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2. TECHNICAL FIELD 



The present invention provides novel polynucleotides and proteins encoded by 
such polynucleotides, along with uses for these polynucleotides and proteins, for example 
5 in therapeutic, diagnostic and research methods. 

3. BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, 
such as lymphokines, interferons, CSFs, chemokines, and interleukins) has matured 

10 rapidly over the past decade. The now routine hybridization cloning and expression 

cloning techniques clone novel polynucleotides "directly" in the sense that they rely on 
information directly related to the discovered protein (i.e., partial DNA/amino acid 
sequence of the protein in the case of hybridization cloning; activity of the protein in the 
case of expression cloning). More recent "indirect" cloning techniques such as signal 

1 5 sequence cloning, which isolates DNA sequences based on the presence of a now 

well-recognized secretory leader sequence motif, as well as various PCR-based or low 
stringency hybridization-based cloning techniques, have advanced the state of the art by 
making available large numbers of DNA/amino acid sequences for proteins that are 
known to have biological activity, for example, by virtue of their secreted nature in the 

20 case of leader sequence cloning, by virtue of their cell or tissue source in the case of 
PCR-based techniques, or by virtue of structural similarity to other genes of known 
biological activity. 

Identified polynucleotide and polypeptide sequences have numerous applications 
in, for example, diagnostics, forensics, gene mapping; identification of mutations 
25 responsible for genetic disorders or other traits, to assess biodiversity, and to produce 
many other types of data and products dependent on DNA and amino acid sequences. 

4. SUMMARY OF THE INVENTION 

The compositions of the present invention include novel isolated polypeptides, novel 
30 isolated polynucleotides encoding such polypeptides, including recombinant DNA 

molecules, cloned genes or degenerate variants thereof, especially naturally occurring 
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variants such as allelic variants, antisense polynucleotide molecules, and antibodies that 
specifically recognize one or more epitopes present on such polypeptides, as well as 
hybridomas producing such antibodies. 

The compositions of the present invention additionally include vectors, including 
5 expression vectors, containing the polynucleotides of the invention, cells genetically 

engineered to contain such polynucleotides and cells genetically engineered to express such 
polynucleotides. 

The present invention relates to a collection or library of novel nucleic acid 
sequences assembled from expressed sequence tags (ESTs) isolated mainly by sequencing 

10 by hybridization (SBH), and in some cases, sequences obtained from one or more public 

databases. The invention also relates to the proteins encoded by such polynucleotides, along 
with therapeutic, diagnostic and research utilities for these polynucleotides and proteins. 
These sequences of the present invention are designated herein as 748 SEQ ID NO: 1- 
45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 

15 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID 
NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1- 
30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502. The nucleic acids and 
polypeptides are provided in the Sequence Listing, wherein for the nucleic acids, A is 
adenosine; C is cytosine; G is guanine; T is thymine; and N is any of the four bases. In the 

20 amino acids provided in the Sequence Listing, * corresponds to the stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid 
sequences that hybridize to the complement of the aforementioned nucleic acid sequences 
under stringent hybridization conditions; nucleic acid sequences which are allelic variants or 
species homologues of any of the nucleic acid sequences recited above, or nucleic acid 

25 sequences that encode a peptide comprising a specific domain or truncation of the peptides 
encoded by said nucleic acid sequences, a polynucleotide comprising a nucleotide sequence 
having at least 90% identity to an identifying sequence or a degenerate variant or fragment 
thereof The identifying sequence can be 100 base pairs in length. 

The nucleic acid sequences of the present invention also include the sequence 

30 information from the aforementioned nucleic acid sequences of 748 SEQ ID NO: 1 -45,1 96, 
752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID 
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NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1- 
3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 
SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502 . The sequence information can be a 
segment of any one of 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID 
5 NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 
785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ 
ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 
1-8502 that uniquely identifies or represents the sequence information of 748 SEQ ED NO: 
1-45,196, 752 SEQ ED NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128,782 

10 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID 
NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1- 
30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502. 

A collection as used in this application can be a collection of only one 
polynucleotide. The collection of sequence information or identifying information of each 

1 5 sequence can be provided on a nucleic acid array. In one embodiment, segments of 

sequence information is provided on a nucleic acid array to detect the polynucleotide that 
contains the segment. The array can be designed to detect full-match or mismatch to the 
polynucleotide that contains the segment. The collection can also be provided in a 
computer-readable format. 

20 This invention also includes the reverse or direct complement of any of the nucleic 

acid sequences recited above; cloning or expression vectors containing the nucleic acid 
sequences; and host cells or organisms transformed with these expression vectors. Nucleic 
acid sequences (or their reverse or direct complements) according to the invention have 
numerous applications in a variety of techniques known to those skilled in the art of 

25 molecular biology, such as use as hybridization probes, use as primers for PCR, use in an 
array, use in computer-readable media, use in sequencing full-length genes, use for 
chromosome and gene mapping, use in the recombinant production of protein, and use in 
the generation of anti-sense DNA or RNA, their chemical analogs and the like. 

In a preferred embodiment, the nucleic acid sequences of 748 SEQ ID NO: 1- 

30 45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 

SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID 
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NO: 1-3960, 788 SEQ ED NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1- 
30,533, 791 SEQ ED NO: 1-5822, and 792 SEQ ID NO: 1-8502 or novel segments or parts 
of the nucleic acids of the invention are used as primers in expression assays that arewell 
known in the art. In a particularly preferred embodiment, the nucleic acid sequences of 748 
5 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID 
NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ED NO: 1- 
3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 
SEQ ID NO: 1-30,533, 791 SEQ ED NO: 1-5822, and 792 SEQ ID NO: 1-8502 or novel ; 
segments or parts of the nucleic acids provided herein are used in diagnostics for identifying - 

1 0 expressed genes or, as well known in the art and exemplified by Vollrath et al, Science 
258 :52-59 (1992), as expressed sequence tags for physical mapping of the human genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 
polynucleotide comprising any one of the nucleotide sequences set forth in 748 SEQ ED NO: 
1-45,196, 752 SEQ ED NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ED NO: 1-128, 782 

15 SEQ ED NO: 1-10,451, 784 SEQ ED NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ED 
NO: 1-3960, 788 SEQ ED NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1- 
30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502; a polynucleotide 
comprising any of the full length protein coding sequences of 748 SEQ ID NO: 1-45,196, 
752 SEQ ED NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ED 

20 NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1- 
3960, 788 SEQ ED NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 
SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502; and a polynucleotide comprising any of 
the nucleotide sequences of the mature protein coding sequences of 748 SEQ ID NO: 1 - 
45,196, 752 SEQ ED NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 

25 SEQ ID NO: 1-10,451, 784 SEQ ED NO: 1-10,289, 785 SEQ ED NO: 1-3796, 787 SEQ ID 
NO: 1-3960, 788 SEQ ED NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1- 
30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502. The polynucleotides of the 
present invention also include, but are not limited to, a polynucleotide that hybridizes under 
stringent hybridization conditions to (a) the complement of any one of the nucleotide 

30 sequences set forth in 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID 
NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ED NO: 1-10,289, 
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785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ 
ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 
1-8502; (b) a nucleotide sequence encoding any one of the amino acid sequences set forth in 
the Sequence Listing; (c) a polynucleotide which is an allelic variant of any polynucleotides 
5 recited above; (d) a polynucleotide which encodes a species homolog (e.g. orthologs) of 
any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide 
comprising a specific domain or truncation of any of the polypeptides comprising an amino 
acid sequence set forth in the Sequence Listing. 

The isolated polypeptides of the invention include, but are not limited to, a 

10 polypeptide comprising any of the amino acid sequences set forth in the Sequence Listing; 
or the corresponding full length or mature protein. Polypeptides of the invention also include 
polypeptides with biological activity that are encoded by (a) any of the polynucleotides 
having a nucleotide sequence set forth in 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1- 
13,192, 778 SEQ ED NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 

15 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID 
NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1- 
5822, and 792 SEQ ID NO: 1-8502; or (b) polynucleotides that hybridize to the complement 
of the polynucleotides of (a) under stringent hybridization conditions. Biologically or 
immunologically active variants of any of the polypeptide sequences in the Sequence 

20 Listing, and "substantial equivalents" thereof (e.g., with at least about 65%, 70%, 75%, 
80%, 85%, 90%, 95%, 98% or 99% amino acid sequence identity) that preferably retain 
biological activity are also contemplated. The polypeptides of the invention may be wholly 
or partially chemically synthesized but are preferably produced by recombinant means using 
the genetically engineered cells (e.g. host cells) of the invention. 

25 The invention also provides compositions comprising a polypeptide of the 

invention. Polypeptide compositions of the invention may further comprise an acceptable 
carrier, such as a hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 

The invention also provides host cells transformed or transfected with a 
polynucleotide of the invention. 

30 The invention also relates to methods for producing a polypeptide of the invention 

comprising growing a culture of the host cells of the invention in a suitable culture 
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medium under conditions permitting expression of the desired polypeptide, and purifying 
the polypeptide from the culture or from the host cells. Preferred embodiments include 
those in which the protein produced by such process is a mature form of the protein. 
Polynucleotides according to the invention have numerous applications in a 
5 variety of techniques known to those skilled in the art of molecular biology. These 
techniques include use as hybridization probes, use as oligomers, or primers, for PCR, 
use for chromosome and gene mapping, use in the recombinant production of protein, 
and use in generation of anti-sense DNA or RNA, their chemical analogs and the like. 
For example, when the expression of an mRNA is largely restricted to a particular cell or 
10 tissue type, polynucleotides of the invention can be used as hybridization probes to detect 
the presence of the particular cell or tissue mRNA in a sample using, e.g., in situ 
hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
expressed sequence tags for identifying expressed genes or, as well known in the art and 
15 exemplified by Vollrath et al. t Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

The polypeptides according to the invention can be used in a variety of 
conventional procedures and methods that are currently applied to other proteins. For 
example, a polypeptide of the invention can be used to generate an antibody that 
20 specifically binds the polypeptide. Such antibodies, particularly monoclonal antibodies, 
are useful for detecting or quantitating the polypeptide in tissue. The polypeptides of the 
invention can also be used as molecular weight markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical 
condition which comprises the step of administering to a mammalian subject a 
25 therapeutically effective amount of a composition comprising a polypeptide of the 
present invention and a pharmaceutical^ acceptable carrier. 

In particular, the polypeptides and polynucleotides of the invention can be 
utilized, for example, in methods for the prevention and/or treatment of disorders 
involving aberrant protein expression or biological activity. 
30 The present invention further relates to methods for detecting the presence of the 

polynucleotides or polypeptides of the invention in a sample. Such methods can, for 
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example, be utilized as part of prognostic and diagnostic evaluation of disorders as 
recited herein and for the identification of subjects exhibiting a predisposition to such 
conditions. The invention provides a method for detecting the polynucleotides of the 
invention in a sample, comprising contacting the sample with a compound that binds to 
5 and forms a complex with the polynucleotide of interest for a period sufficient to form 
the complex and under conditions sufficient to form a complex and detecting the complex 
such that if a complex is detected, the polynucleotide of interest is detected. The 
invention also provides a method for detecting the polypeptides of the invention in a 
sample comprising contacting the sample with a compound that binds to and forms a 

10 complex with the polypeptide under conditions and for a period sufficient to form the 

complex and detecting the formation of the complex such that if a complex is formed, the 
polypeptide is detected. 

The invention also provides kits comprising polynucleotide probes and/or 
monoclonal antibodies, and optionally quantitative standards, for carrying out methods of ■ 

1 5 the invention. Furthermore, the invention provides methods for evaluating the efficacy of 
drugs, and monitoring the progress of patients, involved in clinical trials for the treatment 
of disorders as recited above. 

The invention also provides methods for the identification of compounds that 
modulate (i.e., increase or decrease) the expression or activity of the polynucleotides 

20 and/or polypeptides of the invention. Such methods can be utilized, for example, for the 
identification of compounds that can ameliorate symptoms of disorders as recited herein. 
Such methods can include, but are not limited to, assays for identifying compounds and 
other substances that interact with (e.g., bind to) the polypeptides of the invention. The 
invention provides a method for identifying a compound that binds to the polypeptides of 

25 the invention comprising contacting the compound with a polypeptide of the invention in 
a cell for a time sufficient to form a polypeptide/compound complex, wherein the 
complex drives expression of a reporter gene sequence in the cell; and detecting the 
complex by detecting the reporter gene sequence expression such that if expression of the 
reporter gene is detected the compound the binds to a polypeptide of the invention is 

30 identified. 
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The methods of the invention also provides methods for treatment which involve 
the administration of the polynucleotides or polypeptides of the invention to individuals 
exhibiting symptoms or tendencies. In addition, the invention encompasses methods for 
treating diseases or disorders as recited herein comprising administering compounds and 
5 other substances that modulate the overall activity of the target gene products. 

Compounds and other substances can effect such modulation either on the level of target 
gene/protein expression or target protein activity. 

The polypeptides of the present invention and the polynucleotides encoding them 
are also useful for the same functions known to one of skill in the art as the polypeptides 
10 and polynucleotides to which they have homology. If no homology is set forth for a 
sequence, then the polypeptides and polynucleotides of the present invention are useful 
for a variety of applications, as described herein, including use in arrays for detection. 


5. DETAILED DESCRIPTION OF THE INVENTION 
15 5.1 DEFINITIONS 

It must be noted that as used herein and in the appended claims, the singular 
forms "a", "an" and "the" include plural references unless the context clearly dictates 
otherwise. 

The term "active" refers to those forms of the polypeptide which retain the 
20 biologic and/or immunologic activities of any naturally occurring polypeptide. According 
to the invention, the terms "biologically active" or "biological activity" refer to a protein 
or peptide having structural, regulatory or biochemical functions of a naturally occurring 
molecule. Likewise "immunologically active" or "immunological activity" refers to the 
capability of the natural, recombinant or synthetic polypeptide to induce a specific 
25 immune response in appropriate animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are 
engaged in extracellular or intracellular membrane trafficking, including the export of 
secretory or enzymatic molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 
30 polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the 
complementary sequence 3'-TCA-5\ Complementarity between two single-stranded 
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molecules may be "partial" such that only some of the nucleic acids bind or it may be 
"complete" such that total complementarity exists between the single stranded molecules. 
The degree of complementarity between the nucleic acid strands has significant effects on 
the efficiency and strength of the hybridization between the nucleic acid strands. 
5 The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 

differentiated cell types in an embryo or an adult, including the germ cells. The term 
"germ line stem cells (GSCs)" refers to stem cells derived from primordial stem cells that 
provide a steady and continuous source of germ cells for the production of gametes. The 
term "primordial germ cells (PGCs)" refers to a small population of cells set aside from 

10 other cell lineages particularly from, the yolk sac, mesenteries, or gonadal ridges during 
embryogenesis that have the potential to differentiate into germ cells and other cells. 
PGCs are the source from which GSCs and ES cells are derived The PGCs, the GSCs 
and the ES cells are capable of self-renewal. Thus these cells not only populate the germ 
line and give rise to a plurality of terminally differentiated cells that comprise the adult 

1 5 specialized organs, but are able to regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides 
which modulates the expression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably 
linked sequence" when the expression of the sequence is altered by the presence of the 

20 EMF. EMFs include, but are not limited to, promoters, and promoter modulating 

sequences (inducible elements). One class of EMFs are nucleic acid fragments which 
induce the expression of an operably linked ORF in response to a specific regulatory 
factor or physiological event. 

The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 

25 "oligonculeotide" are used interchangeably and refer to a heteropolymer of nucleotides or 
the sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic 
or synthetic origin which may be single-stranded or double-stranded and may represent 
the sense or the antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or 
RNA-like material. In the sequences herein A is adenine, C is cytosine, T is thymine, G 

30 is guanine and N is A, C, G or T (U). It is contemplated that where the polynucleotide is 
RNA, the T (thymine) in the sequences provided herein is substituted with U (uracil). 
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Generally, nucleic acid segments provided by this invention may be assembled from 
fragments of the genome and short oligonucleotide linkers, or from a series of 
oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic acid 
which is capable of being expressed in a recombinant transcriptional unit comprising 
5 regulatory elements derived from a microbial or viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," 
or "segment" or "probe" or "primer" are used interchangeably and refer to a sequence of 
nucleotide residues which are at least about 5 nucleotides, more preferably at least about 
7 nucleotides, more preferably at least about 9 nucleotides, more preferably at least about 

10 11 nucleotides and most preferably at least about 17 nucleotides. The fragment is 

preferably less than about 500 nucleotides, preferably less than about 200 nucleotides, 
more preferably less than about 100 nucleotides, more preferably less than about 50 
nucleotides and most preferably less than 30 nucleotides. Preferably the probe is from 
about 6 nucleotides to about 200 nucleotides, preferably from about 15 to about 50 

1 5 nucleotides, more preferably from about 17 to 30 nucleotides and most preferably from 
about 20 to 25 nucleotides. Preferably the fragments can be used in polymerase chain 
reaction (PCR), various hybridization procedures or microarray procedures to identify or 
amplify identical or related parts of mRNA or DNA molecules. A fragment or segment 
may uniquely identify each polynucleotide sequence of the present invention. Preferably 

20 the fragment comprises a sequence substantially similar to any one of 748 SEQ ID NO: 
1-45,196, 752 SEQ ID NO:. 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 
782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 
SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID 
NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502. 

25 Probes may, for example, be used to determine whether specific mRNA 

molecules are present in a cell or tissue or to isolate similar nucleic acid sequences from 
chromosomal DNA as described by Walsh et. al., (Walsh, P.S. et al, 1992, PCR Methods 
Appl 1 :241-250). They may be labeled by nick translation, Klenow fill-in reaction, PCR, 
or other methods well known in the art. Probes of the present invention, their preparation 

30 and/or labeling are elaborated in Sambrook, J. et al, 1989, Molecular Cloning: A 

Laboratory Manual, Cold Spring Harbor Laboratory, NY; or Ausubel, F.M. et al, 1989, 
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Current Protocols in Molecular Biology, John Wiley & Sons, New York NY, both of 
which are incorporated herein by reference in their entirety. 

The nucleic acid sequences of the present invention also include the sequence 
information from the nucleic acid sequences of 748 SEQ ID NO: 1-45,196, 752 SEQ ID 
5 NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1- 

10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 

788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 
SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502. The sequence information can be a 

, segment of any one of 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ 
10 ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1- 
10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 

789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 
SEQ ID NO: 1-8502 that uniquely identifies or represents the sequence information of 
that sequence of 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID 

15 NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1- 

10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 
789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 
, SEQ ID NO: 1-8502. One such segment can be a twenty-mer nucleic acid sequence 
because the probability that a twenty-mer is fully matched in the human genome is 1 in 

20 300. In the human genome, there are three billion base pairs in one set of chromosomes. 
Because 4 20 possible twenty-mers exist, there are 300 times more twenty-mers than there 
are base pairs in a set of human chromosomes. Using the same analysis, the probability 
for a seventeen-mer to be fully matched in the human genome is approximately 1 in 5. 
When these segments are used in arrays for expression studies, fifteen-mer segments can 

25 be used. The probability that the fifteen-mer is fully matched in the expressed sequences 
is also approximately one in five because expressed sequences comprise less than 
approximately 5% of the entire genome sequence. 

Similarly, when using sequence information for detecting a single mismatch, a 
segment can be a twenty-five mer. The probability that the twenty-five mer would appear in 

30 a human genome with a single mismatch is calculated by multiplying the probability for a 
full match (l-^4 25 ) times the increased probability for mismatch at each nucleotide position 
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(3 x 25). The probability that an eighteen mer with a single mismatch can be detected in an 
array for expression studies is approximately one in five. The probability that a twenty-mer 
with a single mismatch can be detected in a human genome is approximately one in five. 

The term "open reading frame," ORF, means a series of nucleotide triplets coding 
5 for amino acids without any termination codons and is a sequence translatable into 
protein. 

The terms "operably linked" or "operably associated" refer to functionally related 
. nucleic acid sequences. For example, a promoter is operably associated or operably 
linked with a coding sequence if the promoter controls the transcription of the coding 
10 sequence. While operably linked nucleic acid sequences can be contiguous and in the 
same reading frame, certain genetic elements e.g. repressor genes are not contiguously 
. linked to the coding sequence but still control transcription/translation of the coding 
sequence. 

The term "pluripotent" refers to the capability of a cell to differentiate into a 

15 number of differentiated cell types that are present in an adult organism. A pluripotent 
cell is restricted in its differentiation capability in comparison to a totipotent cell. 

The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an 
oligopeptide, peptide, polypeptide or protein sequence or fragment thereof and to 
naturally occurring or synthetic molecules. A polypeptide "fragment," "portion," or 

20 "segment" is a stretch of amino acid residues of at least about 5 amino acids, preferably at 
least about 7 amino acids, more preferably at least about 9 amino acids and most 
preferably at least about 17 or more amino acids. The peptide preferably is not greater 
than about 200 amino acids, more preferably less than 150 amino acids and most 
preferably less than 100 amino acids. Preferably the peptide is from about 5 to about 200 

25 amino acids. To be active, any polypeptide must have sufficient length to display 
biological and/or immunological activity. 

The term "naturally occurring polypeptide" refers to polypeptides produced by 
cells that have not been genetically engineered and specifically contemplates various 
polypeptides arising from post-translational modifications of the polypeptide including, 

30 but not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation 
and acylation. 
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The term "translated protein coding portion" means a sequence which encodes for 
the full length protein which may include any leader sequence or any processing 
sequence. 

The term "mature protein coding sequence" means a sequence which encodes a 
5 peptide or protein without a signal or leader sequence. The "mature protein portion" 

means that portion of the protein which does not include a signal or leader sequence. The 
peptide may have been produced by processing in the cell which removes any 
leader/signal sequence. The mature protein portion may or may not include the initial 
methionine residue. The methionine residue may be removed from the protein during 
10 processing in the cell. The peptide may be produced synthetically or the protein may 

have been produced using a polynucleotide only encoding for the mature protein coding 
sequence. 

The term "derivative" refers to polypeptides chemically modified by such 
techniques as ubiquitination, labeling {e.g., with radionuclides or various enzymes), 

1 5 covalent polymer attachment such as pegylation (derivatization with polyethylene glycol) 
and insertion or substitution by chemical synthesis of amino acids such as ornithine, 
which do not normally occur in human proteins. 

The term "variant"(or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created 

20 using, e g., recombinant DNA techniques. Guidance in determining which amino acid 
residues may be replaced, added or deleted without abolishing activities of interest, may 
be found by comparing the sequence of the particular polypeptide with that of 
homologous peptides and minimizing the number of amino acid sequence changes made 
in regions of high homology (conserved regions) or by replacing amino acids with 

25 consensus sequence. 

Alternatively, recombinant variants encoding these same or similar polypeptides 
may be synthesized or selected by making use of the "redundancy" in the genetic code. 
Various codon substitutions, such as the silent changes which produce various restriction 
sites, may be introduced to optimize cloning into a plasmid or viral vector or expression 

30 in a particular prokaryotic or eukaryotic system. Mutations in the polynucleotide 

sequence may be reflected in the polypeptide or domains of other peptides added to the 
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polypeptide to modify the properties, of any part of the polypeptide, to change 
characteristics such as ligand-binding affinities, interchain affinities, or 
degradation/turnover rate. 

Preferably, amino acid "substitutions' 1 are the result of replacing one amino acid 
5 with another amino acid having similar structural and/or chemical properties, le., 

conservative amino acid replacements. "Conservative" amino acid substitutions may be 
made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity, and/or the amphipathic nature of the residues involved. For example, 
nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, 

10 phenylalanine, tryptophan, and methionine; polar neutral amino acids include glycine, 

serine, threonine, cysteine, tyrosine, asparagine, and glutamine; positively charged (basic) 
amino acids include arginine, lysine, and histidine; and negatively charged (acidic) amino 
acids include aspartic acid and glutamic acid. "Insertions" or "deletions" are preferably 
in the range of about 1 to 20 amino acids, more preferably 1 to 10 amino acids. The 

1 5 variation allowed may be experimentally determined by systematically making insertions, 
deletions, or substitutions of amino acids in a polypeptide molecule using recombinant 
DNA techniques and assaying the resulting recombinant variants for activity. 

Alternatively, where alteration of function is desired, insertions, deletions or 
non-conservative alterations can be engineered to produce altered polypeptides. Such 

20 alterations can, for example, alter one or more of the biological functions or biochemical 
characteristics of the polypeptides of the invention. For example, such alterations may 
change polypeptide characteristics such as ligand-binding affinities, interchain affinities, 
or degradation/turnover rate. Further, such alterations can be selected so as to generate 
polypeptides that are better suited for expression, scale up and the like in the host cells 

25 chosen for expression. For example, cysteine residues can be deleted or substituted with 
another amino acid residue in order to eliminate disulfide bridges. 

The terms "purified" or "substantially purified" as used herein denotes that the 
indicated nucleic acid or polypeptide is present in the substantial absence of other 
biological macromolecules, e.g., polynucleotides, proteins, and the like. In one 

30 embodiment, the polynucleotide or polypeptide is purified such that it constitutes at least 
95% by weight, more preferably at least 99% by weight, of the indicated biological 
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macromolecules present (but water, buffers, and other small molecules, especially 
molecules having a molecular weight of less than 1000 daltons, can be present). 

The term "isolated" as used herein refers to a nucleic acid or polypeptide 
separated from at least one other component (e.g., nucleic acid or polypeptide) present 
5 with the nucleic acid or polypeptide in its natural source. In one embodiment, the nucleic 
acid or polypeptide is found in the presence of (if anything) only a solvent, buffer, ion, or 
other component normally present in a solution of the same. The terms "isolated" and 
"purified" do not encompass nucleic acids or polypeptides present in their natural source: 
The term "recombinant," when used herein to refer to a polypeptide or protein, 

10, means that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, 
or mammalian) expression systems. "Microbial" refers to recombinant polypeptides or 
proteins made in bacterial or fungal (e.g., yeast) expression systems. As a product, 
"recombinant microbial" defines a polypeptide or protein essentially free of native 
endogenous substances and unaccompanied by associated native glycosylation. 

15 Polypeptides or proteins expressed in most bacterial cultures, e.g., E. coli, will be free of 
glycosylation modifications; polypeptides or proteins expressed in yeast will have a 
glycosylation pattern in general different from those expressed in mammalian cells. 

The term "recombinant expression vehicle or vector" refers to a plasmid or phage 
or virus or vector, for expressing a polypeptide from a DNA (RNA) sequence. An 

20 expression vehicle can comprise a transcriptional unit comprising an assembly of (1) a 
genetic element or elements having a regulatory role in gene expression, for example, 
promoters or enhancers, (2) a structural or coding sequence which is transcribed into 
mRNA and translated into protein, and (3) appropriate transcription initiation and 
termination sequences. Structural units intended for use in yeast or eukaryotic expression 

25 systems preferably include a leader sequence enabling extracellular secretion of 

translated protein by a host cell. Alternatively, where recombinant protein is expressed 
without a leader or transport sequence, it may include an amino terminal methionine 
residue. This residue may or may not be subsequently cleaved from the expressed 
recombinant protein to provide a final product. 

30 The term "recombinant expression system" means host cells which have stably 

integrated a recombinant transcriptional unit into chromosomal DNA or carry the 
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recombinant transcriptional unit extrachromosomally. Recombinant expression systems 
as defined herein will express heterologous polypeptides or proteins upon induction of 
the regulatory elements linked to the DNA segment or synthetic gene to be expressed. 
This term also means host cells which have stably integrated a recombinant genetic 
5 element or elements having a regulatory role in gene expression, for example, promoters 
or enhancers. Recombinant expression systems as defined herein will express 
polypeptides or proteins endogenous to the cell upon induction of the regulatory elements 
linked to the endogenous DNA segment or gene to be expressed. The cells can be 
prokaryotic or eukaryotic. 

10 The term "secreted" includes a protein that is transported across or through a 

membrane, including transport as a result of signal sequences in its amino acid sequence 
when it is expressed in a suitable host cell. "Secreted" proteins include without limitation 
proteins secreted wholly (e.g., soluble proteins) or partially (e.g., receptors) from the cell 
in which they are expressed. "Secreted" proteins also include without limitation proteins 

15 that are transported across the membrane of the endoplasmic reticulum. "Secreted" 
proteins are also intended to include proteins containing non-typical signal sequences 
(e.g. Interleukin-1 Beta, see Krasney, P.A. and Young, P.R. (1992) Cytokine 4(2): 134 
-143) and factors released from damaged cells (e.g. Interleukin-1 Receptor Antagonist, 
. see Arend, W.P. et. al. (1998) Annu. Rev. Immunol. 16:27-55) 

20 Where desired, an expression vector may be designed to contain a "signal or 

leader sequence" which will direct the polypeptide through the membrane of a cell. Such 
a sequence may be naturally present on the polypeptides of the present invention or 
provided from heterologous protein sources by recombinant DNA techniques. 

The term "stringent" is used to refer to conditions that are commonly understood 

25 in the art as stringent. Stringent conditions can include highly stringent conditions (i.e., 
hybridization to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 
1 mM EDTA at 65°C, and washing in 0.1X SSC/0.1% SDS at 68°C), and moderately 
stringent conditions (i.e., washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary 
hybridization conditions are described herein in the examples. 

30 In instances of hybridization of deoxyoligonucleotides, additional exemplary 

stringent hybridization conditions include washing in 6X SSC/0.05% sodium 
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pyrophosphate at 37°C (for 14-base oligonucleotides), 48°C (for 17-base oligos), 55°C 
(for 20-base oligonucleotides), and 60°C (for 23 -base oligonucleotides). 

As used herein, "substantially equivalent" can refer both to nucleotide and amino 
acid sequences, for example a mutant sequence, that varies from a reference sequence by 
5 one or more substitutions, deletions, or additions, the net effect of which does not result 
in an adverse functional dissimilarity between the reference and subject sequences. 
Typically, such a substantially equivalent sequence varies from one of those listed herein 
by no more than about 35% (i.e., the number of individual residue substitutions, 
additions, and/or deletions in a substantially equivalent sequence, as compared to the 

10 corresponding reference sequence, divided by the total number of residues in the 

substantially equivalent sequence is about 0.35 or less). Such a sequence is said to have 
65% sequence identity to the listed sequence. In one embodiment, a substantially 
equivalent, e.g., mutant, sequence of the invention varies from a listed sequence by no 
more than 30% (70% sequence identity); in a variation of this embodiment, by no more 

15 than 25% (75% sequence identity); and in a further variation of this embodiment, by no 
more than 20% (80% sequence identity) and in a further variation of this embodiment, by 
no more than 10% (90% sequence identity) and in a further variation of this embodiment, 
by no more that 5% (95% sequence identity). Substantially equivalent, e.g., mutant, 
amino acid sequences according to the invention preferably have at least 80% sequence 

20 identity with a listed amino acid sequence, more preferably at least 90% sequence 

identity. Substantially equivalent nucleotide sequences of the invention can have lower 
percent sequence identities, taking into account, for example, the redundancy or 
degeneracy of the genetic code. Preferably, nucleotide sequence has at least about 65% 
identity, more preferably at least about 75% identity, and most preferably at least about 

25 95% identity. For the purposes of the present invention, sequences having substantially 
equivalent biological activity and substantially equivalent expression characteristics are 
considered substantially equivalent. For the purposes of determining equivalence, 
truncation of the mature sequence (e.g., via a mutation which creates a spurious stop 
codon) should be disregarded. Sequence identity may be determined, e.g., using the 

30 Jotun Hein method (Hein, J. (1990) Methods Enzymol. 1 83:626-645). Identity between 
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sequences can also be determined by other methods known in the art, e.g. by varying 
hybridization conditions. 

The term "totipotent" refers to the capability of a cell to differentiate into all of 
the cell types of an adult organism. 
5 The term "transformation" means introducing DNA into a suitable host cell so 

that the DNA is replicable, either as an extrachromosomal element, or by chromosomal 
integration. The term "transfection" refers to the taking up of an expression vector by a 
suitable host cell, whether or not any coding sequences are in fact expressed. The term 
"infection" refers to the introduction of nucleic acids into a suitable host cell by use of a 

10 virus or viral vector. 

As used herein, an "uptake modulating fragment," UMF, means a series of 
nucleotides which mediate the uptake of a linked DNA fragment into a cell. UMFs can 
be readily identified using known UMFs as a target sequence or target motif with the 
computer-based systems described below. The presence and activity of a UMF can be 

1 5 confirmed by attaching the suspected UMF to a marker sequence. The resulting nucleic 
acid molecule is then incubated with an appropriate host under appropriate conditions and 
the uptake of the marker sequence is determined. As described above, a UMF will 
increase the frequency of uptake of a linked marker sequence. 

Each of the above terms is meant to encompass all that is described for each, 

20 unless the context dictates otherwise. 

5.2 NUCLEIC ACIDS AND PEPTIDES OF THE INVENTION 

Sequences of the nucleic acids and peptides of the present invention are set forth 
in the Sequence Listing. Table 1 relates the SEQ ID's listed herein to their identification 
25 in parent applications from which this application claims priority. 


TABLE 1 


Gene 

Serial 

Date 

SEQ ID NO. 

SEQ ID NO: 

File Name 

Family 

Number 

Filed 

in Parent Applications 

in Current Application 

On CD 

748 

09/205,070 

Dec. 3, 1998 

SEQ ID NO: 1-45,196 

748 SEQ ID NO: 1-45,196 

N/A 


09/340,623 

Jun. 28, 1999 

SEQ ID NO: 1-45,196 

748 SEQ ID NO: 1-45,196 
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09/898,888 

Jul. 3, 2001 

SEQ ID NO: 1-45,196 

748 SEQ ID NO: 1-45,196 


752 

09/205,155 

09/359,922 
aq/q 1 o nm 

Dec. 3, 1998 
Jul. 22, 1999 

JUI. J\)y zuu i 

SEQ ID NO: 1-13192 
SEQ ID NO: 1-13192 

752 SEQ ID NO: 1-13192 
752 SEQ ID NO: 1-13192 

ar^ OCA ipv via. 1 111 AO 

N/A 

/ /O 

yJy/D** /, iz / 
09/905,059 

T«il 0 1 QQQ 

jui. z, iyyy 
Jul. 12, 2001 

OCA I PV XTfV 1 O/l 

SEQ ID NO: 1-94 

*7*70 c rr f~\ rr\ via. i c\a 
77o bbQ ID 1NU. 1-94 

778 SEQ ID NO: 1-94 

N/A 

779 

09/437,877 
09/952,981 

T~\ _ _ O 1 AAA 

Dec. o, 1999 
Sep. 14, 2001 

or , / r> v ir\ via. i i ^ o 

bhQ ID NO: 1-128 
SEQ ID NO: 1-128 

TTA PPA TT*\ VIA 1 1 O 

779 SEQ ID NO: 1-128 
779 SEQ ID NO: 1-128 

N/A 

7oZ 

09/47 1 ,275 

A_,„ "">0 1 AAA 

Uec. 23, 1999 

o t? /~\ ir\ via .1 i a /ici 

bbQ ID NO: 1-10,451 

TOO CT'/^V VIA 1 l A j r i 

782 SEQ ID NO: 1-10,451 


784 

09/488,725 
09/552,317 

Jan. 21,2000 
Apr. 25, 2000 

SEQ ID NO: 1-10289 
SEQ ID NO: 1-10289 

784 SEQ ID NO: 1-10289 
784 SEQ ID NO: 1-10289 


785 

09/491,404 
09/922,279 

Jan. 25, 2000 

A AA 1 

Aug. 3, 2001 

SEQ ID NO: 1-3796 
SEQ ID NO: 1-3796 

785 SEQ ID NO: 1-3796 
785 SEQ ID NO: 1-3796 

(Table 5 and 7 are hard copies) 

787 

09/496,914 
09/560,875 

Feb. 23, 2000 
Apr. 27, 2000 

SEQ ID NO: 1-3960 
SEQ ID NO: 1-3960 

787 SEQ ID NO: 1-3960 
787 SEQ ID NO: 1-3960 

Table8(787).doc; Table9(787).doc; 

Tahiti 0/7R7\ Hap 

788 

09/515,126 
09/577,409 

Feb. 28, 2000 

X jf . 1 O »^AAA 

May. 1 8, 2000 

SEQ ID NO: 1-14074 
SEQ ID NO: 1-14074 

788 SEQ ID NO: 1-14074 
788 SEQ ID NO: 1-14074 

Table 1 l(788).doc; Table 12(788).doc 

Tahiti V75MN rtnr 

789 

09/519,705 
09/574,454 

Mar. 7, 2000 

» i A AAA 

May. 19, 2000 

SEQ ID NO: 1-6391 

OrA TA VIA. 1 Z"1Al 

SEQ ID NO: 1-6391 

789 SEQ ID NO: 1-6391 
789 SEQ ID NO: 1-6391 

Table 1 4(789).doc; Table 1 5(789).doc 

Tahiti f\(7RQ\ Hnc 

790 

09/540,217 
09/649,167 

Mar. 31,2000 
Aug. 23, 2000 

SEQ ID NO: 1-30533 
SEQ ID NO: 1-30533 

790 SEQ ID NO: I -30533 
790 SEQ ID NO: 1-30533 

Table 1 7(790).doc; Table 1 8(790).doc 
Table 19(790).doc 

791 

09/552,929 
09/770,160 

Apr. 18, 2000 
Jan. 26, 2001 

SEQ ID NO: 1-5822 
SEQ ID NO: 1-5822 

79 1 SEQ ID NO: 1-5822 
79 1 SEQ ID NO: 1-5822 

Table20(79 1 ).doc; Table2 1 (79 1 ).doc 
Table22(791).doc 

792 

09/577,408 May. 18,2000 

SEQ ID NO: 1-8502 

792 SEQ ID NO: 1-8502 

(Table 23-25 are hard copies) 


The isolated polynucleotides of the invention include a polynucleotide comprising 
the nucleotide sequences of 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 
5 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID 
NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1- 
14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, 
and 792 SEQ ID NO: 1-8502; a polynucleotide encoding any one of the peptide 
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sequences of 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1- 
94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 
SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID 
NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 

5 1-8502; and a polynucleotide comprising the nucleotide sequence encoding the mature 
protein coding sequence of the polypeptides of any one of 748 SEQ ID NO: 1-45,196, 
752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID 
NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 
1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1- 

10 30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502. The polynucleotides of 
the present invention also include, but are not limited to, a polynucleotide that hybridizes 
under stringent conditions to (a) the complement of any of the nucleotides sequences of 
748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ 
ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID 

15 NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1- 
6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502 
; (b) nucleotide sequences encoding any one of the amino acid sequences set forth in the 
Sequence Listing; (c) a polynucleotide which is an allelic variant of any polynucleotide 
recited above; (d) a polynucleotide which encodes a species homolog of any of the 

20 proteins recited above; or (e) a polynucleotide that encodes a polypeptide comprising a 
specific domain or truncation of the polypeptides encoded by 748 SEQ ID NO: 1-45,196, 
752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID 
NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 
1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1- 

25 30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502. Domains of interest 
may depend on the nature of the encoded polypeptide; e.g., domains in receptor-like 
polypeptides include ligand-binding, extracellular, transmembrane, or cytoplasmic 
domains, or combinations thereof; domains in immunoglobulin-like proteins include the 
variable immunoglobulin-like domains; domains in enzyme-like polypeptides include 

30 catalytic and substrate binding domains; and domains in ligand polypeptides include 
receptor-binding domains. 
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The polynucleotides of the invention include naturally occurring or wholly or 
partially synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The 
polynucleotides may include all of the coding region of the cDNA or may represent a 
portion of the coding region of the cDNA. 
5 The present invention also provides genes corresponding to the cDNA sequences 

disclosed herein. The corresponding genes can be isolated in accordance with known 
methods using the sequence information disclosed herein. Such methods include the 
preparation of probes or primers from the disclosed sequence information for identification 
and/or amplification of genes in appropriate genomic libraries or other sources of genomic 

10 materials. Further 5* and 3' sequence can be obtained using methods known in the art. For 
example, full length cDNA or genomic DNA that corresponds to any of the polynucleotides 
of 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ 
ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 
• 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 

15 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502 can be 
obtained by screening appropriate cDNA or genomic DNA libraries under suitable 
hybridization conditions using any of the polynucleotides of 748 SEQ ID NO: 1-45* 196, 752 
SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1- 
10,451, 784SEQIDNO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 

20 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533,791 SEQ 
ID NO: 1-5822, and 792 SEQ ID NO: 1-8502 or a portion thereof as a probe. Alternatively, 
the polynucleotides of 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID 
NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 
785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ 

25 ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 
1-8502 may be used as the basis for suitable primer(s) that allow identification and/or 
amplification of genes in appropriate genomic DNA or cDNA libraries. 

The nucleic acid sequences of the invention can be assembled from ESTs and 
sequences (including cDNA and genomic sequences) obtained from one or more public 

30 databases, such as dbEST, gbpri, and UniGene. The EST sequences can provide identifying 
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sequence information, representative fragment or segment information, or novel segment 
information for the full-length gene. 

The polynucleotides of the invention also provide polynucleotides including 
nucleotide sequences that are substantially equivalent to the polynucleotides recited 
5 above. Polynucleotides according to the invention can have, e.g., at least about 65%, at 
least about 70%, at least about 75%, at least about 80%, more typically at least about ' •' . • 
90%, and even more typically at least about 95%, sequence identity to a polynucleotide 
recited above. 

Included within the scope of the nucleic acid sequences of the invention are 

10 nucleic acid sequence fragments that hybridize under stringent conditions to any of the 

nucleotide sequences of 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ - 
ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1- 
10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 
789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 

15 SEQ ID NO: 1-8502, or complements thereof, which fragment is greater than about 5 
nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and 
most preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 nucleotides 
or more that are selective for (i.e. specifically hybridize to any one of the polynucleotides 
of the invention) are contemplated. Probes capable of specifically hybridizing to a 

20 polynucleotide can differentiate polynucleotide sequences of the invention from other * 
polynucleotide sequences in the same family of genes or can differentiate human genes 
from genes of other species, and are preferably based on unique nucleotide sequences. 

The sequences falling within the scope of the present invention are not limited to 
these specific sequences, but also include allelic and species variations thereof. Allelic and 

25 species variations can be routinely determined by comparing the sequence provided 748 
SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID 
NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1- 
3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 
SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502, a 

30 representative fragment thereof, or a nucleotide sequence at least 90% identical, preferably 
95% identical, to 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 
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1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784SEQIDNO: 1-10,289, 785 
SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ED NO: 1-14,074, 789 SEQ ID 
NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1- 
8502 with a sequence from another isolate of the same species. Furthermore, to 
5 accommodate codon variability, the invention includes nucleic acid molecules coding for 
the same amino acid sequences as do the specific ORFs disclosed herein. In other words, in 
■ the coding region of an ORF, substitution of one codon for another codon that encodes the 
same amino acid is expressly contemplated. 

The nearest neighbor or homology result for the nucleic acids of the present 

10 invention, including 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID 
NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 
785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ 
ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 
1-8502, can be obtained by searching a database using an algorithm or a program. 

1 5 Preferably, a BLAST which stands for Basic Local Alignment Search Tool is used to search 
for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1993) and Altschul 
S.F. et al., J. Mol. Biol. 21:403-410 (1990)). Alternatively a FASTA version 3 search 
against Genpept, using Fastxy algorithm. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are 

20 also provided by the present invention. Species homologs may be isolated and identified 
by making suitable probes or primers from the sequences provided herein and screening a 
suitable nucleic acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides 
or proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide 

25 which also encode proteins which are identical, homologous or related to that encoded by 
the polynucleotides. 

The nucleic acid sequences of the invention are further directed to sequences 
which encode variants of the described nucleic acids. These amino acid sequence 
variants may be prepared by methods known in the art by introducing appropriate 

30 nucleotide changes into a native or variant polynucleotide. There are two variables in the 
construction of amino acid sequence variants: the location of the mutation and the nature 
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of the mutation. Nucleic acids encoding the amino acid sequence variants are preferably 
constructed by mutating the polynucleotide to encode an amino acid sequence that does 
not occur in nature. These nucleic acid alterations can be made at sites that differ in the 
nucleic acids from different species (variable positions) or in highly conserved regions 
5 (constant regions). Sites at such locations will typically be modified in series, e.g., by 
substituting first with conservative choices (e.g., hydrophobic amino acid to a different 
hydrophobic amino acid) and then with more distant choices (e.g., hydrophobic amino 
acid to a charged amino acid), and then deletions or insertions may be made at the target 
site. Amino acid sequence deletions generally range from about 1 to 30 residues, 

10 preferably about 1 to 10 residues, and are typically contiguous. Amino acid insertions - 
include amino- and/or carboxyl-terminal fusions ranging in length from one to one 
hundred or more residues, as well as intrasequence insertions of single or multiple amino 
acid residues. Intrasequence insertions may range generally from about 1 to 10 amino 
residues, preferably from 1 to 5 residues. Examples of terminal insertions include the 

1 5 heterologous signal sequences necessary for secretion or for intracellular targeting in 
different host cells and sequences such as FLAG or poly-histidine sequences useful for 
purifying the expressed protein. 

In a preferred method, polynucleotides encoding the novel amino acid sequences 
are changed via site-directed mutagenesis. This method uses oligonucleotide sequences 

20 to alter a polynucleotide to encode the desired amino acid variant, as well as sufficient 
adjacent nucleotides on both sides of the changed amino acid to form a stable duplex on 
either side of the site of being changed. In general, the techniques of site-directed 
mutagenesis are well known to those of skill in the art and this technique is exemplified 
by publications such as, Edelman et al, DNA 2:183 (1983). A versatile and efficient 

25 method for producing site-specific changes in a polynucleotide sequence was published 
by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 (1982). PCR may also be used to 
create amino acid sequence variants of the novel nucleic acids. When small amounts of 
template DNA are used as starting material, primer(s) that differs slightly in sequence 
from the corresponding region in the template DNA can generate the desired amino acid 

30 variant. PCR amplification results in a population of product DNA fragments that differ 
from the polynucleotide template encoding the polypeptide at the position specified by 
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the primer. The product DNA fragments replace the corresponding region in the plasmid 
and this gives a polynucleotide encoding the desired amino acid variant. 

A further technique for generating amino acid variants is the cassette mutagenesis 
technique described in Wells et al, Gene 34:315 (1985); and other mutagenesis 
5 techniques well known in the art, such as, for example, the techniques in Sambrook et al, 
supra, and Current Protocols in Molecular Biology, Ausubel et. al., Due to the inherent 
degeneracy of the genetic code, other DNA sequences which encode substantially the 
same or a functionally equivalent amino acid sequence may be used in the practice of the 
invention for the cloning and expression of these novel nucleic acids. Such DNA 

10 sequences include those which are capable of hybridizing to the appropriate novel nucleic ■ 
acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can 
be used to generate polynucleotides encoding chimeric or fusion proteins comprising one 
or more domains of the invention and heterologous protein sequences. 

15 The polynucleotides of the invention additionally include the complement of any 

of the polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, 
amplified, or synthetic) or RNA. Methods and algorithms for obtaining such 
polynucleotides are well known to those of skill in the art and can include, for example, 
methods for determining hybridization conditions that can routinely isolate 

20 polynucleotides of the desired sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the 
mature protein coding sequences corresponding to any one of 748 SEQ ID NO: 1-45,196, 
752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID 
NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 

25 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1- 

30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502, or functional equivalents 
thereof, may be used to generate recombinant DNA molecules that direct the expression 
of that nucleic acid, or a functional equivalent thereof, in appropriate host cells. Also 
included are the cDNA inserts of any of the clones identified herein. 

30 A polynucleotide according to the invention can be joined to any of a variety of 

other nucleotide sequences by well-established recombinant DNA techniques (see 
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Sambrook J et. ah, (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, NY). Useful nucleotide sequences for joining to polynucleotides include an 
assortment of vectors, e.g., plasmids, cosmids, lambda phage derivatives, phagemids, and 
the like, that are well known in the art. Accordingly, the invention also provides a vector 
5 including a polynucleotide of the invention and a host cell containing the polynucleotide. 
In general, the vector contains an origin of replication functional in at least one organism, 
convenient restriction endonuclease sites, and a selectable marker for the host cell. 
Vectors according to the invention include expression vectors, replication vectors, probe 
generation vectors, and sequencing vectors. A host cell according to the invention can be 

10 a prokaryotic or eukaryotic cell and can be a unicellular organism or part of a • 
multicellular organism. 

The present invention further provides recombinant constructs comprising a 
nucleic acid having any of the nucleotide sequences of 748 SEQ ID NO: 1-45,196, 752 
SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 

15 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1- 
3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 
791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502 or a fragment thereof or any other 
polynucleotides of the invention. In one embodiment, the recombinant constructs of the 
present invention comprise a vector, such as a plasmid or viral vector, into which a 

20 nucleic acid having any of the nucleotide sequences of 748 SEQ ID NO: 1-45,196, 752 
SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 
1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1- 
3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 
791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502 or a fragment thereof is inserted, 

25 in a forward or reverse orientation. In the case of a vector comprising one of the ORFs of 
the present invention, the vector may further comprise regulatory sequences, including 
for example, a promoter, operably linked to the ORF. Large numbers of suitable vectors 
and promoters are known to those of skill in the art and are commercially available for 
generating the recombinant constructs of the present invention. The following vectors 

30 are provided by way of example. Bacterial: pBs, phagescript, PsiX174, pBluescript SK, 
pBs KS, pNH8a, pNH16a, pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, 
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pKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: pWLneo, pSV2cat, pOG44, 
PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an 
expression control sequence such as the pMT2 or pED expression vectors disclosed in 
5 Kaufman et al. r Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein 
recombinantly. Many suitable expression control sequences are known in the art. 
General methods of expressing recombinant proteins are also known and are exemplified 
in R. Kaufman, Methods in Enzymology 185, 537-566 (1990). As defined herein 
"operably linked" means that the isolated polynucleotide of the invention and an 

10 expression control sequence are situated within a vector or cell in such a way that the 
protein is expressed by a host cell which has been transformed (transfected) with the 
ligated polynucleotide/expression control sequence. 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 

15 appropriate vectors are pKK232-8 and pCM7. Particular named bacterial promoters 

include lad, lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic promoters include CMV 
immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and 
mouse metallothionein-I. Selection of the appropriate vector and promoter is well within 
the level of ordinary skill in the art. Generally, recombinant expression vectors will 

20 include origins of replication and selectable markers permitting transformation of the host 
cell, e.g., the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and a 
promoter derived from a highly-expressed gene to direct transcription of a downstream 
structural sequence. Such promoters can be derived from operons encoding glycolytic 
enzymes such as 3-phosphoglycerate kinase (PGK), a-factor, acid phosphatase, or heat 

25 shock proteins, among others. The heterologous structural sequence is assembled in 

appropriate phase with translation initiation and termination sequences, and preferably, a 
leader sequence capable of directing secretion of translated protein into the periplasmic 
space or extracellular medium. Optionally, the heterologous sequence can encode a 
fusion protein including an amino terminal identification peptide imparting desired 

30 characteristics, e.g., stabilization or simplified purification of expressed recombinant 
product. Useful expression vectors for bacterial use are constructed by inserting a 
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structural DNA sequence encoding a desired protein together with suitable translation 
initiation and termination signals in operable reading phase with a functional promoter. 
The vector will comprise one or more phenotypic selectable markers and an origin of 
replication to ensure maintenance of the vector and to, if desirable, provide amplification 
5 within the host. Suitable prokaryotic hosts for transformation include E. coli, Bacillus 
subtilis, Salmonella typhimurium and various species within the genera Pseudomonas, 
Streptomyces, and Staphylococcus, although others may also be employed as a matter of * 
choice. 

As a representative but non-limiting example, useful expression vectors for 

10 bacterial use can comprise a selectable marker and bacterial origin of replication derived 
from commercially available plasmids comprising genetic elements of the well known 
cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, : 
pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (Promega 
Biotech, Madison, WI, USA). These pBR322 "backbone" sections are combined with an 

1 5 appropriate promoter and the structural sequence to be expressed. Following 

transformation of a suitable host strain and growth of the host strain to an appropriate cell 
density, the selected promoter is induced or derepressed by appropriate means (e.g., 
temperature shift or chemical induction) and cells are cultured for an additional period. . 
Cells are typically harvested by centrifugation, disrupted by physical or chemical means, 

20 and the resulting crude extract retained for further purification. 

Polynucleotides of the invention can also be used to induce immune responses. 
For example, as described in Fan et al, Nat. Biotech. 17:870-872 (1999), incorporated 
herein by reference, nucleic acid sequences encoding a polypeptide may be used to 
generate antibodies against the encoded polypeptide following topical administration of 

25 naked plasmid DNA or following injection, and preferably intramuscular injection of the 
DNA. The nucleic acid sequences are preferably inserted in a recombinant expression 
vector and may be in the form of naked DNA. 

5.3 ANTISENSE 

30 Another aspect of the invention pertains to isolated antisense nucleic acid 

molecules that are hybridizable to or complementary to the nucleic acid molecule 
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comprising the nucleotide sequence of 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1- 
13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 
SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID 
NO: 1-14,074, 789 SEQ ID NO: 1-6391,790 SEQ ID NO: 1-30,533,791 SEQ ID NO: 1- 
5 5822, and 792 SEQ ID NO: 1-8502, or fragments, analogs or derivatives thereof. An . 
"antisense" nucleic acid comprises a nucleotide sequence that is complementary to a 
"sense" nucleic acid encoding a protein, e.g., complementary to the coding strand of a 
double-stranded cDNA molecule or complementary to an mRNA sequence. In specific 
aspects, antisense nucleic acid molecules are provided that comprise a sequence 

10 complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire 

coding strand, or to only a portion thereof. Nucleic acid molecules encoding fragments, 
homologs, derivatives and analogs of a protein of any of 748 SEQ ID NO: 1-45,196, 752 
SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 
1.10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1- 

15 3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 
791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502 or antisense nucleic acids 
complementary to a nucleic acid sequence of 748 SEQ ID NO: 1-45,196, 752 SEQ ID 
NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1- 
10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 

20 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 
SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence of the invention. The term "coding 
region" refers to the region of the nucleotide sequence comprising codons which are 

25 translated into amino acid residues. In another embodiment, the antisense nucleic acid 
molecule is antisense to a "noncoding region" of the coding strand of a nucleotide 
sequence of the invention. The term "noncoding region" refers to 5 1 and 3' sequences 
which flank the coding region that are not translated into amino acids {i.e., also referred 
to as 5' and 3' untranslated regions). 

30 Given the coding strand sequences encoding a nucleic acid disclosed herein (e.g., 

748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ 
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ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784SEQIDNO: 1-10,289, 785 SEQ ID 
NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1- 
6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1- 
8502, antisense nucleic acids of the invention can be designed according to the rules of 
5 Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be 
complementary to the entire coding region of a mRNA, but more preferably is an 
oligonucleotide that is antisense to only a portion of the coding or noncoding region of a 
mRNA. For example, the antisense oligonucleotide can be complementary to the region 
surrounding the translation start site of a mRNA. An antisense oligonucleotide can be, 

10 for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An 
antisense nucleic acid of the invention can be constructed using chemical synthesis or 
, enzymatic ligation reactions using procedures known in the art. For example, an 
antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously modified nucleotides designed to 

1 5 increase the biological stability of the molecules or to increase the physical stability of 
the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense 
nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 

20 hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 

5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
. dihydrouracil, beta-D-galactosylqueosine, inosine, n6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2,2-dimethyl guanine, 2-methyladenine, 

2- methyl guanine, 3-methylcytosine, 5-methylcytosine, n6-adenine, 7-methylguanine, 
25 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 

beta-D-mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 
2-methylthio-n6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 
5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 
30 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 

2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 
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using an expression vector into which a nucleic acid has been subcloned in an antisense 
orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

5 The antisense nucleic acid molecules of the invention are typically administered 

to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding a protein according to the invention to thereby inhibit 
expression of the protein, e.g., by inhibiting transcription and/or translation. The 
hybridization can be by conventional nucleotide complementarity to form a stable duplex, 

10 or, for example, in the case of an antisense nucleic acid molecule that binds to DNA 
duplexes, through specific interactions in the major groove of the double helix. An 
example of a route of administration of antisense nucleic acid molecules of the invention 
includes direct injection at a tissue site. Alternatively, antisense nucleic acid molecules 
can be modified to target selected cells and then administered systemically. For example, 

1 5 for systemic administration, antisense molecules can be modified such that they 

specifically bind to receptors or antigens expressed on a selected cell surface, e.g., by 
linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell 
surface receptors or antigens. The antisense nucleic acid molecules can also be delivered 
to cells using the vectors described herein. To achieve sufficient intracellular 

20 concentrations of antisense molecules, vector constructs in which the antisense nucleic 
acid molecule is placed under the control of a strong pol II or pol III promoter are 
preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is 
an a-anomeric nucleic acid molecule. An oc-anomeric nucleic acid molecule forms 
25 specific double-stranded hybrids with complementary RNA in which, contrary to the usual 
(3-units, the strands run parallel to each other (Gaultier et ah, (1987) Nucleic Acids Res 15: 
6625-6641). The antisense nucleic acid molecule can also comprise a 
2-o-methylribonucleotide (Inoue et al s (1987) Nucleic Acids Res 15: 6131-6148) or a 
chimeric RNA -DNA analogue (Inoue et al. y (1987) FEES Lett 215: 327-330). 
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5.4 RIBOZYMES AND PNA MOIETIES 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity that are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they have 
5 a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes (described in 
Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave 
mRNA transcripts to thereby inhibit translation of the mRNA. A ribozyme having 
specificity for a nucleic acid of the invention can be designed based upon the nucleotide 
sequence of a DNA disclosed herein {i.e., 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 

10 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 
784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 
SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ 
ID NO: 1-5822, and 792 SEQ ID NO: 1-8502). For example, a derivative of a 
Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the 

15 active site is complementary to the nucleotide sequence to be cleaved in a 

SECX-encoding mRNA. See, e.g., Cech et al, U.S. Pat. No. 4,987,071 ; and Cech et al, 
U.S. Pat. No. 5,1 16,742. Alternatively, SECX mRNA can be used to select a catalytic 
RNA having a specific ribonuclease activity from a pool of RNA molecules. See, e.g., 
Bartele/ al, (1993) Science 261:1411-1418. 

20 Alternatively, gene expression can be inhibited by targeting nucleotide sequences 

complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple 
helical structures that prevent transcription of the gene in target cells. See generally, 
Helene. (1991) Anticancer Drug Des. 6: 569-84; Helene. et al., (1992) Ann. N.Y. Acad. 
ScL 660:27-36; and Maher (1992) Bioassays 14: 807-15. 

25 In various embodiments, the nucleic acids of the invention can be modified at the 

base moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, 
hybridization, or solubility of the molecule. For example, the deoxyribose phosphate 
backbone of the nucleic acids can be modified to generate peptide nucleic acids (see 
Hyrup et al, (1996) Bioorg Med Chem 4: 5-23). As used herein, the terms "peptide 

30 nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, in which the 
deoxyribose phosphate backbone is replaced by a pseudopeptide backbone and only the 
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four natural nucleobases are retained. The neutral backbone of PNAs has been shown to 
allow for specific hybridization to DNA and RNA under conditions of low ionic strength. 
The synthesis of PNA oligomers can be performed using standard solid phase peptide 
synthesis protocols as described in Hyrup et al, (1996) above; Perry-O'Keefe et al, 
5 ( 1 996) PNAS 93 : 1 4670-675 . 

PNAs of the invention can be used in therapeutic and diagnostic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing transcription or translation arrest or 
inhibiting replication. PNAs of the invention can also be used, e.g., in the analysis of 

10 single base pair mutations in a gene by, e.g., PNA directed PCR clamping; as artificial 
restriction enzymes when used in combination with other enzymes, e.g., SI nucleases 
(Hyrup B. (1996) above); or as probes or primers for DNA sequence and hybridization 
(Hyrup et al, (1996), above; Perry-O'Keefe (1996), above). 

In another embodiment, PNAs of the invention can be modified, e.g., to enhance 

15 their stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by 
the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of 
drug delivery known in the art. For example, PNA-DNA chimeras can be generated that 
may combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA 

20 portion while the PNA portion would provide high binding affinity and specificity. 

PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 
of base stacking, number of bonds between the nucleobases, and orientation (Hyrup 
(1996) above). The synthesis of PNA-DNA chimeras can be performed as described in 
Hyrup (1996) above and Finn et al, (1996) Nucl Acids Res 24: 3357-63. For example, a 

25 DNA chain can be synthesized on a solid support using standard phosphoramidite 
coupling chemistry, and modified nucleoside analogs, e.g., 

5 , -(4-methoxytrityl)amino-5 -deoxy-thymidine phosphoramidite, can be used between the 
PNA and the 5' end of DNA (Mag et al, (1989) Nucl Acid Res 17: 5973-88). PNA 
monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 
30 5* PNA segment and a 3' DNA segment (Finn et al, (1996) above). Alternatively, 
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chimeric molecules can be synthesized with a 5' DNA segment and a 3' PNA segment. 
See, Petersen et al, (1975) Bioorg Med Chem Lett 5:1119-11 124. 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
5 transport across the cell membrane (see, e.g., Letsinger et al.,, 1989, Proc. Natl. Acad. 
Scl U.S.A. 86:6553-6556; Lemaitre et al.„ 1987, Proc. Natl. Acad Sci. 84:648-652; PCT 
Publication No. W088/09810) or the blood-brain barrier (see, e.g., PCT Publication No. 
W089/1 0 1 34). In addition, oligonucleotides can be modified with hybridization triggered 
cleavage agents (See, e.g., Krol et al.,, 1988, BioTechniques 6:958-976) or intercalating 
10 agents. (See, e.g., Zon, 1988, Pharm. Res. 5: 539-549). To this end, the oligonucleotide 
may be conjugated to another molecule, e.g., a peptide, a hybridization triggered 
cross-linking agent, a transport agent, a hybridization-triggered cleavage agent, etc. 


5.5 HOSTS 

15 The present invention further provides host cells genetically engineered to contain 

the polynucleotides of the invention. For example, such host cells may contain nucleic 
acids of the invention introduced into the host cell using known transformation, 
transfection or infection methods. The present invention still further provides host cells 
genetically engineered to express the polynucleotides of the invention, wherein such 

20 polynucleotides are in operative association with a regulatory sequence heterologous to 
the host cell which drives expression of the polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, 
or increase, expression of endogenous polypeptide. Cells can be modified (e.g., by 
homologous recombination) to provide increased polypeptide expression by replacing, in 

25 whole or in part, the naturally occurring promoter with all or part of a heterologous 
promoter so that the cells express the polypeptide at higher levels. The heterologous 
promoter is inserted in such a manner that it is operatively linked to the encoding 
sequences. See, for example, PCT International Publication No. WO94/12650, PCT 
International Publication No. WO92/20808, and PCT International Publication No. 

30 W09 1/0995 5. It is also contemplated that, in addition to heterologous promoter DNA, 
amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which 
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encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) 
and/or intron DNA may be inserted along with the heterologous promoter DNA. If 
linked to the coding sequence, amplification of the marker DNA by standard selection 
methods results in co-amplification of the desired protein coding sequences in the cells. 
5 The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a 

lower eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, 
such as a bacterial cell. Introduction of the recombinant construct into the host cell can 
be effected by calcium phosphate transfection, DEAE, dextran mediated transfection, or 
electroporation (Davis, L. et al., Basic Methods in Molecular Biology (1986)). The host 

10 cells containing one of the polynucleotides of the invention, can be used in conventional 
manners to produce the gene product encoded by the isolated fragment (in the case of. an 
ORF) or can be used to produce a heterologous protein under the control of the EMF. 

Any host/vector system can be used to express one or more of the ORFs of the 
present invention. These include, but are not limited to, eukaryotic hosts such as HeLa 

15 cells, Cv-1 cell, COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. 
coli and B. subtilis. The most preferred cells are those which do not normally express the 
particular polypeptide or protein or which expresses the polypeptide or protein at low 
natural level. Mature proteins can be expressed in mammalian cells, yeast, bacteria, or 
other cells under the control of appropriate promoters. Cell-free translation systems can 

20 also be employed to produce such proteins using RNAs derived from the DNA constructs 
of the present invention. Appropriate cloning and expression vectors for use with 
prokaryotic and eukaryotic hosts are described by Sambrook, et al, in Molecular 
Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New York (1989), 
the disclosure of which is hereby incorporated by reference. 

25 Various mammalian cell culture systems can also be employed to express 

i 

recombinant protein. Examples of mammalian expression systems include the COS-7 
lines of monkey kidney fibroblasts, described by Gluzman, Cell 23:175 (1981). Other 
cell lines capable of expressing a compatible vector are, for example, the CI 27, monkey 
COS cells, Chinese Hamster Ovary (CHO) cells, human kidney 293 cells, human 
30 epidermal A431 cells, human Colo205 cells, 3T3 cells, CV-1 cells, other transformed 
primate cell lines, normal diploid cells, cell strains derived from in vitro culture of 
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primary tissue, primary explants, HeLa cells, mouse L cells, BHK, HL-60, U937, HaK or 
Jurkat cells. Mammalian expression vectors will comprise an origin of replication, a 
suitable promoter and also any necessary ribosome binding sites, polyadenylation site, 
splice donor and acceptor sites, transcriptional termination sequences, and 5' flanking 
5 nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for 

example, SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be 
used to provide the required nontranscribed genetic elements. Recombinant polypeptides 
and proteins produced in bacterial culture are usually isolated by initial extraction from 
cell pellets, followed by one or more salting-out, aqueous ion exchange or size exclusion 

10 chromatography steps. Protein refolding steps can be used, as necessary, in completing 
configuration of the mature protein. Finally, high performance liquid chromatography 
(HPLC) can be employed for final purification steps. Microbial cells employed in 
expression of proteins can be disrupted by any convenient method, including freeze-thaw 
cycling, sonication, mechanical disruption, or use of cell lysing agents. 

15 Alternatively, it may be possible to produce the protein in lower eukaryotes such 

as yeast or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains 
include Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, 
Candida, or any yeast strain capable of expressing heterologous proteins. Potentially 
suitable bacterial strains include Escherichia coli, Bacillus subtilis, Salmonella 

20 typhimurium, or any bacterial strain capable of expressing heterologous proteins. If the 
protein is made in yeast or bacteria, it may be necessary to modify the protein produced 
therein, for example by phosphorylation or glycosylation of the appropriate sites, in order 
to obtain the functional protein. Such covalent attachments may be accomplished using 
known chemical or enzymatic methods. 

25 In another embodiment of the present invention, cells and tissues may be 

engineered to express an endogenous gene comprising the polynucleotides of the 
invention under the control of inducible regulatory elements, in which case the regulatory 
sequences of the endogenous gene may be replaced by homologous recombination. As 
described herein, gene targeting can be used to replace a gene's existing regulatory region 

30 with a regulatory sequence isolated from a different gene or a novel regulatory sequence 
synthesized by genetic engineering methods. Such regulatory sequences may be 
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comprised of promoters, enhancers, scaffold-attachment regions, negative regulatory 
elements, transcriptional initiation sites, regulatory protein binding sites or combinations 
of said sequences. Alternatively, sequences which affect the structure or stability of the 
RNA or protein produced may be replaced, removed, added, or otherwise modified by 
5 targeting. These sequence include polyadenylation signals, mRNA stability elements, 
splice sites, leader sequences for enhancing or modifying transport or secretion properties 
of the protein, or other sequences which alter or improve the function or stability of 
protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing 

10 the gene under the control of the new regulatory sequence, e.g., inserting a new promoter 
or enhancer or both upstream of a gene. Alternatively, the targeting event may be a 
simple deletion of a regulatory element, such as the deletion of a tissue-specific negative 
regulatory element. Alternatively, the targeting event may replace an existing element; 
for example, a tissue-specific enhancer can be replaced by an enhancer that has broader 

15 or different cell-type specificity than the naturally occurring elements. Here, the 

naturally occurring sequences are deleted and new sequences are added. In all cases, the 
identification of the targeting event may be facilitated by the use of one or more 
selectable marker genes that are contiguous with the targeting DNA, allowing for the 
selection of cells in which the exogenous DNA has integrated into the host cell genome. 

20 The identification of the targeting event may also be facilitated by the use of one or more 
marker genes exhibiting the property of negative selection, such that the negatively 
selectable marker is linked to the exogenous DNA, but configured such that the 
negatively selectable marker flanks the targeting sequence, and such that a correct 
homologous recombination event with sequences in the host cell genome does not result 

25 in the stable integration of the negatively selectable marker. Markers useful for this 

purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial 
xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance 
with this aspect of the invention are more particularly described in U.S. Patent No. 

30 5,272,071 to Chappel; U.S. Patent No. 5,578,461 to Sherwin et. al.,; International 

Application No. PCT/US92/09627 (WO93/09222) by Selden et. al.,; and International 
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Application No. PCT/US90/06436 (WO91/06667) by Skoultchi et al, each of which is 
incorporated by reference herein in its entirety. 

5.6 POLYPEPTIDES OF THE INVENTION 

5 The isolated polypeptides of the invention include, but are not limited to, a 

polypeptide comprising: the amino acid sequences set forth as any one of 748 SEQ ID 
NO: 1-45,196, 752 SEQ ID NO:. 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1- 
128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 
787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ 

10 ID NO: 1-30,533, 791 SEQ ID NO: l-5822, and 792 SEQ ID NO: 1-8502 or an amino 
acid sequence encoded by any one of the nucleotide sequences 748 SEQ ID NO: 1- 
45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 
SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ 
ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 

15 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502 or the corresponding 
full length or mature protein. Polypeptides of the invention also include polypeptides 
preferably with biological or immunological activity that are encoded by: (a) a 
polynucleotide having any one of the nucleotide sequences set forth in 748 SEQ ID NO: 
1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 

20 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 
SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID 
NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502 or (b) 
polynucleotides encoding any one of the amino acid sequences set forth 748 SEQ ID NO: 
1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 

25 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 
SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID 
NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502 or (c) 
polynucleotides that hybridize to the complement of the polynucleotides of either (a) or 
(b) under stringent hybridization conditions. The invention also provides biologically 

30 active or immunologically active variants of any of the amino acid sequences set forth as 
748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ 
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ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID 
NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1- 
6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1- 
8502or the corresponding full length or mature protein; and "substantial equivalents" 
5 thereof (e.g., with at least about 65%, at least about 70%, at least about 75%, at least 
about 80%, at least about 85%, at least about 90%, typically at least about 95%, more 
typically at least about 98%, or most typically at least about 99% amino acid identity) 
that retain biological activity. Polypeptides encoded by allelic variants may have a 
similar, increased, or decreased activity compared to polypeptides comprising 748 SEQ 

10 ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: l-94> 779 SEQ ID NO: 
1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1- 
3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 
790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502. 

Fragments of the proteins of the present invention which are capable of exhibiting 

1 5 biological activity are also encompassed by the present invention. Fragments of the 
protein may be in linear form or they may be cyclized using known methods, for 
example, as described in H. U. Saragovi, et al, Bio/Technology 10, 773-778 (1992) and 
in R. S. McDowell, et al, J. Amer. Chem. Soc. 1 14, 9245-9253 (1992), both of which are 
incorporated herein by reference. Such fragments may be fused to carrier molecules such 

20 as immunoglobulins for many purposes, including increasing the valency of protein 
binding sites. 

The present invention also provides both full-length and mature forms (for 
example, without a signal sequence or precursor sequence) of the disclosed proteins. The 
protein coding sequence is identified in the sequence listing by translation of the 

25 disclosed nucleotide sequences. The mature form of such protein may be obtained by 

expression of a full-length polynucleotide in a suitable mammalian cell or other host cell. 
The sequence of the mature form of the protein is also determinable from the amino acid 
sequence of the full-length form. Where proteins of the present invention are membrane 
bound, soluble forms of the proteins are also provided. In such forms, part or all of the 

30 regions causing the proteins to be membrane bound are deleted so that the proteins are 
fully secreted from the cell in which they are expressed. 
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Protein compositions of the present invention may further comprise an acceptable 
carrier, such as a hydrophilic, e.g., pharmaceutical ly acceptable, carrier. 

The present invention further provides isolated polypeptides encoded by the 
nucleic acid fragments of the present invention or by degenerate variants of the nucleic 
5 acid fragments of the present invention. By "degenerate variant" is intended nucleotide 
fragments which differ from a nucleic acid fragment of the present invention (e.g., an 
ORF) by nucleotide sequence but, due to the degeneracy of the genetic code, encode an 
identical polypeptide sequence. Preferred nucleic acid fragments of the present invention 
are the ORFs that encode proteins. 

10 A variety of methodologies known in the art can be utilized to obtain any one of 

the isolated polypeptides or proteins of the present invention. At the simplest level, the 
amino acid sequence can be synthesized using commercially available peptide 
synthesizers. The synthetically-constructed protein sequences, by virtue of sharing 
primary, secondary or tertiary structural and/or conformational characteristics with 

1 5 proteins may possess biological properties in common therewith, including protein 

activity. This technique is particularly useful in producing small peptides and fragments 
of larger polypeptides. Fragments are useful, for example, in generating antibodies 
against the native polypeptide. Thus, they may be employed as biologically active or 
immunological substitutes for natural, purified proteins in screening of therapeutic 

20 compounds and in immunological processes for the development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be 
purified from cells which have been altered to express the desired polypeptide or protein. 
As used herein, a cell is said to be altered to express a desired polypeptide or protein 
when the cell, through genetic manipulation, is made to produce a polypeptide or protein 

25 which it normally does not produce or which the cell normally produces at a lower level. 
One skilled in the art can readily adapt procedures for introducing and expressing either 
recombinant or synthetic sequences into eukaryotic or prokaryotic cells in order to 
generate a cell which produces one of the polypeptides or proteins of the present 
invention. 

30 The invention also relates to methods for producing a polypeptide comprising 

growing a culture of host cells of the invention in a suitable culture medium, and 
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purifying the protein from the cells or the culture in which the cells are grown. For 
example, the methods of the invention include a process for producing a polypeptide in 
which a host cell containing a suitable expression vector that includes a polynucleotide of 
the invention is cultured under conditions that allow expression of the encoded 
5 polypeptide. The polypeptide can be recovered from the culture, conveniently from the 
culture medium, or from a lysate prepared from the host cells and further purified. 
Preferred embodiments include those in which the protein produced by such process is a 
full length or mature form of the protein. 1 

In an alternative method, the polypeptide or protein is purified from bacterial cells 

10 . which naturally produce the polypeptide or protein. One skilled in the art can readily 
follow known methods for isolating polypeptides and proteins in order to obtain one of 
the isolated polypeptides or proteins of the present invention. These include, but are not 
limited to, immunochromatography, HPLC, size-exclusion chromatography, 
ion-exchange chromatography, and immuno-affinity chromatography. See, e.g., Scopes, 

15 Protein Purification: Principles and Practice, Springer-Verlag (1994); Sambrook, et al., 
in Molecular Cloning: A Laboratory Manual; Ausubel et al., Current Protocols in 
Molecular Biology. Polypeptide fragments that retain biological/immunological activity 
include fragments comprising greater than about 100 amino acids, or greater than about 
200 amino acids, and fragments that encode specific protein domains. 

20 The purified polypeptides can be used in in vitro binding assays which are well 

known in the art to identify molecules which bind to the polypeptides. These molecules 
include but are not limited to, for e.g., small molecules, molecules from combinatorial 
libraries, antibodies or other proteins. The molecules identified in the binding assay are 
then tested for antagonist or agonist activity in in vivo tissue culture or animal models 

25 that are well known in the art. In brief, the molecules are titrated into a plurality of cell 
cultures or animals and then tested for either cell/animal death or prolonged survival of 
the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the 
peptides may be complexed with toxins, e.g., ricin or cholera, or with other compounds 
30 that are toxic to cells. The toxin-binding molecule complex is then targeted to a tumor or 
other cell by the specificity of the binding molecule for 748 SEQ ID NO: 1-45,196, 752 
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SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 
1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1- 
3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 
791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502. 
5 The protein of the invention may also be expressed as a product of transgenic 

animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which 
are characterized by somatic or germ cells containing a nucleotide sequence encoding the 
protein. 

The proteins provided herein also include proteins characterized by amino acid 

10 sequences similar to those of purified proteins but into which modification are naturally 
provided or deliberately engineered. For example, modifications, in the peptide or DNA 
sequence, can be made by those skilled in the art using known techniques. Modifications 
of interest in the protein sequences may include the alteration, substitution, replacement, 
insertion or deletion of a selected amino acid residue in the coding sequence. For 

1 5 example, one or more of the cysteine residues may be deleted or replaced with another 
amino acid to alter the conformation of the molecule. Techniques for such alteration, 
substitution, replacement, insertion or deletion are well known to those skilled in the art 
(see, e.g., U.S. Pat. No. 4,518,584). Preferably, such alteration, substitution, replacement, 
insertion or deletion retains the desired activity of the protein. Regions of the protein that 

20 are important for the protein function can be determined by various methods known in 
the art including the alanine-scanning method which involved systematic substitution of 
single or strings of amino acids with alanine, followed by testing the resulting 
alanine-containing variant for biological activity. This type of analysis determines the 
importance of the substituted amino acid(s) in biological activity. Regions of the protein 

25 that are important for protein function may be determined by the eMATRIX program. 

Other fragments and derivatives of the sequences of proteins which would be 
expected to retain protein activity in whole or in part and are useful for screening or other 
immunological methodologies may also be easily made by those skilled in the art given 
the disclosures herein. Such modifications are encompassed by the present invention. 

30 The protein may also be produced by operably linking the isolated polynucleotide 

of the invention to suitable control sequences in one or more insect expression vectors, 
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and employing an insect expression system. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, 
e.g., Invitrogen, San Diego, Calif, U.S.A. (the MaxBat™ kit), and such methods are well 
known in the art, as described in Summers and Smith, Texas Agricultural Experiment 
5 Station Bulletin No. 1555 (1987), incorporated herein by reference. As used herein, an 
insect cell capable of expressing a polynucleotide of the present invention is 
"transformed." 

The protein of the invention may be prepared by culturing transformed host cells 
under culture conditions suitable to express the recombinant protein. The resulting 

10 expressed protein may then be purified from such culture {i.e., from culture medium or 
cell extracts) using known purification processes, such as gel filtration and ion exchange 
chromatography. The purification of the protein may also include an affinity column 
containing agents which wilLbind to the protein; one or more column steps over such 
affinity resins as concanavalin A-agarose, heparin-toyopearl™ (TOSOH Biosep LLC, 

15 Montgomeryville, PA) or Cibacrom blue 3GA Sepharose™ (Pharmacia, Piscataway, NJ); 
one or more steps involving hydrophobic interaction chromatography using such resins as 
phenyl ether, butyl ether, or propyl ether; or immunoaffinity chromatography. 

Alternatively, the protein of the invention may also be expressed in a form which 
will facilitate purification. For example, it may be expressed as a fusion protein, such as 

20 those of maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin 
(TRX), or as a His tag. Kits for expression and purification of such fusion proteins are 
commercially available from New England BioLab (Beverly, Mass.), Pharmacia 
(Piscataway, N.J.) and Invitrogen, respectively. The protein can also be tagged with an 
epitope and subsequently purified by using a specific antibody directed to such epitope. 

25 One such epitope ("FLAG®") is commercially available from Kodak (New Haven, 
Conn.). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
methyl or other aliphatic groups, can be employed to further purify the protein. Some or 
30 all of the foregoing purification steps, in various combinations, can also be employed to 
provide a substantially homogeneous isolated recombinant protein. The protein thus 
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purified is substantially free of other mammalian proteins and is defined in accordance 
with the present invention as an "isolated protein." 

The polypeptides of the invention include analogs (variants). This embraces 
fragments, as well as peptides in which one or more amino acids has been deleted, 
5 inserted, or substituted. Also, analogs of the polypeptides of the invention embrace 

fusions of the polypeptides or modifications of the polypeptides of the invention, wherein 
the polypeptide or analog is fused to another moiety or moieties, e.g., targeting moiety or 
another therapeutic agent. Such analogs may exhibit improved properties such as activity 
and/or stability. Examples of moieties which may be fused to the polypeptide or an 

10 analog include, for example, targeting moieties which provide for the delivery of 

polypeptide to pancreatic cells, e.g., antibodies to pancreatic cells, antibodies to immune 
cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well as receptor 
and ligands expressed on pancreatic or immune cells. Other moieties which may be 
fused to the polypeptide include therapeutic agents which are used for treatment, for 

15 example, immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CD3 
antibodies and steroids. Also, polypeptides may be fused to immune modulators, and 
other cytokines such as alpha or beta interferon. 

5.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE 
20 IDENTITY AND SIMILARITY 

Preferred identity and/or similarity aire designed to give the largest match between 
the sequences tested. Methods to determine identity and similarity are codified in 
computer programs including, but are not limited to, the GCG program package, 
including GAP (Devereux, J., et aL, Nucleic Acids Research 12(1):387 (1984); Genetics 

25 Computer Group, University of Wisconsin, Madison, WI), BLASTP, BLASTN, 
BLASTX, FASTA (Altschul, S.F. etaL, J. Molec. Biol. 215:403-410 (1990), PSI- 
BLAST (Altschul S.F. et al, Nucleic Acids Res. vol. 25, pp. 3389-3402, herein 
incorporated by reference), eMatrix software (Wu et al, J. Comp. Biol., Vol. 6, pp. 219- 
235 (1999), herein incorporated by reference), eMotif software (Nevill-Manning et al, 

30 ISMB-97, Vol. 4, pp. 202-209, herein incorporated by reference), pFam software 
(Sonnhammer et al, Nucleic Acids Res., Vol. 26(1), pp. 320-322 (1998), herein 
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incorporated by reference) and the Kyte-Doolittle hydrophobocity prediction algorithm 
(J. Mol Biol, 157, pp. 105-31 (1982), incorporated herein by reference). The BLAST 
programs are publicly available from the National Center for Biotechnology Information 
(NCBI) and other sources (BLAST Manual, Altschul, S., et. aL, NCB NLM NIH 
5 Bethesda, MD 20894; Altschul, S., et al t J. Mol. Biol. 215:403-410 (1990). 

5.7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a 
"chimeric protein" or "fusion protein" comprises a polypeptide of the invention 
operatively linked to another polypeptide. Within a fusion protein the polypeptide 

10 according to the invention can correspond to all or a portion of a protein according to the 
invention. In one embodiment, a fusion protein comprises at least one biologically active 
portion of a protein according to the invention. In another embodiment, a fusion protein 
comprises at least two biologically active portions of a protein according to the 
invention. Within the fusion protein, the term "operatively linked" is intended to indicate 

15 that the polypeptide according to the invention and the other polypeptide are fused 

in-frame to each other. The polypeptide can be fused to the N-terminus or C-terminus. 

For example, in one embodiment a fusion protein comprises a polypeptide 
according to the invention operably linked to the extracellular domain of a second 
protein. In another embodiment, the fusion protein is a GST-fusion protein in which the 

20 polypeptide sequences of the invention are fused to the C-terminus of the GST (i.e., 
glutathione S-transferase) sequences. 

In another embodiment, the fusion protein is an immunoglobulin fusion protein in 
which the polypeptide sequences according to the invention comprises one or more 
domains are fused to sequences derived from a member of the immunoglobulin protein 

25 family. The immunoglobulin fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject to inhibit an interaction 
between a ligand and a protein of the invention on the surface of a cell, to thereby 
suppress signal transduction in vivo. The immunoglobulin fusion proteins can be used to 
affect the bioavailability of a cognate ligand. Inhibition of the ligand/protein interaction 

30 may be useful therapeutically for both the treatment of proliferative and differentiative 
disorders, e,g., cancer as well as modulating (e.g., promoting or inhibiting) cell survival. 
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Moreover, the immunoglobulin fusion proteins of the invention can be used as 
immunogens to produce antibodies in a subject, to purify ligands, and in screening assays 
to identify molecules that inhibit the interaction of a polypeptide of the invention with a 
ligand. 

5 A chimeric or fusion protein of the invention can be produced by standard 

recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 
techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, 
restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends 

10 as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and 

enzymatic ligation. In another embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers that give rise to 
complementary overhangs between two consecutive gene fragments that can 

15 subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for 
example, Ausubel et. al., (eds.) Current Protocols in Molecular Biology, John 
Wiley & Sons, 1992). Moreover, many expression vectors are commercially available 
that already encode a fusion moiety (e.g., a GST polypeptide). A nucleic acid encoding 
a polypeptide of the invention can be cloned into such an expression vector such that the 

20 fusion moiety is linked in-frame to the protein of the invention. 

5.8 GENE THERAPY 

Mutations in the polynucleotides of the invention gene may result in loss of 
normal function of the encoded protein. The invention thus provides gene therapy to 

25 restore normal activity of the polypeptides of the invention; or to treat disease states 
involving polypeptides of the invention. Delivery of a functional gene encoding 
polypeptides of the invention to appropriate cells is effected ex vivo, in situ, or in vivo by 
use of vectors, and more particularly viral vectors {e.g., adenovirus, adeno-associated 
virus, or a retrovirus), or ex vivo by use of physical DNA transfer methods {e.g., 

30 liposomes or chemical treatments). See, for example, Anderson, Nature, supplement to 
vol. 392, no. 6679, pp. 25-20 (1998). For additional reviews of gene therapy technology 
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see Friedmann, Science, 244: 1275-1281 (1989); Verma, Scientific American: 68-84 
(1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any one of the 
nucleotides of the present invention or a gene encoding the polypeptides of the present 
invention can also be accomplished with extrachromosomal substrates (transient 
5 expression) or artificial chromosomes (stable expression). Cells may also be cultured ex 
vivo in the presence of proteins of the present invention in order to proliferate or to 
produce a desired effect on or activity in such cells.' Treated cells can then be introduced 
in vivo for therapeutic purposes. Alternatively, it is contemplated that in other human 
disease states, preventing the expression of or inhibiting the activity of polypeptides of 

10 the invention will be useful in treating the disease states. It is contemplated that antisense 
therapy or gene therapy could be applied to negatively regulate the expression of 
polypeptides of the invention. 

Other methods inhibiting expression of a protein include the introduction of 
antisense molecules to the nucleic acids of the present invention, their complements, or their 

1 5 translated RNA sequences, by methods known in the art. Further, the polypeptides of the 
present invention can be inhibited by using targeted deletion methods, or the insertion of a 
negative regulatory element such as a silencer, which is tissue specific. 

The present invention still further provides cells genetically engineered in vivo to 
express the polynucleotides of the invention, wherein such polynucleotides are in operative 

20 association with a regulatory sequence heterologous to the host cell which drives expression 
of the polynucleotides in the cell. These methods can be used to increase or decrease the 
expression of the polynucleotides of the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of 
cells to permit, increase, or decrease, expression of endogenous polypeptide. Cells can be 

25 modified {e.g., by homologous recombination) to provide increased polypeptide expression 
by replacing, in whole or in part, the naturally occurring promoter with all or part of a 
heterologous promoter so that the cells express the protein at higher levels. The heterologous 
promoter is inserted in such a manner that it is operatively linked to the desired protein 
encoding sequences. See, for example, PCT International Publication No. WO 94/12650, 

30 PCT International Publication No. WO 92/20808, and PCT International Publication No. 
WO 91/09955. It is also contemplated that, in addition to heterologous promoter DNA, 
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amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which encodes 
carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron 
DNA may be inserted along with the heterologous promoter DNA. If linked to the desired 
protein coding sequence, amplification of the marker DNA by standard selection methods 
5 results in co-amplification of the desired protein coding sequences in the cells. 

In another embodiment of the present invention, cells and tissues may be engineered 
to express an endogenous gene comprising the polynucleotides of the invention under the 
control of inducible regulatory elements, in which case the regulatory sequences of the 
endogenous gene may be replaced by homologous recombination. As described herein, 

1 0 gene targeting can be used to replace a gene's existing regulatory region with a regulatory 
sequence isolated from a different gene or a novel regulatory sequence synthesized by 
genetic engineering methods. Such regulatory sequences may be comprised of promoters, 
enhancers, scaffold-attachment regions, negative regulatory elements, transcriptional 
, initiation sites, regulatory protein binding sites or combinations of said sequences. 

1 5 Alternatively, sequences which affect the structure or stability of the RNA or protein 
produced may be replaced, removed, added, or otherwise modified by targeting. These 
sequences include polyadenylation signals, mRNA stability elements, splice sites, leader 
sequences for enhancing or modifying transport or secretion properties of the protein, or 
other sequences which alter or improve the function or stability of protein or RNA 

20 molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the 
gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple 
deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory 

25 element. Alternatively, the targeting event may replace an existing element; for example, a 
tissue-specific enhancer can be replaced by an enhancer that has broader or different 
cell-type specificity than the naturally occurring elements. Here, the naturally occurring 
sequences are deleted and new sequences are added. In all cases, the identification of the 
targeting event may be facilitated by the use of one or more selectable marker genes that are 

30 contiguous with the targeting DNA, allowing for the selection of cells in which the 

exogenous DNA has integrated into the cell genome. The identification of the targeting 
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event may also be facilitated by the use of one or more marker genes exhibiting the property 
of negative selection, such that the negatively selectable marker is linked to the exogenous 
DNA, but configured such that the negatively selectable marker flanks the targeting 
sequence, and such that a correct homologous recombination event with sequences in the 
5 host cell genome does not result in the stable integration of the negatively selectable marker. 
Markers useful for this purpose include the Herpes Simplex Virus thymidine kinase (TK) 
gene or the bacterial xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

. The gene targeting or gene activation techniques which can be used in accordance 

with this aspect of the invention are more particularly described in U.S. Patent No. 

10 5,272,071 to Chappel; U.S. Patent No. 5,578,461 to Sherwin et. al.,; International 

Application No. PCT/US92/09627 (WO93/09222) by Selden et. al.,; and International 
Application No. PCT/US90/06436 (WO91/06667) by Skoultchi et al, each of which is 
incorporated by reference herein in its entirety. 

15 5.9 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed 
or inactivated in the germ line of animals using homologous recombination (Capecchi, 
Science 244:1288-1292 (1989)). Animals in which the gene is over expressed, under the 

20 regulatory control of exogenous or endogenous promoter elements, are known as 
transgenic animals. Animals in which an endogenous gene has been inactivated by 
homologous recombination are referred to as "knockout" animals. Knockout animals, 
preferably non-human mammals, can be prepared as described in U.S. Patent No. 
5,557,032, incorporated herein by reference. Transgenic animals are useful to determine 

25 the roles polypeptides of the invention play in biological processes, and preferably in 
disease states. Transgenic animals are useful as model systems to identify compounds 
that modulate lipid metabolism. Transgenic animals, preferably non-human mammals, 
are produced using methods as described in U.S. Patent No 5,489,743 and PCT 
Publication No. W094/28122, incorporated herein by reference. 

30 Transgenic animals can be prepared wherein all or part of a promoter of the 

polynucleotides of the invention is either activated or inactivated to alter the level of 


50 


JL O O a H-3 in O 2 id! £» O S 


expression of the polypeptides of the invention. Inactivation can be carried out using 
homologous recombination methods described above. Activation can be achieved by 
supplementing or even replacing the homologous promoter to provide for increased 
protein expression. The homologous promoter can be supplemented by insertion of one 
5 or more heterologous enhancer elements known to confer promoter activation in a 
particular tissue. 

The polynucleotides of the present invention also make possible the development, 
through, e.g., homologous recombination or knock out strategies, of animals that fail to 
express polypeptides of the invention or that express a variant polypeptide. Such animals 

10 are useful as models for studying the in vivo activities of polypeptide as well as for 
studying modulators of the polypeptides of the invention. 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed 
or inactivated in the germ line of animals using homologous recombination (Capecchi, 

15 Science 244:1288-1292 (1989)). Animals in which the gene is over expressed, under the 
regulatory control of exogenous or endogenous promoter elements, are known as 
transgenic animals. Animals in which an endogenous gene has been inactivated by 
homologous recombination are referred to as "knockout" animals. Knockout animals, 
preferably non-human mammals, can be prepared as described in U.S. Patent No. 

20 5,557,032, incorporated herein by reference. Transgenic animals are useful to determine 
the roles polypeptides of the invention play in biological processes, and preferably in 
disease states. Transgenic animals are useful as model systems to identify compounds 
that modulate lipid metabolism. Transgenic animals, preferably non-human mammals, 
are produced using methods as described in U.S. Patent No 5,489,743 and PCT 

25 Publication No. W094/28 1 22, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of 
the invention promoter is either activated or inactivated to alter the level of expression of 
the polypeptides of the invention. Inactivation can be carried out using homologous 
recombination methods described above. Activation can be achieved by supplementing or 

30 even replacing the homologous promoter to provide for increased protein expression. The 
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homologous promoter can be supplemented by insertion of one or more heterologous 
enhancer elements known to confer promoter activation in a particular tissue. 

5.10 USES AND BIOLOGICAL ACTIVITY 

5 The polynucleotides and proteins of the present invention are expected to exhibit 

one or more of the uses or biological activities (including those associated with assays 
cited herein) identified herein. Uses or activities described for proteins of the present 
invention may be provided by administration or use of such proteins or of 
polynucleotides encoding such proteins (such as, for example, in gene therapies or 

10 vectors suitable for introduction of DNA). The mechanism underlying the particular 
condition or pathology will dictate whether the polypeptides of the invention, the 
polynucleotides of the invention or modulators (activators or inhibitors) thereof would be 
beneficial to the subject in need of treatment. Thus, "therapeutic compositions of the 
invention" include compositions comprising isolated polynucleotides (including 

1 5 recombinant DNA molecules, cloned genes and degenerate variants thereof) or 
polypeptides of the invention (including full length protein, mature protein and 
truncations or domains thereof), or compounds and other substances that modulate the 
overall activity of the target gene products, either at the level of target gene/protein 
expression or target protein activity. Such modulators include polypeptides, analogs, 

20 (variants), including fragments and fusion proteins, antibodies and other binding proteins; 
chemical compounds that directly or indirectly activate or inhibit the polypeptides of the 
invention (identified, e.g., via drug screening assays as described herein); antisense 
polynucleotides and polynucleotides suitable for triple helix formation; and in particular 
antibodies or other binding partners that specifically recognize one or more epitopes of 

25 the polypeptides of the invention. 

The polypeptides of the present invention may likewise be involved in cellular 
activation or in one of the other physiological pathways described herein. 

5.10.1 RESEARCH USES AND UTILITIES 

30 The polynucleotides provided by the present invention can be used by the 

research community for various purposes. The polynucleotides can be used to express 
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recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either j 
constitutively or at a particular stage of tissue differentiation or development or in disease 
states); as molecular weight markers on gels; as chromosome markers or tags (when 
5 labeled) to identify chromosomes or to map related gene positions; to compare with 

endogenous DNA sequences in patients to identify potential genetic disorders; as probes 
to hybridize and thus discover novel, related DNA sequences; as a source of information 
to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known 
sequences in the process of discovering other novel polynucleotides; for selecting and 

10 making oligomers for attachment to a "gene chip" or other support, including for 
examination of expression patterns; to raise anti-protein antibodies using DNA 
immunization techniques; and as an antigen to raise anti-DNA antibodies or elicit another 
immune response. Where the polynucleotide encodes a protein which binds or 
potentially binds to another protein (such as, for example, in a receptor-ligand 

15 interaction), the polynucleotide can also be used in interaction trap assays (such as, for 
example, that described in Gyuris et aL, Cell 75:791-803 (1993)) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify 
inhibitors of the binding interaction. 

The polypeptides provided by the present invention can similarly be used in 

20 assays to determine biological activity, including in a panel of multiple proteins for 

high-throughput screening; to raise antibodies or to elicit another immune response; as a 
reagent (including the labeled reagent) in assays designed to quantitatively determine 
levels of the protein (or its receptor) in biological fluids; as markers for tissues in which 
the corresponding polypeptide is preferentially expressed (either constitutively or at a 

25 particular stage of tissue differentiation or development or in a disease state); and, of 
course, to isolate correlative receptors or ligands. Proteins involved in these binding 
interactions can also be used to screen for peptide or small molecule inhibitors or agonists 
of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent 

30 grade or kit format for commercialization as research products. 
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Methods for performing the uses listed above are well known to those skilled in 
the art. References disclosing such methods include without limitation "Molecular 
Cloning: A Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, 
Sambrook, J., E. F. Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: 
5 Guide to Molecular Cloning Techniques", Academic Press, Berger, S. L. and A. R. 
Kimmel eds., 1987. 

5.10.2 NUTRITIONAL USES 

Polynucleotides and polypeptides of the present invention can also be used as 
1 0 nutritional sources or supplements. Such uses include without limitation use as a protein or 
amino acid supplement, use as a carbon source, use as a nitrogen source and use as a source 
of carbohydrate. In such cases the polypeptide or polynucleotide of the invention can be 
added to the feed of a particular organism or can be administered as a separate solid or liquid 
preparation, such as in the form of powder, pills, solutions, suspensions or capsules. In the 
1 5 case of microorganisms, the polypeptide or polynucleotide of the invention can be added to 
the medium in or on which the microorganism is cultured. 

5.10.3 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 
ACTIVITY 

20 A polypeptide of the present invention may exhibit activity relating to cytokine, 

cell proliferation (either inducing or inhibiting) or cell differentiation (either inducing or 
inhibiting) activity or may induce production of other cytokines in certain cell 
populations. A polynucleotide of the invention can encode a polypeptide exhibiting such 
attributes. Many protein factors discovered to date, including all known cytokines, have 

25 exhibited activity in one or more factor-dependent cell proliferation assays, and hence the 
assays serve as a convenient confirmation of cytokine activity. The activity of therapeutic 
compositions of the present invention is evidenced by any one of a number of routine 
factor dependent cell proliferation assays for cell lines including, without limitation, 32D, 
DA2, DA1G, T10, B9, B9/1 1, BaF3, MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, 

30 Til 65, HT2, CTLL2, TF-1 , Mo7e, CMK, HUVEC, and CaCo. Therapeutic 
compositions of the invention can be used in the following: 
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Assays for T-cell or thymocyte proliferation include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. 
H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 3, In vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
5 Chapter 7, Immunologic studies in Humans); Takai et al, J. Immunol. 137:3494-3500, 
1986; Bertagnolli et al, J. Immunol. 145:1706-1712, 1990; Bertagnolli et al, Cellular 
Immunology 133:327-341, 1991; Bertagnolli, et al, I. Immunol. 149:3778-3783, 1992; 
Bowman^ al, I. Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node 

10 cells or thymocytes include, without limitation, those described in: Polyclonal T cell 

stimulation, Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. 
J. E. e.a. Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and 
Measurement of mouse and human interleukin-y, Schreiber, R. D. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 

15 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic 
cells include, without limitation, those described in: Measurement of Human and Murine 
Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current 
Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and 

20 Sons, Toronto. 1991; deVries etaL, J. Exp. Med. 173:1205-1211, 1991; Moreau et al, 
Nature 336:690-692, 1988; Greenberger et al, Proc. Natl. Acad. Sci. U.S.A. 
80:2931-2938, 1983; Measurement of mouse and human interleukin 6— Nordan, R. In 
Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley 
and Sons, Toronto. 1991; Smith et al, Proc. Natl. Aced. Sci. U.S.A. 83:1857-1861, 1986; 

25 Measurement of human Interleukin 1 1— Bennett, F., Giannotti, J., Clark, S. C. and Turner, 
K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.15.1 John 
Wiley and Sons, Toronto. 1991; Measurement of mouse and human Interleukin 
9~Ciarletta, A., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in 
Immunology. J. E. Coligan eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991. 

30 Assays for T-cell clone responses to antigens (which will identify, among others, 

proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring 
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proliferation and cytokine production) include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. 
Margulies, E. M. Shevach, W Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 3, In vitro assays for Mouse Lymphocyte Function; Chapter 
5 6, Cytokines and their cellular receptors; Chapter 7, Immunologic studies in Humans); 
Weinberger et al, Proc. Natl. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et al, 
Eur. J. Immun. 1 1:405-41 1, 1981; Takai et al, J. Immunol. 137:3494-3500, 1986; Takai 
etal, J. Immunol. 140:508-512, 1988. 

1 0 5.10.4 STEM CELL GROWTH FACTOR ACTIVITY 

A polypeptide of the present invention may exhibit stem cell growth factor 
activity and be involved in the proliferation, differentiation and survival of pluripotent 
and totipotent stem cells including primordial germ cells, embryonic stem cells, 
hematopoietic stem cells and/or germ line stem cells. Administration of the polypeptide 

15 of the invention to stem cells in vivo or ex vivo is expected to maintain and expand cell 
populations in a totipotential or pluripotential state which would be useful for re- 
engineering damaged or diseased tissues, transplantation, manufacture of bio- 
pharmaceuticals and the development of bio-sensors. The ability to produce large 
quantities of human cells has important working applications for the production of human 

20 proteins which currently must be obtained from non-human sources or donors, 
implantation of cells to treat diseases such as Parkinson's, Alzheimer's and other 
neurodegenerative diseases; tissues for grafting such as bone marrow, skin, cartilage, 
tendons, bone, muscle (including cardiac muscle), blood vessels, cornea, neural cells, 
gastrointestinal cells and others; and organs for transplantation such as kidney, liver, 

25 pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or 
cytokines may be administered in combination with the polypeptide of the invention to 
achieve the desired effect, including any of the growth factors listed herein, other stem 
cell maintenance factors, and specifically including stem cell factor (SCF), leukemia 

30 inhibitory factor (LIF), Flt-3 ligand (Flt-3L), any of the interleukins, recombinant soluble 
IL-6 receptor fused to IL-6, macrophage inflammatory protein 1 -alpha (MIP-1 -alpha), G- 
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CSF, GM-CSF, thrombopoietin (TPO), platelet factor 4 (PF-4), platelet-derived growth 
factor (PDGF), neural growth factors and basic fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, 
expansion of these cells in culture will facilitate the production of large quantities of 
5 mature cells. Techniques for culturing stem cells are known in the art and administration 
of polypeptides of the invention, optionally with other growth factors and/or cytokines, is 
expected to enhance the survival and proliferation of the stem cell populations. This can 
be accomplished by direct administration of the polypeptide of the invention to the 
culture medium. Alternatively, stroma cells transfected with a polynucleotide that 

10 encodes for the polypeptide of the invention can be used as a feeder layer for the stem 

cell populations in culture or in vivo. Stromal support cells for feeder layers may include 
embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or 
cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926). 

Stem cells themselves can be transfected with a polynucleotide of the invention to 

1 5 induce autocrine expression of the polypeptide of the invention. This will allow for 

generation of undifferentiated totipotential/pluripotential stem cell lines that are useful as 
is or that can then be differentiated into the desired mature cell types. These stable cell 
lines can also serve as a source of undifferentiated totipotential/pluripotential mRNA to 
create cDNA libraries and templates for polymerase chain reaction experiments. These 

20 studies would allow for the isolation and identification of differentially expressed genes 
in stem cell populations that regulate stem cell proliferation and/or maintenance. 

Expansion and maintenance of totipotent stem cell populations will be useful in 
the treatment of many pathological conditions. For example, polypeptides of the present 
invention may be used to manipulate stem cells in culture to give rise to neuroepithelial 

25 cells that can be used to augment or replace cells damaged by illness, autoimmune 

disease, accidental damage or genetic disorders. The polypeptide of the invention may be 
useful for inducing the proliferation of neural cells and for the regeneration of nerve and 
brain tissue, i.e. for the treatment of central and peripheral nervous system diseases and 
neuropathies, as well as mechanical and traumatic disorders which involve degeneration, 

30 death or trauma to neural cells or nerve tissue. In addition, the expanded stem cell 
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populations can also be genetically altered for gene therapy purposes and to decrease host 
rejection of replacement tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also 
be manipulated to achieve controlled differentiation of the stem cells into more 
5 differentiated cell types. A broadly applicable method of obtaining pure populations of a 
specific differentiated cell type from undifferentiated stem cell populations involves the 
use of a cell-type specific promoter driving a selectable marker. The selectable marker 
allows only cells of the desired type to survive. For example, stem cells can be induced 
to differentiate into cardiomyocytes (Wobus et al, Differentiation, 48: 173-182, (1991); 

10 Klug et al, J. Clin. Invest, 98(1): 216-224, (1998)) or skeletal muscle cells (Browder, L. 
W. In: Principles of Tissue Engineering eds. Lanza et al, Academic Press (1997)). 
Alternatively, directed differentiation of stem cells can be accomplished by culturing the 
stem cells in the presence of a differentiation factor such as retinoic acid and an 
antagonist of the polypeptide of the invention which would inhibit the effects of 

15" endogenous stem cell factor activity and allow differentiation to proceed. 

In vitro cultures of stem cells can be used to determine if the polypeptide of the 
invention exhibits stem cell growth factor activity. Stem cells are isolated from any one 
of various cell sources (including hematopoietic stem cells and embryonic stem cells) and 
cultured on a feeder layer, as described by Thompson et. al., Proc. Natl. Acad. Sci, 

20 . U.S.A., 92: 7844-7848 (1995), in the presence of the polypeptide of the invention alone 
or in combination with other growth factors or cytokines. The ability of the polypeptide 
of the invention to induce stem cells proliferation is determined by colony formation on 
semi-solid support e.g. as described by Bernstein Blood, 77: 2316-2321 (1991). 

25 5*10.5 HEMATOPOIESIS REGULATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of 
hematopoiesis and, consequently, in the treatment of myeloid or lymphoid cell disorders. 
Even marginal biological activity in support of colony forming cells or of 
factor-dependent cell lines indicates involvement in regulating hematopoiesis, e.g. in 
30 supporting the growth and proliferation of erythroid progenitor cells alone or in 

combination with other cytokines, thereby indicating utility, for example, in treating 
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various anemias or for use in conjunction with irradiation/chemotherapy to stimulate the 
production of erythroid precursors and/or erythroid cells; in supporting the growth and 
proliferation of myeloid cells such as granulocytes and monocytes/macrophages (i.e., 
traditional CSF activity) useful, for example, in conjunction with chemotherapy to 
5 prevent or treat consequent myelo-suppression; in supporting the growth and proliferation 
of megakaryocytes and consequently of platelets thereby allowing prevention or 
treatment of various platelet disorders such as thrombocytopenia, and generally for use in 
place of or complimentary to platelet transfusions; and/or in supporting the growth and 
proliferation of hematopoietic stem cells which are capable of maturing to any and all of 

10 the above-mentioned hematopoietic cells and therefore find therapeutic utility in various 
stem cell disorders (such as those usually treated with transplantation, including, without 
limitation, aplastic anemia and paroxysmal nocturnal hemoglobinuria), as well as in 
repopulating the stem cell compartment post irradiation/chemotherapy, either in-vivo or 
ex-vivo (i.e., in conjunction with bone marrow transplantation or with peripheral 

1 5 progenitor cell transplantation (homologous or heterologous)) as normal cells or 
genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 
Suitable assays for proliferation and differentiation of various hematopoietic lines 
are cited above. 

20 Assays for embryonic stem cell differentiation (which will identify, among others, 

proteins that influence embryonic differentiation hematopoiesis) include, without 
limitation, those described in: Johansson et. al., Cellular Biology 15:141-151, 1995; 
Keller et al.. Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al.. 
Blood 81:2903-2915, 1993. 

25 Assays for stem cell survival and differentiation (which will identify, among 

others, proteins that regulate lympho-hematopoiesis) include, without limitation, those 
described in: Methylcellulose colony forming assays, Freshney, M. G. In Culture of 
Hematopoietic Cells. R. I. Freshney, et. al., eds. Vol pp. 265-268, Wiley-Liss, Inc., New 
York, N.Y. 1994; Hirayama et al.. Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; 

30 Primitive hematopoietic colony forming cells with high proliferative potential, McNiece, 
I. K. and Briddell, R. A. In Culture of Hematopoietic Cells. R. I. Freshney, et. al., eds. 
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Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et al.. Experimental 
Hematology 22:353-359, 1994; Cobblestone area forming cell assay, Ploemacher, R. E. 
In Culture of Hematopoietic Cells. R. I. Freshney, et. al., eds. Vol pp. 1-21, Wiley-Liss, 
Inc., New York, N.Y. 1994; Long term bone marrow cultures in the presence of stromal 
5 cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R. I. 
Freshney, et. al., eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, N.Y. 1994; Long 
term culture initiating cell assay, Sutherland, H. J. In Culture of Hematopoietic Cells. R. 
I. Freshney, et. al., eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, N.Y. 1994. 

10 5.10.6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, 
tendon, ligament and/or nerve tissue growth or regeneration, as well as in wound healing 
and tissue repair and replacement, and in healing of burns, incisions and ulcers. 

A polypeptide of the present invention which induces cartilage and/or bone 

15 growth in circumstances where bone is not normally formed, has application in the 

healing of bone fractures and cartilage damage or defects in humans and other animals. 
Compositions of a polypeptide, antibody, binding partner, or other modulator of the 
invention may have prophylactic use in closed as well as open fracture reduction and also 
in the improved fixation of artificial joints. De novo bone formation induced by an 

20 osteogenic agent contributes to the repair of congenital, trauma induced, or oncologic 
resection induced craniofacial defects, and also is useful in cosmetic plastic surgery. 

A polypeptide of this invention may also be involved in attracting bone-forming 
cells, stimulating growth of bone-forming cells, or inducing differentiation of progenitors 
of bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative 

25 disorders, or periodontal disease, such as through stimulation of bone and/or cartilage 
repair or by blocking inflammation or processes of tissue destruction (collagenase 
activity, osteoclast activity, etc.) mediated by inflammatory processes may also be 
possible using the composition of the invention. 

Another category of tissue regeneration activity that may involve the polypeptide 

30 of the present invention is tendon/ligament formation. Induction of tendon/ligament-like 
tissue or other tissue formation in circumstances where such tissue is not normally 
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formed, has application in the healing of tendon or ligament tears, deformities and other 
tendon or ligament defects in humans and other animals. Such a preparation employing a 
tendon/ligament-like tissue inducing protein may have prophylactic use in preventing 
damage to tendon or ligament tissue, as well as use in the improved fixation of tendon or 
5 ligament to bone or other tissues, and in repairing defects to tendon or ligament tissue. 
De novo tendon/ligament-like tissue formation induced by a composition of the present 
invention contributes to the repair of congenital, trauma induced, or other tendon or 
ligament defects of other origin, and is also useful in cosmetic plastic surgery for 
attachment or repair of tendons or ligaments. The compositions of the present invention 

10 may provide environment to attract tendon- or ligament-forming cells, stimulate growth 
of tendon- or ligament-forming cells, induce differentiation of progenitors of tendon- or 
, ligament-forming cells, or induce growth of tendon/ligament cells or progenitors ex vivo 
for return in vivo to effect tissue repair. The compositions of the invention may also be 
useful in the treatment of tendinitis, carpal tunnel syndrome and other tendon or ligament 

15 defects. The compositions may also include an appropriate matrix and/or sequestering 
agent as a carrier as is well known in the art. 

The compositions of the present invention may also be useful for proliferation of 
neural cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central 
and peripheral nervous system diseases and neuropathies, as well as mechanical and 

20 traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve 
tissue. More specifically, a composition may be used in the treatment of diseases of the 
peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and 
localized neuropathies, and central nervous system diseases, such as Alzheimer's, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager 

25 syndrome. Further conditions which may be treated in accordance with the present 

invention include mechanical and traumatic disorders, such as spinal cord disorders, head 
trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting 
from chemotherapy or other medical therapies may also be treatable using a composition 
of the invention. 
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Compositions of the invention may also be useful to promote better or faster 
closure of non-healing wounds, including without limitation pressure ulcers, ulcers 
associated with vascular insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 
5 regeneration of other tissues, such as organs (including, for example, pancreas, liver, 
intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular 
(including vascular endothelium) tissue, or for promoting the growth of cells comprising 
such tissues. Part of the desired effects may be by inhibition or modulation of fibrotic 
scarring may allow normal tissue to regenerate. A polypeptide of the present invention 
10 may also exhibit angiogenic activity. 

A composition of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, 
and conditions resulting from systemic cytokine damage. 

A composition of the present invention may also be useful for promoting or 
15 inhibiting differentiation of tissues described above from precursor tissues or cells; or for 
inhibiting the growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 

Assays for tissue generation activity include, without limitation, those described 
in: International Patent Publication No. WO95/16035 (bone, cartilage, tendon); 
20 International Patent Publication No. WO95/05846 (nerve, neuronal); International Patent 
Publication No. WO91/07491 (skin, endothelium). 

Assays for wound healing activity include, without limitation, those described in: 
Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., eds.), 
Year Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. 
25 Invest. Dermatol 71 :382-84 (1978). 

5.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or 
immune suppressing activity, including without limitation the activities for which assays 
30 are described herein. A polynucleotide of the invention can encode a polypeptide 

exhibiting such activities. A protein may be useful in the treatment of various immune 
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deficiencies and disorders (including severe combined immunodeficiency (SOD)), e.g., 
in regulating (up or down) growth and proliferation of T and/or B lymphocytes, as well as 
effecting the cytolytic activity of NK cells and other cell populations. These immune 
deficiencies may be genetic or be caused by viral (e.g., HIV) as well as bacterial or 
5 fungal infections, or may result from autoimmune disorders. More specifically, infectious 
diseases causes by viral, bacterial, fungal or other infection may be treatable using a 
protein of the present invention, including infections by HIV, hepatitis viruses, herpes 
viruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such 
. as candidiasis. Of course, in this regard, proteins of the present invention may also be 

10 useful where a boost to the immune system generally may be desirable, i.e., in the 
treatment of cancer. 

Autoimmune disorders which may be treated using a protein of the present 
invention include, for example, connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, 

1 5 Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, 
myasthenia gravis, graft- versus-host disease and autoimmune inflammatory eye disease. 
Such a protein (or antagonists thereof, including antibodies) of the present invention may 
also to be useful in the treatment of allergic reactions and conditions (e.g., anaphylaxis, 
serum sickness, drug reactions, food allergies, insect venom allergies, mastocytosis, 

20 allergic rhinitis, hypersensitivity pneumonitis, urticaria, angioedema, eczema, atopic 

dermatitis, allergic contact dermatitis, erythema multiforme, Stevens-Johnson syndrome, 
allergic conjunctivitis, atopic keratoconjunctivitis, venereal keratoconjunctivitis, giant 
papillary conjunctivitis and contact allergies), such as asthma (particularly allergic 
asthma) or other respiratory problems. Other conditions, in which immune suppression is 

25 desired (including, for example, organ transplantation), may also be treatable using a 
protein (or antagonists thereof) of the present invention. The therapeutic effects of the 
polypeptides or antagonists thereof on allergic reactions can be evaluated by in vivo 
animals models such as the cumulative contact enhancement test (Lastbom et al.. 
Toxicology 125: 59-66, 1998), skin prick test (Hoffmann et al.. Allergy 54: 446-54, 

30 1999), guinea pig skin sensitization test (Vohr et al.. Arch. Toxocol. 73: 501-9), and 

murine local lymph node assay (Kimber et al.. J. Toxicol. Environ. Health 53: 563-79). 
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Using the proteins of the invention it may also be possible to modulate immune 
responses, in a number of ways. Down regulation may be in the form of inhibiting or 
blocking an immune response already in progress or may involve preventing the 
induction of an immune response. The functions of activated T cells may be inhibited by 
5 suppressing T cell responses or by inducing specific tolerance in T cells, or both. 
Immunosuppression of T cell responses is generally an active, non-antigen-specific, 
process which requires continuous exposure of the T cells to the suppressive agent. 
Tolerance, which involves inducing non-responsiveness or anergy in T cells, is 
distinguishable from immunosuppression in that it is generally antigen-specific and 

10 persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be r 
demonstrated by the lack of a T cell response upon reexposure to specific antigen in the 
absence of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 
limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing 

15 high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, 
skin and organ transplantation and in graft-versus-host disease (GVHD). For example, 
blockage of T cell function should result in reduced tissue destruction in tissue 
transplantation. Typically, in tissue transplants, rejection of the transplant is initiated 
through its recognition as foreign by T cells, followed by an immune reaction that 

20 destroys the transplant. The administration of a therapeutic composition of the invention 
may prevent cytokine synthesis by immune cells, such as T cells, and thus acts as an 
immunosuppressant. Moreover, a lack of costimulation may also be sufficient to anergize 
the T cells, thereby inducing tolerance in a subject. Induction of long-term tolerance by B 
lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration 

25 of these blocking reagents. To achieve sufficient immunosuppression or tolerance in a 

subject, it may also be necessary to block the function of a combination of B lymphocyte 
antigens. 

The efficacy of particular therapeutic compositions in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
30 humans. Examples of appropriate systems which can be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been 
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used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as 
described in Lenschow et al.. Science 257:789-792 (1992) and Turka et aL. Proc. Natl. 
Acad. Sci USA, 89:1 1 102-1 1 105 (1992). In addition, murine models of GVHD (see Paul 
ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used 
5 to determine the effect of therapeutic compositions of the invention on the development 
of that disease. 

Blocking antigen function may also be therapeutically useful for treating 
autoimmune diseases. Many autoimmune disorders are the result of inappropriate 
activation of T cells that are reactive against self tissue and which promote the production 

10 of cytokines and autoantibodies involved in the pathology of the diseases. Preventing the 
activation of autoreactive T cells may reduce or eliminate disease symptoms. 
Administration of reagents which block stimulation of T cells can be used to inhibit T 
cell activation and prevent production of autoantibodies or T cell-derived cytokines 
which may be involved in the disease process. Additionally, blocking reagents may 

1 5 induce antigen-specific tolerance of autoreactive T cells which could lead to long-term 
relief from the disease. The efficacy of blocking reagents in preventing or alleviating 
autoimmune disorders can be determined using a number of well-characterized animal 
models of human autoimmune diseases. Examples include murine experimental 
autoimmune encephalitis, systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB 

20 hybrid mice, murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and 
BB rats, and murine experimental myasthenia gravis (see Paul ed., Fundamental 
Immunology, Raven Press, New York, 1989, pp. 840-856). 

Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a 
means of up regulating immune responses, may also be useful in therapy. Upregulation of 

25 immune responses may be in the form of enhancing an existing immune response or 

eliciting an initial immune response. For example, enhancing an immune response may 
be useful in cases of viral infection, including systemic viral diseases such as influenza, 
the common cold, and encephalitis. 

Alternatively, anti-viral immune responses may be enhanced in an infected patient 

30 by removing T cells from the patient, costimulating the T cells in vitro with viral 

antigen-pulsed APCs either expressing a peptide of the present invention or together with 
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a stimulatory form of a soluble peptide of the present invention and reintroducing the in 
vitro activated T cells into the patient. Another method of enhancing anti-viral immune 
responses would be to isolate infected cells from a patient, transfect them with a nucleic 
acid encoding a protein of the present invention as described herein such that the cells 
5 express all or a portion of the protein on their surface, and reintroduce the transfected 
cells into the patient. The infected cells would now be capable of delivering a 
costimulatory signal to, and thereby activate, T cells in vivo. 

A polypeptide of the present invention may provide the necessary stimulation 
signal to T cells to induce a T cell mediated immune response against the transfected 

10 tumor cells. In addition, tumor cells which lack MHC class I or MHC class II molecules, 
or which fail to reexpress sufficient mounts of MHC class I or MHC class II molecules, 
can be transfected with nucleic acid encoding all or a portion of (e.g., a 
cytoplasmic-domain truncated portion) of an MHC class I alpha chain protein and P2 
microglobulin protein or an MHC class II alpha chain protein and an MHC class II beta 

1 5 chain protein to thereby express MHC class I or MHC class II proteins on the cell 
surface. Expression of the appropriate class I or class II MHC in conjunction with a 
peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a 
T cell mediated immune response against the transfected tumor cell. Optionally, a gene 
encoding an antisense construct which blocks expression of an MHC class II associated 

20 protein, such as the invariant chain, can also be cotransfected with a DNA encoding a 
peptide having the activity of a B lymphocyte antigen to promote presentation of tumor 
associated antigens and induce tumor specific immunity. Thus, the induction of a T cell 
mediated immune response in a human subject may be sufficient to overcome 
tumor-specific tolerance in the subject. 

25 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without 
limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. 
M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing 

30 Associates and Wiley- Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte 
Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Herrmann et al.. Proc. 
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Natl. Acad. Sci. USA 78:2488-2492, 1981; Herrmann et al.. J. Immunol. 128:1968-1974, 
1982; Handa et al.. J. Immunol. 135:1564-1572, 1985; Takai et al.. I. Immunol. 
137:3494-3500, 1986; Takai et al.. J. Immunol. 140:508-512, 1988; Bowman et al.. J. 
Virology 61:1992-1998; Bertagnolli et al.. Cellular Immunology 133:327-341, 1991; 
5 Brown et al.. J. Immunol. 153:3079-3092, 1994. 

Assays for T-cell-dependent immunoglobulin responses and isotype switching 
(which will identify, among others, proteins that modulate T-cell dependent antibody 
responses and that affect Thl/Th2 profiles) include, without limitation, those described 
in: Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In 
10 vitro antibody production, Mond, J. J. and Brunswick, M. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, 
Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 
proteins that generate predominantly Thl and CTL responses) include, without limitation, 

15 those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. 
Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing 
Associates and Wiley-Interscience (Chapter 3, In vitro assays for Mouse Lymphocyte 
Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Takai et al.. J. Immunol. 
137:3494-3500, 1986; Takai et al.. J. Immunol. 140:508-512, 1988; Bertagnolli et al.. J. 

20 Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins 
expressed by dendritic cells that activate naive T-cells) include, without limitation, those 
described in: Guery et al.. J. Immunol. 134:536-544, 1995; Inaba et al.. Journal of 
Experimental Medicine 173:549-559, 1991; Macatonia et al.. Journal of Immunology 

25 154:5071-5079, 1995; Porgador et al.. Journal of Experimental Medicine 182:255-260, 
1995; Nair et al.. Journal of Virology 67:4062-4069, 1993; Huang et al.. Science 
264:961-965, 1994; Macatonia et al.. Journal of Experimental Medicine 169:1255-1264, 
1989; Bhardwaj et al.. Journal of Clinical Investigation 94:797-807, 1994; and Inaba et 
al.. Journal of Experimental Medicine 172:631-640, 1990. 

30 Assays for lymphocyte survival/apoptosis (which will identify, among others, 

proteins that prevent apoptosis after superantigen induction and proteins that regulate 
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lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz 
et al.. Cytometry 13:795-808, 1992; Gorczyca et al.. Leukemia 7:659-670, 1993; 
Gorczyca et al.. Cancer Research 53:1945-1951, 1993; Itoh et al.. Cell 66:233-243, 1991; 
Zacharchuk, Journal of Immunology 145:4037-4045, 1990; Zamai et al.. Cytometry 
5 14:891-897, 1993; Gorczyca et al.. International Journal of Oncology 1:639-648, 1992. 
Assays for proteins that influence early steps of T-cell commitment and 
development include, without limitation, those described in: Antica et al.. Blood 
84:1 1 1-1 17, 1994; Fine et al.. Cellular Immunology 155:1 1 1-122, 1994; Galy et al.. 
Blood 85:2770-2778, 1995; Toki et al.. Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 

10 

5.10.8 ACTIVIN/INHIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 

1 5 stimulating hormone (FSH), while activins and are characterized by their ability to 

stimulate the release of follicle stimulating hormone (FSH). Thus, a polypeptide of the 
present invention, alone or in heterodimers with a member of the inhibin family, may be 
useful as a contraceptive based on the ability of inhibins to decrease fertility in female 
mammals and decrease spermatogenesis in male mammals. Administration of sufficient 

20 amounts of other inhibins can induce infertility in these mammals. Alternatively, the 
polypeptide of the invention, as a homodimer or as a heterodimer with other protein 
subunits of the inhibin group, may be useful as a fertility inducing therapeutic, based 
upon the ability of activin molecules in stimulating FSH release from cells of the anterior 
pituitary. See, for example, U.S. Pat. No. 4,798,885. A polypeptide of the invention may 

25 also be useful for advancement of the onset of fertility in sexually immature mammals, so 
as to increase the lifetime reproductive performance of domestic animals such as, but not 
limited to, cows, sheep and pigs. 

The activity of a polypeptide of the invention may, among other means, be 
measured by the following methods. 

30 Assays for activin/inhibin activity include, without limitation, those described in: 

Vale et al.. Endocrinology 91 :562-572, 1972; Ling et al.. Nature 321 :779-782, 1986; Vale 
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et al.. Nature 321:776-779, 1986; Mason et al.. Nature 318:659-663, 1985; Forage et al.. 
Proc. Natl. Acad. Sci. USA 83:3091-3095, 1986. 

5.10.9 CHEMOTACTIC/CHEMOKINETIC ACTIVITY 

5 A polypeptide of the present invention may be involved in chemotactic or 

chemokinetic activity for mammalian cells, including, for example, monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial 
cells. A polynucleotide of the invention can encode a polypeptide exhibiting such 
attributes. Chemotactic and chemokinetic receptor activation can be used to mobilize or 

10 attract a desired cell population to a desired site of action. Chemotactic or chemokinetic 
compositions (e.g. proteins, antibodies, binding partners, or modulators of the invention) 
provide particular advantages in treatment of wounds and other trauma to tissues, as well 
. as in treatment of localized infections. For example, attraction of lymphocytes, 
monocytes or neutrophils to tumors or sites of infection may result in improved immune 

1 5 responses against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it 
can stimulate, directly or indirectly, the directed orientation or movement of such cell 
population. Preferably, the protein or peptide has the ability to directly stimulate directed 
movement of cells. Whether a particular protein has chemotactic activity for a population 

20 of cells can be readily determined by employing such protein or peptide in any known 
assay for cell chemotaxis. 

Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or 
prevent chemotaxis) consist of assays that measure the ability of a protein to induce the 

25 migration of cells across a membrane as well as the ability of a protein to induce the 

adhesion of one cell population to another cell population. Suitable assays for movement 
and adhesion include, without limitation, those described in: Current Protocols in 
Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. 
Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 6.12, 

30 Measurement of alpha and beta Chemokines 6.12.1-6.12.28; Taub et. al., J. Clin. Invest. 
95:1370-1376, 1995; Lind et. al., APMIS 103:140-146, 1995; Muller et al Eur. J. 
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Immunol. 25:1744-1748; Gruber et. aL, J. of Immunol. 152:5860-5867, 1994; Johnston 
et. aL, J. of Immunol. 153:1762-1768, 1994. 

5.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

5 A polypeptide of the invention may also be involved in hemostatis or 

thrombolysis or thrombosis. A polynucleotide of the invention can encode a polypeptide 
exhibiting such attributes. Compositions may be useful in treatment of various 
coagulation disorders (including hereditary disorders, such as hemophilias) or to enhance 
coagulation and other hemostatic events in treating wounds resulting from trauma, 

10 surgery or other causes. A composition of the invention may also be useful for dissolving 
or inhibiting formation of thromboses and for treatment and prevention of conditions 
resulting therefrom (such as, for example, infarction of cardiac and central nervous 
system vessels {e.g., stroke). 

Therapeutic compositions of the invention can be used in the following: 

1 5 Assay for hemostatic and thrombolytic activity include, without limitation, those 

described in: Linet et aL. J. Clin. Pharmacol. 26:131-140, 1986; Burdick et aL. 
Thrombosis Res. 45:413-419, 1987; Humphrey et aL. Fibrinolysis 5:71-79 (1991); 
Schaub, Prostaglandins 35:467-474, 1988. 

20 5.10.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation, 
proliferation or metastasis. Detection of the presence or amount of polynucleotides or 
polypeptides of the invention may be useful for the diagnosis and/or prognosis of one or 
more types of cancer. For example, the presence or increased expression of a 

25 polynucleotide/polypeptide of the invention may indicate a hereditary risk of cancer, a 
precancerous condition, or an ongoing malignancy. Conversely, a defect in the gene or 
absence of the polypeptide may be associated with a cancer condition. Identification of 
single nucleotide polymorphisms associated with cancer or a predisposition to cancer 
may also be useful for diagnosis or prognosis. 

30 Cancer treatments promote tumor regression by inhibiting tumor cell 

proliferation, inhibiting angiogenesis (growth of new blood vessels that is necessary to 
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support tumor growth) and/or prohibiting metastasis by reducing tumor cell motility or 
invasiveness. Therapeutic compositions of the invention may be effective in adult and 
pediatric oncology including in solid phase tumors/malignancies, locally advanced 
tumors, human soft tissue sarcomas, metastatic cancer, including lymphatic metastases, 
5 blood cell malignancies including multiple myeloma, acute and chronic leukemias, and 
lymphomas, head and neck cancers including mouth cancer, larynx cancer and thyroid 
cancer, lung cancers including small cell carcinoma and non-small cell cancers, breast 
cancers including small cell carcinoma and ductal carcinoma, gastrointestinal cancers 
including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 

10 associated with colorectal neoplasia, pancreatic cancers, liver cancer, urologic cancers 
including bladder cancer and prostate cancer, malignancies of the female genital tract 
including ovarian carcinoma, uterine (including endometrial) cancers, and solid tumor in 
the ovarian follicle, kidney cancers including renal cell carcinoma, brain cancers 
including intrinsic brain tumors, neuroblastoma, astrocytic brain tumors, gliomas, 

1 5 metastatic tumor cell invasion in the central nervous system, bone cancers including 

osteomas, skin cancers including malignant melanoma, tumor progression of human skin 
keratinocytes, squamous cell carcinoma, basal cell carcinoma, hemangiopericytoma and 
Karposi's sarcoma. 

Polypeptides, polynucleotides, or modulators of polypeptides of the invention 
20 (including inhibitors and stimulators of the biological activity of the polypeptide of the 
invention) may be administered to treat cancer. Therapeutic compositions can be 
administered in therapeutically effective dosages alone or in combination with adjuvant 
cancer therapy such as surgery, chemotherapy, radiotherapy, thermotherapy, and laser 
therapy, and may provide a beneficial effect, e.g. reducing tumor size, slowing rate of 
25 tumor growth, inhibiting metastasis, or otherwise improving overall clinical condition, 
without necessarily eradicating the cancer. 

The composition can also be administered in therapeutically effective amounts as 
a portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the 
polypeptide or modulator of the invention with one or more anti-cancer drugs in addition 
30 to a pharmaceutically acceptable carrier for delivery. The use of anti-cancer cocktails as 
a cancer treatment is routine. Anti-cancer drugs that are well known in the art and can be 
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used as a treatment in combination with the polypeptide or modulator of the invention 
include: Actinomycin D, Aminoglutethimide, Asparaginase, Bleomycin, Busulfan, 
Carboplatin, Carmustine, Chlorambucil, Cisplatin (cis-DDP), Cyclophosphamide, 
Cytarabine HC1 (Cytosine arabinoside), Dacarbazine, Dactinomycin, Daunorubicin HC1, 
5 Doxorubicin HC1, Estramustine phosphate sodium, Etoposide (VI 6-2 13), Floxuridine, 5- 
Fluorouracil (5-Fu), Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, Interferon 
Alpha-2a, Interferon Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), 
Lomustine, Mechlorethamine HC1 (nitrogen mustard), Melphalan, Mercaptopurine, 
Mesna, Methotrexate (MTX), Mitomycin, Mitoxantrone HC1, Octreotide, Plicamycin, 

10 Procarbazine HC1, Streptozocin, Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine 
sulfate, Vincristine sulfate, Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, 
Mitoguazone, Pentostatin, Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for 
prophylactic treatment of cancer. There are hereditary conditions and/or environmental 

1 5 situations (e.g. exposure to carcinogens) known in the art that predispose an individual to 
developing cancers. Under these circumstances, it may be beneficial to treat these 
individuals with therapeutically effective doses of the polypeptide of the invention to 
reduce the risk of developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide of 

20 the invention as a potential cancer treatment. These in vitro models include proliferation 
assays of cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, 
(1987) Culture of Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, 
NY Ch 18 and Ch 21), tumor systems in nude mice as described in Giovanella et al.. J. 
Natl. Can. Inst., 52: 921-30 (1974), mobility and invasive potential of tumor cells in 

25 Boyden Chamber assays as described in Pilkington et al.. Anticancer Res., 17: 4107-9 
(1997), and angiogenesis assays such as induction of vascularization of the chick 
chorioallantoic membrane or induction of vascular endothelial cell migration as described 
in Ribatta et al.. Intl. J. Dev. Biol., 40: 1 189-97 (1999) and Li et al.. Clin. Exp. 
Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines are available, e.g. 

30 from American Type Tissue Culture Collection catalogs. 
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5.10.12 RECEPTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, 
receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide 
of the invention can encode a polypeptide exhibiting such characteristics. Examples of 
5 such receptors and ligands include, without limitation, cytokine receptors and their 
ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, 
receptors involved in cell-cell interactions and their ligands (including without limitation, 
cellular adhesion molecules (such as selectins, integrins and their ligands) and 
receptor/ligand pairs involved in antigen presentation, antigen recognition and 

10 development of cellular and humoral immune responses. Receptors and ligands are also 
useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/ligand interaction. A protein of the present invention (including, without 
limitation, fragments of receptors and ligands) may themselves be useful as inhibitors of 
receptor/ligand interactions. 

15 The activity of a polypeptide of the invention may, among other means, be 

measured by the following methods: 

Suitable assays for receptor-ligand activity include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, 
D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 

20 Wiley- Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static 

conditions 7.28.1- 7.28.22), Takai et al.. Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; 
Bierer et al.. J. Exp. Med. 168:1 145-1 156, 1988; Rosenstein et al.. J. Exp. Med. 
169:149-160 1989; Stoltenborg et al.. J. Immunol. Methods 175:59-68, 1994; Stitt et al.. 
Cell 80:661-670, 1995. 

25 By way of example, the polypeptides of the invention may be used as a receptor 

for a ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may 
be identified through binding assays, affinity chromatography, dihybrid screening assays, 
BIAcore assays, gel overlay assays, or other methods known in the art. 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists 

30 or a partial antagonist require the use of other proteins as competing ligands. The 
polypeptides of the present invention or ligand(s) thereof may be labeled by being 
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coupled to radioisotopes, colorimetric molecules or a toxin molecules by conventional 
methods. ("Guide to Protein Purification" Murray P. Deutscher (ed) Methods hv 
Enzymology Vol. 182 (1990) Academic Press, Inc. San Diego). Examples of 
radioisotopes include, but are not limited to, tritium and carbon- 14 . Examples of 
5 colorimetric molecules include, but are not limited to, fluorescent molecules such as 
fluorescamine, or rhodamine or other colorimetric molecules. Examples of toxins 
include, but are not limited, to ricin. 

5,10.13 DRUG SCREENING 

. 10 This invention is particularly useful for screening chemical compounds by using 

the novel polypeptides or binding fragments thereof in any of a variety of drug screening 
techniques. The polypeptides or fragments employed in such a test may either be free in 
solution, affixed to a solid support, borne on a cell surface or located intracellularly. One 
method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 

1 5 transformed with recombinant nucleic acids expressing the polypeptide or a fragment 

thereof. Drugs are screened against such transformed cells in competitive binding assays. 
Such cells, either in viable or fixed form, can be used for standard binding assays. One 
may measure, for example, the formation of complexes between polypeptides of the 
invention or fragments and the agent being tested or examine the diminution in complex 

20 formation between the novel polypeptides and an appropriate cell line, which are well 
known in the art. 

Sources for test compounds that may be screened for ability to bind to or 
modulate (i.e., increase or decrease) the activity of polypeptides of the invention include 
(1) inorganic and organic chemical libraries, (2) natural product libraries, and (3) 

25 combinatorial libraries comprised of either random or mimetic peptides, oligonucleotides 
or organic molecules. 

Chemical libraries may be readily synthesized or purchased from a number of 
commercial sources, and may include structural analogs of known compounds or 
compounds that are identified as "hits" or "leads" via natural product screening. 

30 The sources of natural product libraries are microorganisms (including bacteria 

and fungi), animals, plants or other vegetation, or marine organisms, and libraries of 
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mixtures for screening may be created by: (1) fermentation and extraction of broths from 
soil, plant or marine microorganisms or (2) extraction of the organisms themselves. 
Natural product libraries include polyketides, non-ribosomal peptides, and (non-naturally 
occurring) variants thereof For a review, see Science 252:63-68 (1998). 
5 Combinatorial libraries are composed of large numbers of peptides, 

oligonucleotides or organic compounds and can be readily prepared by traditional 
automated synthesis methods, PCR, cloning or proprietary synthetic methods. Of 
particular interest are peptide and oligonucleotide combinatorial libraries. Still other 
libraries of interest include peptide, protein, peptidomimetic, multiparallel synthetic 

10 collection, recombinatorial, and polypeptide libraries. For a review of combinatorial 

chemistry and libraries created therefrom, see Myers, Curr. Opin. Biotechnol 8:701-707 
(1997). For reviews and examples of peptidomimetic libraries, see Al-Obeidi et al.. Mol. 
Biotechnol, 9(3):205-23 (1998); Hruby et al.. Curr Opin Chem Biol, 1(1): 1 14-1 9 (1997); 
Dorner et al.. Bioorg Med Chem, 4(5):709-15 (1996) (alkylated dipeptides). 

1 5 Identification of modulators through use of the various libraries described herein 

permits modification of the candidate "hit" (or "lead") to optimize the capacity of the 
"hit" to bind a polypeptide of the invention. The molecules identified in the binding assay 
are then tested for antagonist or agonist activity in in vivo tissue culture or animal models 
that are well known in the art. In brief, the molecules are titrated into a plurality of cell 

20 cultures or animals and then tested for either cell/animal death or prolonged survival of 
the animal/cells. 

The binding molecules thus identified may be complexed with toxins, e.g., ricin 
or cholera, or with other compounds that are toxic to cells such as radioisotopes. The 
toxin-binding molecule complex is then targeted to a tumor or other cell by the specificity 
25 of the binding molecule for a polypeptide of the invention. Alternatively, the binding 
molecules may be complexed with imaging agents for targeting and imaging purposes. 

5,10.14 ASSAY FOR RECEPTOR ACTIVITY 

The invention also provides methods to detect specific binding of a polypeptide 
30 e.g. a ligand or a receptor. The art provides numerous assays particularly useful for 
identifying previously unknown binding partners for receptor polypeptides of the 
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invention. For example, expression cloning using mammalian or bacterial cells, or 
dihybrid screening assays can be used to identify polynucleotides encoding binding 
partners. As another example, affinity chromatography with the appropriate immobilized 
polypeptide of the invention can be used to isolate polypeptides that recognize and bind 
5 polypeptides of the invention. There are a number of different libraries used for the 
identification of compounds, and in particular small molecules, that modulate (i.e., 
increase or decrease) biological activity of a polypeptide of the invention. Ligands for 
receptor polypeptides of the invention can also be identified by adding exogenous 
ligands, or cocktails of ligands to two cells populations that are genetically identical 

10 except for the expression of the receptor of the invention: one cell population expresses 
the receptor of the invention whereas the other does not. The response of the two cell 
populations to the addition of ligands(s) are then compared. Alternatively, an expression 
library can be co-expressed with the polypeptide of the invention in cells and assayed for 
an autocrine response to identify potential ligand(s). As still another example, BIAcore 

1 5 assays, gel overlay assays, or other methods known in the art can be used to identify 

binding partner polypeptides, including, (1) organic and inorganic chemical libraries, (2) 
natural product libraries, and (3) combinatorial libraries comprised of random peptides, 
oligonucleotides or organic molecules. 

The role of downstream intracellular signaling molecules in the signaling cascade 

20 of the polypeptide of the invention can be determined. For example, a chimeric protein in 
which the cytoplasmic domain of the polypeptide of the invention is fused to the 
extracellular portion of a protein, whose ligand has been identified, is produced in a host 
cell. The cell is then incubated with the ligand specific for the extracellular portion of the 
chimeric protein, thereby activating the chimeric receptor. Known downstream proteins 

25 involved in intracellular signaling can then be assayed for expected modifications i.e. 
phosphorylation. Other methods known to those in the art can also be used to identify 
signaling molecules involved in receptor activity. 

5.10.15 ANTI-INFLAMMATORY ACTIVITY 

30 Compositions of the present invention may also exhibit anti-inflammatory 

activity. The anti-inflammatory activity may be achieved by providing a stimulus to cells 
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involved in the inflammatory response, by inhibiting or promoting cell-cell interactions 
(such as, for example, cell adhesion), by inhibiting or promoting chemotaxis of cells 
involved in the inflammatory process, inhibiting or promoting cell extravasation, or by 
stimulating or suppressing production of other factors which more directly inhibit or 
5 promote an inflammatory response. Compositions with such activities can be used to treat 
inflammatory conditions including chronic or acute conditions), including without 
limitation intimation associated with infection (such as septic shock, sepsis or systemic 
inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin 
lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 

10 chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting 
from over production of cytokines such as TNF or IL-1. Compositions of the invention 
may also be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or 
material. Compositions of this invention may be utilized to prevent or treat conditions 
such as, but not limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced 

1 5 shock, rheumatoid arthritis, chronic inflammatory arthritis, pancreatic cell damage from 
diabetes mellitus type 1 , graft versus host disease, inflammatory bowel disease, 
inflamation associated with pulmonary disease, other autoimmune disease or 
inflammatory disease, an antiproliferative agent such as for acute or chronic mylegenous 
leukemia or in the prevention of premature labor secondary to intrauterine infections. 

20 

5.10.16 LEUKEMIAS 

Leukemias and related disorders may be treated or prevented by administration of 
a therapeutic that promotes or inhibits function of the polynucleotides and/or 
polypeptides of the invention. Such leukemias and related disorders include but are not 
25 limited to acute leukemia, acute lymphocytic leukemia, acute myelocytic leukemia, 
myeloblasts, promyelocyte, myelomonocytic, monocytic, erythroleukemia, chronic 
leukemia, chronic myelocytic (granulocytic) leukemia and chronic lymphocytic leukemia 
(for a review of such disorders, see Fishman et al.. 1985, Medicine, 2d Ed., J.B. 
Lippincott Co., Philadelphia). 

30 
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5.10.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efficacy of 
intervention with compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an indication 
5 of therapeutic utility, include but are not limited to nervous system injuries, and diseases 
or disorders which result in either a disconnection of axons, a diminution or degeneration 
of neurons, or demyelination. Nervous system lesions which may be treated in a patient 
(including human and non-human mammalian patients) according to the invention 
include but are not limited to the following lesions of either the central (including spinal 
10 cord, brain) or peripheral nervous systems: 

(i) traumatic lesions, including lesions caused by physical injury or associated 
with surgery, for example, lesions which sever a portion of the nervous system, or 
compression injuries; 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous 
15 system results in neuronal injury or death, including cerebral infarction or ischemia, or 

spinal cord infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 
injured as a result of infection, for example, by an abscess or associated with infection by 
human immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme 

20 disease, tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed 
or injured as a result of a degenerative process including but not limited to degeneration 
associated with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or 
amyotrophic lateral sclerosis; 

25 (v) lesions associated with nutritional diseases or disorders, in which a portion 

of the nervous system is destroyed or injured by a nutritional disorder or disorder of 
metabolism including but not limited to, vitamin B12 deficiency, folic acid deficiency, 
Wernicke disease, tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary 
degeneration of the corpus callosum), and alcoholic cerebellar degeneration; 
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(vi) neurological lesions associated with systemic diseases including but not 
limited to diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, 
carcinoma, or sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 
5 neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is 
destroyed or injured by a demyelinating disease including but not limited to multiple 
sclerosis, human immunodeficiency virus-associated myelopathy, transverse myelopathy 
or various etiologies, progressive multifocal leukoencephalopathy, and central pontine 

10 myelinolysis. 

Therapeutics which are useful according to the invention for treatment of a 
nervous system disorder may be selected by testing for biological activity in promoting 
the survival or differentiation of neurons. For example, and not by way of limitation, 
therapeutics which elicit any of the following effects may be useful according to the 

1 5 invention: 


(ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, 
e.g., choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 


Such effects may be measured by any method known in the art. In preferred, 
non-limiting embodiments, increased survival of neurons may be measured by the 
method set forth in Arakawa et. al., (1990, J. Neurosci. 10:3507-3515); increased 
sprouting of neurons may be detected by methods set forth in Pestronk et. al., (1980, Exp. 

25 Neurol. 70:65-82) or Brown et. al., (1981, Ann. Rev. Neurosci. 4:17-42); increased 
production of neuron-associated molecules may be measured by bioassay, enzymatic 
assay, antibody binding, Northern blot assay, etc., depending on the molecule to be 
measured; and motor neuron dysfunction may be measured by assessing the physical 
manifestation of motor neuron disorder, e.g., weakness, motor neuron conduction 

30 velocity, or functional disability. 


(i) 


increased survival time of neurons in culture; 


20 


(iv) decreased symptoms of neuron dysfunction in vivo. 
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In specific embodiments, motor neuron disorders that may be treated according to 
the invention include but are not limited to disorders such as infarction, infection, 
exposure to toxin, trauma, surgical damage, degenerative disease or malignancy that may 
affect motor neurons as well as other components of the nervous system, as well as 
5 disorders that selectively affect neurons such as amyotrophic lateral sclerosis, and 
including but not limited to progressive spinal muscular atrophy, progressive bulbar 
palsy, primary lateral sclerosis, infantile and juvenile muscular atrophy, progressive 
bulbar paralysis of childhood (Fazio-Londe syndrome), poliomyelitis and the post polio 
syndrome, and Hereditary Motorsensory Neuropathy (Charcot-Marie-Tooth Disease). 

10 

5.10.18 OTHER ACTIVITIES 

A polypeptide of the invention may also exhibit one or more of the following 
additional activities or effects: inhibiting the growth, infection or function of, or killing, 
infectious agents, including, without limitation, bacteria, viruses, fungi and other 

1 5 parasites; effecting (suppressing or enhancing) bodily characteristics, including, without 
limitation, height, weight, hair color, eye color, skin, fat to lean ratio or other tissue 
pigmentation, or organ or body part size or shape (such as, for example, breast 
augmentation or diminution, change in bone form or shape); effecting biorhythms or 
circadian cycles or rhythms; effecting the fertility of male or female subjects; effecting 

20 the metabolism, catabolism, anabolism, processing, utilization, storage or elimination of 
dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other nutritional 
factors or component(s); effecting behavioral characteristics, including, without 
limitation, appetite, libido, stress, cognition (including cognitive disorders), depression 
(including depressive disorders) and violent behaviors; providing analgesic effects or 

25 other pain reducing effects; promoting differentiation and growth of embryonic stem cells 
in lineages other than hematopoietic lineages; hormonal or endocrine activity; in the case 
of enzymes, correcting deficiencies of the enzyme and treating deficiency-related 
diseases; treatment of hyperproliferative disorders (such as, for example, psoriasis); 
immunoglobulin-like activity (such as, for example, the ability to bind antigens or 

30 complement); and the ability to act as an antigen in a vaccine composition to raise an 
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immune response against such protein or another material or entity which is 
cross-reactive with such protein. 

5.10.19 IDENTIFICATION OF POLYMORPHISMS 

5 The demonstration of polymorphisms makes possible the identification of such 

polymorphisms in human subjects and the pharmacogenetic use of this information for 
diagnosis and treatment. Such polymorphisms may be associated with, e.g., differential 
predisposition or susceptibility to various disease states (such as disorders involving 
inflammation or immune response) or a differential response to drug administration, and 

10 this genetic information can be used to tailor preventive or therapeutic treatment 
appropriately. For example, the existence of a polymorphism associated with a 
predisposition to inflammation or autoimmune disease makes possible the diagnosis of 
this condition in humans by identifying the presence of the polymorphism. 

Polymorphisms can be identified in a variety of ways known in the art which all 

1 5 generally involve obtaining a sample from a patient, analyzing DNA from the sample, 
optionally involving isolation or amplification of the DNA, and identifying the presence 
of the polymorphism in the DNA. For example, PCR may be used to amplify an 
appropriate fragment of genomic DNA which may then be sequenced. Alternatively, the 
DNA may be subjected to allele-specific oligonucleotide hybridization (in which 

20 appropriate oligonucleotides are hybridized to the DNA under conditions permitting 

detection of a single base mismatch) or to a single nucleotide extension assay (in which 
an oligonucleotide that hybridizes immediately adjacent to the position of the 
polymorphism is extended with one or more labeled nucleotides). In addition, traditional 
restriction fragment length polymorphism analysis (using restriction enzymes that 

25 provide differential digestion of the genomic DNA depending on the presence or absence 
of the polymorphism) may be performed. Arrays with nucleotide sequences of the 
present invention can be used to detect polymorphisms. The array can comprise modified 
nucleotide sequences of the present invention in order to detect the nucleotide sequences 
of the present invention. In the alternative, any one of the nucleotide sequences of the 

30 present invention can be placed on the array to detect changes from those sequences. 
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Alternatively a polymorphism resulting in a change in the amino acid sequence 
could also be detected by detecting a corresponding change in amino acid sequence of the 
protein, e.g., by an antibody specific to the variant sequence. 

5 5.10.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against 
rheumatoid arthritis is determined in an experimental animal model system. The 
experimental model system is adjuvant induced arthritis in rats, and the protocol is 
described by J. Holoshitz, et at., 1983, Science, 219:56, or by B. Waksman et al.. 1963, < 

10 Int. Arch. Allergy Appl. Immunol., 23:129. Induction of the disease can be caused by a 
single injection, generally intradermally, of a suspension of killed Mycobacterium 
tuberculosis in complete Freund's adjuvant (CFA). The route of injection can vary, but 
rats may be injected at the base of the tail with an adjuvant mixture. The polypeptide is 
administered in phosphate buffered solution (PBS) at a dose of about 1-5 mg/kg. The 

1 5 control consists of administering PBS only. 

The procedure for testing the effects of the test compound would consist of 
intradermally injecting killed Mycobacterium tuberculosis in CFA followed by 
immediately administering the test compound and subsequent treatment every other day 
until day 24. At 14, 15, 18, 20, 22, and 24 days after injection of Mycobacterium CFA, an 

20 overall arthritis score may be obtained as described by J. Holoskitz above. An analysis of 
the data would reveal that the test compound would have a dramatic affect on the 
swelling of the joints as measured by a decrease of the arthritis score. 

5.11 THERAPEUTIC METHODS 

25 The compositions (including polypeptide fragments, analogs, variants and 

antibodies or other binding partners or modulators including antisense polynucleotides) 
of the invention have numerous applications in a variety of therapeutic methods. 
Examples of therapeutic applications include, but are not limited to, those exemplified 
herein. 
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5.11-1 EXAMPLE 

One embodiment of the invention is the administration of an effective amount of 
the polypeptides or other composition of the invention to individuals affected by a 
disease or disorder that can be modulated by regulating the peptides of the invention. 
5 While the mode of administration is not particularly important, parenteral administration 
is preferred. An exemplary mode of administration is to deliver an intravenous bolus. 
The dosage of the polypeptides or other composition of the invention will normally be 
determined by the prescribing physician. It is to be expected that the dosage will vary 
according to the age, weight, condition and response of the individual patient. Typically, 

10 the amount of polypeptide administered per dose will be in the range of about 0.01|ag/kg 
to 100 mg/kg of body weight, with the preferred dose being about 0.1(j.g/kg to 10 mg/kg 
of patient body weight. For parenteral administration, polypeptides of the invention will 
be formulated in an injectable form combined with a pharmaceutically acceptable 
parenteral vehicle. Such vehicles are well known in the art and examples include water, 

1 5 saline, Ringer's solution, dextrose solution, and solutions consisting of small amounts of 
the human serum albumin. The vehicle may contain minor amounts of additives that 
maintain the isotonicity and stability of the polypeptide or other active ingredient: The 
preparation of such solutions is within the skill of the art. 

20 5.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 

ADMINISTRATION 

A protein or other composition of the present invention (from whatever source 
derived, including without limitation from recombinant and non-recombinant sources and 

25 including antibodies and other binding partners of the polypeptides of the invention) may 
be administered to a patient in need, by itself, or in pharmaceutical compositions where it 
is mixed with suitable carriers or excipient(s) at doses to treat or ameliorate a variety of 
disorders. Such a composition may optionally contain (in addition to protein or other 
active ingredient and a carrier) diluents, fillers, salts, buffers, stabilizers, solubilizers, and 

30 other materials well known in the art. The term "pharmaceutically acceptable" means a 
non-toxic material that does not interfere with the effectiveness of the biological activity 
of the active ingredient(s). The characteristics of the carrier will depend on the route of 
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administration. The pharmaceutical composition of the invention may also contain 
cytokines, lymphokines, or other hematopoietic factors such as M-CSF, GM-CSF, TNF, 
IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-1 1, IL-12, IL-13, IL-14, 
IL-1 5, IFN, TNFO, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell factor, 
5 and erythropoietin. In further compositions, proteins of the invention may be combined 
with other agents beneficial to the treatment of the disease or disorder in question. These 
agents include various growth factors such as epidermal growth factor (EGF), 
platelet-derived growth factor (PDGF), transforming growth factors (TGF-a and TGF-p), 
insulin-like growth factor (IGF), as well as cytokines described herein. 

10 The pharmaceutical composition may further contain other agents which either 

enhance the activity of the protein or other active ingredient or complement its activity or 
use in treatment. Such additional factors and/or agents may be included in the 
pharmaceutical composition to produce a synergistic effect with protein or other active 
ingredient of the invention, or to minimize side effects. Conversely, protein or other 

1 5 active ingredient of the present invention may be included in formulations of the 

particular clotting factor, cytokine, lymphokine, other hematopoietic factor, thrombolytic 
or anti-thrombotic factor, or anti- inflammatory agent to minimize side effects of the 
clotting factor, cytokine, lymphokine, other hematopoietic factor, thrombolytic or 
anti-thrombotic factor, or anti-inflammatory agent (such as IL-IRa, IL-1 Hyl, IL-1 Hy2, 

20 anti-TNF, corticosteroids, immunosuppressive agents). A protein of the present 

invention may be active in multimers (e.g., heterodimers or homodimers) or complexes 
with itself or other proteins. As a result, pharmaceutical compositions of the invention 
may comprise a protein of the invention in such multimeric or complexed form. 

As an alternative to being included in a pharmaceutical composition of the 

25 invention including a first protein, a second protein or a therapeutic agent may be 

concurrently administered with the first protein (e.g., at the same time, or at differing 
times provided that therapeutic concentrations of the combination of agents is achieved at 
the treatment site). Techniques for formulation and administration of the compounds of 
the instant application may be found in "Remington's Pharmaceutical Sciences," Mack 

30 Publishing Co., Easton, PA, 18 th edition. A therapeutically effective dose further refers to 
that amount of the compound sufficient to result in amelioration of symptoms, e.g., 


84 


;.l O O 846 * O H id! fe O 


treatment, healing, prevention or amelioration of the relevant medical condition, or an 
increase in rate of treatment, healing, prevention or amelioration of such conditions. 
When applied to an individual active ingredient, administered alone, a therapeutically 
effective dose refers to that ingredient alone. When applied to a combination, a 
5 therapeutically effective dose refers to combined amounts of the active ingredients that 
result in the therapeutic effect, whether administered in combination, serially or 
simultaneously. 

In practicing the method of treatment or use of the present invention, a 
therapeutically effective amount of protein or other active ingredient of the present 

10 , invention is administered to a mammal having a condition to be treated. Protein or other 
active ingredient of the present invention may be administered in accordance with the 
method of the invention either alone or in combination with other therapies such as 
treatments employing cytokines, lymphokines or other hematopoietic factors. When co- 
administered with one or more cytokines, lymphokines or other hematopoietic factors, 

15 protein or other active ingredient of the present invention may be administered either 
simultaneously with the cytokine(s), lymphokine(s), other hematopoietic factor(s), 
thrombolytic or anti-thrombotic factors, or sequentially. If administered sequentially, the 
attending physician will decide on the appropriate sequence of administering protein or 
other active ingredient of the present invention in combination with cytokine(s), 

20 lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic factors. 

5.12.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, for example, include oral, rectal, 
transmucosal, or intestinal administration; parenteral delivery, including intramuscular, 

25 subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, 
intravenous, intraperitoneal, intranasal, or intraocular injections. Administration of 
protein or other active ingredient of the present invention used in the pharmaceutical 
composition or to practice the method of the present invention can be carried out in a 
variety of conventional ways, such as oral ingestion, inhalation, topical application or 

30 cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. Intravenous 
administration to the patient is preferred. 
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Alternately, one may administer the compound in a local rather than systemic 
manner, for example, via injection of the compound directly into a arthritic joints or in 
fibrotic tissue, often in a depot or sustained release formulation. In order to prevent the 
scarring process frequently occurring as complication of glaucoma surgery, the 
5 compounds may be administered topically, for example, as eye drops. Furthermore, one 
may administer the drug in a targeted drug delivery system, for example, in a liposome 
coated with a specific antibody, targeting, for example, arthritic or fibrotic tissue. The 
liposomes will be targeted to and taken up selectively by the afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an 

10 effective dosage to the desired site of action. The determination of a suitable route of 
administration and an effective dosage for a particular indication is within the level of 
skill in the art. Preferably for wound treatment, one administers the therapeutic 
compound directly to the site. Suitable dosage ranges for the polypeptides of the 
invention can be extrapolated from these dosages or from similar studies in appropriate 

15 animal models. Dosages can then be adjusted as necessary by the clinician to provide 
maximal therapeutic benefit. 

5.12.2 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention 
20 thus may be formulated in a conventional manner using one or more physiologically 

acceptable carriers comprising excipients and auxiliaries which facilitate processing of 
the active compounds into preparations which can be used pharmaceutically. These 
pharmaceutical compositions may be manufactured in a manner that is itself known, e.g., 
by means of conventional mixing, dissolving, granulating, dragee-making, levigating, 
25 emulsifying, encapsulating, entrapping or lyophilizing processes. Proper formulation is 
dependent upon the route of administration chosen. When a therapeutically effective 
amount of protein or other active ingredient of the present invention is administered 
orally, protein or other active ingredient of the present invention will be in the form of a 
tablet, capsule, powder, solution or elixir. When administered in tablet form, the 
30 pharmaceutical composition of the invention may additionally contain a solid carrier such 
as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 95% 
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protein or other active ingredient of the present invention, and preferably from about 25 
to 90% protein or other active ingredient of the present invention. When administered in 
liquid form, a liquid carrier such as water, petroleum, oils of animal or plant origin such 
as peanut oil, mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The 
5 liquid form of the pharmaceutical composition may further contain physiological saline 
solution, dextrose or other saccharide solution, or glycols such as ethylene glycol, 
propylene glycol or polyethylene glycol. When administered in liquid form, the 
pharmaceutical composition contains from about 0.5 to 90% by weight of protein or other 
active ingredient of the present invention, and preferably from about 1 to 50% protein or 

1 0 other active ingredient of the present invention. 

When a therapeutically effective amount of protein or other active ingredient of 
the present invention is administered by intravenous, cutaneous or subcutaneous 
injection, protein or other active ingredient of the present invention will be in the form of 
a pyrogen-free, parenterally acceptable aqueous solution. The preparation of such 

15 parenterally acceptable protein or other active ingredient solutions, having due regard to 
pH, isotonicity, stability, and the like, is within the skill in the art. A preferred 
pharmaceutical composition for intravenous, cutaneous, or subcutaneous injection should 
contain, in addition to protein or other active ingredient of the present invention, an 
isotonic vehicle such as sodium chloride injection, Ringer's injection, dextrose injection, 

20 dextrose and sodium chloride injection, lactated Ringer's injection, or other vehicle as 
known in the art. The pharmaceutical composition of the present invention may also 
contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those 
of skill in the art. For injection, the agents of the invention may be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hanks's solution, 

25 Ringer's solution, or physiological saline buffer. For transmucosal administration, 

penetrants appropriate to the barrier to be permeated are used in the formulation. Such 
penetrants are generally known in the art. 

For oral administration, the compounds can be formulated readily by combining 
the active compounds with pharmaceutically acceptable carriers well known in the art. 

30 Such carriers enable the compounds of the invention to be formulated as tablets, pills, 
dragees, capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral 
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ingestion by a patient to be treated. Pharmaceutical preparations for oral use can be 
obtained from a solid excipient, optionally grinding a resulting mixture, and processing 
the mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or 
dragee cores. Suitable excipients are, in particular, fillers such as sugars, including 
5 lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for example, maize 
starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl cellulose, 
hydroxypropylmethyl-cellulose, sodium carboxymethylcellulose, and/or 
polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, such as the 
cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such as sodium 

10 alginate. Dragee cores are provided with suitable coatings. For this purpose, 

concentrated sugar solutions may be used, which may optionally contain gum arabic, talc, 
polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 
solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may 
be added to the tablets or dragee coatings for identification or to characterize different 

1 5 combinations of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules 
made of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as 
glycerol or sorbitol. The push-fit capsules can contain the active ingredients in admixture 
with filler such as lactose, binders such as starches, and/or lubricants such as talc or 

20 magnesium stearate and, optionally, stabilizers. In soft capsules, the active compounds 
may be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or 
liquid polyethylene glycols. In addition, stabilizers may be added. All formulations for 
oral administration should be in dosages suitable for such administration. For buccal 
administration, the compositions may take the form of tablets or lozenges formulated in 

25 conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 
dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon 

30 dioxide or other suitable gas. In the case of a pressurized aerosol the dosage unit may be 
determined by providing a valve to deliver a metered amount. Capsules and cartridges 
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of, e.g., gelatin for use in an inhaler or insufflator may be formulated containing a powder 
mix of the compound and a suitable powder base such as lactose or starch. The 
compounds may be formulated for parenteral administration by injection, e.g., by bolus 
injection or continuous infusion. Formulations for injection may be presented in unit 
5 dosage form, e.g., in ampules or in multi-dose containers, with an added preservative. 
The compositions may take such forms as suspensions, solutions or emulsions in oily or 
aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing 
and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous 

10 solutions of the active compounds in water-soluble form. Additionally, suspensions of 
. . the active compounds may be prepared as appropriate oily injection suspensions. 

Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or synthetic 
fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. Aqueous injection 
suspensions may contain substances which increase the viscosity of the suspension, such 

1 5 as sodium carboxymethyl cellulose, sorbitol, or dextran. Optionally, the suspension may 
also contain suitable stabilizers or agents which increase the solubility of the compounds 
to allow for the preparation of highly concentrated solutions. Alternatively, the active 
ingredient may be in powder form for constitution with a suitable vehicle, e.g., sterile 
pyrogen-free water, before use. 

20 The compounds may also be formulated in rectal compositions such as 

suppositories or retention enemas, e.g., containing conventional suppository bases such as 
cocoa butter or other glycerides. In addition to the formulations described previously, the 
compounds may also be formulated as a depot preparation. Such long acting 
formulations may be administered by implantation (for example subcutaneously or 

25 intramuscularly) or by intramuscular injection. Thus, for example, the compounds may 
be formulated with suitable polymeric or hydrophobic materials (for example as an 
emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, 
for example, as a sparingly soluble salt. 

A pharmaceutical carrier for the hydrophobic compounds of the invention is a co- 

30 solvent system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible 
organic polymer, and an aqueous phase. The co-solvent system may be the VPD 
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co-solvent system. VPD is a solution of 3% w/v benzyl alcohol, 8% w/v of the nonpolar 
surfactant polysorbate 80, and 65% w/v polyethylene glycol 300, made up to volume in 
absolute ethanol. The VPD co-solvent system (VPD:5W) consists of VPD diluted 1 : 1 
with a 5% dextrose in water solution. This co-solvent system dissolves hydrophobic 
5 compounds well, and itself produces low toxicity upon systemic administration. 

Naturally, the proportions of a co-solvent system may be varied considerably without 
destroying its solubility and toxicity characteristics. Furthermore, the identity of the 
co-solvent components may be varied: for example, other low-toxicity nonpolar 
surfactants may be used instead of polysorbate 80; the fraction size of polyethylene 

10* glycol may be varied; other biocompatible polymers may replace polyethylene glycol, 
e.g. polyvinyl pyrrolidone; and other sugars or polysaccharides may substitute for 
. . dextrose. Alternatively, other delivery systems for hydrophobic pharmaceutical 

compounds may be employed. Liposomes and emulsions are well known examples of 
delivery vehicles or carriers for hydrophobic drugs. Certain organic solvents such as 

15 dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 
Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. 
. Various types of sustained-release materials have been established and are well known by 
those skilled in the art. Sustained-release capsules may, depending on their chemical 

20 nature, release the compounds for a few weeks up to over 100 days. Depending on the 
chemical nature and the biological stability of the therapeutic reagent, additional 
strategies for protein or other active ingredient stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase 
carriers or excipients. Examples of such carriers or excipients include but are not limited 

25 to calcium carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, 
gelatin, and polymers such as polyethylene glycols. Many of the active ingredients of the 
invention may be provided as salts with pharmaceutically compatible counter ions. Such 
pharmaceutically acceptable base addition salts are those salts which retain the biological 
effectiveness and properties of the free acids and which are obtained by reaction with 

30 inorganic or organic bases such as sodium hydroxide, magnesium hydroxide, ammonia, 
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trialkylamine, dialkylamine, monoalkylamine, dibasic amino acids, sodium acetate, 
potassium benzoate, triethanol amine and the like. 

The pharmaceutical composition of the invention may be in the form of a 
complex of the protein(s) or other active ingredient(s) of present invention along with 
5 protein or peptide antigens. The protein and/or peptide antigen will deliver a stimulatory 
signal to both B and T lymphocytes. B lymphocytes will respond to antigen through their 
surface immunoglobulin receptor. T lymphocytes will respond to antigen through the T 
cell receptor (TCR) following presentation of the antigen by MHC proteins. MHC and 
structurally related proteins including those encoded by class I and class II MHC genes 

10 on host cells will serve to present the peptide antigen(s) to T lymphocytes. The antigen 
components could also be supplied as purified MHC-peptide complexes alone or with 
co-stimulatory molecules that can directly signal T cells. Alternatively antibodies able to 
bind surface immunoglobulin and other molecules on B cells as well as antibodies able to 
bind the TCR and other molecules on T cells can be combined with the pharmaceutical - 

1 5 composition of the invention. 

The pharmaceutical composition of the invention may be in the form of a 
j liposome in which protein of the present invention is combined, in addition to other 
pharmaceutically acceptable carriers, with amphipathic agents such as lipids which exist 
in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers 

20 in aqueous solution. Suitable lipids for liposomal formulation include, without limitation, 
monoglycerides, diglycerides, sulfatides, lysolecithins, phospholipids, saponin, bile acids, 
and the like. Preparation of such liposomal formulations is within the level of skill in the 
art, as disclosed, for example, in U.S. Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 
4,737,323, all of which are incorporated herein by reference. 

25 The amount of protein or other active ingredient of the present invention in the 

pharmaceutical composition of the present invention will depend upon the nature and 
severity of the condition being treated, and on the nature of prior treatments which the 
patient has undergone. Ultimately, the attending physician will decide the amount of 
protein or other active ingredient of the present invention with which to treat each 

30 individual patient. Initially, the attending physician will administer low doses of protein 
or other active ingredient of the present invention and observe the patient's response. 
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Larger doses of protein or other active ingredient of the present invention may be 
administered until the optimal therapeutic effect is obtained for the patient, and at that 
point the dosage is not increased further. It is contemplated that the various 
pharmaceutical compositions used to practice the method of the present invention should 
5 contain about 0.01 |ig to about 100 mg (preferably about 0.1 (ig to about 10 mg, more 
preferably about 0.1 |j.g to about 1 mg) of protein or other active ingredient of the present 
invention per kg body weight. For compositions of the present invention which are 
useful for bone, cartilage, tendon or ligament regeneration, the therapeutic method 
includes administering the composition topically, systematically, or locally as an implant 

10 or device. When administered, the therapeutic composition for use in this invention is, of 
course, in a pyrogen-free, physiologically acceptable form. Further, the composition may . 
desirably be encapsulated or injected in a viscous form for delivery to the site of bone, 
cartilage or tissue damage. Topical administration may be suitable for wound healing 
and tissue repair. Therapeutically useful agents other than a protein or other active 

15 ingredient of the invention which may also optionally be included in the composition as 
described above, may alternatively or additionally, be administered simultaneously or 
sequentially with the composition in the methods of the invention. Preferably for bone 
and/or cartilage formation, the composition would include a matrix capable of delivering 
the protein-containing or other active ingredient-containing composition to the site of 

20 bone and/or cartilage damage, providing a structure for the developing bone and cartilage 
and optimally capable of being resorbed into the body. Such matrices may be formed of 
materials presently in use for other implanted medical applications. 

The choice of matrix material is based on biocompatibility, biodegradability, 
mechanical properties, cosmetic appearance and interface properties. The particular 

25 application of the compositions will define the appropriate formulation. Potential 

matrices for the compositions may be biodegradable and chemically defined calcium 
sulfate, tricalcium phosphate, hydroxyapatite, polylactic acid, polyglycolic acid and 
polyanhydrides. Other potential materials are biodegradable and biologically 
well-defined, such as bone or dermal collagen. Further matrices are comprised of pure 

30 proteins or extracellular matrix components. Other potential matrices are 

nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, 
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aluminates, or other ceramics. Matrices may be comprised of combinations of any of the 
above mentioned types of material, such as polylactic acid and hydroxyapatite or 
collagen and tricalcium phosphate. The bioceramics may be altered in composition, such 
as in calcium-aluminate-phosphate and processing to alter pore size, particle size, particle 
5 shape, and biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of 
lactic acid and glycolic acid in the form of porous particles having diameters ranging 
from 150 to 800 microns. In some applications, it will be useful to utilize a sequestering 
agent, such as carboxymethyl cellulose or autologous blood clot, to prevent the protein 
compositions from disassociating from the matrix. 
10 A preferred family of sequestering agents is cellulosic materials such as 

alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, 
ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, 

hydroxypropyl-methylcellulose, and carboxymethylcellulose, the most preferred being 
cationic salts of carboxymethylcellulose (CMC). Other preferred sequestering agents 

15 include hyaluronic acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, 
carboxyvinyl polymer and poly(vinyl alcohol). The amount of sequestering agent useful 
herein is 0.5-20 wt %, preferably 1-10 wt % based on total formulation weight, which 
represents the amount necessary to prevent desorption of the protein from the polymer 
matrix and to provide appropriate handling of the composition, yet not so much that the 

20 progenitor cells are prevented from infiltrating the matrix, thereby providing the protein 
the opportunity to assist the osteogenic activity of the progenitor cells. In further 
compositions, proteins or other active ingredients of the invention may be combined with 
other agents beneficial to the treatment of the bone and/or cartilage defect, wound, or 
tissue in question. These agents include various growth factors such as epidermal growth 

25 factor (EGF), platelet derived growth factor (PDGF), transforming growth factors 
(TGF-oc and TGF-(3), and insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary 
applications. Particularly domestic animals and thoroughbred horses, in addition to 
humans, are desired patients for such treatment with proteins or other active ingredients 

30 of the present invention. The dosage regimen of a protein-containing pharmaceutical 
composition to be used in tissue regeneration will be determined by the attending 


93 


:i O O H M-fin 4-2S . O K iS £. O Si : 


physician considering various factors which modify the action of the proteins, e.g., 
amount of tissue weight desired to be formed, the site of damage, the condition of the 
damaged tissue, the size of a wound, type of damaged tissue (e.g., bone), the patient's 
age, sex, and diet, the severity of any infection, time of administration and other clinical 
5 factors. The dosage may vary with the type of matrix used in the reconstitution and with 
inclusion of other proteins in the pharmaceutical composition. For example, the addition 
of other known growth factors, such as IGF I (insulin like growth factor I), to the final 
composition, may also effect the dosage. Progress can be monitored by periodic 
assessment of tissue/bone growth and/or repair, for example, X-rays, histomorphometric 

10 determinations and tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other 
known methods for introduction of nucleic acid into a cell or organism (including, 

15 .without limitation, in the form of viral vectors or naked DNA). Cells may also be 

cultured ex vivo in the presence of proteins of the present invention in order to proliferate 
or to produce a desired effect on or activity in such cells. Treated cells can then be 
introduced in vivo for therapeutic purposes. 

20 5.12.3 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to 
achieve its intended purpose. More specifically, a therapeutically effective amount 
means an amount effective to prevent development of or to alleviate the existing 

25 symptoms of the subject being treated. Determination of the effective amount is well 
within the capability of those skilled in the art, especially in light of the detailed 
disclosure provided herein. For any compound used in the method of the invention, the 
therapeutically effective dose can be estimated initially from appropriate in vitro assays. 
For example, a dose can be formulated in animal models to achieve a circulating 

30 concentration range that can be used to more accurately determine useful doses in 
humans. For example, a dose can be formulated in animal models to achieve a 
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circulating concentration range that includes the IC50 as determined in cell culture (i.e., 
the concentration of the test compound which achieves a half-maximal inhibition of the 
protein's biological activity). Such information can be used to more accurately determine 
useful doses in humans. 

5 A therapeutically effective dose refers to that amount of the compound that results 

in amelioration of symptoms or a prolongation of survival in a patient. Toxicity and 
therapeutic efficacy of such compounds can be determined by standard pharmaceutical 
procedures in cell cultures or experimental animals, e.g., for determining the LD50 (the 
dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 

10 50% of the population). The dose ratio between toxic and therapeutic effects is the 
therapeutic index and it can be expressed as the ratio between LD 50 and ED50- 
Compounds which exhibit high therapeutic indices are preferred. The data obtained from 
these cell culture assays and animal studies can be used in formulating a range of dosage 
for use in human. The dosage of such compounds lies preferably within a range of 

1 5 circulating concentrations that include the ED50 with little or no toxicity. The dosage 
may vary within this range depending upon the dosage form employed and the route of 
administration utilized. The exact formulation, route of administration and dosage can be 
chosen by the individual physician in view of the patient's condition. See, e.g., Fingl et 
al.. 1975, in M The Pharmacological Basis of Therapeutics", Ch. 1 p.l. Dosage amount 

20 and interval may be adjusted individually to provide plasma levels of the active moiety 
which are sufficient to maintain the desired effects, or minimal effective concentration 
(MEC). The MEC will vary for each compound but can be estimated from in vitro data. 
Dosages necessary to achieve the MEC will depend on individual characteristics and 
route of administration. However, HPLC assays or bioassays can be used to determine 

25 plasma concentrations. 

Dosage intervals can also be determined using MEC value. Compounds should 
be administered using a regimen which maintains plasma levels above the MEC for 
10-90% of the time, preferably between 30-90% and most preferably between 50-90%. 
In cases of local administration or selective uptake, the effective local concentration of 

30 the drug may not be related to plasma concentration. 
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An exemplary dosage regimen for polypeptides or other compositions of the 
invention will be in the range of about 0.01 fig/kg to 100 mg/kg of body weight daily, 
with the preferred dose being about 0.1 ng/kg to 25 mg/kg of patient body weight daily, 
varying in adults and children. Dosing may be once daily, or equivalent doses may be 
5 delivered at longer or shorter intervals. 

The amount of composition administered will, of course, be dependent on the 
subject being treated, on the subjects age and weight, the severity of the affliction, the 
manner of administration and the judgment of the prescribing physician. 


The compositions may, if desired, be presented in a pack or dispenser device 
which may contain one or more unit dosage forms containing the active ingredient. The 
pack may, for example, comprise metal or plastic foil, such as a blister pack. The pack or 
dispenser device may be accompanied by instructions for administration. Compositions 
15 comprising a compound of the invention formulated in a compatible pharmaceutical 
carrier may also be prepared, placed in an appropriate container, and labeled for 
treatment of an indicated condition. 


of the invention. The term "antibody" as used herein refers to immunoglobulin molecules 
and immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules 
that contain an antigen binding site that specifically binds (immunoreacts with) an 
antigen. Such antibodies include, but are not limited to, polyclonal, monoclonal, 

25 chimeric, single chain, F a b, F a t>' and F( a b')2 fragments, and an F a b expression library. In 
general, an antibody molecule obtained from humans relates to any of the classes IgG, 
IgM, IgA, IgE and IgD, which differ from one another by the nature of the heavy chain 
present in the molecule. Certain classes have subclasses as well, such as IgGi, IgG2, and 
others. Furthermore, in humans, the light chain may be a kappa chain or a lambda chain. 

30 Reference herein to antibodies includes a reference to all such classes, subclasses and 
types of human antibody species. 


10 


5.12.4 PACKAGING 


5.13 ANTIBODIES 


20 


Also included in the invention are antibodies to proteins, or fragments of proteins 
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An isolated related protein of the invention may be intended to serve as an 
antigen, or a portion or fragment thereof, and additionally can be used as an immunogen 
to generate antibodies that immunospecifically bind the antigen, using standard 
techniques for polyclonal and monoclonal antibody preparation. The full-length protein 
5 can be used or, alternatively, the invention provides antigenic peptide fragments of the 
antigen for use as immunogens. An antigenic peptide fragment comprises at least 6 
amino acid residues of the amino acid sequence of the full length protein, such as an 
amino acid sequence encoded by 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 
778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID 

10 NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1- 
14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, 
and 792 SEQ ID NO: 1-8502, and encompasses an epitope thereof such that an antibody 
raised against the peptide forms a specific immune complex with the full length protein 
or with any fragment that contains the epitope. Preferably, the antigenic peptide 

15 comprises at least 10 amino acid residues, or at least 15 amino acid residues, or at least 
20 amino acid residues, or at least 30 amino acid residues. Preferred epitopes 
encompassed by the antigenic peptide are regions of the protein that are located on its 
surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 

20 antigenic peptide is a region of -related protein that is located on the surface of the 
protein, e.g., a hydrophilic region. A hydrophobicity analysis of the human related 
protein sequence will indicate which regions of a related protein are particularly 
hydrophilic and, therefore, are likely to encode surface residues useful for targeting 
antibody production. As a means for targeting antibody production, hydropathy plots 

25 showing regions of hydrophilicity and hydrophobicity may be generated by any method 
well known in the art, including, for example, the Kyte Doolittle or the Hopp Woods 
methods, either with or without Fourier transformation. See, e.g., Hopp and Woods, 
1981, Proc. Nat. Acad. Sci. USA 78: 3824-3828; Kyte and Doolittle 1982, J. Mol Biol 
157: 105-142, each of which is incorporated herein by reference in its entirety. 

30 Antibodies that are specific for one or more domains within an antigenic protein, or 
derivatives, fragments, analogs or homologs thereof, are also provided herein. 
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A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 
thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of 
5 polyclonal or monoclonal antibodies directed against a protein of the invention, or against 
derivatives, fragments, analogs homologs or orthologs thereof (see, for example, 
Antibodies: A Laboratory Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, incorporated herein by reference). Some of 
these antibodies are discussed below. 

10 

5.13.1 Polyclonal Antibodies 

For the production of polyclonal antibodies, various suitable host animals (e.g., 
rabbit, goat, mouse or other mammal) may be immunized by one or more injections with 
the native protein, a synthetic variant thereof, or a derivative of the foregoing. An 

1 5 appropriate immunogenic preparation can contain, for example, the naturally occurring 
immunogenic protein, a chemically synthesized polypeptide representing the 
immunogenic protein, or a recombinantly expressed immunogenic protein. Furthermore, 
the protein may be conjugated to a second protein known to be immunogenic in the 
mammal being immunized. Examples of such immunogenic proteins include but are not 

20 limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and 
soybean trypsin inhibitor. The preparation can further include an adjuvant. Various 
adjuvants used to increase the immunological response include, but are not limited to, 
Freund's (complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface 
active substances (e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 

25 dinitrophenol, etc.), adjuvants usable in humans such as Bacille Calmette-Guerin (BCG) 
and Corynebacterium parvum, or similar immunostimulatory agents. Additional 
examples of adjuvants which can be employed include MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

The polyclonal antibody molecules directed against the immunogenic protein can 

30 be isolated from the mammal (e.g., from the blood) and further purified by well known 
techniques, such as affinity chromatography using protein A or protein G, which provide 
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primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific 
antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be 
immobilized on a column to purify the immune specific antibody by immunoaffinity 
chromatography. Purification of immunoglobulins is discussed, for example, by D. 
5 . Wilkinson (The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 
8 (April 1 7, 2000), pp. 25-28). 

5.13.2 Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", 

10 as used herein, refers to a population of antibody molecules that contain only one 

molecular species of antibody molecule consisting of a unique light chain gene product 
and a unique heavy chain gene product. In particular, the complementarity determining 
regions (CDRs) of the monoclonal antibody are identical in all the molecules of the 
population. MAbs thus contain an antigen binding site capable of immunoreacting with a 

1 5 particular epitope of the antigen characterized by a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature , 256 :495 (1975). In a hybridoma method, a 
mouse, hamster, or other appropriate host animal, is typically immunized with an 
immunizing agent to elicit lymphocytes that produce or are capable of producing 

20 antibodies that will specifically bind to the immunizing agent. Alternatively, the 

lymphocytes can be immunized in vitro. The immunizing agent will typically include the 
protein antigen, a fragment thereof or a fusion protein thereof. Generally, either 
peripheral blood lymphocytes are used if cells of human origin are desired, or spleen cells 
or lymph node cells are used if non-human mammalian sources are desired. The 

25 lymphocytes are then fused with an immortalized cell line using a suitable fusing agent, 
such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: 
Principles and Practice , Academic Press, (1986) pp. 59-103). Immortalized cell lines are 
usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and 
human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma 

30 cells can be cultured in a suitable culture medium that preferably contains one or more 
substances that inhibit the growth or survival of the unfused, immortalized cells. For 
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example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl 
transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will 
include hypoxanthine, aminopterin, and thymidine ( M HAT medium"), which substances 
prevent the growth of HGPRT-deficient cells. 
5 Preferred immortalized cell lines are those that fuse efficiently, support stable 

high level expression of antibody by the selected antibody-producing cells, and are 
sensitive to a medium such as HAT medium. More preferred immortalized cell lines are 
murine myeloma lines, which can be obtained, for instance, from the Salk Institute Cell 
Distribution Center, San Diego, California and the American Type Culture Collection, 

10 Manassas, Virginia. Human myeloma and mouse-human heteromyeloma cell lines also 
have been described for the production of human monoclonal antibodies (Kozbor, J. 
Immunol., 133 :3001 (1984); Brodeur et al.. Monoclonal Antibody Production Techniques 
and Applications , Marcel Dekker, Inc., New York, (1987) pp. 51-63). 

The culture medium in which the hybridoma cells are cultured can then be 

1 5 assayed for the presence of monoclonal antibodies directed against the antigen. 

Preferably, the binding specificity of monoclonal antibodies produced by the hybridoma 
cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such 
techniques and assays are known in the art. The binding affinity of the monoclonal 

20 antibody can, for example, be determined by the Scatchard analysis of Munson and 

Pollard, Anal. Biochem., 107 :220 (1980). Preferably, antibodies having a high degree of 
specificity and a high binding affinity for the target antigen are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by 
limiting dilution procedures and grown by standard methods. Suitable culture media for 

25 this purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 
medium. Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 
The monoclonal antibodies secreted by the subclones can be isolated or purified from the 
culture medium or ascites fluid by conventional immunoglobulin purification procedures 
such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel 

30 electrophoresis, dialysis, or affinity chromatography. 
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The monoclonal antibodies can also be made by recombinant DNA methods, such 
as those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal 
antibodies of the invention can be readily isolated and sequenced using conventional 
procedures (e.g., by using oligonucleotide probes that are capable of binding specifically 
5 to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells 
of the invention serve as a preferred source of such DNA. Once isolated, the DNA can 
be placed into expression vectors, which are then transfected into host cells such as 
simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not 
otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal 

10 antibodies in the recombinant host cells. The DNA also can be modified, for example, by 
substituting the coding sequence for human heavy and light chain constant domains in 
place of the homologous murine sequences (U.S. Patent No. 4,816,567; Morrison, Nature 
368, 812-13 (1994)) or by covalently joining to the immunoglobulin coding sequence all 
. or part of the coding sequence for a non-immunoglobulin polypeptide. Such a non- 

1 5 immunoglobulin polypeptide can be substituted for the constant domains of an antibody 
of the invention, or can be substituted for the variable domains of one antigen-combining 
site of an antibody of the invention to create a chimeric bivalent antibody. 

5.13.2 HUMANIZED ANTIBODIES 

20 The antibodies directed against the protein antigens of the invention can further 

comprise humanized antibodies or human antibodies. These antibodies are suitable for 
administration to humans without engendering an immune response by the human against 
the administered immunoglobulin. Humanized forms of antibodies are chimeric 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab 1 , 

25 F(ab')2 or other antigen-binding subsequences of antibodies) that are principally 

comprised of the sequence of a human immunoglobulin, and contain minimal sequence 
derived from a non-human immunoglobulin. Humanization can be performed following 
the method of Winter and co-workers (Jones et al.. Nature , 321 :522-525 (1986); 
Riechmann et al.. Nature, 332:323-327 (1988); Verhoeyen et al.. Science , 239:1534-1536 

30 (1988)), by substituting rodent CDRs or CDR sequences for the corresponding sequences 
of a human antibody. (See also U.S. Patent No. 5,225,539.) In some instances, Fv 
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framework residues of the human immunoglobulin are replaced by corresponding non- 
human residues. Humanized antibodies can also comprise residues which are found 
neither in the recipient antibody nor in the imported CDR or framework sequences. In 
general, the humanized antibody will comprise substantially all of at least one, and 
5 typically two, variable domains, in which all or substantially all of the CDR regions 
correspond to those of a non-human immunoglobulin and all or substantially all of the 
framework regions are those of a human immunoglobulin consensus sequence. The 
humanized antibody optimally also will comprise at least a portion of an immunoglobulin 
constant region (Fc), typically that of a human immunoglobulin (Jones et al.. 1986; 
10 Riechmann et al.. 1988; and Presta, Curr. Op. Struct. Biol.. 2:593-596 (1992)). 


5.13.3 HUMAN ANTIBODIES 

Fully human antibodies relate to antibody molecules in which essentially the 
entire sequences of both the light chain and the heavy chain, including the CDRs, arise 

15 from human genes. Such antibodies are termed "human antibodies", or "fully human 
antibodies" herein. Human monoclonal antibodies can be prepared by the trioma 
technique; the human B-cell hybridoma technique (see Kozbor, et al.. 1983 Immunol 
Today 4: 72) and the EBV hybridoma technique to produce human monoclonal 
antibodies (see Cole, et al.. 1985 In: Monoclonal Antibodies and Cancer Therapy, 

20 Alan R. Liss, Inc., pp. 77-96). Human monoclonal antibodies may be utilized in the 

practice of the present invention and may be produced by using human hybridomas (see 
Cote, et al.. 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by transforming human 
B-cells with Epstein Barr Virus in vitro (see Cole, et al.. 1985 In: Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

25 In addition, human antibodies can also be produced using additional techniques, 

including phage display libraries (Hoogenboom and Winter, J. MoL Biol. , 227 :381 
(1991); Marks et al.. J. Mol. Biol. , 222:581 (1991)). Similarly, human antibodies can be 
made by introducing human immunoglobulin loci into transgenic animals, e.g., mice in 
which the endogenous immunoglobulin genes have been partially or completely 

30 inactivated. Upon challenge, human antibody production is observed, which closely 
resembles that seen in humans in all respects, including gene rearrangement, assembly, 
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and antibody repertoire. This approach is described, for example, in U.S. Patent Nos. 
5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and in Marks et. al., 
(Bio/Technology 10, 779-783 (1992)); Lonberg et. al., (Nature 368 856-859 (1994)); 
Morrison ("Nature 368 , 812-13 (1994)); Fishwild et al,( Nature Biotechnology 14, 845-51 
5 (1 996)); Neuberger (Nature Biotechnology 14, 826 (1 996)); and Lonberg and Huszar 
( Intern. Rev. Immunol. 13 65-93 (1995)). 

Human antibodies may additionally be produced using transgenic nonhuman 
animals which are modified so as to produce fully human antibodies rather than the 
animal's endogenous antibodies in response to challenge by an antigen. (See PCT 

10 publication WO94/02602). The endogenous genes encoding the heavy and light 

immunoglobulin chains in the nonhuman host have been incapacitated, and active loci 
encoding human heavy and light chain immunoglobulins are inserted into the host's 
genome. The human genes are incorporated, for example, using yeast artificial 
chromosomes containing the requisite human DNA segments. An animal which provides 

1 5 all the desired modifications is then obtained as progeny by crossbreeding intermediate 
transgenic animals containing fewer than the full complement of the modifications. The 
preferred embodiment of such a nonhuman animal is a mouse, and is termed the 
Xenomouse™ (Abgenix Inc., Freemont, CA) as disclosed in PCT publications WO 
96/33735 and WO 96/34096. This animal produces B cells which secrete fully human 

20 immunoglobulins. The antibodies can be obtained directly from the animal after 
immunization with an immunogen of interest, as, for example, a preparation of a 
polyclonal antibody, or alternatively from immortalized B cells derived from the animal, 
such as hybridomas producing monoclonal antibodies. Additionally, the genes encoding 
the immunoglobulins with human variable regions can be recovered and expressed to 

25 obtain the antibodies directly, or can be further modified to obtain analogs of antibodies 
such as, for example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, 
lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. 
Patent No. 5,939,598. It can be obtained by a method including deleting the J segment 

30 genes from at least one endogenous heavy chain locus in an embryonic stem cell to 
prevent rearrangement of the locus and to prevent formation of a transcript of a 
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rearranged immunoglobulin heavy chain locus, the deletion being effected by a targeting 
vector containing a gene encoding a selectable marker; and producing from the 
embryonic stem cell a transgenic mouse whose somatic and germ cells contain the gene 
encoding the selectable marker. 
5 A method for producing an antibody of interest, such as a human antibody, is 

disclosed in U.S. Patent No. 5,916,771 . It includes introducing an expression vector that 
contains a nucleotide sequence encoding a heavy chain into one mammalian host cell in 
culture, introducing an expression .vector containing a nucleotide sequence encoding a 
light chain into another mammalian host cell, and fusing the two cells to form a hybrid 
10 cell. The hybrid cell expresses an antibody containing the heavy chain and the light 
chain. 

In a further improvement on this procedure, a method for identifying a clinically 
relevant epitope on an immunogen, and a correlative method for selecting an antibody 
that binds immunospecifically to the relevant epitope with high affinity, are disclosed in 
1 5 PCT publication WO 99/53049. 

5.13.4 F ab FRAGMENTS AND SINGLE CHAIN ANTIBODIES 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to an antigenic protein of the invention (see e.g., U.S. 

20 Patent No. 4,946,778). In addition, methods can be adapted for the construction of F a b 
expression libraries (see e.g., Huse, et al.. 1989 Science 246: 1275-1281) to allow rapid 
and effective identification of monoclonal F a b fragments with the desired specificity for a 
protein or derivatives, fragments, analogs or homologs thereof. Antibody fragments that 
contain the idiotypes to a protein antigen may be produced by techniques known in the 

25 art including, but not limited to: (i) an F(ab')2 fragment produced by pepsin digestion of 
an antibody molecule; (ii) an F a b fragment generated by reducing the disulfide bridges of 
an F(ab')2 fragment; (iii) an F a b fragment generated by the treatment of the antibody 
molecule with papain and a reducing agent and (iv) F v fragments. 
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5.13.5 BISPECIFIC ANTIBODIES 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies 
that have binding specificities for at least two different antigens. In the present case, one 
of the binding specificities is for an antigenic protein of the invention. The second 
5 binding target is any other antigen, and advantageously is a cell-surface protein or 
. receptor or receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 
immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have 
10 different specificities (Milstein and Cuello, Nature, 305:537-539 (1983)). Because of the 
random assortment of immunoglobulin heavy and light chains, these hybridomas 
(quadromas) produce a potential mixture of ten different antibody molecules, of which 
only one has the correct bispecific structure. The purification of the correct molecule is 
usually accomplished by affinity chromatography steps. Similar procedures are disclosed 
15 in WO 93/08829, published 13 May 1993, and in Traunecker et aL, 1991 EMBOJ., 
10:3655-3659. 

Antibody variable domains with the desired binding specificities (antibody- 
antigen combining sites) can be fused to immunoglobulin constant domain sequences. 
The fusion preferably is with an immunoglobulin heavy-chain constant domain, 

20 comprising at least part of the hinge, CH2, and CH3 regions. It is preferred to have the 
first heavy-chain constant region (CHI) containing the site necessary for light-chain 
binding present in at least one of the fusions. DNAs encoding the immunoglobulin 
heavy-chain fusions and, if desired, the immunoglobulin light chain, are inserted into 
separate expression vectors, and are co-transfected into a suitable host organism. For 

25 further details of generating bispecific antibodies see, for example, Suresh et aL. Methods 
in Enzvmology. 121:210 (1986). 

According to another approach described in WO 96/2701 1, the interface between 
a pair of antibody molecules can be engineered to maximize the percentage of 
heterodimers which are recovered from recombinant cell culture. The preferred interface 

30 comprises at least a part of the CH3 region of an antibody constant domain. In this 

method, one or more small amino acid side chains from the interface of the first antibody 
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molecule are replaced with larger side chains (e.g. tyrosine or tryptophan). 
Compensatory "cavities" of identical or similar size to the large side chain(s) are created 
on the interface of the second antibody molecule by replacing large amino acid side 
chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for 
5 increasing the yield of the heterodimer over other unwanted end-products such as 
homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody 
fragments (e.g. F(ab') 2 bispecific antibodies). Techniques for generating bispecific 
antibodies from antibody fragments have been described in the literature. For example, 

10 bispecific antibodies can be prepared using chemical linkage. Brennan et al.. Science 

229:81 (1985) describe a procedure wherein intact antibodies are proteolytically cleaved 
to generate F(ab')2 fragments. These fragments are reduced in the presence of the dithiol 
complexing agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular 
disulfide formation. The Fab' fragments generated are then converted to 

15 thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB derivatives is then 

reconverted to the Fab' -thiol by reduction with mercaptoethylamine and is mixed with an 
equimolar amount of the other Fab'-TNB derivative to form the bispecific antibody. The 
bispecific antibodies produced can be used as agents for the selective immobilization of 
enzymes. 

20 Additionally, Fab' fragments can be directly recovered from E. coli and 

chemically coupled to form bispecific antibodies. Shalaby et al.. J. Exp. Med. 
175:217-225 (1992) describe the production of a fully humanized bispecific antibody 
F(ab')2 molecule. Each Fab' fragment was separately secreted from E. coli and subjected 
to directed chemical coupling in vitro to form the bispecific antibody. The bispecific 

25 antibody thus formed was able to bind to cells overexpressing the ErbB2 receptor and 
normal human T cells, as well as trigger the lytic activity of human cytotoxic 
lymphocytes against human breast tumor targets. 

Various techniques for making and isolating bispecific antibody fragments 
directly from recombinant cell culture have also been described. For example, bispecific 

30 antibodies have been produced using leucine zippers. Kostelny et al.. J. Immunol. 

148(5):1547-1553 (1992). The leucine zipper peptides from the Fos and Jun proteins 
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were linked to the Fab' portions of two different antibodies by gene fusion. The antibody 
homodimers were reduced at the hinge region to form monomers and then re-oxidized to 
form the antibody heterodimers. This method can also be utilized for the production of 
antibody homodimers. The "diabody" technology described by Hollinger et aL Proc. 
5 Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an alternative mechanism for 
making bispecific antibody fragments. The fragments comprise a heavy-chain variable 
domain (V H ) connected to a light-chain variable domain (V L ) by a linker which is too 
short to allow pairing between the two domains on the same chain. Accordingly, the V H 
and V L domains of one fragment are forced to pair with the complementary V L and V H 

10 domains of another fragment, thereby forming two antigen-binding sites. Another 

strategy for making bispecific antibody fragments by the use of single-chain Fv (sFv) 
dimers has also been reported. See, Gruber et aL. J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, 
trispecific antibodies can be prepared. Tutt et aL. J. Immunol. 147:60 (1991). 

15 Exemplary bispecific antibodies can bind to two different epitopes, at least one of which 
originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm of 
an immunoglobulin molecule can be combined with an arm which binds to a triggering 
molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or 
B7), or Fc receptors for IgG (FcyR), such as FcRn, FcyRI (CD64), FcyRII (CD32), 

20 FcyRIII (CD 16), so as to focus cellular defense mechanisms to the cell expressing the 
particular antigen. Bispecific antibodies can also be used to direct cytotoxic agents to 
cells which express a particular antigen. These antibodies possess an antigen-binding 
arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as 
EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody of interest binds the 

25 protein antigen described herein and further binds tissue factor (TF). 

5.13.6 HETEROCONJUGATE ANTIBODIES 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 
30 antibodies have, for example, been proposed to target immune system cells to unwanted 
cells (U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; 


107 


1, 0 O S Ml- 6. «+ 3 . O E5 E! 6 O 5 


WO 92/200373; EP 03089). It is contemplated that the antibodies can be prepared in 
vitro using known methods in synthetic protein chemistry, including those involving 
crosslinking agents. For example, immunotoxins can be constructed using a disulfide 
exchange reaction or by forming a thioether bond. Examples of suitable reagents for this 
5 purpose include iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, 
for example, in U.S. Patent No. 4,676,980. 


5.13.7 EFFECTOR FUNCTION ENGINEERING 

It can be desirable to modify the antibody of the invention with respect to effector 
10 function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing 
interchain disulfide bond formation in this region. The homodimeric antibody thus 
generated can have improved internalization capability and/or increased complement- . 
mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et 
15 al.. J. Exp Med., 176: 1191-1195 (1992) and Shopes, J. Immunol., 148: 2918-2922 

(1992). Homodimeric antibodies with enhanced anti-tumor activity can also be prepared 
using heterobifunctional cross-linkers as described in Wolff et. al., Cancer Research, 53: 
2560-2565 (1993). Alternatively, an antibody can be engineered that has dual Fc regions 
and can thereby have enhanced complement lysis and ADCC capabilities. See Stevenson 
20 et al.. Anti-Cancer Drug Design, 3: 219-230 (1989). 

5.13.8 IMMUNOCONJUGATES 

The invention also pertains to immunoconjugates comprising an antibody 
conjugated to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an 
25 enzymatically active toxin of bacterial, fungal, plant, or animal origin, or fragments 
thereof), or a radioactive isotope (i.e., a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have 
been described above. Enzymatically active toxins and fragments thereof that can be 
used include diphtheria A chain, nonbinding active fragments of diphtheria toxin, 
30 exotoxin A chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, 

modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca 
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americana proteins (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, 
crotin, sapaonaria officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, 
enomycin, and the tricothecenes. A variety of radionuclides are available for the 
production of radioconjugated antibodies. Examples include 212 Bi, 131 I, 125 I, 131 In, 90 Y, 
5 and 186 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of 
bifiinctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) 
propionate (SPDP), iminothiolane (IT), bifiinctional derivatives of imidoesters (such as 
dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes 

10 (such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) 
hexanediamine), bis-diazonium derivatives (such as bis-(p-diazomumbenzoyl)- 
ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and bis-active 
fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a ricin 
immunotoxin can be prepared as described in Vitetta et aL. Science, 238: 1098 (1987). 

15 Carbon- 14-labeled l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid 
(MX-DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the 
antibody. See W094/1 1026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate 

20 is administered to the patient, followed by removal of unbound conjugate from the 

circulation using a clearing agent and then administration of a "ligand" (e.g., avidin) that 
is in turn conjugated to a cytotoxic agent. 

5.14 COMPUTER READABLE SEQUENCES 

25 In one application of this embodiment, a nucleotide sequence of the present 

invention can be recorded on computer readable media. As used herein, "computer 
readable media" refers to any medium which can be read and accessed directly by a 
computer. Such media include, but are not limited to: magnetic storage media, such as 
floppy discs, hard disc storage medium, and magnetic tape; optical storage media such as 

30 CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these 
categories such as magnetic/optical storage media. A skilled artisan can readily 
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appreciate how any of the presently known computer readable mediums can be used to 
create a manufacture comprising computer readable medium having recorded thereon a 
nucleotide sequence of the present invention. As used herein, "recorded" refers to a 
process for storing information on computer readable medium. A skilled artisan can 
5 readily adopt any of the presently known methods for recording information on computer 
readable medium to generate manufactures comprising the nucleotide sequence 
information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide sequence of the present 

10 invention. The choice of the data storage structure will generally be based on the means 
chosen to access the stored information. In addition, a variety of data processor programs 
and formats can be used to store the nucleotide sequence information of the present 
invention on computer readable medium. The sequence information can be represented 
in a word processing text file, formatted in commercially-available software such as 

1 5 WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored in a 
database application, such as DB2, Sybase, Oracle, or the like. A skilled artisan can 
readily adapt any number of data processor structuring formats (e.g. text file or database) 
in order to obtain computer readable medium having recorded thereon the nucleotide 
sequence information of the present invention. 

20 By providing any of the nucleotide sequences 748 SEQ ID NO: 1-45,196, 752 

SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 
1-10,451, 784 SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1- 
3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 
791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 1-8502 or a representative fragment 

25 thereof; or a nucleotide sequence at least 95% identical to any of the nucleotide 

sequences of 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1- 
94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 
SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID 
NO: 1-6391,790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 

30 1-8502 in computer readable form, a skilled artisan can routinely access the sequence 
information for a variety of purposes. Computer software is publicly available which 
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allows a skilled artisan to access sequence information provided in a computer readable 
medium. The examples which follow demonstrate how software which implements the 
BLAST (Altschul et al.. J. Mol. Biol. 215:403-410 (1990)) and BLAZE (Brutlag et al.. 
Comp. Chem. 17:203-207 (1993)) search algorithms on a Sybase system is used to 
5 identify open reading frames (ORFs) within a nucleic acid sequence. Such ORFs may be 
protein encoding fragments and may be useful in producing commercially important 
proteins such as enzymes used in fermentation reactions and in the production of 
commercially useful metabolites. 

As used herein, "a computer-based system" refers to the hardware means, 

10 software means, and data storage means used to analyze the nucleotide sequence 
information of the present invention. The minimum hardware means of the 
computer-based systems of the present invention comprises a central processing unit 
(CPU), input means, output means, and data storage means. A skilled artisan can readily 
appreciate that any one of the currently available computer-based systems are suitable for 

.15 use in the present invention. As stated above, the computer-based systems of the present 
invention comprise a data storage means having stored therein a nucleotide sequence of 
the present invention and the necessary hardware means and software means for 
supporting and implementing a search means. As used herein, "data storage means" 
refers to memory which can store nucleotide sequence information of the present 

20 invention, or a memory access means which can access manufactures having recorded 
thereon the nucleotide sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which are 
implemented on the computer-based system to compare a target sequence or target 
structural motif with the sequence information stored within the data storage means. 

25 Search means are used to identify fragments or regions of a known sequence which 
match a particular target sequence or target motif. A variety of known algorithms are 
disclosed publicly and a variety of commercially available software for conducting search 
means are and can be used in the computer-based systems of the present invention. 
Examples of such software includes, but is not limited to, Smith- Waterman, MacPattern 

30 (EMBL), BLASTN and BLASTA (NPOLYPEPTIDEIA). A skilled artisan can readily 
recognize that any one of the available algorithms or implementing software packages for 


111 


JL O CI B H- B M- a; m O E! S ft U & 


conducting homology searches can be adapted for use in the present computer-based 
systems. As used herein, a "target sequence" can be any nucleic acid or amino acid 
sequence of six or more nucleotides or two or more amino acids. A skilled artisan can 
readily recognize that the longer a target sequence is, the less likely a target sequence will 
5 be present as a random occurrence in the database. The most preferred sequence length 
of a target sequence is from about 10 to 300 amino acids, more preferably from about 30 
to 100 nucleotide residues. However, it is well recognized that searches for 
commercially important fragments, such as sequence fragments involved in gene 
expression and protein processing, may be of shorter length. 

10 As used herein, "a target structural motif," or "target motif," refers to any 

rationally selected sequence or combination of sequences in which the sequence(s) are 
chosen based on a three-dimensional configuration which is formed upon the folding of 
the target motif. There are a variety of target motifs known in the art. Protein target 
motifs include, but are not limited to, enzyme active sites and signal sequences. Nucleic 

1 5 acid target motifs include, but are not limited to, promoter sequences, hairpin structures 
and inducible expression elements (protein binding sequences). 

5.15 TRIPLE HELIX FORMATION 

In addition, the fragments of the present invention, as broadly described, can be 
20 used to control gene expression through triple helix formation or antisense DNA or RNA, 
both of which methods are based on the binding of a polynucleotide sequence to DNA or 
RNA. Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in 
length and are designed to be complementary to a region of the gene involved in 
transcription (triple helix - see Lee et al.. Nucl. Acids Res. 6:3073 (1979); Cooney et al.. 
25 Science 15241:456 (1988); and Dervan et al.. Science 251:1360 (1991)) or to the mRNA 
itself (antisense - Olmno, J. Neurochem. 56:560 (1991); Oligodeoxynucleotides as 
Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988)). Triple 
helix-formation optimally results in a shut-off of RNA transcription from DNA, while 
antisense RNA hybridization blocks translation of an mRNA molecule into polypeptide. 
30 Both techniques have been demonstrated to be effective in model systems. Information 
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contained in the sequences of the present invention is necessary for the design of an 
antisense or triple helix oligonucleotide. 

5.16 DIAGNOSTIC ASSAYS AND KITS 
5 The present invention further provides methods to identify the presence or 

expression of one of the ORFs of the present invention, or homolog thereof, in a test 
sample, using a nucleic acid probe or antibodies of the present invention, optionally 
conjugated or otherwise associated with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 
10 contacting a sample with a compound that binds to and forms a complex with the 

polynucleotide for a period sufficient to form the complex, and detecting the complex, so 
that if a complex is detected, a polynucleotide of the invention is detected in the sample. 
Such methods can also comprise contacting a sample under stringent hybridization 
conditions with nucleic acid primers that anneal to a polynucleotide of the invention 
15 under such conditions, and amplifying annealed polynucleotides, so that if a 

polynucleotide is amplified, a polynucleotide of the invention is detected in the sample. 

In general, methods for detecting a polypeptide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the 
polypeptide for a period sufficient to form the complex, and detecting the complex, so 
20 that if a complex is detected, a polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of the 
antibodies or one or more of the nucleic acid probes of the present invention and assaying 
for binding of the nucleic acid probes or antibodies to components within the test sample. 
Conditions for incubating a nucleic acid probe or antibody with a test sample 
25 vary. Incubation conditions depend on the format employed in the assay, the detection 
methods employed, and the type and nature of the nucleic acid probe or antibody used in 
the assay. One skilled in the art will recognize that any one of the commonly available 
hybridization, amplification or immunological assay formats can readily be adapted to 
employ the nucleic acid probes or antibodies of the present invention. Examples of such 
30 assays can be found in Chard, T., An Introduction to Radioimmunoassay and Related 

Techniques, Elsevier Science Publishers, Amsterdam, The Netherlands (1986); Bullock, 
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G.R. et al.. Techniques in Immunocytochemistry, Academic Press, Orlando, FL Vol. 1 
(1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice and Theory of immunoassays: 
Laboratory Techniques in Biochemistry and Molecular Biology, Elsevier Science 
Publishers, Amsterdam, The Netherlands (1985). The test samples of the present 
5 invention include cells, protein or membrane extracts of cells, or biological fluids such as 
sputum, blood, serum, plasma, or urine. The test sample used in the above-described 
method will vary based on the assay format, nature of the detection method and the 
tissues, cells or extracts used as the sample to be assayed. Methods for preparing protein 
extracts or membrane extracts of cells are well known in the art and can be readily be 

10 adapted in order to obtain a sample which is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain 
the necessary reagents to carry out the assays of the present invention. Specifically, the 
invention provides a compartment kit to receive, in close confinement, one or more 
containers which comprises: (a) a first container comprising one of the probes or 

1 5 antibodies of the present invention; and (b) one or more other containers comprising one 
or more of the following: wash reagents, reagents capable of detecting presence of a 
bound probe or antibody. 

In detail, a compartment kit includes any kit in which reagents are contained in 
separate containers. Such containers include small glass containers, plastic containers or 

20 strips of plastic or paper. Such containers allows one to efficiently transfer reagents from 
one compartment to another compartment such that the samples and reagents are not 
cross-contaminated, and the agents or solutions of each container can be added in a 
quantitative fashion from one compartment to another. Such containers will include a 
container which will accept the test sample, a container which contains the antibodies 

25 used in the assay, containers which contain wash reagents (such as phosphate buffered 
saline, Tris-buffers, etc.), and containers which contain the reagents used to detect the 
bound antibody or probe. Types of detection reagents include labeled nucleic acid probes, 
labeled secondary antibodies, or in the alternative, if the primary antibody is labeled, the 
enzymatic, or antibody binding reagents which are capable of reacting with the labeled 

30 antibody. One skilled in the art will readily recognize that the disclosed probes and 
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antibodies of the present invention can be readily incorporated into one of the established 
kit formats which are well known in the art. 

5.17 MEDICAL IMAGING 

5 The novel polypeptides and binding partners of the invention are useful in 

medical imaging of sites expressing the molecules of the invention (e.g., where the 
polypeptide of the invention is involved in the immune response, for imaging sites of 
inflammation or infection). See, e.g., Kunkel et al.. U.S. Pat. NO. 5,413,778. Such 
methods involve chemical attachment of a labeling or imaging agent, administration of 
10 the labeled polypeptide to a subject in a pharmaceutically acceptable carrier, and imaging 
the labeled polypeptide in vivo at the target site. 

5.18 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides disclosed herein, the present 

1 5 invention further provides methods of obtaining and identifying agents which bind to a 
polypeptide encoded by an ORF corresponding to any of the nucleotide sequences set 
forth in 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ ID NO: 1-94, 
779 SEQIDNO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1-10,289, 785 
SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 789 SEQ ID 

20 NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 SEQ ID NO: 
1-8502, or bind to a specific domain of the polypeptide encoded by the nucleic acid. In 
detail, said method comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by an ORF of the 
present invention, or nucleic acid of the invention; and 

25 (b) determining whether the agent binds to said protein or said nucleic acid. 

In general, therefore, such methods for identifying compounds that bind to a 
polynucleotide of the invention can comprise contacting a compound with a 
polynucleotide of the invention for a time sufficient to form a polynucleotide/compound 
complex, and detecting the complex, so that if a polynucleotide/compound complex is 

30 detected, a compound that binds to a polynucleotide of the invention is identified. 
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Likewise, in general, therefore, such methods for identifying compounds that bind 
to a polypeptide of the invention can comprise contacting a compound with a polypeptide 
of the invention for a time sufficient to form a polypeptide/compound complex, and 
detecting the complex, so that if a polypeptide/compound complex is detected, a 
5 compound that binds to a polynucleotide of the invention is identified. 

Methods for identifying compounds that bind to a polypeptide of the invention 
can also comprise contacting a compound with a polypeptide of the invention in a cell for 
a time sufficient to form a polypeptide/compound complex, wherein the complex drives 
expression of a receptor gene sequence in the cell, and detecting the complex by 
10 detecting reporter gene sequence expression, so that if a polypeptide/compound complex 
is detected, a compound that binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate 
the activity of a polypeptide of the invention (that is, increase or decrease its activity, 
relative to activity observed in the absence of the compound). Alternatively, compounds 
1 5 identified via such methods can include compounds which modulate the expression of a 
polynucleotide of the invention (that is, increase or decrease expression relative to 
expression levels observed in the absence of the compound). Compounds, such as 
compounds identified via the methods of the invention, can be tested using standard 
assays well known to those of skill in the art for their ability to modulate 
20 activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be 
selected and screened at random or rationally selected or designed using protein modeling 
techniques. 

25 For random screening, agents such as peptides, carbohydrates, pharmaceutical 

agents and the like are selected at random and are assayed for their ability to bind to the 
protein encoded by the ORF of the present invention. Alternatively, agents may be 
rationally selected or designed. As used herein, an agent is said to be "rationally selected 
or designed" when the agent is chosen based on the configuration of the particular 

30 protein. For example, one skilled in the art can readily adapt currently available 

procedures to generate peptides, pharmaceutical agents and the like, capable of binding to 
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a specific peptide sequence, in order to generate rationally designed antipeptide peptides, 
for example see Hurby et al.. Application of Synthetic Peptides: Antisense Peptides," In 
Synthetic Peptides, A User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and 
Kaspczak et al.. Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like. 
5 In addition to the foregoing, one class of agents of the present invention, as 

broadly described, can be used to control gene expression through binding to one of the 
ORFs or EMFs of the present invention. As described above, such agents can be 
randomly screened or rationally designed/selected. Targeting the ORF or EMF allows a 
skilled artisan to design sequence specific or element specific agents, modulating the 

10 expression of either a single ORF or multiple ORFs which rely on the same EMF for 
expression control. One class of DNA binding agents are agents which contain base 
residues which hybridize or form a triple helix formation by binding to DNA or RNA. 
Such agents can be based on the classic phosphodiester, ribonucleic acid backbone, or 
can be a variety of sulfhydryl or polymeric derivatives which have base attachment 

1 5 capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and are 
designed to be complementary to a region of the gene involved in transcription (triple 
helix - see Lee et al.. Nucl. Acids Res. 6:3073 (1979); Cooney et al.. Science 241 :456 
(1988); and Dervan et al.. Science 251:1360 (1991)) or to the mRNA itself (antisense - 

20 Okano, J. Neurochem. 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of 
Gene Expression, CRC Press, Boca Raton, FL (1988)). Triple helix-formation optimally 
results in a shut-off of RNA transcription from DNA, while antisense RNA hybridization 
blocks translation of an mRNA molecule into polypeptide. Both techniques have been 
demonstrated to be effective in model systems. Information contained in the sequences 

25 of the present invention is necessary for the design of an antisense or triple helix 
oligonucleotide and other DNA binding agents. 

Agents which bind to a protein encoded by one of the ORFs of the present 
invention can be used as a diagnostic agent. Agents which bind to a protein encoded by 
one of the ORFs of the present invention can be formulated using known techniques to 

30 generate a pharmaceutical composition. 
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5.19 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the present invention is to provide for polypeptide-specific 
nucleic acid hybridization probes capable of hybridizing with naturally occurring 
nucleotide sequences. The hybridization probes of the subject invention may be derived 
5 from any of the nucleotide sequences 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1- 
13,192, 778 SEQ ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 
SEQ ID NO: 1-10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID 
NO: 1-14,074, 789 SEQ ID NO: 1-6391,790 SEQ ID NO: 1-30,533,791 SEQ ID NO: 1- 
5822, and 792 SEQ ID NO: 1-8502. Because the corresponding gene is only expressed 

10 in a limited number of tissues, a hybridization probe derived from of any of the 

nucleotide sequences 748 SEQ ID NO: 1-45,196, 752 SEQ ID NO: 1-13,192, 778 SEQ 
ID NO: 1-94, 779 SEQ ID NO: 1-128, 782 SEQ ID NO: 1-10,451, 784 SEQ ID NO: 1- 
10,289, 785 SEQ ID NO: 1-3796, 787 SEQ ID NO: 1-3960, 788 SEQ ID NO: 1-14,074, 
789 SEQ ID NO: 1-6391, 790 SEQ ID NO: 1-30,533, 791 SEQ ID NO: 1-5822, and 792 

15 SEQ ID NO: 1-8502 can be used as an indicator of the presence of RNA of cell type of 
such a tissue in a sample. 

Any suitable hybridization technique can be employed, such as, for example, in 
situ hybridization. PCR as described in US Patents Nos. 4,683,195 and 4,965,188 
provides additional uses for oligonucleotides based upon the nucleotide sequences. Such 

20 probes used in PCR may be of recombinant origin, may be chemically synthesized, or a 
mixture of both. The probe will comprise a discrete nucleotide sequence for the detection 
of identical sequences or a degenerate pool of possible sequences for identification of 
closely related genomic sequences. 

Other means for producing specific hybridization probes for nucleic acids include 

25 the cloning of nucleic acid sequences into vectors for the production of mRNA probes. 
Such vectors are known in the art and are commercially available and may be used to 
synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerase as T7 or SP6 RNA polymerase and the appropriate radioactively labeled 
nucleotides. The nucleotide sequences may be used to construct hybridization probes for 

30 mapping their respective genomic sequences. The nucleotide sequence provided herein 
may be mapped to a chromosome or specific regions of a chromosome using well known 
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genetic and/or chromosomal mapping techniques. These techniques include in situ 
hybridization, linkage analysis against known chromosomal markers, hybridization 
screening with libraries or flow-sorted chromosomal preparations specific to known 
chromosomes, and the like. The technique of fluorescent in situ hybridization of 
5 chromosome spreads has been described, among other places, in Verma et al (1988) 

Human Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York NY. 

Fluorescent in situ hybridization of chromosomal preparations and other physical 
chromosome mapping techniques may be correlated with additional genetic map data. 
Examples of genetic map data can be found in the 1994 Genome Issue of Science 
1 0 (265 : 1 98 1 f). Correlation between the location of a nucleic acid on a physical 

chromosomal map and a specific disease (or predisposition to a specific disease) may 
help delimit the region of DNA associated with that genetic disease. The nucleotide 
sequences of the subject invention may be used to detect differences in gene sequences 
between normal, carrier or affected individuals. 

i 5 5.20 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides, i.e., small nucleic acid segments, may be readily prepared from 
the sequences disclosed herein by, for example, directly synthesizing the oligonucleotide by 
chemical means, as is commonly practiced using an automated oligonucleotide synthesizer. 
Support bound oligonucleotides may be prepared by any of the methods known to 

20 those of skill in the art using any suitable support such as glass, polystyrene or Teflon® 
(DuPont). One strategy is to precisely spot oligonucleotides synthesized by standard 
synthesizers. Immobilization can be achieved using passive adsorption (Inouye & Hondo, 
(1990) J. Clin. Microbiol. 28(6) 1469-72); using UV light (Nagata et al. 1985; Dahlen et al. 
1987; Morrissey & Collins, (1 989) Mol. Cell Probes 3(2) 189-207) or by covalent binding 

25 of base modified DNA (Keller et al. 1988; 1989); all references being specifically 
incorporated herein. 

Another strategy that may be employed is the use of the strong biotin-streptavidin 
interaction as a linker. For example, Broude et al. (1994) Proc. Natl. Acad. Sci. USA 91(8) 
3072-6, describe the use of biotinylated probes, although these are duplex probes, that are 
30 immobilized on streptavidin-coated magnetic beads. Streptavidin-coated beads may be 

purchased from Dynal, Oslo. Of course, this same linking chemistry is applicable to coating 
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any surface with streptavidin. Biotinylated probes may be purchased from various sources, 
such as, e.g., Operon Technologies (Alameda, CA). 

Nunc Laboratories (Naperville, IL) is also selling suitable material that could be 
used. Nunc Laboratories have developed a method by which DNA can be covalently bound 
5 to the microwell surface termed Covalink NH. CovaLink NH is a polystyrene surface 

grafted with secondary amino groups (>NH) that serve as bridge-heads for further covalent 
coupling. CovaLink Modules may be purchased from Nunc Laboratories. DNA molecules 
may be bound to CovaLink exclusively at the 5'-end by a phosphoramidate bond, allowing 
immobilization of more than 1 pmol of DNA (Rasmussen et al. (1991) Anal. Biochem. 

10 198(1) 138-42). 

The use of CovaLink NH strips for covalent binding of DNA molecules at the 5-end 
has been described (Rasmussen et al.. (1991). In this technology, a phosphoramidate bond 
is employed (Chu et al.. (1983) Nucleic Acids Res. 1 1(8) 6513-29). This is beneficial as 
immobilization using only a single covalent bond is preferred. The phosphoramidate bond 

1 5 joins the DNA to the CovaLink NH secondary amino groups that are positioned at the end 
of spacer arms covalently grafted onto the polystyrene surface through a 2 nm long spacer 
arm. To link an oligonucleotide to CovaLink NH via an phosphoramidate bond, the 
oligonucleotide terminus must have a 5-end phosphate group. It is, perhaps, even possible 
for biotin to be covalently bound to CovaLink and then streptavidin used to bind the probes. 

20 More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) 

and denaturing for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0.1 M 
1-methylimidazole, pH 7.0 (1-Melni7), is then added to a final concentration of 10 mM 
1-Melm7. A ss DNA solution is then dispensed into CovaLink NH strips (75 ul/well) 
standing on ice. 

25 Carbodiimide 0.2 M l-ethyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC), 

dissolved in 10 mM 1-Melm 7? is made fresh and 25 ul added per well. The strips are 
incubated for 5 hours at 50°C. After incubation the strips are washed using, e.g., 
Nunc-Immuno Wash; first the wells are washed 3 times, then they are soaked with washing 
solution for 5 min., and finally they are washed 3 times (where in the washing solution is 0.4 

30 N NaOH, 0.25% SDS heated to 50°C). 
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It is contemplated that a further suitable method for use with the present invention is 
that described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated 
herein by reference. This method of preparing an oligonucleotide bound to a support 
involves attaching a nucleoside 3 -reagent through the phosphate group by a covalent 
5 phosphodiester link to aliphatic hydroxyl groups carried by the support. The 

oligonucleotide is then synthesized on the supported nucleoside and protecting groups 
removed from the synthetic oligonucleotide chain under standard conditions that do not 
cleave the oligonucleotide from the support. Suitable reagents include nucleoside 
phosphoramidite and nucleoside hydrogen phosphorate. 

10 An on-chip strategy for the preparation of DN A probe for the preparation of DN A 

probe arrays may be employed. For example, addressable laser-activated photodeprotection 
may be employed in the chemical synthesis of oligonucleotides directly on a glass surface, 
as described by Fodor et al. (1991) Science 251(4995) 767-73, incorporated herein by 
reference. Probes may also be immobilized on nylon supports as described by Van Ness et 

15 al. (1991) Nucleic Acids Res. 19(12) 3345-50; or linked to Teflon using the method of 
Duncan & Cavalier (1988) Anal. Biochem. 169(1) 104-8; all references being specifically 
incorporated herein. 

To link an oligonucleotide to a nylon support, as described by Van Ness et al. 
(1991), requires activation of the nylon surface via alkylation and selective activation of the 

20 5-amine of oligonucleotides with cyanuric chloride. 

One particular way to prepare support bound oligonucleotides is to utilize the 
light-generated synthesis described by Pease et aL (1994) PNAS USA 91(11) 5022-6, 
incorporated herein by reference). These authors used current photolithographic techniques 
to generate arrays of immobilized oligonucleotide probes (DNA chips). These methods, in 

25 which light is used to direct the synthesis of oligonucleotide probes in high-density, 
miniaturized arrays, utilize photolabile 5-protected Af-acyl-deoxynucleoside 
phosphoramidites, surface linker chemistry and versatile combinatorial synthesis strategies. 
A matrix of 256 spatially defined oligonucleotide probes may be generated in this manner. 

5.21 PREPARATION OF NUCLEIC ACID FRAGMENTS 

30 The nucleic acids may be obtained from any appropriate source, such as cDNAs, 

genomic DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC 
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inserts, and RNA, including mRNA without any amplification steps. For example, 
Sambrook et al. (1 989) describes three protocols for the isolation of high molecular weight 
DNA from mammalian cells (p. 9.14-9.23). 

DNA fragments may be prepared as clones in Ml 3, plasmid or lambda vectors 
5 and/or prepared directly from genomic DNA or cDNA by PCR or other amplification 

methods. Samples may be prepared or dispensed in multiwell plates. About 100-1000 ng of 
DNA samples may be prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those 
of skill in the art including, for example, using restriction enzymes as described at 9.24-9.28 

10 of Sambrook et al. ( 1 989), shearing by ultrasound and NaOH treatment. 

Low pressure shearing is also appropriate, as described by Schriefer et al. (1 990) 
.Nucleic Acids Res. 18(24) 7455-6, incorporated herein by reference). In this method, DNA 
samples are passed through a small French pressure cell at a variety of low to intermediate 
pressures. A lever device allows controlled application of low to intermediate pressures to 

15 the cell. The results of these studies indicate that low-pressure shearing is a useful 
alternative to sonic and enzymatic DNA fragmentation methods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using 
the two base recognition endonuclease, Cv/JI, described by Fitzgerald et al. (1992) Nucleic 
Acids Res. 20(14) 3753-62. These authors described an approach for the rapid 

20 fragmentation and fractionation of DNA into particular sizes that they contemplated to be 
suitable for shotgun cloning and sequencing. 

The restriction endonuclease Cv/JI normally cleaves the recognition sequence 
PuGCPy between the G and C to leave blunt ends. Atypical reaction conditions, which alter 
the specificity of this enzyme (Cv/JI**), yield a quasi-random distribution of DNA 

25 fragments form the small molecule pUC19 (2688 base pairs). Fitzgerald et al. (1992) 
quantitatively evaluated the randomness of this fragmentation strategy, using a Cv/JI** 
digest of pUC19 that was size fractionated by a rapid gel filtration method and directly 
ligated, without end repair, to a lac Z minus Ml 3 cloning vector. Sequence analysis of 76 
clones showed that Cv/JI** restricts pyGCPy and PuGCPu, in addition to PuGCPy sites, and 

30 that new sequence data is accumulated at a rate consistent with random fragmentation. 
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As reported in the literature, advantages of this approach compared to sonication and 
agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 jug instead 
of 2-5 \ig); and fewer steps are involved (no preligation, end repair, chemical extraction, or 
agarose gel electrophoresis and elution are needed 
5 Irrespective of the manner in which the nucleic acid fragments are obtained or 

prepared, it is important to denature the DNA to give single stranded pieces available for 
hybridization. This is achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. 
The solution is then cooled quickly to 2°C to prevent renaturation of the DNA fragments 
before they are contacted with the chip. Phosphate groups must also be removed from 
1 0 genomic DNA by methods known in the art. 

5.22 PREPARATION OF DNA ARRAYS 

The nucleic acid sequences disclose herein can be used to create arrays. Arrays may 
be prepared by spotting DNA samples on a support such as a nylon membrane. Spotting 
may be performed by using arrays of metal pins (the positions of which correspond to an 

1 5 array of wells in a microtiter plate) to repeated by transfer of about 20 nl of a DNA solution 
to a nylon membrane. By offset printing, a density of dots higher than the density of the 
wells is achieved. One to 25 dots may be accommodated in 1 mm 2 , depending on the type 
of label used. By avoiding spotting in some preselected number of rows and columns, 
separate subsets (subarrays) may be formed. Samples in one subarray may be the same 

20 genomic segment of DNA (or the same gene) from different individuals, or may be 

different, overlapped genomic clones. Each of the subarrays may represent replica spotting 
of the same samples. In one example, a selected gene segment may be amplified from 64 
patients. For each patient, the amplified gene segment may be in one 96-well plate (all 96 
wells containing the same sample). A plate for each of the 64 patients is prepared. By using 

25 a 96-pin device, all samples may be spotted on one 8x12 cm membrane. Subarrays may 
contain 64 samples, one from each patient. Where the 96 subarrays are identical, the dot 
span may be 1 mm 2 and there may be a 1 mm space between subarrays. 

Another approach is to use membranes or plates (available from NUNC, Naperville, 
Illinois) which may be partitioned by physical spacers e.g. a plastic grid molded over the 

30 membrane, the grid being similar to the sort of membrane applied to the bottom of multiwell 
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plates, or hydrophobic strips. A fixed physical spacer is not preferred for imaging by 
exposure to flat phosphor-storage screens or x-ray films. 

5.23 PREPARATION OF A UNIVERSAL SET OF PROBES 

5 The nucleic acid sequences disclosed herein may be used to design universal 

hybidizaiton probes for detecting genetic markers. Two types of universal sets of probes 
may be prepared. The first is a complete set (or at least a noncomplementary subset) of 
relatively short probes, for example all 4096 (or about 2000 non-complementary) 6-mers, 
or all 16,384 (or about 8,000 non-complementary) 7-mers. Full noncomplementary 
10 subsets of 8-mers and longer probes are less convenient inasmuch as they include 32,000 
or more probes. 

A second type of probe set is selected as a small subset of probes still sufficient 
for reading every bp in any sequence with at least with one probe. For example, 12 of 16 
dimers are sufficient. A small subset for 7-mers, 8-mer and 9-mers for sequencing 

15 double stranded DNA may be about 3000, 10,000 and 30,000 probes, respectively. 

Probes may be prepared using standard chemistry with one to three non-specified 
(mixed A,T,C and El) or universal (e.g. M base or inosine) bases at the ends. If 
radiolabelling is used, probes may have an OH group at the 5' end for kinasing by 
radiolabeled phosphorous groups. Alternatively, probes labelled with any compatible 

20 system, such as fluorescent dyes, may be employed. Other types of probes, such as PNA 
(Protein Nucleic Acids) or probes containing modified bases which change duplex 
stability also may be used. 

Probes may be stored in bar-coded multiwell plates. For small numbers of 
probes, 96-well plates may be used; for 10,000 or more probes, storage in 384- or 864- 

25 well plates is preferred. Stacks of 5 to 50 plates are enough to store all probes. 

Approximately 5 pg of a probe may be sufficient for hybridization with one DNA 
sample. Thus, from a small synthesis of about 50 mg per probe, ten million samples may 
be analyzed. If each probe is used for every third sample, and if each sample is 1000 bp 
in length, then over 30 billion bases (10 human genomes) may be sequenced by a set of 

30 5,000 probes. 
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5.24 PROBES HAVING MODIFIED OLIGONUCLEOTIDES 

Modified oligonucleotides may be introduced into hybridization probes described 
in section 5.23 and used under appropriate conditions therefor. For example, pyrimidines 
with a halogen at the C5-position may be used to improve duplex stability by influencing 
5 base stacking. 

2,6-diaminopurine may be used to provide a third hydrogen bond in base pairing with 
thymine, thereby thermally stabilizing DNA-duplexes. Using 2,5-diaminopurine may 
increase duplex stability to allow more stringent conditions for annealing, thereby 
improving the specificity of duplex formation, suppressing background problems and 

10 permitting the use of shorter oligomers. The synthesis of the triphosphate versions of 
these modified nucleotides is disclosed by Hoheisel & Lehrach(1990). 

One may also use the non-discriminatory base analogue, or universal base, 
designed by Nichols et. al, (1994). This new analogue, l-(2 -deoxy-10 -D- 
ribftzranosyl)-3-nitropyrrole ignated M), was generated for use in oligonucleotide probes 

15 and primers for solving the design problems that arise as a result of the degeneracy of the 
genetic code, or when only fragmentary peptide sequence data are available. This 
analogue maximizes stacking while minimizing hydrogen-bonding interactions without 
sterically disrupting a DNA duplex. 

The M nucleoside analogue was designed to maximize stacking 

20 interactions using aprotic polar substituents linked to heteroaromatic rings, enhancing 

intra- and inter-stand stacking interactions to lessen the role of hydrogen bonding in base- 
pairing specificity. Nichols et al. (1994) favored 3-nitropyrrole2 -deoxyribonucleoside 
because of its structural and electronic resemblance to p-nitroaniline, whose derivatives 
are among the smallest known intercaltors of double-stranded DNA. 

25 The dimethoxytrityl-protectedphosphoramidite of nucleoside M is also available 

for incorporation into nucleotides used as primers for sequencing and polymerase chain 
reaction (PCR). Nichols et. al, (1994) showed that a substantial number of nucleotides 
can be replaced by M without loss of primer specificity. 

A unique property of M is its ability to replace long strings of contiguous 

30 nucleosides and still yield functional sequencing primers. Sequences with three, six and 
nine M substitutions have all been reported to give readable sequencing ladders, and PCR 
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with three different M-containing primers all resulted in amplification of the correct 
product (Nichols et ah, 1994). 

Probes may be retrieved one by one and added to subarrays covered by 
hybridization buffer. It is preferred that retrieved probes be placed in a new plate and 
5 labelled or mixed with hybridization buffer. The preferred method of retrieval is by 
accessing stored plates one by one and pipetting (or transferring by metal pins) a 
sufficient amount of each selected probe from each plate to specific wells in an 
intermediary plate. An array of individually addressable pipettes or pins may be used to 
speed up the retrieval process. 

10 

5.25 PREPARATION OF LABELED PROBES 

The oligonucleotide probes may be prepared by automated synthesis, which is 
routine to those of skill in the art, for example, using and Applied Biosystems system. 
Alternatively, probes may be prepared using Genosys Biotechnologies Inc. Methods 

15 using stacks of porous Teflon wafers. 

Oligonucleotide probes may be labeled with, for example, radioactive labels ( 35 S, 
32 P, and preferably, 33 P) for arrays with 100-200 |xm or 100-400 |jm spots; non- 
radioactive isotopes (Jacobsen et ah. 1990); or fluorophores (Brumbaugh et ah. 1988). 
All such labeling methods are routine in the art, as exemplified by the relevant sections in 

20 Sambrook et ah. (1989) and by further references such as Schubert et ah. (1990), 
Murakami et ah. (1991) and Cate et ah. (1991), all articles being specifically 
incorporated herein by reference. 

In regard to radiolabelling, the common methods are end-labeling using T4 
polynucleotide kinase or high specific activity labeling using Klenow or even T7 

25 polymerase. These are described as follows. 

Synthetic oligonucleotides are synthesized without a phosphate group at their 5' 
termini and are therefore easily labeled by transfer of the 32 P or 33 P from [ 32 P] ATP or 
[ P] ATP using the enzyme bacteriophage T4 polynucleotide kinase. If the reaction is 
carried out efficiently, the specificity activity of such probes can be as high as the specific 

30 activity of the [ 32 P]ATP or [ 33 P]ATP itself. The reaction described below is designed to 
label 10 pmoles of an oligonucleotide to high specific activity. Labeling of different 
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amounts of oligonucleotide can easily be achieved by increasing or decreasing the size of 
the reaction, keeping the concentrations of all components constant. 

A reaction mixture would be created using 1 .0 pi of oligonucleotide (10 
pmoles/|il); 2.0 pi of 10 x bacteriophage T4 polynucleotide kinase buffer; 5.0 pi of [ - 
5- 32 P]ATP or [ 33 P]ATP (sp. act. 5000 Ci/mmole; 10 mCi/ml in aqueous solution) (10 

pmoles); and 1 1.4 pi of water. Eight (8) units (lOu/pll) ofbacteriophagT4 polynucleotide 
kinase is added to the reaction mixture, mixed well, and incubated for 45 minutes at 
37°C. The reaction is heated for 10 minutes at 68°C to inactivate the bacteriophage T4 
polynucleotide kinase. * 

10 The efficiency of transfer of 32 P or 33 P to the oligonucleotide and its specific 

activity is then determined. If the specific activity of the probe is acceptable, it is 
purified. If the specific activity is too low, an additional 8 units of enzyme is added and - 
incubated for a further 30 minutes at 37°C before heating the reaction for 10 minutes at 
68°C to inactivate the enzyme. 

1 5 Purification of radiolabeled oligonucleotides can be achieved by precipitation 

with thanol; precipitation with cetylpyridinium bromide; by chromatography through 
Bio-gel P-60; or by chromatography on a Sep-Pak C\% column (Pharmacia). 
: Probes of higher specific activities can be obtained using the Klenow fragment of 

E. coli DNA polymerase I to synthesize a strand of DNA complementary to the synthetic 

20 oligonucleotide. A short primer is hybridized to an oligonucleotide template whose 
sequence is the complement of the desired radiolabeled probe. The primer is then 
extended using the Klenow fragment of E. coli DNA polymerase I to incorporate [- 32 P] 
dNTPs or [- 33 P] dNTPs in a template-directed manner. After the reaction, the template 
and product are separated by denaturation followed by electrophoresis through a 

25 polyacrylamide gel under denaturing conditions. With this method, it is possible to 

generate oligonucleotide probes that contain several radioactive atoms per molecule of 
oligonucleotide if desired. 

To use this method, one would mix in a microfuge tube the calculated amounts of 
[oc- 32 P]dNTP's or [a- 33 P]dNTP's necessary to achieve the desired specific activity and 

30 sufficient to allow complete synthesis of all template strands. The concentration of 

dNTPs should not be less than IpM at any stage during the reaction. Then add to the tube 
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the appropriate amounts of primer and template DNAs, with the primer being in three- to 
tenfold molar excess over the template. 

Klenow buffer is then added and mixed well. 2-4 units of the Klenow fragment of 
E. coli DNA polymerase I would then be added per 5 [d of reaction volume, mixed and 
5 incubated for 2-3 hours at 4°C. If desired, the process of the reaction may be monitored 
by removing small (0. 1 aliquots and measuring the proportion of radioactivity that has 
become precipitable with 10% trichloroacetic acid (TCA). 

The reaction would be diluted with an equal volume of gel-loading buffer, heated 
to 80°C for 3 minutes, and then the entire sample loaded on a denaturing polyacrylamide 

10 gel. Following electrophoresis, the gel is autoradiographed, allowing the probe to be 
localized and removed from the gel. Various methods for fluorphoric labeling are also 
available, as follows. Brumbaugh et ah. (1988) describe the synthesis of fluorescently 
labeled primers. A deoxyuridine analog with a primary amine "linker arm" of 12 atoms 
attached at C-S is synthesized. Synthesis of the analog consists of derivatizing 2 - 

1 5 deoxyuridine through organometallic intermediates to give 5 (methyl propenoyl)-2 - 
deoxyuridine. Reaction dimethoxytrityl-chloride produces the corresponding 5 - 
dimethoxytrityl adduct. The methyl ester is hydrolyzed, activated, and reacted with an 
appropriately monoacylated ldiamine. After purification, the resultant linker arm 
nucleosides are converted to nucleoside analogs suitable for chemical oligonucleotide 

20 synthesis. 

Oligonucleotides would then be made that include one or two linker arm bases by 
using modified phosphoridite chemistry. To a solution of 50 nmol of the linker arm 
oligonucleotide in 25 yX of 500 mM sodium biocarbonate (pH 9.4) is added 20 ul of 300 
mM FITC in dimethyl sulfoxide. The mixture is agitated at room temperature for 6 hrs. 

25 The oligonucleotide is separated from free FITC by elution form a 1 x 30 cm Sephadex 
G-25 column with 20 mM ammonium acetate (pH 6), combining fractions in the first 
UV-absorbing peak. 

In general, fluorescent labeling of an oligonucleotide at it s 5 '-end initially 
involved two steps. First, a N-protected aminoalkyl phosphoramidite derivative is added 

30 to the 5 5 -end of an oligonucleotide during automated DNA synthesis. After removal of 
all protecting groups, the NHS ester of an appropriate fluorescent dye is coupled to the 5 
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-amino group overnight followed by purification of the labeled oligonucleotide from the 
excess of dye using reverse phase HPLC or PAGE. 

Schubert et al. (1990) describes the synthesis of a phosphoramidite that enables 
olignucleotides labeled with fluorescent tags to be produced during automated DNA 
5 synthesis. 

Murakami et al. also described the preparation of fluorescein-labeled 
oligonucleotides. 

Cate et al, (1991) describes the use of oligonucleotide probes directly conjugated 
to alkaline phosphatase in combination with a direct chemiluminescent substrate 
1 0 (AMPPD) to allow probe detection. 

Labeled probes could readily be purchased form a variety of commercial sources, 
including GENSET, rather then synthesized. 

Other labels include ligands which can serve as specific binding members to a 
labeled antibody, chemiluminescers, enzymes, antibodies which can serve as a specific 
1 5 binding pair member for a labeled ligand, and the like. A wide variety of labels have 
been employed in immunoassays which can readily be employed. Still other labels 
include antigens, groups with specific reactivity, and electrochemically detectable 
moeities. 

In general, labeling of nucleic acids with electrophore mass labels ("EML") is 
20 described, for example, in Xu et al., J. Chromatography 764:95-102(1997). 

Electrophores are compounds that can be detected with high sensitivity by electron 
capture mass spectrometry (EC-MS). EMLs can be attached to a probe using chemistry 
that is well known in the art for reversibly modifying a nucleotide (e.g., well known 
nucleotide synthesis chemistry teaches a variety of methods for attaching molecules to 
25 nucleotides as protecting groups). EMLs are detected using a variety of well known 

electron capture mass spectrometry devices (e.g., devices sold by Finnigan Corporation). 
Further, techniques that may be used in the detection of EMLs include, for example, fast 
atomic bombardment mass spectrometry (see, e.g., Koster et aL 9 Biomedical Environ. 
Mass Spec. 14:1 1 1-1 16(1987)); plasma desorption mass spectrometry; 
30 electrospray/ionspray (see, e.g., Fenn et aL 9 J. Phys. Chem. 88:4451-59(1984), PCI 
Appln. No. WO 90/14148, Smith cial., Anal. Chem. 62:882-89(1990)); and matrix- 
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assisted laser desorption/ionization (Hillenkamp, a/.., "Matrix Assisted UV-Laser 
Desorption/Ionization: A New Approach to Mass Spectrometry of Large Biomolecules," 
Biological Mass Spcciromctry (Burlingame and McCloskey, eds.), Elsevier Science 
Publishers, Amsterdam, pp. 49-60, 1990); Huth-Fehre et al.. 9 "Matrix Assisted Laser 
5 Desorption Mass Spectrometry of Oligodeoxythymidylic Acids," Rapid Communications 
in Mass Spectrometry, 6:209-13 (1992)). 

In preferred embodmients, the EMLs are attached to a probe by a covalent bond 
that is light sensitive. The EML is released from the probe after hybridization with a 
target nucleic acid by a laser or other light source emitting the desired wavelength of 
10 light. The EML is then fed into a GC-MS (gas chromatogrph-mass spectrometer) or 
other appropriate device, and identified by its mass. 

5-26 PREPARATION OF SEQUENCING CHIPS AND ARRAYS 

The nucleic acids of he present invention can be affixed to chips to create arrays. 

15 A basic example is using 6-mers attached to 50 micron surfaces to give a chip with 
dimensions of 3 x 3 mm which can be combined to give an array of 20 x 2020cm. 
Another example is using 9-mer oligonucleotides attached to 10 x 10 micron surface to 
create a 9-mer chip, with dimensions of 5 x 5 mm. 4000 units of such chips may be used 
to create a 30 x 30 cm array. In one embodiment the nucleic acids comprise an oligochip 

20 array in which 4,000 to 1 6,000 oligochips are arranged into a square array. A plate, or 
collection of tubes, as also depicted, may be packaged with the array as part of the 
sequencing kit. 

The arrays may be separated physically from each other or by hydrophobic 
surfaces. One possible way to utilize the hydrophobic strip separation is to use 
25 technology such as the Iso-Grid Microbiology System produced by QA Laboratories, 
Toronto, Canada. 

Hydrophobic grid membrane filters (HGMF) have been in use in analytical food 
microbiology for about a decade where they exhibit unique attractions of extended 
numerical range and automated counting of colonies. One commercially-available grid is 
30 Iso-GRID™ from QA Laboratories Ltd. (Toronto, Canada) which consists of a square 
(60 x 60 cm) of polysulfone polymer (Gelman Tuffryn HT-450, 0.4 micron pore size) on 
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which is printed a black hydrophobic ink grid consisting of 1600 (40 x 40) square cells. 
HCMF have previously been inoculated with bacterial suspensions by vacuum filtration 
and incubated on the differential or selective media of choice. 

Because the microbial growth is confined to grid cells of known position and size 
5 on the membrane, the HGMF functions more like an MPN apparatus than a conventional 
plate or membrane filter. Peterkin et aL (1987) reported that these HGMFs can be used 
to propagate and store genomic libraries when used with a HGMF replicator. One such 
instrument replicates growth from each of the 1600 cells of the ISO-GRID and enables 
many copies of the master HGMF to be made (Peterkin et aL., 1987). 

10 Sharpe et aL. (1989) also used ISO-GRID HGMF form QA Laboratories and an < 

automated HGMF counter (MI-I 00 Interpreter) and RP- 100 Replicator. They reported a 
technique for maintaining and screening many microbial cultures. 

Peterkin and colleagues later described a method for screening DNA probes using 
the hydrophobic grid-membrane filter (Peterkin et aL., 1989). These authors reported 

15 methods for effective colony hybridization directly on HGMFs. Previously, poor results < 
had been obtained due to the low DNA binding capacity of the epoxysulfone polymer on 
which the HGMFs are printed: However, Peterkin et aL. (1989) reported that the binding 
of DNA to the surface of the membrane was improved by treating the replicated and 
incubated HGMF with polyethyleneimine,a polycation, prior to contact with DNA. 

20 Although this early work uses cellular DNA attachment, and has a different objective to 
the present invention, the methodology described may be readily adapted for Format 3 
SBH. 

In order to identify useful sequences rapidly, Peterkin et aL. (1989) used 
radiolabeled plasmid DNA from various clones and tested its specificity against the DNA on 

25 the prepared HGMFs. In this way, DNA from recombinant plasmids was rapidly screened 
by colony hybridization against 100 organisms on HGMF replicates which can be easily and 
reproducibly prepared. Manipulation with small (2-3 mm) chips, and parallel execution of 
thousands of the reactions. The solution of the invention is to keep the chips and the probes 
in the corresponding arrays. In one example, chips containing 250,000 9-mers are 

30 synthesized on a silicon wafer in the form of 8 x 8 mM plates (1 5 joM/oligonucleotide, Pease 
et aL., 1994) arrayed in 8 x 12 format (96 chips) with a 1 mM groove in between. Probes 
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are added either by multichannel pipette or pin array, one probe on one chip. To score all 
4000 6-mers, 42 chip arrays have to be used, either using different ones, or by reusing one 
set of chip arrays several times. In the above case, using the earlier nomenclature of the 
application, F=9; P=6; and F + P = 15. Chips may have probes of formula B x N n , where x is 
5 a number of specified bases B; and n is a number of non-specified bases, so that x = 4 to 1 0 
and n = 1 to 4. To achieve more efficient hybridization, and to avoid potential influence of 
any support oligonucleotides, the specified bases can be surrounded by unspecified bases, 
thus represented by a formula such as (N)nB x (N) m » 

1 0 5.27 HYBRIDIZATION AND SCORING PROCESS 

Labeled probes may be mixed with hybridization buffer and pipetted, preferably by 
multichannel pipettes, to the subarrays. To prevent mixing of the probes between subarrays 
(if there are no hydrophobic strips or physical barriers imprinted in the membrane), a 
corresponding plastic, metal or ceramic grid may be firmly pressed to the membrane. Also, 

15 the volume of the buffer may be reduced to about 1 jil or less per mm 2 . The concentration 
of the probes and hybridization conditions used may be as described previously except that 
the washing buffer may be quickly poured over the array of subarrays to allow fast dilution 
of probes and thus prevent significant cross-hybridization. For the same reason, a minimal 
concentration of the probes may be used and hybridization time extended to the maximal 

20 practical level. For DNA detection and sequencing, knowledge of a "normal" sequence 
allows the use of the continuous stacking interaction phenomenon to increase the signal. - In 
addition to the labelled probe, additional unlabelled probes which hybridize back to back 
with a labelled one may be added in the hybridization reaction. The amount of the hybrid 
may be increased several times. The probes may be connected by ligation. This approach 

25 may be important for resolving DNA regions forming "compressions". 

In the case of radiolabeled probes, images of the filters may be obtained, 
preferably. by phosphor storage technology. Fluorescent labels may be scored by CCD 
cameras, confocal microscopy or otherwise. In order to properly scale and integrate data 
from different hybridization experiments, raw signals are normalized based on the 

30 amount of target in each dot. Differences in the amount of target DNA per dot may be 
corrected for dividing signals of each probe by an average signal for all probes scored on 
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one dot. The normalized signals may be scaled, usually from 1-100, to compare data 
from different experiments. Also, in each subarray, several control DNAs may be used to 
determine an average background signal in those samples which do not contain a full 
match target. For samples obtained from diploid (polyploid) scores, homozygotic 
5 controls may be used to allow recognition of heterozygotes in the samples. 
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5.28 HYBRIDIZATION WITH OLIGONUCLEOTIDES 

Oligonucleotides were either purchased from Genosys Inc., Houston, Texas or made 
on an Applied Biosystems 381 A DNA synthesizer. Most of the probes used were not 
purified by HPLC or gel electrophoresis. For example, probes were designed to have both a 
5 single perfectly complementary target in interferon, a M13 clone containing a 921 bp 
Eco RI-Bgl II human Bl-interferon fragment (Ohno and Tangiuchi, Proc. Natl. Acad. Sci. 
74: 4370-4374 (1981)], and at least one target with an end base mismatch in M13 vector 
itself. 

End labelling of oligonucleotides was performed as described (Maniatis et al.. 9 

10 Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Cold Spring 
Harbor, New York (1982)) in 10 |il containing T4-polynucleotide kinase (5 units 
Amersham), y 32 P-ATP (3.3 pM, 10 mCi Amersham 3000 Ci/mM) and the 
oligonucleotide (4 pM, 10 ng). Specific activities of the probes were 2.5-5 xlO 9 cpm/nm. 
Single stranded DNA (2 to 4 ml in 0.5 NaOH, 1 .5 M NaCl) was spotted on a 

15 Gene Screen membrane wetted with the same solution, the filters were neutralized in 0.05 
M Na2HP0 4 pH 6.5, baked in an oven at 80°C for 60 min, and UV irradiated for 1 min. 
Then, the filters were incubated in hybridization solution (0.5 M Na2HP04 pH 7.2, 7% 
sodium lauroyl sarcosine for 5 min at room temperature and placed on the surface of a 
plastic Petri dish. A drop of hybridization solution(10 ml, 0.5 M Na2HP04 pH 7.2, 7% 

20 sodium lauroyl sarcosine) with a 32p end-labeled oligomer probe at 4 nM concentration 
was placed over 1 -6 dots per filter, overlaid with a square piece of polyethylene 
(approximately 1x1 cm.), and incubated in a moist chamber at the indicated 
temperatures for 3 hr. Hybridization was stopped by placing the filter in 6X SSC 
washing solution for 3 to 5 minutes at 0°C to remove unhybridized probe. The filter was 

25 either dried, or further washed for the indicated times and temperatures, and 

autoradiographed. For discrimination measurements, the dots were excised from the 
dried filters after autoradiography (a phosphoimager (Molecular Dynamics, Sunnyvale, 
California) may be used) placed in liquid scintillation cocktail and counted. The 
uncorrected ratio of cpms for IF and Ml 3 dots is given as D. 

30 The conditions reported herein allow hybridization with very short 

oligonucleotides but ensure discriminations between matched and mismatched 
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oligonucleotides that are complementary to and therefore bind to a target nucleic acid. 
Factors which influence the efficient detection of hybridization of specific short 
sequences based on the degree of discriminations (D) between a perfectly complementary 
target and an imperfectly complementary target with a single mismatch in the hybrid are 
5 defined. In experimental tests, dot blot hybridization of twenty-eight probes that were 6 
to 8 nucleotides in length to two Ml 3 clones or to model oligonucleotides bound to 
membrane filters was accomplished. The principles guiding the experimental procedures 
are given below. 

Oligonucleotide hybridization to filter bound target nucleic acids only a few 
10 nucleotides longer than the probe in conditions of probe excess is a pseudo-first order 
reaction with respect to target concentration. This reaction is defined by: 
S t /S„ = e- kh[OPlt 

Wherein S t and S 0 are target sequence concentrations at time t and to, respectively. 
(OP) is probe concentration and t is temperature. The rate constant for hybrid formation, 
15 k h increases only slightly in the 0°C to 30°C range (Porschke and Eigen, J. Mol Biol. 62: 
361 (1971); Craig et al.. J Mol Biol 62: 383 (1971). Hybrid melting is a first order 
reaction with respect to hybrid concentration (here replaced by mass due to filter bound 
state) as shown in: 

Ht/Ho^e*™ 1 

20 In this equation, H t and H c are hybrid concentrations at times t and to, 

respectively; k m is a rate constant for hybrid melting which is dependent on temperature 
and salt concentration (Ikuta et al.. Nucl Acids Res. 15: 797 (1987); Porsclike and Eigen, 
J. Mol Biol 62: 361 (1971); Craig et al.. J. Mol Biol 62: 303 (1971)). During 
hybridization, which is a strand association process, the back, melting, or strand 

25 dissociation, reaction takes place as well. Thus, the amount of hybrid formed in time is 
result of forward and back reactions. The equilibrium may be moved towards hybrid 
formation by increasing probe concentration and/or decreasing temperature. However, 
during washing cycles in large volumes of buffer, the melting reaction is dominant and 
the back reaction hybridization is insignificant, since the probe is absent. This analysis 

30 indicates workable Short Oligonucleotide Hybridization (SOH) conditions call be varied 
for probe concentration or temperature. 
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D or discrimination is defined above: 
D = H p (t w )/Hi(t w ) 

Hp (t w ) and Hi (t w ) are the amounts hybrids remaining after a washing time, t w , for 
he identical amounts of perfectly and imperfectly complementary duplex, respectively. 
5 For a given temperature, the discrimination D changes with the 10 length of washing time 
and reaches the maximal value when Hi = B which is equation five. 

The background, B, represents the lowest hybridization signal detectable in the 1 
system. Since any further decrease of H* may not be examined, D increases upon 
continued washing. Washing past t w just decreases H p relative to B, and is seen as a 
10 decrease in D. The optimal washing time, t w , for imperfect hybrids, from equation three 
and equation five is: 

t w =ln(B/Hi (to)/k m i 

Since H p is being washed for the same t w , combining equations, one obtains the 
optimal discrimination function: 

15 j-j = e ln B/Hi (tO)) km,p/km,i 

The change of D as a function, of T is important because of the choice of an 
optimal washing temperature. It is obtained by substituting the Arrhenius equation which 

is: 

K- = Ae' Ea/RT 

20 into the previous equation to form the final equation: 

D = H p ((to)/B X (B/Hi (to)) (Ap/Ai)e(E a,r E a,p )/RT ; 
Wherein B is less than Hj (to). 

Since the activation energy for perfect hybrids, E ap , and the activation energy for 
imperfect hybrids, E^, can be either equal, or Ea,i less than Ea, p D is temperature 

25 independent, or decreases with increasing temperature, respectively. This result implies 
that the search for stringent temperature conditions for good discrimination in SOH is 
unjustified. By washing at lower temperatures, one obtains equal or better 
discrimination, but the time of washing exponentially increases with the decrease of 
temperature. Discrimination more strongly decreases with T, if Hi(to) increases relative 

30 to H p (t 0 ). 
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D at lower temperatures depends to a higher degree on the H p (to)/B ratio than on 
the Hp (to) / Hj (to) ratio. This result indicates that it is better to obtain a sufficient 
quantity of H p in the hybridization regardless of the discrimination that can be achieved 
in this step. Better discrimination can then be obtained by washing, since the higher 
5 amounts of perfect hybrid allow more time for differential melting to show an effect. 
Similarly, using larger amounts of target nucleic acid a necessary discrimination can be 
obtained even with small differences between K m p and K m j. 

Extrapolated to a more complex situation than covered in this simple model, the 
result is that washing at lower temperatures is even more important for obtaining 

10 v discrimination in the case of hybridization of a probe having many end-mismatches 
within a given nucleic acid target. 

Using the described theoretical principles as a guide for experiments, reliable 
hybridizations have been obtained with probes six to eight nucleotides in length. All 
experiments were performed with a floating plastic sheet providing a film of 

15 hybridization solution above the filter. This procedure allows maximal reduction in the 
amount of probe, and thus reduced label costs in dot blot hybridizations. The high 
concentration of sodium lauroyl sarcosine instead of sodium lauroyl sulfate in the 
phosphate hybridization buffer allows dropping the reaction from room temperature 
down to 12°C. Similarly, the 4-6 X SSC, 10% sodium lauroyl sarcosine buffer allows 

20 hybridization at temperatures as low as 2°C. The detergent in these buffers is for 
obtaining tolerable background with up to 40 nM concentrations of labelled probe. 
Preliminary characterization of the thermal stability of short oligonucleotide hybrids was 
determined on a prototype octamer with 50% G+C content, i.e. probe of sequence 
TGCTCATG. The theoretical expectation is that this probe is among the less stable 

25 octamers. Its transition enthalpy is similar to those of more stable heptamers or, even to 
probes 6 nucleotides in length (Bresslauer et al.. Proc. Natl Acad, Sci. U.S.A. 83: 3746 
(1986)). Parameter T d , the temperature at which 50% of the hybrid is melted in unit time 
of a minute is 18°C. The result shows that T d is 15°C lower for the 8 bp hybrid than for 
an 1 1 bp duplex (Wallace et al.. Nucleic Acids Res. 6: 3543 (1979)). 

30 In addition to experiments with model oligonucleotides, an Ml 3 vector was 

chosen as a system for a practical demonstration of short oligonucleotide hybridization. 
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The main aim was to show useful end-mismatch discrimination with a target similar to 
the ones which will be used in various applications of the method of the invention. 
Oligonucleotide probes for the Ml 3 model were chosen in such a way that the Ml 3 
vector itself contains the end mismatched base. Vector IF, an Ml 3 recombinant 
5 containing a 921 bp human interferon gene insert, carries single perfectly matched target. 
Thus, IF has either the identical or a higher number of mismatched targets in comparison 
to the Ml 3 vector itself. Using low temperature conditions and dot blots, sufficient 
differences in hybridization signals were obtained between tie dot containing the perfect 
and the mismatched targets and the dot containing the mismatched targets only. This was 

10 true for the 6-mer oligonucleotides and was also true for the 7 and 8-mer oligonucleotides 
hybridized to the large JF-M13 pair of nucleic acids. 

The hybridization signal depends on the amount of target available on the filter 
for reaction with the probe. A necessary control is to show that the difference in sign 
intensity is not a reflection of varying amounts of nucleic acid in the two dots. 

15 Hybridization with a probe that has the same number and kind of targets in both IF and 
Ml 3 shows that there is an equal amount of DNA in the dots. Since the efficiency of 
hybrid formation increases with hybrid length, the signal for a duplex having six 
nucleotides was best detected with a high mass of oligonucleotide target bound to the 
filter. Due to their lower molecular weight, a larger number of oligonucleotide target 

20 molecules can be bound to a given surface area when compared to large molecules of 
nucleic acid that serves as target. 

To measure the sensitivity of detection with unpurified DNA, various amounts of 
phage supernatants were spotted on the filter and hybridized with 32 P-labelled octamer. 
As little as 50 million unpurified phage containing no more than 0.5 ng of DNA gave a 

25 detectable signal indicating that sensitivity of the short oligonucleotide hybridization 
method is sufficient. Reaction time is short, adding to the practicality. 

As mentioned in the theoretical section above, the equilibrium yield of hybrid 
depends oil probe concentration and/or temperature of reaction. For instance, the signal 
level for the same amount of target with 4 nM octamer at 13 °C is 3 times lower than with 

30 a probe concentration of 40 nM, and is decreased 4 to 5 times by raising the hybridization 
5 temperature to 25°C. 
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The utility of the low temperature wash for achieving maximal discrimination is 
demonstrated. To make the phenomenon visually obvious, 50 times more DNA was put 
in the Ml 3 dot than in the IF dot using hybridization with a vector specific probe. In this 
way, the signal after the hybridization step with the actual probe was made stronger in 
5 the, mismatched that in the matched case. The Hp/Hi ratio was 1 :4. Inversion of signal 
intensities after prolonged washing at 7°C was achieved without a massive loss of perfect 
hybrid, resulting in a ratio of 2: 1 . In contrast, it is impossible to achieve any 
discrimination at 25°C, since the matched target signal is already brought down to the 
background level with 2 minute washing; at the same time, the signal from .the 

10 >mismatched hybrid is still detectable. The loss of discrimination at 13°C compared to 
7°C is not so great but is clearly visible. If one considers the 90 minute point at 7°C and 
the 15 minute point at 13°C when, the mismatched hybrid signal is near the background 
level, which represents optimal washing times for the respective conditions, it is obvious 
that the amount of several times greater at 7°C than at 13°C. To illustrate this further, the 

15 time course of the change discrimination with washing of the same amount of starting 
; hybrid at the two temperatures shows the higher maximal D at the lower temperature. 

These results confirm the trend in the change of D with temperature and the ratio of 
, amounts of the two types of hybrid at the start of the washing step. 

In order to show the general utility of the short oligonucleotide hybridization 

20 conditions, we have looked hybridization of 4 heptamers, 10 octamers and an additional 
14 probes up to 12 nucleotides in length in our simple Ml 3 system. These include-the 
nonamer GTTTTTTAA and octamer GGCAGGCG representing the two extremes of GC 
content. Although GC content and sequence are expected to influence the stability of 
short hybrids (Bresslauer et al.. Proc. Natl Acad. Set U.S.A. 83: 3746 (1986)), the low 

25 temperature short oligonucleotide conditions were applicable to all tested probes in 

achieving sufficient discrimination. Since the best discrimination value obtained with 
probes 1 3 nucleotides in length was 20, a several fold drop due to sequence variation is 
easily tolerated. 

The Ml 3 system has the advantage of showing the effects of target DNA 
30 complexity on the levels of discrimination. For two octamers having either none or five 
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mismatched targets and differing in only one GC pair the observed discriminations were 
18.3 and 1.7, respectively. 

In order to show the utility of this method, three probes 8 nucleotides in length 
were tested on a collection of 5 1 plasmid DNA dots made from a library in Bluescript 
5 vector. One probe was present and specific for Bluescript vector but was absent in Ml 3, 
while the other two probes had targets that were inserts of known sequence. This system 
allowed the use of hybridization negative or positive control DNAs with each probe. 
This probe sequence (CTCCCTTT) also had a complementary target in the interferon 
insert. Since the Ml 3 dot is negative while the interferon insert in either Ml 3 or 

10 Bluescript was positive, hybridization is sequence specific. Similarly, probes that detect 
the target sequence in only one of 5 1 inserts, or in none of the examined inserts along 
with controls that confirm that hybridization would have occurred if the appropriate 
targets were present in the clones. 

Thermal stability curves for very short oligonucleotide hybrids that are 6-8 

15 nucleotides in length are at least 15°C lower than for hybrids 11-12 nucleotides in length 
(Wallace et aL Nucleic Acids Res, 6: 3543-3557 (1979). However, performing the 
hybridization reaction at a low temperature and with a very practical 0.4-40 nM 
concentration of oligonucleotide probe allows the detection of complementary sequence 
in known or unknown nucleic acid target. To determine an unknown nucleic acid 

20 sequence completely, an entire set containing 65,535 8-mer probes may be used. 

Sufficient amounts of nucleic acid for this purpose are present in convenient biological 
samples such as a few microliters of Ml 3 culture, a plasmid prep from 10 ml of bacterial 
culture or a single colony of bacteria, or less than 1 ml of a standard PCR reaction. 

Short oligonucleotides 6-10 nucleotides long give excellent discrimination. The 

25 relative decrease in hybrid stability with a single end mismatch is greater than for longer 
probes. Results with the octamer TGCTCATG support this conclusion. In the 
experiments, the target with a G/T end mismatch, hybridization to the target of this type 
of mismatch is the most stable of all other types of oligonucleotide. This discrimination 
achieved is the same as or greater than an internal G/T mismatch in a 19 base paired 

30 duplex greater than an internal G/T mismatch in a 19 paired duplex see, (Ikuta et aL. 
Nucl. Acids Res. 15: 797 (1987). Exploiting these discrimination properties using the 
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described hybridization conditions for short oligonucleotide hybridization allows a very 
precise determination of oligonucleotide targets. In contrast to the ease of detecting 
discrimination between perfect and imperfect hybrids, a problem that may exist with 
using very short oligonucleotides is the preparation of sufficient amounts of hybrids. In 
5 practice, the need to discriminate H p and Hi is aided by increasing the amount of DNA in 
the dot and/or the probe concentration, or by decreasing the hybridization temperature. 
However, higher probe concentrations usually increase background. Moreover, there are 
limits to the amounts of target nucleic acid that are practical to use. This problems was 
solved by the higher concentration of the detergent Sarcosyl which gave an effective 

10 background with 4 nM of probe. Further improvements may be effected either in the use 
of competitors for unspecific binding of probe to filter, or by changing the hybridization 
support material. Moreover, for probes having E a less than 45 Kcal/mol (eg., for many 
heptamers and a majority of hexamers, modified oligonucleotides give a more stable 
hybrid (Asseline, et al.. Proc. Natl Acad. Scl 81; 3297 (1984) than their unmodified 

1 5 counterparts. The hybridization conditions described in this invention for short 

oligonucleotide hybridization using low temperatures give better discriminating for all 
sequences and duplex hybrid inputs. The only price paid in achieving uniformity in 
hybridization conditions for different sequences is an increase in washing time from 
minutes to up to 24 hours depending on the sequence. Moreover, the washing time can 

20 be further reduced by decreasing the salt concentration. 

Although there is excellent discrimination of one matched hybrid over a 
mismatched hybrids, in short oligonucleotide hybridization, signals from mismatched 
hybrids exist, with the majority of the mismatch hybrids resulting from end mismatch. 
This may limit insert sizes that may be effectively examined by a probe of a certain 

25 length. 

The influence of sequence complexity on discrimination cannot be ignored. 
However, the complexity effects are more significant when defining sequence 
information by short oligonucleotide hybridization for specific, nonrandom sequences, 
and can be overcome by using an appropriate probe to target length ratio. The length 
30 ratio is chosen to make unlikely, on statistical grounds, the occurrence of specific 

sequences which have a number of end-mismatches which would be able to eliminate or 
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falsely invert discrimination. Results suggest the use of oligonucleotides 6, 7, and 8 
nucleotides in length on target nucleic acid inserts shorter than 0.6, 2.5, and 10 kb, 
respectively. 

5 5.29 DNA SEQUENCING 

An array of subarrays allows for efficient sequencing of a small set of samples 
arrayed in the form of replicated subarrays; for example, 64 samples may be arrayed on a 
8x8 mm subarray and 16 x 24 subarrays may be replicated on a 15 x 23 cm membrane 
with 1 mm wide spacers between the subarrays. Several replica membranes may be 

10 made. For example, probes from a universal set of three thousand seventy-two 7-mers 
may be divided in thirty-two 96-well plates and labelled by kinasing. Four membranes 
may be processed in parallel during one hybridization cycle. On each membrane, 384 
probes may be scored. All probes may be scored in two hybridization cycles. 
Hybridization intensities may be scored and the sequence assembled as described below. 

15 If a single sample subarray or subarrays contains several unknowns, especially 

when similar samples are used, a smaller number of probes may be sufficient if they are 
intelligently selected on the basis of results of previously scored probes. For example, if 
probe AAAAAAA is not positive, there is a small chance that any of 8 overlapping 
probes are positive. If AAAAAAA is positive, then two probes are usually positive. The 

20 sequencing process in this case consists of first hybridizing a subset of minimally 

overlapped probes to define positive anchors and then to successively select probes which 
confirms one of the most likely hypotheses about the order of anchors and size and type 
of gaps between them. In this second phase, pools of 2-10 probes may be used where 
each probe is selected to be positive in only one DNA sample which is different from the 

25 samples expected to be positive with other probes from the pool. 

The subarray approach allows efficient implementation of probe competition 
(overlapped probes) or probe cooperation (continuous stacking of probes) in solving 
branching problems. After hybridization of a universal set of probes the sequence 
assembly program determines candidate sequence subfragments (SFs). For the further 

30 assembly of SFs, additional information has to be provided (from overlapped sequences 
of DNA fragments, similar sequences, single pass gel sequences, or from other 
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hybridization or restriction mapping data). Competitive hybridization and continuous 
stacking interactions have been proposed for SF assembly. These approaches are of 
limited practical value for sequencing of large numbers of samples by SBH wherein a 
labelled probe is applied to a sample affixed to an array if a uniform array is used. 
5 Fortunately, analysis of small numbers of samples using replica subarrays allows efficient 
implementation of both approaches. On each of the replica subarrays, one branching 
point may be tested for one or more DNA samples using pools of probes similarly as in 
solving mutated sequences in different samples spotted in the same subarray (see above). 
If in each of 64 samples described in this example, there are about 100 branching 

10 points, and if 8 samples are analyzed in parallel in each subarray, then at least 800 

subarray probings solve all branches. This means that for the 3072 basic probings an 
additional 800 probings (25%) are employed. More preferably, two probings are used for 
one branching point. If the subarrays are smaller, less additional probings are used. For 
example, if subarrays consist of 16 samples, 200 additional probings may be scored (6%). 

15 By using 7-mer probes (N1-2B7N1-2) and competitive or collaborative branching solving 
approaches or both, fragments of about 1000 bp fragments may be assembled by about 
4000 probings. Furthermore, using 8-mer probes (NBsN) 4 kb or longer fragments may 
be assembled with 12,000 probings. Gapped probes, for example, NB4NB3N or 
NB4NB4N may be used to reduce the number of branching points. 

20 

5.30 DNA ANALYSIS BY TRANSIENT ATTACHMENT TO SUBARRAYS 
OF PROBES AND LIGATION OF LABELLED PROBES 

Oligonucleotide probes having an informative length of four to 40 bases are 
synthesized by standard chemistry and stored in tubes or in multiwell plates. Specific 

25 sets of probes comprising one to 10,000 probes are arrayed by deposition or in situ 

synthesis on separate supports or distinct sections of a larger support. In the last case, 
sections or subarrays may be separated by physicalor hydrophobic barriers. The probe 
arrays may be prepared by in situ synthesis. A sample DNA of appropriate size is 
hybridized with one or more specific arrays. Many samples may be interrogated as pools 

30 at the same subarrays or independently with different subarrays within one support. 
Simultaneously with the sample or subsequently, a single labelled probe or a pool of 
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labelled probes is added on each of the subarrays. If attached and labelled probes 
hybridize back to back on the complementary target in the sample DNA they are ligated. 
Occurrence of ligation will be measured by detecting a label from the probe. 

This procedure is a variant of the described DNA analysis process in which DNA 
5 samples are not permanently attached to the support. Transient attachment is provided by 
probes fixed to the support. In this case there is no need for a target DNA arraying 
process. In addition, ligation allows detection of longer oligonucleotide sequences by 
combining short labelled probes with short fixed probes. 

The process has several unique features. Basically, the transient attachment of the 

1 0 target allows its reuse. After ligation occur the target may be released and the label will 
stay covalently attached to the support. This feature allows cycling the target and 
production of detectable signal with a small quantity of the target. Under optimal 
conditions, targets do not need to be amplified, e.g. natural sources of the DNA samples 
may be directly used for diagnostics and sequencing purposes. Targets may be released 

15 by cycling the temperature between efficient hybridization and efficient melting of 

duplexes. More preferably, there is no cycling. The temperature and concentrations of 
components may be defined to have an equilibrium between free targets and targets 
entered in hybrids at about 50:50% level. In this case there is a continuous production of 
ligated products. For different purposes different equilibrium ratios are optimal. 

20 An electric field may be used to enhance target use. At the beginning, a 

horizontal field pulsing within each subarray may be employed to provide for faster 
target sorting. In this phase, the equilibrium is moved toward hybrid formation, and 
unlabelled probes may be used. After a target sorting phase, an appropriate washing 
(which may be helped by a vertical electric field for restricting movement of the samples) 

25 may be performed. Several cycles of discriminative hybrid melting, target harvesting by 
hybridization and ligation and removing of unused targets may be introduced to increase 
specificity. In the next step, labelled probes are added and vertical electrical pulses may 
be applied. By increasing temperature, an optimal free and hybridized target ratio may be 
achieved. The vertical electric field prevents diffusion of the sorted targets. 

30 The subarrays of fixed probes and sets of labelled probes (specially designed or 

selected from a universal probe set) may be arranged in various ways to allow an efficient 
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and flexible sequencing and diagnostics process. For example, if a short fragment (about 
100-500 bp) of a bacterial genome is to be partially or completely sequenced, small 
arrays of probes (5-30 bases in length) designed on the bases of known sequence may be 
used. If interrogated with a different pool of 10 labelled probes per subarray, an array of 
5 10 subarrays each having 10 probes, allows checking of 200 bases, assuming that only 
two bases connected by ligation are scored. Under the conditions where mismatches are 
discriminated throughout the hybrid, probes may be displaced by more than one base to 
cover the longer target with the same number of probes. By using long probes, the target 
may be interrogated directly without amplification or isolation from the rest of DNA in . 

10 the sample. Also, several targets may be analyzed (screened for) in one sample 

simultaneously. If the obtained results indicate occurrence of a mutation (or a pathogen), 
additional pools of probes may be used to detect type of the mutation or subtype of 
pathogen. This is a desirable feature of the process which may be very cost effective in 
. preventive diagnosis where only a small fraction of patients is expected to have an 

1 5 infection or mutation. 

In the processes described in the examples, various detection methods may be 
used, for example, radiolabels, fluorescent labels, enzymes or antibodies 
(chemiluminescence), large molecules or particles detectable by light scattering or 
interferometric procedures. 

20 

5.31 SEQUENCING A TARGET USING OCTAMERS AND NONAMERS 

Data resulting from the hybridization of octamer and nonamer oligonucleotides 
shows that sequencing by hybridization provides an extremely high degree of accuracy. 
In this experiment, a known sequence was used to predict a series of contiguous 

25 overlapping component octamer and nonamer oligonucleotides. 

In addition to the perfectly matching oligonucleotides, mismatch oligonucleotides, 
mismatch oligonucleotides wherein internal or end mismatches occur in the duplex 
formed by the oligonucleotide and the target were examined. In these analyses, the 
lowest practical temperature was used to maximize hybridization formation. Washes 

30 were accomplished at the same or lower temperatures to ensure maximal discrimination 
by utilizing the greater dissociation rate of mismatch versus matched 
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oligonucleotide/target hybridization. These conditions are shown to be applicable to all 

sequences although the absolute hybridization yield is shown to be sequence dependent. 
The least destabilizing mismatch that can be postulated is a simple end mismatch, 

so that the test of sequencing by hybridization is the ability to discriminate perfectly 
5 matched oligonucleotide/targetduplexes from end-mismatched 

oligonucleotide/targetduplexes. 

The discriminative values for 102 of 105 hybridizing oligonucleotides in a dot 

blot format were greater than 2 allowing a highly accurate generation of the sequence. 

This system also allowed an analysis of the effect of sequence on hybridization formation 
10 and hybridization instability. 

One hundred base pairs of a known portion of a human-interferon genes prepared 

by PCR, i.e. a 100 bp target sequence, was generated with data resulting from the 

hybridization of 105 oligonucleotides probes of known sequence to the target nucleic 

acid. The oligonucleotide probes used included 72 octamer and 21 nonamer 
1 5 oligonucleotides whose sequence was perfectly complementary to the target. The set of 

93 probes provided consecutive overlapping frames of the target sequence e displaced by 

one or two bases. 

To evaluate the effect of mismatches, hybridization was examined for 12 
additional probes that contained at least one end mismatch when hybridized to the 100 bp 

20 test target sequence. Also tested was the hybridization of twelve probes with target end- 
mismatched to four other control nucleic acid sequences chosen so that the 12 
oligonucleotides formed perfectly matched duplex hybrids with the four control DNAs. 
Thus, the hybridization of internal mismatched, end-mismatched and perfectly matched 
duplex pairs of oligonucleotide and target were evaluated for each oligonucleotide used 

25 in the experiment. The effect of absolute DNA target concentration on the hybridization 
with the test octamer and nonamer oligonucleotides was determined by defining target 
DNA concentration by detecting hybridization of a different oligonucleotide probe to a 
single occurrence non-target site within the co-amplified plasmid DNA. 

The results of this experiment showed that all oligonucleotides containing perfect 

30 matching complementary sequence to the target or control DNA hybridized more 

strongly than those oligonucleotides having mismatches. To come to this conclusion, we 
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examined H p and D values for each probe. H p defines the amount of hybrid duplex 
formed between a test target and an oligonucleotide probe. By assigning values of 
between 0 and 10 to the hybridization obtained for the 105 probes, it was apparent that 
68.5% of the 105 probes had an H p greater than 2. 
5 Discrimination (D) values were obtained where D was defined as the ratio of 

signal intensities between 1) the dot containing a perfect matched duplex formed between 
test oligonucleotide and target or control nucleic acid and 2) the dot containing a 
mismatch duplex formed between the same oligonucleotide and a different site within the 
target or control nucleic acid. Variations in the value of D result from either 1) 

1 0 perturbations in the hybridization efficiency which allows visualization of signal over 
background, or 2) the type of mismatch found between the test oligonucleotide and the 
target. The D values obtained in this experiment were between 2 and 40 for 102 of the 
105 oligonucleotide probes examined. Calculations of D for the group of 102 
oligonucleotides as a whole showed the average D was 10.6. 

15 There were 20 cases where oligonucleotide/targetduplexes exhibited an end- 

mismatch. In five of these, D was greater than 10. The large D value in these cases is 
most likely due to hybridization destabilization caused by other than the most stable (G/T 
and G/A) end mismatches. The other possibility is there was an error in the sequence of 
either the oligonucleotides or the target. 

20 Error in the target for probes with low H p , was excluded as a possibility because 

such an error would have affected the hybridization of each of the other eight overlapping 
oligonucleotides. There was no apparent instability due to sequence mismatch for the 
other overlapping oligonucleotides, indicating the target sequence was correct. Error in 
the oligonucleotide sequence was excluded as a possibility after the hybridization of 

25 seven newly synthesized oligonucleotides was re-examined. Only 1 of the seven 

oligonucleotides resulted in a better D value. Low hybrid formation values may result 
from hybrid instability or from an inability to form hybrid duplex. An inability to form 
hybrid duplexes would result from either 1) self complementarity of the chosen probe or 
2) target/target self hybridization. Oligonucleotide/oligonucleotideduplex formation may 

30 be favored over oligonucleotide/targethybrid duplex formation if the probe was self- 
complementary. Similarly, target/target association may be favored if the target was self- 
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complementary or may form internal palindromes. In evaluating these possibilities, it 
was apparent from probe analysis that the questionable probes did not form hybrids with 
themselves. Moreover, in examining the contribution of target/target hybridization, it 
was determined that one of the questionable oligonucleotide probes hybridized 
inefficiently with two different DNAs containing the same target. The low probability 
that two different DNAs have a self-complementary region for the same target sequence 
leads to the conclusion that target/target hybridization did not contribute to low 
hybridization formation. Thus, these results indicate that hybrid instability and not the 
inability to form hybrids was the cause of the low hybrid formation observed for specific 
oligonucleotides. The results also indicate that low hybrid formation is due to the 
specific sequences of certain oligonucleotides. Moreover, the results indicate that 
reliable results may be obtained to generate sequences if octamer and nonamer 
oligonucleotides are used. 

These results show that using the methods described long sequences of any target 
nucleic acid may be generated by maximal and unique overlap of constituent 
oligonucleotides. Such sequencing methods are dependent on the content of the 
individual component oligomers regardless of their frequency and their position. 

The sequence which is generated using the algorithm described below is of high 
fidelity. The algorithm tolerates false positive signals from the hybridization dots as is 
indicated from the fact the sequence generated from the 105 hybridization values, which 
included four less reliable values, was correct. This fidelity in sequencing by 
hybridization is due to the "all or none" kinetics of short oligonucleotide hybridization 
and the difference in duplex stability that exists between perfectly matched duplexes and 
mismatched duplexes. The ratio of duplex stability of matched and end-mismatched 
duplexes increases with decreasing duplex length. Moreover, binding energy decreases 
with decreasing duplex length resulting in a lower hybridization efficiency. However, the 
results provided show that octamer hybridization allows the balancing of the factors 
affecting duplex stability and discrimination to produce a highly accurate method of 
sequencing by hybridization. Results presented in other examples show that 
oligonucleotides that are 6, 7, or 8 nucleotides can be effectively used to generate reliable 
sequence on targets that are 0.5 kb (for hexamers) 2 kb (for septamers) and 6kb (for 
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octamers). The sequence of long fragments may be overlapped to generate a complete 
genome sequence. 

532 ANALYZING THE DATA OBTAINED 

5 Image files are analyzed by an image analysis program, like DOTS program 

(Drmanac et aL. 1993), and scaled and evaluated by statistical functions included, e.g., in 
SCORES program (Drmanac et aL. 1994). From the distribution of the signals an 
optimal threshold is determined for transforming signal into +/- output. From the 
position of the label detected, F + P nucleotide sequences from the fragments would be 
10 determined by combining the known sequences of the immobilized and labeled probes 
corresponding to the labeled positions. The complete nucleic acid sequence or sequence 
subfragments of the original molecule, such as a human chromosome, would then be 
assembled from the overlapping F + P sequence determined by computational deduction. 
One option is to transform hybridization signals e.g., scores, into +/- output 
1 5 during the sequence assembly process. In this case, assembly will start with aF + P 
sequence with a very high score, for example F + P sequence AAAAAATTTTTT. 
Scores of all four possible overlapping probes AAAAATTTTTTA, AAAAATTTTTT, 
p k j s jjsjs{XYYYTlC and AAAAATTTTTTG and three additional probes that are different 
at the beginning (T AAAAATTTTTT, C AAAAATTTTTT, G AAAAATTTTTT, are 
20 compared and three outcomes defined: (i) only the starting probe and only one of the 

four overlapping proves have scores that are significantly positive relatively to the other 
six probes, in this case the AAAAAATTTTTT sequence will be extended for one 
nucleotide to the right; (ii) no one probe except the starting probe has a significantly 
positive score, assembly will stop, e.g., the AAAAAATTTTT sequence is at the end of 
25 the DNA molecule that is sequenced; (iii) more than one significantly positive probe 

among the overlapped and/or other three probes is found; assembly is stopped because of 
the error or branching (Drmanac et aL., 1989). 

The processes of computational deduction would employ computer programs 
using existing algorithms (see, e.g., Pevzner, 1989; Drmanac et aL., 1991; Labat and 
30 Drmanac, 1 993 ; each incorporated herein by reference). 
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If, in addition to F + P, F (1)P, F (2)P, F(3)P or F(4)P are determined, algorithms 
will be used to match all data sets to correct potential errors or to solve the situation 
where there is a branching problem (see, e.g., Drmanac et al., 1989; Bains et al., 1988; 
each incorporated herein by reference). 

5 

533 CONDUCTING SEQUENCING BY TWO STEP HYBRIDIZATION 

Sequencing is accomplished as described herein. First, the whole chip is 
hybridized to a mixture of DNA as complex as 100 million of bp (one human 
chromosome). Guidelines for conducting hybridization can be found in papers such as 

10 Drmanac et al. (1990); Khrapko et al. (1991); and Broude et al. (1994). These articles 
teach the ranges of hybridization temperatures, buffers and washing steps that are 
appropriate for use in the initial steps of Format 3 SBH. 

The present invention particularly contemplates that hybridization is to be carried 
out for up to several hours in high salt concentrations at a low temperature (-2°C to 5°C) 

1 5 because of a relatively low concentration of target DNA that can be provided. For this 
purpose, SSC buffer is used instead of sodium phosphate buffer (Drmanac et al., 1990), 
which precipitates at 10°C. Washing does not have to be extensive (a few minutes) 
because of the second step, and can be completely eliminated when the hybridization 
cycling is used for the sequencing of highly complex DNA samples. The same buffer is 

20 used for hybridization and washing steps to be able to continue with the second 
hybridization step with labeled probes. 

After proper washing using a simple robotic device on each array, e.g., a 8 x 8 
mm array, one labeled, probe, e.g., a 6-mer, would be added. A 96-tip or 96-pin device 
would be used, performing this in 42 operations. Again, a range of discriminatory 

25 conditions could be employed, as previously described in the scientific literature. 

The present invention particularly contemplates the use of the following 
conditions. First, after adding labeled probes and incubating for several minutes only 
(because of the high concentration of added oligonucleotides) at a low temperature (0- 
5°C), the temperature is increased to 3-10°C, depending on F + P length, and the washing 

30 buffer is added. At this time, the washing buffer used is one compatible with any ligation 
reaction {e.g., 100 mM salt concentration range). After adding ligase, the temperate is 
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increased again to 15-37°C to allow fast ligation (less than 30 min) and further 
discrimination of full match and mismatch hybrids. 

The use of cationic detergents is also contemplated for use in Format 3 SBH, as 
described by Pontius & Berg (1991, incorporated herein by reference). These authors 
5 describe the use of two simple cationic detergents, dodecy- and cetyltrimethylamnionium 
bromide (DTAB and CTAB) in DNA renaturation. 

DTAB and CTAB are variants of the quaternary amine tetramethylammonium 
bromide (TMAB) in which one of the methyl groups is replaced by either a 12-carbon 
(DTAB) or a 16-carbon (CTAB) alkyl group. TMAB is the bromide salt of the 

10 tetramethylammoniumcation, a reagent used in nucleic acid renaturation experiments to 
decrease the G-C content bias of the melting temperature. DTAB and CTAB are similar 
in structure to sodium dodecyl sulfate (SDS), with the replacement of the negatively 
charged sulfate of SDS by a positively charged quaternary amine. While SDS is 
commonly used in hybridization buffers to reduce nonspecific binding and inhibit 

1 5 nucleases, it does not greatly affect the rate of renaturation. 

When using a ligation process, the enzyme could be added with the labeled probes 
or after the proper washing step to reduce the background. Although not previously 
proposed for use in any SBH method, ligase technology is well established within the 
field of molecular biology. For example, Hood and colleagues described a ligase- 

20 mediated gene detection technique (Landegren et al..; 1988), the methodology of which 
can be readily adapted for use in Format 3 SBH. Wu & Wallace also describe the use of 
bacteriophage T4 DNA ligase to join two adjacent, short synthetic olignucleotides. Their 
oligoligation reactions were carried out in 50 mM Tris HC1 pH 7.6, 10 mM MgCl 2 , 1 
mM ATP, 1 mM DTT, and 5% PEG. Ligation reactions were heated to 100°C for 5-10 

25 min followed by cooling to 0°C prior to the addition of T4 DNA ligase (1 unit; Bethesda 
Research Laboratory, gaithersburg, MD). Most ligation reactions were carried out at 
30°C and terminated by heating to 100°C for 5 min. 

Final washing appropriate for discriminating detection of hybridized adjacent, or 
ligated, oligonucleotidesof length (F + P), is then performed. This washing step is done 

30 in water for several minutes at 40-60°C to wash out all the non-ligated labeled probes, 
and all other compounds, to maximally reduce background. Because of the covalently 
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bound labeled oligonucleotides, detection is simplified (it does not have time and low 
temperature constraints). 

Depending on the label used, imaging of the chips is done with different apparati. 
For radioactive labels, phosphor storage screen technology and Phosphorlmager as a 
5 scanner may be used (Molecular Dynamics, Sunnyvale, CA). Chips are put in a cassette 
and covered by a phosphorous screen. After 1-4 hours of exposure, the screen is scanned 
and the image file stored at a computer hard disc. For the detection of fluorescent labels, 
CCD cameras and epifluorescent or confocal microscopy are used. For the chips 
generated directly on the pixels of a CCD camera, detection can be performed as 

10 described by Eggers et al. (1994, incorporated herein by reference). 

Charge-coupled device (CCD) detectors serve as active solid supports that 
quantitatively detect and image the distribution of labeled target molecules in probe- 
based assays. These devices use the inherent characteristics of microelectronics that 
accommodate highly parallel assays, ultrasensitive detection, high throughput, integrated 

15 data acquisition and computation. Eggers et al. (1994) describe CCDs for use with 
probe-based assays, such as Format 3 SBH of the present invention, that allow 
quantitative assessment within seconds due to the high sensitivity and direct coupling 
employed. 

The integrated CCD detection approach enables the detection of molecular 
20 binding events on chips. The detector rapidly generates a two-dimensional pattern that 
uniquely characterizes the sample. In the specific operation of the CCD-based molecular 
detector, distinct biological probes are immobilized directly on the pixels of a CCD or 
can be attached to a disposable cover slip placed on the CCD surface. The sample 
molecules can be labeled with radioisotope, chemiluminescent or fluorescent tags. 
25 Upon exposure of the sample to the CCD-based probe array, photons or 

radioisotope decay products are emitted at the pixel locations where the sample has 
bound, in the case of Format 3, to two complementary probes. In turn, electron-hole 
pairs are generated in the silicon when the charged particles, or radiation from the labeled 
sample, are incident on the CCD gates. Electrons are then collected beneath adjacent 
30 CCD gates and sequentially read out on a display module. The number of photoelectrons 
generated at each pixel is directly proportional to the number of molecular binding events 
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in such proximity. Consequently, molecular binding can be quantitatively determined 
(Eggers 1994). 

By placing the imaging array in proximity to the sample, the collection efficiency 
is improved by a factor of at least 10 over lens-based techniques such as those found in 
5 conventional CCD cameras. That is, the sample (emitter) is in near contact with the 

detector (imaging array), and this eliminates conventional imaging optics such as lenses 
and mirrors. 

When radioisotopes are attached as reporter groups to the target molecules, 
energetic particles Eire detected. Several reporter groups that emit particles of varying 

10 energies have been successfully utilized with the micro-fabricated detectors, including 
32 P, 33 P, 35 P, 14 C and 125 L The higher energy particles, such as from 32 P, provide the 
highest molecular detection sensitivity, whereas the lower energy particles, such as from 
35 S, provide better resolution. Hence the choice of the radioisotope reporter can be 
tailored as required. Once the particular radioisotope label is selected, the detection 

1 5 performance can be predicted by calculating the signal-to-noise ration (SNR), as 
described by Eggers et al. (1994). 

An alternative luminescent detection procedure involves the use of fluorescent or 
chemiluminescent reporter groups attached to the target molecules. The fluorescent 
labels can be attached covalently or through interaction. Fluorescent dyes, such as 

20 ethidium bromide, with intense absorption bands in the near UV (300-350 nm) range and 
principal emission bands in the visible (500-650 nm) range, are most suited for the CCD 
devices employed since the quantum efficiency is several orders of magnitude lower at 
the excitation wavelength then at the fluorescent signal wavelength. 

From the perspective of detecting luminescence, the polysilicon CCD gates have 

25 in capacity to filter away the contribution of incident light in the UV range, yet are very 
sensitive to the visible luminescence generated by the fluorescent reporter groups. Such 
inherently large discrimination against UV excitation enables large SNRs (greater than 
100) to be achieved by the CCDs as formulated in the incorporated paper by Eggers et 
aL (1994). 

30 For probe immobilization on the detector, hybridization matrices may be 

produced on inexpensive SiC>2 wafers, which are subsequently placed on the surface of 
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the CCD following hybridization and drying. This format is economically efficient since 
the hybridization of the DNA is conducted on inexpensive disposable SiC>2 wafers, thus 
allowing reuse of the more expensive CCD detector. Alternatively, the probes can be 
immobilized directly on the CCD to create a dedicated probe matrix. 
5 To immobilize probes upon the SiC>2 coating, a uniform epoxide layer is linked to 

the film surface, employing an epoxy-silane reagent and standard SiC>2 modification 
chemistry. Amine-modified oligonucleotide probes are then linked to the SiC>2 surface by 
means of secondary amine formation with the epoxide ring. The resulting linkage 
provides 17 rotatable bonds of separation between the 3 base of the oligonucleotide and 

10 the SiC>2 surface. To ensure complete amine deprotonation and to minimize secondary 
structure formation during coupling, the reaction is performed in 0. 1 M KOH and 
incubated at 37°C for 6 hours. 

In Format 3 SBH in general, signals are scored per each of billion points. It 
would not be necessary to hybridize all arrays, e.g., 4000 5x5 mm, at a time and the 

15 successive use of smaller number of arrays is possible. 

Cycling hybridizations are one possible method for increasing the hybridization 
signal. In one cycle, most of the fixed probes will hybridize with DNA fragments with 
tail sequences non-complementary for labeled probes. By increasing the temperature, 
those hybrids will be melted. In the next cycle, some of them (-0.1%) will hybridize 

20 with an appropriate DNA fragment and additional labeled probes will be ligated. In this 
case, there occurs a discriminative melting of DNA hybrids with mismatches for both 
probe sets simultaneously. 

In the cycle hybridization, all components are added before the cycling starts, at 
the T4, or a higher temperature for a thermostable ligase. Then the temperature is 

25 decreased to 15-37°C and the chip is incubated for up to 10 minutes, and then the 

temperature is increased to 37°C or higher for a few minutes and then again reduced. 
Cycles can be repeated up to 10 times. In one variant, an optimal higher temperature (10- 
50°C) can be used without cycling and longer ligation reaction can be performed (1-3 
hours). 

30 The procedure described herein allows complex chip manufacturing using 

standard synthesis and precise spotting of oligonucleotides because a relatively small 
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number of oligonucleotides are necessary. For example, if all 7-mer oligos are 
synthesized (16384 probes), lists of 256 million 14-mers can be determined. 

One important variant of the invented method is to use more than one differently 
labeled probe per base array. This can be executed with two purposes in mind; 
5 multiplexing to reduce number of separately hybridized arrays; or to determine a list of 
even longer oligosequences such as 3 x 6 or 3 x 7. In this case, if two labels are used, the 
specificity of the 3 consecutive oligonucleotides can be almost absolute because positive 
sites must have enough signals of both labels. 

A further and additional variant is to use chips containing BxNy probes with y 

10 being from 1 to 4. Those chips allow sequence reading in different frames; This can also 
be achieved by using appropriate sets of labeled probes or both F and P probes could 
have some unspecified end positions (/.e., some element of terminal degeneracy). 
Universal bases may also be employed as part of a linker to join the probes of defined 
sequence to the solid support. This makes the probe more available to hybridization and 

15 makes the construct more stable. If a probe has 5 bases, one may, e.g., use 3 universal 
bases as a linker. 

534 DETERMINING SEQUENCE FROM HYBRIDIZATION DATA 

Sequence assembly may be interrupted where ever a given overlapping (N-l) mer 
20 is duplicated two or more times. Then either of the two N-mers differing in the last 
nucleotide may be used in extending the sequence. This branching point limits 
unambiguous assembly of sequence. 

Reassembling the sequence of known oligonucleotides that hybridize to the target 
nucleic acid to generate the complete sequence of the target nucleic acid may not be 
25 accomplished in some cases. This is because some information may be lost if the target 
nucleic acid is not in fragments of appropriate size in relation to the size of 
oligonucleotide that is used for hybridizing. The quantity of information lost is 
proportional to the length of a target being sequenced. However, if sufficiently short 
targets are used, their sequence may be unambiguously determined. 
30 The probable frequency of duplicated sequences that would interfere with 

sequence y which is distributed along a certain length of DNA may be calculated. This 
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derivation requires the introduction of the definition of a parameter having to do with 
sequence organization: the sequence subfragment (SF). A sequence subfragment results 
if any part of the sequence of a target nucleic acid starts and ends with an (N-l) mer that 
is repeated two or more times within the target sequence. Thus, subfragments are 
5 sequences generated between two points of branching in the process of assembly of the 
sequences in the method of the invention. The sum of all subfragmentsis longer than the 
actual target nucleic acid because of overlapping short ends. Generally, subfragments 
may not be assembled in a linear order without additional information since they have 
shared (N-l) mers at their ends and starts. Different numbers of subfragments are 

10 obtained for each nucleic acid target depending on the number of its repeated (N-l) mers. 
The number depends on the value of N-l and the length of the target. 

Probability calculations can estimate the interrelationshipof the two factors. If the 
ordering of positive N-mers is accomplished by using overlapping sequences of length N- 
1 or at an average distance of Ao, the N-l of a fragment Lf bases long is given by 

15 equation one: 

N sf = 1+A 0 X KXP (K, L f ) 

Where K greater than or = 2, and P (K, Lf) represents the probability of an N-mer 
occurring K-times on a fragment Lf base long. Also, a computer program that is able to 
form subfragments from the content of N-mers for any given sequence is described 
20 below. 

The number of subfragments increases with the increase of lengths of fragments 
for a given length of probe. Obtained subfragments may not be uniquely ordered among 
themselves. Although not complete, this information is very useful for comparative 
sequence analysis and the recognition of functional sequence characteristics. This type of 

25 information may be called partial sequence. Another way of obtaining partial sequence is 
the use of only a subset of oligonucleotide probes of a given length. 

There may be relatively good agreement between predicted sequence according to 
theory and a computer simulation for a random DNA sequence. For instance, for N-l = 
7, [using an 8-mer or groups of sixteen 10-mers of type 5' (A,T,C,G) B 8 (A,T,C,G) 3'] a 

30 target nucleic acid of 200 bases will have an average of three subfragments. However, 
because of the dispersion around the mean, a library of target nucleic acid should have 
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inserts of 500 bp so that less than 1 in 2000 targets have more than three subfragments. 
Thus, in an ideal case of sequence determination of a long nucleic acid of random 
sequence, a representative library with sufficiently short inserts of target nucleic acid may 
be used. For such inserts, it is possible to reconstruct the individual target by the method 
5 of the invention. The entire sequence of a large nucleic acid is then obtained by 
overlapping of the defined individual insert sequences. 

To reduce the need for very short fragments, e.g. 50 bases for 8-mer probes. The 
information contained in the overlapped fragments present in every random DNA 
fragmentation process like cloning, or random PCR is used. It is also possible to use 

10 pools of short physical nucleic acid fragments. Using 8-mers or 1 1-mers like 5' (A, T, C, 
G) Ng (A, T, C ,G) 3' for sequencing 1 megabase, instead of needing 20,000 50 bp 
fragments only 2,100 samples are sufficient. This number consists of 700 random 7 kb 
clones (basic library), 1250 pools of 20 clones of 500 bp (subfragments ordering library) 
and 150 clones from jumping (or similar) library. The developed algorithm (see Example 

15 18) regenerates sequence using hybridization data of these described samples. 

5.35 ALGORITHM 

This example describes an algorithm for generation of a long sequence written in 
a four letter alphabet from constituent k-tuple words in a minimal number of separate, 

20 randomly defined fragments of a starting nucleic acid sequence where K is the length of 
an oligonucleotide probe. The algorithm is primarily intended for use in the sequencing 
by hybridization (SBH) process. The algorithm is based on subfragments (SF), 
informative fragments (IF) and the possibility of using pools of physical nucleic 
sequences for defining informative fragments. 

25 As described, subfragments may be caused by branch points in the assembly 

process resulting from the repetition of a K-l oligomer sequence in a target nucleic acid. 
Subfragments are sequence fragments found between any two repetitive words of the 
length K-l that occur in a sequence. Multiple occurrences of K-l words are the cause of 
interruption of ordering the overlap of K-words in the process of sequence generation. 

30 Interruption leads to a sequence remaining in the form of subfragments. Thus, the 
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unambiguous segments between branching points whose order is not uniquely determined 
are called sequence subfragments. 

Informative fragments are defined as fragments of a sequence that are determined 
by the nearest ends of overlapped physical sequence fragments. 
5 A certain number of physical fragments may be pooled without losing the 

possibility of defining informative fragments. The total length of randomly pooled 
fragments depends on the length of k-tuples that are used in the sequencing process. 

The algorithm consists of two main units. The first part is used for generation of 
subfragments from the set of k-tuples contained in a sequence. Subfragments may be 

10 generated within the coding region of physical nucleic acid sequence of certain sizes, or 
within the informative fragments defined within long nucleic acid sequences. Both types 
of fragments are members of the basic library. This algorithm does not describe the 
determination of the content of the k-tuples of the informative fragments of the basic 
library, i.e. the step of preparation of informative fragments to be used in the sequence 

1 5 generation process. 

The second part of the algorithm determines the linear order of obtained 
subfragments with the purpose of regenerating the complete sequence of the nucleic acid 
fragments of the basic library. For this purpose a second, ordering library is used, made 
of randomly pooled fragments of the starting sequence. The algorithm does not include 

20 the step of combining sequences of basic fragments to regenerate an entire, megabase 
plus sequence. This may be accomplished using the link-up of fragments of the basic 
library which is a prerequisite for informative fragment generation. Alternatively, it may 
be accomplished after generation of sequences of fragments of the basic library by this 
algorithm, using search for their overlap, based on the presence of common end- 

25 sequences. 

The algorithm requires neither knowledge of the number of appearances of a 
given k-tuple in a nucleic acid sequence of the basic and ordering libraries, nor does it 
require the information of which k-tuple words are present on the ends of a fragment. 
The algorithm operates with the mixed content of k-tuples of various length. The concept 
30 of the algorithm enables operations with the k-tuple sets that contain false positive and 
false negative k-tuples. Only in specific cases does the content of the false k-tuples 
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primarily influence the completeness and correctness of the generated sequence. The 
algorithm may be used for optimization of parameters in simulation experiments, as well 
as for sequence generation in the actual SBH experiments e.g., generation of the genomic 
DNA sequence. In optimization of parameters, the choice of the oligonucleotide probes 
5 (k-tuples) for practical and convenient fragments and/or the choice of the optimal lengths 
and the number of fragments for the defined probes are especially important. 

This part of the algorithm has a central role in the process of the generation of the 
e from the content of k-tuples. It is based on the unique ordering of k-tuples by means of 
maximal overlap. The main obstacles in sequence generation are specific repeated 

10 sequences and false positive and/or negative k-tuples. The aim of this part of the 

algorithm is to obtain the minimal number of the longest possible subfragments, with 
correct sequence. This part of the algorithm consists of one basic, and several control 
steps. A two-stage process is necessary since certain information can be used only after 
generation of all primary subfragments. 

1 5 The main problem of sequence generation is obtaining a repeated sequence from 

word contents that by definition do not carry information on the number of occurrences 
of the particular k-tuples. The concept of the entire algorithm depends on the basis on 
which this problem is solved. In principle, there are two opposite approaches: 1) 
repeated sequences may be obtained at the beginning, in the process of generation of 

20 pSFs, or 2) repeated sequences can be obtained later, in the process of the final ordering 
of the subfragments. In the first case, pSFs contain an excess of sequences and in the 
second case, they contain a deficit of sequences. The first approach requires elimination 
of the excess sequences generated, and the second requires permitting multiple use of 
some of the subfragments in the process of the final assembling of the sequence. 

25 The difference in the two approaches in the degree of strictness of the rule of 

unique overlap of k-tuples. The less severe rule is: k-tuple X is unambiguously 
maximally overlapped with k-tuple Y if and only if, the rightmost k-1 end of k-tuple X is 
present only on the leftmost end of k-tuple Y. This rule allows the generation of 
repetitive sequences and the formation of surplus sequences. 

30 A stricter rule which is used in the second approach has an addition caveat: k- 

tuple X is unambiguously maximally overlapped with k-tuple Y if and only if, the 
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rightmost K-l end of k-tuple X is present only on the leftmost end of k-tuple Y and if the 
leftmost K-l end of k-tuple Y is not present on the rightmost end of any other k-tuple. 
The algorithm based on the stricter rule is simpler, and is described herein. 

The process of elongation of a given subfragment is stopped when the right k-l 
5 end of the last k-tuple included is not present on the left end of any k-tuple or is present 
on two or more k-tuples. If it is present on only one k-tuple the second part of the rule is 
tested. If in addition there is a k-tuple which differs from the previously included one, 
the assembly of the given subfragment is terminated only on the first leftmost position. If 
this additional k-tuple does not exist, the conditions are met for unique k-l overlap and a 

10 given subfragment is extended to the right by one element. 

Beside the basic rule, a supplementary one is used to allow the usage of k-tuples 
of different lengths. The maximal overlap is the length of k-l of the shorter k-tuple of the 
overlapping pair. Generation of the pSFs is performed starting from the first k-tuple from 
the file in which k-tuples are displayed randomly and independently from their order in a 

1 5 nucleic acid sequence. Thus, the first k-tuple in the file is not necessarily on the 

beginning of the sequence, nor on the start of the particular subfragment. The process of 
subfragment generation is performed by ordering the k-tuples by means of unique 
overlap, which is defined by the described rule. Each used k-tuple is erased from the file. 
At the point when there are no further k-tuples unambiguously overlapping with the last 

20 one included, the building of subfragment is terminated and the buildup of another pSF is 
started. Since generation of a majority of subfragments does not begin from their actual 
starts, the formed pSF are added to the k-tuple file and are considered as a longer k-tuple. 
Another possibility is to form subfragments going in both directions from the starting k- 
tuple. The process ends when further overlap, i.e. the extension of any of the 

25 subfragments, is not possible. 

The pSFs can be divided in three groups: 1) subfragments of the maximal length 
and correct sequence in cases of exact k-tuple set; 2) short subfragments, formed due to 
the used of the maximal and unambiguous overlap rule on the incomplete set, and/or the 
set with some false positive k-tuples; and 3) pSFs of an incorrect sequence. The 

30 incompleteness of the set in 2) is caused by false negative results of a hybridization 

experiment, as well as by using an incorrect set of k-tuples. These are formed due to the 
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false positive and false negative k-tuples and can be: a) misconnected subfragments; b) 
subfragments with the wrong end; and c) false positive k-tuples which appears as false 
minimal subfragments. 

Considering false positive k-tuples, there is the possibility for the presence of a k- 
5 tuple containing more than one wrong base or containing one wrong base somewhere in 
the middle, as well as the possibility for a k-tuple with a wrong base on the end. 
Generation of short, erroneous or misconnected subfragments is caused by the latter k- 
tuples. The k-tuples of the former two kinds represent wrong pSFs with length equal to 
k-tuple length. 

10 In the case of one false negative k-tuple, pSFs are generated because of the 

impossibility of maximal overlapping. In the case of the presence of one false positive k- 
tuple with the wrong base on its leftmost or rightmost end, pSFs are generated because of 
the impossibility of unambiguous overlapping. When both false positive and false 
negative k-tuples with a common k-1 sequence are present in the file, pSFs are generated, 

15 and one of these pSFs contains the wrong k-tuple at the relevant end. 

The process of correcting subfragments with errors in sequence and the linking of 
unambiguously connected pSF is performed after subfragment generation and in the 
process of subfragment ordering. The first step which consists of cutting the 
< , misconnected pSFs and obtaining the final subfragments by unambiguous connection of 

20 pSFs is described below. 

There are two approaches for the formation of misconnected subfragments. In the 
first a mistake occurs when an erroneous k-tuple appears on the points of assembly of the 
repeated sequences of lengths k-1 . In the second, the repeated sequences are shorter than 
k-1. These situations can occur in two variants each. In the first variant, one of the 

25 repeated sequences represents the end of a fragment. In the second variant, the repeated 
sequence occurs at any position within the fragment. For the first possibility, the absence 
of some k-tuples from the file (false negatives) is required to generate a misconnection. 
The second possibility requires the presence of both false negative and false positive k- 
tuples in the file. Considering the repetitions of k-1 sequence, the lack of only one k- 

30 tuple is sufficient when either end is repeated internally. The lack of two is needed for 
strictly internal repetition. The reason is that the end of a sequence can be considered 
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informatically as an endless linear array of false negative k-tuples. From the "smaller 
than k-1 case", only the repeated sequence of the length of k-2, which requires two or 
three specific erroneous k-tuples, will be considered. It is very likely that these will be 
the only cases which will be detected in a real experiment, the others being much less 
5 frequent. 

Recognition of the misconnected subfragments is more strictly defined when a 
repeated sequence does not appear at the end of the fragment. In this situation, one can 
detect further two subfragments, one of which contains on its leftmost, and the other on • 
its rightmost end k-2 sequences which are also present in the misconnected subfragment. 

10 When the repeated sequence is on the end of the fragment, there is only one subfragment 
which contains k-2 sequence causing the mistake in subfragment formation on its 
leftmost or rightmost end. 

The removal of misconnected subfragments by their cutting is performed 
according to the common rule: If the leftmost or rightmost sequence of the length of k-2 

15 of subfragments is present in any other subfragment, the subfragment is to be cut into 
subfragments, each of them containing k-2 sequence. This rule does not cover rarer 
situations of a repeated end when there are more than one false negative k-tuple on the 
point of repeated k-1 sequence. Misconnected subfragments of this kind can be 
recognized by using the information from the overlapped fragments, or informative 

20 fragments of basic and ordering libraries. In addition, the misconnected subfragment will 
remain when two or more false negative k-tuples occur on both positions which contain 
the identical k-1 sequence. This is a very rare situation since it requires at least 4 specific 
false k-tuples. An additional rule can be introduced to cut these subfragments on 
sequences of length k if the given sequence can be obtained by combination of sequences 

25 shorter than k-2 from the end of one subfragment and the start of another. 

By strict application of the described rule, some completeness is lost to ensure the 
accuracy of the output. Some of the subfragments will be cut although they are not 
misconnected since they fit into the pattern of a misconnected subfragment. There are 
several situations of this kind. For example, a fragment, beside at least two identical k-1 

30 sequences, contains any k-2 sequence from k-1 or a fragment contains k-2 sequence 
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repeated at least twice and at least one false negative k-tuple containing given k-2 
sequence in the middle, etc. 

The aim of this part of the algorithm is to reduce the number of pSFs to a minimal 
number of longer subfragments with correct sequence. The generation of unique longer 
5 subfragments or a complete sequence is possible in two situations. The first situation 
concerns the specific order of repeated k-1 words. There are cases in which some or all 
maximally extended pSFs (the first group of pSFs) can be uniquely ordered. For 
example, in fragment S-Rl-a-R2-b-Rl-c-R2-E where S and E are the start and end of a 
fragment, a, b, and c are different sequences specific to respective subfragments and Rl 

10 and R2 are two k-1 sequences that are tandemly repeated, five subfragments are 

generated (S-Rl, Rl-a-R2, R2-b-Rl, R1-C-R2, and R-E). They may be ordered in two 
ways; the original sequence above or S-Rl-c-R-b-Rl-a-R-E. In contrast, in a fragment 
with the same number and types of repeated sequences but ordered differently, i.e. S-Rl- 
a-Rl-b-R-c-R-E, there is no other sequence which includes all subfragments. Examples 

15 of this type can be recognized only after the process of generation of pSFs. They 
represent the necessity for two steps in the process of pSF generation. The second 
situation of generation of false short subfragments on positions of nonrepeated k-1 
sequences when the files contain false negative and/or positive k-tuples is more 
important. 

20 The solution for both pSF groups consists of two parts. First, the false positive k- 

tuples appearing as the nonexisting minimal subfragments are eliminated. All k-tuple 
subfragments of length k which do not have an overlap on either end, of the length of 
longer than k-a on one end and longer than k-b on the other end, are eliminated to enable 
formation of the maximal number of connections. In our experiments, the values for a 

25 and b of 2 and 3, respectively, appeared to be adequate to eliminate a sufficient number 
of false positive k-tuples. 

The merging of subfragments that can be uniquely connected is accomplished in 
the second step. The rule for connection is: two subfragments may be unambiguously 
connected if, and only if, the overlapping sequence at the relevant end or start of two 

30 subfragments is not present at the start and/or end of any other subfragment. 
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The exception is if one subfragment from the considered pair has the identical 
beginning and end. In that case connection is permitted, even if there is another 
subfragment with the same end present in the file. The main problem here is the precise 
definition of overlapping sequence. The connection is not permitted if the overlapping 
5 sequence unique for only one pair of subfragments is shorter than k-2, or it is k-2 or 

longer but an additional subfragment exists with the overlapping sequence of any length 
longer than k-4. Also, both the canonical ends of pSFs and the ends after omitting one 
(or few) last bases are considered as the overlapping sequences. 

After this step some false positive k-tuples (as minimal subfragments) and some 

10 subfragments with a wrong end may survive. In addition, in very rare occasions where a 
certain number of some specific false k-tuples are simultaneously present, an erroneous 
connection may take place. These cases will be detected and solved in the subfragment • 
ordering process, and in the additional control steps along with the handling of uncut 
"misconnected" subfragments. 

1 5 The short subfragments that are obtained are of two kinds. In the common case, 

these subfragments may be unambiguously connected among themselves because of the 
distribution of repeated k- 1 sequences. This may be done after the process of generation 
of pSFs and is a good example of the necessity for two steps in the process of pSF 
generation. In the case of using the file containing false positive and/or false negative k- 

20 tuples, short pSFs are obtained on the sites of nonrepeated k-1 sequences. Considering 
false positive k-tuples, a k-tuple may contain more than one wrong base (or containing 
one wrong base somewhere in the middle), as well as k-tuple on the end. Generation of 
short and erroneous (or misconnected) subfragments is caused by the latter k-tuples. The 
k-tuples of the former kind represent wrong pSFs with length equal to k- tuple length. 

25 The aim of merging pSF part of the algorithm is the reduction of the number of 

pSFs to the minimal number of longer subfragments with the correct sequence. All k- 
tuple subfragments that do not have an overlap on either end, of the length of longer than 
k-a on one, and longer than k-b on the other end, are eliminated to enable the maximal 
number of connections. In this way, the majority of false positive k-tuples are discarded. 

30 The rule for connection is: two subfragments can be unambiguously connected if, and 
only if the overlapping sequence of the relevant end or start of two subfragments is not 
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present on the start and/or end of any other subfragment. The exception is a subfragment 
with the identical beginning and end. In that case connection is permitted, provided that 
there is another subfragment with the same end present in the file. The main problem 
here is of precise definition of overlapping sequence. The presence of at least two 
5 specific false negative k-tuples on the points of repetition of k-1 or k-2 sequences, as well 
as combining of the false positive and false negative k-tuples may destroy or "mask" 
some overlapping sequences and can produce an unambiguous, but wrong connection of 
pSFs. To prevent this, completeness must be sacrificed on account of exactness: the 
connection is not permitted on the end-sequences shorter than k-2, and in the presence of 

10 an extra overlapping sequence longer than k-4. The overlapping sequences are defined 
from the end of the pSFs, or omitting one, or few last bases. 

In the very rare situations, with the presence of a certain number of some specific 
false positive and false negative k-tuples, some subfragments with the wrong end can 
survive, some false positive k-tuples (as minimal subfragments) can remain, or the 

15- erroneous connection can take place. These cases are detected and solved in the 
subfragments ordering process, and in the additional control steps along with the 
handling of uncut, misconnected subfragments. 

The process of ordering of subfragments is similar to the process of their 
generation. If one considers subfragments as longer k-tuples, ordering is performed by 

20 their unambiguous connection via overlapping ends. The informational basis for 

unambiguous connection is the division of subfragments generated in fragments of the 
basic library into groups representing segments of those fragments. The method is 
analogous to the biochemical solution of this problem based on hybridization with longer 
oligonucleotides with relevant connecting sequence. The connecting sequences are 

25 generated as subfragments using the k-tuple sets of the appropriate segments of basic 

library fragments. Relevant segments are defined by the fragments of the ordering library 
that overlap with the respective fragments of the basic library. The shortest segments are 
informative fragments of the ordering library. The longer ones are several neighboring 
informative fragments or total overlapping portions of fragments corresponding of the 

30 ordering and basic libraries. In order to decrease the number of separate samples, 
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fragments of the ordering library are randomly pooled, and the unique k-tuple content is 
determined. 

By using the large number of fragments in the ordering library very short 
segments are generated, thus reducing the chance of the multiple appearance of the k-1 
5 sequences which are the reasons for generation of the subfragments. Furthermore, longer 
segments, consisting of the various regions of the given fragment of the basic library, do 
not contain some of the repeated k-1 sequences. In every segment a connecting sequence 
(a connecting subfragment) is generated for a certain pair of the subfragments from the 
given fragment. The process of ordering consists of three steps: (I) generation of the k- 

10 tuple contents of each segment; (2) generation of subfragments in each segment; and (3) 
connection of the subfragments of the segments. Primary segments are defined as 
significant intersections and differences of k-tuple contents of a given fragment of the 
basic library with the k-tuple contents of the pools of the ordering library. Secondary 
(shorter) segments are defined as intersections and differences of the k-tuple contents of 

1 5 the primary segments. 

There is a problem of accumulating both false positive and negative k-tuples in 
both the differences and intersections. The false negative k-tuples from starting 
sequences accumulate in the intersections (overlapping parts), as well as false positive k- 
tuples occurring randomly in both sequences, but not in the relevant overlapping region. 

20 On the other hand, the majority of false positives from either of the starting sequences is 
not taken up into intersections. This is an example of the reduction of experimental 
errors from individual fragments by using information from fragments overlapping with 
them. The false k-tuples accumulate in the differences for another reason. The set of 
false negatives from the original sequences are enlarged for false positives from 

25 intersections and the set of false positives for those k-tuples which are not included in the 
intersection by error, i.e. are false negative in the intersection. If the starting sequences 
contain 10% false negative data, the primary and secondary intersections will contain 
19% and 28% false negative k-tuples. respectively. On the other hand, a mathematical 
expectation of 77 false positives may be predicted if the basic fragment and the pools 

30 have lengths of 500 bp and 10,000 bp, respectively. However, there is a possibility of 
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recovering most of the "lost" k-tuples and of eliminating most of the false positive k- 
tuples. 

First, one has to determine a basic content of the k-tuples for a given segment as 
the intersection of a given pair of the k-tuple contents. This is followed by including all 
5 k-tuples of the starting k-tuple contents in the intersection, which contain at one end k-1 
and at the other end k-+ sequences which occur at the ends of two k-tuples of the basic 
set. This is done before generation of the differences thus preventing the accumulation of 
false positives in that process. Following that, the same type of enlargement of k-tuple 
set is applied to differences with the distinction that the borrowing is from the 
10 intersections. All borrowed k-tuples are eliminated from the intersection files as false 
positives. 

The intersection, i.e. a set of common k-tuples, is defined for each pair (a basic 
fragment) x (a pool of ordering library). If the number of k-tuples in the set is significant 
it is enlarged with the false negatives according to the described rule. The primary 

1 5 difference set is obtained by subtracting from a given basic fragment the obtained 
intersection set. The false negative k-tuples are appended to the difference set by 
borrowing from the intersection set according to the described rule and, at the same time, 
removed from the intersection set as false positive k-tuples. When the basic fragment is 
longer than the pooled fragments, this difference can represent the two separate segments 

20 which somewhat reduces its utility in further steps. The primary segments are all 

generated intersections and differences of pairs (a basic fragment) x (a pool of ordering 
library) containing the significant number of k-tuples. K-tuple sets of secondary 
segments are obtained by comparison of k-tuple sets of all possible pairs of primary 
segments. The two differences are defined from each pair which produces the 

25 intersection with the significant number of k-tuples. The majority of available 

information from overlapped fragments is recovered in this step so that there is little to be 
gained from the third round of forming intersections, and differences. 

(2) Generation of the subfragments of the segments is performed identically 
as described for the fragments of the basic library. 

30 (3) The method of connection of subfragments consists of sequentially 

determining the correctly linked pairs of subfragments among the subfragments from a 
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given basic library fragment which have some overlapped ends. In the case of 4 relevant 
subfragments, two of which contain the same beginning and two having the same end, 
there are 4 different pairs of subfragments that can be connected. In general 2 are correct 
and 2 are wrong. To find correct ones, the presence of the connecting sequences of each 
5 pair is tested in the subfragments generated from all primary and secondary segments for 
a given basic fragment. The length and the position of the connecting sequence are 
chosen to avoid interference with sequences which occur by chance. They are k+2 or 
longer, and include at least one element 2 beside overlapping sequence in both 
subfragments of a given pair. The connection is permitted only if the two connecting 

10 sequences are found and the remaining two do not exist. The two linked subfragments 
replace former subfragments in the file and the process is cyclically repeated. 
Repeated sequences are generated in this step. This means that some subfragments are 
included in linked subfragments more than once. They will be recognized by finding the 
relevant connecting sequence which engages one subfragment in connection with two 

1 5 different subfragments. 

The recognition of misconnected subfragments generated in the processes of 
building pSFs and merging pSFs into longer subfragments is based on testing whether the 
sequences of subfragments from a given basic fragment exist in the sequences of 
subfragments generated in the segments for the fragment. The sequences from an 

20 incorrectly connected position will not be found indicating the misconnected 
subfragments. 

Beside the described three steps in ordering of subfragments some additional 
control steps or steps applicable to specific sequences will be necessary for the generation 
of more complete sequence without mistakes. 

25 The determination of which subfragment belongs to which segment is performed 

by comparison of contents of k-tuples in segments and subfragments. Because of the 
errors in the k-tuple contents (due to the primary error in pools and statistical errors due 
to the frequency of occurrences of k-tuples) the exact partitioning of subfragments is 
impossible. Thus, instead of "all or none" partition, the chance of coming from the given 

30 segment (P(sf,s)) is determined for each subfragment. This possibility is the function of 
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the lengths of k-tuples, the lengths of subfragments, the lengths of fragments of ordering 
library, the size of the pool, and of the percentage of false k-tuples in the file: 
P(sf,s)=(Ck-F)/Lsf, 

where Lsf is the length of subfragment, Ck is the number of common k-tuples for 
5 a given subfragment/segment pair, and F is the parameter that includes relations between 
lengths of k-tuples, fragments of basic library, the size of the pool, and the error 
percentage. 

Subfragments attributed to a particular segment are treated as redundant short 
pSFs and are submitted to a process of unambiguous connection. The definition of 

10 unambiguous connection is slightly different in this case, since it is based on a probability 
that subfragments with overlapping end(s) belong to the segment considered. Besides, the 
accuracy of unambiguous connection is controlled by following the connection of these 
subfragments in other segments. After the connection in different segments, all of the 
obtained subfragments are merged together, shorter subfragments included within longer 

1 5 ones are eliminated, and the remaining ones are submitted to the ordinary connecting 
process. If the sequence is not regenerated completely, the process of partition and 
connection of subfragments is repeated with the same or less severe criterions of 
probability of belonging to the particular segment, followed by unambiguous connection. 
Using severe criteria for defining unambiguous overlap, some information is not 

20 used. Instead of a complete sequence, several subfragments that define a number of 

possibilities for a given fragment are obtained. Using less severe criteria an accurate and 
complete sequence is generated. In a certain number of situations, e.g., an erroneous 
connection, it is possible to generate a complete, but an incorrect sequence, or to generate 
"monster" subfragments with no connection among them . Thus, for each fragment of the 

25 basic library one obtains: a) several possible solutions where one is correct and b) the 

most probable correct solution. Also, in a very small number of cases, due to the mistake 
in the subfragment generation process or due to the specific ratio of the probabilities of 
belonging, no unambiguous solution is generated or one, the most probable solution. 
These cases remain as incomplete sequences, or the unambiguous solution is obtained by 

30 comparing these data with other, overlapped fragments of basic library. 
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The described algorithm was tested on a randomly generated, 50 kb sequence, 
containing 40% GC to simulate the GC content of the human genome. In the middle part 
of this sequence were inserted various All, and some other repetitive sequences, of a total 
length of about 4 kb. To simulate an in vitro SBH experiment, the following operations 
5 were performed to prepare appropriate data. 

1) Positions of sixty 5 kb overlapping "clones" were randomly defined, to 
simulate preparation of a basic library: 

2) Positions of one thousand 500 bp "clones" were randomly determined to 
simulate making the ordering library. These fragments were extracted from the sequence. 

10 Random pools of 20 fragments were made, and k-tuple sets of pools were determined and 
stored on the hard disk. These data are 1 used in the subfragment ordering phase: For the • 
, same density of clones 4 million clones in basic library and 3 million clones in ordering 
library are used for the entire human genome. The total number of 7 million clones is 
several fold smaller than the number of clones a few kb long for random cloning of 

15 almost all of genomic DNA and sequencing by a gel-based method. 

From the data on the starts and ends of 5 kb fragments, 117 "informative 
fragments" were determined to be in the sequence. This was followed by determination 
of sets of overlapping k-tuples of which the single "informative fragment" consist. Only 
the subset of k-tuples matching a predetermined list were used. The list contained 65% 8- 

20 mers, 30% 9-mers, and 5% 10-1 2-mers. Processes of generation and the ordering of 
subfragments were performed on these data. 

The testing of the algorithm was performed on the simulated data in two 
experiments. The sequence of 50 informative fragments was regenerated with the 100% 
correct data set (over 20,000 bp), and 26 informative fragments (about 10,000 bp) with 

25 10% false k-tuples (5% positive and 5% negative ones). 

In the first experiment, all subfragments were correct and in only one out of 50 
informative fragments the sequence was not completely regenerated but remained in the 
form of 5 subfragments. The analysis of positions of overlapped fragments of ordering 
library has shown that they lack the information for the unique ordering of the 5 

30 subfragments. The subfragments may be connected in two ways based on overlapping 
ends, 1-2-3-4-5 and 1-4-3-2-5. The only difference is the exchange of positions of 
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subfragments 2 and 4. Since subfragments 2, 3, and 4 are relatively short (total of about 
100 bp), the relatively greater chance existed, and occurred in this case, that none of the 
fragments of ordering library started or ended in the subfragment 3 region. 

To simulate real sequencing, some false ("hybridization") data was included as 
5 input in a number of experiments. In oligomer hybridization experiments, under proposed 
conditions, the only situation producing unreliable data is the end mismatch versus full 
match hybridization. Therefore, in simulation only those k-tuples differing in a single 
element on either end from the real one were considered to be false positives. These 
"false" sets are made as follows. On the original set of a k-tuples of the informative 

10 fragment, a subset of 5% false positive k-tuples are added. False positive k-tuples are 

made by randomly picking a k-tuple from the set, copying it and altering a nucleotide on 
its beginning or end. This is followed by subtraction of a subset of 5% randomly chosen 
k-tuples. In this way the statistically expected number of the most complicated cases is 
generated in which the correct k-tuple is replaced with a k-tuple with the wrong base on 

1 5 the end. 

Production of k-tuple sets as described leads to up to 10% of false data. This value 
varies from case to case, due to the randomness of choice of k-tuples to be copied, 
altered, and erased. Nevertheless, this percentage 3-4 times exceeds the amount of 
unreliable data in real hybridization experiments. The introduced error of 10% leads to 

20 the two fold increase in the number of subfragments both in fragments of basic library 
(basic library informative fragments) and in segments. About 1 0% of the final 
subfragments have a wrong base at the end as expected for the k-tuple set which contains 
false positives (see generation of primary subfragments). Neither the cases of 
misconnection of subfragments nor subfragments with the wrong sequence were 

25 observed. In 4 informative fragments out of 26 examined in the ordering process the 

complete sequence was not regenerated. In all 4 cases the sequence was obtained in the 
form of several longer subfragments and several shorter subfragments contained in the 
same segment. This result shows that the algorithmic principles allow working with a 
large percentage of false data. 

30 The success of the generation of the sequence from its k-tuple content may be 

described in terms of completeness and accuracy. In the process of generation, two 
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particular situations can be defined: 1) Some part of the information is missing in the 
generated sequence, but one knows where the ambiguities are and to which type they 
belong, and 2) the regenerated sequence that is obtained does not match the sequence 
from which the k- tuple content is generated, but the mistake can not be detected. 
5 Assuming the algorithm is developed to its theoretical limits, as in the use of the exact k- 
tuple sets, only the first situation can take place. There the incompleteness results in a 
certain number of subfragments that may not be ordered unambiguously and the problem 
of determination of the exact length of monotonous sequences, i.e. the number of perfect 
tandem repeats. 

10 . With false k-tuples, incorrect sequences may be generated. The reason for 

mistakes does not lie in the shortcomings of the algorithm, but in the fact that a given 
content of k-tuples unambiguously represents the sequence that differs from the original 
one. One may define three classes of error, depending on the kind of the false k- tuples 
present in the file. False negative k-tuples (which are not accompanied with the false. 

1 5 positives) produce "deletions". False positive k-tuples are producing "elongations 

(unequal crossing over)". False positives accompanied with false negatives are the reason 
for generation of "insertions", alone or combined with "deletions". The deletions are 
produced when all of the k-tuples (or their majority) between two possible starts of the 
subfragments are false negatives. Since every position in the sequence is defined by k- 

20 tuples, the occurrence of the deletions in a common case requires k consecutive false 

negatives. (With 10% of the false negatives and k=8, this situation takes place after every 
108 elements). This situation is extremely infrequent even in mammalian genome 
sequencing using random libraries containing ten genome equivalents. 

Elongation of the end of the sequence caused by false positive k-tuples is the 

25 special case of "insertions", since the end of the sequence can be considered as the 

endless linear array of false negative k-tuples. One may consider a group of false positive 
k-tuples producing subfragments longer than one k-tuple. Situations of this kind may be 
detected if subfragments are generated in overlapped fragments, like random physical 
fragments of the ordering library. An insertion, or insertion in place of a deletion, can 

30 arise as a result of specific combinations of false positive and false negative k-tuples. In 
the first case, the number of consecutive false negatives is smaller than k. Both cases 


172 


JL O O B ■+ 6 ■+ 3 « O 2! 2 6« O t 

require several overlapping false positive k-tuples. The insertions and deletions are 
mostly theoretical possibilities without sizable practical repercussions since the 
requirements in the number and specificity of false k-tuples are simply too high. 

In every other situation of not meeting the theoretical requirement of the minimal 
5 number an the kind of the false positive and/or negatives, mistakes in the k-tuples content 
may produce only the lesser completeness of a generated sequence. 

In SBH, a sample nucleic acid is sequenced by exposing the sample to a support- . 
bound probe of known sequence and a labeled probe or probes in solution. Wherever the 
probes ligase is introduced into the mixture of probes and sample, such that, wherever a 

10 support has a bound probe and a labeled probe hybridized back to back along the sample, 
the two probes will be chemically linked by the action of the ligase. After washing, only 
chemically linked support-bound and labeled probes are detected by the presence of the 
labeled probe. By knowing the identity of the support-bound probe at a particular location 
in an array, and the identity of the labeled probe, a portion of the sequence of the sample 

1 5 may be determined by the presence of a label at a point in an array on a Format with a 
sample of three substrates Not all of the sample to be sequenced may be a nucleic acid 
fragment or oligonucleotide of ten base pairs ("bp"). The sample is preferably four to one 
thousand bases in length. 

The length of the probe is a fragment less than ten bases in length, and, 

20 preferably, is between four and nine bases in length. In this way, arrays of support-bound 
probes may include all oligonucleotides of a given length or may include only 
oligonucleotides selected for a particular test. Where all oligonucleotides of a given 
length are used, the number of central oligonucleotides may be calculated by 4 N where N 
is the length of the probe. 

25 

536 RE-USING SEQUENCING CHIPS 

When ligation is employed in the sequencing process, then the ordinary 
oligonucleotide chip cannot be immediately reused. The inventors contemplate that this 
may be overcome in various ways. 
30 One may employ ribonucleotides for the second probe, probe P, so that this probe 

may subsequently be removed by RNAse treatment. RNAse treatment may utilize RNAse 
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A, an endoribonuclease that specifically attacks single-stranded RNA 3'-to pyrimidine 
residues and cleaves the phosphate linkage to the adjacent nucleotide. The end products 
are pyrimidine 3' phosphates and oligonucleotides with terminal pyrimidine 3* 
phosphates. RNAse A works in the absence of cofactors and divalent cations. 
5 To utilize an RNAse, one would generally incubate the chip in any appropriate 

RNAse-containing buffer, as described by Sambrook et al. (1989; incorporated herein by ■ 
reference). The use of 30-50 \x\ of RNAse-containing buffer per 8x8 mm or 9 x 9mm 
array at 37°C for between 10 and 60 minutes is appropriate. One would then wash with 
hybridization buffer. 

10 Although not widely applicable, one could also use the uracil base, as described 

by Craig et al. (1989), incorporated herein by reference, in specific embodiments. 
Destruction of the ligated probe combination, to yield a re-usable chip, would be 
achieved by digestion with the E. coli repair enzyme, uracil-DNA glycosylase which 
removes uracil from DNA. 

15 One could also generate a specifically cleavable bond between the probes and 

then cleave the bond after detection. For example, this may be achieved by chemical 
ligation as described by Shabarova et aL 9 (1991) and Dolmnnaya et at, (1988), both 
references being specifically incorporated herein by reference. 

Shabarova et al. (1991) describe the condensation of oligodeoxyribonucleotides 

20 with cyanogen bromide as a condensing agent. In their one step chemical ligation 

reaction, the oligonucleotides are heated to 97°C, slowly cooled to 0°C, then 1 |il lOmM 
BrCN in acetonitrile is added. 

Dolmnnaya et al. (1988) show how to incorporate phosphoramidite and 
pyrophosphate internucleotide bonds in DNA duplexes. They also use a chemical ligation 

25 method for modification of the sugar phosphate backbone of DNA, with a water-soluble 
carbodiimide (CDI) as a coupling agent. The selective cleavage of a phosphoamide bond 
involves contact with 15% CH3COOH for 5 mm at 95°C. The selective cleavage of a 
pyrophosphate bond involves contact with a pyridine- water mixture (9:1) and freshly 
distilled (CF 3 CO) 2 0. 
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5.37 DIAGNOSTICS - SCORING KNOWN MUTATIONS OR FULL 
GENE RESEQUENCING 

In a simple case, the goal may be to discover whether selected, known mutations 
5 occur in a DNA segment. Less than 12 probes may suffice for this purpose, for example, 
5 probes positive for one allele, 5 positive for the other, and 2 negative for both. Because 
of the small number of probes to be scored per sample, large numbers of samples may be 
analyzed in parallel. For example, with 12 probes in 3 hybridization cycles, 96 different 
genomic loci or gene segments from 64 patient may be analyzed on one 6 x 9 in 

10 membrane containing 12 x 24 subarrays each with 64 dots representing the same DNA 

segment from 64 patients. In this example, samples may be prepared in sixty-four 96-well 
plates. Each plate may represent one patient, and each well may represent one of the 
DNA segments to be analyzed. The samples from 64 plates may be spotted in four 
replicas as four quarters of the same membrane. 

15 A set of 12 probes may be selected by single channel pipetting or by a single pin 

transferring device (or by an array of individually-controlled pipets or pins) for each of 
the 96 segments, and the selected probes may be arrayed in twelve 96-well plates. Probes 
may be labelled, if they are not prelabelled, and then probes from four plates may be 
mixed with hybridization buffer and added to the subarrays preferentially by a 96-channel 

20 pipeting device. After one hybridization cycle it is possible to strip off previously- 
applied probes by incubating the membrane at 37° to 55°C in the preferably undiluted 1 
hybridization or washing buffer. 

The likelihood that probes positive for one allele are positive and probes positive 
for the other allele are negative may be used to determine which of the two alleles is 

25 present. In this redundant scoring scheme, some level (about 10%) of errors in 
hybridization of each probe may be tolerated. 

An incomplete set of probes may be used for scoring most of the alleles, 
especially if the smaller redundancy is sufficient, e.g., one or two probes which prove the 
presence or absence in a sample of one of the two alleles. For example, with a set of four 

30 thousand 8-mers there is a 91% chance of finding at least one positive probe for one of 
the two alleles for a randomly selected locus. The incomplete set of probes may be 
optimized to reflect G+C content and other biases in the analyzed samples. 
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For full gene sequencing, genes may be amplified in an appropriate number of 
segments. For each segment, a set of probes (about one probe per 2-4 bases) may be 
selected and hybridized. These probes may identify whether there is a mutation anywhere 
in the analyzed segments. Segments (i.e., subarrays which contain these segments) where 
5 one or more mutated sites are detected may be hybridized with additional probes to find 
the exact sequence at the mutated sites. If a DNA sample is tested by every second 6-mer, 
and a mutation is localized at the position that is surrounded by positively hybridized 
probes TGCAAA and TATTCC and covered by three negative probes: CAAAAC, 
AAACTA and ACT ATT, the mutated nucleotides must be A and/or C occurring in the 

10 normal sequence at that position. They may be changed by a single base mutation, or by a 
one or two nucleotide deletion and/or insertion between bases AA, AC or CT. 

One approach is to select a probe that extends the positively hybridized probe 
TGCAAA for one nucleotide to the right, and which extends the probe TATTCC one 
nucleotide to the left, With these 8 probes (GCAAAA, GCAAAT, GCAAAC, GCAAAG 

1 5 and ATATTC, TTATTC, CTATTC, GTATTC) two questionable nucleotides are 
determined. 

The most likely hypothesis about the mutation may be determined. For example, 
A is found to be mutated to G. There are two solutions satisfied by these results. Either 
replacement of A with G is the only change or there is in addition to that change an 

20 insertion of some number of bases between newly determined G and the following C. If 
the result with bridging probes is negative these options may then be checked first by at 
least one bridging probe comprising the mutated position (AAGCTA) and with an 
additional 8 probes: CAAAGA, CAAAGT, CAAAGC, CAAAGG and ACTATT, 
TCTATT, CCTATT, GCTATT, There are many other ways to select mutation-solving 

25 probes. 

In the case of diploid, particular comparisons of scores for the test samples and 

homozygotic control may be performed to identify heterozygotes (see above). A few 

consecutive probes are expected to have roughly twice smaller signals if the segment 

covered by these probes is mutated on one of the two chromosomes. 

30 5.38 IDENTIFICATION OF GENES (MUTATIONS) RESPONSIBLE FOR 

GENETICDISORDERS AND OTHER TRAITS 
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Using universal sets of longer probes (8-mers or 9-mers) on immobilized arrays of 
samples, DNA fragments as long as 5-20 kb may be sequenced without subcloning. 
Furthermore, the speed of sequencing readily may be about 10 million 
bp/day/hybridization instrument. This performance allows for resequencing a large 
5 fraction of human genes or the human genome repeatedly from scientifically or medically 
interesting individuals. To resequence 50% of the human genes, about 100 million bp is 
checked. That may be done in a relatively short period of time at an affordable cost. 

This enormous resequencing capability may be used in several ways to identify 
mutations and/or genes that encode for disorders or any other traits. Basically, mRNAs 

1 0 (which may be converted into cDNAs) from particular tissues or genomic DNA of 

patients with particular disorders may be used as starting materials. From both sources of 
DNA, separate genes or genomic fragments of appropriate length may be prepared either 
by cloning procedures or by in vitro amplification procedures (for example by PCR). If 
cloning is used, the minimal set of clones to be analyzed may be selected from the 

1 5 libraries before sequencing. That may be done efficiently by hybridization of a small 

number of probes, especially if a small number of clones longer than 5 kb is to be sorted. 
Cloning may increase the amount of hybridization data about two times, but does not 
require tens of thousands of PCR primers. 

In one variant of the procedure, gene or genomic fragments may be prepared by 

20 restriction cutting with enzymes like Hga I which cuts DNA in following way: 

GACGC(N5')/CTGCG(N10'). Protruding ends of five bases are different for different 
fragments. One enzyme produces appropriate fragments for a certain number of genes. 
By cutting cDNA or genomic DNA with several enzymes in separate reactions, every 
gene of interest may be excised appropriately. In one approach, the cut DNA is 

25 fractionated by size. DNA fragments prepared in this way (and optionally treated with 
Exonuclease III which individually removes nucleotides from the 3' end and increases 
length and specificity of the ends) may be dispensed in the tubes or in multiwell plates. 
From a relatively small set of DNA adapters with a common portion and a variable 
protruding end of appropriate length, a pair of adapters may be selected for every gene 

30 fragment that needs to be amplified. These adapters are ligated and then PCR is 

performed by universal primers. From 1000 adapters, a million pairs may be generated, 
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thus a million different fragments may be specifically amplified in the identical 
conditions with a universal pair of primers complementary to the common end of the 
adapters. 

If a DNA difference is found to be repeated in several patients, and that sequence 
5 change is nonsense or can change function of the corresponding protein, then the mutated 
gene may be responsible for the disorder. By analyzing a significant number of 
individuals with particular traits, functional allelic variations of particular genes could be 
associated by specific traits. 

This approach may be used to eliminate the need for very expensive genetic 
10 mapping on extensive pedigrees and has special value when there is no such genetic data 
or material. 

5.39 SCORING SINGLE NUCLEOTIDE POLYMORPHISMS IN 
GENETIC MAPPING 

Techniques disclosed in this application are appropriate for an efficient 

15 identification of genomic fragments with single nucleotide polymorphisms (SNUPs). In 
10 individuals by applying the described sequencing process on a large number of 
genomic fragments of known sequence that may be amplified by cloning or by in vitro 
amplification, a sufficient number of DNA segments with SNUPs may be identified. The 
polymorphic fragments are further used as SNUP markers. These markers are either 

20 mapped previously (for example they represent mapped STSs) or they may be mapped 
through the screening procedure described below. 

SNUPs may be scored in every individual from relevant families or populations 
by amplifying markers and arraying them in the form of the array of subarrays. Subarrays 
contain the same marker amplified from the analyzed individuals. For each marker, as in 

25 the diagnostics of known mutations, a set of 6 or less probes positive for one allele and 6 
or less probes positive for the other allele may be selected and scored. From the 
significant association of one or a group of the markers with the disorder, chromosomal 
position of the responsible gene(s) may be determined. Because of the high throughput 
and low cost, thousands of markers may be scored for thousands of individuals. This 

30 amount of data allows localization of a gene at a resolution level of less than one million 
bp as well as localization of genes involved in polygenic diseases. Localized genes may 
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be identified by sequencing particular regions from relevant normal and affected 
individuals to score a mutation(s). 

PCR is preferred for amplification of markers from genomic DNA. Each of the 
markers require a specific pair of primers. The existing markers may be convertible or 
5 new markers may be defined which may be prepared by cutting genomic DNA by Hga I 
type restriction enzymes, and by ligation with a pair of adapters. 

SNUP markers can be amplified or spotted as pools to reduce the number of 
independent amplification reactions. In this case, more probes are scored per one sample. 
When 4 markers are pooled and spotted on 12 replica membranes, then 48 probes (12 per 
10 marker) may be scored in 4 cycles. 

5.40 DETECTION AND VERIFICATION OF IDENTITY OF DNA 
FRAGMENTS 

DNA fragments generated by restriction cutting, cloning or in vitro amplification 

15 (e.g. PCR) frequently may be identified in a experiment. Identification may be performed 
by verifying the presence of a DNA band of specific size on gel electrophoresis. 

Alternatively, a specific oligonucleotide may be prepared and used to verify a 
DNA sample in question by hybridization. The procedure developed here allows for more 
efficient identification of a large number of samples without preparing a specific 

20 oligonucleotide for each fragment. A set of positive and negative probes may be selected 
from the universal set for each fragment on the basis of the known sequences. Probes that 
are selected to be positive usually are able to form one or a few overlapping groups and 
negative probes are spread over the whole insert. 

This technology may be used for identification of STSs in the process of their 

25 mapping on the YAC clones. Each of the STSs may be tested on about 100 YAC clones 
or pools of YAC clones. DNAs from these 100 reactions possibly are spotted in one 
subarray. Different STSs may represent consecutive subarrays. In several hybridization 
cycles, a signature may be generated for each of the DNA samples, which signature 
proves or disproves existence of the particular STS in the given YAC clone with 

30 necessary confidence. 
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To reduce the number of independent PCR reactions or the number of 
independent samples for spotting, several STSs may be amplified simultaneously in a 
reaction or PCR samples may be mixed, respectively. In this case more probes have to be 
scored per one dot. The pooling of STSs is independent of pooling YACs and may be 
5 used on single YACs or pools of YACs. This scheme is especially attractive when several 
probes labelled with different colors are hybridized together. 

In addition to confirmation of the existence of a DNA fragment in a sample, the 
amount of DNA may be estimated using intensities of the hybridization of several 
separate probes or one or more pools of probes. By comparing obtained intensities with 

10 intensities for control samples having a known amount of DNA, the quantity of DNA in 
all spotted samples is determined simultaneously. Because only a few probes are 
necessary for identification of a DNA fragment, and there are N possible probes that may 
be used for DNA N bases long, this application does not require a large set of probes to 
be sufficient for identification of any DNA segment. From one thousand 8-mers, on 

15 average about 30 full matching probes may be selected for a 1000 bp fragment. 

5.41 IDENTIFICATION OF INFECTIOUS DISEASE ORGANISMS AND 
THEIR VARIANTS 

DNA-based tests for the detection of viral, bacterial, fungal and other parasitic 
20 organisms in patients are usually more reliable and less expensive than alternatives. The 
major advantage of DNA tests is to be able to identify specific strains and mutants, and 
eventually be able to apply more effective treatment. Two applications are described 
below. 

The presence of 12 known antibiotic resistance genes in bacterial infections may 
25 be tested by amplifying these genes. The amplified products from 128 patients may be 
spotted in two subarrays and 24 subarrays for 12 genes may then be repeated four times 
on a 8 x 12 cm membrane. For each gene, 12 probes may be selected for positive and 
negative scoring. Hybridizations may be performed in 3 cycles. For these tests, a much 
smaller set of probes is most likely to be universal. For example, from a set of one 
30 thousand 8-mers, on average 30 probes are positive in 1000 bp fragments, and 10 positive 
probes are usually sufficient for a highly reliable identification. As described in Section 
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5.22, several genes may be amplified and/or spotted together and the amount of the given 
DNA may be determined. The amount of amplified gene may be used as an indicator of 
the level of infection. 

Another example involves possible sequencing of one gene or the whole genome 
5 of an HIV virus. Because of rapid diversification, the virus poses many difficulties for 
selection of an optimal therapy. DNA fragments may be amplified from isolated viruses 
from up to 64 patients and resequenced by the described procedure. On the basis of the 
obtained sequence the optimal therapy may be selected. If there is a mixture of two virus 
types of which one has the basic sequence (similar to the case of heterozygotes), the 
10 mutant may be identified by quantitative comparisons of its hybridization scores with 
scores of other samples, especially control samples containing the basic virus type only. 
Scores twice as small may be obtained for three to four probes that cover the site mutated 
in one of the two virus types present in the sample (see above). 

1 5 5.42 FORENSIC AND PARENTAL IDENTIFICATION 

Sequence polymorphisms make an individual genomic DNA unique. This permits 
analysis of blood or other body fluids or tissues from a crime scene and comparison with 
samples from criminal suspects. A sufficient number of polymorphic sites are scored to 
produce a unique signature of a sample. SBH may easily score single nucleotide 

20 polymorphisms to produce such signatures. 

A set of DNA fragments (10-1000 mer) may be amplified from samples and 
suspects. DNAs from samples and suspects representing one fragment are spotted in one 
or several subarrays and each subarray may be replicated 4 times. In three cycles, 12 
probes may determine the presence of allele A or B in each of the samples, including 

25 suspects, for each DNA locus. Matching the patterns of samples and suspects may lead to 
discovery of the suspect responsible for the crime. 

The same procedure may be applicable to prove or disprove the identity of parents 
of a child. DNA may be prepared and polymorphic loci amplified from the child and 
adults; patterns of A or B alleles may be determined by hybridization for each. 

30 Comparisons of the obtained patterns, along with positive and negative controls, aide in 
the determination of familial relationships. In this case, only a significant portion of the 
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alleles need match with one parent for identification. Large numbers of scored loci allow 
for the avoidance of statistical errors in the procedure or of masking effects of de novo 
mutations. 

5.43 ASSESSING GENETIC DIVERSITY OF POPULATIONS OR 
5 SPECIES AND BIOLOGICAL DIVERSITY OF ECOLOGICAL NICHES 

Measuring the frequency of allelic variations on a significant umber of loci (for 
example, several genes or entire mitochondrial DNA) permits development of different 
types of conclusions, such as conclusions regarding the impact of the environment on the 
genotypes, history and evolution of a population or its susceptibility to diseases or 

10 extinction, and others. These assessments may be performed by testing specific known 
alleles or by full resequencing of some loci to be able to define de novo mutations which 
may reveal fine variations or presence of mutagens in the environment. 

Additionally, biodiversity in the microbial world may be surveyed by 
resequencing evolutionary conserved DNA sequences, such as the genes for ribosomal 

1 5 RNAs or genes for highly conservative proteins. DNA may be prepared from the 

environment and particular genes amplified using primers corresponding to conservative 
sequences. DNA fragments may be cloned preferentially in a plasmid vector (or diluted 
to the level of one molecule per well in multiwell plates and than amplified in vitro). 
Clones prepared this way may be resequenced as described above. Two types of 

20 information are obtained. First of all, a catalogue of different species may be defined as 
well as the density of the individuals for each species. Another segment of information 
may be used to measure the influence of ecological factors or pollution on the ecosystem. 
It may reveal whether some species are eradicated or whether the abundance ratios 
among species is altered due to the pollution. The method also is applicable for 

25 sequencing DNA from fossils. 


5.44 DETECTION OR QUANTIFICATION OF NUCLEIC ACID SPECIES 

DNA or RNA species may be detected and quantified by employing a probe pair 
including an unlabeled probe fixed to a substrate and a labeled probe in a solution. The 
30 species may be detected and quantified by exposure to the unlabeled probe in the 

presence of the labeled probe and ligase. Specifically, the formation of an extended probe 
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by ligation of the labeled and unlabeled probe on the sample nucleic acid backbone is 
indicative of the presence of the species to be detected. Thus, the presence of label at a 
specific point in the array on the substrate after removing unligated labeled probe 
indicates the presence of a sample species while the quantity of label indicates the 
5 expression level of the species. 

Alternatively, one or more unlabeled probes may be arrayed on a substrate as first 
members of pairs with one or more labeled probes to be introduced in solution. 
According to one method, multiplexing of the label on the array may be carried out by 
using dyes which fluoresce at distinguishable wavelengths. In this manner, a mixture of 

10 cDNAs applied to an array with pairs of labeled and unlabeled probes specific for species 
to be identified may be examined for the presence of and expression level of cDNA 
species. According to a preferred embodiment this approach may be carried out to 
sequence portions of cDNAs by selecting pairs of unlabeled and labeled probes pairs 
comprising sequences which overlap along the sequence of a cDNA to be detected. 

15 Probes may be selected to detect the presence and quantity of particular 

pathogenic organisms genome by including in the composition selected probe pairs 
which appear in combination only in target pathogenic genome organisms. Thus, while 
no single probe pair may necessarily be specific for the pathogenic organism genome, the 
combination of pairs is. Similarly, in detecting or sequencing cDNAs, it might occur that 

20 a particular probe is not be specific for a cDNA or other type of species. Nevertheless, the 
presence and quantity of a particular species may be determined by a result wherein a 
combination of selected probes situated at distinct array locations is indicative of the 
presence of a particular species. 

An infectious agent with about lOkb or more of DNA may be detected using a 

25 support-bound detection chip without the use of polymerase chain reaction (PCR) or 
other target amplification procedures. According to other methods, the genomes of 
infectious agents including bacteria and viruses are assayed by amplification of a single 
target nucleotide sequence through PCR and detection of the presence of target by 
hybridization of a labelled probe specific for the target sequence. Because such an assay 

30 is specific for only a single target sequence it therefore is necessary to amplify the gene 
by methods such as PCR to provide sufficient target to provide a detectable signal. 


183 


.i o o a M-ft 1+3 . o h e fin o 


According to this example, an improved method of detecting nucleotide 
sequences characteristic of infectious agents through a Format 3-type reaction is provided 
wherein a solid phase detection chip is prepared which comprises an array of multiple 
different immobilized oligonucleotide probes specific for the infectious agent of interest: 
5 A single dot comprising a mixture of many unlabeled probes complementary to the target 
nucleic acid concentrates the label specific to a species at one location thereby improving 
sensitivity over diffuse or single probe labeling. Such multiple probes may be of : 
overlapping sequences of the target nucleotide sequence but may also be non-overlapping 
sequences as well as non-adjacent. Such probes preferably have a length of about 5 to 12 
10 nucleotides. 

A nucleic acid sample exposed to the probe array and target sequences present in 
the sample will hybridize with the multiple immobilized probes. A pool of multiple 
labeled probes selected to specifically bind to the target sequences adjacent to the 
immobilized probes is then applied with the sample to an array of unlabeled 

15 oligonucleotide probe mixtures. Ligase enzyme is then applied to the chip to ligate the 

adjacent probes on the sample. The detection chip is then washed to remove unhybridized 
and unligated probe and sample nucleic acids and the presence of sample nucleic acid 
may be determined by the presence or absence of label. This method provides reliable 
sample detection with about a 1000-fold reduction of molarity of the sample agent. 

20 As a further aspect of the invention, the signal of the labelled probes may be 

amplified by means such as providing a common tail to the free probe which itself 
comprises multiple chromogenic, enzymatic or radioactive labels or which is itself 
susceptible to specific binding by a further probe agent which is multiply labelled. In this 
way, a second round of signal amplification may be carried out. Labeled or unlabeled 

25 probes may be used in a second round of amplification. In this second round of 

amplification, a lengthy DNA sample with multiple labels may result in an increased 
amplification intensity signal between 10 to 100 fold which may result in a total signal 
amplification of 100,000 fold. Through the use of both aspects of this example, an 
intensity signal approximately 100,000 fold may give a positive result of probe-DNA 

30 ligation without having to employ PCR or other amplification procedures. 
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According to a further aspect of the invention an array or super array may be 
prepared which consists of a complete set of probes, for example 4096 6-mer probes. 
Arrays of this type are universal in a sense that they can be used for detection or partial to 
complete sequencing of any nucleic acid species. Individual spots in an array may contain 
5 single probe species or mixtures of probes, for example N(l-3) B(4-6) N(l-3) type of 
mixtures that are synthesized in the single reaction (N represents all four nucleotides, B 
one specific nucleotide and where the associated numbers are a range of numbers of 
bases i.e., 1-3 means "from one to three bases"). These mixtures provide stronger signal 
for a nucleic acid species present at low concentration by collecting signal from different 

10 parts of the same long nucleic acid species molecule. The universal set of probes may be 
subdivided in many subsets which are spotted as unit arrays separated by barriers that 
. prevent spreading of hybridization buffer with sample and labeled probe(s). 

For detection of a nucleic acid species with a known sequence one of more 
oligonucleotide sequences comprising both unlabelled fixed and labeled probes in 

1 5 solution may be selected. Labeled probes are synthesized or selected from the 

presynthesized complete sets of, for example, 7-mers. The labeled probes are added to 
. . corresponding unit arrays of fixed probes such that a pair of fixed and labeled probes will 
-* adjacently hybridize to the target sequence such that upon administration of ligase the 
probes will be covalently bound. 

20 If a unit array contains more than one fixed probe (as separated spots or within the 

same spot) that are positive in a given nucleic acid species all corresponding labeled 
probes may be mixed and added to the same unit array. The mixtures of labeled probes 
are even more important when mixtures of nucleic acid species are tested. One example 
of a complex mixture of nucleic acid species are mRNAs in one cell or tissue. 

25 According to one embodiment of the invention unit arrays of fixed probes allow 

use of every possible immobilized probe with cocktails of a relatively small number of 
labeled probes. More complex cocktails of labeled probes may be used if a multiplex 
labeling scheme is implemented. Preferred multiplexing methods may use different 
fluorescent dyes or molecular tags that may be separated by mass spectroscopy. 

30 Alternatively, according to a preferred embodiment of the invention, relatively 

short fixed probes may be selected which frequently hybridize to many nucleic acid 
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sequences. Such short probes are used in combination with a cocktail of labeled probes 
which may be prepared such that at least one labeled probe corresponds to each of the 
fixed proves. Preferred cocktails are those in which none of the labeled probes 
corresponds to more than one fixed probe. 
5 5.45 INTERROGATION OF SEGMENTS OF THE HIV VIRUS WITH 

ALL POSSIBLE 5-MERS 

In this example of Format III SBH, an array was generated on nylon membranes 

. . (e.g., Gene Screen) of all possible bound 5-mers (1024 possible pentamers). The bound 
5-mer oligonucleotides were synthesized with 5' tails of 5 ' -TTTTTT-NNN-3 ' (N = all 

10 four bases A, C, G, T, at this step in the synthesis equal molar amounts of all four bases 
. are added). These oligonucleotides were precisely spotted onto the nylon membrane, the 
spots were allowed to dry, and the oligonucleotides were immobilized by treating the 
dried spots with UV light. Oligonucleotide densities of up to 1 8 oligonucleotides per 
square nanometer were obtained using this method. After the UV treatment, the nylon 

15 membranes were treated with a detergent containing buffer at 60-80°C. The spots of 

oligonucleotides were gridded in subarrays of 10 by 10 spots, and each subarray has 64 5- 
mer spots and 36 control spots. 16 subarrays give 1024 5-mers which encompasses all 
possible 5-mers. 

The subarrays in the array were partitioned from each other by physical barriers, 
20 e.g., a hydrophobic strip, that allowed each subarray to be hybridized to a sample without 
cross-contamination from adjacent subarrays. In a preferred embodiment, the 
hydrophobic strip is made from a solution of silicone (e.g., household silicone glue and 
seal paste) in an appropriate solvent (such solvents are well known in the art). This 
solution of silicone grease is applied between the subarrays to form lines which after the 
25 solvent evaporates act as hydrophobic strips separating the cells. 

In this Format III example, the free or solution (nonbound) 5-mers were 
synthesized with 3' tails of 5'-NN-3' (N = all four bases A, C, G, T). In this embodiment, 
the free 5-mers and the bound 5-mers are combined to produce all possible I 0-mers for 
sequencing a known DNA sequence of less than 20kb. 20kb of double stranded DNA is 
30 denatured into 40 kb of single-stranded DNA. This 40 kb of ss DNA hybridizes to about 
4% of all possible 1 0-mers. This low frequency of 10-mer binding and the known target 
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sequence allow the pooling of free or solution (nonbound) 5-mers for treatment of each 
subarray, without a loss of sequence information. In a preferred embodiment, 16 probes 
are pooled for each subarray, and all possible 5-mers are represented in 64 total pools of 
free 5-mers. Thus, all possible I 0-mers may be probed against a DNA sample using 
5 1024 subarray s (16 subarray s for each pool of free 5-mers). 

The target DNA in this embodiment represents two-600 bp segments of the HIV 
virus. These 600 bp segments are represented by pools of 60 overlapping 3 0-mers (the 
30-mers overlap each adjacent 30 mer by 20 nucleotides). The pools of 30-mers mimic a 
target DNA that has been treated using techniques well known in the art to shear, digest, 

10 and/or random PGR the target DNA to produce a random pool of very small fragments. 

As described above in the previous Format III examples, the free 5-mers are 
labeled with radioactive isotopes, biotin, fluorescent dyes, etc. The labeled free 5-mers 
are then hybridized along with the bound 5-mers to the target DNA, and ligated. In a 
preferred embodiment, 300-1000 units of ligase are added to the reaction. The 

1 5 hybridization conditions were worked out following the teachings of the previous 

examples. Following ligation and removal of the target DNA and excess free probe, the 
. array is assayed to determine the location of labeled probes (using the techniques 
described in the examples above). 

The known DNA sequence of the target, and the known free and bound 5-mers in 

20 each subarray, predict which bound 5-mers will be ligated to a labeled free 5-mer in each 
subarray. The signal from 20 of these predicted dots were lost and 20 new signals were 
gained for each change in the target DNA from the predicted sequence. The overlapping 
sequence of the bound 5-mers in these ten new dots identifies which free, labeled 5-mer 
is bound in each new dot. 

25 Using the described methods, arrays and pools of free, labeled 5-mers, the test 

HIV DNA sequence was probed with all possible lOmers. Using this Format III 
approach, we properly identified the "wild-type" sequence of the segments tested, as well 
as several sequence "mutants" that were introduced into these segments. 
5.46 SEQUENCING OF REPETITIVE DNA SEQUENCES 

30 Repetitive DNA sequences in the target DNA are sequenced with "spacer 

oligonucleotides" in a modified Format III approach. Spacer oligonucleotides of varying 
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lengths of the repetitive DNA sequence (the repeating sequence is identified on a first 
SBH run) are hybridized to the target DNA along with a first known adjoining 
oligonucleotide and a second known, or group of possible oligonucleotides adjoining the 
other side of the spacer (known from the first SBH run). When a spacer matching the 
5 length of the repetitive DNA segment is hybridized to the target, the two adjacent 

oligonucleotides can be ligated to the spacer. If the first known oligonucleotide is fixed to 
a substrate, and the second known or possible oligonucleotide(s)is labeled, a bound 
ligation product including the labeled second known or possible oligonucleotide's 
formed when a spacer of the proper length is hybridized to the target DNA. 
1 0 5.47 SEQUENCING THROUGH BRANCH POINTS WITH FORMAT III 

SBH 

Branch points in the target DNA are sequenced using a third set of 
oligonucleotides and a modified Format III approach. After a first SBH run, several 
branch points may be identified when the sequence is compiled. These can be solved by 

15 hybridizing oligonucleotide(s)that overlap partially with one of the known sequences 
leading into the branch point and then hybridizing to the target an additional 
oligonucleotide that is labeled and corresponds to one of the sequences that comes out of 
the branch point. When the proper oligonucleotides are hybridized to the target DNA, the 
labeled oligonucleotide can be ligated to the other(s). In a preferred embodiment, a first 

20 oligonucleotide that is offset by one to several nucleotides from the branch point is 
selected (so that it reads into one of the branch sequences), a second oligonucleotide 
reading from the first and into the branch point sequence is also selected, and a set of 
third oligonucleotides that correspond to all the possible branch sequences with an 
overlap of the branch point sequence by one or a few nucleotides (corresponding to the 

25 first oligonucleotide) is selected. These oligonucleotides are hybridized to the target 

DNA, and only the third oligonucleotide with the proper branch sequence (that matches 
the branch sequence of the first oligonucleotide) will produce a ligation product with the 
first and second oligonucleotides. 

30 5.48 MULTIPLEXING PROBES FOR ANALYZING A TARGET 

NUCLEIC ACID 
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Sets of probes are labeled with different labels so that each probe of a set can be 
differentiated from the other probes in the set. Thus, the set of probes may be contacted 
with target nucleic acid in a single hybridization reaction without the loss of any probe 
information. In preferred embodiments, the different labels are different radioisotopes, or 
5 different fluorescent labels, or different EMLs. These sets of probes may be used in either 
Format I, Format II or Format III SBH. 

In Format 1 SBH, the set of differently labeled probes are hybridized to target 
nucleic acid which is fixed to a substrate under conditions that allow differentiation 
between perfect matches one base-pair mismatches. Specific probes which bind to the 
10 target nucleic acid are identified by their different labels and perfect matches are 
determined, at least in part, from this binding information. 

In Format II SBH, the target nucleic acids are labeled with different probes and 
hybridized to arrays of probes. Specific target nucleic acids which bind to the probes are 
identified by their different labels and perfect matches are determined, at least in part, 
1 5 form this binding information. 

In Format III SBH, the set of differently labeled probes and fixed probes are 
hybridized to a target nucleic acid under conditions that allow perfect matches to be 
differentiated from one base-pair mismatches. Labeled probes that are adjacent on the 
target to a fixed probe are bound to the fixed probe, and these products are detected and 
20 differentiated by their different labels. 

In a preferred embodiment, the different labels are EMLs, which can be detected 
by electron capture mass spectrometry (EC-MS). EMLs may be prepared from a variety 
of backbone molecules, with certain aromatic backbones being particularly preferred, 
e.g., see Xu et aL, J. Chromatog. 764:95-102 (1997). The EML is attached to a probe in a 
25 reversible and stable manner, and after the rpobe is hybridized to target nucleic acid, the 
EML is removed from the probe and identified by standard EC-MS (e.g., the EC-MS may 
be done by a gas chromotograph-massspectrometer). 

5.49 DETECTION OF LOW FREQUENCY TARGET NUCLEIC ACIDS 

30 Format III SBH has sufficient discrimination power to identify a sequence that is 

present in a sample at 1 part to 99 parts of a similar sequence that differs by a single 
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nucleotide. Thus, Format III can be used to identify a nucleic acid present at a very low 
concentration in a sample of nucleic acids, e.g., a sample derived from blood. 

In one embodiment, the two sequences are for cystic fibrosis and the sequences 
differ from each other by a deletion of three nucleotides. Probes for the two sequences 
5 were as follows, probes distinguishing the deletion from wild type were fixed to a 

substrate, and a labeled contiguous probe was common to both. Using these targets and 
probes, the deletion mutant could be detected with Format III SBH when it was present at 
one part to ninety nine pans of the wild-type. 

5.50 POLAROID APPARATUS AND METHOD FOR ANALYZING A 

10 TARGET NUCLEIC ACID 

An apparatus for analyzing a nucleic acid can be constructed with two arrays of 
nucleic acids, and an optional material that prevents the nucleic acids of the two arrays 
from mixing until such mixing is desired. The arrays of the apparatus may be supported 
by a variety of substrates, including but not limited to, nylon membranes, nitrocellulose 

1 5 membranes, or other materials disclosed above. In preferred embodiments, one of the 
substrate is a membrane separated into sectors by hydrophobic strips, or a suitable 
support material with wells which may contain a gel or sponge. In this embodiment, 
probes are placed on a sector of the membrane, or in the well, the gel, or sponge, and a 
solution (with or without target nucleic acids) is added to the membrane or well so that 

20 the probes are solubilized. The solution with the solubilized probes is then allowed to 
contact the second array of nucleic acids. The nucleic acids may be, but are not limited 
to, oligonucleotide probes, or target nucleic acids, and the probes or target nucleic acids 
may be labeled. The nucleic acids may be labeled with any labels conventionally used in 
the art, including but not limited to radioisotopes, fluorescent labels or electrophore mass 

25 labels. 

The material which prevents mixing of the nucleic acids may be disposed between 
the two arrays in such a way that when the material is removed the nucleic acids of the 
two arrays mix together. This material may be in the form of a sheet, membrane, or other 
barrier, and this material may be comprised of any material that prevents mixing of the 
30 nucleic acids. 
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This apparatus may be used in Format I SBH as follows: a first array of the 
apparattus has target nucleic acids that are fixed to the substrate, and a second array of 
the apparatus has nucleic acid probes that are labeled and can be removed to interrogate 
the target nucleic acid of the first array. The two arrays are optionally separated by a 
5 sheet of material that prevents the probes from contacting the target nucleic acid, and 
when this sheet is removed, the probes can interrogate the target. After appropriate 
incubation, and (optionally) washing steps the array of targets may be read to determine 
which probes formed perfect matches with the target. This reading may be automated or 
can be done manually (e.g. by eye with an autoradiogram). In format II SBH, the 

10 procedure followed would be similar to that described above except the target is labeled 
and the probes are fixed. 

Alternatively the apparatus may be used in Format III SBH as follows: to arrays 
of nucleic acid probes are formed, the nucleic acid probes of either or both arrays may be 
labeled and one of the arrays may be fixed to its substarate. The two arrays are separated 

15 by a sheet of material that prevents the probes from mixing. A Format II reaction is 

initiated by adding target nucleic acid and removing the sheet allowing the probes to mix 
with each other and the target. Probes which bind to adjacent sites on the target are 
bound together {e.g., by base stacking interactions or by covalently joining the 
backbones), and the results are read to determine which probes bound to the target at 

20 adjacent sites. When one set of probes is fixed to the substrate, the fixed array can be 
read to determine which probes from the other array are bound together with the fixed 
probes. As with the method above, this reading may be automated {e.g., with an ELISA 
reader) or can be done manually {e.g., by eye with an autoradiogram). 

25 
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6.0 EXAMPLES 

Selected tables are provided on the CD-ROM, supplied concurrently with this 
application. The entirety of the contents of the CD-ROM are intended to be a part of this 
application, and are incorporated by reference. The electronic file name, where appropriate, 
5 follows the first reference to the table throughout the application. 

6. 1 The 748 Gene Family 
Novel Contigs 

The novel contigs of the invention, were assembled from novel expressed 
sequence tags (EST's) isolated by methods described herein (e.g., SBH), and in some 

10 cases sequences obtained from one or more public databases. The inserts for the cDNA 
libraries from which the novel ESTs were obtained were amplified with PCR using 
primers specific for the vector sequences which flank the inserts. These samples were 
spotted onto nylon membranes and interrogated with oligonucleotide probes to give 
sequence signatures. The clones were clustered into groups of similar or identical 

1 5 sequences, and single representative clones were selected from each group for gel 

sequencing. The 5' sequence of the amplified inserts was then deduced using the reverse 
Ml 3 sequencing primer in a typical Sanger sequencing protocol. PCR products were 
purified and subjected to fluorescent dye terminator cycle sequencing. Single pass gel 
sequencing was done using a 377 Applied Biosy stems (ABI) sequencer to obtain the 

20 novel ESTs. The novel contigs of the invention, were assembled from the novel ESTs 
and, in some cases, sequences obtained from one or more public databases. The 
sequences for the resulting contigs from the 748 gene family are designated as 748 SEQ 
ID NO: 1-45,196 and are provided in the Sequence Listing. 

25 6.2 The 752 Gene Family 

Novel Contigs 

The novel contigs of the invention, were assembled from novel expressed 
sequence tags (EST's) isolated by methods described herein (e.g., SBH), and in some 
cases sequences obtained from one or more public databases. The inserts for the cDNA 
30 libraries from which the novel ESTs were obtained were amplified with PCR using 
primers specific for the vector sequences which flank the inserts. These samples were 
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spotted onto nylon membranes and interrogated with oligonucleotide probes to give 
sequence signatures. The clones were clustered into groups of similar or identical 
sequences, and single representative clones were selected from each group for gel 
sequencing. The 5' sequence of the amplified inserts was then deduced using the reverse 
5 Ml 3 sequencing primer in a typical Sanger sequencing protocol. PCR products were 
purified and subjected to fluorescent dye terminator cycle sequencing. Single pass gel 
sequencing was done using a 377 Applied Biosy stems (ABI) sequencer to obtain the 
novel ESTs. The novel contigs of the invention, were assembled from the novel ESTs 
and, in some cases, sequences obtained from one or more public databases. The 
10 sequences for the resulting contigs from the 752 gene family are designated as 752 SEQ 
ID NO: 1-13,192 and are provided in the Sequence Listing. 

6.3 The 778 Gene Family 

Novel Contigs 

1 5 The novel contigs of the invention, were assembled from novel expressed 

sequence tags (ESTs) isolated by methods described herein (e.g., SBH), and in some 
cases sequences obtained from one or more public databases. The inserts for the cDNA 
libraries from which the novel ESTs were obtained were amplified with PCR using 
primers specific for the vector sequences which flank the inserts. These samples were 

20 spotted onto nylon membranes and interrogated with oligonucleotide probes to give 
sequence signatures. The clones were clustered into groups of similar or identical 
sequences, and single representative clones were selected from each group for gel 
sequencing. The 5' sequence of the amplified inserts was then deduced using the reverse 
M13 sequencing primer in a typical Sanger sequencing protocol. PCR products were 

25 purified and subjected to fluorescent dye terminator cycle sequencing. Single pass gel 
sequencing was done using a 377 Applied Biosy stems (ABI) sequencer to obtain the 
novel ESTs. The novel contigs of the invention, were assembled from the novel ESTs 
and, in some cases, sequences obtained from one or more public databases. The 
sequences for the resulting contigs from the 778 gene family are designated as 778 SEQ 

30 ID NO: 1-94 and are provided in the Sequence Listing. 
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6.4 The 779 Gene Family 

Novel Contigs 

The novel contigs of the invention, were assembled from sequences that were 
obtained from cDNA libraries by methods described herein (e.g., SBH). Briefly, clones 
5 from cDNA libraries were spotted on nylon membrane filters and screened with 

oligonucleotide probes (e.g., 7-mers) to obtain signature sequences. The clones were 
clustered into groups of similar or identical sequences, and single representative clones 
were selected from each group for gel sequencing. The inserts for the cDNA libraries 
from which the sequences were obtained were amplified with PCR using primers specific 

10 for the vector sequences which flank the inserts, or isolated from plasmid preparations. 
The 5' sequence of the amplified inserts was then deduced using the reverse Ml 3 
sequencing primer in a typical Sanger sequencing protocol, as well as internal primers in 
both the forward and reverse direction. In some cases RACE (Random Amplification of 
cDNA Ends) was performed to further extend the sequence in the 5' direction. In some 

15 cases all of a signature cluster was sequenced to generate overlapping clones to assemble 
the contigs. 

Chromatograms were base called and assembled using a software suite from 
Washington University, MO containing three applications designated PHRED, PEIRAP, 
and CONSED. The sequences for the resulting contigs for the 779 gene family are 

20 designated as 779 SEQ ID NO: 1-128 and are provided in the Sequence Listing. 

Additionally, contig sequences were BLASTed against Hyseq's database to to 
determine adequate sequence homology for addition to the contigs. Full length clones of 
the entire message were obtained either by identifying clones which contained the 
beginning of the open reading frame and fully sequenced and verified against the contig. 

25 If no clones were available containing the full length sequence, PCR was used to generate 
an amplicon from tissue libraries and the entire length sequenced and verified against the 
contig. 

6.5 The 782 Gene Family 

Novel Contigs 

30 A novel contig of the invention was assembled from sequences that were obtained 

from a cDNA library by methods described herein (e.g., SBH). Briefly, clones from cDNA 
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libraries were spotted on nylon membrane filters and screened with oligonucleotide probes 
{e.g., 7-mers) to obtain signature sequences. The inserts for the cDNA libraries from which 
the sequences were obtained were amplified with PCR using primers specific for the vector 
sequences which flank the inserts, or isolated from plasmid preparations. The 5' sequence of 
5 the amplified inserts was then deduced using the reverse Ml 3 sequencing primer in a typical 
Sanger sequencing protocol, as well as internal primers in both the forward and reverse 
direction. In some cases RACE (Random Amplification of cDNA Ends) was performed to 
further extend the sequence in the 5' direction. In all cases all of a signature cluster was 
sequenced to generate overlapping clones to assemble the contigs. Chromatograms were 

10 base called and assembled using a software suite from University of Washington, Seattle 
containing three applications designated PHRED, PHRAP, and CONSED. The sequences 
for the resulting contigs for the 782 gene family are designated as 782 SEQ ID NO: 1- 
10,451 and are provided in the attached Sequence Listing, inserts was then deduced in a 
typical Sanger sequencing protocol. The inserts of the library were, amplified with PCR 

1 5 using 5 primers specific for vector sequences which flank the inserts. 

The contigs were assembled using an EST sequence as a seed. Then a recursive 
algorithm was used to extend the seed EST into an extended assemblage, by pulling 
additional sequences from different databases (i.e., Hyseq's database containing EST 
sequences, dbEST version 1 14, gb pri 114, and UniGene version 101) that belong to this 

20 assemblage. The algorithm terminated when there was no additional sequences from the 
above databases that would extend the assemblage. Inclusion of component sequences 
into the assemblage was based on a BLASTN hit to the extending assemblage with 
BLAST score greater than 300 and percent identity greater than 95%. 

The nearest neighbor result for the assembled contig was obtained by a FASTA 

25 version 3 search against Genpept release 1 14, using FASTXY algorithm. FASTXY is an 
improved version of FASTA alignment which allows in-codon frame shifts. The nearest 
neighbor result showed the closest homologue for each assemblage from Genpept (and 
contains the translated amino acid sequences for which the assemblage encodes). The 
nearest neighbor results for 782 SEQ ID NO: 1-10,451 are shown in Table 2, and 

30 identified as Table2(782).doc on the enclosed compact disc. 
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6.6 The 784 Gene Family 

Novel Contigs 

Table 3 (identified as Table3(784).doc on the enclosed CD) sets forth the novel 
predicted polypeptides (including proteins) encoded by the novel polynucleotides (784 SEQ 
5 ID NO: 1-10,289) of the present invention, and their corresponding nucleotide locations to 
each of 748 SEQ ID NO: 1-10,289. Table 3 also indicates the method by which the 
polypeptide was predicted. Method A refers to a polypeptide obtained by using a software 
program called FASTY (available from http://fasta.bioch.virginia.edu) which selects a 
polypeptide based on a comparison of translated novel polynucleotide to known 

10 polypeptides (W.R. Pearson* Methods in Enzymology, 183: 63-98 (1990), incorporated * 
herein by reference). Method B refers to a polypeptide obtained by using a software 
program called GenScan for human/vertebrate sequences (available from Stanford 
University, Office of Technology Licensing) that predicts the polypeptide based on a 
probabilistic model of gene structure/compositional properties (C. Burge and S. Karlin, J. 

15 Mol. Biol., 268: 78-94 (1997), incorporated herein by reference). Method C refers to a 
polypeptide obtained by using a Hyseq proprietary software program that translates the 
novel polynucleotide and its complementary strand into six possible amino acid sequences 
(forward and reverse frames) and chooses the polypeptide with the longest open reading 
frame. When the predicted beginning nucleotide of Table 3 is a higher number than the 
. 20 predicted end nucleotide of Table 3, then the amino acid sequence is derived from the 
complementary strand of the indicated SEQ ID NO. The locations of the predicted 
beginning and end nucleotides correlate to the nucleotide sequence of the indicated SEQ ID 
NO., not its complementary strand. 

The isolated polypeptides of the invention include, but are not limited to, a 

25 polypeptide comprising any of the amino acid sequences set forth in Table 3 or from six 
frame translations of 784 SEQ ID NO: 1-10,289; or the corresponding full length or 
mature protein. One of skill in the art could determine the corresponding amino acid 
sequence using techniques well known in the art to translate and analyze all possible six 
frames. Polypeptides of the invention also include polypeptides with biological activity 

30 that are encoded by (a) any of the polynucleotides having a nucleotide sequence set forth 
in the 784 SEQ ID NO: 1-10,289; or (b) polynucleotides that hybridize to the 
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complement of the polynucleotids of (a) under stringent hybridization conditions. 
Biologically or immunologically active variants of any of the polypeptide sequences in 
Table 3 or from six frame translations of 784 SEQ ID NO: 1-10,289, and "substantial 
equivalents" thereof (e.g., with at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 
5 98% or 99% amino acid sequence identity) that preferably retain biological activity are 
also contemplated. The polypeptides of the invention may be wholly or partially 
chemically synthesized but are preferably produced by recombinant means using the 
genetically engineered cells (e.g. host cells) of the invention. 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared 20 ■ 

10 from various human tissues and in some cases isolated from a genomic library derived 
from human chromosome using standard PCR, SBH sequence signature analysis and- 
Sanger sequencing techniques. The inserts of the library were amplified with PCR using 
primers specific for the vector sequences which flank the inserts. Clones from cDNA 
libraries were spotted on nylon membrane filters and screened with oligonucleotide 

15 probes (e.g., 7-mers) to obtain signature sequences. The clones were clustered into groups 
of similar or identical sequences. Representative clones were selected for sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a 
typical Sanger sequencing protocol. PCR products were purified and subjected to 
fluorescent dye terminator cycle sequencing. Single pass gel sequencing was done using 

20 a 377 Applied Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. In 
some cases RACE (Random Amplification of cDNA Ends) was performed to further 
extend the sequence in the 5' direction. 

The contigs or the nucleic acids of the present invention, designated as 784 SEQ 
ID NO: 1-10,289 were assembled using an EST sequence as a seed. Then a recursive 

25 algorithm was used to extend the seed EST into an extended assemblage, by pulling 
additional sequences from different databases (i.e., Hyseq's database containing EST 
sequences, dbEST version 1 14, gb pri 1 14, and UniGene version 101) that belong to this 
assemblage. The algorithm terminated when there was no additional sequences from the 
above databases that would extend the assemblage. Inclusion of component sequences 

30 into the assemblage was based on a BLASTN hit to the extending assemblage with 
BLAST score greater than 300 and percent identity greater than 95%. 
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Table 4 (identified as Table4(784).doc on the enclosed CD) shows the various 
tissue sources of the EST sequences from Hyseq's database which were used to assemble 
the contigs or nucleic acids of the present invention (identified by 784 SEQ ID NO: 1- 
10,289). 

5 The nearest neighbor result for the assembled contig was obtained by a FASTA 

version 3 search against Genpept release 1 14, using FASTXY algorithm. FASTXY is an 
improved version of FASTA alignment which allows in-codon frame shifts. The nearest 
neighbor result showed the closest homologue for each assemblage from Genpept (and 
contains the translated amino acid sequences for which the assemblage encodes). The 
10 nearest neighbor results for 784 SEQ ID NO: 1-10,289 are shown in the Table 5, infra. 

6.7 The 785 Gene Family 

Novel Nucleic Acid Sequences Obtained From Various Libraries 
A plurality of novel nucleic acids were obtained from cDNA libraries prepared 
15 from various human tissues and in some cases isolated from a genomic library derived 
from human chromosome using standard PCR, SBH sequence signature analysis and 
Sanger sequencing techniques. The inserts of the library were amplified with PCR using 
primers specific for the vector sequences which flank the inserts. Clones from cDNA 
libraries were spotted on nylon membrane filters and screened with oligonucleotide 
20 probes {e.g., 7-mers) to obtain signature sequences. The clones were clustered into groups 
of similar or identical sequences. Representative clones were selected for sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a 
typical Sanger sequencing protocol. PCR products were purified and subjected to 
fluorescent dye terminator cycle sequencing. Single pass gel sequencing was done using 
25 a 377 Applied Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. In 
some cases RACE (Random Amplification of cDNA Ends) was performed to further 
extend the sequence in the 5' direction. 

The novel contigs of the invention were assembled from sequences that were 
obtained from a cDNA library by methods described above, and in some cases sequences 
30 obtained from one or more public databases. Chromato grams were base called and 

assembled using a software suite from University of Washington, Seattle containing three 
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applications designated PJRED, PHRAP, and CONSED. The sequences for the resulting 
contigs are designated as 785 SEQ ID NO: 1-3,796 and are provided in the Sequence 
Listing. The contigs were assembled using an EST sequence as a seed. Then a recursive 
algorithm was used to extend the seed EST into an extended assemblage, by pulling 
5 additional sequences from different databases (i.e., Hyseq's database containing EST 
sequences, dbEST version 1 14, gb pri 114, and UniGene version 101) that belong to this 
assemblage. The algorithm terminated when there was no additional sequences from the 
above databases that would extend the assemblage. Inclusion of component sequences 
into the assemblage was based on a BLASTN hit to the extending assemblage with 

10 BLAST score greater than 300 and percent identity greater than 95%. 

The nearest neighbor result for the assembled contig was obtained by a FASTA 
version 3 search against Genpept release 114, using Fastxy algorithm. Fastxy is an 
improved version of FASTA alignment which allows in-codon frame shifts. The nearest 
neighbor result showed the closest homologue for each assemblage from Genpept (and 

15 contains the translated amino acid sequences for which the assemblage encodes). The 

nearest neighbor results for 785 SEQ ID NO: 1-3,796 are shown in Table 6 (identified as 
Table6(785).doc on the enclosed CD), infra. 

The nucleotide sequence within the assembled contigs that codes for signal 
peptide sequences and their cleavage sites can be determined from using Neural network 

20 SignalP Vl.l program (from Center for Biological Sequence Analysis, The Technical 
University of Denmark). The process for identifying prokaryotic and eukaryotic signal 
peptidesand their cleavage sites are also disclosed iby Henrick Nielson, Jacob 
Englebrecht, Soren Brunak, and Gunnar von Heijne in the publication "Identification of 
prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites" Protein 

25 Engineering , vol. 10, no. 1, pp. 1-6 (1997) incorporated herein by reference. A maximum 
S score and a mean S score, as described in the Nielson et. al., reference, are obtained 
from each assembled contig. Table 7 sets forth the nucleotide sequence range for each 
sequence of 785 SEQ ID NO: 1-3,796 that encodes a corresponding forty-five amino acid 
sequence containing the signal peptide sequence and its cleavage site, the maximum S 

30 score and the mean S score obtained for each sequence. Not all forty-five amino acids in 
the sequence may comprise the signal peptide. 


199 


..1, 0 Cii B «+6 . O 2 iS £n O Sir 


6.8 The 787 Gene Family 

Table 8 (identified as Table8(787).doc on the enclosed CD) sets forth the novel 
predicted polypeptides (including proteins) encoded by the novel polynucleotides (787 SEQ 
5 ID NO: 1-3960) of the present invention, and their corresponding nucleotide locations to 
each of 787 SEQ ID NO: 1-3960. Table 8 also indicates the method by which the 
polypeptide was predicted. Method A refers to a polypeptide obtained by using a software 
. program called FASTY (available from http://fasta.bioch.virginia.edu) which selects a 
polypeptide based on a comparison of translated novel polynucleotide to known 

10 polypeptides (W.R. Pearson, Methods in Enzymology, 183: 63-98 (1990), incorporated 
herein by reference). Method B refers to a polypeptide obtained by using a software 
program called GenScan for human/vertebrate sequences (available from Stanford 
University, Office of Technology Licensing) that predicts the polypeptide based on a 
probabilistic model of gene structure/compositional properties (C. Burge and S. Karlin, J. 

15 Mol. Biol., 268: 78-94 (1997), incorporated herein by reference). Method C refers to a 
polypeptide obtained by using a Hyseq proprietary software program that translates the 
novel polynucleotide and its complementary strand into six possible amino acid sequences 
(forward and reverse frames) and chooses the polypeptide with the longest open reading 
frame. When the predicted beginning nucleotide of Table 8 is a higher number than the 

20 predicted end nucleotide of Table 8, then the amino acid sequence is derived from the 
complementary strand of the indicated SEQ ID NO. The locations of the predicted 
beginning and end nucleotides correlate to the nucleotide sequence of the indicated SEQ ID 
NO., not its complementary strand. 

The isolated polypeptides of the invention include, but are not limited to, a 

25 polypeptide comprising any of the amino acid sequences set forth in Table 8 or from six 
frame translations of 787 SEQ ID NO: 1-3960; or the corresponding full length or mature 
protein. One of skill in the art could determine the corresponding amino acid sequence 
using techniques well known in the art to translate and analyze all possible six frames. 
Polypeptides of the invention also include polypeptides with biological activity that are 

30 encoded by (a) any of the polynucleotides having a nucleotide sequence set forth in the 
787 SEQ ID NO: 1-3960; or (b) polynucleotides that hybridize to the complement of the 
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polynucleotids of (a) under stringent hybridization conditions. Biologically or 
immunologically active variants of any of the polypeptide sequences in Table 8 or from 
six frame translations of 787 SEQ ID NO: 1-3960, and "substantial equivalents" thereof 
(e.g., with at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% amino acid 
5 sequence identity) that preferably retain biological activity are also contemplated. The 
polypeptides of the invention may be wholly or partially chemically synthesized but are 
preferably produced by recombinant means using the genetically engineered cells (e.g. 
host cells) of the invention. 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared 20 
10 from various human tissues and in some cases isolated from a genomic library derived 
. from human chromosome using standard PCR, SBH sequence signature analysis and 
Sanger sequencing techniques. The inserts of the library were amplified with PCR using 
primers specific for the vector sequences which flank the inserts. Clones from cDNA 
libraries were spotted on nylon membrane filters and screened with oligonucleotide 
15 probes (e.g., 7-mers) to obtain signature sequences. The clones were clustered into groups 
of similar or identical sequences. Representative clones were selected for sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a 
typical Sanger sequencing protocol. PCR products were purified and subjected to 
fluorescent dye terminator cycle sequencing. Single pass gel sequencing was done using 
20 a 377 Applied Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. In 
some cases RACE (Random Amplification of cDNA Ends) was performed to further 
extend the sequence in the 5' direction. 
Novel Contigs 

The contigs or the nucleic acids of the present invention, designated as 787 SEQ 
25 ID NO: 1-3960 were assembled using an EST sequence as a seed. Then a recursive 
algorithm was used to extend the seed EST into an extended assemblage, by pulling 
additional sequences from different databases (i.e., Hyseq's database containing EST 
sequences, dbEST version 1 14, gb pri 1 14, and UniGene version 101) that belong to this 
assemblage. The algorithm terminated when there was no additional sequences from the 
30 above databases that would extend the assemblage. Inclusion of component sequences 
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into the assemblage was based on a BLASTN hit to the extending assemblage with 
BLAST score greater than 300 and percent identity greater than 95%. 

Table 9 (identified as Table9(787).doc on the enclosed CD) shows the various 
tissue sources of the EST sequences from Hyseq's database which were used to assemble 
5 the contigs or nucleic acids of the present invention (identified by 787 SEQ ID NO: 1- 
3960). 

The nearest neighbor result for the assembled contig was obtained by a FASTA 
version 3 search against Genpept release 1 14, using FASTXY algorithm. FASTXY is an 
improved version of FASTA alignment which allows in-codon frame shifts. The nearest 

10 neighbor result showed the closest homologue for each assemblage from Genpept (and 
contains the translated amino acid sequences for which the assemblage encodes). The 
nearest neighbor results for 787 SEQ ID NO: 1-3960 are shown in the Table 10, 
(identified as Table 10(787).doc on the enclosed CD) infra. 
6.9 The 788 Gene Family 

15 Table 11 (identified as Table 1 l(788).doc on the enclosed CD) sets forth the novel 

predicted polypeptides (including proteins) encoded by the novel polynucleotides (788 SEQ 
ID NO: 1-14,074) of the present invention, and their corresponding nucleotide locations to 
each of 788 SEQ ID NO: 1-14,074. Table 11 also indicates the method by which the 
polypeptide was predicted. Method A refers to a polypeptide obtained by using a software 

20 program called FASTY (available from http://fasta.bioch.virginia.edu) which selects a 
polypeptide based on a comparison of translated novel polynucleotide to known 
polypeptides (W.R. Pearson, Methods in Enzymology, 183: 63-98 (1990), incorporated 
herein by reference). Method B refers to a polypeptide obtained by using a software 
program called GenScan for human/vertebrate sequences (available from Stanford 

25 University, Office of Technology Licensing) that predicts the polypeptide based on a 
probabilistic model of gene structure/compositional properties (C. Burge and S. Karlin, J. 
Mol. Biol., 268: 78-94 (1997), incorporated herein by reference). Method C refers to a 
polypeptide obtained by using a Hyseq proprietary software program that translates the 
novel polynucleotide and its complementary strand into six possible amino acid sequences 

30 (forward and reverse frames) and chooses the polypeptide with the longest open reading 
frame. When the predicted beginning nucleotide of Table 1 1 is a higher number than the 
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predicted end nucleotide of Table 11, then the amino acid sequence is derived from the 
complementary strand of the indicated SEQ ID NO. The locations of the predicted 
beginning and end nucleotides correlate to the nucleotide sequence of the indicated SEQ ID 
NO., not its complementary strand. 
5 The isolated polypeptides of the invention include, but are not limited to, a 

polypeptide comprising any of the amino acid sequences set forth in Table 1 1 or from six 
frame translations of 788 SEQ ID NO: 1-14,074; or the corresponding full length or 
mature protein. One of skill in the art could determine the corresponding amino acid 
sequence using techniques well known in the art to translate and analyze all possible six 

10 frames. Polypeptides of the invention also include polypeptides with biological activity 
that are encoded by (a) any of the polynucleotides having a nucleotide sequence set forth 
in the 788 SEQ ID NO: 1-14,074; or (b) polynucleotides that hybridize to the 
complement of the polynucleotides of (a) under stringent hybridization conditions. 
Biologically or immunologically active variants of any of the polypeptide sequences in 

15 Table 1 1 or from six frame translations of 788 SEQ ID NO: 1-14,074, and "substantial 
equivalents" thereof (e.g., with at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 
98% or 99% amino acid sequence identity) that preferably retain biological activity are 
also contemplated. The polypeptides of the invention may be wholly or partially 
chemically synthesized but are preferably produced by recombinant means using the 

20 genetically engineered cells (e.g. host cells) of the invention. 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared 20 
from various human tissues and in some cases isolated from a genomic library derived 
from human chromosome using standard PCR, SBH sequence signature analysis and 
Sanger sequencing techniques. The inserts of the library were amplified with PCR using 

25 primers specific for the vector sequences which flank the inserts. Clones from cDNA 
libraries were spotted on nylon membrane filters and screened with oligonucleotide 
probes (e.g., 7-mers) to obtain signature sequences. The clones were clustered into groups 
of similar or identical sequences. Representative clones were selected for sequencing. 
In some cases the 5' sequence of the amplified inserts was then deduced using a typical 

30 Sanger sequencing protocol. PCR products were purified and subjected to fluorescent dye 
terminator cycle sequencing. Single pass gel sequencing was done using a 377 Applied 
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Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. In some cases 
RACE (Random Amplification of cDNA Ends) was performed to further extend the 
sequence in the 5' direction. 
Novel Contigs 

5 The novel contigs of the invention were assembled from sequences that were 

obtained from a cDNA library by methods described above, and in some cases sequences 
obtained from one or more public databases. The sequences for the resulting contigs are 
designated as 788 SEQ ID NO: 1-14,074 and are provided in the attached Sequence 
Listing; The contigs were assembled using an EST sequence as a seed. Then a recursive 

1 0 algorithm was used to extend the seed EST into an extended assemblage, by pulling * 
additional sequences from different databases (i.e., Hyseq's database containing EST 
sequences, dbEST version 1 14, gb pri 114, and UniGene version 101) that belong to this 
assemblage. The algorithm terminated when there was no additional sequences from the 
above databases that would extend the assemblage. Inclusion of component sequences 

1 5 into the assemblage was based on a BLASTN hit to the extending assemblage with 
BLAST score greater than 300 and percent identity greater than 95%. 

Table 12 (identified as Table 12(788).doc on the enclosed CD) shows the various 
tissue sources of the EST sequences from Hyseq's database which were used to assemble 
the contigs or nucleic acids of the present invention (identified by 788 SEQ ID NO: 1- 

20 14,074). 

The nearest neighbor result for the assembled contig was obtained by a FASTA 
version 3 search against Genpept release 115, using FASTXY algorithm. FASTXY is an 
improved version of FASTA alignment which allows in-codon frame shifts. The nearest 
neighbor result showed the closest homologue for each assemblage from Genpept (and 
25 contains the translated amino acid sequences for which the assemblage encodes). The 
nearest neighbor results for 788 SEQ ID NO: 1-14,074 are shown in the Table 13, 
(identified as Tablel3(788).doc on the enclosed CD) infra. 

6.10 The 789 Gene Family 

30 Table 14 (identified as Table 14(789).doc on the enclosed CD) sets forth the novel 

predicted polypeptides (including proteins) encoded by the novel polynucleotides (789 
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SEQ ID NO: 1-6,391) of the present invention, and their corresponding nucleotide 
locations to each of SEQ ID NO: 1-6,391. Table 14 also indicates the method by which 
the polypeptide was predicted. Method A refers to a polypeptide obtained by using a 
software program called FASTY (available from http://fasta.bioch.virginia.edu) which 
5 selects a polypeptide based on a comparison of translated novel polynucleotide to known 
polypeptides (W.R. Pearson, Methods in Enzymology, 183: 63-98 (1990), incorporated 
. herein by reference). Method B refers to a polypeptide obtained by using a software 
program called GenScan for human/vertebrate sequences (available from Stanford 
University, Office of Technology Licensing) that predicts the polypeptide based on a 
10 probabilistic model of gene structure/compositional properties (C. Burge and S. Karlin, J. 
Mol. Biol., 268: 78-94 (1997), incorporated herein by reference). Method C refers to a 
polypeptide obtained by using a Hyseq proprietary software program that translates the 
. novel polynucleotide and its complementary strand into six possible amino acid 
sequences (forward and reverse frames) and chooses the polypeptide with the longest 
1 5 open reading frame. 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared 
. from various human tissues and in some cases isolated from a genomic library derived 
, from human chromosome using standard PCR, SBH sequence signature analysis and 
Sanger sequencing techniques. The inserts of the library were amplified with PCR using 
20 primers specific for the vector sequences which flank the inserts. Clones from cDNA 
libraries were spotted on nylon membrane filters and screened with oligonucleotide 
probes {e.g., 7-mers) to obtain signature sequences. The clones were clustered into groups 
of similar or identical sequences. Representative clones were selected for sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a 
25 typical Sanger sequencing protocol. PCR products were purified and subjected to 

fluorescent dye terminator cycle sequencing. Single pass gel sequencing was done using 
a 377 Applied Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. In 
some cases RACE (Random Amplification of cDNA Ends) was performed to further 
extend the sequence in the 5' direction. 
30 Novel Contigs 
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The novel contigs or the nucleic acids of the present invention of the invention 
were assembled from sequences that were obtained from a cDNA library by methods 
described above, and in some cases sequences obtained from one or more public 
databases. The sequences for the resulting contigs are designated as 789 SEQ ID NO: 1- 
5 6,391 and are provided in the attached Sequence Listing. The contigs were assembled 
using an EST sequence as a seed. Then a recursive algorithm was used to extend the seed 
EST into an extended assemblage, by pulling additional sequences from different 
databases (i.e., Hyseq's database containing EST sequences, dbEST version 1 14, gb pri 
1 14, and UriGene version 101) that belong to this assemblage. The algorithm terminated 

10 when there was no additional sequences from the above databases that would extend the 
assemblage. Inclusion of component sequences into the assemblage was based on a 
BLASTN hit to the extending assemblage with BLAST score greater than 300 and 
percent identity greater than 95 %. 

Table 15 (identified as Table 15(789).doc on the enclosed CD) shows the various 

15 tissue sources of the EST sequences from Hyseq's database which were used to assemble 
the contigs or nucleic acids of the present invention (identified by 789 SEQ ID NO: 1- 
6,391). 

The nearest neighbor result far the assembled contig was obtained by a FASTA 
version 3 search against Genpept release 115, using FASTXY algorithm. FASTXY is an 
20 improved version of FASTA alignment which allows in-codon frame shifts. The nearest 
neighbor result showed the closest homologue for each assemblage from Genpept (and 
contains the translated amino acid sequences for which the assemblage encodes). The 
nearest neighbor results for 789 SEQ ID NO: 1-6,391 are shown in the Table 16, 
(identified as Table 16(789).doc on the enclosed CD) infra. 

25 

6. 1 1 The 790 Gene Family 

Table 17 (identified as Table 17(790).doc on the enclosed CD) sets forth the novel 
predicted polypeptides (including proteins) encoded by the novel polynucleotides (790 
SEQ ID NO: 1-30,553) of the present invention, and their corresponding start and stop 
30 nucleotide location to each of 790 SEQ ID NO: 1-30,553. Table 17 also indicates the 
method by which the polypeptide was predicted. Method A refers to a polypeptide 
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obtained by using a software program called FASTY (available from 
http://fasta.bioch.virginia.edu) which selects a polypeptide based on a comparison of 
translated novel polynucleotide to known polypeptides (W.R. Pearson, Methods in 
Enzymology, 183: 63-98 (1990), incorporated herein by reference). Method B refers to a 
5 polypeptide obtained by using a software program called GenScan for human/vertebrate 
sequences (available from Stanford University, Office of Technology Licensing) that 
predicts the polypeptide based on a probabilistic model of gene structure/compositional 
properties (C. Burge and S. Karlin, J. MoL Biol., 268: 78-94 (1997), incorporated herein 
by reference). Method C refers to a polypeptide obtained by using a Hyseq proprietary 

10 software program that translates the novel polynucleotide and its complementary strand 
into six possible amino acid sequences (forward and reverse frames) and chooses the 
polypeptide with the longest open reading frame. 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared 
from various human tissues and in some cases isolated from a genomic library derived 

15 from human chromosome using standard PCR, SBH sequence signature analysis and 

Sanger sequencing techniques. The inserts of the library were amplified with PCR using 
primers specific for the vector sequences which flank the inserts. Clones from cDNA 
libraries were spotted on nylon membrane filters and screened with oligonucleotide 
probes (e.g., 7-mers) to obtain signature sequences. The clones were clustered into groups 

20 of similar or identical sequences. Representative clones were selected for sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a 
typical Sanger sequencing protocol. PCR products were purified and subjected to 
fluorescent dye terminator cycle sequencing. Single pass gel sequencing was done using 
a 377 Applied Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. In 

25 some cases RACE (Random Amplification of cDNA Ends) was performed to further 
extend the sequence in the 5' direction. 
Novel Contigs 

The contigs or the nucleic acids of the present invention, designated as 790 SEQ 
ID NO: 1-30,553 were assembled using an EST sequence from Hyseq's database as a 
30 seed. A recursive algorithm was used to extend the seed EST into an extended 

assemblage, by pulling additional sequences from different databases (e.g., Hyseq's 
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database containing EST sequences, dbEST version 1 15, gb pri 1 15, and UniGene 
version 10.3, and exons from public domain genomic sequences predicted by GenScan) 
that belong to this assemblage. The algorithm terminated when there were no additional 
sequences from the databases that will extend the assemblage. Further, the inclusion of 
5 component sequences into the assemblage was based on a BLASTN hit to the extending 
assemblage with BLAST score greater than 300 and percent identity greater than 95%. 

Table 18 (identified as Table 18(790).doc on the enclosed CD) shows the various 
tissue sources of the EST sequences from Hyseq's database which were used to assemble 
the contigs or nucleic acids of the present invention (identified by 790 SEQ ID NO:l- 
10 30,553). 

The nearest neighbor result for the assembled contig was obtained by a FASTA 
version 3 search against Genpept release 1.15, using FASTXY algorithm. FASTXY is an 
improved version of FASTA alignment which allows in-codon frame shifts. The nearest 
neighbor result showed the closest homologue for each assemblage from Genpept (and 
1 5 contains the translated amino acid sequences for which the assemblage encodes). The 
nearest neighbor results for 790 SEQ ID NO: 1-30,553 are shown in the Table 19, 
(identified as Tablel9(790).doc on the enclosed CD) infra. 

6.12 The 791 Gene Family 

20 Table 20 (identified as Table20(791).doc on the enclosed CD) sets forth the novel 

predicted polypeptides (including proteins) encoded by the novel polynucleotides (791 
SEQ ID NO: 1-5,822) of the present invention, and their corresponding nucleotide 
locations to each of 791 SEQ ID NO: 1-5,822. Table 20 also indicates the method by 
which the polypeptide was predicted. Method A refers to a polypeptide obtained by using 

25 a software program called FASTY (available from http://fasta.bioch.virginia.edu) which 
selects a polypeptide based on a comparison of translated novel polynucleotide to known 
polypeptides (W.R. Pearson, Methods in Enzymology, 183: 63-98 (1990), incorporated 
herein by reference). Method B refers to a polypeptide obtained by using a software 
program called GenScan for human/vertebrate sequences (available from Stanford 

30 University, Office of Technology Licensing) that predicts the polypeptide based on a 

probabilistic model of gene structure/compositional properties (C. Burge and S. Karlin, J. 
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Mol. Biol., 268: 78-94 (1997), incorporated herein by reference). Method C refers to a 
polypeptide obtained by using a Hyseq proprietary software program that translates the 
novel polynucleotide and its complementary strand into six possible amino acid 
sequences (forward and reverse frames) and chooses the polypeptide with the longest 
5 open reading frame. 

Novel Nucleic Acid Sequences Obtained From Various Libraries 
A plurality of novel nucleic acids were obtained from cDNA libraries prepared 
from various human tissues and in some cases isolated from a genomic library derived 
from human chromosome using standard PCR, SBH sequence signature analysis and 

10 Sanger sequencing techniques. The inserts of the library were amplified with PCR using 
primers specific for the vector sequences which flank the inserts. Clones from cDNA 
libraries were spotted on nylon membrane filters and screened with oligonucleotide 
probes (e.g., 7-mers) to obtain signature sequences. The clones were clustered into groups 
of similar or identical sequences. Representative clones were selected for sequencing. 

15 In some cases, the 5' sequence of the amplified inserts was then deduced using a 

typical Sanger sequencing protocol. PCR products were purified and subjected to 
fluorescent dye terminator cycle sequencing. Single pass gel sequencing was done using 
a 377 Applied Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. In 
some cases RACE (Random Amplification of cDNA Ends) was performed to further 

20 extend the sequence in the 5* direction. 
Novel Contigs 

The contigs or the nucleic acids of the present invention, designated as 791 SEQ 
ID NO: 1-5,822 were assembled using an EST sequence from Hyseq's database as a seed. 
A recursive algorithm was used to extend the seed EST into an extended assemblage, by 

25 pulling additional sequences from different databases (e.g., Hyseq's database containing 
EST sequences, dbEST version 1 15, gb pri 1 15, and UniGene version 103, and exons 
from public domain genomic sequences predicted by GenScan) that belong to this 
assemblage. The algorithm terminated when there were no additional sequences from the 
databases that will extend the assemblage. Further, the inclusion of component sequences 

30 into the assemblage was based on a BLASTN hit to the extending assemblage with 
BLAST score greater than 300 and percent identity greater than 95%. 
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Table 21 (identified as Table21(791).doc on the enclosed CD) shows the various 
tissue sources of the EST sequences from Hyseq's database which were used to assemble 
the contigs or nucleic acids of the present invention (identified by 791 SEQ ID NO: 1- 
5,822). 

5 The nearest neighbor result for the assembled contig was obtained by a F ASTA 

version 3 search against Genpept release 115, using FASTXY algorithm. FASTXY is an 
improved version of FASTA alignment which allows in-codon frame shifts. The nearest 
neighbor result showed the closest homologue for each assemblage from Genpept (and 
contains the translated amino acid sequences for which the assemblage encodes). The 
10 nearest neighbor results for 791 SEQ ID NO: 1-5,822 are shown in the Table 22, 
(identified as Table22(791).doc on the enclosed CD) infra. 

6.13 The 792 Gene Family 

Table 23 sets forth the novel predicted polypeptides (including proteins) encoded 

15 by the novel polynucleotides (792 SEQ ID NO: 1-8,502) of the present invention, and 
their corresponding nucleotide locations to each of 792 SEQ ID NO: 1-8,502. Table 23 
also indicates the method by which the polypeptide was predicted. Method A refers to a 
polypeptide obtained by using a software program called FASTY (available from 
http://fasta.bioch.virginia.edu) which selects a polypeptide based on a comparison of 

20 translated novel polynucleotide to known polypeptides (W.R. Pearson, Methods in 

Enzymology, 183: 63-98 (1990), incorporated herein by reference). Method B refers to a 
polypeptide obtained by using a software program called GenScan for human/vertebrate 
sequences (available from Stanford University, Office of Technology Licensing) that 
predicts the polypeptide based on a probabilistic model of gene structure/compositional 

25 properties (C. Burge and S. Karlin, J. Mol. Biol., 268: 78-94 (1997), incorporated herein 
by reference). Method C refers to a polypeptide obtained by using a Hyseq proprietary 
software program that translates the novel polynucleotide and its complementary strand 
into six possible amino acid sequences (forward and reverse frames) and chooses the 
polypeptide with the longest open reading frame. 

30 The isolated polypeptides of the invention include, but are not limited to, a 

polypeptide comprising any of the amino acid sequences set forth in Table 23 or from six 
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frame translations of 792 SEQ ID NO: 1-8,502; or the corresponding full length or 
mature protein. One of skill in the art could determine the corresponding amino acid 
sequence using techniques well known in the art to translate and analyze all possible six 
frames. Polypeptides of the invention also include polypeptides with biological activity 
5 that are encoded by (a) any of the polynucleotides having a nucleotide sequence set forth 
in the 792 SEQ ID NO: 1-8,502; or (b) polynucleotides that hybridize to the complement 
of the polynucleotides of (a) under stringent hybridization conditions. Biologically or 
immunologically active variants of any of the polypeptide sequences in Table 23 or from 
six frame translations of 792 SEQ ID NO: 1-8,502, and "substantial equivalents" thereof 

10 (e.g., with at least about 65%, 70%, 75 %, 80%, 85 %, 90%, 95 %, 98% or 99% amino 
acid sequence identity) that preferably retain biological activity are also contemplated. 
The polypeptides of the invention may be wholly or partially chemically synthesized but 
are preferably produced by recombinant means using the genetically engineered cells 
(e.g. host cells) of the invention. 

15 . Novel Nucleic Acid Sequences Obtained From Various Libraries 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared 
from various human tissues and in some cases isolated from a genomic library derived 
from human chromosome using standard PCR, SBH sequence signature analysis and 
Sanger sequencing techniques. The inserts of the library were amplified with PCR using 

20 primers specific for the vector sequences which flank the inserts. Clones from cDNA 
libraries were spotted on nylon membrane filters and screened with oligonucleotide 
probes (e.g., 7-mers) to obtain signature sequences. The clones were clustered into groups 
of similar or identical sequences. Representative clones were selected for sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a 

25 typical Sanger sequencing protocol. PCR products were purified and subjected to 

fluorescent dye terminator cycle sequencing. Single pass gel sequencing was done using 
a 377 Applied Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. In 
some cases RACE (Random Amplification of cDNA Ends) was performed to further 
extend the sequence in the 5' direction. 

30 Novel Contigs 
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The contigs or the nucleic acids of the present invention, designated as 792 SEQ 
ID NO: 1-8,502 were assembled using an EST sequence from Hyseq's database as a seed. 
A recursive algorithm was used to extend the seed EST into an extended assemblage, by 
pulling additional sequences from different databases (e.g., Hyseq's database containing 
5 EST sequences, dbEST version 1 15, gb pri 115, and UniGene version 103, and exons 
from public domain genomic sequences predicted by GenScan) that belong to this 
assemblage. The algorithm terminated when there were no additional sequences from the 
databases that will extend the assemblage. Further, the inclusion of component sequences 
into the assemblage was based on a BLASTN hit to the extending assemblage with 
1 0 BLAST score greater than 300 and percent identity greater than 95%. 

Table 24 shows the various tissue sources of the EST sequences from Hyseq's 
database which were used to assemble the contigs or nucleic acids of the present 
invention (identified by 792 SEQ ID NO 1-8,502). 

The nearest neighbor result for the assembled contig was obtained by a FASTA 
15 version 3 search against Genpept release 115, using FASTXY algorithm. FASTXY is an 
improved version of FASTA alignment which allows in-codon frame shifts. The nearest 
neighbor result showed the closest homologue for each assemblage from Genpept (and 
contains the translated amino acid sequences for which the assemblage encodes). The 
nearest neighbor results for 792 SEQ ID NO: 1-8,502 are shown in the Table 25, infra. 

20 
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table 5 Nearest neighbor (FastA v. Genbank, Genpept release 114) 


SEQ 

ACCESSION 

DESCRIPTION 

SMITH- 

% 

ID 

NO. 


WATERMAN 

IDENTITY 

NO. 



SCORE 



1 

L27428 

Homo sapiens reverse transcriptase 

253 

32 .129 

2 

X97675 

Homo sapiens plakophilin 2b 

167 

50.794 

3 

U49082 

Homo sapiens transporter protein 

1001 

56 .122 

4 

U17247 

Saccharomyces cerevisiae Imhlp 

164 

25 .217 

5 

Z38061 

Saccharomyces cerevisiae mal5, stal, 
len: 1367, CAI : 0.3, AMYH_YE AS T P08640 
GLUCOAMYLASE SI (EC 3.2.1.3) 

320 

26.144 

6 

AF080234 

Human endogenous retrovirus K polymerase 

330 

46 .774 

7 

M13101 

Rattus norvegicus unknown protein 

265 

49.398 

8 

M12140 

Homo sapiens envelope protein 

458 

40.865 

9 

U49974 

Homo sapiens mariner transposase 

545 

78.571 

10 

AB014549 

Homo sapiens KIAA064 9 protein 

588 

81.818 

11 

Y12713 

Mus musculus Pro-Pol-dUTPase polyprotein 

266 

65.789 

12 

AB007903 

Homo sapiens KIAA044 3 

1527 

65 .445 

13 

X78933 

Homo sapiens zinc finger protein 

465 

65.979 

14 

AF081111 

Mus musculus domes ticus ORF2 

276 

63 . 158 

15 

AF081111 

Mus musculus domesticus ORF2 

178 

46.341 

16 

AB012223 

Canis familiaris ORF2 

271 

49.485 

17 

X03725 

Mus musculus ORF 2 (466 aa) 

222 

43.902 

18 

U49974 

Homo sapiens mariner transposase 

870 

71.491 

19 

D49677 

Homo sapiens U2AF1-RS2 

476 

67.391 

20 

AJ001714 

Homo sapiens Myosin- IXA 

261 

58.750 

21 

U93572 

Homo sapiens putative pi 50 

295 

50.000 

22 

AB012223 

Canis familiaris ORF2 

182 

43 .434 

23 

X53581 

Rattus norvegicus ORF4 

176 

40 .230 

24 

L27428 

Homo sapiens reverse transcriptase 

223 

35.714 

25 

U93568 

Homo sapiens putative pl50 

227 

65 .574 

26 

Y12713 

Mus musculus Pro- Pol -dUTPase polyprotein 

285 

52 .542 

27 

AF003535 

Homo sapiens ORF2-like protein 

252 

42 .149 

28 

AF123881 

multiple sclerosis associated retrovirus 
element gag polyprotein 

549 

84 .821 

29 

AF123881 

multiple sclerosis associated retrovirus 
element gag polyprotein 

391 

62 .500 

30 

AB022046 

Cynops pyrrhogaster alphal type II 
collagen 

151 

35.294 

31 

AF015539 

Mytilus edulis precollagen P 

265 

32 .950 

32 

Y12713 

Mus musculus Pro-Pol-dUTPase polyprotein 

367 

57 . 983 

33 

AL032660 

Caenorhabditis elegans predicted using 
Gene finder 

930 

37 . 174 

34 

AJ005073 

Mus musculus Alix 

2666 

88 . 565 

35 

AL117237 

Homo sapiens hypothetical protein 

523 

72 .656 

36 

U93570 

Homo sapiens p4 0 

190 

33 .333 

37 

U09116 

Homo sapiens ORF2 , encodes a reverse 
transcriptase homolog 

248 

47 .107 

38 

M34651 

Pseudorabies virus ORF- 3 protein 

159 

40 .206 

39 

L76559 

Drosophila melanogaster mus308 

271 

51.724 

40 

X06745 

Homo sapiens DNA polymerase alpha - 

382 

68.317 
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subunit (AA 1 - 1462) 



A 1 
4 1 

TTO OTTO 

Rattus norvegicus ORF2 consensus 
sequence encoainy enaonucicase duu 

X. C; V G X. O C LLailo^JL XpLctoc HlJ.il LIS i\X m dot-il 

161 

39 . 831 

42 

TTQ QQC Q 

TV t^-sj 4 f^OTO c? "5 C3 ■ff - V»rjl T T1 J^l O -1 w\ ■y ~l a y~ t~ /~\ -j a f - 
rtiaUiuupoia L-llcA-L J.ca11c1 oxlllllai UC_> y caiiL 

crt^nf^val npaahi vp rpaiil a t-or* of 7 

transcription subunit 1 

/ 

38 . 435 

43 

AB012223 

faniq f amiliari «? ORF9 

33 7 

JO . ±z z 

44 

X52235 

Homo sapiens ORFII 

248 

46 .429 

45 

U93574 

Homo nairi ptiq tyi i t~ a t~ "i vro nl ^ tTl 

ajU 

4 / . 7Uo 

46 

an oi 2 22 ^ 

rtD v X /j J 

Vvdiixo xaiiixxxax Xs \JX\.r 

JDl 

/I C n AO 

4 7 

Ml 

Ml i d mi i ci 1 1 1 1 ct O Q C C tcj t~ Vic* riric "i t~ "l /"^r^ r~\ "P t~ V> a 
HUo llluaLUlUo Z O J J X. o L-lit: jJUolUXUIl Ul U lit; 

f iirst start codon in OR F O * mit*£i t* "i vp 

X. -1- X. I* iJ L- X- V— \^ w IX Wl X J.11 ^* y \~J LX l_ CX UX V C 

Ifi i7 

4U . 000 

48 

X65551 

Homo car^i ^> n <3 ant "i npn o "F t-Vio monripl nnal 

antibody Ki-67 


/ O .4 / 1 

49 

X03725 

Mus musculua ORF 2 (4 66 aa) 

291 

co /r 

50 

U23857 

Herpesvirus papio EBNA1 

125 

36.782 

51 

AF003 53 5 

Hnmn saioi pnci OPP9-1 t lr*^ nrntpin 
nviuv oauicuo v/i\f ^ x XJ\C JL/ x. ^ c; x. 1 1 

^ ^ V/ 

AO O Q 

52 

AF003535 

Homo sapiens ORF2-like protein 

233 

42 . 574 

53 

fUJU J. /• /. Z J 

V-dlllo XalllXXXax Xs UKrz 

-3 i> / 

53 . 077 

54 

X0314 5 

nuiuu odjjxcno pui . urtr x. 

O "7 n 

r a COO 

C C 

■M..T VJ / X. U O 1 

riycoDaCLerxuin tuDercuiosis proixne-ricn 
mucin homo log 

too 

39 . 000 


M£ il7Q^ 

Katuus norveyxcus salivary proiine-ricn 

f~} X w LCXll 


35 . 115 

57 

AF167320 

Mus musculus zinc finger protein ZFP113 

742 

55.026 

z> o 

tto "inpr 

ivx'cuyuoL'cxxuin umcjiaeuni LimA 

zUU 

2 9 . 126 



rionio sapiens Herpesvirus associaueo 
ubiquitin-specif ic protease (HAUSP) 

coo 

69.173 


TAE'mPpO p 

noiiio sapiens uann 

r r t 

71 .429 

D X. 

bio 7DD 

rsos tduius pyruvace aenyoiirogenase 

yJ L 1U C3 JJi id Udoc 

-iio y 

90 .685 

62 

f\f v O J. -1. J. ^» 

Ml 1 ct mi t a f-»i il 110 Hr*vmoQr* "5 i ci OI? TTO 

IIUO IIIUDL»UXUkj vjlk^l 1 It3 O X. UKr z 

zoo 

AO 1 CO 

63 

AF041330 

Bodo saltans NADH dehydrogenase subunit 
5 

166 

38.053 

64 

AF14 9422 

Homo <^ ^ o "i f> in q unVnr>un 

OnO 

^ u / 

AT O O Q 

4X . zzo 

65 

L27428 

Homo Rani f^npi T*« a v^T* c if a tran^rri ntacp 

1 Ivy* 1 1 W Quk/ -X vll fcj X. ^ — ' V X C-J ^ — . w X CI 1 X X- X. LJ ^ CX O 

1 7f) 

X. / V 

An 9in 

66 

AF003535 

Homo sapiens ORF2~like protein 

292 

56.731 

67 

■f-VkJ UU1 JO J 

Homo QaT"i i PTi q l mmi itinnl oVm il i n Vi ^ a if-vr /^K>a ■> n 
rxv^miu iaapicus xiuuiuiiuy xujjuxxii i it: dvy ciiciX.il , 

rnn<?t ant roni on 

V» ull O U. CXXX 1 — X- V-j J. Wll 

DDfi 

O O Q O C 

68 

AB012223 

Canis f amiliaris ORF2 

OOC 

OO rtrtfl 

69 

Y12713 

Mus musculus Pro-Pol-dUTPase polyprotein 

148 

41.379 

70 

U83 303 

Homo cani pn q "| inp_l vpvpvcp 
nui i ivj oapxeilo line x. Icvcioc 

t iran s c y i d t a s e 

1^ i7 

O Q ATA 

71 

X03145 

Homo saDiens not ORF T 

217 

A 1 C O Q 
MX • ju 7 

72 

AF003535 

Homo sapiens ORF2-like protein 

197 

44.231 

73 

U83303 

Homo Qani pnq 1 i np-1 T*"<=*"\r«aT*c;«a 

transcriptase 

jX4 

DO . 1*1 v 

74 

X5223 5 

Homo saDiens ORFTT 

lb UZ 

co p o n 

75 

L49380 

Homo sapiens transcription factor ZFM1 

232 

28 . 261 

76 

AF078035 

Homo sapiens translation initiation 
factor IF2 

385 

62.376 

77 

M34059 

Homo sapiens beta-globin 

283 

75.000 

78 

M11841 

simian type D virus 1 pol protein 

234 

37.879 | 

79 

Y12713 

Mus musculus Pro-Pol-dUTPase polyprotein 

190 

40.000 
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80 

M22333 

Homo sapiens unknown protein 

215 

42 .857 

81 

D86850 

Homo sapiens HADHB 

163 

58 . 491 

82 

M22334 

Homo sapiens unknown protein 

s. EL 

230 

60 . 563 

83 

S80119 

Rattus sp . reverse transcriptase homolog 

264 

48 . 000 

84 

AF081114 

Mus musculus domesticus ORF2 

213 

47 . 826 

85 

U09116 

Homo sapiens ORF2 , encodes a reverse 
transcriptase homolog 

244 

42 . 857 

86 

AJ010479 

Homo sapiens kinesin-like protein 

464 

69 . 091 

87 

X61296 

Rattus norvegicus open reading frame 2 

240 

41 . 085 

88 

Z24734 

Meloidogyne incognita cutioular collagen 

165 

34 .416 

89 

U72069 

Homo sapiens karyopherin beta2 

538 

67.361 

90 

M55409 

Homo sapiens pancreatic tumor- related 
protein 

273 

59.494 

91 

X07881 

Homo sapiens proline-rich protein Gl 

205 

3 8 . 182 

92 

U22961 

Homo sapiens similar to human albumin 
Swiss-Prot Accession Number P02768; 
Method: conceptual translation supplied 
by author 

42 8 


93 

D89053 

Homo sapiens Acyl-CoA synthetase 3 

683 

66.667 I 

94 

AF006514 

Homo sapiens CHD2 

293 

58 . 242 

95 

X59841 

Homo sapiens homeobox protein 

648 

70.130 

96 

M74816 

Homo sapiens sulfated glycoprotein- 2 

413 

76.829 

97 

X58251 

Mus musculus pro- alpha -2 (I) collagen 

190 

31 . 579 

98 

Z71173 

Mus musculus inositol 1,4,5- 
trisphosphate receptor type 2 

375 

66 .667 

99 

S80119 

Rattus sp. reverse transcriptase homolog 

183 

40.625 

100 

U22055 

Homo sapiens 100 kDa coactivator 

402 

76.667 

101 

X14690 

Homo sapiens lambda HuHITI-13 

209 

51 . 899 

102 

AF003535 

Homo sapiens ORF2-like protein 

BL EL 7L-Z. . t 

170 

34 . 234 

103 

M22333 

Homo sapiens unknown protein 

214 

43 .519 

104 

L11672 

Homo sapiens zinc finger protein 

685 

44 . 033 

105 

M22333 

Homo sapiens unknown protein 

261 

43.750 | 

106 

AB002312 

Homo sapiens KIAA0314 

433 

69.903 

107 

X55777 

Homo sapiens put. ORF 

309 

68.056 1 

108 

AF009668 

mil 1 t~ "i T) 1 Qrl PT*ncii q aQQnP"i ahpH rohmv-i rno 
hi*-* -*» \~ JL^f -A. ^ o ^ .L c; ±. <w» o j. £s aoouuxaLcu ic^iUVliUb 

polyprotein 

41 n 

11U 

DU.J// 

109 

U22961 

Homo sapiens similar to human albumin, 
Swiss-Prot Accession Number P02768; 
Method: conceptual translation supplied 
by author 

532 

76 . 364 

110 

AB012223 

Canis familiaris ORF2 

269 

41.497 

111 

L06147 

Homo sapiens golgin-95 

752 

57.752 

112 

X61048 

Hydra sp. mini -collagen 

141 

42 . 391 

113 

L27428 

Homo sapiens reverse transcriptase 

286 

45 . 299 

114 

X82153 

Homo sapiens Cathepsin O 

308 

54 . 167 

115 

D13629 

Homo sapiens KIAA0004 

169 

42 . 000 

116 

AF123881 

multiple sclerosis associated retrovirus 
element gag polyprotein 

409 

41 . 071 

117 

AB019602 

Homo sapiens IDN3-B 

237 

53.012 

118 

AF155099 

Homo sapiens NY-REN- 18 antigen 

237 

47 .423 

119 

AB020660 

Homo sapiens KIAA0853 protein 

153 

31.304 

120 

AJ249625 

Paracentrotus lividus Chaperonin 

413 

63.208 

121 

M22334 

Homo sapiens unknown protein 

282 

49.565 

122 

X70391 

Mus musculus inter-alpha- inhibitor HI 
chain 

170 

77 . 778 
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123 

D88899 

Mus musculus kidney- derived aspartic 
protease-like protein 

242 

66.071 

124 

U93572 

Homo sapiens p4 0 

1 CO 
lj" 

~x n r 

O V . O J J 

125 

AL117200 

Caenorhabditis elegans predicted using 
Genet lnder ; preliminary preaiccion 

X 27 J 

OO • \J Zs -J 

126 

AL117200 

Caenornaoaitis eiegans preaiccea using 
Genetinaer; preliminary pxrecij.Q,L.j.uii 

1QT 
JL O O 

37 . 349 

127 

AJ3 8 8 555 

Cams ramiiiaris nypotnecicai prymin 

1720 

75 . 703 

128 

L2742 8 

Homo sapiens reverse uxaimL.1. lptaoc 

226 

37 . 313 

129 

U93567 

Homo sapiens pucacive pisu 

470 

49 . 524 

130 

AL050060 

Homo sapiens hypothetical protein 

286 

64.000 

131 

Y12713 

Mus musculus Gag polyprotein 

270 

44 . 444 

132 

M58704 

Homo sapiens 12 -lipoxygenase 

306 

60 . 204 

133 

L19713 

Homo sapiens dematin 

236 

68 . 182 

134 

L36120 | 

Medicago sativa proline rich protein 

172 

39.655 

135 

L26245 

Homo sapiens effector cell protease 
receptor 1 

259 

47.619 

136 

AB028997 

Homo sapiens KIAA1074 protein 

286 

44.697 

137 

AF075575 

Homo sapiens dysferlin 

O A O 

c q "7 n T 
J?. / Vi 1 

138 

L29028 

Unknown amino acid feature : N- 
glycosylation sites, aa 41 . . 43, 46 . . 
48, 51 . . 53, 72 . . 74, 107 . 

lb 3 


139 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

485 

47.584 

140 

AB012223 

Canis familiaris ORF2 

323 

46.226 

141 

X97630 

Homo sapiens serine/threonine protein 
kinase 

322 

3 9-4 /4 

142 

D83776 

Homo sapiens The KIAA0191 gene is 
expressed ubiquitously. ; The KIAA0191 
protein retains the C2H2 zinc-finger at 
its N- terminal region. 

1932 

b3 . 1UJ 

143 

U80846 

Caenorhabditis elegans No definition 
line found 

lb -5 

-to . u / u 

144 

U60269 

Homo sapiens putative envelope protein; 
orf similar to env of Type A and Type B 
retroviruses ana co ciass ±± ruc*jKvt> 


6 9 079 

145 

X13885 

Nicociana caDacum exuensin v /-w-v. j. o^u; 

293 

26 . 138 

146 

AB012223 

cams ramiiiaris uKr^ 

352 

44 . 509 

147 

AF003535 

Homo sapiens UKtz-iiJce protein 

248 

47 . 423 

148 

AB012223 

cams ramiiiaris uKr/ 

276 

51 .685 

149 

AB014574 

Homo sapiens KIAA0674 protein 

4403 

96.296 

150 

X98494 

Homo sapiens M pnase pnospnoprouem 

254 

74 . 194 

151 

AB012223 

cams ramiiiaris ukcz 

482 

47 . 514 

152 

M80537 

urosopmia meianogaster tai. ptuuein 

549 

35 . 043 

153 

X51622 

caenornaoaitis eiegans coiiagen 

213 

36 . 702 

154 

X87629 

Homo sapiens nicotinic acetyiuiuiiiie 
receptor 

138 

29 . 365 

155 

ABOlll^b 

riorno sapiens AiaHUDJi piuucxii 

294 

46 .207 

156 

A55126 

rlUS muscuius Zii:p-^i7 

522 

44 . 907 

157 

M61120 

Homo sapiens loricrin 

183 

32.240 

158 

U02313 

Mus musculus protein kinase 

238 

51.807 

159 

AB002342 

Homo sapiens KIAA0344 

299 

32 .203 

160 

AF003535 

Homo sapiens ORF2-like protein 

246 

47.573 

161 

U95090 

Homo sapiens F19541 1 

1239 

59.950 

162 

X52235 

Homo sapiens ORFII 

405 

54 .478 


W3 


;£. O O O «4 i& 'Ml- 3 , O E!! E!! 6 CI S 


163 

AB012223 

Canis familiaris ORF2 

448 

51. 748 

164 

AB017600 

HaliotlS disCUS CollaOPn nro al nha-r»hai n 
-*- vaxquuo \_w-L ±ay Cli piu ci -J- \~) lid k_ 1 1 c* JL 1 1 

± i 3 

28 . 992 

165 

M13002 

MllS ItlUSClllus 28^*5 1 ci fViP* -r\r\<=i i t-i nn r-\-f t-Via 

first start codon in ORF 2 ; putative 

*i o 1 

iol 

60 . 177 

166 

X71602 

Nicotiana tabacum pxten^in 


36 . 364 

167 

AL117200 

Caenorhabditia elpaan<j nrpHi r^t-*=H n a s 
Genefinder; oreliminarv nrprtiri-inn 


34 . 641 

168 

X52235 

Homo sapiens ORFII 

231 

39. 024 

169 

M31524 

Saccharomvces cprpvisiap pppi <: nont- -5 Ho. 
(put. helicase) ; putative 

"3 "5 T 

34 . 254 

170 

AB012223 

Canis familiaris ORF? 

AAA 

444 

52.841 

171 

AB012223 

Canis familiaris nppo 

£" 1 /- 

40.294 

172 

M92040 

Stroncivlocentrotiis ryiir*miY*at-iiQ :=» 1 r-»Vi=a - 1 
collagen 

Z J4 

30 . 213 

173 

AF084205 

Rattus norvegicus serine/threonine 
protein kinase TAOl 

oqc 

O 270 

85 . 625 

174 

M63595 

Xenopus laevis alpha- 1 type II collagen 

351 

30.116 

175 

AF085809 

Mus musculus synapsin lb 

*> r» 

33 .880 

176 

AC004794 

Homo sapiens Acetolactate synthase 

253 

49.180 

177 

AL049482 

ArabidODS i s t*hal l^m^ nnt"ah T iro nrnf- c» -> t-» 

A O *7 

4 8/ 

42 . 366 

178 

U09413 

Homo sapiens zinc finger protein ZNF135 

906 

53.138 

179 

AL023781 

■jv^iiX£.WDau^iiaxuuiycco puitujc W LclTlllIiai 

acetyl transferase 1 

816 

41.212 

180 

Y14685 

dOT"iC31 CI t~Vli*T T ana vn^nI i/niinl &/-v#- -i y-3 

&-*-*j-\A\j£j£} j. o Liia j. -Ldiia poiynucxSOuiQu 
pho sphory 1 a s e 

61 

41.379 

181 

M80341 

nuiiiu o c*jj.l t£ii» uKrz concains a reverse 
transcriotase domain • drpo 

477 

51.282 

182 

AF003131 

CaPnOrhrihrl I t" i q pi prrariQ C~* ol nrrana TTXTO O Q 

\.c*ciiuj.iiajjux lx& c j. tr y ctiitf V—. clcUallS UWL~oy 

(GB:U33058) (NID-ollfiOlSR) 

282 

25 . 153 

183 

AL022374 

Streotomvces coelicolor m 1 1- ^ t- -i tt*> anp 
transporter 

TOT 

40 . 299 

184 

U93569 

Homo sapiens putative pl50 

O O D 

55 . 2 94 

185 

U93568 

Homo sapiens putative Jpl50 

c: /r 

Z JO 

3 8.583 

186 

X64697 

Homo sapiens titin 


82 . 724 

187 

AB025259 

Mus musculus granuphilin-b 

<p v X 


188 

D14663 

Homo sapiens KIAA0107 


ac n c c 

189 

AJ007798 

Homo sapiens nuclear protein SA3 


CO (\ A 1 

190 

AB018281 

Homo sapiens KIAA0738 protein 


A r\ q —i c 

191 

U07973 

Gallus gallus alpha-1 collagen type III 

207 

30. 189 

192 

M35547 

Human herpesvirus 4 LF3 gene product 

4S 27 fx 

27 . 523 

193 

U53445 

Homo sapiens DOC1 


zo.205 

194 

U76618 

Mus musculus N-RAP 

2955 

83 .422 

195 

M25984 

Gallus crallus aloha - 2 f vnp T rn1 1 aapn 

i n c 

28 . 6 34 

196 

U41557 

Caenorhabditis elegans proline and 
glycine -rich 

437 

35.018 

197 

AB028997 

Homo sapiens KIAA1074 protein 

•511 



50 . 000 

198 

U59694 

Homo sapiens zinc fincier nrnfp S n 
basonuclin 


44 . 853 

199 

Y10392 

Human endogenous retrovirus K gag 
protein 

r O X 

4y . bz>0 

200 

AF167320 

Mus musculus zinc finger protein ZFP113 

536 

52 .288 

201 

X92485 

Plasmodium vivax pval 

269 

59.036 | 

202 

U47856 

Araneus diadematus fibroin-4 

204 

29.588 

203 

AB020708 

Homo sapiens KIAA0901 protein 

406 

29.260 

204 

AF056936 

Plasmodium falciparum mature parasite- 

240 

21.245 
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205 

Z75330 

Homo saoiens nurlpar nrnt"pin cjti-.i 

1 ox 

48 . 810 

206 

X92887 

Human endocrenou *3 T~<=»t* r^ovi ruq fc r\r\i /<=»mr 

o y j 

41 . 833 

207 

X13885 

Nicotiana tabacum extensin (AA 1-620) 

239 

31.274 

208 

AC005990 

majjiuuyoxo uiiaxXaila oLIUUy HiniXlariuy 

to PFAM PF 1 0006 9 Eukarvotic nrofpi'n 
kinase domain . 

o c c 

34 . 545 

209 

U93563 

Homo sapiens putative pl50 

1103 

53.670 

210 

X14420 

Homo saDiens ni'pnro-fl 1 nha - 1 fvnp "X 

collagen 

Jul 

28 . 918 

211 

AF133730 

Rattus norveaicus SI it* 2 

.5 3 Z7 

2 9.730 

212 

AL080123 

Homo saDiens hvDOthpt* "i r";=i1 nrnt-pin 

D £. / 

47 . 391 

213 

AL021918 

Homo saoiens b34I8 1 ( Knmnp 1 rpl^t-pH 
Zinc Finger protein 184) 

~i a a n 

56 .587 

214 

U35245 

Rattus norvegicus vacuolar protein 
sorting homolog r-vps3 3b 

7 JX 

c a on A 

215 

AB007860 

Homo sapiens KIAA04 00 


Z / . Dll 

216 

AJ010585 

Rattus rattus PTB-like protein 

1304 

80.216 

217 

M2 78 78 

Homo SaDiens DNA hindino nrnhpi n 

T *7 O 1 
X / ^ X 

69 . 252 

218 

U59655 

Pithecia pithecia MHC class I Pipi-G*04 

431 

69.474 

219 

AF159296 

JJ /^ IJ r t; ' ta - L ^* u ' 11 cbcuicntuin excensin-iiKe 
protein 

34 0 

33.333 

220 

AF010144 

xxv^mw oayxcilo llcui Ulld x. L-Xlx trclCX prOtSXIl 

AD7C-NTP 

2 8 7 

61 .458 

221 

AF071081 

x ly ^uxja^ lci J.UIU cuoercuiosis proline - ricn 
mucin homolog 

325 

26.979 

222 

X0602 1 

Xpnnnnn 1 aavi a vf -J n t- oi n / 7\ 7\ i 

ACiiu|jub idcvib A.x,m proccin vAr* J. - 
1350) 

1093 

36 . 266 

223 

AF010144 

Homo saDipns npnronal 1-Viy-<=>^H r^-r~r^t- e» -i n 
AD7C-NTP 

*5 O O 

64 . 646 

224 

AF052663 

Xenopus laevis gamma- tubulin interacting 
protein 

294 

29.528 

225 

AB017600 

Haliotis discus collagen pro alpha-chain 

252 

29.487 

226 

AB023065 


431 

61 . 429 

227 

M22 334 

Homo saDipns iinknown nrnh Pin 

oo4 

38 .438 

228 

U41557 

Caenorhabditis elpaanq nrnl inp anH 
glycine -rich 

T A 1 

30 . 530 

229 

S57132 

Homo SaDieilS tvnp YVT r'ol 1 aopn alnha T 

AAv^iiiv^ oa^xciio ^ y C. V X v^k-'X, X C*y till ol X LJ 1 1 cJ. X 

chain, alpha 1 (XVI) 

O "5 "7 

29.860 

230 

U97553 

murine herpesvirus 68 unknown 

262 

32 .899 

231 

X89453 

Rattus norvegicus DRPLA 

ZUl 

-> -f oca 

232 

AL118514 

Streotomvces coelicolor A3 (2) dna 
polymerase III subunit gamma 

lb J 

O "7 ACQ 

233 

D29642 

Homo sapiens KIAA0053 

312 


234 

AF153062 

Canis familiaris type I collagen pre- 
pro-alphal (I) chain 

218 

33.333 

235 

AF032103 

Homo sapiens ataxin-7 

313 

-5 -7 . J. D 

236 

AB002319 

Homo sapiens KIAA0321 

179 

~XC O *7 

237 

Y12713 

Mus musculus Pro -Pol -dUTPase polyp rot ein 

JX J 

"71 Qin 
/ ± . yju 

238 

U87607 

Rattus norvegicus putative RNA binding 
protein 1 

299 

46 .667 

239 

X53581 

Rattus norvegicus ORF3 

797 

49.858 

240 

Z11922 ! 

Canis familiaris Ran 

383 

81.579 

241 

M77194 

Rat leukemia virus polymerase 

261 

44.094 

242 

Z34465 

Zea mays extensin-like protein 

292 

32.527 
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243 

AF017178 

Homo saoiens oro aloha i (t) rnllanpn 

IOC 

log 

2 9.344 

244 

AC008075 

Arabidopsis thaliana F24J5.4 

193 

34 .932 

245 

U4 1 5 5 7 

Caenorhabdi t is elecian** hi Qt- \ Hi np-ri r»H 


31 . 646 

246 

U76618 

M\is musculus N— RAP 

"1 A *7 ft 

47 . 599 

247 

AC004561 

Arabidopsis thaliana putative proline- 
rich protein 

219 

28.041 

248 

X07441 

Turnio vellow mosaic viTii«* fiQK npp fzia 
1-628) 


3 0 . 08 5 

249 

AB012223 

Canis familiar is ORF2 


48 . 592 

250 

AF000657 

Arabidopsis thaliana hypothetical 
protein 

249 

28 .718 

251 

AB020629 

Homo saoiens KIAA0822 Drofpin 

XU r Z 

o4 . 103 

252 

AL118514 

Streptomyces coelicolor A3 (2) DNA 
tDolvmerase tit auhimi f rrflmma 

163 

35.811 

253 

X66358 

Homo SaDlPnfl SPKinp/hhrpnninp nmhpin 

kinase 

r» 

8 8.571 

254 

U09413 

Homo saoiens zinc fincrer* nrotpin znpi m 



255 

AL079308 

Streptomyces coelicolor putative 
serine/threonine protein kinase 

168 

42 . 593 

256 

AF169388 

Mus musculus alTlh^J 4 col "1 ;»rr<=»« T\T 


2 7 .609 

257 

U93564 

Homo sapiens putative pl50 

260 

37.727 

258 

M12140 

Homo sapiens envelope protein 

395 

59.322 

259 

V \J *± 17 J J 

numo bdpiens supporuea oy hoiS Id1992 
\iiiiL/,yDo jz jd/ ana wzoyDU vi>J.LiJ:gx3U/lo// 
and Genscan 

268 

64 . 103 

260 

Z82268 

Unknown cDNA EST yk338gl0.5 comes from 
this gene; cDNA EST EMBL:D27 934 comes 

f" ynm t~h"i ct opnp * /— T^MZi T? 

517 

30.081 

261 

M26927 

Gibbon leukemia virus pol polyprotein 

557 

50.754 

262 

AB011152 

nvjuiu aapxciio IVX/uivDOU piutcin 

712 

44 . 747 

263 

X87226 

Jaculus oriental is NAD-dependent 
y ±y ^ci d lueiiyutj j-pnospnate aenyorogenase 
(phosphorylating) 

214 

47.321 

264 

U44091 

Rattus norvegicus atrophin-1 related 

nr*ot pin 

178 

33.511 

265 

Z98980 

Qrhi 70c5Arph3rrtiTivrpQ nr»mho t»i -J el-A*-f . 

aldrich syndrome protein homolog 1 

O O *~7 

26 . 070 

266 

Z79694 

v.ac:iiui.iiauuJ. txo c±cy alio picUlULcQ US lily 

Genefinder; similar to collagen; cDNA 
EST vk552al2 3 comes from this cf^n«=» 

158 

29 . 333 

267 

AF144627 

Mus musculus SLIT1 

■377 

OQ "ICQ 

268 

D00824 

Gallus cxallus aloha 1 chain r>f hvnp ytt 
collagen 


An ace 

269 

AB011414 

Homo sapiens Kruppel-type zinc finger 
protein 

— *L 

792 

3D . D DU 

270 

X52046 

Mus musculus type III collagen 

481 

32 .472 

271 

M11723 

Homo sapiens coagulation factor XII 

1618 

O X . J / -7 

272 

M22334 

Homo sapiens unknown protein 

755 

*± -J . *± O 3 

273 

X69490 

Homo sapiens titin 

214 


274 

M22334 

Homo sapiens unknown protein 

360 

50 . 000 

275 

U35022 

Rattus norvegicus cis-Golgi matrix 
protein GM130 

323 

47.097 

276 

X03725 

Mus musculus ORF 2 (466 aa) 

287 

48.855 

277 

L08811 

Drosophila melanogaster adherin 

662 

32 .822 

278 

K02444 

Oryctolagus cuniculus beta-myosin heavy 

385 

46.753 
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chain 



279 

AB018312 

Homo ^ani ptiq v - T7\Arj'7^ Q r>y^f- ^ ■»-« 

435 

43 .541 

280 

AF071081 

i ij v-uwav, lcl xuiii LUJJt;iLUJ,OS13 pxOXme~ lTlCll 

mucin homo log 

196 

35 . 065 

281 

AB011164 

Homo saoiens KTAA05 9? nrnhPin 

*7 C O 

758 

82 .468 

282 

AF045239 

Homo sapiens brain expressed ring finger 
protein 

310 

28 .205 

283 

AL110218 

Homo nani pnR Vi A/n ot~V»<^t~ "i r» 1 n-i n 

473 

63.380 

284 

U10281 

Sus scrofa gastric mucin 

188 

24.706 j 

285 

U93570 

ncjiiKJ tidyj X tilio puLaClVc ploU 

305 

53.968 

286 

U56964 

v - c * t = i nauui u jlc* eieyans wcajc similarity 

w vcicvioiat: xn Li cUvCXXUlar protein 

transport protein US) 1 (SP:P25386) 

766 

31.866 

287 

U70136 

n(JI|lu ocipxtiiis ineydKaryocyLc SCxmuiaCing 
factor; MSF 

208 

23 . 928 

288 

Z78279 

Rattus norvegicus Collagen alphal 

194 

29.042 

289 

AF085185 

Apanh hamnpha r»a af-pl "1 =s ri *i *i M^/na -5 i-» t t\ 
rvuaiji-iiamucua v^ao LclXallXX l v ly03in~lA 

217 

35 .377 

290 

M33509 

Homo sapiens HLA-B-associated transcript 
2 (BAT2) 

197 

25 .791 

291 

ACO 06 53 0 

TI<"ifY1/ p *» G 7™> "1 d Vfc a 1 » V"* \r j^Y-m 

n<w>iuv^> bdpiens uuKnown 

207 

82 .222 

292 

\J J -5 J O O 

aivmeiia pompejana tiorillar collagen 

C V"i ;n -i n PZVr^l a 1 r-*Vira 
^iiaxii .r f\)~> x dxpilct 

279 

33 .333 

293 

Z38061 

Saccharomyces cerevisiae mal5, stal, 

xcii . -L-J o ' i LAI . U.j, AMxri xiiAJb 1 rUo640 

225 

21.692 

294 

X53556 

i.aui. uo v»ypt2 W. CuXXd^cli 

175 

28 . 221 

295 

M92913 

^^-ir 11 -*- x ^ Lxavipea urdyxxne S 1JLK IlDroin 

319 

28 . 778 

296 

U37012 

Homo sapiens cleavage and 

fWJ. ycau.criiy xaLXUil opcQ,lIXCXCy taCuOl 

781 

88 .194 

297 

L21990 

Homo Qani one cir'kT r^<a/~»c3 /"*im2i 1 >-/^ f-n-l n 

240 

35 . 542 

298 

AJ006754 

Yarrowia lipolytica hypothetical protein 

164 

36.496 

299 

U4 9974 

xauuiu bapxens manner cransposase 

599 

69 . 128 

300 

AF153062 

Canis familiaris type I collagen pre- 

OT*0 — £4 1 Tiln^ 1 / T \ r*Y\ a i t-i 
Jfx axpxiax \ A/ LilaXU 

308 

32.845 

301 

AF055904 

Mwnrnr Otic; Yanh Vino nnVriAnm 

213 

32 . 014 

302 

X13804 

x\.av»i«iao o j^j . iicavy JlcUI lldlllcnL. 

polypeptide (8 54 AA) 

246 

23 .618 

303 

AF000198 

Ca PnDT*lia i h-i c c»nan a C-imi 1 f-y-v 
v.Qcii^i.iiajJUXL.Xo cxtsydlltJ oXIilXXdi CO 

cuticular collagen 

159 

44 . 048 

304 

U97553 

murine herDe^viru*? £ft nnlrnnun 

191 

28 . 994 

305 

M13101 

Rattus norvecricus unknnwn nrrtt*oi'n 


44 . 444 

306 

AF144573 

Mesocricetus auratus Mx- interacting 
protein kinase PKM 

3766 

97.222 

307 

AB012223 

Canis familiaris ORF2 

337 

52.713 

308 

D70831 

Homo Ran"i PriQ 7.in^-f i nnov" y^i>~/~\ f- ai n 
A A ^ 1 » I 1 —' o cxjl^j. CUo OJLUL LlilUcI yXOCGin 

630 

55.208 

309 

U97553 

milir inP hPTDPCj V"i rnc P nnlrnnun 

317 

33.631 

310 

Y17832 

Human endogenous retrovirus K pol 
protein 

1047 

61.686 

311 

D31763 

Homo Saoiens hpiOQ4.fi nrnhpin -J a Vvimnal 

related. 

574 

50 . 000 

312 

X51394 

Xenopus laevis APEG precursor protein 

226 

32.068 

313 

Y10392 

Human endogenous retrovirus K gag 
protein 

273 

44 .144 

314 

AC007842 

Homo sapiens BC331191 1 

1005 

74.129 | 

315 

D63881 

Homo sapiens KIAA016 0 gene product is 

1100 

68.106 
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316 

U35376 

Homo sapiens repressor transcriptional 

773 

57.014 | 

317 

X65964 

Homo sapiens nestin 

3614 

99.313 

318 

M82 977 

Bos taurus alpha-collagen 

228 

29.528 

319 

Ml 4 1 0 ~K 

xiomo sapiens poi / env URr (cases 3 8 78- 
/ / tiisi scare couon at. qx/z; axx ; 
outative 

3111 

70.629 

320 

AF004211 

Mus musculus paired- like homeodomain 
ront*fli ni nor KiT~r**t~£**i n 

335 

40.606 

321 

AL080125 

"vinu oajj/icua iiypu Liit; ticdi procein 

1468 

— — _ 

62 . 893 

322 

M13101 

Rattus norvegicus unknown protein 

288 

35.784 

323 

L11672 

"Wiuu oajjicUb Z. _LI1LJ L 11 ly t; I prOCcin 

270 

40 . 120 

324 

AF084642 

Mi i o mi lapnl no /-* ^ 1 1 n 1 a -*~ -y a -J •»-» — > 1 <-» V» t r ^3 ~ 

* ius uiua l» ui us L-fcixj-uxciir icuinaiacnyue" 
bind"incr nrnhpi n • r^p zv t ,r> r> 

507 

34 .496 

325 

M12140 

Homo sapiens envelope protein 

432 

48.966 

326 

AB01543 8 

^-ymjps pyrrnogasLer aipna x type x 

> — V-/ X -L C* v-j t^li 

293 

31 . 635 

327 

X98705 

HfllTin c? ;a -r-v -J pnn 1 arron ♦""vrr^e* T =» T r-\V» o 1 
oa|JXCiio UwJ. J.ay ell L.yjL?c X aipila X 

347 

32 . 548 

328 

X15332 

xnjiuu oapxmm axpna— X VXXXJ COllaycn 

289 

30 . 361 

329 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 

-i- l}U^ A. Ill 

269 

75.510 

330 

Z54238 

Vs.aciiux. llaUUl U 1 0 t-J-tiydllo 1 Z OLD . X 

184 

41 . 667 

331 

AB0122? 

V-dilJ. t» J.ctHlJ. JL lallb UKrZ 

671 

42 . 784 

332 

U09413 

Homo sapiens zinc finger protein ZNF135 

978 

60.996 

333 


nomo sapxens ajz^tJHii . 3 (zinc txnger 

936 

72 . 928 

334 

AB000462 

Homo sapiens SH3 binding protein 

249 

47.273 

335 

■1—' O V/ V/ \i _7 

nomo sapxens maauih / 

569 

60 . 804 

336 

U41021 

Caenorhabditis elegans C. elegans mec-2 

239 

39.552 

337 

AJ243460 

Leishmania major proteophosphoglycan 

231 

27.246 


TTO "> A Q A 

Caenorhabditis elegans weakly similar to 
aerine/ cnreonine protein Kinase 

667 

35 . 840 

339 

AF01014 4 

rn_)nio sapxens neuronax enreaa protexn 
AD7c-NTP 

437 

59 . 690 

340 

U41557 

\.aciivx najjux L l o c J. cu al lo piUl 111c ci 1 1U 

crlvcine - rich 

444 

33 . 679 

341 

AB012223 

Canis familiaris ORF2 

279 

39.806 

342 

X52235 

Homo sapiens ORFII 

369 

56 .897 

343 

AF108843 

Homo sapiens env protein 

762 

56.696 

344 

X16711 

numu octjtJX trxis V^WXiZ/^X 

289 

31 . 186 

345 

AJ012371 

Homo sapiens NAALADase L» protein 

819 

72.512 

346 

rtD W U X J J 3 

Homo sapiens similar to C. elegans 
hypothetical protein 

OTTTTi IT-IP 1 r*T?f~T\ cpi o *a puce >i m c n j ^ t _ „ 
v-c x uj.no . x , \.r>i^.u a^l^s , j , Lbr . d . Similar 

f~ f\ t~ Y"r~l J5nr1 1~ t - !^* — 1 "5 lrc* nrrvf-oi no 

ctiiu. uip Juj.j\.t: pxotcms 

888 

38 . 795 

347 

M15103 

Plasmodium cynomolgi circumsporozoite 
ant icrf^n 

125 

24 .402 

348 

AF062655 

Mus musculus plenty-of-prolines-101; 
POP101 ; SH3-philo-protein 

470 

25 . 698 

349 

AF081111 

Mus musculus domes ticus ORF2 

301 

38 .255 

350 

L07924 

Mus musculus guanine nucleotide 
dissociation stimulator 

249 

31.799 

351 

L35601 

Drosophila melanogaster ankyrin 

247 ! 

25.092 
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352 

D88764 

Rana. catesbeiana alriha ? t~ t r^^i ~\ nrron 

ivuuu v,a^&ovcxaiia c*. _l J^"* Act l*yj^/ti X l^OXXdy ell 

o c o 

28 . 231 

353 

D10354 

R<3. t tllS rattUS alaninp ami nnfrariQforaoo 

bib 

53.439 

354 

AB014561 

Homo sapiens KIAA0661 protein 

46 

35.294 

355 

U97553 

ITVUiririf* t*l^r*r>f a *3'Vl T"TIS fift nnlrnnwn 


31 . 701 

356 

L27428 

iivtnv "^J^^ciio icvcioc LlallL}l,i JLpLaoc 


43 . 810 

357 

AB012223 

Canis familiaris ORF2 

221 

49.451 

358 

AC003682 

Homo sapiens R28830 1 

669 

54 . 067 

359 

r\C U vv" D D 

Mus musculus lymphocyte specific foirmin 

421 

52.071 

360 

AF015539 

Mytilus edulis precollagen P 

235 

29.258 

361 

277664 

uiijviiuwri preaiccea using benetinaer; 
similar to Zinc finger, C2H2 type; cDNA 

208 

39.623 

362 

Y07752 

Volvox carteri pherophorin-S 

239 

41. 053 

363 

K02623 

JJiUov^pxiXXct [lie 1 allU^ a 3 U 6 r LI OpOITiyOS J- 11 
isofoYTTl T*l / QP\ 

212 

36 . 774 

364 

D86983 

Homo sapiens similar to D.melanogaster 

T^£* yrt'V "l ^ cs -J T1 /TT1 1 H Q O \ 
J^/CT J_ UAXUciO XI 1 \J»JXXUO.i£/ 

450 

29.442 

365 

AL021747 

*jv*aij. iuoav-unai t^miy v^co kjouiuti iiypoi.riec.icai 
protein 

388 

31 . 599 

366 

AB020671 

1 lwLiiu oa.pj.t-Ho {VlAnUOD4 pXuUc lH 

681 

82 . 677 

367 

K0122 8 

Homo cjia T"» t on a nl *-\V» ^ n / t \ /— ■ V* — k -i ■»-» 
"wiuu oapiclib axpila X. \ JL ) CllHUl 

217 

32 .258 

368 

AF003535 

Homo Qani Pn«3 HPT?") . 1 i Vo r\ -y~/~\ t- /-k -i t-i 

3 57 

54 . 902 

369 

Z11974 

Mus musculus macrophage mannose receptor 

Jj-'-*- uljl. O WJL 

249 

27.014 

370 

AF070651 

Homo sapiens zinc finger protein 4 

246 

40. 000 

371 

AF068 74 9 

nua iiiuuLuxus bpniny os ins Kinase 

362 

65 . 657 

372 

AB012223 

^.ctii-Lo idiuxiidi xa UKr a 

364 

51 . 938 

373 

AF169633 

Mus musculus alpha 2 delta calcium 

i** Vi £i nn ^ ~\ ci i V^\i t t"i -» t~ 

^llalUiCX oLiXJLLllXL- 

332 

44 .633 

374 

U80846 

v^cicriitJi iiauux u. X fc> clc^oHS JNO QcIlnlClOn 

line found 

237 

24 . 710 

375 

X16711 

iiuiiiu e*c*jk»»j.tsiii> ^.wLiZ AX 

431 

29 . 918 

376 

AC004460 

"viiiu ociyxt;iio sxiuxxax co go±gi antigen, 
similar to (PTn'O^/iQfidni ^ 

412 

59 . 124 

377 

M2 07 8 9 

living oapxciia axpila x I— y LJ t; X w-UXXa^cIl 

169 

34 . 921 

378 

AF027735 

J- -i»a. v^xa v j.pco inxiiui allipux XaLc bllK 

protein MiSpl 

O Q O 

Z 9 8 

30 . 977 

379 

AF003535 

Homo saoiens ORP2-likf* nrnhp-in 

CI 1 

63 . 704 

380 

X83413 

Human herpesvirus 6 U88 

328 

36.649 

381 

Y12713 

MUS mUSCUluS Pm-Pnl -riTFTPa c:<a 7-^-^1 \rrivrvho-! t-i 

/ / J 

70 . 225 

382 

AF020261 

Saritalum ^> 1 hum nrol H r"*i Vi r^vr\t- ^ -i n 

k^aiiuaxuiii piUXXllC I plULcXu 

194 

39 . 200 

383 

AC002535 

ArahidoiDSis thall^nA m^ah i vp fi-hoha- 

repeat containing protein, 5' partial 


25 . 473 

384 

AC004883 

Homo sapiens similar to KIAA0766; 
similar* to PTD * crl flft'?!?^'* 

191 

27. 717 

385 

X98330 

Homo Satsipnfl nnHi no rpronhnr O 

o n c 
zlb 

41 . 176 

386 

AF145705 

MUS mUSCUlUS T'P'K nrnt"P1 n V"i naco Vinmrkl rtrr 


80 . 342 

387 

Z49125 

Unknown similarity to Trichostrongylus 
colubrif ormis 11 kd secretory protein 
(Swiss Prot accession 

312 

31.356 

388 

X03725 

Mus musculus ORF 2 (466 aa) 

270 

45. 185 

389 

M31627 

Homo sapiens X box binding protein- 1 

458 

68 . 148 

390 

L07545 

Leishmania tarentolae A 'c' was inserted 

212 

27.490 
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after nt 369 (=nt 10459 in genomic 
sequence (M10126) ) to correct -1 
frame shift probably due to gel 
compression 




TTQ "5 C *7 O 

Homo sapiens putative pl50 

486 

45.509 i 


/W UUU4 So 

Rattus norvegicus cyclic nucleotide- 
yacea cnannei oeua suduihc 

298 

36 . 604 

393 

Z38061 

Saccharomyces cerevisiae mal5, stal, 

Icll. IjD / , v_Mx . U.J, /VMxii iMol rUobiu 

351 

24.477 

394 

M13100 

Rattus norvegicus unknown protein 

231 

37.838 


-M-D U -L £. £. J 


3 00 

42 . 553 

396 

U35376 

Homo sapiens repressor transcriptional 

laCLOr 

521 

40. 940 


ALUDUUZZ 

nomo sapiens nypounecicai protein 

312 

35.060 

398 

AF038564 

Homo sapiens atrophin-1 interacting 

472 

69.811 

TOO 

J ZJ 

AO 

Rattus norvegicus LI retroposon, a 
portion of its ORF2 sequence 

413 

42 . 922 


RI7n£ A HA 
J\C U04DU4 

Homo sapiens KE03 protein 

12 7 9 

50.785 

401 

U93181 

Homo sapiens nuclear dual -specificity 
phosphatase 

1716 

55.626 

*±\J Z 


Mus musculus domesticus ORF2 

598 

44 . 156 

403 

U00978 

Mus musculus type I inosine 
monophosphate dehydrogenase 

1626 

83 .537 

404 

U67056 

Acanthamoeba castellanii myosin I heavy 
chain kinase 

222 

29.478 

405 

LI 54 19 

Caenorhabditis elegans col -4 0 collagen 

196 

35.681 

A f\C 

4Uo 


Homo sapiens Protein sequence and 
annotation available soon via Swiss- 
Prot ; available at present via e-mail 
from LABE I T@EMBL- Heidelberg . DE 

783 

79 . 268 

dm 

*X\J t 

VC1 "5 Q/l 

Xenopus laevis APEG precursor protein 

141 

32 . 584 

a n q 


Mus musculus TCF-3 protein 

133 7 

86 . 307 

A ft Q 


nomo sapiens prepro-aipna- x type 3 

i_ *—> x x ay cii 

339 

29 . 250 

*± x v 

A31D ID 

\Tf \ ~] ir/«\v <— • — i t~ <^ >^ "i O TTT T7* A TCTi GTTD'CA/" , C 
VU1 VUa Lai Lcl 1 OULirHl IltU oUKr/idi 

GLYCOPROTEIN 185 


37.190 

411 

U96 113 

nL>IIIU ci LJ _L Uiio rVWrl 


zo . y io 

412 

X83413 

Human Vi£»Y*rii=» cj-vr-i vnc: ttq q 


1 A 1 1 Q 
j4 . 11 O 

413 

Z34465 

Zea mays extensin- like protein 

306 

29.032 

414 

U Z t: Z H U 

l-'X, woOk/HX Xd IllcXallUydoLcr X / X / 

*> *"5 Q 

36 . 232 

415 

U08020 

Mus musculus collagen pro-alpha-1 type I 
chain 

369 

32.374 

416 

AO J 4 X J 

XI LI II let 11 IlcXpcoVlIUo o uoo 


6 5 . 04 9 

417 

Z67990 

Caenorhabditis elegans similar to 
cuticle collagen 

246 

35.829 

418 

AL078635 

Amycolatopsis orientalis putative WD- 
repeat containing protein 

142 

28.571 

419 

U43 541 

riuo llluOLUXUo X dl 1 1X1 1X11 UtJ La 

^ n *7 


420 

Z70756 

Caenorhabditis elegans predicted using 
Gene finder; similar to collagen; cDNA 
EST EMBL:D66041 comes from this gene; 
cDNA EST EMBL:D66295 comes from this 
gene; cDNA EST EMBL:D69679 comes from 

114 

27.632 


(^0 


i o y a *+& »+3; o 2 a & o l» 




tii-Lc» yt^iit:, tfxMi-V l!*o 1 CiiYLdXj . U / U / COTTieS 
"F r~om f"hi <5 rtpn 

Xw Jte ^/ 111 LliXu mCIIC 



421 

M22333 

Homo sapiens unknown protein 

268 

49.275 

422 

AROI 9 99 ~K 

UaillS J. ctlllX X Xdi. X £> \JK>U 

O Q O 

z\ y Z 

42 . 520 

423 

AL117201 

iVU -1. _1_ / U J. 

\_deuuinduuiLiD cicy alio prcdiCLca usiny 

Gene finder 

lo / 

25 . 934 

424 

U97553 

uivix xnt: ucxpcoviiuo oo uiiJvnown 

"3 T Q 

32 .599 

425 

U09116 

Homo sapiens 0RF2 , encodes a reverse 

LXctlloV/X XJL/LclDC XlUilKJlO^ 

361 

40.686 

426 

AF169388 

Mus musculus alpha 4 collagen IV 

292 

28.723 

497 


riuman enaoyenous retrovirus Jtv. pox 
protein 

850 

59 . 817 

428 

Y1 4490 

nui i Ltw> sctpicim prepio aipna 1 type j 

collagen 

O *7 C 

33 . 007 

42 9 


■jiiLurnyncnus rnyKiss axpna z, type 1 

>w v J_ J. C* Cll 

loo 

30 .485 

430 

U93 568 

Homn q ai~> H <=»yi ci rn i t* a r* \ ~\re> r%~\ 

ccc 
J JJ 

>1Q TT) 

431 

AB012223 

Canis familiaris ORF2 

195 

40 . 000 

432 

AU1 /DO 

riomo sapiens pro- aipna vv/coiiagen (aa 
1099) 

220 

30 . 078 

433 

US 0 076 



Jo . 4 y j 

434 

AC002398 

Homo sapiens F25965 3 

533 

69.697 

4 1 R 
" j j 

rir uuui jo 

tdenoiiiaDciiuis eieyans oimnar to 
cuticular collagen 


31.600 

4 1 fi 
f* «J o 

.rt.r 11440D 

urosopniia meianogasuer ^linp-o 

2 73 

30 . 114 

437 

AL021811 

Arabidopsis thaliana putative protein 

816 

51.154 

/l "3 Q 
*± J O 


Schizosaccliaromyces pombe hypothetical 
divergent repeat-containing protein 

178 

31 . 285 

439 

U73819 

Mus musculus polypeptide GalNAc 
transferase -T4 

366 

40.838 


Ao© uiy 

Homo sapiens SH3 -domain interacting 

}JL ULc ±11 

190 

32 . 000 

441 

AJ004801 

Bovine herpesvirus type 1.1 immediate- 
early transactivator protein (cell 

llUtlcUo / 

112 

30.539 

*a ** €» 


vjdiius yaiius novel coiiagen protein 

319 

29 . 730 

443 

AROI "*4fi4 

nut} mufatuius cytonesin i 

3 78 

48 . 810 

444 

L29028 

Unknown amino acid feature : N- 
yiytubyictLiun sites, aa ^l . . . . 
48 51 79 74 1 07 

205 

33.158 

445 

U97553 

mi i t~ H ti ^ V»^T~r^f*c3'v-i "r*i i a fift nnlfnr>uin 

lltui JLXXC llCiJ^CoVXI US OO UX1A.11UWI1 

9 1 

OQ ICC 

446 

AJB014608 

Homo ^aoipn*! KTAA07f)fl nrnh <=» "5 n 


7 Q fi 7 A 
/ y . O / U 

447 

AL031231 

StreDtomvces coplicoloy hvnnfhAt* i ral 

i~r v— X. will y V— ^ £3 l>l/v-JLX WJlvX ny UUUilC LXUCtX 

protein SC3C3.03C 

1 JO 

4** c 

448 

AF081110 

Mus musculus domesticus ORF9 

9 R ^ 

4 R 9 C Q 

449 

AF081111 

Mus musculus dom^st* i ctis ORP9 

22 7 

ii n T4i 

ft U . 11 X 

450 

AF132480 

Mus musculus Ese2 nrnhp i n 

9 "3 7 

^ J J D 

qn 79 fi 

451 

AB002304 

Homo sapiens KIAA0306 

1459 

74.656 

452 

AF067607 

v^aciivjx XictULlX » — La excy alio OXlllXXctx LU 

cuticular collagen; C18H7.3 

OCT 

"1 "1 "5 A "J 

453 

AB011370 

Mus musculus Ankhzn 

2127 

76.231 

454 

Z78279 

Rattus norvegicus Collagen alphal 

223 

32.472 

455 

AF115480 

Mus musculus cAMP- dependent Rapl 
guanine-nucleotide exchange factor 

230 

32.308 

456 

M22334 

Homo sapiens unknown protein 

1085 

58.544 
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/» C *7 

Li4U4 by 

Mus musculus latent transforming growth 
i_ctv_ ulji — jjtruc*. jjiuuiiiy protein 

2128 

89 . 969 

A *^ft 
f* o o 


HxdDiaopsis L.xicijLiaria pui-ative 

l fx 

37 . 059 

459 

i iz. ^1. ^ «> o 

.n<wMiit_> oapxciio uiifkiiuwii piuucxn 

T ~7A 
1 / *4 

30 . 137 

460 

AB023203 

Homo sapiens KIAA0986 protein 

165 

46.591 

ft D X 

7\T?ft ft O C O C 

riomo sapiens L/Kr z - x i\e prouein 

492 

54 . 301 

462 

X51394 

Xenopus laevis APEG precursor protein 

194 

33 .333 

fx D 5 

jlU / / z>2 

Volvox carteri pherophorin-S 

305 

45 . 918 

4 £ A 
fxD 1 

X loolfl 

Homo sapiens paraplegin-like protein 

323 

40 . 952 

fx D O 

^d /yyu 

Caenorhabditis elegans similar to 
cuticle collagen 

216 

40 . 476 

HDD 

j_ifx jo iy 

Homo sapiens polycystic kidney disease 1 
protein 

162 

34 . 266 

a a i 

f*D / 

ADD QUO 

Gallus gallus procKr2 

168 

33 . 577 

A K. Q 

nnni ooo*3 

Lams rdtnuidiis UKr^s 

535 

40 . 397 

ACQ 

AL. UUoU / O 

Arabidopsis thaliana Contains PF| 00069 

CiUlvdiyuUlu pXUUclIl KlllaSc uoiuain • 

181 

38 . 462 

470 

U58736 

Caenorhabditis elegans Similar to 

tUUlCUldl COlla^cU 

245 

33.333 

4 *7 1 
ft / X 


v_a.il j. s lamiiidris uki?z 

200 

43 . 269 

ft / Z 


Drosophila melanogaster eyelid 

315 

29 . 968 

A *7 "l 
fx / J 

Apni c A A n 
Vl Dfx fx U 

Rana catesbeiana alpha 1 type I collagen 

270 

27 . 672 

474 

Z98980 

Schizosaccharomyces pombe wiskott- 
aldrich syndrome protein homolog 1 

172 

31.416 

475 

AF090866 

Mus musculus CDO 

1473 

47 . 119 

>t *7 C 

4 / D 

L76559 

Drosophila melanogaster mus308 

477 

30 . 484 

477 

ACQ02528 

Homo sapiens alpha2 (I) collagen 

212 

28 . 060 

478 

AL033534 

Schizosaccharomyces pombe serine- rich 
protein 

253 

35. 065 

479 

X15120 

Pseudorabies virus immediate-early 
protein \AA 1-1460) 

105 

25. 869 

480 

AF071172 

Homo sapiens HERC2 

615 

85.000 

4 y l 

*7 "7 /"» <"\ O 

Z70208 

Caenorhabditis elegans predicted using 
Genefinder; similar to collagen 

199 

31.088 | 

/l o o 

U44 091 

Rattus norvegicus atrophin-1 related 
protein 

227 

32 . 168 

a q *a 
ftoo 


Mus musculus 2855 is the position of the 
first start codon in ORF 2 ; putative 

344 

52 . 212 

A ft A 

ft O ft 

A. Ifi ft Z LF 

nomo sapiens prepro-aipna- i cype J 

LLJ-L J. dy till 

244 

32 . 154 

485 

U07629 

Drosophila melanogaster beta-heavy- 

ope LLI XII 

228 

30.052 

486 

TTQ TCC'J 

XJ/^vrn /™v cj t~\ ~i on g i^i 1 1" ^ i T C C\ 
n\J\\\\J iscipicilb pLIL.ciL.lVc: pXDU 

C Q tZ 

bU . o4 y 

487 

Z34465 

Zea mays extensin-like protein 

250 

31.383 j 

488 

aRft? ft^Rfi 

«d wzuooo 

TJ"/ — vtti /~v c; n *r> i on a T A 7\ H Q ^7 Q v\y»/s ^ -J 

xiuiulv Ddpicim in. j./-i/-vu o i zf pruccin 

-51o 

34 . 884 

4ft 9 

fr O _7 

Ml 1 ft Q"7 

i v ius iHUscuius proiine-ricn salivary 
protein 

O "5 A 

28 . 571 

A Q ft 
ft j7 V 

Ml 9 1 ft ft 

i v ilis iiiLiscLii us proline — ricn procein riir - _3 

O O "7 

22. / 

36 . 585 

491 

AF045567 

Xenopus laevis nucleoporin Nupl53 
homolog 

94 

OA 17 R 

4& ft . O O J 

492 

AF003535 

Homo sapiens ORF2-like protein 

370 

44 . 805 

493 

U35376 

Homo sapiens repressor transcriptional 
factor 

903 

62.312 

494 

AF109907 

Homo sapiens S164 

350 

40 . 361 
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495 

X52046 

Mus mus cuius type III collagen 

212 

35.417 

496 

D42063 

Homo sapiens RanBP2 (Ran-binding protein 
2) 

354 

48.588 



Homo sapiens The KIAA0136 gene product 
is novel . 

315 

58 .491 

A Q Q 

Ar UIU144 

Homo sapiens neuronal thread protein 

nm« "NTT T> 
A1J / C-JN 1 It* 

218 

58 . 974 

*» -7 -7 

VI 071 7 

Mus musculus Pro- Pol -dUTPase polyprotein 

665 

76 . 744 

cnn 

vi m q*> 

Human endogenous retrovirus K protease 

582 

72 .581 



Homo sapiens neuronal thread protein 

AT* "7 1~> _ VPTTJ 
J\XJ / C — JM 1 tr 

276 

69 . 841 


1 J.Z. f J_ j 

Mus musculus Pro- Pol-dUTPase polyprotein 

231 

45 . 926 

503 

X52495 

Glycine max DNA-directed RNA polymerase 

248 

39.153 



Homo sapiens transcription factor 

24 7 

36 . 816 

J U J 

AD J 

Mus musculus formin 

320 

44 . 898 

506 

Y11871 

Arabidopsis thaliana Oxal protein 

330 

26.710 

c rv *7 


Mus musculus proline- rich protein 

240 

37 . 500 

bU O 


Rattus norvegicus calmodulin -binding 
protein 

930 

79.803 



Homo sapiens prepro-alphal (I) collagen 

312 

30 . 227 

Jlu 


Homo sapiens density regulated protein 
urpi 

752 

88 .281 

511 

M74165 

Gallus gallus tensin 

77 

28.125 

z> X Z 

TTQ 1 *7 Q Q 

Drosophila melanogaster kinesin-73 

625 

45 . 174 

Jl J 


Homo sapiens envelope 

956 

55 .814 

^ 1 A 

MQA 111 

Homo sapiens mucin 

265 

41 .333 

CI c 

ALillUlbl 

Homo sapiens hypothetical protein 

361 

36 .269 

DID 

71 A O Alt 

Mus musculus Wiscott-Aldrich Syndrome 
protein homolog 

228 

41 .600 

517 


Mycobacterium tuberculosis PE PGRS 

190 

33 . 880 

JlO 

a an 1 ft o 1 

Homo sapiens KIAA0720 protein 

421 

53 . 788 

519 

D38024 

numu oapicns UKr 

2 93 

45.985 

520 

X69838 

Homo sapiens G9a 

809 

46.885 



Mus musculus Ten-m3 

518 

78 . 095 

522 

AL050276 

Homo sapiens hypothetical protein 

270 

55 . 072 

COT 

Ao bib b 

Volvox carteri extensin 

233 

33 .838 

C O A 
b ^2 ft 


Caenorhabdi tis elegans Y48ElB.2b 

326 

26 .300 

525 

AF071081 

Mycobacterium tuberculosis proline-rich 
mucin homolog 

222 

35.233 

DZ D 

VI TQT) 

Human endogenous retrovirus K env 

339 

43 . 103 

527 

AF116463 

Streptomyces lincolnensis unknown 

183 

33 .649 

ft 

o o n c 
i\U J / Ub 

Homo sapiens salivary proline-rich 
protein precursor 

182 

34 . 586 

Z> Z? 

l v l 2/ D _/ *i J 

Homo sapiens prof ilaggrin 

248 

27 . 817 

con 

Z) J v 

VIA AC C 

Zea mays extensin- like protein 

213 

29 . 680 



Alvinella pompejana fibrillar collagen 

nha i n TP A 1 al t-\ Vi ^ 
OXlciJ.Il fHpi dipnd 

218 

35 . 628 

532 

U00048 

Caenorhabditis elegans No definition 
line found 

520 

32.759 

533 

AF020261 

Santalum album proline rich protein 

233 

31.579 

534 

AC002310 

Homo sapiens Unknown gene product 

907 

65.929 

535 

AJ243997 

Homo sapiens ERIC1 

264 

50.355 

536 

U32189 

Oryctolagus cuniculus histidine-rich 
glycoprotein precursor 

291 

47.458 


^3 
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537 

AF003535 

Homo sapiens ORF2-like protein 

355 

44 .000 

z> 0 O 


Canis fa.milia.iris ORF2 

219 

38.211 

539 

L11672 

Homo sapiens zinc finger protein 

1730 

42 .364 

C4 A 

VI OTI 1 

Mus musculus Pro-Pol-dUTPase polyprotein 

337 

38.538 

541 

M34225 

Homo sapiens cytokeratin 8 

580 

72 .603 

b4z 

AF07108 1 

Mycobacterium tuberculosis proline-rich 
mucin homolog 

167 

27.559 

543 

U40953 

Caenorhabditis elegans No definition 
line found 

206 

31.746 

r /1 /i 


Rattus norvegicus vanilloid receptor 
subtype 1 

348 

39.735 



Homo sapiens pot ► ORF I 

227 

34 .395 

546 

w J D J f D 

Homo sapiens repressor transcriptional 
factor 

917 

51 . 136 

547 

U93564 

riouio sapiens pucacive pibu 

254 

47 . 222 

54 8 

AO j 

numdn ne xrp esvi ru s b Uob 

257 

41 .606 

549 


i"us tnuacuius semapnorm via 

531 

43 . 644 

550 


riomo sapiens type aviii collagen 

298 

30 . 162 

551 

AB028997 

Homo sapiens KIAA1074 protein 

701 

49.811 



Gallus gallus alpha-1 collagen type III 

184 

28 . 829 

553 

U93563 

Homo sapiens putative pl50 

278 

36 .508 

CCA 


Rattus norvegicus ORF3 

487 

64.800 

555 

U93568 

Homo sapiens putative pl50 

228 

42 .424 

556 

Z14015 

Nicotiana tabacum pistil extensin like 
protein 

187 

29.302 

c c 1 

557 

X55995 

Rattus norvegicus dimethylglycine 
dehyd r ogena s e 

1412 

91.111 

558 

AF062655 

Mus musculus plenty-of -prolines-101 ; 
POP101; SH3-philo-protein 

155 

25.652 

rr c Q 


Homo sapiens suppressor of white apricot 
homo log 2 

1369 

67 .204 

CCA 

act n oortn 
AF109907 

Homo sapiens SI 71 

450 

47. 849 

DO 1 

AT A1 1 C T /t 

AliO-3 J 534 

Schizosaccharomyces pombe serine- rich 
protein 

159 

24 . 759 


y c: O C Q T 

Rattus norvegicus ORF4 

310 

47 . 107 

D D J 

Aiar^T cr a ~i 0 

Cynops pyrrhogaster alpha 1 type I 
collagen 

534 

29.097 

C(~ A 
DO 

uujy / j 

Homo sapiens ZNF91L 

662 

49 . 794 


/\r UXU144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

461 

57 .432 

000 


Canis familiaris ORF2 

420 

45 . 789 

567 

TT4 "l^fin 

Peromyscus maniculatus reverse 

t* "K" 0 n O 1* T 1^1 -3 ft n 

L-i ctilo t-i lpudSc 

339 

48 . 592 

568 

rtr vu j j j j 

rioino sapiens UKr 4-iiKe protein 

26 7 

57 . 732 

569 

D38548 

Homo sapiens The ha0936 gene product is 
nove 1 

421 

34 .247 

570 

M22334 

Homo sapiens unknown protein 

299 

46.053 

571 

A. O J *± X 0 

numan nerpesvirus 0 U8 8 

407 

42 . 593 

— > / z. 

Af UD D i7 Uft 

Myxococcus xanthus unknown 

150 

38 . 889 

573 

K03204 

Homo saoiens salivarv nrnl inp-ri rh 
protein precursor 


J b . jiu 

574 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

368 

65.979 

575 

X51394 

Xenopus laevis APEG precursor protein 

317 

34.091 

576 

M22334 

Homo sapiens unknown protein 

631 

51.101 
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577 

X90569 

nuino £>dpxtilj.S claSulC Cltin 

270 

35.455 

578 

X51394 

Xenopus laevis APEG precursor protein 

263 

31.773 

579 


uncornyncnus myKiss axpna 2 type I 

w _L J. cty tin 

266 

31 . 116 

580 

AE001 727 

iiiCLitujLUya I i Id 1 1 1 — L 1 1 Id COuSci V6Q 

hypothetical protein 

226 

28 . 571 

581 

U08 020 

vy vy ^* w 

11US iiiuaLuius coixayen pro-aipna-i type I 
chain 

179 

32 .203 

582 

AJ132099 

Unmri C3f>-j <=»n c; \7MM1 t-» v /-"i t- e» -i n 

944 

70 . 936 

583 

AB020629 

Homo Qar»"i pnQ "K"TI1210Q">0 r^-rr\t~ a A r» 
iiumw oapxcilo XVliiriUOZZ pxvJL,tJXZ* 

1552 

70 . 904 

584 

M76671 

Lycopersicon esculentum extensin (class 
II ) 

238 

31.361 

585 

AC004144 

Homo sanipnq P^4.Dfn 1 

1611 

. 

78 . 571 

586 

D31909 

PnPlimftPVQt" "l cs oa. Ti nil OT?T? — 1 
t *ivi 1 1 » w jr O UXO V*- CI J- X11XX UI\ r O 

210 

32 . 857 

587 

AB012223 

Canis familiar is ORF2 

213 

39.695 

588 

AL117200 

Caenorhabditis elegans predicted using 
Genefinder; preliminary prediction 

320 

25.123 

589 

MR"} 1 *7 Q 

baiius gaiius aipna-3 type IX collagen 

218 

28 . 713 

-J -7 V/ 

AU J / ZD 

Mus musculus ORF 2 (4 66 aa) 

262 

41 . 880 

591 

Z78018 

Caenorhabditis elegans predicted using 
Genefinder; similar to serine/threonine 
kinase; cDNA EST yk353dl0.5 comes from 
this gene 

1258 

36.444 

O _7 ^ 

*j -5 D D 

Zea mays extensin-like protein 

238 

30. 556 

COO 

O O / D \J 

Homo sapiens Similarity to Human 
diusLdxyxri vow:/\jmjvd nUMAN; ; CDNA EST 
Cii'io j_i . comes ixom ums gene; cd 

362 

37. 952 

594 

AB028236 

Tricholoma matsutake Pol (reverse 
transcriptase-RNase H-inteqrase) 

172 

31. Ill 

595 

AF053745 

Mus dunni endogenous virus gag protein 

205 

37.615 

596 

X92887 

Human endogenous retrovirus K pol/env 

586 

64 .493 

^ Q "7 

AbUUz .3 .<£ 1 

Homo sapiens KIAA032 3 

1358 

85.169 

COO 
J JO 

too Q*7** 
XiO Z? / j 

Mus musculus This ORF is capable of 
encoaing 4jz aa wnicn is similar to 
thymidylate kinases especially at two 
domains: the p-loop or catalytic site 
diivi Lilt; buxjaLidtc Dinainy slue ; OKlr 

1011 

80.952 

599 

M22333 

rzwitiv^ oa^xcils uiuvnuwil piotclll 

418 

46 . 000 

600 

AF033811 

Moloney murine leukemia virus Prl80 

359 

50.376 

601 

AF026 9R4 

out> tdurus pyruvate Qenyorogenase 
piiuapudLdoc icyuiaLory siujunit 
precursor; PDPr 

797 

72 . 131 

602 

M13 100 

ivauuuo iiux vey iLUb UIIKIIUWII protein 

286 

41 . 781 

603 

D42043 

Homo cj t~) -j ctr-i a TVio Via O HOO rrcinci nvrtrlii^f- ■» <-» 
^ iw oci^/xdio xntr lldz UZZ yclie UxOGuCC IS 

novel . 

1232 

75 . 746 

604 

AL110188 

Homo sapiens hypothetical protein 

657 

50.746 

605 

U41557 

Caenorhabditis elegans proline and 
glycine-rich 

255 

30. 154 

606 

M95610 

Homo cjani ono a 1 nna - O TY r*n1 1 nrran 
ra^-unw oapxcllb aXpUa Z 1A COllaycIl 

152 

29 . 795 

607 

AC004221 

Homo sapiens R2 9144 1 

144 

/ . Z> \J \J 

608 

AL031588 

Homo sapiens dJ1163Jl.l (ortholog of 
mouse transmembrane receptor Celsrl 
(KIAA0279 LIKE EGF-like domain 
containing protein similar to rat MEGF2) 

1006 

75. 622 

609 

AC004460 

Homo sapiens similar to golgi antigen; 

571 

78.448 
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similar to Q08379 (PID :g2498401) 



O J. W 

J_ll ^ U1D 

KdULus norveyicus tricarooxyxate 

"V" a V\ >-v /-v V* ^ - ^ ^» -i 

i- xTciri ispozr l. procein 

593 

64 .671 

611 

X69838 


426 

63 . 910 

612 

UUO U j£» u 

i v ius muscuius conagen pro - dipna — i type 1 
cha in 

268 

35 .401 

613 

X83429 

Rnmhw mnr* "i a 1 nha - f-iiKnl -in 
xjv-/mujyj^ \\\KJJL A. aipila LUJjUXIJI 

741 

81 . 119 

614 

AJ238221 

Homo sapiens RNA polymerase III subunit 

175 

64 .815 

615 

M58378 

Homo sapiens synapsin lb 

175 

35.616 

616 

AB015440 

Rana catesbeiana alpha 1 type I collagen 

251 

31.657 

617 

AJ001038 

Mus muscuius M-protein 

1798 

45.704 

618 

D70831 

Homo sapiens Zinc- finger protein 

563 

63 .504 

619 

U4 983 0 

Caenorhabditis elegans coded for by C. 
elegans cDNA ykl4el0.3; similar to S. 
pombe cell division control protein 16, 
v_JJL_lfo vbP : CC16_SCHPO / P36618 ; ana to S. 
cerevisiae cell cycle arrest protein 
BUB 2 (SP:BUB2 YEAST , P26448) 

628 

34 .571 

620 

AL021366 

Homo sapiens CICK0721Q. 4 . 1 (PHD finger 
protein 1) (isoform 1) 

443 

39.791 

621 

L03427 

Homo sapiens basonuclin 

327 

32 .510 

622 

TTQ T cm 

U93570 

Homo sapiens putative pl50 

287 

39.355 

/TOO 

b Z J 

Z75550 

Caenorhabditis elegans weak similarity 
with BRKA gene from Bordetella 
Pertussis; cDNA EST EMBL:T0106 0 comes 
from this gene; cDNA EST EMBL:T01361 
comes i. xrom Lnis yerie 

588 

49.524 

624 


Hraoiaopsis tnaiiana putative protein 

222 

31 . 950 

v3 ^ O 


Xenopus laevis Scythe 

274 

46.479 ; 

626 

K034 75 

noino sapiens pulmonary surraccanc- 

ClODUV^ pi UUcJ.II 

181 

46 . 078 

627 

AB020629 

Homo sapiens KIAA0822 protein 

1415 

72 .107 

628 

Z73619 

Saccharomyces cerevisiae ORF YPL263c 

386 

27.132 

629 

Z78279 

Rattus norvegicus Collagen alphal 

202 

26.866 

630 

AF068706 

Homo sapiens gamma2 -adapt in 

254 

64 . 198 

6 J 1 

Z22964 

Caenorhabditis elegans a2 (IV) collagen 

156 

34 .300 



Caenorhabditis elegans final exon in 
repeat region; similar to long tandem 
repeat region of sialidase 

voir . x^nj^. iai^k, ir ^ J Z j J ) ciliLl 

neurofilament H protein i 

223 

36.150 

633 

AB015440 

T?ATIA C*Z\ t" G> ct VkO n ana al r^Vira 1 f ^ rr\ c» T /-tr\l 1 -> rrnn 
xvciiicx ^aLCoUclalJa ct _L JLJ I lcl _L Cypti 1 COlldycn 


28 . 466 

634 

AB014564 

Homo sapiens KIAA0664 protein 

513 

58 . 140 

635 

AF095449 

ntuciuaera glycines mucin-iiKe protein 

143 

26 . 804 

636 

M63596 

Xenopus laevis alpha-1 type II 1 collagen 

298 

32 .249 

6^7 1 


Streptomyces coelicolor hypothetical 

pXULclil O V 5 V 5 . UJC 

187 

29 . 365 

638 


Rattus norvegicus protein tyrosine 

319 

33 . 968 

639 

D88440 

Gallus gallus high molecular mass 
nuclear antigen 

177 

22 . 642 

640 

M92913 

Nephila clavipes dragline silk fibroin 

185 

31.278 

641 

Z74615 

Homo sapiens prepro-alphal (I) collagen 

306 

29.108 

642 

U41387 

Homo sapiens Gu protein 

843 ! 

60 .484 

643 

Z92546 | Homo sapiens bK65A6.1 

1700 ! 

90.175 
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644 

Y12713 

Mus musculus Pro-Pol-dUTPase polyprotein 

365 

44.390 

£ A C 

Jjj o o o i 

neisseria gonorrnoeae pucauive 

181 

38 . 462 

646 

D38162 

Mus musculus mouse al(XI) collagen chain 

176 

37.264 

D / 


Myxococcus xanthus unknown 

192 

33 . 166 

f /1Q 
D4 O 


Homo sapiens KXAA1005 protein 

472 

37 . 805 

649 

U43200 

Boreogadus saida antifreeze glycopeptide 
AFGP polyprotein precursor 

149 

40.157 

^ ^ n 

TTyl 10C\(\ 

Boreogadus saida antifreeze glycopeptide 
jr\c vjir poAypiotein precursor 

232 

31 . 687 

D -J X 

r\r \J / 11 / 

nonio sapiens rinrCv^z 

7 97 

81 . 765 

652 

AFfiQR7RR 

./"1..I7 U JO / O O 

vsaiius yaxiuy nuciear caiinociuiin-Diriaing 
protein 

z 3 83 

61 . Ill 

653 

U4 7855 

m dueus uiaUciriacuy iiDioin- j 

job 

28 . 099 

654 

AF083334 

Antheraea pernyi fibroin 

256 

33 . 923 

ODD 

t no n q*7 

cams ramiiiaris oeca-speccrin 

164 

27 . 962 

tec 

D D O 


Homo sapiens putative pl50 

377 

55 . 814 

657 

AF071081 

Mycobacterium tuberculosis proline- rich 
mucin homo log 

266 

33.562 

658- 

L28125 

Podospora anserina beta transducin-like 
protein 

316 

26 . 525 

65 9 

AF03 1588 

Homo sapiens WASP interacting protein 

266 

30.102 

C C f\ 

AC006135 

Arabidopsis thaliana putative vicilin 
storage protein (globulin-like) 

389 

41.618 

6 6 X 

Z3806 1 

Saccharomyces cerevisiae mal5, stal, 
Xen : 1367, CAI : 0.3, AMYH YEAST P08640 
LjIjUCUAiYIYIjAoJS SX I EC 3.2.1.3) 

230 

22 . 689 

D D 


LaenornaDuitis exegans x/t>ooA.X2 

510 

41.441 

O O J 

Al/ jZO 

liicicum aescivum ririw yiucenin suounic b 
(AA 1-848) 

312 

30.033 


TTQ1 R ^ A 
U J? J D O *4 

Homo sapiens putative pl50 

933 

62 . 195 

665 

U93572 

Homo sapiens putative pl50 

338 

48 . 673 

D D O 

IlZ / 1 3 

Mus musculus Gag polyprotein 

288 

45 . 528 

£ £ "7 
O D / 

Mi 11 nr\ 

Rattus norvegicus unknown protein 

239 

42 . 056 

668 

AF000298 

Caenorhabditis elegans weak similarity 
to collagens; glycine- and proline-rich 

246 

29.562 

669 

M19419 

Mus musculus proline-rich salivary 
protein 

261 

38.743 

670 

X53581 

Rattus norvegicus ORF3 

332 

54.491 

O / 1 

vi moo 

Human endogenous retrovirus K gag 

-y~ /~\ 4~ "1 f-k 

p xro tcin 

273 

44 . 144 


■A.O 1Z J7fi 

Rattus norvegicus IjI retroposon, a 
pui liuxi w j_ ilo UKr Ejetjueiice 

291 

38 . 144 

673 

U43360 

Peromyscus maniculatus reverse 

215 

45. 614 

674 

U93572 

Homo sapiens putative pi 50 

396 

57.554 

675 

At?fift ill"* 

nus muscuius uoniescicus UKr l 

188 

45 . 370 

6 76 

TT4. QQ74 

tioiiio sapiens manner cransposase 

590 

72 . 222 

6 77 

AF0AHR4 

nuiiiLj sapiens 4DKua splicing LaCLor , t^trt 
45 

6 54 

73 . 649 

678 

Y17833 

Human endogenous retrovirus K gag 
protein 

111 

44 . 860 

679 

X67863 

Mus musculus T2 

179 

37.857 

680 

X59244 

Homo sapiens ZNF4 3 

519 

50.282 

681 

Y12713 

Mus musculus Pro- Pol -dUTPase polyprotein 

194 

41.538 

682 

U64849 | 

Caenorhabditis elegans Contains 

974 

44.475 
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similarity to Pfam domain: PF00646 (F- 
box) , Score=28.7, E- value =4 . 3e- 05 , N=l 



/" Q O 


Mycobacterium tuberculosis H37Rv 
hypothetical protein JV0534 

273 

31.429 

684 

X01469 

Plasmodium lophurae histidine-rich 
protein 

298 

56.410 

685 

AB012223 

Canis familiaris ORF2 

290 

54 . 545 

686 

Z92546 

Homo sapiens bK65A6.1 

411 

43.333 

DO / 

Z81515 

Unknown cDNA EST EMBL:T01055 comes from 
this gene; CDNA EST EMBL:D74 98 0 comes 
from this gene; cDNA 

328 

24.746 

boo 

At 1 Ub / o 0 / 

Caenorhabditis elegans Similar to 
cuticular collagen; C18H7.3 

183 

30 .594 

689 

X17403 

human herpesvirus 5 HCMVUL61 

185 

31.188 



Homo sapiens meningioma -expressed 

dntiycu o 

675 

70 . 909 

CQ1 
O _7 J. 

O ft "5 ft 

opermaLOzopsis simnis pziu protein 

229 

30 . 802 

692 

X58251 

Mus musculus pro-alpha-2 (I) collagen 

201 

26.974 

693 

X63134 

Zea mays hydroxyproline-rich 
glycoprotein 

195 

26 .210 

o 

Abo 04 4 

Mus musculus protein Htf 9C 

1143 

63 . 023 


*7 ^ A A C C 

Zea mays extensin-like protein 

188 

32 . 800 

696 

V01555 

Human herpesvirus 4 BRLF1 reading frame, 
(immediate?) early gene, acts as 
transcription activator . 

164 

30.319 

697 

AB012223 

Canis familiaris ORF2 

540 

46.377 

£Z O O 

7\ t~> r\ c> ~l ~l 1 i 

Ar 081111 

Mus musculus domesticus ORF2 

727 

46.201 

699 

AF010144 

Homo sapiens neuronal thread protein 

AD7C-NTP 

347 

61.538 

/oo 

a unoi r" -> r- 

AFOOJ 53 5 

Homo sapiens ORF2-like protein 

328 

49 . 032 

701 

Z844 76 

Homo sapiens dJ25J6.4 (ret finger 
proLeinj 

292 

32.639 


UJI / O 1 

Ancnopieura eiegantissima Antno-LWamiall 
preprohormone 

300 

32 . 584 

703 

L26953 

Homo sapiens chromosomal protein 

199 

68. 000 

704 

D12983 

Pyrococcus furiosus DNA polymerase 

1140 

96.923 

/Uj 


Homo sapiens KIAA06 96 protein 

723 

86 .232 

1 Ub 

Ar 03 0131 

Mus musculus Plenty of SH3s; POSH 

132 

25.328 

707 

L29029 

Chlamydomonas reinhardtii amino acid 
feature: Rod protein domain, aa 266 
468; amino acid feature: globular 
protein domain, aa 32 . . 26 5 

205 

33.846 

708 

X83413 

Human herpesvirus 6 U88 

213 

40.299 

709 

Z81503 

Caenorhabditis elegans predicted using 
Genefinder; similar to collagen; cDNA 
EST EMBL»:D6 5450 comes from this gene; 
cDNA EST EMBL:D688 88 comes from this 
gene 

130 

30.151 

71 0 


riomo sapiens kiaauuzU 

641 

80 . 435 

711 

U94855 

Homo sapiens translation initiation 
factor 3 4 7 kDa subunit 

276 

/ o . o o o 

712 

AC007192 

Homo sapiens P85B_HUMAN; PTDINS- 3 -KINASE 1 
P85-BETA 

1055 

77.432 

713 

D83146 

Mus musculus Six5 

327 

53.600 

714 

M60832 

Homo sapiens alpha-2 type VIII collagen 

200 

33.673 
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715 

L.48440 

Rattus norvegicus collagen alpha 1 type 
II 

286 

33 .639 

716 

L48440 

Rattus norvegicus collagen alpha 1 type 
II 

283 

29.070 

717 

AB012223 

Canis familiaris ORF2 

343 

37 .727 

718 

U09367 

Homo sapiens zinc finger protein ZNF136 

605 

59.494 

719 

AL078579 

Arabidopsis thaliana putative proline - 
rich protein 

186 

33 .333 

72 0 

Y12 713 

Mus musculus Pro-Pol-dUTPase polyprotein 

1796 

73 .351 

72 1 

AF071081 

Mycobacterium tuberculosis proline -rich 
mucin homolog 

213 

29 .618 

n i "~> 

uu / y / J 

oajLius gaiius aipna-i collagen type III 

190 

30 . 837 

/ ^ o 

ZVT ftRft7rtC 

Homo sapiens dJ475B7.2 (novel protein) 

217 

52 .381 

/ *I ** 


Rattus norvegicus unknown protein 

234 

58 .333 

'7'? 
/ZD 


Cams famxliaris ORF2 

337 

52 . 713 

726 

X64698 

Homo sapiens titin 

1055 

84 .314 

72 7 

AF04144 9 

Homo sapiens advillin; p92 

267 

60.204 

728 

Y12713 

Mus musculus Pro-Pol-dUTPase polyprotein 

503 

69.748 

729 

U10281 

Sus scrofa gastric mucin 

198 

28 .326 

730 

X64322 

Chironomus ten tans BR1 

223 

45.349 

731 

AF043636 

Plasmodium chabaudi circumsporozoite 
protein 

345 

55.682 

732 

AE000114 

Escherichia coli possible synthesis of 
cofactor for carnitine racemase and 
dehydratase 

955 

93 .038 

733 

U93569 

Homo sapiens putative pl50 

362 

46 .296 

734 

AF104328 

Arabidopsis thaliana cell wall -plasma 
membrane linker protein homolog 

186 

45 .263 

735 

AB032552 

Schizosaccharomyces pombe mipl 

1068 

61.355 

736 

AL079348 

Streptomyces coelicolor putative serine- 
threonine protein kinase 

195 

34 .010 

737 

X54162 

Homo sapiens 64 Kd autoantigen 

460 

48.876 

738 

U93572 

Homo sapiens putative pi 50 

352 

48.734 

739 

L29028 

Unknown amino acid feature : N- 
glycosylation sites, aa 41 .. 43, 46 
48, 51 . . 53, 72 . . 74, 107 . 

188 

37.931 

74 0 

Z38061 

Saccharomyces cerevisiae mal5, stal, 
len: 1367, CAI: 0.3, AMYH_YE AS T P08640 
GLUCOAMYLASE SI (EC 3.2.1.3) 

164 

27 . 500 

741 

AF151865 

Homo sapiens CGI - 107 protein 

556 

73.050 

742 

AF094519 

Mus musculus diaphanous -related formin; 
pl34 mDia2 

1533 

87.500 

743 

U2 2 376 

Homo sapiens alternatively spliced 
product using exon 13A 

372 

55.200 

744 

L17318 

Rattus norvegicus proline- rich 
proteoglycan 

252 

46.154 

745 

AF153604 

Homo sapiens ubiquitin-specif ic protease 
homolog 

224 

48.673 

746 

ZB 2 26 8 

Unknown cDNA EST yk338gl0.5 comes from 

this oene • cDNA E^?T FMRT.*n97Q^4 rr^-me* c: 

from this gene; cDNA E 

489 

39.405 

747 

L27428 

Homo sapiens reverse transcriptase 

241 

45.528 

748 

AF003522 

Homo sapiens Delta 

501 

75 . 000 

749 

U76759 

Mus musculus nuclear protein NIP4 5 

533 

57.778 

750 

AJ000517 ! 

Homo sapiens spinocerebellar ataxia 7 

1214 

42.626 


X H J O S M-6 «4-3; ,„ O 2 Hi 6 O 2 


751 


"j^a t»v-iuid XJcLd dCClU 

53 7 

69 . 118 

752 

AF004161 

Oryctolagus cuniculus peroxisomal Ca- 

Uc^cIlUtillL bUlUCc Crtxx XSx 

813 

67. 797 

753 

AC006264 

Arabidopsis thaliana unknown protein 

335 

45.217 

754 

298979 

ocnizosaccnaromyces pornoe cat Dinaing 

526 

49 . 333 

755 

^AX7 y WU J 

noiiiv^ sdpiens lmmunoyioDuiin nsavy cnain 
variable rpaion 

"5 *1 C% 

3 79 

76 . 056 

756 

AF084205 

Ratt.us norvegicus serine/threonine 
orotein leinas** TAOl 

X V-f U t J.il /v JL X 1 GX O X ^XVX JL. 

2886 

98 .670 

757 

U4 0 3 4 2 

Mils mimnil hq ni tipt n 


64 . 072 

758 

AF05507 7 

Homo saT"»i" on Q 71 nr f innpr nrnhoi n /IO 

c: q c 

32 . 44 7 

759 

M12100 

Mus musculus proline- rich protein MP-3 

207 

48.810 

760 

AF14 06 7 ■=» 
nr ww 

nuuiu yctpxcllb ^ ITlcLallOprOLcaSe 

ADAMTS7 

413 

46 . 970 

761 

X53581 

Rattus norvegicus ORF2 

188 

42 . 718 

762 

U4 0342 

riUO IIILiwJ L* Ul Llt> IlXIlfc; XU 

l / / y 

76 . 903 

763 

AC003027 

Arabidopsis thaliana lcl|prt_seq No 

ucLiuition x xiifc: luunu 

277 

41.406 

764 

U8854 9 

tJt\ 1 a TYii 1 q tl 11a r^iT f—j-^ — % V* ■*«• ^ 

nus iiiuouuiud ul pruLOCaQiicrin 

76 0 

89 . 516 

765 

U4 5958 

iNXV_t->L,Xclilci dldLd piSCll CALcnSin-llK6 

piuucxii 

182 

32 . 800 

766 

U55816 

JAdLLUa iiUJ. veyitua LUiOoclIliUc~ScnslUlve 

X\. V_- J- LULI dllopu X, L. C2 X> 


98 .710 

767 

D3 0612 

Homo c; ^ i pn q ypriTPGCrtT* r^^r^r*on n 
nuiuw oapiCllo icpicooUI Ly JL L> L. tZ J. I J. 

t o q 

42 . 786 | 

768 

AB011135 

Homo ctani Pnci If THZiri^(\l r^y*^t"OT r\ 


77 . 193 

769 

AB029014 

Homo ^airi ptiq KTZiill DPI nrnhoi n 

» 1 1 l\J sajJXCllo ivXrVrVX L« _7 JL pL L/LCxll 

Q111 

i?XX X 


770 

X83413 

Hum^n Y"o#^ ci-ir"i rue C TTR Q 
n. La 1 1 let 1 1 IlcipcS VII Uo D UOO 


48 . 673 

771 

X03145 

Homo can "i priQ nnl" \T 


3 9.286 

772 

AB023183 

Homo cj=»y-v -i f=»Y-» Q K'TJVZiOQA^ nrnhoin 

/i n "7 

98 .485 

773 

X15334 

Homo c 3 t™i "i pri p prpa t* "i np lo' rtacp X) 

X _? D 

Ob . 66 / 

774 

M98502 

Muc; innqrnl nc; nMT,7-d 

X lUiJ IIIUO^UJi Jk^ 1 J X-J TC 

^ *7 *7 

bJ . D /l 

775 

AF003535 

Homo sani one; nPF9 — 1 n Icp* nrnhpin 

IXUIIIU 43dkVX.t7XXO VX\X7 £ IIJVC piULCIll 

X / D 

C (\ c c\~> 
DU . bU^ 

776 

AF109905 

Mus musculus Hsc70t 

^ ^ 
j j j 

oo . UoU 

777 

U90126 

Ros t" 7\\ i t~i i s Z\Rr* t~ v^n Qnnrh 
x_jv_/cj i-ciux lx cz> r\ i > v ox aiiopui l— C; X 

OTA 


778 

U09411 

Homo sairi prm 7 \t\c f i ncrpy nrnh n 7Kfl?l ~X 

O X X 

bo. o / -5 

779 

U49974 

Homo sapiens mariner transposase 

494 

77.670 

780 

D86974 

Homo sapiens KIAA0220 

1169 

97.238 

781 

Y12 713 

i v ju.o iiiu.ocu.xus f jlo — foi — au x rase poxyprOLein 

c /T a 
669 

79 . 699 

782 

AB014607 

Homo «!3n"i PtlQ VTADn^m r~»-r*^r* O "J n 
IIUUIU ocipJLCllO x\. JL t\r-\\J / W / piULclIl 

i q a 

75.3 09 

783 

M62626 

Homo sapiens homeobox protein 

264 

74 . 545 

784 

L01790 

Drosophila melanogaster potential zinc- 
xxiiytix uotudins centeicu at aa j._5_> ana aa 

•jo**/ _> JVL/a pi ULCIH , pULdLX Vc 

743 

48.230 

785 

AL080157 

Homo Qani pn<3 hi/nnhViPh i pa 1 nrot-pin 
x x*— /n \\j Dwpiciio iiypu LxlC Ll^dX piULclll 

o Z>Z 

"7*7 xC "3 O 

/ / .biz 

786 

L01790 

ivJ-UOW^Ulla lllclaliuy ao Lcl LJL-> L- till L. .Let X X IlvJ 

f increr domains cpntPTPd at aa i anH =i ^ 
364; 43 kDa protein; putative 

o / 3 

dO . 24 9 

787 

X83413 

Human herDp«ivi f% TTRft 

21 *7 
fx / 

/it no 

788 

X94082 

Xenopus laevis KLP2 protein 

497 

39 . 844 

789 

AB006628 

Homo sapiens KIAA02 90 

414 | 

44 . 898 

790 

X78932 

Homo sapiens zinc finger protein 

361 

53.043 

791 

U93909 

Cercopithecine herpesvirus 15 nuclear 
antigen EBNA-1 

137 

41.379 

792 

AB002304 

Homo sapiens KIAA03 06 

910 

73 . 729 


100 


,;t O O H H & 4-3 „, O a S it;',, O E! 


793 

AJT000517 

Hnmo d ;=» •»-> -J pna en n nnrorohol lav at-ia-vi^ *7 
xivjmixj £japj.ciio ojjJJ.uut,cI. euex IaX d LdXl d / 

8 90 

43 . 280 

794 

L2 2858 

nucleopolyhedrovirus AcOrf-91 peptide 


51 . 515 

795 

A09561 

synthetic construct human serum albumin 

1295 

99.487 

796 

U0002 9 

oauL-nai L^iuy l_ t_: t> v,ex e v loxdc liir^l /Cp 

16 9 

45 . 946 

797 

AB005803 

Hnrflft qan i on a \\\ a t~ "I H n no - >~ t /~» V» <*-r 1 t r/^> -y^-v a -i n 
nuuiu Oapicllo iUoLlUlHc i JLCJI1 yiyCOprOucin 

244 

43 . 038 

798 

U25725 

Homo sapiens AH antigen 

1128 

98.333 

799 

D614A7 

xiomo sapiens ine jviaauid j gene proauct 

xo icldLcu L-O d putative L . cic^aiiS gene 

pnrnHprl "in r^oQmi e\ A. O 2\ Q 
"suv-v vxcivx -i-ii l- w £3 1 1 1 jl yjL r'z^/iO • 

349 

69 . 512 

800 

AJ222801 

Homo fl3m PTIQ ripnt" r*a "1 anh i nrromvol nr^aac* 
numu oajjicua iicuui al ojuii xny uiliy ex Xllaac 

OCA 

73 . 214 

801 

AL035652 

Homo S3T11 PT1R T 1 . T£* T / t~ t~\r-\i~\ "i anmoraoo 
*Av>iiivy DapiCllD UU1UD .X \ LUpuioUlilCLaoc 

(DNA) I) 

xob 

92 . 000 

802 

S38742 

Homo saniens HOX11 hompftHoma i n-HDY 1 i 


74 .684 

803 

S70011 

Rattus so tricaTboxvlafp parri or* 

z> o 

A O 1 1 ft 

4 J . 119 

804 

AB028948 

Homo saoiens KIAA102 5 nrnf p-i n 

DU / 


805 

U95760 

DrosoDhila mplanoaaster Cnn 

X D 


806 

AF164610 

Homo Rani ona fS^icr nmhPi n 

•7QC 

/ o o 

55 . 399 

807 

AJ001189 

Homo saoiens oliaoohrpnin 1 

c; 

CO Q C tl 

oo« y 0 0 

808 

L20315 

Mus musculus MPS1 protein 

X D X / 

/ / . 0 93 

809 

AF132480 

Mus musculus Rqpo nrni-pin 


OO OCT 

0 9 . 05 7 

810 

D70831 

Homo saoi ^ns^i Zinr-f t nopr r*i "yr^t - «=» i n 

ri o 

O / . 090 

811 

M21582 

Trypanosoma cruzi chagas antigen 

177 

51.639 

812 

X65165 

Vnl VOY P3 Y~ t~ T'' "i pvt* Anc i n 
v V— ' -i- v V— 'vv v-cix L- C X X CA Lcilb Xli 

O Q T 

41 . 463 

813 

AF155095 

Homo sapiens NY-REN- 2 antigen 

379 

51. 773 

814 

rti v J Jlt U 

Mus musculus tip associating protein 

192 

77 . 500 

815 

A J dSi *± / a 

j-ixuxcum aescxvum proiine-ricn protein 

339 

30 . 876 

816 

Y18890 

Human endogenous retrovirus K env 

k/X UUClIl 

490 

60.345 

817 

U4 1107 

v-d.enornajDa.xcxs exegans jno aerinicion 

1 i rip f- iri H 
1J.11C JL LU Id 

308 

28 . 571 

818 

AF092091 

l\.aLLUO 11U1 VCLjlLiUO V» ty" -5 X 

4 / o 

46 . 667 

819 

AE001406 

Plasmodium falciparum predicted membrane 

aoouLictLcu pi ULclIl 

166 

38 . 136 

820 

AB020709 

Homo c; m t #a» ri <2 WT A&nQflO r\-rr\i-o-in 
numu oa^Jicnb IV X i-\jt\ \J 2s \J kyXt-JL-tiXIl 

yis 

55 . 516 

821 

X03725 

Mus musculus ORF 2 (466 aa) 

348 

52.414 

822 

Z70310 

v-dciiui iiduui uib eiegans kiiao . /d 

1729 

53 . 875 

823 

U353 76 

rxL^iiiLj £>apieni3 repressor uranscripcionax 
f actor 

54 7 

73 . 451 

824 

AL031177 

Homo aani pne HiTftR^Ml tr o fnnvpl nrnhoinl 

a *a a 

3 2 . 800 

825 

Y17832 

Human endocrenous retrovirne; K nnl 
protein 

X X / O 

c r\ 0 r\ 1 

o u . zu x 

826 

AJ010641 

Drosophila melanogaster Dof protein 

186 

24 . 042 

827 

AF045454 

Cavia DOrcellus nhosnhnlinaQp R 

X^i / 27 


828 

AB020678 

Homo sapiens KIAA0871 protein 

1181 

66.182 

829 

Y14318 

Homo sapiens peroxisomal ABC- transporter 

201 

56.250 

830 

AB014607 

"Uiuu oaJJXcllb J\d.nf\\J / \J / proLcin 

902 

77.436 

831 

AC003 007 

iivjiuu oapiciio uiij\ii(jwi i gene product, 
(partial) 

2180 

91 . 733 

832 

AC004893 

Homo sapiens similar to NEDD-4 
(KIA0093) ; similar to P46934 
(PID:gll71682) 

316 


833 

AL080091 

Homo sapiens hypothetical protein 

402 

59.223 

834 

M19538 

Cricetulus griseus RNA polymerase II 
largest subunit 

171 

27. 917 


"701 


i o o b 4":?;; ,„ o as & o la 


835 

M87049 

Escherichia coli similar to putative 
regulatory protein AtsB of Klebsiella 
pneumoniae 

1020 

100. 000 

QIC 


Rattus norvegicus serine/threonine 
kinase 

2896 

91.268 

837 

AF102855 

Rattus norvegicus synaptic SAPAP- 
interacting protein bynamon 

871 

58.621 

O O O 

atirio on 0 0 

Homo sapiens K1AA1099 protein 

1091 

64.583 

O .3 

MO O 1 A 

Homo sapxens unknown protein 

934 

67 . 143 

Oft LJ 


Mus musculus Plenty of SH3s; POSH 

2810 

87 . 500 

841 

U50157 

Homo sapiens cAMP-specif ic 
phosphodiesterase HPDE4D1 variant 

573 

83 .333 

ft4 o 

vn 0700 

nomo sapiens mucin 

219 

30 . 052 

O *i J5 

anno 100c 

Homo sapiens KIAA1009 protein 

756 

75 . 410 

844 

D50857 

Homo sapiens DOCK180 protein 

1779 

69.211 



Drosophila melanogaster RNA polymerase 
III second- largest subunit 

841 

69.110 

846 

AB018353 

Homo sapiens KIAA0810 protein 

191 

53.731 

84 7 

AJ011928 

Drosophila melanogaster Fidipidine 

377 

37.056 

848 

Z50144 

Rattus norvegicus kynurenine/alpha- 
aminoadipate aminotransferase 

337 

62.821 

84 y 

at nr\oi on 

AXj009197 

Schizosaccharomyces pombe putative pre- 
mRNA splicing factor ATP -dependent RNA 
helicase 

251 

47.619 

850 

U06713 

Rattus norvegicus SM-2 0 

750 

66.129 

OCT 

851 

Z70310 

Caenorhabditis elegans R11A8 . 7b 

1310 

57.487 

8 52 

U3 2 4 98 

Rattus norvegicus rsec8 

2904 

95 .238 

853 

U35376 

Homo sapiens repressor transcriptional 
factor 

1212 

78 .281 

854 

AF081941 

Rattus norvegicus soluble adenylyl 
cyclase 

230 

30 . 682 

obb 

At 1 0122 52 

Gallus gallus Coch-5B2 

626 

46 . 632 

0 cr c 

ABOz 7757 

Cicer arietinum NADPH oxidoreductase 
homo log 

886 

43 . 817 

Q C *7 

c> /4y Uz 

Homo sapiens P2U nucleotide receptor 

197 

34 .375 

858 

D50928 

Homo sapiens The KIAA013 8 gene product 
is novel . 

539 

38 . 971 

OCA 

Xb 7704 

Drosophila melanogaster sperm protein 

196 

38 .393 

860 

X99145 

Canis familiaris overexpressed in 
thyroid tissue after TSH stimulation 

1278 

84 .100 

861 

AJ006278 

Mus musculus 

acetylglucosaminyl transferase - 1 ike 
protein 

1759 

70.637 

0 O z. 

AL U U /obi 

Arabidopsis thaliana Hypothetical 
protein 

247 

27 . 155 

0 0 0 


Homo sapiens alpha -2 chain precursor (AA 
-25 to 1018) (3416 is 2nd base in codon) 

1095 

100 . 000 

ft £4 


Rattus norvegicus putative integral 
lutjiilux cine transport protein 

1254 

53.867 

865 

X66957 

Homo saoiens hexokina^f* f vnp 1 



866 

AB007938 

Homo sapiens KIAA0469 protein 

296 

31.841 

867 

L20303 

Gallus gallus actin filament-associated 
protein 

230 

41.758 

868 

L08240 

Homo sapiens located at OATL1 

484 

40.183 

869 

D86947 

Pseudomonas aeruginosa chemo tactic 

186 

26.923 


nox. 
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transducer 



870 

AF043697 

Caenorhabditis elegans contains 
similarity to NAD(P)H oxidases 

307 

38. 889 

871 

Z46861 

Saccharomyces cerevisiae Met30p 

179 

37.209 

872 

U09413 

Homo sapiens zinc finger protein ZNF13 5 

538 

51 . 049 

873 

U95760 

Drosophila melanogaster Sno 

1513 

50 . 763 

874 

AL022165 

Homo sapiens dJ71L16.5 (KIAA0267 LIKE 
putative Na(+)/H(+) exchanger) 

1207 

73.462 

875 

AF038615 

Caenorhabditis elegans No definition 
line found 

272 

23 . 145 

876 

U72194 

Mus musculus muskelin 

1156 

90.722 

877 

M62324 

Homo sapiens modulator recognition 
factor I 

828 

75 . 401 

878 

AB023202 

Homo sapiens KIAA0985 protein 

397 

48 . 649 

879 

AL117472 

Homo sapiens hypothetical protein 

905 

94 . 904 

880 

AC006017 

Homo sapiens similar to ALR; similar to 
AAC51735 (PID:g2358287) 

2584 

98.718 

881 

AF079974 

Mus musculus Rac GTPase-activating 
protein 

2441 

86.998 

882 

Z83868 

Rattus norvegicus serine/threonine 
kinase 

2896 

91.268 

883 

U09367 

Homo sapiens zinc finger protein ZNF136 

761 

65 . 089 

884 

Z19555 

Unknown predicted using Genefinder; 
similar to Propionyl -CoA carboxylase 
beta chain; cDNA EST EMBL : 

2310 

65.483 

885 

LI 13 16 

Mus musculus ect2 

1642 

97 . 287 

886 

AC006123 

Homo sapiens KIAA0616 protein 

307 

33.831 

887 

X76013 

Homo sapiens glutaminyl- tRNA synthetase 

4984 

99 . 597 

888 

AB020721 

Homo sapiens KIAA0914 protein 

852 

50.177 

889 

X83413 

Human herpesvirus 6 U88 

205 

30 . 769 

890 

AB019003 

Mus musculus MRP5 

609 

50.691 

891 

U67056 

Acanthamoeba castellanii myosin I heavy 
chain kinase 

363 

35 . 125 

892 

U4 7661 

Lupinus luteus proline -rich protein PRP2 
precursor 

184 

40 .397 

893 

AL0502 98 

Homo sapiens hypothetical protein 

1751 

98 . 881 

894 

AB020718 

Homo sapiens KIAA0911 protein 

611 

63.265 I 

895 

AB017498 

Homo sapiens Lipoprotein Receptor 
Related Protein 5 

1453 

75 . 962 

896 

AB001735 

Mus musculus ADAMTS - 1 

309 

34 . 783 

897 

X78933 

Homo sapiens zinc finger protein 

582 

58 . 333 

8 98 

AC006 942 

Homo sapiens Human alpha -adapt in A 
homo log 

423 

78 . 571 

D Q Q 

m Q Q Q A 

Homo sapiens similar to ankyrin of 
Chromatium vinosum. 

395 

32 . 231 

900 

L20302 

Gallus gallus actin filament protein 

244 

28 . 082 

901 

X87224 

Canis familiaris ribosome receptor 

5288 

81.853 

902 

AB014 56 8 

Homo sapiens KIAA0668 protein 

558 

75 . 940 

yuj 

xy 084 y 

Gallus gallus polybromo 1 protein 

2365 

95.767 



TJ<*"\m<**\ cr? ^ t"% ~i art cs XT T" 7V 7V f\ Q C 
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z y . on 

905 

U07609 

Rattus norvegicus brain specific Na+- 
dependent inorganic phosphate 
cotransporter 

383 

90.909 

906 

AB011127 

Homo sapiens KIAA0555 protein 

2001 

60. 036 

907 

X90587 

Homo sapiens Rod cGMP phosphodiesterase 

513 

60.221 
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908 

X69063 

Mus musculus erythroid ankyrin 

475 

33.813 

909 

AB029016 

Homo sapiens KIAA1093 protein 

499 

40 . 549 

910 

AF117754 

Homo sapiens thyroid hormone receptor- 
associated protein complex component 

497 

43 . 627 

911 

X58288 

Homo sapiens protein- tyrosine 
phosphatase 

5123 

99.870 

912 

D4 9489 

Homo sapiens human P5 

274 

50 .000 

913 

U683 8 0 

Gallus gallus csdp 

1158 

61 . 888 

914 

D78255 

Mus musculus PAP-1 

1261 

92.821 

915 

U13642 

Caenorhabditis elegans exon 5 similar to 
transmembrane domain of S. cerevisiae 
zinc resistance protein 

351 

38 . 860 

916 

X73874 

Homo sapiens phosphorylase kinase 

1161 

93.137 

917 

X52127 

Mus musculus domesticus testis-specif ic 
protein, clone 46 

244 

34 . 746 

918 

L36434 

Mus musculus basic domain/ leucine zipper 
transcription factor 

225 

39.860 

yiy 


Rattus norvegicus glutamate receptor 
subtype delta— 1 

Q A Q 

It .4.3 1 

y.«2 U 


Homo sapiens zinc finger protein ZNF13 5 

*7 "7 Q 

/ / y 


921 

U73200 

Mus musculus pll6Rip 

3211 

88.328 

922 

AC004528 

Homo sapiens R32184 3 

2194 

95.342 

923 

AF067165 

Homo sapiens zinc finger protein 3 

4 91 

62 . 992 

924 

U66707 

Rattus norvegicus densin-180 

271 

31.832 

925 

U41538 

Caenorhabditis elegans proline rich 

226 

36 . 066 

926 

AF071544 

Spinacia oleracea ribulose-1, 5- 
bisphosphate carboxylase /oxygenase small 
subunit N-methyltransf erase I 

195 

30 . 147 

927 

Z95334 

Schizosaccharomyces pombe hypothetical 
protein 

2386 

52.191 

928 

AF007872 

Homo sapiens torsinB 

235 

43 . 011 

929 

AF151847 

Homo sapiens CGI- 89 protein 

796 

56 . 784 

930 

AL031848 

Homo sapiens dJ202O8.1 (novel rat Espin 
LIKE protein containing Ank repeats) 

171 

41.304 

931 

AF0533 56 

Homo sapiens leucin rich neuronal 
protein 

12 5 9 

33 . 490 

932 

U37501 

Mus musculus laminin alpha 5 chain 

449 

53 .289 

933 

AL031447 

Homo sapiens dJ12 6A5.2.1 (novel protein) 
(isoform 1) 


26 .3 70 

934 

AJ010901 

Homo sapiens MUC4 

2 76 

2 5 . 872 

y j b 

ajou / jyb 

Homo sapiens QA79 membrane protein 

C A 1 

£Z O /TOO 

oi.o yy 

936 

AF064254 

Homo sapiens very long- chain acyl-CoA 
syntnetase nomolog l; VLiCS-Hl 

454 

81. 720 

yj / 


Escherichia coli ORF f246; alternate 
name ygg£» ; or to ox. ax^uo 


Q A C O 1 

938 

U37248 

Homo sapiens alpha-mannosidase 

363 

52.113 

y 3 y 

t o m n o 

Rattus norvegicus developmental ly 
regulated protein 

b jy 

/b . Job 

Oil rv 

nDU L> / O _5 

TJy— \TT\ y— y c°* T~\ ^ w— i r-y \T T 7\ A {°\ A "5 ^ 

rlOniCJ bdpicllis I\.lrtAU4 —> 3 

QQQ 


941 

U22815 

Homo sapiens LAR- interacting protein la 

976 

68.254 

942 

AF043695 

Caenorhabditis elegans similar to the 
protein phosphates 2c family 

259 

30.435 

943 

D86560 

Schizosaccharomyces pombe 
carboxypeptidase Y 

183 

31.783 


X O U B H-6 *+3 .., O id! €t O 2 


944 

AB015289 

Gallus gallus BASH 

191 

42 .857 

945 

X65165 

Volvox carteri extensin 

235 

39.806 

946 

AB023216 

Homo sapiens KIAA0999 protein 

1157 

79.661 

947 

AF010403 

Homo sapiens ALR 

180 

36.029 

948 

AF003535 

Homo sapiens ORF2-like protein 

176 

52 .459 

949 

Z99271 

Caenorhabditis elegans similar to zinc 
metallopeptidase (M8 family) ; cDNA EST 
EMBL:C07771 comes from this gene; cDNA 
EST EMBL:C09261 comes from this gene; 
cDNA EST yk259cl.5 comes from this gene 

495 

40.506 

950 

Y17920 

Drosophila melanogaster CALO protein 

741 

42.525 

951 

AF074086 

Homo sapiens polymerase 

2590 

84 . 774 

952 

D86983 

Homo sapiens similar to D .melanogaster 
peroxidasin (U11052) 

1173 

67.568 

953 

U13152 

Mesocricetus auratus guanine nucleotide- 
binding protein beta 5 

2174 

85.158 

954 

S58722 

Homo sapiens X- linked retinopathy 
protein {O terminal, clone XEH.8c} 

179 

76.316 

955 

AF084205 

Rattus norvegicus serine/threonine 
protein kinase TAOl 

2886 

98 .670 

956 

Y11710 

Homo sapiens collagen type XIV 

1892 

100.000 

957 

AC004893 

Homo sapiens similar to NEDD-4 
(KIA0093) ; similar to P46934 
(PID:gll71682) 

316 

80.000 

958 

X52046 

Mus musculus type III collagen 

260 

32 .215 

959 

AF149422 

Homo sapiens unknown 

329 

42 . 537 

960 

AF037364 

Homo sapiens paraneoplastic neuronal 
antigen MAI 

289 

36 .810 

961 

U15784 

Mus musculus Shcp52 

216 

45.098 

962 

AB029290 

Homo sapiens actin binding protein 
ABP620 

1780 

100.000 

963 

AF057140 

Homo sapiens cargo selection protein 
TIP47 

511 

38 . 095 

964 

U29156 

Mus musculus involved in signaling by 
the epidermal growth factor receptor ; 
Method: conceptual translation supplied 
by author 

1321 

95.516 

96 5 

X6 7863 

Mus musculus T2 

182 

37 . 895 

966 

AB029014 

Homo sapiens KIAA1091 protein 

9111 

99.852 

967 

AB014568 

Homo sapiens KIAA066 8 protein 

558 

75 . 940 

968 

AF017777 

Drosophila melanogaster misato 

654 

30 . 948 

969 

AF071186 

Mus musculus WW domain binding protein 
11 

230 

35.915 

970 

AC004865 

Homo sapiens similar to KIAA0319; 
similar to AB002317 (PID :g2224579) 

1715 

93.857 

971 

AF109719 

Mus musculus BAT2 

1132 

36 . 805 

972 

AC007019 

Arabidopsis thaliana hypothetical 
protein 

648 

37 . 183 

973 

AF003739 

Caenorhabditis elegans No definition j 

IXUc 1UUI1U > 

412 

51 .402 

974 

AJ000517 

Homo sapiens spinocerebellar ataxia 7 

890 

43 .280 

975 

AF016252 

Rattus norvegicus Spinophilin 

2415 

98.947 

976 

Z70208 

Caenorhabditis elegans predicted using 
Genefinder; similar to collagen 

165 

34 .459 

977 

AJ010482 

Homo sapiens Myopodin protein 

699 

36 .627 
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978 

U92072 

Rattus norvegicus m- tomosyn 

1148 

94.505 


979 

Z74037 

Caenorhabditis elegans predicted using 
Genefinder; similar to collagen 

203 

30.366 

980 

U55816 

Rattus norvegicus furosemide- sensitive . 
K-Cl cotransporter 

729 

83.436 

981 

Z46259 

Saccharomyces cerevisiae N034 8 

273 

26.923 

982 

U2 63 97 

Rattus norvegicus inositol polyphosphate 
4 -phosphatase 

384 

28 .378 

983 

AB002320 

Homo sapiens KIAA0322 

930 

94.231 


7\ -n f\ «"v r\ f- t-t r> 

AB02O6 78 

Homo sapiens KIAA08 71 protein 

1181 

66.182 

985 

AB026190 

Homo sapiens Kelch motif containing 
protein 

930 

47.975 

986 

X56044 

Mus musculus protein Htf9C 

400 

39.037 

987 

AF109907 

Homo sapiens S164 

317 

27.341 

988 

AB000215 

Rattus norvegicus CCA1 protein 

1516 

89.615 

989 

AF091457 

Rattus norvegicus zinc finger protein 
RIN ZF 

1474 

81.851 

990 

AC006193 

Arabidopsis thaliana Hypothetical 
Protein 

481 

32.155 

991 

D86966 

Homo sapiens similarto human ZFY 
protein . 

544 

46.023 

992 

U22818 

Cricetulus griseus mutant sterol 
regulatory element binding protein- 2 

729 

74.342 

993 

AB017614 

Mus musculus OASIS protein 

724 

64 .062 

994 

D87682 

Homo sapiens similar to a C. elegans 
protein encoded in cosmid T26A5 . 

1380 

80.135 

995 

M17921 

human herpesvirus 1 latency associated 
transcript (LAT) ORF-2 

266 

56.818 

996 

AF151821 

Homo sapiens CGI -63 protein 

180 

85 .294 

997 

AF12 8625 

Homo sapiens CDC42 -binding protein 
kinase beta 

5379 

99.509 

998 

U43194 

Mus musculus rhophilin 

1007 

45. 966 

999 

AF009039 

Homo sapiens synaptojanin 

180 

34.127 ! 

1000 

AF005036 

Mus musculus secretory carrier membrane 
protein 

584 

55.705 

1001 

U022 8 9 

Caenorhabditis elegans GTPase -activating 
protein 

650 

42.606 

1002 

X52 022 

Homo sapiens collagen type VI, alpha 3 chain 

412 

50. 758 

1003 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

322 

71.622 

1004 

AF032103 

Homo sapiens ataxin-7 

833 

44 . 702 

1005 

U56732 

Rattus norvegicus KRAB/zinc finger suppressor 
protein 1 

1459 

50.336 

1006 

AJ010949 

Mus musculus calcium channel alpha-2-delta-C 
subunit 

604 

49.444 

i n c\ i 
1UU7 


Homo sapiens protein- tyrosine kinase 

234 

89.583 

1008 

U28831 

Homo sapiens protein that is immuno- reactive 
with anti-PTH polyclonal antibodies 

2221 

95.263 

1009 

AF103939 

Homo sapiens echinoderm microtubule-associated 
protein- like EMAP2 

1283 

56.051 

1010 

U76992 

Homo sapiens Tat-SFl 

410 

48 . 000 

1011 

U93872 

Kaposi's sarcoma-associated herpesvirus ORF 73, 
contains large complex repeat CR 73 

164 

25.874 

1012 

Z75543 

Caenorhabditis elegans cDNA EST EMBL:M89063 
comes from this gene; cDNA EST yk384fl.3 comes 

250 

32.773 
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from this gene; cDNA EST yk384fl.5 comes from 
cms gene 





Homo sapiens protein tyrosine phosphatase 

996 

91 . 617 


y q a r a q 

Homo sapiens elastic titin 

1746 

82 .390 



Homo sapiens neuronal thread protein AD7c-NTP 

226 

69.811 

lUlO 

Ar l o / ybl 

Schizosaccharomyces pombe ubiquitin carboxyl- 
terminal hydrolase 

498 

30.311 

1017 

AF186461 

Rattus norvegicus ring finger protein Fxy 

53 

30.233 

i a i o 

ALU JbDZb 

Arabidopsis thaliana extensin-like protein 

222 

36 . 196 

XU Xzf 


Volvox carteri extensin 

223 

40.458 

1020 

X98411 

Homo sapiens myosin- IE 

192 

33 . 140 

1021 

Z81561 

Caenorhabditis elegans cDNA EST yk338f6.5 comes 
from this gene; cDNA EST EMBL:D752 96 comes from 
this gene 

279 

33.333 

T A O O 

Y17833 

Human endogenous retrovirus K pol protein 

813 

85.034 

102 3 

Z74 03 5 

Unknown similar to Zinc finger, C3HC4 type 
(RING ringer); cDNA EST EMBL:D32547 comes from 
this gene 

237 

22 .492 

t ao >i 

71 >i A O A 

^14 UZ U 

Nicotiana tabacum Pistil extensin like protein, 
partial CDS only 

188 

50.649 

t a o q 


Homo sapiens Unknown gene product 

187 

56 . 140 

1026 

X90849 

Gallus gallus polybromo 1 protein 

2869 

86.614 

102 7 

D13 8 96 

Rattus norvegicus cytoplasmic dynein heavy 
chain 

1642 

98 . 540 

l a o q 

TT A Q 1 T C 

Homo sapiens ORF1, encodes a 4 0 kDa product 

371 

50 .407 

i ao q 

X \J Z. 17 

TTO 1 C T Q 

Caenorhabditis elegans No definition line found 

278 

31 . 343 

1 ATA 

Tn o o c i 
/Zoo 

Homo sapiens KRAB zinc finger protein; Method: 
conceptual translation supplied by author 

331 

46 . 847 

1 ATI 

1*4 3 1 _L 

Homo sapiens KIAA0611 protein 

976 

65.339 

i ai o 

iiuj yz 

Human endogenous retrovirus K gag protein 

706 

43 . 983 


T O C A C A 

Homo sapiens serine/proline-rich protein 

521 

57 .396 

n a "2 /i 


Homo sapiens KIAA0286 

433 

31.897 


Ar Ujjb^b 

Homo sapiens leucine-rich glioma- inactivated 
protein precursor 

934 

52 .459 

103 6 

TTfifi Tft7 

Kautus norvey icus Qensin- ibu 

807 

84 . 049 

1037 

APft Q£TO Qfi 
t\C \J ;7 0 o _? o 

L/iosopni j.a meianogascer pusnover 

2 94 

33 . 333 

1038 

API 1Q1 pq 

KdttuB iiorveyicus uiyoiueyaiin 

2 08 

30 . 366 

1039 

Z11527 

Drosophila melanogaster CYS3HIS finger protein 

389 

51.754 

i n/in 

X Ufi u 

AO f J> Q Z 

Homo sapiens Human giant larvae homologue 

1057 

95 . 181 

1041 

Z97184 

Homo sapiens BING1 

784 

96 . 992 


Vft Q AO O 

Homo sapiens Not56-like protein 

1062 

90 . 217 


ADftl Q "1 O C 

Homo sapiens KIAA0782 protein 

202 

43 . 363 

1044 

AF155101 

Homo sapiens putative kruppel -related zinc 
finger protein NY-REN- 23 antigen 

740 

42 .388 


ArUUUUDo 

Aeropyrum pernix 246aa long hypothetical 
protein 

252 

41 . 818 

X. \J*± o 

AODIO j 

Volvox carteri extensin 

257 

46 . 250 

1047 

AC003080 

Homo sapiens Similar to KIAA0299; 60% 
simixarmy to ABUUzzyy (PiD :g2224539; 

2058 

99.068 

1048 

AL110151 

Homo saoiens hvioothf^t* i ral n-rnh <=k -i -n 


J /.ZOO 

1049 

U25116 

Dictyostelium discoideum cytoplasmic dynein 
intermediate chain 

187 

29.655 

1050 

D28863 

Anthocidaris crassispina dynein intermediate 
chain 3 

1166 

71.193 

1051 

X12609 

Homo sapiens anion transport protein (AA 1 - 

245 

45.390 


'l o o e «+£» "•i-;?;; ,., n a a is o a 




Q1 1 \ 

y XX ; 





Caenorhabdi tis elegans C26D10 . 4 

451 

33.028 

1053 

U82808 

Homo sapiens muscle- specif ic serine kinase 1 

233 

78.000 

1054 

AB002384 

Homo sapiens KIAA0386 

241 

25. 566 

1055 

AF117814 

Mus musculus odd- skipped related 1 protein 

984 

94.156 

1056 

AF151840 

Homo sapiens CGI- 82 protein 

363 

45.890 

1057 

AL117448 

Homo sapiens hypothetical protein 

1107 

74 . 180 

1058 

X90568 

Homo sapiens Protein sequence and annotation 
available soon via Swiss -Prot; available at 
present via e-mail from IiABEIT@EMBL- 
Heidelberg . DE 

969 

98.658 


rlZb o X^J 

Oryctolagus cuniculus unknown protein 

190 

36.220 

lubU 


Homo sapiens KIAA0855 protein 

520 

37.808 

X U O X 

MO *7 Q "7 O 

Homo sapiens DNA binding protein 

721 

71.852 

1062 

U22058 

Mus musculus ADAM 4 protein precursor 

689 

63.704 

106 3 

AF003622 

Drosophila melanogaster A-kinase anchor protein 
DAKAP55 0 

947 

62. 128 

10o4 

AF0173 68 

Mus musculus faciogenital dysplasia protein 2 

2115 

86.301 

1 A/- r 

luob 

X834 13 

Human herpesvirus 6 U88 

550 

55. 645 

1 f\ C c 

XUbb 

AF040642 

Caenorhabditis elegans contains similarity to 
transacylases 

800 

35.476 

1067 

U88157 

Rattus norvegicus PAM COOH- terminal interactor 
protein 10a 

541 

73.171 

T n C Q 

At lbbll / 

Homo sapiens NY-REN- 62 antigen 

1320 

69.470 

1 A/TQ 

AC 0 O 2 3 1 0 

Homo sapiens Unknown gene product 

187 

56 . 140 

1U / U 

Liuyxy o 

Homo sapiens trichohyalin 

239 

21.471 

1 ATI 

X U / X 

X57017 

Saccharomyces cerevisiae acetylglutamate 
kinase 

418 

27.020 j 

i mo 

Atsu^ o y / o 

Homo sapiens KIAA1055 protein 

394 

35.455 

X U / j 


murine herpesvirus 68 unknown 

234 

30. 065 

i nod 

XiX D D U / 

Sus scrofa f ormiminotransf erase-cyclodeaminase 

776 

90. 076 

1075 

t\X2\J 4 & zf Z? 1 

Homo sapiens KIAA1074 protein 

263 

40. 940 

J. \J 1 D 

VI C7CQ 
AID / D " 

Mus musculus U1RNA- associated 70-kDa protein 

142 

30. 075 

1077 

X61296 

Rattus norvegicus open reading frame 2 

309 

47. 794 

X U / O 

A PI Q 1 *T O *f 

AJf lolbz / 

Drosophila melanogaster BcDNA. GH0412 0 

699 

37.669 1 

X LF / 17 

Zi / X *± u o 

Saccharomyces cerevisiae ORF YNL132w 

1757 

65. 541 

1080 

Z47747 

Homo sapiens NFKB1 

322 

100.000 

XUoX 

TT C A ATO 

U50078 

Homo sapiens p532 

1085 

77.209 

T n o o 

AT ATI tn O 

ALiO^ 1918 

Homo sapiens b34I8.1 (Kruppel related Zinc 
Finger protein 184) 

893 

49.801 

1083 


Homo sapiens KIAA03 99 

805 

99.213 


at no o. o n a 

Schizosaccharomyces pombe putative 
translocation elongation factor-Tu fa mily 

800 

32.688 

1 A Q C 
J.UO J 

•rir U O X D D O 

Mus musculus delta tubulin 

61 

56.522 

1086 

AF181645 

Drosophila melanogaster BcDNA. GH12144 

207 

36. 567 

i n q *7 

xuo / 

AT r»OT T/l"7 

Schizosaccharomyces pombe hypothetical protein 

349 

21.492 

1088 

AB023216 

Homo sapiens KIAA0999 protein 

1157 

79.661 

iuo y 

At 13 0441 

Arabidopsis thaliana UVB- resistance protein 
UVR8 

347 

33.766 

1090 

AE001399 

■tJ.cioiiivjvaj.u.ui iai,L iparum vyl\r QOiuain protein 

(cyclic nt signal transduct . ) 

203 

34 . 058 

1091 

AF082556 

Homo sapiens TRF1- interacting ankyr in- related 
ADP-ribose polymerase 

299 

38.462 

1092 

S66427 

Homo sapiens retinoblastoma binding protein 1, 
RBP1 

603 

63.576 
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1093 

X51591 

Homo sapiens beta-myosin heavy chain (1151 AA) 

158 

26.846 

l nod 


Homo sapiens KIAA0337 

193 

60.294 

1095 

AB007950 

Homo sapiens KIAA0481 protein 

316 

43.931 


Ar 10 9 719 

Mus mus cuius BAT2 

1119 

36.714 

1 Q "7 

AB028 982 

Homo sapiens KIAA1059 protein 

1244 

74.900 


Ar 076183 

Rattus norvegicus cytosolic sorting protein 

Jr/\<w £> - 1 a 

1620 

95.849 

X v .7 

TTH A CDC 

Rattus norvegicus double -stranded RNA specific 
duenos xne aeaininase 

393 

— - 

68 .478 

1 1 on 

AO jl Jl j 

Human nerpes virus b UoO 

334 

40 . 909 

X X W X 


Rattus norvegicus Muncl3-4 protein 

4605 

87 . 377 



Homo sapiens antigen NY-CO- 8 

2056 

78 .465 

1103 

AF082664 

Gallus gallus interferon alpha/beta receptor 1 

242 

28 . 782 



bacteriophage lambda exonuclease 

886 

96 . 528 

1105 

U79142 

Sus scrofa putative inhibitor of apoptosis 

471 

39.024 

1106 

7\B012223 

Canis familiaris ORF2 

203 

52 . 941 

1107 

AB029014 

Homo sapiens KIAA1091 protein 

9111 

99.852 

1108 

D86214 

Mus musculus Ca2+ dependent activator protein 
for secretion 

1702 

93.141 

1109 

Z93393 

Unknown Similarity with snail BR-1 protein 
(Swiss Prot accession number Q25112) ; cDNA EST 
EMBL : D3 71 

673 

31.447 

1110 

AC005048 

Homo sapiens laminin beta precursor; similar to 
AAB92586 ( PID : g2 708 707 ) 

770 

40.625 

1111 

U93909 

Cercopithecine herpesvirus 15 nuclear antigen 
EBNA- 1 

172 

41.525 

1112 

AF031939 

Mus musculus RalBPl -associated EH domain 
protein Repsl 

2397 

95.968 

1113 

U72972 

Sus scrofa calcium/calmodul in-dependent protein 
kinase II isoform gamma-E 

1737 

98 .195 

1114 

AC005142 

Arabidopsis thaliana putative calcium channel 

296 

25.705 

1115 

AF0914 57 

Rattus norvegicus zinc finger protein RIN ZF 

468 

52 . 703 

"I 1 t c 

1116 

/A-L050019 

Homo sapiens hypothetical protein 

793 

89.041 

1117 

D8 928 5 

Mesocricetus auratus inter-alpha- trypsin 
inhibitor heavy chain 1 

1086 

35.727 

1118 

AC004594 

Homo sapiens Ca2+ dependent activator protein 
for secretion; similar to D86214 (NID :gl3 98 903 ) 

1891 

98.936 


AHOOz 3-<i3 

Homo sapiens KIAA0325 

3563 

99.821 

11^ u 

•7 Q D C Q C 

Z98595 

Schizosaccharomyces pombe hypothetical protein 

477 

28 . 155 

1121 

Z35719 

Unknown cDNA EST EMBL :D6 7419 comes from this 
gene; cDNA EST EMBL:C13853 comes from this 
gene ; cDNA 

487 

40 . 574 


Ui7D X,} 1 

Homo sapiens HPV16 El protein binding protein 

937 

70.270 

1 1 


Homo sapiens KIAA0246 j 

219 

32 .479 

1 1 OA 


Homo sapiens supported by human ESTs AA4124 02 
(NID:g2070990) NH44021 (NID : gll82 54 9 ) , mouse 
r*oi MUDbyjj \ jMiu : gibo /o9; , and genscan , 

223 

72 . 222 

112 5 


Mus musculus sphingosine kinase 

557 

50 .289 

1126 

AB020664 

Homo sapiens KIAA0857 protein 

507 

40.293 

1127 

AC007660 

Arabidons is thai i ana rMit~;=» t* l v<^ PMZv Vie»i -J raoa 

A Q1 
*± 91 

J / . Z. Ul 

1128 

M69183 

Plasmodium falciparum mature -parasite -infected 
erythrocyte surface antigen 

284 

51.923 

1129 

AF114378 

Mus musculus cypherl 

219 

44 . 144 

1130 

U88154 

Homo sapiens proline and glutamic acid rich 
nuclear protein isoform 

145 

40.000 


n o so a «4-3; ,„ o 2 2 & o e; 


-L -L -5 1 


Mus m\i sculus zinc finger ZF-12 

3 99 

50 . 394 

1 1 J> 

z*o 0 luo 

Caenorhabdi t is elegans F41E7 . 1 

187 

26 . 036 

1133 

Y17920 

Drosophila melanogaster CALO protein 

491 

35.798 


/ 

Homo sapiens uj1o2H14 . l vf iBULilN 1 liJce 

316 

36 . 757 

1 1 *1C 
J.JL J J 

tto no i *7 

i v »us muscuiua tiDriiiin-^ 

1020 

74 . 556 

1 1 

.rtxS U 1 U *5 ft / 

nus uiuscuius King J 

196 

34.286 

1137 

U13152 

Mesocricetus auratus guanine nucleotide -binding 
protein. Deca z> 

1165 

82 . 096 

X x 0 0 

VI A QAC 

riomo sapiens oriw proue m 

234 

52 . 941 

1 1 7Q 

t*»i 1 ^ r» 0 

Kdccus sp . uiNA. Dinaiiiy proLein a 

2 08 

41 . 026 

1 i >in 

114U 

ZVRH 1 Q7/IQ 
M_D U 1 OjflO 

Homo sapiens KIAA0805 protein 

987 

53 . 962 

1111 

21Ff>1 QflQO 
U170 0 / 

wus rn u. scuius mecaiioprouease-aisiriuegrin 

IIICX L.X 111 IJcLd 

1508 

80.385 

1142 

177 

M<^mrt cj t~4 4 on a rl.TQ Q QMl C O / n^tro 1 -r* ^-\t* o-i n \ 
XiWlllW adjJXCilo UJ007rixj . j \IlUVcl piUUclIl/ 

A "3 A 

4 J4 

32.800 

1143 

AF1 1??(1Q 

nuiiiv^ bdpieiis pi cpru- iuca j oit Dasic protein nornoiog 

ICQ 

54 . 000 

1144 

AFO 70 ^79 

nus muscuius lysosomal cranicKing regulator z 

fl y~ r\ 

96 0 

94 . 156 

114 5 

rir 1 0 1 0 *x 0 

JJi UoUpilXXa lUcXallugctotcl Dtl/lNrt. unl / j^D 

O *7 Q 

36 .42 9 

1146 

i-iu J XO O 

T-J fNim/^ cj a t*v ~i on o -f- • ^ ~l ^ >-J V» q o n V» ^ t*i ~> <~* a 
XT.*JIIHJ bapXcUb lULal aLulCD lOIl Kxllaoc 

ziy 

74 .419 

1147 

X03145 

Homo sapiens pot. ORF II 

233 

52 . 874 



Human endogenous retrovirus K pol protein 

807 

84 . 416 

1149 

X83413 

Human herpesvirus 6 U88 

314 

51.852 


C A *7 O 

A05472 

Rattus norvegicus ORF 3 

194 

46 . 053 

1151 

U40342 

Mus muscuius ninein 

1799 

76 . 903 

1152 

D90713 

Escherichia coli Hypothetical protein in hrsA 
3 1 region . 

727 

96 .639 

1153 

AE000464 

Escherichia coli formate dehydrogenase- O, major 
subunit 

894 

92 . 199 

1154 

D3 /918 

Escherichia coli Reverse transcriptase like 
protein 

427 

85 . 714 

nice 

D30 719 

Escnericnia coli ORF ID:o207#4 

843 

90 . 000 

1156 

D90704 

Escherichia coli Penicillin-binding protein 2 
vp^p-^s / . 

838 

99 . 174 

1157 

AJ002735 

Klebsiella oxytoca initiation factor IF2 -alpha 

1252 

93 . 953 

1158 

AF083501 

Macaca mulatta rhadinovirus 17577 latent 
nuclear antigen 

169 

44 . 086 

"I "1 C Q 
1137 

i\ ~c r\ a ~i a ~i tz 
Ar U4ibio 

Plasmodium chabaudi circumsporozoite protein 

237 

63 . 830 

llbU 

At Ub / O / J 

Homo sapiens DNA fragmentation factor DFF3 5 

369 

61 . 017 



Schi zosaccharomyces pombe putative 
pnopnoaiescerase-nucieocxae pyropnosp nacase 
precursor 

391 

31 . 746 

1162 

Y17793 

Mus muscuius Duttl protein 

1969 

61 . 075 

11DJ 

vft-i c c n 

nomo sapiens do- rvua. protein vaa 1-4 /o) 

1225 

62 . 581 

1 1 0 *± 

U'ilDDZ 

Kautus norvegicus neuroiigin z. 

1931 

98 . 662 

1 1 O J 

aRfiOd on 

Homo sapiens Kelch motif containing protein 

275 

46 . 789 

1 1 

HDD 


Unknown similar to Phosphoglucomutase and 
phosphomannomutase phosphoserine ; cDNA EST 

H<l*II_> Xj . LJ j O 1 O O 

719 

43 . 494 

1167 

AF079974 

i v iufc> uiuotuiuo rccivj iairasc~activatiny protein 

r\—i 

CO TOO 

53 .333 

1168 

U79716 

Homo sapiens Human Reel in 

74 5 

84 . 034 

1169 

AC008075 

Arabidopsis thaliana F24J5.4 ' 

167 

35 . 000 

1170 

X13916 

Homo sapiens LDL-receptor related precursor (AA 
-19 to 4525) 

766 

87.603 

1171 

X75296 

Homo sapiens TUP1 like enhancer of SPLIT gene 1 

256 

59.016 

1172 

U70935 

Peromyscus maniculatus reverse transcriptase 

163 

29.371 
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1173 

U22961 

Homo sapiens similar to human albumin, Swiss - 
Prot Accession Number P02768; Method: 
conceptual translation supplied by author 

543 

73.043 

1174 

AB020629 

Homo sapiens KIAA0822 protein 

508 

71.698 

1175 

D50927 

Homo sapiens The KIAA013 7 gene product is 
related to Arabidopsis thaliana protein kinase 
(TOUSLED) . 

390 

93 .846 

1176 

AF111423 

Xenopus laevis chromosome condensation protein 
XCAP-G 

177 

57.447 

1177 

AB023188 

Homo sapiens KIAA0971 protein 

362 

79.452 

1178 

Z70200 

Homo sapiens U5 snRNP- specif ic 200kD protein 

232 

51.899 

1179 

S61070 

Homo sapiens reverse transcriptase homolog=pol 
{retroviral element} 

453 

75.758 

1180 

AF046001 

Homo sapiens zinc finger transcription factor 

187 

59 . 016 

1181 

AB028981 

Homo sapiens KIAA1058 protein 

480 

64 . 078 

1182 

X01455 

Reticuloendotheliosis virus C end of pol 
protein (reverse transcriptase) 

237 

34 .314 

1183 

L13391 

Homo sapiens helix- loop -helix phosphoprotein 

482 

70.940 

1184 

AB011152 

Homo sapiens KIAA0580 protein 

185 

74.359 

1185 

AC006233 

Arabidopsis thaliana hypothetical protein 

175 

41.111 

1186 

X62681 

Gallus gallus limb deformity protein 

408 

57.377 

1187 

Z14093 

Homo sapiens branched chain decarboxylase alpha 
subunit 

739 

85.106 

1188 

X97999 

Homo sapiens transcription factor IID 

203 

37.008 

1189 

M85168 

Homo sapiens glycogen debranching enzyme 

251 

57.447 

1190 

X90569 

Homo sapiens elastic titin 

365 

75.610 

1191 

U80736 

Homo sapiens CAGF9 

743 

90.323 

1192 

U35376 

Homo sapiens repressor transcriptional factor 

678 

74 . 590 

1193 

U90215 

Rattus norvegicus polysialyltransf erase 

225 

59. 155 

1194 

AB011169 

Homo sapiens KIAA05 97 protein 

211 

96 .429 

1195 

U48736 

Homo sapiens serine/threonine-protein kinase 
PRP4h 

269 

71.667 

1196 

AF026169 

Homo sapiens SALF 

245 

82 . 500 

1197 

U93570 

Homo sapiens p4 0 

173 

37.895 

1198 

X13621 

Homo sapiens HNP-3 defensin (AA 1- 94) 

205 

71.795 

1199 

U51432 

Homo sapiens nuclear protein Skip 

280 

77.193 

1200 

AL050037 

Homo sapiens hypothetical protein 

355 

56 . 731 

1201 

AF076783 

Rattus norvegicus plasma membrane Ca2+ ATPase 
isoform lkb 

312 

73 .134 

1202 

AF085691 

Homo sapiens multidrug resistance-associated 
protein 3A 

626 

39.858 

1203 

AF069603 

Homo sapiens myosin light chain kinase isoform 
3B 

228 

70.455 

1204 

AB007945 

Homo sapiens KIAA04 76 protein 

529 

46 .269 

1205 

AF083037 

Homo sapiens A- kinase anchoring protein AKAP3 50 

292 

96 .000 

1206 

X01469 

Plasmodium lophurae hist idine- rich protein 

214 

45 .283 

1207 

AF145705 

Mus musculus T2K protein kinase homolog 

841 - 

77.381 

1208 

X78933 

Homo sapiens zinc finger protein 

407 

57. 732 

1209 

AF042169 

Homo sapiens putative ATP-dependent 
mitochondrial RNA helicase 

420 

65.421 

1210 

X06596 

Homo sapiens complement protein Cls precursor 

318 

58 . 065 

1211 

D38595 

Homo sapiens inter-alpha- trypsin inhibitor 
family heavy chain-related protein (IHRP) 

406 

65. 909 

1212 

L16558 

Homo sapiens ribosomal protein L7 

318 

81. 967 

1213 

Z79757 

Unknown Similarity to Candida CDC4 gene 

35 

42.857 
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(TR:E234056) ; cDNA EST EMBL : D2 7 6 9 9 comes from 
this gene ; cD 



X £. JL *i 

TTAO CO rt 
U*± S. Z> O \J 

Paramecium bursaria Chlorella virus 1 Pro- , 
Lys-ricn, papk IjOx) ; similar to wheat Pro-, 
■uyt* pxocein, beiiDanjc Accession Numoer 
X52472 

174 

29.661 

1215 

U93570 

Homo tjar^l P*Y\ Q nnh at*-! itcx r»1 Crt 

145 

37 . 778 

1216 

D50487 

Homo sapiens RNA helicase (HRH1) 

331 

58.416 

1217 

M20471 

Homo CZ T"V 1 PTia f'Tof^Vl'V'lvi T "i ♦» »»*V\ -r* A n — k 

nuiiii; odj_> -Leila cidcnrin Ilyflu- Cnain a 

292 

88 . 679 

1218 

AF049910 

Homo a "i c»t*i a T'Zif'f*! 
* iwiiiw aapicuo J. J. 

392 

85 . 333 

1219 

D80009 

Homo qani <=*na VTnani 0*7 

443 

77 . 174 

1220 

Z26653 

Homo sapiens laminin M chain (merosin) 

351 

79. 167 

1221 

M96264 

Homo sapiens galactose- 1-phosphate uridyl 
transferase 

252 

58.333 

1222 


no i no sapiens nypocnetical protein 

194 

73 . 333 

1223 

A6 9020 

HTll Hon!* i ^ "i £3 DDATPTi OTP 

485 

41. 808 

1224 

V004AR 

nomo sapiens aipna gloom 

466 

69. 725 

1225 

AL008 7? 

nomo sapiens auyoCj24.4 (SAAT1 (low affinity 
sodium glucose cotransporter (sodium : solute 

G VTDD nrf"Pr f ami 1 \r\ \ \ 
j tr ^ ■*- i— J- J- ctlllX J. y ) / I 

152 

41.429 

1226 

AL110218 

Homo QAni priQ Vi -\rr\r~\ a\- \ -r-w-^t- ^ A 
uu,llu c»ct.j^xdio XlyjJtJ L_Ilt; 1 — LVJdJ. protein 

287 

92 . 000 

1227 

U02032 

Homo ct saT*» "i on a v* t V*!*"* es /^vmo 1 v^4- -4 *a too-* 

i i"— ' 1 1 i^j octjL^x ciia x xuuouinai protein 

291 

56 . 977 

1228 

M13100 

rvc%uuL&o nui vcyAtuo unKnuwn protein 

196 

35 . 345 

1229 

M1214 0 

"wuiu oct^xtjiisa ciivcit/pc pr otein 

567 

46 . 067 

1230 

D30648 

xiwuiv^ scipiexia tidvoprotein suDunit or complex II 

440 

64 . 286 

1231 

AB020629 

Homo Q^ni pna WTZi2\f^QOO r-»v«y-\*- ^ -J « 

1 i^uiw oa^/xcuo i\.iaauo/4 protein 

125 

46 . 667 

1232 

M69297 

Homo sapiens ORF 3 

163 

33.333 

1233 

M624 1 9 

"us muse ux us ciacnrin- associated, protein 

214 

52 . 308 

1234 

A15293 

Homo sapiens Mature HSA 

473 

73.267 

1235 

U83 90R 
*-» t> j ^ \j t> 

Homo sapiens nuclear antigen H731 

307 

70. 588 

1236 

L28010 

Homo sapiens HnRNP F protein 

283 

85.714 

1237 

X534 14 

Homo sapiens L- alanine : glyoxylate 
aminotransferase 

244 

77. 778 

-L *l O O 


Homo sapiens non-lens beta gamma -crys tall in 
xxrLt± protein 

341 

46.552 

1239 

M19938 

Homo sapiens f ructose- 6 -phosphate, 2 -kinase : 
tructose-z, b -Dispnospnatasse 

185 

60.417 

1240 

X84157 

nomo sapiens suDunit or tne aimeric cap binding 
complex CBC 

405 

70. 652 

1241 

AB029008 

Homo o a t"\ t an o VXftTiT O Q C v^-*^^ *_ J _^ 

ntmio sapiens iviA/iiUoo protein 

331 

59. 223 

1242 

X01677 

Homo sapiens glyceraldehyde- 3 -phosphate 

ueayaroycnaSc 

467 

72.549 

1243 

U09367 

nomo bdpiens zinc ringer protein ZNF136 

467 

52.500 j 

1244 

AC006128 

ntmiu bdpiens riuman nomoiog or mus musculus wizS 
orotein 

559 

70. 732 

1245 

X15005 

uw " iu ocijl» xciis ^^t . idiiiiHiin-Dinciiny protein \aa l 
- 300) 

570 

68 . 595 

1246 

S63654 

MUS SO tVDP VTT rnl 1 aopti 

2 78 

50 . 000 

1247 

M10905 

Homo sapiens fibronectin 

172 

36.264 

1248 

D13635 

Homo sapiens KIAA0010 

566 

79 . 279 

1249 

AB028948 

Homo sapiens KIAA1025 protein 

553 

75.221 

1250 

D45131 

Homo sapiens basigin 

471 

65. 741 

1251 

AF015308 

Homo sapiens nucleolar protein 

311 

68.354 

1252 

AL049557 

Homo sapiens dJ773A18.2 (PROBABLE ATP - DEPENDENT 
RNA HELICASE P4 7 HOMOLOG) 

456 

58. 333 


:i O O O M-6 M-3 ... O a Ii2 6 O 2 


1253 

U8862 9 

xii«^Hiv_/ oa^XCilo pui y I lit; JL d fc> fci 11 tr X C_>Iiy ci 1 LU11 

factor EIjTjP 

171 

93 . 103 

1254 

AF098788 

Gallus aallus nucleair r^l mnHnl l n-hi ndi na 
protein 

T Q Q t~\ 


1255 

AF077207 

Homo sapiens HSPC021 

<<£ / j 

CI i C£ 
51 . frOO 

1256 

M13100 

Rattus norvegicus unknown protein 

ftf* 1 

Do . 2 3 U 

1257 

U93568 

Homo sapiens putative pl50 

206 

36 . 000 

1258 

AB014571 

Homo saoipnfl KTAA0fi71 T"»Y*r>t"**"t n 

obl 

58 . 009 

1259 

D16815 

Homo f?at">i en FAR — It* 


/ u . y b o 

1260 

AF167320 

Mus musculus zinc finger protein ZFP113 

640 

66.923 

1261 

Z95334 

O /~* V> ~i *y r~\ a Cx \~\ 2j Vrtmi/CD a T*ir\mV\o Viin^^ I" V* -1 «s *l v^-v^j" 1 **- 4 -r— 1 
O CUJ. iUoaH^nai uco |JUllixJfc: IiypOLilcUXCaX prOtcin 

OCT 

60 . 976 

1262 

X014 55 

Rpt" "i fill o^n<^ot~Vi#^l "i *i a vi vi i a f~* onH f\ 

protein (reverse transcriptase) 


36.364 

1263 


LicxoiiiuaiuLd iiid jui proucupnuopnuy lycdn 

232 

32 .468 

1264 

D63481 

Homo sapiens The KIAA014 7 gene product is 

T*^l ^f"pH t~0 aHflTI \r 1 \r/— 1 a pa 

lcxaucu lu aucny x y x cy^xaoc • 

1027 

57.597 

1265 

U4 1559 

uaciiuj. iiajjux Lib trj-tiyciiiy inu utii lnicion line lounu 


26 . 606 

1266 

U76846 

Arabidopsis thaliana ubiquitin-specif ic 

r»T*ot* O 3 C3 A 

jkyx. uucaoc 

217 

30 . 108 

1267 

D6 34 81 

nuuiu 0 a JJ l c 1 1 0 xjlt; xv X r\J-\. U 14 / ^cllc piOUULL IS 

related t~o adpnvl \/ 1 r ,, vr*l^«ip» 

102 / 

57 . 597 

1268 

X52235 


243 

60.870 

1269 

AF019082 

uul i.cxxa uuiyuui i_ x. x. viiuiCHU oLIalll cl^SC>Gl« ueCl 

1 inoDrotp in 

J- J-k^V^Jk/ X. ^-C ill 

OCT 
2 o A 

2 y . iso 

1270 

U4 93 79 

Homo sai)ien9 diacvlal vfprnl lei na«?p pn<?i 1 r»n rifSK" 

icq 

1 3 _7 

O O . J 

1271 

AC007228 

Homo cj 3*0 "i on BC5729^ O / n^rt* 4 a 1 ^ 

CQQ 

J J o 


1272 

AF091090 

Homo cianipnQ 1 1 nlcnnwn 

o n n 

Q"i i *a *> 
y j . jjj 

1273 

X694 90 

Homo QAtri on q f ifin 


y o . y 3 y 

1274 

Y12713 

Mi 1 0 mi 1 qpm 1 tic; pr-n-Pnl — HTTTOiaao 1 x at^ >*■ t- » -i n 
1 iuo iiiuouuxuo n w rux uuiraoc puxyjJi uucxll 


/ JL . DbU 

1275 

AF078933 

Homo sapiens WAIT-1 

403 

87.879 

1276 

Y16241 

Homo t^v H ones no Villi of~t"0 

llVJULvy OCIL/XCllO 11CJJUXCLL.C 


bb . jUb 

1277 

AB018348 

Homo qani one ICTAAflRfm nr^P"i n 

3 £. 1 

D± . U42 

1278 

AJ2 42 97 9 

Hnmn -J onQ TfTZiZin/l^l / O /l — „ 
nuuiu oapxcilo IMrVrtu *x 0 1 / ^*±o JpiULclli 

C Q O 

by 2 

b ± . 2 02 

1279 

D89660 

i\atuuo iiui vc^xuus {Jcx uax^uiuc aoocllujxy £dCLOx Z 

ZD / 

OJ . 3 3 3 

1280 

U67056 

Aran t~ hamnpha r* ^\ cfpl 1 Hi mvrr^c; -in T hoamr r» H -n 

nuaiiuiiaiuucua v — c* ^> i„ x. x. c* 1 1 _L _L ill yuo Xll 1 JIl"dVy \ 1 lri 1 11 

kinase 


2 y . 2 o y 

1281 

AF072810 

Homo saoiens transcription factor wctp 

Ton 

OA nnn 

O ft . UUU 

1282 

AB014607 

Homo saoiens KIAA0707 Drotpin 

DUO 

f* i . ob fi 

1283 

AF052831 

Trypanosoma cruzi unknown 

1J1 

i A ill 

JO . Ill 

1284 

AC003038 

Homo sapiens R3 0923 1 

575 

/ Z . _7 J/ O 

1285 

D25215 

Homo sapiens KIAA0032 

214 

33 . 010 

1286 

AF108843 

Homo saoiens on v nrotein 


j* . z z y 

1287 

AJ243460 

Leishtnania ma*ioir nrotponhoqnhool vran 

utAiJiiHiwiixtt ma j v^x. j^/x v^c^^/iiv/opiiu^-j x y uctii 

1 PA 

lof* 

Z 27 . ODl 

1288 

U8 94 3 9 

Bos taurus ubicmitin- 1 ilco nrntpin 

fx O 

ZD . b fx 1 

1289 

AL049759 

Homo saoiens dJ93 0L11 1 (similar to KTAAO"*, 9 7 ) 

ZOO 

Q A CI 
Ofl .D13 

1290 

X99211 

Drosophila melanogaster ubiquitin-specif ic 
protease 

589 

39.373 

1291 

X53556 

Bos taurus type X collagen 

252 

30.534 

1292 

AC002333 

Arabidopsis thaliana SF16 isolog 

180 

32.192 

1293 

Y17832 

Human endogenous retrovirus K pol protein 

213 

45 . 000 

1294 

AF071081 

Mycobacterium tuberculosis proline- rich mucin 
homo log 

192 

35.766 

1295 

Z79752 

Homo sapiens predicted using Genefinder; 
Similarity to Human RNA helicase 
(SW:P68 HUMAN); cDNA EST EMBL : 

188 

45.205 


"115 
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1296 

Z22968 

Homo sapiens M13 0 antigen 

929 

54.386 

1297 

AB007862 

Homo sapiens KIAA04 02 

1117 

82.126 

1298 


no mo sapiens xr^Uii / \ ori — lr con J a 

434 

37.611 j 

1299 

API ftfiQ9 ft 
nt X O \J j a* \J 

nuiuu octpiclls LyLllll dnia~ba 

295 

29 . 500 

1300 

.T1A7 \J \J S> \J J> 

v-ctt;iioi iiajjui lis eieyans jno uciinicion line tounci 

1156 

37 .434 

1301 

AL035311 

Unlcnovm l-evidence=predicted by content; 1- 
me tnoa-y ene t lnaer ; u ; i-eviaence ena; z - 

G V 1UC11LC — jj X t=U 

484 

36.564 

1302 

M92040 

fit" yonrtvl nppnt* YTihn a nnmi irahiia -» 1 r^V»o _ 1 1 fmaii 

loo 

31.319 

1303 

AF003535 

Homn f5^iT> i «i OT?F>> — 1 H r*»"»**ot" a t n 

o c c 
Z. bo 

40 . 777 

1304 

L31840 

i\aLLUj iiuJL vey i(„uo llUL-lcctl {JUL c t^OIIl£JX protein 

NUP107 

bl3 

85 . 714 

1305 

U97553 

murine hfiDesvirus 6fl unknown 

1?1 

3 / . 14 3 

1306 

X00318 

Homo sanipn<? anr>f Arri fin T-T rha i n 

D / U 

/3 . 2 76 

1307 

D30648 

Homo saoipnfl f lavnnrnfpin miV*mn "i t~ 0"P rnmnl TT 

Off 

DO A Q C 

1308 

AF010144 

Homo S3r>"i pn«? npnrnnal hhrpaH nrnhoin amn-MTD 

O *7/l 
Z /4 

54 . 321 

1309 

U01317 

Homo sanipnq R-oamma cil nhi n 

r r q 

70.4 92 

1310 

D79996 

Homo sapiens KIAA0174 

220 

81.395 

1311 

Y17267 

nuo iiiuouuiuo ujjicjux l, xii — louj uyauiny enzyine 

3 22 

57 . 143 

1312 

D63881 

Homo sapiens KIAA0160 gene product is novel. 

3156 

99.161 

1313 

D4 2150 

Lidiiua gaiius cnicKen cacuierin- / 

906 

84 . 146 

1 T1A 
X O X** 

mi ?1 An 

VIA. 14 u 

Homo sapiens envelope protein 

397 

47 . 794 

1315 

AF00797 O 

HidDiuopsi s cnaiiana contains similarity to 

iiiyoiaxii Heavy oricixri 

86 

23 . 864 

1316 

U6 6 R6 1 

no i no aapiens Jtruppei - reiacea zinc linger 
protein 

1607 

57 . 067 

1317 

AFftft 1 1 44 

n.V \J O X X ** *x 

XvaLLUo 11UX VcyiLUb UiiXnA 

277 

58 . 537 

1318 

M2 5 984 

wdxxvxa ydiiub dipxia z type i coxiagen 

180 

31 . 169 

1319 

AU U JU J 

rioiuo sapiens serine/ tnreonine protein Kinase 

427 

59.259 

1320 

AT.0PP6O"? 

/ii djjiuupms cnaiiana puLai.ive protein 

219 

43.590 

1321 

Y11145 

Pacifastacus leniusculus masquerade -like 
protein 

174 

27.919 

1322 

XX / o _> o 

Human endogenous retrovirus K pol protein 

840 

84 . 000 

1323 

XX r O j 

Human endogenous retrovirus K env protein 

598 

66.667 

1324 

VI 971 7 
X X / X J 

Mus musculus Pro- Pol -dUTPase polyprotein 

748 

49 . 770 

1325 

AJ004801 

Bovine herpesvirus type 1.1 immediate-early 
Lidiibd^civauor procein witn lxx linger (cell 

192 

32.168 

1326 

X99753 

Homo sapiens Arno protein (ARF exchange factor) 

217 

35.965 

1327 


noiiio bapieiis uiiiyurouroLdue aenyarogenase 

4 09 

72 . 917 

1328 

AF0 72 75 8 

iiiubtuiub lauLy dcia transport protein J ; 
FATP3 

2164 

80 . 653 

1329 

AJ006693 

Homo sapiens ultra high sulfer keratin 

182 

28 . 155 

1330 

Y10256 

Homo sapiens NIK, serine/threonine protein- 

JV X 1 let O Ci 

74 

22 . 989 

1331 

X79448 

Homo cani <^no t"vnp T 

464 

48 . 780 

1332 

AF071172 

Homo sani pnq 

o c n 

60 . 6 06 

1333 

U50193 

v.acuuiiiauuiLio Clcy alio wcdJs. t» XlllX XciX 1 Cy CO 
SP-YAD5 CLOAB (P3T746) hvnnfhpf n c*z* 1 nrnhPin nnrl 

k - J *^ • \ XT -J J ' i O / XiypULIlCLXUaX pi ULclll ctllU. 

to PIR:C48583 stress-inducible protein STI1 


31 . 579 

1334 

AB012223 

Canis familiaris ORF2 

228 

37 . 963 

1335 

AF106682 

Homo sapiens spindlin 

1009 

93 .168 

1336 

U07358 

Homo sapiens serine/threonine protein kinase 

718 

37.353 

1337 

AF016507 

Homo sapiens C- terminal binding protein 2 

786 

83 . 916 

1338 

L15309 

Homo sapiens zinc finger protein 

574 

74.138 
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1339 

AR0?407 I i 

f\I-> \J 4Lt ^ \J 1 J 

nvjuiu OClj^/liCliO O _L ^ w 

yuo 

47 . 826 

1340 

M12140 

Womo c?or*i"J cartel priv&l nno nrnfein 
nui i llj sajJlcwia ciivciupc piULclH 

t>U O 

54 . 615 j 

1341 

U09413 

Homo sapiens zinc finger protein ZNF135 

1034 

60.090 i 

1342 



699 

71 . 533 

1343 

M11902 

i v iu.y iiiuatuiub proiiric iicn Salivary protein 

221 

33 . 582 

1344 

D50464 

Mus musculus SDR2 

263 

28.636 

1345 

X57527 

Homo sapiens alpha l(VIII) collagen 

201 

33.125 

1 1 A C 
I jib 


Homo sapiens similar to zinc finger proteins; 
similar co /vAtuiyoo \riD:g2843i7i; 

240 

30 . 405 

1^4 7 

T.nA "icq 

Plasmodium falciparum 3 ' end. , gene product 

151 

24 . 519 

1 T^o 

1 o 

HJjU 70040 

ocm zosaccnaromyces pomoe similar to yeast ubLi 
mitochondrial inheritance and actin 

f**\rt~ f~\ ct lr o lot"* r"»n mrrv a n t o a ^ i /"\n ■n - ** - ^>'H' o *i 

v^y Luajvcic t un uitjaiimciuioii piutcin 

281 

22 . 651 

134 9 

MV_ U u Z J u J 

numu oopiciio \jM.xr Lvn ~* ^ iiKc 


47 . 525 

1350 


nui i iu isajL/xciia rv _2> _i_ o o _) .z. 

IbU 

4 J . 4o4 

135 1 

M63 595 

Aciiupuu xdtivia cAxpn.ci-1 type 11 collagen 

-) o o 

27 . 397 

13 52 


riytuAJacterium leprae putative protein syntnesis 

T n -i t"i ahi /^»n Fapf-or" 7 

J.11X ~L C* l_» -L 1 X X. Ca V x. 

1 /.Z 

35 . 780 

1353 

M25984 

Gallus gallus alpha-2 type I collagen 

180 

31.169 

13 54 

nil \J *± -7 / -J j7 

noiiicj &c*pieiifc» ao 7 jULiii . i isimiiar to KiAAUjy / / 

olj 

66 . 509 

1355 

Z24725 

Homo sapiens mitogen inducible gene mig-2 

1952 

91.437 

J- J O D 


Mus musculus envelope polyprotein precursor 

176 

34 . 375 

1357 

AF132180 

Drosophila melanogaster unknown 

338 

31. 792 

TOCO 

m i o o *3 

Homo sapiens similar to an actin bundling 
protein, dematn. 

399 

54 . 074 

T *J C Q 

AT T T T *"> O n 

Caenorhabditis elegans predicted using 
beneiinaer, preliminary preaiction 

208 

37 . 719 

X J O KJ 

/\J? l^OU^U 

Ldvid porceiius enaotnexiai nitric oxiae 

d \/n t" Via aa 

185 

30 . 986 

1361 

AC002310 

Homo sapiens Unknown gene product 

630 

52 . 151 

1 T. & 7 

J. J O Z. 

M£<±7 Ql 

Rattus norvegicus salivary proline~rich protein 

201 

32 . 308 

1 ^ 7 
-L -J O J 

AT.H7 17/17 

/-V1jI/w£ X / *x / 

oL/iiizobdccfidroinyces pomDe nypotneticai protein 

O f\ c 

205 

55 . 172 

1364 

X53581 

Rattus norvegicus ORF2 

165 

49.123 


XUo / XD 

Mus musculus vascular cadherin-2 

1317 

78 . 599 

X J D D 

TTft Tlli; 

riomo sapiens non-iens oeta gamma -crystamn 

T •» Ira rwrti-ni n 

iiA.c pxutciii 

244 

82 . 927 

1367 

U83590 

Rattus norvegicus PAR interacting protein 

1542 

68 .421 

i £ ft 

_L -j O O 

TTl O'lQft 
U1Z J J \J 

Lionmg vector pbporci oeca-gaiactosiaase alpna 

LycLy * — LUc 

176 

44 . 928 

136 9 

U013 51 

rnjiuu oapieim yiucocorticoiu receptor aipna-z 

/bo 

9 J . 443 

1370 

X51591 

Homo sapiens beta-myosin heavy chain (1151 AA) 

286 

53.271 

1371 

APOD Qfifift 

multiple sclerosis associatea retrovirus 

"i O "5 

4 7. 143 

1372 

AB002348 


7 r» n q 

OA Q Q d 

O** . o yo 

1373 

M30023 

nrf \r "i y*iiq APP9 

1^7 
ID / 


1374 

AL021918 | 

Homo sapiens b34I8.1 (Kruppel related Zinc 

Pi napy nTnfpi n i Q4 \ 

*■ -1-1 J-^jj w ±. kJ -L. W I — .L 1 1 J. O M / 

1474 

57.100 

1375 

Z49651 

oa^v^iicii. vjiiijy L-ciD Lcievxoiav KJt\.r lUKi jit 

ICQ 

loo 


1376 

AC002 365 

Homo <?anipnc! qpyt, 

ZOO 

A 7 Q Ql 

4« . yyi 

1377 

AB015440 

Rana catesbeiana alpha 1 type I collagen 

320 

2 9.470 

1378 

AF176688 

Rattus norvegicus sodium/calcium/potassium 
exchanger NCKX1 

191 

33.113 

1379 

Y15635 

Homo sapiens ABCR 

200 

42.391 

1380 

X75926 

Mus musculus abel 

2263 

55.857 

1381 

V01555 

Human herpesvirus 4 BYRF1 , encodes EBNA-2 

181 

31.707 
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1382 

AF015454 


D 13 

51 

.485 

1383 

X16468 


200 

29 

. 911 

1384 

D80004 


C A f\ 

b 4 U 

90 

. 000 

1385 

M11902 

1 »uo uiuo^uiito ^lUllllc licit ydll vaiy piULcin 

1 Q Q 

iyy 

33 

.793 

1386 

nft6969 

numu i>ctpj.t;iia similar to riuman zinc-nnyer 

147 9 

68 

.403 



jLJioccin, DKifiu \ri :UL*Uoy j 




1 TOT 
1 J O / 

nj n -5 -a c -a a 

Schizosaccharomyces pombe serine- rich protein 

176 

30 

. 061 

1 TOO 

1j O o 

SOI CTlOn 

Mus musculus zinc finger protein ZFP113 

789 

55 

.140 

1 o o 

1389 

Z68335 

Caenorhabditis elegans predicted using 

157 

35 

.514 



Gene finder; similar to collagen; cDNA EST 






fc.Mtjjj : jJo o 9o / comes trom this gene; cDNA EST 






EMBL:D6 92 98 comes from this gene; cDNA EST 






EMBIj:D6 9331 comes from this gene; cDNA EST 






Eji v i±3j_» . u / u j o o comes rrom cms gene 




i -ion 

Zibyjb 1 

Caenorhabditis elegans Weak similarity to 

199 

40 

.000 



nimeria cnromoosponain IPIR Acc. No. A45517) ; 






cuiMA Cit>i liMBLi : My 9^266 comes rrom this gene; cDNA 






EST vk^QRhQ R rnmPQ frnm t~ Vi -i « rrono 
■» y*v^7Jjjj • — > wutiico JLivJiii Liiio ytjuti 




1 OQ1 
lj 71 


Arabidopsis thaliana putative proline -rich cell 

197 

38 

.793 



wdii piotcin 





o / y^iu 

Mus sp . nuclear localization signals (NIjS) - 

184 

48 

. 077 



jjinciirig proLein=spoL i 




1393 

n j / z> z> jl 

Pseudomonas aeruginosa transcription regulatory 

156 

37 

.000 



Ul W l_ t- -Lll 




1394 

AF01 fifift7 

v-citiiicjx xiciijui iio eiegans oimnar co cucicuiar 

181 

35 

.652 



LUiio|jcu ( LUUCU 1UL L»y v_ . cxc^aUo CUlM/\ y K.D G *i . D 




1395 

M83822 

numw o o xcilo ucl^jc llJ^c pi UUclIl 

413 

82 

. 192 

1396 

AF085185 

mc aiiLiiaiuucjJa CaoLcllallll nyOBlil 1ft 

o o c 

42 

.400 

1397 

U87318 

Acuupuu lac vlo IN cl U Vw A. 

i c r\ r\ 
1500 

56 

. 041 

1398 

Y15491 

fuiiyu pyyiucieus leruiiin aipna protein 

973 

94 

. 161 

1399 



2351 

99 

.403 

14 00 

X75926 

i mo uiuo^uiUo ctiP^l 


55 

.857 

1401 

M18247 

rciinc icuAemia viius gay - poi precursor 

259 

36 

.792 



nolvttrDhPin rrPrRD 

^VJl jr Jt'i w uciii y n ou 




1402 

Z97340 

*"».i auxuujjo i o uiiaiiana CALCiiDlll Xlivt: piutclll 

144 

39 

. 344 

1403 

Y12713 

Mus musculus Pro- Pnl -rlTTTPn c?<=» nnl vnrnfpin 

1 O *i 

46 

.377 

1404 

S80119 

i\at.uuo *-*Jr * ic vcisc uidllo^I lptaoc nuulOluy 

"3 "7 Q 

3 /y 

53 

.608 

1405 

AF071172 

*■ l^iiiw oa^/iciio n Cj jl. 

O Q O 

88 

. 636 

1406 

AF134304 


i y u 

31 

. 967 

1407 

L06863 

viiLCLUiuu y i lbcUo type Vll COllaycIl 

164 

33 

. 333 

14 08 

AF032872 

Kattua iiuivcyicus potassiuni cnannei reguiacory 

22 9 

84 

. 091 



Drotein KChAP 




1409 

M37759 

Ml If? milRril 1 HQ cj <=i y* i np 1 nlf- -r- -» Vi"5 rrVi qi 1 1 -Ft i-k* r^v/^t- a -i -r-k 
iiiuo\*uxu£) senile i Ul Ll a iiiyii ouXllii LyiCjUtilll 

£ o4 

40 

. 336 

1410 

S74439 

Bombvx mori=ci"i 1 kwnrmq P#=»Y^»t* "i r^o Darhial All -> 
uwuiwyA inwii — oxxA.wuiiiiQf rcp^lUc rctl tldl ^ D J J cLa. 

A O 
4 Zo 

40 

. 314 



silk fibroin heaw chain / c*- 1~ ormi 1 \ 




1411 

D80009 

Homo sapiens KIAA018 7 

323 

57 

. 944 

1412 

M12140 

Homo sapiens envelope protein 

316 

71 

. 186 

1413 

M13 100 

i\ci u uuo xxv^i vcy l^Uo Li.1 1 jS.1 1U Wl 1 piULclll 

ZD J 

O 4 

. Dlo 

1414 

AF116463 

Streptomyces lincolnensis unknown 

219 

33 

.173 

1415 

U00039 

Escherichia coli No definition line found 

796 

97 

561 

1416 

AB017614 

Mus musculus OASIS protein 

1919 

92 

.652 

1417 

AF042379 

Homo sapiens spindle pole body protein spc97 

180 

25 

532 



homolog GCP2 




1418 

D87459 

Homo sapiens Similar to Volbox carteri extensin 

204 

35. 

766 


11U 
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(S22697) 



14 iy 


Homo sapiens helicase 

350 

52 . 713 

1420 

AB007871 

Homo sapiens KIAA0411 

668 

60.938 

1421 

Z4814 9. 

Saccharomyces cerevisiae similarity with H. 
polymorpha hypothetical protein in LEU2 region 

169 

32 .432 

14ZZ 


Homo sapiens salivary proline- rich protein 
precursor 

177 

33 . 077 



Homo sapiens putative RHO/RAC effector protein; 
^ jt> similarity co r^yzUD ifiu : gi jibbbU) 

5501 

99 . 166 

1424 

AlT0 04 fi 1*? 

xiwiiiw aapiciia iicuxupauiiy LaiycL coiciaoc 

lz05 

79 . 762 

1425 

AO ** J *± O 

Qa i tn 1 ri i no V»c» y >~\ f*- air-! y*i 10 O /~VD 1? 71 . tt>/~«t tt> 1 

odimiriine nerpesvirus z UKr /i ; hLLir 1 

O O 1 

221 

29 . 070 

1426 

Y08986 

Brassica napus oleosin-like protein 

190 

36.842 

1427 


iiomo sapiens cycoKeracin z 

372 

54 .348 

X *± D 


Mus musculus schlaf en4 

710 

47 . 012 

1429 

LSX J O O D 

fivjinu sapiens hiaauuij. 

3 74 

41 . 579 

1430 

AJ245569 

Mus musculus hypothetical protein 

249 

67.273 

14 .3 1 

Ar ljy /44 

Streptococcus pyogenes serum opacity factor 
precursor 

144 

31 . 304 


U ' Uo j 1 

Homo sapiens Zinc- finger protein 

710 

59 . 016 

1 ATT 
X *± -5 J 


nomo sapiens UKrii 

197 

42 . 667 

1 Aid 

/\r u 4 z 0 / y 

Homo sapiens spindle pole body protein spc97 
iivjuiux uy V3V_ ±r z 

534 

82 . 569 

1 4*1 C 

7AT OA QCOD 
/VLUt JO UO 

Arabidopsis thaliana extensin-like protein 

160 

37 . 662 

14 3 6 


riointi sapiens maauoi^ protein 

34 7 

73 . 973 

143 7 

JAU U1 J 

myLiius eauiis nongraaienc oyssai precursor 

223 

31 . 092 

1438 

U44898 

Homo sapiens SNAP4 5 subunit 

301 

54.237 

1 A 1 O 

14 .3 y 

Y O O "5 O "5 

Rattus norvegicus SNFl-related kinase 

308 

73 . 684 

1440 

AF097183 

Cryptosporidium parvum unknown 

238 

33 . 735 

1441 

AF 05 5 904 

Myxococcus xanthus unknown 

225 

29 . 082 

T A A *"> 

V C O COI 

Rattus norvegicus ORF1 

200 

40 . 506 

T A A "3 
14 4 J 

Ar 04 3 2 9 7 

Chlamydomonas reinhardtii poly (A) binding 
protein RB47 

192 

32 .484 

1444 

D29766 

Rattus norvegicus Crk-associated substrate, 
pi J U 

1853 

86 .420 

~\ A AfZ 
Xfx fi O 

M"5 TQ7Q 

Homo sapiens DNA binding protein 

734 

74 . 265 

X*± *± O 

A I? ft CTCCI 

Homo sapiens anti- Fas- induced apoptosis 

196 

63 . 462 

x*± ** / 

•ttxJ X 0 O X Z D 

mus muscuius immuniuy assocatea protein 3 8 

315 

48 . 387 

1448 

U64608 

Caenorhabditis elegans coded for by C. elegans 
cDNA ykl73cl2.5 

334 

34.706 

1 A A Q 

Apn en e /r 

Homo sapiens leucin rich neuronal protein 

886 

90 . 972 

1 a n n 

/■VDU 14 D o4 

Homo sapiens KIAA0684 protein 

209 

77 . 083 

14CI 


Homo sapiens KIAA07 50 protein 

1090 

50 . 146 

Xft D 

/£» / O Uo 4 

Caenorhabditis elegans predicted using 
Genefinder; similar to collagen; cDNA EST 
Hi1 v ij3Xj . x*o y / j\j cornes x ruin Luis yens 

151 

31 . 544 

1453 

U12134 

Homo sapiens RAD52 

223 

69.091 

1454 

U07973 

Gallus gallus alpha- 1 collagen type III 

134 

30. 597 

14 D D 

AT ft Q ni /IT 
AI1U0 U141 

Homo sapiens hypothetical protein 

1117 

67 . 293 

1 iCC 

14 DO 


Schizosaccharomyces pombe putative 
t ranscriot ion or .qnl i m' ncr fartnr 

162 

28 . 571 

1457 

U43 585 

Mus musculus protein kinase related to Raf 
protein kinases; Method: conceptual translation 
supplied by author 

280 

57.143 

1458 

AB029335 

Halocynthia roretzi HrPET-3 

238 

32 . 738 

1459 

U82982 

Cavia porcellus GEC-3 

227 

38.679 


717 


JL till O S 4-6 HhSS », Oi fi H & O ES 


1460 

Z19574 

Homo sapiens cytokeratin 17 

178 

61.364 

1461 

AB014521 

Hoino sapiens K1AA0621 protein 


o ft . o 

1462 

AJ131526 

Mus mu.scu.lns TEF-5 

1 KG. 
J- Do 

£ZQ O Q "3 

1463 

AB022927 

Orvctolaous cuniculus hvDemolari 7afir>n 

jr \«uvxa^ui9 ^uiij.<wuxuk> 11ypci.puj.a1 x^auiUii 

activated cation channel 

lot) 

Jl . ZoU 

1464 

U64608 

Caenorhabditis elegans coded for by C. elegans 
cDNA ykl73cl2.5 

334 

34.706 

1465 

X70944 

Homo sapiens PTB-associated splicing factor 

231 

* JOO 

1466 

AJ243460 

Leishmania major proteophosphoglycan 

221 

"^7 1 "7** 

1467 

U81788 

Drosophila melanogaster kinesin-73 

~j f 


1468 

U14635 

Caenorhabditis elegans similar to GABA and 
glycine receptors 

681 

34.925 

1469 

M24355 

Homo sapiens filaggrin 

518 

34 . 014 

1470 

M12140 

Homo saDiens pnvplonp nrofpin 

*X^7 f 


1471 

X97675 

Homo sapiens plakophilin 2b 

loo 

"7 *7 77Q 

1472 

AC004893 

Homo sapiens similar to NEDD-4 (KIA0093) ; 
similar to P46934 (PID -01171682) 

211 

54.688 

1473 

U42471 

Mus musculus Wiscott-Aldrich Syndrome protein 
homo 1 og 

226 

39.669 

1474 

AF001305 

Pneumocvstis carini \ f an carini "i nrni-paQp l 

1 QC 

JLjZ> 

Z / . ^OD 

1475 

X75931 

Bos taurus Cleavaae and Polvadpnvl at-i nn 
specificity factor (CPSF) lOOkD subunit 

£. ** fi O 

JO . OOU 

1476 

AF084521 

Homo sapiens brefeldin A- inhibited guanine 
nucleotide - exchange protein 2 

1552 

94 .286 

1477 

X83413 

Human herpesvirus 6 U88 

175 

41. 509 

1478 

AF164612 

Homo ^» r~i H pnq *=»TTvr#a>T nnp nrnhpi n 

O .3 ft 

/IT O O T 

ft J. . Z.2. JL 

1479 

AF115435 

Rat"t"iif? nnTVPfi 4 cn<3 a vm t" ^ v "5 x\ 17 

* v ^* AA^JL VCVj±LUi3 O X JL LuA JL 11 JL / 


O Z . j Jo 

1480 

AC002563 

Homo e; r-) -j pnq 1 1~ t" "1 VP PHO/'PTVO f" f pz-'hriT nrAt*o-J « • 
11VIIIU oa^/xcuo Jkv ULQLX V C IvTlV/ INJTVV-. cl Leu LUI UI UUci.Il , 

95% similaritv to P49205 ( PID • al345S60 ) 



1481 

D13644 

Homo sapiens protein related N-ternimus of tre 
oncogene 

238 

84.444 

1482 

AF009243 

Homo sapiens proline-rich Gla protein 2 

1 ~\ l 
~j ~> ~j 

Q4 lift 

1483 

D25538 

Homo sapiens KIAA003 7 

387 

7fi "^fiQ 

1484 

Y17137 

Mus musculus mCASK-A 

237 

79 . 167 

1485 

AE000789 

Borrelia burgdorferi B. burgdorferi predicted 
coding reqion BBI16 

218 

2 9.885 

1486 

AF146531 

Homo sapiens bridging integrator- 2 

175 

64 . 286 

1487 

AB023161 

Homo sapiens KIAA0944 protein 

566 

56 . 954 

1488 

AB018288 

Homo sapiens KIAA074 5 protein 

524 

3 9.837 

1489 

AL021929 

Mycobacterium tuberculosis PPE 

193 

37 . 500 

1490 

AB028998 

Homo sapiens KIAA107 5 protein 

386 

57 . 692 

1491 

AF124435 

Danio rerio p5 5 -related MAGUK protein DLG3 

776 

82 . 482 

1492 

X65546 

Mycobacterium leprae proline-rich antigen 

145 

44.186 

1493 

AB023178 

Homo sapiens KIAA0961 protein 

1615 

79.061 1 

1494 

Z46791 

Caenorhabditis elegans similar to coilagen 

188 

-> -> - O T« J7 

1495 

AF085185 ! 

Acanthamoeba castellanii Myosin- IA 

205 

3 5.811 

1496 

J05499 

Rattus norvegicus L-glutamine amidohydrolase 

1327 

79 . 259 

1497 

AL021492 

Unknown similar to Glycosyl transferases; cDNA 
EST EMBL:D3 3 950 comes from this gene; cDNA EST 
EMB 

246 

3 3 . 14 3 

1498 

U27459 

Homo sapiens hORC2L 

213 

60. 000 

1499 

U50078 

Homo sapiens p532 

333 

88.679 

1500 

AE000351 

Escherichia coli orf, hypothetical protein 

899 

98.473 

1501 

X69089 

Homo sapiens 165kD protein 

374 

62.766 ! 

1502 

L19201 

Escherichia coli glutamine synthetase 

1255 

93.564 


11* 


.1, Hi O S M-6 1+3 . O 2 ES 6 O 2 


1503 

D90846 

Escherichia Col T 2X<~*t*H "Fl avi n vdc i c? t~ <~* "nv/ot* o *i n 

F (EnvD protein) 

1 £ 1 Q 

yo . ol 7 

1504 

AE000248 

Escherichia, coli nersistencp t~ o inhihiti r»n r-k-F 
murein or DNA biosynthesis, DNA-binding 
regulator 

"I 1 R 

iff* . J4U 

1505 

D90730 

Escherichia coli MukB protein 

1416 

94 . 515 

1506 

D90709 

Escherichia coli Yhhl protein 

784 

90.152 

1507 

X69089 

Homo sapiens 165kD protein 

196 

70.455 

1508 

AB004659 

Acidiphilium multivorum ArsB 

DZX 

on /i o o 

1509 

X73143 

Escherichia coli NikA 

O O Zj 

. o z u 

1510 

L10328 

Escherichia coli f772 

641 

90.909 

1511 

D90731 

Escherichia coli Asnaraainvl _t*Rwa qvnhhpfaao 
(EC 6.1.1.22) (asparagine-tRNA ligase) 
(asnRS) . 

*± O 3 


1512 

X57560 

Escherichia coli pspB protein 

294 

86 . 000 

1513 

D90716 

Escherichia coli HvT}Ot~h*=»t" i r*p*1 R4. ^ lrH nrrti-o-in 

in lpd-3 5 1 region (orf2) 

■71 c 

/lb 

.73 7 

1514 

D90748 

Escherichia coli Virulence membrane protein 
phoQ . 

Oft ft 

on too 

1515 

X16531 

Escherichia coli oxyR gene product 34kD protein 
(AA 1-305) 

853 

91. 096 

1516 

AB011180 

Homo sapiens KIAA06 08 protein 

JO u 


1517 

AB012725 

Mus musculus zinc finger protein 


JL . *± ft / 

1518 

AB018274 

Homo sapiens KIAA0731 protein 

COT 

O / . 5 00 

1519 

AL035632 

Unknown /Drediction- ( mp t~ VioH - » " rr^-n ^ -F S nHo >- " » 
version : " "084 " " , score : " " 113 . 36 " " ) ; 
/prediction= (meth 

C "7 "3 

59 . 02 8 

1520 

Z11518 

Homo sapiens hist idyl- tRNA synthetase 


1UU . uuu 

1521 

U08350 

Sus scrofa E-selectin 

187 

30.392 

1522 

AF007170 

Homo sapiens unknown 

"1 C 

-L Dft 

O C Ti x 

85 . /14 

1523 

AF160798 

Rattus norvegicus calcium transporter CaTl 

1 CI 1 

90 . 04 1 

1524 

D63478 

Homo sapiens The KIAA0144 gene product is 
novel . 

155 

84 .615 

1525 

AJ133120 

Rattus norvecricus Proline t~i r-Vi qvnancjp 
associated protein 2 

ullo 

Q f\ 1 Q O 

1526 

A58331 

Homo sapiens unnamed protein product 

831 

QQ TTC 

Zj j . JL X D 

1527 

U39060 

Mus musculus glucocorticoid receptor 
interacting protein 1 

219 

O "* . U3X 

1528 

U71273 

Sus scrofa glucosidase II 

704 

60.667 

1529 

Y00826 

Rattus norvegicus gp210 (AA 1-1886) 

258 

Ol . *4 *4 

1530 

U66707 

Rattus norvegicus densin-180 

558 

60.432 

1531 

M18907 

Homo sapiens nifedipine oxidase 

654 

81. 967 

1532 

AB020716 

Homo sapiens KIAA0909 protein 

318 

QQ COO 
O _7 . 3 O O 

1533 

AF156856 

Mus musculus cytosolic sialic acid 9-0- 
acetylesterase 

26 8 

7t> Ql *7 

1534 

U86074 

Homo sapiens tesmin 

662 

/ O . O / / 

1535 

Z19550 

Homo sapiens N-acetyllactosaminide beta-l,6-N- 
acetylglucosaminyl transferase 

33 9 

CO TOO 
. JL Z O 

1536 

AL096749 

Homo sapiens DKFZp434G153 

271 

36 . 522 

1537 

AF156529 

Mus musculus Msx2 interacting nuclear target 
protein 

179 

100.000 

1538 

AB009024 

Homo sapiens capping enzyme IB 

157 

100.000 

1539 

U13019 

Caenorhabditis elegans No definition line found 

426 

56.731 

1540 

AB002370 

Homo sapiens KIAA03 72 

263 

97.436 

1541 

U39940 

Sinorhizobium meliloti choline sulfatase 

254 

35.417 


HI 


:t ci o s 1+3 ,„ o a a & o s; 


1542 

D78572 

Mus musculus membrane glycoprotein 

747 

65.000 

1543 

M63180 

Homo sapiens threonyl - tRNA synthetase 

174 

42.500 

1544 

Z68011 

Unknown Similarity to Xenopus F-spondin 
precursor (PIR Acc. No. A47723) ; cDNA EST 
EMBL:D33135 comes 

191 

33 .708 

154 5 

U37373 

Xenopus laevis up -regulated by thyroid hormone 
in tadpoles; expressed specifically in the tail 
and only at metamorphosis; membrane bound or 
extracellular protein; C- terminal basic region 

387 

81 .250 

1546 

AL096768 

Homo sapiens dJ858B16.1.2 (KIAA0542 (isoform 
2/ ) 

176 

83 .333 

t C A O 

7A ITA O O QO 

Mus musculus Sec8 

194 

93 . 939 

1 C A ft 


Caenorhabdi tis elegans cDNA EST EMBL:D32 6 93 
comes from this gene; cDNA EST EMBL:D3 54 05 
comes x x oiTi Lnis gene; cuv*j\ bbi yK^u /ciu . d comes 
from this gene 

510 

52 . 632 

1549 

AF023261 

Human endogenous retrovirus K pol -env 

412 

54.167 

1 CCrt 

Id jU 

TTC Q O O "7 

ut> yz o / 

Gallus gallus paranemin 

162 

57 . 895 

lDDl 

AL UUz jiz 

Arabidopsis thaliana putative NAD (P) -dependent 
cnoiesteroi aenyQroyenase 

303 

39 . 412 

n c co 


uixKixown id lmiiaricy co i ease iai - JDinamg nomoiog 
/ \ o w . i tDf i I xji-io i / ; cuiNA xiio l jioxojli : Do i jlz Ht comes 
from 

T r Q 

34 . 375 

1553 

Jrxi. \J V t _J _/ _> 

1 IUo IIIIX£3 LI X UO 1NOL/1 piULclil 

& a n 

i7U . DZD 

1554 

API 

Homo a o pn □ CC2 T-1 1 Q nvnt* ^ t n 
ni^uiv^ oa]Jiciio v_»vjx ~ x X _7 j^xULclll 


DA "5 O C 

1555 

11-/ / QO£ 

rvd. L. L. UO vcy ILUo IJcUa alalllllt: byHLXlaoti 

7 K 

on ri ftft 
O U . UuU 

1556 

r\c u o / z^D 


ont; 

C C TOO 

1557 

ABO 1 ft 2 74 


4Q7 

CO Q C O 

1558 

X/O v> D \J \J 

Homo c3^oi pnq MHTPWA 

TAT 
J CI / 

1UU . UUU 

1559 


HUG tllUi o t_ Ul X U t> Vorovj-d 

J ZO 

ao rtc Q 

1560 

U09413 

TTomo a -r~k H pri a 7 t np f i nrtPY" r-\ y~/~\ t~ q i n "1 "3 ^ 

nWI 1 1U OClpXCJlO ^tlilL JL JLll^jCL pi vJLcXIl ijlM 17 X J Z> 

O X o 

CP T O C 
DO . j / O 

1561 

L2 816 7 

nuo iiiLio ix x z>±iiv> i xn^cx yivJUciit 


Oft OA 7 

1562 

X91788 

Homo sapiens Icln protein 

639 

100.000 

X J O J 

x/o / x z / 

nvjiiivwP bapicim Lxaiis lULd i_ lull protcXu-i 

con 

co one 

1564 

U17342 

Thermus sp. serine proteinase 

262 

41.121 

T CC. C 
X DO D 

rsct n n n c 

xiomo sapiens kiaauio J 

55 

A C C C ^7 

46 .667 

X D O O 

QRm A C QA 

TJ/^m/^ o o t~\ "i on o V X 7V A f \ C Ck A *\ v>/>f- ^ -i 
XlvJlllvJ odLJXtillti JMrtrtUOS^ pj.OL.t3 XII 

ICC 
ODD 

CO O T O 
D2 . 212 

1 cc 7 


nomo sapiens eiuDryonic eccoaerm aevexopment, 

r^T~or"^Tn chrtT"h i oofovm 

JJ1ULC111 OllUl I— loUlUllll 


OC T Qrt 

/ O . 190 

1568 

U95171 

xviuoupilllcl lllc lallUy do Lcl llllLlULUJJUlt: dooULlaLcU 

protein 

Z. Z. 

O C i on 
z o . i y i 

1569 

AF064748 

Mus musculus S3 - 12 

569 

79 . 464 

1570 

A36790 

unidentified LIGHT CHAIN VARIABLE REGION C21-L3 

154 

96 .000 

1571 

AF017152 

Mus musculus She binding protein 

234 

83 .333 

1572 

AF095136 

If U J JX JO 

Homo a^r~»"i pn q nrnhoi n O-mannriavl - ^ranaf oraeo 1 
ixuiiiu oajjxciiD ir 111 W lllallliusyi * ulallSlclaoc X 


ft ft A OA 

1573 

AF165161 

Homo sapiens FLASH 

732 

100. 000 

1574 

AL008637 

Homo sapiens NCF4 

247 

83 .333 

1 ETC 
lJ / J 


xiouhj sapiens i\i/\>iuftD/i proLcin 

li)l 


ID / o 

nT?rt ceo q i 

Kautus notvcyicus signal cransaucer ana 
activator of transcription 4 


Q "7 n r n 

1577 

AL031667 

Homo sapiens dJ620Ell.la (novel Helicase C- 
terminal domain and SNF2 N- terminal domains 
containing protein, similar to KIAA0308) 

440 

78.889 

1578 

AF143946 

Homo sapiens transcriptional activator SRCAP 

963 

98 .675 

1579 

AF022256 

Mus musculus corneal proteoglycan, keratocan 

221 

37.069 


.1. 0 O & H- ib «+3 CI i2 E!! tB O E? 


1580 

X67052 

Gallus gallus c-Rmil 

301 

95.745 

1581 

AC006539 

Homo sapiens BC3 94 98 2 

584 

63.866 

J. — » O *C 


fiomo sapiens nypocnecicai procein 

260 

38 . 462 

1583 

S67826 

Homo sapiens immunoglobulin heavy chain 
variable region 

549 

67.568 


iiDO / D U 

Homo sapiens Similarity to Human ankaryin 

/ CM • T-TT TM 7\ KT 1 . /~ii"YIvT7\ I?CT T7TVTT3 T . T^~j A ^ O IZ t~, 
V o W . J-\X>* i\X5 riUl v l>AlN ) f CUrtJ\ doI hniiL :UJ4zbb COmeS 

from this gene; cD 

306 

38 .655 

1585 

£j 1 \J -C / X 

OXlllXXclXXl_y U-O IcaSt EiX AiraSc 
f rnm th"i ci apnp 

1~ A- III U ll-L O H C11C 

7 09 

44 . 737 

1586 

AB018334 

Homo saoiens KIAA0791 orofpin 

X / D 

Q A Q A Q 

o4 . o4o 

1587 

AF091624 

Drosophila melanogaster Pelle associated 
orotein Pellino 

420 

60.952 

1588 

AB004884 

Homo qani pnQ "DKTT— x-% 1 loVia 

Xo X 

OA COO 

/ O . boo 

1589 

U52193 


Z7 A A 

99.259 

1590 

M80902 

Homo saDiens AHNAK nurlpnnrohpin 

£ fi £ 
DOD 

no AO o 

1591 

AB019440 

Homo Rflfii pn a -5 mmnnoo'l oV^l "i n Vt p» ■\r~\r rhai n ira -j -a Kl Q 
nv^uivj oc*^/xcxiD j_uiiuuj.n_»y jl ujjj. xii I lei d v y l_ I id JL I! vctl xdJJXc 

region 

D fi ^ 

Q 1 OCA 

1592 

X60155 

Homo ^fliri pit? 71 np f i nripr A ~l 

TOO 
/ O O 

CO /I o o 

1593 

Y17793 

Mii^ mil Qfii 1 ii q Dut^t"! nrnhpi n 

1 1 LI III LA O V*» V-A JL La O U LA I Lc X X t-* ^ _1_ A X 

K A £ 

Cf coo 

1594 

AB007876 

Homo ^ani pnq KTAADdl £ 

ACQ 
ft -3 27 

A\ 1 O O O 

1595 

D63850 

Mil*? mil sen 1 n ei hpr>atnma-HpT"i vpH rTr*OTArf-Vi ■F.ar" , t-r , i - r- 


O O jIIQ 

1596 

AJ010949 

Mus musculus calcium channel alpha- 2-delta-C 

ci 1 1 V-*i i n *i t~ 

O UJjUilJi L- 

1093 

97.605 

1597 

U34 360 

Homo eani ana T.QC- A 

Oil 

82 . 353 

1598 

V00401 

Gallus gallus collagen 

559 

54 .861 

A- ZJ z* z? 


HxdDxaopsxs cnaiiana nypouneuicax procein 

147 

46 . 667 

1600 

U35371 

Rattus norvegicus neural cell adhesion protein 
BIG-2 precursor 

690 

93.519 

1DU1 

jnoc o c 

Rattus norvegicus prealpha- 2 -macroglobulin 

304 

41 . 667 

X O w 


Tripneustes gratilla dynein heavy chain isotype 
IB 

694 

76 . 515 

1603 

7Q1 i oo 
oOll JO 

L-aciiurncijjux tis g x eyans preuiccea using 
Gene finder; cDNA EST EMBL:D65543 comes from 

t" V| 1 C ffOTlO 
tllXO LjCHC 

193 

35 . 246 

1604 

Y12713 

Ml 1 Q mi IQflll lies pvn — D^i 1 — r~1 T I" 1 " D n a A 1 \ rr\-r~/^ f" o -i n 
i iiiudl. lxj. uo riu r UX Uvlrabc k7UXypxUL.cXn 

d *7 *7 

83 . 761 

1605 

D86983 

Homo qan t c»r% c; a i m i 1 a "k* |- /— \ iyiaI ^ nrr a f o v 
nwiiiL' Oaiyxcllo oXIUXXctX L. LJ U . lucl dllLjy do tci 

Deroxidasin (U11052 ) 

Z _> b 

O C 11C 

3 b . lib 

1606 

AL021918 

Homo saoiens b34T8 1 ( Ki^iit^d^I rpl at*prt 7ino 
Finger protein 184) 

J / X 

C *7 OO A 

1607 

D87437 

Homo sapiens KIAA02 50 

165 

96 . 429 

1608 

X51394 

Xenopus laevis APEG precursor protein 

305 

4 7 fil Q 

** 1 . DX7 

1609 

X72473 

Homo sapiens Ig kappa light chain (VJC) 

376 

82 .609 

1610 

AF132726 

Mus musculus FLASH 

396 

60.606 

1611 

X69942 

Mus musculus pnhanrpr- 1~ ran- T 0011 «5 - 1 

O "31 ^ 
^ J J 

Z7f* . O / A 

1612 

L24907 

Rattus nnrvpcri nrnrpi n Vi na qp T 

*»WI*I#Ui3 11L/1> vcy J.L/UD |J1 L/LC J.11 JV JLlldDC JL 

ID / 

OJ .Zoo 

1613 

M98326 

Homo qanipne! val vl — t*P"MA cj-wnt-Hot-a <3<a 


O 1 Q O A 

1614 

AL031667 

Homo sapiens dJ620Ell.la (novel Helicase C- 
terminal domain and SNF2 N- terminal domains 
containing protein, similar to KIAA0308) 

545 

100. 000 

1615 

D90188 

Homo sapiens phSR2 

201 

78.125 

1616 

AJ004810 

Zea mays cytochrome P45 0 monooxygenase 

159 

68 .293 

1617 

U93872 

Kaposi's sarcoma-associated herpesvirus ORF 73, 
contains large complex repeat CR 73 

149 

31.298 


:£, CI ii.il B M-3 u O K ir* 6 O H 


1 1 ft 

-LDlo 


Mil 0 mil /ti 1 1 it n Utjc 
HUS lUUSCUlUS Ei Wo 

175 

74 . 194 ' 

1619 

Z54216 

Unknown similar to DNAJ protein; cDNA EST 
hMbb: iuujJ4 comes rrom tnis gene; cDNA EST 
EMBL:T01898 CO 

255 

41.237 

162 0 

7AF1 Q9Q c; 

Homo sapiens serine/threonine protein kinase 
Kp78 splice variant CTAK75a 

164 

71 . 795 

1621 

-rtU V/ X <& J / o 

Homo sapiens ATP-binding cassette transporter- 1 
(ABC-1) 

181 

33 . 043 

1622 

AL031432 

Homo sapiens d«J465N24.1 (PUTATIVE novel protein 
ssimixar co preaicuea yeast, ano worm proteins) 

168 

80.645 


74 Q1 44 

Uiyctoidyus cunicuius muiciuiug resistance- 
associated protein 2 

513 

48 . 125 

1 fi24 

Zj 27 / OZO 

notno sapiens oimiiarity co Human bL-ricn UNA- 
binding factor (GCF) (SW:P16383); cDNA EST 
yk238ell.3 comes 

178 

31 . 624 

162 5 

U58134 

i v ius inustuius poiy \n/ poxymerase vi 

242 

97.297 

1626 

TTQT 1 ft T 

UJJlOl 

rioino sapiens nuciear auai-speciiicity 

^iiuo^iict Laoc 

240 

48 . 889 

1627 

za 0220 

£J VJ \J £m £t \J 

uiijuiuwii biuuidi L.<j nu^-xcULiUc Jjinainy protein } 
CDNA EST EMBL:M75897 comes from this gene; cDNA 
EST 

2 8 9 

27.326 

1628 

U31961 

Drosophila melanogaster ORF2 

199 

37.931 

1629 

M63438 

Homo sapiens , gene product 

518 

81.188 

1 CTft 
J. D J \J 

ZD D.5 

Drosophila melanogaster unknown product 

340 

50 . 459 



Homo sapiens nuclear pore complex protein 
hnupl53 

283 

100 . 000 

ID 

tat (in acn 
iibUilbb / 

Homo sapiens dCF620Ell.la (novel Helicase C- 
Lciiniiidi aomain ano. orjrz im — terminal aomains 
containing protein, similar to KIAA0308) 

531 

67 . 544 

1633 

AF01 7ROfi 

nub muscuius /jii i j t lanscr ip t ion ractor 

713 

94 . 690 

1634 

U09116 

Homo sapiens ORF2 , encodes a reverse 
transcriptase homolog 

198 

56.604 

1635 

X94082 

Xenopus laevis KLP2 protein 

191 

90 .323 

IDjo 


Homo sapiens immunoglobulin heavy chain 
variauie region 

653 

88 . 073 

1637 

X90568 

Homo sapiens Protein sequence and annotation 
available soon via Swiss-Prot; available at 
present via e-mail rrom LABhi iQfcinBLi- 
Heidelberg . DE 

488 

97.403 

1638 


riuino sapiens Air cassette oinamg transporter l 

179 

42 . 424 

163 9 

o J J / O D 

LaenornaDaicis eiegans similar to Protein 
comes from this gene 

202 

56 . 250 

1640 

AF127979 

Homo r^> "i pnq 1 omhrla O \ mrm innrrl oVvtil t n T -i r»Vk#- 
1 1 w u iu oapicnis xaiiix'uct c» x iiiuiLiiioy iujjui xii iiCjuu 

chain variablp rpcrion 


/4 . /4 / 

1641 

AB007939 

Homo sapiens KIAA04 70 protein 

197 

93 . 750 

1642 

D26067 

Homo cani on cj TfTA7AOO"^'^ 

Z U o 

yb . 9 /{j 

1643 

U04267 

Gossypium barbadense proline-rich cell wall 
protein 

210 

44 . 578 

1644 

AB018311 

Homo sapiens KIAA0768 protein 

13 21 

43 . 644 

1645 

A65888 

unidentified PUROMYCIN- SENSITIVE AMINOPEPTIDASE 
(PSA) -99 

182 

100.000 

1646 

U22961 

Homo sapiens similar to human albumin, Swiss- 
Prot Accession Number P02768; Method: 
conceptual translation supplied by author 

279 

78 . 571 


i, o o u 4-6. '4-:. : «;: „ o a id! & o a 


164 7 

TT1 QQ71 

urosopniia meianogaster protein disulfide 

189 

30.328 

1648 

AF080229 

Human endogenous retrovirus K polymerase 

537 

68 .644 

J. O *4 _7 


Homo sapiens unknown 

186 

44.286 i 

1650 

AF055634 

Homo sapiens transmembrane receptor UNC5C 

347 

100. 000 

1651 

U35376 

Homo sapiens repressor transcriptional factor 

533 

67.273 


AT AH Tin 

Homo sapiens dJ73M23.3 (KIAA0319) 

198 

63.043 


AOo 148 

Rattus norvegicus ribosomal protein L5 (AA 1- 
2 97) 

411 

86 .301 

lO D^l 

uzz y© l 

Homo sapiens similar to human albumin, Swiss - 
Prot Accession Number P02 768; Method: 
conceptual translation supplied by author 

271 

84 .783 

X O j D 

TTQ H CIA'I 

Homo sapiens butyrophilin 

274 

38 .333 

ID DO 

AC U U / O D U 

Arabidopsis thaliana putative serine/threonine 
procein Kinase 

336 

29.304 

1657 

Jl /X 

Drosophila meianogaster microtubule associated 

340 

49 . 057 

1658 

A074. on 

nomo sapiens vi i i in 

229 

100 . 000 

1659 

X6 5165 

voivox carteri extensm 

177 

36 .449 

1660 


nomo sapiens kiaaod /4 protein 

186 

68 . 085 

1661 

AiT005897 

v V J U y / 

n<_>rno sapiens UMd 

347 

71 . 429 

1662 

U23502 

Plasmodium chabaudi chabaudi POM1 

487 

46 . 012 

1663 

D88154 

Homo sapiens villin-like protein 

209 

96 . 970 

1DD1 

AUo yob 

Homo sapiens alpha- tubulin 

481 

93 .671 

1665 

AF070470 

Mus musculus SPARC- related protein 

1803 

96 . 578 

t a c c 

C? "7 C C O 

Homo sapiens 4-aminobutyrate aminotransferase 

279 

97.674 

T c d *7 

lob / 

Zt> 82 9 7 

Unknown Similarity to Yeast TAT-binding homolog 
7 (SW:TBP7_YEAST) ; cDNA EST EMBL:D37124 comes 
from 

732 

82 . 540 

1668 

AF076183 

Rattus norvegicus cytosolic sorting protein 
pacs - la 

957 

98.675 

1DD J 


Macaca mulatta mucin 

148 

48 . 387 

1670 

AJ130711 

Homo sapiens QA79 membrane protein, splice 
product airm-2 

481 

48 .864 

ID / 1 

ttc r\r\A a 

Homo sapiens signaling inositol polyphosphate 5 
phosphatase SIP-110 

210 

97.222 

ID / 4* 


Kattus rattus PCTAIRE3 

1106 

92 .222 

-L D f J 

AUDZ3U 

Homo sapiens apolipopreprotein (a) (AA -19 to 

ft i7 } 

375 

78 . 788 

1674 

XX/ f O r 

Cams familiaris albumin 

157 1 

52 . 000 

1675 

£-1 -L J_ U O 

Lanis tamiiiaris stimulatory GTP binding 
protein 

1211 

88 . 614 

1676 

D16626 

"A U O Ji D 

rnjiiio sapiens nisciQase 

198 

79 .487 

1677 

Jl-D V _L O Zj U O 

ricino faapiens i\XHJ\{j/zt> protein 

216 

100.000 

1678 

AB004885 

nuiuu sapiens irjvu — oeca 

172 

92 . 593 

1679 

Z26876 

ntpnio sapiens rioosomai protein 

24 

57 . 143 

1680 


Homo sapiens nuclear receptor co-repressor N- 
CoR 

362 

82 . 540 

1681 

AF151827 

iiwiiiw tjapxciis lvji procein 

299 

78 . 571 

1682 

AB002299 

Homo sapiens KIAA0301 

272 

64 .286 

1683 

AF060076 

Mus musculus polyhomeotic 2 protein 

245 

67 . 797 

1684 

AF015911 

Rattus norvegicus NAC-1 protein 

796 

97.541 

1685 

AJ243997 

Homo sapiens ERIC1 

167 

96 .667 

1686 

AF118023 

Homo sapiens SH3 domain- binding protein SNP70 

441 

86.957 I 

1687 

X69086 

Homo sapiens utrophin (dystrophin related 

200 

22 . 326 


1« 


.11 0 O B M-6 H-:S „. O SiEJ 6. 0 iS 




protein) 



1688 

AF016903 

Homo sapiens agrin precursor 

476 

39 . 394 

1689 

L13923 

Homo sapiens fibrillin 

zzu 

83.333 

1690 

X89858 

Drosophila melanocraster actin bindina nrnhpin 

1 O A 

64 . 865 

1691 

U31629 

Mu s mu s cu lus unknown 

z 01 

65 . 789 

1692 

U68233 

Homo sapiens farnesol receptor HRR-1 

Z 04 

100 . 000 

1693 

AB011370 

Mus musculus Ankhzn 

148 

100.000 

1694 

M74165 

Gallus gallus tensin 

589 

65 . 714 

1695 

AB020662 

Homo sapiens KIAA0855 protein 

22 5 

59 . 322 

1696 

AL117237 

Homo sapiens hypothetical protein 

597 

90 . 816 

1697 

AF042191 

Danio rerio paraxial orotocadhprin * pzxpr* 

4 3 9 

50.382 

1698 

U08813 

OrvctolaCTUS CUniculllS 597 aa nrnt-Pin volat-^a^ 

Na/glucose cotransporters 

220 

94 . 444 

1699 

U40342 

Mus musculus ninein 

ffc 2? o 

68 . 103 

1700 

AF039698 

Homo sapiens antigen NY-CO- 33 

*7 AC 

39 . 523 

1701 

M90656 

Homo sapiens gamma -glutamyl cysteine synthetase 

215 

100.000 

1702 

AF166261 

Xenopus laevis nuclear protein Sopo 

3 60 

55 . 856 

1703 

Y12713 

Mus musculus Pro-Pol-dUTPase polyprotein 

545 

76.577 

1704 

AF129756 

Homo sapiens NG26 

272 

75 . 000 

1705 

D44497 

Homo sapiens human p57 

368 

80 . 000 

1706 

U97189 

cxcyano ouiuuy s inn xari uy to tnw 
P13/P14 family of kinases 

141 

29.412 i 

1707 

AL117204 

Caenorhabditis elegans predicted using 
Gene finder 

202 

40 . 845 

1708 

AF017275 

MUS milRCtl 1 11 <5 fr 7" f~\ t»T \~ H fartf-nr -i nrJrkVArt^^^T-.^.^ i m 

1 niuDuuj. yiuwLii idctor lnaepenaencs — IB 

594 

77 .679 

1709 

D26069 

Homo saniens KTAAnf)4i 

227 

23 . 963 

1710 

Z81525 

Unknown cDNA t? q»t» vlr ai i cr «nmoo F*-r-^T« «_ u 
w * AV -^ vwA * x^oA yj\.joieii . d comes xrom tnis 

Qene ; CDNA EST vkP??rfi ^ r'omo a f rrkm t-u-i a „„_ . 
j , s^^/nn x _y/v^^^#^o.j k-uiuti o j_ i om cms gene/ 

CDNA EST 

372 

41 . 026 

1711 

D10627 

Mus musculus zinc finger protein 

576 

54 . 412 

1712 

X69942 

Mus musculus enhancer- trap- locus -l 

1250 

96.373 

1713 

AC007204 

Homo sapiens BC2 73 23 9 1 

691 

60 . 927 

1714 

AF016448 

Caenorhabditis eleoans No dpf ini i--lnn line* fnim^ 

564 

41 . 791 

1715 

U56732 ! 

RattUS norvecricus KRAB/7inr f inapr cunnvaooni- t 

protein 1 

903 

66 . 667 

1716 

X64228 

Homo sapiens putative oncogene 

273 

92.000 

1717 

AF064553 

Mus musculus NSD1 protein 

T r r r 

1556 

69 . 429 

1718 

AF028789 

Homo sapiens UNC-119b 

iy y 

67 . 500 

1719 

L01986 

Homo sapiens trithorax 

"> O T 

ZZ1 

97 . 297 

1720 

U09413 

Homo sapiens zinc f inaer nrofpin 7Mi?i 

C O *7 

bo / 

62 . 963 

1721 

Z66568 

Schizosaccharomvces Dombe hvoothet-iral fm-acn 
repeats containing protein 

/ten 

54 . 237 

1722 

X69942 

Mus musculus enhancer- trap- locus- 1 7 

/r Q Q 

96 . 296 

1723 

AF038007 

Homo sapiens FIC1 

/ion 

84 . 615 

1724 

M86664 

Equine herpesvirus 1 membrane glycoprotein 

166 

25. 150 

1725 

Z26634 

Homo sapiens ankyrin B (44 0 kDa) 

312 

33 . 898 

1726 

AF082556 

Homo Saoiens TRFl-infArarfinrr anlrvrin val 

ADP-ribose polymerase 

2343 

82 . 082 

1727 

AB029290 

Homo sapiens actin binding protein ABP620 

674 

51. 515 

1728 

Z66511 

Unknown similar to RNA recognition motif, (aka 
RRM, RBD, or RNP domain) ; cDNA EST CEMSA01F 
comes fr 

201 

36 . 364 

1729 

X90568 

Homo sapiens Protein sequence and annotation 
available soon via Swiss-Prot; available at 
present via e-mail from LABE I TOEMBL - 

1637 

99.209 
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-j Hoi V*i*=»"r*rT TlT? 
nc luciuci y . LJtli 



1730 

X90569 

"viiiu oap J.CU0 claoLlL LlUln 

1831 

99.286 

1731 

Z48045 

CAf^TlO'rh;^ nH "5 hi q al cirrrj t-» a o-»-*=» O 
^aciiuj. iiajjul L-JLo clCjallo Sl6"A 

320 

25 . 664 

1732 

Z22968 

HOITIO qani on a Ml "iffl anhi rr on 

944 

63 . 043 

1733 

AJ132751 

BOS hauniQ Ypnfihi nt" i r /moH-i nm-nVia ■! n r 
wwo "-wUJ-UO AC11UU1UL -L^-/ UltJVAJL Villi** Cllalll laUCy 

acid:CoA ligase form XL-III 

685 

70 . 073 

1734 

AB014577 

Hnmn q^iri one WTRtnc *7 "7 rt >*• *~s*-a 4 
HUUIU ocip X cIlo I\.X/^M.UO / / piOCcin 

740 

65 . 409 

1735 

U32305 

Caenorhabditis elegans No definition line found 

272 

51.899 

1736 

AL080125 

Homo sapiens hypothetical protein 

422 

53.543 

1737 

U06641 

Homo sapiens UDP glucuronosyltransf erase 

420 

80. 769 

X / J O 


Homo sapiens nonmuscle myosin heavy chain 
(NMHC) 

329 

95.918 

17*10 

voncro 

AJU JOO 

Homo sapiens Protein sequence and annotation 
available soon via Swiss- Prot; available at 

essenc via e-mail irom 1-iA£EIT@EMBIj- 
Heidelberg . DE 

1114 

99.412 

1740 

U64598 

Caenorhabditis elegans weakly similar to S. 
vci viside rini precursor \oP : P32857) 

316 

45.045 

1741 

Z21507 

Homo sapiens human elongation f actor- 1-delta 

388 

76.190 

1742 

219092 

Oryctolagus cuniculus trichohyalin 

207 

26.038 

1743 

Z75536 

LdenornaDaitis elegans similar to DnaJ domain; 
cDNA EST yk398hl2.5 comes from this gene; cDNA 
i-»oi yjv^ovjuo . _> comes irom cms gene 

326 

39.264 

1744 

D80003 

Hnmn e3am' one VTAArtl On 

n<jnu_> odpiens iviAAUlbl 

2675 

99. 761 

1745 

AE001032 

Archaeoglobus fulgidus purine NTPase, putative 

212 

22. 901 

1746 

AL080141 

Homo sapiens hypothetical protein 

1943 

74.347 

1747 


Caenorhabditis elegans similar to oxysterol- 
uxiicixriy proteins 

272 

63.768 

1748 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

228 

76. 744 

1749 

Z81029 

Unknown Similarity to S.pombe hypothetical 
protein C1D4 . 09C (SW:Q10154); cDNA EST 
EMBL:T00543 Comes 

19 

75. 000 

1750 

AB029012 

Homo sapiens KIAA1089 protein 

198 

80. 952 

1751 

AF078856 

Homo sapiens p4 7 

393 

50.400 

i 71:0 

ttq Tnm 

Caenorhabditis elegans similar to acyl-CoA 
dehydrogenases and epoxide hydrolases; Pfam 
domain PF00441 (Acyl-CoA_dh) , Score=57.4, E- 
value=l . 7e-16 , N=2 ; contains similarity to Pfam 
uuincixn fruu / u/ iHyaroiasej , Score=57.4, E - 
value=le-13, N=l 

389 

51.200 

1753 

U09411 

Homo sapiens zinc finger protein ZNF132 

889 

61.497 j 

1754 

/ Oil 

Unknown similar to glycerophosphoryl diester 
phosphodiesterase domain; cDNA EST EMBL:D27842 
comes f 

345 

28.205 

1755 

U2 9096 

V-* ^* \J ~S \J 

^aenornaDaicis elegans coded for by C. elegans 
cDNA yk44f2.5; similar to P59 protein (HSP 
binding immunophilin) and to TPR domain 

622 

32.768 

1756 

Z682 97 

unK.no wn cijjna hbl EMBL : D32434 comes from this 
yeiie, clnn/\ lioi £.1*11311:1^33/10 comes trom this 
gene ,- cDNA 

593 

47.847 

1757 

Z27079 

Unknown cDNA EST CEMSF67FB comes from this 
gene; cDNA EST CEMSF6 7R comes from this gene; 
cDNA EST y 

497 

35. 039 

1758 

AL035601 

Arabidopsis thaliana putative protein 

241 

27.811 

1759 

AJ005821 

Homo sapiens X-like 1 protein 

507 

51.299 
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i *7 n 

now O X _7 VJ 

noiuo bdpiens i\.eicn mo u x x conLaming protein 

264 

42 . 478 

1761 

AL021768 

Arabidopsis thaliana putative protein 

321 

28.947 

1 "JO 

uyu y u 

nouio sapiens r lrbi-associacea RfiObAr 

909 

47.097 

1 *7 £ "5 
1 / DO 

TT*7 CT77 

Mus musculus skm-BOPl 

960 

96 . 403 

J. / o *± 

Ml 71 -1A 

Mus mus cuius RNA polymerase II 

3 77 

88 . 060 

1765 

AF144477 

Homo sapiens myotilin 

645 

98 . 947 

T -J C C 
1 /DO 

AB00554 9 

Rattus norvegicus atypical PKC specific binding 
protein 

875 

79 . 412 

1767 

AF020261 

Santalum album proline rich protein 

176 

34.444 

1768 

X17617 

Mus musculus zinc finger protein (AA 1-580) 

658 

48 . 148 

1769 

U3 9850 

Caenorhabditis elegans coded for by C. elegans 

CDNA yK13lgl2.5 

207 

29.221 

1770 

M84911 

Pseudomonas aeruginosa ORF located downstream 
of mmsAB operon, has sequence similarity to an 
acetyl -CoA synthetase; ORF1; putative 

670 

59.006 

17 71 

AF051944 

Gallus gallus Xin 

326 

42 . 857 

1772 

AB002388 

Homo sapiens KIAA0390 

480 

34 . 557 

1773 

X90568 

Homo sapiens Protein sequence and annotation 
available soon via Swiss- Prot; available at 
present vxa e-mail from LABEIT@EMBL»- 

nclucliJcitJ . UCt 

932 

99. 301 

x / / r* 

/1DUZ J X / O 

riomo sapiens MAAi/yox protein 

919 

59 . 906 

-L / / 3 


rius muscuius Ain 

526 

61 . 194 

X / / O 

MXjX X / 0 0 d 

Homo sapiens hypothetical protein 

2 8 5 

100 . 000 

X / / / 


Bos taurus chloride channel protein 

441 

74 . 725 

1 *7*7ft 
X / / O 

Tin fl/1 

Oryctolagus cuniculus preproacrosin 

188 

59 . 459 

x / / y 

Ar lb / J ^1 U 

Mus musculus zinc finger protein ZFP113 

807 

56 . 633 

X / o U 


Homo sapiens hypothetical protein 

12 97 

56 . 592 

"1 7 ft 1 
X / O X 

UoUl X^ 

Mus musculus phosphoinositide 3 -kinase p8 5alpha 

501 

94 . 595 

i 709 

<o j 0 O U X 

Schi zosaccharomyces pombe hypothetical protein 

3 34 

31 . 500 

1 *7 ft "5 
X / O O 

VI 77 C7 
XX / 4& D / 

Mus musculus ubiqui tin-con jugating enzyme 

1650 

98 .450 

1 7 ft 4 
x / 0 *± 

7ATTO Q"7 "7 

rcniciiiiuni cnrysogenum nypotnecicai protein 

~i i Q 

Jo . 806 

1785 

X64418 

Drosophila melanogaster kurz protein 

458 

48.611 

X / 00 

a / y 0 

Mus musculus NK10 

1285 

94 . 054 

1 *7 ft "7 
1 / 0 / 

7\ TD /-\ -1 Q "7 O. 

Homo sapiens KIAA072 7 protein 

1715 

57 . 683 

1 *7 ft ft 
X / OO 

Zj / UjJLU 

Caenorhabditis elegans R11A8 . 7b 

1732 

59 . 785 

1 *7 ft Q 
X / O -7 

fiirmfi 7nc 

Caenorhabditis elegans No definition line found 

4 95 

41 . 000 

nqn 

X / 2* V 

J-\H U U D Oft j7 

Rattus norvegicus atypical PKC specific binding 

2 594 

88 . 791 

1791 

AF015454 

Xenopus laevis ERl 

185 

64.286 

1792 

X52943 

Homo sapiens ATF-a protein (AA 1-483) 

391 

100.000 

1793 

U97006 

Caenorhabditis elegans No definition line found 

406 

43 .671 

1794 

M20031 

Homo sapiens V-III-J region 

383 

95.312 

"1 *7 G C 

1 / y b 

Ar 0 7 976 5 

Mus musculus enhancer of polycomb 

616 

96 . 000 

17 96 

Z71180 

Caenorhabditis elegans similar to BPTI/KUNITZ 
inhibitor domain; cDNA EST EMBL:D682 93 comes 
from this gene; cDNA EST yk448h4„5 comes from 
this gene; cDNA EST yk249e6.5 comes from this 
gene, cjjjna ri^i yK44on4 . j comes rrom cms gene 

731 

44 . 141 

1797 

AB023216 

Homo saniens KTAA0 9 99 nrotpin 

809 


1798 

AL022600 

Schizosaccharomyces pombe hypothetical protein 

428 

40.860 

1799 

U34932 

Rattus norvegicus Fos- related antigen 

300 

68.421 

1800 

Y00062 

Homo sapiens precursor polypeptide (AA -23 to 
1120) 

203 

82.857 

1801 

U35376 

Homo sapiens repressor transcriptional factor 

2183 

78.358 


:l ill O iB ,., O H E! 6 O S! 


X o u z. 

r\.V UOJU?! 

Drosophila melanogas ter eyelid 

661 

40 . 741 


n ni qq 

Caenorhabditis elegans weak similarity to HSP90 

48 

60.000 


HIlU J X D O -3 

uriKnowri /preaiction= (rnetnoa: genet meter" " , 

vora "5 nn • >i n n Q A " » a . union c 0 n 11 \ . 

version. uo^ , score : izu.oo / / 

634 

52 . 885 

1805 

U41538 

Caenorhabditis elegans No definition line found 

668 

47.748 

1806 

X90565 

oc* t t n aiotnyccs cerevisiac miuz 

475 

43 . 860 

1807 

X7ft 99fi 

rnjuivj bdpiens zinc linger protein 

601 

67 . 213 

1808 

ZVR007QO 1 

H*Oin<0 aaT~k"5 Ana If T"&"&f^.*iCO nv^fa^ *■» 
nuiUU oapicllo lN.X>\M.Ufs D piOCcin 

1180 

97.895 ! 

1809 

rVDVX'i j / w 

nomo sapiens maaud /u protein 

223 

24 . 229 

1810 

Y15197 

i jua muatuiuy microtuDuie-associatea protein, 
MAP- 115 

339 

26 . 036 

1811 

-A. O O D O -J 

urosopniid meianoydscer aeep orange iQorj 

358 

40 . 909 

1812 

AP151014 

Aunupua ldcvib smaxx Kno-iiKe birase Knai 

265 

86 . 047 

1813 

U1798 9 

xiwui^j octj^/xeiit> uozin>\. 

243 

84 . 091 

1814 

Y167 90 

riLJiiiu adpiens Keratin type i 

345 

36 . 585 

1815 


nomo sapiens guanos in-aipnospnatase HJce 
protein 

614 

63 . 636 

1816 

AB011094 

Homo sapiens KIAA052 2 protein 

322 

35.714 

1 ftl "7 

HdU^O X u 

Homo sapiens Kelch motif containing protein 

438 

39 . 196 

X O X o 

A*? "* £ n c; 

O O w O 

unidentified unnamed protein product 

1057 

100 .000 

1819 

AF102129 

Rattus norvegicus KPL2 

1094 

88.660 

T o o r\ 

AT no 1 AO«" 

AJLjUz 10 y 6 

Unknown /predictions (method : " "gene finder" " , 
version : " "084 " " , score : 11 " 14 7 . 90 " " ) ; 
/niatcn= v uesc : iji 

297 

60.811 

X O £, X 

Mfi 1 1 PR 

l 1 D X X O _> 

Bos taurus glutamic acid- rich protein 

217 

52 . 174 

1822 

D16226 

Oryctolagus cuniculus one of the members of 
sodium-glucose cotransporter family 

1335 

90. 868 

182 3 

anno -loi c 

numo sapiens jviAAuyyo protein 

567 

35 . 385 

1824 

r\JD \J \J X / / ^ 

v_ x Ljiici sdviynyi fHii v i — d 

211 

45 . 588 

182 5 

J-^*i Z U 4 J 

Homo sapiens The ha2022 gene product is novel. 

240 

23 . 019 

1826 

U938 72 

i\.c*pt>si s sarcoma - associa tea nerpesvirus orf 73, 
contains large complex repeat CR 73 

322 

27.451 

1827 

AC0OR614 

noniu sapiens r <i j^d y z 

635 

64.463 

1828 

AB002374 

Homo sapiens KIAA0376 

600 

47. 964 

X O 27 


Dictyos telium discoideum TipD 

146 

37 . 288 

1830 

M12140 

Homo sapiens envelope protein 

297 

48 .421 

1831 

D90756 

Escherichia coli Hypothetical protein in pth- 
prsA intergenic region . 

878 

100.000 

1 ft ~\o 

1 W 1D *± O O O 

Oryctolagus cuniculus phosphorylase kinase 
beta-subunit 

191 

96 . 429 

X O J o 

7700m 

Unknown Similarity to E.coli 2 -oxoglutarate 
dehydrogenase (SW:0D01_ECOLI) ; cDNA EST 
EMBL:D32590 com 

776 

58 . 163 

1834 


Homo sapiens cytosolic phospholipase A2 beta 

559 

45 . 506 

1835 

AJ248284 

Pyrococcus abyssi chromosome segregation 
protein ismcij 

177 

22 .467 

1836 

D83776 

Homo sapiens The KIAA0191 gene is expressed 
uDiquicousiy. ; ine kiaa.0191 protein retains the 
C2H2 zinc- finger at its N- terminal region. 

219 

96.552 

1837 

X64346 

Saimiriine herpesvirus 2 ORF 48; EDLF5 ; sim. to 
EBV BRRF2 

720 

66 .355 

1838 

U93872 

Kaposi* s sarcoma-associated herpesvirus ORF 73, 
contains large complex repeat CR 73 

218 

20.952 

1839 

AC007138 

Arabidopsis thaliana predicted protein of 

191 

50.000 


137 
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unknown f uric 1 1 on 



1840 

Z75540 

Unknown similar to BRCA1 C Terminus (BRCT) 
aomain 14 aomains) ; cuna bbr embl :D3 6 64i comes 

XXvJITl Ulllo 

173 

30.508 

1841 

X15657 

ijrusopniid HicianoyaSLer mi-i proEein l — 
1063 ) 

374 

58 . 163 

1842 

»J03 916 

T* , Ctl^"ViOT*T Vl "1 i5 /~»/~v "1 -! T"\Vl /~s C3 Vl 3 h 1 r^i rl no VI n a 

dpcaTboyvl n c; o 

V— * CI ^ 3*J\*J jf JL CL O »C- 

945 

98 .630 

1843 

U90880 

Fugu rubripes zinc finger protein 

665 

92.632 

1844 

D6 3881 

nui i iu octpiciio x\. xj-\j~\\j idu y t2i le pirouucL is no vex . 

219 

97 . 059 

1845 

U3 53 76 

fiv^riiii^ so^iicus icpicaUUi L.JL alio ti XLJt XOilcxX LaCCOx 

*7 f\ C 

/Ub 

73 . 134 

1846 

AB028978 

Homo sapiens KIAA1055 protein 

603 

62 .411 

184 7 

AT71 Ofi 7"l£ 

numo Sapiens uDicjuicin processing protease 

44 8 

46 . 897 

184 8 

*-J / -J J \J D 

LdcuornaDQi tis eiegans cuin/\ Jboi Drioijirioobob 
^uiuco x a. vjui Liiio yciic 

168 

36 . 145 

184 9 

AL031032 

nL au±uupDJ.o Lilallalla caUcIISIU HJ^c piuLclU 

o o c 
-•sob 

47 . 059 

1850 

AF181856 

Rattus norvegicus tRNA selenocysteine 
associated protein 

153 

95.833 

1 QC1 
lO Jl 


Mus musculus kinesin light chain 2 

203 

100 . 000 

1852 

Z12 841 

uiycLOiagus cunicuius rnospnonpase 

1236 

79 . 646 

1853 

V\£* 1 o / o 

rioino sdpiens Lfv*A mnaing protein 

873 

73 . 054 

i ft ^/l 
xo o** 

■MXjUjXOO / 

Homo sapiens dJ620Ell.la (novel Helicase C- 

Lciiiiiiial uuillaxu allU DWrZ W-ucilllinai QOIIlainS 

206 

93 . 548 

1855 

AB023155 

Homo sapiens KIAA0 938 protein 

955 

59.615 

18 56 

API CTion 

./A XT X O 1 S> \J 

i v iu.a niusouius zinc linger procein ZirJfiu 

82 9 

76 . 871 

1857 

AB0OOS1 2 

uaXxUS (jallUo t aUIlcX XH X \J 

/3b 

78 . 571 

1858 

rtV^ \J \J *± £. 1 

nuiiiu uctLyxexio x\.x/iu u xxjd 


100 .000 

1 QCQ 

I'll/ / D J? 

Homo sapiens transmembrane receptor 

84 0 

87 . 591 

1860 

D63476 

noiinj sapiens ine i\.x/VH.ui*t^ gene is reiacea to 
human KIAA0 006 gene. 

zio 

85 . 000 

1861 

AF091628 

Mus musculus ERG-associated protein ESET 

526 

33 . 977 

1862 

■t\D\J ^ JlOD 

nomo sapiens i\.i/\M.uyoy protein 

581 

31 . 343 

i ft 

X O D «j 

ytoo-i o 

-A. / O ? J J 

Homo sapiens zinc finger protein 

1018 

64 . 516 

1864 

AL049784 

Homo sapiens hypothetical protein 

1152 

95.977 

i ft £ 

XO O D 

■7/L ppnq 

Unknown similarity to the yeast MET3 0 protein 
(PIR accession number S43750) ; cDNA EST 

ch'ujli • no jzd i 

377 

29 . 880 

1866 

nltv jv JO / 

"Uiuu sdpiens nypocneuicai protein 

378 

46 .400 

1867 

AJ242 914 

1*1 lx o uino v- ux Lit* iicuxotxopnin receptor interacting 
factor (NRIF1) 

3 b3 

50 . 83 3 

1868 

AF043695 

Caenorhabditis elegans Similar to mitochondrial 

C* J- X. JU <w JL X~ v — ' v- Ci Xil 

287 

37.778 

1869 

Z70310 

Caenorhabditis elegans R11A8 . 7b 

540 

34 . 277 

1870 

D83146 

Ml 1 C3 nfll 1 Q /"•l ll 11 Q Q 1 

riuo uiuo ux lis oxad 1 

o o c 

o^5 b 

81 . 935 

1871 

AB029290 

xnwiiiv^ oct^XdlO dv>i.xn U X I iu x x xy pxuucxii J-\D Xr o ^ u 

lb lb 

1UU . uuu 

1872 

AJ243806 

Chlamydomonas reinhardtii 1- alpha dynein heavy 
cha in 

256 

73 . 913 

1873 

AF004 813 

l-T^imi^i csaTn ones a i A/"^t~ An 4 /~» XT ^ i Vi-i /-» ■K'Vsj^n -a ^ <r» 
nuiliu oapxcllD cXCLLlU^cIlXC XMcl-r XJX (JctX JJUlla t e 

cotransporter ; NBC 

/ u u 

1 A A C O 

/4 .4b J 

1874 

AF135440 

Mus musculus huntington yeast partner C 

270 

97 . 500 

1875 

U18991 

Homo sapiens retinal pigment epithelium- 
specific 61 kDa protein 

293 

33.333 

1876 

X74546 

Escherichia coli LysR homologue A 

935 

100. 000 

1877 

M67467 

Macaca fuscata 3 - beta -hydroxy- 5 -ene steroid 

858 

85.430 


72* 
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dehydrogenase/delta- 5 -delta-4 isomerase 



1878 

Z21707 

Homo sapiens polypeptide 

202 

32 . 576 

1879 

U37775 

Mus musculus tuberin 

349 

69.231 

1880 

AF022729 

Rattus norvegicus HNK-l sulfotransf erase 

361 

41 . 667 

1881 

AF081941 

Rattus norvegicus soluble adenylyl cyclase 

178 

38 . 961 

1882 

AF004715 

Homo sapiens jerky gene product homolog 

328 

41.270 

1883 

M60172 

Gallus gallus novel collagen protein 

342 

48 . 696 

1884 

AF056617 

Homo sapiens BWSCR2 associated zinc- finger 
protein BAZ1 

825 

70.440 

1885 

AL021482 

Caenorhabditis elegans Y39A1B.2 

269 

36 . 220 

1886 

D63478 

Homo sapiens The KIAA0144 gene product is 
novel . 

312 

45.455 

1887 

AJ131021 

Mus musculus ribosomal protein S6 kinase 3 

720 

79.365 

1888 

U4 9974 

Homo sapiens mariner transposase 

158 

64 . 865 

1889 

AF01644 8 

Caenorhabditis elegans No definition line found 

719 

47 . 083 

1890 

AB008164 

Homo sapiens ST1C2 

234 

94 . 444 

1891 

U27196 

Gallus gallus zinc finger protein 

250 

41 . 304 

1892 

Y08564 

Homo sapiens UDP-GalNAc : polypeptide N- 
ace tylgalactosaminyl transferase 

546 

63 .303 

1893 

AL021492 

Caenorhabditis elegans Y45F10D.il 

230 

51.429 

1894 

AF180728 

Drosophila melanogaster sulfate transporter 

429 

36 .709 

1895 

U88172 

Caenorhabditis elegans No definition line found 

202 

25.000 

1896 

AL117626 

Homo sapiens hypothetical protein 

553 

62.590 

1897 

X90568 

Homo sapiens Protein sequence and annotation 
available soon via Swiss-Prot; available at 
present via e-mail from LABE I T@EMBL - 
Heidelberg . DE 

1907 

98.233 

18 98 

AB023155 

Homo sapiens KIAA0938 protein 

726 

66 . Ill 

1899 

Y13367 

Homo sapiens phosphoinositide 3 -kinase 

442 

98.413 

1900 

AF083391 

Homo sapiens putative WHSC1 protein 

191 

41.791 

1901 

AB007934 

Homo sapiens KIAA046 5 protein 

1347 

64 .286 

1902 

X69490 

Homo sapiens titin 

1208 

97.980 

1903 

AF117888 

Homo sapiens myosin- IXa 

1547 

100.000 

1904 

X90568 

Homo sapiens Protein sequence and annotation 
available soon via Swiss-Prot; available at 
present via e-mail from LABEIT@EMBL- 
Heidelberg . DE 

856 

98.485 

1905 

X69490 

Homo sapiens titin 

926 

99.281 

1906 

X69490 

Homo sapiens titin 

1687 

100.000 

1907 

X69089 

Homo sapiens 16 5kD protein 

598 

36.630 

1908 

AB029290 

Homo sapiens actin binding protein ABP620 

1524 

61.039 

1909 

X90568 

Homo sapiens Protein sequence and annotation 
available soon via Swiss-Prot; available at 
present via e-mail from LABEITOEMBL- 
HeiaeiJoerg . ua 

1635 

100 . 000 

i oi n 

MJDsJ l^b Ul 

Homo sapiens KIAA0701 protein 

*i ear* 

4 0.909 

1911 

AL031230 

Homo sapiens dJ73M23.3 (KIAA0319) 

258 

41.129 

1912 

M32865 

Homo sapiens Ku protein subunit 

187 

100.000 

1913 

AF038007 

Homo sapiens FIC1 

547 

63 . 971 

*t on a 

t>i mi o 
JJ1 U / i^i 

Mus musculus nedd-1 protein 

6 14 

92 . 07 9 

1915 

D26069 

Homo sapiens KIAA0041 

156 

29.921 

1916 

Z66496 

Unknown cDNA EST EMBL:D71941 comes from this 
gene; cDNA EST EMBL:D74 6 91 comes from this 
gene ; cDNA 

240 

28.387 

1917 

AF140360 

Homo sapiens histone acetyl transferase 

159 

100.000 
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1918 

D42046 

Homo sapiens The ha3631 gene product is related 
to S.cerevisiae protein encoded in chromosome 
VIII. 

299 

97.917 

1919 

AF125386 

Drosophila melanogaster L82C 

226 

31.532 

1920 

D16611 

Homo sapiens coproporphyrinogen oxidase 

207 

75.610 

1921 

AB028958 

Homo sapiens KIAA103 5 protein 

402 

48.000 

1922 

X79131 

Mus musculus IB3/5 -polypeptide 

1535 

78.105 

1923 

U72192 

Homo sapiens lysosomal trafficking regulator 
LYST 

160 

100.000 

1924 

AF027955 

Mus musculus G protein-coupled receptor 

754 

60.221 

1925 

M34551 

Homo sapiens 52 -kD Ro/SSA ribonucleoprotein 

226 

34.307 

1926 

U10401 

Caenorhabditis elegans No definition line found 

391 

30.741 

1927 

U87965 

Mus musculus putative G-protein 

337 

50.000 

1928 

AB023157 

Homo sapiens KIAA094 0 protein 

519 

39.241 

1929 

AF016191 

Rattus norvegicus potassium channel 

629 

97.917 

1930 

Z71264 

Unknown predicted using Genefinder; Weak 
similarity to Mouse T-complex-associated- 
testes- expressed- 

325 

33.908 

1931 

J05194 

Oryctolagus cuniculus myosin light chain kinase 
(EC 2.7.1.-) 

1077 

97.590 

1932 

AF124396 

Danio rerio R-cadherin precursor 

389 

41.722 

1933 

U79587 

Homo sapiens immunoglobulin V- region light 
chain 

320 

79.032 

1934 

AF035537 

Homo sapiens DNA polymerase zeta 

285 

89. 583 

1935 

U93181 

Homo sapiens nuclear dual-specificity 
phosphatase 

183 

52 . 941 

1936 

AF129756 

Homo sapiens BAT 2 

793 

40.568 

1937 

D89677 

Mus musculus Kryn 

2393 

78.005 

1938 

Z93372 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk345d5.5 comes from this 
gene; cDNA EST yk345d5.3 comes from this gene 

746 

48.052 

1939 

AF104414 

Mus musculus large tumor suppressor 1 

944 

78.261 

1940 

AB002376 

Homo sapiens KIAA03 78 

780 

76.220 

1941 

L20303 

Gallus gallus actin filament-associated protein 

582 

37.370 

1942 

AJ133521 

Drosophila buzzatii protease, reverse 
transcriptase, ribonuclease H, integrase 

228 

29.787 

1943 

L38620 

Mus musculus mSin3A gene product 

809 

100.000 

1944 

Z71264 

Unknown predicted using Genefinder; Weak 
similarity to Mouse T-complex-associated- 
testes-expressed- 

462 

34.812 

1945 

X15187 

Homo sapiens precursor polypeptide (AA -21 to 
782) 

1761 

97.112 

1946 

D78572 

Mus musculus membrane glycoprotein 

184 

50 . 000 

1947 

D87077 

Homo sapiens KIAA024 0 

483 

93 . 750 

1948 

AE000699 

Aquifex aeolicus chromosome assembly protein 
homolog 

143 

22 .414 

1949 

AJ131244 

Homo sapiens Sec24A protein 

433 

95.161 

1950 

U09820 

Homo sapiens helicase II 

259 

100 . 000 

1951 

AB030502 

Xenopus laevis XDRP1 

729 

82 . 576 

19 52 

1VI74 16 5 

Gallus gallus tensin 

464 

45 . 276 

1953 

AE000142 

Escherichia coli putative transport protein 

1049 

98.246 

1954 

AF025467 

Caenorhabditis elegans contains similarity to 
drosophila DNA-binding protein K10 (NID:g8148) 

181 

43.548 

1955 

A56817 

unidentified unnamed protein product 

255 

53.571 

1956 

AF060246 

Mus musculus zinc finger protein 106 

1295 

80.543 


12>o 
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1957 

X75342 

Homo sapiens Shb 

335 

42.636 


M 7416 5 

Gallus gallus tensin 

1269 

87 . 879 

1959 

AB002384 

Homo sapiens KIAA0386 

676 

54.545 

1960 

AF017112 

Mus musculus non- erythrocyte beta spectrin 

170 

100 . 000 

1961 

X12492 

Homo sapiens CTF-1 factor (AA 1 - 499) 

1087 

95 . 906 

1962 

Z11527 

Drosophila melanogaster CYS3HIS finger protein 

742 

74.219 

1963 

J02974 

Acanthamoeba castellanii myosin IB heavy chain 

183 

37 . 000 

1964 

X90840 

Homo sapiens axonal transporter of synaptic 
vesicles 

1904 

99.647 

1965 

M27266 

Mus musculus p59fyn 

168 

25 . 424 

1966 

L4 0459 

Mus musculus latent transforming growth factor- 
beta binding protein 

1279 

88 . 701 

1967 

Z48583 

Unknown similar to ATPases associated with 
various cellular activities (AAA) ; cDNA EST 
EMBL : Z14623 

267 

42.553 

1968 

AF181631 

Drosophila melanogaster BcDNA. GH04 92 9 

245 

28.481 

1969 

S48472 

Felis catus arylsulf atase B, ARSB 

727 

57 . 838 

1970 

Z70683 

Unknown Weak similarity to Human tyrosine- 
protein kinase CSK ( Sw : CSK__HUMAN) ; cDNA EST 
EMBL:C10908 c 

326 

47 . 273 

1971 

AF078790 

Caenorhabditis elegans No definition line found 

159 

25.564 

1972 

Z22642 

Homo sapiens PO-GA 

214 

28.488 

1973 

X56203 

Plasmodium falciparum liver stage antigen 

314 

23 . 566 

1974 

X73882 

Homo sapiens microtubule associated protein 

298 

38 . 776 

1975 

Z71178 

Unknown similar to pro-collagen domains; cDNA 
EST EMBL:D27978 comes from this gene; cDNA EST 
EMBL : D 

380 

40.411 

1976 

D10627 

Mus musculus zinc finger protein 

747 

56 . 287 

1977 

U09413 

Homo sapiens zinc finger protein ZNF135 

1082 

52 . 797 

1978 

AF106682 

Homo sapiens spindlin 

770 

75 . 159 

197 9 

AL023781 

Schizosaccharomyces pombe N- terminal 
acetyl transferase 1 

686 

40 . 876 

1980 

U53332 

Caenorhabditis elegans No definition line found 

190 

30.769 

1981 

AF11116 9 

Homo sapiens unknown 

1357 

77 . 74 1 

1982 

U41164 

Rattus norvegicus Cys2/His2 zinc finger protein 

1089 

75.879 

1983 

AU 131720 

Homo sapiens alpha integrin binding protein 8 0 

2396 

98 . 652 

1984 

U69262 

Mus musculus matrilin-2 precursor 

2150 

86.765 

1985 

AC007193 

Homo sapiens Putative homolog of hypoxia 
inducible factor three alpha 

1849 

95 . 172 

1986 

AF113131 

Homo sapiens host cell factor homolog LCP 

481 

51 . 969 

1987 

L20450 

Mus musculus DNA- binding protein 

703 

63.699 

1988 

AE000798 

Methanobacterium thermoautotrophicum O- linked 
GlcNAc transferase 

185 

31.293 

1989 

AB011414 

Homo sapiens Kruppel-type zinc finger protein 

780 

82.222 

1990 

S79915 

Drosophila sp. Hls=155 kda putative DE-H type 
RNA-dependent ATPase-helicase/RNA localizing 
protein 

335 

27.037 

1991 

Z48241 

Caenorhabditis elegans similar to coiled coil 
domains; cDNA EST yk302gl2.5 comes from this 
gene; cDNA EST yk365dl0.5 comes from this gene; 
cDNA EST yk461cl.5 comes from this gene 

166 

23.316 

1992 

AF012942 

Dictyostelium discoideum HelD 

580 

50. 532 

1993 

U39851 

Caenorhabditis elegans coded for by C. elegans 
cDNA ykl2le3.3; coded for by C. elegans cDNA 
ykl21e3.5; coded for by C. elegans cDNA cm06g4 

511 

43.367 


72>l 
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1994 

AF067622 

Caenorhabditis elegans Contains similarity to 
Pfam domain: PF00628 (PHD), Score=36.7, E- 
value=l . 7e- 07 , N-2 

320 

32 . 227 

1995 

AJ003112 

Homo sapiens doublecortin 

202 

28 . 947 

1996 

AL109832 

Schizosaccharomyces pombe putative gtpase 
activating protein 

433 

35.484 

1997 

D87908 

Mus musculus nuclear protein np95 

1561 

85.057 

1998 

AB014538 

Homo sapiens KIAA0638 protein 

238 

36 . 774 

1999 

M95762 

Rattus norvegicus GABA transporter 

791 

92 . 913 

2000 

Z49967 

Unknown cDNA EST EMBL:T00743 comes from this 
gene; cDNA EST EMBL:D693 56 comes trom tnis 
gene ; cDNA 

238 

41 . 837 

2001 

U78180 

Homo sapiens sodium channel 2 

/ 

bo .zoi 

2002 

AB011094 

Homo sapiens KIAA0522 protein 

274 V 

yb . y^o 

2003 

L25055 

Escherichia coli NADH dehydrogenase 

A A C\ 

QQ All 

2004 

AF126867 

Mus musculus calpain-like protease 

JLUoo 

to . bob 

2005 

L02897 

Canis familiaris beta-spectrin 

OAT 

AO 71 C 

*± 27 . J> J. b 

2006 

X85214 

Mus musculus ox4 0 

n n C Q 

i Art n n n 

1UO . ooo 

2007 

D13159 

Escherichia coli transaldolase 

821 

98.450 

2008 

AF064604 

Homo sapiens KE03 protein 

493 

54 . 264 

2009 

X52574 

Mus musculus GTP binding protein 

363 

50.420 

2010 

AF060503 

Homo sapiens zinc finger protein 

780 

83.453 

2011 

D78174 

Mus musculus Zic4 protein 

1436 

92.453 

2012 

L07736 

Rattus norvegicus carnitine 
palmitoyltransferase I 

12 92 

55.828 

2013 

L26049 

Chlamydomonas reinhardtii dyne in heavy chain 
alpha 

184 

28.431 

2014 

M80537 

Drosophila melanogaster fat protein 

180 

28 . 767 

2015 

AF100956 

Mus musculus Bingl 

367 

41.497 

2016 

AF047659 

Caenorhabditis elegans No definition line found 

331 

52.336 

2017 

AF005654 

Homo sapiens actin-binding double -zinc -finger 
protein 

1835 

96 . 471 

2018 

S62941 

Homo sapiens Ps 2=basic proline-rich 
protein (PRB1L precursor protein=basic proline- 
rich proteins (Ps, PmF, PmS, and Pe) precursor) 
{C-terminal} 

263 

33 . 929 

2019 

AF071081 

Mycobacterium tuberculosis proline-rich mucin 
homo log 

269 

35.507 

2020 

AB029013 

Homo sapiens KIAA1090 protein 

n r\ c c 
lUbb 

o / . faJi 

2021 

AL080125 

Homo sapiens hypothetical protein 

111 

68 . 387 

2022 

U75276 

Homo sapiens TFIIB related factor hBRF 

112 

96 . 154 

2023 

D90714 

Escherichia coli Aldose l-epimerase (EC 
5.1.3.3) (mutarotase) . 

1371 

97 . 156 

2024 

AF015297 

Human herpesvirus 6 (strain Uganda-1102) IE2hom 

194 

24 . 583 

2025 

D87743 

Homo sapiens Similar to Human Na+/H+ exchanger 
2 (A57644) 

193 

57 .692 

2026 

AB001735 

Mus musculus ADAMTS-1 

54 6 

b4 .zob 

2027 

U94585 

Homo sapiens requiem homolog 

1231 

59.044 

2028 

AL031583 

Unknown /predict ion = (method: " "genef inder " " , 
version: uof* , score. _lzu.oo / , 
/prediction= (meth 

274 

42 . 857 

2029 

AF057019 

Dictyostelium discoideum interaptin 

194 

20 .866 

2030 

AC004893 

Homo sapiens similar to NEDD-4 (KIA0093) ; 
similar to P46934 (PID : gll71682 ) 

773 

98 .214 

2031 

U09413 

Homo sapiens zinc finger protein ZNF13 5 

998 

55 .459 


nix 
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Uli /do 

Murine leukemia virus gag-pol polyprotein 

159 

46 . 154 

^ VJ -5 -3 

TTO DQTi 
UZ OOjI 

Homo sapiens protein that is immuno- reactive 
wicn anci-Fin polyclonal antiboclies 

518 

82.524 

2034 

AF038599 

Sus scrofa env protein 

196 

25.000 


JL/O J *4 / O 

tiomo sapiens ine kiaaoi42 gene is related to 
human KIAA0006 gene. 

210 

75 . 610 


tilo J O J. 

Ovis aries trichohyalin 

243 

23 . 514 


MJtJU U^J U«5 

Homo sapiens KIAA0305 

252 

38 . 043 


rtJJU IjOZ J 

Homo sapiens type II membrane protein similar 
t.o niv ypizu -Dinainy c-cype lectin 

637 

78.231 I 

2039 

Y12400 

Drosophila melanogaster putative organic cation 
transporter 

470 

42 .222 

j£ U f± VJ 

Y7QQOC 

Homo sapiens zinc finger protein 

816 

57.592 

9 OAT 


Homo sapiens trk-2h polypeptide 

301 

89.130 | 

2042 

L26507 

Mus musculus myocyte nuclear factor 

2438 

97.268 

z u *± o 

UUDO OX 

Homo sapiens homologous to members of the I- 
kappa B family; protein binds NF- kappa B 
proteins 

236 

35 . 036 

2044 

M55169 

Homo sapiens tripeptidyl peptidase II 

339 

96 . 154 

^ U *A 3 


Rattus norvegicus sodium-calcium exchanger form 

■j 

894 

95.652 

2046 

U84248 

Aedes aegypti blood meal -induced protein 

630 

57.042 

2047 

AF121781 

Homo sapiens unknown 

244 

25.201 


AFU45022 

Bos taurus phosphatidic acid-preferring 
phospholipase Al 

1521 

98.696 

o n a q 

TV TSfiO 11 TQ 

Homo sapiens KIAA0961 protein 

687 

60.248 


£ib o / o U 

Homo sapiens Similarity to Human ankaryin 
li>W :/USIKB_HUMAN; ; CDNA EST EMBL:D34286 comes 
from this gene; cD 

365 

40. 141 

2051 

D87433 

Homo sapiens KIAA024 6 

1430 

47.103 

onto 


Homo sapiens hiwi 

395 

39.412 ! 


At UJZboo 

Rattus norvegicus rseclS 

897 

98 . 561 

4* V/ Of* 

ar*nrk too q 

Homo sapiens BC37295 1 

1253 

65 . 748 


£»zj j z j y 

Unknown predicted using Gene finder; cDNA EST 
cii^wij . ub wb ou comes trom tnis gene; cDNA EST 

y jv z. y jts .3 v— 

288 

51.899 

2056 

U09874 


1468 

92 . 500 

2057 

Y10601 

Homo sapiens ankyrin-like protein 

285 

97.619 

2058 

0094 11 

"Omo sapiens zinc ringer protein ZNF132 

689 

60 .839 

2059 

AB023163 

Homo sapiens KIAA0946 protein 

747 

99.115 

2 06 0 


Mus musculus NG22 

481 

25.778 

2061 

AF07RRR7 

Homo sapiens protein associated with Myc 

373 

98 .214 

2062 

AP04. 1 ft "? 

Drosophila melanogaster microtubule binding 

piULCXll U Llilr"! 

316 

33 . 588 

2063 

AF055636 

noiiiu jjctpiena leucine-ricn glioma- inaccivacea 
protein precursor 

649 

51 . 813 

2064 

AF115509 

Homo sapiens LRR FLI-I interacting protein 2 

214 

100.000 

2065 

AB01 1 Rl? 

"D -a f- 1 1 ci finvirorfi nun MTP^f^T 

KdLLua iioir VcylCUS rlnbrb 

1378 

74 . 762 

2066 

n.C \J ~J A. Is ^ Zj 

Ml 1 C3 TY1T 1 O ll lift V -i i»» 

I V 1U.S IUUSCUJ.US Ain 

990 

77 . 835 

2067 

AB023209 

Homo sapiens KIAA0992 protein 

857 

D / . D O Z 

2068 

AF069300 

Arabidopsis thaliana contains similarity to 
Arabidopsis membrane-associated salt-inducible- 
like protein ( GB : AL0 2 163 7) 

261 

34 .483 

2069 

AL050134 

Homo sapiens hypothetical protein 

402 

38.418 

2070 

S60312 

Mus sp. DMR-N9 {C- terminal} 

512 

70.370 
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2071 

AF005050 

Homo sapiens aspartyl aminopeptidase 

255 

82.609 

2072 

AB000113 

Rattus norvegicus cationic amino acid 
transporter 3 

1304 

85.281 

2073 

AC005954 

Homo sapiens ZO-3 

807 

100.000 

2074 

Z78543 

Caenorhabditis elegans predicted using 
Genef inder 

310 

32 .663 

2075 

AC003 02 6 

Homo sapiens Multiple drug resistance gene MRP1 
l 5 partial ; 

556 

67 .500 

2U /© 

mo n c n 
MoUDJ / 

Drosophila melanogaster fat protein 

482 

37 . 727 

2 077 

L32162 

Homo sapiens transcription factor 

283 

60 . 526 

Z U / O 


callus galius CSH-PTP2 

766 

94 . 118 

2079 

X65157 

Mus musculus desmoyokin 

271 

39.855 

2080 

AB029290 

Homo sapiens actin binding protein ABP620 

766 

63 .542 

2081 

AB002347 

Homo sapiens KIAA0349 

216 

75 . 000 

2082 

X62528 

Rattus norvegicus ribonuclease inhibitor 

578 

42 .347 

2083 

L40933 

Homo sapiens phosphoglucomutase- related protein 

369 

100.000 

2084 

Z81051 

Homo sapiens predicted using Genef inder; 
Similarity in 3 ' end to Human KIAA0173 protein 
(TR:Q14679); cDN 

219 

42 .254 

2085 

U50078 

Homo sapiens p53 2 

388 

88.235 

2086 

D86604 

Mus musculus Bach2 

1079 

92.982 

2087 

AB026190 

Homo sapiens Kelch motif containing protein 

377 

42.857 

2088 

AF119816 

Oryctolagus cuniculus sodium bicarbonate 
cotransporter 

694 

88 .235 

2089 

AE000350 

Escherichia coli putative ATP-binding component 
of a transport system 

1186 

98.953 

2090 

AC005065 

Homo sapiens determined by GENSCAN prediction 
and spliced EST; match to EST R84329 
(NID : 942 735 ) 

589 

52.874 

2 091 

AJ010973 

Homo sapiens DEDD protein 

186 



40 . 244 

2092 

AF117210 

Homo sapiens host cell factor 2 

191 

36 . 905 

2093 

AB013605 

Mus musculus Per3 

669 

46 . 275 

2094 

X64346 

Saimiriine herpesvirus 2 ORF 73; ECLF1 

179 

26 .829 

2095 

Y15054 

Rattus norvegicus 70 kD tumor- specif ic antigen 

173 

38.202 

2096 

U55042 

Bos taurus myosin X 

1676 

89.726 


AJ01094 9 

Mus musculus calcium channel alpha-2 -delta-C 
subunit 

306 

92.453 

2098 

AB020715 

Homo sapiens KIAA0908 protein 

400 

47.863 

t n n 0 
2 099 

M28231 

Drosophila melanogaster neuroglian precursor 

287 

32 . 370 

2100 

Y17048 

Rattus norvegicus caldendrin 

269 

48.611 

2101 

D32210 

Mus musculus cell surface protein 

4497 

93 . 557 

2102 

X86368 

Mus musculus transcription factor 

835 

72 .222 

2103 

AF023450 

Homo sapiens Down syndrome cell adhesion 
molecule 

720 

70.064 

2104 

AL031118 

Homo sapiens dJ153G14.3 (novel C2H2 type Zinc 
Finger protein) 

159 

22.656 

2105 

AF082556 

Homo sapiens TRF1- interacting ankyrin- related 
ADP-ribose polymerase 

313 

32 . 800 

2106 

U32517 

Saccharomyces cerevisiae Ydr324cp 

296 

36 .691 


HLUUDUUD 

Homo sapiens similar to phosphatidyl inositol 
(4 , 5) bisphosphate 5 -phosphatase ; match to 
PID:gl399105 

1949 

98 . 592 

2108 

J04526 

Rattus norvegicus hexokinase 

983 

74.611 

2109 

AL032626 

Unknown cDNA EST EMBL:D70654 comes from this 
gene; cDNA EST EMBL:Z14359 comes from this 

472 

39.130 
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gene ; cDN 



2110 

U 2 0105 

Rattus norvegicus synaptotagmin VI 

1371 

92 .444 

2111 

U5 5816 

Rattus norvegicus furosemide- sensitive K-Cl 
cotransporter 

4723 

98.462 

OTTO 

z i iz 

At* 03 8 554 

Homo sapiens density regulated protein drpl 

233 

97.143 



Plasmodium falciparum liver stage antigen 

304 

26 . 737 

Z ± ±*± 

Ar u / li / ^ 

Homo sapiens HERC2 

385 

90.476 

/lib 

AdUIozoU 

Homo sapiens KIAA0737 protein 

173 

96.429 

OTIC 

T 11 Q O Q 

Homo sapiens mucin 

408 

22 . 769 

Zll / 

AoUUU^ / z> 

Homo sapiens DAP -2 

369 

83 . 333 

2118 

AF151110 

Mus musculus COP1 protein 

203 

96 . 552 

2119 

AB002379 

Homo sapiens KIAA0381 

329 

50 . 000 

2120 

J05499 

Rattus norvegicus L-glutamine amidohydrolase 

393 

100 . 000 

2121 

U49974 

Homo sapiens mariner transposase 

252 

78 . 000 

2122 

J03796 

Homo sapiens erythroid protein 4.1 isoform B 

233 

29.412 

2123 

AF013969 

Mus musculus antigen containing epitope to 
monoclonal antibody MMS- 85/12 

655 

55.224 

2124 

D90828 

Escherichia coli Lysostaphin precursor (EC 
3.5.1.-) . 

946 

100.000 

O "1 O C 

^Jl^s 5 

T"\ n ii o o r 

D9082 5 

Escherichia coli ORF ID:o334#5; similar to 

84 

41 . 176 

zlzo 

AIj096 883 

Homo sapiens hypothetical protein 

327 

27.376 

2127 

AB015484 

Dugesia japonica myosin heavy chain 

175 

24.891 

2128 

U23516 

Caenorhabditis elegans No definition line found 

407 

33 .476 

2129 

AB023148 

Homo sapiens KIAA0931 protein 

523 

97 . 531 

2130 

AF047347 

Homo sapiens adaptor protein Xllalpha 

265 

90.698 

2131 

D87436 

Homo sapiens Similar to Human KIAA0188 protein 

234 

100.000 

2132 

AC003040 

Arabidopsis thaliana putative nicotinate 
phosphor ibosyl transferase 

1483 

44.241 

2133 

U73199 

Mus musculus Rho-guanine nucleotide exchange 
factor 

1694 

72.654 

2134 

X95190 

Homo sapiens branched chain acyl-CoA oxidase 

273 

97 . 561 

2135 

AB023212 

Homo sapiens KIAA0995 protein 

259 

89. 362 

2136 

AF103939 

Homo sapiens echinoderm microtubule -associated 
protein- like EMAP2 

249 

39. 706 

213 7 

AB028997 

Homo sapiens KIAA1074 protein 

809 

49. 123 

213 8 

U82761 

Homo sapiens S-adenosyl homocysteine hydrolase 
homo log 

189 

100 .000 

TITO 

Ao JDU / 

unidentified unnamed protein product 

1113 

100.000 


Do 1 *i*k 3 

Homo sapiens KIAA02 56 

198 

100 . 000 


ajjU uyi /i 

Drosophila melanogaster 62D9.a 

2515 

66 .415 


Z Ub o 

Homo sapiens RanBP2 (Ran-binding protein 2) 

1014 

96 . 875 



Rattus norvegicus dynein heavy chain 

2612 

98 . 272 

Z 1*4 f* 

Ar UDollo 

Fugu rubripes All-1 related protein 

1045 

69.444 

J. *± J 


Drosophila melanogaster BcDNA. GH09358 

300 

35 . 333 

2146 

AL031324 

Schizosaccharomyces pombe membrane atpase 

630 

50.256 

2147 

Z97211 

Schizosaccharomyces pombe kinesin-like protein 

321 

51.515 

2148 

AB002379 

Homo sapiens KIAA0381 

302 

48. 276 

214 9 

AB002374 

Homo sapiens KIAA03 76 

1155 

100.000 

2150 

AF007116 

Mus musculus cadherin 7 precursor 

1048 

93 . 373 

2151 

670000 

unKnown preaiccea using Cjenerinaer ; cDNA EST 
yk261h2.3 comes from this gene; cDNA EST 
yk261h2.5 com 

174 

29 . 508 

2152 

AF043643 

Xenopus laevis NF-protocadherin 

665 

43 . 396 

2153 

U81036 

Rattus norvegicus ankyrin binding cell adhesion 
molecule neurofascin 

722 

84. 848 


72>S- 


.1, 0 O El »+€», Ml- 3; „, i|.J a H & O H 


2154 

X62379 

Mus musculus formin, isoform IV 

744 

73 .203 

2155 

TJ2 91^6 

\j4Lt 71 JO 

mus mus cuius involved in signaling by trie 
epiaermai growcn ractor receptor ; Method : 

v^uiiv,c^uua± LiaiioJ.aUl.Ull o V1J^>^) 1 ICQ Oy aUuilOr 

1290 

86.047 

2156 

X12517 

Hnmn pani pna C* r\ -r~/~\t~ ^ n / a a i i c a \ 

nv -"i'*-' octjjitiiifc* ^ procein v -"-H i-ibyj 

247 

93 . 750 

2157 

D10171 

i lua mubuuiub y lULdiiidLe recepcor cnannei suounit 
delta- 1 

1458 

97 .391 

2158 

AB028997 

Homo sanipnq KTA&1 fl74 nrnhPin 

324 

57 . 303 

2159 

Y14946 

Homo saoiena SPTN nrnt - p-i n 

870 

78 . 286 

2160 

AF084396 

SVIlthfttir ron^frupt" pal mnHnl -In miiH^nf* cvmpami a 

*-* J A * v-aa^ v» a. v- ^^iio Li. LallKUUUl 111 iUULallL O I Vi v^/ifl J U 

153 

26 . 667 

2161 

AJ243460 

Leishmania m a i ni* oY~ot~^oT"iV"ioar*il"i<'*K-*n i/rian 

uuxoiiuiaiixa ma jui pi^LcupilUopilU^iyCail 

249 

30.488 

2162 

AF091457 

i^<=» i_ i_ uo iiwi vcyiLua ^iiic 1 liiycl ptOCcin K.X1N Zi r* 

471 

27 . 716 

2163 

AF151863 

Homo saoiens CGI -105 orot-pin 


91 . 667 

2164 

X70326 

Homo saoiens MarMARPKCi 

68 

33 . 333 

2165 

U13766 

Murine leukemia virus gag-pol polyprotein 

159 

46.154 | 

2166 

D32064 

nv^uio adpiens z -oxogiucarace oenyarogenase 

2420 

81 . 840 

2167 

X98055 

iiuo uiusuuiua yiuLdcnione b-cransierase tneca 

302 

65 . 753 

2168 

Y16610 

Homo sapiens paraplegin 

368 

98.305 

2169 

AC006530 

Homo sapiens unknown 

214 

86 .486 

2170 

11 jO jj 

Drosophila melanogaster ubiquitin activating 

it»y~i *y vrmo 

215 

47.143 

2171 

U024 76 

c>us scrora jmauph oxidase heavy chain subunit 

182 

39.130 

2172 

— vV JU J J 

riomo sapiens matcn to ESTs AA316181 
(NID:g3165221) , AA032221 (NID : gl502183 ) , and 
AI16 7942 (NID :g3 701112) 

370 

46.078 

2173 

U49114 

Homo sapiens prohormone convertase 5 precursor 

208 

93 . 750 

1 / *± 

a t^n m a~i a 

Jn-V U U 1 *k 3 *± 

Homo sapiens Hpast 

1306 

80.000 

2175 

Il/OO / 

Homo sapiens tenascin-X 

2768 

99.263 

2176 

Zt / / D O O 

Caenorhabditis elegans cDNA EST yk221bll.5 
comes rrom cms gene; cDNA EST yk221bll.3 comes 
- Liul " ciiio yciit:, cjuiM-rt. i yKoi4hi> . J comes rrom 
this crf^n^ 

306 

26.873 

2177 

AF055985 

v^xav^a *»w/v--c;i. ^a vwivuiuy pyiioiiaouc ricn anuicfen 

116 

38 . 776 

2178 

AF070065 

Rathu*? norvpoi rnq f 1 -J >-An — if Ir-inaoo 
i\auuuo iiui »cy 1UUD L1UIUU I\ JvlHaSc 

388 

94 . 828 

2179 

A56817 

unidPOf 1 •p i p»H nnnamoH r^>*/~»t* Ann n>-/^k^t*r-»4- 
uiij.uciiuxi icu ui ii id lilted £JiUl»dIl piOQUCU 

389 

44 . 304 

2180 

M37190 

Homo sapiens ras inhibitor 

319 

46.610 

2181 

D90790 

DOL-iiei n,iiici con unr iu : oz /ytfo f similar to 

970 

100 . 000 

2182 

AL050156 

Homo Qani a Vivrr^*^it- Vie»t- i r>al nvi^t* ^ A *-« 
aav^iiiw oapi cut* liy^JvJ L-llc I — LC— ell pi OLclU 

189 

100 . 000 

2183 

M81787 

vjchiuo yanuo / y trilt: J^Ji UUUCL 

245 

32 . 090 

2184 

X51829 

Mus musculus MyD116 protein (AA 1-657) 

168 

40. 000 

2185 

AF075461 

uus uiuocui uc> HLff-riDosyiacion ractor-uirectea 

mt^c. av>ui vaLlll^ pi l_ ti 1 1 1 loUlOl III a. 

1131 

94.220 \ 

2186 

L07599 

Homo sapiens ribosomal protein S6 kinase 3 

182 

64.583 

2187 

U55042 

Rocs haiiyilQ m\rr~^ a i n Y 

1149 

97 . 159 

2188 

S57132 

Homo Rani pne f"imo YA7T /~«/^l T arrAn 1 y^Vk -> *i _ i_ _ ' ^ 

nuillu oc*pj.eii£i type Avi coiiagen aipna l chain, 
alpha 1 (XVI) 

517 

45.355 

2189 

L2633 5 

Cavia OOrcellllS 71 nr f i nrrpr rtrofoi n 
>" , ** v ia tr v-.*; 1 1 blllU 1 Illy cl piOtcill 

617 

50 . 602 

2190 

D42041 

HomO Saoiens Th£» hal O S rr p^n <=» nmrlnpf -i -k-c i ^. *- „ j 

to human aloha.— crltico«5i da 

226 

100 . 000 

2191 

AC004240 

Homo sapiens match to Z43555 (NID : g572788 ) 

994 

99 . 338 

2192 

AB020665 

Homo sapiens KIAA0858 protein 

1454 

100.000 

2193 

AB007407 

Mus musculus myeloid zinc finger protein- 2 

185 

68.182 

2194 

AB018333 

Homo sapiens KIAA0790 protein ' 

857 

45.954 

2195 

A68194 

unidentified unnamed protein product 

793 

92.424 

2196 

U14003 

Escherichia coli apparent frameshift in GenBank 

940 

100. 000 


in, 
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Accession Number X55662 



z j. y / 

uy 2 y4 y 

Mus musculus kinesin motor protein 

2353 

83 . 641 

2198 

D80004 

Homo sapiens KIAA0182 

246 

90.476 


AF0586 93 

Mus musculus M-RdgB2 retinal degeneration 
protein B subtype 2 

212 

76 . 190 

oonn 
22 UU 


Homo sapiens transcriptional activator SRCAP 

929 

97.857 

0 0 e\ *i 
22 01 

T\ T3 /^l 1 A C C *"1 

ABU14 557 

Homo sapiens KIAA065 7 protein 

499 

30 .400 

2202 

AF058693 

Mus musculus M-RdgB2 retinal degeneration 
protein B subtype 2 

1255 

81.858 

2203 

AF050183 

Rattus norvegicus GTPase activating protein 
SynGAP-c 

979 

94. 969 

2204 

D21239 

Homo sapiens C3G protein 

174 

89.655 

2205 

AF125964 

Caenorhabditis elegans contains similarity to 
collagens 

249 

33 . 333 

0 0 n c 

Tin r\ 0 0 1 
U2U281 

Gallus gallus cell division cycle control 
protein 37 

3 56 

46 . 491 

2207 

X05173 

Escherichia coli NR(II) (glnL gene product) (AA 

792 

100.000 

2208 

D70831 

Homo sapiens Zinc- finger protein 

727 

63.576 

2209 

X78932 

Homo sapiens zinc finger protein 

794 

80.000 

2210 

U873 05 

Rattus norvegicus transmembrane receptor UNC5H1 

796 

90 . 000 

2211 

M74094 

Schizosaccharomyces pombe mitotic control 
protein 

207 

27. 128 

2212 

AL022018 

Unknown /prediction= (method: " "gens can" " , 
vers ion : " " 1 . 0 " " , score : " "294 .38" ") ; 
/matcn= (aesc : "Thiaz 

413 

55 . 140 

2213 

AF081825 

Rattus norvegicus sodium- dependent high- 
affinity dicarboxylate transporter 

980 

91.358 

2214 

X74904 

Gallus gallus alpha-2-macroglobulin receptor 

1086 

57 . 795 

2215 

AF016903 

Homo sapiens agrin precursor 

1538 

95.902 

2216 

AF038564 

Homo sapiens atrophin-l interacting protein 4 

219 

39 . 286 

2217 

Z75712 

Unknown Similarity with yeast gene L3502.1 
(TREMBL ID G609424) ; cDNA EST EMBL:D33317 comes 
from thi 

45 

25. 000 

2218 

Z75537 

Caenorhabditis elegans Similarity to 
Aspergillus acid phosphatase (TR:G755244) 

396 

47 . 934 


TT*7 A C Q d 

Rattus norvegicus double- stranded RNA specific 
adenosine deaminase 

688 

89.286 

0 0 0 /"» 

ZZZKJ 


Escherichia coli f ormylglycineamide 
ribonucleotide synthetase (EC 6.3.5.3) 

814 

96 . 032 

2221 

M96625 

Gallus gallus cardiac muscle tensin 

1139 

53.165 

Z Z Z Z 

z>Zx o 

Homo sapiens KIAA0032 

492 

50 . 694 

2223 

AF006465 

Mus musculus B cell antigen receptor Ig beta 
associated protein 1 

941 

80.838 

O O O /l 
2224 

Ar 1zd4dd 

Caenorhabditis elegans No definition line found 

349 

37.368 

2225 

AL035634 

Homo sapiens dJ403L10.1 (SNX9 (Sorting Nexin 
9} ) 

555 

49. 718 



Unknown cDNA EST yk422gl.5 comes from this 
gene; cuima r,t>i yKiy/c4 . b comes trom cms gene; 
cDNA EST 

304 

26 . 688 

2227 

AF176688 

Rattus norvegicus sodium/calcium/potassium 
exchanger NCKX1 

216 

32.632 

2228 

X52022 

Homo sapiens collagen type VI, alpha 3 chain 

2313 

99. 180 

2229 

AB004906 

Ipomoea purpurea transposase 

227 

21.290 

2230 

AL023799 

Homo sapiens dJ322P7.1 (zinc finger) 

561 

64.138 


J. O O S H-6 H-3 . ill m;!: E! 6 O EE 


2231 

D38255 

Homo sapiens CAB1 

226 

24.176 

O O o 
Z Z 3 Z 

A±j009 171 

Drosophila melanogaster 62D9.o 

1052 

59.109 

2233 

AF060500 

Homo sapiens liver specific transporter 

803 

53 .219 

2234 

Y14946 

Homo sapiens SPIN protein 

224 

100/000 

2235 

U20554 

Drosophila melanogaster UDP- 

glucose : glycoprotein glucosyl transferase 

precursor 

375 

33 .945 

2236 

Z68753 

Unknown predicted using Genefinder; Similarity 
to Glucose- repressible alcoihol dehydrogenase 
transc 

293 

43 .269 

Z Z 3 1 


Homo sapiens neuropathy target esterase 

1350 

72.830 

JO 


Unknown similar to nucleotide binding protein; 
cdjna hiiai hiMBi-i : m / b o 9 / comes rrom tnis gene; cDNA 

182 

25 . 882 

2239 


I\.d l_ l_ U S I ltj X. Vtjy ltUo ^^Aj 

1 O A O 

61 . 747 

224 0 


numan endogenous retrovirus J\. env protein 

616 

63 . 636 

2241 

AB009024 

Homo sapiens capping enzyme IB 

240 

97.297 

& £,HtZ 

U -D U'iD D 

Rattus norvegicus phodpholipase C delta4 

195 

84 . 375 

2243 

AB002377 

Homo sapiens KIAA0379 

241 

29.949 


AT A11CO/I 

Schizosaccharomyces pombe serine-rich protein 

214 

28 . 934 



Homo sapiens zinc finger protein ZNF13 6 

570 

60 .800 


Ac 01938 0 

Arabidopsis thaliana putative 
phosphatidyl inositol - 4 -phosphate 5-kinase 

172 

35 . 714 

O O A 1 
ZZ<* I 

TTA *7 o n *7 

U0 /ol / 

Dictyostelium discoideum glutamine-asparagine 
ncn procein 

164 

25 . 532 


\j / o / o D 

Caenorhabdit is elegans No definition line found 

210 

32 . 836 

2249 

X94335 

Saccharomyces cerevisiae YOR3348c 

198 

26 . 500 


Ar lUUybU 

Rattus norvegicus protocadherin 

282 

39 . 161 

O O C T 

Z Z D _L 

At 14 06 74 

Homo sapiens zinc metal loprot ease ADAMTS6 

372 

44 . 030 

2252 

AF094508 

Homo sapiens dentin phosphoryn 

166 

19.870 

*> O C 1 

A J 010 04 5 

Mus musculus guanine nucleotide- exchange factor 

1262 

69 . 811 


LJo by / l 

Homo sapiens no similarities to reported gene 
products 

192 

27 . 717 

*> ^ ^ 

Ar 1 J4yit5 

Mus musculus semaphorin subclass 4 member G 

952 

92 . 517 

3D 

-rir j. / yjoy 

Mus musculus insulin- like growth factor binding 

piUk.CJ.il -D X. U l_ Ct O t= 

517 

52 . 071 

2257 

D37793 

Mus musculus synaptotagminII/IP4BP 

348 

63 .636 

z, ^. ^ o 

Air J. D / JZU 

ptus niuscuius zinc ringer procein z,t fiu 

626 

63 . 281 

*i J? 

i-VJUwfi 7DOO 

xiomo sapiens nypotnecicai procein 

493 

39 . 512 

2260 

jfAiD W _J _L _) J_ 

rioiiit? sapiens j\.xmj\kj z) j fk procein 

o r\ a a 

73 . 500 

2261 


riiis IUU3CU1U3 uxm/\ mecnyi crans c erase 1 


55 . 814 

2262 


tioirio sapiens oxvi-j^ijrz^ . i inovei procein similar 

i-u iuuuoc xiiuuuiitJo upt: x. iaIUliy prOLclli dLjZ ) 

552 

50 . 968 

2263 

U35376 

Homo sapiens repressor transcriptional factor 

890 

65.922 

2264 

MJ3 UU a A -/ O 

rtouio sapiens jvihaujuu 

1347 

99 .519 

2265 


uiobopniid meianoyascer eir-^aipna Kinase 

TOO 

48 . 214 

2266 

U2 83 73 

oavuilax »Jiii_y uclcvlbldc Idijo DCp 

a "j q 

DJO 

J / . b 00 

2267 



o4y 

49 . 162 

2268 

S66427 

Homo sapiens retinoblastoma binding protein 1, 
RBP1 

476 

47.399 

2269 

AL121800 

Drosophila melanogaster BACN5I9.i 

237 

28 . 916 

2270 

U10281 

Sus scrofa gastric mucin 

205 

24 .497 

2271 

AF111168 

Homo sapiens unknown 

882 

52 .000 

2272 

AL021997 

Homo sapiens dJ874C20.1 (Zinc Finger Protein 
ZFP47 LIKE) 

427 

63.636 


7^ 
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O *7 "5 

At ui j yb y 

Mus musculus antigen containing epitope to 
monoclonal antibody MMS- 85/12 

929 

73 . 301 

"> *"> "7 /I 

ZjD o o U ^ 

Caenorhabdi t is elegans ZK792 . 8 

280 

36 . 000 

OOTC 
/ J 

/UjU4 yob / 

Homo sapiens hypothetical protein 

760 

98 . 305 

/ D 

At UU / X D / 

Homo sapiens unknown 

139 

78 . 125 

ZZ / / 


Mus musculus mkr4 

1169 

71 . 493 

2278 

Z46970 

Leishmania mexicana secreted acid phosphatase 2 

V SAP2 J 

141 

30.667 

O O *7 Q 

Abzlb j 

Homo sapiens Cathepsin O 

233 

64 . 286 

2280 

U64675 

Homo sapiens sperm membrane protein BS-63 

437 

95.714 

2281 

U80227 

Mus musculus ELL 

181 

34 . 615 

2282 

AC004943 

Homo sapiens alpha- fetoprotein enhancer -binding 
protein; 99% identical to A41948 (PID :g283975) 

599 

68 . 992 

2283 

U82535 

Homo sapiens fatty acid amide hydrolase 

224 

69.388 

2284 

X97674 

Homo sapiens transcriptional intermediary 
factor 2 

336 

100 . 000 

2285 

AC004991 

Homo sapiens ATM-like; similar to AL022373 
(PID:g3036812 ) 

452 

98.529 

2286 

L19102 

Rattus norvegicus sodium dependent sulfate 
transporter 

719 

53 . 171 

2287 

AB007901 

Homo sapiens HH0601 cDNA clone for KIAA0441 has 
an 82 -bp deletion at positions between 14 55 and 
1538 of the sequence of KIAA0441. 

220 

100.000 

2288 

AB023624 

Rattus norvegicus SCOP 

1147 

88 . 325 

2289 

Z72499 

Homo sapiens herpesvirus associated ubiquitin- 
specific protease (HAUSP) 

474 

100.000 

2290 

U80953 

Caenorhabditis elegans weakly similar in serine 
repeat region to rat thyroxine -binding globulin 
(PIR:A3 956 7) and to D. melanogaster ecdysone- 
inducible protein E75-C (SP:E75C DROME, P13055) 

547 

32 . 961 

zz91 

Mb 3 510 

Rattus norvegicus uromodulin 

445 

50.000 

22 92 

Ar 081158 

Rattus norvegicus CL3BB 

308 

72 . 308 

2293 

AB020684 

Homo sapiens KIAA0877 protein 

224 

67.308 

2294 

U10556 

Saccharomyces cerevisiae Yhr074wp 

343 

76 . 562 

2295 

U75321 

Mus musculus chromaffin granule ATPase II 
homo log 

601 

67 . 391 

2296 

M58583 

Homo sapiens precerebellin 

212 

54 .237 

2297 

AF08333 9 

Mus musculus double- stranded RNA- binding zinc 
finger protein JAZ 

358 

47 . 934 

O *"> Q Q 

ABQ11370 

Mus musculus Ankhzn 

816 

98 . 496 

2299 

U23514 

Caenorhabditis elegans similar to S. cerevisiae 
obUl protein ISP : SSD1_YEAST, P24276) ana to E. 
coli VACB and Ribonuclease II genes 

315 

39.231 


yu J*i 

riomo sapiens i\.i/4/\iiii protein 

23 0 

34 . 021 

2301 

AB014600 

Homo sapiens KIAA0700 protein 

2128 

99.689 

2302 

U81035 

Rattus norvegicus ankyrin binding cell adhesion 
molecule neurofascin 

942 

94.000 

2303 

U58203 

Mus musculus Lsc 

563 

42 .424 

2304 

AF047714 

Mus musculus melastatin 

681 

60. 784 

Z J U _> 

77QC QQ 

Zi / yoyo 

unKnown cujna bbi bMob:DJ4 /4 8 comes rrom tins 
gene; cDNA EST yk218e6.5 comes from this gene; 
CDNA ES 

217 

23 . 894 

2306 

AB023658 

Rattus norvegicus Ca/calmodul in-dependent 
protein kinase kinase alpha, CaM-kinase kinase 
alpha 

754 

87.402 


17A 


1. 0 O S 'Ml- US <+3 ,., 11 H £2 6. 0 £ 


2307 

AL031583 

Unknown /predict ion= (method: " "genef inder" " , 
version: " "084" » , score: ""120.68"") ; 

/ v~\Tr r\ i r~» t" i <*vn— ■ I mot" Vi 

384 

48.872 

2308 

AB014566 

Homo sapiens KIAA0666 protein 

245 

28.492 

2309 

AF0 7178 7 


322 

33 . 523 

2310 

Z35597 

Unknown Weak similarity with sea squirt nidogen 
precursor protein (blastp score 71) ; cDNA EST 

303 

46.875 

2311 

■r\C vu j 7 J D 

nomo sapiens putacive neuronal ceil acuiesion 
molecule 

235 

94 . 444 

2312 

S466?? 

nuiuu fcJcipifcJIlo CalCincurin A CaCaiyCIC SUDUIUC , 

pa 1 modi 1 1 "i t~i - HpnpnHAnt" nrAhoin nVmortVinf naa 
'■■» xuiuuux J. i i ucpciiucllL pxULclU pilOSpilaLaoc 

catalytic subunit, CaM-PrP catalytic subunit 

167 

100 . 000 

2313 

AB007298 

Homo sapiens hGLI2 

754 

69.799 

2314 

D3 7918 

i^o^xjitsx iconic* con Keverse transcriptase HKe 

1144 

100 . 000 

2315 

AC005600 


1113 

100 . 000 

2316 

U41543 

Unknown Similar to Rat trg gene product; coded 

-L w j. xjy v. . cXc^ailo \^xJvit\ yJv^olc / . j , COCieCl IOi DV 

C. ele 

250 

28.994 

2317 

AC005600 

nuuiu od^/x.dlo LUJJcl 111 

641 

94 . 340 

2318 

X97675 

Homo <SP4T"»"i f^n ci t~>1 ralmi^V»"i 1 1 n OK 

226 

69 . 231 

2319 

U93 8 72 

ivapuoi is oaiCUllia < 'aooULxauca IlcipeSVirUS UK r '3, 
contains T arop rrimnl py rpnoa f* fl? "7 1> 

290 

28 . 629 

2320 

Z12840 

Oryctolagus cuniculus protein of unknown 

f* 1 mot - i on 

792 

78.231 

2321 

X97675 

Homo cspir^T ana r%~\ at nnhi 1 i n OK 

166 

92 . 308 

2322 

D64000 

oyiic^nu^yoLio ^P - iiypuuxicuicetx protein 

231 

33.333 

2323 

AF02 34 5 9 

xia j.xulXl> i UlcoLcllo lUoLI in A 

235 

27 . 232 

2324 

AP000058 

ncj.upyi.uiu pciiiiA zioaa _L tjiiy xiypo ulle ClCal 

protein 

257 

34 .247 

2325 

AF145634 

Drosonhila mplannaactl-pr RrDMfl RHOfii 

Job 

33 . 04 / 

2326 

U10281 

Sll55 fif Tnf a or^ctt~T*"io tm in n 

one 

2 1 . 84 9 

2327 

AF176069 

Homo cani on Q nKi rnin 1 i n 
Ai^iiiw oaj^xcno lLUlL[Ulllii 

4 09 

52 . 94 1 

2328 

U09367 

Homo Qani pnq ^inr* f inrror r*»v*/-\t-*a -i -n 7MT?i l c 

1331 

59 . 819 

2329 

AF006465 

associated protein 1 

Z 1 !> 

2 9 „ 474 

2330 

AF060152 

Homo sapiens METH1 protein 


ft j . / 7D 

2331 

M81591 

Mus musculus CD10 neutral endopeptidase 24 . 11 

576 

jo . ftOZ 

2332 

L08505 

Rattus norveaicus dvnein hpaw rhain 

1 COA 

lozy 

QQ 1 "7*7 

2333 

AC005169 

Arabidopsis thaliana hypothetical protein 


-> Q O O C 
-5 O . Z 

2334 

AJ000517 

Homo sapiens spinocerebellar ataxia 7 


13 1 . ffc 

2335 

AF017433 

Homo sapiens outative transcrintion factor- fR^^ 

-j 

<r c one 

2336 

U41663 

Rattus norvegicus neuroligin 3 

1003 


2337 

AB020678 

Homo sapiens KIAA0871 protein 

1157 

59.871 

2338 

X70514 

Mus musculus nodal 

640 

64.238 

2339 

AC002328 

Arabidoosis thaliana FS>!?OS> s>o 


1 3 . o / o 

2340 

M77697 

Caenorhabditis elegans acid- rich protein 

237 

31.847 

2341 

AF14 3 94 6 

Homo tjaiTl PriQ hrancPTi nt" t nnal snf 1 trat-<^i' CDf^AT"* 
"viiw oap ±ciib tlalloui.ipLlUIldl aCtlVaUOl oK^—Air 

293 

39 . 3 94 

2342 

AF098504 

Caenorhabditis elegans contains similarity to 
protein kinases (Pf am:pkinase . hmm, score: 
149.36) 

338 

40 . 449 

2343 

U88181 

Caenorhabditis elegans similar to glycerol 
kinase 

632 

44.340 

2344 

AB014532 

Homo sapiens KIAA0632 protein 

1956 

100.000 
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baiius gaiius aipna-i collagen type VI 

220 

24 . 667 



hus miiscuxus paiaain 

1033 

80 .208 

234 7 


i ¥ ius mus cuius bbiii 

405 

31 . 967 

234 8 

•rt-iT W O O -> O ^7 

^acnornaDaiuis Driyy sae ujn*~ — fio 

195 

23 . 897 

2349 

AB018293 

Homo sapiens KIAA07 50 protein 

1097 

84.896 

2350 

X54135 

Homo sapiens protein- tyrosine phosphatase 

729 

79.433 

^ j j! 

AJBOlz 033 

Mus musculus keratin 6 alpha 

641 

71 .014 

o *a e -> 

MQQT7C 

M9877D 

Homo sapiens keratin 1 

602 

62 .805 



Rattus norvegicus CCA3 

201 

46 .512 


Z4 6 24 1 

Unknown carboxyl terminus of the predicted 
protein shows similarity to chimaerin; cDNA EST 

602 

45 .685 

2355 

AB018348 

Homo sapiens KIAA0805 protein 

1146 

76.168 

2356 

U81453 

Mus musculus myosin Vila 

722 

55.959 

2357 

J04425 

Gallus gallus type VI collagen, alpha -2 subunit 

433 

42 .857 

2358 

U3 957 3 

Homo sapiens salivary peroxidase 

203 

93 .333 

2359 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

240 

65.574 

2360 

v <~\ r\ t~ s~ r\ 

X90568 

Homo sapiens Protein sequence and annotation 
available soon via Swiss -Prot; available at 
present via e-mail from LABEIT@EMBL- 
Heidelberg . DE 

1192 

87.156 

Z So ± 

U4 8363 

Mus musculus alpha-NAC, muscle -specific form 
gp22 0 

472 

37 . 860 

2362 

M87306 

Tetrahymena thermophila micronuclear linker 
histone polyprotein 

192 

24 .567 

2363 

AF031834 

Caenorhabditis elegans GLY4 ; ppGaNTase 

423 

41.714 

2364 

X69490 

Homo sapiens titin 

1246 

98.980 

2365 

X90568 

Homo sapiens Protein sequence and annotation 
available soon via Swiss-Prot; available at 
present via e-mail from LABE I T@EMBL - 
Heiaeioerg . DE 

1478 

98 .298 

2366 

X90569 

Homo sapiens elastic titin 

207 

94 .286 

2367 

AF051944 

Gallus gallus Xin 

301 

38.462 

236 8 

AB005047 

Homo sapiens SH3 binding protein 

475 

42 . 162 

236 9 

AC004974 

Homo sapiens spa-l-like; similar to AF026504 
(PID:g2555183) 

888 

66 .176 

o o *? r\ 

U3 7591 

Homo sapiens similar to the following EST 
sequences: GenBank Accession Numbers T96213 and 
T96131; 3'UTR nmd sequence found in U30998 

518 

42 .424 

Z j / 1 


Plasmodium falciparum liver stage antigen 

210 

27 .273 

ZS / Z 


Homo sapiens a\J333H23 . 1 . 1 (60S Ribosomal 
rrocem l»s ) 

372 

96 . 552 

2 373 

U2 573 9 

rjus mus cuius x o y Lj ~ x rorm z 

228 

49 . 351 

^ o / *± 

7Q-50 Ail 

nOmo sapiens DKXlorb.z (PUTATIVE RnoGAP (CDC42 

GTPAse Activating Protein) LIKE protein) 

125 

77 . 273 

2375 

U09413 

Homo sapiens zinc finger protein ZNF13 5 

904 

54 . 338 

2376 

AF162681 

Drosophila melanogaster maroon- like protein 

327 

37.419 

2377 

X78933 

Homo sapiens zinc finger protein 

747 

77.344 

O "2 T Q 

At loo 795 

Rattus norvegicus schlafen-4 

601 

38 . 060 

2379 

AE000799 

i ic Liiaiiuuav« 1 UUl tXlCllllUaULULlUpniwUIH U~11I1J\.CU 

GlcNAc transferase 


z y . jou 

2380 

Z38061 

Saccharomyces cerevisiae mal5, stal, len: 1367, 
CAI: 0.3, AMYH YEAST P08640 GLUCOAMYLASE SI (EC 
3.2.1.3) 

196 

23.973 

2381 

AC004877 

Homo sapiens sco-spondin-mucin-like ; similar to 

1073 

92.453 


7HI 
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P98167 (PID :crl711548) ♦ details of infrnn/pynn 
structure uncertain 



2382 

AF060246 

Mus musculus zinc finger protein 106 


72 . 837 

2383 

AF132478 

Mus musculus Esel protein 

211 

27 .155 

2384 

AF115480 

Mus musculus cAMP- dependent Rapl guanine - 
nucleotide exchange factor 

± o o 

34 . 177 

2385 

AF135440 

Mus musculus huntington yeast partner C 


Q C r AO 

yb . bOo 

2386 

AL049481 

Arabidopsis thaliana putative protein 


48 . 148 

2387 

AC002451 

Homo sapiens pyruvate dehydrogenase kinase 
isoform 4 

J JL 

1UU . QUO 

2388 

AB007897 

Homo sapiens KIAA0437 

587 

AO ins 

2389 

AF016679 

Caenorhabditis elegans No definition line found 

199 

26 . 776 

2390 

AB023139 

Homo sapiens KIAA0922 protein 

"± ^ .3 

QQ OTC 

2391 

Z71264 

Caenorhabditis elegans similar to C2 domain 

JL / O 

O *7 QIC 

2 / . ol5 

2392 

AF053368 

Mus musculus lysyl oxidase- related protein 2 


yt> . Ubi 

2393 

U16726 

Chlamydomonas reinhardtii his tone HI 

164 

32 .258 

2394 

AF098066 

Homo sapiens squamous cell carcinoma antigen 
recognized by T cell 

530 

39.437 

2395 

AB017615 

Mus musculus Eos protein 


93 . 642 

2396 

AB000199 

Rattus norvegicus CCA2 protein 


82 . 456 

2397 

Z73428 

Caenorhabditis elegans similar to Zinc finger, 
C3HC4 type (RING fincrer) * cDNA E<5T PMRT,'nfi7i9i 
comes from this gene 

553 

67 . 797 

2398 

AF032668 

Rattus norvegicus rseclS 

y j jl 

94 . 667 

2399 

AF094520 

Mus musculus NET1 homolog 

305 

40.909 | 

2400 

D87450 

Homo Saoiens Similar to D mplannnaQhpr na 1 1 ol 

sister chromatids protein 

"1 "7 "3 

12 73 

83 . 333 

2401 

L04159 

Plasmodium falciparum 3' end., gene product 

163 

26 . 738 

2402 

AJ388557 

Canis familiaris zinc f inoer nrnt-pin 

y d o 

45 . 918 

2403 

Z33905 

Homo SaDiens 4 3 kE) Acetvlrhnl i np rpronhnr- 

associated protein (Rapsyn) 

Q *7 1 

972 

98 . 592 

2404 

AB018317 

Homo sapiens KIAA0774 protein 

OCT 
.Z O / 

95 . 556 

2405 

AF012273 

Mus musculus rho-tvoe GTPase-activatina nrnt-Pi n 
rhoGAPX-1 


4 J . 731 

2406 

AF109906 

Mus musculus G9A 

1276 

58 . 140 

2407 

U4 0410 

Caenorhabditis elegans C54G7 . 4 gene product 


AO . D / J. 

2408 

X51760 

Homo sapiens zinc finger protein (583 AA) 

-3 J7 JL 

cr a inn 

2409 

M61199 

Homo sapiens cleavage signal 1 protein 

260 

86 . 364 

2410 

AC002336 

Arabidopsis thaliana hvDothetical n^pin 

Oil 

2 JL JL 

32 . 759 

2411 

AF186273 

Homo sapiens leucine-rich reneats roni-ain-ina v?- 
box protein FBL3 

Oil Q 

3 0 . 994 

2412 

AC005614 

Homo sapiens F23269 2 

701 


2413 

U03976 

Tripneustes gratilla dynein heavy chain isotype 
5C 

494 

73 . 958 

2414 

AF061758 

Gallus gallus poly (A) polymerase II 

1460 

83 . 794 

2415 

AF184226 

Drosophila melanogaster BcDNA. GH0904 5 

7Q-3 

.5 o . o y 

2416 

AB012223 

Canis familiaris ORF2 

ODD i 
^ o O 

/ JL . -LOO 

2417 

U37263 

Homo sapiens KRAB zinc finger protein; Method: 
conceptual translation supplied by author 

z _> o 

o o . lOz 

2418 

L09742 

Carassius auratus gene, complete cds . , gene 
product 

146 

40.449 

2419 

AF038007 

Homo sapiens FIC1 

701 

51.643 ! 

2420 

Z73424 

Caenorhabditis elegans C44B9.1 

352 

26 .254 

2421 

AF117675 

Homo sapiens nebulin 

183 

71. 053 

2422 

X64697 

Homo sapiens titin 

1593 

98.016 




IOOBM-6H-3 «C 

i!E!E!!i& 

O 

2423 


Homo sapiens Protein sequence and annotation 
dvdiiaDie soon vxa Swiss -Prot; available at 

tr 1 - CocllL Vld fc: - Illci 1 X rrom LjJ\rir, L 1 (ajE|VlB ]_, — 

Heidelberg . DE 

953 

100.000 

2424 

X90569 

Homo sapiens elastic titin 

1126 

98.830 

2425 

AF060246 

Mus musculus zinc finger protein 106 

574 

89.362 

2426 

*\ «7 \J J O O 

Homo sapiens Protein sequence and annotation . 
dvdixduie soon via owiss-prot; available at 
present via e-mail from LABEITOEMBL- 
Heidelberg . DE 

1391 

100 . 000 

2427 

X90 56 9 

Homo sapiens elastic titin 

3791 

99.307 

2428 

U4 1534 

LdenornaDaicis eiegans contains similarity to 

Pf am flrtma "in* DTrnnO "7 1 /Viol i f~* \ <r> ^ ^ jt n r\ 

irj_dui uuiuctj.li. Fr\j\j<&/± tnencase l.j , score=49.0, 
E-value = 3 7p_i i m— i 

J-» V L4 JL. 14. — ' J . J C -I L / A* J. 

332 

37.500 

2429 

AB014542 

Homo sapiens KIAA064 2 protein 

455 

54 . 610 

2430 

AB029335 

AictxtJL.yiiL.xiia rotcLZi ririrJbil — 3 

243 

36.641 

2431 

AB029018 

xtl^iulj odpiens t\ xj-\j\x uy j protein 

584 

49.038 1 

2432 

AL031731 

noino sapiens uj^bDj / . 1 {cellular repressor of 
ElA-stimulated genes CREG) 

443 

39.416 

2433 

X14355 

Homo sapiens FcRI b form (AA 1-344) 

154 

39.241 

2434 


Mus musculus Ankhzn 

193 

96.774 

2435 

270269 

unjoiown preaiccea using Genetinaer; Similarity 
i ccib l iiypocneLicai procein xHCai 

\ »-J » » • -I. 1XL7.L 1 Cirto ± f f 

501 

58.400 

2436 

X97650 

Ml 1 ITIllQPlll iia mvncin-T 

* iiiuoluxuo iiiyvJoin.— x 

878 

90 .411 

2437 

U1184 3 

nLJiuLJ sd,pi.ens iructose transporter 

356 

65 . 333 

2438 

AC003007 

Homo sapiens KIAA0220 

333 

100.000 

2439 

U91318 

xiljiiuj sapiens pi v i z> v j partial j 

246 

70.968 

2440 

AC003681 

Homo sapiens match to AB002369 (NID :g2224 6 82 ) 

295 

60.606 

2441 

D87845 

Homo sapiens platelet-activating factor 
ductyinyuroidse -<s 

202 

83 .333 

2442 

Z22181 

Caenorhabditis elegans cDNA EST CEESN66F comes 
from this gene; cDNA EST yk3 95c9.5 comes from 
this gene 

199 

34.641 

2443 

70 ~> 0 0 c 
00 J Z Z J 

Caenorhabditis elegans similar to ankyrin 
aomain; cujna iibi yK2l9g4.5 comes from this 
gene, cijxsi/v z,z> i yKbyugii . i comes from this gene; 
v^ivim^a c*o ± yA3 joud . j comes xrom cms gene 

266 

35.075 

2444 

AF045640 

Caenorhabditis elegans No definition line found 

385 

47.518 

2445 

AB018336 

Homo sapiens KIAA0793 protein 

230 

34 . 711 

2446 


numo sapiens j\iaaud44 protein 

628 

70.714 

2447 

AC006550 

LiiiJLiivj wii iuenticai to goiui^bJo 3 — 
methyl crotonyl-CoA carboxylase precursor 

}J ^ l-vi Xll 11UIII HlaJJlUQpSl 

677 

55.191 

2448 

X64697 

IHJIllU octj^/x Clio LI Lxll 

903 

99 .259 

2449 

Y18204 

£j^u.u.a Ldudiiub nxgxi dinniLy immunogioouiin E 
receptor alpha subunit 

198 

50 . 943 

2450 

AB028954 

Homo sapiens KIAA1031 protein 

177 

45.283 

2451 

AF04 0944 

1 T 1L4&3 UlLloL. Ul Ll£> ±r Xf» U 

413 

41 . 317 

2452 

AL021392 

Homo sapiens dJ439F8.2 (novel KIAA0279 LIKE 
Ldunciin aouidxn procein (similar to mouse 
Celsrl, rat MEGF2) ) 

191 

32.090 

2453 

AL022322 

Homo sapiens bK228A9.1 (85 KDA CALCIUM- 
INDEPENDENT PHOSPHOLIPASE A2 ) 

175 

100. 000 

2454 

Z67990 

Caenorhabditis elegans similar to cuticle | 

167 

37.705 


-7M2> 
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collagen 



2455 

U41543 

Unknown Similar to Rat trg gene product; coded 
■tor r>y u . eiegans cjjima yK3ie/.5; coded for by 
C. ele 

199 

28.571 

2456 

D87908 

nuo uiuov^uxuo iiuuxcaL pioccin np^7 J 

244 

62 . 963 

2457 

Z73428 

v^aciiui nauux Lis cicy ctllc> ollUllaJ. L.O ZilnC linger, 

(~* ~i HC-4. f* {"PTftm f i nrro-r^ • /-■OXTA COT i^ivrQT .ncTin 
jn^_*± L/pc v*LXiNLar XXIlyex/ , CJUiMA E»o J. xirloij : JJo /j^«3 

comes from this gene 

604 

69 . 697 

2458 

Z50142 

Schizosaccharomyces pombe unknown 

213 

34.821 

2459 

AF169411 

Rattus norvegicus PAPIN 

423 

92 . 537 

2460 

M9662 9 

v«.aiiJ.o i.alllll±aliy IlUinuiU^Uc l_0 SGCbl 

570 

95 .604 

2461 

AF098070 

L/iuaupiiixa uicicinoyaSLer j-iisj. nomoiog 

153 

25 . 166 

2462 

Z99708 

Aiauiuujjaia i_ i id j. xctiid deem niLeract lny protein 

628 

71 . 875 

2463 

Z82268 

T Tt~i Xr n own /-^O'KT A POT 1 \r"\r "i 1 Qrrl A C /t^moe -P „ _ *_ v% -I 

uxiAuuwii t-jL/AM/ii c*o x yjvj joyiu . d comes iirom cms 
gene; cDNA EST EMBL:D27934 comes from this 

208 

48 .276 

2464 

S45710 

Homo sar* "i Fp namma rpronhnr 

2 54 

36 .496 

2465 

X90849 

Or5 I 1 HQ Cf a T "Ilia nol - \/'V"\ VOTY!/"* T -r\ -v~ 4 r-» 
«»aauo yaxiuo i-rLJX jr X>X OHIO X piOLclIl 

2241 

88 . 525 

2466 

AB023230 

Homo <3 an *i <=*n d V'TAZil m 1 r^-i-r^t- 

1012 

58 . 582 

2467 

AF067136 

subunit 7 alpha2 

203 

37 . 374 

2468 

U24070 

Rattus norvpcri niq Mnnrl "^-1 

163 7 

83 . 013 j 

2469 

Y00826 

Rattus norvegicus gp210 (AA 1-1886) 

1915 

86 .604 

2470 

AB002365 

Homo Q^ni ptiq lr T 21 ft ni C T 

321 

53 . 846 

2471 

U41552 

t^TiOT'Via Vifl l hi o 1 orranc T\Jr^ ^ q -f- -» y-i -i t- -i ^-r-, t ^ — — £ _ , , j 
v-aciiui iiauui Lia cxc^jallo IMO Ucl-J.IIlL.10n line lOUnu 

46 0 

33 . 955 

2472 

AF083424 


275 

29 . 082 

2473 

Y16008 

Ml 1 Q mi 1 QPl 1 1 iiq n f^l 1 von rs 1 _ CTHD t-\ -v-y-v 4- a « n 

i luo uilxol* ux no llcuxoilax o AUir piOLcin 

1523 

60 . 227 

2474 

M30270 

Torpedo californica electromotor neuron- 

aiaoUL.laL.cU JL/iLJL_tiXH 

507 

54 .225 

2475 

AF173829 

Mus musculus neuropathy target esterase homolog 

1046 

68 .778 

2476 

AF057019 

Dictyostelium discoideum interaptin 

183 

21 . 918 

2477 

AF145690 

Drosophila melanogaster BcDNA. LD28657 

787 

60.317 

T47Q 

TO QOQl 

Homo sapiens polycystic kidney disease- 
aabuLidtea protein 

772 

96 .748 

2479 

AJguOl 

oaccnarotnyces cerevisiae tne product of PRP22 
ytsnc aLLb late in cne splicing or yeast pre- 

iiicoodxycx. f ititiLiXcl L- Xiiy L- 11 1: iclcdSc Ol LIlG 

spliced mRNA from the spliceosome 

171 

59.524 

2480 

D86966 

Homo saoiens similarto human 7,fy o-r-ot- pin 

All 


2481 

AB029035 

Homo sapiens KIAA1112 protein 

jUj 

ci Oil o 

2482 

U43194 

Mus musculus rhophilin 

215 

36 .290 

2483 

AF087697 

Rattus norvegicus dig 3 

1 *7 C 
1/3 


2484 

AB014533 

Homo sapiens KIAA0633 protein 

2237 

99.707 

2485 

X86683 

wo \^jkJA x j. jl c* uicxaiiuy cto LCI UccU OXclilLj6 ^ OO X ) 

355 

29 . 218 

2486 

AB025411 

Mus musciilu*? Tpn-m9 

1087 

98 . 137 

2487 

AF019380 

Arabidopsis thaliana putative j 

DhoSDfia t idvl innq \ t* ol -4-nhnQn)iafp ^ - Iri -n a a 

295 

38.261 

2488 

X15430 

Dictvostelium di <?rni Hpnm nplahinn f ar-hniv { t\ t\ 1 
- 857) 

243 

37 . 097 

2489 

AB014600 

Homo sapiens KIAA0700 protein 

298 

88 . 462 

2490 

X80535 

Rattus rattus thyrotropin- releasing hormone 
degrading enzyme 

921 

94 .521 

2491 

AB002086 

Rattus norvegicus p4 7 

308 | 

41.129 

2492 

X53959 

Drosophila melanogaster slit protein 

228 

35.211 

2493 

AF055357 

Arabidopsis thaliana respiratory burst oxidase 

238 

54.386 


7Vf 
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protein D 



2494 

U22376 

Homo sapiens alternatively spliced product 
usincr pxon 1 "^A 

464 

80.233 

2495 

AB023230 

Homo sapiens KIAA1013 protein 

1385 

76.357 

2496 

AJ007395 

iiwiuu c* \~> J-dio \£r\ 1 ^ IllcllUJi. dllc |JIUUt.JLIl 

440 

49 . 689 

2497 

AL021497 

Caenorhabditis elegans predicted using 

(2at-i ca-p-i y\c\g*TT * fDMS T7QT -i/'Vr STIpC "X /^nmao f mm *-V\>t<-» 

gene 

254 

36 . 585 

2498 

Z54308 

* J *- ,1 ***»v»o<auv*iiai. viuy \<cd j~ pU(#dUx V c 11 Id J O X. 

facilitator superfamily protein 

642 

42 . 435 

2499 

AJ242914 

Mus musculus neurotroohin recentor infprarhinn 
factor (NRIF1) 


c: a coo 
31 . boo 

2500 

AB023227 

Homo sapiens KIAA1010 protein 

766 

<&y . o ud 

2501 

AF151847 

Homo sapiens CGI -89 protein 

9 ft ^ 


2502 

Z49212 

Saccharomyces cerevisiae unknown 

277 

45.217 

2503 

Z23024 

Homo sat) i ens rhoGAP Drotein 

1 o zf 

J / . SOU 

2504 

U67322 

Homo sapiens HBV associated factor 

216 

74 .419 

2505 

U44731 

Mus musculus purine nucleotide binding protein 

676 

70.629 

2506 

U62967 

Homo sapiens Na+/nucleoside cotransporter 

263 

95.000 

2507 

AT.O35701 

rvu v/ -j mJ I \J -X. 

nuniu sapiens 00001.0 (Similar CO FLiAoMA-CEIjLi 
MFMRPAWF nT.VrnDPnTPTM Df-l ^ 

I I Hi I 1XJXV/A1>< Hi VJJJ_| X 1 CillN XT *w — t J 

420 

38 . 068 

2508 

Z81557 

vacxiux. najuui u jlo cic^ans px7eo.1ct.GCi using 
ucuci. j.nucL / do j. yjv^D Dy d . j comes ixvom cms 
Cfene * CDNA EST vie 1? Scr S rnmpQ frnm t-Vi-J <3 rrono - 
cDNA EST vk6 56cr4.3 comes from this ci^ne* 

2 92 

52.439 

2509 

X96618 

Mus musculus novel stromal rpl 1 nrnf p-| n 

X ft 

/o . iyu 

2510 

AF117892 

Homo sapiens aspartic-like protease 


O Z7 . U Z *± 

2511 

AJ007395 

Homo sapiens QA7 9 membrane protein 

194 

73 . 810 

2512 

U15131 

Homo sapiens pi 2 6 

f± Z / 


2513 

AL022238 

Homo saoiens dJ1042K10 3 (nnvpl nrnt- pin) 


.52 . lfc>4 

2514 

AL031324 

Schizosaccharomyces pombe beta transducin 

478 

38 . 021 

2515 

U81788 

DrOQfinh i 1 a mpl annrfa q t~ "K* If inoain.T) 

13 09 

67 . 961 

2516 

Y13245 

Moraxella sp . NAD-dependent formate 
dehydrogenase 

748 

72 . 078 

2517 

L33861 

HpI inri Hari <3 prvt"hrnfiramma •F -J Vivrtnal 1 -i n T 
" c J-xuuAuai lis ci. y Liiiu^jiaiiuiia lljjiopcllin la 

54 0 

41.209 

2518 

AJ243535 

Synechococcus elongatus possible thiophene and 
furan oxidation protein 

242 

37. 778 

2519 

AC002394 

Homo sapiens Gene product with similarity to 

uy itCXil WCL d o ULJL/ Ui 11 L 

749 

59. 538 

2520 

AL117610 

Homo ^SDl H \m o t~ Vi#at~ i 1 nyrkho-l « 

O C Q 

58 . 904 

2521 

Z78417 

v.acii iiauux L lo cicy alio v^lVJlMrt. do 1 I2»I v LdJ_i * XJ/L / O Jo 

comes from this gene; cDNA EST EMBL:D2 7837 
comes from this aene ; cDNA EST vk3l5a;9 5 mmpq 
from this gene 

TOO 

2 8 9 

23 . 367 

2522 

AF101361 

Drosophila melanogaster Abnormal X segregation 

318 

JO . 

2523 

K02581 

Homo sapiens thymidine kinase (EC 2.7.1.21) 

192 

J7 D . O O / 

2524 

X61276 

Streptococcus pyogenes ML2 . 1 

192 

£ / . j34 

2525 

L07911 

Mus musculus DNA-binding protein 

11 1 

O O TOO 
O O . O O J 

2526 

AF098066 

Homo sapiens squamous cell carcinoma antigen 
recognized by T cell 

4 57 

T 04 

2527 

AJ223324 

Homo sapiens MAX . 3 cell surface antigen 

207 

33 . 333 

2528 

AF127374 

Streptomyces lavendulae unknown 

586 

38.645 

2529 

AF181626 

Drosophila melanogaster BcDNA.GH03694 

480 

40.171 

2530 

AF184921 

Rattus norvegicus ion transporter protein 

299 

45 . 055 

2531 

Z36233 

Mus musculus sialoadhesin 

394 

70. 130 


:i o o s «+6 . o a a 6 o k 


2532 

S62035 

Homo saDl&ns q _ cjn^o i f i r m inn -i no nnr^l o^t" "i rlea 

releasing factor, H-GRF55=CDC25 homoloq 

OCT 

£ bl 

10O . 00O 

2533 

AB023202 

Homo saDiens KTAAOQR^ nrnhpin 


61 . 017 

2534 

U63809 

Homo Rani pnq T">ro«?t~;* t~f* i"jOT~it~ o ct -i a roonnnQo 

protein, par- 4 

J 13 

97 . 959 

2535 

AC004794 

Homo sapiens F02569 2 

141 

41 . 791 

2536 

AF003134 

Caenorhabditis elegans strong similarity to the 

CDC2/CDX aubfamilv or^ Q£»r/rV»r r*rr»t-o-5 r» lr-i n=i aaa 

532 

54 .483 

2537 

L06147 

Homo saDiens crolcrin — 


51.000 

2538 

L06898 

Act" i nnmvrpa vi qcocjh c? qi al i H^qp 

1/4 

30 . 657 

2539 

AC004997 

Unknown match to E^T** AAfifI7Q ( }Q fMTn«fi9fi9fi7nn\ 
AA165465 (NID •crl741481) Z45871 (NID-a57Rl om 

and 


y 7 .43 6 

2540 

X79568 

Homo sapiens protein- tyrosine -phosphatase 

630 

97.826 

2541 

AJ388557 

Canis familiaris zinc finger protein 

C) 7 T 

0 1-13/ 

2542 

D86491 

Xenopus laevis Nfrl 

947 

77.273 

2543 

U19361 

Petromvzon marinu e ? NF-lfiO 

lo4 

3 U . /by 

2544 

U25739 

Mus musculus YSPL-1 form 2 

367 

80.303 

2545 

AF040642 

uaciiux xictuux u it> eieydns contains siniiidricy lo 
transacylases 

498 

39.583 

2546 

AF061026 

i*i uti uiuoouiuo leucine zipper— n*r —nana containing 
transmembrane nrnfpi n 1 

3 76 

60 . 870 

2547 

U13836 

* iiiuouuxucj vauuuxai aUcHUblllc LI ipilUopnaLaoc 

subunit Acll6 

14 j y 

bl . 891 

2548 

D87467 

Homo saDiens Similar f n a C pi pa^nn mi^an-ino 

^cij^'^ciia iji.iiuj.ai. v^ij c* V* . C X cy cli 1 L> ^UctUXlie 

nucleotide releasina factor homol na (^A ?**fiftl 

1 Q Q 

Ioj 

y 0 . 323 

2549 

U70825 

Rattus norvegicus 5-oxo-L-prolinase 

269 

74 . 000 

2550 

M37760 

MllS mUSCUlus Serine ? ill rra Viicrh fny nrnh^i n 

441 

4 y . 123 

2551 

AB028978 

Homo saoiena KIAA1055 nrofpin 

1 i»o 

3 / . DUU 

2552 

U75686 

Homo Sanipns nnl vaHptivI at*P hinHinrr nrrvho-l n 
iiviuv/ " a r ^ jj-' v y o vx i i j» x c* c uiiiuxii^j px uuexn 

/ z r> 

5 9 . 14 0 

2553 

AC004755 

Homo sapiens fos3 7502 2 

SOU j 

Q Q 1 

y y . loo 

2554 

D86970 

Homo sanipns similar t~o m\/r>ci -i n V> ^ t n • 
Containing ATP/GTP- binding site motif A (P- loop) 

O >1 "7 
Z 4 / 

0 y . 13 u 

2555 

U63818 

Xenopus laevis RING finger protein 

251 

37.037 

2556 

L12147 

Mus musculus parlv R-cel 1 "factor 

17-57 

Of ICQ 

y 0 . 20 y 

2557 

AB014551 

Homo sapiens KIAA0651 protein 

287 

100 . 000 

2558 

D83703 

Homo sapiens peroxisome assembly factor- 2 

242 

100.000 

2559 

X05418 

Homo «?ani PTlQ lfprah-i n 1-\rr\c* TT / A A1 _Ol 
iiwiiiu Dctpxcilo JVcxciLlll Lypc XX v**«»i <o 1 z> y 

1 /8 

46 . 753 

2560 

AB005549 

ivauuuo uux vcy xuuo atypicctl xr spcClUC Dinuiny 

orotein 

963 

88 . 125 

2561 

Z32683 

Unknown cDNA EST EMBL:Z14 902 comes from this 
ciene ; CDNA EST EMBL ^8 915 5 comes from this 

gene ; cDNA 

327 

30.293 

2562 

U81006 

Homo sapiens p76 

394 

100.000 

2563 

X07695 

Homo saoiens cvtokeratin 4 (408 AA) 

J\J f 

0 u . y U2 

2564 

U78516 

Homo saDiGns cAMP - recul a t" f^ri cni An irif* nnrl pnh ■! rlo 
exchange factor II 

D D / 

3 3 . 333 

2565 

X97818 

Mus musculus samaphorin G 

Q1 A 
J JL *± 

y 3 . / y 3 

2566 

AF003384 

Caenorhabditis elegans weak similarity to the 
peptidase family A2 

343 

39. 130 

2567 

AB028959 

Homo sapiens KIAA103 6 protein 

554 

60.448 

2568 

D90272 

Tachypleus tridentatus limulus factor C 
precursor 

202 

35.789 

2569 

AL021154 

Homo sapiens dJ150O5 . 1 (transcription factor 
E2F-2) 

188 

83 . 333 


cj a a ■+ & "+ z; ,„ o e: k £> o K 


2570 

D84296 

xav^iuv^F oauicilki If iUJXX J. 

863 

89 . 189 

2571 

X15573 

Homo SSDiens 1 — t*»Vio«3T"»H nf rnr t"olr"i r»a a <=» 

236 

62 . 264 

2572 

AC002310 

Homo sapiens Unknown gene product 

604 

100. 000 

2573 

AF041373 

natLuo uui vcvjituo ^J.aLuX 111 abbcluOiy prOU6in 
n horti form 

445 

60 . 800 

2574 

M64780 

RAtt~n«? nnrvPfri fiia arrri n 
ivaLuuLi uui. vcy J_X.11 

26 7 

38 . 614 

2575 

X74818 


2 90 

35 . 948 

2576 

X15657 

Drosophila melanogaster Elf-1 protein (AA 1- 
1063 ) 

229 

46 .429 

2577 

AB018281 

Homrt <3£af> "i *=»T1 Q "K T & ZV n *7 1 R nrnt-oin 

46 9 

52 . 817 

2578 

AF151843 

Homo Q^ni ptiq f^^lT — nrrthoi n 

O *7 O 

278 

78 . 182 

2579 

U41558 

^aciiuiiiauux L-lfl cxc^allo IN kJ Uti X» XI1JL U. XUH Xlllc JLOUnd 

A O *i 

48.361 

2580 

Y09945 

R t~ t~ 1 1 nOTVArtl flici m 1 1~ t~ ~i ira 1 nhprrval mamKvariA 

n.ai,i.uo mji vcy iuuo puLaUi. vc XllUcy I al [nt=liuJX7<Xlie 

transoort orotein 

4 98 

50 .000 

2581 

AF100960 

Rattus norvegicus protocadherin 

CI Q 


2582 

AF104260 

Homo s aniens hiwi 

iDO 

3d . 6 07 

2583 

AB018258 

Homo saoiens KTAAQ71 ^ nrnhPi n 

001 
OOl 

55 . 981 

2584 

U20086 

Mus musculus NF2d9 

317 

97.959 

2585 

D90734 

Escherichia coli Helicase (EC 3.6.1.-) IV. 

517 

96.386 

2586 

Y12781 

"Uinu bdpxens cxansaucin tDeca/ ixKe l protein 

303 

80 . 000 

2587 

AF05S446 

<t? ~3 Tina rta 1 1 no Vi^ a 4- m — » «^-» ■ v ^«- x tjo at o 

225 

26 . 027 

2588 

U19463 

1*1 u o iiiuouuxud t±z. \j protein 

174 

56 . 098 

2589 

AT. 099971 

jriUu&£Z / X. 

Unknown similar to Guanine nucleotide exchange 
factor for Rho/Rac/Cdc42-like GTPases ; cDNA EST 
vk 

279 

40 . 952 

2590 

AB007864 

Homo ci^'rvJ pn <3 If T 21 O A C\ A. 

345 

47 . 573 

2591 

AB011535 

Xivjiuu BoUXCllO 11 ill \J V X. 

184 

100 . 000 

2592 

AF030430 

Mus musculus semaphorin Via 

487 

98.630 

2593 

U14 635 

^ctenui xidxjLix. uis eiegans similar co (jAi3A ana 

Cfl H TIP* f" <=»T"i t~ r^l T" C3 
y J. Jr 1.11C J. C ^ tr 

511 

43 . 011 

2594 

M2 88 8 0 

Homo «?^ir»'i on<i anlrvri n 

224 

94 . 444 

2595 

AF064 553 

l 'Uo UlVXo V_ L4.X- Ud IMoJJX |J X ) l_ t- X 11 

311 

95.349 

2596 

Z93241 

Homo sapiens dJ222E13 . 3 . 2 (PUTATIVE partial 
isofofm y) 

204 

90.625 

2597 

270310 

v_.aciiux iidijux uxo eicyallb rv. J L /\o . / D 

267 

39.416 

2598 

Z70200 

Homo cam" i^n q TT^ cnPMD - enpr«i -F-i r* ^ (\C\\rT\ 

rawing oaj^xcilo W-j olinXN rr — olJti t-i X> 1C Z UU K. J_/ prOCSlIl 

5272 

99.137 

2599 

AF099974 

Mus musculus i?chlaf <=»ri3 

0 A 0 

3 1 . 984 

2600 

AF117754 

Homo sapiens thyroid hormone receptor- 
associated protein complex component TRAP24 0 

426 

95.588 

2601 

D25538 

Homo sapiens KIAA0037 

251 

100.000 

2602 

Y12713 

i Y JUS IHUSCUIUS rxO-rOl-uUiraSG pOiyprOtSin 

793 

80 . 916 

2603 

M33874 

Aciiu Ljutt xacvxb AULcn prouein 

285 

44 . 444 

2604 

278417 

f~* ra pnHT'Via V>/-1 t 4- t q T on a r"i o oHKIA COT* pwm t . T~\ *1 o t a 

Laciiui xidxjcix. cis eieyans cxjina hbi bnoL : / oJo 
romp<? f" T*om f- Vi -i a o*=*n<^ • rHMZi uqt pmut . m 7007 ' 

wiuco i_j.^jiu Llixo y Cllc , \r LJV4J-\ do 1 12ii v iOXj . JL/^d fO J / 

comes from this aene * cDNA eqt vlc^i RctQ r oom<=»Q 
from this gene 

260 

29 . 596 

2605 

M76720 

Xenopus laevis egg- specific protein 

523 

64 . 516 

2606 

U76764 

Homo sapiens CD 9 7 

415 

64.130 

2607 

U92072 

Rattus norvegicus m-tomosyn 

1290 

98. 396 

2608 

U23516 

Caenorhabditis elegans No definition line found 

249 

4. 1 414. 

2609 

Z70038 

Caenorhabditis elegans cDNA EST EMBL:D32579 
comes from this gene; cDNA EST EMBL:D35254 
comes from this gene; cDNA EST yk224b3.5 comes 
from this gene; cDNA EST yk357fl0.5 comes from 
this gene 

543 

37.949 


..i o s '4-6 Ni»3 u, o a e 6. o & 


2610 

Z24680 

Homo sapiens garp 

271 

84.615 

2611 

Z70038 

Caenorhabditis elegans cDNA EST EMBL:D32 579 
comes from this gene; cDNA EST EMBL:D3 52 54 
comes from this gene; cDNA EST yk224b3 . 5 comes 
from this gene; cDNA EST yk357fl0.5 comes from 
this gene 

793 

38.292 

2612 

M21389 

Homo sapiens keratin type II 

295 

58.333 

2613 

AL023777 

Schizosaccharomyces pombe rna binding protein 

241 

59.016 

2614 

U21317 

Caenorhabditis elegans similar to YBS4 YEAST 
(P38244) hypothetical 47.8 kD protein in HSP26- 
SEC18 intergenic region 

196 

28.986 

2615 

Y12781 

Homo sapiens transducin (beta) like 1 protein 

1121 

93.939 

2616 

AF097887 

Rattus norvegicus Chp 

1103 

98.780 

-sbl7 

M77003 

Mus musculus glycerol- 3 -phosphate 
acyl trans f erase 

237 

35.032 

OCT Q 

T f\OOl 1 

LiUooll 

Drosophila melanogaster adherin 

656 

53 . 804 

2619 

U20217 

Mus musculus fibrillin-2 

1007 

66.477 

262 0 

Z38112 

Caenorhabditis elegans E03A3.6 

231 

30.827 

2621 

Z26653 

Homo sapiens laminin M chain (merosin) 

191 

32.283 

2622 

AF155117 

Homo sapiens NY-REN- 62 antigen 

355 

50.000 

2623 

AF104260 

Homo sapiens hiwi 

682 

55.495 

2624 

U0 3 979 

Tripneustes gratilla dynein heavy chain isotype 
7B 

1034 

82.386 

2625 

AF113614 

Cricetulus griseus Toll-like receptor 2 

176 

31.973 

2626 

D833 90 

Gallus gallus connectin/titin 

193 

29.371 

2627 

U09413 

Homo sapiens zinc finger protein ZNF135 ' 

894 

65.405 

2628 

M92443 

Homo sapiens zinc finger protein 41 

591 

44 . 920 

2629 

AF002251 

Rattus norvegicus Maxpl 

1052 

87.027 

2630 

AF062256 

Homo sapiens immunoglobulin heavy chain 
variable region 

633 

75.591 

2631 

D12621 

Homo sapiens cytochrome P-4 50LTBV 

1152 

90.526 

2632 

U97002 

Caenorhabditis elegans similar to acyl-CoA 
dehydrogenases and epoxide hydrolases; Pfam 
domain PF00441 (Acyl-CoA_dh) , Score=57.4, E- 
value=l . 7e-16, N=2 ; contains similarity to Pfam 
domain PF00702 (Hydrolase), Score=57.4, E- 
value=le-13, N=l 

690 

53.125 


Ar 12 10 81 

Mus musculus cAMP inducible 2 protein 

1113 

80. 905 

ZD 

AL UUbbo / 

Homo sapiens FAA 

228 

100.000 

O £ c: 

Jjy u / z / 

Escherichia coli Hypothetical protein Y 

732 

100 . 000 

i a 

JO 

uzt U o XX 

Escherichia coli ORF ID:o320#13; similar to 

1006 

97.403 


AJC.UU0232 

Escherichia coli orf, hypothetical protein 

981 

100.000 

^ ^ "I ft 
Z O J o 

ZtXiX X / t> X o 

Homo sapiens hypothetical protein 

179 

68.571 


7\ T?1 Ai c 

Caenorhabditis elegans No definition line found 

146 

47.727 


TTO OT T7 

Escherichia coli ORF f329 

1430 

98.190 

2641 

D90713 

Escherichia coli TolA protein. 

880 

100.000 

Z O ft 

Ar UJ U4 J U 

Mus musculus semaphorin Via 

1910 

90.820 

OCA"? 
Z O ft .5 

Abuz o yy / 

Homo sapiens KIAA1074 protein 

189 

64.286 

2644 

U00039 

Escherichia coli No definition line found 

940 

100.000 

2645 

D90239 

i lwiu^j ac4^jit2xit* yiycine QecarDoxyxase precursor 

167 

100 . 000 

2646 

S80905 

Homo sapiens Conl=salivary concanavalin-A 
binding protein {exon 3} 

195 

26.891 

2647 

AB029324 

Rattus norvegicus TIP120-f amily protein TIP120B 

1743 

91.549 

2648 

Y08260 

Mus musculus cytoplasmic polyadenylation 
element-binding protein (CPEB) 

1313 

93.137 


Ik * 


j„ o o a a-t'fe <+3 „, o s si 161, o e 


2649 

AF100960 

Rattus norvegicus protocadherin 

555 

57.246 

0 ^ c n 

1UTO C A A A 

IYI8o444 

Sus scrofa moesin B 

892 

96 .429 

2651 

AL032684 

Schizosaccharomyces pombe 60s ribosomal protein 
12 

506 

58.400 

2652 

D90739 

Escherichia coli Orfl 5* of phoH. 

448 

100.000 

O £ CT "5 

26 53 

AIj031540 

Schizosaccharomyces pombe hypothetical ATP 
binding protein 

261 

29.956 


Tm a nor 
UUUUUb 

Escherichia coli similar to O.berteriana 
mitochon. protein involved, in cytochrome c 

896 

99.248 

AO J J 

J--* -7 UZZ / 

Cisciicticnid coil /iir-Dinainy procexn 

580 

100 . 000 

2656 

XV// 0 <o 

voivox carter x pneropnorxn-o 

238 

61 . 702 

2657 

AR009 1 

nUIUU Sd^JX tJUfci I\.1HHW J J J 

2 3 0 

30 . 952 

2658 

nnni 1 cio 

./""VXj UXX J JZ 


1100 

72 . 115 

2659 

AF09R QQ7 

Udt;iiuiiiclXJUJ.Llo clc^auS iMO Qc IiniLl Ou line j.OUnQ. 

3 62 

37 . 662 

266 0 

vJ / J. 

Katcus norvegicus nuncij- j 

782 

81 . 690 

0 0 j. 


mus tnuscuius synapcotagmxn 3 

1341 

97 . 549 

2662 

AC006593 

Arabidopsis thaliana unknown protein 

221 

29. 104 

Z t) O J 


Homo sapxens DOC1 

435 

40 . 642 


YQO CQi 

Mus musculus transcription factor 

194 

40 . 625 

2665 

AC007519 

Arabidopsis thaliana F16N3.15 

263 

26.606 

2666 

AF052042 

Rattus norvegicus zinc finger protein Yl 

563 

76 . 316 

2667 

U81259 

Pseudomonas aeruginosa carbamoylphosphate 
synthetase large subunit 

181 

76.316 

2668 

U35376 

Homo sapiens repressor transcriptional factor 

617 

74 . 775 

2669 

X90568 

Homo sapiens Protein sequence and annotation 
available soon via Swiss -Prot; available at 
present via e-mail from LABEITOEMBL- 
Heidelberg . DE 

826 

100.000 

2670 

AJ006486 

Mus musculus RNA binding protein 

178 

92 . 857 

2671 

X90568 

Homo sapiens Protein sequence and annotation 
available soon via Swiss-Prot; available at 
present via e-mail from I±ABEIT@EMBL»- 
Heidelberg . DE 

1021 

99.412 

2672 

AF034611 

Homo sapiens intrinsic factor-B12 receptor 
precursor; cubilin 

503 

26 . 196 

2673 

X75603 

Pleurodeles waltl fibroblast growth factor 
receptor 3 

536 

32.738 

2674 

Z75527 

Caenorhabditis elegans predicted using 
Genefinder; Similarity to Human ARNT 
interacting protein (TR : G1144013 ) 

560 

28.820 

O *7 ^ 
ZD / D 

Uooxb / 

Caenorhabditis elegans contains similarity to 
v--*- uomains 

557 

29.268 

*o / o 

A x?n no 0 7 c 

hus muscuius TTixsmaccn- speciric nnymxne-JUNA 
glycosylate 

156 

95 . 652 

2677 

AxlT U J J 1 \J \J 

jfujinu odpiens aeiuex 

2 50 

40 . 952 

2678 

AB002377 

Homo sapiens KIAA0379 

2295 

68.738 

ZD / J 

ni -a <r t c 

Homo sapiens KIAA0010 

203 

96 . 667 


A "Dm QOT/1 
rtoUlOZ / 

Homo sapxens KIAA0731 protein 

601 

82 . 524 

2681 

D80001 

Homo saDiens similar t~o Vivmot~V»^t- -! r>z* 1 nrr»t-o-i n 
D4478 of S . cerevisiae. 

o O 3 

1UU . uuu 

2682 

U84725 

Mus musculus GAT A- 5 cardiac transcription 
factor 

687 

77.600 

2683 

AF003535 

Homo sapiens ORF2-like protein 

320 

78 . 947 

2684 

U12336 

Rattus rattus acetylcholine receptor alpha 9 

293 

69.841 


7^ 


.;£, O ilJ O «4-6 "43; „ 11 S id! 6. 0 2 




oUJJUlll *— 



2685 

U5 O 0 7 8 


430 

97 . 183 

2686 

D38582 

c*o V^-llC? X. X l^ilX. C* V_ W J. -L ruin 

1092 

96 . 591 

2687 

D90828 

Escherichia coli Pyruvate kinase (EC 2.7.1.40) 

793 

100.000 

268 8 

TT000"* 9 

cocnericnia con iNO aennition ims touna 

82 0 

100 . 000 

2689 

M80458 

Escherichia coli biotin carboxylase 

522 

100.000 


tti a a ni 

Escherichia coli vacB gene product 

993 

93 . 671 

2691 

D90725 

Escherichia coli 0RF ID:o212#3 

1008 

99.342 

o ^ a o 

M18747 

Escherichia coli glutamate synthase large 
suDunit \tz*\~ ^ . 0 . l . 

716 

98 .230 



Escherichia coli Transcriptional activator 
protein riecK . 

745 

100 . 000 

2694 

D90876 

Escherichia coli dihydrodipicolinate synthase 

V CiVw. x i. . Z. . J. . Z> Z. ) 

498 

97 .403 

2695 

AF00041 9 

CoUllcJL luilXa COll pUUaCIVS LrdllspOl C 

1123 

98 .315 


Ml 1 fi Q 

tscxiericnia con posicive regulatory protein 

436 

100 . 000 

2697 

L08399 

Escherichia coli peripheral membrane protein 

1199 

98.907 


none oq 

Escherichia coli Sensor protein copS (EC 
2.7.3.-) . 

938 

100 . 000 

2699 

X69089 

Homo sapiens 165kD protein 

339 

100.000 

^ /oo 

LI 3 601 

Escherichia coli homoserine kinase 

749 

100 . 000 

2701 

D90868 

Escherichia coli PUTATIVE PEPTIDASE IN GCVT- 
SPOIIIAA INTERGENIC REGION (EC 3.4.-.-). 

693 

97 .273 

2 702 

Z19601 

Escherichia coli ORF, trpS . Hall C.V., van 
Cleemput M. , Muench K.H. , Yanofsky C. ; J. Biol. 
Lnem. 257(11; : 6132-6136 (1982) 

538 

68 . 800 

2703 

D90701 

Escherichia coli Aspartate transaminase (EC 
2*0.1.1/ 

860 

89. 189 

O *7 f\ A 

TT*7 m 1 /I 

U7U214 

Escherichia coli hypothetical 

540 

100 . 000 



Escherichia coli Probable ATP-dependent 
helicase dinG homolog . 

909 

98 . 582 


vai i *7/i 

iS-U ± 1 / 4 

Escherichia coli DnaB replication protein 
\ cinac5 ) 

834 

100 . 000 

2 707 


oyLiicricnia con rixt protein . 

765 

98 . 319 

2708 

M38777 

Escherichia coli htpG ORF 

870 

99.286 

O "7 A Q 

UUUUUb 

Escherichia coli matches PS00443 : Glutamine 
amidotransf erases class- II active site 

775 

82 . 432 

2710 

AE000443 

Escherichia coli orf, hypothetical protein 

1135 

98.765 

Z / 11 

rlj / / UZ 

Escherichia coli queA 

774 

90 . 152 

I JLZ, 

/ICiUUUfi Ub 

Escherichia coli putative ATP -binding component 
of a transport system 

340 

100 . 000 

2 713 

T.i n^*?n 

Escherichia coli similar to R. meliloti 

•i-i^vxu. j. d u xsjii procein u, simnar co lysK ciass or 

i.CyUJ-«^V^J-jr Jk/JL^^C^11£3 

610 

93 . 000 

2714 

013159 

bo^iici xuiilcl X. jl CI ctlioctXUUlctac 

A K O 

inn n n n 
XKJXJ . UUU 

2715 

X71917 

*-*o v^iicx icnia ' j_ _l ^jiuudiiiciLc vie v.dl JJUAyxaoc 

1 1 CO 

ytj . xoz 

2716 

D90721 

E ^ r* h 1^ "i c\\ "5 ^ rnl "i TranQmamhranp nrnhoi n r1t*\T^^ 

ijOUiiCi. J,CllXa W -i- -L x i dllOllldlLUXi dliC piUUcXU WlkyiyV^. 

*k O O 

~\ r\r\ AAA 

2717 

D38504 

Escherichia coli formaldehyde dehydrogenase 

959 

99.301 

2718 

D90746 

cociici lcuxa (^vJii oulhj jUxOLcin 


y b . 192 

2719 

AJ224871 

Escherichia coli GTPase 

1178 

95 . 745 

2720 

X14152 

Escherichia coli SrmB protein 

802 

93.985 

2721 

D10483 

Escherichia coli baiF homolog (PIR : F37844 ) 

966 

99.324 

2722 

D90744 

Escherichia coli Flagellar hook-associated 
protein 1 (hapl) . 

617 

100.000 

2723 

X53696 

Escherichia coli glutamate- 1- semi aldehyde 2,1- 

757 

100.000 


7S0 


J . 11 0 S a + 6 4- 3 . 0 12 S! i& O 2 




aminomutase 



2724 

M13457 

Escherichia coll nmpC 

Rift 

O X o 

Q*7 ^4.1 

2725 

AE000391 

Escherichia col i outat ive transDort Drohein 

<3 J £j> 

inn nnn 

X VJVJ . \J \J\J 

2726 

X04890 

Escherichia coli threonine deaminase 

X X / \J 

qo qi o 

2727 

D90836 

Escherichia, coli Invasin. 

773 

on "*70 

2728 

D90878 

Escherichia coli DOlvDhoanhate kinase (EC* 
2.7.4.1) 


Q*7 ft 9^ 

2729 

X13141 

Escherichia coli ugpE protein (AA 1-281) 

841 

91.489 

2730 

AE000341 

Escherichia coli orf , hypothetical protein 

1479 

ft Q H7R 

2731 

U82598 

Escherichia coli hvDothetical nrotein 

X 


2732 

D90801 

Escherichia coli Mlc protein. 

1 ftTO 
XV JO 

ftfi 1 ft 
70 . O X -7 

2733 

L03845 

Escherichia coli glyoxylate carboligase 

896 

94 . 521 

2734 

M19501 

Rflrhpri rhi pi col "5 f OTtnvl rrl xrr*! n*=*;^m"i r!<^ 

ribonucleotide synthetase (EC 6.3.5.3) 

ft ft ft 
o o o 

qo AO 

2735 

U4 3 5 6 8 

Brugia malayi microf ilarial sheath protein SHP1 

178 

4 8 . 077 

2736 

AE000362 

Escherichia coli putative glucarate dehydratase 

1646 

98.361 

2737 

D90701 

Escherichia coli Hypothetical protein 200 (entA 
3 ' region) 

1342 

99.024 

2738 

D90825 

Escherichia coli ORF ID:o334#8; similar to 

936 

94. 156 

O "7 ^ Q 


nscnericnia con secA procein 

82 9 

96 .26 9 

2740 

D90888 

Escherichia coli PHOS PHO - 2 - DEHYDRO - 3 - 
DEOXYHEPTONATE ALDOLASE, TYR- SENSITIVE (EC 
A ~\ o i c\ / nunc nun o vcto ~> npnvvuunTAmTP 

840 

97.674 

2741 

D90823 

Escherichia coli 3-isopropylmalate 

r^Vi-v/HrorTf^ri;^ r<=» ( VC 111 Q c \ f H- TDM 
uciiyui uy cuaoc \ n>V— x.x.x.od/ \ X/ X xr i i 

dehvdroaenase) ( TmriH ) H-TPM-DH) 

815 

100.000 

2742 

D10483 

Escherichia coli hsp70 protein 

965 

100 .000 

2743 

U14 003 


*7 C c: 


2744 

U00006 

DD^ilCi XLUXCt £i X UlX J. cl X L. kJ d . v^(JXX Uyi U VdLC 

f orma te — 1 vaf?p act* i v^at* *i ncr pn7vmp 

O / fi 

i f\t\ r\ r\ n 
1UU . uuu 

2745 

D90732 

Escherichia coli Hypothetical protein K 

1269 

100.000 

2746 

X72298 

Escherichia coli homnl no t*o cell r»rot*<=»"!n ■F r-om 
Rhodobacter caDsulatus 

iqq 

J J7 

qo 41-5 

7 0 . Hxj 

2747 

AF009204 

Homo sapiens PSD-95/SAP90-associated protein-2 

278 

97. 619 

2748 

X15237 

Escherichia coli AnraD nrnfpin ( AA i _i pi \ 

/ 3D 

QO ICC 
27 a . X O D 

2749 

D90803 

Escherichia coli ^pnior r>rot - ein Rqt*R ^ "Pt"* 
2.7.3.-) . 

/ JV 

i nn nnn 

2750 

U82598 

Escherichia coli CitG homolog 

820 

100 000 

X w V . \J V/ V 

2751 

D90849 

Escherichia coli D-mannonate oxidoreductase (EC 
1.1.1.57) (Fructuronate reductase) 

1346 

99.495 

2752 

D90702 

Escherichia coli Sensor kinase CitA 

457 

100 000 

X V V/ • V V \J 

2753 

U14003 

Escherichia coli ORF f26 0 

743 

100.000 

2754 

M24905 

Escherichia coli recE 

1123 

99 .415 

2755 

D90777 

Escherichia coli Copper amine oxidase precursor 
(EC 1.4.3.6) (Tyramine oxidase). 

876 

100.000 

2756 

M13495 

Escherichia coli IlvrA orotein 

-7 O 

_7 / . J J J 

2757 

D90788 

Escherichia coli Alcohol dphvdrnapna qp T / T^f 
1.1.1.1) (ADH I) . 

1 1 TO 
X J — J .J 

i on nnn 

2758 

D90783 

Escherichia coli Tyrosine aminotransferase (EC 
2.6.1.5) (L- tyrosine : 2 -oxoglutarate 
aminotransferase) (TAT) . 

448 

98. 551 

2759 

D90741 

Escherichia coli MdoG protein. 

257 

100.000 

2760 

D90887 

Escherichia coli similar to 

706 

100.000 


IS I 
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2761 

D13328 

Escherichia coli Uronate isomerase 

1155 

99.438 

2762 

AE000249 

Escherichia coli putative LACI-type 
transcriptional regulator 

436 

100.000 

2763 

D85613 

Escherichia coli 30KD protein, similar to ADHX- 
RAT 

188 

100.000 

2764 

D90865 

Escnericnia con long- chain fatty acid 

T'TD A "KTC "D^TD T DD HTT T XT DDrPTTDCrtD /fYTTTTTD MT?MnDaKTP 

TTZiTlT. T^POTT^TTCM fOT'Tr'PT? M'PM'RT? ZiMTJ* T?T,P D'POT'PT'Nr^ 
C t\Lfxj c t\.\J X J2j X IM ) \ U U 1 Ij K l v Jljjl JDxv/\J.nJ il> C JUJr ir X I-> X IN / . 

189 

84 . 848 

2765 

D90716 

Escherichia coli Molybdenum cof actor 

628 

94.624 

2766 

U28377 

Escherichia coli ORF o2l9; alternate name yghB 

761 

100.000 

2 76 7 

U93405 

Escherichia coli 2 , 4 -dienoyl-CoA reductase 

"7 *7 C 

. o b o 

2768 

tta n r\ *7 
U00007 

Escherichia coli yohG 

O / £ 

34 . o / Z 

276 9 

M3 82 88 

Escherichia coli RNA polymerase beta subunit 

/■mnP^ tWC O 1 1 \ 

\ xpuL / \ z . / . / . o / 

4 Jo 

y / . 14 J 

2770 

X59460 

Escherichia coli penicillin-binding protein 4 
(PBP4) 

786 

97.414 

27 71 

L19346 

Escherichia coli urfl 

y i y 

Q -7 O O O 

y / . zzz 

2772 

X13002 

Escherichia coli exonuclease III 

554 

98.718 

2773 

AE000399 

Escherichia coli orf , hypothetical protein 

85 7 

98 . 561 

2774 

D90715 

Escherichia coli Aconitate hydratase, 
mitochondrial precursor (EC 4.2.1.3) (Citrate 

48 5 

97 . 368 

2775 

M24148 

Escherichia coli 2 , 3-dihydro-2 , 3- 
ainyuroxyDenzoaLe aenyuroyenase 

933 

100.000 

2776 

U28377 

Escherichia coli ORF f413 

359 

92.982 

2777 

U2 83 7 9 

Escherichia coli alternate name ygiP; ORF 310 

716 

97.24 8 

2778 

M32793 

Escherichia coli amylomaltase (malQ) 

780 

100.000 

2779 

D90756 

Escherichia coli 3-deoxy-D-manno-octulosonic 
acid 8 -phosphate synthetase 

406 

86.301 

2780 

M87625 

Escherichia coli cobalamin- independent 
methionine synthase 

805 

98.333 

2781 

AE000475 

Escherichia coli B12 -dependent homocysteine-N5- 
methyl tetrahydrof olate transmethylase , 
repressor oi me ucj ci i i ex metr 

1042 

98.171 

£ I o Z 


tiScnericnia con pucauxve enzyme 

CAT 

1UU . uuu 

/ O J 


H>h>LJIltri X.OI1X.C1 Lull pX7t>l-t>ll — ^lULdulaLc 

O / \J 

jr. iC. Z Z> 

Z / Ofi 


CjoLIIcX XCIlld CU11 putative ta^atUsc O pxlODpnatc 
=i "1 ^oq<=» / Vr <=» t~ o "i QnmPT^ 

di.UUOC/ JvC w O C IDvlllCluOC 

/ O 

qq T n a 

2785 

M38777 

Escherichia coli htpG ORF 

658 

100.000 

2786 

Y00544 

Escherichia coli PufX protein 

1000 

100.000 


VUUzb / 

escnericnia con reacuny irame yamma 

joy 

y ^ . y j y 

2788 

U82664 

Escherichia coli similar to E. coli ydhB 

868 

97.826 

278 9 

D9082 5 

Escherichia coli ORF ID:o334#7; similar to 

lb Ul 

1UU . UUU 

2790 

D90715 

Escherichia coli Aconitate hydratase, 
mitochondrial precursor (EC 4.2.1.3) (Citrate 
nyuxu~xyaac/ \AuuiixLabc/ . 

536 

86.598 

z / i*l 


TC o Vi A t* n /~»Vi T 3 1 A - rr>i ima r*a t* o _ r*nl\ 

Ciotiiciitiiia tui l 4 - t^uuuiai ate 

homolog (PIR:S01667) 

4D1 

i nn nnn 

±.\JVJ . U v \J 

? 7 99 

J^O J J o 

Fcjche^Tfichia. coli DNA no! vitipts^p TTT alnha 
chain (EC 2.7.7.7) . 

1178 

99 . 432 

2793 

M12858 

Escherichia coli beta-cystathionase 

787 

100.000 

2794 

L03845 

Escherichia coli glyoxylate carboligase 

771 

100.000 

2795 

D90880 

Escherichia coli IMP dehydrogenase (EC 
1.1.1.205) 

1051 

93 .750 


1SX 
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2796 

X04341 

Escherichia coli recF protein 

1087 

99 .379 

2797 

X76979 

Escherichia coli orf303 

486 

93.976 

2798 

X54945 

Escherichia coli product appears to be 
membrane bound 

786 

98.305 

2799 

D90791 

Escherichia coli ORF ID:o280#4; similar to 

1029 

98 . 137 

2800 

AE000415 

Escherichia coli putative transport 

735 

100.000 

2801 

D90868 

Escherichia coli PTS SYSTEM, FRUCTOSE -SPECIFIC 
IIBC COMPONENT (EIIBC-FRU) (FRUCTOSE- PERMEASE 
IIBC COMPONENT) (PHOSPHOTRANSFERASE ENZYME II, 
BL. LOMPUNhW 1 ; (hiC 2. / .1.69) \EII-rKU; . 

859 

100 . 000 

2802 

D83536 

Escherichia coli Lipid-a-disaccharide synthase 

I 2.4.1.182/ . 

721 

99.115 

Tom 

D90842 

Escherichia coli ORF ID:o352#3; similar to 

786 

96 . 800 

28U4 


Escherichia coli SrmB protein 

712 

97 .321 

28 05 


Escherichia coli Lipoprotein RlpA precursor . 

123 9 

100 .000 

2o0o 

D90758 

Escherichia coli hyothetical protein (purT 

L "y J. Oil / 

775 

n n AAA 

99 . 099 

o u / 


Escherichia coli putative enzyme 

Hot) 

1 Art f\f\f\ 

2808 

AB000275 

Homo sapiens DAP -2 

182 

100.000 

2809 

X04619 

Escherichia coli A protein (AA 1-388) 

1555 

100 . 000 

2810 

Y17108 

Homo sapiens rhomboid- related protein 

358 

42.748 

2811 

AF071071 

Mus musculus protein kinase Myak-S 

540 

47 . 312 

2812 

U06713 

Rattus norvegicus SM-20 

349 

45 . 370 

2813 

U07817 

Dictyostelium discoideum glutamine-asparagine 
rich protein 

238 

25 . 098 

2814 

Z49216 

Homo sapiens mitoxantrone- resistance associated 
gene 

338 

84 . 211 

2815 

Z95334 

Schizosaccharomyces pombe hypothetical protein 

1206 

41 . 880 

2816 

AB023186 

Homo sapiens KIAA0969 protein 

193 

36 . 842 

2 817 

AF00966 8 

multiple sclerosis associated retrovirus 
polyprotein 

376 

53 . 097 

2 818 

M2 3236 

Mus musculus proline-rich protein 

344 

33 . 110 


ABU28942 

Homo sapiens KIAA1019 protein 

6540 

96 . 555 

2820 

D31885 

Homo sapiens KIAA006 9 

1122 

94.545 

2821 

LOO 92 3 

Mus musculus myosin I 

5487 

96 . 445 

2822 

AF06 9307 

Homo sapiens sodium-dependent multivitamin 
transporter 

1050 

52 . 535 

O Q O *J 


Mus musculus ribosomal protein L21 

4 98 

TO TOO 

/8 . 788 

O O O /t 

Ar UolzbO 

Mus musculus testis-specif ic chromodotnain Y- 

~\ "J If d T"» 'V* /"^ ♦"" £^ "i T"» 

1 lKc piOLclll 

628 

61 . 988 

2825 

AF052573 

Homo sapiens DNA polymerase theta 

11511 

99. 773 

j£ O Z, D 

77Q1 /I /l 

Mus musculus unknown 

r Aft 

buo 

/ o . /by 

/ItJZ / 

TTH Q/l T 1 

nomo sapiens zinc unyer procexn &»x*c ijj 

C O A 

D24 

a r\ con 

4U . D2 U 

O Q O Q 

Ax3U J.4 r> / 8 

Homo sapiens KIAA0678 protein 

6414 

98 . 438 

2829 

AF102546 

Homo sapiens dachshund 

3539 

93 . 944 

2()iU 

no o ~5 -7 /r 
U2 2 J /o 

Homo sapiens alternatively spliced product 

lie? i nfr £a*v/*^n 1 ^ R 
Ublliy CAUll 1 JH 

380 

70 . 103 

2831 

M74002 

Homo sapiens arginine-rich nuclear protein 

458 

30.311 

O Q "3 O 

zbiz 

Ar U / o 2 b U 

Lyt echinus variegatus hyalin 

295 

2 9.210 



rlLJlI Iw bd^JlcIlo oUJJUIllL {_/!_ KiN>i puiyillcl citifc; XI 

transcription factor TFIID 


Q"7 Q (Z Q 
zf / . zf o zf 

2834 

AF074086 

Homo sapiens envelope 

697 

56.571 

2835 

AL021918 

Homo sapiens b34I8.1 (Kruppel related Zinc 
Finger protein 184) 

1611 

44 .073 

2836 

M63311 

Cricetulus sp . cAMP- dependent protein kinase 

2226 

97 .436 


nib 


1, 0 O B H-6 4-3 ,„ O £!! 2 £» O i2 




alpha- catalytic subunit 



283 7 

D7083 1 

Homo sapiens Zinc- finger protein 

1032 

66.109 

O Q *5 O 

Jo 

U57347 

Sus scrofa Spl transcription factor 

267 

57.143 

O Q "3 Q 

z. o j y 

At* 063243 

Bos taurus ribosomal protein L30 

407 

76 .000 


TTQ "5 1— "-7 /-V 

U9 3570 

Homo sapiens p4 0 

228 

47.368 


AF183139 

Cercopithecine herpesvirus 15 trucnated type 2 

T?DM7\ O 
fc.BNA2 

202 

30.303 


Ml 1 C\C\ 
I V 1_L 

Rattus norvegicus unknown protein 

373 

42 .857 

2843 

AF159297 

Zea mays extensin-like protein 

412 

25.638 

2844 


Homo sapiens mucin 

257 

26 . 044 

2845 

ALi080l40 

Homo sapiens hypothetical protein 

4718 

99.018 

o q a a 

AF 104413 

Homo sapiens large tumor suppressor 1 

1861 

92.759 

2847 

U35376 

Homo sapiens repressor transcriptional factor 

1288 

53 .352 

2848 

AC004883 

Homo sapiens similar to KIAA0766; similar to 
PID : g3882253 

722 

36 . 548 

284 9 

D86980 

Homo sapiens KIAA02 27 

340 

36.481 

2850 

% /• ^ « ft A/- 

M34 986 

Homo sapiens erythropoietin receptor precursor 

269 

70.588 

-id bl 

U15174 

Homo sapiens BCL2 /adenovirus E1B 19kD- 
interacting protein 3 

508 

66 .667 

open 

TAT rtQCTCI 

ALuyo / bx 

Homo sapiens hypothetical protein 

5007 

99.493 

2853 

AL080170 

Homo sapiens hypothetical protein 

824 

46 .471 

-2854 

X61280 

Oryza sativa hydroxyproline-rich glycoprotein 

181 

39.623 

2855 

AB023191 

Homo sapiens KIAA0974 protein 

1166 

87 . 054 

2856 

AB018288 

Homo sapiens KIAA0 74 5 protein 

302 

72 . 857 

2857 

AB014587 

Homo sapiens KIAA0687 protein 

7749 

99. 915 

2858 

U37263 

Homo sapiens KRAB zinc finger protein; Method: 
conceptual translation supplied by author 

352 

49.541 

2859 

M73713 

Homo sapiens pregnancy- specif ic beta-l- 
glycoprotein 5 

362 

70.588 

"i o el r\ 
286 0 

X94335 

Saccharomyces cerevisiae YOR334 8c 

361 

27.184 

2861 

AC004144 

Homo sapiens R34001 1 

6792 

97 . 348 

2862 

AF038611 

Caenorhabditis elegans contains similarity to 
sulphatases 

784 

28.440 

2863 

X92485 

Plasmodium vivax pval 

364 

45.614 

2 864 

U51723 

Plasmodium vivax V- SERA 1 

168 

37.500 

2865 

AF083242 

Homo sapiens HSPC024-iso 

994 

94 . 706 

2866 

U60416 

Rattus norvegicus myr 6 myosin heavy chain 

2689 

93 .407 

O O £T T 

286 7 

Z75713 

Unknown cDNA EST EMBL:D36107 comes from this 
gene; cDNA EST yk208cl.3 comes from this gene; 
cDNA ES 

298 

24 . 127 


AU 004810 

Zea mays cytochrome P45 0 monooxygenase 

200 

78 . 049 

Tor q 

2oby 

AB007888 

Homo sapiens KIAA04 28 

1169 

59.365 

28 70 

Alt 02 1748 

Schizosaccharomyces pombe hypothetical protein 

293 

31.884 

/X 

Z81525 

Unknown cDNA EST yk282b7.5 comes from this 
gene; cDNA EST EMBL:D28011 comes from this 
gene ; cDNA ES 

766 

41.889 

9 ft 79 

Y ^ c: £: 

Homo sapiens gamma-G globin 

180 

46 . 835 

/ J 

HJjUZ _7 U U O 

Homo sapiens KIAA1085 protein 

3580 

96 .447 

OQ74 


Mus musculus tousled-like kinase 

370 

77 . 778 

2875 

AF151827 

Homo sapiens CGI -69 protein 

-L O -5 / 

/ O . O X 1 

2876 

M92443 

Homo sapiens zinc finger protein 41 

540 

35.959 

2877 

AF155096 

Homo sapiens NY-REN- 6 antigen 

2445 

87. 555 

2878 

AB014595 

Homo sapiens KIAA0695 protein 

4636 

99.861 

2879 

U38847 

Homo sapiens TAR RNA loop binding protein 

10152 

97.111 

2880 

U22376 

Homo sapiens alternatively spliced product 

474 

74.257 
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u s ing exon 1 3 A 



2881 

AF029249 

Mytilus edulis precollagen D 

192 

29. 815 

2882 

U80837 

Caenorhabditis elegans contains similarity to 
C2H2-type zinc fingers 

382 

36.257 

2883 

AF113615 

Homo sapiens FH1/FH2 domain- containing protein 
FHOS 

1228 

54.743 

2884 

Z11115 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk281d4.5 comes from this 
gene; cDNA EST yk288cl2.5 comes from this gene; 
cDNA EST yk406gll.5 comes from this gene 

719 

28.653 

2885 

AF010144 

Homo sapiens neuronal thread protein AD7C-NTP 

434 

50.000 

2886 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

293 

63.333 

2887 

X55995 

Rattus norvegicus dimethylglycine dehydrogenase 

1011 

89.535 

2888 

AL035709 

Arabidopsis thaliana putative protein 

704 

37.231 ! 

2889 

U4 1 3 8 7 

Homo sapiens Gu protein 

2182 

64.794 ■ 

2890 

Y09060 

murine herpesvirus 68 serine threonine rich 
glycoprotein 

220 

28.426 

2891 

AF125386 

Drosophila melanogaster L82C 

669 

46.083 

2892 

Z81552 

Caenorhabditis elegans similar to Thioredoxins ; 
cDNA EST yk4 91h3.5 comes from this gene 

704 

36.494 

2893 

AF002592 

Pterolebias xiphophorus cytochrome oxidase I 

223 

67. 089 

2894 

U49973 

Homo sapiens ORF1; MER37; putative transposase 
similar to pogo element 

315 

75.248 

2895 

AC006592 

Arabidopsis thaliana unknown protein 

243 

26 . 648 

2896 

AB020684 

Homo sapiens KIAA08 7 7 protein 

1161 

71. 795 

2897 

AL080125 

Homo sapiens hypothetical protein 

452 

53 .488 

2898 

AF041054 

Mus musculus E1B 19K/Bcl -2 -binding protein 
homo log 

292 

63.636 

2899 

AB020725 

Homo sapiens KIAA0918 protein 

400 

35.548 

2900 

266563 

Unknown Similarity to myosin; cDNA EST 
yk24 9a4.5 comes from this gene; cDNA EST 
yk270h6.5 comes fro 

593 

29.004 

2901 

U79260 

Homo sapiens unknown 

328 

51.145 

2902 

AL032655 

Caenorhabditis elegans predicted using 
Genefinder; similar to Inositol monophosphatase 
family; cDNA EST yk255ell.5 comes from this 
gene; cDNA EST yk610f9.3 comes from this gene 

372 

32.827 

2903 

AC004883 

Homo sapiens similar to KIAA0766; similar to 
PID:g3882253 

394 

32.370 

2904 

U83176 

Mus musculus ROSA26AS 

323 

23.451 

2905 

AF004161 

Oryctolagus cuniculus peroxisomal Ca- dependent 
solute carrier 

1006 

66 . 814 

2906 

M94 131 

Homo sapiens mucin 

291 

24 . 758 

2907 

U2 7196 

Gallus gallus zinc finger protein 

1616 

64 .494 

2908 

AC004877 

Homo sapiens zinc finger- like; similar to 
P52 742 (PID:gl731411) 

670 

31.159 

2909 

AF167320 

Mus musculus zinc finger protein ZFP113 

786 

67.702 

2910 

M23236 

Mus musculus proline -rich protein 

199 

30.488 

2911 

AF184226 

Drosophila melanogaster BcDNA. GH09045 

894 

37. 915 


Ay yZ 71 

Caenorhabditis elegans similar to zinc 
metallopeptidase (M8 family) ; cDNA EST 
EMBL:C07771 comes from this gene; cDNA EST 
EMBL:C09261 comes from this gene; cDNA EST 
yk259cl.5 comes from this gene 

324 

36 . 184 

2913 

Z72506 

Unknown Similarity to Hydra RAS like protein 

291 

47.107 


iff 
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RAS2 (SW:RAS2_HYDMA) ; cDNA EST EMBL:D72418 
comes from 



2914 

D67066 

Bos taurus N-WASP 

167 

33.511 

2915 

X56044 

Mus musculus protein Htf9C 

2251 

84.938 

2916 

X73974 

Homo sapiens ribosomal protein L4 

334 

64.800 

2917 

U72882 

Homo sapiens interferon- induced leucine zipper 
protein 

1638 

91.447 

2918 

X92352 

Mus musculus homology to nucleosome assembly 
proteins; specifically expressed in neurons 

373 

33 .422 

2919 

S67970 

Homo sapiens ZNF75=KRAB zinc finger 

1206 

75.641 

2920 

AB017616 

Mus musculus homologous to the yeast YGR163 
gene 

1923 

81.989 

2921 

AL080141 

Homo sapiens hypothetical protein 

1134 

54.695 

2922 

AF055084 

Homo sapiens very large G-protein coupled 
receptor- 1 

11286 

99.319 

2923 

AL050395 

Homo sapiens hypothetical protein 

846 

88.506 

2924 

AF135440 

Mus musculus huntington yeast partner C 

384 

68.041 

2925 

U07974 

Gallus gallus unknown 

175 

28 .105 

2926 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

358 

52 .258 

2927 

Z78544 

Caenorhabditis elegans predicted using 
Genefinder,- Similarity to C. elegans Guanine 
nucleotide binding protein (WP : C14B1 . 4 ) ; cDNA 
EST yk567gl2.3 comes from this gene 

176 

69.767 

2928 

S80119 

Rattus sp. reverse transcriptase homolog 

325 

53 . 782 

2929 

M59227 

Homo sapiens alpha -1 type III collagen 

165 

37.615 

2930 

AF039187 

Schistosoma japonicum myosin 

274 

24 .217 

2931 

Z68756 

Homo sapiens Huntington Disease (HD) gene exon 
1 

171 

35.632 

2932 

Y11770 

Mus musculus very- long- chain acyl-CoA 
dehydrogenase 

1714 

47.430 

2933 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

384 

50.000 

2934 

U10414 

Caenorhabditis elegans Contains similarity to 
Pfam domain: PF00005 (ABC t ran) , Score=245.2, 
E-value=3e- 70 , N=2 

446 

46.087 

2935 

AF040105 

Homo sapiens RCL 

517 

67.647 

2936 

Z69635 

Unknown Similarity to Yeast uridine kinase 
(SW:URK1_YEAST) ; cDNA EST EMBL:Z14695 comes 
from this ge 

705 

49.393 

2937 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

439 

62.281 

2938 

X65120 

Homo sapiens alphal (X) collagen 

487 

36 .424 

2939 

AF151825 

Homo sapiens CGI -67 protein 

1379 

79.592 

2940 

AC004893 

Homo sapiens similar to NEDD-4 (KIA0093) ; 
similar to P46934 (PID :gll71682) 

1249 

96.939 

2941 

AF010144 

Homo sapiens neuronal thread protein AD7C-NTP 

438 

77.895 

2942 

D50685 

Trypanosoma cruzi trans -sialidase 

274 

41.709 

2943 

M33784 

Dictyostelium discoideum protein- tyrosine 
kinase-2 (DPYK2) 

299 

30.449 

2944 

X53581 

Rattus norvegicus ORF3 

190 

37.143 

2945 

S80119 

Rattus sp. reverse transcriptase homolog 

167 

56.818 

2946 

U53153 

Caenorhabditis elegans one short region of weak 
similarity to S. cerevisiae protease A 
inhibitor 3 (SP:P01094) and another short 
region of weak similarity to S. cerevisiae 
glucose repression mediator protein (SP:P14 922) 

513 

34.756 
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2947 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

389 

64 . 151 

2948 

AB020675 

Homo sapiens KIAA0868 protein 

8903 

97 . 470 

2949 

AF067165 

Homo sapiens zinc finger protein 3 

728 

52 . 423 

2950 

AB014580 

Homo sapiens KIAA0680 protein 

779 

39 .492 

2951 

U71601 

Homo sapiens zinc finger protein zfp4 7 

248 

26 . 866 

2952 

M64793 

Rattus norvegicus salivary proline- rich protein 

266 

30 . 943 

2953 

AF060173 

Rattus norvegicus SV2 related protein 

394 

30 . 370 

2954 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

413 

67 .647 

2955 

U79260 

Homo sapiens unknown 

356 

71.429 

2956 

AF095155 

Mus musculus Clq-related factor 

310 

33 .000 

2957 

X91617 

Mus musculus 5' -3' exonuclease 

1721 

56 .593 

2958 

AC002505 

Arabidopsis thaliana unknown protein 

189 

21.921 

2959 

AC006550 

Arabidopsis thaliana Similar to gb|U70015 
lysosomal trafficking regulator from Mus 
musculus and contains 2 PF| 00400 WD40, G-beta 
repeats. ESTs gb|T43386 and gb|AA3952 3 6 come 
from this gene. 

528 

33 .542 

2960 

K02298 

Rattus norvegicus chymotrypsin B 

1644 

91 .288 

2961 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

295 

64 .865 

2962 

AL080155 

Homo sapiens hypothetical protein 

2487 

91.991 

2963 

AF020261 

Santalum album proline rich protein 

206 

27.099 

2964 

Z69730 

Schizosaccharomyces pombe putative ranbp7- 
importin-beta-cselp superfamily protein 

445 

24 .756 

2965 

AF071103 

Drosophila melanogaster myo-inositol-1- 
phosphate synthase 

1866 

61.169 

2966 

X70944 

Homo sapiens PTB-associated splicing factor 

1674 

67.196 

2967 

AF095350 

Homo sapiens RAB-like protein 2A 

755 

55.839 

2968 

AF036705 

Unknown Similar to phytoene desaturase; coded 
for by C. elegans cDNA CEESX74F; coded for by 
C. el 

746 

43 .987 

2969 

AL117452 

Homo sapiens hypothetical protein 

5690 

97.068 

2970 

X59720 

Saccharomyces cerevisiae YCR009c, len:265 

264 

26 .446 

2971 

AC005396 

Arabidopsis thaliana putative proline-rich cell 
wall protein 

219 

34 .694 

2972 

AF078848 

Homo sapiens BUP 

985 

62 .136 

2973 

D87076 

Homo sapiens similar to human bromodomain 
protein BR140 ( JC2069) 

3649 

97 . 909 

2974 

AF056116 

Fugu rubripes unknown 

963 

51 .304 

2975 

AB011084 

Homo sapiens KIAA0512 protein 

523 

34.426 

2976 

U73522 

Homo sapiens AMSH 

1185 

53.736 

2977 

AF170708 

Homo sapiens T-box protein TBX3 

4480 

98 . 207 

2978 

Z49068 

Unknown similar to GTP-binding protein; cDNA 
EST EMBL:M8 9111 comes from this gene; cDNA EST 

C1V/ITJ3 T TV "** 

EMBL : D2 

64 

25 . 000 

2 97 9 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

410 

71 . 579 

2 980 

AB023160 

Homo sapiens KIAA0943 protein 

2508 

99 . 728 

2 y o l 

U7 0 93 5 

Peromyscus maniculatus reverse transcriptase 

265 

48 . 214 

2 982 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

3 97 

51.462 

*"> G O "3 

2 yoo 

Ar loblol 

Homo sapiens FLASH 

6603 

97 . 103 


FlR 

riomo sapiens cysteine protease 

JL / 2 


2985 

AJ005071 

Gallus gallus Tapasin 

315 

34 .091 

2986 

U30292 

Mus musculus collagen type XIII alpha- 1 chain 

725 

51.835 

2987 

AJ243459 

Leishmania major proteophosphoglycan 

324 

31.414 

2988 

U85995 

Homo sapiens unknown 

2241 

90.618 

2989 

U49082 

Homo sapiens transporter protein 

1046 

54 .785 


■7<n 
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2990 

AB023209 

Homo sapiens KIAA0992 protein 

4871 

95 . 460 

2991 

M14123 

Homo sapiens pol/env ORF (bases 3 878-8257) 
first start codon at 4172; Xxx; putative 

8432 

95 . 434 

2 y y2 

M20789 

Homo sapiens alpha- 1 type I collagen 

225 

31 . 307 

2993 

D63876 

Homo sapiens KIAA0154 gene product is related 
to mouse gamma adapt in . 

4406 

94.077 

2994 

X14401 

Rattus rattus ribosomal protein L34 

284 

67.470 

o a a c 
2 995 

AF167321 

Mus musculus zinc finger protein ZFP2 3 5 

1873 

60 . 664 

2996 

AF15 92 97 

Zea .mays extensin-like protein 

404 

27 . 051 

2997 

v n c o ao 

X95808 

Homo sapiens X- linked mental retardation 
candidate gene 

860 

33 . 448 

*"> Q Q O 

2 y y o 

T1*"> O *5 "7 £T 

U22 J /O 

Homo sapiens alternatively spliced product 
using exon 13 A 

361 

65 . 591 

2999 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

709 

71. 687 

3000 

AJ243459 

Leishmania major proteophosphoglycan 

186 

29.461 

3001 

U97553 

murine herpesvirus 6 8 unknown 

281 

28 . 708 

3002 

X56044 

Mus musculus protein Htf9C 

602 

46 . 154 

3003 

Z46242 

Caenorhabditis elegans similar to WD domain, G- 
beta repeat; cDNA EST yk283e3.3 comes from this 
gene; cDNA EST yk238e2.3 comes from this gene ; 
cDNA EST yk283e3.5 comes from this gene; cDNA 
EST yk2 3 8e2.5 comes from this gene 

1011 

39. 009 



Mus musculus arachidonate 12 (S) -lipoxygenase 

32 52 

74 . 548 

*a n n c 

Ay 24 o t> 

Plasmodium vivax pval 

181 

51 . 613 

~a n o 
j UUb 

TV TT»rt -1 rk T A A 

Homo sapiens neuronal thread protein AD7c-NTP 

431 

72 . 165 

JUU / 

V / UoJ 1 

Homo sapiens Zinc- finger protein 

373 

35 . 439 



Mus musculus PAP-1 

810 

56 . 391 


AbUz Jib J 

Homo sapiens KIAA0946 protein 

4478 

98 . 8 02 

3010 

Y10746 

Homo sapiens methyl -CpG binding protein 

3952 

99.102 

3011 

D2 5215 

Homo sapiens KIAA0032 

2351 

55 . 199 


At Uj4 / 4 fa 

Mus musculus LNXp70 

260 

82 . 812 


AJjUoUIjo 

Homo sapiens hypothetical protein 

5255 

99 . 264 

3014 

AF056116 

Fugu rubripes unknown 

381 

36 .715 

3015 

AF036699 

Caenorhabditis elegans Similar to cuticular 
collagen; F58F6.2 

180 

29. 956 

3016 

K033 32 

Human herpesvirus 4 nuclear antigen 2 

227 

28 . 800 

3017 

AB011084 

Homo sapiens KIAA0512 protein 

856 

40 . 670 

J 0 lb 

U4 1534 

Caenorhabditis elegans Contains similarity to 
Pfam domain: PF00271 (helicase_C) , Score=49.0, 
b-vaiue=3 . Je-ii, N=i 

606 

41.833 

juiy 

VUU14o 

Caenorhabditis elegans unnamed protein product 

176 

36 . 406 


u / y z o u 

Homo sapiens unknown 

4 02 

75 . 824 

3021 

AB022915 

Mus musculus sif and Tiaml-like exchange factor 

3634 

84 . 978 

3022 

AJ001616 

Mus musculus myeloid associated differentiation 
protein 

237 

39.623 

jUz J 

1*1X2 uy y 

Mus musculus proline -rich protein 

2 57 

36 . 000 

i no a 

AliUJDD 3D 

Schizosaccharomyces pombe hypothetical protein 

471 

40 . 693 


o / y y lb 

Drosophila sp. Hls=155 kda putative DE-H type 
KNii - aepenaenc AiFase-nencase/ kjna localizing 
protein 

258 

28 . 879 

3026 

AL031709 

Homo sapiens C316G12.3 (novel protein) 

1807 ! 

76 . 804 

3027 

AC006539 

Homo sapiens BC3 94 98 2 

694 

48 . 864 

3028 

AL110228 

Homo sapiens hypothetical protein 

4620 

97.931 

3029 

AB020698 

Homo sapiens KIAA08 91 protein 

3695 

94 . 435 

3030 

AJ005890 

Homo sapiens JM1 

1043 

86 . 301 


157 
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3031 

AJ388557 

Canis familiaris zinc finger protein 

1818 

56 .838 

3032 

V00147 

Caenorhabditis elegans unnamed protein product 

206 

35.429 

3033 

AF010144 

Homo sapiens neuronal thread protein AD7C-NTP 

385 

61.111 

3034 

U22376 

Homo sapiens alternatively spliced product 
using exon 13 A 

366 

65.657 

3035 

U29154 

Caenorhabditis elegans T07F12 . 1 gene product 

451 

31.939 

3036 

AL050396 

Homo sapiens filamin 

5534 

77 .814 

3037 

L01042 

Homo sapiens TATA element modulatory factor 

5656 

99.460 

3038 

Z81503 

Caenorhabditis elegans predicted using 
Genefinder; similar to collagen; cDNA EST 
EMBL:D654 50 comes from this gene; cDNA EST 
EMBL:D68888 comes from this gene 

146 

33 .000 

3039 

D10627 

Mus musculus zinc finger protein 

756 

58.730 

3040 

U93571 

Homo sapiens p4 0 

263 

28 . 959 

3041 

AJ243459 

Leishmania major proteophosphoglycan 

270 

36.123 

3042 

AL080159 

Homo sapiens hypothetical protein 

637 

48 . 980 

3043 

Z77664 

Unknown predicted using Genefinder; similar to 
Zinc finger, C2H2 type; cDNA EST CEMSC43F comes 
from 

162 

34.884 

3044 

AL079308 

Streptomyces coelicolor putative 
serine/threonine protein kinase 

194 

31.795 

3045 

AB007887 

Homo sapiens KIAA042 7 

325 

29.221 

3046 

AB026190 

Homo sapiens Kelch motif containing protein 

480 

26 .882 

3047 

Y13374 

Homo sapiens putative prenylated protein 

966 

82 .530 

3048 

Y08766 

Homo sapiens SFl-Bo isoform 

86 

29.101 

3049 

M80650 

Caenorhabditis elegans alpha- collagen 

206 

31.414 

3050 

AJ242777 

Mus musculus ABINs , A2 0 -binding inhibitor of 
NF-kappa B activation (small) 

567 

36 .503 

3051 

X71621 

Saccharomyces cerevisiae hypothetical 58.9 kD 
protein 

186 

24.924 

3052 

X13885 

Nicotiana tabacum extensin (AA 1-620) 

274 

24 .568 

3053 

Y11395 

Homo sapiens seven transmembrane -domain protein 

1205 

57.190 

3054 

AF156271 

Homo sapiens RING finger protein terf 

202 

30.769 

3055 

V00148 

Caenorhabditis elegans unnamed protein product 

249 

34 . 766 

3056 

U00025 

Caenorhabditis elegans weak similarity to ATP 
synthase B chain 

674 

28 .320 

3057 

AL080125 

Homo sapiens hypothetical protein 

525 

61 .268 

3058 

AF151827 

Homo sapiens CGI- 69 protein 

1344 

76 . 144 

3059 

AL080123 

Homo sapiens hypothetical protein 

528 

30.366 

3060 

L11672 

Homo sapiens zinc finger protein 

442 

26 .448 

3061 

X74764 

Homo sapiens protein- tyrosine kinase 

5319 

96.647 

3062 

AF149093 

Mus musculus zinc finger ZF-12 

354 

59.036 

3063 

U79260 

Homo sapiens unknown 

242 

56.180 

3064 

D31887 

Homo sapiens KIAA006 2 

1316 

50 .000 

3065 

AF071081 

Mycobacterium tuberculosis proline-rich mucin 
homo log 

188 

25.721 

3066 

D86966 

Homo sapiens similarto human ZFY protein. 

554 

36 . 797 

3067 

X62681 

Gallus gallus limb deformity protein 

564 

49.721 

3068 

AF087142 

Homo sapiens TED protein 

1040 

41 . 016 


21171 n QQflC 

Mus musculus NG22 

720 

30.270 

3070 

AB000459 

Homo sapiens unnamed protein product 

603 

36 .000 

3071 

X13885 

Nicotiana tabacum extensin (AA 1-620) 

383 

31.976 

3072 

AF053356 

Homo sapiens leucin rich neuronal protein 

1429 

67.016 

3073 

AB011164 

Homo sapiens KIAA0592 protein 

8471 

97 . 791 

3074 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

715 

66 .851 
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3075 

AF039698 

Homo sapiens antigen NY- CO- 3 3 

489 

70.642 

3076 

AF010144 

Homo sapiens neuronal thread protein AD7C-NTP 

476 

50.228 

3077 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

706 

69.492 ; 

3078 

M13100 

Rattus norvegicus unknown protein 

325 

35.266 

3079 

L29028 

Unknown amino acid feature: N-glycosylation 
sites, aa 41 . . 43, 46 . . 48, 51 . . 53, 72 
74, 107 . 

224 

31.609 

3080 

AJ243459 

Leishmania major proteophosphoglycan 

177 

25.858 

3081 

AF043944 

Mytilus edulis nongradient byssal precursor 

277 

27.792 

3082 

S69693 

Leishmania donovani, ssp. infantum, Ethiopian 
LV9, amastigote, Peptide, 236 aa stage-specific 
S antigen homolog=A2 {repetitive sequence} 

226 

36.076 

3083 

X79881 

Rattus norvegicus aggrecan like protein/ 
brevican 

522 

56.329 

3084 

AF039939 

Canis familiaris type II collagen 

94 

44 .444 

3085 

U04267 

Gossypium barbadense proline- rich cell wall 
protein 

143 

27.749 

3086 

AC006220 

Arabidopsis thaliana hypothetical protein 

354 

51.724 

3087 

X61295 

Rattus norvegicus LI retroposon, a portion of 
its ORF2 sequence 

308 

39.735 

3088 

AB02 9011 

Homo sapiens KIAA1088 protein 

4037 

86.686 

3089 

AF010144 

Homo sapiens neuronal thread protein AD7C-NTP 

419 

71.134 

3090 

Z2 8201 

Saccharomyces cerevisiae ORF YKL2 01c 

151 

27 .273 

3091 

AB002336 

Homo sapiens KIAA03 38 

6169 

99.893 

3092 

AF159297 

Zea mays extensin-like protein 

284 

25.967 

3093 

X92485 

Plasmodium vivax pval 

269 

58.333 

3094 

AF060570 

Mus musculus rig-1 protein 

1698 

73.156 

3095 

AF118223 

Arabidopsis thaliana No definition line found 

217 

50.704 

3096 

U20107 

Mus musculus synaptotagmin VIII 

1134 

69.922 

3097 

X513 94 

Xenopus laevis APEG precursor protein 

376 

36.861 

3098 

X79510 

Homo sapiens protein- tyrosine-phosphatase 

6982 

97.056 

3099 

AF112299 

Homo sapiens integral inner nuclear membrane 
protein MAN1 

342 

44 .800 

3100 

Z28201 

Saccharomyces cerevisiae ORF YKL2 01c 

357 

36 . 747 

3101 

U09367 

Homo sapiens zinc finger protein ZNF136 

577 

52.717 

3102 

AF121009 

Mycobacterium tuberculosis H3 7Rv hypothetical 
protein Jv0534 

183 

31.609 

3103 

U37263 

Homo sapiens KRAB zinc finger protein; Method: 
conceptual translation supplied by author 

385 

57.692 

3104 

AB007447 

Homo sapiens Fln2 9 

165 

62.745 

3105 

D13641 

Homo sapiens mitochondrial outer membrane 
protein 19 

416 

75.962 

3106 

U41007 

Caenorhabditis elegans similar to G beta 
repeats (PROSITE : PS00670) 

280 

37.226 

3107 

Z82268 

Unknown predicted using Genefinder; similar to 
CUTICLE COLLAGEN 34; cDNA EST EMBL:D65629 Comes 
from 

227 

35.971 

3108 

M73980 

Homo sapiens TAN1 

6446 

98.970 

3109 

AC006293 

Homo sapiens immunoglobulin- like transcript 10 

2290 

88. 060 

3110 

AF060248 

Arabidopsis thaliana unknown 

420 

35.547 

3111 

X07881 

Homo sapiens proline -rich protein Gl 

248 

27.313 

3112 

U91318 

Homo sapiens Gene product with similarity to 
Multidrug resistance protein MRP1 

397 

60.484 

3113 

U68488 

Homo sapiens 5 -hydroxytryptamine7 receptor 

2150 

82.989 


7kt> 
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isoform d 



3114 

Au 24 55 87 

Homo sapiens Kruppel-type zinc finger 

247 

35 .648 

3115 

U33460 

Homo sapiens DNA- directed RNA polymerase I, 
largest subunit 

1923 

98 .371 

3116 

AF159297 

Zea mays extensin-like protein 

231 

35.581 

3117 

Al»03 5423 

Homo sapiens dJ20I3.1 (brain mitochondrial 
carrier protein- 1 (BMCP1) ) 

932 

86 . 982 

3118 

U2 23 76 

Homo sapiens alternatively spliced product 
using exon 13A 

322 

58 . 000 


ALill /zOl 

Caenorhabditis elegans predicted using 
Gene finder 

228 

26 . 063 


£o / yyu 

Caenorhabditis elegans similar to cuticle 
collagen 

217 

29 . 097 

3121 

AF151840 

Homo sapiens CGI -82 protein 

463 

42.000 

3122 

D1453 9 

Homo sapiens LTG19 

72 

29 . 885 

3123 

U05227 

Homo sapiens Rar protein 

1386 

66 .460 

3124 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

248 

56.818 

3125 

U53155 

Unknown Similar to cuticular collagen; coded 
for by C. elegans cDNA yk58e6.3; coded for by 
C. elega 

182 

30.544 


K03.<£ 08 

Homo sapiens salivary proline- rich protein 
precursor 

73 

33 . 333 

•3107 


Homo sapiens neuronal thread protein AD7C-NTP 

199 

53.333 

3128 

AB014578 

Homo sapiens KIAA0678 protein 

6383 

98 .242 

O 1 O Q 

Jlz9 

U4 y 9 74 

Homo sapiens mariner transposase 

763 

73 . 298 

3130 

AF 010144 

Homo sapiens neuronal thread protein AD7c-NTP 

230 

52 . 475 

3131 

U15181 

Mycobacterium leprae 4-coumarate-coA ligase 

787 

40.000 

3132 

X06956 

Homo sapiens alpha- tubulin 

2225 

92.521 

3133 

U223 76 

Homo sapiens alternatively spliced product 
using exon 13 A 

448 

65 . 179 

3134 

Z97336 

Arabidopsis thaliana hypothetical protein 

722 

37.755 

3135 

AF045646 

Caenorhabditis elegans contains similarity to 
collagens 

138 

29.646 

3136 

Tk /"» it ^> f r\ 

AC005360 

Homo sapiens FAA 

501 

41 . 962 

J13 / 

Z35597 

Unknown Weak similarity with sea squirt nidogen 
precursor protein (blastp score 71) ; cDNA EST 

Hil" Id Lj . 

760 

36 . 188 



Homo sapiens ORF1 codes for a 4 0 kDa product 

333 

50 .450 

J -L J J 


nomo sapiens neuronal tnreaa protein ajj/c-ntp 

1 yl r 

345 

54 . 032 

-J -L *x v/ 

t m "7 "7 

uducus caroLd proline- ricn protein 

2 73 

35 . 211 

0141 

UJUOl J 

Lbcnericnia coxi UKr xij:0-5zz+f / ; similar co 

/lo 

34.38 8 

3142 

rVr UD J D X J 

xnjiiiQj fc>cip-Lt;xits /l. 3 01 lgoauenyjiaus syncneuass j 

1 "2 Q C 
/ O O D 

y a . o y o 

3143 

J04974 

Homo sapiens alpha- 2 type XI collagen 

196 

29.435 



Caenorhabditis elegans unknown 

213 

23.353 

3145 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

332 

59.048 

3146 

AF159297 

Zea mays extensin-like protein 

368 

29.979 

3 14 7 

M74027 

Homo sapiens mucin 

262 

26 .357 

3148 

U22 3 76 

Homo sapiens alternatively spliced product 
using exon 13A 

399 

73 . 494 

3149 

Z68? 1 R 

TTnVnnijri ci-J m "i 1 ar t~ r~\ r^r^l 1 rar**»T"k • i^TYNTTA CCP 
uiijvahj wii o j. nix la JL \—KJ UUX J.dy till , OUiNi-V do! 

EMBL:D6 9371 comes from this gene; cDNA EST 
EMBL:D65671 comes 

loo 


3150 

M63798 

Blaberus discoidalis cytochrome P450 

653 

50 . 711 

3151 

Z48045 

Caenorhabditis elegans sre-2 

1047 

48.817 

3152 

AF104413 

Homo sapiens large tumor suppressor 1 

1687 

72.566 


.1 o o a "•i-ts . o ei •& o h 


3153 

AF121009 

Mycobacterium tuberculosis H37Rv hypothetical 
protein <Jv0534 

206 

33.333 

O 1 C A 

L3 5013 

Homo sapiens spliceosomal protein 

215 

32 . 877 

3155 

U2424 6 

Drosophila melanogaster 171-7 

302 

28 . 926 

jl JO 


Unknown cDNA EST EMBL:D26959 comes from this 
gene; cDNA EST EMBL:D26963 comes from this 
gene , cjjjna 

380 

27 . 711 

jIj / 


nomo sapiens duj /zys x 

64 7 

47 . 005 

1 1 ft 

Til OA A "7 

Trypanosoma cruzi mucin- like protein 

182 

37.795 

3159 

U97006 

Caenorhabditis elegans No definition line found 

308 

36 . 913 

oxo U 


Zea mays Hydroxyproline-rich Glycoprotein 

\ rlKCjf ) 

204 

22 . 464 

n c i 

JXOX 

Ufi OO Ji 

Cricetulus griseus HT protein 

1918 

76 . 453 

3162 

AF177758 

Homo sapiens ubiquitin specific protease 16 

1185 

75 . 781 

3163 

U38252 

Mus musculus fractionated X- irradiation- induced 
2 9 thymoma 

1015 

60. 702 

JXb4 

U76618 

Mus musculus N- RAP 

1163 

48 . 603 

JXbo 


Zea mays extensin-like protein 

252 

29 . 502 

3166 

U40029 

Caenorhabditis elegans Contains similarity to 
Pfam domain: PF01060 (Worm_f amily_2) , 
Score=203.8, E-value=8 . 6e-58 , N=l 

431 

33.918 


M36912 

Zea mays cell wall protein (put.); putative 

245 

30 . 072 

3168 

AF10 8 843 

Homo sapiens env protein 

680 

34 . 195 

316 9 

AF022985 

Unknown Similar to collagen; coded for by C. 
elegans cDNA yk55f3.3; coded for by C. elegans 

CL)v4J\ 

215 

32 . 719 

JX / U 

7A 11 0 ft Q Q vl 

Homo sapiens KIAA1031 protein 

2361 

48 . 748 

jX / X 

mt *7 0 m 
mx / ouz 

Plasmodium falciparum circumsporozoite protein 

213 

28 . 676 

3172 

AF067165 

Homo sapiens zinc finger protein 3 

780 

61.677 

3173 

U53154 

Caenorhabditis elegans No definition line found 

310 

21.905 

3174 

Y12713 

Mus musculus Gag polyprotein 

516 

43 . 210 

3175 

U95090 

Homo sapiens F19541 1 

1909 

64 . 974 

3176 

X6 7156 

Rattus norvegicus (S) - 2 -hydroxy -acid oxidase 

1033 

69 . 068 

3177 

S62936 

Homo sapiens PRB1S precursor protein=basic 
proline-rich proteins (Ps, PmF, PmS, and Pe) 
precursor {c- terminal} 

206 

31 . 140 

OX / O 

MQ71A "7 

Homo sapiens beta-l,6-N— 

at t= \,y xy lULUoainxny x l l. aimiciaSc 

34 3 

72 . 973 

3179 

U60315 

Molluscum contagiosum virus subtype 1 MC132L 

484 

41.429 

n ft n 

TTC Q C. CO 
U -J O030 

Homo sapiens unknown 

257 

62 . 162 

11 ft i 

TTQ 1 l?7/l 
\JZ7 ~> O / fx 

nomo sapiens put.at.xve piou 


4 0 . 092 

3182 

U93571 

Homo sapiens p4 0 

409 

42 . 941 

3183 

AC004235 

Homo sapiens Mytl 

359 

51. 163 


TT>I T O T 

U4 UX 0 / 

Caenorhabditis elegans C. elegans cuticle 
collagen col-8 

169 

31 . 472 

n ft ^ 

TT7 Q"7 C fl 

Homo sapiens unknown 

288 

58 . 6 96 

3186 

Z46787 

Unknown cDNA EST EMBL:D75506 comes from this 
gene, cdjna fcioi bMBLiu/^boo comes rrom tins 

o n o * i"»TYNIZV 

y tiii t; , oi_^iM/-v 

376 

46 . 497 

3187 

U23514 

Caenorhabditis elecraris No dpf init*i on line* "Found 

321 

32 . 000 

3188 

U79260 

Homo sapiens unknown 

288 

58.696 

3189 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

445 

56.115 

3190 

AC005175 

Homo sapiens R3144 9 3 

1854 

57. 739 

3191 

74J010949 

Mus musculus calcium channel alpha-2-delta-C 

448 

41.667 


7U 
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subuni t - 



3192 

U97553 

murine hemes vi ni 68 unknnwn 


33 . 992 

3193 

L06147 

Homo sapiens golgin-95 

A C Q 


3194 

AF010144 

Homo saDisns npuronA 1 f hi t*^^ H nrot" p-i n Z\r">'7 — wttd 

/i 1 O 

77 . 895 

3195 

U97553 

murine hert)GSvirus 68 unknown 

li7 / 

31 . 2 /o 

3196 

AF010144 

Homo saDiens neuronal thread orot" Pin ar)7r-MTD 

C Q O 

OO . Ill 

3197 

X54162 

Homo sapiens 64 Kd autoantigen 

958 

42 . 523 

3198 

S78234 

Homo Qat*»l PnQ H-NTTP-nurl par T"YKT1Y \\\ Y\r\ \ rtrr y~\-vr-\*- e*. A v> 
xiwuiw oo^xciio xi iNuu^iiu^xcai LJVin. xjxilixllltx protein 

976 

74 .2 97 

3199 

U79260 

Homo c 5air>'i pii<j unVnriwn 


62 .63 7 

3200 

AF152502 

Homo flaniens nrot*nrariheri n hpt"a Q 

1 i-d y 

73 . 061 

3201 

AF006065 

Pnwl nnv vi rtan 

zyb 

31 . 4 74 

3202 

AF056617 

Homo sapiens BWSCR2 associated zinc -finger 
nrotein RAZ1 

1075 

61.176 

3203 

U93566 

Homo o a t~\ H pnq nA Ci 


4 7 . 706 

3204 

M34551 

Homo sapiens 52 -kD Ro/SSA ribonucleoprotein 

376 

36 .321 

3205 

Ml 9 5 03 

Homo Qiavn on a ODT71 « mifahi vd 
nuiuu oap J. cxio UKr 1 , pULaClvc 

223 

31 . 679 

3206 

AC004 8 83 

Homo o n pn q qi mi 1 ar f- i~\ V"TZ\Zin"7£CC - ai'mi 1 av* t~ 
nuuiu oaLyxciio oxilixxax I\XA/iU / DO/ 13 lull 1 cLJ. UO 

PID *cr3 8 82 2 53 

/l "3 A 

4 34 

26 . 087 

3207 

L11672 

Homo cj^r^H pnq 7 1 no f i nrrpr nrnfoi n 

oo4 

43 . 077 

3208 

AJ22 5122 

1 1U « UIUOL.UX no iiypcLpuxaX xzaUXUIl caCLXVaUcU (_ ct I L Oil 

channel, HAC1 

loo 

34 . 783 

3209 

AF071172 

Homo saniens HRRC^ 

-1 lv/ hfCtj^/X^llO IliJl\Vb 

*t 0 y 0 

Q *7 TOT 

3210 

AB011532 

Rattus norvper i mic? MRGFfi 

once 

Z U 0 0 

"7 A 1 A r> 

/ 4 . 143 

3211 

AF010144 

Homo cjani onc» npiivnnal t" VlT-Oiirl r\T*r»t*Di n J\ri7r« - MTD 

"2 "7 "i 

72 . 82 6 

3212 

U09116 

Homo ^anipnci ORP1 pnrnHpq r» Afl L-Fla nrnHnph 

ii JO 

3 y . / Oo 

3213 

U85494 

mavq T .HMl nrnt*pa qp 

1 O O *1 

loo 1 

bo . 23 8 

3214 

AF027973 

Nephila clavipes flagellif orm silk protein 

430 

29.014 

3215 

M174 91 

Ml 1 cj mi i dot ll 11a r>Y"r»Prtl 1 anon ^ rr~^ X -a 1 *-\V^ a j>-»V» — > -\ v» 

nua uiubLuiub prucoiidyen type 1 aipna cnain 

0 0 n 

31 . 556 

3216 

AF132947 

Homo oani e^rt ct T _ 1 "3 ni^r^t- oi n 
nwiuvJ oaJJlcllS V-VjXU piULcxu 

*T *7 C 

0 /o 

86 . 131 

3217 

U93564 

Homo 0 pi -i ^nc; r^A. Ci 

O O 1 

ZZ 7 

2 8 . 571 

3218 

X6 9115 

Homo ear\i ono 7MT**"^'7ZV 

2 o4 

33 .333 

3219 

rtf v "x .J O r± O 

v-c»fc-ii^w>x xicixjui uis eiegans conuains simiiaricy to 
col lagens 

182 

34 . 363 

3220 

X63005 

Mus musculus proline-rich protein 

240 

36 .123 

3221 

AF132181 


y b 1 

33 . 564 

3222 

AF08178 9 

I I" O IIIU.OL>UXUO ^>CXX oUI laLC dill. Xy ell t 

IRQ 

2o . Ub3 

3223 

AF020261 

Santalum album proline rich protein 

210 

29.811 

3224 

U09413 

Homo sapiens zinc finger protein ZNF135 

1531 

53 .753 

3225 

U a £i .-> / D 

rxt>rno sapiens alternatively spiiceu proauct 

itq-i nrr q yhti T "I Jx 

Uoxliy CAUX 1 X O /A 

305 

65 . 517 

3226 

AL023776 

SoVl T ^OCT^l <~"oVl ^ T^omv/'O*^ c nr\mVMa Vi\/T^<~*t" Viol- i r*r± 1 -»-\ /^i -I ■»-"> 
^^iixi^ociv-v-iiaxwiliy uco LJvJULUtr IiypULXlcLlCdX piOCein 

"3 £T "5 

O C ICQ 

26 . 3 by 

3227 

AL009196 

Unknown / nrpH -i /-> f i nn — ^mpt" VtnH • " || rronp , F , 5 nHrav'i n 

version • " ** 084 " " qrnrp - n » c q 41 mii\ , 
/predict ion= (metho 

Q C A 

37 . 209 

3228 

U09413 

Homo sapiens zinc finger protein ZNF135 

1801 

58 . 093 

3229 

Y12713 

Mus miisciiliis Pro- Pol -dUTPa^p nnlvnrnfpin 

27 O 

jo . O 3 O 

3230 

M73980 

Homo s an i en s TAN 1 

1 A Q Q "3 
1 U _7 _? J> 

QQ Q Q C 

y 0 . yob 

3231 

AF181640 

Drosoohila melanoaaqtpr RrDMA RHflQfti 7 

^ 1 

ft b . b / b 

3232 

AF043642 

Rattus norveaicu^ maf rin rvrlnnlri Tin 

i\«ui.Uk7 iiux • x^uo uiaui xii L>y ^xupiixxxii 

JZO 

31 . 10 4 

3233 

AF121009 

Mycobacterium tuberculosis H37Rv hypothetical 
protein Jv0534 

212 

33 . 163 

3234 

M73491 

Mus musculus N- ace tylglucosaminyl transferase I 

499 

37 . 193 

3235 

X55777 

Homo sapiens put. ORF 

261 

60.526 

3236 

X15332 

Homo sapiens alpha -1 (III) collagen 

118 

32.824 I 

3237 

X05830 

Mus musculus ORF2 product 

778 

73.034 j 
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3238 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

416 

71.134 

3239 

AF067946 

Caenorhabditis elegans similar to Drosophila 
ring canal protein (kelch) (SW:Q04652) 

376 

25 . 683 

3240 

AF01503 / 

Oryctolagus cuniculus endooligopeptidase A 
reiaceci protein; r.^x*A reiatec* protein 

3012 

90 . 154 


7AI71 1 C\A A 1 

AidDiaopsis cnaxiaiia uvo-icsiSLaiice protein 
TT\rpft 

UVKO 

o c c 
255 

-j ri too 
3 0 . 723 

j ^ *± & 

TT1 fl9fl1 

oud DLiuia y do il. iiiui^xii 

i y 

9 4 4nc 
24 . 4 Ob 



nuiILU oapicim UIipXc Xj XIM UCJIUciXIl pxuLcxn 


4 / . o 63 

7 9 4 4. 

TT9 m 

i\ct 1* i, uo iiui vcy iL.ua oyiictiJi_v_/i_.ciyiii_Lii vxx 

1U jo 

oo.o / y 

3245 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

359 

66.337 

1 0 /1 a 


tiomo sapiens kjn/\ nencase 

C Q A 

bo4 

it /ran 
31.690 

3247 

X92517 

Saccharomyces cerevisiae N1751 

291 

29.614 

3248 

Z11773 

Homo sapiens SRE-ZBP 

2709 

97.810 

324 9 

AB011164 

Homo sapiens KIAA05 92 protein 

8487 

97.938 

3250 

Mb 5014 

Ovine pulmonary adenocarcinoma virus pol 
protein 

377 

41 . 358 

3251 

Z69635 

Unknown Similarity to Yeast uridine kinase 
(SW:URK1_YEAST) ; cDNA EST EMBL:Z14695 comes 
from this ge 

536 

40 . 728 

3252 


Homo sapiens KIAA1064 protein 

821 

34 . 926 

3253 

U2 2 3 76 

Homo sapiens alternatively spliced product 
using exon 

398 

78 . 824 

3254 

M14123 

Homo sapiens pol/env 0RF (bases 3878-8257) 
first start codon at 4172; Xxx ; putative 

781 

33 . 830 

1 0 c c 

Au 0102 6 2 

Mus musculus MT5-MMP protein 

338 

2 9.845 

3256 

U09366 

Homo sapiens zinc finger protein ZNF13 3 

1237 

54.545 

3257 

AJ38 8 5 57 

Canis familiaris zinc finger protein 

703 

46 . 154 

3258 

U4 6068 

Mus musculus von Ebner minor salivary gland 
protein 

573 

55 . 172 

3 2 b y 

U13766 

Murine leukemia virus gag-pol polyprotein 

794 

2 9 . 54 2 


At* 06 y J 0 / 

Homo sapiens sodium-dependent multivitamin 
transporter 

104 0 

52 . 76 5 

9 9 ^ 1 

AJ? U 4 JL U 0 2 

Rattus norvegicus transmembrane receptor Robol 

312 

26 . bio 

9 9 £ 9 

AO / 

riomo sapiens numan giant larvae nomoiogue 

11 no 

310o 

a c c r\ o 

y b . 6 oy [ 

3263 

AF159297 

Zea mays extensin-like protein 

291 

31.768 

3264 

D32050 

Homo sapiens alanyl-tRNA synthetase 

1303 

37 . 074 

3265 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

333 

70.886 

3266 

D10923 

Homo sapiens HM74 

601 

39 . 597 

3267 

AL0238 93 

Drosophila melanogaster 
/prediction^ (method : " "genef inder " " , 
version: uofk , score: in. 64 / ; 
/ picuicuioii- \meLnou . genscan , 
version: ""1.0"") 

489 

32 . 500 

J j£ O O 

ni rt9 ft fi 

U X VJ O 

ui yuLUiayuD k>^y . ui_ytJo-Lii ntjovy tiictui 

£4 9 


J 4b D 

T.I Q9fi 7 


■3 j Ol 

qc i n4 

17 O . 1U4 

3270 

jfn-D v X T U v ^ 


9 9 ft 9 

Q9 ^ 1 Q 

3271 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

463 

43.443 

32 72 

AFT ft 9 946 

XT JLO & J/^t v 

i\an,uo uui v x L-Uo I3i\.v^r\x aosuuia Leu rtxi^vj uuilla ill 

protein 1 

J U J 

J X . D ^ X 

3273 

U09413 

Homo sapiens zinc finger protein ZNF135 

558 

41.277 

3274 

U60416 

Rattus norvegicus myr 6 myosin heavy chain 

2703 

93.187 

3275 

AF100318 

Homo sapiens mitogen- activated protein kinase 
kinase kinase 6 

183 

40 .268 


J. 01 Ol fell «, 4-3; ,„ O ia 2 ci. O H 


32 76 

7*1 37 of 

Unknown Similarity to Candida CDC4 gene 

liK:HiiJ4UbbJ ; CD1NA. hbi J2.WBL1 : D2 7 6 9 9 COmeS £rom 

this gene; cD 

201 

22 . 659 



nOuio sapxens noojcz protein 

773 

48 . 630 

3278 

AB011096 

Homo sapiens KIAA0524 protein 

347 

61.607 

3 z / y 


Mus musculus ultra-high sulphur keratin 

255 

48 . 193 


AT T T "7 C "J O 
ALII / DJ<6 

Homo sapiens hypothetical protein 

5467 

97.852 

3281 

AB017616 

Mus musculus homologous to the yeast YGR163 
gene 

2208 

96 . 089 

*> 0 0 0 

0 c 0 1 0 0 

Homo sapiens X- linked retinopathy protein {C- 
terinxnai, Clone AJkri.oC/ 

243 

69 . 231 

0 J 


nomo sapiens envelope protein 

165 

43 . 182 

3284 

AL096811 

Streptomyces coelicolor A3 (2) putative acyl-CoA 
dehydrogenase 

916 

36 . 057 

~i 0 0 c 

AJ UlUU4b 

Homo sapiens guanine nucleotide -exchange factor 

2311 

99 . 711 


TT"7 C\ Q O O 

u / uyjz 

Peromyscus leucopus reverse transcriptase 

231 

41 . 071 

*a 0 0 *7 

TTQ mi C 

Homo sapiens CAGF2 8 

4578 

97 .312 

3288 

AL009147 

Unknown /prediction^ (method : " "genscan" " , 
version: 1.0 , score: 184.75 ) ; 
/prediction= (method: 

1309 

49.883 

32 8 9 

A T C\ t\ {\ f\ t\ O 

Homo sapiens PI3 -kinase 

9514 

99 . 517 

32 90 

AC004 983 

Homo sapiens similar to PID:g3877944 

2934 

94 . 456 

3291 

U15174 

Homo sapiens BCL2 /adenovirus E1B 19kD- 
interacting protein 3 

287 

65 . 169 

3292 

AL117204 

Caenorhabditis elegans predicted using 
Gene finder 

301 

35 . 047 

3293 

M13100 

Rattus norvegicus unknown protein 

216 

53 . 750 

3294 

Z46241 

Unknown carboxyl terminus of the predicted 
protein shows similarity to chimaerin; cDNA EST 
EMBL:Z14 

582 

33 . 038 

32 95 

Do 934 0 

Rattus norvegicus dipeptidyl peptidase III 

3642 

87 . 106 

32 96 

U2 2 3 76 

Homo sapiens alternatively spliced product 
using exon 13A 

355 

63 . 636 

0 z y 1 

AV- UUIbb j 

Arabidopsis thaliana unknown protein 

270 

24 . 756 

1 0 Q Q 


Ambystoma tigrinum electrogenic Na+ bicarbonate 

UULX dllbpUX Lei , MDU 

1829 

56 . 031 

a j j 

& J?1 R QO Q7 

rt.F J. J 3 / 3 / 

^fcjci niciys eAtensin-iiKe procem 

1 Q O 

3 y 3 

30.4 81 

3300 

U70932 

Peromyscus leucopus reverse transcriptase 

231 

54 . 878 

J JUl 


mo mo sapiens neuronal unreao. protein f\D xir 

625 

58 . 768 

J jUZ 

71 i 77") 
Ct X. X. f to 

T-T/'-km /~"\ nnrM >— » O ^5 "C f 1 T > 

nUmO bapiens oKt — 6or 

2709 

97 .810 

1 7 C\ ~x 

-f-i.tr \J f _L f4 j7 1 

Homo sapiens potassium channel 

5524 

98 . 171 

3304 

X92485 

Plasmodium vivax pval 

259 

48. 980 

3305 

AF055084 

Homo sapiens very large G-protein coupled 
receptor- 1 

11286 

99.319 

3306 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

287 

70.130 

3 3 07 

AF11116 9 

Homo sapiens KIAA0759 

1021 

44 . 965 

33 08 

D12485 

Homo sapiens The first in- frame ATG codon is 
located at nucleotides 17-19, followed by a 
second ATG codon 52 codons downstream. And the 

SPmnd ATT5 rnHnn "5 a r^r^f" pn hi ol in - ! f*-i ah i r\n r%r% int* 
oc^uiiu f\x\j l-uuuj i j-c> puucutxax jlxijl t la t xuxi puxxit 

for translation of NPPase . 

5619 

93 . 545 

3309 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

627 

56.682 

3310 

Z79694 

Caenorhabditis elegans predicted using 
Genefinder; similar to collagen 

164 

31.414 

3311 

X92485 

Plasmodium vivax pval 

335 

64 .516 
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3312 

AC004005 

Arabidopsis thaliana unknown protein 

246 

30.286 

3313 

D90886 

Escherichia coli CDP-DIACYLGLYCEROL- - SERINE O- 
PHOSPHATIDYLTRANSFERASE (EC 2.7.8.8) 
(PHOSPHATIDYLSERINE SYNTHASE) . 

880 

86.335 

3314 

U79260 

Homo sapiens unknown 

288 

58.696 

3315 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

481 

84.706 

3316 

U22376 

Homo sapiens alternatively spliced product 
using exon . 13 A 

417 

73 .913 

3317 

AL023702 

Streptomyces coelicolor hypothetical protein 
SC1C3 . 11 

420 

39.524 

*"> 1 1 O 

J J lo 

X91114 

Gorilla gorilla interleukin- 8 receptor type B 

1216 

86.036 

*"l "5 *1 O 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

445 

82 .353 

3320 

AB023164 

Homo sapiens KIAA0947 protein 

9067 

100. 000 

3321 

X89985 

Homo sapiens BCL7B 

1109 

96 .629 

3322 

L08475 

Xenopus laevis ubiquitin-like fusion protein 

582 

38 . 936 

3323 

Y07829 

Homo sapiens RING finger protein 

2983 

97 .516 

3324 

Z66519 

Caenorhabditis elegans similar to Thiamine 
pyrophosphate enzymes; cDNA EST EMBL:D3 6315 
comes from this gene; cDNA EST EMBL:D33464 
comes from this gene 

1141 

44.550 

3325 

AC00558 1 

Homo sapiens R31237 1, partial CDS 

439 

96 . 825 

3326 

U80082 

Homo sapiens putative protein B2 

5058 

100. 000 

3327 

AB011113 

Homo sapiens KIAA0541 protein 

7644 

99.828 

3328 

AF061443 

Rattus norvegicus G protein- coupled receptor 
LGR4 

1574 

91.603 

~i ~i O Q 

AC 0 0 716 8 

Arabidopsis thaliana hypothetical protein 

157 

44 .643 

inn 

7\ *o /*\ /"\ *~> ~i c c 
AB002 36 5 

Homo sapiens kIAA0367 

4745 

99.860 

3331 

AL080173 

Homo sapiens hypothetical protein 

2968 

100.000 

3332 

AF06 7864 

Homo sapiens transferrin receptor 2 alpha 

5197 

99.501 

"I *> O 1 

X64 58 7 

Mus musculus orf 

3202 

99.184 

3334 

U22232 

Felis catus ribosomal protein S4 

1702 

99.237 

3335 

AB000910 

Sus scrofa ribosomal protein 

168 

72.917 

3336 

U88154 

Homo sapiens proline and glutamic acid rich 
nuclear protein isoform 

6446 

81.135 

-5 -> "3 "~T 

33 3 7 

AF094480 

Homo sapiens cholesterol 24 -hydroxylase 

3332 

100.000 

3 3 3 8 

7v n r\ r\ s~ s~ ^ 

AB006622 

Homo sapiens No similarities to any reported 
proteins 

8935 

100. 000 

3339 

AB018273 

Homo sapiens KIAA0730 protein 

6572 

99.801 

*5 ""1 vl f"1 

V f\ 1 A Q A 

AUJ*i o fl 

Homo sapiens raf protein (aa 1-648) 

2262 

88 . 993 


TTD *7 *TJ f\ C 

Uo /3 Ub 

Rattus norvegicus transmembrane receptor UNC5H1 

2655 

93 . 119 


Ay / *k y u 

Mus musculus PNG protein 

445 

70.833 

"1*1/1 "> 

-j J -5 

rl / 4 bO 9 

Homo sapiens , gene product 

541 

87.255 

1 A /l 
.3 O ft ft 

Ajod j4 

Homo sapiens homologue of Drosphila retinal 
degeneration B gene 

7913 

98.030 

J J4 J 


Homo sapiens hypothetical protein 

5256 

100. 000 

J J*iO 


Drosophila melanogaster unknown 

601 

54 . 187 

3347 

M22332 

Homo sapiens unknown protein 

166 

26 . 050 

3348 

AB020695 

nuniu oapxcilo I\lnnUOOO pxULcin 

42 58 

98 . 108 

3349 

AB012162 

Homo sapiens APCL protein 

14441 

100. 000 

3350 

AF011359 

Bos taurus regulator of G-protein signaling 7 

2957 

98.874 

3351 

AL050321 

Homo sapiens dJ717M23.1 (novel gene) 

4999 

99.868 

3352 

U60269 

Homo sapiens putative envelope protein; orf 
similar to env of Type A and Type B 

329 

86 .441 
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retroviruses and to class II HERVs 



3353 

AF182946 

Rattus norvegicus BRCA1 -associated RING domain 
protein 1 

449 

41 . 711 

3354 

AB002299 

Homo sapiens KIAA0301 

13484 

99 . 951 

3355 

L08483 

Drosophila melanogaster ring canal protein 

466 

31 . 104 

3356 

AL080198 

Homo sapiens hypothetical protein 

4659 

99 . 713 

33 57 

X73874 

Homo sapiens phosphorylase kinase 

4572 

98 . 770 

33 58 

U21163 

Ictalurus punctatus No definition line found 

449 

71.698 

3359 

AF083249 

Homo sapiens Rb binding protein homolog 

2736 

98 . 265 

3360 

AB007931 

Homo sapiens KIAA0462 protein 

14285 

98 . 466 

3361 

A01592 

Homo sapiens haemoglobin A beta chain 

329 

81 . 159 

3362 

U4 1387 

Homo sapiens Gu protein 

5179 

99 . 750 

3363 

X71997 

Rattus norvegicus myosin I 

4 970 

98.037 

3364 

AF071076 

Homo sapiens Nup98-Nup96 precursor 

11033 

98.775 

3365 

AB006630 

Homo sapiens KIAA02 92 

11483 

99 . 883 

3366 

AF038554 

Homo sapiens density regulated protein drpl 

1548 

98 . 745 

3367 

U29659 

Human endogenous retrovirus pol gene product 

206 

42 . 708 

3368 

AB000884 

Sus scrofa glutathione S- transferase 

390 

78 . 571 

3369 

AF151800 

Homo sapiens CGI -41 protein 

587 

99 . 010 

3370 

AF151889 

Homo sapiens CGI -131 protein 

430 

81 . 395 

3371 

AJ131581 

Homo sapiens latrophilin-2 

6260 

94 . 515 

3372 

AF077030 

Homo sapiens hypothetical 43.2 kDa protein 

496 

73 . 684 

3373 

D83776 

Homo sapiens The KIAA0191 gene is expressed 
ubiquitously. ,- The KIAA0191 protein retains the 
C2H2 zinc- finger at its N- terminal region. 

10130 

99 . 081 

3374 

AF145681 

Drosophila melanogaster BcDNA. LD23181 

1099 

36 .813 

3375 

AL050022 

Homo sapiens hypothetical protein 

4172 

99.675 

3376 

AF054284 

Homo sapiens spliceosomal protein SAP 155 

8400 

98 .471 

3377 

U72194 

Mus musculus muskelin 

1229 

94 .359 

3378 

Z70038 

Homo sapiens Similarity to Human hnRNP F 
protein (PIR Acc . No. S43484) ; cDNA EST 
EMBL:D34218 comes from 

436 

38.298 

3379 

L14684 

Rattus norvegicus elongation factor G 

2295 

86 . 885 

3380 

X96586 

Homo sapiens FAN protein 

4394 

99 .695 

3381 

AF076612 

Homo sapiens chordin 

5666 

99. 750 

3382 

AL080159 

Homo sapiens hypothetical protein 

2446 

99 . 169 

3383 

AF139639 

Oryctolagus cuniculus serum and glucocorticoid- 
regulated protein kinase 

1791 

67 . 990 

3384 

AC003058 

Arabidopsis thaliana unknown protein 

443 

43 . 671 

3385 

AB019409 

Homo sapiens unique gene expressed in 
fibroblasts of periodontal ligament 

942 

88.780 

33 86 

AF1253 92 

Homo sapiens insulin induced protein 2 

1354 

99 . 048 

3387 

AL050261 

Homo sapiens hypothetical protein 

296 

58 . 333 

3388 

U44803 

Rattus norvegicus ovarian- specif ic protein 

1501 

78 .472 

3389 

X66285 

Mus musculus HC1 ORF 

207 

43 .243 

3390 

AC004542 

Homo sapiens OXYSTEROL- BINDING PROTEIN- like ; 
similar to P22059 (PID : gl29308 ) 

2447 

99.461 

3391 

AF067164 

Homo sapiens zinc finger protein 2 

1051 

67.647 

3392 

Z78018 

Caenorhabditis elegans predicted using 
Genefinder; similar to serine/threonine kinase; 
cDNA EST yk353dl0.5 comes from this gene 

1442 

35.812 

3393 

AB011084 

Homo sapiens KIAA0512 protein 

963 

43 . 026 

3394 

AB018288 

Homo sapiens KIAA0745 protein 

1002 

63 .248 

3395 

U12535 

Homo sapiens epidermal growth factor receptor 
kinase substrate 

1336 

42.159 
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3396 

AF009674 

Homo sapiens axin 

6111 

99 .667 

3397 

AF049528 

Homo sapiens huntingtin- interacting protein 
HYPA/FBP11 

2902 

99 . 115 

3398 

D86978 

Homo sapiens similar to a C.elegans protein 
encoded in cosmid K12D12 (Z49069) 

13202 

99. 950 

3399 

Z15005 

Homo sapiens CENP-E 

3619 

93 .344 

3400 

AB011166 

Homo sapiens KIAA05 94 protein 

5703 

100 . 000 

3401 

AC007229 

Homo sapiens dynamin II {AA 4 74- 866] 

474 

88 . 889 

3402 

D63487 

Homo sapiens The KIAA0153 gene product is 
related to a putative C.elegans gene encoded in 
cosmid F42A8. 

4319 

99 . 844 

3403 

AL032626 

Caenorhabdxtxs elegans predxcted using 
Genef inder 

322 

47 . 826 

3404 

X63563 

Homo sapiens RNA polymerase II 14 0 kDa subunit 

4568 

96 . 728 

3405 

D88315 

Mus musculus tetracycline transporter-like 
protein 

1371 

70.279 

3406 

AB026257 

Homo sapiens organic anion transporter OATP-C 

1047 

35.031 

3407 

AL080220 

Homo sapiens hypothetical protein 

2359 

100.000 

3408 

AJ005898 

Homo sapiens shal-type potassium channel 

4209 

99 . 691 

3409 

U03399 

Homo sapiens T- complex protein 10A 

801 

74 . 011 

3410 

AF111170 

Homo sapiens unknown 

1357 

100 .000 

3411 

AB029032 

Homo sapiens KIAA1109 protein 

12865 

99 . 898 

3412 

U22376 

Homo sapiens alternatively spliced product 
using exon 13 A 

326 

72 . 368 

3413 

X78926 

Homo sapiens zinc finger protein 

2804 

98 . 500 

3414 

AF064254 

Homo sapiens very long- chain acyl-CoA 
synthetase homolog 1; VLCS-H1 

3003 

96 .473 

3415 

AF090834 

Homo sapiens malonyl coenzyme A decarboxylase 

3243 

100 . 000 

3416 

D38521 

Homo sapiens The ha09l9 gene product is novel. 

11907 

99 . 333 

3417 

AF023674 

Homo sapiens nephrocystin 

4666 

98.909 

3418 

D86957 

Homo sapiens similar to Drosophila melanogaster 
sept in (sep2) . 

2094 

69.456 

3419 

L41686 

Rattus norvegicus ORF 

219 

30 . 435 

3420 

AF062378 

Mus musculus calmodul in-binding protein SHA1 

1641 

68 . 085 

3421 

Z36948 

Unknown contains a valine and arginine rich 
domain, possesses weak similarity with the RNA 
binding 

243 

46 . 212 

3422 

AB014524 

Homo sapiens KIAA0624 protein 

13 lb4 

y y . 9d u 

3423 

X85991 

Mus musculus semaphorin B 

1768 

78 . 916 

3424 

AC007018 

Arabidopsis thaliana unknown protein 

1778 

42.653 

3425 

AB029040 

Homo sapiens KIAA1117 protein 

8856 

99 . 92 7 

3426 

AB028956 

Homo sapiens KIAA1033 protein 

7779 

100.000 

3427 

AB014576 

Homo sapiens KIAA06 76 protein 

8297 

99 .367 

3428 

AF094508 

Homo sapiens dentin phosphoryn 

193 

20 . 090 

3429 

AF094758 

Homo sapiens nasopharyngeal epithelium specific 
protein l 

1779 

98 . 940 

3430 

AC006135 

Arabidopsis thaliana putative vicilin storage 
protein (globulin-like) 

358 

31 . 250 

3431 

AL110210 

Homo sapiens hypothetical protein 

6927 

n n o o o 



Mus musculus zinc finger protein 

c o *7 

. U J a 

3433 

AB014534 

Homo sapiens KIAA06 34 protein 

8726 

99.244 

3434 

AF145632 

Drosophila melanogaster BcDNA. GH06032 

2127 

51.073 

3435 

AF005632 

Homo sapiens phosphodiesterase 1/nucleotide 
pyrophosphatase beta 

6023 

99. 543 

3436 

AB016930 

Cricetulus griseus Phosphatidylglycerophosphate 

3235 

89.292 
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synthase 



3437 

AL031765 

UnKnown /prediction= (method: " "generinaer" " , 
version : Ub4 , score :31.9b""/ ; 
/prediction^ (metho 

463 

35 . 021 

3438 

AB023151 

Homo sapiens KIAA0934 protein 

6529 

79. 710 

3439 

AF155112 

Homo sapiens NY- REN- 50 antigen 

1773 

100. 000 

3440 

AF124726 

Homo sapiens acinusL 

8287 

99 . 070 

3441 

AF132180 

Drosophila melanogaster unknown 

796 

36 . 267 

3442 

Z66524 

Caenorhabditis elegans Homology with Squid 
retinal -binding protein (PIR Acc. No. A53057) ; 
cDNA EST yk463dl0.3 comes from this gene; cDNA 

HO X yJVDD JIIIa • -3 VvUlUco X. X Olll LUIS vjcllc 

1534 

44 . 925 

1 A A 1 

A<~. UU / U X / 

Arabxdopsxs thaliana putative RNA helicase A 

Z? J D 


3444 

M26576 

Homo sapiens alpha- 1 type IV collagen 

12444 

99. 880 

3445 

AC004381 

Homo sapiens Unknown gene product 

2096 

76.321 

3446 

AF126867 

Mus musculus calpain-like protease 

1171 

75.229 

3447 

X98411 

Homo sapiens myosin- IE 

1052 

89 . 888 

3448 

AB018339 

Homo sapiens KIAA0796 protein 

7069 

99 . 722 

3449 

U49829 

Caenorhabditis elegans short region of weak 
similarity to human folicular variant 
translocation protein 1 precursor (FVT-1) 
(GB :X6 36 57) 

152 

38 . 053 

3450 

AB007925 

Homo sapiens KIAA04 56 protein 

7124 

99 . 635 

3451 

AL110295 

Schizosaccharomyces pombe hypothetical protein 

173 

44 . 156 

3452 

Z8156 9 

Caenorhabditis elegans K11B4.2 

188 

33 . 663 

3453 

D86 957 

Homo sapiens similar to Drosophila melanogaster 
septin (sep2) . 

3283 

100 . 000 

3454 

AF131739 

Homo sapiens Unknown 

354 

48 . 077 

3455 

U62961 

Homo sapiens succinyl CoA: 3 -oxoacid CoA 
transferase precursor 

1658 

75.802 

34 56 

AF02 076 0 

Homo sapiens serine protease 

A A A /I 

2 004 

92 .6 55 

34 57 

AF152 961 

Homo sapiens chromatin- specif ic transcription 
elongation factor FACT 14 0 kDa subunit 

5752 

99 .66 5 

34 58 

T TO O O *"7 C 

U89876 

Mus musculus AXiY 

14 19 

93 . 562 

"3 A C Q. 

34 59 

7\ DAft ftft (T 1 

Santalum album proline rich protein 

2o4 

41 . UoU 

3460 

AC002544 

Homo sapiens Unknown gene product splice form-1 

1337 

82.186 

3461 

7v Ti f~\ *~\ r\ f\ o *i 

ABO 2 9022 

Homo sapiens KIAA1099 protein 

16 00 

58 . 085 

3462 

U766 3 8 

Homo sapiens BRCA1 -associated RING domain 
protein 

"> O A 

2 84 

29 . 231 

34 b 3 

Ax* UU /J.DZ 

Homo sapiens unknown 

O Q/l Q 

1UU . uuu 

"J /4 £Z A 

JJo UUU1 

Homo sapiens similar to hypothetical protein 

U*± *± / O \J X. O • UCIcVXoXdt: . 


9o . DDO 

J D O 

ARftl Q TL A 7 


7DQ1 
/ O O X 

QQ 

34 6 6 

nJJv U J ii D D 

T-Tornrt cj saT"> "i pti Q ViTrrvfit" Vi at" l pal nyot - pin 

11 W [ I IU OCipXCllO J J~ L- L. JL d JL pi.ULCJ.il 

889 

56 . 623 

7 <4 7 
J *x O / 



902 

51 . 2 03 

34 6 8 

ruJ u X t -J / 

UnmA ciflT^i atiq K - TZ\ZVftA^7 nrnhpi n 

5242 

100 000 

J> *d O _7 

AT?1 7fi Ofi Q 

Hnmn q ar\i ones nVii rni i 1 i" n 
nun ilj adpi.t.iio uux^uxxxii 

3 836 

99 .832 

la 7 n 

A T? i O Q QO -> 

rlub IIIU0L.UXU0 IcLcpLUI aLLX VctLcU La J.L.XU1I1 v~XlcLllllC.X. 

576 

76 . 119 

■5471 
J 1 ! / 1 

M.O 1 jIa i i 

nuiliL' oapxcllo OcL^^rt LJX ULcUl 

7ft Qn 

QQ Qft 7 

34 72 

AF121858 

Homo saDiens sortincr npxin R 

2551 

94 . 444 

3473 

AF042838 

Homo sapiens MEK kinase 1 

9864 

99.666 

3474 

U62325 

Homo sapiens FE65-like protein 

4917 

99.454 

3475 

AC007017 

Arabidopsis thaliana putative RNA helicase A 

1507 

44.128 

3476 

AL050367 

Homo sapiens hypothetical protein 

5840 

99.327 

3477 

Z82090 

Unknown similar to Alpha- 2 -macroglobulin family 

539 

31.844 


7tf 
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(3 domains) ; CDNA EST KMRT. - Df^RO? rnmpq frnm 

this g 



3478 

AL050156 

Homo sapiens hypothetical protein 

2269 

100.000 

3479 

U05343 

MllS mus cu 111 7"i nr f "i rinpr nrnh o -! n r> "7 TT 

1083 

95 . 906 

3480 

U57368 

Mus musculus EGF repeat transmembrane protein 

4656 

94.102 

3481 

L20900 

Rattus norvegicus autoantigen p69 

536 

38. 138 

3482 

AP07 9S? 9 

nomo sdpiens c/^iF-speciric pnospnooxesterase 
phosphodiesterase 

4297 

99 . 848 

3483 

AF127142 

Homo c; v-\ -i (3»ti q KTq 1 1 7V _ a 1 r^V» _ O 1 _ ^ 1 Vv*=» f- -> t -j 
x l^i 1 iu Dapiciio JLNtSLLZ-H^ — d-LpxlcA ~* / O "bai. ~ DSCa — X f J — 

x-»*-v\»» axyua ^ , 0 OlaiyxLI cil 10 J. c -L doc c* xpilci *o ,0 

sialyl transferase 

809 

47 . 510 

3484 

D44480 

Mus musculus MATH- 2 protein 


Q Q COO 

3485 

AF173937 

Homo saoiens secretpd nrnt" H n of 7 unlfnnwn 
function 


98 . 160 

3486 

AF059516 

Homo sapiens tolloid-like 2 protein 


1UU . u u u 

3487 

AB018329 

Homo sapiens KIAA0786 protein 

6700 

99.804 

3488 

AF055666 

MUS musculus kine*sin 1 lCrVit* rhain O 


69 . 417 

3489 

AF073481 

Homo saoiens oolvcvshin-I. 

CI Q Q 

99 . 009 

3490 

AF041483 

Homo S3.t)ieris historic mo r*r*oT-T9 ZX 1 

-L o U 

86 . 842 

3491 

AL117626 

Homo saoiens hVDOthetical nrnf pin 


A "7 OCT 

4 / . Ob / 

3492 

D87742 

Homo sapiens Similar to Human C219-reactive 
oeotide (L34688) 

7834 

99.749 

3493 

X59131 

Homo cr p» r-) -j Q Vivnohhohi r«a1 rtrnt-o-i n 

7217 

99 . 634 

3494 

U11036 

Homo sapiens Xbdl 

ICC 

97 . 2 97 

3495 

AB025258 

Mus mil S f*i 1 1 net crT"^ni ir^Vi ■» 1 i n — 

680 

38 . 344 

3496 

Y08134 

Homo flani Pir^ afi H cjt™»Vii nrrnmvo 1 inaao-1 i 
iiv^iuu oa^xcus auiu opillIiyuiuycllIlaSc XlKc 

phosphodiesterase 

3187 

99 . 14 0 

3497 

AB017563 

Homo sapiens IGSF4 


CC *7 C *7 

3498 

AB028983 

Homo sapiens KIAA1060 protein 

5731 

99 . 887 

3499 

AF074091 

Homo sapiens islet-brain 1 

4770 

98.312 

3500 

AF015287 

Homo ^\ r^k S on cprino nT*Ahoaco 

1109 

55 . 593 

3501 

AB007903 

Homo sapiens KIAA044 3 

664 

28.660 

3502 

AF056116 

X. V/ V-/ JL JL VJ t 

r uy u. x lxux. x^tro UiiA.nuwn 

1400 

57.398 

3503 

AC004997 

Homo oani pnc mahpVi POTo "7/t O qto 

(NID:g573097) , R19699 (NID : g774 333 ) , T59198 
(NID:g661035) , and AA027979 (NID : gl494038) 

1188 

98 . 953 

3504 

U32743 

nacmupjixxua xiix.xut;iiz,dt: KU LUCOSc OpsrOu PiOCcin 

( f ucU) 

286 

44.203 

3505 

Y08715 

MUS mus CU "L 11 VAQHiI raHhorin-O 

O Q O 

26.43 9 

3506 

Z73428 

Unknown oredictpd usiricf fii=»rifa-F-i nH^r- • r*T"YKfzi pot 
EMBL:T01774 comes from this gene; cDNA EST 
yk470a9.3 c 

c a c 

JO J 

44 . 053 

3507 

AL080170 

Homo sapiens hypothetical protein 

2968 

99 77*? 

3508 

AF151807 

Homo sapiens CGI-49 protein 

2752 

99.532 

3509 

D88747 

Arabidopsis thaliana AR4 01 

4 50 


3510 

U47024 

Mus musculus MEM3 

■J J _7 D 

Q7 i m 

3511 

Y09022 

Homo sapiens Not56-like protein 

i T-in 

1UU . uuu 

3512 

AF116826 

Homo sapiens outative orotein-tvrosinp kinac-p 


qq ioi 

3513 

U93181 

Homo sapiens nuclear dual-specificity 
phosphatase 

11068 

99. 000 

3514 

AB011180 j 

Homo sapiens KIAA0608 protein 

5244 

99.871 

3515 

X73882 

Homo sapiens microtubule associated protein 

708 

39.868 

3516 

AC004475 

Homo sapiens F23 8 58 1 

3752 

97.213 

3517 

D87071 

Homo sapiens similar to C.elegans protein 

693 

56.054 
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encoded in cosmid T20D3 (Z68220) . 



3518 

U93868 

Homo sapiens RNA polymerase III subunit 

447 

46.196 

3519 

X84693 

Mus musculus testis nuclear RNA binding protein 

1245 

88.444 

3 52 0 

AF134726 

Homo sapiens NG3 7 

3676 

98.435 

3521 

AF070594 

Homo sapiens HNK-1 sulfotransf erase 

403 

28.344 

3522 

AF121080 

Mus musculus cAMP inducible 1 protein 

2332 

81.406 

3523 

AF141315 

Homo sapiens alpha- 1, 4 -N- 
acetylglucosaminyl transferase 

336 

36.301 

3524 

U72520 

Mus musculus mena protein 

1198 

80.992 

3525 

AJ003125 

Homo sapiens procollagen I N-proteinase 

8516 

99.422 

3526 

Z19585 

Homo sapiens thrombospondin-4 

2679 

99.737 

3527 

AF111785 

Homo sapiens myosin heavy chain Ilx/d 

12060 

99.433 

3528 

U43148 

Homo sapiens patched gene homolog; similar to 
Drosophila patched protein, Swiss -Prot 
Accession Number P18502; transmembrane protein; 
Method: conceptual translation supplied by 
author 

364 

70. 930 

3529 

AB002361 

Homo sapiens KIAA0363 

9948 

99.277 

353 0 

D31886 

Homo sapiens KIAA0066 

6604 

100.000 

3531 

AF062389 

Rattus norvegicus kidney- specif ic protein 

1795 

78.593 

3532 

AB020716 

Homo sapiens KIAA0909 protein 

8247 

98.464 

3533 

D89285 

Mesocricetus auratus inter-alpha- trypsin 
inhibitor heavy chain 1 

939 

34.375 

3534 

AB023221 

Homo sapiens KIAA1004 protein 

1327 

67.266 

3535 

D38231 

Oryza sativa RWD 

192 

31.707 

3536 

AF071544 

Spinacia oleracea ribulose-1, 5-bisphosphate 
carboxylase /oxygenase small subunit N- 
methyl transferase I 

405 

28.090 

3 53 7 

A*J131245 

Homo sapiens Sec24B protein 

8242 

99. 370 

3 538 

A4 8861 

Homo sapiens unnamed protein product 

410 

40. 711 

353 9 

A31036 

Nicotiana alata PRP2 

126 

41 . 270 

3540 

AJ2 2263 6 

Homo sapiens hypothetical protein 

1372 

100 . 000 

354 1 

U8 074 7 

Homo sapiens CAGH3 

256 

97.619 

3542 

AB020716 

Homo sapiens KIAA0909 protein 

8247 

98 . 464 

"i C A O 

AF151822 

Homo sapiens CGI - 64 protein 

591 

96 . 386 

3544 

AB011665 

Mus musculus BAZF 

2205 

87. 569 

3 54 5 

AL096881 

Homo sapiens hypothetical protein 

432 

45. 143 

3 54 6 

U41060 

Homo sapiens LIV-l protein 

483 

45. 087 

3547 

AF064748 

Mus musculus S3 -12 

405 

63.158 

3548 

AF032666 

Rattus norvegicus rsec5 

3300 

90. 893 

3549 

D50925 

Homo sapiens The KIAA013 5 gene is related to 
pirn- 1 oncogene . 

8355 

99. 597 

3550 

U79260 

Homo sapiens unknown 

333 

70. 886 

3551 

AL022314 

Homo sapiens dJ1170K4.1 (novel protein similar 
to KIAA0176 and mouse, worm and fly proteins) 

1658 

89.655 

3552 

AB020671 

Homo sapiens KIAA0864 protein 

7713 

99. 590 

3553 

AL031033 

Homo sapiens C321D2 . 1 (Ribosomal Large Subunit 
Pseudouridine Synthase (EC 4.2.1.70, 
Pseudouridylate Synthase, Uracil Hydrolase) 

■Li -L XVCj piUUclIl/ 

2175 

100.000 

3554 

AL009196 

Unknown /predict ion= (method: " "genef inder " " , 
version: ""084" •') ; 

/predict ion= (method : " "genscan" " , ve 

487 

53.049 

3555 

AF132608 I 

Homo sapiens histone deacetylase 5 

7365 

99. 114 

3556 

U61538 | 

Homo sapiens calcium-binding protein chp 

221 

31. 928 


T7I 
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3557 

L07335 

Homo sapiens putative 

2166 

99.085 

3558 

AB011173 

Homo sapiens KIAA0601 protein 

5852 

100.000 

3559 

D79991 

Homo sapiens putative hydrophobic domain in 
amino acid positions 373-390. 

11158 

99.712 

3560 

AF125451 

Caenorhabditis elegans contains similarity to 
the NIFR3/SMM1 family 

444 

68.687 ; 

3561 

U75467 

Drosophila melanogaster Atu 

599 

54 . 974 

3562 

AL096750 

Homo sapiens hypothetical protein 

6025 

93 .699 

3563 

Y18314 

Homo sapiens paraplegin-like protein 

880 

66 .234 

3564 

AF177292 

Homo sapiens genethonin 3 

2211 

100.000 

3565 

X97675 

Homo sapiens plakophilin 2b 

1407 

99.556 

3566 

AL034399 

Homo sapiens dA191P20.2 (novel Fibronectin type 
III domain containing protein similar to Ring 
finger protein MIDI (Midline 1) ) 

2843 

99 . 770 

3567 

S45936 

Homo sapiens HTS1 

1434 

49 . 772 

3568 

AF045244 

Klebsiella pneumoniae ribitol kinase 

768 

41.617 

3569 

AF155108 

Homo sapiens NY-REN-41 antigen 

1578 

100.000 

3570 

AL022394 

Homo sapiens dJ511B24.3 (KIAA0395 (probable 
Zinc Finger Homeobox protein) ) 

3860 

99.829 

3571 

X74654 

Zea mays beta3 tubulin 

876 

33 .698 

3572 

U17133 

Rattus norvegicus ZnT-1 

1945 

81.671 

3573 

AJ010317 

Fugu rubripes Sand 

1593 

68 .232 

3574 

AB011180 

Homo sapiens KIAA06 08 protein 

1272 

77.366 

3575 

AF115313 

Thermomonospora curvata PkwA 

249 

27 . 099 

3576 

X52138 

Homo sapiens L7a protein 

296 

60 . 784 

3577 

AF097887 

Rattus norvegicus Chp 

840 

61 .321 

3578 

AC005005 

Homo sapiens similar to phosphatidylinositol 
(4 , 5) bisphosphate 5 -phosphatase ; match to 
PID:gl399105 

7224 

99.905 

3579 

AF130366 

Homo sapiens LISCH protein 

4074 

99 . 327 

3580 

AF045642 

Caenorhabditis elegans No definition line found 

446 

29.762 

3581 

AF170723 

Homo sapiens protein kinase STK10 

435 

43 .386 

3582 

AL110124 

Homo sapiens hypothetical protein 

3965 

99.836 

3583 

X62446 

Gallus gallus PR 264 

877 

62.105 

3584 

AF119334 

Homo sapiens zinc finger protein FOG- 2 

7618 

99.298 i 

3585 

Y16187 

Homo sapiens metallopeptidase 

1830 

98.127 | 

3586 

AF030430 

Mus musculus semaphorin Via 

2240 

91.826 

3587 

U73199 

Mus musculus Rho-guanine nucleotide exchange 
factor 

655 

57 .292 

3588 

AJ388553 

Canis familiaris hypothetical protein 

536 

73 .529 

3589 

AL050051 

Homo sapiens hypothetical protein 

774 

100 . 000 

3590 

U66003 

Xenopus laevis ADAM 13 

1196 

50. 920 

3591 

AC006276 

Homo sapiens R2 83 7 9 3 

452 

56 . 376 

3592 

M31209 

Gallus gallus skeletal muscle C-protein 

374 

35.096 

3 593 

AF000657 

Arabidopsis thaliana hypothetical protein 

423 

32 . 886 

3594 

AF152101 

Homo sapiens paracellin-1 

1810 

98.168 

3595 

AB018253 

Rattus norvegicus voltage-gated ca channel 

2023 

85. 946 

3596 

D80005 

Homo sapiens KIAA0183 

7286 

,99.812 

3597 

AF015041 

Homo sapiens NUMB-R protein 

3525 

99.625 

3598 

AF083384 

Homo sapiens 4 5kDa splicing factor; SPF 45 

1387 

94 . 937 

3599 

AF067946 

Caenorhabditis elegans similar to Drosophila 
ring canal protein (kelch) (SW:Q04652) 

279 

27.160 

3600 

U63648 

Mus musculus pl60 myb-binding protein 

373 

58 . 974 

3601 

L41690 

Homo sapiens tumor necrosis factor receptor 
type 1 associated protein 

2149 

99.390 
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3602 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

445 

82 . 353 

3603 

AC005896 

Arabidopsis thaliana unknown protein 

245 

29.697 

3604 

U60269 

Homo sapiens putative envelope protein; orf 
similar to env of Type A and Type B 
retroviruses axici to ciass 11 xit^Kvs 

162 

60 . 870 


AUOZDb 

jtiomo sapxens lnteynn axpna o suouniu precursor 

/OO / 

99 . 333 


TT1 Qfll Q 

nomo sapiens aaenovirus ii x>i ennancer oxnoxng 
protein 


100 . ooo 

JUU / 

TTO 1 A. 1 Q 

U3 / fk 3 1? 

no mo sapxens enaoy lin 

4202 

99 . 544 

3608 

AF003130 

Caenorhabditis elegans No definition line found 

324 

35.088 

jbuy 


Homo sapxens KIAA1118 protein 

7413 

99 .485 

3610 

U29056 

Mus musculus Src-like adapter protein 

514 

39.286 

3611 

AF1364 50 

Homo sapiens goodpasture antigen -binding 
protein 

4158 

99 . 519 

3612 

AF07432 9 

Mus musculus SH2-B PH domain containing 
signaling mediator 1 gamma isoform 

3244 

90.185 

■) r "i 0 

ibij 

AC U O b 1 3 5 

Arabidopsis thaliana putative vicilin storage 
protein (globulin- like) 

258 

26 . 642 

TCI /I 


Giardia intestinalis unidentified reading 

X X clmfc: , aXtcXnatlvc CUdOIl Uoc 

OAT 

36 . 2 57 

3615 

X82209 

Homo sapiens MN1 

3066 

100.000 

jolb 


Homo sapxens CGI -15 protein 

"3 q r\ 

95 . 522 

JDl / 

abozo 9 99 

Homo sapiens KIAA1076 protein 

5582 

100.000 

ibltJ 

XjZ b 3 3 O 

Cavia porcellus zinc finger protein 

O Q Q 

93 .333 

0 c n a 
3b 19 

AB014553 

Homo sapiens KIAA06 53 protein 

3 82 9 

99 . 821 

*3 *r 0 n 
Jb2U 

AC 

Homo sapiens Gene with similarity to rat 
kidney- specif ic (KS) gene 

31/ 

58 . 182 

3621 

U20780 

Mus musculus ubiquitinating enzyme E2-230 kDa 

906 

86 .420 

3622 

D2 5218 

Homo sapiens KIAA0112 

2639 

100 . 000 

362 3 

Ul 0536 

Pan paniscus MHC class I A 

872 

84 . 049 

3624 

L00352 

Homo sapiens low density lipoprotein receptor 

4599 

99.850 

3625 

AL031228 

Homo sapiens dGT1033B10 . 10 (membrane protein 
with histidine rich charge clusters (HKE4 , 

K X IN V7 _> ^ ) 

593 

38 . 920 

jDZD 

/ibUUUii X D 

Rattus norvegicus CCA1 protein 

O Q ~7 
JO / 

o4 . y j z 

3 D Z / 

7v t n 0 c c n 
/UjU b b U 

Caenorhabditis elegans predicted using 

fianaf i nrfov 
Vjcllt: J- IIIUcX 

COT 

j x . y4 y 

362 8 

L3 9211 

T-T/^TTt/^ cj si n i on c o vn t t n o t>o i m t f~ r ~1 f~ n o ^ a a a T 

nuiuu octu xciio LdLiiXLXiic L/ctxiuxLuy x LLdiioLcx aoc x 

D w O 

3 O . Z ? J 

362 9 

AC \J W J. J \_/ 

LctCUUI ilaJJUl Lib t.Xt.y alio UcJLXIlXLi.UIl X 1 1 It; XCVXXlvX 

P ^ cr 

ODD 


3630 

AF057026 

Rattus norvegicus protein kinase KID-1 

1480 

94 .444 

J D J 1 

TTQ fll A 1 

T-X/— \rn nam oria V\i i /^i^\Vi ^ ~1 -i yi ■v\ , *~*~\t'~ ir\ 

nuniu acLjj xcim xjixuyjLCjj^/inxxii pxot-tjxn 



3632 

AB007883 

Homo sapiens KIAA0423 

10829 

99.529 

1 ^ ^ 1 

JO J J 


nomo sapxens jmaaiuiu procexn 

O iT "7 fk 

Q Q O A O 

y 9 . o4 o 

3634 

Z94864 

Schizosaccharomyces pombe hypothetical protein 

384 

36 .923 

o 0 c 
3b3 i> 

tto one 
Uzzi /b 

Homo sapiens alternatively spliced product 

ix o x iiy exon i jh 

320 

71 . 084 

3 b 3 b 

/\.Jf J.XXXUJ 

Homo sapiens mitogen- activated protein kinase 

JVXIlCloC JVXllCtoC 1*1 HiZVtVxi 


Q O C A C 

90 . 546 

3637 

AF123880 

multiole sclerosis associated retrovirus 
element unknown protein U5/1 

317 

79 . 032 

3638 

AB020720 

Homo sapiens KIAA0913 protein 

2943 

98 .633 

3639 

U15131 

Homo sapiens pl26 

2467 

64 . 726 

3640 

Y09631 

Homo sapiens PIBF1 protein 

643 

79.452 

3641 

AL035263 

Schizosaccharomyces pombe hypothetical protein 

628 

24 .409 


111 
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3642 

AL050283 

Homo sapiens hypothetical protein 

976 

54.015 

3643 

AF037454 

Mus musculus ubiquitin protein ligase 

5172 

95.758 

3644 

AJ005891 

Homo sapiens JM2 

620 

38.671 

3645 

AF149413 

Arabidopsis thaliana contains similarity to 
protein kinase domains; Pfam PF00069, 
Score=15.8, E=0.0027, N=l 

406 

48.052 

3646 

M83297 

Rattus norvegicus protein phosphatase 2A 55 kD 
regulatory subunit B 

1912 

78.919 

3647 

Z19152 

Caenorhabditis elegans cDNA EST EMBL:T02216 
comes from this gene; cDNA EST ykl93b5.3 comes 
from this gene; cDNA EST ykl93b5.5 comes from 
this gene 

364 

29.218 

3648 

U95044 

Homo sapiens zinc finger protein 

1472 

82.449 

3649 

X92666 

Bos taurus cysteine string protein 

329 

82.258 

3650 

AB002384 

Homo sapiens KIAA03 86 

715 

29.856 

3651 

Z99709 

Caenorhabditis elegans cDNA EST EMBL:D73217 
comes from this gene; cDNA EST yk478c5.3 comes 
from this gene; cDNA EST EMBL:M89187 comes from 
this gene 

237 

28 . 044 

3652 

U71383 

Homo sapiens OB binding protein- 2 

578 

96 . 809 

3653 

U09874 

Mus musculus SKD3 

3142 

95.050 

3654 

U79260 

Homo sapiens unknown 

318 

70.513 

3655 

D87458 

Homo sapiens Similar to Human estrogen- 
responsive finger protein, efp (A496 56) 

3757 

99.636 

3656 

AB014522 

Homo sapiens KIAA0622 protein 

8194 

99. 153 

3657 

D87470 

Homo sapiens KIAA02 8 0 

784 

53.252 

3658 

AF082657 

Homo sapiens Era GTPase A protein 

2941 

99. 774 

3659 

AF123344 

Homo sapiens Kruppel-like zinc finger 
transcription factor 

2537 

99.155 

3660 

U2 3 1 7 2 

Caenorhabditis elegans No definition line found 

197 

40.206 

3661 

L25125 

Mus musculus RNA helicase 

533 

100.000 

3662 

AL049955 

Homo sapiens hypothetical protein 

1060 

87. 958 

3663 

U21556 

Homo sapiens similar to rat integral membrane 
glycoprotein, PIR Accession Number A40670 

2083 

91.193 

3664 

AC008075 

Arabidopsis thaliana F24J5.4 

224 

35 . 075 

3665 

D86983 

Homo sapiens similar to D . melanogaster 
peroxidasin (U11052) 

10048 

99.466 

3666 

AB018313 

Homo sapiens KIAA07 70 protein 

4841 

100.000 

3667 

AB028989 

Homo sapiens KIAA1066 protein 

7056 

99 . 625 

3668 

U13262 

Mus musculus myelin gene expression factor 

701 

80.714 

3669 

X85019 

Homo sapiens UDP-GalNAc : polypeptide N- 
acetylgalactosaminyl transferase 

1777 

52 . 008 

3670 

D87325 

Mus musculus GSG1 

471 

45.588 

3671 

AF000996 

Homo sapiens ubiquitous TPR motif, Y isoform 

191 

71 . 739 

3672 

S58722 

Homo sapiens X- linked retinopathy protein {C- 
terminal, clone XEH.8c} 

289 

78.689 

3673 

U41543 

Unknown Similar to Rat trg gene product; coded 
for by C. elegans cDNA yk31e7.5; coded for by 
C . ele 

881 

45.652 

3674 

AL1104 90 

Caenorhabditis elegans predicted using 
Genef inder 

481 

51 . 266 

3675 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

445 

82.353 

3676 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

445 

82 . 353 
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36 7 7 

X64995 

Homo sapiens HGMP07J 

856 

55.469 

3678 

AB007862 

Homo sapiens KIAA04 02 

11152 

99 . 885 

3679 

AB002374 

Homo sapiens KIAA03 76 

5532 

99.548 

3680 

AB020671 

Homo sapiens KIAA0864 protein 

7713 

99.590 

3681 

AC006201 

Arabidopsis thaliana hypothetical protein 

261 

27 .397 

3682 

AF124490 

Homo sapiens ARF GTPase- activating protein GIT1 

3300 

96 . 507 

3683 

X75887 

Bos taurus brevican 

5154 

89.180 

3684 

AF036249 

Mus musculus polymerase I -transcript release 
factor; PTRF 

2102 

93 . 817 

3685 

U95031 

Homo sapiens sublingual gland mucin 

6305 

99.659 

3686 

U94991 

Xenopus laevis transcription factor XLMOl 

535 

77 .451 

3687 

AC004021 

Homo sapiens kelch protein; ring canal 
component involved in cytoplasmic bridges; 77% 
Similarity to A45773 (PID : gl079096 ) 

582 

32 .773 

3688 

Z47811 

Unknown similar to ubiquitin carboxyl- terminal 
hydrolase; cDNA EST EMBL:D34 519 comes from this 
gene 

266 

34 .228 

3689 

AF117756 

Homo sapiens thyroid hormone receptor- 
associated protein complex component TRAP150 

852 

93.197 

3690 

Y07800 

Oryctolagus cuniculus ryanodine receptor 

760 

91.597 

3691 

D86957 

Homo sapiens similar to Drosophila melanogaster 
sept in (sep2) . 

2094 

69.456 

3692 

Z19555 

Unknown predicted using Genefinder; similar to 
Propionyl-CoA carboxylase beta chain; cDNA EST 
EMBL: 

2434 

65.177 

3693 

X73608 

Homo sapiens testican 

2368 

95.628 

3694 

AB014533 

Homo sapiens KIAA0633 protein 

8740 

99.393 

3695 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

333 

70.588 

3696 

AL021816 

Schizosaccharomyces pombe SPBC24E9 . 03c, 
unknown, len:2 51aa 

207 

34 .694 

3697 

D50913 

Homo sapiens The KIAA0123 gene product is 
related to rat general mitochondrial matrix 
processing protease (MPP) . 

3457 

99.811 

3698 

D87930 

Homo sapiens myosin phosphatase target subunit 
1 

532 

34 .483 

3699 

AJ13 34 8 8 

Bos taurus SCO-spondin 

1570 

79.377 

3700 

AJ 001981 

Homo sapiens OXA1L 

3268 

98 .400 

3 701 

AF061936 

Homo sapiens diacylglycerol kinase iota 

7093 

99.438 

3702 

Ali0344 17 

Homo sapiens bK215D11.2 (similar to rat gene 
33 ) 

2219 

98 . 176 

3703 

AL080143 

Homo sapiens hypothetical protein 

2615 

96.649 

"5 *7 f\ A 
3 /04 

D78020 

Rattus norvegicus NFI-A4 

413 

81 . 579 

3705 

AB001772 

Ciona savignyi PEM-5 

380 

35.217 

3706 

AF133086 

Homo sapiens membrane- type serine protease 1 

5839 

99 . 647 

3707 

AB020654 

Homo sapiens KIAA0847 protein 

4251 

100.000 

3708 

Z14014 

Nicotiana tabacum Pistil extensin like 
protein, partial CDS only 

165 

46 . 154 

3709 

U37143 

Homo sapiens cytochrome P4 50 monooxygenase 
CYP2 J2 

1025 

43 . 056 

J / lu 

ZVPOfi 1 Q79 

nomo sapiens una cycosine metnyi trans r erase 3 
alpha 

6 312 

9 9.233 

3711 

X83957 

Homo sapiens nebulin 

6801 

98.712 

3712 

D10250 

Homo sapiens alpha- fetoprotein enhancer binding 
protein 

821 

45.556 

3713 

AC004794 

Homo sapiens F02 56 9 2 

1052 

79.630 
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3714 

AJ000644 

Homo sapiens SPOP 

933 

71.635 

3715 

AC006029 

Homo sapiens Similar to Sperm Surface Protein 
PH-20; Similar to P38568 (PID:585674) 

3039 

99 . 585 

3716 

X62528 

Rattus norvegicus ribonuclease inhibitor 

614 

38 . 909 

3717 

D16226 

Oryctolagus cuniculus one of the members of 
sodium-glucose cotransporter family 

843 

64 . 362 

3718 

AF010144 

Homo sapiens neuronal thread protein AD7C-NTP 

347 

63 .107 

3719 

M35297 

Rattus norvegicus G-protein coupled receptor 

286 

75 . 385 

3720 

AB024400 

Rattus- norvegicus LAT4 

832 

57 . 627 

3721 

U79775 

Homo sapiens NNP-l/Nop52 

759 

96 . 667 

3722 

Z83838 

Homo sapiens GTPASE- ACTIVATING PROTEIN 

2113 

99.676 

3723 

U90653 

Homo sapiens DHHC- domain- containing cysteine - 
rich protein 

290 

37.500 

3724 

AB029016 

Homo sapiens KIAA1093 protein 

815 

54 . 122 

3725 

U10991 

Homo sapiens G2 

10859 

99 . 764 

3726 

U41543 

Unknown Similar to Rat trg gene product; coded 
for by C. elegans cDNA yk3le7.5; coded for by 
C. ele 

948 

41 . 289 

3727 

U76373 

Mus musculus skm-BOPl 

1636 

91 . 760 

3728 

AF155595 

Homo sapiens CoREST protein 

3121 

100 . 000 

3729 

AF167320 

Mus musculus zinc finger protein ZFP113 

877 

72.189 

3730 

AB011089 

Homo sapiens KIAA0517 protein 

5113 

99.114 

3731 

AC004542 

Homo sapiens OXYSTEROL- BINDING PROTEIN- like ; 
similar to P22059 ( PID : gl29308 ) 

580 

77.778 

3732 

U80745 

Homo sapiens CTG7a 

2275 

98 . 841 

3733 

Z75536 

Caenorhabditis elegans similar to dynein heavy 
chain; cDNA EST EMBL:D2 7549 comes from this 
gene; cDNA EST EMBL:D34 8 59 comes from this gene 

244 

35.385 

3734 

AJ002424 

Rattus norvegicus p65 protein 

1152 

63 . 235 

3735 

L36434 

Mus musculus basic domain/leucine zipper 
transcription factor 

459 

84 .884 

3736 

D80005 

Homo sapiens KIAA0183 

7297 

99.906 

3737 

U79263 

Homo sapiens unknown 

2128 

99 . 115 

3738 

X55126 

Mus musculus Zfp-2 9 

648 

39 . 858 

3739 

U81375 

Homo sapiens equilibrative nucleoside 
transporter 1 

510 

36 . 965 

3740 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

444 

73 . 404 

3741 

A J 0 01531 

Homo sapiens neurotrypsin 

6316 

99.657 

3742 

X66405 

Mus musculus collagen alphal type VI -precursor 

6248 

90 . 578 

3743 

X66902 

Mus musculus En-2/lacZ fusion protein 

402 

92 . 857 

3744 

AF121081 

Mus musculus cAMP inducible 2 protein 

395 

90 . 000 

3745 

AB020671 

Homo sapiens KIAA0864 protein 

7713 

99 . 590 

3746 

U75329 

Homo sapiens serine protease 

577 

42 . 512 

3747 

D86980 

Homo sapiens KIAA0227 

447 

63 . 793 

3748 

U2 2376 

Homo sapiens alternatively spliced product 
using exon 13A 

322 

62.245 

3749 

AB023206 

Homo sapiens KIAA0989 protein 

5583 

99 . 767 

3 75 0 

X52949 

Giardia intestinalis unidentified reading 

J_ J- ctllltr / ct-L LcillaL J. vc LUUUI1 Uoc 

181 

3 5 . 06 5 

3751 

AB01H05 

Homo sapiens KIAA0533 protein 

10943 

99.818 

3752 

L07809 

Homo sapiens dynamin 

323 

77. 049 

3753 

AB028957 

Homo sapiens KIAA1034 protein 

4933 

98.558 

3754 

AF078165 

Homo sapiens conduct in 

4436 

98 .678 

3755 

X52876 

Gallus gallus myosin light chain kinase 

263 

38 .312 
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3756 

U23452 

Caenorhabditis elegans No definition line found 

355 

40.397 

3757 

U80445 

Unknown coded for by C. elegans cDNA ykl3g5.3; 
coded for by C. elegans cDNA yk21g6.3; coded 
for by 

1584 

45. 819 

3758 

U80223 

Drosophila melanogaster eukaryotic initiation 
factor eIF-2 alpha kinase; DGCN2 

372 

27.586 

3759 

U12392 

Haematobia irritans putative ATPase 

628 

45.588 

3760 

Y15913 

Homo sapiens COL1A1 and PDGFB fusion transcript 

102 

41. 071 

3761 

AL050089 

Homo sapiens hypothetical protein 

5354 

99. 753 

3762 

AF063231 

Mus musculus cytoplasmic dynein intermediate 
chain 2 

3011 

96.646 

3763 

AF151840 

Homo sapiens CGI- 82 protein 

639 

54 . 040 

3764 

AB014600 

Homo sapiens KIAA0700 protein 

7538 

99. 735 

3765 

AF000996 

Homo sapiens ubiquitous TPR motif, Y isoform 

425 

75.610 

3766 

D42055 

Homo sapiens KIAA0093 gene product is related 
to NEDD-4 protein. 

6302 

99. 892 

3767 

M27685 

Mus musculus ultra-high sulphur keratin 

396 

39.759 

3768 

AL031177 

Homo sapiens dJ889M15.3 (novel protein) 

366 

30.534 

3769 

AF128527 

Homo sapiens breast cancer associated gene 1 
protein 

768 

42.663 

3770 

AL022393 

Homo sapiens p3 73c6.1 

3523 

100 .000 

3771 

U66496 

Homo sapiens leptin receptor 

265 

63 . 380 

3772 

M34915 

Bos taurus retina-specific 15.7 kDa protein 

376 

57.778 

3773 

D86984 

Homo sapiens similar to yeast adenylate cyclase 
(S56776) 

1155 

61.056 

3774 

D88750 

Bacillus circulans beta-galactosidase 

434 

36.792 

3775 

Z50097 

Drosophila melanogaster hdc protein 

691 

30.839 

3776 

U83246 

Homo sapiens copine I 

1011 

53 .667 

3777 

AB014516 

Homo sapiens KIAA0616 protein 

263 

36 . 757 

3778 

AL031393 

Homo sapiens dJ733D15.1 (Zinc-finger protein) 

562 

41.026 

3779 

Z24725 

Homo sapiens mitogen inducible gene mig-2 

4368 

95. 921 

3780 

U37376 

Xenopus laevis MAM domain protein 

1475 

66. 134 

3781 

AC007228 

Homo sapiens BC37295 2 (partial) 

1091 

100.000 

3782 

AF124440 

Homo sapiens MAGE tumor antigen Dl 

1496 

57 . 176 

3783 

AF043695 

Caenorhabditis elegans similar to the protein 
phosphates 2c family 

732 

37 . 789 

3784 

AL049733 

Homo sapiens dJ875H3.1 (APK1 antigen) 

1628 

81.034 

3785 

AC006225 

Arabidopsis thaliana putative multidrug 
resistance protein 

948 

45 . 723 

3786 

AB001636 

Homo sapiens ATP -dependent RNA helicase #46 

480 

36 . 134 

3787 

U63839 

Rattus norvegicus nucleoporin p58 

701 

94 . 017 

3788 

U75329 

Homo sapiens serine protease 

782 

39.441 

3789 

AF127389 

Rattus norvegicus putative taste receptor TR1 

1201 

43 . 715 

3790 

X80038 

Homo sapiens Polio virus receptor protein 

2979 

99.560 

3791 

AF143003 

Perca f lavescens lysyl oxidase related protein 
homo log 

1350 

58.610 

3792 

S44213 

Saccharomyces cerevisiae, Peptide, 323 aa 
YKL522=mitochondrial ADP/ATP carrier protein 
homo log 

479 

35. 918 

3793 

AB018342 

Homo sapiens KIAA07 99 protein 

44 94 

97 . 899 

3794 

Z29371 

Oryctolagus cuniculus adenylyl cyclase type V 

1178 

100.000 

3795 

AF004715 

Homo sapiens jerky gene product homolog 

964 

56.154 

3796 

AJ001403 

Homo sapiens MUC5AC protein 

7672 

98 . 885 

3797 

X52875 

Mus musculus Prx2 

1371 

88.845 

3798 

AF071172 

Homo sapiens HERC2 

533 

32 .626 
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3799 

X75342 

Homo sapiens Shb 

3266 

98.780 

•j O n /\ 

M94 3o2 

Homo sapiens latnin B2 

3268 

99 . 806 

i o r\ i 
3 8 U 1 

AD /J.J J 

Homo sapiens mitotic kinase- like protein- 1 

4488 

98 . 875 

3 8 02 

AF 13 2 972 

Homo sapiens CGI- 38 protein 

694 

62 . 424 

Tom 
3803 

TTO Q1 

U2 91bo 

Mus mus cuius involved in signaling by the 
epidermal growun raccor receptor; wecnoa: 

^UHL-cpLUdl LI allbXaLlUIl ouppxlcU *Jy aUHIUJL 

566 

63 . 399 

3804 

AB011104 

Homo sapiens KIAA0532 protein 

10989 

99.939 

TO AC 

3 O U b 

Tk T7»r\ QQQ1 /; 

Homo sapiens emilin precursor 

6 793 

99 . 902 

3806 

L41834 

Ensis minor nuclear protein 

466 

31.944 

1 Q A *7 

At 118 8 8 9 

Rattus norvegicus b-tomosyn isoform 

2613 

94 . 601 

3808 

Lil4 00 9 

Drosophila melanogaster zinc finger protein 

294 

39 . 655 

3809 

D38538 

Anthocidaris crassispina dynein intermediate 
chain 2 

320 

61.250 

101 ft 

\j / y y y fi 

Homo sapiens similar to ankyrin of Chromatium 
vmosum . 

352 

54 . 918 


T A/1 QD 

Homo sapiens NADH dehydrogenase ( ubiquinone ) 

n O a 
/ 8 U 

83 . 8 /l 


m*7 *7 r» n 1 

i v ius muscuius glycerol - 3 -pnospnate 
acyl transferase 

1 1 A A 

1344 

96 . 744 

J u J. J 

UZ" DUX 

Mus muscuius Zf p6 7p 

407 

57 . 500 

3814 

AF055993 

Homo sapiens mSin3A associated polypeptide p3 0 

761 

67.935 

3815 

ALi080123 

Homo sapiens hypothetical protein 

828 

60 . 317 

3816 

D80005 

Homo sapiens KIAA0183 

7297 

99.906 

3817 

AL110490 

Caenorhabditis elegans predicted using 
Gene finder 

490 

53 . 503 

3818 

L31840 

Rattus norvegicus nuclear pore complex protein 
NUP107 

3280 

92.559 

3819 

D86 976 

Homo sapiens similar to C. elegans protein 

\ Z J 7093 ; 

7679 

99 . 914 


Ay / ZJU 

Homo sapiens NK receptor 

2 96 8 

98 . 621 

o o o n 

U22 3 / b 

Homo sapiens alternatively spliced product 

78 

30 . 769 



riorno Sapiens .dijnhji 

TCI Q 

l no ftnn 
1UU . uuu 

3823 

AF041835 

Homo sapiens laminin gamma 3 chain precursor 

11183 

99.685 


ADftTft/l in 

Ar UJU4 JU 

Mus muscuius semaphorin Via 

170 

30 . 76 9 

3825 

AB014516 

Homo sapiens KIAA0616 protein 

323 

31.915 

O Q O ZT 

3826 

AB02 8 94 8 

Homo sapiens KIAA102 5 protein 

7802 

1 ft ft ft ft ft 

100 . 000 

1 Q O "7 

T\Q O "7 C ft 

JJ88 /bO 

Bacillus circulans beta-galactosidase 

433 

36 . 792 

3828 

D50918 

Homo sapiens The KIAA0128 gene is related to 
cdclO . 

880 

98.601 

"3 O *> Q 

382 9 

D90716 

Escherichia coli Hypothetical 18.7 kd protein 
in rhlE-dinG/rarB intergenic region (F160) . 

703 

99 . 115 

"3 Q "3 r\ 

38 3 0 

AiJU 1 8 2 34 

Homo sapiens KIAA0711 protein 

285 

31.818 

3831 

AC004 010 

Homo sapiens similar to Leucine-rich 
transmembrane proteins; 44% similarity to 
U42767 (PID:gl736918) 

3422 

99 . 808 

JO J/I 

Dj 0 b 3 8 

Anthocidaris crassispina dynein intermediate 
chain 2 

bl 9 

b / . /4o 

JO J J 

t^i nice 

Homo sapiens alanine aminotransferase 

1 /ii « 

141U 

CI Olft 

3834 

AF0 RQfil 1 




3835 

D87467 

Homo sapiens Similar to a C. elegans guanine 
nucleotide releasing factor homolog (S4 2368) 

751 

60.101 

3836 

AF022212 

Homo sapiens Rho GTPase activating protein 6 
isoform 2 

4372 

98.671 

3837 

AF135491 

Mus muscuius neuronal apoptosis inhibitory 

364 

23.143 


nil 
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protein 



3838 

AF089897 

Homo sapiens topoisomerase- related function 
protein 

701 

72 . 414 

3839 

AC006951 

Arabidopsis thaliana putative 3-oxoacyl carrier 
protein synthase II 

1115 

54 . 655 

3840 

TV T-»"l r\ A ft ft 

AF104260 

Homo sapiens hiwi 

IOC 


3841 

X90849 

Gallus gallus polybromo 1 protein 

890 

84.967 

3842 

AF045022 

Bos taurus phosphatidic acid-preferring 
phosphol ipase Al 

2411 

91 . 847 

3843 

AC007661 

Arabidopsis thaliana hypothetical protein 

344 

36 . 548 

3844 

AL080062 

Homo sapiens hypothetical protein 

O A O O 

99 .6 53 

3845 

D83536 

Escherichia coli Acyl- -UDP-n-acetylglucosam ine 
o-acyltransf erase (EC 2.3.1.129) 

570 

98.901 

3846 

AF115509 

Homo sapiens LRR FLI-I interacting protein 2 

256 9 

97 . 092 

3847 

S58722 

Homo sapiens X- linked retinopathy protein {C- 
terminal, clone XEH.8c} 

261 

80 . 000 

3848 

Z81143 

Caenorhabditis elegans cDNA EST yk481g5.5 comes 
from this gene; cDNA EST yk508e4.3 comes from 
this gene 

187 

24 . 766 

384 9 


jjrosopniid mej.anogascer dcjl>jn/\ . unuz zj i *± 

*i _/ fx 

fx -3 • O 3 _3 

"> □ C ft 

C ft ft Q^J 

riomo sapiens vj j. u _> protein 


i Aft nnn 

38 51 

AB018274 

Homo sapiens KIAA0731 protein 

TCI 1 


3852 

X0S4 72 

Rattus norvegicus ORF 3 

-L / -5 

co ftft r» 

3853 

D29954 

Homo sapiens KIAA0056 

QOCl 


3854 

AF12 64 84 

Homo sapiens CARD 4 

O Ui7 

it nan 

3855 

AF151824 

Homo sapiens CGI -66 protein 

1 O O Q 

qi -3 n a 

3 856 

Z38102 

Homo sapiens interleukin- 11 receptor 

iy / 

O / . ouu 

3857 

AFOb 1346 

Mus musculus Edpl protein 

T QC 
1 J D 

C Q fiQ1 

38 58 

U586 58 

Homo sapiens unknown 

Z 1 

ci c A 

38 5 9 

Tin n o T O 

Peromyscus leucopus reverse transcriptase 

i cn 
1 D U 


3860 

tv t~> o rv r~v *i 

AB02 8 997 

Homo sapiens KIAA1074 protein 

1 £ o 
loz 

c C CCt 
jj . DDO 


Ar V Zoo Zo 

Homo sapiens Tax interaction protein 33 



3 862 

TV. O O 1 ft ft ft C 

AB012725 

Mus musculus zinc finger protein 


O ft . Uf± o 

3863 

AF0 3 18 3 b 

caenornaoai lis eiegans ijiixoa; ppvjaiNiase 

D J J 

1 A R A P 
Of* . Of* O 

"3 O C A 

3ob4 

f\Z>2 oUb 

uniaenciEieu unnainea proLein ptociiict. 


07 An 

O / . O *i U 

~> o a c 
Job b 

A / lob b 

Bos taurus calcineurin 

_J D D 

/ J . O fi _L 

"3 Q tZ C 
JODO 

z,xx / y j 

riomo sapiens seicnoprouein f 


qq A7C 

-/ -7 . ft / 3 

Job/ 


vjcoiemur crassicauaacus o — axpria-riernoy lojjin 

1 fiQ 
J. O _7 

Al 77? 
Hl X . tic. 

JODO 

>\r U .Lbftf* o 


4 51 

29 . 392 

"3 Q £T Q 

job? 

7A Tft 1 1 Q C c 


IOC 
X z, ^ 

AH 47fi 

3870 

A01592 

Homo sapiens haemoglobin A beta chain 

212 

58 .462 

3871 

V00662 

Homo sapiens cytochrome oxidase I 

1742 

91.667 

3872 

U222 32 

Felis catus ribosomal protein S4 

ACT 

O / - 0**Z 

3 8 73 

AF15 90 92 

Homo sapiens syld709613 protein 

/ 

/ 1 . Z 1 J> 

3874 

V ft C ft ft ft 

X85807 

Saccharomyces cerevisiae ORF G6623 

Z3*± 

ft c can 
zb . byu 

38 75 

U22 961 

Homo sapiens similar to human albumin, Swiss- 
rxOu Accession XNUiuDer ruz / bo, iriecnoa.: 

<-i^vi roril*i ial hvanol at* i at? cjh wri 1 1 pH ant" Vi c\ v 
^UUOcpuUax LlallolaUlUIl t> uppi -LtrLJ. U y aULllUI 

O O Q 

Q1 P "3 7 

Jo / b 

T Q 
bUb4 X y 

T_T y-\TYV ~» >— \ ^ a vv 1 If 1 r 1 r/'H y^VTfl ^ C? CV 

riorno odpiciiD lybyi i ly lii. KJj^y x dot; 

464 

77 . 273 

3877 

AF041378 

Homo sapiens cell death activator CIDE-A 

260 

44 . 444 

3878 

AL109630 

Drosophila melanogaster BACR7A4 . z 

198 

42 .169 

3879 

D50134 

Homo sapiens inward rectifying K channel 

180 

59.322 

3880 

U49439 

Drosophila melanogaster ASH1 

614 

35.754 

3881 

AF081947 

Mus musculus tektin 

191 

49.057 

3882 

AL034559 

Plasmodium falciparum predicted using hexExon,- 

178 

47 . 170 
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MAL3P7.14 <PFC0925w), Hypothetical protein, 
len : 489 aa 



3 883 

1*1 27 / OOZ 

i\.ci LLus norveyicus Deta-aianine syncnase 

1836 

86.469 

J O O f* 

'h.cnnA q q n 

/-iv^ KJ \J *4 27 27 U 

Homo sapiens supported by Genscan and several 

CiOlO . ^.O Jul J \lN!L'.yjUOZyUD / , JrWAO ^> »5 / D U 

(NID:g2893628) , AA215791 (NID : gl815572 ) , 
AI095488 (NID:g3434464) , and AA969095 
(NID -a3 144 2 75) 

3070 

98.488 

3885 

AF163254 

oa^xciiis c* vjciky L.v_^x jL/iULClll LJ^\.lr tr X 

ion 
J Z. U 

90.196 ] 

3886 

AB010710 

Homo 5^ Pi o i o n <=j 1 prt" *i n - 1 i If p rw "5 H i T7 orl T .PIT . voronf-nv 

/i n "i 
U J 


3887 

L29029 

Chi amvdomona s i nViAvrit" i i amino n r\ f o ;=i t~ 1 1 r-o * 
Rod protein domain, aa 266 . . 468; amino acid 
feature: globular protein domain, aa 32 . . 265 


n nil 

3888 

X67863 

Mus musculus T2 

133 

34 .444 

3889 

AF028722 

Mus musculus fetal globin inducing factor 

915 

80.34 7 

3890 

AF044208 

Drosophila melanogaster Strabismus 

1389 

4 7.228 

3891 

D10712 

Mus musculus nedd-1 protein 

1450 

78.3 97 

3892 

U07974 

Gallus gallus unknown 

349 

J *± . 

3893 

AF107295 

Rattus norvegicus outer membrane protein 

962 

PI ^ 1 R 

OX . ilD 

3894 

M36912 

Zea mays cell wall protein (put.) ; putative 

247 

31 073 

3895 

M23568 

Mus musculus transmembrane protein 

404 

75 . 862 

3896 

AB002317 

Homo sapiens KIAA0319 

1066 

J U . -J W _7 

3897 

X92485 

Plasmodium vivax pval 

318 

CO COO 
J ^ » D O O 

3898 

X68060 

Homo sapiens DNA topoisomerase II 

10692 

99.753 

3899 

U09366 

Homo sapiens zinc finger protein ZNF13 3 

1807 

59 .069 

3900 

X66366 

Rattus norvegicus Gephyrin 

2988 

86 .289 

3901 

AF113131 

Homo sapiens host cell factor homolog LCP 

1948 

92.105 



ttomo sapiens iviAAuy^i protein 

259 

53 . 425 



Caenorhabdi tis elegans similar to S . cerevisiae 

T"k y~o — mPMZi ei-il -J A yi/~t f- a t~ /~\ -r~ "DMA V<q 1 -I <-i — » <-» /-> no T> o O 

pxre mrciMM. spxxciny taccor kjn/a. nencasc rKViiz 
(SP:PR22_YEAST, P24 384) and other DEAH 
subfamily members of the DEAD box family 
helicases 

695 

40 . 370 

3 904 

298949 

nv-»uiu oapxciio jjivi^ jiiz . J- \myosxn neavy crict in/ 

6 922 

99 . 813 

3905 


O • V» -J >7 /— » o 3 /~" Vi ^3 •*~<"*i , m\/'f"' , o e MomV^o )i\/r^r\t" V» o -S nal »-\>-/-vl- e» -i ■*-» 
o^iix iUbaLLlldl l ly t; fc> pLJlllxjfcJ *iypot.llSUXCo.X pxOLGin 

1088 

40 . 222 

3906 


oapxciio oitppux LcU JJy iiuulclll Cj£> l S j j 73 

(NID:g892090) and AA126363 (NID : gl687976 ) , 
mouse ESTs W83982 ( NID ■ a\ 7 94 } atiH a^7i 7^1"^ 
(NID:g2729907) , and Genscan 

*"i f\ A O 

zU4z 

98 .697 

3907 

AF133124 

Homo sapiens transcription factor IIIC63 

2190 

94 334 

3908 

AF074086 

Homo sapiens envelope 

781 

53 .266 

3909 

AF035285 

Homo sapiens dihydroxyacetone phosphate 
acyl transferase 

177 

53 . 968 

3910 

AB018272 

Homo sapiens KIAA072 9 protein 

1517 

79.193 

3911 

AC008075 

Arabidopsis thaliana F24J5.4 

155 

35 . 922 

3912 

U38904 

Homo saoiens zinc fincier* Drotein C?W? - ^ 

117 9 

(Z -t one 

3913 

AF049099 

Mus musculus SPAF 

851 

40.223 

3914 

AF01014 4 

Homo cj^ni one npnrnnal hbrparl nrnhPi n Zkr» r 7r^ — mtd 

j ft u 

C ZL ore: 
of* . o o D 

3915 

AL033125 

Unknown l-evidence=predicted by content; 1- 
method=genef inder ; 084 ; 1- evidence end; 2- 
evidence=pred 

245 

29. 907 

3916 

AJ002474 

Rattus norvegicus Testis-specif ic A-kinase- 
anchoring-protein 

244 

26.106 

3917 

AB003503 

Mus musculus Guanine Nucleotide Regulatory 
Protein 

2849 

95.730 


180 
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3918 

AF060539 

Mus musculus channel interacting PDZ domain 
protein 

710 

82 .014 

3919 

AF016427 

Caenorhabditis elegans Contains similarity to 
Pfam domain: PF00004 (AAA), Score=268.1, E- 
value=3 . 7e-77 , N=l 

845 

60.000 

3920 

Z30320 

Plasmodium falciparum liver stage antigen- 1 

41 

31 .250 

3921 

AB009698 

Homo s ap i ens hOAT 1-2 

168 

51 . 786 

3922 

AF136234 

Lytechinus variegatus microtubule-associated 
protein 

482 

35 .371 

3923 

AF131833 

Homo sapiens Unknown 

312 

69 . 014 

3924 

AF006064 

Fowlpox virus protein kinase homolog 

268 

40 . 566 

3925 

M12140 

Homo sapiens envelope protein 

638 

66 . 906 

3926 

AL022724 

Homo sapiens dJ413H6.1.1 (hamster Androgen- 
dependent Expressed Protein LIKE PUTATIVE 
protein) (isoform 1) 

346 

74 . 324 

3927 

AJ007798 

Homo sapiens nuclear protein SA3 

1098 

92 . 090 

3928 

AF127374 

Streptomyces lavendulae unknown 

715 

33 . 125 

3929 

Y15908 

Homo sapiens DIA-12C protein 

918 

96 .622 

3930 

M74824 

Drosophila melanogaster D-E-A-D box protein 

510 

38 . 768 

3931 

A00279 

synthetic construct Human serum albumin 

519 

67 . 857 

3932 

X78933 

Homo sapiens zinc finger protein 

1703 

64 . 571 

3933 

AL117557 

Homo sapiens hypothetical protein 

628 

64 .671 

3934 

AF077040 

Homo sapiens SIH003 

167 

92 . 308 

3935 

AF038963 

Homo sapiens RNA helicase 

564 

32 . 749 

3936 

AB014579 

Homo sapiens KIAA0679 protein 

232 

78 . 571 

3937 

AJ010045 

Mus musculus guanine nucleotide-exchange factor 

703 

49.351 

3938 

AJ005021 

Styela plicata intermediate filament protein 
IFB 

188 

25 .294 

3939 

S48220 

Homo sapiens type I 5 • iodothyronine 
deiodinase, 5 1 DI 

1696 

99.593 

3940 

U89984 

Acanthamoeba castellanii transformation- 
sensitive protein homolog 

324 

28 . 631 

3941 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

402 

64 . 948 

3942 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

315 

58 .621 

3943 

M93018 

Rattus norvegicus , gene product 

3698 

95 . 805 

3944 

AL032657 

Unknown predicted using Genefinder; similar to 
DnaJ domain ; Thioredoxin; cDNA EST yk433f3.5 
come 

943 

35 . 279 

3945 

U40411 

Caenorhabditis elegans Similar to ubiquitin- 
conjugating enzyme . 

254 

41 .83 7 

3946 

AF070572 

Homo sapiens unknown 

745 

35 . 980 

3947 

X68314 

Homo sapiens glutathione peroxidase-GI 

1295 

100.000 

3948 

J04801 

Homo sapiens open reading frame A 

321 

86 . 207 

3949 

M77836 

Homo sapiens pyrrol ine- 5- carboxylate reductase 

285 

42 . 400 

3950 

AP000060 

Aeropyrum pernix 14 3aa long hypothetical 
protein 

169 

37 .681 

3951 

U12762 

Caenorhabditis elegans prolyl 4 -hydroxylase 
alpha subunit 

249 

52.23 9 

1 q r ^ 

Tvnrti o •*> T /I 

ABUloz /4 

Homo sapiens KIAA0731 protein 

OCA 

DD . UUU 

3953 

AC005396 

Arabidopsis thaliana putative proline-rich cell 
wall protein 

174 

31.677 

3954 

U24657 

Myxococcus xanthus putative O-methyltransf erase 

371 

39.205 

3955 

U13876 

Caenorhabditis elegans similar to 4- 
hydroxybenzoate octaprenyl transferase 

588 

43 .882 
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3956 

Z37139 

Unknown similar to guanine nucleotide binding 
protein; cDNA EST EMBL:T00917 comes from this 
gene ; c 

514 

64.865 

3957 

AF 010144 

riomo sapiens neuronal tnreaa protein hu/l. imiit 


O 1 . O Z> D 

3958 

AF010144 

Homo sapiens neuronal cnreaa procein *\xj /c-jn x jf 

■J -3 o 

J D . Jo J 

3959 

U56966 

Caenorhabditis elegans coded for by C. elegans 
CJJlSiA yKiUDJ . d ; coaeu lor oy \~ . eiegana tf inm. i 
yk3 0b3 .3 " 

844 

40.116 

3960 

Y14999 

Drosophila melanogaster Bip2 protein 

325 

53.425 

3961 

M89471 

gallid herpesvirus 1 MDV Eco Q protein 

1 7C 
I/O 

4i D . J J 1 

3962 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 


CO DOC 

3963 

AC004 997 

Homo sapiens matcn to bo is auzj/u 
fMTn . cr661 0"*m and AA027979 (NID • crl4 94038 ) 

*± *± / 

fin t; c;n 

D U * DOW 

3964 

L24907 

Rattus norvegicus protein kinase I 

311 

63 .636 

3 96 5 


TN-^-^-v £3 /— »v> Vi i 1 a tt\c* 1 anr»ffa or - <T/^1 i ah Vl nvr\t" 1 n 
JJI OaOpillla ulclaiiuyabLCA. yyiiaun |jluucj.h 

368 

54 . 000 

3966 

U13644 

LacuOrnajjaiLlo clctjailo IMU UtS J- Xll-L U- lUil -L -Liic J.uuiiu 

4 24 

40 . 18 7 

3 96 7 


riomo sapiens i^.i/i-H.u^-5 3 protein 

4930 

96 . 834 

396 8 


riomo sapiens js.±*\i\\jo ii? protein 

2 94 

— > w . www 

396 9 


riomo sapiens alternatively spiitcu piuuutu 

Uo lily ti-A.OI i J- 

3 97 

70 . 93 0 

3970 

D16593 

Homo sapiens hippocalcin 

190 

68.750 

3 971 

ABO lb d J U 

riomo sapiens type 11 nierrLDirane protein 

4 96 

44 . 172 

3972 

X75342 

Homo sapiens Shb 

~J / ~3 

— > -L- . U JX 

3973 

U2 8 993 

caenornaoai t is eiegans r^^uo.««2 gene prouutt 

2 56 

26 . 244 

3974 

V ^ >1 O O D 

X6422o 

Homo sapiens putative oncogene 

366 

95 . 000 

3975 

L123 51 

i^accnaromyces cerevisiae centromere / microtuijuie 
Dinoiny protein 


o . J ^ *± 

3976 

AF095737 

Homo sapiens unknown 

339 

65.556 

3977 

Y09321 

Homo sapiens TBP associated factor 

352 

96 .491 

3978 

M12140 

Homo sapiens envelope protein 

iiii 

fin nnn 

3979 

AF13 2148 

Drosophila melanogaster unknown 

i j j 

26 . 396 

3980 

AF010130 

Mus musculus neuregulin-3 

1 il c; 

7fl 1Q7 

3981 

rr ^ t /" a 

Z77660 

Homo sapiens Similarity to Human enoyl-CoA 

V» i vat-aoo f CM . T?PUM TTTTM ANT \ - r-TTMZl T?CT PIVTRT . * TO f> 1 1 

nyaratase \ovv . ctttuM rnji v irtj>i/ , ciox di^ioxj . ± w w o j. j. 
comes from th 



O Q O O 

/Villi / DDii 

riomo ydpicuo iiyputiictiLdi piutciu 

182 

59 . 016 

1 Q D "3 

von "JQA 

otis sorotct tuiiiciiueu jjiuuciu px.uuuuL 

580 

85 . 556 

"J Q Q A 

AT 131000 

XIOUIO UapiCllD v-Ul _L £t O {JlULCXll 

284 

86.275 

1 Q Q C 

j y 0 b 


rial 1 no rra 1 1 na t'^/no /*~»/~\ 1 1 rrp n 
uaiiUo yaiiua type v tuxxaycii 

597 

35 . 404 


U j£ Z >5 / O 

Urtmri oann pnci al t- f»T*n ^ r* i ir*=» 1 \r cnl "i r^*^»r? lDi^ridnct" 
riomo oapiciik) aiLciuaLxvcxy opx x^cu ^/•luuu^i. 

i]G-i ncr exon 13A 

294 

59 . 770 

J-70 / 

X61047 

HvHra «5t*» mini- rol 1 aoen 

204 

41 . 121 

-3QQO 

jtVIT U U J J 0 0 

rAPnoY"habdi i - i «5 pleoans No definition line found 

236 

24 . 242 

■1QQQ 
J I/O J 

Mft06 31 

Rattus norveaicus adenvlvl cvclase tvoe IV 

1363 

92 .478 

-Joan 

Mr w / w 0 

Homo fianipns enveloDe Drotein 

221 

49 . 333 

-J _? -L 

W VT J fa J 

Homo ^aniens 9 1 

1074 

69 . 758 

j -7 

x~i.LJ 1 J 1 i Tt J 

Colnmba livia 5 ' -nucleotidase 

382 

71 . 084 

3993 

D38112 

Homo sapiens cytochrome c oxidase subunit 3 

1577 

93 .333 

3994 

AF125175 

Homo sapiens angiopoiet in- related protein- 2 

418 

62.366 

3995 

S80119 

Rattus sp. reverse transcriptase homolog 

345 

31.937 

3996 

AB023186 

Homo sapiens KIAA096 9 protein 

268 

44 .231 

3997 

AF047695 

Homo sapiens cap-binding protein 4EHP 

338 

30.890 

3998 

AF004161 

Oryctolagus cuniculus peroxisomal Ca-dependent 

238 

48 .780 
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solute carrier 



3999 

U72973 

Sus scrofa calcium/ calmodulin- dependent protein 
kinase II isoform gamma-G 

613 

66.447 

4000 

U23450 

Caenorhabditis elegans No definition line found 

252 

28.994 

4001 

Z95584 

Mycobacterium tuberculosis hypothetical protein 
Rvll58c 

143 

35.294 

4002 

U50929 

Homo sapiens betaine : homocysteine 
methyl transferase 

1061 

76 . 166 

4003 

U22376 

Homo sapiens alternatively spliced product 
using exon 13 A 

270 

49.020 

4004 

U76846 

Arabidopsis thaliana ubiquitin-specif ic 
protease 

215 

30 . 168 

4005 

M32865 

Homo sapiens Ku protein subunit 

221 

85 . 366 

4006 

AJ235270 

Rickettsia prowazekii PROBABLE OXYGEN- 
INDEPENDENT COPRO PORPHYRINOGEN III OXIDASE 
(hemN) 

709 

34 . 188 

4007 

AB029022 

Homo sapiens KIAA1099 protein 

2590 

75.940 

4008 

AB023210 

Homo sapiens KIAA0993 protein 

166 

35 . 135 

4009 

Y13367 

Homo sapiens phosphoinositide 3 -kinase 

667 

79 . 310 

4010 

U83115 

Homo sapiens non-lens beta gamma- cry stall in 
like protein 

479 

38 . 587 

4011 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

296 

61 . 842 

4012 

Z75331 

Homo sapiens nuclear protein SA-2 

563 

69.065 

4013 

U60553 

Homo sapiens carboxylesterase hCE-2 

254 

81 .250 

4014 

L29457 

Mus musculus dynamin 

247 

47 .312 

4015 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

371 

70.667 

4016 

AL050321 

Homo sapiens dJ717M23.1 (novel gene) 

2757 

94.533 

4017 

M18247 

Feline leukemia virus gag-pol precursor 
polyprotein gPr80 

394 

31.780 

4018 

AE001691 

Thermotoga maritima conserved hypothetical 
protein 

189 

33 .333 

4019 

U22376 

Homo sapiens alternatively spliced product 
using exon 13 A 

423 

59 . 167 

4020 

Z97200 

Homo sapiens dJ79C4.1.2 (Homeobox protein PMX-1 
(PHOX1) isoform 2) 

68 

34 . 783 

4021 

S70011 

Rattus sp. tricarboxylate carrier 

2110 

92 . 711 

4022 

Z37525 

Xenopus laevis XDCoH 

251 

65.385 

4023 

X92485 

Plasmodium vivax pval 

232 

58.929 

4024 

M19651 

Rattus norvegicus fos- related antigen 

137 

33 . 684 

4025 

AJ010071 

Homo sapiens TOMl-like protein 

902 

94 . 702 

4026 

AB020676 

Homo sapiens KIAA0869 protein 

478 

46 . 707 

4027 

Y17833 

Human endogenous retrovirus K env protein 

417 

47 . 863 

4028 

U15174 

Homo sapiens BCL2 /adenovirus E1B 19kD- 
interacting protein 3 

239 

66 .667 

4029 

AF132150 

Drosophila melanogaster unknown 

445 

44 . 828 

4030 

AF010144 

Homo sapiens neuronal thread protein AD7C-NTP 

358 

63 . 158 

4031 

AF056116 

Fugu rubripes unknown 

955 

70 .732 

4032 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

280 

50 . 000 

4033 

K03036 

Mus musculus alpha -1 type I procollagen 

166 

32.895 

4034 

X01068 

Bombyx mori Hc-B chorion protein 

282 

63.077 | 

4035 

X71973 

Homo sapiens phospholipid hydroperoxide 
glutathione peroxidase 

1382 

100.000 

4036 

V00662 

Homo sapiens ATPase 6 

1039 

92 .473 
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4037 

AC005546 

Homo sapiens R2 94 25 1 

2208 

89. 973 

4038 

X13459 

Mus sp. laminin C- terminal fragment 

921 

65.581 

4039 

AF049588 

Canis familiaris synapsin I 

188 

30.288 

4040 

AF027956 

Homo sapiens G protein- coupled receptor 

320 

25.132 

4041 

AJ133120 

Rattus norvegicus Proline rich synapse 
associated protein 2 

847 

93.382 

4042 

AL110226 

Homo sapiens hypothetical protein 

535 

31.694 

4043 

AP000061 

Aeropyrum pernix 235aa long hypothetical 
protein 

170 

30.337 

4044 

AF007826 

Homo sapiens bax epsilon 

165 

50. 000 

4045 

AF100426 

Streptococcus parasanguinis f imbriae-associated 
protein Fapl 

164 

26.667 

4046 

Z82268 

Unknown predicted using Genefinder; similar to 
CUTICLE COLLAGEN 34; cDNA EST EMBL:D65629 comes 
from 

211 

33 . 775 

4047 

AL049946 

Homo sapiens hypothetical protein 

1937 

46.597 

4048 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

364 

64 . 198 

4049 

AJ001701 

Homo sapiens deoxyhypusine synthase 

488 

81.720 

4050 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

213 

63.636 

4051 

X74370 

Homo sapiens mucin 

167 

29. 762 

4052 

D87908 

Mus musculus nuclear protein np95 

703 

64 .458 

4053 

X92485 

Plasmodium vivax pval 

230 

47.826 

4054 

AF010144 

Homo sapiens neuronal thread protein AD7C-NTP 

323 

67. 089 

4055 

Z71264 

Caenorhabditis elegans similar to C2 domain 

61 

22 . 642 

4056 

AF108843 

Homo sapiens env protein 

348 

72 . 727 

4057 

U22376 

Homo sapiens alternatively spliced product 
using exon 13 A 

467 

73.196 

4058 

M74055 

Homo sapiens thromboxane synthase 

203 

82 . 051 

4059 

AL050382 

Homo sapiens hypothetical protein 

225 

34 .694 

4060 

AF132883 

Caenorhabditis elegans UNC- 52/Perlecan 

384 

32.240 

4061 

AB007930 

Homo sapiens KIAA04 61 perotein 

393 

22 . 981 

4062 

Z35602 

Caenorhabditis elegans similar to monoamine 
oxidase; cDNA EST EMBL:T01957 comes from this 
gene; cDNA EST yk324d6.3 comes from this gene; 
cDNA EST yk348h2.3 comes from this gene 

528 

38.168 

4063 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

342 

66 .234 

4064 

X92485 

Plasmodium vivax pval 

140 

63 .889 

4065 

X92485 

Plasmodium vivax pval 

171 

37.079 

4066 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

141 

50.000 

4067 

AB029018 

Homo sapiens KIAA1095 protein 

867 

59.641 

4068 

AF145690 

Drosophila melanogaster BcDNA. LD2 86 57 

1142 

46.036 

4069 

M34551 

Homo sapiens 52 -kD Ro/SSA ribonucleoprotein 

716 

34.574 

4070 

X78928 

Homo sapiens zinc finger protein 

183 

34.694 

4071 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

347 

64.000 

4072 

AC005396 

Arabidopsis thaliana putative proline-rich cell 
wall protein 

174 

28.571 

4073 

D87515 

Rattus norvegicus aminopeptidase-B 

1786 

49.130 

4074 

AF015264 

Rattus norvegicus golgi peripheral membrane 
protein p65 

1576 

72.997 


AD O^OO 

Mus musculus proline rich protein 

231 

31.333 

4076 

S67247 

Homo sapiens smooth muscle myosin heavy chain 
isoform SMemb 

197 

38.554 

4077 

U22376 

Homo sapiens alternatively spliced product 
using exon 13 A 

364 

67 .442 

4078 

AJ243460 

Leishmania major proteophosphoglycan 

189 i 

31.278 
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4079 

AJ248288 

Pyrococcus abyssi hypothetical protein 

378 

39.394 

4080 

AJ3 88 557 

Canis familiaris zinc finger protein 

1058 

55.311 

4081 

AF041082 

Rattus norvegicus transmembrane receptor Robol 

194 

34.286 

4082 

AF181646 

Drosophila melanogaster BcDNA. GH12326 

300 

31.847 

4083 

L36315 

Mus musculus zinc finger protein 

2474 

89.367 

4084 

D70831 

Homo sapiens Zinc- finger protein 

1303 

58 . 657 

4085 

AF083110 

Homo sapiens sirtuin type 5 

261 

93 . 182 

4086 

AF072508 

Homo sapiens envelope protein 

370 

79.221 

4087 

M38257 

Escherichia coli ORF 235 

696 

99.057 

4088 

X05173 

Escherichia coli NR(I) (glnG gene product) (AA 
1-468) 

995 

98.065 

4089 

U88169 

Caenorhabditis elegans similar to molybdoterin 
biosynthesis MOEB proteins 

922 

57.874 

4090 

AF105228 

Bos taurus tuftelin 

355 

31.308 

4091 

AF169635 

Sus scrofa Niemann-Pick C disease protein 

478 

59.167 

4092 

Z46793 

Caenorhabditis elegans cDNA EST EMBL:D75782 
comes from this gene; cDNA EST EMBL:D72838 
comes from this gene; cDNA EST yk504a5.3 comes 
from this gene 

244 

23.858 

4093 

J03137 

Bos taurus phospholipase C 

7783 

97.366 

4094 

AF146277 

Homo sapiens adapter protein CMS 

3820 

100 . 000 

4095 

U09367 

Homo sapiens zinc finger protein ZNF136 

965 

42 . 756 

4096 

D90907 

Synechocystis sp . isoleucyl- tRNA synthetase 

965 

40.189 

4097 

U10435 

Mus musculus Requiem 

432 

33.645 

4098 

X80111 

Drosophila melanogaster synapse-associated 
protein 

468 

39.556 

4099 

AL032626 

Caenorhabditis elegans predicted using 
Genef inder 

735 

51.031 

4100 

AF081258 

Homo sapiens testis- specific chromodomain Y- 
like protein 

3952 

100.000 

4101 

D63877 

Homo sapiens KIAA0157 gene product is novel. 

2756 

99.523 

4102 

AF116547 

Homo sapiens cysteine sulfinic acid 
decarboxylase- related protein 3 

3299 

99.797 

4103 

AL031515 

Streptomyces coelicolor hypothetical protein 
SC5C7 . 08 

377 

46.721 

4104 

Y10388 

Homo sapiens Graf protein 

293 

41.667 

4105 

D61689 

Mus musculus SOX-LZ 

1146 

96.591 

4106 

AL035424 

Homo sapiens dA22D12.1 (novel protein similar 
to Drosophila Kelch (Ring Canal protein, KEL) 
and a heterogenous set of other types of 
proteins ) 

2002 

75.956 

4107 

AL035652 

Homo sapiens dJ"U6.1 ( topoisomerase (DNA) I) 

179 

100 . 000 

4108 

Z92770 

Mycobacterium tuberculosis fadE2 

267 

56 . 627 

4109 

AL035524 

Arabidopsis thaliana putative protein 

498 

50.993 

4110 

X01630 

Homo sapiens argininosuccinate synthetase (aa 
1-412) 

151 

100 .000 

4111 

L24920 

Pan troglodytes complement receptor 1 

987 

28 . 800 

4112 

AC002126 \ 

Homo sapiens R31240 1 

655 

100 . 000 

4113 

AL033377 

Homo sapiens dJ287G14.2 (PUTATIVE novel seven 
transmembrane domain protein) 

3224 

100.000 

4114 

AB006533 

Homo sapiens DNA helicase 

553 

77 .477 

4115 

AF087697 

Rattus norvegicus dig 3 

283 

43 . 697 

4116 

AL021917 

Homo sapiens dJ45P21.2 (butyrophilin) 

613 

98.889 

4117 

AF113136 

Homo sapiens IL-1 receptor-associated-kinase-M; 
IRAK-M 

2084 

100.000 
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4118 

AB023207 

Homo sapiens KIAA0990 protein 

533 

37.968 

4119 

X58374 

Drosophila melanogaster crn 

2515 

66 .415 

4120 

Y08302 

Homo sapiens mitogen-activated protein kinase 
phosphatase 4 

1182 

100. 000 

4121 

AL031667 

Homo sapiens dJ620Ell.la (novel Helicase C- 
terminal domain and SNF2 N- terminal domains 
containing protein, similar to KIAA0308) 

4841 

100.000 

412 2 

AB011121 

Homo sapiens KIAA054 9 protein 

3094 

99.787 

4123 

AF15 5115 

Homo sapiens NY-REN- 5 8 antigen 

172 

100. 000 

4124 

APO 00005 

Pyrococcus horikoshii l4 9aa long hypothetical 
methylmalonyl-CoA decarboxylase gamma chain 

225 

38.554 

4125 

L48516 

Homo sapiens paraoxonase-3 

2210 

98.534 

4126 

AC0 072 92 

Homo sapiens R31167 1, partial protein 

2907 

99.302 

412 7 

D90746 

Escherichia coli SocA3 protein 

647 

100. 000 

4128 

AC005594 

Homo sapiens R26 984 1 

608 

100.000 

412 9 

AF143 536 

Homo sapiens colon cancer-associated protein 
Micl 

3880 

97.537 

4130 

U49082 

Homo sapiens transporter protein 

635 

62.179 

4131 

AB018275 

Homo sapiens KIAA0732 protein 

5592 

100.000 

413 2 

M34059 

Homo sapiens beta-globin 

236 

80.488 

4133 

D13637 

Homo sapiens KIAA0012 

267 

34.307 

4134 

U90126 

Bos taurus ABC transporter 

468 

60.870 

4135 

S61069 

Homo sapiens reverse transcriptase homolog=pol 
{retroviral element} 

271 

75.806 

4136 

X66899 

Homo sapiens RNA binding protein 

440 

100.000 

4137 

M34458 

Homo sapiens lamin B 

203 

91.667 

4138 

J03998 

Plasmodium falciparum glutamic acid- rich 
protein 

153 

30 .645 

4139 

AE001394 

Plasmodium falciparum predicted integral 
membrane protein 

78 

62.500 

414 0 

AF036 54 8 

Rattus norvegicus RGC-3 2 

683 

91.304 

4141 

AF000423 

Rattus norvegicus synaptotagmin XI 

845 

99.219 

4142 

X53773 

Rattus norvegicus alpha- c large chain (AA 1- 
938) 

1831 

68 . 974 

4143 

A68194 

unidentified unnamed protein product 

4024 

100.000 

4144 

AB011174 

Homo sapiens KIAA0602 protein 

6402 

99.896 j 

4145 

AF035526 

Mus musculus kanadaptin 

2540 

79.716 

4146 

AF023657 

Rattus norvegicus endo-alpha-D-mannosidase 

1590 

68.285 

4147 

U6 2 587 

Cricetulus griseus beta-l,6-N- 
acetylglucosaminyl transferase 

225 

56.863 

414 8 

7vt r\ a n c n 
AXj04 9557 

Homo sapiens dJ773A18.2 (PROBABLE ATP-DEPENDENT 
KJMA HkJjlOA&E P4 7 HOMOLOG) 

5454 

99.879 


awrti/i 7 a c 

nr U j4 / *k 5 

Mus musculus LNXp8 0 

2793 

88 .486 



Homo sapiens KIAA0882 protein 

6096 

100 . 000 

41 CI 

4 -L J X 

T 1 coi 0 

Caenorhabdi t is elegans putative 

743 

70 . 667 


7^171 cool 1 

Homo sapiens protocadherin alpha 3 

6162 

99.368 

4153 

U88908 

Mus musculus inhibitor of apoptosis protein 1 

223 

41.667 

4154 


Homo sapiens KIAA0001 

865 

49.807 

4155 

X94912 

Homo sapiens Pr22 

252 

92.857 

4156 

1^ -L. «J J U O 

r\Kj\\\\j bapiens uunior suppressing oXJr CL/WA 4 

2239 

99 . 392 

4157 

AL110239 

Homo sapiens hypothetical protein 

247 

97.436 

4158 

U02082 

Homo sapiens guanine nucleotide regulatory 
protein 

822 

43 . 910 

4159 

Z66496 

Unknown cDNA EST EMBL:D71941 comes from this 
gene; cDNA EST EMBL.-D746 91 comes from this 

240 

28.387 \ 
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gene ; cDNA 



4160 

Z48615 

Homo sapiens serine/threonine kinase with SH3 
domain, leucine ziDDer domain, and nroline irich 
domain 

275 

43 .846 

4161 

X66435 

Homo sapiens Hydroxymethylglutaryl CoA Synthase 

359 

96 . 721 

4162 

Z49125 

Unknown similarity to Trichostrongylus 
colubrif ormis 11 kd secretory protein (Swiss 
Prot accession 

448 

36 . 932 

4163 

AF072372 

Mus musculus lysosomal trafficking regulator 2 

1554 

99 . 127 

4164 

M36501 

Homo sapiens alpha- 2 -macroqlobulin 

642 

44 . 643 

4165 

U41164 

Rattus norvegicus Cys2/His2 zinc finger protein 

721 

86 . 957 

4166 

AL035678 

Arabidopsis thaliana putative protein 

2127 

65 . 971 

4167 

AF078779 

Rattus norvegicus putative four repeat ion 
channel 

796 

93 .600 

4168 

Z28278 

Saccharomyces cerevisiae ORF YKR053C 

281 

25 .463 

4169 

AB028981 

Homo sapiens KIAA1058 protein 

2502 

67 . 016 

4170 

AF007157 

Homo sapiens unknown 

283 7 

lOO OflO 

4171 

Z77654 

Caenorhabdi t is elegans predicted using 
Genefinder; Similarity to Drosophila RNA 
binding protein squid (SW:SQD DROME); cDNA EST 
yk638al.3 comes from this gene 

215 

34 .483 

4172 

U62810 

Mesocricetus auratus potassium channel Kv8 . 1 

468 

100 . 000 

4173 

X97999 

Homo sapiens transcription factor IID 

386 

47 . 297 

4174 

A01592 

Homo sapiens haemoglobin A beta chain 

434 

95 .455 

4175 

AB001563 

Homo sapiens RES4-22D 

277 

35 . 099 

4176 

AF103017 

Homo sapiens uroporphyrinogen decarboxylase 

248 

77 . 273 

4177 

A01592 

Homo sapiens haemoglobin A beta chain 

382 

86 .567 

4178 

AB023207 

Homo sapiens KIAA0990 protein 

663 

35 . 621 

4179 

Z83844 

Homo sapiens similar to SH3 -binding protein 

372 

98 . 214 

4180 

AF146688 

Fugu rubripes sex comb on midleg-like 2 protein 

188 

40 . 000 

4181 

X13621 

Homo sapiens HNP-3 defensin (AA 1- 94) 

162 

95.238 

4182 

AC002544 

Homo sapiens Translation initiation factor elF- 
pllO 

1750 

98 . 893 

4183 

A01592 

Homo sapiens haemoglobin A beta chain 

434 

92 .754 

4184 

AF032668 

Rattus norvegicus rseclS 

411 

73 . 810 

4185 

AF073299 

Homo sapiens Na+/H+ exchanger isoform 2 

2255 

100 . 000 

4186 

U25691 

Mus musculus lymphocyte specific helicase 

258 

87 . 500 

4187 

S72008 

Homo sapiens CDC10 homolog=hCDC10 

£. , 2 

199 

94 .286 

4188 

X16491 

Dictyostelium discoideum spore coat protein 
sp96 

198 

29.240 

4189 

AJ001019 

Homo sapiens ring finger protein 

343 

46 .429 

4190 

AF125569 

Homo sapiens tumor suppressing STF cDNA 6 

589 

100 . 000 

4191 

AB002321 

Homo sapiens KIAA0323 

5064 

100 . 000 

4192 

L41560 

Homo sapiens pterin- 4a- carbinolamine 
dehydratase 

319 

67 . 105 

4193 

AB006625 

Homo sapiens The human homolog of a mouse 
imprinted gene, Peg3 . 

7665 

99. 912 

4194 

AC004678 

Homo sapiens R34 094 1 

292 

93 . 878 

4195 

L13200 

Caenorhabditis elegans putative 

503 

49. 367 

4196 

AL032639 

Unknown similar to Zinc finger, C3HC4 type 
(RING finger); cDNA EST EMBL:C08103 comes from 
this ge 

171 

29.070 

4197 

U87318 

Xenopus laevis NaDC-2 

682 

69.286 

4198 

AF111423 

Xenopus laevis chromosome condensation protein 
XCAP-G 

1706 

54 .386 


t*7 
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MotTio sapiens ujna. oinoing protein RFXi 

3 080 

99 . 785 

4200 

U06631 

Homo sapiens homologous to mouse gene 
rL^zb :beiiDanA Accession jNumDer r^iy _3 d t> fi 

350 

39.735 

4201 

AL080123 

Homo sapiens hypothetical protein 

978 

64.000 

4202 

AC0O53 95 

Arabidopsis thaliana hypothetical protein 

503 

35 . 176 

4203 

X83544 

Homo sapiens DAP -3 

247 

97.368 

4204 

AF099013 

Homo sapiens glucocorticoid modulatory element 
binding protein- 1 

3686 

100.000 

4205 

AF005355 

Oryctolagus cuniculus translation initiation 
factor eIF2C 

3537 

99 . 809 

4206 

AC002131 

Arabidopsis thaliana Similar to seryl-tRNA 
synthetase gb|U10400 from S cerevisiae. EST 
go|JNyDD2 / comes iroiu cms gene. 

374 

49.573 


>vi_i\j 0 w u a 0 

nomo sapiens nypocneticai prouein 

1 A C A 

1464 

99 .533 

4208 

AL031427 

Homo sapiens dJ167A19.4 (novel protein) 

146 

92 . 000 

4209 

L10326 

Rattus norvegicus GTP-binding protein alpha- s 
subuni t 

154 

100.000 

4210 

AB028944 

Homo sapiens KIAA1021 protein 

5116 

99.742 

4211 

Z50194 

Homo sapiens PQ-rich protein 

208 

52 . 174 

4212 

AC006042 

Homo sapiens supported by human ESTs 

AI681256 .1 (NID:g4891438) , N32168 . 1 (NID :gll52567 ) 

, and genscan 

1016 

100 . 000 

4213 

X67337 

Homo sapiens Human pre-mRNA cleavage factor I 
68 kDa subunit 

370 

65.169 

4214 

AF034803 

Homo sapiens liprin-beta2 

581 

95 .604 

4215 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

264 

77.358 

4216 

AF117758 

Homo sapiens secreted f rizzled-related protein 
5 

2253 

100.000 

4217 

AF096300 

Homo sapiens HPK/GCK-like kinase HGK 

1707 

92 . 958 

4218 

AF043250 

Homo sapiens mitochondrial outer membrane 
protein 

1319 

64.561 

4219 

Z81505 

Unknown similar to Zinc finger, C3HC4 type 
(RING finger); cDNA EST EMBL:D28025 comes from 
this gene 

1156 

57.045 

422 0 

Li04 ODD 

Homo sapiens carbonic anhydrase- related protein 

V X ± ± 

220 

94 . 118 

4221 

AC004084 

Homo sapiens similar to GTPase-activating 
proteins; 35% similar to JC5047 (PID :g2136083 ) 

1268 

81.200 

4222 

A01573 

synthetic construct cystatin C 

314 

92 .000 

422 3 

ABQ14 57 9 

Homo sapiens KIAA067 9 protein 

1735 

98 . 872 

4224 

AF087433 

Rattus norvegicus leprecan 

3317 

92 . 075 

4225 

AL034488 

Caenorhabditis elegans predicted using 
tenet lnaer ; cdna est emb.L:C087 71 comes from 
cms gene; cjjxma c*oi Jc.rixJJLi : \~\) /412 comes irom cms 
gene 

400 

47.015 

*± 0 

YOQCQl 



D / . JUb 

4227 

AJ007583 

Homo sapiens acetylglucosaminyltransf erase- like 

LJ X UUCJ.I1 

1044 

67.757 

4228 

AF132552 

Drosophila melanogaster BcDNA. GM01838 

1600 

65 . 395 

4229 

L05779 

Homo sapiens cytosolic epoxide hydrolase 

159 

100.000 

4230 

X97818 

Mus musculus samaphorin G 

188 

96.296 

4231 

U38252 

Mus musculus fractionated X- irradiation- induced 
29 thymoma 

469 

97.333 

4232 

X74794 

Homo sapiens PI Cdc21 protein 

5616 

99.770 
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4233 

X98259 

Homo sapiens M-phase phosphoprotein 8 

454 

100. 000 

4234 

D83043 

Homo sapiens allele A*2711 

312 

90.000 

4235 

AL.080150 

Homo sapiens hypothetical protein 

5789 

99. 542 

4236 

AC004594 

Homo sapiens Ca2+ dependent activator protein 
for secretion; similar to D86214 (NID :gl398903) 

2026 

98.418 

4237 

AF117897 

Bos taurus rabll binding protein 

278 

88.000 

4238 

D31888 

Homo sapiens KIAA0071 

410 

67.347 

4239 

AF160934 

Drosophila melanogaster BcDNA. LD1418 9 

204 

64.286 

4240 

Z75330 

Homo sapiens nuclear protein SA-1 

3665 

99.819 

4241 

AB007918 

Homo sapiens KIAA044 9 protein 

745 

69.565 

4242 

AB030644 

Rattus norvegicus tudor repeat associator with 
PCTAIRE 2 

3647 

89.474 

4243 

AB030502 

Xenopus laevis XDRP1 

435 

72 . 000 

4244 

Z24725 

Homo sapiens mitogen inducible gene mig-2 

1993 

52.783 

4245 

Z67961 

Schizosaccharomyces pombe hypothetical protein 

440 

33 . 188 

4246 

AB002326 

Homo sapiens KIAA0328 

11013 

100 . 000 

4247 

M63180 

Homo sapiens threonyl - tRNA synthetase 

1846 

59.111 

4248 

AF093097 

Homo sapiens putative RNA-binding protein Q99 

264 

90.476 

4249 

AL080158 

Homo sapiens hypothetical protein 

801 

35.698 

4250 

U23502 

Plasmodium chabaudi chabaudi POM1 

487 

46.012 

4251 

Y17392 

Homo sapiens prefoldin subunit 1 

705 

98.291 

4252 

Z69634 

Unknown cDNA EST EMBL:C11678 comes from this 
gene; cDNA EST EMBL:C13926 comes from this 
gene ; cDNA 

704 

34.375 

4253 

U2 2 3 8 7 

Homo sapiens immunoglobulin heavy chain 

198 

64.583 

4254 

AL050163 

Homo sapiens hypothetical protein 

735 

100.000 

4255 

AB007945 

Homo sapiens KIAA04 76 protein 

9511 

99. 711 

4256 

Z37166 

Homo sapiens nuclear RNA helicase (DEAD family) 

1864 

100. 000 

4257 

X06764 

Homo sapiens Ig(k) L- chain precursor 

696 

84.328 

4258 

AF048731 

Homo sapiens cyclin T2a 

4386 

100.000 

4259 

M28732 

Mus musculus beta -tubulin 

281 

66 . 154 

4260 

Y10392 

Human endogenous retrovirus K protease 

452 

49. 206 

4261 

AC003682 

Homo sapiens R27945 2 

1686 

100.000 

4262 

AF044033 

Marmota marmota olfactory receptor 

267 

92.857 | 

4263 

L07924 

Mus musculus guanine nucleotide dissociation 
stimulator 

3514 

84.968 

4264 

X75931 

Bos taurus Cleavage and Polyadenylation 
specificity factor (CPSF) lOOkD subunit 

1686 

99.222 

4265 

AC006264 

Arabidopsis thaliana unknown protein 

278 

41.573 

4266 

X84101 

Streptomyces clavuligerus Proclavaminic acid 
amidino hydrolase 

631 

38.340 

4267 

AF108420 

Fugu rubripes 1-aminocyclopropane-carboxilate 
synthase 

1001 

59. 149 

4268 

AC009325 

Arabidopsis thaliana putative 
phosphate/phosphoenolpyruvate translocator 

197 

38.571 

4269 

Z74201 

Saccharomyces cerevisiae ORF YDL153C 

215 

30.380 

4270 

AL110193 

Homo sapiens hypothetical protein 

2638 

99.229 1 

4271 

AF187318 

Homo sapiens F-box protein Fbx2 

444 

55.263 

4272 

Z77655 

Caenorhabditis elegans predicted using 
^enetinaer, similar co c>v nana \z domains) 

644 

34.184 

4273 

AF124511 

Gallus gallus BVES 

339 

36.757 

4274 

AF056035 

Rattus norvegicus s-nexilin 

3148 

85. 841 

4275 

AJ005621 

Mus musculus skeletal and cardiac muscle- 
specific gene 

1791 

95.139 

4276 

X58288 

Homo sapiens protein- tyrosine phosphatase 

4182 

99. 836 
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4277 

M74495 

Mus musculus adenylosuccinate synthetase 

1924 

92.233 

4278 

M26460 

Homo sapiens retinoblastoma 1 

155 

36.667 

4279 

M83679 

Rattus norvegicus RAB15 

368 

96.491 

4280 

AL021106 

Unknown /predict ion= (method : " "genscan" " , 
version: ""1.0"", score: ""113.71"") ; 
/predictions (method: 

747 

50.215 

4281 

X05806 

Acetabularia mediterranea put . ORF 

189 

71.429 

4282 

AB007900 

Homo sapiens HH0452 cDNA clone for KIAA0440 has 
a 438-bp insertion at position 1711 of the 
sequence of KIAA0440. 

774 

40.389 

4283 

M95762 

Rattus norvegicus GABA transporter 

507 

81.395 

4284 

Z99118 

Bacillus subtilis aspartyl- tRNA synthetase 

1374 

43 . 667 

4285 

U04301 

Oryctolagus cuniculus mannosyl -oligosaccharide 
alpha- 1, 2-mannosidase 

1749 

67.385 

4286 

M55176 

Rhizomucor racemosus MRAS2 gene product 

207 

32 . 990 

4287 

Z70310 

Caenorhabditis elegans similar to Glutathione 
S-transf erases . ; cDNA EST yk536e7.3 comes from 
this gene 

763 

44.565 

4288 

AB020676 

Homo sapiens KIAA0869 protein 

814 

42. 338 

4289 

AJ006054 

Homo sapiens UDP glucuronosyltransf erase 

345 

76. 923 

4290 

U80223 

Drosophila melanogaster eukaryotic initiation 
factor eIF-2 alpha kinase; DGCN2 

783 

29.412 

4291 

Z34286 

Oryctolagus cuniculus ORF might extend further 
in 5' -direction 

432 

39.205 

4292 

AB007925 

Homo sapiens KIAA04 56 protein 

235 

45.122 

4293 

AF095150 

Homo sapiens protein O-mannosyl- transferase 1 

2731 

100.000 

4294 

D87433 

Homo sapiens KIAA0246 

727 

38 . 323 

4295 

D86984 

Homo sapiens similar to yeast adenylate cyclase 
(S56776) 

1377 

51.768 

4296 

AF151811 

Homo sapiens CGI- 53 protein 

2164 

99.096 

4297 

AB011399 

Homo sapiens AF-6 

515 

100.000 

4298 

AL080123 

Homo sapiens hypothetical protein 

4248 

99. 832 

4299 

AF134726 

Homo sapiens NG22 

2967 

97.821 

4300 

AB011084 

Homo sapiens KIAA0512 protein 

347 

30.342 

4301 

AF092094 

Homo sapiens AP-4 adaptor complex beta4 subunit 

3564 

99.441 

4302 

L14745 

Caenorhabditis elegans homology with GTP 
binding protein; putative 

546 

51. 592 

4303 

AF078832 

Homo sapiens methyl -CpG binding protein splice 
variant 3 

3931 

100. 000 

4304 

AF132484 

Mus musculus unknown 

523 

51.266 

4305 

AL050393 

Homo sapiens hypothetical protein 

420 

51.261 

4306 

D88158 

Sus scrofa cytochrome b561 

433 

50.350 

4307 

D86964 

Homo sapiens similar to a human major CRK- 
binding protein DOCK180. 

12180 

100.000 

4308 

AL110151 

Homo sapiens hypothetical protein 

2418 

99. 721 

4309 

AF140690 

Homo sapiens melusin 

307 

47 . 778 

4310 

AC002396 

Arabidopsis thaliana Hypothetical protein 

1347 

51 . 157 

4311 

AF068718 

Caenorhabditis elegans No definition line found 

514 

32 . 394 

4312 

AB023157 

Homo sapiens KIAA0940 protein 

1944 

91. 391 


At U o y b o y 

Homo sapiens act in binding protein MAYVEN 

404 

28.421 

4314 

AB028981 

Homo sapiens KIAA1058 protein 

1609 

68. 946 

4315 

AF133123 

Homo sapiens transcription factor IIIC102 

4090 

100.000 

4316 

X78927 

Homo sapiens zinc finger protein 

3146 

99.307 

4317 

AF028722 

Mus musculus fetal globin inducing factor 

490 

83 . 908 

4318 

M55542 

Homo sapiens guanylate binding protein isoform 

1774 

64 . 678 
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4319 

AB029015 

Homo sapiens KIAA1092 protein 

7731 

100. 000 

4320 

AB014548 

Homo sapiens KIAA0648 protein 

5548 

100.000 

4321 

Z68753 

Unknown predicted using Genefinder; Similarity 
to Glucose- repressible alcoihol dehydrogenase 
transc 

420 

46 . 875 

4322 

U70855 

Caenorhabditis elegans similar to the RAS gene 
family 

552 

35.039 

4323 

Y00649 

Homo sapiens CR2 receptor 

217 

27. 933 

4324 

D88315 

Mus musculus tetracycline transporter- like 
protein 

2647 

98 . 768 

4325 

Z68215 

Caenorhabditis elegans C53B4.4c 

156 

24 . 370 

4326 

AJ009698 

Rattus norvegicus embigin protein 

535 

80 .392 

4327 

AB001488 

Bacillus subtilis FUNCTION UNKNOWN. 

182 

36.364 

4328 

D63877 

Homo sapiens KIAA0157 gene product is novel. 

208 

28.972 

4329 

M14949 

Homo sapiens R-ras 

1443 

100.000 

4330 

D42041 

Homo sapiens The hal225 gene product is related 
to human alpha -glucos idase . 

6494 

99. 787 

4331 

AJ001015 

Homo sapiens RAMP2 

574 

98 . 824 

4332 

AF186461 

Rattus norvegicus ring finger protein Fxy 

295 

25.075 

4333 

AF064255 

Homo sapiens very long- chain acyl-CoA 
synthetase homolog 2; VLCS-H2 

1347 

100.000 

4334 

AC004955 

Homo sapiens supported by ESTs T61992 
(NID:g665235) and W26450 (NID : gl307l67 ) and 
Genscan 

1096 

100.000 

4335 

Z46676 

Unknown weak similarity to microtubule 
associated proteins; cDNA EST EMBL:T01154 comes 
from this ge 

443 

35.514 

4336 

U40029 

Caenorhabditis elegans No definition line found 

1552 

47 . 647 

4337 

AL032626 

Unknown cDNA EST EMBL:D70654 comes from this 
gene; cDNA EST EMBL:Z143 59 comes from this 
gene ; cDN 

957 

48 . 000 

4338 

AC004883 

Homo sapiens similar to KIAA0766; similar to 
PID:g3882253 

1875 

99.296 

4339 

X78998 

Homo sapiens endosomal protein 

372 

24 . 702 

4340 

AF153208 

Homo sapiens GC-rich sequence DNA- binding 
factor candidate 

588 

81.667 

4341 

U00050 

Caenorhabditis elegans No definition line found 

362 

32 . 512 

4342 

Z81586 

Caenorhabditis elegans cDNA EST yk335d8.5 comes 
from this gene; cDNA EST yk335d8.3 comes from 
this gene; cDNA EST yk656d4.3 comes from this 
gene 

624 

31.796 

4343 

AJ388555 

Canis familiaris hypothetical protein 

915 

43 .021 

4344 

U79776 

Mus musculus ajuba; jub 

431 

82.090 

4345 

U20554 

Drosophila melanogaster UDP- 

glucose : glycoprotein glucosyl transferase 

precursor 

2134 

67.550 

4346 

AB011097 

Homo sapiens KIAA052 5 protein 

1360 

44 . 270 

4347 

AF124512 

Homo sapiens BVES 

907 

100 . 000 

4348 

A47122 

unidentified unnamed protein product 

1067 

100 . 000 

4349 

AC006593 

Arabidopsis thaliana putative transmembrane 
protein 

490 

30. 350 

4350 

M23159 

Cricetus cricetus DHFR-coamplif ied protein 

496 

85.542 

4351 

D87515 

Rattus norvegicus aminopept idase -B 

2615 

92.593 

4352 

AF090136 

Rattus norvegicus lin-7-C 

322 

100.000 
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4353 

D79998 

Homo sapiens KIAA0176 

511 

70.370 

4354 

AF098993 

Caenorhabditis elegans No definition line found 

296 

25.000 

4355 

AF087826 

Mus musculus claudin-8 

646 

79.310 

4356 

AL023704 

Schizosaccharomyces pombe weak similarity to 
B.subtilis spore outgrowth f actor B 

1525 

56.585 

4357 

AJ223301 

Bos taurus aralkyl acyl-CoA: amino acid N- 
acyl transferase 

694 

38 .806 

4358 

AC004908 

Homo sapiens zinc finger protein from gene of 
uncertain exon structure; similar to Q99676 
(PID :g3025333 ) 

1961 

100.000 

4359 

AF099032 

Homo sapiens embryonic ectoderm development 
protein short isoform 

173 

96 . 154 

a ~i c n 

Ar 1 ,33670 

Mus musculus ARL-6 interacting protein- 2 

620 

96.939 

4361 

M98502 

Mus musculus pMLZ-4 

1035 

59.535 

4362 

M97204 

Drosophila melanogaster goliath protein 

305 

67.273 

4363 

U47924 

Homo sapiens C9 

905 

100.000 

4364 

AL080144 

Homo sapiens hypothetical protein 

4653 

99.866 

4365 

AJ222798 

Lycopersicon esculentum tDETl protein 

277 

37.607 

4366 

AF024497 

Caenorhabditis elegans weak similarity to 
drosophila tyrosine kinase (GB:G455391) 

333 

29.515 

4367 

Z82083 

Caenorhabditis elegans ZK1010.2 

407 

28.621 

4368 

AB020626 

Homo sapiens KIAA0819 protein 

330 

40 . 972 

4369 

D26488 

Homo sapiens This sequence is almost identical 
to HUMRSC433, D13632. 

4588 

100.000 

4370 

AL031387 

Homo sapiens dJ596C15.1.1 (novel protein) 
(isoform 1) 

1083 

100.000 

4371 

AL049699 

Homo sapiens dJ747H23.2 (novel protein) 

1297 

100.000 

4372 

D90916 

Synechocystis sp. hypothetical protein 

522 

52.695 

4373 

AC006530 

Homo sapiens unknown 

681 

55.758 

4374 

AB018323 

Homo sapiens KIAA0780 protein 

866 

98 .473 

4375 

AC004020 

Homo sapiens Unknown gene product 

958 

97.351 

4376 

AL023518 

Schizosaccharomyces pombe conserved 
hypothetical protein 

667 

52 . 857 

4377 

AF151799 

Homo sapiens CGI -4 0 protein 

424 

71.605 

4378 

AF070637 

Homo sapiens unknown 

1278 

100.000 

4379 

S60885 

Mus sp. LYAR=cell growth regulating nucleolar 
protein 

536 

67.797 

4 380 

Z93244 

Homo sapiens bK116F5.1 (unknown PUTATIVE 
protein) 

448 

53.600 

4 381 

Z95619 

Caenorhabditis elegans H21P03.2 

201 

29. 060 

43 82 

U29154 

Caenorhabditis elegans T07F12 . 1 gene product 

426 

33.617 


AF16 26 8 0 

Homo sapiens TRIF 

922 

99.270 

4384 

D87433 

Homo sapiens KIAA0246 

15929 

99. 955 

4385 

AF129812 

Homo sapiens candidate tumor suppressor protein 
NOC2 

668 

97.895 

4386 

AB002347 

Homo sapiens KIAA0349 

8658 

99.843 

4387 

Z68166 

Schizosaccharomyces pombe unknown 

373 

35.176 

4388 

Z35641 

Caenorhabditis elegans cDNA EST yk273d8.5 comes 
from this gene 

280 

32.402 

4389 

D7 9994 

Homo sapiens similar to ankyrin of Chromatium 
vinosum. 

1049 

46 . 667 

4390 

U40420 

Caenorhabditis elegans weak similarity to 
procollagen alpha chain 1 (V) chain 

537 

50. 000 

4391 

AL023828 

Caenorhabditis elegans cDNA EST yk289g5.5 comes 
from this gene; cDNA EST yk391h4.5 comes from 

1017 

41.943 
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this gene; cDNA EST EMBL:C09408 comes from this 
gene; cDNA EST yk332h9.5 comes from this cyene 



4392 

U49056 

Rattus norvegicus rAl 

1640 

94 . 779 

4393 

X78801 

Gallus gallus ovomacroglobulin, ovostatin 

515 

47.159 | 

4394 

AB014590 

Homo sapiens KIAA0690 protein 

1584 

99. 580 

43 95 

AB000216 

Rattus norvegicus CCA3 

338 

73 .239 

4396 

Z81515 

Caenorhabditis elegans F26H11.3c 

516 

39.444 

4397 

X02488 

Homo sapiens collagen N-prepropeptide (aa -22 
to 72) 

132 

41.379 

4398 

AF067172 

Homo sapiens RNA cyclase homolog 

1142 

97.790 

4399 

AF181623 

Drosophila melanogaster BcDNA. GH02 974 

223 

44.737 

4400 

Z75543 

Caenorhabditis elegans CDNA EST EMBL:M89063 
comes from this gene; cDNA EST yk384fl.3 comes 
from this gene; cDNA EST yk384fl.5 comes from 
this gene 

295 

39.091 

4401 

A27266 

Homo sapiens TGR-CL7 

790 

100.000 

4402 

AF041377 

Mus musculus cell death activator CIDE-B 

218 

63 . 793 

4403 

X64600 

Rattus norvegicus trans golgi network (TGN) 
specific integral membrane protein TGN38 

198 

27.317 

4404 

AF061555 

Mus musculus ubiquitin-protein ligase E3 -alpha 

1751 

94.141 ! 

4405 

AF117723 

Glycine max seed maturation protein PM2 7 

231 

29.189 

4406 

AL117204 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk381b7.5 comes from this 
gene 

1167 

41.606 

4407 

AL117518 

Homo sapiens hypothetical protein 

9043 

99. 925 

4408 

U22376 

Homo sapiens alternatively spliced product 
using exon 13 A 

396 

64 . 286 

4409 

AL050018 

Homo sapiens hypothetical protein 

2489 

99.478 

4410 

AL049943 

Homo sapiens hypothetical protein 

2268 

99.682 

4411 

Y08991 

Homo sapiens adaptor protein 

2325 

100 . 000 

4412 

AB004538 

Schizosaccharomyces pombe HYPOTHETICAL 5 9.2KD 
PROTEIN IN PFK26-SGA1 INTERGENIC REGION 

267 

43 .434 

4413 

U73820 

Mus musculus polypeptide GalNAc transf erase-Tl 

246 

28 . 090 

4414 

AL121804 

Drosophila melanogaster BACR7C10.a 

541 

52 . 381 

4415 

X83973 

Homo sapiens transcription factor 

1246 

100 . 000 

4416 

M96860 

Homo sapiens dipeptidyl aminopeptidase like 
protein 

493 

47.436 

4417 

AF024691 

Drosophila ananassae putative inorganic 
phosphate cotransporter 

523 

45.614 

4418 

A3 88 09 

Homo sapiens unnamed protein product 

564 

98.824 

4419 

U89336 

Homo sapiens unknown 

197 

30. 702 

4420 

U41543 

Unknown Similar to Rat trg gene product; coded 
for by C. elegans cDNA yk31e7.5; coded for by 
C. ele 

898 

46 . 795 

4421 

AF017418 

Homo sapiens homeobox protein MEIS2 

859 

99.194 

4422 

Z47075 

Unknown similar to Yeast DEG-1 protein (Swiss 
Prot accession number P31115) ; cDNA EST 
EMBL:D70252 c 

558 

32.000 

4423 

AJ011812 

Homo sapiens transcription factor NRF 

2606 

100 . 000 

4424 



330 

49.495 

4425 

U76759 

Mus musculus nuclear protein NIP4 5 

1207 

81 . 140 

4426 

AL117499 

Homo sapiens hypothetical protein 

1109 

100 . 000 

4427 

AJ132192 

Mus musculus HS1 binding protein 3 

781 

67.568 

4428 

AL022018 

Unknown /prediction= (method: " "genscan" " , 
version: ""1.0"", score: " "133 . 82 " " ) ; 

423 

35.156 
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/prediction= (method: 



4429 

AB018345 

Homo sapiens KIAA0802 protein 

9178 

100 . 000 

4430 

AL050095 

Homo sapiens hypothetical protein 

4235 

100 . 000 

4431 

AF151363 

Mus musculus Cdc42 GTPase-activating protein 

276 

60 . 938 

4432 

AF060153 

Homo sapiens METH2 protein 

6295 

99 . 888 

4433 

AF186115 

Mus musculus putative secreted protein SIG9 

243 

45 . 000 

4434 

AJ222636 

Homo sapiens hypothetical protein 

219 

45 .370 

4435 

AB028980 

Homo sapiens KIAA1057 protein 

6539 

99.795 

4436 

D83206 

Mus -musculus P24 protein 

184 

46.429 

4437 

AF132946 

Homo sapiens CGI -12 protein 

2097 

99.388 

4438 

AB028985 

Homo sapiens KIAA1062 protein 

10207 

100.000 

4439 

Z93386 

Unknown Similarity to Yeast hypothetical 52.9 
KD protein (SW:P43616); cDNA EST EMBL:M89432 
comes fr 

1110 

48.036 

4440 

U41107 

Caenorhabditis elegans No definition line found 

680 

40 . 071 

4441 

U64601 

Caenorhabditis elegans Gene probably begins in 
the next cosmid 

289 

51.765 

4442 

AL117204 

Caenorhabditis elegans predicted using 
Genef inder 

299 

35 .616 

4443 

L32372 

Mus musculus AM PA selective glutamate receptor 

332 

94 . 545 

4444 

M63180 

Homo sapiens threonyl - tRNA synthetase 

2594 

76 . 170 

4445 

U89529 

Rattus norvegicus fatty acid transport protein 

289 

84 . 000 

4446 

X93357 

Mus musculus homolog of human SYT 

2215 

95.652 

4447 

AB014565 

Homo sapiens KIAA066 5 protein 

323 

58 . 824 

4448 

AF144757 

Homo sapiens PR-domain zinc-finger protein PFM1 

5398 

99.497 

4449 

AF131826 

Homo sapiens Unknown 

763 

52 .804 

4450 

AF038007 

Homo sapiens FIC1 

742 

49.780 

4451 

Z12840 

Oryctolagus cuniculus protein of unknown 
function 

831 

33.014 

4452 

AF015811 

Mus musculus putative lysophosphatidic acid 
acyltransf erase 

925 

81.595 

4453 

U40800 

Caenorhabditis elegans similar to thymidine 
diphosphoglucose 4,6- dehydratase 

1647 

67.341 

4454 

AL117233 

Homo sapiens hypothetical protein 

3180 

99 . 792 

4455 

D17629 

Homo sapiens GALNS 

3665 

100.000 

4456 

Z82053 

Unknown predicted using Genef inder; similar to 
MUTT protein like; cDNA EST EMBL:C07418 comes 
from t 

286 

56.757 

4457 

U71205 

Mus musculus rit 

358 

34 . 906 

4458 

AF112481 

Homo sapiens RAD54B protein 

2357 

100 . 000 

4459 

Z81137 

Unknown Similarity to Yeast YIP1 protein 
(SW:P53039); cDNA EST EMBL:T01608 comes from 
this gene; cD 

643 

50.256 

4460 

D83146 

Mus musculus Six5 

306 

92 .308 

4461 

AJ133768 

Homo sapiens ZASP protein 

3315 

99 . 787 

4462 

Z99277 

Caenorhabditis elegans cDNA EST CEMSA26F comes 
from this gene; cDNA EST yk575gll.3 comes from 
this gene 

410 

31.206 

4463 

AB020629 

Homo sapiens KIAA0822 protein 

1959 

69 . 784 

4464 

AF074086 

Homo sapiens envelope 

146 9 

62 . 121 

4465 

Z83123 

Caenorhabditis elegans predicted using 
Genef inder 

300 

37.956 

4466 

AB007828 

Homo sapiens necdin 

407 

52.679 ! 

4467 

AF101361 

Drosophila melanogaster Abnormal X segregation 

310 

31.963 | 

4468 

AL021106 

Unknown /prediction= (method : " "genscan" " , 

228 

29.114 i 
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version :" "1.0" ") ; 

/predictions (method : " "genef inder " " , ve 



4469 

AC003672 

Arabidopsis thaliana putative zinc finger 
protein 

506 

30. 164 

4470 

D29766 

Rattus norvegicus Crk-associated substrate, 
P 130 

3067 

91.339 

4471 

X86779 

Homo sapiens FAST kinase 

162 

27.119 

4472 

U20286 

Rattus norvegicus lamina associated polypeptide 
1C 

1622 

74 . 775 

4473 

M76720 

Xenopus laevis egg-specif ic protein 

149 

35.185 

4474 

AL035086 

Homo sapiens dJ44A20.2 (novel protein) 

1717 

100.000 

4475 

AF056116 

Fugu rubripes unknown 

1570 

71 . 976 

4476 

Z29115 

Unknown similar to RNA helicases, deleted exon 
13 97-14 95 which introduced stop codon at 3' 
splice; 

1917 

64 .027 

4477 

AF115435 

Rattus norvegicus syntaxin 17 

629 

95.098 

4478 

AF025424 

Rattus norvegicus RNA polymerase I 12 7 kDa 
subunit 

1970 

95.302 

4479 

Z54328 

Schizosaccharomyces pombe hypothetical protein 

289 

34.266 

4480 

U15765 

Rattus norvegicus nonmuscle myosin heavy chain- 
B 

256 

89. 744 

4481 

AF003140 

Caenorhabditis elegans No definition line found 

432 

31.229 

4482 

AJ224306 

Arabidopsis thaliana PRTl 

218 

34.000 

4483 

AB002584 

Rattus norvegicus beta-alanine-pyruvate 
aminotransferase 

2102 

85.515 

4484 

AL117204 

Caenorhabditis elegans predicted using 
Genef inder 

253 

26.250 

4485 

Z11502 

Homo sapiens intestine-specific annexin 

1581 

98 . 814 

4486 

AF111713 

Homo sapiens junctional adhesion molecule 

504 

38 . 565 

4487 

AF151968 

Gallus gallus RGS protein RGS-17 

480 

91.250 

4488 

Z69944 

Schizosaccharomyces pombe hypothetical protein 

490 

28. 615 

4489 

Z69637 

Caenorhabditis elegans predicted using 
Genef inder; Similarity to E.coli hypothetical 
protein YCAC ( SW : YCAC_ECOLI ) ; cDNA EST 
yk555dl2.3 comes from this gene 

309 

63.291 

4490 

Z79754 

Caenorhabditis elegans Similarity to some 
phosphatases and kinases; cDNA EST EMBL:Z14643 
comes from this gene; cDNA EST yk531b4.3 comes 
from this gene; cDNA EST yk642e5.3 comes from 
this gene 

502 

30.508 

4491 

L41816 

Homo sapiens cam kinase I 

1106 

85.484 

4492 

AF120102 

Homo sapiens calsenilin 

630 

80. 531 

4493 

AL110300 

Homo sapiens hypothetical protein 

2338 

98.352 

4494 

U28412 

Caenorhabditis elegans similar to polyposis 
locus protein 1 (SP:DP1 HUMAN, Q00765) 

362 

51.429 

4495 

L12147 

Mus musculus early B-cell factor 

1671 

99.609 

4496 

D86964 

Homo sapiens similar to a human major CRK- 
binding protein DOCK180. 

1450 

99.563 

4497 

AB020690 

Homo sapiens KIAA0883 protein 

479 

41. 176 

4498 

AC004780 

Homo sapiens F17127 1 

2136 

94.253 

4499 

AF057365 

Canis familiaris UDP N-acetylglucosamine 
transporter 

254 

32.639 

4500 

AF099186 

Mus musculus EH domain- containing protein EHD1 

952 

67 . 347 

4501 

U27838 

Mus musculus glycosyl -phosphatidyl- inositol - 
anchored protein homolog 

460 

40.693 


A O O IB M-6 i+3 » O 2 S: & O 2 


43UZ 


Homo sapiens CDC4 2 -binding protein kinase beta 

426 

100.000 

4503 

X69489 

Rattus norvegicus beta-chimaerin 

290 

34 . 711 

/i c ft /i 
4 504 

AB012933 

Rattus norvegicus acyl-CoA synthetase 5 

2880 

78.571 


Tic ~i ~i c e? 

U5336 6 

Oncorhynchus mykiss terminal deoxynucleotidyl 
transferase 

550 

51.099 


i\E U i7 D / *i X 

Rattus norvegicus unknown 

1198 

58.123 

4507 

U83115 

Homo sapiens non-lens beta gamma- crystall in 
like protein 

10968 

100.000 

4508 

U62906 

Mus musculus zinc finger protein 94 

1328 

69.231 

a c n q 

ACT ft ft T *7 "7 

Mus musculus IdlBp 

782 

84 .138 


-f\ T7> "t C 1 *7 ft ft 

Ar 1S1799 

Homo sapiens CGI -40 protein 

3487 

95.963 


X162 62 

Rattus norvegicus myosin heavy chain 21 (AA 
621) 

199 

24.437 

*± -> XxS 

TT A C ft *7 

U4 Jb 07 

Mus musculus transcription factor-like protein 
4 beta 

1481 

87.732 

ART"* 

Ar X3b421 

Homo sapiens GDP-mannose pyrophosphorylase B 

2421 

100.000 

4514 

AL031427 

Homo sapiens dJ167A19.4 (novel protein) 

146 

92 . 000 

4515 

Z9912 9 

Homo sapiens dJ425C14.2 (Placental protein 
DIFF33 LIKE) 

884 

52.893 

4516 

Y08135 

Mus musculus acid sphingomyelinase- like 
phosphodiesterase 

2365 

81.995 

4517 

ALi023286 

Schizosaccharomyces pombe probable atp- 
dependent ma helicase 

215 

32.374 

A C T O 

AT All C C *1 

AJjOJ 166 7 

Homo sapiens dJ620Ell.le (novel Helicase C- 
terminal domain and SNF2 N- terminal domains 
containing protein, similar to KIAA0308) 

1028 

99.342 

4519 

D63850 

Mus musculus hepatoma-derived growth factor 

1444 

74 . 441 

/f C ft ft 

AC004523 

Homo sapiens F2232 9 1 

214 

84 . 615 

4521 

M60706 

Homo sapiens topoisomerase I 

2869 

68.243 

4522 

AF019236 

Dxctyostelium discoideum TipD 

691 

37.459 

4523 

Z75550 

Unknown limited similarity with some myosins; 
cDNA EST EMBL:C08402 comes from this gene; cDNA 
EST E 

192 

28.125 

/i coy! 

TTft TJT/T 

Homo sapiens alternatively spliced product 
using exon 13 A 

363 

67.500 

4525 

L13977 

Homo sapiens prolylcarboxypeptidase 

453 

37.297 

4526 

D78572 

Mus musculus membrane glycoprotein 

4177 

81.630 

4527 

AC004883 

Homo sapiens similar to KIAA0766; similar to 
PID:g3882253 

386 

27.046 

4528 

AB029005 

Homo sapiens KIAA1082 protein 

1769 

52.795 

452 9 

Z70307 

Caenorhabditis elegans Similarity to B.subtilis 
tetracycline resistance protein 
(SW:TCR2_BACSU) ; cDNA EST EMBL:C09951 comes 
from this gene; cDNA EST EMBL:C08265 comes from 
this gene; cDNA EST yk619cll.3 comes from this 
gene 

384 

35.135 

T J JU 

X XO U U O 

Mus musculus neuronal -STOP protein 

914 

57.692 

"J J J. 

TTQ ft QO n 

Caenorhabditis elegans strong similarity to 
class- III of pyridoxal-phoshate-dependent 
aminotransferases 

1001 

48.377 

4532 

AF039023 

Homo sapiens Ran-GTP binding protein; RanBP6 

1209 

97 . 861 

4533 

AF051945 

Mus musculus Xin 

492 

51.515 

4534 

AF005856 

Drosophila yakuba anon2A5 

499 

34 .454 

4535 

AF105374 

Homo sapiens heparan sulfate D-glucosaminyl 3- 
O-sulf otransf erase-2 

2512 

99.455 


1% 
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A C *5 ^ 

AC0U44 72 

Homo sapiens PI. 11659 3 

1465 

75 . 078 

453 7 

Z66519 

Caenorhabditis elegans similar to phytoene 
synthase precursor; cDNA EST yk340f7.3 comes 
from this gene; cDNA EST yk340f7.5 comes from 
this gene; cDNA EST yk565e5.3 comes from this 
gene 

351 

30.769 

4 53 8 


cdenornaDaiLis eiegans pucative 

229 

39.669 

4539 

Z78201 

Unknown Similarity to E.coli 2-oxoglutarate 
dehydrogenase ( SW : ODOl^ECOLI ) ; cDNA EST 
ejI v ud±j . L>j z dj\j com 

1681 

52 .665 

4540 

Y17833 

Human endogenous retrovirus K pol protein 

741 

80 . 597 



Arabidopsis thaliana unknown protein 

383 

27 . 596 

4542 

U20861 

Caenorhabditis elegans similar to yeast 
ancivirai protein SKI2 ana ATP-aepenaent DNA- 
helicases 

278 

24 .416 

4543 

Z69384 

Caenorhabditis elegans Similarity to Salmonella 
regulatory protein UHPC (SW:UHPC SALTY) 

345 

54 .023 

4544 

AC004890 

Homo sapiens similar to zinc finger proteins; 
similar to AAC01956 (PID : g2843171 ) 

2856 

100. 000 

b«t b 


Homo sapiens supported by GENSCAN prediction 
and spliced EST; similar to Z35641 
lirlJJ .yib ana AlObyoUO INID : g33333 77 ) 

1394 

100 . 000 

*i J *± O 

M7 TRnp 
/DUO 

nomo sapiens oeta-gaiactosiaase related protein 
precursor ■ 

966 

49.379 

M *i / 


xioiuo sapiens j\.i/\/\uyDi7 protein 

5305 

99 . 026 

4C4fl 


Caenorhabditis elegans coded for by C. elegans 

K—LJVit^t^ VjCilDallA. . HO O O O ^7 dHU J / pULciLlVS 

647 

46 . 445 

4549 

AJ011523 

Caenorhabditis elegans CHE- 2 protein 

532 

37 . 811 

4Ccn 

LID J fi / D 

Rattus norvegicus GTPase Rab8b 

245 

97 .436 


TJ74 O Q*7 

Oryctolagus cuniculus PiUS 

218 

96 . 774 

4552 

AL117434 

Homo sapiens hypothetical protein 

4171 

99.685 

4553 

AL021492 

Caenorhabditis elegans Y45F10D.il 

518 

33.158 

4 CC/1 

1J14 JJo 

Mus musculus RNA polymerase I associated factor 
(PAF53) 

1494 

77 . 700 

ft J J J 

At U U Z? O b 

Homo sapiens unknown protein IT12 

5694 

99 . 771 



Homo sapiens match to ESTs AA3 16181 
(NID:g3165221) , AA032221 (NID : gl502 183 ) , and 
AI167942 (NID:g3701112) 

2522 

100 . 000 

4 3D / 

/\r* uyy 1 *±z 

Rattus norvegicus putative short-chain 

ucnyuiuyciidat:/ r cuuc taSc 

448 

63 . 158 

4558 

TT"3 Qfi4. ft 

cdenurnaDuic.is eiegans iijld.d gene proauct 

4 97 

51.250 

4559 

AL011 07 7 

vjiijviivjwii / picuiLtiuii — ^iitc: uiiuu . yensiinaer , 
/oredi c t ion= (metho 

754 

56 . 614 

4560 

AF132164 

Drosophila melanogaster unknown 

278 

35. 762 

4561 

AB020669 

XlUlllv/ oapiCUO I\lnnu O O Z. ^Ji. L- t. JLI1 

O Art 

3 o . bob 

4562 

AL117430 

Homo c >~) ■} on o Vivrr»ot"Hi=»t* i 1 nrnl-p i n 

T O O *7 

i An r\r\f\ 
10U . UOO 

4563 

U21324 

\,acnvi najjuj. l j.d cxcy alio ixy UCllilXLXvll 1111c J_ vll 1Q 

loo 

A Q QIC 

4564 

AF098633 

Ml 1 c* mi l c» <*-m 1 1 lies nTTlTd irooi r~> ~] <=> nvnho-i -n 
i'luo iiiuov.uj.Uo ouuit vcolulc piUUclll 

A *1 1 

411 

A *1 C C C 

4 7.656 

4565 

AF053630 

Homo sapiens monocyte /neutrophil elastase 
inhibitor 

1490 

100 000 

-A. \J \J m \J \J \J 

4566 

J05499 

Rattus norvegicus L-glutamine amidohydrolase 

462 

92.208 

4567 

AF068749 

Mus musculus sphingosine kinase 

1716 

81. 908 

4568 

Z48804 

Homo sapiens OA1 

1258 i 

100. 000 

4569 

X85030 

Homo sapiens calpain 

1541 

100.000 
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4570 

AL079349 

Arabidopsis thaliana putative protein 

289 

30.962 

4571 

AF155110 

Homo sapiens NY -REN- 4 5 antigen 

5381 

99.877 

4572 

AB002301 

Homo sapiens KIAA03 03 

14352 

99.953 

4573 

AC002400 

Homo sapiens Acyl carrier protein, 
Mitochondrial (ACP) (5' partial) 

1000 

100.000 

4574 

U20554 

Drosophila melanogaster UDP- 

glucose :glycoprotein glucosyltransf erase 

precursor 

2320 

59.331 

4575 

S45663 

Rattus sp. SC2=synaptic glycoprotein 

722 

50.538 

4576 

AB025411 

Mus musculus Ten-m2 

4766 

98 .476 

4577 

246381 

Unknown Weak similarity with the Ysy6 protein 
(Yeast) (PIR accession number JQ0912) ; cDNA EST 
EMBL: 

550 

42.157 

4578 

AF151863 

Homo sapiens CGI -105 protein 

187 

93 . 333 

4579 

AB014564 

Homo sapiens KIAA066 4 protein 

7558 

100 . 000 

4580 

X98494 

Homo sapiens M phase phosphoprotein 10 

4243 

98 . 960 

4581 

AL008635 

Homo sapiens dJ510H16.1 

844 

54 .412 

4582 

AF041206 

Homo sapiens midline 1 cerebellar isoform 1 

460 

26 . 515 

4583 

AC006416 

Arabidopsis thaliana Similar to 

530 

43 .750 

4584 

AF123653 

Homo sapiens FEZ1 

362 

41.341 

4585 

AF000195 

Caenorhabditis elegans similar to oxysterol- 
binding proteins 

490 

42 .246 

4586 

U93569 

Homo sapiens p4 0 

167 

54 . 545 

4587 

X94313 

Mus musculus p6 8 RNA helicase 

153 

95.455 

4588 

D87438 

Homo sapiens Similar to a C. elegans protein in 
cosmid C14H10 

5406 

99.512 

4589 

U61953 

Caenorhabditis elegans No definition line found 

764 

49.798 

4590 

U93569 

Homo sapiens p4 0 

238 

37.963 

4591 

Z70780 

Unknown similar to Zinc finger, C2H2 type; cDNA 
EST EMBL:D27516 comes from this gene; cDNA EST 
EMBL 

975 

40.000 

4592 

AB007923 

Homo sapiens KIAA04 54 protein 

701 

91.200 

4593 

AC002336 

Arabidopsis thaliana hypothetical protein 

647 

33 .668 

4594 

AB011096 

Homo sapiens KIAA0524 protein 

3955 

100 . 000 

4595 

ALO50331 

Homo sapiens dJ486I3.4 (TSPY-like (testis 
specific protein, Y-linked like) ) 

750 

65.269 

4596 

AB029033 

Homo sapiens KIAA1110 protein 

4928 

99 . 595 

4597 

AB002364 

Homo sapiens KIAA03 66 

406 

30 . 000 

4598 

AF017639 

Mus musculus carboxypeptidase X2 

923 

90 . 411 

4599 

Y17462 

Fugu rubripes cysteine conjugate beta- lyase 

1613 

55.904 

4600 

AF060173 

Rattus norvegicus SV2 related protein 

1828 

96 . 140 

4601 

M36501 

Homo sapiens alpha- 2 -macroglobulin 

211 

91 . 429 

4602 

X65964 

Homo sapiens nest in 

1907 

95 .652 

4603 

X03743 

Homo sapiens L apoferritin (aa 126-175) 

174 

92 . 857 

4604 

Z69240 

Schizosaccharomyces pombe putative 
amidohydrolase 

996 

52.708 

4605 

U73819 

Mus musculus polypeptide GalNAc transf erase -T4 

734 

39 . 527 

4606 

AF187318 

Homo sapiens F-box protein Fbx2 

492 

43 . 103 

4607 

AF104260 

Homo sapiens hiwi 

1062 

4 8 . 951 

4608 

Z46786 

Drosophila melanogaster acetyl -CoA synthetase 

~i A Q 

CL*\ A A C 

4609 

X76092 

Homo sapiens DNA binding protein RFX3 

417 

100. 000 

4610 

X68880 

Homo sapiens EMX2 

1076 

100. 000 

4611 

AF093097 

Homo sapiens putative RNA-binding protein Q99 

5782 

100. 000 

4612 

AF038960 

Homo sapiens SKD1 homolog 

340 

97 . 917 

4613 

AB020807 

Homo sapiens TLR6 

1680 

56.849 
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4614 

U39546 

Rattus norvegicus surface protein MCA- 32 

372 

30.904 

4615 

AL031228 

Homo sapiens dJ1033B10 . 8 . 1 (Ring finger protein 
1 (RING1, RNF1) ) 

1300 

100.000 

4616 

AB015330 

Homo sapiens HRIHFB2 007 

341 

55.914 

4617 

U20086 

Mus musculus NF2d9 

297 

89.796 

4618 

AF016417 

Caenorhabditis elegans Similar to BZIP 
transcription factor 

648 

42 .105 

4619 

L12701 

Homo sapiens engrailed protein 

667 

100 . 000 

4620 

X55885 

Homo sapiens KDEL receptor 

497 

100.000 

4621 

Y08200 

Homo sapiens rab geranylgeranyl transferase 

169 

33.803 j 

4622 

U79260 

Homo sapiens unknown 

191 

58 . 730 

4623 

U95973 

Arabidopsis thaliana endomembrane protein EMP70 
precusor isolog 

1683 

59.698 

4624 

AL034433 

Schizosaccharomyces pombe ubiquitin-activating 
enzyme el 

456 

48 . 538 

4625 

AL117404 

Homo sapiens hypothetical protein 

458 

100.000 

4626 

L07765 

Homo sapiens carboxylesterase 

273 

50.000 

4627 

AC003038 

Homo sapiens R3 0 923 1 

482 

98.611 

4628 

AJ010482 

Homo sapiens Myopodin protein 

1271 

100.000 

4629 

AJ010305 

Mus musculus mouse smoothelin, large isoform 

834 

54 . 751 

4630 

AF118637 

Homo sapiens feline leukemia virus subgroup C 
receptor FLVCR 

446 

28.342 

4631 

AB029023 

Homo sapiens KIAA1100 protein 

1059 

72 .093 

4632 

AB014568 

Homo sapiens KIAA0668 protein 

4888 

99.865 

4633 

X92857 

Homo sapiens NFI /CAAT-binding transcription 
factor 5 (CTF5) 

1014 

100.000 

4634 

U13875 

Caenorhabditis elegans No definition line found 

485 

43 .367 

4635 

U34925 

Drosophila melanogaster TH1 

220 

52 . 727 

4636 

AF031897 

Meleagris gallopavo G protein coupled P2Y 
nucleotide receptor 

563 

35 .632 

4637 

L22005 

Homo sapiens ubiquitin conjugating enzyme 

2038 

99.664 

4638 

AF145615 

Drosophila melanogaster BcDNA . GH03 3 77 

221 

33 .333 

4639 

Z49909 

Caenorhabditis elegans weak similarity with a 
B. Flavum translocation protein (Swiss Prot 
accession number P38376) ; cDNA EST yk220el0.5 
comes from this gene; cDNA EST yk54 9el2.3 comes 
from this gene; cDNA EST yk618d6.3 comes from 
this gene 

628 

36 . 508 

4640 

AF078850 

Homo sapiens steroid dehydrogenase homolog 

1175 

98 . 925 

4641 

D88153 

Homo sapiens HYA2 2 

1155 

71.429 

4642 

AF155739 

Mus musculus axotrophin 

1761 

90.203 

4643 

AF079527 

Mus musculus IER5 

450 

78.261 

4644 

AF080217 

Sinorhizobium meliloti acetoacetyl-CoA 
synthetase; acetoacetyl-CoA ligase; acyl- 
activating enzyme 

883 

47.826 

4645 

X04823 

Bos taurus cGMP phosphodiesterase (AA 1-87) 

275 

88 . 889 

4646 

Z95559 

Caenorhabditis elegans cDNA EST yk236d4.5 comes 
from this gene; cDNA EST EMBL:C13455 comes from 
this gene; cDNA EST yk32 9g6.5 comes from this 
gene; cDNA EST CEMSH4 5R comes from this gene 

648 

56.497 

4647 

X80169 

Mus musculus tsg24 

3148 

92 . 032 

4648 

X92750 

Mus musculus red-1 

1174 

98.352 

4649 

J02459 

bacteriophage lambda ea22 (182) 

856 

96.296 

4650 

U06631 

Homo sapiens homologous to mouse gene 
PC326 :GenBank Accession Number M95564 

400 

64.583 
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4651 

L36340 

Xenopus laevis importin alpha lb 

194 

57 . 692 

4652 

AF135027 

Homo sapiens OB binding protein- like protein 

2931 

89 . 655 

4653 

U22818 

Cricetulus griseus mutant sterol regulatory- 
element binding protein- 2 

469 

63 .636 

4654 

AB028981 

Homo sapiens KIAA1058 protein 

10125 

100 . 000 

4655 

AL049996 

Homo sapiens hypothetical protein 

51 

36 . 364 

4656 

D86979 

Homo sapiens KIAA0226 

1103 

55 . 172 

4657 

U28789 

Mus musculus PACT 

2740 

78 . 743 

4658 

X97674 

Homo sapiens transcriptional intermediary- 
factor 2 

8231 

100.000 

4659 

A58799 

unidentified unnamed protein product 

407 

94 .444 

4660 

AF072506 

Homo sapiens envelope protein precursor 

373 

75.949 

4661 

X98374 

Rattus norvegicus KIS 

314 

100 . 000 

4662 

Z71259 

Caenorhabditis elegans Weak similarity to Yeast 
mitochondriual carrier protein YIL006W 
(SW:YIA6_YEAST) ; cDNA EST EMBL:C09181 comes 
from this gene 

365 

42.748 

4663 

AB028986 

Homo sapiens KIAA1063 protein 

811 

80.000 

4664 

AL021748 

Schizosaccharomyces pombe hypothetical protein 

232 

35 . 971 

4665 

AB002374 

Homo sapiens KIAA0376 

962 

45 .433 

4666 

AB023203 

Homo sapiens KIAA0 986 protein 

242 

100 . 000 

4667 

AJ010973 

Homo sapiens DEDD protein 

578 

53 . 049 

4668 

AF116553 

Drosophila melanogaster antennal- specif ic 
short -chain dehydrogenase /reductase 

390 

32 . 663 

4669 

X07311 

Drosophila melanogaster heat shock protein 

167 

35 . 714 

4670 

X75042 

Homo sapiens c-rel 

2965 

100 . 000 

4671 

X17531 

Strongylocentrotus purpuratus epidermal growth 
factor 

407 

57 . 831 

4672 

AB011123 

Homo sapiens KIAA0551 protein 

443 

87 . 671 

4673 

AB020711 

Homo sapiens KIAA0 904 protein 

312 

100 . 000 

4674 

AF000234 

Homo sapiens P2x purinoceptor 

216 

100.000 

4675 

Z36715 

Homo sapiens Net 

2006 

98 .101 

4676 

AL109819 

Arabidopsis thaliana extensin-like protein 

414 

24 . 194 

4677 

AJ006268 

Homo sapiens ATPase 

811 

92 .308 

4678 

AF132180 

Drosophila melanogaster unknown 

940 

35.849 

4679 

Z72511 

Unknown possible zinc finger protein; cDNA EST 
EMBL:M8 9115 comes from this gene; cDNA EST 
EMBL : D715 

228 

33 .333 

4680 

AC005053 

Homo sapiens match to ESTs AA316181 
(NID :g3165221) , AA032221 (NID : gl502l83 ) , and 
AI167942 (NID : g3 701112 ) 

881 

49 . 580 

4681 

AE000119 

Escherichia coli orf, hypothetical protein 

675 

100.000 

4682 

AB015630 

Homo sapiens type II membrane protein 

551 

34 . 448 

4683 

AB005541 

Rattus rattus PCTAIRE3 

2521 

91 . 707 

4684 

AF089730 

Rattus norvegicus potassium channel subunit 

488 

94 . 937 

4685 

AB011541 

Homo sapiens MEGF8 

12808 

99 . 942 

4686 

AF006466 

Mus musculus lymphocyte specific formin related 
protein 

2651 

81 . 426 

4687 

U71273 

Sus scrofa glucosidase II 

179 

30 . 488 

46 88 

D90750 

Escherichia coli Hypothetical transcriptional 
regulator in metG-dld intergenic region. 

C r a 

1UU . DUO 

4689 

M59742 

Rattus norvegicus GABA transporter protein 

2732 

90.991 

4690 

AB014568 

Homo sapiens KIAA06 68 protein 

511 

37.838 

4691 

D29013 

Homo sapiens DNA polymerase beta 

512 

33.333 

4692 

AB028972 

Homo sapiens KIAA104 9 protein 

201 

100 . 000 
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4693 

AC004685 

Homo sapiens Unknown gene product 

287 

100.000 



Methanobacterium thermoautotrophicum O- linked 

ox v-XN.rt.v-. LidiiDLcLaac 

193 

38 . 776 

4695 

X99209 

Homo sapiens arginine methyltransf erase 

250 

34 .884 

4696 

AL031685 

Homo sapiens dJ963K23.2 (novel protein) 

533 

41.765 

4 C Q "7 

7\T T 71 7/11 

ALii J 1 /4 1 

Schizosaccharomyces pombe vacuolar protein 
sorting-associated protein 

1078 

35 . 199 

/i a q q 

VI C ACd 

Rattus norvegicus 70 kD tumor- specif ic antigen 

591 

86 . 735 

4699 

Z93239 

Unknown predicted using Genefinder; cDNA EST 
EMBL:D6 8680 comes from this gene; cDNA EST 

- _1_ 7 17^7 r— 

yKz±zg2 . 5 c 

1063 

44.986 

a 7 f\ n 

AT 070^77 
AliU / OOZ / 

Schizosaccharomyces pombe actin-like protein; 
(2 act in domains) 

403 

46 . 043 

A *7 t\ 1 

** 1 U 1 


Caenorhabditis elegans Similarity to Human 
Transcriptional repressor protein PRDI-BF1 (PIR 

Hrr* M/7 7\ 7 qcr a \ 

«.VkI- • INV-J . rtj ->D ft J 

194 

39 . 216 

4702 


nomo sapiens riKxrir ozzoo 

293 

65 . 556 

4703 

Ag JD 

rioiuo sapiens jujma \cyuosine-b - / - 
methyltransf erase 

1422 

99 . 510 

4704 

AF053130 

Mus musculus unconventional myosin MY015 

2 54 

97.561 

4*7ft c: 

J-\C lZlUOl 

Mus musculus cAMP inducible 2 protein 

473 

68 . 182 

t / vD 


Homo sapiens Unknown gene product (partial) 

1141 

96.237 

ffc / u / 

ADrtT /t C 7 7 

Homo sapiens KIAA06 73 protein 

8121 

100 . 000 

fi / U O 

ZjO 0 / bl 

Unknown Similarity to Yeast hypothetical 
protein ykku \bw:YKJ\U YJ2.A0T/ ; CDNA EST 

PMDT . r*1 9C7Q p nmp cj -P 

drlDij . ^l^D /o (-WlTlcS X 

662 

47 . 222 

4709 

API 111 fiR 

/IT X X X X O O 

nOlllLJ CjCtLJX Clio IXIliVilC^) WI1 

Id 3 

55 . 556 

4710 

AF146688 

Fugu rubripes kelch protein 

264 

31.056 

I'll 


nomo sapiens i\.iAAUzoy 

8 904 

99.313 

4712 


nomo sapiens nypocnecicai protein 

2277 

99 . 701 

4713 

AB001772 

Ciona savignyi PEM-5 

301 

32 .576 

*± / X*± 

anno rxtz i 

Homo sapiens KIAA0869 protein 

5658 

99 . 217 

471 c 


Caenorhabditis elegans cDNA EST EMBL:M88866 
comes from this gene 

298 

30 . 290 

4716 

AF077032 

Homo sapiens sec61 homolog 

476 

95.890 

471 7 
*s / X / 

arn^4 no*? 

Homo sapiens mutant guanine nucleotide-binding 
jjioceiii \ s / , axpna suouniu 

158 

100 . 000 

4718 

AF030253 

Rattus norvegicus vesicular GABA transporter 

398 

98 .305 

471 9 


baiius gaiius rnovjap protein 

877 

48.235 



nomo sapiens au^bbii . i i loFx -luce ^testis 
sucmic protein/ i-iinKea luce/ / 

1554 

99 . 556 

4721 

X77681 

riLCci ctJJl.co ^ LI 1. cl 

1 Z D 

A C A C C 

4722 

U6 6411 

uiuoupiii.ia uic laliuyao Lcl piluaLl Vc U-ypts XXX 

alcohol dehydrogenase 

O QA 
ZtJ<* 

C A Q 7 O 

4723 

L07063 

Mus musculus FKBP65 binding protein 

2255 

89.444 

4724 

AB007923 

Homo sapiens KIAA04 54 protein 

292 

33 . 790 

4 72 5 

wo y\j j\j 

Homo sapiens 190kD protein 

3235 

99.3 93 

4726 

X61047 

Hydra sp. mini- collagen 

124 

53 .333 



Homo sapiens dA22D12 . 1 (novel protein similar 
to Drosoohila Kelch (Rincr Canal nrot-pin TCFT.) 
and a heterogenous set of other types of 
proteins) 

3848 

100 . 000 

4728 

AF003384 

Caenorhabditis elegans weak similarity to the 
peptidase family A2 

713 

34 .988 

4729 

U43317 

Mus musculus transmembrane receptor 

1084 

99.401 
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4730 

AC002400 

Homo sapiens Gene product with similarity to 
Ubiquitin binding enzyme 

1378 

90.870 

4731 

L15313 

Caenorhabditis elegans homology with leucine 
aminopeptidase; coded for by C. elegans cDNAs 
CE2F12 (GenBank: Z14714) and CE15D11 (GenBank: 
Z14518) ; putative 

1227 

46.991 

4732 

AJ238706 

Drosophila melanogaster monocarboxylate 
transporter 1 homologue 

297 

30.400 

4733 

AF072934 

Homo sapiens translational release factor, l 

1050 

48. 773 

4734 

AC005306 

Homo sapiens R2 7216 1 

1480 

80 . 970 

4735 

AF113131 

Homo sapiens host cell factor homolog LCP 

276 

80. 000 

4736 

AB007926 

Homo sapiens KIAA04 57 protein 

5616 

99. 763 

4737 

AC004500 

Homo sapiens GDF- 9 

3109 

100.000 

4738 

AF007170 

Homo sapiens unknown 

404 

26.415 

4739 

X60152 

Homo sapiens zinc finger 2.2 

2795 

100.000 

4740 

D25215 

Homo sapiens KIAA0032 

593 

38.565 

4741 

AC005534 

Homo sapiens supported by human ESTs AA4124 02 
(NID:g2070990) NH44021 (NID : gll82549) , mouse 
EST AA065933 (NID :gl562789) , and genscan 

1486 

88.845 

4742 

Z83115 

Caenorhabditis elegans predicted using 
Genefinder; Similarity to Human P619 protein 
(TR:Q15751) 

212 

48.387 

4743 

AF153230 

Xenopus laevis allantoicase 

817 

56 . 345 

4744 

Y07759 

Homo sapiens mysoin heavy chain 12 

413 

61 . 798 

4745 

M98529 

Homo sapiens 21 kDa protein 

418 

100.000 

4746 

U23522 

Caenorhabditis elegans No definition line found 

574 

31 . 875 

4747 

AF070996 

Monodelphis domestica lactate dehydrogenase A 

767 

41.534 

4748 

Z25420 

Gallus gallus class II INCENP protein 

238 

26 . 103 

4749 

AF117892 

Homo sapiens aspartic-like protease 

3428 

99. 807 

4750 

AL050126 

Homo sapiens hypothetical protein 

740 

73 . 077 

4751 

Z82268 

Unknown cDNA EST yk338gl0.5 comes from this 
gene; cDNA EST EMBL:D2 7934 comes from this 
gene; cDNA E 

369 

32 . 035 

4752 

AL031320 

Homo sapiens dJ20N2.1 (novel protein similar to 
yeast and bacterial cytosine deaminase) 

260 

97.436 

4753 

U34932 

Rattus norvegicus Fos- related antigen 

1129 

82.432 

4754 

AF083108 

Homo sapiens sirtuin type 3 

2730 

100.000 

4755 

AL049938 

Homo sapiens hypothetical protein 

1438 

99.552 

4756 

A0T006470 

Homo sapiens cartilage-associated protein 
(CASP) 

1944 

100.000 

4757 

AF000195 

Caenorhabditis elegans similar to oxysterol- 
binding proteins 

839 

44 . 966 

4758 

U54638 

Mus musculus rhotekin 

1438 

81.176 

4759 

Z78542 

Caenorhabditis elegans similar to Mitochondrial 
carrier proteins; cDNA EST EMBL:T01651 comes 
from this gene 

523 

51. 592 

4760 

X13916 

Homo sapiens LDL- receptor related precursor (AA 
-19 to 4525) 

1252 

96.196 

4761 

AF053308 

Drosophila affinis putative guanine nucleotide 
releasing factor 

656 

57. 962 

4762 

AF167320 

Mus musculus zinc finger protein ZFP113 

672 

67.669 

4763 

AF049611 

Homo sapiens huntingtin interacting protein 
HYPE 

1221 

100.000 

4764 

AB014596 

Homo sapiens KIAA06 96 protein 

3747 

100 . 000 

4765 

AL034408 

Homo sapiens dJ710L4.2 (similar to 

1985 

99. 660 
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MYOTUBULARIN- RELATED PROTEIN) 



4766 

Z46787 

Caenorhabditis elegans similar to Glutaredoxin, 
Zinc finger, C3HC4 type (RING finger) 

1209 

51.582 

4767 

D78359 

Rattus norvegicus consensus repeat domain: 
ntl20-174; consensus repeat domain: nt262-317; 
consensus repeat domain: nt57-115; 
transmembrane domain: nt376-413 

235 

32 .168 

4768 

AF083217 

Homo sapiens WD repeat protein WDR3 

6181 

99 . 788 

4769 

X95272 

Rattus norvegicus ORF 

304 

25.000 

4770 

AL096842 

Homo sapiens hypothetical protein 

409 

95 .385 

4771 

AF061817 

Rattus norvegicus DNA- binding protein PREB 

1471 

92 .373 

4772 

AL021481 

Unknown similar to Phosphoglucomutase and 
phosphomannomutase phosphoserine ; cDNA EST 
EMBL» : D3 6168 

1138 

44.089 

4773 

Z92832 

Caenorhabditis elegans F31D4.2 

578 

42 .273 

4774 

X52022 

Homo sapiens collagen type VI, alpha 3 chain 

4690 

99.582 | 

4775 

U70854 

Caenorhabditis elegans No definition line found 

985 

39.231 

4776 

X76116 

Caenorhabditis elegans carrier protein (c2) 

693 

50.877 

4777 

U06713 

Rattus norvegicus SM-20 

993 

61.860 

4778 

Z46373 

Saccharomyces cerevisiae orf, len: 423, CAI : 
0.18, 27.4% identity in 307 aa overlap with 
S36201 S36201 hypothetical protein 1 - 
Rhizobium leguminosarum 

945 

44 .142 

4779 

Z46354 

Homo sapiens hexokinase II 

200 

70.732 

4780 

Z93239 

Unknown predicted using Genefinder; cDNA EST 
EMBL:D686 80 comes from this gene; cDNA EST 
yk212g2.5 c 

593 

37 .917 

4781 

AB023233 

Homo sapiens KIAA1016 protein 

4916 

99 . 185 

4782 

U49954 

Caenorhabditis elegans coded for by C. elegans 
cDNA CEESG19F; short region of weak similarity 
to Thermomonospora protein kinase (GB:U23820) 

281 

27.759 

4783 

M17099 

Oryctolagus cuniculus progesterone -induced 
protein 

1158 

91.623 

4784 

AL117482 

Homo sapiens hypothetical protein 

2001 

100. 000 

4785 

Z29094 

Caenorhabditis elegans similar to Na/Ca, K 
antiporter 

234 

33 .884 

4786 

D49473 

Mus musculus truncated form of Soxl7 

1402 

78 .400 

4787 

L31349 

Drosophila melanogaster out at first protein 

644 

41 .228 

4788 

U66088 

Homo sapiens sodium iodide symporter 

186 

33 .654 

4789 

AL117204 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk381b7.5 comes from this 
gene 

325 

39.344 

4790 

AF038957 

Homo sapiens translation initiation factor 4e 

169 

100.000 

4791 

AF002196 

Caenorhabditis elegans No definition line found 

227 

38.462 | 

4792 

U21324 

Caenorhabditis elegans similar to entire S. 
cerevisiae ABC1 protein (Swiss-Prot Acc: 
P27697) 

420 

50.862 

4793 

AL050102 

Homo sapiens hypothetical protein 

3182 

99 .798 

4794 

AF097887 

Rattus norvegicus Chp 

1135 

98 .810 

*± l )Jz> 


Mus musculus tyrosine phosphatase 

200 

33 . 708 

4796 

L15313 

Caenorhabditis elegans homology with leucine 
aminopeptidase; coded for by C. elegans cDNAs 
CE2F12 (GenBank: Z14714) and CE15D11 (GenBank: 
Z14518) ; putative 

1227 

46.991 

4797 

L43631 

Homo sapiens scaffold attachment factor B 

377 

44 . 937 
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4798 

AF083389 

Homo sapiens putative WHSC1 protein 

3574 

100.000 

yf "7 O O 

Av_0 044 /-3 

Arabidopsis thaliana T13D8 . 31 

630 

36 . 901 

/I OA A 

AJ 2 3 8097 

Homo sapiens Lsm5 protein 

24 

60 . 000 

/i o n i 
4 801 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

363 

67 . 568 

4802 

Z75552 

Unknown predicted using Genefinder; Similarity 
to Prototheca mitochondiral ribosomjal protein 

Oil / 

£>11 { 

382 

43 . 704 

4803 

X92763 

Homo sapiens tafazzins 

2043 

100.000 

4804 

Z75170 

Saccharomyces cerevisiae ORF YOR262w 

727 

44 . 939 

4805 

Z48583 

Unknown similar to ATPases associated with 
various cellular activities (AAA) ; cDNA EST 
EMBL:Z14623 

1246 

59 . 868 

a a f\ c 
4oub 

Ar iu4yiy 

Unknown Arabidopsis thaliana ABC1 protein 

V vjits : Au UUlJLoo) 

819 

45 . 946 

*fc O U / 


Homo sapiens keratin 

340 

98 . 214 

A p n o 

Ar j. ft o 0 *± y 

Drosophila melanogaster BcDNA. GH083 8 8 

907 

38 . 362 

A R n Q 

ffc O U J 

TTC Of CO 

Oryctolagus cuniculus ubicfu it in— conjugating 

fciIlZ,yulfci CiZ ~ J ZK 

1413 

99 . 515 

4810 

X76 013 

ntjino oapiciiD yj_u.i_ciuij.iiyj. T_.rCisj.f-v syHLnetaSc 

4 012 

100 . 000 


nf \JH / O Z> j? 

Caenorhabdi t is elegans No definition line found 

1162 

46 . 731 

4812 

AL080214 

Homo sapiens hypothetical protein 

693 

100.000 


AoU 3 Iz y^ 

Mus musculus proteolipid protein 2 

192 

30 . 208 


M.r' uo jZ7l 

Homo sapiens hT41 

925 

44 . 970 

4815 

AL109832 

Schizosaccharomyces pombe hypothetical protein 

175 

32 .632 

4816 

AF 070470 

Mus musculus SPARC- related protein 

1583 

58 . 530 

4 817 

Uz2 3 76 

Homo sapiens alternatively spliced product 
using exon 13A 

402 

62 . 245 

4818 

D50086 

Mus musculus neuropilin 

248 

37.500 

4819 

L2 0319 

Rattus norvegicus developmentally regulated 
protein 

221 

55 . 172 

4820 

M81088 

Rattus norvegicus EF-l-alpha 

165 

96.667 

4821 

U82 982 

Cavia porcellus GEC-3 

544 

47 . 399 

4822 

AF000195 

Caenorhabditis elegans similar to oxysterol- 
binding proteins 

1346 

47 . 899 

A Q O O 

ALU04 912 

Homo sapiens similar to CR16, SH3 domain 
binding protein; similar to 2205340A 

1309 

99.459 

4824 

D87973 

Mus musculus Impact 

396 

86.765 


V7C7 c (? 
A / D / DO 

nomo sapiens protein Kinase c_ mu 

675 

99 - 020 


Ar \J / UD14 

jiottio Sapiens urin.no wn 

o c c o 
2 bo 8 

lOO . 000 

4OO7 

JL> W O W D -7 

nomo sapiens jjxmau nomoioyue-z 


68 . 613 

4828 

AL117525 

Homo sapiens hypothetical protein 

172 

96 . 000 

4829 

U22376 

Homo sapiens alternatively spliced product 
using exon ±ja 

425 

73 .626 

zi A7 n 

4 O J w 

Ml 1 All 
1 V J J. J. ft O / 

Homo sapiens kininogen 

2074 

100 . 000 

4831 

AB005142 

Homo sapiens klotho 

576 

37 . 097 

4832 

Y18208 

Rattus norvegicus serine- threonine specific 
protein phosphatase, glycogen-binding (GL) 
subuni t 

873 

90.226 

4833 

U6 3 84 0 

D a f t" 1 1 Q nnrvprri r^nc tiii/^T onnriri r^t^zl 
kciluuo iiui vcy iluo iiuticupux xxi poft 

1 1 cc 
HDD 

QC OC7 

4834 

AL031588 

Homo sapiens dJ1163J1.2.1 (novel protein 
similar to C. elegans B0035.16 and bacterial 
tRNA (5-Methylaminomethyl-2-thiouridylate) - 
Methyltransf erases) (isoform 1) 

892 

97 .744 

4835 

AF116509 

Homo sapiens Ets transcription factor TEL- 2b 

2415 

100.000 
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4836 

AB011168 

Homo sapiens KIAA0596 protein 

8110 

99.836 

/ion 

AD D JO / 

Homo sapiens serine /threonine protein kinase 

1667 

99 . 595 

4838 

X86682 

Mus musculus HNP3 6 protein 

275 

34.320 

/I O "3 Q 

4 o 3 y 

A T7rt C C C C C 

Mu.s musculus kinesin light chain 2 

2075 

73.884 ! 

Q A f\ 

Ar IUUdd / 

Caenorhabditis elegans Contains similarity to 
Pfam domain: PF00614 (PLDc) , Score=13.8, E- 
value=0.2, N=l 

217 

38.400 j 

/I Q A 1 

loll 

7\T /"111 /I /l "7 

Homo sapiens d»J12 6A5.2.1 (novel protein) 
(isoform 1) 

1670 

100 . 000 

4 Qii O 
ft Oft Z 


nOmO Sapiens illAAUozi. 

10277 

100 . 000 

A Q A ~i 

ft oft 3 

AT HQC7£a 
i\±jU "D /DO 

Homo sapiens dJ858Bl6.2 (novel protein similar 
to hamster PSSC (Phosphatidylserine 
Decarboxylase Proenzyme, EC 4.1.1.65) 

2108 

99 . 670 

4844 

U86074 

Homo sapiens tesmin 

206 

54.386 

4845 

AB023136 

Homo sapiens KIAA0919 protein 

4673 

100.000 

4846 

X87237 

Homo sapiens a-glucosidase I 

4405 

97 . 869 

4847 

AF016421 

Caenorhabditis elegans Similar to 
nitrophenylphosphatase ; coded for by C. elegans 
cujna yKjiiiii4 . 3 , coaea ror oy c . eiegans cuwa 
yttzo /co .a, coaea tor Dy l» . eiegans cuv*j\ 

344 

37 . 013 

4 848 

22 9121 

^acliui llaUUl U _L o cXcy alio ol\ / j / . _L 

TXA. 

1 Q Q C O 

4849 

AB020702 

Homo sapiens KIAA0895 protein 

834 

64.324 

a q n 

ttc c n-> 1 

oaccnaromyces cerevisiae <jjd^dp 

too 
13 o 

3 3 . 3 33 

ft o !3-l 

AdU^ y UUU 

tiomo sapiens r^±j\j\±u / / procein 

5570 

99 .750 

4. Q O 


rionio sapiens i\.ia/\iii^i procem 

33 99 

95 . 131 

A Q ^ "1 

i\r U / ft D U j 

Streptomyces griseus subsp. griseus NonF 

o c c 

38 . 571 

fl OD4 

Ar U o fJ y 

Homo sapiens JAW1- related protein MRVIlA long 
isoform 

584 9 

99 . 215 

4, p c c 

TOO J 

AT Oil *7ft£ 

ocm zosaccnaromyces poiruje nypocnecicai procem 

2 0 9 

33 .628 

1o Do 

P»ft 7Q^O 
DO / jUO 

Mus musculus nuclear protein np95 

512 

46 . 286 

4. ft ^7 
ft o O 1 

/AV_ U U / 1 / O 

Arabidopsis thaliana hypothetical protein 

£T o f\ 

6 8 0 

32 . 326 

4 858 

v"7 n ft n/i 

nils lllULo\_U J. U t> laJJI / 


on c\c\c\ 

4 859 

L v l j4 U J4 

La V I ci puX CcllUo CCJHiy 1 t:mc:IlC v_ j piOLclIl ^ Lair ^ 3 ) 

ir\ ~y~ en pii it - 

/I -J /T 

3 9. / /3 

4860 

M25579 

duo oaut n 0 auciiyxyi Ly Liabc xjy^tr J. 

O / XJ 

3X . j u u 

4861 

AT.03 1 644 

nllv J J. \J t M: 

o^lll^UoaLL-llalulliy^co LUc xly J^ivJ J.v_c*X ^lUUclll 

Z 7Z 

zo . jx j 

4 862 

Y17832 


1 TC 

ft c> . uuu 

4863 

M76411 

Escherichia coli cadA 

590 

100.000 

4864 

aRni 117 7 

Hnmn o n t^i i pno i/" T A D H ^ r^>~/^t*o"i n 
IaLJIIIvJ bapXcllo I\XnnUDU J UX UUcXil 

4 3 / 

JO . 3 / ft 

4865 


IIUHIU OdJJXCllO JLI1L, 1, XIl^JcL JJitJUtiill oIMir^J^ 

X?OZ 

1UU . UUU 

4866 

AT.11 7462 

Womn qani pn a Vivmot" t~ H pal nvri!*oi r*i 

-3 U O 

A AK1 
ft O . ft D / 

4867 

X6487R 

TTf^mO 0 t~\ ~\ pn q /— v-y--\/-4- /~\r~* "i n TPppn^rtT" 
rxwiuvj octLyxciio %_/Jt_y lul xii x cuc^lux 


1UU . UUU 

4868 

M92843 

TTomo caD"i ^t^cj 7 1 np f i rifiPT* f ranop'ri r^k f- ~5 f^na T 
nuiuu oapicllo I lliyci Ll alloLI XLJt LUllclX 

rpcnil 3 hoy 

o X. 

1 uu . UUU 

4869 

AF059569 

Homo saioieiis act in bindincf nrri fpin MAYVFIM 

305 

50 . 000 

4870 

AB023216 

Homo saoiens k TAAO 9 9 9 T>r*ot~ pi n 

7456 

QQ 70Q 

4871 

AF026794 

Mus musculus galectin-6 

201 

28.877 

4872 

AT.117478 

nuiuu ocxiyxdxo iiyjJuuiicLXUctx piULcxn 

ft O O 4 

QQ "71 

4873 

J04605 

Homo sapiens prolidase 

301 

91 . 837 

4874 

AF019236 

Dictyostelium discoideum TipD 

746 

39.869 1 

4875 

AB001451 

Homo sapiens Sck 

992 

100.000 

4876 

X85019 

Homo sapiens UDP-GalNAc : polypeptide N- 
acetylgalactosaminyl transferase 

902 

48.029 

4877 

U86074 

Homo sapiens tesmin 

2133 

100.000 


?0< 
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4878 

D86980 

Homo sapiens KIAA022 7 

501 

45.312 

4879 

D86979 

Homo sapiens KIAA0226 

234 

43 .662 

4880 

Z92809 

Caenorhabditis elegans predicted using 
Genefinder; similar to Thrombospondin type 1 
domain 

414 

30.126 

4881 

Z37139 

Unknown similar to guanine nucleotide binding 
protein; cDNA EST EMBL:T00917 comes from this 
gene ; c 

1556 

64 .756 

4882 

AF095286 

Homp sapiens guanine deaminase GDA 

793 

100.000 

4883 

AF126162 

Homo sapiens HERV-H LTR associating protein 2 

160 

100.000 

4884 

AB002331 

Homo sapiens KIAA033 3 

6685 

100. 000 

4885 

X85545 

Homo sapiens protein kinase 

427 

86 . 957 

4886 

AF149045 

Homo sapiens Sex comb on midleg homolog 1 
isoform 1 

1784 

92 .568 

4887 

U13149 

Pennisetum ciliare possible apospory-associated 
protein 

189 

45 .588 

4888 

U23172 

Caenorhabditis elegans No definition line found 

619 

37 .631 

4889 

AB002304 

Homo sapiens KIAA03 06 

9825 

99 . 724 

4890 

M32512 

Sus scrof a Na+ , K+ -ATPase alpha subunit 

167 

92 .308 

4891 

D10923 

Homo sapiens HM74 

530 

45 . 226 

4892 

AL049944 

Homo sapiens hypothetical protein 

193 

51 .613 

4893 

AF089745 

Homo sapiens FK5 06 -binding protein 

264 

80 .851 

4894 

M16538 

Homo sapiens G protein beta subunit 

745 

91 . 964 

4895 

U00063 

Caenorhabditis elegans No definition line found 

140 

20.792 

4896 

AF077831 

Homo sapiens tumor-related protein 

3255 

100.000 

4897 

M16141 

Gallus gallus ovoinhibitor 

184 

37 .681 

4898 

Z81030 

Caenorhabditis elegans similar to citrate lyase 
beta chain; cDNA EST yk302b4.5 comes from this 
gene 

602 

54 .023 

4899 

J04621 

Homo sapiens heparan sulfate proteoglycan core 
protein 

2591 

99.748 

4900 

AJ003125 

Homo sapiens procollagen I N-proteinase 

444 

33 .508 

4901 

AF053367 

Mus musculus carboxyl terminal LIM domain 
protein 

433 

45.578 

4902 

AF002668 

Homo sapiens MLD 

634 

72 .034 

4903 

Z82266 

Caenorhabditis elegans predicted using 
Genefinder; similar to WD domain, G-beta 
repeats 

959 

39 .118 

4904 

M28821 

Mus musculus Tcte-1 peptide 

202 

30.769 

4905 

AF076957 

Homo sapiens SDP1 protein 

526 

98 .824 

4906 

U76638 

Homo sapiens BRCAl-associated RING domain 
protein 

419 

40.957 

4907 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

390 

72 . 727 

4908 

AF060152 

Homo sapiens METHl protein 

6696 

100. 000 

4909 

AF117892 

Homo sapiens aspartic-like protease 

1132 

47 . 965 

4910 

U3 2 3 94 

Mus musculus Max- interacting transcriptional 
repressor 

280 

80 .000 

4911 

D21205 

Homo sapiens estrogen responsive finger protein 
letp) 

294 

37 .857 

4912 

AL034381 

Schizosaccharomyces pombe conserved 
hypothetical PFAM UPF0031 containing protein 

298 

40.708 

4913 

X63692 

Homo sapiens DNA (cytosine-5- ) - 
methyl transferase 

5745 

99.880 

4914 

Z75526 

Caenorhabditis elegans Weak similarity to 

692 

48.485 I 


got 
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Staphyloccus autolysin gene (TR:G76 5072) ; cDNA 
EST EMBL:M89336 comes from this gene; cDNA EST 
yk505dl2.3 comes from this gene 



4915 

AF000198 

Caenorhabditis elegans weak similarity to HSP90 

594 

30.952 

4916 

AB020721 

Homo sapiens KIAA0914 protein 

666 

56.716 

4917 

AF083246 

Homo sapiens HSPC028 

1540 

100.000 

4918 

AF001434 

Homo sapiens Hpast 

1630 

84 .698 

4919 

U36340 

Mus musculus BKLF 

956 

99.242 

4920 

AF126484 

Homo sapiens CARD4 

424 

44 . 944 

4921 

AF186469 

Rattus norvegicus TM6P1 

951 

90.385 

4922 

AC007071 

Arabidopsis thaliana hypothetical protein 

221 

26 .400 

4923 

AC003093 

Homo sapiens OXYSTEROL- BINDING PROTEIN; 45% 
similarity to P22059 (PID:gl29308) 

503 

53.731 

4924 

AF067660 

Mus musculus Bel -2 homolog 

407 

47 .368 

4925 

AF145204 

Homo sapiens E2a-Pbxl-associated protein 

1458 

79.333 

4926 

X69089 

Homo sapiens 16 5kD protein 

815 

96.875 | 

4927 

AF022080 

Homo sapiens R- ras3 

302 

100.000 

4928 

AC006585 

Arabidopsis thaliana hypothetical protein 

212 

25.108 

4929 

D42047 

Homo sapiens The ha3662 gene product is related 
to mouse glycerophosphate dehydrogenase . 

2715 

100.000 

4930 

AB028998 

Homo sapiens KIAA1075 protein 

9734 

99 .427 

4931 

AF051162 

Drosophila melanogaster SLOB 

443 

39.574 

4932 

AF001533 

Mus musculus mitogen- induced 

497 

98 . 750 

4933 

Z72604 

Saccharomyces cerevisiae ORF YGL082w 

230 

27 .500 

4934 

Z70780 

Caenorhabditis elegans cDNA EST yk465dl0.3 
comes from this gene; cDNA EST yk465dl0.5 comes 
from this gene; cDNA EST yk481d9.5 comes from 
this gene 

548 

49.032 

4935 

AB007948 

Homo sapiens KIAA04 7 9 protein 

2327 

100.000 

4936 

AF187318 

Homo sapiens F-box protein Fbx2 

873 

48 .289 

4937 

Y17048 

Rattus norvegicus caldendrin 

1274 

94 .500 

4938 

AB020626 

Homo sapiens KIAA0819 protein 

349 

39 . 583 

4939 

L06237 

Homo sapiens microtubule-associated protein IB 

671 

35.782 

4940 

U81491 

Mus musculus polyhomeotic 2 

197 

44 . 776 

4941 

AF007787 

Enterococcus faecalis orfC 

309 

44 .444 

4942 

AF058919 

Arabidopsis thaliana regions of weak similarity 
to 1- asparaginase 

267 

45.556 

4943 

AF051155 

Rattus norvegicus G beta- like protein GBL 

1280 

94 .388 

4944 

X57108 

Homo sapiens cerebroside sulfate activator 

338 

100.000 

4945 

AL117589 

Homo sapiens hypothetical protein 

1787 

100.000 

4946 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

185 

88 .571 

4947 

AL050157 

Homo sapiens hypothetical protein 

747 

100.000 

4948 

AF090133 

Rattus norvegicus lin-7-A 

860 

96 . 350 

4949 

U37150 

Bos taurus peptide methionine sulfoxide 
reductase 

1402 

89.238 

4950 

AF025441 

Homo sapiens Opa- interacting protein OIP5 

1557 

100.000 

4951 

Z69240 

Schizosaccharomyces pombe putative 
amidohydrolase 

996 

52 . 708 

4952 

U21309 

Caenorhabditis elegans No definition line found 

307 

39 . 157 

4953 

AB018188 

Bos taurus myocilin 

323 

35 . 583 

4954 

AL035424 

Homo sapiens dA22D12.1 (novel protein similar 
to Drosophila Kelch (Ring Canal protein, KEL) 
and a heterogenous set of other types of 
proteins) 

382 

66 .667 
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4955 

AL031261 

Schizosaccharomyces pombe conserved 
hypothetical protein 

400 

35 .519 

4956 

U55376 

Caenorhabditis elegans coded for by C. elegans 
cDNA cm21e6; coded for by C. elegans cDNA 
cm01e2; similar to melibiose carrier protein 
{thiomethylgalactoside permease II) 

515 

35.350 

4957 

X78933 

Homo sapiens zinc finger protein 

1012 

74 . 731 

4958 

AF007139 

Homo sapiens unknown 

1597 

98 . 780 

4959 

D83327 

Homo sapiens DCRR1 

390 

23 . 862 

4960 

Z81052 

Caenorhabditis elegans Similarity to Yeast 
ABC1P protein (SW : ABC1_YEAST) ; cDNA EST 
yk229g8.3 comes from this gene; cDNA EST 
yk229g8.5 comes from this gene 

613 

42 .326 

4961 

AF118274 

Homo sapiens DNb-5 

1759 

100. 000 

4962 

AL050197 

Homo sapiens hypothetical protein 

496 

48 .529 

4963 

AL031393 

Homo sapiens dJ733D15.1 (Zinc- f inqer protein) 

3520 

100.000 

4964 

L07780 

Bos taurus UDP-GalNAc : polypeptide , N- 
acetylgalactosaminyl transferase 

663 

34 .713 

4965 

AF155117 

Homo sapiens NY-REN-62 antigen 

505 

49.708 

4966 

X80903 

Mus musculus DELTA- like 1 

716 

40.240 

4967 

AB002323 

Homo sapiens KIAA0325 

13746 

100.000 

4968 

AB014520 

Homo sapiens KIAA0620 protein 

13395 

99. 950 

4969 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

390 

71 .622 

4970 

AF154671 

Homo sapiens CRB1 

9674 

99.626 

4971 

U29096 

Caenorhabditis elegans coded for by C. elegans 
cDNA yk44f2.5; similar to P59 protein (HSP 
binding immunophilin) and to TPR domain 

527 

38 . 000 

4972 

AF125188 

Homo sapiens adenosine deaminase acting on tRNA 
1 

301 

91 . 667 

4973 

D26549 

Bos taurus bovine adseverin 

4422 

90 . 795 

4974 

AJ007 7 98 

Homo sapiens nuclear protein SA3 

481 

92 . 000 

4975 

AF068749 

Mus musculus sphingosine kinase 

175 

41 . 791 

4976 

AL031685 

Homo sapiens dJ963K23.2 (novel protein) 

1621 

100. 000 

4977 

AF187552 

Homo sapiens ezrin 

201 

90 . 909 

4978 

U95098 

Xenopus laevis mitotic phosphoprotein 44 

611 

71 . 774 

4979 

X77383 

Homo sapiens cathepsin O 

1878 

100. 000 

4980 

Z49967 

Unknown cDNA EST EMBL:T00743 comes from this 
gene; cDNA EST EMBL:D693 56 comes from this 
gene ; cDNA 

535 

32 . 240 

4981 

Z78542 

Caenorhabditis elegans similar to Mitochondrial 
carrier proteins; cDNA EST EMBL:T01651 comes 
from this gene 

250 

54.412 

4982 

D87757 

Mus musculus ECF-L precursor 

762 

68. 919 

4983 

L20319 

Rattus norvegicus developmentally regulated 
protein 

670 

83.636 

4984 

U4 9057 

Rattus norvegicus rA9 

1098 

70 . 472 

4985 

AC007504 

Arabidopsis thaliana Hypothetical Protein 

1211 

35.835 

4986 

X61047 

Hydra sp. mini -collagen 

197 

39. 759 

4987 

U85969 

Xenopus laevis middle molecular weight 
neurot ixamenn protein NF-M(l) 

259 

29. 102 

4988 

AL022238 

Homo sapiens dJ1042Kl0.4 (novel protein) 

793 

51. 029 

4989 

AC006128 

Homo sapiens Human homolog of Mus musculus wizS 
protein 

382 

100.000 

4990 

U90880 

Fugu rubripes zinc finger protein 

1285 

75. 336 

4991 

U23175 

Caenorhabditis elegans similar to anion 

1184 

46 . 250 
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exchange protein \ 




AT nil TOO 

Schizosaccharomyces pombe putative 
mitochondrial inner membrane protease s ubun i t 2 

203 

55 . 000 

4993 

U96113 

Homo sapiens WW PI 

4692 

99.853 

4994 

X57228 

Rattus norvegicus beta COP 

4867 

98.830 

4995 

U35776 

Rattus norvegicus ADP-ribosylation factor 1- 
directed GTPase activating protein 

1453 

80. 505 

A Q Q £T 

AF040944 

Mus musculus P14 0 

454 

33 . 158 

/I Q O "7 

AC007858 

Oryza sativa Similar to gb|U43629 integral 
membrane protein from Beta vulgaris and is a 
member of the sugar transporter family 

F r | UUUoJ . bib 

677 

37. 243 

4998 

AL117654 

Homo sapiens hypothetical protein 

2059 

100 . 000 

4999 

AB006629 

Homo sapiens KIAA02 91 

6027 

99. 895 

5000 

L04673 

Saccharomyces cerevisiae phosphatase 

198 

30.000 

5001 

AF047659 

Caenorhabditis elegans No definition line 
found 

291 

51.020 

5002 

U29378 

Caenorhabditis elegans No definition line 
found 

910 

53 .462 

5003 

U23515 

Caenorhabditis elegans weakly similar to 
gastrula zinc finger protein 

375 

40.000 

5004 

D64001 

Synechocystis sp. hypothetical protein 

396 

39.888 

5005 

AL035416 

Homo sapiens dJ782L23.1 (HOOK1) 

475 

60. 294 

5006 

S66407 

Homo sapiens receptor tyrosine kinase isoform 
FLT4 long, FLT41 {C- terminal} 

498 

100.000 

5007 

AB015630 

Homo sapiens type II membrane protein 

388 

48.819 

5008 

Z70269 

Unknown predicted using Genefinder; Similarity 
to Yeast hypothetical protein YHG1 
(SW:YHG1 YEAST) ; 

1526 

44 . 857 

5009 

AF105228 

Bos taurus tuftelin 

711 

94 . 915 


U2316 9 

Caenorhabditis elegans No definition line 
round 

329 

51. 923 

Cftl 1 

OU1 J. 

AdUz Ufa j / 

Homo sapiens KIAA083 0 protein 

3451 

100 . 000 

jU lz 

Ml i rti o 
nj XUX J 

Homo sapiens nonmuscle myosin heavy chain 

/ "MA/TLX/""' \ 

7708 

99.840 

5013 

AF125158 

Homo sapiens zinc finger DNA binding protein 
99 

635 

100 . 000 

5014 

AE001719 

i iit;i iiiouoy ci ludriciiiia jn/ujm oxiaase, putative 

221 

26 . 126 

5015 

U82163 

yiy^Luiayua uuiiicuius iNo uci lnic ion line round 

364 

53 . 571 

5016 

AB006013 


214 

49 . 180 

5017 

L26955 

o t-i t!|->u.t>iiiy^t;t> vt;j. Liciiiua oecd - nyaroxyiase 

313 

51 . 579 

5018 

AF140501 

null IV-/ ocl]JXt:XJo lUiUj vD 

468 9 

99 . 441 

5019 

X98374 

Rattus norvegicus KIS 

963 

98.621 

5020 

AF172849 

Homo sapiens AIM-1 protein 

3564 

100.000 

5021 

Z72795 

Saccharomyces cerevisiae ORF YGROlOw 

655 

41 . 538 

5022 

AB020682 

Homo sapiens KIAA08 75 protein 

4162 

98.885 

5023 

U90908 

Homo sapiens unknown 

352 

41 . 007 

5024 

Z70034 

Unknown similarity to 35.1KD hypothetical 
yeast protein (Swiss Prot accession number 
P38 805) ; cDNA 

1020 

51.678 

5025 

AB000280 

Rattus norvegicus peptide/histidine 
transporter 

3185 

87. 057 

5026 

AB029010 

Homo sapiens KIAA1087 protein 

6354 

100.000 

5027 

AC005620 

Homo sapiens R33 5 90 1 

2089 

100.000 

5028 

M38222 

Trypanosoma brucei procyclic acidic repetitive 

182 

88.462 


801 
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protein 



5029 

ABQ2 9018 

Homo sapiens KIAA1095 protein 

7344 

99.454 

5030 

AB018299 

Homo sapiens KIAA0756 protein 

5678 

99.761 

5031 

AB019280 

Mus musculus sprouty-4 

1559 

95.853 

5032 

AB007948 

Homo sapiens KIAA04 79 protein 

519 

36.634 

5033 

U22376 

Homo sapiens alternatively spliced product 
using exon 13 A 

202 

69.565 

5034 

U21317 

Caenorhabditis elegans similar to 
SP:YR40_BACSU (P37512) hypothetical 78.8 kD 
protein in TETB-EXOA intergenic region 

566 

31 .268 

5035 

AB018324 

Homo sapiens KIAA0781 protein 

4316 

100.000 

5036 

AL031525 

Schizosaccharomyces pombe ubiquitin carboxyl- 
terminal hydrolase 

987 

42 . 147 

5037 

D63477 

Homo sapiens The KIAA0143 gene product is 
related to a putative C. elegans gene encoded 
on cosmid C32D5. 

496 

62 . 500 

5038 

U50193 

Caenorhabditis elegans weak similarity to 
SP:YAD5_CLOAB (P33746) hypothetical protein 
and to PIR:C4 8583 stress -inducible protein 
STI1 

706 

35.135 

5039 

Z74031 

Unknown Similarity to Yeast D- lactate 
dehydrogenase ( SW : DLD1_YEAST) ; cDNA EST 
EMBL:C12235 comes fro 

701 

50.495 

5040 

AB009593 

Tetragenococcus halophilus xylose transporter 

169 

35 . 294 

5041 

U97107 

Mus musculus membrane glycoprotein CIG3 0 

651 

48 . 901 

5042 

AF055084 

Homo sapiens very large G-protein coupled 
receptor- 1 

689 

100.000 

5043 

X55019 

Homo sapiens acetylcholine receptor delta 
subunit 

1138 

99.408 

5044 

X70223 

Rattus norvegicus peroxisomal membrane protein 

882 

76 . 829 

5045 

M29015 

Mus musculus ribosomal protein L7 

689 

43 . 983 

5046 

Z99124 

Bacillus subtilis urocanase 

414 

33 . 772 

5047 

AC004010 

Homo sapiens similar to Leucine- rich 
transmembrane proteins; 44% similarity to 
U42767 (PID :gl736918) 

474 

39.819 

5048 

AF087940 

Homo sapiens basic helix- loop-helix 
transcription factor HAND2 

433 

98.529 

c n a q 

Ml Q A A 1 

Mus musculus Kruppel- related protein 

767 

87 . 500 

5050 

AC006017 

Homo sapiens similar to ALR; similar to 

AAC51735 vPID :g2358287) 

2363 

98 . 596 


Ar 14boli 

Drosophila melanogaster BcDNA. GH03108 

498 

30 . 278 


AT 1 1 *7 /l *1 C 

Homo sapiens hypothetical protein 

681 

49 . 315 

c n c *a 

AT 1 1 T C "7 Q 

Homo sapiens hypothetical protein 

1812 

100 . 000 

Z> U Dft 


Homo sapiens alternatively spliced product 
using exon ija 

333 

65 . 909 

5055 

AJ010901 

Homo sapiens MUC4 

8100 

99.654 

5056 

X14549 

Chlamydomonas reinhardtii spoke protein 

565 

49.412 

bUb / 

A(_ UUyj^Z 

Arabidopsis thaliana Hypothetical protein 

235 

24 . 460 

bUbo 


Homo sapiens TRGV10 

669 

99 . 057 

5059 

u j j j ^ j 

du o Ldurufa JjuLyropni lin 

115 

31 . 343 

5060 

AC005587 

Homo sapiens similar to meningioma-expressed 
antigen 6 ( MEA6 ) ; similar to U94780 
(PID:g2231999) 

1868 

99.612 

5061 

U27486 

Pseudorabies virus EP0 

109 

38.806 

5062 

AF041106 

Rattus norvegicus tulip 1 

729 

60. 571 
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5063 

AF077032 

Homo sapiens sec6l homolog 

971 

96.026 


7AT 1 1 "7/1 1 Q 
ALII /ft JO 

nomo sapiens nypocnecicai protein 

6181 

99 . 891 

3 UO 3 

TTO (Z(Z fl 
UOJD O U 

noinu sapiens mux c lciruy resiscance-dssociateci 
protein homolog 

158 

66 . 667 

3 \J D O 

atrm oo.fra 

<ri<T VJ 1 / 3 O _7 

Hub IdUo^Ul Uo LaUlUy clll Ldl Uyspidola piOtcin 3 

O "7 O 

O /o 

61 . 765 

3 VJO / 


nomo sapiens caspase-iijce apopcosis jreguiacoiry 

242 

90 . 698 

5068 

U02098 

Miiq mi 1 q 1 T 1 1 ci T5i i v* -» a 1 'rtVia 

IJUo IIIUO^UIUO XT LXJL a J. JL^ilcl 

o n q 

3 u . /by 

506 9 

AF02782G 

T-I<^»tY1*^ Q A T*\ T Oil Ct i H D f" n ^ro e mrori a o a f~ ->^o ^ otnomKv* •!■> v\ a 

nuuiu oapicno puLaLivc stJvtiil pay t> UX olloiltcniDr allc 

1 O 1 
J J J 

-5 o . 994 

5070 

Z22181 

Caenorhabditis elegans ZK632.11 

299 

36.527 

5071 

AF058445 

Gallus gallus histone macroH2Al . 1 

193 

27.326 

5072 

AC007228 

Homo sapiens BC37295 2 (partial) 

575 

58.915 

DU / i 

ZiA.Z. 1 / 3 

Homo sapiens N- ace tylglucosamine-6 -sulphatase 

1861 

100 . 000 

3U /fk 


Homo sapiens hypothetical protein 

5031 

100 . 000 


Uz J 3 ±4 

Caenorhabditis elegans similar to S . 
cerevisiae SSD1 protein (SP : SSD1_YEAST, 
fzfiz/o; ana co ti. con vAtB ana Ki-Donuciease 

TT noil o <3 

1365 

45.591 

5076 

AF167319 

Mus musculus zinc finger protein ZFP112 

834 

87 . 121 


7AT?1 T Q71 T 
Ar 11? /II 

riomo sapiens cysiiii l» j. Ufi recepcor 

2233 

99 . 703 

5078 

AF143171 

Mus musculus high affinity immunoglobulin 
gamma r c recepcor i 

364 

47 .692 

^no q 

TTA Tl Qd 
Uf±3 J. 2? ft 

rius museums rnopniiin 

2 14 

4 7 .2 97 

DUOU 


rioiiio sapiens riomoiog or rau a /is. protein 

T — i f~\ /"^ 1 1 « 

pruuucL 

16 9 

50 . 000 

5081 

7AP0 ^ 9"* 90 

noiilO bdpieuo VdaCUldl cUUOLllcl lal Cell grOWCn 
"i tiHt V"H t*or 

±1111 J-Jk/J. LWX. 

llO 3 

1 /\A r\ r\ e\ 

lOO . 000 

5082 

AF027514 

"UiiiU oapicllo ZilXXv^ llilvjci piULclil 

bbl 

1UU . uuu 

5083 

AL035478 

Streptomyces coelicolor putative transferase 

187 

32.479 \ 

3 KJ O ** 


riLJiiitj adpiciia iMHrtu 0 j 1 protein 


4 7 . 664 

508 5 


nuiiiu sdpiciio maaiuj / pruLcin 

241 

100 . 000 

5086 

AT.l i m 24 

tWJX. 1U1Z1 

rivjlllvJ bdpicllb IiypULIlcUlCdl piOLclIl 

O Q C 

Z o o 

3 0. 622 

5087 

avxt J l \-> -7 J. \J 

nuillU bdpieilb pULdLXVt; I lJjUIlUClcdSc 111 

o o *a o 

n rtft 

10U . 000 

5088 

AF072758 

Mus musculus fatty acid transport protein 3 ; 
FATP3 

1219 

87.864 

3 \J O J 

Uw/. JDl 

rtidDiuopsis unanana nypotnecicai procem 

271 

25 . 123 


npnn cot o 

riomo sapiens rooiiA-c 

23 5 

30 . 114 

5091 

AT.07A 

Ai dl/lUUpo lo Lildlldlld pULdLlvc pi ULclIl 

1 Q 1 

O O O 1 c 

2 / . 2 lb 

5092 

247811 

uiirviiowii aimiidi t.o yiyceropnospnoryi oiesuer 

^llUO^llUUlCDLClaoc UUIIICllH/ ^1/1X7-1 J&O 1 1_>1 It3J_» . VJ Z> / u4 Z 

comes f 


4 J . OOO 

5093 

AiJ2 4 2 97 6 

Homo ci^r^-} ckn a nOAl nrnhpi n 

ZliU 

on Add 

5094 

AC004077 

Af^lhi rtonfl "i r t"haliPiT!a t>i i t~ Pi t" "i vp *=*nrt 1 1 nrnh pin 

-A- VAw k/ 47 -i, i^j y — _i_ ana i—/ n i — ci l- v v^i xvi » ^> l v-/ w 1X1 

OCT 
-7 3/ 

3D . ^ / fx 

5095 

X14968 

Homo <5p5o i pnej pTT-al T»V»^i Qiihim -i t - ( 7\7\ 1 — A C\ A \ 

>J U Jt^ Clio 1\ JL JL CI -1- K-V -1 XCa O U4.J_/ 1 _J_ |_. y J- 1 ut ^ 

£ O "2 

i7U . / f» 1 

5096 

AF118562 

Rar*t"ii e ; nnTVPrri n i a ovoot** i n — T 

o U 3 


5097 

AF096771 

Homo c; rar-^ -j pn c: amoot"h miiQfl p /nnnmncrl p ihvaci n 

light chain kinase 

0£ Q 

zoo 

Afl OAT 
f* Lf . / ft 1 

5098 

AB028955 

Homo sapiens KIAA1032 protein 

6551 

1 V W . W W V/ 

5099 

AL117537 

Homo sapiens hypothetical protein 

1799 

100 . 000 

5100 

U95044 

Homo sapiens zinc finger protein 

975 

76.571 

5101 

U14372 

Rana catesbeiana myosin I alpha 

183 

96 .552 

5102 

X95073 

Homo sapiens Translin associated protein X 

437 

100.000 

5103 

U16800 

Xenopus laevis ribonucleoprotein 

510 

89.157 

5104 

AF100421 

Rattus norvegicus p80 

1617 

82 .524 
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5105 

Y17267 

Mus musculus ubiqui tin -conjugating enzyme 

4532 

93 .690 

5106 

AB028949 

Homo saoiens KIAA1G2 6 Drotein 

■1400 

inn nnn 

5107 

AB011126 

Homo sapiens KIAA0554 protein 

1560 

48.527 

5108 

AL0344 91 

hvoothetical oro^p i n 

ZoKJ 

4© . j^ii 

5109 

AF061025 

Homo sapiens leucine zipper-EF-hand containing 
transmembrane protein 1 

1521 

98.413 

5110 

AF083340 

Homo sapiens double- stranded RNA-binding zinc 
finger protein JAZ 

1996 

100.000 

5111 

AF042001 

Homo sapiens zinc finger protein slug 

R t> 

I 1 . D -5 D 

5112 

AB008548 

Mus musculus type 1 procollagen C-proteinase 
enhancer protein 

1091 

41.176 

5113 

U70854 

uaciiui. iiauu j, v, AO cicyaiio o xul± lai t-O CiUUcXULUCCUa 

faecalis TRAB (GI: 388268) 

0 a c 

ODD 

/i c m n 
4i> . 0 /0 

5114 

AB023419 

Mus musculus mSox7 

631 

91.176 

5115 

270270 

Lat;nuincijju.iLi3 eieyans preaiccea using 

f^or-i a f 7 -j T1 <=» Y~ - Si mi 1 ari hv \~ c\ Monao anrri nt-ona-i n TT 
vj^iici, xiiuci. / o J-iu J. J-dX. J. i iKJ Uoc dlly 1U Lcllo _LI1 -L _L 

receptor (SW:AG2R MOUSE) 

18 7 

35 . 106 

5116 

Z83230 

Caenorhabdit" i <i <=*erAr»<3 rDMA t?ct -v/Vo. ccn-o c; 
v_-ca ci x. iiciuu _i_ t_ j_ c icy ciiio >— j_/am/-\ c*o a y js. j j jy j . 

comes from this gene; cDNA EST yk645c2.3 comes 
from this gene 

0 t n 

£. X U 


5117 

AT.021748 


1 A f\ 

14 U 

22 . 826 

5118 

U88308 

Caenorhabditis elegans No definition line 
found 

807 

60.185 

5119 

AF14 9413 

y ri V"\ "i (~ > iX~\ C3 "i a t~ Vl J^J H pnn t"a t na q -i m -I T ra>~-i k~ •% r /-\ 
ni awA uu^s x o L-iic* x lctna ^—wIlt_cl_LIlL3 oXIIlXXdl 1 *~ y CO 

histone deacetylases ; Pfam PF00850, 

U V» V-/ X. — » -L. J . J f d — _J d J. V/ f AM — JL 

O 1 A 

58 . 730 

5120 

Z99129 

Homo sapiens dJ425C14.2 (Placental protein 
DIFF33 LIKE) 

2021 

100.000 

5121 

U66496 

Homo saDiens l^fitin '^~£*^*'*^=>'r^t•o'r• 

-i j J 

DO ACT 

5122 

L37380 

Rattus norvegicus apical endosomal 

^-j .a, y > — - ys x. v_-/ V- ' — xii 

860 

72 . 626 

5123 

X74946 

GallllS GPl 1 1 1 1 q nha -M-^rphvl 1 ar>l*r»cami n-i Ho 

alpha -2,6- s ialyl trans f erase 


a 1 1 c "a 

0 1 . X Z>3 

5124 

AB011536 

Homo sapiens MEGF2 

94 1 P. 


5125 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

428 

70 

/V - XU J 

5126 

L07045 

Strongylocentrotus purpura tus fibropellin c 

1007 

45.455 

5127 

AB014531 

Homo sapiens KIAA0631 protein 

4250 

99.842 

5128 

AL050100 

Homo sapiens hypothetical protein 

1137 

100 000 

-i- w w . u u w 

5129 

AC004770 

Homo sapiens BC26 973 0 2 

3002 

99 . 765 

5130 

AF097518 

Homo sapiens 1 iver- specif ic transporter 

927 

100.000 

5131 

AB007876 

Homo sapiens KIAA0416 

318 

28 .497 

5132 

AF069442 

Arabidopsis thaliana putative 
ribonucleoprotein 

1 1 
x 0 X 

->r\ q *7 0 

5133 

U78978 

Homo sapiens putative ATPase 

2177 

^ / . DZD 

5134 

Z46241 

Unknown carboxyl terminus of the predicted 
protein shows similarity to chimaerin; cDNA 
EST EMBL:Z14 

4 75 


5135 

AL117444 

Homo sapiens hypothetical protein 

2760 

100.000 

5136 

AB014532 

Homo sapiens KIAA0632 protein 

4861 

100.000 

5137 

AC002398 

Homo sapiens F2 596 5 1 

702 

96.117 

5138 

AF152361 

Drosophila melanogaster Kua protein 

909 

64 . 706 

5139 

AB021179 

Homo sapiens HEXIM1 protein 

365 

47 . 586 

5140 

U92010 

Rattus norvegicus lin-10 protein homolog 

2248 

96. 997 
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5141 

J04695 

Mus musculus alpha- 2 type IV collagen 

4422 

87.816 

5142 

AL009196 

Unknown /predict ion= (method : " "genef inder " " , 
version : " " 084 " n ) ; 

/predict ion= (method : " "genscan" " , ve 

1136 

48.092 

5143 

AE0013 94 

Plasmodium falciparum predicted integral 
membrane protein 

111 

59.259 

5144 

AF065389 

Homo sapiens tetraspan NET- 4 

490 

47.794 

5145 

AB012886 

Mus musculus mac25 

470 

37.561 

5146 

AJ006692 

Homo sapiens ultra high sulfer keratin 

756 

88.776 

5147 

AB030502 

Xenopus laevis XDRP1 

174 

60.345 

5148 

AB023143 

Homo sapiens KIAA0926 protein 

328 

53.125 

5149 

X99140 

Homo sapiens type II intermediate filament of 
hair keratin 

1885 

100.000 

5150 

AB013607 

Mus musculus c2 9 

1331 

77.985 

5151 

AC006053 

Arabidopsis thaliana putative potassium 
transport protein 

149 

36.905 

5152 

AF176838 

Homo sapiens N-acetylglucosamine 6-0- 
sulf otransf erase 

620 

100.000 

5153 

U14417 

Homo sapiens Ral guanine nucleotide 
dissociation stimulator 

382 

34.400 

5154 

U55208 

Homo sapiens myosin Vila 

458 

71.111 

5155 

AF188634 

Drosophila melanogaster F protein 

766 

51.373 

5156 

Z70203 

Caenorhabditis elegans cDNA EST yk414c9.3 
comes from this gene; cDNA EST yk414c9.5 comes 
from this gene 

631 

32.099 

5157 

AB021981 

Homo sapiens UDP -N-acetylglucosamine 
transporter 

292 

30.396 

5158 

AB014527 

Homo sapiens KIAA0627 protein 

259 

97.500 

5159 

AL021571 

Caenorhabditis elegans predicted using 
Genef inder 

282 

48.837 

5160 

U10861 

Rattus norvegicus calpain small subunit 

220 

31.897 

5161 

X07715 

Homo sapiens protein Po (partial) (217 is 2nd 
base in codon) 

1909 

98.551 

5162 

X85992 

Mus musculus semaphorin C 

1542 

80.645 j 

5163 

U00050 

Caenorhabditis elegans similarity across 
entire gene to DNA-directed RNA polymerase 

474 

71.111 

5164 

U46569 

Homo sapiens aquaporin-5 

1693 

99.623 

5165 

AF125955 

Caenorhabditis elegans contains similarity to 
AMP-binding domains (Pfam: PF00501, 
Score=82.1, E=l.le-20, n=l) ; similar to long- 
chain- f atty-acid- -CoA ligases 

1244 

43.875 

5166 

U41548 

Caenorhabditis elegans weak similarity to 
hemolysins 

202 

48.333 

5167 

X69490 

Homo sapiens titin 

838 

99.231 

5168 

Z80220 

Unknown Similarity to yeast protein TREMBL ID 
E246895); cDNA EST EMBL:T00018 comes from this 
gene ; 

774 

56.944 


ABOH109 

Homo sapiens KIAA053 7 protein 

511 

42.810 

3± /{J 

J04204 

Bos taurus 32 kd accessory protein 

962 

69.951 

5171 

AF095193 

** ui,iu oayxclia tmyj- idmiiy moiecuiar cnaperone 
regulator- 3; BAG- 3 

3926 

99 . 304 

5172 

U50927 

Rattus norvegicus zinc transporter ZnT-2 

1420 

86.166 

5173 

AC004472 

Homo sapiens PI. 11659 5 

397 

54.867 

5174 

M62972 

Drosophila melanogaster RP140-upstream 

161 

31.111 

5175 

Z71180 

Caenorhabditis elegans similar to BPTI/KUNITZ j 

442 

41.146 


Sl3 
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inhibitor domain* rnwa i?ct PMRT.«r»fifl?Q"? prima o 
from this gene; cDNA EST yk448h4.5 comes from 
this aene* cDNA EST t> 4 Q tz rompq f rnm t-"h-! a 
gene; cDNA EST yk448h4.3 comes from this gene 



5176 

X53556 

Bos taurus type X collagen 

£ "7 

0 3/ 

*± Z . 

5177 

AB018295 

Homo sapiens KIAA0752 protein 

J»- «c o z> 

Oil AlfZ 

5178 

AE001079 

Archaeoglobus fulgidus acetyl -CoA synthetase 
(acs-2) 

3 O 3 

O Q Ol Q 

5179 

AF097707 

Bos taurus serine protease 

395 

80.263 

5180 

X68011 

Homo sapiens ZNF81 

lion 

XXOu 

O «* • O KJ \J 

5181 

M13536 

Homo sapiens ceruloplasmin 



5182 

Z36531 

Homo sapiens fibrinogen- like protein 

500 

38.462 

5183 

X59372 

Homo saoiens homeobox r-iyot - o -i n 


o o coo 
y o . 5 J 8 

5184 

AF030558 

Rattus norvecricus Dhosnhaf i Hvl i nocti t*r»l c: _ 
phosphate 4 -kinase gamma 


ob.oUU 

5185 

L27479 

Homo sapiens X123 

TOO 

y / . yby 

5186 

AC005005 

Homo sapiens similar to smoothelin; similar to 
PID :g4128006 

3446 

99.811 

5187 

AC005328 

Homo sapiens R26660 1, partial CDS 

521 

97.436 

5188 

X52140 

28 to 1152) 

o c c c 

83 . 084 

5189 

AF151977 

Homo SSDiens omhan npurnfranQmi ht*pr 
transporter NTT5 


yo . oo / 

5190 

AL035461 

Homo sapiens dJ967N21.5 (novel MCM2/3/5 family 
member) 

2110 

99. 699 

5191 

AF064553 

Mus musculus NSD1 protein 


oU . bUb 

5192 

S67156 

Homo sapiens aspartoacylase, ASP 

669 

44 . 131 

5193 

AJ000522 

Homo saT)i£*n£? axnnpmal Hvnpi n boa -\.r\r r^HaTn 
'j**j^'xc;iio aAviiciuax uyxicxil lied V y k_ 1 1 cl J_ 1 1 


q o "ion 
y o . / o U 

5194 

U22376 

Homo sapiens alternatively spliced product 
using exon 13A 

199 

76.190 

5195 

AJ010901 


429 

81 . 159 

5196 

AF128406 

Homo Sam on a i^v^pnvl -HpnpnHpnh vsrol am'! n TV 
■ AAWlllv ^' oapxciio piciiyx ucjJciiuciiL px cxaUlin J\ 

bindincr Drotein Marf 

r a a 

52 . 143 

5197 

AC006014 

Homo sapiens similar to KIAA0618 and nuclear 
enveloDe Drotein POM ~\ "?i • qimilar to 
PID:g3327050 and P52591 (PID : gl709213 ) 

921 

94. 702 

5198 

U47924 

Homo sapiens C9 

44 5 

3 yj . joz 

5199 

D25215 

Homo sapiens KIAA0032 

COT 


5200 

M26217 

Gallus gallus prolyl 4 -hydroxylase , alpha 
subunit (EC 1.14.11.2) 

233 

30. 500 

5201 

U80736 

Homo sapiens CAGF9 

197 

78 . 94 7 

5202 

AC005328 

Homo sapiens R26660 1, partial CDS 

321 

80. 357 

5203 

Z71181 

Caenorhabdit is elegans similar to hydrolase 

^ X o 


5204 

D83777 

Homo sapiens expressed ubiquitously with 
strong expression in brain 

778 

49.351 

5205 

AF164623 

Homo sapiens trypsin- like serine protease 

171 

t^ft Q7/1 
JO . 7 / ^ 

5206 

AF117064 

Homo sapiens transitional epithelia response 
protein 

1016 

100 .000 

5207 

AL080318 

Arabidopsis thaliana stress- induced protein 
stil-like protein 

215 

27. 451 

5208 

AB023155 

Homo sapiens KIAA093 8 protein 

658 

68. 939 

5209 

L15313 

Caenorhabdit is elegans homology with leucine 
aminopeptidase; coded for by C. elegans cDNAs 
CE2F12 (GenBank: Z14714) and CE15D11 (GenBank: 
Z14518) ; putative 

1227 

46 . 991 


8iM 
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5210 

AC006841 

Arabidopsis thaliana hypothetical protein 

308 

24 .903 

5211 

AF169963 

Homo sapiens WNT16 protein 

2594 

100.000 

5212 

AB007877 

Homo sapiens KIAA0417 

173 

54.167 

5213 

Z73974 

Caenorhabditis elegans cDNA EST yk291f5.3 
comes from this gene; cDNA EST yk291f5.5 comes 
from this gene; cDNA EST yk638b3.3 comes from 
this gene 

202 

28.090 

5214 

U33630 

Mus musculus myeloid ecotropic viral 
integration site- lb 

664 

94 .681 

5215 

X75314 

Homo sapiens SEB4D 

1491 

97 . 817 

DZlt> 

X98 62 5 

Mus musculus sialic acid- specif ic 9-0- 
acety lest erase 

742 

74 .306 

Oil / 

tto one 

Homo sapiens alternatively spliced product 
using exon 13A 

451 

— 

69 . 072 

5218 

AF101361 

Drosophila melanogaster Abnormal X segregation 

731 

37.931 

52 1 9 

7\ f~* f\ f\ c c ~i e\ 

AC0 06 53 9 

Homo sapiens BC3 94 98 3 

217 

100.000 

5220 

AF131826 

Homo sapiens Unknown 

564 

56.944 

5221 

AJ24 8283 

Pyrococcus abyssi hypothetical protein 

163 

20.747 

5222 

Z73102 

Caenorhabditis elegans Similarity to 
B.subtilis DNAJ" protein (SW : DNAJ_BACSU) ; cDNA 
EST yk4 37al.5 comes from this gene 

735 

40.312 

5223 

K03202 

Homo sapiens salivary proline -rich protein 
precursor 

146 

50.000 

5224 

AF136715 

Homo sapiens taxol resistant associated 
protein 

394 

80.519 

522 5 

AF082556 

Homo sapiens TRF1- interacting ankyr in- related 
ADP-ribose polymerase 

1843 

79.940 

bz z o 

Ar 14 904 6 

Homo sapiens Sex comb on midleg homolog 1 
isoform 2 

427 

61 . 404 

c o O *7 

at n'jQem 

/VLiU j jdUI 

Arabidopsis thaliana putative protein 

237 

32.667 

C O O Q 

/\r> u z uooJ 

Homo sapiens KIAA0876 protein 

5650 

100 . 000 


A MIIj 

Homo sapiens fast MyBP-C 

2418 

99 . 722 

52 30 

TT4 Q Oil 

rius musculus ^Ipb4 

102 7 

83.237 

COOT 
D Z j X 


Bos taurus phospholipase C-III 

1335 

54 . 271 

c O 

3 Z O Z. 


Homo sapiens EH-binding protein 

248 

100 . 000 

Zj J Ji 


Homo sapiens hHR21spB 

3712 

100 . 000 

5234 

AF027955 

Mus musculus G protein- coupled receptor 

910 

94.203 

5235 

S76975 

Mus sp. cerebellin 2, Cbln2 

429 

100.000 

5236 

AB023138 

Homo sapiens KIAA0921 protein 

9710 

100.000 

5237 

X91856 

Fugu rubripes valyl-tRNA synthetase 

621 

36.278 

5238 

D43949 

Homo sapiens This gene is novel . 

1073 

100.000 

5239 

U32575 

Rattus norvegicus similar to yeast Sec6p, 
Swiss-Prot Accession Number P32844; similar to 
mammalian B94 , Swiss-Prot Accession Number 
Q0316 9; Method: conceptual translation 
supplied by author 

3968 

94.255 


AC0 05 970 

Arabidopsis thaliana putative translation 
initiation factor eIF-2B, alpha subunit 

700 

67 . 066 


t T/r i /i -*) r\ 
Uo 0 

Homo sapiens Spl4 0 protein 

760 

61 . 798 

5242 


acetylgalactosaminyl transferase; similar to 
Q104 73 (PID:gl709559) 

42 06 

99 . 836 

5243 

AF004161 

Oryctolagus cuniculus peroxisomal Ca- dependent 
solute carrier 

841 

96 . 947 

5244 

Y08026 

Mus musculus immune associated protein 38 

590 

56.000 
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5245 

AB014540 

Homo sapiens KIAA064 0 protein 

411 

35 . 885 

5246 

AF093098 

Homo sapiens transcription factor TBLYM; T-box 
transcription factor family member 

3821 

100.000 

5247 

X05908 

Homo sapiens lipocortin (AA 1-346) 

200 

86 .486 

5248 

AF163254 

Homo sapiens adaptor protein DAPP1 

1099 

100.000 

5249 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

390 

71.622 

5250 

U97006 

Caenorhabditis elegans No definition line 
found 

395 

26 .075 

5251 

Z75550 

Caenorhabditis elegans Similarity with 
Schizosaccharomyces hypothetical gene (TREMBL 
ID G84 7708) ; CDNA EST EMBL:M89418 comes from 
this gene 

179 

28.814 

5252 

AE001727 

Thermotoga maritima conserved hypothetical 
protein 

374 

35.233 

5253 

L072 94 

Homo sapiens T-cell receptor beta 

1919 

93 . 204 

5254 

AB014460 

Homo sapiens tuberin 

255 

100 . 000 

5255 

AF094508 

Homo sapiens dentin phosphoryn 

214 

22.532 

5256 

AL022718 

Homo sapiens dJ1052M9.3 (mouse DOC4 LIKE 
protein) 

8523 

100.000 

5257 

Z33879 

Sus scrofa G-beta like protein 

1978 

100 . 000 

5258 

AF060244 

Mus musculus zinc finger protein 106 

2094 

94 . 969 

5259 

AL050331 

Homo sapiens dJ486I3.2 (KIAA0721 (NAP 
(Nucleosome Assembly Protein) domain containg 
protein) ) 

3136 

100 . 000 

5260 

X63436 

Bos taurus poly (A) polymerase 

525 

93 .258 

5261 

AB001684 

Chlorella vulgaris ORF49b 

77 

60.000 

5262 

X97668 

Homo sapiens XLRP3 

170 

96.000 

5263 

AL117390 

Schizosaccharomyces pombe putative nuclear 
pore protein 

254 

24.194 

5264 

U41060 

Homo sapiens LIV-l protein 

483 

49.324 

5265 

AC002131 

Arabidopsis thaliana Contains similarity to 
hypothetical protein C18bll.05 gb|Z50728 from 
S. pombe. EST gb|H76601 comes from this gene. 

267 

30.612 

5266 

AL032646 

Caenorhabditis elegans cDNA EST yk330ell.3 
comes from this gene; cDNA EST yk330ell.5 
comes from this gene 

561 

41.294 

5267 

AL049474 

Schizosaccharomyces pombe hypothetical protein 

179 

35 . 165 

5268 

X99142 

Homo sapiens type II intermediate filament of 
hair keratin 

397 

100 . 000 

C O C Q 

r» /- -7 q q r\ 

£0 /999 

Schizosaccharomyces pombe hypothetical protein 

3 97 

32 . 090 

5270 

X86019 

Homo sapiens SH3 -domain interacting protein 

157 

35.507 

5271 

AB013452 

Homo sapiens ATPasell 

443 

54 . 135 


AB023207 

Homo sapiens KIAA0990 protein 

663 

35 . 621 

5273 

Z74866 

Saccharomyces cerevisiae ORF YOL124c 

671 

36 .420 

5274 

X604 6 5 

Gallus gallus delta-9 desaturase 

935 

69 . 730 

5275 

AF037364 

Homo sapiens paraneoplastic neuronal antigen 
MAI 

1148 

54.655 

5276 

X66286 

Gallus gallus tensin 

819 

71 . 676 

5277 

L16559 

Caenorhabditis elegans homology with ATP- 
dependent RNA helicase; putative 

409 

38.272 

5278 

AF064447 

Mus musculus sex-determination protein homolog 
Femla 

1023 

90.303 

5279 

AF096870 

Homo sapiens estrogen- responsive B box protein 

266 

32.727 

5280 

AF151816 

Homo sapiens CGI - 58 protein 

482 

49.624 

5281 

X80343 

Homo sapiens regulatory partner for cdk5 

1759 

100.000 
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kinase 



5282 

AC006577 

Arabidopsis thaliana Contains similarity to 
gb|U45880 X~linked inhibitor of apotosis 
protein from Homo sapiens and contains 
PF | 00097 Zinc finger C3HC4 (Ring finger) 
domain . 

248 

24.895 

5283 

U4 2 5 8 0 

Paramecium bursaria Chlorella virus 1 contains 
10 ankyrin-like repeats; similar to human 
ankyrin, corresponds to Swiss -Prot Accession 

360 

38.217 

5284 

X13255 

Homo sapiens dopamine beta -hydroxylase 
preprotein (AA -25 to 578) 

972 

99.281 

D ^ O .5 

Zii? b Z 

Schizosaccharomyces pombe threonine synthase 

821 

35 . 135 


TTA/l "3 Q A 

Naegleria gruberi calcineurin B 

223 

30 . 882 

COOT 
_> ^ o / 

U U ^ *k / D 

nomo sapxens rzJoDo x 

4072 

99 . 671 

COOQ 

At*. \J \J ft O i7 U 

Homo sapxens similar to zinc finger proteins; 

SinHlal L.O tiMJ\jL 4 j oU 

227 

53 . 125 


1/ J -L O O / 

nomo sapxens iviAAUUoz 

3500 

99 . 623 

5290 


riUb o}J . JbtrlllcipilO XTJ.il ilOlllC/ lOy = 1 V I — oeiUd J? 

10bl 

95 . 541 

5291 

1 ij / ** 0 j 

TsT^ 1 1 ~V~ /~\ >~\ f~\ ~V* x* - ! "V" ^ o o ^ T~*\ "\ /■ /~\ s"\ /-^ ^"-v X-i "v» rvrmM — ^ 

iNcUiUopuia v«Iaooo pily tOcllc UcnyurOycIlaSe 

182 

23 . 874 

5292 

AB014558 

Homo sapiens KIAA06 58 protein 

4074 

100.000 

3 -7 -3 


Homo sapxens hypothetical protein 

922 

100 . 000 

5294 

X91655 

Bacillus subtilis lepA 

735 

54 . 167 

5295 

AF095741 

Rattus norvegicus unknown 

575 

81 . 443 

5296 

AF053130 

Mus musculus unconventional myosin MY015 

860 

35.211 

5297 

U13019 

Caenorhabditis elegans No definition line 
found 

1315 

50.272 

5298 

AC006978 

Homo sapiens supported by human and rodent 
ESTs; match to AA454028 (NID:g2167697) , 
similar to AA9255224 (NID : g4236415) and 
AA023712 (NID : gl487627 ) 

537 

78 .824 

C O Q Q 

U5696 5 

Caenorhabditis elegans No definition line 
found 

819 

39 . 077 

DO 00 

AE000887 

Methanobacterium thermoautotrophicum N2,N2- 
dimethylguanosine tRNA methyl transferase 

323 

27 . 554 

JJWi 


Gallus gallus cartilage link protein 

1090 

60 . 417 

5302 

AF132479 

Mus musculus Ese2L protein 

2132 

94.611 



Borrelia burgdorferi virulent strain 
assocxacea. xxpoprocexn 

219 

24 . 603 



nomo sapiens xvi /\/\u & h j> protein 

1370 

99 . 535 

53 05 

APflft 1 4Q7 

Homo sapiens tumor-related protein 

3235 

100 . 000 

5306 

AF098505 

Caenorhabditis elegans similar to Arabidopsis 
Lnaiiand mdie sccriiicy protein z \oW :yuooyi^ 

482 

31.604 

5307 

11DOD J 

Mus musculus hedgehog- interacting protein 

4485 

93 . 899 

O ~> \J O 


xioiTio Sdpiens jviAftUbbo protein 

3117 

98 . 008 

5309 

zxt .0 74774 

xicjiiio sapxens au^o ji\.id . z inovex protein; 

914 

100 . 000 

5310 

B.T7 0 c: -1 0 4 

I'lUo lllUbLUXUb lly|Jfc:X^JtJ-Lcli lZdt 1UI1 aC tlvatcCl 

o c c 

84 . 091 

5311 

AF140538 

Homo sapiens histamine H3 receptor 

1396 

98 . 985 

5312 

U17989 

Homo sapiens GS 2 NA 

1756 

52 000 

5313 

X80930 

Saccharomyces cerevisiae RHC18 

687 

22 . 506 

5314 

AF125102 

Homo sapiens HSPC041 protein 

398 

82.192 

5315 

AC004076 

Homo sapiens R3 0217 1 

4916 

100.000 

5316 

U22015 

Mus musculus retinoid X receptor interacting 
protein 

1343 

65 .257 


8r? 
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5317 

U66561 

Homo sapiens kruppel- related zinc finger 
protein 

305 

30 . 000 

53 18 

AB00240 5 

Homo sapiens LAK- 4p 

514 

38 . 693 

5319 

D14572 

Mus musculus 1 PEBP2bl protein* 

1230 

98.889 

5320 

U79298 

Homo sapiens unknown 

396 

56.881 

5321 

D86640 

Homo sapiens stac 

554 

54 .194 

5322 

AB011084 

Homo sapiens KIAA0512 protein 

767 

45 . 907 

5323 

AC004490 

Homo sapiens R29381 1 

301 

59.223 

5324 

AF036696 

Caenorhabditis elegans contains similarity to 
Brassica oleracea non-green plastid 
phosphate /triose -phosphate translocator 
precursor (GB:U13632) 

591 

41 .150 

5325 

U02082 

Homo sapiens guanine nucleotide regulatory 
protein 

976 

54 . 027 

5326 

D2 6 018 

Homo sapiens KIAA003 9 

3182 

100 . 000 

5327 

AF077738 

Mus musculus metallocarboxypeptidase CPX-l 

3977 

87.790 

5328 

AF062006 

Homo sapiens orphan G protein-coupled receptor 
HG38 

1523 

45 . 941 

5329 

AB007936 

Homo sapiens KIAA0467 protein 

636 

100 . 000 

5330 

AL049689 

Homo sapiens hypothetical protein 

8803 

100 . 000 

5331 

Z14016 

Nicotiana tabacum pistil extensin like 
protein, partial CDS 

179 

33 .673 

5332 

X70804 

Mus musculus rabl7 

271 

65 . 714 

5333 

U75329 

Homo sapiens serine protease 

843 

52 . 521 

5334 

Y18620 

Arabidopsis thaliana DsPTPl protein 

316 

41 . 791 

5335 

AL023854 

Caenorhabditis elegans similar to Regulator of 
chromosome condensation (RCC1) ; cDNA EST 
yk246c2.5 comes from this gene 

421 

35 . 981 

5336 

Z73944 

Lotus japonicus RAB8A 

253 

34 .400 

5337 

U04379 

Mus musculus ZAP- 70 

3112 

92 . 323 

5338 

AL110261 

Homo sapiens hypothetical protein 

1579 

100 . 000 

5339 

AF106062 

Homo sapiens Wiskott-Aldrich syndrome protein 
interacting protein 

361 

47 .287 

5340 

U46068 

Mus musculus von Ebner minor salivary gland 
protein 

494 

63 . 710 

5341 

D14478 

Rattus norvegicus calpain 

516 

80 .435 

5342 

Z68297 

Unknown Similarity to Yeast TAT-binding 
homolog 7 (SW : TBP7_YEAST) ; CDNA EST 
EMBL:D37124 comes from 

330 

22 .820 

5343 

AC006014 

Homo sapiens similar to RFP transforming 
protein; similar to P14373 (PID :gl32517) 

425 

94 .118 

5344 

S67247 

Homo sapiens smooth muscle myosin heavy chain 
isoform SMemb 

202 

28 . 834 

5345 

Z93385 

Caenorhabditis elegans predicted using 
Genefinder; Similarity to B.subtilis GTP- 
binding protein (SW:P20964); cDNA EST 
yk457c2 . 5 comes from this gene 

687 

50 .213 

C 1 A C 

b 346 

Ar 1 / b 9 o / 

Mus musculus Leman coiled-coil protein variant 
1 

683 

58 . 173 


Anno qnm 

T-Ty"-v tt"I f - * qar\i ^Tica TfTEiZlT nifi t"^i vr\t" O "i ri 
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-7 J \J 

a q 0 co 

5348 

D83785 

Homo sapiens expressed ubiquitously; product 
similar to D . melanogaster mam protein. 

450 

30.812 

5349 

M37884 

Homo sapiens muscle beta spectrin 

404 

100. 000 

5350 

AF156884 

Homo sapiens RIP- like kinase 

2130 

100. 000 

5351 

AF091035 

Homo sapiens GTP-binding protein RAB21 

1496 

100.000 
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5352 

D10523 

Homo sapiens 2-oxoglutarate dehydrogenase 
precursor 

1043 

71.649 

5353 

AL021768 

Arabidopsis thaliana ATP binding protein- like 

650 

58 .491 

5354 

X16934 

Homo sapiens B2 3 nucleophosmin (28 0 AA) 

990 

100 . 000 

5355 

U09284 

Homo sapiens PINCH protein 

343 

86 . 538 

5356 

X96770 

Saccharomyces cerevisiae P2558 protein 

232 

27.451 

5357 

D79998 

Homo sapiens KIAA0176 

1078 

83.333 

5358 

AL117664 

Homo sapiens hypothetical protein 

215 

96 . 774 

5359 

Z69725 

Schizosaccharomyces pombe hypothetical protein 

293 

35. 838 

5360 

U93305 

Homo sapiens triple LIM domain protein 

244 

65.116 

5361 

AC005757 

Homo sapiens R32611 1 

600 

98.837 

5362 

AF159567 

Homo sapiens C2H2 (Kruppel- type) zinc finger 
protein 

2768 

100.000 

5363 

Y08139 

Rattus norvegicus dermo-1 protein 

903 

99.281 

5364 

X87852 

Homo sapiens SEX protein 

1126 
0 

99.821 

5365 

X91619 

Rattus norvegicus seal 

359 

39.645 

5366 

L42612 

Homo sapiens keratin type II 

981 

63 . 052 

5367 

X07695 

Homo sapiens cytokeratin 4 (4 08 AA) 

2515 

99.755 

5368 

AL031583 

Unknown /prediction= (method : " "genef inder " " , 
version: ""084"", score: ""124.90"") ; 
/prediction= (meth 

690 

33.735 

5369 

AF106518 

Homo sapiens sialomucin CD164 

212 

32.258 

5370 

D83596 

Mus musculus unnamed protein product 

2350 

91.645 

5371 

U43194 

Mus musculus rhophilin 

727 

67.722 

5372 

AB005549 

Rattus norvegicus atypical PKC specific 
binding protein 

712 

81.955 

5373 

AJ011855 

Homo sapiens PAK4 protein 

966 

74.737 

5374 

U70369 

Mus musculus hematopoietic-specif ic IL-2 
deubiquitinating enzyme 

757 

49.282 

5375 

J05065 

Bos taurus calpain II regulatory subunit (EC 
3 .4 .22 . 17) 

751 

78 .676 

5376 

Z75542 

Unknown cDNA EST EMBL:D34386 comes from this 
gene; cDNA EST EMBL:D3 74 34 comes from this 
gene ; cDNA 

343 

21.836 

5377 

AB013607 

Mus musculus c29 

1351 

89.916 

5378 

AF135440 

Mus musculus huntington yeast partner C 

4056 

94 . 977 

5379 

AF132297 

Homo sapiens cytokine -inducible SH2- containing 
protein 

2466 

99.723 

5380 

D45913 

Mus musculus leucine- rich- repeat protein 

1243 

95.960 

5381 

AL022018 

Unknown /prediction= (method: " "genef inder" " , 
version : " " 084 " " , score : " "165 . 48 "" ) ; 
/prediction^ (meth 

1242 

38.814 

5382 

AB0232 07 

Homo sapiens KIAA0990 protein 

1092 

66 . 977 

5383 

U88167 

Caenorhabditis elegans No definition line 
round 

395 

23 . 502 

5384 

U05340 

Homo sapiens p55CDC 

904 

92.361 

5385 

AF064604 

Homo sapiens KE03 protein 

173 

96 . 154 

5386 

AF043725 

Homo sapiens PHD- finger protein 

5849 

96.544 

COOT 

A J 0 0314 7 

Homo sapiens marenostrin 

654 

57 . 055 

5388 

AF124435 

Danio rerio p55-related MAGUK protein DLG3 

629 

43.636 

5389 

AC003989 

Homo sapiens argininosuccinate synthase 
(citrulline-aspartate ligase) ; 84% Similarity 
to P09034 (NID:gll429l) 

1100 

100.000 

5390 

AF155108 

Homo sapiens NY- REN- 41 antigen 

559 

77.778 


x u o e m-6 M-3 ,„ o a is: 6. o a; 


5391 

Z27116 

Saccharomyces cerevisiae ORF YKR4 07 

191 

33.333 

5392 

AB018349 

Homo sapiens KIAA0806 protein 

1588 

62 . 916 

5393 

U41534 

Caenorhabditis elegans Contains similarity to 
Pfam domain: PF00096 (zf-C2H2) , Score=17.4, E- 
value=0.11, N=3 

531 

36 .255 

5394 

AF075461 

Mus musculus ADP-ribosylation factor-directed 
GTPase activating protein isoform a 

2124 

91.916 

5395 

AB023207 

Homo sapiens KIAA0990 protein 

216 

42 .857 

5396 

AF098505 

Caenorhabditis elegans similar to Arabidopsis 
thaliana male sterility protein 2 (SW:Q08891) 

637 

34.783 

5397 

AF151857 

Homo sapiens CGI- 99 protein 

212 

89.189 

5398 

D87077 

Homo sapiens KIAA024 0 

6429 

99.898 

5399 

AF153362 

Dictyostelium discoideum adenylyl cyclase 

417 

45.890 

5400 

D87449 

Homo sapiens Similar to a C. elegans protein 
encoded in cosmid C52E12 (U50135) 

2444 

100.000 

5401 

Z77666 

Unknown cDNA EST EMBL.-T01059 comes from this 
gene; cDNA EST EMBL:D71534 comes from this 
gene ; cDNA 

502 

37.500 

5402 

AB020527 

Homo sapiens Na/P04 cotransporter homolog 

306 

95.833 

5403 

AL079314 

Homo sapiens hypothetical protein, similar to 
(U06944) PRAJA1 

169 

100.000 

5404 

AB023212 

Homo sapiens KIAA0995 protein 

6648 

100.000 

5405 

U22321 

Rattus norvegicus casein kinase 1 gamma 3 
isoform 

701 

61.749 

5406 

AF144573 

Mesocricetus auratus Mx- interacting protein 
kinase PKM 

615 

93 . 000 

5407 

AE000209 

Escherichia coli orf, hypothetical protein 

569 

100. 000 

5408 

AF009603 

Rattus norvegicus SH3p4 

223 

26.829 

5409 

X98614 

Homo sapiens cytokeratin 

199 

61.017 

5410 

M76665 

Homo sapiens 11 -be t a- hydroxys tero id 
dehydrogenase 

1075 

100.000 

5411 

AF183961 

Homo sapiens carbon catabolite repression 4 
protein homolog 

2962 

99.768 

5412 

AB018302 

Homo sapiens KIAA0759 protein 

954 

45.506 

5413 

X85237 

Homo sapiens human splicing factor 

4766 

100.000 

5414 

U51032 

Saccharomyces cerevisiae Ydr341cp 

1366 

39.226 

5415 

D63476 

Homo sapiens The KIAA0142 gene is related to 
human KIAA0006 gene. 

220 

100.000 

5416 

AB004538 

Schizosaccharomyces pombe HYPOTHETICAL 59.2KD 
PROTEIN IN PFK26-SGA1 INTERGENIC REGION 

157 

34 .483 

C A 1 "7 

ALUBU159 

Homo sapiens hypothetical protein 

610 

64 .336 

CA 1 O 

Ar 1418 84 

Homo sapiens oligophrenin-1" like protein 

5117 

100.000 

c: a n Q 
3*1 J. 


Mus musculus Ankhzn 

1396 

94 . 836 


TTr r\1 Q ^ 

Caenorhabditis elegans ZK328.4 gene product 

398 

37 . 748 

34^ -L 


Homo sapiens methyl -CpG binding protein MBD3 

1947 

99.656 



Drosophila melanogaster unknown 

1024 

60 .474 

5423 

AL049481 

Arabidopsis thaliana putative protein 

497 

51.948 

b4.«£4 

AF1053 6 5 

Homo sapiens K-Cl cotransporter KCC4 

1856 

100. 000 

C /I o c 

AF03 3 56 6 

Mus musculus cdc2/CDC2 8-like protein kinase 4 

1590 

98 .283 

5426 

AL.031685 

riLJU l^J o djkJX Clio Uu Ji\Z j • Z. \ I1U V fci _L prOCclu/ 

3 78 

30 . 811 

5427 

AF083955 

Homo sapiens G protein- coupled receptor 

994 

100.000 

5428 

Y18101 

Mus musculus macrophage actin-associated- 
tyrosine-phosphorylated protein 

1327 

84 .211 

5429 

AF134149 

Homo sapiens 2 -pore K+ channel subunit TOSS 

2053 

100. 000 

5430 

AF117888 

Homo sapiens myosin- IXa 

2539 

100.000 


;L O O B NHS 4-3 „, O 5! El! 6, 0 2 


5431 

fACL C COO 
11 ZZ> ZJ zj o ^. 

i\.cn_t_u£? norvegicus carDonyura ue - Dinuing 
receptor 

ACQ 

4o o 

47 . 407 

5432 

AF031919 

Ml 1 a mi 1 C3 fM line Dal RD1 ra a a r«ir« i /-3 ^>>rv-t >-» -i •»-» 


91 . 3 04 

5433 

AL021816 

•J -L. ^ UOaL. VfllCl 1. VJIlljr C 9 \J 1 1 LXJ t3 O £r *± xl» 27 . v J^, ^ 

unknown , len : 2 51aa 



5434 

M95046 

MvcotdI asma feirmentans tranqlat"! on i ni t* i At* n on 
factor IF3 

Aon 

A *7 Q1 7 

5435 

AF071777 

Mus musculus IRE1 

2248 

O C Q*7 Q 
O ZJ . Zs 1 -J 

5436 

AF030131 

Mus musculus Plenty of SH3s; POSH 

178 

85.714 

5437 

U40410 

Caenorhabdi t is elegans C54G7 . 2 gene product 

con 

"^fi /l fi D 

5438 

AC004381 

Homo sapiens SA gene 


CC\ Q O O 
OU . O O ^ 

5439 

U78597 

Plectonema boryanum kinesin light chain 


AO 7Q"3 
■* 4& * / z? J 

5440 

M58297 

Homo sapiens zinc finger protein 42 

914 

51.131 

5441 

U80955 

^aciiuxuauuiLio cicy ClUi3 uUHLalllU cl vXCJlUci Ail LUUilCl 
in band 4 1 P»7 ri n mnp qi ri rarli yi n anH f- o 1 

■X- x x ijaiivx ~x . jl / Cx>X Xll , IIIVJCO -Lxx f JL CIU1A J. 11 Ctl 1U Ldl JL 11 

/ U o 

bU . uuu 

5442 

AC006234 

Arabidopsis thaliana hypothetical protein 

269 

36 .607 

5443 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

424 

77.632 

5444 

AB01 9438 

v * \J -X. -s ^ cj 

nuuiu bapxciia j. uuiiLxinjy luui J.11 Heavy cjiici ill 
variable rpai nn 

/lb 

90 . 598 

5445 

U57053 

Homo sapiens myosin- ID 

426 

74 .227 

5446 

AF040652 

Caenorhabditis elegans Contains similarity to 
Pfam domain: PF00400 (G-beta) , Score=62.3, E- 
value=3 . 5e-15, N=3 

311 

36.957 

jti / 

VUUD JO 

bacteriophage lambda reading frame gam 

776 

99 . 138 



Schi zosaccharomyces pombe SPAC4A8.15C, cdc3 ; 

pi UI -LJ. Ill , J.eii . XjZ. I clcl , laentlCal CO IrKL/r bLHfU, 

P39825, (127aa) , similar eg. to PROF_YEAST, P0 

79 74 nrof i 1 i n ( t TCaa \ f ao f-3 e» /-*/->! o /\y\i> . /itq 
f ^ f *± t p-LUi. , \X«Oaa/ / i. cl l_ cl SCO XV cS , upC \ *± 1 zf t 

xj \ / . v / v —* * o o 4.V1C11UX xii x. / dd vvcilap/ 

238 

37 . 168 

5449 

M85227 

Escherichia col i pic t" i va t"nr nrnt"Pin 

1UU1 

i nn n rt rt 

5450 

L27153 

Mus musculus kinesin heavy chain 

220 

21 .689 

5451 

U03416 

Ra t" i~ 1 1 nnrvpffi rnq npiimnal r^l ■Fa<- , t~r^mfari-l t-i — 
x^ciu.l.uo iiui vc^jXLUo iicui unai \J-L x. aL LUIIlculll 

related ER localized protein 

A "7 "3 

o o too 

5452 

L06443 

Mus musculus arnwt" Vi f arhnr 

one 

7Q nc 

/ y . J / i> 

5453 

D90716 

Esrhpfi fhi a col i Mn^R nrnfpi n 

xxuy 

i rt r\ rt rt rt 
1UU . UUU 

5454 

AE000451 

Rschpri rhi a col t t>i i t" ^ t~ t 9-rnmnnnpnt- 

regulator 

ft W D 

"1 rt rt rt rt rt 

J.UU . uuu 

5455 

AF039034 

Caenoirhabdi t i e^le* cranes rnn f ai nq qimi 1 ari t*v t-o 
G- coupled protein receptors 

"1 £ *7 

on 714 

5456 

AB014604 

Homo sapiens KIAA0704 protein 

1680 

68.902 

5457 

AF132961 

Homo saDiens CGI -27 oT*ot" pi n 

^. W X. O 

1 rtrt rtrt rt 
1UU - uuu 

5458 

S63848 

Bos taurus G-protein coupled receptor type B, 
GCR type B {clone PPRl} 

708 

86.777 

5459 

AF189817 

Mus musculus eve c t in - 2 

9 9£ 

ZJ 1 . / / o 

5460 

AL022721 

Homo sapiens dJ109F14 . 1 . 1 (Transcriptional 
Enhancer Factor TEF-5) (isoform 1) 

420 

100. 000 

5461 

U73960 

Homo sapiens ADP-ribosylation factor-like 
protein 4 

X 3 D 

J. u u . uuu 

5462 

AB014601 

Homo sapiens KIAA0701 protein 

~ 

4013 

99 . 683 

5463 

D37979 

Rattus norvegicus AIR carboxylase -SAICAR 
synthetase 

157 

74 .194 

5464 

AC007292 

Homo sapiens R31167 1, partial protein 

2907 

99.302 

5465 

Z46787 

Unknown similar to leucyl-tRNA synthetase; 
cDNA EST EMBL:D64208 comes from this gene; 

1396 

45.766 


%7\ 
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5466 

AB020672 

Homo sapiens KIAA086 5 protein 

7122 

100.000 

O^O / 

ilbbzl 

Homo sapiens start position 1 

2663 

98 . 025 

r /» q 

ABOOolo4 

Bos taurus ganglioside sialidase 

354 

30. 992 

c A a Q 

7\ "CI A A 1 

Ar 144 /il 

Rattus norvegicus putative splicing factor 

Y 1 521 -B 

3563 

93 . 285 

5470 

U41548 

Caenorhabditis elegans weak similarity to 
hemolysins 

291 

47.475 

5471 

AB020715 

Homo sapiens KIAA0908 protein 

689 

39. 803 

b4 72 

TV "1 O f\ '1 

A18 0 07 

Penici Ilium chrysogenum FacA 

228 

46 . 575 

C >1 T "i 

54 7 3 

AC00552 5 

Homo sapiens F2216 2 1 

718 

87 . 879 

5474 

AL031588 

Homo sapiens dJH63Jl.l (ortholog of mouse 
transmembrane receptor Celsrl (KIAA0279 LIKE 
EGF-like domain containing protein similar to 

rat, ntvjr 2 ; 

403 

100.000 

5475 

AB023178 

Homo sapiens KIAA0961 protein 

958 

83.019 

3^ /b 

Ar Uddo Jo 

Homo sapiens leucine- rich glioma -inactivated 
protein precursor 

387 

50 . 505 

5477 

AB029029 

Homo sapiens KIAA1106 protein 

6832 

99. 705 

D4 /o 

ttq r »7 /i rt 

U9 5 74 0 

Homo sapiens Unknown gene product 

722 

64 . 677 

5479 

AF049344 

Rattus norvegicus UDP-GalNAc : polypeptide N- 
acetylgalactosaminyl transferase T5 

2978 

90.260 

5480 

U4 3 959 

Homo sapiens beta 4 adducin 

203 

100.000 

5481 

AF005654 

Homo sapiens act in-binding double- zinc- finger 
protein 

3272 

99. 584 

5482 

AL021481 

Unknown similar to Phosphoglucomutase and 
phosphomannomutase phospho serine ; cDNA EST 
EMBL:D36168 

1095 

50.456 

C A Q O 

D10627 

Mus musculus zinc finger protein 

876 

47 . 368 

5484 

AB014575 

Homo sapiens KIAA0675 protein 

664 

100 . 000 

5485 

X07037 

Escherichia coli ORF A 

669 

100.000 

5486 

D90728 

Escherichia coli Hypothetical protein HI1265 

298 

100.000 

5487 

U70214 

Escherichia coli gamma -glutamyl phosphate 
reductase 

998 

99.367 

5488 

D90705 

Escherichia coli Apolipoprotein n- 
acyltransf erase (EC 2.3.1.-) (alp n- 
acyl transferase) (copper homeostasis protein 
cute) . 

222 

96 . 970 

C /I O Q 

MO *7 f\ A Q 
i v l O / \J fi 27 

Escherichia coli guanos ine pentaphosphatase 

646 

88 . 889 


LJ-?\J Oil 

UiScnericnia coil UKr ±jj : oj2Utf l J ; similar to 

761 

99.130 

54 91 


Cibonericnia con yiucose- lnniDiceu qivisi on ; 
chromosome replication? 

735 

100 . 000 

54 92 

LJ J \J 1 O 

bscnericnia con nypocneticai protein M 

571 

90 . 722 



Escherichia coli yehV 

555 

100 . 000 

54 94 

U11 v v J 

£jacnerxcnia con vjKjf T254 

804 

100 . 000 

54 95 


HiSciicricnia con yenir 

8 01 

100 . 000 

5496 

D90702 

fibtiiei itnia con LiLidte lyase oeua cnain 

( /~»"V/ ~| "1 \ra C3 <=» oi iVvi lfii t" \ /ni HT7\ V» i^>rr>i^« T 

\ dL.y X XydOC oUJJUIllL/ j xiornoJLOy 

592 

92 . 157 

54 97 

D85081 

x^iixct uuii Lii ii let lilted protein piouucu 

1101 

99.3 98 

5498 

D64044 

Escherichia coli YFHH-ECOLI protein similarity 

573 

88 . 679 

5499 

U28377 

Escherichia coli ORF ol83 

427 

78.889 

5500 

AE000401 

Escherichia coli putative enzyme 

674 

99.020 

5501 

D90701 

Escherichia coli ORF ID:ol66#7 

651 

95.370 

5502 

AE000390 

Escherichia coli orf, hypothetical protein 

656 

100.000 

5503 

D90715 

Escherichia coli Molybdenum transport ATP- 

944 

98.621 


:io o a Nh& h-3 ,., o e a 6, o i5 







5504 

U29579 

Escherichia coli ORF ol91 

381 

100.000 


fit aoca 

hscnericma con partial ORF 

164 

89.655 



bscnericnia con wo aetinition line round 

794 

100 . 000 

5507 

X52227 

Escherichia coli fhlA gene product <AA 1-692) 

1486 

97.107 

C C f\ o 

TTA Q "I "7 "7 

uuyi / / 

Escherichia coli hydrogenase-2 large subunit 

248 

100. 000 


JJ^ o 5 y 5 

Escherichia coli transcriptional activator 
protein from homology search 

650 

100. 000 

o i \j 

ly O j J. _/ ** 

bnewaneixa sp . kjma polymerase aipna suounit 

437 

100 . 000 

jjII 


Escherichia coli acriflavine resistance 
protein 

265 

100 . 000 


■no n ^ nil 

bscnericma con urf id:o169#14 

622 

100 . 000 

5513 

M60916 

Escherichia coli cytidine deaminase 

615 

97.895 

_> 51^ 

V(*\ Ton') 
ID / oUz 

Escherichia coli membrane protein 

263 

95 . 122 

5515 

AE000324 

Escherichia coli orf, hypothetical protein 

1012 

98.026 

5516 

D90820 

Escherichia coli Synaptic vesicle protein 2 
(SV2) . 

459 

98.649 

5517 

U14 003 

Escherichia coli soluble lytic 
transglycosylase 

893 

100. 000 

C Cl Q 

TTQ TC(T/1 

Escherichia coli similar to human protein that 
oxidizes 11-cis retinol into 11-cis 
ret maiaenyae 

384 

98 .333 



Escherichia coli mannitol permease 

345 

96 .610 

5520 

AE000394 

Escherichia coli orf, hypothetical protein 

695 

95 .283 


Mb 74 52 

Escherichia coli cadB 

692 

98 . 148 

5522 

D90891 

Escherichia coli GLYCINE BETAINE- BINDING 
PERIPLiASMIC PROTEIN PRECURSOR. 

365 

94 .643 

5523 

D90713 

Escherichia coli TolR protein 

539 

98.837 

C C O /I 

5524 

D90 /89 

Escherichia coli Dipeptide transport ATP- 
binding protein DppD. 

1482 

94 . 828 

5525 

D90722 

Escherichia coli Hypothetical protein f4l0 

733 

95 . 798 

5526 

L27665 

Escherichia coli lipoate-protein ligase A 

1195 

92 .746 

5527 

AE000309 

Escherichia coli f erredoxin- type protein: 
electron transfer 

697 

100.000 

5528 

L20915 

Escherichia coli alternative putative coding 
sequence; GTG start codon; homology to acyl 
CoA dehydrogenases and isovaleryl CoA 
dehydrogenases 

405 

100. 000 

5529 

U82664 

Escherichia coli similar to H. influenzae 

illxj U 5 

998 

98.658 

5530 

nqn7 ci'l 
yj z? \j / 3 j 

fiscnericma con UKr xu : o^4b|f l 

62 7 

100 . 000 

5531 

X01666 

Escherichia coli GSH-II 

1239 

100. 000 

5532 

V00279 

Escherichia coli ORF 2 (AA 1-301) 

1670 

98.770 


uyuo^ / 

Escherichia coli DNA-directed DNA polymerase 
(EC 2.7.7.7) III q chain 

326 

100 . 000 

C C O /I 

TTO 1 n O /I 

Saccharomyces cerevisiae Ylr4 3 5wp 

232 

28 .395 

^ R 1 

<ci / Uzo4 

Caenorhabditis elegans cDNA EST EMBL:T01421 
comes from this gene; cDNA EST yk413bl.5 comes 
from this gene 

349 

66 . 667 

5536 

D42054 

Homo sat3if>ns KTAAOOQ9 rrf=»n nrnHnrt- i <a 

distantly related to smooth muscle myosin. 

Z. o ft 

oo . bzl 

5537 

AB028968 

Homo sapiens KIAA1045 protein 

424 

100. 000 

5538 

AF026565 

Mus musculus ring finger protein 

462 

32.326 

5539 

U90946 

Dictyostelium discoideum myosin heavy chain 
kinase B 

540 

35.165 


&23 
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5540 

AL109739 

Schizosaccharomyces pombe trp-asp repeat 
protein 

699 

36.544 

5541 

Z69240 

Schizosaccharomyces pombe putative 
ami dohy dr o 1 a s e 

996 

52 . 708 

5542 

AL024499 

Caenorhabditis elegans cDNA EST EMBL:C08 541 
comes from this gene; cDNA EST EMBL:C07241 
comes from this gene; cDNA EST yk562a6.3 comes 
from this gene 

718 

45.374 

5543 

ALU 7483 

Homo sapiens hypothetical protein 

2477 

100 . 000 

5544 

AF13 12 2 0 

Homo s ap i ens HEMK homo 1 og 

272 

94 . 737 

5545 

AF062740 

Rattus norvegicus pyruvate dehydrogenase 
phosphatase isoenzyme 1 

1142 

96 .023 

5546 

fk X 1 1 f\ ^ 

AL110295 

Schizosaccharomyces pombe conserved 
hypothetical protein 

1198 

35 .494 

554 7 

AF11116 8 

Homo sapiens unknown 

922 

100 . 000 

C C yi O 

bb4 e 


Homo sapiens putative polymerase; orf similar 
to the integrase domain of Type A and Type B 
retroviruses and to class II HERVs 

663 

100 . 000 

5549 

AB020690 

Homo sapiens KIAA08 83 protein 

258 

43 .396 

5550 

U5322 5 

Homo sapiens sorting nexin 1 

3359 

99 . 808 

5551 

AF132963 

Homo sapiens CGI-29 protein 

1349 

99.495 

5552 

U96963 

Mus musculus pl40mDia 

439 

30 . 605 

5553 

AC002131 

Arabidopsis thaliana Similar to seryl-tRNA 
synthetase gb|U10400 from S cerevisiae. EST 
gb|N96627 comes from this gene. 

374 

49.573 

5554 

AF110647 

Homo sapiens translocon-associated protein 
gamma subunit 

359 

98 .276 

5555 

AF127035 

Homo sapiens calcium-activated chloride 
channel protein 2 

6038 

99. 782 

5556 

AB011102 

Homo sapiens KIAA053 0 protein 

1010 
3 

99 . 424 

5557 

M31423 

Homo sapiens cerebellar- degeneration- related 
antigen (CDR34) 

1162 

86 . 283 

5558 

X86779 

Homo sapiens FAST kinase 

3652 

98.548 

5559 

U70935 

Peromyscus maniculatus reverse transcriptase 

319 

49.153 j 

5560 

n r~ ^ 

S58722 

Homo sapiens X- linked retinopathy protein (C- 
terminal, clone XEH.8c} 

183 

50 . 000 

5561 

AF119231 

Homo sapiens histone acetyltransf erase MORF 
beta 

1391 
9 

99 . 904 

5562 

X02761 

Homo sapiens fibronectin precursor 

1529 
3 

99. 057 

5563 

AJ011679 

Homo sapiens Rab6 GTPase activating protein, 

6605 

99.321 

5564 

Y17816 

Suberites domuncula cytochrome P4 50 

600 

28 . 838 

ODD 5 

Ar 117 7 54 

Homo sapiens thyroid hormone receptor - 
associated protein complex component TRAP240 

1455 
9 

99 . 724 

C C ^ £T 

M58 511 

Homo sapiens iron- responsive element -binding 
protein/ iron regulatory protein 2 

6263 

99 . 481 

3 3D / 


Homo sapiens inter- alpha- trypsin inhibitor 
hpaw chain TTTH1 

5534 

95.425 

5568 

AF128406 

Homo sapiens prenyl -dependent prelamin A 
binding protein Narf 

1757 

82.030 

5569 

D87442 

Homo sapiens KIAA02 53 

4348 

96 . 783 

5570 

AF104413 

Homo sapiens large tumor suppressor 1 

7569 

98.060 

5571 

D87078 

Homo sapiens similar to D.melanogaster pumilio 

5451 

98 . 595 
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protein (S22026) : similar to human KIAA0099 
protein (D43951) 



5572 

X02661 

Homo sapiens 2-5A synthetase fragment (229 aa) 

769 

80 . 537 

5573 

Z15005 

Homo sapiens CENP-E 

1639 
8 

99.100 

5574 

AF132969 

Homo sapiens CGI- 35 protein 

1230 

98 . 387 

5575 

U79260 

Homo sapiens unknown 

400 

74 . 194 

5576 

AJ243460 

Leishmania major proteophosphoglycan 

222 

30 . 502 

5577 

M59216 

Homo sapiens gamma- aminobutyric acid receptor 
beta-l subunit 

3077 

99 . 578 

5578 

AC006069 

Arabidopsis thaliana unknown protein 

334 

25 . 829 

5579 

D31884 

Homo sapiens KIAA0063 

562 

54.301 

5580 

AF098799 

Homo sapiens RanBP7/ import in 7 

6699 

98 . 460 

5581 

AF093250 

Homo sapiens P3 8IP 

4675 

99 . 184 

5582 

AF188706 

Homo sapiens g20 protein 

262 

35 . 758 

5583 

AB029030 

Homo sapiens KIAA1107 protein 

8318 

99 . 844 

5584 

D87446 

Homo sapiens Similar to a C.elegans protein 
encoded in cosmid C27F2 (U40419) 

1144 
8 

98.672 

5585 

AF047663 

Caenorhabditis elegans W09G12.7 gene product 

261 

29 . 963 

5586 

X52138 

Homo sapiens L7a protein 

1499 

92 . 593 

5587 

V00488 

Homo sapiens alpha globin 

434 

70 . 093 

5588 

AF083107 

Homo sapiens sirtuin type 2 

1761 

82 . 796 

5589 

AF151848 

Homo sapiens CGI -90 protein 

1863 

94 . 937 

5590 

X79440 

Homo sapiens NADP+- dependent malic enzyme 

3291 

90.879 

5591 

AC002398 

Homo sapiens F25965 1 

1075 

74.477 

5592 

AJ011812 

Homo sapiens transcription factor NRF 

320 

27 . 083 

5593 

AB023215 

Homo sapiens KIAA0 998 protein 

7978 

99.674 

5594 

AL031588 

Homo sapiens dCT1163J1.3 (novel protein similar 
to mouse B99) 

4350 

97.896 

5595 

AF071070 

Mus musculus protein kinase Myak-L 

3927 

95 . 556 

5596 

X81889 

Homo sapiens p0071 protein 

7433 

96 .478 

5597 

AL023828 

Caenorhabditis elegans cDNA EST EMBL :M8 9008 

w 1 1 ic; c3 J- J- win y cut: t v. Xjo l jv<& o^vi-J ♦ -> ^.vJlllco 

f* r~om this opn r 

735 

37.752 

5598 

D83781 

Homo saDiens the KIAA0197 extent* i q f^vrn-^cicif^ri 
ubiquitously. ; the KIAA0197 protein has 
histidine acid phosphatase signature at amino 
acid positions 1047-1061. 

8491 

98 .62 7 

5599 

X99802 

Homo sapiens ZYG homologue 

4280 

94 . 444 

5600 

AJ238248 

Homo sapiens centaurin beta2 

4569 

94 . 891 

5601 

Y15164 

Homo sapiens CxorfS (71- 7A) protein 

5882 

97.544 

5602 

U53450 

Rattus norvegicus »Jun dimerization protein 1 
JDP-1 

257 

50 . 000 

5603 

AP000060 

Aeropyrum pernix 118aa long hypothetical 
protein 

81 

39 . 623 

5604 

M54788 

Homo sapiens pyruvate dehydrogenase El -beta 
subunit 

1699 

91.233 

5605 

AL117233 

Homo sapiens hypothetical protein 

4241 

98 . 268 

5606 

M27826 

Homo sapiens neutral protease large subunit 

649 

74 . 522 

5607 

AB020688 

Homo sapiens KIAA0881 protein 

4895 

76.923 

5608 

U38253 

Rattus norvegicus initiation factor eIF-2B 
gamma subunit 

1471 

87.153 

5609 

AF132938 

Homo sapiens CGI- 03 protein 

3980 

98 . 148 

5610 

AF055470 

Homo sapiens ZNF2 58 

4709 

98.056 

5611 

Z78416 

Unknown predicted using Genefinder; Similarity 

787 

24 .825 
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CFEQX52 



56X2 



■ -ion 

XJ6U 

on ATC 

5613 

M99375 

Borna disease virus duplicated domain within 

pal aiiiyAU v xi uo cti ikx ntciJJUv/vx^uo pux y uici. cttpt- 

Droteins coniDlete cds crene Droduct 

724 

39.402 

5614 

AL035307 

Homo saniens hvDOthetical nrotft in 

1068 

55 . 220 

5615 

AF056929 

Homo sanipns sarcosin 

3750 

98 . 325 

5616 

AF159164 

Homo aaoiens anlcvriii reoeat- containincr orotein 
ASB-2 

3163 

96 . 360 

5617 

AL117629 

Homo sapiens hypothetical protein 

496 

42 .471 

5618 

X89750 

Homo sapiens TGIF protein 

1520 

93 . 116 

5619 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

411 

41 . 558 

5620 

AB020694 

Homo sapiens KIAA0887 protein 

2821 

99 099 

5621 

AC003114 

Arabidopsis thaliana T12M4 . 6 

485 

30 . 652 

5622 

AB029025 

Homo sapiens KIAA1102 protein 

5385 

98.585 

5623 

AL080156 

Homo sapiens hypothetical protein 

575 

35.714 

5624 

AB018289 

Homo ci p\ y~\ "j <=krn €3 TCTAA0746 nrnhpi n 

6654 

96 . 609 

5625 

D38549 

Homo c; ^ Ti i n cj h^ilOP^ iq npw 

8361 

96.796 

5626 

L77968 

Ovi ci ar-i pq t"vnp T T ^ma 1 1 nrol i nA-r"i fh nrnf pin 

100 

32 .468 

5627 

AB018276 

Homo sapiens KIAA073 3 protein 

3382 

87 . 960 

5628 

7-\_Lj W -J JL 3> -_> 1 

c V"i -i *?■ r-\ c? Vi ^3 vfirrwrf ic> cj nrtitihp t^i 1 1* a t* t 3 Qnayam np 

Ovll-Lx^UOdC^llCli. Ulliy L^CO ^/Ul 1 Lk^Ci puuctL JL v c aopaL a^lllc 

7 9ft 


562 9 

AF131766 

Homo can i *a.-n «=: mi 1 ar t~o Ftipi — \7 A ^ P 1 i lr<=» nrnh pin 

2266 

98 . 8 98 

5630 

AT.1 1 7 6 6 S 

Homo qnni <=*ti ci VTv/Y H jot"Vi*=>t~ t o^i 1 r.rnt"Pi n 
nuiiiv/ ociuxctjio J H ^ i~ 

76 81 

95 . 211 

5631 

AF0 9R066 

Homo ejani pn<? om lammi c; pal 1 rarrinnma anhiapn 
recocrnizGd hv T cell 

6387 

99 . 269 

5632 

AC007842 

Homo saoiens BC331191 1 

x a Vh/ 1 1 ■ Vh/ ii_> ca jky x. 1 1. iO iJv.j^xx.yx x> 

2 054 

49 . 922 

5633 

X05472 

Rattus norvegicus ORF 2 

159 

39.394 

5634 

X70040 

Homo Qani ptiq fvrnQi np Vr i na cp 
iiwiiiw ocijl/ jlciio u y i. waxiit- /vxiiaoc 

9361 

99 . 572 

5635 

AC004 9S3 

Homo qanipnq qitiri lar f o PTD *alfl77944 

2 3 54 

83 . 811 

5636 

Z83838 

Liu J OJ □ 

Homo saDieng GTPASE -ACTIVATING PROTEIN 

IlVylllU OdJLS -L Cll O V3 X XT JrVLj X-i X X- V JT^. X Jw X>» VJJ XT XV \_/ X X_t X. ll 

1369 

87 . 967 

563 7 

X6 0016 

*v W L/ W <J> O 

Homo i pnQ nhriQnha t~ pa yyi pv T"&Y~ot*£ i iT'i 

XT Wl 1 i\J OdLyXCllO LyxlWoiyi let L-C Cdi i 1CJ. UL>CX11 

2195 

78 .431 

5638 

X89633 

Saccharonivces cprevisiae t.RNA 

fc^dL^^XXdJL ^— ' 1 1 IV w * — ^— ' * — • ^- ^ — ■ V J- J -1- v — \— AVI -i V 

isopentenyl transferase 

710 

34 . 813 

5639 

X06272 

Homo sapiens docking protein 

3891 

97 .488 

5640 

AJ238248 

Homo QAni ptiq rpnt'^iirin f- ^» o 

4565 

94 . 757 

5641 

AR007979 

Homo cani priq TfTAA046(i nrohpi n 

6025 

98 .4 53 

5642 

AJ233591 

Mus musculus reverse transcriptase 

232 

49.515 

5 64 


RMBL • TO! 909 rompq from t"ih"L c 5 a^n^ * oDNA RST 
yk278all . 3 

i jo 


5644 

X89602 

Homo sapiens rTSbeta 

2599 

94 . 313 

5645 

AL110218 

Homo sapiens hypothetical protein 

1038 
5 

99.249 

5646 

Z34278 

Homo sapiens mucin 

222 

24 . 749 

5647 

AL117530 

Homo sapiens hypothetical protein 

814 

37. 594 

5648 

AJ243 797 

Homo saniens deoxvr i bonu c 1 ea s e III (DNase TIT) 

1940 

97 . 368 

564 9 

AF051325 

Homo sapiens SH3 domain containing adaptor 
protein 

225 

26 . 603 

5650 

X73902 

Homo sapiens nicein 

8104 

98 . 996 

5651 

AF019413 

Homo sapiens complement component C4 

1147 
4 

99.599 

5652 

X55777 

Homo sapiens put - ORF 

248 

53 .247 

5653 

X75888 

Mus musculus cyclin E 

2381 

73 .428 
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D O D ft 

7v RRQQi; 
M.F UDD70D 

tjncnocerca voivuius pyrr 01 luone - ricn 3.11c xy en 

166 

/I O O C *7 

42 . od7 



Homo sapiens integrin alpha6 subunit 

6161 

94 . 161 

DODO 

Y*7 ft Q"i 1 

rioino Sapiens zinc linger procem 

108 5 

93 . 023 

ODD/ 


Homo sapiens snRNA activating 1 protein complex 

X _7 JVX'cl OlUJUIlll. 

463 

6 5. 772 

J O J o 

AD f D A / 

nuinu oapiciia dXLyiict x v vxxxy uuiiavjcii 

D ft O J 

QQ QO C 

jo. y^i d 

D O D .7 

nr W O X 72 3 J 

nomo Sapiens scauten procein 

1 O *7 *3 
± Z I D 

Ad "2 A d 
ftO . JflO 

5660 

X61585 

Bos taurus polynucleotide adenylyl transferase 

4357 

93.883 

5661 

AF083068 

Homo sapiens NAD+ ADP-ribosyl transferase 2 

3354 

96.834 

c ^ £ o 

Ayz4o D 

Plasmodium vivax pval 

329 

58 . 889 

c -a 
job J 

AdUIo JUo 

Homo sapiens KIAA0765 protein 

3935 

98 . 658 

5664 

U58658 

Homo sapiens unknown 

267 

58.537 

566 5 

U3 984 9 

Caenorhabditis elegans similar to leucine- rich 
repeat regions of L. monocytogenes internalin 

320 

37 . 931 

jDDD 

Af 1 D X o / D 

Homo sapiens CGI —11 7 protein 


94 . 972 

ODD/ 

TTQ Q T Q n 

UooXoU 

LaenornaDuitis eiegans similar co moiyoaenum 
cofactor biosynthesis protein E 

*7 "3 Q 

26 . 033 

DO O O 

TTQ "7 C C O 

u y / bbj 

murine herpesvirus 6 8 unknown 

100 

3 0 . 682 

C/T/TQ 

Dboy 

rllJ 1UU 

Rattus norvegicus unknown protein 

*"> O Q 

zoo 

o o o**> *"> 

3 o . 922 


Xj2 2 U D U 

Glycine max hydroxyproline-rich glycoprotein 

iyo 

34 . 314 

bo / X 

Ai5021ob 0 

Homo sapiens carbonic anhydrase VB 

d73 

42 . 54 7 

DO / 2 

tvt non oi o 

Homo sapiens b34I8.1 (Kruppel related Zinc 
Finger protein 184) 

5228 

98 . 936 

C d "7 1 

bb / J 

Ab U -7 D / 

Homo sapiens receptor tyrosine kinase 

763 7 

98 . 599 

n a *7 /i 
ob /4 

AoU^i U / Ub 

Homo sapiens KIAA08 99 protein 

d914 

99 . 355 

r>6 / d 

A / U / 6ft 

Mus musculus serine/ threonine protein kinase 

6 83 

51 . 149 

C *T "7 £T 

bo /6 

U4 3360 

Peromyscus maniculatus reverse transcriptase 

419 

46 . 907 

C £T *7 *7 

bb / / 

Id / y 4 1U 

Mus sp . nuclear localization signals (NLS) - 
Dinainy proucin- spot- — l 

145 

47 . 170 

bo /o 

Ub^4^b 

Homo sapiens GOK 

1 Q Q O 

c r\ d o c 
DO . oJ b 

3D / 27 

-MXJ U U DO XO 

udiius gaiius enromouox procein \\^ti\^x5A / 

D Ol 

/ 2 . Jfi U 

JO o \j 

/-VHi UUll'i o 

Hal i rnViarhor r-w -i ,TQQ TWDPOMTHIT q V"XTTT-r ACT? 
ncllcODaCter; pyiOxl U j zf IrlKEjUJNliNct oiJNIrl/ioil 

i *± / 

"5/1 CQ1 
J rr - D y 1 

CCOI 
D O O X 

a*Rm ftO'7'? 

U^mrt »-» i a T A A H *7 O Q v^ , v*^ > v't- /"^ 4 r» 

ritjmo bapiciio xvxM-rt.vj / j7 prouexn 

/ y^i / 

y y . / *± y 

56 82 

X X O D X f* 

nuiutj octpxeuo pdiopicyiii'HJvc pxrotcin 

D 1 lO 

on Q "70 
y / . O / 2 

3DOO 

COR 

./-llr v / D v O 

nvjiin_j octpicllo envelope piOCclll 

A t Z, 

-J ft . fx O D 

56 84 

U76714 


4b D D*x 


5685 

X98259 

Homo sapiens M-phase phosphoprotein 8 

1232 

75.735 

5686 

AL022316 

Homo sapiens bK126B4.1 (novel protein) 

268 

60.759 

CCQT 
DDO / 

AOu / / y 

Homo sapiens FAST kinase 


O C O /I o 

yb . ofto 

C C O Q 

_> b o o 

TAT n /I Q Q /I O 

Homo sapiens hypothetical protein 

«bO 

31 . 304 

5689 

AC006284 

Arabidopsis thaliana putative ankyrin 

267 

39.189 | 

56 90 

Z3 713 9 

Unknown similar to guanine nucleotide binding 
procein; cuisja ii>o i cii*ldj_i : luuyi/ comes xrom cms 

431 

33 . 469 

5691 

X61048 

Hydra sp. mini -collagen 

189 

34.815 

5692 

AL032654 

Caenorhabditis elegans similar to Heme -binding 
domain in cytochrome b5 and oxidoreductases 

698 

31.870 

do yj 

■v c tz d q d 
AD DO O O 

Lycopersicon esculentum extensin (class II) 

74 

2 7.711 

C C QA 

7A J?fi"*£ Q"7*7 

nUIUU SdpicilS UllliVllO WH 

^lZb 

y *» . /ub 

5695 

U26743 

Homo sapiens similar to the 87 kDA Torpedo 
acetylcholine receptor-associated protein ; 
similar to human dystrophin- related protein, 
PIR Accession Number S03966 

207 

52 . 000 

5696 

AL050131 

Homo sapiens hypothetical protein 

265 

39.548 
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5697 

M12 14 0 

Homo sapiens envelope protein 

9 m *? 

t^T 1 A** 

5698 

U93 56 7 

Homo sapiens p4 0 

AHA 

i i t; 

5699 

AF108 84 3 

Homo sapiens env protein 


-3 / . J.13 

5700 

TV T"» A1 A1 A J\ 

AF010144 

Homo sapiens neuronal thread protein AD 7c — NTP 

Ten 


5701 

AC U O 3 9 / 9 

7\ „ _ V-k -! 1 0 t-Vi-jl ^ ^r-i^i TPCTa *-tV\ 1 17 1 A f\ *7 C; rrV"» I 77dfl'i C 

AidDiaopsis cnaiiand fiois y-t> | ^« -3f±u / d , yD|6j4ojD 
and gb|AA404888 come from this gene. 

D J. J 

A1 fl4^ 

5702 

L06 505 

Homo sapiens rioosomai protein iiiz 

DO O 

71 A O Q 

5703 

U7926 0 

Homo sapiens unknown 

"1 A *7 

Do . jZ / 

5704 

V A T t n /I 

XO / /04 

Homo sapiens Po protein 


ji . f \J f 

5705 

M63835 

Homo sapiens IgG Fc receptor I 

2307 

98.667 

5706 

K01075 

synthetic construct circumsporozoite (CS) 
fusion prot (partial) 

TIC 

o c n ft 

5707 

M11902 

Mus musculus proline-rich salivary protein 

216 

30.126 

5708 

AF126164 

Homo sapiens alternative HHLA3 protein 

AT) 

D7. Ui7l 

5709 

AF111106 

Homo sapiens protein serine/threonine 
pnospnatase 4 reguiacory suDuniu i 

Dflc .5 1 

Q9 CQC 

yz . djo 

5710 

AF083107 

Homo sapiens sirtuin type 2 



5711 

U22818 

Cricetulus griseus mutant sterol regulatory 
element binding protein-2 

601 

46.512 

5712 

AF06 9781 

Drosophila melanogaster Bem46-like protein 

OZu 

A 9 9 ft A 

5713 

AL080119 

Homo sapiens hypothetical protein 


Q^ 7n7 

5714 

L27104 

Bos sp . muscarinic receptor, M4 subtype 

QQ 


5715 

Y14690 

Homo sapiens procollagen alpha 2 (V) 

lUol 
7 

Q7 Q-l O 

5716 

L34001 

Homo sapiens ORF; putative 

348 

87.500 

5717 

U58755 

Caenorhabditis elegans C34D4.11 gene product 

J.JU 

JO . Z U / 

5718 

X82157 

Homo sapiens hevin 


q c mc 

5719 

U95098 

Xenopus laevis mitotic phosphoprotein 4 4 

1 A "J Q 
1*4 J i7 

£T Q n Q c 

5720 

X70944 

Homo sapiens PTB-associated splicing factor 

3439 

83.062 

5721 

U66796 

Homo sapiens laminin alpha 2 chain 

2132 
4 

99 . loo 

5722 

M84379 

Homo sapiens lymphocyte antigen 

zUll 

q a a a a 

5723 

X65165 

Volvox carteri extensin 


. 0 3Z 

5724 

AF129085 

Homo sapiens carboxy terminus of Hsp70- 
interacting protein 

1864 

96.066 

5725 

AP000060 

Aeropyrum pernix 14 3aa long hypothetical 
protein 

178 

35.256 

5726 

X07881 

Homo sapiens proline-rich protein Gl 

O O A 

">*> *7 C A 

5727 

U80848 

Caenorhabditis elegans No definition line 
found 

«S O O 


5728 

AC002544 

Homo sapiens Unknown gene product splice form- 
J. 

667 

54.430 

5729 

Y09615 

Homo sapiens mTERF 

498 

30.357 

5730 

AL0500 7 1 

Homo sapiens hypothetical protein 

ZZ / Z 

88 . 278 

5731 

AB018331 

Homo sapiens KXAA0788 protein 

794D 
/ ZIO 

Q9 arc; 

5732 

r* ft n r\ '"f a 

Z98974 

Schizosaccharomyces pombe putative vacuolar 
protein sorLiny*associatea proLein 

O A A 

49 7nfl 

*± 4 . /wo 

C *7 ~> 1 

U J> / 

Miio mncpiil lie DanoT"Vi pal ~\ DnViannoH C± Yfi y*o c Q i rtn 
l v l\JS IDUSC-UiUS Jr ciliti HI Lcll CllildULcU CApicbolUll 

777 

77 . 005 

5734 

AJ24 97 32 

Homo sapiens G8 protein 

792 

90. 000 

5735 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

416 

61.165 

5736 

X57110 

Homo sapiens c-cbl protein 

6141 

98.677 

5737 

X86779 

Homo sapiens FAST kinase 

3487 

96 . 926 

5738 

AL080318 

Arabidopsis thaliana putative protein 

757 

40.456 

5739 

AF019413 

Homo sapiens complement component C2 

4106 

93 .054 
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5740 

AF123881 

mul t inlp c'f-'l F»T"O c ?"i <z aQQnri ah^H t-<=* t- >»mr-i vug 

element gag polyprotein 

Z DO 

*3 "7 TOO 

5741 

X55777 

Homo sapiens put . ORF 

243 

ZJ it . JZD 

5742 

AF127142 

Homo sapiens NeuAc-alpha-2 , 3-Gal-beta-l , 3- 
GalNAc-alpha-2 , 6 -sialyl transferase alpha2 , 6- 
sialyl transferase 

1633 

0 / . / *± 

5743 

AF072506 

Homo sapiens envelope protein precursor 

3502 

97 . 774 

5744 

X83413 

Human herpesvirus 6 U8 8 

555 

43 . 719 

5745 

AJ010099 

Homo sapiens NKp4 4RGl 

230 

29 .500 

5746 

Y13374 

Homo sapiens putative prenylated protein 

1201 

79 .426 

5747 

U58658 

Homo sapiens unknown 

266 

64.935 

5748 

D00189 

Rattus norvegicus Na+, K+ -ATPase alpha-subunit 

6543 

98 .422 

5749 

Ul 0 1 8 5 

XenoDus laevis XPMC2 Drotpin 

1 'SOT 

x J 4 

JU . fl O J 

5750 

U00043 

Casnorhabditis elecrana qimi lar to H^t-a- 
mannosyl transferase 

O O 


5751 

S62929 

Homo sapiens PRB1L precursor protein=basic 
proline rich proteins (Ps, PmF, PmS, and Pe) 
precursor {C- terminal} 

167 

26 .203 

5752 

Y16610 

Homo sapiens paraplegin 

5001 

QO 74c 
-7 O . /4 J 

5753 

AJ243460 

Leishmania manor oroteoDhosDhoalvcan 


. Z. J 

5754 

AC004473 

Arabidopsis thaliana EST gb|N37577 comes from 
this gene. 

167 

Of . J J7l 

5755 

K03208 

Homo sapiens salivary proline-rich protein 
precursor 

312 

"11 cno 

5756 

M13100 

Rattus norvegicus unknown protein 

O *7 Q 

il A 7"J7 

5757 

AJ252550 

Homo saoiens crlvceirol kinase 

j J O J 

Q*7 "7/1 /l 

5758 

U09407 

Rattus norvegicus putative protein kinase C 
inhibitor 

500 

-j * . j 3 j 

5759 

D87450 

Homo sapiens Similar to D.melanogaster 
parallel sister chromatids protein 

8232 

^ 0 . ^ 3 

5760 

AL096753 

Homo sapiens hypothetical protein 

780 

43 .450 

5761 

X84909 

Homo sapiens phosphorylase kinase 

6934 

93 910 

5762 

X52851 

Homo sapiens peptidylprolyl isomerase 

709 

74 . 847 

5763 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

509 

42 . 484 

5764 

X94912 

Homo sapiens Pr22 

292 

62 .179 

5765 

Z25535 

Homo sapiens nuclear pore complex protein 
hnup!53 


Q7 47 "7 

5766 

X57766 

Homo sapiens stromelysin-3 precursor 

3138 

96 . 349 

5767 

AL117610 

Homo sapiens hypothetical protein 

3606 

96 . 970 

5768 

X15311 

Woolly monkey sarcoma virus reverse 
transcriptase (476 AA) 

507 

48 . 000 

5769 

AF023261 

Human endogenous retrovirus K pol-env 

628 

48.469 

5770 

U78312 

Mus musculus zinc finger protein 

632 

35.955 

5771 

X13293 

Homo sapiens B-myb protein (AA 1-700) 

4641 

99 . 144 

5772 

Z96047 

Caenorhabditis elegans DY3 . 6 

491 

28.718 

5773 

AB002363 ! 

Homo sapiens KIAA0365 

5491 

96 . 931 

5774 

M74027 

Homo sapiens mucin 

364 

27.138 

5775 

D87469 

Homo sapiens Similar to D.melanogaster 
cadherin-related tumor suppressor 

1625 

0 

99.295 

5776 

AF145732 

Homo sapiens endoplasmic reticulum alpha- 
mannosidase I 

4511 

99 .397 

5777 

U48251 

Homo sapiens protein kinase C-binding protein 
RACK7 

3002 

94 .414 

5778 

M12240 

human herpesvirus 1 infected cell protein 

195 

29.288 

5779 

X74764 

Homo sapiens protein- tyrosine kinase 

5477 

96.636 
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57 80 

At 16 9 3 4b 

LaVld porCcllUS prO - c4X| r vIlci — JL type X LUllaycll 

122 

29 . 091 

5781 

X55686 

LyCOpS i S 1 COn cSCUlcIltUm cALcilblll v v_ X do o XX/ 

It J 

-2 *X . W _7 -L 

5782 

AF1210 0 9 

MycoDacueriuni cujjcrcuiosis hj / r<. v nypot.nec.icai 

4— n -I -|—| _~Tt r COyl 

procciii u vu jj 1 * 

jti 

/.O • Oil 

5783 

U04706 

Bos taurus 50 kDa protein 

2579 

83 .476 

5784 

AF092z 0 / 

KdLLus norveyicus uiij\.iiuwu 

X x / ^ 

78 . 648 

5785 

X59892 

Homo Sapiens *± i x. aa poxypepuxae \yaininci4s; 

£1 1 ** z> 

OO > O 27 *± 

5786 

AF065391 

Homo sapiens z»j.ox 

627 

81 . 618 

5787 

X55777 

riomo Sapiens put. . OKr 

241 

61 . 194 

5788 

U2 938 0 

LaenornaOCiiLis cieyans ANO UcI.iiiii.iuii line 

found 

343 

33 . 654 

5789 


Homo sapiens Airf* procein 

3145 

84 .72 2 

5790 

AF1 / /zUo 

nOino Scipxeiio cexeuictx wexx auncoiuu uivxcv*uic 

3328 

96 . 73 1 

5791 

TV Tttl **> X •"» 

AF13 4 72 6 

riomo sapiens 

6856 

99 .800 

5792 

AF064z0b 

riomo sapiens o.yiicit-L.xii x piou iouluj.ui 

7889 

98 .753 

5793 

U21317 

Caenorhabditis elegans No definition line 
found 

184 

46 .835 

5794 

TV t?1 "5 1 T A "5 

AF131 /4 3 

riomo sapiens unjuiown 

515 

67.153 

5795 

AF111178 

Homo sapiens glypican-6 

1733 

93 .357 

5796 

U97553 

murine herpesvirus 6 8 unknown 

141 

1 "7 7"IC 
.5 / . /JO 

5797 

D87076 

Homo sapiens similar to human bromodomain 
protein BR140 t JLzUby ; 

2835 

89.876 

5798 

M73491 

Mus musculus N-acetylglucosaminyltransf erase I 


0 / . iyj 

5799 

AF128406 

Homo sapiens prenyl -dependent prelamin A 
binding protein Narf 

2058 

81.028 

5800 

AF109126 

Homo sapiens stromal ceii-aerivea receptor- i 
beta 

*> 4 7 A 

Q*7 O n 

5801 

U43360 

Peromyscus maniculatus reverse transcriptase * 

246 

37.714 

5802 

AF027826 

Homo sapiens putative seven pass transmembrane 
protein 

4o 7 

4b . b y 1 

5803 

M63438 

Homo sapiens , gene product 

1629 

79.834 

5804 

U00059 

Saccharomyces cerevisiae Yhrl21wp 


^ *i . I/O 

5805 

AF090867 

Rattus norvegicus guanosine monophosphate 
reductase 

1749 

70. 694 

5806 

AF070588 

Homo sapiens unknown 

991 Q 

27_> . X 

5807 

AF022985 

Caenorhabditis elegans No definition line 
round 

580 

40.179 

5808 

tv /*■» /\ /\ a r* a a 

AC004544 

Homo sapiens cynocnrome v_ oxiaase; matcn to 
rl^^uo v f±u . gi i / iz i ; 

j *± / 

QQ 11 Q 

O O . X X _7 

C O f\ Q 

booy 


riomo sapiens vui \j £. piutciii 

4002 

98 . 875 

5810 

1-7 -*j 

oaccnaroinyucb tcicvioiac ux i_ , icii . oow, v-mi . 
0.13, some similarity to gag polyproteins 

278 

23 . 711 

C O T 1 

boll 

tv Tftl 1 A Q"7 

AjU ui 14 y / 

XlOlliO odpicllb V_XctUvXXH ' 

997 

92 . 958 

bolz 

ad 3 bb 3 

riOmO odpicllo KJLNiA puiyillcl doc XI X*xU J\.L/ck ouuuiixl 

6432 

91 . 221 

581 J 

TV TJ-1 CI QOC 

Ar ibloZb 

Ur*\m^v c dT"v"i on cj C~*(~Z T — ^ *7 nynhPi n 
rlOTTlO bapicllb \-ul O / piUUCJ.il 

1516 

89 . 726 

5814 

AF067226 

Homo sapiens cGMP phosphodiesterase A4 

2920 

94 . 553 

5815 

Ao2o77 

UryCuOlayUS CUnitUlUa IcLIUVII Llo I Cla Lt;U 

reverse transcriptase 

238 

54 .622 

5816 

ao 9718 

oils scrora zoo protease suDunit 

1 479 

1** / £m 

67 . 981 

J O X / 

API ^6*587 

Homo sapiens retinoic acid- induced protein 

3449 

98. 120 

5818 

AL031187 

Arabidopsis thaliana kinesin-related protein 
katA (fragment) 

191 

21.923 

5819 

AF062249 

Homo sapiens immunoglobulin heavy chain 
variable region 

627 

84.167 

5820 

AF042386 

Homo sapiens cyclophilin-33B 

452 

44 . 976 


1, til OS «•■!!■• 6> «+3! ,„ O a !r;!! i6 O H 


5821 

D30747 

Acropora donei mini -collagen 

*■> r\ a 
209 

"3 T Q *7 C 

3 7 . 9 / b 

5822 

AF036 705 

Unknown Similar to phytoene desaturase; coded, 
for by C. elegans cDNA CEESX74F; coded for by 
c . ex 

"7 /I C 

746 

4j . yo / 

C Q O O 


ore virus UKr^ 

XOu 

^ *7 *7 ^ 

C O O yl 

AJ24 J4oy 

JLi e i s niud nia major prOLeopnuspnoyxytciii 

Z. X Z7 

Z / . OOH 

c o o c 
bozo 


nOIUU Sapicua ti J. Hit; 1 1 L. IIIUUUloLvJI y J.aLLUI 



5826 

Y00064 

Homo sapiens precursor polypeptide (AA -20 to 
657) 

4510 

99.409 

5827 

AU223830 

Rattus norvegicus ARE1 


Ck *a /too 
93 . o o y 

5828 

U17000 

Gaiius gaiius iof ap 

/If 1 
4 O X 

J J . 3 j j 

5829 

AC004 07 7 

Arabidopsis t ha liana hypothetical protein 

/ 

31 . yu / 

5830 

TV T /"V *"* 1 ""J T £T 

AXi021726 

unKnown /matcn= vtiesc : lkuu jzd . bprime v^iv. 

T^-v**"* « />\v\Vt ■{ 1 ^ mo 1 ^ Y™i f^i-r a af" onuKvi/rt lit O C V* T T*% t~ 

LTOSOpnila lUcXdnoy aS Lei cilUJX yU Diucoi*iXpv, 

T^v/^aor^Vi i 1 z$ mo 1 
uiuoupniia iiici 

o on 


con 
jo j1 

TTii nnci 

U4 U U O X 


33 0 

3 0 . 526 

5832 

U92820 

Homo sapiens unnamed HERV-H protein 

378 

85. 075 

583 3 

At U©4 /bz 

Mus muscuius unknown 

Tl CO 

o i a a a 
ox . ooi 

C Q ~1 A 

DO 


nOIUO Sapicllo l v JUl\ piUUclU KlllaUc 


O ^ . -J -J -7 

583 5 

Ar 11 /ob J 

Homo sapiens double homeobox protein 

OX J 


5836 

AF000412 

Plasmodium berghei merozoite surface protein- 1 

151 

36.364 

5837 

X56010 

Sorghum bi color hydroxyproline-rich 
glycoprotein 

186 

29. 197 

5838 

U4 78 56 

Araneus diadematus fibroin- 4 

O O *7 

j u . /by 

5839 

AL050261 

Homo sapiens hypothetical protein 

1084 

93 .237 

5840 

U80931 

Caenorhabditis elegans strong similarity to 
class- III of pyridoxal-phoshate- dependent 
aminotransferases 

1331 

46 . 2 56 

5841 

U102 81 

Sus scrofa gastric mucin 

r> c a 
Zb4 

O O O O "3 

5842 

D87684 

Homo sapiens similar to a C. elegans ZK3 53.8 
protein (S44655) 

iJbb 

Q o c n *7 

yy . bu / 

5843 

X52479 

Homo sapiens protein kinase C alpha (AA 1-672) 

44 zb 


5844 

AF140507 

Homo sapiens Ca2+/calmodulin- dependent protein 
kinase kinase beta 

3443 

91.709 

5845 

V00572 

Homo sapiens coding sequence 

2389 

92.723 

5846 

AF071081 

Mycobacterium tuberculosis proline- rich mucin 
homo log 

243 

27 . 253 

5847 

Z83227 

Caenorhabditis elegans predicted using 
Genefinder; Weak similarity to high- sulphur 
keratins . ; cDNA EST yk663al . 3 comes from this 
gene 

270 

29.297 



XlOmO Uapicilo I1X o LUIlc r\z.t\ , A. 

424 

75 . 694 

5849 

K03208 

Homo sapiens salivary proline -rich protein 

226 

27.645 


Ar uzz / *s y 

Dat* t*n a nnv^ron "i r^na HKTTv^— 1 Gill f Ahrancf pra 

43 8 

33 . 010 

CQC1 

DODl 

AblJ y 4 

AcIJUpUb lacvlo t\zr Hi\J picLULbUI JJi ULClil 


30 . 124 

C Q C O 

mi oi nn 

i v iu.o muotuiuo pxuiiuc i lvyii pxuucxii nr j 

2 96 

26 . 667 

C Q C T 


ilKJ 1 1 IU bapXCli&l pULaUX VC i\.C*U XllLCXClL-LXUy £J 1 U UC111 

{clone Ll-57} 

676 

86.777 

5854 

X55683 

Lycopersicon esculentum extensin (class I) 

168 

41 .892 

5855 

AJ007666 

Cryptosporidium parvum unnamed protein product 

163 

43.750 

5856 

U70935 

Peromyscus maniculatus reverse transcriptase 

265 

37.500 

5857 

Z97184 

Homo sapiens HKE2 

388 

48.430 

5858 

Z83227 

Caenorhabditis elegans predicted using 
Genefinder; Weak similarity to high- sulphur 

243 

32.857 
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keratins.; cDNA EST yk663al.3 comes from this 
gene 



5859 

AF111784 

Homo sapiens myosin heavy chain I la 

1084 
1 

93 .496 

5860 

D00723 

Homo sapiens hydrogen carrier protein 
precursor 

627 

67 . 630 

COC1 

bobl 


Homo sapiens NADP dependent leukotriene b4 12- 
nyaroxyaenyarogena s e 

1935 

89 . 275 

5862 

U84371 

Homo sapiens adenylate kinase 2A 

876 

64.664 


r\Q *j o r\c. 
Uo JzUd 

Mus musculus P24 protein 

"3 C "1 

«3b 7 

42 . 143 



Homo sapiens hypothetical protein 

64 9 

33 . 555 

C: O £T tt 

Dob!) 

TV T ftCftO£7 

Homo sapiens hypothetical protein 

3 956 

96 . 825 

c o c ez 
boob 

At* 0 /2bUb 

Homo sapiens envelope protein precursor 

3502 

97 . 774 

Dob / 


Homo sapiens DNA-directed RNA polymerase I, 

-L d Jl y x± LS C SUDUniu 

1126 

99 . 477 

5868 

X54326 

Homo sapiens glutaminyl-tRNA synthetase 

9593 

99.722 

bob 3 

o/4 bb2 

Human T-cell lymphotropic virus type 1 Gag 

267 

24 . 924 

5870 

AF004107 

Mus musculus unknown 

1215 

64.839 

5871 

AB03 0503 

Mus musculus UBE-la 

518 

36 . 728 

5872 

U58658 

Homo sapiens unknown 

210 

52.381 

5873 

Z21487 

Saccharomyces cerevisiae internal membrane 
protein 

680 

39. 198 

5874 

S58722 

Homo sapiens X- linked retinopathy protein {C- 
terminal, clone XEH.8c} 

209 

59. 155 

5875 

X12966 

Homo sapiens 3-oxoacyl-CoA thiolase propeptide 
(424 AA) 

2434 

92.056 

5876 

AF070660 

Homo sapiens HSPC004 

917 

72 . 851 

5877 

AF071081 

Mycobacterium tuberculosis proline- rich mucin 
homo log 

334 

26 . 977 

5878 

U59453 

Macaca mulatta flavin- containing monooxygenase 
form 2 

3483 

96 . 449 

C Q "7 Q 

bo / y 

Z11 / /3 

Homo sapiens SRE-ZBP 

2703 

97.567 

CQQ A 

boo U 


Homo sapiens yeast methionyl-tRNA synthetase 
homo log 

5178 

95 . 710 

C O Q 1 

boo 1 

Ab J /bb 

Homo sapiens high-sulpher keratin 

178 

30 . 481 

5882 

X61280 

Oryza sativa hydroxyproline-rich glycoprotein 

217 

29. 000 

5883 

MX94 19 

Mus musculus proline -rich salivary protein 

206 

35 . 862 

CO Q A 

boo4 

J04670 

Haemonchus contortus collagen 2c 

143 

31 . 690 

c o o c 
boob 


Bos taurus H+ ATPase 31kDa subunit (EC 
J . b . X . .J J 

1006 

75 . 664 

5886 

X95518 

Mus musculus neuronal tyrosine threonine 
pnospnacase x 

1647 

46.696 

COOT 

ooo / 


bdiius gaiius aniinopeptiaase ri 

1918 

70 . 194 

coop 
j o o o 


nomo sapiens procein- uyrosine -pnospnacase 

"7 IT C *7 

/ bb 7 

97 . 44 7 

COO Q 
J O O _7 


nomo sapiens zinc linger procein 

r r A 

bb (J 

CO TOO 

coon 


IN X V-/ U. JL ctl let U ct U ct U 1 U pioLll CALcIlSin llKc 

protein, partial CDS 

loU 


jojx 

J-\L U O Z J vj 

ajjiuopbi s cnaxiana araoinogaxaccan— procem 

1 CA 
1D4 

J> / . / / O 

COQO 
3 O -? ^ 

<t*^ O W± / 

LdcnornaLjuiLis eiegans ui j . d 

c n q 

buy 

"3 r\ cat 

3 o . b y 1 

5893 

AB008227 

Ad i an turn rani 1 1 ti<3 — vf»n*^T*i "Rv t* f^n q *i n 

2 01 

■j o . u u u 

5894 

M36912 

Zea mays cell wall protein (put.); putative 

287 

37.173 

5895 

Z66523 

Caenorhabditis eiegans similar to gamma - 
butyrobetaine, 2-oxoglutarate dioxygenase 

670 

34.540 

5896 

L32610 

Homo sapiens ribonucleoprotein 

1734 

94 . 757 

5897 

X55681 

Lycopersicon esculentum extensin (class I) 

202 

31.250 


.1. 0 O S 4-6, «+3 „ O Ei id 6. ill E 



?VT 1 1 7C1 C 

/UjII / b 15 

Homo sapiens hypothetical pirotein 

3 855 

9b . 082 

5899 

L17318 

Rattus norvegicus proline-rich proteoglycan 

264 

30.137 

C Q A A 

\r -t c O O Q 

Y1522 8 

Homo sapiens Leu2 

415 

84 . 444 


X6 0155 

Homo sapiens zinc finger 41 

5499 

99 . 362 

5902 

AF098499 

Caenorhabditis elegans No definition line 
round 

291 

37.500 


C! C O Ti O 

&58 Z 

Homo sapiens X- linked retinopathy protein {C- 
ceirrnina-L , c-iorie Ai&ri . oc / 

294 

72 . 059 

c on a 

AUbbb y 

synunecic conscrucc pre Aor -oecai 

O *> A O 

95 . 92 9 


2VT?1 ROT C4 
J\r X D ¥ X D ft 

Homo sapiens ankyrin repeat -containing protein 
ASB-2 

11 Tl 

-51/3 

yb . 1 /b 



riomo sapiens pxaKopnxnn 3 

*i ob y 

Q 1 Q TO 

y 7 . 878 

5907 

AF140598 

Homo sapiens ring-box protein 1 

479 

69.444 

byuy 

AU / o o 1 

Homo sapiens proline-rich protein Gl 

22 9 

31 . 718 

r A A Q 

5y oy 

ABU JUiS 

Canis familiaris D4 dopamine receptor 

133 

35.789 

59X0 

Y15 918 

Homo sapiens COL1A1 and PDGFB fusion 
cranscripc 

134 

37 . 168 

Z> J X X 

u U j jjZ 

Kattus norvegicus procein pnospnacase 

ltllllJJlLUl 1 pXULclll 

"1 C\ 1 

3 03 

3 7 . 805 

5912 

M29297 

Transposon Tn4 556 unknown protein 

225 

36 . 000 

5913 

XU/OD / 

Homo sapiens pirin 

1552 

93 . 197 

5914 

X59720 

Saccharomyces cerevisiae YCL054w, len:841 

1680 

35.862 

5915 

X1689 9 

Mus mus cuius SAP 

671 

52 . 821 

5916 

TTA t~ t\ A A 

U06 944 

Mus mus cuius PRAJA1 

1 944 

78.068 

5917 

tv r\ t\ f\ f\ r\ t~ r\ 

AP00006 0 

Aeropyrum pernix 143aa long hypothetical 
protein 

166 

34 . 167 

5918 

AE000789 

Borrelia burgdorferi B. burgdorferi predicted 
coding region BB116 

250 

26 . 259 

5919 

X03145 

Homo sapiens pot . ORF V 

210 

37.864 

5920 

X63005 

Mus musculus proline-rich protein 

170 

26 . 389 

5921 

AF072 8 80 

Homo sapiens SOCS box- containing WD protein 

OM-i D "I 

oWlF - 1 

1942 

47 . 826 

r> y z. z 

Ml C Q 1 A 

Zea mays cell wall protein (put . ) ; putative 

T A C 

14 5 

31.3 56 



Aeropyrum pernix l^iaa long nypocnecicai 

P J. UUclU 

151 

"3 D D A "> 

3 8 . 843 
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QO TOO 


TTO QTIC 

UO?j JO 

T_I/"\m/~v a a »^ *i cine? i mVn <"vt»»t"» 

XZ / 

A / . UOJ 

5927 

X75042 

Homo sapiens c-rel 

3919 

98.546 

5928 

AF036906 

Homo sapiens LAT 

188 

33.621 

5y2 y 

ArS015b30 

Homo sapiens type II membrane protein 

774 

40 . 777 

5y 3 u 

At 1 o 1 b 5 y 

urosopmia meianogaster jdcl>jma. vjM05 iUo 

1 9b 5 

52 . 914 


Y R *7 Q 1 O 
A5 / OJ.^ 

Homo sapiens immunoglobulin lambda light chain 

y 5b 

a 3 . b 8-5 


•7/1 QftCQ 

46 fi y ub 0 

unKnown similar co bir-Dinainy procein; cuina. 

XLO X XjI'IDJU - 1"10 -7 X JL X ^ initio X X <wHU Llllo ycllc , V-X^AM/\ CjO 1 

JCil U9JU • LJjZ. 

104 o 

4 0.3 / 3 

5933 

AF010144 


6 1 

2 8 169 

5934 

TTQ7408 

Homo c? ;=> r"\ "i p>ti c? nnlrnnwn 

Z. X _> D 

ft c; 1 4.4. 

O 3 . X*x *± 

5935 

M1553 0 

Unmo cani ^n*5 R-ppl 1 nTowf Vi f arhnr 

1 1 V— ' 1 1 wj ci j_/ -i- w jid u w -L. j. x. w wuii j_ a. i — w -l 

14 0 

6 0 000 

O L/ . \J\J \J 

5936 

M80596 

Saccharomyces cerevisiae VAC1 

231 

24 . 918 

5937 

M19338 

Oryctolagus cuniculus protein kinase delta 

4637 

96 . 011 

5938 

X55687 

Lycopersicon esculentum extensin (class II) 

128 

37.179 

5939 

AF131851 

Homo sapiens Unknown 

365 

38.235 

5940 

AJ133352 

Homo sapiens ZNF23 7 protein 

1644 

70.681 

5941 

X97675 

Homo sapiens plakophilin 2b 

5249 

96.288 


jl o u e h e, ■+ 3 ,., o ;a ;s t» o a 


5942 

AC008075 

Arabidopsis thaliana F24J5.4 

178 

29.714 

5943 

AC005581 

Homo sapiens R3123 7 1, partial CDS 

2485 

80.235 

5944 

Y13141 

Bromheadia f inlaysoniana extensin 

94 

37.500 

5945 

AB002320 

Homo sapiens KIAA0322 

. . 

1006 | 
5 

98.403 

5946 

Z83128 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk433e5.3 comes from this 
gene; cdna est yK433eb.b comes irom cms gene; 
cDNA est yK4o9eii.b comes rrom cms gene 

250 

23 . 929 

5947 

U79260 

Homo sapiens unknown 

JOl 


5948 

Alill7481 

Homo sapiens nypocnecicai procem 


OA "IOC 

5949 

X77055 

Homo sapiens gp'" region or endogenous retro 
virus erv-10 

OA O 
ft *Z 

J / , OUU 

5950 

X8 5134 

Homo sapiens RB protein binding protein 

T C *3 Q 

OR QO7 

5951 

M30023 

orf virus 0RF2 

193 

35.135 ! 

5952 

U16360 

Homo sapiens caudal -type homeobox protein 

1 / z 

J 4 . lz / 

5953 

L36120 

Medicago sativa proline rich protein 

1 "3 C 

13 b 

27.338 

5954 

AF061741 

Homo sapiens retinal short-chain 
dehydrogenase/reductase retSDRl 

133d 

81.132 

5955 

AF000198 

Caenorhabditis elegans Similar to cuticular 
collagen 

O •"> Q 

. oflo 

5956 

M17236 

Homo sapiens MHC HLA-DQ alpha precursor 

_ _ _ 

1,5 / 3 

y ft .loo 

5957 

AF099505 

Homo sapiens colon carcinoma related protein 

2279 

94 .416 

5958 

AL021366 

Homo sapiens CICK0721Q.5 (polypeptide from 
patented CDNA EMBL:E06811) 

687 

84 . 932 

5959 

X59656 

Homo sapiens CRKL 

204 8 

99.6 70 

5960 

X91012 

Mus musculus alpha 3 type IX collagen 

212 

36 . 552 

5961 

D42044 

Homo sapiens The ha3523 gene product is 
related to S.cerevisiae gene product located 
in chromosome III. 

5690 

99.449 

5962 

Z75166 

Saccharomyces cerevisiae ORF YOR258w 

195 

28 . 505 

5963 

AL035453 

Homo sapiens CB42E1.1 (PUTATIVE novel protein 
similar to various different known and 
predicted proteins) 

163 

29.83 9 

5964 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

405 

34.091 

5965 

Y13618 

Homo sapiens abundant transcript 

1 1 1 Q 
1 / 1 O 

3 

. /z / 

5966 

X69090 

Homo sapiens 190kD protein 

9483 

yy . Zft j 

5967 

AL078468 

Arabidopsis thaliana putative protein 

Q O 1 
o Z. 1 

j / . boi 

5968 

M28515 

Mus musculus zinc finger protein mfg3 mRNA 
(put . ) ; putative 

i n d 
3Uo 


5969 

Z98046 

Homo sapiens dJ1409 . 2 (Melanoma-Associated 
Antigen MAGE LIKE) 

3060 

92.869 

5970 

U3108 9 

Homo sapiens Abl binding protein 3 


QTQ 

5971 

AF044 924 

Homo sapiens hook2 protein 

ft DO 1 

. fi 3D 

5972 

X12451 

Homo sapiens pro- (cathepsin L) 

1757 

90.029 

5973 

D45371 

Homo sapiens a novel adipose specific 
collagen-like factor, apMl (adipose most 
abundant gene transcript 1) 

O O T 

32 7 

3 1 . olo 


IlOUflD 

Unmn cam' ona "RY^lFi'PT nnrnrrpnp narhnpT* ( FOP \ 

2466 

98 . 030 

5975 

AF123320 

Homo sapiens lymphocyte activation-associated 
protein 

312 

26 .437 

5976 

L41270 

Homo sapiens natural killer associated 
transcript 4 

2781 

89.451 

5977 

M11902 

Mus musculus proline-rich salivary protein 

333 

36.986 
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r Q -» q 

mi 0 1 a a 
M1Z14 0 

riomo sapiens pox yene protein, axx 

X O fx 

O J • / x*± 

597 9 

Tin Q O C T 

Homo sapiens unknown. 

J O \J 


5980 

X56681 

Homo sapiens junD protein 

1750 

88.701 

5981 

X97477 

Rattus norvegicus in xvk -fib protein 

1 A 1 


5982 

U41557 

Caenorhabditis elegans proline and glycine- 
ricn 

345 

28 .816 

5983 

M19441 

Mus muscuius Kruppei - reiacea protein 

0*A*A\ 

■ic qoc 
.5 o . y OO 

5984 

'ft'CATI AO"! 
At 0 /10B1 

MycoDaCteriuin tuDercuiosis proiine-iicn iiiutiu 
nomoioy 

7"lp 
/ J o 

o q ogi 


At uuy ODD 

multiple bLiciUaio aoSuuiaLCU ictiuviiuo 
nrnh pqqp 

179 

35 . 294 

CQOf 
J JOD 


U/>m/\ cj^i^it «i f nq"i oa nTn1"Pi A 

4919 

95 . 495 

CQ07 
o y 0 / 

VI A 7 "I 7 

TTrvmn oani 0 n a -» mmnnrtfrl nKn Inn 1 ^mViH^ hPAW f'VlA "i A 
11 k_) 1 1 \kj UciyicXlo xiiuiiuiivy xui/ux xii xaiiujuci 1 ice* v y v^-iicu-ii 

2 919 

90 . 985 

5988 

AB023151 

Homo sapiens KIAA0934 protein 

9056 

99.852 

C Q Q Q 

Dy 0 y 


jnomt? oapieiio pyxyn uxiiuxii^ piutciii \ r-w^. x. 0 j / 

554 

54 . 106 

c o q a 

by y u 

Ar UODyoo 

uncnotcicd vuivuius pyx xuiiuunc iiwi cm » — i-y en 

102 

34 .24 7 

i>y y 1 


jjrosopmia meianogastei Dtur»n . unu ^01/ 

753 

41 . 692 

5992 

X78929 

Homo sapiens zinc finger protein 

61 

28 .758 

5993 

AP000062 

Aeropyrum pernix 141aa long hypothetical 
protein 

151 

38 .843 

5994 

X79389 

Homo sapiens glutathione transferase Tl 

1334 

96 .234 

5995 

U4 3701 

Homo sapiens ribosomal protein L23a 

793 

84 . 516 

5996 

M99578 

Homo sapiens 550 amino acids MW=61kDa, 
glycosylated=75 kDa; expressed on endothelium, 
activated lymphocytes and syncytiotrophoblast , 
contains leucine zipper and basic region 
homologous to myc; 72 IP 

3531 

95 . 130 

5997 

M12140 

Homo sapiens envelope protein 

1454 

44 .621 

5998 

AL117608 

Homo sapiens hypothetical protein 

J jl 

41 4 A A 

5999 

AE000446 

Escherichia coli regulator protein for dgo 
ope r on 

673 

87 .395 

6000 

Xy 5677 

Homo sapiens argBPIB 


*5 A 1"J7 
JU . X J / 

6001 

17 e *t o o 
Z6 7 990 

Caenorhabditis elegans similar to cuticle 
collagen 

z z u 

7 O AAA 
J . uuu 

6002 

M31013 

Homo sapiens nonmuscle myosin heavy chain 
(NMHC) 

4764 

67.062 

6003 

AF0833 91 

Homo sapiens putative WHSC1 protein 


O 1 . U / O 

6004 

U0 984 8 

Homo sapiens zinc finger protein 

O 1 A Q 

y y . o o y 

6005 

Tift r\ /\ c 1 

U00051 

Caenorhabditis elegans coded for by C. elegans 
cDNA yk50b2.5; coded for by C. elegans cDNA 
Lbao v j£ o r ; similar to lipases over a snox t 
x tiy J. on 

Q A 1 

O Q Q 77 

6006 

X07173 

Homo sapiens trypsin inhibitor 

6197 

100.000 

c a a "7 

Ar Uol / J y 

Homo sapiens unknown 

1123 

98 .8 70 

c n a 0 
6OU0 

aoat con 

uryctoiagus cunicuius leccin-iiKe oxiaizcu 
receptor 

97*1 

7fi ^4Q 

o uuy 

AD / lOO 

KaltUS riUXVcy ltUo \ o / -z-nyuiUAyatiu UAiuaoc 

1627 

72 . 521 

C AT A 

bUXU 

TTQ C/IQ1 

DOS taurub y lytclaluciiyuc j pilUoLyiiaLC 

dehydrogenase like- 17 protein 


71 . 028 

6011 

AB011179 

Homo sapiens KIAA06 07 protein 

4804 

99.726 

6012 

Z29481 

Homo sapiens 3 -hydroxyanthranilic acid 
dioxygenase 

1805 

97.232 

6013 

AJ005566 

Mus muscuius SPR2H protein 

161 

38 .667 

6014 

U28131 

Homo sapiens novel transcript; similar to 
transcription factors activation domains; 

171 

49.206 
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6015 

AB001452 

Rattus rattus Sck 

2 805 

ft G CkO ft 

6016 

Y00644 

Homo sapiens precursor polypeptide (AA -34 to 
287) 

1929 

QO AOft 
-7 O • *iiiO 

6017 

AF151848 

Homo sapiens CGI -90 protein 

410 

34.4 98 

6018 

AF095791 

Homo sapiens TACC2 protein 

4105 

98 .930 

6019 

M63274 

Plasmodium falciparum malaria antigen 

186 

.0/0 

6020 

S80864 

Homo sapiens cytochrome c-like polypeptide 

Q7 n 

ft ^ *^4. ft 

6021 

AF161081 

Homo sapiens activatory receptor PIRIIbeta 

199 

74 ftft ft 

6022 

AF151800 

Homo sapiens CGI-41 protein 

2945 

99.115 

6023 

AL117496 

Homo saoiens hvoothetical orotein 

1 nQ9 

1 


6024 

AJ011779 

Homo saoiens SEC61 nrntpin 

/ 


6025 

X78933 

Homo saDi&ns zinc f inert*** ornf Pin 


QQ CQA 

6026 

X01655 

Homo saoiens tvoe III Drocollaapn /aa q qo - 
1023) 

X / w 

*5 a /i ft n 

6027 

AF1053 32 

Homo sani pn«> vi t"anii n F)"i T~^r*i=»r"ki~OY* inhorarfinn 

protein 

Q ft Q c: 

Q Q "3 A C 

6028 

AF155113 

Homo saDiens NY-REN- 55 antiapn 

** -7 W «& 

. j / u 

6029 

AL032821 

Homo saoiens dtl55C?"^ 1 (vanin 1 ^ 

J fk X X 

1 ri r» ft ft ft 
1UU . uuu 

6030 

X06562 

Homo saoiens arowth ho rmnnp rpppnfor f AA 1 - 
638) 


QQ Q A "3 

6031 

M69181 

Homo sapiens non- muscle myosin B 

12 3 5 
8 

-7 _/ . 343 

6032 

X61046 

Hydra sp. mini -collagen 

215 

** X . X / 0 

6033 

X51798 

Homo sapiens precursor 

Qft n 

-7 V U 

QQ ft n n 

6034 

AF084256 

Homo saoiens beta alucuron i rip* i qoform c\ 

14c 

X *± 3 

G ft dCR 
OU .403 

6035 

AF152307 

Homo sapiens protocadherin alpha 11 

5459 

89 .053 

6036 

Y12711 

Homo c; rs 1 pne m i f t~ "5 nynrff* o f~ » y~/~*t"» <=» hi nHi nrr 
nuiiiu ocijJXCiio puLdl — LvC piU^CoLcIUIlt: JJlIlUlIiy 

protein 

Q "3 Q 


6037 

AB002330 

Homo sapiens KIAA03 32 

6743 

99.415 

6038 

AF106681 

Homo RaniPIlQ Q — Y*^1 £»t~^H O r PI5_V^"i nH i nrr r^-r~r>i- oi n 

Q O Q 

Of . OD^ 

6039 

AF083419 

Homo qan"i pnc; r*?i 1 nm/ral mnHi 1 1 i n - HononHonh 

protein kinase II beta e subunit 



6040 

AB020649 

Homo saDiens KIAA0849 or*ot~ pin 

OOi<6 

Q Q CI rt 

6041 

AF123 8 80 

multiple sclerosis associated retrovirus 
element unknown protein U5/1 

OQ7 

TC TCQ 
/ Zj . /DO 

6042 

AB002306 

Homo sapiens KIAA03 08 

8346 

99.692 

6043 

X06745 

Homo saoiens DNA oolvmerase aloha -subunit (AA 
1 - 1462) 

Qfi "*ft 

QQ 1 ft 1 

J7 J7 . X O O 

6044 

AF028827 

Homo sapiens Tax interaction protein 40 

1520 

99.567 

6045 

X71428 

Homo sapiens PUS gycline rich protein 

16 01 

QC Hfi 

6046 

D86984 

Homo sapiens similar to yeast adenylate 
cyclase (S56776) 

3034 

99.580 

6047 

AF151877 

Homo sapiens CGI- 119 protein 


QQ ^Qft 

6048 

AF165124 

Homo sapiens gamma -aminobutyric acid A 
receptor gamma 2 

3150 

QQ 7Rfi 

6049 

AF110267 

Rattus norvegicus golgi stacking protein 
homolog GRASPS 5 

2305 

85.092 

6050 

AF084530 

Homo sapiens cyclin-D binding Myb-like protein 

4818 

99.213 

6051 

U82319 

Homo sapiens novel ORF 

382 

92.754 

6052 

AB007922 

Homo sapiens KIAA04 53 protein 

6946 

99.905 

6053 

Y00816 

Homo sapiens CR1 precursor protein 

1472 

99.461 
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6054 

X52053 

riomo sapiens nir - i iaa. i - :?f± / 

£ *■% f* 

ftfi Q49 
OO • 0 ft a 

6055 

AjJ238 /64 

homo sapiens uur* jm - ace uyiy iuco saminc - z - 

c±v~\ "i ma oo / KT — is pohvl mannrtQam i np Ir -? n ;a o o 
cpilllcl aoc / 1.M clv^ti L. y x UlallllUoaUl J.11C JVXilaOC 

fx O v J 

Qft a. ft 

£- n c £T 
oObo 

T 1 rt Q C\ Q 

l v lUb IllUbuUXUo *jV^dk7X y CilC jji.uuuv.u 

167 

34 . 568 

6057 

AB002312 

Homo sapiens KIAA0314 

8357 

99.919 

6058 

Xbl / 98 

riomo sapiens precursox. 

-7 V W 

Qft non 

6059 

AF123534 

Homo sapiens nucleolar protein NOP5/NOP58 

3278 

100.000 

6060 

AJ223948 

Homo sapiens RNA helicase 

o ^2 / U 


6061 

X66276 

Homo sapiens C protein 

/ JO j 

. O /.It 

6062 

X6 7055 

Homo sapiens mter-aipna- trypsin lnmoiuor 
heavy chain H3 

D / 1 D 


^ A C O 

606 J 

I Oob 1Z 

riomo sapiens oojvua nucieai poic t^oiupxtsyt 
px oce xn 

4897 

99 . 595 


VI X. j ft ft Z 

Mi i a mi i a f~* i ilna al nha — t*i i Vm il i n H qnr*VTJP M — Pi 1 nha — fi 

2922 

98 . 667 

606 b 

V C Q Q C /l 

Abyo6fi | 

Dahhna y-j t"ii a OT?T? 

467 

57 . 407 

oUbo 



643 9 

99 . 897 

606 / 

Ar iz y / do 

UAmA —s y-v ^ Ann "D A T 1 /t 

2409 

99 .719 

606 o 

Ar lji / / b 

riomo bdpiciib uiiivixowii 

223 9 

98 . 387 

6 06 y 

at n q-t m 

nCJIHO bapicilb r iUUJ.^ 

122 8 

100 000 

\J \J m \J \J V/ 

6 07 0 

Ar lo / jU j 

nyxine yiutinosa caxiuoauiin 

396 

45.270 

6071 

X52966 

Homo sapiens ribosomal protein L35a (AA 1-110) 

485 

75.000 

6072 

X15954 

Homo sapiens mannose binding protein 

15 97 

i no 000 

6073 

X78990 

Mus musculus testin 


^ X . o 0 

6074 

AB01113 9 

Homo sapiens KIAA056 7 protein 


Dft ft 

-7-7 . \J O \J 

6075 

AF153606 

Homo sapiens angiopoiet in- related protein 

2754 

100.000 

6076 

Z98885 

Homo sapiens aJ522u7 . 2 (Peregrin (BR140) like 
protein) 

/Obi 

yy .bzj 

6077 

L76571 

Homo sapiens nuclear hormone receptor 

1 / b O 

yy . buy 

6078 

AF036145 

Homo sapiens meningioma- expressed antigen 5 

b z ft -6 

Q Q "7 A O 
Z7 ^ . / flO 

6079 

D87685 

Homo sapiens similar to human transcription 
factor TFIIS (S34159) . 

llj / 

2 

QQ Qzl O 
J7 J7 . J<± A 

6080 

D80007 

Homo sapiens similar to hypothetical protein 
YM9959.11C of S . cerevisiae . 

Q 

QQ O Q "5 
-7 -7 . O J J 

6081 

D88769 

Mus musculus latexin 

1275 

84.685 

6082 

X56807 

Homo sapiens desmocollin type 2a 


q q fit;n 
yy . d jv 

6083 

X78710 

Homo sapiens metal -regulatory transcription 
factor 

5075 

100.000 

6084 

AL050258 

Homo sapiens hypothetical protein 

5719 

100.000 

6085 

AF141289 

Homo sapiens bo,+ amino acid transporter; 

DO , +A1 

"J1 ft! 
J 1 O / 

1UU . uuu 

f r\ c> 

6086 

X15D1Z 

Kactus norvegicus apoiipoproLcin ^*i/ 
preprotein 

14 7 

4 5.349 

6087 

AC0024 6 4 

Homo sapiens organic cation cransporcer, out 
similarity to JC4884 (PID :g2143892) 

0077 

99 .344 

H A D O 

6 088 

at ncnOTO 
AliObO-s / 

riomo sapiens iiypoi.ncLj.cai pxutcin 

854 

95 . 522 

6089 

AB023147 

Homo sapiens KIAA093 0 protein 

2197 

99.425 

6090 

X15652 

cricecuius xongicauaacus xsi-eunyimaieimicie 
sensitive rusion procein 

4 7ft ft 

97 . 872 

O v _7 X 

rUJ U /, /- J -6 -7 

Homo qaniens bK4 07Fll 2 ( adireneraic - beta- 
receptor kinase 2) 

4647 

99.855 

6092 

AF087438 

Homo sapiens protein phosphatase 2 subunit A 
isoform beta 

3799 

99.834 

6093 

L11702 

Homo sapiens phospholipase D 

5659 

99.881 

6094 

AL110160 

Homo sapiens hypothetical protein 

1460 

100.000 
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6095 

X95586 

Homo sapiens proteasome 

1730 

100 . 000 

6096 

AB011149 

Homo sapiens KIAA05 77 protein 

6750 

99. 904 

6097 

M34449 

Homo sapiens complement receptor 1 

859 

78 . 882 

6098 

X63564 

Homo sapiens RNA polymerase II largest subunit 

1313 
8 

99.493 

6099 

AJOOb 984 

Capsicum annuum proline- rich protein 

199 

35 . 052 

6100 

AF111162 

Homo sapiens guanine nucleotide exchange 
factor 

1206 
6 

99 . 568 

6101 

AJ24 5600 

Homo sapiens hypothetical protein 

3243 

100 . 000 

6102 

X95762 

Homo sapiens Aminopeptidase P-like 

4163 

99 . 839 

6103 

AL110469 

Schizosaccharomyces pombe hypothetical protein 

872 

45.833 

6104 

L36531 

Homo sapiens integrin alpha 8 subunit 

6712 

98 . 254 

6105 

X98296 

Homo sapiens ubiquitin hydrolase 

1709 
3 

99 . 451 

6106 

Z97074 

Homo sapiens p40 

2279 

93 . 651 

6107 

M16973 

Homo sapiens complement protein C8 beta 
subunit precursor 

4125 

100.000 

6108 

AL117646 

Homo sapiens hypothetical protein 

1877 

100 . 000 

6109 

X82200 

Homo sapiens gpStafSO 

2904 

98 . 655 

6110 

AB011139 

Homo sapiens KIAA056 7 protein 

6345 

99.080 

6111 

AL117496 

Homo sapiens hypothetical protein 

1092 
1 

99 . 046 

6112 

AL110219 

Homo sapiens hypothetical protein 

3897 

99 . 180 

6113 

X9706 5 

Homo sapiens Sec23 protein 

5155 

99 . 739 

6114 

X80909 

Homo sapiens Nascent polypeptide associated 
complex alpha subunit 

1267 

97 . 727 

6115 

M64 982 

Homo sapiens fibrinogen alpha- E chain 

5694 

98 . 274 

6116 

X66363 

Homo sapiens serine/threonine protein kinase 

3213 

98.413 

6117 

Z83067 

Homo sapiens FAA 

9638 

99 . 794 

6118 

U19617 

Mus musculus Elf-1 

3289 

87 . 279 

6119 

D14531 

Homo sapiens 1 human homologue of rat ribosomal 
protein L9 ' 

1110 

93 . 814 

6120 

X15875 

Homo sapiens cAMP response element binding 
protein (AA 1-505) 

3213 

99.802 

6121 

Y13115 

Homo sapiens serine/threonine protein kinase 

6264 

99 . 794 

6122 

AF132947 

Homo sapiens CGI -13 protein 

2822 

96.368 

6123 

D29810 

Homo sapiens unknown 

2011 

94 . 022 

6124 

AF141289 

Homo sapiens bo,+ amino acid transporter 
bo , + AT 

3107 

100 . 000 

612 5 

AB0183 19 

Homo sapiens KIAA0776 protein 

5036 

100 . 000 

6126 

7\T r\ ~3 A A CO 

AL0344 52 

Homo sapiens dJ682»J15.1 (novel Collagen triple 
helix repeat containing protein) 

2539 

100 . 000 

CI OT 
OIZ / 


Homo sapiens unknown 

lyzz 

Q Q *5 Q A 

jo . z y4 

olzo 

VI >t /l Q/l 

x m y *i 

Homo sapiens aralarl 

433 0 

y« .oio 


AoUD 17 D 

rlOluO Sapiens KJJSJ\±ti3 

O O Q A 

yy . uo / 

-in 
Dl jU 

/\r lzo jo j 

bdj.j.us gaiius osceoyiycin 


. 11 


TV C*l •JOQCQ 

Homo sapiens CGI- 34 protein 


yo . bz4 

Ol 

VI QAQI 

riomo sapiens oiA- /ao protein 

J4o J 

y y . Uob 


ado / yy 

unidentified unnamed protein product 

o Q r» 
3 900 

99 . 644 

6134 

X91879 

Hfimn oani one; ciT~><=»Y~m cr\ppi -p i <-» nrnt'Pi n 

4 810 

ion ooo 

_L \J \J . U U U 

6135 

U66140 

Canis familiaris forssman synthetase 

1948 

82 .421 

6136 

AB023065 

Rattus norvegicus O-sialoglycoprotease 

2017 

94 .410 

6137 

AB018350 

Homo sapiens KIAA0807 protein 

8370 

99. 921 

6138 

AF140690 

Homo sapiens melusin 

2317 

97.994 

6139 

AF045646 

Caenorhabditis elegans No definition line 

293 

30. 726 
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6140 

AFT F, 1 R F. 0 

nui IILJ actpXclio V_\jjX X v» X piOUclil 

14 08 

lOO . 000 

6141 

ABOH125 



99 . 543 

6142 

rir u j j j. 

Homo c; o T*i i one; V>t~ "i <H rr S r~\ fr i nhonyat* *^»-»- _ o 
ilUIUU Ddpicllb XJX xtxy x xiy lXlLt-y IctLUI 

**t c a -3 

99.823 

6143 

AF171877 

Homo sapiens cleavage and polyadenylation 

Qnpri "F -i t t~ \r f" .a ot~ ov 7"^ lrT")^ ct i i V»i m "i t~ 

4362 

99.701 

6144 

AJ01034 6 

Homo qanipnci RTNPI-H9 

/I CI A 

*± _> 1U 

i Art rtrtri 
lOO . OOO 

6145 

*VX-> UD JO 



97 . 244 

6146 

D25217 

Homo etani phq K"TZ\Aftft77 

O Q Q A 

y y . 543 

6147 

X59543 

Homo o a n> i phq M 1 <ca i i V\i in t t~ o y* t notii i o 1 oot- i Ho 

reductase 

con 

1 / 

100 . 000 

6148 

X85786 

Homo saDiens bindinc* r*£*cr\il a t~ orv f artnr 

A i ft "5 

QQ Q 1 Q 

y y . ojo 

6149 

Y08319 

Homo saoipnf? kinesin-2 

49*77 
4« / / 

QQ C CO 

6150 

AF135422 

Homo ^aoipn 4 ? nDP-mannn<3P nvrnnhnQnhnrvl aqp 7x 

O 7 ft 7 

QQ "7 A Q 

y y . / ** y 

6151 

Z47362 

Homo sapiens T cell factor 1 splice form E 

2423 

93 . 947 

6152 

AF151820 

J. ~J -X. O .ti V/ 

Homo qani on cj CCZT — nrnhoi n 


Q O A C C 

y« . 4do 

6153 

X85133 

Homo Q3Ti"i pn g T? R r>t*r>h q -i n V»i nrli nrr ■»-. -»-/-\ 4- « -J n 
"wlllU oa|JXcllo I\D LJX V-/ C-t* XII UXlXvJXXXy pxtJt-tiXll 

bzil 

yy . udz 

6154 

AB023139 

Homo sapiens KIAA0922 protein 

5160 

98.108 

6155 


noil lO bdpxcllb XvJC.clt.c2Cl at vji-i 1 XjX 

4361 

98 .618 

6156 

TT9 4 m ft 

rnjiiio faapiens poo iiacuxax Kiiier ceil recepcor 

MX CcUX oUX 

2346 

99 . 713 

6157 

AB020663 

Homo cam' PnQ VT7\7\OQR<^ nrnhoin 

JxvJIUvJ OCtpXCUo XV. X/A/WJ ODD JJXtJUfcJXIl 


QQ O C A 

y y . zb4 

6158 

A _? O O 

Homo Q a "5 on G TTHD _f2al "NT 7\ « . 1 •! /^n 1\T_ 
nUI 1 BdpXcIlb UU±r — vjcIXXM/\C . puiypcptlUc JM — 

c* vz- xy c\ _i_ c\ v — l_ v_/ia ct 11 1 x. 11 y x_ l. x. alio x cz x d. o 

/lido 

99 . 211 

6159 

X773 95 

OaULi la x vjll iy v^cJo LcicVlolac xM^SVJfiC 

oo b 

1 A O C Q 

3-1 . z^y 

6160 

AB029334 

nctxu^yiinixct xuxcl^x nx rCiX x 

*7 O 7 

3Z . oOo 

6161 

AT.050114 

Homo cani on ct Vivmot - Vi ot* t 1 nrrih 0 t n 

i ez q o 
xo y y 

Q Q ICQ 

y o . lb O 

6162 

AF156098 

Homo sapiens RNA binding motif protein 7 

357 

58.824 

6163 

AF1 0074 ft- 

T-TonrtO o o "i Con')') VS^mr^T 

nvjuivj oapxciib jclzz iiuiiiuxoy 

T O /I O 

99.2 91 

6164 

AR09 ft 7 ft 1 -; 

Homo cj^nT one; lfTSfiflfiQP n>^f^k t~ q t ti 

o4o4 

99 . 694 

6165 

M74 08 9 

Homo ci ^ n "5 one: TR1 

O O "7 >1 

1UU . UUU 

6166 


ricjiucj odpiens nypoLnetxCdi procein 

C Q O £T 

100 .000 

6167 


ri(jiii\j bdpieiib pi ccui sux poiypcptiuc \ t\t\ z J CO 
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Homo sapiens myosin I beta 

6636 

98 . 444 

6224 

AF132962 

Homo sapiens CGI -2 8 protein 

2087 

99 . 678 

6225 

AF151891 
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Homo sapiens myosin regulatory light chain 

1104 

99.422 

6362 

X99404 

Homo sapiens Berg36 

2117 

98 . 750 

6363 

U64601 

Caenorhabditis elegans Gene probably begins in 
the next cosmid 

531 

50.336 

6364 

AL080065 

Homo sapiens hypothetical protein 

2287 

98 . 880 

6365 

Y17999 

Homo sapiens DyrklB protein kinase 

4307 

99 . 841 

6366 

AF152961 

Homo sapiens chromatin- specif ic transcription 
elongation factor FACT 140 kDa subunit 

6721 

99 . 618 

6367 

L31783 

Mus musculus uridine kinase 

982 

70.155 

6368 

AL117462 

Homo sapiens hypothetical protein 

849 

49 . 841 

6369 

AB020710 

Homo sapiens KIAA0903 protein 

6183 

99 .480 

6370 

AF151870 

Homo sapiens CGI -112 protein 

1355 

96.602 

6371 

AF143321 

Homo sapiens unknown 

829 

65.789 

6372 

X06256 

Homo sapiens integrin alpha 5 subunit 
precursor 

7083 

99. 905 

6373 

AF051151 

Homo sapiens Toll/interleukin-1 receptor- like 
protein 3 

5659 

99.650 

6374 

Y13620 

Homo sapiens BCL.9 

9699 

99.498 

6375 

Z73102 

Caenorhabditis elegans Similarity to 
B.subtilis DNAvT protein (SW:DNAJ_BACSU) ; cDNA 
EST yk4 3 7al.5 comes from this gene 

884 

40.943 

6376 

AB018257 

Homo sapiens KIAA0714 protein 

7325 

99 . 820 

63 77 

AJ 224 741 

Homo sapiens matrilin- 3 

3300 

100 . 000 

6378 

U6 5416 

Homo sapiens MHC class I molecule 

2560 

99 . 215 

6379 

AF004876 

Homo sapiens 54TMp 

1031 

56 .803 

6380 

AL021878 

Homo sapiens dJ257I20.4 

205 

72 . 000 

6381 

D90734 

Escherichia coli ORF ID:o223#ll 

785 

100 . 000 

63 82 

X86779 

Homo sapiens FAST kinase 

3626 

99.637 

6383 

AJ005890 

Homo sapiens JMl 

3931 

99 . 518 

6384 

AF069762 

Homo sapiens map kinase phosphatase -like 
protein MK-STYX 

2111 

100 . 000 

6385 

AF131826 

Homo sap i en s Unknown 

1980 

99 . 664 

6386 

AF157600 

Mus musculus pleckstrin 2 

2055 

92 . 918 

6 38 7 

X83425 

Homo sapiens Lutheran blood group glycoprotein 

4087 

98 . 722 

6388 

AL096749 

Homo sapiens DKFZp434G153 

3100 

99 . 360 

r t n o 

638 9 

AF036 13 0 

Homo sapiens collagen type IX alpha I chain, 

J. Ul iy -L kJi III 

6706 

99 . 457 

6390 

AF151814 

Homo sapiens CGI -56 protein 

1900 

99.051 

6391 

AB020662 

Homo sapiens KIAA0855 protein 

4070 

97 . 372 

6392 

M58378 

Homo sapiens synapsin I 

4798 

99. 008 

6393 

AF151846 

Homo sapiens CGI -88 protein 

1351 

99 . 000 

6394 

X03528 

Homo sapiens lambda L- chain C region 

588 

87 . 736 
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6395 

AF03 9697 

Homo sapiens antigen NY- CO- 31 

1580 

97 . 414 

6396 

AFO 93543 

Homo sapiens transforming acidic coiled- coil 
containing protein 3 

5319 

99 . 404 

o^y / 

Ar UyblbU 

Homo sapiens protein phosphatase 2A BR gamma 
suDuniL 

2686 

99 . 294 

D .> 2? O 

a .to o o q m 

riomo sapxens neutral spningomyei mase 

26 75 

97 . 887 

D J J J 

J-JO JUD J. 

riomo sapiens transcription iaccor oiii 

3 74 5 

99 . 639 

fi a. n. n 

D *i U \J 

nDnn-j O QO 

riomo sapxens is.iAAUzyb 

64 94 

99 . 898 


W *4 / / *4 

riorno sapiens l/ix- d 

"1 Q C A 

1 A f\ AAA 
100 . 000 

fid. no 
o f* uz 


riomo sapxens Lbi-iu j protein 

2052 

97 . 771 

fid fi*} 
w *x U O 

/-Vir UO JZ ID 

ncjnio sapiens cycosoiic pnospnoi lpase az oeca 

C C *7 1 
DD/7 

99 . 3 06 

6404 

AJ005892 

Homo sapiens JM23 

2079 

95 .482 

fid. n c; 

O ** \J o 

TTd *7 fi 1 d 
Uft / D J ft 

riomo sapiens oeca- tUDuiin 

2892 

97 . 517 


Ml 1 A A *> 

Mus musculus alpha- tubulin isotype M-alpha-6 

2942 

98 . 667 

O fl U / 

TV TJ A 1 /I C"7 /t 

Homo sapiens KIAA0674 protein 

7874 

99 . 354 

fid a q 

A TP A O A A AT 

Mus musculus unknown 

1499 

71 . 510 

c a n q 

"V7 C C 1 

Homo sapiens tuoerin 
. . 

1155 
1 

99 . 006 

fid i n 

TV T?l 

i-ilr ID" D4o 

Homo sapiens gamma- synerg in 

8623 

99 . 924 

fid 1 1 


nomo sapxens ragrj 

24 26 

99 . 465 

6412 

AF104923 

Homo sapiens putative transcription factor 

5936 

96.013 



Homo sapiens CGI- 13 8 protein 

1000 

99 . 346 

6414 

AF151873 

Homo sapiens CGI- 115 protein 

1628 

100. 000 

6415 

ALi050276 

Homo sapiens hypothetical protein 

4457 

99 . 850 

6416 

AL080157 

Homo sapiens hypothetical protein 

2850 

99 . 533 

6417 

AF077033 

Homo sapiens putative ATP-dependent RNA 
helicase ROKl 

3791 

99.165 

6418 

Z99707 

Arabidopsis thaliana methionyl aminopeptidase- 
like protein 

904 

53 . 069 

6419 

Z48570 

Homo sapiens Spl7 

961 

100 . 000 

642 0 

Z81053 

Unknown predicted using Genefinder; Similarity 
to Yeast mitochondrial ribosomal protein S5 

/ C T.T . T"> T* f\ C 

\ bW : RT0 5 

455 

33 . 129 


Alb JUb 

Homo sapiens heavy neurofilament subunit 

6271 

98 . 150 

D fi Z Z 

ADAATQQ/1 
A±3U U / O Ofl 

Homo sapiens KIAA04 24 

3501 

99 . 806 

6423 

AB018323 

Homo sapiens KIAA0780 protein 

7519 

99 . 909 

6424 

Z97 992 

Schizosaccharomyces pombe conserved 
hypothetical protein 

403 

33 . 096 

fid o c 

D *± Z J 

nui "1 o Q rc a 
Ar 1 j jU 

Homo sapxens CGI -16 protein 

2625 

99 .750 

fid o fi 

Ar UfiA / J. ,5 

Rattus norvegxcus neurexophilin 3 

1675 

95 . 635 

fid O 7 

TTfK 1 1 fi fi 

Oncorhynchus mykiss terminal deoxynucleotidyl 
transferase 

620 

43 . 866 

fidO Q 
O f* 4. o 

TV 13 A T O O AO 

riomo sapxens rvXAAU/bU protein 

82 54 

99 . 503 

6429 

X83573 

Homo sapiens ARSE 

4104 

99.321 

fid i n 

7A TO /l "> O *7 j4 
AU Z*± J Z / fl 

Homo sapiens AP-2rep protein 

2454 

94 . 828 

fi d 1 1 

7A T?1 1 fi 1 Q 
Ar X^lOJO 

Homo sapiens CGI -80 protein 

2128 

100 .000 

fid TO 

7\T nicrno 

Homo sapiens dJ479CT7.1 (similar to 

1791 

100 . 000 

fid ^ 

ft J?A C 1 Ovf A 

Picea mariana probable ubigui tin- conjugating 
enzyme E2 

680 

48 . 598 

6434 

AF153606 

Homo sapiens angiopoiet in- related protein 

2754 

100.000 

6435 

AF151904 

Homo sapiens CGI -146 protein 

778 

74 . 869 

6436 

X83973 

Homo sapiens transcription factor 

5570 

99.548 

6437 

AL080169 

Homo sapiens hypothetical protein 

1135 

94 . 536 

6438 

AF130367 

Mus musculus Fas-apoptosis inhibitory molecule 

1029 

90.395 
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6439 

M63193 

Homo sapiens endothelial cell growth factor 

801 

72 . 917 

6440 

AF151810 

Homo sapiens CGI -52 protein 

2256 

99. 721 

6441 

AL022237 

Homo sapiens bK1191B2.3.1 (PUTATIVE novel Acyl 
Transferase similar to C. elegans C50D2.7) 
(isoform 1) 

1620 

75. 819 

6442 

Z67743 

Homo sapiens CLC-7 chloride channel protein 

5148 

99.620 

6443 

L14610 

Rattus norvegicus transcription factor 

3065 

96 .281 

6444 

AF006264 

Homo sapiens hHR21spB 

3324 

98 . 543 

6445 

Z92825 

Unknown predicted using Genefinder; Similarity 
to Yeast low-afinity glucose transporter HXT4 
(PS : 32 

1178 

42 . 576 

6446 

AJ006266 

Homo sapiens AND-1 protein 

7551 

100 . 000 

6447 

AF151908 

Homo sapiens CGI -150 protein 

3180 

99.405 

6448 

AF093744 

Homo sapiens unknown 

293 

100 . 000 

6449 

AF180919 

Homo sapiens RNA lariat debranching enzyme 

3611 

99 . 816 

6450 

AF100750 

Homo sapiens SLAP- 2 homolog 

2697 

91 . 616 

6451 

S41204 

Mus sp. H beta 58=essential for embryogenesis 

1141 

58 . 537 

6452 

Z83844 

Homo sapiens dJ3 7E16.4 (similar to mouse 
p!16Rip protein) 

1515 

98 . 062 

6453 

X83378 

Homo sapiens putative chloride channel 

5711 

99.541 

6454 

AL110240 

Homo sapiens hypothetical protein 

916 

98 . 571 

6455 

A69608 

unidentified unnamed protein product 

3965 

99 .353 

6456 

AF153192 

Homo sapiens ras-related protein 

1827 

100 . 000 

6457 

AB029031 

Homo sapiens KIAA1108 protein 

4913 

99.737 

6458 

D79994 

Homo sapiens similar to ankyrin of Chroma tium 
vinosum. 

8478 

99. 847 

6459 

AL050149 

Homo sapiens hypothetical protein 

3634 

98 . 599 

6460 

AB018301 

Homo sapiens KIAA0758 protein 

6455 

99. 595 

6461 

AB008376 

Sus scrofa 17-kDa PKC-potentiated inhibitory 
protein of PP1 

798 

87 . 248 

6462 

AB028991 

Homo sapiens KIAA1068 protein 

2343 

100 . 000 

6463 

U58658 

Homo sapiens unknown 

275 

60 . 227 

6464 

AB023190 

Homo sapiens KIAA0973 protein 

1044 
6 

99.369 

6465 

AB018295 

Homo sapiens KIAA07 52 protein 

2139 

100 . 000 

6466 

AL050015 

Homo sapiens hypothetical protein 

1245 

100 . 000 

6467 

AF124512 

Homo sapiens BVES 

259 

39 . 130 

6468 

AC004531 

Homo sapiens Gene with similaity to DEAD box 
helicases 

3400 

99. 076 

6469 

A12142 

synthetic construct IFN-pseudo-omega 2 

987 

99.351 

6470 

AF065215 

Homo sapiens cytosolic phospholipase A2 beta 

6677 

99.306 

6471 

U31629 

Mus musculus unknown 

1855 

94 .257 

6472 

Z82268 

Caenorhabditis elegans cDNA EST yk478b4.5 
comes from this gene; cDNA EST EMBL:D74 716 
comes from this gene; cDNA EST yk456bl2.5 
comes from this gene; cDNA EST EMBL:T00892 
comes from this gene 

393 

26 . 519 

6473 

M32317 

Homo sapiens HLA protein allele B7 

2284 

97.245 

6474 

X07384 

Homo sapiens GLI protein (AA 1-1106) 

7827 

99 . 910 

6475 

AJ000327 

Homo sapiens adreno leukodystrophy related 
protein 

4340 

99 . 112 

6476 

AL031228 

Homo sapiens dJ1033B10.10 (membrane protein 
with histidine rich charge clusters (HKE4, 
RING5) ) 

544 

38.973 

6477 

Y11395 

Homo sapiens seventransmembrane -domain protein 

2638 

99.250 
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6478 

L24444 

Homo sapiens DNA repair protein 

175 

35.294 




/ j 


6480 

AF062655 

Mus raus cuius plenty- of -prolines -101 ; POP101; 

On J U11JL XU JUiULCXU 

5466 

92.705 

6481 

/>17 W ^> X f o 

W X j L. LUl ay Uo L-U11XV^U J. Uo CXUll^CtLXUll X. C* \— I— V_/ J- J_ 


QQ 7R4 

j 27 . / O *± 

o *± o ^ 

L10244 

Mn «3 miie*/-*iil lid qr>p T~m l Hi / qnp vmi n p» Ml _ 

1 IllD ULLXO . LXX Ua £3 tr IIIXUXIIC / OJJCX IllXliC INX 

acetvlhransf Gi*as6 


*± f . U J -7 

6483 

AL03122 8 

Homo saoiens dJ1033B10 3 (GalT3 (beta3- 
Galactosyl transferase ) ) 

2521 

99 . 2 08 

6484 

AC003965 

Homo sapiens SP001LA 

456 

33 . 740 

6485 

AF151B21 

Homo saniens CGT-63 nrotein 

2425 

99 . 464 

6486 

Z24680 

*J A* T w v V 

Homo saoiens aaro 

4207 

100 000 

JL \J v/ * x/ \J V/ 

6487 

AL117429 

Homo sapiens hypothetical protein 

1320 

100.000 

6488 

U73585 

duo Uu ux uo r aiiuuux uiiciuxu H-^ uuju ui.uucxn 

2586 

73 .488 

6489 

AFT RR1 OS 

Homo qani on o t->-i i f £4 t" i \re* 71 nr f* "i nnpy T~>Y*ot"* "i tj 1NTY — 
RFNF-34 anfi apn 

Xv xj x^t J j c* iiL.xy ^ x x 

4 04 

i no noo 

X U u . \J \J \J 

6490 

AF07816 5 

Homo saDiens conduct i n 

1 -A > — ' * I I w (JCluX&llU ^ wll\XUw UXiA 

5558 

98.935 

6491 

AF038844 

Homo <=? a 0 1 pn q MKP-1 like* Dfofpin hvrnq i tip* 
phosphatase 

205 

56 . 061 

6492 

U69127 

Homo sapiens FUSE binding protein 3 

3989 

99.497 

6493 

U79304 

Homo s ap i ens unknown 

2190 

96 . 953 

6494 

X17094 

Homo sapiens furin (AA 1*794) 

5413 

99 . 874 

6495 

AF100745 

Homo sapiens PTD016 protein 

1286 

99.497 

6496 

AF073344 

Homo sapiens ubicjuit in- specif ic protease 3 

3533 

99 . 616 

6497 

AB007918 

Homo saoiens KIAA0449 Drotein 

4 931 

99 . 735 

6498 

X68561 

Homo sapiens SPR-1 

5000 

99.745 

6499 

AF056116 

Fugu rubripes unknown 

1934 

66 .596 

6500 

AB014547 

Homo sapiens KIAA0647 protein 

7086 

99 . 902 

6501 

AB002360 

Homo sapiens KIAA0362 

7335 

99.278 

/•CAT 

Aol /o o 

Homo sapiens ICT1 protein 

i Ton 
1J20 

yy . iM4 

C C A "3 

7\T mTDTQ 

Caenorhabdi t is elegans similar to HECT-domain 

V UDiqUl L. Ill - CI7clIx£>JL cxdoc / . , to 1 O UUXU . D 
V_LJlUt-i3 X x UIU UilXo y cllU ; \^UV4r\ XLOX. y JV J J X XJ _> . _> ^.UIIICO 

from this oene - cDNA EST vkl96dl0 5 comes from 
this gene 

/JUUb 

Jo . 5 y© 

6504 

AB014593 

Homo sapiens KIAA0693 protein 

2598 

96 . 543 

6505 

S70011 

Rattus sp . tricarboxylate carrier 

1156 

55.732 

6506 

Z99709 

Caenorhabdi t is elegans C47B2.2b 

670 

51 . 961 

6507 

X97065 

Homo sapiens Sec23 protein 

5155 

99.739 ! 

6508 

AF107834 

Homo sapiens aminopeptidase 

2968 

98 . 943 

6509 

L38709 

Mesocricetus auratus orf 

1847 

71 . 968 

6510 

AB018318 

Homo sapiens KIAA0775 protein 

191 

26 . 935 

6511 

D12816 

Bos taurus actin2 

2597 

90.670 

6512 

AF072733 

Homo sapiens putative secreted protein 

1550 

80 . 831 

6513 

AL080141 

Homo sapiens hypothetical protein 

6209 

99 . 891 

6514 

L29554 

Rattus norvegicus alpha 2 , 6-sialyltransf erase 

745 

42 .629 

6515 

AJ011654 

Homo sapiens triple LIM domain protein 

4277 

98.703 

6516 

X03528 

Homo sapiens lambda L- chain C region 

588 

87.736 

6517 

AF151819 

Homo sapiens CGI -61 protein 

2063 

94 . 022 

6518 

AB011148 

Homo sapiens KIAA0576 protein 

7231 

99.349 1 

6519 

K02882 

Homo sapiens immunoglobulin delta-chain 

2480 

98.695 

6520 

U88567 

Mus musculus secreted frizzled related protein 
sFRP-2 

1911 

97.627 ; 

6521 

X07979 

Homo sapiens integrin beta 1 subunit precursor 

5368 

97 . 378 

6522 

L76200 

Homo sapiens guanylate kinase 

1202 

95 . 610 
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KtJ \J X / J .3 ^ 

TLl^mr^ aani pna Zin rriya / Tt^I 1 - rol ~^ ^ 1^ t n a e « 7 
nun IU oapicilo i-i. UL UL a / IjJil icXaUcU JvlUaSc •-> 

7 

X 

J J . ODD 

6524 

AB007902 

Homo sapiens HH0712 cDNA clone for KIAA0442 

I let o d. 3 / *± ~* lllacl d L» pUol LIUil -L *± / *x *J X Lilc 

Sf^fTii^Ticp of KTAA0442 

8176 

99.659 

6525 

X59727 

Homo sapiens 6 3kDa protein kinase 

3602 

98.925 

6526 

Y00291 

Homo sapiens hap protein 

2971 

99.777 

D D Z / 


nomo sapiens vafacuidi KdD-brtr/ iDL-containiny 

y\y<At" Din 
v-' *— ti Xll 

r q 7 n 

J Zf . ODD 

^9 ft 


T-T/*">mr"v cj^st^t ones T .XT l^ r P T nrppiiToriT 

/ J W X 

Q O il "7 C 

6529 

A16794 

WfiTYif* oani pnn r^TYKTZX 7 ci<"7l afpH "F<"7V t~ Vi "i a r»rnt*Pi n 
iiQi na Pi mnnnrl nnal anf i Viorl'v Hi vprfpH acta i T7 «5t* 

the p27k prosomal protein 

7 74 n 

7 nn nnn 

luU . u u u 

6530 

X78873 

Homo sapiens inhibitor 2 

1304 

99 . 512 

6531 

AJ005895 

Homo sapiens protein translocase 

1033 

97 . 110 

6532 

AJ224819 

Homo sapiens tumor suppressor 

2561 

97 . 066 

6533 

D29958 

Homo sapiens KIAA0116 

1252 

88 . 255 

6534 

AB011154 

Homo sapiens KIAA0582 protein 

3020 

99.782 

6535 

X51985 

Homo saoiens LAG- 3 orotein orecursor 

3603 

99.619 

6536 

Y07595 

Homo sapiens transcription factor TFIIH 

2845 

98 . 4 91 

6537 

AB002631 

Homo sapiens collectin 34 

933 

50 . 598 

6538 

AC007017 

ArabidoDsis thaliana Diitative rwa h^licast* A 

2366 

42 . 857 

6539 

AF052052 

Homo sapiens unknown 

812 

82.209 

6540 

AJ245738 

Homo sapiens claudin-15 

1447 

98.673 

6541 

AF159092 

Homo sapiens syld709613 protein 

2333 

96.226 

6542 

AL109652 

Schizosaccharomyces pombe hypothetical protein 

773 

44 . 884 

D Oft J 


Rattus norvegicus calpain 

O "7 O C 

o c one 
OO . ODD 


M-Z -5 liJ / 

riomo sapiens □lLterenLiaLion anciyen 

lion 

bu.iy / 

CCA C 
O Dfi D 

f\Xj\J Z Z J Z7 0 

fioiuu sapiens cu 4 * j^ui^ . o \novei ruiAiivb 

p JL v.? L. t2 X 1 1 oXlillldl L.LJ XlJLi\Jj7ll». ycdbL Ilypt/y Xlti L, 1 LdX 

84 kD protein from SGA1-KTR7 intergenic 
region) 

4 y lb 

9 9.735 

6 54 6 

YTQQQO 
A. ' 700a 

XxL^( I \\J t> ctJLJ -L cllo J. I {J 


n nn r\c\r\ 
1UU . UUU 

6 54 7 

T,nR97 7 

IjU O j£ J / 

nLJUlOJ tjciJL/ JL. Clio iC^dL-tSLJl d I, V^//AJLJLil 

7 Q7 

1 i7 1 

AC Qfl7 

6 54 8 

AFT ft771 ft 

iiUl I \\J oaJJiCllo XT JJvJA [JX U LC Xll C *JJ\.^ 

X 27 3 \J 

QQ QQ£ 

6549 

AF036702 

Caenorhabditis elegans contains weak 

0 xiiix xcix x Ly L- nx v tr x / met v_ x. Xv\. Lyx. ulcxu 
(GB • L35970) 

285 

26.335 

6550 

AB018290 

Homo sapiens KIAA074 7 piroteiri 

7076 

99 . 907 

6551 

AL050369 

Homo saoiens hvnotlif* t i cal nrot* Pin 

3085 

-7 O . L/ O X 

6552 

AC005917 

Arabidopsis thaliana putative WD -40 repeat 
protein 

1145 

42 . 4 72 

6553 

D26311 

Gallus gallus unknown protein 

1318 

58 . 53 7 

6554 

AF091083 

Homo sapiens unknown 

1871 

99 . 650 

6555 

M29581 

Homo sapiens zinc finger protein 8 (ZFP8) 

3533 

96 . 324 

6556 

AL096879 

Homo sapiens hypothetical protein 

616 

37.631 

6557 

AF139682 

Homo sapiens putative N6-DNA- 
methyltransf erase ; N6AMT1 

1383 

100.000 

6558 

X16135 

Homo sapiens L protein (AA 1-558) 

1847 

56 . 098 


n.V XjO / / / 

nuillU odpXcllb HOD ~X {JIvJLC Xll 

Z, Z ft o 

Q Q 7 7 n 

6560 

Z19555 

Caenorhabditis elegans cDNA EST yk425a6.3 
comes from this gene; cDNA EST yk406e6.3 comes 
from this gene; cDNA EST yk425a6.5 comes from 
this gene; cDNA EST yk480c6.5 comes from this 
gene; cDNA EST yk406e6.5 comes from this gene 

347 

25 . 818 

6561 

AL032623 

Unknown cDNA EST yk331el2.5 comes from this 

200 

35.338 


VA2 


JL O O O Mhfi, «+3 . O H S £> O H 




gene; cDNA EST EMBL:D69131 comes from this 
gene ; cDNA 



c c c o 

At 06 y y o4 

Mus musculus unknown 

2305 

84 .235 

6563 

AB011139 

Homo sapiens KIAA056 7 protein 

6357 

99.183 

6564 

AF003151 

Caenorhabdit is elegans No definition line 
found 

481 

28 . 231 

6565 

X55684 

Lycopersicon esculentum extensin (class I) 

98 

32 . 143 

6566 

AB028968 

Homo sapiens KIAA104 5 protein 

2622 

96 .471 

6567 

D63484 

Homo sapiens The KIAA0150 gene product is 
novel . 

6425 

100.000 

6568 

Z82215 

Homo sapiens dJ6802.2 
. . 

1232 
4 

99 . 796 

6569 

Y13492 

Homo sapiens smoothelin-B 

5831 

99 . 346 

6570 

AL096768 

Homo sapiens dJ858B16.2 (novel protein similar 
to hamster PSSC ( Phosphatidyl serine 
Decarboxylase Proenzyme, EC 4.1.1.65) 

2574 

99 . 733 

D3 / l 


Homo sapiens HSPC038 protein 

466 

89 . 412 

6572 

AF132964 

Homo sapiens CGI- 30 protein 

1764 

94 . 94 9 

6573 

AB029012 

Homo sapiens KIAA1089 protein 

6572 

99 . 899 

6574 

X15218 

Homo sapiens ski protein (AA 1 - 72 8) 

4677 

100 . 000 

6575 

AL035461 

Homo sapiens dJ967N21.6 (novel CDP-alcohol 
phosphatidyltransf erase family member protein) 

1936 

99 . 007 

6576 

U4 5998 

Onchocerca volvulus mitochondrial solute 
carrier 

705 

47 . 489 

6577 

AF091080 

Homo sapiens unknown 

2120 

100.000 

6578 

Y082 00 

Homo sapiens rab geranylgeranyl transferase 

3709 

98.604 

6579 

AF151868 

Homo sapiens CGI-110 protein 

813 

100.000 

6580 

X74142 

Homo sapiens transcription factor 

3301 

99 . 161 

6581 

AJ132545 

Homo sapiens protein kinase 

3757 

99 . 820 

6582 

M60165 

Homo sapiens guanine nucleotide-binding 
regulatory protein 1 

2297 

99.435 

6583 

AF151522 

Homo sapiens hairy and enhancer of split 
related- l 

1989 

100.000 

6 584 

AJ2 4 346 0 

Leishmania major proteophosphoglycan 

182 

25 . 909 

6585 

AL117664 

Homo sapiens hypothetical protein 

889 

49.508 

6586 

AF106685 

Homo sapiens myelin gene expression factor 2 

3579 

97.441 

6587 

AF124490 

Homo sapiens ARF GTPase-activating protein 

CjtITI 

4971 

99 . 081 

c a o o 
b bo o 

AC UU41J1 

Homo sapiens Unknown gene product 

2 567 

98 . 020 

6589 

L77967 

Ovis aries small proline-rich protein with 
paired repeat 

111 

46.667 

6590 

AB011129 

Homo sapiens KIAA0557 protein 

3444 

100.000 

r c Q i 

ABOlo 3 10 

Homo sapiens KIAA0767 protein 

3775 

100 . 000 

6 5 92 

AB01o33o 

Homo sapiens KIAA0795 protein 

2834 

94 . 872 

6593 

AF079098 

Homo sapiens arginine- tRNA-protein transferase 
1-lp; Al£»l-lp 

3506 

99.611 

C C Q A 

au oiuny 

Homo sapiens Ribosomal protein kinase B (RSK- 
& ) 

4 956 

r\ O A A O 

98 . 44 8 

fT C Q IT 

T?n 1 7T77 

Drosophila melanogaster helicase 

1 C "> A 

lb JU 

A Q A 'y 1 

\J J -7 O 

U96 724 

phosphatase type II 

^ O *x O 

*7 A 14 1 ; 

6597 

U71363 

Homo sapiens zinc finger protein zfp6 

3057 

98.144 

6598 

Z35227 

Homo sapiens small G protein 

1247 

100.000 

6599 

AJ133115 

Homo sapiens TSC-22-like protein 

2583 

97 . 727 

6600 

AC004798 

Homo sapiens R31546 1 

4899 

99.863 


8H1 
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6601 

AJ005559 

Mus musculus SPR2A protein 

158 

41.333 

6602 

X83572 

Homo sapiens ARSD 

4 046 

99 . 157 

6603 

AF121859 

Homo sapiens sorting nexin 9 

3 985 


6604 

AF102166 

Homo sapiens intracellular chloride channel 
CLIC3 

1115 

91 . 905 

6605 

X68277 

Homo sapiens protein- tyrosine phosphatase 

z z y z 

y / . 5oo 

6606 

AL117491 

Homo sapiens hypothetical protein 


inn nnn 
1UU . uuu 

6607 

AL117634 

Homo sapiens hypothetical protein 


inn nnn 

6608 

X74226 

Rattus norvegicus putative 

A A 1 C 

OD . JoU 

6609 

U71601 

Homo sapiens zinc finger protein zfp47 


Q*7 CQQ 

y / . !>yu 

6610 

X63368 

Homo sapiens HSJlb 

2325 

99.715 

6611 

X90530 

Homo sapiens ragB 

*"> A O C 

Q Q VI /T C 

99 . 4b b 

6612 

AB020724 

Homo sapiens KIAA0917 protein 

3 8 02 

95 . 15b 

6613 

Z29067 

Homo sapiens protein kinase 

3051 

99. 564 

6614 

AB011135 

Homo sapiens KIAA0563_protein 

5824 

99 .310 

6615 j 

AF061025 

Homo sapiens leucine zipper-EF-hand containing 
transmembrane protein 1 

4406 

100 . 000 

6616 

L00352 

Homo sapiens low density lipoprotein receptor 

6091 

99 . /b / 

6617 

AF042831 

Homo sapiens f orkhead-related transcription 
factor FREAC-10 

ne 

100.000 

6618 

X65873 

Homo sapiens kinesin heavy chain 

6033 

99 . 274 

6619 

M62362 

Mus musculus CCAAT/ enhancer binding protein 

2 3 82 

91 .4 79 

6620 

Y14488 

Homo sapiens 14 kDa protein 

831 

96.124 | 

6621 

S58722 

Homo sapiens X- linked retinopathy protein {C- 
termirial, clone XEH.8c} 

221 

65 . 000 

6622 

AJ224997 

Rattus norvegicus huntingtin 

87 

48 .276 

6623 

AB029016 

Homo sapiens KIAA1093 protein 

8451 

99 . 668 

6624 

AB027013 

Homo sapiens Nucleosome Assembly Protein 1- 
like 2 

3028 

99.565 

6625 

X98258 

Homo sapiens M-phase phosphoprotein 9 

1112 

Q O A C *7 

99.45 / 

6626 

U97191 

Caenorhabditis elegans strong similarity to 
the YPT1 sub- family of RAS proteins 

1199 

Q C IOC 

85 . lob 

6627 

AB006624 

Homo sapiens KIAA0286 

O P A 1 

qq n c *7 

6628 

AF151805 

Homo sapiens CGI -4 7 protein 

ZD Z Z> 

inn nnn 

J.UU . uuu 

6629 

AF106858 

Homo sapiens G-protein-coupled receptor 

A C C Q 

qq n *a a 

6630 

X76057 

Homo sapiens phosphomannose isomerase 


QQ ^ *? "? 

6631 

AL049746 

Arabidopsis thaliana putative protein 

Zo 1 

1 Q "7"? Q 

6632 

X97064 

Homo sapiens Sec23 protein 

5093 

99.869 

6633 

L09634 

Caenorhabditis elegans putative 

"5 A C 

Z D . _7 / f* 

6634 

AJ005642 

Rattus rattus serine protease 



6635 

AL079310 

Homo sapiens hypothetical protein 


Ql£ 7CQ 

yo . /do 

6636 

X79563 

Homo sapiens 8.2 kDa differentiation factor 

c. c\ a 
b Uo 

yt> . yu / 

6637 

Z66520 

Caenorhabditis elegans similar to ERG- 3 like 
protein 

•7AQ 

/uy 

1 ft Q*71 

j o . y / 1 

6638 

M19529 

Sus scrofa follistatin A 

2420 

98.246 

6639 

AF060154 

Homo sapiens activated B-cell factor- 1 


QQ R41 

6640 

U79260 

Homo sapiens unknown 

O Q A 

CO A "I 
DO . 410 

6641 

AJ249457 

Trichomonas vaginalis centrin, putative 

193 

29 . 787 

6642 

AF078850 

Homo sapiens steroid dehydrogenase homolog 

i9b / 

Q Q C 1 Q 

y y . b / y 

664 J 



1118 

4 

99 .416 

6644 

AC004410 

Homo sapiens fos3 9554 1 

2699 

100.000 

6645 

AL035541 

Homo sapiens dJ718J7 . 1 (PUTATIVE novel protein 
similar to Tr:015168) 

1407 

100.000 

6646 

AC004537 

Homo sapiens similar to tumor suppressor 

2611 

99.265 
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p33INGl; similar to AF044076 (PID : g2829208 ) 



6647 

X03528 

Homo sapiens lambda L- chain C region 

588 

87.736 

6648 

D12768 

Rattus norvegicus Spl 

4713 

93.932 

6649 

AF132944 

Homo sapiens CGI -10 protein 

2956 

98 .718 

6650 

AB011136 

Homo sapiens KIAA0564 protein 

9414 

99.584 

6651 

AB011138 

Homo sapiens KIAA0566 protein 

7537 

99.312 

6652 

Z92825 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk315el2.3 comes from 
this gene; cDNA EST yk315el2.5 comes from this 
gene; cDNA EST yk605bl2.3 comes from this gene 

655 

43 .359 

6653 

AB010491 

Homo sapiens natriuretic peptide A type 
receptor 

7000 

100.000 

6654 

D79205 

Homo sapiens ribosomal protein L39 

238 

76.471 

6655 

D83782 

Homo sapiens the KIAA0199 gene is expressed 
ubiquitously.; the KIAA0199 protein shows 
similarity to sea urchin 

hydroxymethylglutalyl-CoA reductase, and 
retains 8 hydrophobic domains . 

8679 

100.000 

6656 

AF016903 

Homo sapiens agrin precursor 

1433 
0 

99.557 

6657 

Z22181 

Unknown similar to PH (pleckstrin homology) 
domain; cDNA EST EMBL:C074 93 comes from this 
gene ; cDN 

747 

50 . 000 

6658 

X61045 

Hydra sp . mini-collagen 

200 

36 . 036 

6659 

AF019926 

Mus musculus protein kinase 

2099 

89 . 944 

6660 

AB020715 

Homo sapiens KIAA0908 protein 

3301 

99 . 802 

6661 

AF118838 

Homo sapiens citrin; adult -onset type II 
citrullinemia protein 

4359 

100.000 

6662 

U30831 

Rattus norvegicus B/K protein 

3068 

96 .835 

6663 

M26312 

Oryctolagus cuniculus unknown protein 

162 

42 .466 

6664 

L32162 

Homo sapiens transcription factor 

1758 

99.265 

6665 

X02585 

Xenopus laevis unidentified open reading frame 
2 

350 

62 . 766 

6666 

D87930 

Homo sapiens myosin phosphatase target subunit 
1 

6441 

99 . 903 

6667 

AF036717 

Homo sapiens FGFR signalling adaptor SNT-1 

3373 

99 . 803 

6668 

AL.050060 

Homo sapiens hypothetical protein 

2185 

99 . 692 

6669 

U07151 

Homo sapiens ARL3 

230 

32 . 090 

6670 

AL110245 

Homo sapiens hypothetical protein 

931 

64 . 259 

6671 

Z72497 

Gallus gallus CEPU-l 

1817 

78 . 632 

6672 

AB014590 

Homo sapiens KIAA06 90 protein 

7783 

99 . 013 

6673 

AF041377 

Mus musculus cell death activator CIDE-B 

1214 

84 . 793 

6674 

X79237 

Mustela vison ribosomal protein S26 

128 

57 . 500 

6675 

AB007914 

Homo sapiens KIAA044 5 protein 

7847 

98 .477 

6676 

X03528 

Homo sapiens lambda L- chain C region 

588 

87.736 

6677 

AF002223 

Homo sapiens myotubularin related 1 

4423 

99 . 698 

6678 

Y10812 

Homo sapiens fructose- 1 , 6 -bisphosphatase 

2073 

100 . 000 

6679 

AF055000 

Homo sapiens unknown 

2557 

98 . 297 

6680 

AF165217 

Homo sapiens tropomodulin 

2229 

99 . 710 

a ez a i 

T -JOI CO 

i_i J zloz 

xiomo sapiens transcription lacLor 

coo 

Zj J 

ZJ r± - O O _L 

6682 

AF135028 

Homo sapiens kallikrein-like protein 2 KLK-L2 

2010 

98.639 

6683 

AL031541 

Streptomyces coelicolor putative dehydrogenase 

481 

37.308 

6684 

AF084256 

Homo sapiens beta glucuronidase isoform d 

182 

68.182 

6685 

U38934 

Gallus gallus histone H2A 

758 

98 .400 

6686 

AF065215 

Homo sapiens cytosolic phospholipase A2 beta 

6677 

99.306 
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6687 

U76374 

Mi i q mi i cs rM il lie g1ctti-ROP9 

/ O 3 

ftOR 

3 X . O \J O 

6688 

X71623 

Homo sapiens zinc finger 

4073 

99.825 

0003 

Vfi ft "*fi7 

Mi i q mi i ca i"»i lino nr*t* — 1 A t~i tt^T" pi n 


ft ft d79 

66 90 


Womo qam' prtQ Qimn'TPCiqnT* O'F CZ'? al 1 pi nf 7 oVnl 
nuuiu oapxciio oupjjicooui u a. z> axxcxc vjl ojvux 

homo log 

,__ X D *± 

9Q 701 

O O _7 X 



Z VJ X 3 

i fin nnn 

66 92 

U U *± D O 

ivaLLUb iiui v t.y iuud -j iiyuLUAyj.DUJJUi_yi.aLt. 

dehydrogenase 

9n^"^ 

Z U J j 


fifi 9** 

"*9 

Womo aani ona VTZVA0977 nrnt"Din 

IIUI HU o ci JL^> XvJllo l\J.nnv _7 -£ UJ. ULcXll 

ci en 

3 X D U 

Qft 1 rm 


Aul O 3 3 

riu 1 1 iu Ucipiciio l_ y yJKZ xxx ui ucu J. lay en \ del o _7 ^ 
1023 ) 

119 

X X _7 

3 / . iyu 

6695 

Y15521 

Homo sapiens start position 1 

4126 

99.524 

6696 

AL117496 

Homo sapiens hypothetical protein 

1090 
i 

X 

99. 046 

6697 

YTC tr "I Q 
A / O 3 3 O 

T-Tnmn cjsit^t ones ViMr"i'\7'1 7 
n kj 1 i iu octpiciio i ii ijLy v x / 

1 1 Tj 

9 7 O O 7 

6698 

API "^1 ft 7 

nui i iu oauit-iio umviiuwii 


7 7 ^ 7 

6699 

AL049176 

Homo sapiens dA141H5.1 (C-terminal part of a 

L.11UJ.U111 XjXIVCj Ul ULclll W1U11 v Ull nil ICUl ex 11 IX 

f art"nr 1~ vdp f~* rlnma l nQ^ 

1786 

100.000 

6700 

AJ133421 

i iU J. J J I Z. 1 

Homo Qani phq v^Piml £*t* OT'Ot* #^ i n Qnrhintf 

3 543 

99 £4 9 

6701 

AF125807 

A, JL ^« —J u V-/ / 

Homo qani (^ti c? TlMA ^ * — 1c H n^^p / ~K ' — oViocsoVipi t~ p\ *5f> 

3468 

99 . 616 

6702 

AC004923 

Homo sapiens similar to UNC-93; similar to 
U89424 (PID:g3642687) 

2082 

97.619 

o / *j 3 

API m Rf;7 

nUllIU octpicllo v_. v_» X X W _7 L/lvJLcJ.11 

i i 

X X D D 

Z7*± . O \JO 

704 

J-\C \J VJ J O £i *i 

nui i iu oc&Ly x ciiti i\ir j iciaLcu uiului piULcin 

1 Cil ft 
X 3 X O 

qq i cr\ 

6705 

X92475 

Homo sapiens ITBA1 

1761 

98.855 

D / wD 

7V T nil 71 £T 

riomo sapiens cjdud^.i \jfui>\ixvii novel protein 
siniiidi CO preaicceci Dciccenax ana worm 

T^T~ot" pi nQ \ 

llii 

O 1 -l /» o 

o7 . 14 J 

6707 

AF07770 C ; 

xtuiiiu oapxciio no c u x _7 

ft i 

O X D 

i no nnn 

6708 

U79745 

Worn*"* cr ^ n anq mnnnpa T*hnwl at-p f- T*on c3T^/~v^f- 

JiUUlU oapiCllo lUUllUUd J. JJUAjr XCl LC I— JL ClllopUl I— X 

homoloaup MCT6 

3 O 3 

99 .17 £ 

Z7 27 . 4 Z O 

6709 

AJ010482 

Homo sapiens Myopodin protein 

5470 

100.000 

6710 


T-Tomo qani c? Vrv/nnt - hpf i 1 t~>T"ot~<=*"i ti 

ilUIIIU OCipiLlID liy UUL11C.I, IVwClX UlULClll 

1061 

81 . 905 

6711 

Y16 752 

nuuiu oauiciiD oc^v-r j. ciy uy 111 

1786 

98 . 913 

6712 

AB00467R 

Momo cani VivH ynnvr "i m l Hi n^^^ 
xiuitiu ci _i_ ^3 no Liniy ui uwy x lununiaijc 

1990 

58 382 

6713 

X51416 

Homo cjadi pnci hormnnp rprpttfrjr hRT?l?l ( AA 1 — t ^t>l ) 

3447 

99 . 04 0 

6714 

D50911 

Homo sapiens The KIAA0121 gene product is 
novel . 

1365 

98.571 

6715 

X99459 

Homo sapiens sigma 3 protein 

1243 

100 000 

JL. V* w • v/ V V 

6716 

AJ006591 

Homo sapiens cvsteine-rich nroteiii 

2295 

100 000 

JL. w • v yj \s 

6717 

AL080155 

Homo sapiens hypothetical protein 

2563 

94 . 771 

6718 

Z97653 

Homo sapiens C380A1.2.1 (novel protein 
(isoform 1) ) 

644 

45.324 

6719 

X54232 

Homo sapiens glypican 

3531 

98 . 571 

6720 

U17989 

Homo sapiens GS2NA 

2343 

54.331 

6721 

AF176642 

Homo sapiens ubicruitin- specific protease ISG43 

2499 

99 . 464 

6722 

AF117336 

Aolvsia californica mollusk-derived crrowth 
factor; MDGF 

122 9 

38.833 

6723 

AL050003 

Homo sapiens hypothetical protein 

1970 

100 . 000 

6724 

AJ223324 

Homo sapiens MAX. 3 cell surface antigen 

414 

32 . 797 

6725 

A08695 

Homo sapiens rap2 

943 

93 .478 

6726 

Z12173 

Homo sapiens N-acetylglucosamine- 6 -sulphatase 

3750 

99. 819 

6727 

AF151814 

Homo sapiens CGI -56 protein 

703 

42 .405 

6728 

M37030 

Mus musculus ORF 

2258 

94 . 823 
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6729 

D13744 

Holotrichia diomphalia holotricin 3 

108 

59.375 

6730 

AB007930 

Homo sapiens KIAA0461 perotein 

8939 

96 .028 

6731 

U32575 

Rattus norvegicus similar to yeast Sec6p, 
Swiss-Prot Accession Number P32844; similar to 
mammalian B94, Swiss-Prot Accession Number 
Q0316 9; Metnoa : conceptual translation 
supplied by author 

4586 

94 . 430 

6732 

D38491 

Homo sapiens maauii / 

1018 

78 . 829 

6733 

AJ012590 

Homo sapiens glucose l - aenyarog enase 

5158 

98 . 235 

6734 

Z83 86 9 

KattUS norvegicus serine / lux.c:uiixiic jvinaoc 

4210 

89 . 529 

6735 

X74331 

Homo sapiens una primase ip^o aujjuiiit/ 

3328 

98.821 

6736 

X68362 

Mus musculus Oct-IA protein 

3 93 8 

88 . 472 

6737 

AL080133 

Homo sapiens nypo txict lLdi jjiutcm 

7089 

99 . 543 

6738 

M84725 

Rattus norvegicus neuronal protein 

1386 

96.347 

6739 

X75316 

Mus musculus SEB4 

971 

64 . 591 

6740 

AF151833 

Homo sapiens CGI- 75 protein 



6741 

A58331 

Homo sapiens unnamed protein product 


ZJ O . Zj .7 

6742 

AF153686 

Homo sapiens calcium Dinaing protein precursor i 


97 .63 8 

6743 

AL008583 

Homo sapiens aJ327Jl6 .2 inuman ortnoiog or rat 
Neuronal Pentraxin Receptor) 

o q c 

J Z J? D 

q q con 

6744 

AF151888 

Homo sapiens CGI -13 0 protein 

1 o 

81 . 250 

6745 

AB017112 

Mus musculus mCAC 


l ft r 

6746 

AL080121 

Homo sapiens hypothetical protein 

J. _> / J 

i on noo 

6747 

X04434 

Homo sapiens IGF- I receptor 

9297 

99.854 

6748 

AJ224538 

Homo sapiens AMP-activated protein kinase beta 
2 subunit 

lolz 

QQ "> 

6749 

AF151829 

Homo sapiens CGI -71 protein 


q q CQi 

-7 i7 . D31 

6750 

M17466 

Homo sapiens coagulation factor XII 

4464 

100.000 

6751 

AF106473 

Mus musculus leucine-rich-domain inter-acting 
protein 1; LeR inter-acting protein 1; LEAP1 


/ U . / 1*1 

6752 

M25826 

Kluyveromyces lactis actin 

-| pQI 

81.432 

6753 

U92819 

Homo sapiens unnamed HERV-H protein 

*? j1 

O ^ . Zj 

6754 

AF061262 

Mus musculus semaF cytoplasmic domain 
associated protein 2 

1644 

83 .387 

6755 

AF108831 

Homo sapiens K:C1 cotransporter 3 

7183 

99 . 909 

6756 

AF151806 

Homo sapiens CGI -48 protein 


^ ZJ . *± ^. -3 

6757 

AJ004832 

Homo sapiens neuropathy target esterase 

8691 

99.623 

6758 

AF097439 

Mus musculus brain expressed X- linked protein 
2 

O Z J7 

/J. . J io 

6759 

AJ005562 

Mus musculus SPR2D protein 

99 

35.556 

6760 

AF132600 

Homo sapiens B- lymphocyte stimulator 

i ft 9 n 

J. o ^ / 

i nn noo 

6761 

AL079292 

Homo sapiens hypothetical protein, similar to 

/ -j\ /-i r\ r\ —r r\ ~t *7\ w-v-i .b^t. ^ t . a d XT 7\ Viol i paoo A 

(AC00701 / ) putative kjma neiicase j\ 

J 1 ! 

QQ 752 

Z7 Z7 - 1 ~J 

6762 

X07876 

Homo sapiens irp protein \aa i-jouj 

2511 

99 . 446 

6763 

D14592 

KattUS nOrvegiCUS rlMr Klllaoc JVUlaSc iciaLcu 

protein 

2462 

93 . 216 

6764 

A65892 

unidentified MURINE PSA- 99 

5941 

97 . 177 

6765 

Z22819 

Mus musculus Rab24 protein 

1232 

85.470 

6766 

U79260 

Homo sapiens unknown 

'X ft A 

71 9fi 
/ J . o z. o 

G. "7 £ 7 


Arabidopsis thaliana putative protein 

1383 

58.127 

6768 

Z97029 

Homo sapiens ribonuclease HI large subunit 

1798 

98.333 

6769 

AC006042 

Homo sapiens supported by human ESTs 

AI681256 . 1 (NID:g489143 8) ,N32168 . 1 (NID :gll52567 

) , and genscan 

854 

45 .278 

6770 

A1.033514 

Caenorhabditis elegans predicted using 

991 

53 .160 


$53 
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Gene finder; cDNA EST EMBL:D71127 comes from 
this gene; cDNA EST EMBL:D73 731 comes from 
cms gene ; cdjn/* boi yKo^ /cj . j comes irom tins 
gene; cDNA EST yk645b5.3 comes from this gene 



6771 

A6334 0 

unidentified unnamed protein product 

Z. o £ :i 

o o no i 

6772 

X67325 

Homo sapiens p2 7 

415 

87 .097 

6773 

AF13 3 911 

Mus musculus ARL-6 interacting protem-4 

O HA 

-3UU 

7 9 . o o o 

6774 

AF010258 

Homo sapiens homeodomain protein 

1 OCT 

lob z 


6775 

A3 7078 

Homo sapiens nucjuc.oi xdj& wuh uuKKkbPuwiJiisivj 
PROTEIN 

271 

81 . 132 

6776 

AF003145 

Caenorhabdi t is elegans No definition line 
found 



6111 

AJ005564 

Mus musculus SPR2F protein 

103 

34 .694 

6778 

X85237 

Homo sapiens human splicing factor 

C O O "3 

1 Art rt A A 

lUU . uuu 

6779 

AL1174 96 

Homo sapiens hypothetical protein 

. — — — — 

T A Q "> 
X 

y y . u4o 

6 78 0 

T\T3 f\ f\1 A A *7 

AiJUU /44 / 

Homo sapiens Fln2 9 


Q Q (TCf 

6781 

X5342 7 

Kactus norvegicus glycogen syncnase Kinase .s 

ciXpilct V -"-rv -I- *±0 J / 


jo . 1 o o 

6782 

AC007055 

Homo sapiens unknown 

2053 

100.000 

6 783 

AF 184 "by 

Homo sapiens cytokine homolog CYT07 


ion rsrsn 

6784 

AF0 7 7 03 o 

Homo sapiens unc -50 related protein homolog 

1 TAC 

1 / UD 

Q"7 1 £ Q 

6785 

AC 0 0 4 J a 

riomo sapiens unKnown gene prociucc 

1 TOO 

O 1 . / I 

6786 

AF04 5584 

Homo sapiens PB3 9 

/TCI 

ia n t >i 
o U . 114 

6787 

AF173 3 8 0 

Mus musculus angiotensin II AT2 receptor- 
interacting protein 



6788 

X68596 

Homo sapiens parathyroid hormone receptor 

4020 

100.000 

6789 

U13021 

Homo sapiens ICH-1L 

2837 

95 .302 

6790 

AB014585 

Homo sapiens KIAA06 85 protein 

5753 

95.503 

6791 

M95178 

Homo sapiens alpha-actinin 

5377 

97 . 213 

6792 

U264 01 

Homo sapiens galactokinase 

2454 

97 . 970 

6793 

D89141 

Schizosaccharomyces pombe similar to 
Saccharomyces cerevisiae hypothetical 27.6KD 
protein in chromosome VII, SWISS-PROT 
Accession Number P46948 

564 

43 . 802 

6794 

AF151836 

Homo sapiens CGI- 77 protein 

1798 

100.000 

6795 

AF042800 

Homo sapiens suppressor of white apricot 
homolog 2 

4277 

99 . 54 5 

6796 

Z46973 

Homo sapiens phosphatidyl inositol 3 -kinase 

5722 

97.750 

6797 

AL.030996 

Homo sapiens auil89B24.4 (novel PUTATIVE 
protein similar to hypothetical proteins S. 
pomoe l^zi 1 j . ana. I**, eiegans Lio/ij . o / 

7299 

99 . 731 

6798 

AB015331 

Homo sapiens HRIHFB2 017 

1301 

99.539 

6799 

AF042752 

Gorilla gorilla cytochrome c oxidase subunit 
IV 



6800 

U18468 

Homo sapiens pregnancy- specif ic beta 1- 
glycoprotein 4 precursor 

ZD ^ t> 


6801 

Z14 954 

Homo sapiens codes for a 184 aminoacid peptide 
(BCM) 

TOT"! 

1UU . uuu 

f Qf|9 
OoUZ 

/-\Jr u ^/ j jt j 

nuiuu d x ciio i» ±. alio J- ui uixiiy auiuiu 1 — w -i — l c v_a luix 

containing protein 3 

5399 

99 . 404 

6803 

AF073920 

Homo sapiens regulator of G-protein signaling 
6 

3721 

98.239 

6804 

M99487 

Homo sapiens prostate- specific membrane 
antigen 

5005 

98 .800 


.i. o tj e .a. „ w a s? & o a 


6 805 

AF070663 

Homo sapiens HSPC0 0 7 

112 7 

99 .465 

6806 

U0 8377 

Homo sapiens similar to the Drosophila 
splicing regulator , suppressor-of — white— 

618 5 

99 . 579 

6807 

AB025905 

Homo sapiens neighbor of A- kinase anchoring 

piULClll 2? Z> 

4088 

98.609 

£ ft ft ft 

o-c -7L/0 / 

Wnmn eaT*ii on q T~»i~nt~ 01 n V" "i na no 
riLJniLJ odpxt.Uo jjx uut-xn JV,xxiaoc 

~> \J X 

qq cc^ 

1717 . 3b^ 

6809 

AL035295 \ 

Homo sapiens hypothetical protein 

751 

94 .215 

/■Qi n 
bolU 

/\r 1? / 56 Z 

Homo sapiens unknown 

1 ft Q *7 
IUj / 

. .so / 

boll 


xiomo sapxens similar co jsx/\/\v/yLJf* # similar to 
AAA58424 (PID : gl8 04 92 ) 

A Q7 1 

yy . 8oz 

6812 

AF073770 

Homo sapiens carnitine octanoyltransf erase 

3986 

99.668 

6 OlJ 

Af 1U b iby 

Homo sapiens actin- associated protein 

£hi*± / Kdpuin 


y b . o y l 

Cftl A 
O Ol4 

/\1 bob 

synuucLic conscrucc protein dntxyen 

1 ftft7 
x o o 

QO CA1 

bolb 

T1*l c *> 1 c\ 

Pseudomonas aeruginosa similar to E. coli 

pi ULcXIl rUlD/UlllA. OWlby rlUl. MCCcooXUXl XM LxllLMtJX 

XT 1 1 O J 

lift 
1JU 

/l C Q /I 

^ d . y ^ o 

6816 

.TT-LJ UiUO JO 

1 1 ^ 1 1 IW DCtpiCllO IN. JLrlrVv O ^ -/ pj. UUCXI1 

7599 

99 . 915 

O O X / 

71 7ft 

t_T/^\ty>/-\ cjrj>-\-" pno Piini" Hnmj^ "i n rnnt""ii ni nrr nyrifpi n 

nUlllvJ OdJkJXdlO KUI1L UvJlllClXil UUlll.ClXllXllVj JT -HI 

2668 

98 .7 95 

6 818 

A \J \J 

Hnmn c- ^> t-> H on <=? 1 -runViH^ Ti- pha i n C* y~f*ci "i on 

x i*>^ 1 1 iij octjk/xciio -i- c* ti ixyv^icx xj v^iici i. j. vii 

588 

87 . 736 

fiftl Q 

ziprjft7ftft 7 

rii dUXUUpoXa Llldx lctild HI 1 JV1 1UW1I piULCXil 

246 

23 . 58 9 

coon 

Y12781 

T-frMno a -a-n -J on cs f T*an csrliir--in f Ha t - *^^ 1 llrp l nrnf pin 

nui 1 l\J octpxciio X ailoUUV»XIl VL>C UCl / JL J- "vC JL pi-WUClll 

3607 

99 . 449 

£ ft 7 1 
DC3Z1 

/A I L# J U ^ ^ / 

nuuiu t>ci Ly iciib vav iJx ulc in 

J O ** -7 


fift77 

AO O O «7 17 

iJ'/^rn/^i ear>i on a DMA l-\-5 nrli nrr nrnhPi n 

nUIIIU oapXt-Ho XtX>i/-V LJXIiUXliy piULClll 

4687 

i on ooo 

£ ft 7 1 
D O ^ J 

vnnfi ift 

V \J \J D 3 O 

V*> a phpri r*i"p»Vi arto la rnVifl Tr~o *a <^ "i nrr f -ra mo pa 1 f) 
j-Jct \_, l \z x x ^jji iciy c j.oiiUm'vici i cauiiiy xxctiwt^ cctxu 

7 99 

99 . 180 

6824 

AE000406 

Escherichia coli putative DNA topoisomerase 

1219 

100.000 

c o o c 


riomo sapxens nuntiny tin lnteractiny protein 

iijj 

QQ A 1 c: 

bozb 

Ar UobbUo 

Kattus norvegicus neurestin ceta 


17 7 1 ft 
J Z . /JO 

6827 

M55593 

Homo sapiens type IV collagenase 

4683 

100.000 

6828 

D85884 

Homo sapiens glutamate transporter 

3642 

99.652 

682 9 

U4 0 97 8 

Homo sapiens DNA mismatch repair protein 
homo log 

4 779 

99 . 6 03 

6830 

D31765 

Homo sapiens KIAA0061 

6234 

100.000 

6 831 

ALiOOy 191 

unJcnown /prediction--, (metnoci : "generinaer" . 
version: '»»084 M ") / 

/predict ion= (method : " "genscan " " , ve 

4 56 

30 . 056 

C Q 1 O 

1\I71 r\ Q ""7 "I Q 

At iuy / i y 

Mus musculus unknown 

i ^ t n 
l^o U 

TC OCA 

/ 5 . 8 bU 

C O *1 o 

6 833 

U71075 

Homo sapiens protein tyrosine phosphatase 

v* o /"^ o ■***» r\ "f* /*\rrt "i /^v**/***r*» 
I cv-cpLUX OtnJLCxCJIl 

7DlD 

y / . oz4 

O O J ft 

AT ftQ^"7£R 

xiorno Sapiens ujoddoio . z \iiovcx piuucxn uxnixxax 

o hamctf pr P*-?*-!^ f PhiocfDhiat" *i rlvl qpri no 
Dprarboxvlasp Proenzyme EC 4 1 1 65) 

7 c:7 ft 

Qft 4 ft ft 

6835 

X57351 

Homo sapiens 1 - 8D 

819 

96 . 970 

6 ft 

TJ06617 

U V O O -J 

Hnmn qani on<3 nftft — 1 in 

3729 

99 . 307 

6837 

AC005594 

Homo sapiens R26 984 1 

3433 

98.641 

£ ft 7 ft 

O O J o 


Hnmn a a n i one f— ( ~i . C \ / /»■) c "l . «pk *l a \ -f- ■« lainn frono 
nv^iiii^/ oapxciio \ o t ~j i vvjyiio.-Lfpj**/ x, us xuu y cue 

1712 

98 .8 85 

fift 7 Q 

O O J -7 

77fi 7 1 7 

T-Tnmn eani on g ^ocmnnl o \ n 7 

7096 

99 . 015 


O / LOO? 

protein=H4 (D10S170) 

O O V 

qq tic 

6841 

AF044195 

Homo sapiens IkappaB kinase complex associated 
protein; IKAP 

8837 

99.775 

6842 

M62415 

Pseudopleuronectes americanus HPLC6 

114 

39.623 

6843 

X73113 

Homo sapiens fast MyBP-C 

7378 

98.953 

6844 

AF083249 

Homo sapiens Rb binding protein homolog 

4573 

98 .670 


.1 o o a <+6i ., o 2 h ns 


6845 

M64934 

Homo sapiens kell blood group protein 

4481 

96 - 744 

6846 

AJ005162 

Homo sapiens UDP-glucuronosyl transferase 

3555 

99 . 811 

6847 

AB011097 

Homo sapiens KIAA0525 protein 

5680 

99 .654 

6848 

M55614 

Homo sapiens TK14 protein 

5498 

99 . 757 

6849 

U15155 

Gallus gallus tripsinogen 

459 

37 . 500 

6850 

AB018320 

Homo sapiens KIAA0777 protein 

7314 

98 . 268 

6851 

AF018956 

Homo sapiens neuropilin 

6334 

99 . 783 

6852 

X00776 

Escherichia coli lipoprotein signal peptidase 

1071 

99 . 390 

6853 

AE000236 

Escherichia coli putative enzyme 

1457 

94 . 902 

6854 

D86974 

Homo sapiens KIAA0220 

3648 

99 .458 

6855 

X07936 

Homo sapiens epoxide hydrolase (AA 1-455) 

2732 

98 . 253 

6856 

Z11518 

Homo sapiens hist idyl- tRNA synthetase 

3185 

98 . 232 

6857 

M84725 

Rattus norvegicus neuronal protein 

1386 

96 . 347 

6858 

AF097025 

Homo sapiens cysteine desulfurase 

2961 

98 . 687 

6859 

AL117424 

Homo sapiens hypothetical protein 

1640 

100.000 

6860 

AJ236885 

Homo sapiens ZBP-89 protein 

5179 

99 . 748 

6861 

M74002 

Homo sapiens arginine-rich nuclear protein 

467 

34 .495 

6862 

U05812 

Herpetomonas muscarum 3' -end 

115 

31 . 325 

6863 

L06505 

Homo sapiens ribosomal protein L12 

627 

66 .460 

6864 

Y08769 

Rattus norvegicus microvascular endothelial 
differentiation gene 2 

151 

47 . 761 

6865 

X82260 

Homo sapiens RanGAPl 

3708 

99 . 830 

6866 

Y13186 

Homo sapiens dystrophin 

142 

71 .429 

6867 

M15386 

Homo sapiens gamma -globin 

522 

70 . 175 

6868 

U79260 

Homo sapiens unknown 

203 

50.794 

6869 

AB023179 

Homo sapiens KIAA0 962 protein 

4135 

99 . 049 

6870 

Y08162 

Homo sapiens heptahelix receptor 

316 

25 .364 

6871 

U18745 

Homo sapiens corticostatin/def ensin HP-4 
precursor 

330 

56.522 

6872 

X55777 

Homo sapiens put . ORF 

360 

62 . 921 

6873 

L01664 

Homo sapiens lysophospholipase 

441 

47 . 143 

6874 

AF081110 

Mus musculus domesticus ORF2 

194 

47 . 761 

6875 

U23452 

Caenorhabditis elegans No definition line 
found 

631 

33 .758 

6876 

M19419 

Mus musculus proline-rich salivary protein 

207 

36 .496 

6877 

AL031907 

Schizosaccharomyces pombe hypothetical protein 

275 

26 . 531 

6878 

AF033664 

Mus musculus cbpl46 

1197 

83 .491 

6879 

AB023167 

Homo sapiens KIAA0950 protein 

982 

48.071 I 

6880 

Z72946 

Saccharomyces cerevisiae ORF YGR159C 

236 

44 . 660 

6881 

AP000060 

Aeropyrum pernix 108aa long hypothetical 
protein 

144 

31 . 731 

6882 

AF064604 

Homo sapiens KE03 protein 

459 

51 . 634 

6883 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

3 51 

67 . 500 

6884 

M15530 

Homo sapiens B-cell growth factor 

162 

41.758 

6885 

AC002294 

Arabidopsis thaliana Unknown protein 

721 

50 . 000 

6886 

AJ243460 

Leishmania major proteophosphoglycan 

173 

28 . 319 

6887 

J04076 

Homo sapiens early growth response 2 protein 

167 

24 . 468 

6888 

M17294 

Human herpesvirus 4 unknown protein 

172 

33 . 161 

6889 

AF138957 

Bos taurus type II collogen cyanogen bromide 
fragment CB8 


o £ - U t> J. 

6890 

AB023201 

Homo sapiens KIAA0984 protein 

4765 

98 .072 

6891 

U41557 

Caenorhabditis elegans glycine -rich 

134 

46 .429 

6892 

AB030237 

Canis familiaris D4 dopamine receptor 

126 

36.667 

6893 

U58658 

Homo sapiens unknown 

296 

65.385 
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6894 

M97662 

Rattus norvegicus beta- alanine synthase 

2276 

83.206 

6895 

AB028972 

Homo sapiens KIAA104 9 protein 

3676 

98 . 919 

6896 

X64897 

Bos taurus MLRQ subunit of the NADH: 
ubiquinone oxidoreductase complex 

343 

64 . 935 

6897 

AF172328 

Homo sapiens unknown 

679 

97 . 895 

6898 

U26358 

Rattus norvegicus S100A1 gene product 

420 

55.556 

6899 

AF174394 

Homo sapiens apoptotic- related protein PCAR 

66 

36 . 957 

6900 

Z35093 

Homo sapiens SURF- 1 

1818 

92 . 593 

6901 

AL096725 

Homo sapiens hypothetical protein 

1120 

97.576 

6902 

AF044923 

Homo sapiens hookl protein 

4543 

93.565 

6903 

Z99494 

Mycobacterium leprae hypothetical protein 
MLCB57 . 05c 

110 

33 . 684 

6904 

Z79695 

Unknown Similarity to Yeast hypothetical 
protein YOR3160W (TR • E217727 ) ; cDNA EST 
EMBL:T004 98 comes 

1020 

47. 500 

6905 

L78671 

Homo sapiens CoxII/D-loop DNA fusion protein 

513 

91.358 

6906 

AF174481 

Typhlonectes natans gonadotropin- releasing 
hormone receptor 

405 

45.033 

6907 

Z95620 

Schizosaccharomyces pombe putative dna binding 
protein 

267 

36 . 364 

6908 

S85655 

Homo sapiens prohibitin 

273 

64 . 706 

6909 

L08816 

Hepatitis E virus poly-proline hinge 

80 

33 . 333 

6910 

AF072508 

Homo sapiens envelope protein 

178 

44. 776 

6911 

X17400 

Mus sp. TIS7 protein (AA 1-449) 

2606 

93 .381 

6912 

X54518 

Gossypium hirsutum late embryogenesis abundant 
protein 

109 

32.000 

6913 

AF041060 

Mus musculus co-chaperone mt-GrpE#2 precursor 

1228 

83. 929 

6914 

Z74029 

Unknown Similarity to C.elegans alcohol 
dehydrogenase (WP : C17G10 . 8 ) ; cDNA EST 
EMBL:D66106 comes fro 

778 

66 .484 

6915 

AB018423 

Mus musculus SH2 domain- containing protein 

1772 

76 .453 

6916 

U49973 

Homo sapiens ORFl; MER37; putative transposase 
similar to pogo element 

230 

30. 147 

6917 

Z73497 

Homo sapiens CU240C2.1 (Core histone 
H2A/H2B/H3/H4) 

738 

83 . 803 

6918 

U63542 

Homo sapiens FAP protein 

217 

70.455 

6919 

U97553 

murine herpesvirus 68 unknown 

144 

39.189 

6920 

AF125569 

Homo sapiens tumor suppressing STF cDNA 6 

961 

97.203 

6921 

M11759 

Lycopersicon esculentum cell wall 
hydroxyproline-rich glycoprotein 

91 

41.667 

6922 

X72963 

Nicotiana tabacum pAP8 product 

149 

33 . 333 

6923 

Z83246 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST EMBL:M79771 comes from 
this gene 

1147 

59.524 

6924 

CT01055 

Chironomus tentans giant secretory protein 

138 

29. 078 

6925 

X55777 

Homo sapiens put . ORF 

365 

63 .218 

6926 

U61947 

Caenorhabditis elegans No definition line 
found 

124 

32.051 

6927 

AF063866 

Melanoplus sanguinipes entomopoxvirus ORF 
MSV2 3 3 hypothetical protein 

110 

34 .483 

6928 

X98485 

Plasmodium vivax putative 

112 

30.208 

6929 

X16282 

Homo sapiens zinc finger protein (217 AA) (1 
is 2nd base in codon) 

1420 

97.512 

6930 

U88368 

Sus scrofa inositol ( 1 , 3 , 4 , 5) tetrakisphosphate 
receptor 

1054 

59.919 


SS7 
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6931 

Z92539 

Mycobacterium tuberculosis pth 

396 

35 .450 

6932 

AL008730 

Homo sapiens dJ487J7.1.1 (putative protein 
dJ487J7.1 isoform 1) 

3980 

99.127 

6933 

U28131 

Homo sapiens novel transcript; similar to 
transcription factors activation domains; 
linked at 5* end to AT nook motir or HMGI-C; 
Method: conceptual translation supplied by 
author 

197 

49 . 351 

6934 

AF169346 

Cavia porcellus pro-alpha- 1 type 1 collagen 

1 zf / 

J ft . J 

6935 

M62324 ! 

Homo sapiens modulator recognition factor I 


. loo 

6936 

X68600 

Hordeum vulgare pZE40 

ID / 


6937 

X55777 

Homo sapiens put . ORF 

319 

54 .639 

6938 

Z99113 

Bacillus subtilis similar to long-chain acyl- 
CoA synthetase 

"7 C A 

4 9. 074 

6939 

X55777 

Homo sapiens put . ORF 


DO .DO / 

6940 

AB030483 

Mus musculus B9 

294 

27 .083 

6941 

X01655 

Homo sapiens type III procollagen (aa 892- 
1023 ) 

176 

38 . 393 

6942 

U43200 

Boreogadus saida antifreeze glycopeptide AFGP 
polyprotein precursor 

183 

25 . 110 

6943 

U37150 

Bos taurus peptide methionine sulfoxide 
reductase 

1 o o c\ 

o J . fl 

6944 

X92109 

Homo sapiens hcglX 


ft . ft Z ft 

6945 

AL117555 

Homo sapiens hypothetical protein 

984 

98 .496 

6946 

AC005328 

Homo sapiens R26660 2, partial CDS 

1226 

85 .66 / 

6947 

Z68752 

Caenorhabditis elegans T12G3.5 

179 

32 . 203 

6948 

X63508 

Mycobacterium tuberculosis predicted ORF 

14 5 


6949 

L34807 

Musca domestica transposase 

261 

18 . 375 

6950 

S58722 

Homo sapiens X- linked retinopathy protein {c- 
terminal, clone XEH.8c} 

116 

61.538 

6951 

M24543 

Homo sapiens prostate-specific antigen 

480 

85 . 185 

6952 

AB020719 

Homo sapiens KIAA0912 protein 

8368 

99.769 

6953 

S68106 

Ascaris suum, Peptide Partial, 100 aa type IV 
collagen alpha 2 chain, alpha 2 (IV) 
{alternatively spliced} 

120 

32 .530 

6954 

AF151832 

Homo sapiens CGI -74 protein 

725 

52 .652 

6955 

AF095737 

Homo sapiens unknown 

1 A A 
J ft ft 

o / . ft lb 

6956 

AF076607 

Mus musculus prediabetic NOD sera- reactive 
autoantigen 

3374 

94 .727 

6957 

AJ010100 

Homo sapiens NKp4 4RG2 

191 

27 .632 

6958 

AJ011736 

Homo sapiens growth factor receptor binding 
protein (GRBLG) 

701 

82.812 

6959 

AF022985 

Caenorhabditis elegans No definition line 
found 

23 9 

£*k .boy 

6960 

AL022018 

Unknown /predict ion= (method: " "genscan" " , 

, rnv .« -J — ,„ .11111 D II II r>onvn . M 111 TJ QO II II ^ . 

version : l . u , score: ijj.oz / ; 
/predictions: ^mcLnou . 

q n q 
o U o 

j O . O Z 3 

c q a i 

AC UU / U b / 

AldDiaopSlS Cnailana X J. \J\JZ. ft . Z JL 

577 

29 . 653 

b 

UUb b 

XiCJUlD odpicllb UlLIa 11 Xy il ibUllCl JVCI C* LUl 

1011 

66 . 162 

6963 

U2 3037 

Oryctolagus cuniculus eIF-2Bepsilon 

4335 

90.390 

6964 

U05313 

Trypanosoma brucei CR3 

73 

29.412 

6965 

Y08250 

Homo sapiens immunoglobulin kappa light chain 

100 

39.062 

6966 

AF061812 

Homo sapiens keratin 16 

2994 

98.943 

6967 

AF151899 

Homo sapiens CGI -141 protein 

535 

56 . 835 

6968 

AF057489 

Mycobacterium thermoresistibile RNA polymerase 

129 

30.476 


a 01 o s «+ is «+ .5 , o a a & o is 




beta 



6969 

AF0056 54 

Homo sapiens act in- binding double- zinc- finger 
protein 

611 

31 . 570 

6970 

Y07604 

Homo sapiens nucleoside-diphosphate kinase 

1063 ! 

89. 730 

6971 

U01849 

Trypanosoma brucei 0RF2 

165 

30.435 

6972 

X02585 

Xenopus laevis unidentified open reading frame 
2 

241 

58.462 

6973 

AC004990 

Homo sapiens supported by Genscan and several 
ESTs: C83049 (NID : g3 062006 ) , AA823760 

(NID : g2893628 ) , AA215791 (NID :gl815572 ) , 
AI095488 (NID:g3434464) , and AA969095 

(NID :g3144275; 

3418 

98 .450 

6974 

AB029034 

Homo sapiens KIAA1111 protein 

7038 

99.060 

6975 

AJ009761 

Homo sapiens putative dimethyladenosine 
transferase 

252 

76 . 923 

6976 

L10326 

Rattus norvegicus GTP-binding protein alpha- s 
subunit 

281 

93.182 

6977 

X66403 

Homo sapiens acetylcholine receptor epsilon 

SUDUniC LHRNb 

2897 

78 . 06 9 
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UbJ JJz 

nomo sapiens super cysteine ricn protein; ot^Kir 
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zoo 

A AO 
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Xenopus laevis unidentified open reading frame 
z 
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noiuo sapiens recinooias tomd i 
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J f . O / _7 

£T Q Q O 

TTC Q /T C Q 

nomo sapiens unKnown 

O A Q 
Zfi O 


o y o o 

T\ T7> -1 ITT D f\ C\ 

Homo sapiens CGI -41 protein 

"IT "7 

0 J . b U / 


i\v Uob / u y 

Homo sapiens NAG— 7 protein 


QQ QIC 

CQOC 

J\C J. jlODD 

Homo sapiens CGI- 108 protein 

X Z. I t) 

qq a an 
jj . 4 0 / 

b y oo 


Hydra sp. mini -collagen 

ICC 
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6987 

AL022117 

Schizosaccharomyces pombe hypothetical protein 

422 

39. 535 

6988 

At lbozoz 

Arabidopsis thaliana HAL 3 A protein 

b 11 

A Q yl Oft 

6989 

M61745 

Bos taurus phosphatidyl inositol 3 -kinase 

4634 

96.409 

6 990 

IV 1~1 *1 1 ft 1 ft "1 

AF119121 

Homo sapiens putative RNA binding protein 

3 74 9 

ft ft VI O ft 

99.480 

6 991 

TV r A "i r 1" *^ ^ 

AL035523 

Arabidopsis thaliana hypothetical protein 

216 

21 . 782 

6992 

Vi. r*t —% ft ft ft ft 

AF12 38 8 0 

multiple sclerosis associated retrovirus 
element unknown protein U5/1 

45 

50.000 

6993 

X98709 

Homo sapiens COL1A1 and PDGFB fusion 
transcript 

119 

37.143 

6994 


Homo sapiens T-cell receptor gamma -chain 

A Q C 

. y j / 

6995 

X84194 

Homo sapiens acylphosphatase 

201 

64.151 

6996 

Z93785 

Unknown predicted using Genefinder; similar to 
RNA recognition motif, (aka RRM, RBD, or RNP 
domain) 

500 

31.492 

6997 

AF160909 

Drosophila melanogaster BcDNA. LD03471 

1277 

53.959 

6998 

AL035700 

Homo sapiens d*T75K24.1 (novel protein similar 
to SH3BGR (SH3 domain binding glutamic acid- 
ricn protein/ ano orioovjKii/ 

202 

41 . 053 

o y y y 

l v I± _) DJU 

Homo sapiens B-cell growth factor 

ID / 

ca c A R 

/uuu 

Ar Ub / Z 1 y 

Caenoirhabdi t is elegans No definition line 
found 

Q O 
O Z 

Jb . zU / 

7001 

AC005587 

Homo sapiens similar to mouse olfactory 
receptor 13; similar to P34984 (PID :g464305) 

1450 

71.382 

7002 

AB011154 

Homo sapiens KIAA0582 protein 

373 

87 . 879 

7003 

X63797 

Gallus gallus decorin 

342 

32 . 547 

7004 

AF064597 

Homo sapiens LINE-1 like protein 

157 

49. 153 
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7005 

AB0023 54 

Homo sapiens KIAA0356 

co n a 

OZU4 


7006 

X67703 

Drosophila melanogaster Mst84Db 

88 

33.871 

7007 

A6 1971 

unidentified MCSP 
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riomo sapiens unKnown procein 


CI A Q Q 
D J ■ ft O O 

/01o 

A±SU 1^5 y 0 J 

KaLcus noivegicus acyx loa syiiuxit;i-.ac>tJ o 

•3 71 " 

pi 11 I 
O JL . HO 

7019 

AF026198 

Fugu rubripes putative protein 2 

677 

62 . 195 

7020 

AJ24 2724 

Homo sapiens putative nxitogen-activated 
piroceiri Kinase Kinase Kinase 

lOtJ J 

DO C O A 

7021 

7\t t n t r r ^ 
Alill /bo / 

xiOiuo sapiens nypocnecicai protein 


QQ TQC 

7022 

TT1 O **> "7 

riomo sapiens ai r-Dinainy cassecie protein 


i/o . /Ul 

7023 

ALi021366 

Homo sapiens cilku /zi^> ^ \ ivines m rexatea 

P 1 ULclll / 

A 1 A *7 

QQ 1 1 rt 
i7 . 11 U 

7024 

UU Ul / J 

xiomo sapiens cycocnroine r ^ du 

-197C 

1UU . \J\J\J 

*7 n o c 

AO OUOl 
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17Ui 

qc i "in 
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7026 

AF035940 

Homo sapiens similar to mago nashi 

968 

100.000 

702 7 

AC0 05 /oJ 

riomo sapiens kjjuoj i 
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7028 

AB014599 

Homo sapiens KIAA0699 protein 

5341 

100.000 

7029 

AL05006 9 

Homo sapiens hypothetical protein 

1751 

yu . 4 y o 

7030 

AF132 96 3 

Homo sapiens CGI-29 protein 

1 o o 1 

Q Q 1 "7 A 

yy . i / 4 

7031 

AF134983 

Mus musculus energy- dependent regulator of 
proteolysis 

j" jO 

Q CT Tin 

yo . iiu 

7032 

AF100753 

Homo sapiens ancient ubiquitous 46 kDa protein 
AUP1 

2748 

100 .000 

7033 

AB015982 

Homo sapiens serine/threonine kinase 

6 054 

100 . 000 

7034 

M6 422 9 

Homo sapiens type I collagen 

TOT 
lb 1 

. 4z y 

7035 

Z81037 

Caenorhabditis elegans predicted using 
Genefinder; Weak similarity in N- terminus to 

U1N V_ — *±Z. \ nr . T DoCiD . 1^ / DDI CjtIDXj , J<i J 

rnmp c? from fhi q npn p> 
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U4 0952 
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65 . 000 

/ \J J o 
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128 

34 . 211 

1 \J O ZJ 

AB023210 

Homo saoiens k T A A 09 93 r>rotein 

2577 

100 . 000 

7040 

V\ Dt JO J 

Homo «;anipn c 3 f ibrinoaen beta chain 

X JLV^s 1 1 is^s O QL/ X S — X X O X. -X X_/ X X X Iv/VJ X X XV v— L*. CX V^ilCA X. X X 

3373 

99 . 793 

/ U ■* JL 

Dft ft 01 0 

Homo ^ani pti^ t~\ hnQnmal nrotpin 5513 

978 

100 . 000 

7042 

L2 53 14 

Drn^nnh i 1 a mplanoaaster actin-related t5jrotein 

1269 

47 . 328 
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1278 

64 . 260 

7044 

AC00 5614 

Homo sapiens F2 326 9 2 

4179 

96 . 546 

7045 

U90549 

Homo sapiens non-histone chromosomal protein 

601 

100.000 

7046 

AB018288 

Homo sapiens KIAA0745 protein 

5822 

98.472 

7047 

AF132960 

Homo sapiens CGI -26 protein 

2162 

99.107 

7048 

S42658 

Homo sapiens S3 ribosomal protein 

1589 

100 . 000 

7049 

U09813 

Homo sapiens mitochondrial ATP synthase 

8 96 

100 .000 
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7050 

X12517 

Homo sapiens C protein (AA 1-159) 

JL 4* ^> O 

10O OOO 

7051 

AI*079277 

Homo sapiens hypothetical protein, similar to 
(U32865; imotce procein 

JL / / O 

100 000 

JL. V V • w V/ V/ 

7052 

AF006087 

Homo sapiens p2 0-Arc 

1059 

100.000 

7053 

AF125101 

Homo sapiens HSPC040 protein 

/ *± *± 

100 000 
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7054 

X52425 

Homo sapiens interleukin 4 receptor 


99 7 ^fl 

7055 

AF067730 

Homo sapiens TLS-associated protein TASR-2 

815 

55. 597 

7056 

X60489 

Homo sapiens elongation f actor- 1 -beta 

JL*i O J 

1UU . U \J V-/ 

7057 

AF186264 

Homo sapiens brain specific membrane -anchored 
protein BSMAP 

4. 0 fc> U 

i nn nnn 

7058 

AB007876 

Homo sapiens KIAA0416 

3504 

100. 000 

7059 

X54938 

Homo sapiens inositol 1, 4 , 5- triphosphate 3- 
kinase 


lUv . \J\J\J 

7060 

U27831 

Homo sapiens striatum-enriched phosphatase 

J 0 3 0 

9ft "^^4 

7061 

AF001533 

Mus musculus mitogen- induced 

O /l 7/1 

97 91 1 

7062 

D16181 

Homo sapiens peripheral myelin protein 2 
(PMP2) 

Q "> Q 

i nn nnn 

7063 

Z50053 

Homo sapiens alpha2i- subunit of soluble 
guanylyl cyclase 

n r £ 

D U O O 

1 00 000 

7064 

L26288 

Rattus norvegicus LaM-iiKe protein jtxndtic 

1803 

82 . 663 

7065 

Y15060 

Homo sapiens GalT2 protein 

2925 

100 . 000 

7066 

AB023223 

Homo sapiens KIAA1006 protein 

8196 

99 . 918 

7067 

AL032684 

Schizosaccharomyces pombe hypothetical protein 

•j <t» 

37 . 126 

7068 

X76199 

Bos taurus synaptobrevin 

747 

99 . 138 

7069 

AJO 0558 5 

Homo sapiens unnamed protein product 

1 7QC 

99 £1 "7 

7070 

AF077045 

Homo sapiens ATP synthase epsilon chain 


n nn nnn 

7071 

AL050028 

Homo sapiens hypothetical protein 



7072 

X78444 

Rattus norvegicus ribosomal protein L22 

565 

69.291 

7073 

U27838 

Mus musculus glycosyl -phosphatidyl- inositol - 
anchored protein homolog 

4 Zffc b 

y 0 . 1 0 7 

7074 

U25034 

Homo sapiens neuronatin beta 

7 n ^ 
^; v d 

79 "I4f| 

7075 

X03484 

Homo sapiens raf protein (aa 1-648) 

4 3 70 

i nn nnn 

7076 

AC004539 

Homo sapiens unknown function; similar to 
Y09105 (PIDrglbbbl /1J 

9£ 7 

99 ^11 

7077 

X60188 

Homo sapiens protein serine/threonine kinase 

2534 

99.736 

7078 

AF132961 

Homo sapiens CGI-27 protein 

2 018 

100 000 

7079 

AJ001258 

Homo sapiens NIPSNAP1 protein 

1967 

99.648 

7080 

AL031266 

Caenorhabditis elegans VM106R.1 

Z zf U 

*i Z . J -7 J 

7081 

AF151898 

Homo sapiens CGI -14 0 protein 

7C4 

TOO OOO 

X U U . V \J \J 

7082 

AF044953 

Homo sapiens NADH : ubiquinone oxidoreductase 
PGIV subunit 

i oa n 

1 nn nnn 

7083 

AF155105 

Homo sapiens pucacive zinc unyeji uit: inx 
REN- 34 antigen 

1219 

100 000 

-ju w \y • v \/ v 

7084 

L00354 

Homo sapiens cnoxecyscoKinin 

775 

100 . 000 

7085 

AC0043 81 

Homo sapiens unKnown gene piuuuv.L 

1570 

100 . 000 

7086 

Y13276 

Homo sapiens Tailless protein 

2557 

100. 000 

7087 

AF159055 

Homo sapiens leucine zipper- like protein 

X -5 _7 

81.481 

7088 

AJ010793 

Triturus carnifex Newt Ovary Ribozyme- 
Associated protein 



7089 

AF181685 

Mus musculus membrane protein TMS-2 

2988 

95 . 806 

7090 

S65091 

Rattus sp. cyclic AMP-regulated 
phosphoprotein, ARPP-21 

443 

82 .759 

7091 

U41315 

Homo sapiens ZNF127-Xp 

2663 

82 . 099 

7092 

AF067937 

Caenorhabditis elegans contains similarity to 

417 

40.698 
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E. coli cation transport protein (GB:L28709) 



7093 

Z12830 

Homo sapiens SSR alpha subunit 

1856 

99.650 

7094 

AL049548 

Homo sapiens dJ3 98G3.1 (ortholog of rat CPG2) 

4426 

99 . 856 

7095 

X55954 

Homo sapiens HL2 3 ribosomal protein 


t ft ft ft ft ft 

7096 

AL023653 

Homo sapiens dJ753P9.2 (novel protein) 

2 517 

Tftft ft ft ft 

7097 

U01317 

Homo sapiens delta-globin 

987 

1 ft ft ft ft ft 

100 . oou 

7098 

A18921 

synthetic construct tissue- specif ic secretory 
protein 

1025 

100.000 

7099 

Z28407 

Homo sapiens ribosomal protein L8 

1732 

100.000 

7100 

X57346 

Homo sapiens HS1 

1568 

100 . 000 

7101 

AF140598 

Homo sapiens ring-box protein 1 

797 

100.000 

7102 

AC006389 

Homo sapiens similar to Schizosaccharomyces 
pombe splicing factor; similar to PID: 3395591 

593 

100 . 000 

7103 

X04327 

Homo sapiens 2,3 biphosphoglycerated mutase 
(AA 1 - 259) 

1747 

100 . 000 

7104 

U00043 

Caenorhabditis elegans similar to D . 
melanogaster trithorax protein 

519 

40 . 359 

7105 

Z22968 

Homo sapiens M130 antigen 

8047 

100.000 

7106 

AF189817 

Mus musculus evectin-2 

1417 

87 . 391 

7107 

X60367 

Mus musculus cellular retinol binding protein 
I 

553 

55 . 970 

7108 

Z93388 

Unknown predicted using Genefinder; cDNA EST 
EMBL:D70912 comes from this gene; cDNA EST 
EMBL:D73452 

911 

61.966 

7109 

AL050225 

Homo sapiens hypothetical protein 

1788 

98.885 

7110 

AJ011007 

Homo sapiens 6.2 kd protein 

365 

98 . 182 

7111 

AC002343 

Arabidopsis thaliana Ser/Thr protein kinase 
isolog 

511 

33 . 431 

7112 

X14329 

Homo sapiens carboxypeptidase N precursor (AA 
-20 to 438) 

3148 

99.782 

7113 

A23031 

Homo sapiens trophoblast membrane expressed 
protein 

2903 

99 . 766 

7114 

Y17392 

Homo sapiens prefoldin subunit 1 

66 9 

98 . 198 

7115 

X75500 

Homo sapiens microsomal triglyceride transfer 
protein, 97kDa subunit 

5680 

T ft ft ft ft ft 

100 . 000 

7116 

AF111713 

Homo sapiens junctional adhesion molecule 

1950 

100.000 

7117 

X08055 

Homo sapiens preglycophorin B 

C "i *7 

53 7 

T r\r\ ft ft ft 

1UU . UUU 

7118 

AL050254 

Homo sapiens hypothetical protein 

3548 

99.808 

7119 

Z46522 

Drosophila subobscura bcn92 

310 

55 .422 

7120 

Z48334 

Caenorhabditis elegans F10B5.8 

2274 

S~ 0 -I T 1 

68 . 113 

7121 

AL021546 

Homo sapiens Cytochrome C Oxidase Polypeptide 
Via- liver precursor (EC 1.9.3.1) 

762 

1 e\ r\ ft ft c\ 
100 . 000 

7122 

X13923 

Homo sapiens cytochrome c oxidase subunit VIb 
(AA 1-86) 

A "5 A 

CZ1 ft ^ Q 

7123 

Y08915 

Homo sapiens alpha 4 protein 

091 c 


7124 

AL050101 

Homo sapiens hypothetical protein 

jOU J 

_L \J \J . \J \J \J 

7125 

X04494 

Homo sapiens precursor polypeptide 

O A 1 *7 
Z 4 J / 

i on nnn 

7126 

L02956 

Xenopus laevis ribonucleoprotein 


pc 0 ft 

7127 

AF082526 

Mus musculus MEK binding partner 1 

759 

97 .581 

/I/O 


ri(_)iii(_) ct jw> itrilb nioLUiJc n^o 

787 

100 . 000 

7129 

AF100761 

Homo sapiens PTD017 

1820 

100.000 

7130 

L31783 

Mus musculus uridine kinase 

1619 

92.692 

7131 

AL117608 

Homo sapiens hypothetical protein 

541 

92 .473 

7132 

AF084259 

Mus musculus bromodoma in -containing protein 
BP75 

3884 

88 .000 
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7133 

Z81108 

Unknown similar to Ufoi qui tin -conjugating 
enzymes; cDNA EST EMBL:T01737 comes from this 
gene; cDNA E 

567 

53 .459 

7134 

AF134772 

Mus musculus LIM protein 

2134 

99.286 

7135 

AF132952 

Homo sapiens CGI -18 protein 

2008 

88 .483 

7136 

X63692 

Homo sapiens DNA (cytosine- 5- ) - 
methyltransf erase 

1101 
7 

100 . 000 

7137 

X63527 

Homo sapiens ribosomal protein L»19 

12 59 

100 . 000 

7138 

U39402 

Homo sapiens ORF; Method: conceptual 
translation supplied by author. 

2084 

100 . 000 

7139 

AF083110 

Homo sapiens sirtuin type 5 

2134 

100.000 

7140 

AF151884 

Homo sapiens CGI -126 protein 

116 5 

100 . 000 

7141 

AF092134 

Homo sapiens PTD013 

1548 

99.588 

7142 

AL117629 

Homo sapiens hypothetical protein 

796 

73 .545 

7143 

M37679 

Mus musculus Ig heavy chain precursor 

133 

72 . 727 

7144 

Z49068 

Caenorhabditis elegans mitochondrial carrier 
protein 

674 

41 . 176 

7145 

Z94160 

Homo sapiens dJ63G5.1 (human SEC7 homolog B2-1 
(cytohesin-2 , Arno, ARF exchange factor) LIKE 
protein) 

2319 

100 . 000 

7146 

X04085 

Homo sapiens catalase 

3642 

100 . 000 

7147 

Z28339 

Homo sapiens delta 4-3-oxosteroid 5 beta- 
reductase 

22 07 

100 . 000 

7148 

X55656 

Homo sapiens gamma- G globin 

104 7 

100 . 000 

7149 

AL050273 

Homo sapiens hypothetical protein 

677 

100 . ooo 

7150 

M61832 

Homo sapiens S-adenosylhomocysteine hydrolase 

2 8 90 

100 . 000 

7151 

U40952 

Caenorhabditis elegans C03B1.10 gene product 

164 

94.737 

7152 

U01317 

Homo sapiens beta -globin 

991 

100 . 000 

7153 

U31449 

Homo sapiens tetraspan membrane protein 

1471 

100 . 000 

7154 

Y14769 

Gallus gallus paralemmin 

474 

36 . 525 

7155 

L42374 

Homo sapiens protein phosphatase 2A B56-beta 

3317 

100 . 000 

7156 

AF118108 

Homo sapiens lymphatic endothelium- specif ic 
hyaluronan receptor LYVE-1 

2059 

98 . 758 

7157 

AB003103 

Homo sapiens 26S proteasome subunit p55 

o o n o 

1UU . UUU 

7158 

L05093 

Homo sapiens ribosomal protein L18a 

1222 

100.000 

7159 

AF043254 

Homo sapiens heat shock protein 75 

4094 

98 . 773 

7160 

D13630 

Homo sapiens KIAA0005 

2051 

72 . 596 

7161 

AF035262 

Homo sapiens BAF5 7 

2702 

100.000 

7162 

U28413 

Homo sapiens CSA protein 

2691 

100 . 000 

7163 

Z69904 

Caenorhabditis elegans cDNA EST yk428d5.3 
comes from this gene; cDNA EST yk428d5.5 comes 
from this gene; cDNA EST yk537a3.3 comes from 
this gene; cDNA EST yk621hll.3 comes from this 
gene 

400 

29 . 694 

7164 

K03020 

Homo sapiens phenylalanine hydroxylase 

1452 

98 .636 

7165 

X00129 

Homo sapiens precursor RBP 

1 TO ft 

-7 / . Jl/ 

7166 

D78134 

Homo sapiens CIRP 


J.UU . UUU 

7167 

AF076191 

Trichosurus vulpecula gamma-actin 

2491 

100-000 

7168 

X00910 

Homo sapiens IGF- I I precursor 

124 5 

100 .000 

7169 

Y0759 J 

homo sapiens coxsacKie ana aaenovirus reccpLur 
protein 

a n q 
** u o 


7170 

X61118 

Homo sapiens TTG-2a/RBTN-2a 

1142 

100 . 000 

7171 

Y12860 

Homo sapiens peroxisomal integral membrane 
protein 

1967 

100.000 

7172 

M10014 

Homo sapiens fibrinogen gamma chain 

3008 

99.771 1 
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7173 

Y11435 

Homo sapiens b4 integrin interactor 

1593 

100.000 

7174 

AF070666 

Homo sapiens Krxippel- associated box protein 

C Q C 

byb 

98 . 93b 

7175 

A»J131057 

Homo sapiens immunoglobulin gamma chain ( BAB 3 - 

215 

9b . 875 

7176 

AF077207 

Homo sapiens HSPC021 

O f 1 Z 

t n c\ Ann 
xuu . uuu 

7177 

AF126 531 

Homo sapiens putative DNA- directed RNA 
poxymerase xxx tii suDunit 

one 
OUD 

XUU . UUU 

7178 

Lj2 J /b / 

riomo sapxens az, j sxaxyxcransxerse 

Z A. O £, 

XUU . uuu 

/ X / 27 

VUU^bo 

riomo Sapiens aipna giooin 

-J jLj Zj 

i nn nnn 

7180 

Vft /I >l 1 Q 

A0442 9 

riomo sapxens precursor poxypepciue 


qq Tci 
-J • / D X 

7181 

D17793 

Homo sapiens KIAA0119 

2172 

100.000 

7182 

M26759 

Rattus norvegicus thymosin beta- 4 precursor 

x / y 


7183 

AF072 832 

Homo sapiens UbcH 7 -binding protein 

*± UUl 

qq Ron 

7184 

M62831 

Homo sapiens ETR101 

1474 

99.103 

7185 

M63109 

Leishmania major glycoprotein 96-92 

187 

27.128 

7186 

D28915 

Homo sapiens hepatitis C-associated 
microtubular aggregate protein p44 

2923 

99 . 550 

7187 

M92383 

Homo sapiens thymosin beta- 10 

313 

10O . 000 

7188 

AL031673 

Homo sapxens dJ694B14.1 (PUTATIVE novel KRAB 
box protein with 18 C2H2 type Zinc finger 
domains ) 

5240 

99 . 865 

718 9 

Y11652 

Homo sapiens phosphate cyclase 

O O Q 
2 2 7 

o a n Q1 

7190 

AF155095 

Homo sapiens NY- REN- 2 antigen 

3897 

100. 000 

7191 

AB020648 

Homo sapiens KIAA0841 protein 

A O C T 

42 5 J 

100 . 000 

7192 

AF151880 

Homo sapiens CGI -12 2 protein 

X 2 2 4 

XUU . uoo 

7193 

i\ r\ f~ r\ '"i 

A06977 

Homo sapiens albumin 

4118 

100 . OOO 

7194 

AF052432 

Homo sapiens katanin p80 subunit 

4366 

99.542 

7195 

U64028 

Homo sapiens NAD PH : ubiquinone oxidoreductase 
subunit B13 

772 

100 . 000 

7196 

if « /-y ^ 

X79805 

Homo sapiens PC4 , pi 5 

O O ~5 

XUU . uuu 

7197 

U7 546 7 

Drosophila melanogaster Atu 

1197 

36 .66 7 

7198 

X17 04 2 

Homo sapiens hematopoetic proteoglycan core 
protein iaa x xooj 

y y x 

. 324 

7199 

X9 92 oy 

Homo sapiens arginine methyltransf erase 

Z J / D 

inn nnn 

*7 O A A 

/ZUU 

7\ Tn m C A O 

au uu / z> u y 

riomo sapxens bio-oDKua-associacea procem 

J3 3D 

?o ■ j o j 

7201 

Ar X J 2 y b b 

Homo sapiens CGI -31 protein 

1^ / o 

t nn nnn 
XUU • uuu 

/ z uz 

At X / jobw 

riomo Sapiens uin/a jjiiiuiny ^>iCJL.tixii p^orir 

J D U 27 

QQ R99 

_7 J . O Z 

7203 

AL031427 

Homo sapiens dJ167A19.1 (novel protein) 

2037 

99.673 

7204 

7\ t-i /-V CT /I "1 Till 

Ar 0541 /4 

Homo sapiens hxstone macroH2Al . 2 

lOJi 

Do . 

"7 O A C 

/20b 

Au ljj /by 

riomo sapxens nucxear transport receptor 

D X D X 

i nn nnn 

XUU . uuu 

Tone 
7206 

AFXdXo J2 

Homo sapxens CGI -74 protein 

Z j / J 

QQ QQO 

*■? "> rv "7 

X52 X42 

Homo sapiens CTP synthetase (AA 1-591) 


QQ CO 

/zUo 

AXjU 2 1 J j1 

riomo sapxens au jdowz^ . i \pucative ^ — eieyans 
UNC-93 (protein 1, C46F11.1) LIKE protein) 

-L _> O -J 

QQ 

7209 

V T /I £" A O 

Al4oOo 

Homo sapiens propionyl-CoA carboxylase 

/I C C Q 

i nn nnn 
XUU . uuu 

f 2 X U 

7VT 1 1 m/t Q 

AXjXXU24 y 

riomo sapxens nypotnetxcax protein 

J / Zj 

QQ ICQ 

TOT T 
/2 X X 

Ay do4o 

rlOmO SapxeiiS alia SUJL>UilJ.t 


i nn nnn 

XUU . uuu 

7212 

X64177 

Homo sapiens metallothionein 

513 

100. 000 

7213 

AB028 942 

Homo sapiens KIAA1019 protexn 

1 i no 
XX27 O 

5 

QQ Q "I Q 

y y . o o y 

7214 

X56932 

Homo sapiens 23 kD highly basic protein 

1329 

100. 000 

7215 

AF134404 

Homo sapiens delta-6 fatty acid desaturase 

3152 

100.000 

7216 

AC004982 

Homo sapiens similar to yeast hypothetical 
protein ybk4 ; similar to P38164 (PID :g586461) 

2616 

100. 000 

7217 

AL049548 

Homo sapiens dJ398G3.2 (novel protein) 

987 

100. 000 
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7218 

AF114263 

Homo sapiens unknown 

1295 

100.000 

7219 

Z68220 

Homo sapiens Similarity to Human ADP/ATP 
carrier protein (SW : ADT1_HUMAN) ; cDNA EST 
EMBL:D71893 comes fro 

566 

43 . 519 

7220 

AJ002308 

Homo sapiens synaptogyrin 2 • 

1524 

100 . 000 

7221 

Z49130 

Caenorhabditis elegans cDNA EST yk486b9.3 
comes from this gene; cDNA EST yk486b9.5 comes 
from this gene; cDNA EST yk615bl.3 comes from 
this gene; cDNA EST yk626c2.3 comes from this 
gene 

169 

26 .364 

7222 

AL021937 

Homo sapiens dJ149A16.6 (novel protein, human 
ortholog of worm F16A11.2 and bacterial and 
archea-bacterial predicted proteins) 

3381 

100.000 

7223 

Y16241 

Homo sapiens nebulette 


n n n nnn 

7224 

X97544 

Homo sapiens preprotein translocase 

no y 


7225 

X15949 

Homo sapiens interferon reguiacory iaccor-z 
(AA 1-349) 


QQ AO O 

7226 

AF095927 

Rattus norvegicus protein phosphatase 2C 

41 / O 

QC 1 CO i 

7227 

AF151895 

Homo sapiens CGI -13 7 protein 

1164 

100. 000 

7228 

AC005757 

Homo sapiens R32611 2 

T A C C 

lUb j 

1 rt A AAA 

1UU . uuu 

7229 

L.41254 

Rattus norvegicus transmembrane protein 

J JO 

CO C "J o 

7230 

AF106583 

Caenorhabditis elegans contains similarity to 
BC-2 protein (GB : AF042384) 

727 

56 .923 

7231 

AF106580 

Caenorhabditis elegans No definition line 
found 

382 

50 .442 

7232 

X13227 

Homo sapiens D-amino acid oxidase (AA 1 - 347) 

2389 

99 . 135 

7233 

Z84395 

Mycobacterium tuberculosis hypothetical 
protein Rv0712 

723 

39 . 933 

7234 

Z66511 

Unknown similar to ribokinase; cDNA EST 
EMBL:D6 9553 comes from this gene; cDNA EST 
EMBL : D6 5 93 8 come 

657 

43.6 51 

7235 

X93498 

Homo sapiens 21 -Glutamic Acid-Rich Protein 


QQ CO') 

7236 

Y11651 

Homo sapiens phosphate cyclase 

z J> J / 

1 A A AAA 

1UU . uuu 

7237 

X79417 

Sus scrofa 40S ribosomal protein S12 

O o U 

1 AA AAA 

7238 

AL078579 

Arabidopsis thaliana putative acyl-CoA binding 
protein 

en 

jO.U / / 

723 9 

AF095737 

Homo sapiens unknown 

j D D 

OO . U O iJ 

7240 

AB023203 

Homo sapiens KiAAuyob procein 


1 00 000 

X U U - uuu 

7241 

AL117463 

Homo sapiens hypothetical protein 

1293 

100.000 

7242 

U34350 

Gallus gallus cThy28kD 

1 AO R 


7243 

D78592 

Rattus norvegicus glucose- 6 -phosphatase 
catalytic subunit 

/ D / 

OO 

7244 

AF073298 

Homo sapiens small c*L>KJ\.-ricn lacuor z 


1 nn nnn 

X u u . uuu 

7245 

U53155 

Caenornaoai t is eiegans imo aetinicion line 
found 

O _7 J 

J J . U ' X 

7246 

X90858 

Homo sapiens uridine phosphorylase 

1878 

99.677 

7247 

AB014589 

Homo sapiens KIAA068 9 protein 

3783 

100.000 

7248 

X72755 

Homo sapiens Humig 

one 

O U D 

l nn nnn 

X u u . uuu 

724 9 

Z74167 

Saccnaromyces cerevisiae UKr lULiiyc 

All 

T J J 

O 1 n*7 

/ Z 3 U 

AT i ~l 1 CLC.O 
/\XjX JL / DD/ 


1393 

99 . 517 

7251 

AC005058 

Homo sapiens similar to calcium- independent 
phospholipase A2; similar to AC004392 
(PID:g3367519) 

2532 

100. 000 

7252 

AB029020 

Homo sapiens KIAA1097 protein 

6718 

100.000 

7253 

X01410 

Homo sapiens T-cell receptor beta chain 

1311 

99.492 


Star 
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7254 

AB014540 

Homo sapiens KIAA064 0 protein 


1 An 0 n r\ 

7255 

AF0203 13 

Mus musculus proline -rich protein 48 

i o c a 


7256 

Yin A «"> A 

U79260 

Homo sapiens unknown 


OX. /Dj 

7257 

X66171 

Homo sapiens CMRF-3 5 antigen 

T C 1 Q 

QQ CCA 

7258 

Z14 093 

rioino Sapiens oraiicnea cnain aecarjjOAyictoc 
aipna siiijuxii c 

jUIO 

X.\J\J m \J\J\J 

725 9 

AJBUIIIaJ 

rlOmO SaplcIlS MAAUDDJL pxOtcXIl 

O O J o 


7260 


riomo sapiens riypoc.riei-.iccix p1ruut5j.11 


Q 7 coc 

/ a b 1 


nOlllO odpicuo MnrtiUAj piULCJ.ll 

3583 

99 .819 

7262 

U72245 

Homo sapiens phospholemman chloride channel 

252 

51.220 

7263 

X53280 

Homo sapiens general transcription ractor 


-L V \J . U U U 

7264 

X72964 

Homo sapiens caltractin 

1087 

100.000 

7265 

AL080178 

Homo sapiens hypothetical protein 

My 

1UU . UUU 

7266 

AB007863 

Homo sapiens KIAA04 03 

O *7 O 

z / Jo 


7267 

Z99106 

Bacillus subtilis similar to hypothetical 
proteins 


0 . !)4U 

7268 

U67536 

Met nanoc 0 c cu s jannascnn conservea 
nypocneciCdi, procein 

ADO 


7269 

TTO 1 O C C 

MUS muscuius itiLrtti prouciii 

1914 

99 . 64 9 

72 70 


nOtnO bdpicilo I1UL led! kyj. <w>L-t:-Lll o/A 

7581 

99 6 56 

7271 

Z75134 

Canis familiaris rod transducin 

2322 

100.000 

7272 

AB014601 

Homo sapiens KIAA0701 protein 

A C\ C\ Q 

Q Q COA 

yy - z>^*± 

7273 

AB007889 

Homo sapiens KIAA042 9 

2432 

100.000 

7274 

Y09615 

Homo sapiens mTERF 

2bo4 

1 flA AAA 

100 . 000 

7275 

AL117635 

Homo sapiens hypothetical protein 

116 9 

99 . 4 54 

7276 

Z14136 

Homo sapiens spermidine/spermine Nl- 
acetyl transferase 

242 

57 . 627 

7277 

X16663 

Homo sapiens haematopoietic lineage cell 
protein iAA 1-400/ 

linn 

1UU . UUU 

7278 

X70476 

Homo sapiens subunit of coatomer complex 

D U J> O 

inn nnn 

72 79 

M55531 

nomo sapiens olui d procein 

I ACT 

II J / 

45 . 895 

7280 

7\ T-> A, *7 O O A O. 

riomo sapiens aynein liynt cnain- 

O ** D J 

QQ 4?fi 

72 81 

AF09 99 J 5 

Homo sapiens MDC-3 . 13 isoform 2 

o / 

t; nnn 

7282 

voi o r\ a 
AB18U4 

rsos t.avirij.s ozt 

214 7 

95 223 

"7 O O O 

TVT 1 T T C ""7 "3 
.AXill / O / 3 

riomo sapiens iiypouneticdi proLcin 

•371 A 

99 . 824 

7284 

X95648 

riomo sapiens aira suDuiiit 


l nn nnn 

7285 

Ar U o 3 y 3 U 

riomo sapiens fcoio 

141 1 

Qft IfiR 

7286 

AF151839 

Homo sapiens CGI-81 protein 

1890 

98 . 940 

7287 

TTCQO/I n 

U59240 

Rattvis norvegiens N - tropomodulin 

1 1 A ft 

77 c;7 

72 88 

Z49213 

isaccnaromyces cerevisiae lmp/p 

1 *7 ft 

AT "J "J 
** -J . J O J 

7289 

AL080177 

Homo sapiens hypothetical protein 

785 

100 . 000 

7290 

X76534 

Homo sapiens NMB 

3859 

99.821 

7291 

Z68493 

Caenorhabditis elegans predicted using 
Genef inder 

466 

At "JTQ 

41 . j /y 

7292 

AF125101 

Homo sapiens HSPC040 protein 

744 

100 . UUU 

7293 

Z99281 

Unknown similar to ADP-ribosylation factor ; 
cDNA EST EMBL:C08179 comes from this gene; 

CDNA EST EMB 

1100 

84 .865 

7294 

TTO A/I OO 

caenornaooi c is eiegans wo aetinicion une 
found 


co T c c. 

7295 

AF103731 

Homo sapiens putative glycolipid transfer 
protein 

2466 

98 .210 

7296 

AF150100 

Homo sapiens small zinc finger-like protein 

597 

100.000 

7297 

AL050277 

Homo sapiens hypothetical protein 

655 

100 . 000 

7298 

AJ238098 

Homo sapiens Lsm6 protein 

521 

100.000 
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7299 

AF023611 

Homo sapiens Dimlp homo log 

263 

39.583 

7300 

AB011115 

Homo sapiens KIAA0543 protein 

7541 

100.000 

7301 

AE001724 

Thermotoga maritima conserved hypothetical 
protein 

479 

36.032 

7302 

X76013 

Homo sapiens glutaminyl- tRNA synthetase 

5223 

99. 871 

7303 

L34041 

Homo sapiens L-glycerol- 3 -phosphate :NAD 
oxidoreductase 

2316 

99.713 

7304 

AP000364 

Oryza sativa Similar to sequence of BAC F7G19 
from Arabidopsis thaliana. (AC000106) 

194 

41.667 

7305 

AF073839 

Rattus norvegicus bithoraxoid-like protein 

562 

93 . 750 

7306 

U51999 

Caenorhabditis elegans C43H6.7 gene product 

411 

27.645 

7307 

AF102850 

Homo sapiens dolichyl -phosphate beta- 
glucosyl transferase 

2142 

100.000 

7308 

AL031228 

Homo sapiens dJ1033B10.2 (WD40 protein BING4 
(similar to S. cerevisiae YER082C, M. sexta 
MNG10 and C. elegans F28D1.1) 

4111 

100.000 

7309 

AF160934 

Drosophila melanogaster BcDNA.LD1418 9 

1140 

61. 508 

7310 

AL117629 

Homo sapiens hypothetical protein 

796 

73.545 

7311 

AF151826 

Homo sapiens CGI- 68 protein 

2240 

99.401 

7312 

AL110295 

Schizosaccharomyces pombe conserved 
hypothetical protein 

1455 

36.475 

7313 

AF016441 

Caenorhabditis elegans No definition line 
found 

851 

41.892 

7314 

X58141 

Homo sapiens erythrocyte alpha adducin 

4876 

99.457 

7315 

AF056490 

Homo sapiens cAMP- specif ic phosphodiesterase 
8A 

4713 

99. 719 

7316 

Z93382 

Caenorhabditis elegans F4 5G2.9 

545 

41. 935 

7317 

X91817 

Homo sapiens transketolase 

3532 

96. 948 

7318 

Z69727 

Schizosaccharomyces pombe putative peroxisoaml 
membrane protein 

248 

30. 769 

7319 

AL117576 

Homo sapiens hypothetical protein 

2984 

99. 566 

7320 

AJ237946 

Homo sapiens DEAD Box Protein 5 

3108 

100 . 000 

7321 

X69480 

Saccharomyces cerevisiae uORFl 

665 

53 . 552 

7322 

AF050641 

Homo sapiens NADH- ubiquinone oxidoreductase 
3 9kDa subunit 

2527 

100.000 

7323 

U47924 

Homo sapiens C10 

809 

100.000 

7324 

AF054986 

Homo sapiens putative transmembrane GTPase 

2387 

100.000 

7325 

AF146018 

Homo sapiens hydroxypyruvate reductase 

1100 

99 .401 

7326 

Z97184 

Homo sapiens HKE2 

794 

100 . 000 

7327 

AF077044 

Homo sapiens RNA polymerase I 16 kDa subunit 

874 

100.000 

7328 

AJ131389 

Homo sapiens PEX3 protein 

2410 

100.000 

7329 

L54057 

Homo sapiens deduced protein product shows 
significant homology to coactosin from 
Dictyostelium discoideum 

922 

100.000 

7330 

X73478 

Homo sapiens phosphotyrosyl phosphatase 
activator 

2196 

99.690 

7331 

U51032 

Saccharomyces cerevisiae Ydr341cp 

1365 

41. 556 

7332 

AF026198 

Fugu rubripes putative protein 2 

855 

66 . 146 

7333 

M83104 

Bos taurus cytochrome b-5 reductase 

1198 

56 . 757 

7334 

A J2 4 5587 

Homo sapiens Kruppel-type zinc finger 

2460 

100 . 000 

7335 

D87453 

Homo sapiens KIAA0264 

2704 

100.000 

7336 

AL031666 

Homo sapiens dJ56 9M2 3.1 (similar to BS69 
protein) 

2168 

100.000 

7337 

Z22820 

Canis familiaris Rab22a protein 

1272 

98 .454 

7338 

AF151841 

Homo sapiens CGI -83 protein 

1915 

100.000 


'Sol 
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7339 

AF019082 

Borrelia burgdorferi virulent strain 
associated lipoprotein 

209 

22 .458 

7340 

AF112968 

Homo sapiens ornithine transporter 

1991 

100.000 

7341 

AF141882 

Homo sapiens APMCF1 

315 

100 . 000 

7342 

X58022 

Homo sapiens corticotropin releasing factor- 
binding protein 

2195 

99.068 

7343 

AF118023 

Homo sapiens SH3 domain- binding protein SNP7 0 

4484 

99.844 

7344 

L04733 

Homo sapiens kinesin light chain 

2521 

70.107 

7345 

U47618 

Drosophila melanogaster ovary2 

195 

27 .152 

7346 

AF143676 

Homo sapiens multispanning nuclear envelope 
membrane protein nurim 

1747 

100.000 

7347 

AF001160 

Homo sapiens G-protein gamma subunit 

448 

100.000 

7348 

U62940 

Rattus norvegicus mt-GrpE#l precursor 

1267 

88 . 479 

7349 

X14046 

Homo sapiens CD37 (AA 1-244) 

1891 

99 . 644 

7350 

X78606 

Rattus norvegicus ras -homologous GTPase rab28 

1417 

96 . 380 

7351 

AB002348 

Homo sapiens KIAA0350 

6010 

99.891 | 

7352 

M32486 

Mus musculus 19.5 protein 

1300 

64 . 364 

7353 

X72879 

Homo sapiens ORF 

149 

88 . 462 

7354 

AF085481 

Homo sapiens serum deprivation response 

2664 

100 . 000 

7355 

AB020681 

Homo sapiens KIAA08 74 protein 

3936 

100 . 000 

7356 

AJ242015 

Homo sapiens eMDC II protein 

5433 

100 . 000 

7357 

AF026816 

Homo sapiens putative oncogene protein 

1101 

100.000 

7358 

AF015913 

Homo sapiens SkblHs 

4349 

99.686 

7359 

L34734 

Homo sapiens T-cell receptor beta 

2042 

96 . 835 

7360 

AF117065 

Homo sapiens male- specif ic lethal -3 homolog 1 

3240 

99.590 

7361 

Y13736 

Homo sapiens Protein Enriched in Diabetes 

829 

100.000 

7362 

AJ223353 

Homo sapiens Histone H2B 

787 

100.000 

7363 

M55542 

Homo sapiens guanylate binding protein isoform 
I 

3356 

87.372 

7364 

AB014550 

Homo sapiens KIAA06 50 protein 

5623 

100.000 

7365 

U41857 

Xenopus laevis WD- 4 0 motifs; up- regulated by 
thyroid hormone in tadpoles 

1285 

51 . 105 

7366 

Y10319 

Homo sapiens carnitine carrier 

2052 

100. 000 

7367 

U66059 

Homo sapiens Vsegment translation product 

794 

100 . 000 

7368 

A68104 

unidentified unnamed protein product 

130 

28 .455 

7369 

U71598 

Homo sapiens zinc finger protein zfp2 

1096 

100.000 

7370 

X68505 

Homo sapiens myocyte- specific enhancer factor 
2 (MEF2) 

3397 

100.000 

7371 

AJ242975 

Homo sapiens p3 8 protein 

2583 

100 . 000 

7372 

AL034488 

Caenorhabditis elegans predicted using 
Genef inder 

925 

35.118 

7373 

AF044671 

Homo sapiens MM46 

698 

87.069 i 

7374 

AF171099 

Xenopus laevis Mi -2 histone deacetylase 
complex protein 66 

871 

48.396 

7375 

AB011182 

Homo sapiens KIAA0610 protein 

4424 

100 . 000 

7376 

X83218 

Homo sapiens ATP synthase, oligomycin 
sensitivity conferring protein 

1310 

100 . 000 

7377 

U41558 

Caenorhabditis elegans No definition line 
found 

537 

31.474 

7378 

AF 03 96 92 

Homo sapiens antigen NY-CO- 10 

2 3 97 

99 . 462 

7379 

X56389 

Canis familiaris rab4b 

1395 

100.000 

7380 

X06820 

Homo sapiens rhoB 

1324 

100.000 

7381 

AJ243721 

Homo sapiens dTDP-4 -keto-6-deoxy-D-glucose 4- 
reductase 

2124 

99.682 

7382 

X77953 

Rattus norvegicus ribosomal protein SI 5a 

860 

100.000 
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7383 

Z46676 

Caenorhabditis elegans cDNA EST yk484gl.3 
comes from this gene; cDNA EST yk484gl.5 comes 
from this gene 

401 

40.690 

7384 

J01163 

Oxytricha fallax actin 

381 

27 . 083 

7385 

AF156098 

Homo sapiens RNA binding motif protein 7 

1820 

100.000 

7386 

AF157028 

Homo sapiens protein phosphatase 
methylesterase-1 

2599 

100.000 

7387 

L24804 

Homo sapiens p23 

423 

46 . 094 

7388 

AF151876 

Homo sapiens CGI- 118 protein 

1336 

100.000 

7389 

Z97210 

Schizosaccharomyces pombe hypothetical protein 

201 

40 . 196 

7390 

X95592 

Homo sapiens C1D protein 

901 

100 . 000 

7391 ! 

AE000984 

Archaeoglobus fulgidus dinitrogenase reductase 
activating glycohydrolase (draG) 

294 

26 .608 

7392 

Z98979 

Schizosaccharomyces pombe hypothetical protein 

348 

40 . 157 

7393 

D50807 

Bos taurus synaphin 

194 

33 . 088 

7394 

X56667 

Homo sapiens calretinin 

1793 

99 .631 

7395 

M23159 

Cricetus cricetus DHFR-coamplif ied protein 

194 

28 . 761 

7396 

D64000 

Synechocystis sp. hypothetical protein 

220 

33 . 803 

7397 

AB020684 

Homo sapiens KIAA0877 protein 

3836 

99.310 

7398 

U22229 

Felis catus ribosomal protein L41 

170 

100.000 

7399 

AJ010392 

Rattus norvegicus Bdeight protein 

1427 

92.340 

7400 

AF093673 

Cricetulus griseus layilin 

2179 

84 . 182 

7401 

X74801 

Homo sapiens gamma subunit of CCT chaperonin 

3492 

99.632 

7402 

Z97630 

Homo sapiens dJ466N1.2 (2 -amino- 3- 
ketobutyrate-CoA ligase mRNA, nuclear gene 
encoding mitochondrial protein) 

2785 

99. 761 

7403 

AC002301 

Homo sapiens Homolog of rat Zymogen granule 
membrane protein 

270 

37.594 

7404 

AF147717 

Homo sapiens ubiquitin C-terminal hydrolase 
UCH37 

2165 

100.000 

7405 

AL096779 

Homo sapiens hypothetical protein 

1978 

100.000 

7406 

AF069442 

Arabidopsis thaliana putative WD- repeat 
protein 

1130 

42 . 721 

7407 

AF020262 

Bos taurus general protein transport factor 
P 16 

776 

100.000 

7408 

AF097439 

Mus musculus brain expressed X- linked protein 
2 

648 

72.868 

7409 

U97001 

Caenorhabditis elegans similar to 
Schizosaccharomyces pombe 4- 

nitrophenylphosphatase (PNPPASE) (GB : X62722 , 
NID:g5005) 

767 

51.969 

7410 

X71978 

Mus musculus Fif 

1923 

95 . 904 

7411 

AL117526 

Homo sapiens hypothetical protein 

5576 

99 . 883 

7412 

AC002550 

Homo sapiens Unknown gene product 

1104 

99.379 

7413 

U81002 

Homo sapiens TRAF4 associated factor 1 

1710 

99.267 

7414 

AF151900 

Homo sapiens CGI - 142 protein 

1167 

100. 000 

7415 

AB011145 

Homo sapiens KIAA0573 protein 

3074 

100. 000 

7416 

AF059569 

Homo sapiens actin binding protein MAYVEN 

3881 

99.658 

7417 

AL031765 

Unknown /predict ion= (method: " "genef inder" " , 
version: " "084" ", score: ""31.96"") ; 
/prediction= (metho 

473 

34.348 

7418 

U80447 

Caenorhabditis elegans similar to the beta 
transducin family 

742 

32 . 767 

7419 

AL080186 

Homo sapiens hypothetical protein 

1860 

99.664 

7420 

AL049758 

Homo sapiens dJ437M21.2 (novel putative GTP- 

1415 

100. 000 
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ase activating protein for Arf similar to worm 
F07F6 .4) 



7421 

AL023780 

Schizosaccharomyces pombe zinc finger protein 

359 

31.606 

7422 

AJ224326 

Homo sapiens ribulose- 5 -phosphate -epimerase 

1179 

100. 000 

7423 

AB023191 

Homo sapiens KIAA0974 protein 

3780 

100.000 

7424 

Z35663 

Caenorhabditis elegans Weak similarity with 
non- his tone chromosomal protein HMG-1 (human) . 
Glutamate rich carboxyl terminus; cDNA EST 
EMBL:D74688 comes from this gene; cDNA EST 
EMBL:D71938 comes from this gene 

211 

27.184 

7425 

X16901 

Homo sapiens 30kb subunit of RAB30 /74 

1623 

100 . 000 

7426 

D14696 

Homo sapiens KIAA0108 

703 

44 .395 

7427 

AB014581 

Homo sapiens KIAA06 81 protein 

659 

36.336 

7428 

AF000423 

Rattus norvegicus synaptotagmin XI 

2699 

94 . 896 

7429 

U90313 

Homo sapiens glutathione- S- transferase homolog 

1044 

64 . 06 9 

7430 

AB015345 

Homo sapiens HRIHFB2216 

3300 

99 . 588 

7431 

L02241 

Mus musculus protein kinase inhibitor 

354 

74 . 359 

7432 

AL021068 

Homo sapiens dJ206D15.3 

3310 

99 . 607 

7433 

U40628 

Rattus norvegicus unknown 

181 

50 . 000 

7434 

AC003028 

Arabidopsis thaliana unknown protein 

172 

43.13 7 

7435 

X65724 

Homo sapiens ORF2 

1092 

100.000 

7436 

Z97341 

Arabidopsis thaliana hypothetical protein 

292 

36 . 424 

7437 

D88157 

Ovis aries cytochrome b561 

591 

45 . 238 

7438 

AF133207 

Homo sapiens protein kinase 

1362 

99 .490 

7439 

AJ235271 

Rickettsia prowazekii unknown 

458 

30.638 

7440 

AF094583 

Homo sapiens putative HIV-1 infection related 
protein 

627 

98 . 925 

7441 

J04970 

Homo sapiens carboxypeptidase M precursor 

3050 

100.000 

7442 

AC006978 

Homo sapiens supported by human and rodent 
ESTs; match to AA454028 (NID : g2l676 97 ) , 
similar to AA9255224 (NID :g4236415) and 
AA02 3712 (NID:gl4 8 762 7) 

1048 

99.315 

7443 

AL021683 

Homo sapiens unnamed protein product 

1820 

100 . 000 

7444 

A68112 

unidentified unnamed protein product 

175 

28 . 926 

7445 

U41515 

Homo sapiens Method: conceptual translation 
supplied by author 

494 

100 . 000 

7446 

X57351 

Homo sapiens 1-8D 

873 

99 . 242 

7447 

AB018325 

Homo sapiens KIAA07 82 protein 

7932 

100 . 000 

7448 

X13482 

Homo sapiens U2 snRNP- specif ic A 1 protein (AA 
1-255) 

1630 

99.608 

7449 

X66901 

Mus musculus En-2/lacZ fusion protein 

307 

90 . 196 

7450 

AF129756 

Homo sapiens NG33 

808 

100 . 000 

7451 

JO0911 

Gallus gallus alpha- tropomyosin (partial) 

164 

96 . 296 

7452 

Z80220 

Unknown Similarity to yeast protein TREMBL ID 
E246895) ; cDNA EST EMBL:T00018 comes from this 
gene ; 

774 

56 . 944 

7453 

X67325 

Homo sapiens p2 7 

318 

o / . dUU 

7454 

AL117530 

Homo sapiens hypothetical protein 

3104 

r\ a CCA 

yy . bb4 

7455 

AF151857 

Homo sapiens CGI-99 protein 

1611 

100.000 

74 56 

AL»035301 

Homo sapiens hypothetical protein 

One 

i nn r»or» 
1UU . uuu 

7457 

U19142 

Homo sapiens GAGE - 1 protein 

327 

51 .351 

7458 

Z49858 

Rattus norvegicus plasmolipin 

435 

45 . 062 

7459 

AL050147 

Homo sapiens hypothetical protein 

3647 

100. 000 

7460 

Z82096 

Caenorhabditis elegans predicted using 
Genefinder; Similarity to E.coli guanosine-3 1 , 

419 

56 .410 
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5 1 -bis (diphosphate) -pyrophosphohydrolase 
(SW:P17580) 



7461 

AF143956 

Mus musculus coronin-2 

3101 

94 . 262 

7462 

AB007836 

Homo sapiens Hie- 5 

3323 

100 . 000 

7463 

AC008263 

Arabidopsis thaliana Contains similarity to 
gb | Z 9 5 6 3 7 acyl - CoA : 1 - acylg lycerol - 3 - phospha t e 
acyltransf erase from Brassica napus . 

495 

28 . 981 

7464 

AF117210 

Homo sapiens host cell factor 2 

5379 

100 . 000 

7465 

J03801 

Homo sapiens lysozyme precursor (EC 3.2.1.17) 

1033 

100 . 000 

7466 

AF149720 

Danio rerio unknown 

165 

41.379 

7467 

AL022724 

Homo sapiens dJ413H6.1.1 (hamster Androgen- 
dependent Expressed Protein LIKE PUTATIVE 
protein) (isoform 1) 

1716 

100.000 

7468 

AF151844 

Homo sapiens CGI -86 protein 

2246 

100 . 000 

7469 

X77631 

Cricetulus griseus intermediate filament 
associated protein 

100 

37 . 500 

7470 

X78686 

Homo s ap i en s ENA - 7 8 

741 

100.000 

7471 

AL049610 

Homo sapiens dJ1055C14.1 (transcription 
elongation factor A (SI I) -like 1) 

181 

37.097 

7472 

AL008637 

Homo sapiens NCF4 

2221 

100 . 000 

7473 

Z81108 

Unknown similar to Ubiqu it in- conjugating 
enzymes; cDNA EST EMBL:T0173 7 comes from this 
gene; cDNA E 

567 

53 . 459 

7474 

M10942 

Homo sapiens human metal lothionein- Ie 

512 

100 . 000 

7475 

AF077034 

Homo sapiens HSPC010 

611 

100.000 

7476 

K01383 

Homo sapiens MT1A 

508 

98 . 361 

7477 

X90999 

Homo sapiens glyoxalase II 

1757 

100 . 000 

7478 

M74161 

Homo sapiens inositol polyphosphate 5- 
phosphatase 

6355 

99.894 

7479 

AC004883 

Homo sapiens similar to KIAA0766; similar to 
PID:g3882253 

536 

27 . 381 

7480 

AF181685 

Mus musculus membrane protein TMS-2 

2988 

95 . 806 

7481 

AF145615 

Drosophila melanogaster BcDNA. GH033 77 

1057 

46 . 133 

7482 

U09410 

Homo sapiens zinc finger protein ZNF131 

3134 

99 . 366 

7483 

AF047431 

Homo sapiens AAPTl-like protein 

1580 

98 . 340 

7484 

AF176012 

Homo sapiens J domain containing protein 1 
isoform a 

1335 

100 . 000 

7485 

X94917 

Drosophila melanogaster head-elevated 
expression in 0.9 kb 

194 

33 . 000 

7486 

U54807 

Rattus norvegicus GTP-binding protein 

1483 

97 . 357 

7487 

AF058807 

Bos taurus GTP-binding protein rah 

1248 

85.106 

7488 

L20427 

Rattus norvegicus dihydroxypolyprenylbenzoate 
methyl transferase 

1578 

82 . 500 

7489 

M11759 

Lycopersicon esculentum cell wall 
hydroxyproline-rich glycoprotein 

108 

34 . 375 

7490 

AF072864 

Homo sapiens peroxisomal membrane protein PMP 
24 

1406 

99.528 

7491 

AL050269 

Homo sapiens hypothetical protein 

1339 

99.034 

7492 

n i-i ^» r» r» k\ 

AF078850 

Homo sapiens steroid dehydrogenase homo log 

73 9 

40 . 065 

/4 y 3 

Zy / ZK)*k 

Schi zosaccharomyces pombe hypothetical protein 



7494 

U23484 

Caenorhabditis elegans No definition line 
found 

627 

52 .660 

7495 

X63679 

Homo sapiens TRAM protein 

2432 

100.000 

7496 

X57352 

Homo sapiens 1-8U 

883 

98.496 

7497 

Z81097 

Caenorhabditis elegans cDNA EST EMBL:D6 9071 

190 

31.858 
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comes from this gene 



7498 

AL035419 

Homo sapiens dJ1100H13.1 (putative novel 
protein) 

1440 

100.000 

7499 

Z37166 

Homo sapiens nuclear RNA helicase (DEAD 
family) 

2817 

100 . 000 

7500 

Z66515 

Unknown cDNA EST EMBL:C07816 comes from this 
gene; cDNA EST EMBL:C0932 8 comes from this 
gene ; cDNA 

411 

35.135 

7501 

AF131220 

Homo sapiens HEMK homo log 

2311 

99 . 704 

7502 

Z49128 

Unknown similar to cAMP- dependant protein 
kinase; cDNA EST EMBL:T00719 comes from this 
gene ; cDNA E 

1098 

44 . 759 

7503 

AJ223980 

Homo sapiens BCL7C 

1487 

99.078 

7504 

X61381 

Rattus rattus interferon- induced protein 

238 

43 . 023 

7505 

U16697 

Torpedo marmorata 14 kDa transmembrane protein 

113 

32.692 

7506 

Z82214 

Homo sapiens dJ526I14.3a (fragment of novel 
CUB and EGF-like domain protein) 

1387 

80 . 995 

7507 

AL008583 

Homo sapiens dJ327J16.1 (human ortholog of 
mouse outer arm Dynein light chain 4) 

716 

100.000 

7508 

D38169 

Homo sapiens inositol 1 , 4 , 5- trisphosphate 3- 
kinase isoenzyme 

4207 

99.834 

7509 

D90053 

Sus scrofa destrin 

1109 

100 . 000 

7510 

AF016685 

Caenorhabditis elegans similar to short chain- 
type dehydrogenases 

776 

42.073 

7511 

U23484 

Caenorhabditis elegans No definition line 
found 

517 

33.133 

7512 

X52509 

Homo sapiens tyrosine aminotransferase 

3098 

100 . 000 

7513 

X13923 

Homo sapiens cytochrome c oxidase subunit VIb 
(AA 1-86) 

206 

52 . 941 

7514 

Z70208 

Caenorhabditis elegans predicted using 
Genefinder; similar to Zinc finger, C3HC4 type 
(RING finger) 

272 

28 . 141 

7515 

AE000715 

Aquifex aeolicus ribosomal protein L20 

270 

41.053 

7516 

AL031432 

Homo sapiens dJ465N24 . 2 . 1 (PUTATIVE novel 
protein) (isoform l) 

1476 

99 . 558 

7517 

Ali03 5588 

Homo sapiens acr696Pi9.i itfeb; 

3221 

98 . 975 

7518 

U79275 

Homo sapiens unknown 

918 

100.000 

7519 

AJ011306 

Homo sapiens guanine nucleotide exchange 
factor (long isoform) 

2716 

99.523 

7520 

AE001788 

Thermotoga maritima ribosomal protein S15 

178 

37 . 037 

7521 

AE001070 

Archaeoglobus fulgidus ribonuclease PH (rph) 

328 

33 . 937 

7522 

X87176 

Homo sapiens 17beta-hydroxysteroid 
dehydrogenase 

4839 

100.000 

7523 

AC005189 

Homo sapiens match to ESTs H97758 
(NID:glll8643) and AA085546 (NID :gl628773 ) 

976 

100.000 

7524 

AL049802 

Homo sapiens hypothetical protein 

4303 

100 . 000 

7525 

AF067219 

Caenorhabditis elegans No definition line 
found 

117 

41.860 ; 

7526 

AF060568 

Homo sapiens promyelocytic leukemia zinc 
finger protein; kruppel-like zinc finger 
protein; PLZF 

4578 

99.554 

7527 

AL031670 

Homo sapiens similar to Zinc finger, C3HC4 
type (RING finger) ; match PFAM PF00097; 
Similar to SW : GOLI_DROME Q06003 GOLIATH 
PROTEIN 

1054 

100.000 
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7528 

AF140501 

Homo sapiens RAD3 0B 

4696 

99 . 720 

7529 

AF116827 

Homo sapiens unknown 

3851 

100 . 000 

7530 

U28282 

Homo sapiens zinc finger protein 

2382 

99 .443 

7531 

AF18 0920 

Homo sapiens eye 1 in ania-6a 

2016 

100 . 000 

7532 

AL031121 

Homo sapiens d»J495O10.2 (novel protein similar 
to worm E04F6 . 2 

343 

100 . 000 

7533 

AB031292 \ 

Mus musculus proteolipid protein 2 

195 

31.746 

7534 

AL117562 

Homo sapiens hypothetical protein 

2411 

99.425 

7535 

U25801 

Homo sapiens Taxi binding protein 

801 

98 .374 

7536 

X94910 

Homo sapiens ERp28 

1682 

99 .234 

7537 

X15722 

Homo sapiens glutathione reductase (AA 1-479) 

3163 

100 . 000 

7538 

Z70213 

Caenorhabditis elegans predicted using 
Genefinder; Weak similarity to Sea Urchin 
myosin heavy chain (PIR Acc . No. A37352) 

296 

28 .910 

7539 

AF151818 

Homo sapiens CGI- 60 protein 

2142 

97.605 

7540 

AF144700 

Homo sapiens small zinc finger- like protein 

654 

100.000 

7541 

U46690 

Mus musculus ATP-dependent RNA helicase 

2643 

85.000 

7542 

L14331 

Caenorhabditis elegans coded for by C. elegans 
cDNAs GenBank: CE5D1 (Z14791) , CEL01F1 
(M88817) , CEL04B5 (M88849) , and 
CEL04C1 (M75812) ; putative 

938 

44 .407 

7543 

AF139461 

Homo sapiens hypothetical protein SBBI31 

976 

100 . 000 

7544 

AL031775 

Homo sapiens dJ30M3.2 (novel protein) 

599 

100. 000 

7545 

AF049523 

Homo sapiens huntingtin- interacting protein 
HYPA/FBP11 

2775 

99.764 

7546 

AC005521 

Homo sapiens similar to yeast SSU72 ; similar 
to P53538 ( PID : gl7H532 ) 

964 

69 . 543 

7547 

AJ005894 

Homo sapiens JM27 

703 

100 . 000 

7548 

AL035464 

Homo sapiens dJ1043E3.1 (novel protein) 

1211 

100 . 000 

7549 

AF153605 

Homo sapiens androgen induced protein 

1625 

99 . 580 

7550 

U4 3 7 0 1 

Homo sapiens ribosomal protein L23a 

999 

100 . 000 

7551 

X07523 

Homo sapiens complement factor H 

3307 

100 . 000 

7552 

AF028823 

Homo sapiens Tax interaction protein 1 

744 

100 . 000 

7553 

X67155 

Homo sapiens mitotic kinase-like protein- 1 

6116 

98 .857 

7554 

AL034353 

Schizosaccharomyces pombe putative 60s 
ribosomal protein 

282 

41 . 071 

7555 

AF143889 

Homo sapiens unknown 

1190 

99 .465 

7556 

X08055 

Homo sapiens preglycophorin B 

537 

100 . 000 

7557 

Z93783 

Homo sapxens du377F16.1 (PUTATIVE novel 
protein) 

975 

100 . 000 

7558 

AF15184 5 

Homo sapiens CGI- 87 protein 

2474 

99 . 472 

7559 

Au 000414 

Homo sapiens Cdc42- interacting protein 4 

3628 

100 . 000 

7560 

Z36531 

Homo sapiens fibrinogen- like protein 

3024 

100 . 000 

7561 

M2 94 58 

Homo sapiens carbonic anhydrase III 

1852 

100 . 000 


X953 84 

Homo sapiens 14.5 kDa translational inhibitor 
protein, pl4 . 5 

854 

10 0 . 000 

7563 

AF132971 

Homo sapiens CGI- 37 protein 

1192 

100.000 

7564 

AF1324 84 

Mus musculus unknown 

897 

80 . 864 

7565 

X54 94 1 

Homo sapiens Cksl protein homologue 

561 

100 . 000 

/jOD 
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7567 

U94991 

Xenopus laevis transcription factor XLMOl 

1051 

97 .973 

7568 

AF017096 

Drosophila melanogaster similar to C. elegans 
R10H10.6 and S. cerevisiae YD8419.03C 

634 

61 .379 

7569 

S73775 

Homo sapiens calmitine; calsequestrine 

2557 

99 .231 

7570 

AL049522 

Schizosaccharomyces pombe hypothetical protein 

859 

51 .464 
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7571 

Y13141 

Bromheadia f inlaysoniana extensin 

122 

44 . 118 

7572 

U85431 

Homo sapiens unknown protein 

159 

70.000 

7573 

AF151882 

Homo sapiens CGI -124 protein 

1153 

100.000 

7574 

AF022913 

Homo sapiens GPI transamidase 

2640 

100.000 

7575 

Z35604 

Unknown cDNA EST EMBL : Z14593 comes from this 
gene; cDNA EST EMBL:T01764 comes from this 
gene ; cDNA 

1003 

37.168 

7576 

X77635 

Drosophila melanogaster lethal (2) essential for 
life 

179 

35 . 000 

7577 

X65020 

Bos taurus PSST subunit of the NADH : 
ubiquinone oxidoreductase complex 

1249 

88 .725 

7578 

Z93382 

Caenorhabditis elegans F4 5G2.10 

440 

57.377 

7579 

AF079317 

Sphingomonas aroma ticivorans unknown 

777 

39.763 

7580 

Z99278 

Caenorhabditis elegans cDNA EST EMBL:D75703 
comes from this gene; cDNA EST yk513g5.3 comes 
from this gene; cDNA EST yk528bl0.3 comes from 
this gene 

817 

52 . 964 

7581 

AF152583 

Homo sapiens unknown 

377 

100 . 000 

7582 

Z30093 

Homo sapiens basic transcription factor 2, 35 
kD subunit 

2025 

99 . 669 

7583 

D43636 

Homo sapiens KIAA0096 gene product is related 
to a protein kinase. 

3491 

100 . 000 

7584 

AF177758 

Homo sapiens ubiquitin specific protease 16 

3900 

100 . 000 

7585 

X16166 

Homo sapiens cytokine 21 

624 

98 . 913 

7586 

U73522 

Homo sapiens AMSH 

2818 

100 . 000 

7587 

Y17454 

Homo sapiens LSFR1 protein 

892 

98.496 

7588 

Y07968 

Homo sapiens TFG 

2711 

99.501 

7589 

AB023222 

Homo sapiens KIAA1005 protein 

6867 

100.000 

7590 

U88167 

Caenorhabditis elegans No definition line 
found 

124 

30 .208 

7591 

AF117657 

Homo sapiens thyroid hormone receptor- 
associated protein complex component TRAP 80 

4754 

99.860 

7592 

U46754 

Aphrodite aculeata nerve myoglobin 

186 

30 .496 

7593 

X91809 

Homo sapiens GAIP 

1523 

100.000 

7594 

M11759 

Lycopersicon esculentum cell wall 
hydroxyproline-rich glycoprotein 

102 

57 . 143 

7595 

X76228 

Homo sapiens vacuolar H+ ATPase E subunit 

1427 

100. 000 

7596 

X80333 

Mus musculus rabl8 

1344 

99 . 029 

7597 

AL022238 

Homo sapiens dJ1042K10 . 2 . 1 (novel protein with 
probable rabGAP domains and Src homology 
domain 3) (isoform 1) 

5195 

99 . 871 

7598 

AJ224875 

Homo sapiens glucosyltransf erase 

3557 

99.060 

7599 

AB020680 

Homo sapiens KIAA0873 protein 

3012 

99.785 

7600 

Z37986 

Homo sapiens phenylalkylamine binding protein 

1611 

100 . 000 

7601 

D43949 

Homo sapiens This gene is novel . 

4103 

100 . 000 

7602 

Y07923 

Homo sapiens GTP-binding protein 

1530 

99 . 138 

7603 

Z32683 

Caenorhabditis elegans similar to RNA binding 
protein; cDNA EST EMBL:D73574 comes from this 
gene; cDNA EST yk613f4.3 comes from this gene 

229 

41 . 379 
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7605 

A58552 

unidentified unnamed protein product 

720 

100.000 

7606 

AF055016 

Homo sapiens unknown 

1511 

99.539 

7607 

AF078848 

Homo sapiens BUP 

1272 

100.000 

7608 

U28168 

Mus musculus GP106 

760 

60.819 

7609 

AL117237 

Homo sapiens hypothetical protein 

6229 

99.783 
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7610 

AJ010277 

Homo sapiens TBX19 protein 

3093 

100.000 

7611 

X95190 

Homo sapiens branched chain acyl-CoA oxidase 

4547 

100 . 000 

7612 

K01075 

synthetic construct circumsporozoite (CS) 
fusion prot (partial) 

141 

26 . 271 

7613 

AL023859 

Unknown SPBC19C7 . 07c, putative tRNA splicing 
endonuclease ga mma subunit, len:2 84aa, 
similar eg. 

281 

25 , 43o 

7614 

AF100 /4b 

Homo sapiens rioirv^u^j procein 

9 ^ ft Q 


7615 

S75997 

Rattus sp . nucleoporin p62 homolog 



7616 

X662 95 

Mus musculus Clq C chain 

1 *7 ** 


7617 

U72680 j 

MUS muscuius ion cnannex nonioioy kiu 

417 

4 ft 911 

7618 

AB028962 

Homo sapiens KIAA103 9 protein 

2901 

99.097 

7619 

AF132966 

Homo sapiens CGI- 32 protein 

1 "7 ^ n 

qq "9 <C *7 
33 . zo / 

7620 

AF042284 

Homo sapiens unknown 



7621 

AB002377 

Homo sapiens KIAA03 79 

5874 

100.000 

7622 

X55681 

Lycopersicon esculentum extensin (class I) 

1 7 J 


7623 

AC005023 

Homo sapiens match to EST AA361117 
(NID :g2 01343 6) 

10-<21 

10U . UUU 

7624 

X78925 

Homo sapiens zinc finger protein 

3 Xj .3 

QQ *7*9 *7 

7625 

AF044956 

Homo sapiens NADH : ubiquinone oxidoreductase 
B22 subunit 

1-511 


7626 ! 

X03084 

Homo sapiens Clq B-chain precursor 

1 JjO 

i nn nnn 

X U U . UUU 

7627 

AF064257 

Homo sapiens unmi-iiKe procein 

CCOQ 
O _> O ,7 


7628 

AC007055 

Homo sapiens unknown 

1337 

100.000 

7629 

AC005253 

Homo sapiens R2644 5 1 

1191 

100 . 000 

7630 

AL031179 

Schizosaccharomyces pombe hypothetical protein 

192 

29.57 7 

7631 

X03077 

Homo sapiens lactate dehydrogenase -A 

216 7 

100 . 000 

7632 

M16424 

Homo sapiens beta-hexosaminidase alpha chain 

3651 

100 .000 

7633 

Z77667 

Caenorhabditis elegans cDNA EST EMBL:C08125 
comes from this gene; cDNA EST EMBL:C09753 
comes from this gene 

1034 

38 . 673 

7634 

X02544 

Homo sapiens alphal-acid glycoprotein 

13 3 8 

99 . 502 

7635 

AC004000 

Homo sapiens match to EST AA085966 
(NID:gl629547) 

491 

100.000 

7636 

U47924 

Homo sapiens B-cell receptor associated 
protein 

1 Q "7 O 
lb /^S 

inn nnn 
1UU . UUU 

7637 

X17025 

Homo sapiens homologue of yeast IPP isomerase 

1 TIT 

Ijj / 

i nn nnn 

X U U . \J\J\J 

7638 

AF129131 

Xenopus laevis putative Zic3 binding protein; 
CBP3 protein homolog 

1094 

75. 000 

763 9 

M57424 

Homo sapiens adenine nucieociae transiocacor-z 

1 R7fi 
x 0 / 0 

92 . 905 

7640 

U57344 

Mus musculus Meis3 

2357 

89. 153 

7641 

U57344 

Mus musculus Meis3 

zUb / 

O X . ZDD 

7642 

AF033120 

Homo sapiens p53 regulated PA26-T2 nuclear 
protein 

1765 

58.439 

7643 

AF052193 

Gallus gallus translation repressor 

DUO 

1 nn nnn 

7644 

U2 7 8 3 8 

Mus musculus glycosyl -phosphatidyl -inositol- 
anchored protein homolog 


QfZ "IRQ 
3D . X O 3 

7645 

AB008549 

Homo sapiens type i procollagen L-proceinase 
enhancer protein 


100 000 

7646 

D79990 

Homo sapiens KIAA0168 

1147 

58.567 

7647 

L14429 

Caenorhabditis elegans putative 

497 

88.235 

7648 

AL023496 

Streptomyces coelicolor hypothetical protein 

202 

34.483 

7649 

AF098669 

Emericella nidulans pantothenate kinase 

608 

36. 943 

7650 

AF151869 

Homo sapiens CGI -111 protein 

1256 

98 .492 

7651 

U79258 

Homo sapiens unknown 

797 

100.000 
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7652 

U77327 

Homo SaDieilS Ki-l/57 intrarpl lul anfiapn 

\J O *± 

Q Q Q Q *7 

7653 

X51466 

Homo saDiens elonaation factor ? 

3/11 

i fin fififi 

7654 

X83618 

Homo saDiPiis h vH fn yvtup f h vl rrl nt*^ rvl — f^oZi 

synthase 

1 ^ S fl 

Q Q Q fi 1 

7655 

X02544 

Homo sapiens alphal-acid glycoprotein 

1338 

99. 502 

7656 

D50646 

Mng musculus snF? 



7657 

AF102805 

Drosophila melanogaster Peter Pan 

976 

35 . 745 

7658 

AL031431 

Homo ct =5 i pnq rLT4fi^O/2"l (nnv^l nrnhpin) 

O 1 A 
D JL*± 


7659 

X83300 

Homo flAT>"i pnpi SMA4 



7660 

K032 07 

Homo <3P1T> "i t^rt Q qal -j \r?i y\r r"»r*ol inp-ri r*»V* T>i^r*>t- <=» -i t-i 

precursor 

1 / 1^5 


7661 

AJ000342 

Homo saDiens DMBTl/6Jcb 1 Drotein 

5 

qq a a a 

. o o o 

7662 

X54134 

Homo sapiens protein- tyrosine phosphatase 

4739 

i no nnn 

X V v . \J KJ \J 

7663 

AL031709 

Homo sapiens C316G12.3 (novel protein) 

2093 

100 000 

7664 

S45367 

Canis familiaris centractin 

2488 

100. 000 

7665 

M27444 

Bos taurus Dhoj3Dhor>rof pin 

i on o 
x«io 

. iyo 

7666 

AF007889 

Symbiodinium microadriaticum calmodulin 

845 

91.304 

7667 

X97324 

Homo Qar>i pn q H T~»r%r">Vi 1 1 i n 

Of/in 

o o r / o 

7668 

X14479 

Canis familiaris calcyphosin (AA 1-189) 

726 

55.738 

7669 

23 8113 

oa^Liicix oinyces cerevisiac on, len: 99, 
CAT — 0 91 

*i ^ o 

42 . 424 

7670 

M6 5 06 6 

Homo c^^-r-^T pri q p21MD-rfor»ov»Hpnh t* - » t n If -i naoo D T _ 

beta regulatory subunit 

/ 

O Q TOT 

99 .73 7 

7671 

AJ001340 

Homo sapiens U3 snoRNP associated 55 kDa 

piUUCUl 

3168 

100.000 

7672 

U88573 

Homo sapiens NBR2 

419 

95 . 082 

/ D / .5 

/-vr* u / / u j u 

riomo sapiens nypocnecicai 4J . z Kua protein 

581 

32 . 283 

76 74 

7 o o o 4 rj 

viijvriowri siiuiiai lo ±t^Jr~x iiKe cnaperonin; CJJrJA 
EMBL : 

2514 

68 .470 

7675 

U40060 

V-' T V VJ u w 

uaciiui iioJJUl Lis clcy alio WcdA. J. y Dimilai L-O E* . 

nidulans himA crpnt* OTodtir't fSP ■ Pi 7ftflR^ 

i nor 

lUob 

.3 9 . 3o 7 

7676 

AL080125 

Homo saoipns hvnnf Vif^t* i r*?i 1 nrnt'P'i n 

o O ^ / 

1 fifi fififi 

7677 

D80002 

Homo sapiens KIAA0180 

^ joy 

inn fififi 
1UU . uuu 

7678 

Z49878 

Homo saoiens auanidinoacetabp M- 
methyl transf erase 

lean 

X O *± \J 

i fin fififi 

7679 

AF092092 

Homo saoiens AP-3 adantor comnlpy mi ha csnhiinHt* 

A / ou 

1 fifi fififi 
1UU . uuu 

7680 

X95073 

Homo sapiens Translin associated protein X 

1890 

100.000 

7681 

Z68218 

Caenorhabditis elegans K01H12.1 

348 

67.143 

7682 

Y12065 

Homo oani pn cr Vi M/^v^ ^ 

j / jU 

9o . o3 3 

7683 

AF110956 

Homo aani #n»n a QTTMO— 1 arl-i -i r-» f t n /~t pn 01 iVtim -J t- 

1 

^ o U 

1 AA AAA 

1UU . uuu 

7684 

AL.022476 

Homo saDiens d»J323M?S > S> 1 (novpl nrnt" pin 
similar to KIAA0173 and worm Tubulin Tyrosine 
Ligase) (isoform 1) 


1 fifi fififi 

1 u u • UUU 

7685 

AL050373 

Homo sapiens hypothetical protein 

2230 

100.000 

7686 

AF151879 

Homo sapiens CGI -121 protein 

1092 

1 00 000 

7687 

AF093680 

Homo sapiens transcription factor IIB 

1281 

100.000 

7688 

AL080125 

Homo sapiens hypothetical protein 

4064 

100 . 000 

7689 

X57814 

Homo sapiens immunoglobulin lambda light chain 

1483 

96.203 

7690 

AF153609 

Homo sapiens serine/threonine protein kinase 
sgk 

1878 

67 . 561 

7691 

AF056617 

Homo sapiens BWSCR2 associated zinc- finger 
protein BAZ1 

4365 

100.000 
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/ O 4* 


riomo sapiens a ipna-aaaucin |IT10C2, 
alternatively spliced} 

209 

97 . 059 

7fi 93 

' O 7 J 

ZVT.n 71 OOP 

nomo sapiens auiu^jDiu .0.1 iKing ringer 

px.Ul.Cl.il J. \ i\. X IN \j J. / xCIMl? 1 ) ) 

2607 

100 . 000 

7694 

AL031324 

Schi zosaccharomyces pombe very hypothetical 

T*>T~Ot~ o ~i n 
J- W LCXli 

210 

40.909 

7695 

X83543 

numu od^/Xdlc> rilr AJj 

1102 

100 . 000 

7696 

AL035307 

Homo sapiens hypothetical protein 

1985 

89.736 

7697 

AF067136 

Hnmn 0 p» fi "I on Q ny/^h o i ti t~» n<""*aniVi ahaoo-1 >~ofri il a h r 
xiuttiu Dapxciia pjLULcxxi pxiuopna Lasc - J. LcyUlatOiy 

subunit 7 alnhA9 

o o o 

42 . 958 

7698 

AJ012008 

Homo sapiens DDAH protein 

1901 

100.000 

7699 

AL117428 

Homo sapiens hypothetical protein 

3990 

100. 000 

/ / \J\J 


Mus musculus bromodoma in- containing protein 
BP75 

1137 

36 . 013 

7701 

U713 83 

rioiinj sapiens kjd jjinainy procein-^ 

3754 

99.819 

7702 

AF067406 

nuiuu oopicllo VdaLUldi aUIlCDlOIl UrOLclIi 1 f 
ocuixuai uaz. J.UC ocllolLl vc dluj.llt; UAlUdbc ; COppGlV" 

containing amine oxidase homolog 

5178 

97 . 772 

7703 

X74837 

Homo sapiens Man9-marmosidase 

4199 

99 . 840 

77 04 

X044 94 

riuino sapiens precursor poiypepcioe 

2437 

100 . 000 

7705 

API 3 5157 

numu octpicuo ccjiup xeiiieiiL. i_iq /\ cnam precursor 

1718 

100 . 000 

7706 

AB009282 

Homo sapiens cytochrome b5 

968 

99.315 

7707 

sJ j A. _> *± 1 

Homo sapiens human formiminotransf erase 
cycioaeaminase ucca/protein, carooxy- terminal 
end 

928 

100 . 000 

7708 

AF1 518 93 

ill. X J X O J/ J 

nuiuu oapxciio v»Vj1"1JO piULcin 

947 

98 . 684 

7709 

Y00752 

ndLLUb nui vtiyiLUb senne aenycuracase v/\/v i — 
327) 

1281 

61 . 562 

7710 

AF159133 




7711 

M94065 

Homo sapiens dihydroorotate dehydrogenase 

2551 

99.241 

7712 

AJ0 06 973 


"3 O O C 

100 . 000 

7713 

AC004528 

Homo sapiens R32184 1 

2891 

99.304 

7714 

LI 1702 

nomo sapiens pnospnoiipase jj 

5653 

99 . 761 

7715 

AXi050?flO 

rUiv ~J \J \j \J 

nuiuu oopxciio iiypuLXlcClLdl protein 

2481 

100 . 000 

7716 

\J O rx -A- 

v.aciiui.iiaiJuj.LXb exclaim similar co polyposis 
locus protein 1 (SPrDPl HUMAN, Q00765) 

558 

52 . 174 

7717 

AE001044 

A T~ In P» OfT 1 oVmiq "Pnl rri rliiQ r>aT*Kr»v\rl a a o» v* r» £i /nnf- 

mj. uiiacu^ xuxy luua LaijjOAyicscerase vesu~ 

1 ) 

258 

25 . 726 

7718 

Z71316 

Saccharomvces cerevisiae nop YTJT.04flw 

jIj 

Z o . \J2zt 

7719 

AF132954 

Homo saniPris PflT-^O r^r^ol" <=» *i n 

T Q O /I 

y y .638 

7720 

X71129 

Homo sapiens electron transfer flavoprotein 
beta sub unit 

1613 

99.608 

7721 

AL035602 

Arabidopsis thaliana putative protein 

840 

37.845 

7722 

AiT13 3 534 

xiuinu oapxcuo picXiyicttcQ KaD aCCcpuOx 1 \trtiJ\l\) 

1224 

99 .459 

7723 

X67250 

Rattus norvegicus n-chimaerin 

2184 

97 . 006 

7724 

£j / f o *± 

Ldenornaoax lis eiegans preaictea using 
ucucLiiiuci ; oxiiixidi xcy l.o uiosopmia KlM/\ 
binding protein squid (SW : SQDJDROME) ; cDNA EST 
yk638al.3 comes from this gene 

225 

31 . 356 

7725 

S82637 

Homo sapiens Ig lambda-like gene/beta- 
glucuronidase exon 11 homolog 

317 

100. 000 

7726 

Y07847 

Homo sapiens RRP22 protein 

1420 

100. 000 

7727 

X82224 

Homo sapiens glutamine- -phenylpyruvate 
aminotransferase 

2906 

100. 000 


$77 
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7728 

AL049989 

Homo sapiens hypothetical protein 

2730 

99 . 045 

7729 

U05784 

Rattus norvegicus light chain 3 subunit of 
microtubule-associated proteins 1A and IB 

777 

96 . 000 

7730 

AF056184 

Homo sapiens WS basic-helix-loop-helix leucine 
zipper protein 

1499 

99 . 142 

7731 

U82381 

Homo sapiens proline dehydrogenase/proline 
oxidase 

3335 

99 .802 

7732 

Y16346 

Homo sapiens COL1A1 and PDGFB fusion 
transcript 

99 

37 . 500 

7733 

S77099 

Drosophila pseudoobscura, Peptide, 14 9 aa Jan 
A 

343 

44 . 800 

7734 

AF118240 

Homo sapiens peroxisomal biogenesis factor 16 

2218 

99 . 405 

7735 

AF060862 

Homo sapiens unknown 

547 

96 . 296 

7736 

AC005917 

Arabidopsis thaliana putative transmembrane 
transport protein 

657 

29.472 

7737 

Z47356 

Caenorhabditis elegans similar to DNAJ 

1348 

58 . 8 92 

7738 

AC003028 

Arabidopsis thaliana unknown protein 

150 

2 9 . 825 

7739 

AC002550 

Homo sapiens Unknown gene product 

1244 

100 . 000 

7740 

AF095927 

Rattus norvegicus protein phosphatase 2C 

24 75 

95 . 153 

7741 

U14003 

Escherichia coli Kenn Rudd identifies as gpmB 

199 

35 . 156 

7742 

AJ224639 

Homo sapiens Surf5b 

12 90 

99 . 500 

7743 

AB004534 

Schizosaccharomyces pombe pi015 

246 

25 . 105 

7744 

AB002368 

Homo sapiens KIAA03 70 

5083 

100 .000 

7745 

X64002 

Homo sapiens RAP74 

3384 

99 . 807 

7746 

AF070572 

Homo sapiens unknown 

4096 

100 . 000 

7747 

Z96932 

Homo sapiens nuclear autoantigen fo 14 kDa 

735 

99.160 

7748 

AL117458 

Homo sapiens hypothetical protein 

2341 

100 . 000 

7749 

X69910 

Homo sapiens P63 protein 

3775 

99.336 

7750 

U80736 

Homo sapiens CAGF9 

2129 

99 . 091 

7751 

AC005609 

Homo sapiens KIAA034 5-like 4 

5346 

99 .279 

7752 

Z99112 

Bacillus subtilis similar to hypothetical 
proteins 

506 

34 . 127 

7753 

AB014514 

Homo sapiens KIAA0614 protein 

1073 
6 

100.000 

7754 

AF097518 

Homo sapiens liver-specific transporter 

3611 

100 . 000 

7755 

U47101 

Homo sapiens NifU-like protein 

779 

100 . 000 

7756 

AJ005897 

Homo sapiens JM5 

2429 

100 . 000 

7757 

AC004997 

Unknown match to ESTs AA667999 <NID:g2626700) , 
AA165465 (NID : gl74 14 81 ) , Z45871 (NID : g575105 ) , 
and 

3045 

100 . 000 

7758 

Y16790 

Homo sapiens keratin type I 

2 592 

99 . 239 

7759 

AF091621 

Catharanthus roseus ubiqui tin- conjugating 
enzyme E2 

513 

51 . 370 

7760 

Z98981 

Schizosaccharomyces pombe hypothetical protein 

o o c 

•JO Atzr\ 

7761 

V00148 

Caenorhabditis elegans unnamed protein product 


O Q C\"> A 

7762 

AL022729 

Homo sapiens hypothetical protein 


QQ QO C 

7763 

AF091087 

Homo sapiens unknown 

~i n q 

i n n nn n 
1UU . uuu 

7764 

AL049746 

Arabidopsis thaliana putative protein 

862 

65 . 746 

7765 

AJ222 96 9 

Mus musculus S-periaxin 

156 

2 7.83d 

7766 

AJBU14 b 

nOUlO Sapicllo XvlftrtUDZ D X. <J U tJ _L H 

5286 

99 . 874 

7767 

AC002342 

Arabidopsis thaliana Dreg- 2 like protein 

348 

32 .075 

7768 

AF151855 

Homo sapiens CGI- 97 protein 

1538 

96.400 

7769 

U41559 

Caenorhabditis elegans No definition line 
found 

311 

26 .724 

7770 

AF025459 

Caenorhabditis elegans No definition line 

261 

28 .326 


ST8 
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found 



7771 

M74555 

Mus musculus house -keeping protein 

1452 

53 .061 

7772 

Z50194 

Homo sapiens PQ-rich protein 

2794 

96 .750 

7773 

X79888 

Homo sapiens AU-binding protein/Enoyl-CoA 
hydra tase 

514 

59.504 

/ / /4 

U3 7251 

Homo sapiens Description: KRAB zinc finger 
protein; this is a splicing variant that 
contains a stop codon and frame shift between 
the KRAB box and the zinc finger region; 

rlCLIiUU . LUllLcpLUal L X. dllfc} J. cl L XL>I1 HUppi led 

£»nt* no v 

254 

43 .689 

7775 

D42084 

Homo sapiens KIAA0094 gene product is related 

t~ O R ppypvi Ql 3P m <^ t~ Vl l on 1 tio ri m H n nriorit" i Ha on 
. ucicviaiac 1 1 ic Uiixuiixilc cllllXi 1L>JLvcJL> L lUaotj . 

2745 

100.000 

7776 

AT.022117 

fUJV <i« JL± / 

C Vl "5 *7 0<3£1 f~* V> .£1 rrmivp o ct nnmKo Vi"\/T"\r"i f~ Viol- nvrkf a-i n 


41 . 379 

7777 

AF173378 

Homo sapiens 60S acidic ribosomal protein PO 

1567 

99.582 

7778 


rxLM llLs ^ctjJ-Lcllo UUUcl-tlilUy 1 L,L»\ UclLa lSOITlcraSc 

i y d d 

100 . 000 

7779 

V_/ \_/ / ^s, o x 

7\ ■yo V\ -i rtt*i o ■> ex fr* Vi a 1 t ana t^i if al-i /*1 -? oo 3 
/ilaUlUU^oXU LlldX XclllcL puCatlvc UXbcdSc 

O ~% C 

39.640 

7780 


Homn c: r» r~v -J pn cj T7MD — PMD V" t n n a a 
nLMUL> oa^lclis \JVltr v-rlf ivXllaoc 


100 . 000 

7781 

U8 8173 

Lacnuiiiauuinj} cx t;y ctiio wcaJv 0J.utJ.Xc1X. iLy t.O 

RT*ahi H on i e; fVl^T -5 A Y"l ;=i 11 V*>1 rni i hi n-1 i lr a nT-r^fr- oi n Q 
niauiuu^Dio uiiaxxctlia UUlvjUlUlil llAc pi ULci.Il O 


ou . aZZ 

7782 

X95826 

Homo saDiens mono-ADP-ribnqvl t"T"Pin cfppvpi «=:c» 

X / o o 

QQ OCT 
3? . /Dl 

7783 

M12098 

Rat" t~ 1 1 nnTVPCfi rues mvo ci n hpa \r\s rhai n 
u u v*iO iiwii vc^j^i^uo iiiy wo J.11 licet v y i^iidXil 

X D f* 

O A /ICQ 

7784 

AF1518 99 

Unmn qnni a — 1 A 1 nr*o t- <=k -i n 
nuuiu oapxcilo V_ vj X X *± X pLULclll 

Q A Q 

Q/1 T Of" 

7785 

AF177862 

Hnmo cjafji pnq TTM1 nrnhPin 

X U J u 

1 fin nnn 

7786 

M97589 

Homo ^airi pn«! v»Y*oc3fr" a t~ i r» £»r»-irl r^VincjT-vVir^t-r^c!^ 
iiuiuu " a r c ^ ptuotat Aw ctwXU LylH-'tiJL/IIct Lab t: 


^ / . z y r> 

7787 

X543 52 

1 1UO IIIUOLUIUO UUI 1 1" O L XL Uo 1*11 JO 

TT7Q 

y / . ooz 

7788 

AF04 9090 

JniL'iiiL' o c*jl/ x cilo Ldoclll Jvillaoc X yctllUUd j I i 


Q Q era 

y y . bo u 

7789 

U69263 

Homo saDiens mat" ri 1 i n - O nrppurcnr 

llWIll^ OUL/ J. W11L7 1 1 Id 1— J- J 1 L X 1 U1CLUJ.OU1 

o o 

o z . /of 

7790 

AF053356 

Homo cani *=»tic: HPP * qnl i ppva-ri anf" V% 

X I> i7 17 

1UU .UUU 

7791 

X54162 

Homo qani one; (\A. \C f~\ ^>iit*nant" i cioy\ 


y y . j u x 

7792 

AF184939 

Homo sapiens LDL induced EC protein 

575 

52 . 023 

7793 

X75593 

Womo csiiT*i"i one T*i»V» 1 ~3 

inn 
x .5 Z U 

1U0 . uou 

7794 

AF1186 70 

T-Tomo cam on <3 n^nh a n r\rr\ hoi n - f~* /^vi >- q /— • Qr s ^ > - 
riL'iiiL' oapicllo kji^ yJlLcLlk \j pxULclIl~L.OUpicU icCcpLOi 

O A Q »C 

1UU . UUU 

7795 

AL11763 9 

Homo o^t*» S on c: Vi\7Y*>o1~ V» o t~ i T i-iv-r^f- o i -n 
nuiinj oapxcUo nytJL»L-Ilc L, lLaX kyJL ULcXil 

x / ±o 

y y . ozi 

7796 

U2 946 3 

Womo cani ones /^ir^Vi rnmo 

nuniu oapiciio Ly LULUIUUlc X>DD X 

1 C A 1 

y o .oUb 

7797 

AL110479 

Caenorhabditis elegans predicted using 

npn c» -f- i nHf*T* * nT"p1 i mi v~\r nroH i r^t*"i on 
vcnc j — Liiuci / picxxiiixiictiy jpxcUXwLXun 

916 

47.535 

7798 

AB024984 

Mus musculus Sid3 2 9p 

2524 

98 .649 

7799 

A*J005273 

T-TomO qani pn c: V i nl *7 
nun iu oapxciio I\X11X / 


1UU . UUU 

7800 

AF152463 

nul 1 \KJ w> cl XLUo xv /A O X t LJ X ULcJ.il 

141/ 

y y . b j 5 

7801 

M81231 

RaLLUb UUl Vcy ILUb LU- XIIlcJX y LXaUSiaLXOn piOQUCU 

of SP-D 

rift 

■3b . /jo 

7802 

X02530 

Homo Rarji (*Y\ c* par*! -\/ rpcnnn nrppnrqnr 

iiumu L3uyj.ciio Cai.Xjr x. C O V^'Vj'l 1 0 C UI CLU1 OU1 

polypeptide (aa - 2 1 to 77 ) 

D O X 

QQ Q Q O 

-70 . you 

7803 

AF121860 

Homo sapiens sorting nexin 10 

1341 

98 . 507 

7804 

AC006233 

AirabidoDsis thaliana hvnot*}"iPt" *i ral nrohp in 

**i ^ sjw^y x. unci X XCA11C* 11 V W v U11C WXWCIX ULUUCXll 

X J D 

A(Z A O Q 

7805 

AF100740 

Homo saDiens ARF-f^amHlv of T?acj vc* 1 ^» t* <=>H fiTPaqpq 

X X o / 

X u V . UUU 

7806 

X86030 

Vigna unguiculata extensin-like protein 

218 

31.405 

7807 

AJ243972 

Homo sapiens 6-phosphogluconolactonase 

1694 

99.612 

7808 

AF058953 

Homo sapiens ATP- specific succinyl-CoA 
synthetase beta subunit 

2741 

100.000 

7809 

Z82192 

Homo sapiens dJ18601.1 

820 

100. 000 

7810 

S58722 

Homo sapiens X- linked retinopathy protein {c- 
terminal, clone XEH.8c} 

272 

66 .667 


sit 


O O S 51 fl- 6. H- .£ ,„ O S S Li! S! 


/ O JL JL 


Homo sapiens origin recognition complex 

965 

100 . 000 

7812 


■tsos caurus sz. ica accessory procein 

2213 

100 . 000 

7813 

L37877 

r liujjasiuiciia neoiornidns riDosomai protein 
Li27 

241 

45 . 349 

7814 

AF116865 

* ius iHu.ov~Li.Lu.E» neayenoy-inceractiny procein 

4208 

93 . 858 

7815 

X66975 

Hnmn cj rj r% i ona nupl oav ■»**. Vi/^m 1 1 «=» «— ■>--/-> t- a -i n 
x xt_ji i iu oap J. ens llU^lcai xlJjUnUL.lcOpxOLcJ.Il 

3362 

95 . 332 

7816 

AF151864 

Homo sapiens CGI -106 protein 

1431 

100 . 000 

7817 


nomo sapiens aipna enuosuiiine 

798 

100 . 000 

7818 

AP110R20 

XT. IT J — L U JL u 

1'IUo IIIUoLUXUo JNVjZ Z* 

968 

56 . 884 

7819 

AB015339 

Homo sapiens HRIHFB2255 

868 

55,556 

7820 

jMv- UVJDZ J J 

Araoiuopsis cnanana nypotneticai protein 

147 

48 . 000 

/ Oii J. 

rir 11XO jO 

Homo sapiens sodium dependent phosphate 

LXaliopul Lei XoOXOXlU War 1 JU 

4595 

99 .855 

7822 

AF151835 

Homo sapiens CGI- 78 protein 

973 

59.167 

7823 

rtr W _/ \J _7 *± 

nomo sapiens Ar - 4 adaptor complex oeta4 
subunit 

4916 

99 . 865 

"7ft*v>_l 


Mus musculus 8hs2 0 protein precursor 

514 

65 .487 

782 5 

YQ4 one 

Lowpox virus Ul /!• 

261 

29 .412 

/ OZD 


Arabidopsis thaliana 60S ribosomal protein L30 

-L bUXUy 

628 

60 . 140 

7R77 

U Z O 4 1 Z 

Caenorhabdit is elegans similar to polyposis 
locus protein 1 (SP:DP1 HUMAN, Q00765) 

515 

47 . 134 

7828 

X93036 

Homo sapiens MAT 8 protein 

608 

100 . 000 

7QOQ 


Homo sapiens unknown 

302 

85 .417 


AdUz Ub JU 

Homo sapiens KIAA082 3 protein 

977 

43 . 765 

7Q"3 1 
/ O -3 X 

A Trtrtl Q 1 A 

J\*J U V JL o / 4 

Homo sapiens orf 

524 

98 . 701 

7832 

Z78542 

Unknown predicted using Gene finder; cDNA EST 
EMBL:Z14514 comes from this gene; cDNA EST 
b*Mi$Li : D 7 ±0 3 3 

261 

35.156 

7833 

AJ002309 

Homo sapiens synaptogyrin 3 

1471 

100 . 000 


D ZT C 

Homo sapiens CIB 

50 

38 . 095 


ALUUbUb / 

Homo sapiens Supported by Human EST H08032.1 
^JMIU : go /^«i>4 ; , mouse EST AA870042.1 
(NID:g2965487) , and genscan 

2465 

100.000 

/ O ^> D 


Homo sapiens no similarities to reported gene 

■»-\ v-/^\f~Ji \ f- q 
J^XUUULtS 

4509 

99 .851 

7837 

X91504 

Homo sapiens GTPase 

1339 

100.000 

7838 

A J J7 JOT 

nomo sapiens oiyi i ja protein 

669 

98 . 058 

/ O -J> J7 


xiomo sapiens au j4Bzi . j (putative novel 

jJXULCXll/ 

539 

100 . 000 

7840 

X66295 

Mus musculus Clq C chain 

1273 j 

73 .171 

7841 

M96264 

Homo sapiens galactose -1 -phosphate uridyl 
Lxans i erase 

2668 

100.000 

7842 

X7 73 07 

nomo sapiens D-rii«dh> serotonin receptor 

3109 

100 . 000 

7843 

/»V— v/ \/ *x O «& J 

/ixajjiuopsis tnaiiana unxnown protein 

244 

44.660 

7844 

Z49968 

Caenorhabdit is elegans weak similarity to the 
ycdbu ooi v if* protein \owiss rxoc accession 
number P4 0318) 

200 

40 . 909 

7845 

AF001602 

Homo sapiens paraoxonase 

154 

i no ono 

XUU . \J \J\J 

7846 

Y11411 

Homo sapiens pristanoyl-CoA oxidase 

4620 

99.000 

7847 

AJ235270 

Rickettsia prowazekii GLUTAMYL- tRNA 
AMIDOTRANSFERASE SUBUNIT A (gatA) 

1206 

40.562 

7848 

AF181856 

Rattus norvegicus tRNA selenocysteine 
associated protein 

1941 

99.303 


,i. o o a 4" & a+rs „, id a: a 6, o s=s 


7849 

AL035398 

Homo sapiens dJ796I17.2 (CGI-51) 

3174 

99. 787 

7850 

X53064 

Homo sapiens small proline- rich protein 

628 

100.000 

7851 

X04366 

Homo sapiens CANP, large subunit (aa 1-714) 

4816 

100.000 

7852 

AF151834 

Homo sapiens CGI -76 protein 

2156 

99.077 

7853 

U89649 

Chlamydomonas reinhardtii Mrl9,000 outer arm 
dynein light chain 

319 

34 .454 

7854 

AL050008 

Homo sapiens hypothetical protein 

761 

56 . 923 

7855 

J05019 

Mus musculus high affinity IgE receptor beta 
subunit 

229 

28 .641 

7856 

AJ009985 

Homo sapiens annexin 31 (annexin XXXI) 

2147 

100.000 

7857 

AF115345 

Homo sapiens calcium- regulated heat stable 
protein CRHSP-24 

1017 

99. 320 

7858 

Z29328 

Homo sapiens Ubiqui tin- conjugating enzyme 
UbcH2 

1228 

100.000 

7859 

AF125175 

Homo sapiens angiopoiet in- related protein- 2 

3433 

100.000 

7860 

U29488 

Caenorhabditis elegans No definition line 
found 

729 

55.349 

7861 

AJ002030 

Homo sapiens progresterone binding protein 

1492 

100.000 

7862 

AL080097 

Homo sapiens hypothetical protein 

1835 

100. 000 

7863 

AJ249248 

Homo sapiens putative G protein-coupled 
Receptor 

2270 

100.000 

7864 

X07743 

Homo sapiens pleckstrin (AA 1-350) 

2360 

99.714 

7865 

X57158 

Gallus gallus CHOX M product 

907 

78 . 736 

7866 

X70649 

Homo sapiens member of DEAD box protein family 

5029 

100 . 000 

7867 

AF141309 

Homo sapiens polyamine modulated factor- 1 

1072 

98 . 788 

7868 

AF072128 

Mus musculus claudin-2 

1404 

91.304 

7869 

AF093204 

Gallus gallus unknown 

452 

62 . 626 

7870 

AC005162 

Homo sapiens probable carboxypeptidase 
precursor; 64% similar to P42660 
(PID:gl718107) 

2014 

100.000 

7871 

Z72576 

Saccharomyces cerevisiae ORF YGL054c 

357 

42 .446 

7872 

AF020797 

Homo sapiens AP-mu chain family member mulB 

2730 

99. 764 

7873 

AF145316 

Homo sapiens vacuolar proton pump delta 
polypeptide 

1515 

100.000 

7874 

X63417 

Homo sapiens IRLB 

1331 

100.000 

7875 

AC004472 

Homo sapiens PI . 11659 4 

2107 

95.798 

7876 

J00287 

Homo sapiens pepsinogen 

2579 

99.742 

7877 

AL022020 

Mycobacterium tuberculosis hypothetical 
protein Rvl920 

254 

33.835 

7878 

AB006628 

Homo sapiens KIAA02 90 

6072 

100.000 

7879, 

AF159063 

Homo sapiens SKDl-homolog 

2872 

100.000 

7880 

Y09333 

Rattus norvegicus mitochondrial very- long- 
chain acyl-CoA thioesterase 

2285 

74.066 

7881 

M59488 

Homo sapiens S100 protein beta subunit 

606 

100.000 

7882 

X98506 

Solanum tuberosum acetyl -CoA synthetase 

2624 

59.685 

7883 

AC005545 

Homo sapiens R26660 2, partial CDS 

199 

100.000 

7884 

AF022977 

Caenorhabditis elegans contains similarity to 
leucine-rich repeats (LRR) 

227 

36.885 

7885 

S62904 

Homo sapiens thiopurine methyltransf erase , 
1 PMT \ EC 2 . 1 . 1 . 6 7 | 

1692 

100.000 

7886 

AC006153 

Homo sapiens similar to Aquifex aeolicus GTP- 
binding protein; similar to AE000771 
(PID:g2984292) 

1283 

88.793 

7887 

AJ001016 

Homo sapiens RAMP3 

1044 

100.000 

7888 

AL031633 

Caenorhabditis elegans cDNA EST yk404dl.5 

530 

43 .243 
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comes from this gene; cDNA EST yk404dl.3 comes 
from this gene; cDNA EST yk672a9.3 comes from 
this gene 




AJ1UUIO45 J 

Archaeoglobus fulgidus A. fulgidus predicted 

luuii x t-y _Luii Aril / 0 

206 

35 . 000 

7ft 90 

/ O 27 V/ 


nuiutj saj^/iciio kyx ui_trcit>uiiit; 0 Liu ixn x. c v»y 

1682 

100 . 000 

/ O y 1 


Homo sapiens nypocnecicai protein 

2713 

100 . 000 


A J J / 27 27 

raoincj sapiens macropjidye inziammacory protein— 
2 alpha precursor 

682 

100 . 000 

7fl Q'} 

/-IF 1DXO J J 


1546 

100 . 000 

7894 

X57802 

Homo sapiens immunoglobulin lambda light chain 

1439 

92 .704 

/ 0 y o 

V -J C O A O 

Homo sapiens protein tyrosine kinase-receptor 

6755 

100 . 000 

/ O y © 

AT HOT A cr "> 

Homo sapiens aJb2iDii . 1 (putative protein) 

335 

100 . 000 

*7 Q Q*7 

ATI T C\d A C 

Homo sapiens candidate tumor suppressor p3 3 
ING1 homolog 

1683 

100 . 000 


vn q c c c: 

Homo sapiens TJDP-GalNAc : polypeptide N- 
acecyiy aiaccosammyicrans z erase 

4280 

99.839 

7RQQ 

/A.r* u *± j> J 0 U 

Homo sapiens lymphocyte -specific protein 1 

293 

100 . 000 

7900 

AF061023 

Gallus gallus ChTl 

180 

20.879 

Tom 

/ y u j. 

TTO Q A 1 *> 
U2 

Caenorhabdit is elegans similar to polyposis 

lOCUS protein 1 (JbPrUPl HUMAN, Q00765) 

543 

49 . 080 


/\r ±00 / / Z 

Homo sapiens surfeit 6 protein 

2301 

99 . 446 

/ iJUo 

T T CQQ1 

JLii d y y x 

Homo sapiens thymidylate kinase 

1376 

98 . 585 

/ y u<i 

VI "JOT C 

Homo sapiens f arnesylated-proteins converting 

2269 

100 . 000 

7905 

Z22555 

Homo sapiens CLA-1 

3422 

99.607 

7906 

AL035521 

Arabidopsis thaliana putative protein 

417 

41.772 

/yo / 

AF 026246 

Homo sapiens HERV-E integrase 

498 

86 . 250 

/908 

Al»1174 73 

Homo sapiens hypothetical protein 

2778 

100 . 000 

1 Q A Q 

790 9 

AI_il09736 

Schizosaccharomyces pombe WD repeat protein 

822 

41 .319 

/y iu 

AT "11 C\ A *7 *7 
ALlll04 /7 

Caenorhabdit is elegans predicted using 
benennaer; ouna kttsi yKi>bigb.3 comes rrom tnis 
gene 

1000 

39 . 484 

7911 

nliut 27 -7 *± O 

rnjuitj oapiciio nyvjtji_iitii lcjcxx protein 

3 936 

99 . 4 86 


AVI; 

T-Ii— *mrt earn ai^i a T OO 
ilUlllU bapXcIlS XjK-O 

174 7 

98 . 519 

7913 

X982 53 



lOO . 000 

7914 

AF151816 

Homo sapiens CGI -58 protein 

2412 

99.713 

791 ^ 

/ 2? x 

T.I Qfi ft 4 
XjX 270 O *± 

rlOiTlO Sapiens KalllSCaCin 

1128 

43 . 972 

7Q1 £ 

M.U Z 0 O !7 / 27 

xiomo sapiens annexm aiu procem 

2198 

99 . 074 

701 7 


xionio sapiens jviaau^ou 

1989 

100 . 000 

791 ft 

/4..T U4 D D v D 

X1Q->IIIU bapXcilo ^ZlOXIft 

1091 

99 . 367 

7919 

AF119297 

Homo sapiens neuroendocrine- specif ic protein- 

11A.C piULCXIl X 

1482 

99.576 

7920 

AF151906 

Homo sapiens CGI -148 protein 

1184 

98.324 

7921 

AL121733 

Homo sapiens hypothetical protein 

414 

36.946 

7922 

AF094516 

Homo sapiens El -like protein 

4764 

100.000 

7923 

AL117183 

Schizosaccharomyces pombe very hypothetical 
protein 

191 

33 .113 

7924 

Z50028 

Caenorhabditis elegans cDNA EST yk321h8.5 

UUlllc o XXvJIll Lilla y til It; , V^UlM/\ fio 1 Hj I v 1x5 Xj r Ut> O O J7 o 

comes from this gene; cDNA EST yk395f9.5 comes 
from this gene; cDNA EST yk360fl2.5 comes from 
this gene 

716 

33.915 

7925 

U16307 

Homo sapiens glioma pathogenesis -related 
protein 

437 

38.953 


VST- 
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7926 

Y10696 

Homo sapiens INE1 

370 

98. 039 

7927 

X13956 

Homo sapiens 9kD protein (AA 1-82) 

243 

91.667 

792 8 

Z46789 

Bos taurus cylicin II 

202 

24.590 

7929 

AL050137 

Homo sapiens hypothetical protein 

2921 

99.769 

7930 

Z66521 

Caenorhabditis elegans similar to 
mitochondrial RNA splicing MSR4 like protein; 
cDNA EST EMBL:C09217 comes from this gene 

1012 

51.333 

/ y.5 j. 

iVLiX-L / 54 U 

Homo sapiens hypothetical protein 

856 

96.875 

TOT) 

AB02 8 954 

Homo sapiens KIAA1031 protein 

6512 

100.000 

*7 Q "5 T 

X5574 0 

Homo sapiens 5 1 -nucleotidase 

3833 

99. 826 

/ yj 4 

At 132174 

Drosophila melanogaster unknown 

361 

29. 310 

/ 5 

A01046 

Homo sapiens lipase 

2717 

100.000 


Ab /4 

Homo sapiens TAG-l/axonin-1 

6974 

99. 519 

Ton 

Ar UloU34 

Homo sapiens endothelin converting enzyme- 1 

147 

36 . 364 

/ yj o 


Mus mus cuius unknown 

1142 

66. 192 

7939 

AB001684 

Chlorella vulgaris ORF54d 

105 

53 . 125 

794 0 

L08069 

Homo sapiens DNAJ homologue-2 

2030 

73.618 

7941 

X82 8 95 

Homo sapiens DLG2 

3799 

99. 826 

7942 

Y14768 

Homo sapiens V-ATPase G-subunit like protein 

753 

100.000 

7 94 3 

AB013721 

Oryctolagus cuniculus mitsugumin 23 

1374 

88.477 

7944 

U72970 

Sus scrofa calcium/calmodul in- dependent 
protein kinase II isoform gamma-B 

3487 

100.000 

7945 

AL110235 

Homo sapiens hypothetical protein 

1099 

99. 387 

7946 

L13291 

Homo sapiens ADP-ribosylarginine hydrolase 

1035 

45. 915 

7947 

AF125096 

Homo sapiens HSPC042 protein 

697 

100.000 

7948 

D50917 

Homo sapiens The KIAA012 7 gene product is 
novel . 

244 

26.923 

7949 

AF085361 

Homo sapiens HSPC032 

2043 

100.000 

7950 

M20030 

Homo sapiens small proline rich protein 

126 

39.623 

7951 

U00032 

Caenorhabditis elegans No definition line 
found 

383 

43 .333 

7952 

X92814 

Homo sapiens homologous to rat HREV107 
(ACC.NO. X764 53) 

1065 

99.383 

7953 

Z50026 

Bos taurus phosphatidylcholine transfer 
protein 

1217 

81.132 

7954 

AF059531 

Homo sapiens protein arginine N- 
methyltransf erase 3 

3391 

99.805 

nor r 

7955 

AF009242 

Homo sapiens proline- rich Gla protein 1 

1493 

100.000 

7956 

U94586 

Homo sapiens NADH : ubiquinone oxidoreductase 
MLRQ subunit 

567 

100.000 

7957 

AL050352 

Arabidbpsis thaliana putative protein 

212 

41.333 

7958 

AL050100 

Homo sapiens hypothetical protein 

211 

53 . 968 

7959 

Tl T ft IT ft CT ~> 

A1j050253 

Homo sapiens hypothetical protein 

5426 

100.000 

7960 

AL034562 

Homo sapiens dJ684024.2 (prodynorphin (Beta- 
Neoendorphin-Dynorphin precursor, 
Proenkephalin B precursor) ) 

1709 

100.000 

7961 

AL117444 

Homo sapiens hypothetical protein 

2760 

100. 000 

7962 

AF144638 

Homo sapiens sphingosine-1 -phosphate lyase 

3786 

99. 824 

"7 Ci tT ~1 

7963 

Y152 86 

Homo sapiens vacuolar proton-ATPase subunit 
M9 . 2 

444 

71.250 

7964 

AF071062 

Homo sapiens disabled- 1 

3689 

99.457 

7965 

X83006 

Homo sapiens neutrophil gelatinase associated 
lipocalin 

276 

40. 367 

7966 

U34973 

Mus musculus protein tyrosine phosphatase- like 

1449 

95. 964 

7967 

X78627 

Homo sapiens translin 

1452 

100. 000 
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7968 

AF150087 

Homo sapiens small zinc finger- like protein 

557 

100.000 

7969 

AJ011895 ! 

Homo sapiens Nafl alpha protein 

4210 

100.000 

7970 

M27071 

Mus musculus protein phosphatase 1 

2177 

99.682 

7971 

M23234 

Homo sapiens P-glycoprotein 

8147 

99.609 

7972 

AJ132445 

Homo sapiens claudin-14 

1487 

99.103 

7973 

U47924 

Homo sapiens B-cell receptor associated 
protein 

1872 

100.000 

7974 

U66411 

Drosophila melanogaster putative type III 
alcohol dehydrogenase 

1307 

49.403 

7975 

AF151863 

Homo sapiens CGI- 105 protein 

2122 

99 . 682 

7976 

M36803 ! 

Homo sapiens hemopexin 

3365 

100.000 

7977 

X63629 

Homo sapiens p-cadherin 

5547 

99.759 

7978 

Y10376 

Homo sapiens SIRP-betal 

2636 

99.497 

7979 

M86510 

Schistosoma mansoni glutathione peroxidase 

431 

43 .125 

7980 

AJ238096 

Homo sapiens Lsm4 protein 

973 

100 . 000 

7981 

AF092128 

Homo sapiens putative transmembrane protein 
E3-16 

1798 

100.000 

7982 

X89969 

Bos taurus polyA binding protein II 

2062 

99.346 

7983 

AL110239 

Homo sapiens hypothetical protein 

1579 

99. 578 

7984 

AF007170 

Homo sapiens unknown 

3714 

99.645 

7985 

AF176116 

Homo sapiens Down Syndrome candidate region 1- 
like protein 2 

1617 

100.000 

7986 

AE000850 

Methanobacterium thermoautotrophicum 
transcriptional regulator 

541 

55.303 

7987 

AF098284 

Cloning vector pERV3 retinoic acid receptor 
RXR 

3127 

99.784 

7988 

AL023592 

Schizosaccharomyces pombe hypothetical protein 

241 

28.934 

7989 

AL035496 

Homo sapiens dJ437022.1 (novel VHS domain 
containing protein similar to predicted worm 
and human proteins) 

438 

100.000 

7990 

AF058448 

Homo sapiens herpesvirus entry protein B 

3219 

100.000 

7991 

Z48795 

Unknown similarity to a thioredoxin- like 
protein from Bacillus subtilis (Swiss Prot 
accession numbe 

322 

37 . 255 

7992 

AJ005866 

Homo sapiens Sqv-7-like protein 

1669 

99 .617 

7993 

AL049929 

Homo sapiens hypothetical protein 

2118 

99.697 

7994 

AL049699 

Homo sapiens dJ747H23.2 (novel protein) 

577 

35 . 069 

7995 

AB007952 

Homo sapiens KIAA04 83 protein 

1917 

100 . 000 

7996 

Y13323 

Homo sapiens disintegrin-protease 

3182 

100 . 000 

7997 

X15393 

Homo sapiens motinlin 

734 

100 . 000 

7998 

X61615 

Homo sapiens leukemia inhibitory factor 
receptor 

7398 

99 . 544 

7999 

AB015330 

Homo sapiens HRIHFB2 007 

1194 

98 . 844 

8000 

X04325 

Homo sapiens gap junction protein (aa 1-283) 

1909 

99.647 

8001 

U08215 

Mus musculus NST-1 

2986 

90 . 215 

8002 

X74504 

Mus musculus T10 

412 

83 . 784 

8003 

AL110244 

Homo sapiens hypothetical protein 

369 

29.204 

8004 

AB029004 

Homo sapiens KIAA1081 protein 

3233 

100.000 

8005 

Z50022 

Homo sapiens putative surface glycoprotein 

1269 

100 . 000 

8006 

L31649 

Saccharomyces cerevisiae cdc91 

616 

27 . 873 

8007 

AL117629 

Homo sapiens hypothetical protein 

796 

73 . 545 

8008 

AF156957 

Homo sapiens NTF2- related export protein NXT1 

935 

100.000 

8009 

U05321 

Homo sapiens X- linked PEST- containing 
transporter 

4196 

100.000 

8010 

U42580 

Paramecium bursaria Chlorella virus 1 Pro- and 

188 

37.975 i 


334 
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Glu-rich, PENPEV (lOx) ; similar to 
Streptococcus B antigen, corresponds to Swiss- 
Prot Accession Number P27951 



8011 

X53390 

Homo sapiens upstream binding factor (AA 1- 
764) 

5104 

100.000 

8012 

AB018268 

Homo sapiens KIAA072 5 protein 

3843 

99.825 

8013 

AF105201 

Homo sapiens G-protein alpha subunit 14 

2329 

100.000 

8014 

AL117587 

Homo sapiens hypothetical protein 

1151 

100.000 

8015 

AL031010 

Homo sapiens dJ422F24.1 (PUTATIVE novel 
protein similar to C. elegans C02C2.5) 

336 

100. 000 

8016 

U28016 

Mus musculus parathion hydrolase 
(phosphotriesterase) -related protein 

2065 

87.679 

8017 

AC004918 

Homo sapiens structure confirmed by Gens can, 
human EST AA447021 (NID : g2 159686 ) and mouse 
EST AA119040 (NID : gl676735 ) 

1219 

100.000 

8018 

AC004839 

Homo sapiens similar to IgD B-cell receptor- 
associated protein (BAP) ,- similar to S46997 
(PID:gl085495) 

1540 

100.000 

8019 

AL034488 

Caenorhabditis elegans predicted using 
Genef inder 

430 

69. 512 

8020 

AB018260 

Homo sapiens KIAA0717 protein 

4288 

99.841 

8021 

AB001568 

Arabidopsis thaliana phospholipid 
hydroperoxide glutathione peroxidase- like 
protein 

432 

40.244 

8022 

U56965 

Caenorhabditis elegans No definition line 
found 

664 

43 .182 

8023 

AB029334 

Halocynthia roretzi HrPET-1 

849 

37.903 

8024 

AJ006291 

Homo sapiens leucine rich protein 

1934 

100.000 

8025 

AL035304 

Homo sapiens hypothetical protein 

860 

100.000 

8026 

AF137386 

Homo sapiens plasmolipin 

1190 

100.000 

8027 

AL035678 

Arabidopsis thaliana putative protein 

2214 

63.916 

8028 

AF155114 

Homo sapiens NY-REN- 57 antigen 

2932 

100.000 

8029 

AF155196 

Rattus norvegicus mindin precursor 

1951 

85. 843 

8030 

AL109831 

Schizosaccharomyces pombe conserved 
hypothetical protein 

399 

24.664 

8031 

L27421 

Rattus norvegicus neuronal calcium sensor 

1257 

100.000 

8032 

Y12735 

Homo sapiens Dyrk3 protein 

3726 

99. 636 

8033 

M16279 

Homo sapiens antigen 

251 

36 . 788 

8034 

AC002985 

Homo sapiens R27090 2 

3158 

100.000 

8035 

AF007791 

Homo sapiens secreted cement gland protein 
XAG-2 homo log 

1131 

100.000 

8036 

AF092878 

Homo sapiens zinc RING finger protein SAG 

850 

100.000 

8037 

A64586 

unidentified unnamed protein product 

4352 

100. 000 

8038 

AF045644 

Caenorhabditis elegans No definition line 
found 

364 

39.623 

8039 

AF161081 

Homo sapiens activatory receptor PIRIIbeta 

1524 

100.000 

8040 

X64588 

Cricetulus longicaudatus cyclin B 

2435 

87.097 

8041 

AF132967 

Homo sapiens CGI- 33 protein 

1483 

97.826 

8042 

AL049758 

Homo sapiens dJ437M21.2 (novel putative GTP- 
ase activating protein for Arf similar to worm 
F07F6 .4) 

900 

63.592 

8043 

AF070657 

Homo sapiens glutathione S- transferase subunit 
13 homolog 

1522 

100. 000 

8044 

AF151848 

Homo sapiens CGI -90 protein 

299 

32 . 143 

8045 

L40357 

Homo sapiens thyroid receptor interactor 

692 

93.750 
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Y18 5 03 

Homo sapiens XAP-5-like protein 

2142 

100.000 

804 7 

Z97340 

Arabidopsis thaliana isomerase like protein 

780 

53 .241 

q r\ A o 
8048 

A1j080076 

Homo sapiens hypothetical protein 

2598 

97.297 


Mi J 14 1 

Bos taurus GTP-binding protein (smg p21B) 

1200 

100.000 


X03475 

Rattus norvegicus ribosomal protein L35a (aa 
1 - 110 ) 

731 

99.091 

q n i 

7VT n A Q C O O 

A±jUf± y o o o 

Homo sapiens hypothetical protein 

3230 

100. 000 



Dictyostelium discoideum ras protein 

519 

50.303 

p fi c; 

o U D O 


Homo sapiens calsenilin 

1713 

100 . 000 

8054 

AF132951 

Homo sapiens CGI - 17 protein 

2475 

98.961 

8055 

AF 151842 

Homo sapiens CGI -84 protein 

1310 

100.000 

oUbo 

TV T -I -1 S~ f~ 1 

Alill7661 

Homo sapiens hypothetical protein 

4447 

99.558 

o n rr ,- 7 

7\ n*n C O a r r 

At 15 9055 

Homo sapiens leucine zipper-like protein 

139 

81 .481 

ft a r o 

oUbo 

7\ T7 1 -1 j| T O T A 

AF14 3 859 

Mus musculus DEBT- 91 

1860 

96.622 

8059 

AL117195 

Caenorhabditis elegans predicted using 
Genefinder; preliminary prediction 

525 

34 . 746 

O Ar a 

8 06 0 

AF03 956 8 

Homo sapiens vesicle trafficking protein 

142 

73 . 913 

8061 

AL022316 

Homo sapiens bK126B4.3 (novel protein) 

1869 

100.000 

8062 

Au 2 4 5709 

Homo sapiens Akt-3 protein 

3181 

98 .330 

8063 

U82382 

Homo sapiens PIN1 pept idyl -prolyl cis/ trans 
isomerase -like 

124 

55.882 

o n a a 
8064 

Z14122 

Xenopus laevis XLCL2 

585 

77 .885 

806 5 

tti Art r* r~ 

U70855 

Caenorhabditis elegans similar to the RAS gene 
family 

1684 

44.302 

806 6 

AF1524 98 

Homo sapiens protocadherin beta 5 

5120 

99.245 

806 7 

Y13647 

Homo sapiens stearoyl CoA desaturase 

2458 

99.443 

oUbo 

At* lbl83b 

Homo sapiens CGI -78 protein 

1573 

98 .780 

806 9 

X6 7712 

Psychrobacter immobilis triacylglycerol lipase 

339 

27 .799 

oU / U 

V" C C T C /I 

Abb /64 

Homo sapiens llbeta-hydrolase precursor 

3358 

99.602 

oU / l 

At 1 34726 

Homo sapiens NG36 

1215 

100 . 000 

8072 

AF070637 

Homo sapiens unknown 

537 

37 .549 

8073 

AF0844 57 

Homo sapiens beta- cop homolog 

6071 

99.265 

8074 

Ali050190 

Homo sapiens hypothetical protein 

1996 

99.675 

807 5 

AF131746 

Homo sapiens Unknown 

873 

100.000 

8076 

A184 11 

Homo sapiens PIGF 

1055 

100. 000 

8077 

U97006 

Caenorhabditis elegans No definition line 
round 

173 

39.189 

8078 

AJ000217 

Homo sapiens CLIC2 

1636 

99.588 

8079 

AB030505 

Mus musculus UBE- lc2 

1365 

78 .707 

8080 

AB007191 

Homo sapiens AMY- 1 

649 

99 . 029 

8081 

AC004472 

Homo sapiens TERA HUMAN 

5315 

100.000 

8082 

AF077200 

Homo sapiens HSPC014 

917 

100.000 

8083 

Z81035 

Caenorhabditis elegans predicted using 
Genefinder; Similarity to Yeast DNA-directed 
RNA polymerase I 13.7 KD polypeptide 
(SW:P32529) 

270 

41.964 

8084 

AL080118 

Homo sapiens hypothetical protein 

945 

94 .521 

8085 

L40410 

Homo sapiens thyroid receptor interactor 

1040 

100.000 

8086 

X63422 

Homo sapiens H (+) -transporting ATP synthase 

1035 

100. 000 

8 08 7 


i v ius muscuxus oupiibn 

173 

26 . 455 

8088 

AP000559 

Oryza sativa ESTs 

AU030008 (E50477) , AU07823 9 (E50477 ) correspond 
to a region of the predicted gene.; Similar to 
pept idyl -prolyl cis-trans isomerase 10 
&CELB0252 4 (P52017) 

770 

69.375 
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8089 

AF155111 

Homo sapiens NY -REN- 4 9 antigen 

1083 

99.401 

8090 I 

AL023553 

Homo sapiens dJ347H13.4 (novel protein) 

711 

100.000 

8091 

AL109978 

Homo sapiens hypothetical protein 

2154 

100.000 

8092 

Z35597 

Unknown Weak similarity with sea squirt 
nidogen precursor protein (blastp score 71) ; 
CDNA EST EMBL: 

770 

35.825 

8093 

J00073 

Homo sapiens alpha- cardiac actin 

2508 

100.000 

8094 

AF105365 

Homo sapiens K-Cl cotransporter KCC4 

7183 

99.908 

8095 

M77172 

Homo sapiens -zinc finger protein 

198 

57.778 

8096 

Z50749 

Homo sapiens yeast sds22 homolog 

2266 

100.000 

8097 

AB028966 

Homo sapiens KIAA1043 protein 

8243 

99.762 

8098 

M17783 

Homo sapiens glia-derived nexin precursor 

2588 

100.000 

8099 

AL021481 

Unknown similar to WD domain, G-beta repeat (2 
domains); cDNA EST yk258d4.3 comes from this 
gene ; 

948 

42 .308 

8100 

AB002380 

Homo sapiens KIAA0382 

4913 

99.733 

8101 

M64749 

Homo sapiens orphan receptor 

2396 

98.619 

8102 

AL117558 

Homo sapiens hypothetical protein 

918 

99 . 320 

8103 

M62419 

Mus musculus clathrin-associated protein 

2778 

99.291 

8104 

AB006191 

Mus musculus cornichon-like protein 

548 

77 .660 

8105 

AL050345 

Homo sapiens hypothetical protein 

809 

100.000 

8106 

AB023185 

Homo sapiens KIAA096 8 protein 

3495 

97 . 753 

8107 

AF090402 

Mus musculus KH domain RNA binding protein 
QKI-5A 

2113 

99.691 

8108 

L03303 

Oryctolagus cuniculus small GTP-binding 
protein 

1300 

96 .244 

8109 

L11317 

Homo sapiens rhoG 

1305 

100. 000 

8110 

U00025 

Caenorhabditis elegans weak similarity to ATP 
synthase B chain 

689 

27 .554 

8111 

AL050286 

Homo sapiens hypothetical protein 

657 

100.000 

8112 

Y14768 

Homo sapiens lymphotoxin-beta 

1627 

100.000 

8113 

AL049654 

Homo sapiens hypothetical protein 

2709 

100. 000 

8114 

AF098807 

Homo sapiens lipoma HMGIC fusion partner 

1430 

100. 000 

8115 

Y12473 

Homo sapiens centrin 

1064 

99.401 

8116 

U77667 

Mus musculus tyrosine kinase 

3980 

93 .376 

8117 

Z35641 

Caenorhabditis elegans cDNA EST yk273d8.5 
comes from this gene 

425 

31.650 

8118 

AF102147 

Homo sapiens putative dimethyladenosine 
transferase 

2067 

100. 000 

8119 

Z69881 

Homo sapiens adenosine triphosphatase, calcium 

6546 

100. 000 

8120 

X80473 

Mus musculus rabl9 

757 

56 . 995 

8121 

AB023221 

Homo sapiens KIAA1004 protein 

3360 

100.000 

8122 

X03884 

Homo sapiens 20K polypeptide 

1442 

100.000 

8123 

AL023694 

Homo sapiens dJ511E16.2 (putative protein 
based on ESTs) 

850 

100.000 

8124 

M91669 

Homo sapiens autoantigen 

1063 
6 

99.673 

8125 

AF151892 

Homo sapiens CGI -134 protein 

975 

100.000 

8126 

AJ270205 

Entodinium caudatum putative 

phospha t idylinositol -4 -phosphate 5-kinase 

204 

39.326 

8127 

U09808 

Drosophila melanogaster cDNA and genomic are 
colinear,* similar to human connective tissue 
growth factor, GenBank Accession Number M92 934 

284 

38 .333 

8128 

AJ011400 

Bos taurus NADH : ubiquinone oxidoreductase 
bl7.2 subunit 

132 

30.000 


#87 
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812 9 

AF003999 

Mus musculus GS15 

619 

90 . 741 


7kt r\-y a c\ 

Ali079349 

Arabidopsis thaliana putative protein 

289 

30.962 


AoUU /oz>y 

Homo sapiens KIAA03 99 

5161 

100.000 


"VI "1 A C A 

Fugu rubripes LSFR2 protein 

308 

84 .615 


A96b9o 

Homo sapiens D1075-like 

1934 

100.000 


AF 07 8 84 5 

Homo sapiens 16 . 7Kd protein 

1030 

100.000 

8135 

Z99259 

Schizosaccharomyces pombe hypothetical protein 

148 

35.000 

8136 

AF0396 98 

Homo sapiens antigen NY-CO- 3 3 

4372 

98 .403 

813 7 

AF116865 

Mus musculus hedgehog- interacting protein 

4208 

93.858 

8138 

AL050371 

Homo sapiens hypothetical protein 

2726 

87.447 

813 9 

M20729 

Chlamydomonas reinhardtii calmodulin 

245 

39.316 

8140 

AF129812 

Homo sapiens candidate tumor suppressor 
protein NOC2 

2202 

100.000 

8141 

AL109665 

Homo sapiens SLP-1 

2575 

99.744 

8142 

AF085355 

Homo sapiens N- terminal acetyltransf erase 
complex ardl subunit 

1188 

100.000 

8143 

Y09858 

Homo sapiens unknown protein 

1576 

99.569 

8144 

AF133123 

Homo sapiens transcription factor IIIC102 

5780 

100.000 

8145 

D50918 

Homo sapiens The KIAA0128 gene is related to 
cdclO . 

2751 

100.000 

8146 

Y17849 

Homo sapiens ganglioside- induced 
differentiation associated protein 1 

2328 

98.603 

8147 

Z93241 

Homo sapiens dJ222E13.3.2 (PUTATIVE partial 
isoform 2) 

2761 

100.000 

8148 

AB028990 

Homo sapiens KIAA1067 protein 

4419 

99 . 855 

8149 

X89984 

Homo sapiens BCL7A 

1515 

100 . 000 

8150 

AB018337 

Homo sapiens KIAA0794 protein 

3263 

99.796 

8151 

AB018285 

Homo sapiens KIAA0742 protein 

8392 

100 . 000 

8152 

U53588 

Homo sapiens HCG V 

885 

100.000 

8153 

AF070657 

Homo sapiens glutathione S- transferase subunit 
13 homo log 

1522 

100.000 

8154 

AE001014 

Archaeoglobus fulgidus transcription 
initiation factor IIB 

182 

24 .000 

8155 

D13634 

Homo sapiens KIAA0009 

292 

29.268 

8156 

X62155 

Homo sapiens Myf4 protein 

1529 

100.000 

8157 

X16576 

Homo sapiens KUP protein 

2892 

99.540 

8158 

X07948 

Homo sapiens TP1 (AA 1-55) 

367 

100.000 

8159 

AF155107 

Homo sapiens NY-REN- 3 7 antiqen 

1300 

99.425 

8160 

AF059198 

Homo sapiens protein kinase/endoribonulcease 

6522 

99.693 

8161 

AL031320 

Homo sapiens dJ20N2.1 (novel protein similar 
to yeast and bacterial cytosine deaminase) 

1070 

99.367 

8162 

Z69637 

Caenorhabditis elegans predicted using 
Genefinder; Similarity to E.coli hypothetical 
protein YCAC (SW : YCAC_ECOLI ) ; cDNA EST 
yk555dl2.3 comes from this gene 

313 

37.949 

olb J 

AB023167 

Homo sapiens KIAA0950 protein 

2290 

100. 000 

O JLO *i 


Homo sapiens cystinosin 

2445 

100 . 000 

OlDD 

Y15062 

Homo sapiens GalT4 protein 

2275 

100. 000 

olbo 

AjJ UIjS J / D 

Homo sapiens ATP-binding cassette transporter- 
1 (ABC- 1 ) 

1462 

99 . 955 

8167 

L13977 

Homo sapiens prolylcarboxypeptidase 

1249 

43 .653 

8168 

X52987 

Homo sapiens rap2b gene product (AA 1-183) 

1185 

99.454 

8169 

AC003002 

Homo sapiens R2 9515 1 

960 

100. 000 

8170 

AF156271 

Homo sapiens RING finger protein terf 

3273 

100.000 

8171 

L77967 

Ovis aries small proline-rich protein with 

103 

32 .609 
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paired repeat 



8172 

AC006085 

Arabidopsis thaliana Putative acyl-CoA:l- 
acylglycerol - 3 -phosphate acyltransf erase 

835 

38.369 

8173 

AC004382 

Homo sapiens Unknown gene product 

2119 

100.000 

8174 

AF129332 

Homo sapiens MUM2 

977 

100. 000 

8175 

U82267 

Oryctolagus cuniculus sarcosine oxidase 

2655 

99.487 

8176 

U37026 

Rattus norvegicus sodium channel beta 2 
subunit 

198 

30.769 

8177 

AF050145 

Homo sapiens iduronate-2 -sulf atase 

153 

25.899 

8178 

AF132000 

Homo sapiens TADA1 protein 

1446 

100.000 

8179 

AF131852 

Homo sapiens Unknown 

911 

100.000 

8180 

Y15057 

Homo sapiens STK9 protein 

6901 

99.903 

8181 

X82207 

Homo sapiens beta-centracetin 

2486 

100.000 

8182 

AF033107 

Grus americana B-G-like protein 

317 

32 .642 

8183 | 

AF107885 

Homo sapiens unknown 

303 

100.000 

8184 

AF112444 

Lupinus luteus L-asparaginase 

653 

41.121 

8185 

AF151848 

Homo sapiens CGI- 90 protein 

433 

33 . 624 

8186 

AJ012295 

Rhizobium etli apaG protein 

261 

40.476 

8187 

X94769 

Rattus rattus choline dehydrogenase 

2727 

88 .435 

8188 

AL049638 

Arabidopsis thaliana putative protein 

219 

27 . 982 

8189 

Z82244 

Homo sapiens bK286B10.2 (Heme Oxygenase 1 (HO- 
1, EC 1.14.99.3) ) 

1882 

100 . 000 

8190 

AC004955 

Homo sapiens supported by EST AA16 04 84 
(NID:gl735912) and Genscan 

1529 

99 . 574 

8191 

AJ007590 

Homo sapiens XRP2 protein 

2394 

100.000 

8192 

X99642 

Mus musculus HP1-BP74 protein 

1653 

92.958 

8193 

AJ010973 

Homo sapiens DEDD protein 

326 

36 .612 

8194 

U40952 

Caenorhabditis elegans C03B1.10 gene product 

230 

65 . 854 

8195 

X52011 

Homo sapiens muscle determination factor 

1633 

100 . 000 

8196 

AF003130 

Caenorhabditis elegans No definition line 
found 

894 

41 . 114 

8197 

AC004382 

Homo sapiens Unknown gene product 

1303 

68 . 132 

8198 

Y10571 

Homo sapiens dinG 

2206 

100.000 

8199 

AB002372 

Homo sapiens KIAA03 74 

1117 

43.750 

8200 

AL109630 

Drosophila melanogaster BACR7A4.h 

737 

32.083 

8201 

X65293 

Homo sapiens protein kinase C epsilon 

5079 

100. 000 

8202 

AB018268 

Homo sapiens KIAA0725 protein 

3852 

100.000 

8203 

U58761 

Caenorhabditis elegans C01F1.6 gene product 

617 

34 . 746 

8204 

U23139 

Caenorhabditis elegans similar to NIFS protein 
(nitrogen fixation) 

348 

41.317 

8205 

X99920 

Homo sapiens S100 calcium-binding protein A13 
(S100A13) 

614 

100.000 

8206 

AJ006276 

Homo sapiens transient receptor potential 
protein 

6153 

100.000 

8207 

Y07707 

Homo sapiens ITBA4 

170 

50.000 

8208 

AL117442 

Homo sapiens hypothetical protein 

1723 

100.000 

8209 

AF040964 

Homo sapiens unknown protein IT1 

3774 

99. 008 

8210 

Z14014 

Nicotiana tabacum Pistil extensin like 
protein, partial CDS only 

170 

52 .459 

8211 

AB020713 

Homo sapiens KIAA0906 protein 

5926 

99 . 567 

8212 

M89797 

Mus musculus Wnt-4 

2425 

98.860 

8213 

AF125044 

Homo sapiens ubiquitin- conjugating enzyme 
HBUCE1 

1036 

100. 000 

8214 

AJ245417 

Homo sapiens G5b protein 

1026 

100.000 

8215 

AF102548 

Mus musculus ATI receptor-associated protein 

804 

77 . 019 
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8216 

AF165429 

Arabidopsis thaliana protein phosphatase 2A 62 
kDa B 1 1 regulatory subunit 

430 

26 . 562 

8217 

AL022238 

Homo sapiens dJ1042K10.3 (novel protein) 

1665 

99. 588 

8218 

Z54271 

Caenorhabditis elegans F21D5.6 

220 

25 . 000 

8219 

AJ238236 

Rattus norvegicus ribosome associated membrane 
protein RAMP4 

283 

79 . 032 

8220 

AJ24 3 503 

Mus musculus m4 6 protein 

660 

82 . 906 

8221 

U43384 

Mus musculus gp91phox 

3675 

93 . 146 

8222 

AF076531 

Homo sapiens minK- related peptide 2; MiRP2 

683 

100.000 

8223 

U41278 

Caenorhabditis elegans F3 3G12.3 gene product 

421 

27 .407 

8224 

X58769 

Homo sapiens V alpha gene segment 

362 

83 . 582 

8225 

AL050159 

Homo sapiens hypothetical protein 

1691 

100.000 

8226 

AC003096 

Arabidopsis thaliana putative protein 
phosphatase 2C 

370 

40 . 606 

8227 

AF132174 

Drosophila melanogaster unknown 

1119 

52 . 308 

8228 

AL035306 

Homo sapiens hypothetical protein 

1690 

100 . 000 

8229 

AF04 906 9 

Pinus radiata No definition line found 

511 

57 . 031 

8230 

AF039687 

Homo sapiens antigen NY-CO-1 

2343 

100 . 000 

8231 

X76105 

Homo sapiens DAP-1 

724 

100 . 000 

8232 

AF039716 

Caenorhabditis elegans similar to ATP synthase 
B chain 

628 

55.357 

8233 

AJ243320 

Canis familiaris hypothetical protein 

294 

32.278 

8234 

Y00757 

Homo sapiens polypeptide 7B2 precursor 

1461 

99.057 

8235 

AB011106 

Homo sapiens KIAA0534 protein 

4833 

100.000 

8236 

Y11711 

Homo sapiens collagen type XIV 

195 

100.000 

8237 

AJ006470 

Homo sapiens cartilage-associated protein 
(CASP) 

2723 

100.000 

8238 

AB017112 

Mus musculus mCAC 

508 

33 . 227 

8239 

X82240 

Homo sapiens T cell leukemia/ lymphoma 1 

800 

100.000 

8240 

AJ130941 

Homo sapiens claudin-9 protein 

1442 

100.000 

8241 

Z73906 

Caenorhabditis elegans Similarity to 
B.subtilis YQJC protein (TR : G1303 954 ) ; cDNA 
EST EMBL:T01187 comes from this gene 

607 

70.229 

8242 

AF094761 

Mus musculus Rfxank 

506 

72 . 642 

8243 

X57985 

Homo sapiens histone H2B 

495 

93 . 023 

8244 

AP000062 

Aeropyrum pernix 152aa long hypothetical 
protein 

183 

30.857 

8245 

AF023130 

Homo sapiens Ras-GRF2 

8176 

99 . 919 

8246 

AC002505 

Arabidopsis thaliana unknown protein 

803 

59.239 

8247 

AB018352 

Homo sapiens KIAA08 09 protein 

8351 

100 . 000 

8248 

Au 223 35 5 

Rattus norvegicus mitochondrial dicarboxylate 
carrier 

1726 

89 . 474 
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Homo sapiens proline rich peptide P-B 

107 

35 . 294 

8250 

AF072933 

Homo sapiens Mad2-like protein 

1398 

100.000 

82 51 

L2 764 5 

Danio rerio growth-associated protein 

185 

37 . 500 

8252 

AF0984 81 

Gallus gallus cadherin 

648 

69 . 231 

82 53 

AF141377 

Mus musculus Ly- 6 /neurotoxin homolog 

682 

81 . 034 

82 54 

AB01 1157 

Homo sapiens KIAA058 5 protein 

2778 

100 . 000 

8255 

U2316 9 

Caenorhabditis elegans No definition line 
found 

1107 

51 . 724 

8256 

AF151901 

Homo sapiens CGI- 143 protein 

897 

99.270 

8257 

M93698 

Oncorhynchus mykiss ependymin 

249 

24 . 752 

8258 

AB014562 

Homo sapiens KIAA0662 protein 

4279 

99.398 

8259 

X68879 

Homo sapiens EMX1 

1050 

99. 346 

8260 

M92441 

Bos taurus ornithine decarboxylase 

1528 

54 . 126 
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8261 

M76424 

Homo sapiens carbonic anhydrase VII 

1742 

99.603 

8262 

U88958 

Rattus norvegicus neuritin 

244 

32.075 

8263 

Z99116 

Bacillus subtilis similar to hypothetical 
proteins 

223 

35.537 

8264 

M98539 

Homo sapiens prostaglandin D2 synthase 

1272 

100. 000 

8265 

AF005050 

Homo sapiens aspartyl aminopeptidase 

3160 

100. 000 

8266 

U31332 

Homo sapiens DP prostanoid receptor 

1901 

100. 000 

8267 

AC002130 

Arabidopsis thaliana F1N21.1 

275 

27 .807 

8268 

X67098 

Homo sapiens ORF1 

238 

97 .297 

8269 

D83198 

Homo sapiens homology to a plant EST:RICS27 53A 

1535 

99.559 

8270 

U32828 

Haemophilus influenzae Rd ribosomal protein S6 
modification protein (rimK) 

381 

31.308 

8271 

X82027 

Sus scrofa BM88 antigen 

654 

72.000 

8272 

AF154831 

Rattus norvegicus PV-1 

1816 

60 .860 

8273 

X54673 

Homo sapiens GABA transporter 

4063 

99.332 

8274 

AF026528 

Rattus norvegicus stathmin- like-protein RB3 

1215 

99 .471 

8275 

U35244 

Rattus norvegicus vacuolar protein sorting 
homolog r-vps33a 

3790 

96 .817 

8276 

AJ001189 

Homo sapiens oligophrenin 1 

5367 

100. 000 

8277 

D28595 

Escherichia coli unknown 

260 

24 .671 

8278 

D78255 

Mus musculus PAP-1 

638 

82 .203 

8279 

AB026054 

Homo sapiens brain finger protein 

4323 

100.000 

8280 

AL031394 

Arabidopsis thaliana putative protein 

376 

60.204 

8281 

AF001308 

Arabidopsis thaliana predicted glycosyl 
transferase 

327 

28 .986 

8282 

X91257 

Homo sapiens seryl-tRNA synthetase 

3400 

99.805 

8283 

AF125443 

Caenorhabditis elegans contains similarity to 
S. pombe phosphatidyl synthase (GB:Z2 8295) 

820 

40.922 

8284 

AB002359 

Homo sapiens KIAA0361 

9350 

99.927 

8285 

AF077202 

Homo sapiens HSPC016 

412 

100. 000 

8286 

AF132960 

Homo sapiens CGI -26 protein 

2162 

99.107 

8287 

AF133422 

Homo sapiens HMP19 protein 

1139 

100. 000 

8288 

AB012309 

Cyprinus carpio allograft inflammatory factor- 
1 

740 

78 .169 

8289 

AC002505 

Arabidopsis thaliana hypothetical protein 

403 

32 .472 

8290 

AL049730 

Arabidopsis thaliana putative protein 

680 

42 .616 

8291 

Z78418 

Unknown cDNA EST EMBL:D71020 comes from this 
gene; cDNA EST EMBL:D73593 comes from this 
gene ; cDNA 

814 

47 .791 

8292 

AL050273 

Homo sapiens hypothetical protein 

603 

98 .947 

8293 

A52140 

unidentified HUMAN NDR 

1000 

98 .675 

8294 

AF151907 

Homo sapiens CGI -14 9 protein 

1385 

100.000 

8295 

AL031775 

Homo sapiens dJ30M3.3 (novel protein similar 
to C. elegans Y63D3A.4) 

2425 

100. 000 

8296 

AF065382 

Yersinia pestis adenylate kinase 

267 

30.556 

8297 

Z30425 

Homo sapiens orphan nuclear hormone receptor 

2322 

98.584 

8298 

Z33905 

Homo sapiens 4 3kD Acetylcholine receptor- 
associated protein (Rapsyn) 

2775 

100. 000 

8299 

Y16787 

Homo sapiens keratin, type I 

2786 

100.000 

8300 

M21103 

Ovis aries BIIIB4 high-sulfur keratin 

648 

82 .653 

8301 

X76488 

Homo sapiens sterol esterase 

1680 

60.957 

8302 

X14420 

Homo sapiens prepro-alpha- 1 type 3 collagen 

1099 
5 

99.864 

8303 

Z85986 

Homo sapiens dJ108K11.3 (similar to yeast 
suppressor protein SRP40) 

1461 

78.161 
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8304 

U18762 

Rattus norvegicus retinol dehydrogenase type I 

1148 

52.581 

8305 

AF072467 

Homo sapiens unknown 

3245 

100.000 

8306 

X63745 

Homo sapiens KDEL receptor 

1374 

100.000 

8307 

AF117646 

Homo sapiens long CBL-3 protein 

3331 

99.789 

8308 

AL117557 

Homo sapiens hypothetical protein 

1780 

100.000 

8309 

AL117495 

Homo sapiens hypothetical protein 

2932 

99.753 

8310 

AC002301 

Homo sapiens Homolog of rat Zymogen granule 
membrane protein 

1120 

100.000 

8311 

U60024 

Ovis aries BIIIA3 

69 

50.000 

8312 

AF188285 

Homo sapiens bone morphogenetic protein 9 

2890 

100.000 

8313 

AF112982 

Homo sapiens group IID secretory phospholipase 
A2 

1113 

100.000 

8314 

297208 

Schizosaccharomyces pombe hypothetical protein 

683 

47.009 

8315 

X16560 

Homo sapiens precursor polypeptide (AA -16 to 
47) 

420 

100.000 

8316 

X04085 

Homo sapiens catalase 

3642 

100.000 

8317 

M13444 

Mus musculus alpha- tubulin isotype M-alpha-6 

3047 

100.000 

8318 

AF042276 

Pseudomonas put i da o251 homolog 

752 

45 . 818 

8319 

U88958 

Rattus norvegicus neuritin 

931 

98 . 592 

8320 

L40806 

Neurospora crassa Restriction enzyme 
inactivation of met- 10 complementation in this 
region. Sequence similarity to S. cerevisiae 
chromosome VIII cosmid 9205, accession no. 
U10556 CDS residues 22627-24126 

821 

33 .623 

8321 

AC004131 

Homo sapiens Unknown gene product 

919 

40 . 759 

8322 

U19596 

Mus musculus pl8 protein 

207 

35.115 

8323 

U21549 

Mus musculus Ac3 9/physophilin 

1623 

68.208 

8324 

X79888 

Homo sapiens AU-binding protein/Enoyl-CoA 
hydratase 

2181 

100.000 

8325 

AF036694 

Caenorhabditis elegans No definition line 
found 

237 

25.478 

8326 

D45370 

Homo sapiens unknown product specific to 
adipose tissue 

462 

100.000 

8327 

M88469 

Rattus norvegicus f-spondin 

5595 

96.654 

8328 

X87237 

Homo sapiens a-glucosidase I 

5754 

99.283 

8329 

AC007231 

Arabidopsis thaliana putative disease 
resistance protein 

632 

34 .393 

8330 

AF161181 

Mus musculus P55T protein 

3458 

97.593 

8331 

AF117814 

Mus musculus odd- skipped related 1 protein 

1218 

66 . 667 

8332 

D13126 

Rattus norvegicus neural visinin-like Ca2+- 
binding protein type 3 

264 

32.593 

8333 

D14849 

Mus musculus meiosis-specif ic nuclear 
structural protein 1 

2498 

77.393 

8334 

AB020711 

Homo sapiens KIAA0904 protein 

6583 

99.900 

8335 

AL008729 

Homo sapiens predicted protein dJ257A7.2 

878 

98.519 

8336 

Z72510 

Unknown similarity to yeast UTR3 protein 
(Swiss Prot accession number P21374) ; cDNA EST 
EMBL:D72822 

800 

49.446 

8337 

Z97992 

Schizosaccharomyces pombe hypothetical protein 

223 

47.761 

8338 

Y14768 

Homo sapiens I Kappa B-like protein 

2673 

100.000 

8339 

AF069737 

Xenopus laevis notchless 

2762 

82 .452 

8340 

Z98944 

Schizosaccharomyces pombe hypothetical protein 

345 

31.884 

8341 

AF144235 

Homo sapiens hypothetical protein SBBI42 

1103 

100.000 

8342 

Y00796 

Homo sapiens LFA-1 alpha subunit precursor (AA 
-25 to 1145) 

7821 

99.829 
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8343 

L10910 

Homo sapiens splicing factor 

1526 

59.773 

8344 

AB014589 

Homo sapiens KIAA06 89 protein 

3783 

100.000 

8345 

Y12653 

Homo sapiens diubiquitin 

1068 

99. 394 

8346 

AL034397 

Homo sapiens dA159Al.l (novel protein) 

2669 

100 . 000 

8347 

Z70750 

Caenorhabditis elegans similar to vanadate 
resistance protein transmembranous domains; 
cDNA EST yk664g4.3 comes from this gene 

1213 

58.457 

8348 

AC002045 

Homo sapiens Unknown protein product 
CIT987SK-A-589H1 1 splice form 1 

2224 

98 . 784 

8349 

AL034488 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk433c6.3 comes from this 
gene; cDNA EST EMBL:D72601 comes from this 
gene; cDNA EST EMBL:D75524 comes from this 
gene; cDNA EST yk433c6.5 comes from this gene 

734 

54.502 

8350 

X02585 

Xenopus laevis unidentified open reading frame 
1 (166 aa) 

169 

65.000 

8351 

AF155100 

Homo sapiens zinc finger protein NY-REN- 21 
antigen 

2895 

100 .000 

8352 

AL035297 

Homo sapiens hypothetical protein 

1067 

100.000 

8353 

J05071 

Bos taurus GTP -binding regulatory protein 
gamma -6 subunit 

454 

100.000 

8354 

AF171060 

Mus musculus RING finger protein A07 

2565 

81. 917 

8355 

X63417 

Homo sapiens IRLB 

649 

51.042 

8356 

AF093680 

Homo sapiens transcription factor I IB 

1281 

100.000 

8357 

AF087825 

Mus musculus claudin-7 

211 

47 . 500 

8358 

AL021960 

Arabidopsis thaliana adrenodoxin-like protein 

414 

51.282 

8359 

Y18007 

Homo sapiens seven transmembrane domain 
protein 

1412 

93 .304 

8360 

X75315 

Homo sapiens SEB4B 

1500 

97.826 

8361 

AL050170 

Homo sapiens hypothetical protein 

935 

100.000 

8362 

AF176784 

Rattus norvegicus eps8 binding protein 

897 

39.446 

8363 

AL080071 

Homo sapiens hypothetical protein 

847 

100 . 000 

8364 

X84003 

Homo sapiens Poll I transcription factor TFIID 

816 

100.000 

8365 

X64037 

Homo sapiens RNA polymerase II associated 
protein RAP74 

3388 

100.000 

8366 

AF098798 

Homo sapiens unknown 

2301 

99.713 

8367 

X04085 

Homo sapiens catalase 

3642 

100.000 

8368 

AF132172 

Drosophila melanogaster unknown 

1053 

40.705 

8369 

X54449 

Oryza sativa Glycine-rich protein 

203 

46.512 

8370 

AL080201 

Homo sapiens hypothetical protein 

1079 

99.383 

8371 

X69089 

Homo sapiens 16 5kD protein 

9811 

99.795 

8372 

Z99130 

Homo sapiens dJ83L6 . 1 (DNA binding Zinc finger 
protein ZFPA (ZXDA) ) 

5467 

98 .257 

8373 

X92972 

Homo sapiens protein phosphatase 6 

2131 

100.000 

8374 

M87307 

Xenopus laevis fast skeletal muscle beta- 
tropomyosin 

1514 

87.324 

8375 

AF045564 

Rattus norvegicus development -related protein 

2200 

93.182 

8376 

AL050297 

Homo sapiens hypothetical protein 

2957 

98.886 

8377 

M11900 

Mus musculus 15-kDa proline-rich salivary 
protein 

216 

36.508 

8378 

L22342 

Homo sapiens phosphoprotein 

233 

97.059 

8379 

AF172854 

Homo sapiens cardiotrophin- like cytokine CLC 

1579 

100.000 

8380 

AJ224878 

Homo sapiens T-cell receptor interacting 
molecule (TRIM) protein 

1242 

100.000 

8381 

M84379 

Homo sapiens lymphocyte antigen 

2482 

99. 178 
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8382 

Z99297 

Homo sapiens dJ262D12.2 

( (mitochondrial /chloroplast 3 OS ribosomal 
protein S14) -LIKE protein) 

870 

100.000 

8383 

AF007130 

Homo sapiens unknown 

1694 

100.000 

8384 

D00099 

Homo sapiens Na , K-ATPase alpha- subunit 

6746 

100.000 

8385 

X05908 

Homo sapiens lipocortin (AA 1-346) 

2207 

100. 000 

8386 

M27288 

Homo sapiens oncostatin M 

1682 

99.603 

8387 

X97571 

Mus musculus HCMV- interacting protein 

563 

96 . 809 

8388 

X04391 

Homo sapiens put. precursor polypeptide 

3463 

99. 798 

8389 

AF161080 

Homo sapiens inhibitory receptor PIRIIalpha 

2014 

99.340 

8390 

AF054176 

Homo sapiens angiotensin/vasopressin receptor 
AII/AVP 

3449 

99.416 

8391 

L08239 

Homo sapiens located at OATL1 

2701 

100.000 

8392 

AF087679 

Sus scrofa tropomyosin 4 

1515 

100.000 

8393 

AF121863 

Homo sapiens sorting nexin 14 

2498 

100.000 

8394 

AL050101 

Homo sapiens hypothetical protein 

3605 

100.000 

8395 

AL109846 

Schizosaccharomyces pombe hypothetical protein 

305 

33 . 146 

8396 

Y18101 

Mus musculus macrophage actin-associated- 
tyrosine-phosphorylated protein 

1995 

87 .425 

8397 

Z81490 

Unknown similar to WD domain, G-beta repeats 
(2 domains) ; cDNA EST EMBL : T00482 comes from 
this gene 

2184 

62 .427 

8398 

U25353 

Gallus gallus homeodomain protein AKR 

650 

51 . 915 

8399 

AF081353 

Homo sapiens GTP-binding protein 

1461 

100.000 

8400 

Y15286 

Homo sapiens vacuolar proton-ATPase subunit 
M9.2 

594 

100.000 

8401 

AL031431 

Homo sapiens dJ462023.1 (novel protein) 

2140 

99.676 

8402 

D87973 

Mus musculus Impact 

1796 

82 . 500 

8403 

AF156884 

Homo sapiens RIP- like kinase 

3567 

99.807 

8404 

X91257 

Homo sapiens seryl-tRNA synthetase 

3400 

99.805 

8405 

Z82062 

Caenorhabditis elegans cDNA EST yk415cl2.5 
comes from this gene; cDNA EST yk526h3.3 comes 
from this gene; cDNA EST yk599bl.3 comes from 
this gene 

545 

42 . 941 

8406 

Z93386 

Unknown Similarity to Yeast hypothetical 52.9 
KD protein (SW:P43616); CDNA EST EMBL:M89432 
comes fr 

1807 

58 . 030 

8407 

AJ010059 

Homo sapiens SIT protein 

1335 

100.000 

8408 

AF068302 

Homo sapiens 

choline/ethanolaminephosphotransf erase 

2854 

100.000 

8409 

U89432 

Mus musculus unknown 

656 

69. 799 

8410 

AL050107 

Homo sapiens hypothetical protein 

1732 

100.000 

8411 

D86964 

Homo sapiens similar to a human major CRK- 
binding protein DOCK180. 

1218 
0 

100.000 

8412 

AL080091 

Homo sapiens hypothetical protein 

1755 

99.225 

8413 

X89399 

Homo sapiens Ins P4 -binding protein 

5564 

99. 520 

8414 

AL031775 

Homo sapiens dJ30M3.1 (novel protein similar 
to (predicted) plant, worm, yeast and archaea 
bacterial proteins) 

712 

100.000 

8415 

AL021453 

Homo sapiens dJ821D11.3 (PUTATIVE protein) 

1256 

100. 000 

8416 

AF015811 

Mus musculus putative lysophosphatidic acid 
acyl transferase 

1900 

96 . 564 

8417 

AF127761 

Homo sapiens ribonucleoprotein RBM8 

1194 

100.000 

8418 

A42942 

unidentified unnamed protein product 

219 

34 .314 

8419 

X71975 

Drosophila melanogaster put. homologue to 

861 

59 . 740 
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S.cerevisiae GAR1 gene 



8420 

AF095927 

Rattus norvegicus protein phosphatase 2C 

2475 

95.153 

8421 

U10414 

Caenorhabditis elegans Contains similarity to 
Pfam domain: PF00005 (ABC tran) , Score=245.2, 
E-value=3e-70, N=2 

2268 

50.142 

8422 

AL117472 

Homo sapiens hypothetical protein 

5442 

99.877 

8423 

U61953 

Caenorhabditis elegans No definition line 
found 

1001 

47.904 

8424 

AL031774 

Homo sapiens dek (putative oncogene) 

2390 

100.000 

8425 

D42039 

Homo sapiens The hal009 gene product is novel. 

1563 

100.000 

8426 

Z99104 

Bacillus subtilis similar to hypothetical 
proteins 

461 

32.955 

8427 

AJ001612 

Homo sapiens L-3-phosphoserine-phosphatase 
homologue 

507 

100.000 

8428 

X15525 

Homo sapiens acid phosphatase 

2891 

99. 764 

8429 

AF025468 

Caenorhabditis elegans No definition line 
found 

815 

34 .286 

8430 

AC004874 

Homo sapiens similar to N- 

acetylgalactosaminyltransf erase; similar to 
Q07537 (PID:gll71989) 

1211 

98.895 

8431 

AF120102 

Homo sapiens calsenilin 

998 

69.388 

8432 

Y13148 

Rattus norvegicus PAG608 

1746 

88 . 194 

8433 

X17320 

Mus musculus put. brain specific antigen (AA 
1-62) 

161 

46.296 

8434 

AF146738 

Rattus norvegicus testis specific protein 

992 

83.523 

8435 

U83194 

Homo sapiens TRAF4 -associated factor 2 

2612 

100.000 

8436 

D10376 

Bos taurus mitochondrial adenylate kinase 
isozyme 3 

1405 

92.511 

8437 

Y09305 

Homo sapiens protein kinase 

1640 

100. 000 

8438 

X94917 

Drosophila melanogaster head-elevated 
expression in 0.9 kb 

170 

23 . 711 

8439 

AB017644 

Homo sapiens ubi qui tin -conjugating enzyme E2 

1169 

85 . 990 

8440 

AL050173 

Homo sapiens hypothetical protein 

1177 

97.312 

8441 

AC006538 

Homo sapiens BC41195 1 

1062 

78.894 

8442 

AJ001019 

Homo sapiens ring finger protein 

584 

40.756 

8443 

AF156857 

Homo sapiens actin-binding protein 

3965 

100.000 

8444 

AF133124 

Homo sapiens transcription factor IIIC63 

3466 

98 .479 

8445 

AJ005578 

Homo sapiens 6 -phosphof ructo-2 -kinase 

3353 

100.000 

8446 

U58884 

Mus musculus SH3P7 

2448 

85.747 

8447 

AF151890 

Homo sapiens CGI -132 protein 

929 

100.000 

8448 

U53148 

Caenorhabditis elegans similar to protein 
kinase C inhibitors 

271 

35.461 

8449 

M77836 

Homo sapiens pyrroline-5-carboxylate reductase 

1739 

84.591 

8450 

AB004316 

Bos taurus mitochondrial methionyl- tRNA 
transf ormylase 

2227 

87 . 838 

8451 

235094 

Homo sapiens SURF-2 

1734 

97.266 

8452 

AL050275 

Homo sapiens hypothetical protein 

3077 

99. 520 

8453 

D13308 

Sus scrofa glycine N-methyltransf erase 

1887 

93.493 

8454 

Z69944 

Schizosaccharomyces pombe putative 
endonuclease 

350 

43 .165 

8455 

AJ223183 

Homo sapiens DORA protein 

1568 

99.585 

8456 

AC004890 

Homo sapiens similar to HUB1; similar to 
BAA24380 (PID : g2789430 ) 

5085 

99.720 

8457 

U09280 

Homo sapiens type XIX collagen 

79 

34.091 

8458 

AJ224442 

Homo sapiens methyltransf erase 

1169 

99.415 
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ALUUbU14 

Homo sapiens similar to RFP trans foonin^ 
protein; similar to P14373 (PID :gl32517) 

114z * 

00 Ton 

Oft D X 


riomo sapxens maccn co aa z ^ ^ d / ^ : gjou^t / / 0} 

C A "*5 

lOO.OOO 

8462 

V00507 

Homo sapiens coding sequence of DHFR (1 is 1st 
case xn coaon/ vddx xs jtu oase xn coaon; 

1129 

92.896 

8463 

AJ005259 

Homo sapiens homologous to Bombyx mori 
muxcxprocexn orxagxng Eaccor \ E*i v Lt>jLi ! aduuiu / oj 

948 

100.000 
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JO.Z1 / 

8465 

AL079281 

Homo sapiens hypothetical protein, similar to 
(U77 968) neuronal PAS domain protein 1 

1258 

98.000 

8466 

M19351 

Mus musculus immunoglobulin heavy chain 
binding protein 

188 

34.021 

8467 

ArOlzbbz 

Trypanosoma cruzi Tcrab27 

233 

68 . 000 

8468 

AC005594 

Homo sapiens R33729 1, partial CDS 

973 

96.711 

8469 

Z81051 

Caenorhabditis elegans predicted using 
Gene finder; similar to Zinc finger, C3HC4 type 
IRING linger) ; cdna EST yK443nb*3 comes from 
this gene; cDNA EST yk443h5.5 comes from this 
gene; cDNA EST yk633hl.3 comes from this gene 

257 

29 . 348 

8470 

D86984 

Homo sapiens similar to yeast adenylate 
cyclase iobo7/oj 

1801 

56 . 263 

8471 

AF109674 

Rattus norvegicus late gestation lung protein 

X 

1104 

77 . 841 

8472 

AE000909 

Methanobacterium thermoautotrophicum 
serine/ threonine protein kinase related 
proucin 

464 

30.032 

8473 

M80783 

Homo sapiens B12 protein 

672 

68.750 

8474 

AL117511 

Homo sapiens hypothetical protein 

654 

100.000 

8475 

ABOz /137 

Homo sapiens RAB-26 

q r a 

11 A f\ A 

13. 4U4 

8476 

X53744 

Canis familiaris 68kDA subunit of signal 
recognition particle 

3855 

97.049 

8477 

M2U4bo 

Homo sapiens aldehyde dehydrogenase 
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2238 

lOO OOO 

8481 

AJ001509 

Homo R^ni onct T^n^iT nrnhpi n 

2834 

100 000 

8482 

Z81069 

Caenorhabditis elegans cDNA EST yk552d5.3 

pAmp o f T*om t~ Vl 1 cj rrpnp 
v— i o J- -i. twJin liixq y cue 

216 

39.394 

8483 

AJ006068 

Homo sapiens dTDP-D- glucose 4 , 6 -dehydratase 

2379 

100.000 
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8486 

Z81592 

Caenorhabditis elegans predicted using 
Gene finder 

546 

55.782 

8487 

AI/L17204 

Caenorhabditis elegans predicted using 
Gene finder 

605 

37.276 

8488 

AC004449 

Homo sapiens R33683 3 

715 

100.000 

8489 

AF061817 

Rattus norvegicus DNA- binding protein PREB 

2520 

89.688 

8490 

AF117615 

Homo sapiens heme -binding protein 

1276 

99-471 
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8491 

S94541 

Homo sapiens clone 4-3 

359 

97 . 959 

8492 

Z71181 

Caenorhabditis elegans similar to hydrolase 

784 

36 . 391 

8493 

AB014589 

Homo sapiens KIAA0689 protein 

3783 

100 . 000 

8494 

Z81105 

Caenorhabditis elegans similar to alpha/beta 
hydrolase fold; cDNA EST EMBL:T02320 comes 
from this gene 

588 

37.681 

8495 

AJ242832 

Homo sapiens calpain 

4743 

99. 713 

8496 

S94421 

Homo sapiens T cell receptor eta-exon 

641 

100.000 

8497 

AL050214 

Homo sapiens hypothetical protein 

1218 

99.448 

8498 

AP135016 

Homo sapiens protein phosphatase 2A 48 kDa 
regulatory subunit 

2845 

100.000 

8499 

AF117272 

Octopus dof leini O- crystal 1 in 

319 

31.176 

8500 

AF089106 

Homo sapiens unknown 

919 

100 . 000 

8501 

U04968 

Cricetulus griseus nucleotide excision repair 
protein 

4897 

97.628 

8502 

AL050131 

Homo sapiens hypothetical protein 

1517 

100.000 

8503 

AL031266 

Caenorhabditis elegans VM106R.1 

198 

33.333 

8504 

AF019661 

Mus musculus zeta proteasome chain; PSMA5 

1538 

100 .000 

8505 

AB020316 

Homo sapiens dermatan/chondroitin sulfate 2- 
sulfo transferase 

2845 

100.000 

8506 

A40202 

unidentified unnamed protein product 

619 

100.000 

8507 

AL035593 

Homo sapiens dJ310J6.1 (novel protein) 

1040 

98.675 

8508 

Z82244 

Homo sapiens bK286B10.1 

288 

61. 176 

8509 

AL022318 

Homo sapiens bK150C2.3 (PUTATIVE novel protein 
similar to APOBEC1 (Apolipoprotein B mRNA 
editing protein) and Phorbolin) 

1457 

100.000 

8510 

AF132794 

Homo sapiens anaphase promoting complex 
subunit 10 

1258 

99.459 

8511 

X63657 

Homo sapiens FVT1 gene is disrupted in a 
t(2;18) chromosomal translocation involving Ig 
kappa gene in a follicular lymphoma 

2116 

100.000 

8512 

Y14780 

Homo sapiens lymphocyte function associated 
antigen-3, TM-linked precursor 

1685 

100.000 

8513 

Z81038 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk488a2.5 comes from this 
gene 

359 

34.562 

8514 

AF070594 

Homo sapiens HNK-1 sulfotransf erase 

546 

32 . 013 

8515 

AF060883 

Mus musculus endomucin 

639 

48.302 

8516 

U05784 

Rattus norvegicus light chain 3 subunit of 
microtubule-associated proteins 1A and IB 

669 

82 . 500 

8517 

X56351 

Homo sapiens delta- aminolevulinate synthase 
(housekeeping) 

4284 

100.000 

8518 

AF151889 

Homo sapiens CGI -131 protein 

1018 

100. 000 

8519 

D50617 

Saccharomyces cerevisiae YFL046W 

225 

24 . 865 

8520 

AF171055 

Homo sapiens thioredoxin reductase TR2 

3716 

99.288 

8521 

AF123880 

multiple sclerosis associated retrovirus 
element unknown protein U5/2 

323 

82 .456 

8522 

AL110151 

Homo sapiens hypothetical protein 

589 

38 .261 

8523 

AC007193 

Homo sapiens PPP5 HUMAN 

3358 

99 . 800 

8524 

Y17282 

Homo sapiens cytokeratin type II 

3498 

99 . 819 

8525 

X66901 

Mus musculus En-2/lacZ fusion protein 

117 

49.123 | 

8526 

U41012 

Caenorhabditis elegans C06A6 . 3 gene product 

212 

24 . 242 

8527 

X76029 

Homo sapiens neuromedin U 

1183 

99.425 

8528 

X94991 

Homo sapiens zyxin 

4124 

99. 825 

8529 

U30521 

Homo sapiens P311 HUM 

326 

92 . 157 
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8530 

AB020967 

Rattus sp . kinase 

1708 

72.905 

8531 

D87457 

Homo sapiens KIAA02 81 

1336 

79.098 

8532 

Z97207 

Mus musculus B-IND1 protein 

1212 

94 . 737 

8533 

AJ001019 

Homo sapiens ring finger protein 

1675 

99.593 

8534 

D42073 

Homo sapiens reticulocalbin 

1198 

58 .446 

8535 

X80035 

Oryctolagus cuniculus cysteine rich hair 
keratin associated protein 

805 

65.385 

8536 

X73462 

Ovis aries hair keratin cysteine rich protein 

796 

70 .229 

8537 

299129 

Homo sapiens dJ425C14.2 (Placental protein 
DIFF33 LIKE) 

1733 

54 .955 

8538 

X99140 

Homo sapiens type II intermediate filament of 
hair keratin 

3354 

100.000 

8539 

X90763 

Homo sapiens HHa5 hair keratin type I 
intermediate filament 

2824 

99.529 

8540 

AL034488 

Unknown predicted using Genefinder; cDNA EST 
yk490cl.5 comes from this gene; cDNA EST 
yk256e4.5 c 

250 

24 . 746 

8541 

L35604 

Drosophila melanogaster ethanolamine kinase 

921 

40.921 

8542 

AF003388 

Caenorhabditis elegans No definition line 
found 

461 

38 .342 

8543 

M32334 

Homo sapiens intercellular adhesion molecule 2 
(ICAM-2) 

1835 

100.000 

8544 

AF064448 

Mus musculus sex-determination protein homolog 
Femlb 

4097 

98.884 

8545 

AF067855 

Homo sapiens geminin 

1328 

100.000 

8546 

M34513 

Homo sapiens omega protein 

1408 

96 . 714 

8547 

AF093419 

Homo sapiens multi PDZ domain protein MUPP1 

1317 
0 

100.000 

8548 

AC004876 

Homo sapiens similar to predicted proteins 
AAB54240 ( PID : g2088822 ) and S67138 
(PID:g2132925) 

995 

55.144 

8549 

M69238 

Homo sapiens Arnt 

5341 

100.000 

8550 

X12433 

Homo sapiens put . ORF 

2902 

100.000 

8551 

D63880 

Homo sapiens KIAA0159 gene product is related 
to yeast protein L8479.14. 

9109 

99. 929 

8552 

M30185 

Homo sapiens cholesteryl ester transfer 
protein precursor 

3186 

100.000 

8553 

AF102265 

Homo sapiens N-acetylglucosamine -phosphate 
mutase 

3544 

100.000 

8554 

M19507 

Homo sapiens myeloperoxidase 

5048 

99.732 

8555 

Y14318 

Homo sapiens peroxisomal ABC- transporter 

4000 

100.000 

8556 

X85750 

Homo sapiens expression associated with 
monocyte to macrophage, differentiation 

1662 

98.739 

8557 

AF161703 

Homo sapiens gammaS- crystal 1 in 

1237 

99.419 

8558 

X01060 

Homo sapiens put. transferrin receptor (aa 1- 
760) 

5000 

99.605 

8559 

U72678 

Mus musculus EF-9 

1014 

92.857 

8560 

X81372 

Homo sapiens biphenyl hydrolase- related 
protein 

1890 

99.635 

8561 

D284 83 

Homo sapiens SCR3 

2702 

99 . 263 

8562 

U66372 

Bos taurus ribosomal protein S2 9 

420 

100.000 

8563 

D50063 

Homo sapiens proteasome subunit p4 0 / Mov34 
protein 

2063 

98.765 

8564 

J05594 

Homo sapiens NAD+ - dependent 15- 
hydroxyprostaglandin dehydrogenase 

1725 

99.624 
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8565 

AC006033 

Homo sapiens similar to MLN 64; similar to 
138027 (PID:g2135214) 

1517 

100. 000 

8566 

AF002697 

Homo sapiens E1B 19K/Bcl- 2 -binding protein 
Nip3 

1285 

100.000 

8567 

X79536 

Homo sapiens hnRNPcore protein Al 

2211 

100.000 

8568 

AF012652 

Trypanosoma cruzi Tcrab2 7 

233 

68.000 

8569 

D86438 

Homo sapiens Ibal (ionized calcium binding 
adapter molecule 1) 

955 

100.000 

8570 

X77639 

Sus scrofa cellular retinol binding protein II 

582 

60.150 

8571 

X83441 

Homo sapiens DNA ligase IV 

5616 

99.408 

8572 

AB023811 

Homo sapiens TU3A 

450 

54 .135 

8573 

U21855 

Mus musculus mCAFl protein 

1914 

99.649 

8574 

AF078857 

Homo sapiens PTD002 

1269 

100.000 

8575 

Z73420 

Homo sapiens match: protein P25325; match: DNA 
X59434 

2072 

100.000 

8576 

AC006929 

Arabidopsis thaliana unknown protein 

283 

34 .286 

8577 

AB018288 

Homo sapiens KIAA074 5 protein 

5881 

99.020 

8578 

AF065441 

Mus musculus FGF binding protein 1 

228 

24 .402 

8579 

AL031824 

Schizosaccharomyces pombe conserved 
hypothetical protein 

193 

30.078 

8580 

AJ008112 

Homo sapiens C17orfl protein 

2986 

100.000 

8581 

AF003386 

Caenorhabditis elegans No definition line 
found 

1557 

44 .186 

8582 

D80004 

Homo sapiens KIAA0182 

7708 

99.825 

8583 

AC003027 

Arabidopsis thaliana lcl|prt_seq No definition 
line found 

481 

40.329 

8584 

AB029028 

Homo sapiens KIAA1105 protein 

596 

29.783 

8585 

X57802 

Homo sapiens immunoglobulin lambda light chain 

1513 

97 . 835 

8586 

AB028996 

Homo sapiens KIAA1073 protein 

325 

23 .600 

8587 

AF111941 

Dictyostelium discoideum development protein 
DG114 8 

304 

56 .962 

8588 

AB001993 

Homo sapiens glia maturation factor homologous 
protein 

941 

100.000 

8589 

AF124249 

Homo sapiens SH2 -containing protein Nspl 

3932 

99.826 

8590 

D25304 

Homo sapiens this sequence overlaps D13631, it 
covers 954.. 4359 of this sequence. 

5066 

100.000 

8591 

AF132209 

Homo sapiens prepro-major basic protein 
homo log 

1608 

100.000 

8592 

AJ002078 

Homo sapiens syntaxin 6 

1622 

100.000 

8593 

U29488 

Caenorhabditis elegans No definition line 
found 

849 

61.333 

8594 

M13444 

Mus musculus alpha- tubulin isotype M-alpha-6 

3047 

100.000 

8595 

Z81137 

Unknown Similarity to Yeast YIP1 protein 
(SW:P53039); cDNA EST EMBL:T01608 comes from 
this gene; cD 

255 

27 .803 

8596 

AF151903 

Homo sapiens CGI -14 5 protein 

2479 

99.482 

8597 

AL050405 

Homo sapiens hypothetical protein 

2129 

99.688 

8598 

D87463 

Homo sapiens KIAA02 73 

1782 

76 .364 

8599 

AF073518 

Homo sapiens small EDRK-rich factor 1, short 
isof orm 

385 

100.000 

8600 

AL117468 

Homo sapiens hypothetical protein 

1121 

100.000 

8601 

L05425 

Homo sapiens nucleolar GTPase 

4793 

100.000 

8602 

AL117600 

Homo sapiens hypothetical protein 

3670 

99.458 

8603 

Y13936 

Homo sapiens protein phosphatase 2C gamma 

3621 

100.000 

8604 

AJ249735 

Homo sapiens claudin-6 

1469 

100.000 
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8605 1 

AF175409 

Homo sapiens unknown 

2750 

99. 757 

8606 

AJ224677 

Homo sapiens scrapie responsive protein 1 

707 

100.000 

8607 

X99656 

Homo sapiens SH3 -containing Grb-2-like 1 

2426 

100.000 

8608 

D90703 

Escherichia coli Hypothetical 13.8 kd protein 
in cspE-lipA intergenic region. 

819 

98.413 

8609 

X57560 

Escherichia coli pspE protein 

682 

100.000 

8610 

L37368 

Homo sapiens RNA-binding protein 

2011 

100.000 

8611 

X77584 

Homo sapiens ATL-derived f actor/thioredoxin 

705 

100.000 

8612 

X78990 

Mus musculus test in 

1191 

48.916 

8613 

AF151849 

Homo sapiens CGI -91 protein 

2001 

100.000 

8614 

D87447 

Homo sapiens KIAA0258 

2628 

100. 000 

8615 

X77858 

Human papillomavirus type 59 ORF putative E5 

99 

31. 746 

8616 

M97935 

Homo sapiens transcription factor ISGF-3 

5010 

100.000 

8617 

AB002294 

Homo sapiens KIAA0296 

1298 
8 

99.891 

8618 

AF151878 

Homo sapiens CGI -12 0 protein 

644 

75.397 

8619 

AL035064 

Schizosaccharomyces pombe queuine trna- 
ribosyl transferase 

1535 

56.743 

8620 

AF006088 

Homo sapiens pl6-Arc 

685 

68.831 

8621 

AF077042 

Homo sapiens 3 OS ribosomal protein S7 homolog 

1625 

100.000 

8622 

U29195 

Homo sapiens neuronal pentraxin II 

3473 

98 .868 

8623 

AB018322 

Homo sapiens KIAA077 9 protein 

2019 

100.000 

8624 

X67209 

Mus musculus npdcf-1 

1511 

75.585 

8625 

AF180819 

Homo sapiens LAK1 protein 

3537 

99.806 

8626 

AF136382 

Homo sapiens JNK MAP kinase scaffold protein 
JIP2 

5730 

100.000 

8627 

AJ248283 

Pyrococcus abyssi methylmalonyl-CoA mutase, 
subunit alpha, C- terminus (mcmA2) 

126 

30.488 

8628 

Z75134 

Canis familiaris rod transducin 

2322 

100. 000 

8629 

AF007160 

Homo sapiens unknown 

915 

100.000 

8630 

AL050007 

Homo sapiens hypothetical protein 

506 

98.795 

8631 

AC009465 

Arabidopsis thaliana unknown protein, contains 
TNFR/NGFR cysteine -rich region 

375 

38.519 

8632 

Z73277 

Saccharomyces cerevisiae ORF YLRlOSc 

256 

29.240 

8633 

AF156102 

Homo sapiens ELL complex EAP3 0 subunit 

1678 

99.612 

8634 

Z95114 

Homo sapiens bK212A2.1 (TNF- inducible protein 
CG12-1) 

1198 

59.627 

8635 

U88964 

Homo sapiens HEM4 5 

249 

90 . 909 

8636 

AL050143 

Homo sapiens hypothetical protein 

759 

100.000 

8637 

U40953 

Caenorhabditis elegans No definition line 
found 

659 

28. 989 

8638 

AB018566 

Homo sapiens Proline synthetase associated 

1828 

100.000 

8639 

D83767 

Homo sap i ens Rep - 8 

1750 

100.000 

8640 

U36340 

Mus musculus BKLF 

1473 

93 . 103 

8641 

U96769 

Homo sapiens chondroadherin 

2393 

99. 721 

8642 

U66411 

Drosophila melanogaster putative type III 
alcohol dehydrogenase 

1307 

49.403 

8643 

X55989 

Homo sapiens eosinophil cat ionic- related 
protein 

1167 

99.371 

8644 

AF177533 

Homo sapiens tight junction protein ZO-2 
isoform A 

7916 

99.328 

8645 

AF007151 

Homo sapiens unknown 

1908 

100.000 

8646 

AB018307 

Homo sapiens KIAA0764 protein 

2802 

100. 000 

8647 

Z32684 

Homo sapiens membrane transport protein 

2964 

100.000 

8648 

U09367 

Homo sapiens zinc finger protein ZNF136 

3902 

99. 815 
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8649 

M89928 

Oryctolagus cuniculus binding protein 

736 

100.000 

8650 

D90701 

Escherichia coli Hypothetical protein 2 . 

464 

100.000 

8651 

U19577 

Escherichia coli galactonate dehydratase 

508 

94 .805 

8652 

AP000064 

Aeropyrum pernix 12 0aa long hypothetical 
protein 

123 

36.620 

8653 

D90699 

Escherichia coli ORF ID:ol63#3 

696 

98.182 

8654 

AL117660 

Homo sapiens hypothetical protein 

609 

77.273 

8655 

AL117195 

Caenorhabditis elegans predicted using 
Genefinder; preliminary prediction 

544 

33 .898 

8656 

U79298 

Homo sapiens unknown 

2465 

100.000 

8657 

Z78542 

Caenorhabditis elegans similar to 
Mitochondrial carrier proteins; cDNA EST 
EMBL:T01651 comes from this gene 

697 

47.191 

8658 

M23236 

Mus musculus proline- rich protein 

523 

39.592 

8659 

AJ010063 

Homo sapiens telethonin 

1132 

100.000 

8660 

AL049711 

Arabidopsis thaliana hypothetical protein 

283 

33 .032 

8661 

AF143235 

Homo sapiens apoptosis related protein APR-l 

1247 

100.000 

8662 

J04173 

Homo sapiens phosphoglycerate mutase 2 

1721 

100. 000 

8663 

X71440 

Homo sapiens peroxisomal acyl-CoA oxidase 

4364 

99.394 

8664 

AF151878 

Homo sapiens CGI -12 0 protein 

1113 

100. 000 

8665 

AF044774 

Homo sapiens breakpoint cluster region protein 
2 

3415 

99 .412 

8666 

Z68227 

Caenorhabditis elegans cDNA EST EMBL:D726 91 
comes from this gene; cDNA EST yk566e9.3 comes 
from this gene 

167 

37.349 

8667 

AF034801 

Homo sapiens liprin-alpha4 

3274 

98.031 

8668 

AL008729 

Homo sapiens predicted protein dJ257A7.l 

836 

100.000 

8669 

Y17849 

Homo sapiens ganglioside- induced 
differentiation associated protein 1 

2328 

98.603 

8670 

AF084458 

Homo sapiens sec61 homolog 

3083 

100.000 

8671 

Y12642 

Homo sapiens E4 8 antigen 

787 

100.000 

8672 

Z73102 

Caenorhabditis elegans predicted using 
Genefinder; Similarity to Bacillus subtilis 
DNAJ" protein <SW : DNAJ_BACSU) ; cDNA EST 
EMBL:D74098 comes from this gene; cDNA EST 
EMBL:C12 52 0 comes from this gene; cDNA EST 
EMBL:D71409 comes from this gene 

608 

38 . 800 

8673 

X78933 

Homo sapiens zinc finger protein 

630 

49.296 

8674 

M17614 

Homo sapiens transferrin 

212 

47.561 

8675 

X61497 

Mus musculus 154 protein 

198 

56 .098 

8676 

M12140 

Homo sapiens envelope protein 

588 

31 .894 

8677 

X57432 

Rattus rattus ribosomal protein S2 

694 

58.547 

8678 

X03145 

Homo sapiens pot. ORF III 

235 

51 . 923 

8679 

S67513 

Borna disease virus BDV, WT-1, Halle Bl/91, 
horse brain, field isolate, Peptide, 370 aa 
p4 0 

763 

42 .138 

8680 

U01849 

Trypanosoma brucei ORF2 

173 

39.130 

8681 

D90176 

Mus musculus ORF of NFI-B6 

1593 

93 .841 

8682 

AF153062 

Canis familiaris type I collagen pre-pro- 
alpnal(I) chain 

230 

27 .186 

8683 

AB030237 

Canis familiaris D4 dopamine receptor 

173 

41.000 

8684 

AF134825 

Homo sapiens small nuclear ribonucleoprotein B 

528 

85.294 

8685 

X51394 

Xenopus laevis APEG precursor protein 

310 

30. 918 

8686 

AF052432 

Homo sapiens katanin p8 0 subunit 

194 

31.282 

8687 

M13100 

Rattus norvegicus unknown protein 

237 

41.667 
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8688 

X03145 

Homo sapiens pot. ORF II 

304 

56 . 190 

8689 

AB012223 

Canis familiaris ORF2 

226 

47.500 

8690 

Z34802 

Caenorhabditis elegans cDNA EST yk372hll.3 
comes from this gene; cDNA EST yk372hll.5 
comes from this gene 

126 

32.692 

8691 

S80905 

Homo sapiens Conl=salivary concanavalin-A 
binding protein {exon 3} 

168 

28.239 

8692 

U04267 

Gossypium barbadense proline- rich cell wall 
protein 

267 

34 . 731 

8693 

U67988 

Homo sapiens guanylate kinase associated 
protein 

570 

78 . 571 

8694 

U73522 

Homo sapiens AMSH 

266 

60 . 526 

8695 

AF123344 

Homo sapiens Kruppel-like zinc finger 
transcription factor 

283 

56.250 

8696 

U71363 

Homo sapiens zinc finger protein zfp6 

430 

51.678 

8697 

AF123880 

multiple sclerosis associated retrovirus 
element gag polyprotein 

277 

42 . 982 

8698 

X15804 

Homo sapiens alpha-actinin (AA 1-892) 

266 

72 . 131 

8699 

U83303 

Homo sapiens line-1 reverse transcriptase 

168 

46 . 835 

8700 

AF015454 

Xenopus laevis ERl 

829 

66.667 

8701 

X90875 

Mus musculus FXR1 

521 

64 . 844 

8702 

Z12172 

Homo sapiens putative homeotic protein 

429 

76 . 190 

8703 

U72514 

Homo sapiens C2f 

363 

57 . 944 

8704 j 

X56158 

Homo sapiens immunoglobulin from VH4 family 

534 

76 . 923 

8705 

U01317 

Homo sapiens G- gamma glob in 

217 

50.602 

8706 

AL021396 

Homo sapiens dJ971N18.2 

861 

85 . 161 

8707 

U49974 

Homo sapiens mariner transposase 

360 

70.526 

8708 

U47924 

Homo sapiens C8 

490 

68 . 966 

8709 

AC006233 

Arabidopsis thaliana hypothetical protein 

190 

52 . 542 

8710 

M15386 

Homo sapiens gamma -globin 

466 

71 . 698 

8711 

D88385 

Sus scrofa A-Raf-l 

396 

72 .414 

8712 

U95044 

Homo sapiens zinc finger protein 

221 

43 . 382 

8713 

AF064553 

Mus musculus NSD1 protein 

223 

57.895 

8714 

M15386 

Homo sapiens gamma -globin 

525 

76 . 786 

8715 

M96982 

Homo sapiens U2 snRNP auxiliary factor small 
subunit 

398 

39.608 

8716 

U49082 

Homo sapiens transporter protein 

950 

62 . 172 

8717 

AF000422 

Homo sapiens TTF-I interacting peptide 5 

1802 

84 . 091 

8718 

U01317 

Homo sapiens G-gamma globin 

215 

50. 000 

8719 

AF053356 

Homo sapiens insulin receptor substrate like 
protein 

317 

39.631 

8720 

U58337 

Mus musculus ligatin 

330 

63 . 636 

8721 

S61973 

Rattus sp. NMDA receptor glutamate-binding 
subunit 

2375 

74 . 723 

8722 

M29622 

Mus musculus open reading frame 2 

120 

50.000 

8723 

X74330 

Homo sapiens DNA primase (subunit p48) 

307 

57.292 

8724 

M16550 

Baboon endogenous virus pol polyprotein 

660 

34 . 990 

8725 

Z77655 

Caenorhabditis elegans Weak similarity to 
Human calcium-dependent proetase 
(SW:CANS HUMAN) 

194 

48 . 718 

8726 

Y12713 

Mus musculus Pro-Pol-dUTPase polyprotein 

2155 

65.565 

8727 

AL033125 

Unknown l-evidence=predicted by content; 1- 
method=genef inder ; 084 ; l-evidence_end; 2 - 
evidence=pred 

1625 

45. 960 

8728 

U49974 

Homo sapiens mariner transposase 

889 

66 . 045 
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8729 

X55777 

Homo sapiens put . ORF 

240 

59.211 

8730 

AF151887 

Homo sapiens CGI -12 9 protein 

177 

75.000 

8731 

X03145 

Homo sapiens pot. ORF II 

275 

47. 934 

8732 

X75042 

Homo sapiens c - rel 

2333 

90.453 

8733 

AF003535 

Homo sapiens ORF2-like protein 

419 

63 .478 

8734 

AF000194 

Caenorhabditis elegans No definition line 
found 

356 

25.364 

8735 

U93565 

Homo sapiens putative pi 50 

195 

37. 864 

8736 

AF003535 

Homo sapiens ORF2-like protein 

440 

58.088 ! 

8737 

X51394 

Xenopus laevis APEG precursor protein 

203 

27.875 

8738 

U93565 

Homo sapiens putative pi 50 

472 

52 .381 

8739 

AC006585 

Arabidopsis thaliana putative extragenic 
suppressor protein 

1243 

43.545 

8740 

AB002306 

Homo sapiens KIAA0308 

900 

40.748 

8741 

AL031177 

Homo sapiens dJ88 9M15.3 (novel protein) 

1109 

90.099 

8742 

L11366 

Herpesvirus papio EBNA2 gene product 

263 

25.731 

8743 

AB012139 

Rattus norvegicus procollagen C-proteinase 3 

267 

38.312 

8744 

M27878 

Homo sapiens DNA binding protein 

407 

31.359 

8745 

X07704 

Homo sapiens Po protein 

304 

45. 912 

8746 

X53581 

Rattus norvegicus ORF4 

444 

39.648 

8747 

AF006740 

Homo sapiens No definition line found 

276 

41.935 

8748 

AC005825 

Arabidopsis thaliana putative glucokinase 

227 

51.389 

8749 

L22031 

Glycine max hydroxyproline-rich glycoprotein 

155 

41.379 

8750 

Z14019 

Nicotiana tabacum pistil extensin like protein 

197 

32 . 540 

8751 

AJ243460 

Leishmania major proteophosphoglycan 

245 

34 .123 

8752 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

219 

53.731 

8753 

Z81074 

Unknown Similarity to Soybean 3- 
methylcrotonyl-CoA carboxylase (TR:Q42777) ; 
cDNA EST EMBL:M75819 co 

1366 

64.223 

8754 

AF107727 

Rattus norvegicus sertolin 

370 

61. 818 

8755 

AF071081 

Mycobacterium tuberculosis proline- rich mucin 
homo log 

323 

29. 964 

8756 

U97553 

murine herpesvirus 6 8 unknown 

181 

28.381 

8757 

M26111 

Anser anser beta-actin 

527 

82.979 

8758 

AC004136 

Arabidopsis thaliana hypothetical protein 

272 

25.468 

8759 

AF015539 

Mytilus edulis precollagen P 

420 

29.711 

8760 

AF129269 

Homo sapiens DNA methyl transferase 3 beta 5 

320 

61.628 

8761 

AB012223 

Cani s f ami 1 iar i s 0RF2 

208 

45.763 

8762 

AB015054 

Rhizomucor pusillus Alg2 

162 

43 . 750 

8763 

D31763 

Homo sapiens ha0946 protein is Kruppel- 
related. 

758 

38.259 

8764 

AF153127 

Gallus gallus SAPK interacting protein 

2179 

72.782 

8765 

AF139185 

Rattus norvegicus myomegalin 

861 

65.066 

8766 

U09116 

Homo sapiens ORF2 , encodes a reverse 
transcriptase homolog 

346 

44 . 056 

8767 

M74002 

Homo sapiens arginine-rich nuclear protein 

692 

79.605 

8768 

AF055904 

Myxococcus xanthus unknown 

294 

33 .438 

8769 

AL110249 

Homo sapiens hypothetical protein 

183 

36 .449 

8770 

L22760 

Rattus norvegicus DNA binding protein 

176 

26 . 984 

8771 

AB032904 

Hylobates syndactylus dopamine receptor D4 

163 

37 . 008 

8772 

U97553 

murine herpesvirus 68 unknown 

346 

30.680 

8773 

X67863 

Mus musculus T2 

236 

40.690 

8774 

J00123 

Homo sapiens preproenkephalin ( 

44 

30. 000 

8775 

U49974 

Homo sapiens mariner transposase 

307 

67.742 

8776 

AB029014 

Homo sapiens KIAA1091 protein 

1362 

61.096 
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8777 

X61046 

Hydra sp . mini -collagen 

167 

47.541 

8778 

D88587 

Homo sapiens Hakata antigen 

337 

40.816 

8779 

M12140 

Homo sapiens envelope protein 

554 

32.624 

8780 

X55777 

Homo sapiens put . ORF 

245 

67.742 

8781 

M80341 

Homo sapiens ORF2 contains a reverse 
transcriptase domain.; ORF2 

320 

52.809 

8782 

Z96047 

Caenorhabditis elegans DY3.6 

251 

28 . 846 

8783 

AF132552 

Drosophila melanogaster BcDNA. GM01838 

1292 

79.498 

8784 

X65120 

Homo sapiens alphal (X) collagen 

312 

29.081 

8785 

AL033545 

Arabidopsis thaliana extensin-like protein 

198 

32.812 

8786 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

421 

54.839 

8787 

AC002542 

Homo sapiens similar to C. elegans F11A10.5; 
80% similarity to Z68297 (PID :gll30619 ) 

1547 

62.439 

8788 

M64793 

Rattus norvegicus salivary proline-rich 
protein 

242 

30.493 

8789 

U04267 

Gossypium barbadense proline-rich cell wall 
protein 

210 

36 . 601 

8790 

X86019 

Homo sapiens SH3 -domain interacting protein 

246 

29.936 

8791 

Z14016 

Nicotiana tabacum pistil extensin like 
protein, partial CDS 

179 

31 . 579 

8792 

U57368 

Mus musculus EGF repeat transmembrane protein 

437 

56 . 522 

8793 

M11902 

Mus musculus proline-rich salivary protein 

221 

35 . 789 

8794 

AF153685 

Homo sapiens truncated calcium binding protein 

89 

28 .421 

8795 

AF123880 

multiple sclerosis associated retrovirus 
element gag polyprotein 

184 

37.838 

8796 

X15332 

Homo sapiens alpha-1 (III) collagen 

294 

28.367 

8797 

AF124663 

Mus musculus UbcM4 interacting protein 28; 
UIP28 

515 

43 . 777 

8798 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

428 

61 . 146 

8799 

U97553 

murine herpesvirus 68 unknown 

390 

29 . 197 

8800 

S62941 

Homo sapiens Ps 2=basic proline-rich 
protein (PRB1L precursor protein=basic proline- 
rich proteins (Ps, PmF, PmS, and Pe) 
precursor) {C- terminal} 

353 

32 . 915 

8801 

X61395 

Lycopersicon esculentum proline rich protein 

185 

26 . 016 

8802 

AJ243460 

Leishmania major proteophosphoglycan 

245 

30 . 189 

8803 

X61045 

Hydra sp. mini -collagen 

193 

35.043 

8804 

X57010 

Homo sapiens collagen II alpha 1 chain 

158 

37.419 

8805 

U05342 

Mus musculus zinc finger protein 

644 

50.505 

8806 

U04267 

Gossypium barbadense proline-rich cell wall 
protein 

223 

35 . 115 

8807 

X55777 

Homo sapiens put. ORF 

348 

72 . 368 

8808 

AFO 55904 

Myxococcus xanthus unknown 

163 

30 . 682 

8809 

X60432 

Zea mays prolin rich protein 

195 

32.680 

8810 

AF055904 

Myxococcus xanthus unknown 

238 

30.712 

8811 

U25281 

Rattus norvegicus SH3 domain binding protein 

207 

32 . 035 

8812 

U35730 

Mus musculus jerky 

260 

33.846 

8813 

U88895 

Homo sapiens ORF2 

302 

71.212 

8814 

AF123880 

multiple sclerosis associated retrovirus 
element gag polyprotein 

256 

44.000 

8815 

AF019969 

Mus musculus Su(var) 3-9 homolog 

420 

49.315 

8816 

U21310 

Caenorhabditis elegans No definition line 
found 

164 

32 . 824 

8817 

L29169 

Chironomus tentans giant secretory protein 
(sp-Ic) 

189 

40.800 
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8818 

AL008583 

Homo sapiens dJ327J16.3 (novel CHROMObox 
family protein) 

532 

59.236 

8819 

U07973 

Gallus gallus alpha- 1 collagen type III 

346 

28 . 075 

8820 

D29833 

Homo sapiens proline rich peptide P-B 

393 

73.750 

8821 

AL034421 

Homo sapiens dJ1137F22.2 (core-binding factor, 
runt domain, alpha subunit 2; translocated to, 
2 (MTGRla, MTG8/ETO/CDR family protein) 
(isoform 2) ) 

147 

32.075 

8822 

Y10529 

Homo sapiens olfactory receptor 

1145 

75.200 

8823 

AF055904 

Myxococcus xanthus unknown 

200 

30.545 

8824 

U03714 

Mus musculus alpha 1 (XVIII) collagen 

165 

39.815 

8825 

AF017777 

Drosophila melanogaster waclaw 

1208 

52 . 099 

8826 

X07882 

Homo sapiens Po protein 

182 

34 . 746 

8827 

X15311 

Woolly monkey sarcoma virus reverse 
transcriptase (476 AA) 

491 

47.236 

8828 

AJ222580 

Mus musculus B99 protein 

184 

27.987 

8829 

M69297 

Homo sapiens ORF 2 

109 

33 . 735 

8830 

Y18620 

Arabidopsis thaliana DsPTPl protein 

251 

43 . 925 

8831 

AF010170 

synthetic construct Pol 

659 

26 . 002 

8832 

U68412 

Arenicola marina fibrillar collagen 

198 

34 . 711 

8833 

X61047 

Hydra sp . mini-collagen 

123 

33.684 

8834 

AB032907 

Hylobates lar dopamine receptor D4 

175 

35.811 

8835 

D64052 

Nicotiana tabacum cytochrome P4 50 like TBP 

183 

36.691 

8836 

AF020261 

Santalum album proline rich protein 

242 

26 . 027 

8837 

AL078606 

Arabidopsis thaliana putative protein 

359 

45.113 

8838 

M15103 

Plasmodium cynomolgi circumsporozoite antigen 

194 

26.210 

8839 

AJ004832 

Homo sapiens neuropathy target esterase 

1218 

58 . 583 

8840 

AC006283 

Arabidopsis thaliana hypothetical protein 

170 

29.605 

8841 

U32305 

Caenorhabditis elegans No definition line 
found 

378 

40.271 

8842 

AL031603 

Schizosaccharomyces pombe conserved 
hypothetical protein . 

1253 

51.969 

8843 

Z97184 

Homo sap i en s B ING 1 

484 

41 .494 

8844 

AF055904 

Myxococcus xanthus unknown 

264 

31.985 j 

8845 

Z81074 

Caenorhabditis elegans predicted using 
Genefinder; Similarity to Yeast ORF YOR070C 
(TR:Q08484); cDNA EST EMBL:T01610 comes from 
this gene; cDNA EST EMBL:D36648 comes from 
this gene; cDNA EST yk303b2.5 comes from this 
gene 

419 

35 . 821 

8846 

U25281 

Rattus norvegicus SH3 domain binding protein 

233 

31.837 

8847 

X55686 

Lycopersicon esculentum extensin (class II) 

56 

31.818 

8848 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

206 

76.744 ; 

8849 

X64173 

Zea diploperennis hydroxyproline-rich 
glycoprotein 

181 

33.117 

8850 

AC006233 

Arabidopsis thaliana hypothetical protein 

188 

42.342 

8851 

U62528 

Equus caballus type II collagen 

326 

29.173 

8852 

Z14015 

Nicotiana tabacum pistil extensin like protein 

253 

29.469 

8853 

AF121861 

Homo sapiens sorting nexin 11 

68 

30.000 

8854 

U93569 

Homo sapiens putative pl50 

296 

42 .373 

8855 

L24521 

Homo sapiens transformation- related protein 

249 

56.667 

8856 

AF086608 

Rattus norvegicus neurestin beta 

297 

30.802 

8857 

AF000381 

Homo sapiens non- functional folate binding 
protein 

806 

89.506 

8858 

S70011 

Rattus sp . tricarboxylate carrier 

298 

53 . 846 
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8859 

AF016671 

Caenorhabditis elegans Similar to collagen 

274 

45.361 

8860 

U49974 

Homo sapiens mariner transposase 

368 

68.041 

8861 

M22333 

Homo sapiens unknown protein 

585 

49.393 

8862 

M92913 

Nephila clavipes dragline silk fibroin 

326 

28.439 

8863 

AL031282 

Homo sapiens dJ283E3.6.1 (PUTATIVE novel 
protein similar to many (archae) bacterial , 
worm and yeast hypothetical proteins) 

1589 

55.644 

8864 

AF017777 

Drosophila melanogaster tweety 

475 

25.971 

8865 

AB011126 

Homo sapiens KIAA0554 protein 

426 

58.678 

8866 

Y14690 

Homo sapiens procollagen alpha 2 (V) 

238 

30. 038 

8867 

AB028997 

Homo sapiens KIAA1074 protein 

504 

50.575 

8868 

AC005498 

Homo sapiens R31665 2 

366 

27.083 

8869 

AF159131 

Mus musculus nucleolar RNA helicase II/Gu 

174 

32 .335 

8870 

D80009 

Homo sapiens KIAA0187 

827 

87.333 

8871 

AC006085 

Arabidopsis thaliana Hypothetical protein 

294 

28 . 516 

8872 

U53155 

Unknown Similar to cuticular collagen; coded 
for by C. elegans cDNA yk58e6.3; coded for by 
C. elega 

109 

34 . 127 

8873 

X15081 

Crithidia fasciculata MURF2 protein (AA 1-348) 

196 

32 . 639 

8874 

Z70780 

Caenorhabditis elegans cDNA EST yk465dl0.3 
comes from this gene; cDNA EST yk465dl0.5 
comes from this gene; cDNA EST yk481d9.5 comes 
from this gene 

323 

27.626 

8875 

AC003979 

Arabidopsis thaliana ESTs gb|Z34075, gb|Z34835 
and gb|AA4 04 888 come from this gene. 

424 

40.299 

8876 

AL022537 

Arabidopsis thaliana putative protein 

172 

33.121 

8877 

AL078637 

Arabidopsis thaliana serine/threonine kinase- 
like protein 

220 

30.270 

8878 

AB029022 

Homo sapiens KIAA1099 protein 

1687 

79.878 

8879 

AL022393 

Homo sapiens p373c6.2 

538 

50. 521 

8880 

U76557 

Rattus norvegicus O-GlcNAc transferase, pllO 
subunit 

1063 

64.537 

8881 

X75919 

Pseudomonas fluorescens TRANSFERRED ENTRY : 
3.4.19.3 

53 

28. 916 

8882 

X99699 

Homo sapiens XIAP associated factor- 1 (ZAP-1) 

689 

64 . 324 

8883 

AF026802 

Homo sapiens alpha- 3 type IX collagen 

120 . 

36 . 735 

8884 

AF041330 

Bodo saltans NADH dehydrogenase subunit 5 

144 

32.824 I 

8885 

AF003535 

Homo sapiens ORF2-like protein 

231 

44 . 882 

8886 

Y12713 

Mus musculus Gag polyprotein 

228 

37.324 

8887 

X55777 

Homo sapiens put . ORF 

309 

54 . 444 

8888 

U44091 

Rattus norvegicus atrophin-1 related protein 

224 

36.683 

8889 

AJ012582 

Homo sapiens hyperpolarization-activated 
cation channel HCN2 

113 

32 . 653 

8890 

AF091234 

Mus musculus putative transcription factor 

1363 

67 . 341 

8891 

Y12713 

Mus musculus Pro-Pol-dUTPase polyprotein 

1439 

77.891 

8892 

AL009266 

Homo sapiens hypothetical protein 

803 

67. 568 

8893 

U97553 

murine herpesvirus 6 8 unknown 

287 

30.478 

8894 

AJ004801 

Bovine herpesvirus type 1 . 1 glycoprotein C 

169 

34 . 746 

8895 

AJ132828 

Spermatozopsis similis p210 protein 

177 

27 . 039 

8896 

AF167320 

Mus musculus zinc finger protein ZFP113 

567 

34 .409 

8897 

Z72495 

Carassius auratus ZP2 

413 

30.882 

8898 

D29642 

Homo sapiens KIAA0053 

398 

49.612 

8899 

U45958 

Nicotiana alata pistil extensin-like protein 

210 

31. 780 

8900 

M64793 

Rattus norvegicus salivary proline -rich 
protein 

273 

34 . 701 
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8901 

S71333 

Platyrrhini alpha 1,3 galactosyl transferase, 
alpha 1 , 3GT 

622 

65. 584 

8902 

M13101 

Rattus norvegicus unknown protein 

348 

50.794 

8903 

M16976 

Glycine max N-75 

274 

36.364 

8904 

X52235 

Homo sapiens ORFII 

308 

43.275 

8905 

X52851 

Homo sapiens peptidylprolyl isomerase 

766 

74.332 

8906 

L24521 

Homo sapiens transformation- related protein 

347 

60.638 

8907 

AF007269 

Arabidopsis thaliana No definition line found 

525 

28.500 

8908 

M24732 

Homo sapiens lamin-like protein 

174 

40.708 

8909 

AF055904 

Myxococcus xanthus unknown 

256 

32.090 

8910 

AB028070 

Homo sapiens activator of S phase Kinase 

181 

36 . 522 

8911 

L28802 

Mus musculus zinc finger protein 

206 

31.343 

8912 

AJ243460 

Leishmania major proteophosphoglycan 

197 

28-854 

8913 

AF009829 

Mycobacterium bovis unknown 

183 

37.234 

8914 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

475 

38.871 

8915 

X68684 

Homo sapiens ZNF11B 

258 

39.370 

8916 

M64791 

Rattus norvegicus salivary proline-rich 
protein 

150 

34.127 

8917 

M17463 

Human papillomavirus type 5 ORF E4 from bp 
3285 to 4022; putative 

112 

30.058 

8918 

AL021086 

Unknown /prediction= (method : " "genef inder " " , 
version: ""084" ") ; 

/prediction= (method: " "genscan" " , ve 

1994 

52.736 

8919 

AL035289 

Homo sapiens hypothetical protein 

2986 

69. 146 

8920 

AC006232 

Arabidopsis thaliana putative proline-rich 
protein PRP2 precursor 

393 

28 .254 

8921 

U37263 

Homo sapiens KRAB zinc finger protein ; 
Method: conceptual translation supplied by 
author 

481 

40.529 

8922 

X02585 

Xenopus laevis unidentified open reading frame 
2 

249 

48.315 

8923 

U75308 

Homo sapiens TBP-associated factor 

232 

33 .209 

8924 

AF073770 

Homo sapiens carnitine octanoyltransf erase 

488 

98 . 750 

8925 

M15182 

Homo sapiens beta-glucuronidase precursor (EC 
3.2.1.31) 

743 

55.000 

8926 

AB023206 

Homo sapiens KIAA098 9 protein 

327 

50.000 

8927 

L10910 

Homo sapiens splicing factor 

947 

59. 170 

8928 

U88895 

Homo sapiens ORF derived from Dl leader region 
and integrase coding region 

189 

42.222 

8929 

M13100 

Rattus norvegicus unknown protein 

214 

47.312 

8930 

AL031581 

Unknown /prediction^ (method : " "genscan" " , 
version: ""1.0"", score: ""198.31"") ; 
/predict ion= (method: 

236 

25.786 

8931 

AF123880 

multiple sclerosis associated retrovirus 
element gag polyprotein 

319 

43.802 

8932 

M19419 

Mus musculus proline-rich salivary protein 

217 

43 .411 

8933 

A31038 

Nicotiana alata PRP3 

157 

44.304 

8934 

AF020261 

Santalum album proline rich protein 

201 

42 . 718 

8935 

U58755 

Caenorhabditis elegans C34D4.11 gene product 

175 

43 . 564 

O Q O C 

a y 3 b 

Z47357 

Caenorhabditis elegans cDNA EST EMBL: TO 08 22 
comes from this gene; cDNA EST EMBL: TO 082 3 
comes from this gene 

327 

43 . 382 

8937 

X83543 

Homo sapiens APXL 

547 

39. 852 

8938 

U54638 

Mus musculus rhotekin 

1149 

89.401 

8939 

AF047690 

Homo sapiens ATP-binding cassette protein M- 

225 

54.054 
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8940 

AF075575 

Homo sapiens dysferlin 

341 

37.563 

8941 

A31038 

Nicotiana alata PRP3 

155 

33.708 

8942 

AL033534 

Schizosaccharomyces pombe serine- rich protein 

226 

27.891 

8943 

U07974 

Gallus gallus unknown 

210 

33.816 

8944 

L22030 

Glycine max hydroxypro line -rich glycoprotein 

167 

31.613 

8945 

AC004997 

Homo sapiens match to ESTs Z43979 
(NID:g573097) , R19699 (NID : g774333 ) , T59198 
(NID:g661035) , and AA027979 (NID :gl494038) 

721 

60.476 

8946 

AF080070 

Mus musculus zinc finger protein 54 

236 

43 .220 

8947 

M92913 

Nephila clavipes dragline silk fibroin 

276 

30.651 

8948 

D90899 

Synechocystis sp. hypothetical protein 

392 

31.818 

8949 

AF071081 

Mycobacterium tuberculosis proline- rich mucin 
homolog 

314 

26 . 752 

8950 

U09413 

Homo sapiens zinc finger protein ZNF135 

1055 

49.547 

8951 

AF019236 

Dictyostelium discoideum TipD 

211 

26.250 

8952 

AF121009 

Mycobacterium tuberculosis H3 7Rv hypothetical 
protein Jv0534 

296 

35.122 

8953 

X76203 

Drosophila virilis major larval glue protein 

180 

31.098 

8954 

U41543 

Caenorhabditis elegans No definition line 
found 

262 

33 . 742 

8955 

Z68747 

Homo sapiens imogen 38 

697 

59.055 

8956 

AF045646 

Caenorhabditis elegans contains similarity to 
collagens 

126 

33 . 775 

8957 

AF018432 

Homo sapiens dUTPase 

464 

64.615 

8958 

U97553 

murine herpesvirus 6 8 unknown 

214 

28.668 

8959 

L27428 

Homo sapiens reverse transcriptase 

173 

26 .437 

8960 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

286 

75 . 000 

8961 

M12099 

Mus musculus proline -rich protein 

263 

33 .333 

8962 

L24521 

Homo sapiens transformation- related protein 

327 

64 . 045 

8963 

X13885 

Nicotiana tabacum extensin (AA 1-620) 

371 

25.828 

8964 

AL050331 

Homo sapiens dJ486I3.4 (TSPY-like (testis 
specific protein, Y-linked like) ) 

1548 

94 . 981 

8965 

AJ388557 

Canis familiaris zinc finger protein 

358 

45.098 

8966 

U83303 

Homo sapiens line-1 reverse transcriptase 

269 

43.750 

8967 

X03145 

Homo sapiens pot. ORF II 

238 

48.421 

8968 

AB012223 

Canis familiaris ORF2 

268 

53 . 846 

8969 

U88154 

Homo sapiens proline and glutamic acid rich 
nuclear protein isoform 

731 

91.270 

8970 

AF164612 

Homo sapiens Gag protein 

272 

44 . 706 

8971 

AB012223 

Canis familiaris ORF2 

295 

40.426 

8972 

AL024499 

Caenorhabditis elegans cDNA EST EMBL:C1012 3 
comes from this gene 

244 

34.746 

8973 

X83413 

Human herpesvirus 6 U88 

685 

36 . 986 

8974 

L22343 

Homo sapiens nuclear phosphoprotein 

2191 

86 .410 

8975 

AB012223 

Canis familiaris ORF2 

361 

52 .212 

8976 

M64791 

Rattus norvegicus salivary proline -rich 
protein 

253 

35.638 

8977 

Z98204 

Hordeum vulgare extensin 

209 

28.652 

8978 

AL035423 

Homo sapiens dJ20I3.1 (brain mitochondrial 
carrier protein-1 (BMCP1) ) 

381 

51.007 

8979 

AJ132828 

Spermatozopsis similis p210 protein 

206 

30.047 

8980 

AF132856 

Homo sapiens suppressor of G2 allele of skpl 
homolog 

209 

64 .815 

8981 

U93872 

Kaposi's sarcoma-associated herpesvirus ORF 

109 

31.915 


.1. 0 U e «4tiS «••!!■• 3 ,„ O E! lib O 2 




73, contains large complex repeat CR 73 



8982 

X61294 

Rattus norvegicus LI retroposon, a portion of 
its ORF2 sequence 

276 

44 .521 

8983 

AL035461 

Homo sapiens dJ967N21.3 (novel protein similar 
to predicted worm, yeast and plant proteins) 

2526 

95.455 

8984 

D29956 

Homo sapiens This gene is similar to tre 
oncogene (X63547) . 

526 

63 .690 

8985 

AJ005282 

Homo sapiens NPR-Bi 

6054 

96 .677 

8986 

Z54141 

Saccharomyces cerevisiae unknown 

128 

31.250 

8987 

D67066 

Bos taurus N-WASP 

247 

41 .221 

8988 

U01849 

Trypanosoma brucei ORF1 

178 

31.034 

8989 

AL022537 

Arabidopsis thaliana putative protein 

193 

29.327 

8990 

AF062389 

Rattus norvegicus kidney-specific protein 

1344 

60.857 

8991 

AF104411 

Mus musculus neuronal -specif ic septin 3 

2395 

82 .452 

8992 

270208 

Caenorhabditis elegans predicted using 
Genefinder; similar to collagen 

246 

30.612 

8993 

M86699 

Homo sapiens kinase 

644 

65.934 

8994 

Z72807 

Saccharomyces cerevisiae ORF YGR023w 

171 

32 .895 

8995 

AF055904 

Myxo coccus xanthus unknown 

202 

29.694 

8996 

AB012223 

Canis familiaris ORF2 

199 

45.977 

8997 

AF071172 

Homo sapiens HERC2 

260 

79.245 

8998 

U60269 

Homo sapiens putative envelope protein; orf 
similar to env of Type A and Type B 
retroviruses and to class II HERVs 

216 

43 .333 

8999 

AF087573 

Homo sapiens DNA fragmentation factor DFF35 

427 

68 .613 

9000 

AL031174 

Schizosaccharomyces pombe hypothetical protein 

215 

40 .404 

9001 

AF017777 

Drosophila melanogaster tweety 

472 

30 .380 

9002 

U49973 

Homo sapiens ORF1 ; MER37; putative transposase 
similar to pogo element 

476 

41.573 

9003 

U97553 

murine herpesvirus 68 unknown 

242 

32 .783 

9004 

AF073344 

Homo sapiens ubiquitin- specif ic protease 3 

2084 

90.909 

9005 

AB028965 

Homo sapiens KIAA1042 protein 

979 

57 .621 

9006 

M69297 

Homo sapiens ORF 3 

136 

41 .096 

9007 

AL117589 

Homo sapiens hypothetical protein 

902 

59.480 

9008 

L29435 

Gallus gallus beta-5 tubulin 

2300 

95 .845 

9009 

X78928 

Homo sapiens zinc finger protein 

551 

64 .463 

9010 

L41827 

Homo sapiens sensory and motor neuron- derived 
factor 

1404 

98 .643 

9011 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

298 

79 .688 

9012 

AL035632 

Unknown /predict ion= (method : " "genefinder" " , 
version: " "084" " , score : " "113 . 36" " ) ; 
/prediction^ (meth 

673 

55 .435 

9013 

X94910 

Homo sapiens ERp28 

468 

74 .074 

9014 

X53476 

Mus musculus HMG-14 (AA 1-96) 

211 

79.167 

9015 

M15386 

Homo sapiens gamma -globin 

556 

71 .094 

9016 

S39127 

human, testis, Peptide, 331 aa cathepsin 
S=cysteine proteinase 

157 

67 .442 

9017 

AF115384 

Homo sapiens LR8 

279 

82 .456 

9018 

K02212 

Homo sapiens alpha-l-antitrypsin 

2458 

98 .223 

9019 

A12029 

Homo sapiens MRP- 14 

322 

51 .923 

9020 

AB028997 

Homo sapiens KIAA1074 protein 

189 

69 .811 

9021 

L13440 

Nicotiana tabacum cysteine-rich extensin- like 
protein-2 

202 

52 .459 

9022 

M15386 

Homo sapiens gamma -globin j 

898 

98 .621 

9023 

U10362 

Homo sapiens GP3 6b glycoprotein | 

506 

53 .741 
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9024 

AB011157 

Homo sapiens KIAA0585 protein 

332 

88.525 

9025 j 

Z32683 

Unknown cDNA EST EMBL:Z14902 comes from this 
gene; cDNA EST EMBL:M8 9155 comes from this 
gene ; cDNA 

711 

28.841 

9026 

AF000560 

Homo sapiens TTF-I interacting peptide 20; 
TIP20; Transcription Termination Factor I 
Interacting Peptide 20 

224 

52 .381 

9027 

AF171099 

Xenopus laevis Mi -2 his tone deacetylase 
complex protein 66 

648 

61 . 137 

9028 

AB018341 

Homo sapiens KIAA0798 protein 

1398 

62 .281 

9029 

AF026954 

Bos taurus pyruvate dehydrogenase phosphatase 
regulatory subunit precursor; PDPr 

1070 

71.374 

9030 

U93569 

Homo sapiens putative pl50 

256 

57 . 143 

9031 

AB017614 

Mus musculus OASIS protein 

276 

34 .091 

9032 

L10110 

Octopus dofleini alpha tubulin 

369 

74 .667 

9033 

AL023706 

Schizosaccharomyces pombe hypothetical protein 

358 

34 .300 

9034 

Z70683 

Unknown cDNA EST EMBL:T01585 comes from this 
gene; cDNA EST EMBL:D72333 comes from this 
gene ; cDNA 

750 

33 .613 

9035 

AB007198 

Agkistrodon blomhoffii siniticus phospholipase 
A2 inhibitor 

248 

37 . 143 

9036 

X80433 

Mus musculus tex2 92 

278 

80.597 

9037 

Y09945 

Rattus norvegicus putative integral membrane 
transport protein 

173 

56 .863 

9038 

X52574 

Mus musculus GTP binding protein 

2154 

84 .635 

9039 

U63840 

Rattus norvegicus nucleoporin p54 

1202 

69.381 

9040 

AB011169 

Homo sapiens KIAA0597 protein 

260 

43.011 

9041 

D14336 

Mus musculus RNA polymerase I associated 
factor (PAF53) 

697 

71.233 

9042 

AB029008 

Homo sapiens KIAA108 5 protein 

2554 

99.482 

9043 

M37190 

Homo sapiens ras inhibitor 

708 

40.431 

9044 

AL050262 

Homo sapiens hypothetical protein 

543 

39.922 

9045 

AF071059 

Mus musculus zinc finger RNA binding protein 

3718 

96 . 721 

9046 

X58636 

Mus musculus lymphoid enhancer factor 1 

1428 

94.009 

9047 

U05681 

Homo sapiens homologous to members of the I- 
kappa B family; protein binds NF-kappa B 
proteins 

329 

32.534 

9048 

AL117557 

Homo sapiens hypothetical protein 

1432 

83 .600 

9049 

AF125455 

Caenorhabditis elegans No definition line 
found 

231 

29.167 

9050 

X52063 

Escherichia coli orfB 

397 

42.553 

9051 

AL031770 

Homo sapiens dJ20Bll.l (ortholog of rat RSEC5 
(mammalian exocyst complex subunit) ) 

2189 

99.432 

9052 

AF132953 

Homo sapiens CGI -19 protein 

2422 

100.000 

9053 

U47024 

Mus musculus MEM3 

1159 

92 . 708 

9054 

AF007109 

Arabidopsis thaliana similar to yeast dcpl 

347 

34 .416 

9055 

U41534 

Caenorhabditis elegans No definition line 
found 

380 

31.724 

9056 

AF120499 

Homo sapiens DEMI protein 

764 

88 .406 

9057 

X89401 

Homo sapiens ribosomal protein L21 

350 

57 . 759 

9058 

Z82271 

Unknown Similarity to Mouse kinensin-like 
protein KIF4 (SW:P33174); CDNA EST EMBL:D27320 
comes from 

756 

42 .804 

9059 

M26312 

Oryctolagus cuniculus unknown protein 

204 

33.571 

9060 

AF077188 

Homo sapiens cullin 4A 

425 

98.571 
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9061 

AC005546 

Homo sapiens R2 942 5 1 

2564 

93 .271 

9062 

A52744 

unidentified unnamed protein product 

237 

37 .624 

9063 

L27428 

Homo sapiens reverse transcriptase 

399 

72 .043 

9064 

Z70783 

Caenorhabditis elegans cDNA EST yk575f9.3 
comes from this gene 

372 

38.150 

9065 

AL031579 

Schizosaccharomyces pombe hypothetical protein 

270 

25.993 

9066 

AF027219 

Homo sapiens ZNF2 02 beta 

431 

52 .857 

9067 

Z85986 

Homo sapiens dJ108K11.3 (similar to yeast 
suppressor protein SRP40) 

1223 

96 .296 

9068 

U88587 

Nicotiana alata 120 kDa style glycoprotein 

303 

35.622 

9069 ! 

D90868 

Escherichia coli GLUCOKINASE (EC 2.7.1.2). 

923 

98.639 | 

9070 

AL035461 

Homo sapiens dJ967N21.3 (novel protein similar 
to predicted worm, yeast and plant proteins) 

3064 

99.786 

9071 

AL096749 

Homo sapiens truncated by frame shifts- 
corresponding STS: EMBL:G37487 

274 

26.433 

9072 

U73379 

Homo sapiens cyclin-selective ubiquitin 
carrier protein 

787 

79.762 

9073 

AL078630 

Mus musculus 573K1.15 (mml7Ml-6 (novel 7 
transmembrane receptor (rhodopsin family) 
(olfactory receptor LIKE) protein) ) 

669 

86 .667 

9074 

Z81505 

Unknown similar to Zinc finger, C3HC4 type 
(RING finger); cDNA EST EMBL:D28025 comes from 
this gene 

486 

34 .672 

9075 

AF132956 

Homo sapiens CGI-22 protein 

2064 

98 .433 

9076 

AB025793 

Bos taurus casein kinase I-alpha 

149 

71.429 

9077 

L29457 

Mus musculus dynamin 

423 

60.484 

9078 

S60885 

Mus sp. LYAR=cell growth regulating nucleolar 
protein 

1188 

74 .532 

9079 

L05147 

Homo sapiens phosphatase tyrosine/serine 

212 

45 . 783 

9080 

AF126736 

Homo sapiens ubiquitin processing protease 

2997 

98 .468 

9081 

AL080125 

Homo sapiens hypothetical protein 

442 

75.510 

9082 

U64856 

Caenorhabditis elegans weak similarity to TPR 
domains 

583 

44 .845 

9083 

AL022117 

Schizosaccharomyces pombe hypothetical protein 

289 

38 .621 

9084 

AL110241 

Homo sapiens hypothetical protein 

1458 

97.510 

9085 

AJ005698 

Homo sapiens poly (A) - specif ic ribonuclease 

712 

82 .639 

9086 

AF117897 

Bos taurus rabll binding protein 

1302 

82.609 

9087 

AB033168 

Mus musculus nuclear protein ZAP 

1451 

93 . 627 

9088 

M58378 

Homo sapiens synapsin I 

1816 

97 .872 

9089 

AF051098 

Mus musculus seven transmembrane domain orphan 
receptor 

1745 

92.000 

9090 

AJ131693 

Homo sapiens AKAP4 50 protein 

1238 

97.500 

9091 

AF183910 

Rattus norvegicus frizzled receptor 4 

915 

96.947 

9092 

AF067608 

Caenorhabditis elegans No definition line 
found 

57 

42 .857 

9093 

X76717 

Homo sapiens MT-ll protein 

200 

75 . 610 

9094 

M64488 

Rattus norvegicus synaptotagmin II 

306 

73 . 973 

9095 

AF000422 

Homo sapiens TTF-I interacting peptide 5 

439 

41 . 714 

9096 

AF125569 

Homo sapiens tumor suppressing STF cDNA 6 

187 

47 .674 

9097 

A18812 

Brass ica napus extensin 

324 

32 . 143 

9098 

U73682 

Homo sapiens meningioma -expressed antigen 11 

196 

29.703 

9099 

M11759 

Lycopersicon esculentum cell wall 
hydroxyproline-rich glycoprotein 

199 

36.000 

9100 

AF180920 

Homo sapiens cyclin ania-6a 

1370 

90.361 

9101 

AP000063 

Aeropyrum pernix l75aa long hypothetical 

158 

35.780 
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9102 

D88315 

Mus musculus tetracycline transporter- like 
protein 

553 

71.901 

9103 

AF051400 

Gallus gallus fibulin-1, isoform C precursor 

241 

47.500 

9104 

X92521 ! 

Homo sapiens MMP-19 (matrix metalloproteinase) 

600 

41.356 

9105 

AF106702 

Mus musculus testis -enriched protein tyrosine 
phosphatase 

457 

32.646 

9106 

Y17113 ! 

Xenopus laevis ribosomal protein L24 

1009 

68 .037 

9107 

AL031447 

Homo sapiens dJ126A5.1.2 (novel DnaJ domain 
protein) (isoform 2) 

427 

92.754 

9108 

M34024 

Homo sapiens immunoglobulin heavy chain 

734 

82 .270 

9109 

X59727 

Homo sapiens 63kDa protein kinase 

2621 

98.765 

9110 

D90741 

Escherichia coli ORF2 

378 

52 .055 

9111 

AF091624 

Drosophila melanogaster Pelle associated 
protein Pellino 

840 

55 .285 

9112 

AF125158 

Homo sapiens zinc finger DNA binding protein 
99 

4417 

100.000 

9113 

X07311 

Drosophila melanogaster heat shock protein 

197 

40.000 

9114 

Z83826 

Homo sapiens dJ473B4.1 (novel protein similar 
to predicted human and worm genes) 

494 

72.131 

9115 

X03145 

Homo sapiens pot. ORF III 

274 

59.000 

9116 

AL050008 

Homo sapiens hypothetical protein 

1449 

88.281 

9117 

Z49909 

Caenorhabditis elegans weak similarity with a 
B. Flavum translocation protein (Swiss Prot 
accession number P38376) ; cDNA EST yk220el0.5 
comes from this gene; cDNA EST yk549el2.3 
comes from this gene; cDNA EST yk618d6.3 comes 
from this gene 

584 

38.839 

9118 

X63546 

Homo sapiens oncogene 

2573 

80.467 

9119 

AJ223782 

Mus musculus CDC10 

1675 

91.958 

9120 

X85991 

Mus musculus semaphorin B 

1074 

78.641 

9121 

M55264 

Saimiriine herpesvirus 2 gene products 

125 

42.857 

9122 

AB028997 

Homo sapiens KIAA1074 protein 

602 

56.970 

9123 

X76104 

Homo sapiens DAP-kinase 

5061 

100.000 

9124 

X58826 

Drosophila melanogaster RNA polymerase III 
second- largest subunit 

517 

69. 912 

9125 

D89821 

Mus musculus RhoM 

766 

53 .202 

9126 

AF092091 

Rattus norvegicus cp431 

1700 

82 . 647 

9127 

D87908 

Mus musculus nuclear protein np95 

401 

47.134 

9128 

U38979 

Homo sapiens hPMSR3 

1141 

94.444 

9129 

X51760 

Homo sapiens zinc finger protein (583 AA) 

373 

59.292 

9130 

Y12713 

Mus musculus Pro-Pol-dUTPase polyprotein 

338 

69.620 

9131 

AF098796 

Mus musculus SLM-1 

1278 

96 .618 

9132 

Z81030 

Caenorhabditis elegans similar to citrate 
lyase beta chain; cDNA EST yk302b4.5 comes 
from this gene 

255 

45.000 

9133 

AF098863 

Mus musculus Ets -protein Spi-C 

459 

46 . 199 

9134 

U95097 

Xenopus laevis mitotic phosphoprotein 43 

611 

71 . 523 

9135 

AF000413 

Plasmodium berghei merozoite surface protein- 1 

153 

33 .621 

9136 

Z82268 

Unknown cDNA EST yk338gl0.5 comes from this 
gene; cDNA EST EMBL:D27934 comes from this 
gene; cDNA E 

801 

38 .074 

9137 

U62940 

Rattus norvegicus mt-GrpE#l precursor 

258 

60.204 

9138 

M13100 

Rattus norvegicus unknown protein 

255 

50.685 

9139 

X67155 

Homo sapiens mitotic kinase- like protein- 1 

4436 

98.844 
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9140 

U83303 

Homo sapiens line-1 reverse transcriptase 

356 

58.416 

9141 

U79260 

Homo sapiens unknown 

335 

69.048 

9142 

AF022821 

Mus musculus putative potassium channel DP4 

336 

52 . 885 

9143 

AF055077 

Homo sapiens zinc finger protein 42 

932 

86 . 061 

9144 

U16790 

Mus musculus putative collagen alpha -2 (XI) 
chain 

212 

32 . 716 

9145 

AF071172 

Homo sapiens HERC2 

324 

57 . 009 

9146 

Y17 793 

Mus musculus Duttl protein 

644 

87 . 156 

9147 

X98259 

Homo sapiens M-phase phosphoprotein 8 

380 

55 . 285 

9148 

X85124 

Mus musculus PACSIN 

435 

87 . 342 

9149 

U64833 

Caenorhabditis elegans B0507.2 gene product 

888 

40.000 

9150 

M93017 

Rattus norvegicus , gene product 

1094 

92 . 228 

9151 

AB026190 

Homo sapiens Kelch motif containing protein 

675 

32.159 

9152 

AJ388555 

Canis familiaris hypothetical protein 

1163 

88 .559 

9153 

AB002312 

Homo sapiens KIAA0314 

597 

39.384 

9154 

AF055636 

Homo sapiens leucine-rich glioma- inactivated 
protein precursor 

723 

53.000 

9155 

D50930 

Homo sapiens The KIAA014 0 gene product is 
novel . 

198 

31.217 

9156 

AF006492 

Mus musculus FOG 

1384 

68.652 1 

9157 

AF043179 

Homo sapiens T cell receptor beta chain 

758 

58.768 

9158 

D86984 

Homo sapiens similar to yeast adenylate 
cyclase (S56776) 

1436 

63.772 

9159 

X55777 

Homo sapiens put . ORF 

351 

70.513 

9160 

AF076167 

Rattus norvegicus UDP-GalNAc : polypeptide N- 
acetylgalactosaminyl transferase T6 

1118 

91.892 

9161 

U67549 

Methanococcus jannaschii spore coat 
polysaccharide biosynthesis protein E 

675 

35.770 

9162 

AL022325 

Homo sapiens fF27C3.1.1 (protein similar to C. 
elegans protein B0035.16) (isoform 1) 

922 

75.115 

9163 

AL031266 

Caenorhabditis elegans VM106R.1 

275 

42.105 

9164 

U00051 

Caenorhabditis elegans coded for by C. elegans 
cDNA yk50b2.5; coded for by C. elegans cDNA 
CEESV26F; similar to lipases over a short 
region 

827 

31 . 369 

9165 

AB007898 

Homo sapiens KIAA04 38 

198 

80. 556 

9166 

AF072509 

Rattus norvegicus glutamate receptor 
interacting protein 2 

826 

84 .472 

9167 

U40942 

Caenorhabditis elegans No definition line 
found 

759 

45.956 

9168 

AB02064 9 

Homo sapiens KIAA0842 protein 

352 

84 . 932 

9169 

D80009 

Homo sapiens KIAA0187 

864 

93 .750 

9170 

AF155110 

Homo sapiens NY- REN- 4 5 antigen 

1914 

51 . 565 

9171 

U88165 

Caenorhabditis elegans No definition line 
found 

616 

48.374 

9172 

L04159 

Plasmodium falciparum 3* end., gene product 

204 

27.723 

9173 

AF022 985 

Caenorhabditis elegans No definition line 
round 

465 

43 .478 

9174 

AF098505 

Caenorhabditis elegans similar to Arabidopsis 
uric*. j_ lei i let rn<s.xe sceriiity protein a \ow :yuooyi/ 

525 

37 . 379 

9175 

AL021997 

Homo sapiens dJ874C20.1 (Zinc Finger Protein 
ZFP47 LIKE) 

266 

59.770 

9176 

D80006 

Homo sapiens KIAA0184 

738 

99.138 

9177 

M23613 

Homo sapiens nucleophosmin 

737 

93 .600 

9178 

L15309 

Homo sapiens zinc finger protein 

250 

77 . 778 
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9179 

U47619 

Drosophila melanogaster ovary2 

1262 

42 .715 

9180 

AB014536 

Homo sapiens KIAA0636 protein 

803 

47 .482 

9181 

U80741 

Homo sapiens CAGH44 

704 

68 .478 

9182 

U71363 

Homo sapiens zinc finger protein zfp6 

887 

78 .409 

9183 

X85786 

Homo sapiens binding regulatory factor 

358 

49.600 

9184 

L11275 

Saccharomyces cerevisiae selected as a weak 
suppressor of a mutant of the subunit AC4 0 of 
DNA dependant RNA polymerase I and III 

180 

27 .317 

9185 

AF177203 

Homo sapiens cerebral cell adhesion molecule 

492 

72 .414 

9186 

D79994 

Homo sapiens similar to ankyrin of Chromatium 
vinosum. 

305 

37 .453 

9187 

AF041083 

Rattus norvegicus RoBo-1 

189 

29.710 

9188 

AF099742 

Rattus norvegicus putative short -chain 
dehydrogenase/reductase 

921 

73 .575 

9189 

AF035209 

Mus musculus putative v- SNARE Vtila 

843 

95 .238 

9190 

AF181655 

Drosophila melanogaster BcDNA. LD2 7873 

298 

42 . 754 

9191 

M29852 

Oryctolagus cuniculus cytochrome P-450 isozyme 
5 

479 

50 .340 

9192 

AF151820 

Homo sapiens CGI- 62 protein 

994 

97 .297 

9193 

AB007876 

Homo sapiens KIAA0416 

1001 

56 .554 

9194 

U38980 

Homo sapiens hPMSR6 

340 

64 .634 

9195 

U26397 

Rattus norvegicus inositol polyphosphate 4- 
phosphatase 

384 

28 .378 

9196 

AE001058 

Archaeoglobus fulgidus adenylate kinase (adk) 

178 

31.034 

9197 

L05147 

Homo sapiens phosphatase tyrosine/serine 

242 

45 .833 

9198 

AF126746 

Mus musculus zinc finger protein splice 
variant FIZ1-A 

474 

42 .021 

9199 

M27685 

Mus musculus ultra -high sulphur keratin 

432 

47 . 934 

9200 

AF064604 

Homo sapiens KE03 protein 

2308 

99.717 

9201 

AF036145 

Homo sapiens meningioma- expressed antigen 5 

1458 

99 . 099 

9202 

AF184919 

Rattus norvegicus artemin 

142 

40 . 000 

9203 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

357 

76 .471 

9204 

AB002803 

Homo sapiens BACH1 

324 

38 .571 

9205 

X90849 

Gallus gallus polybromo 1 protein 

2847 

88 .200 

9206 

AL080125 

Homo sapiens hypothetical protein 

982 

64 . 082 

9207 

AB007898 

Homo sapiens KIAA04 38 

198 

80.556 

9208 

AF155352 

Mus musculus ankyrin repeat -containing protein 
Asb-1 

374 

41 .667 

9209 

AL050110 

Homo sapiens hypothetical protein 

4147 

100. 000 

9210 

S58722 

Homo sapiens X- linked retinopathy protein {C- 
terminal, clone XEH.8c} 

179 

80 . 000 

9211 

AF177942 

Xenopus laevis katanin p60 

655 

48 .918 

9212 

AL080125 

Homo sapiens hypothetical protein 

540 

61 . 074 

9213 

AF016508 

Mus musculus C- terminal binding protein 2 

1107 

79 . 916 

9214 

AB018308 

Homo sapiens KIAA076 5 protein 

400 

32 .530 

9215 

U37263 

Homo sapiens KRAB zinc finger protein; 
Method: conceptual translation supplied by 
author 

548 

39 .464 

9216 

M14268 

Homo sapiens T-cell receptor V-region (V-D-J) 

736 

92 . 742 

x / 
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Homo sapiens KIAA0694 protein 

523 

60 . 140 

9218 

AF180920 

Homo sapiens cyclin ania-6a 

1016 

70 . 044 

9219 

X01655 

Homo sapiens type III procollagen (aa 8 92- 
1023) 

143 

39 .231 

9220 

U33007 

Saccharomyces cerevisiae Ydr449cp; CAI : 0.18 

337 

31 .959 

9221 

AJ005073 

Mus musculus Alix 

1066 

95 .455 
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9222 

AL110151 

Homo sapiens hypothetical protein 

240 

38 . 760 

9223 

D66904 

Homo sapiens suppressor for yeast mutant 

436 

77 . 778 

9224 

U58332 

Mus musculus receptor tyrosine kinase 

362 

81.159 

9225 

AF092091 

Rattus norvegicus cp4 31 

215 

71 . 154 

9226 

AF106682 ! 

Homo sapiens spindlin 

688 

100. 000 

9227 

AJ131730 

Homo sapiens DREAM protein 

205 

41.111 

9228 

AB025259 

Mus musculus granuphilin-b 

405 

40 . 123 

9229 

AB002336 

Homo sapiens KIAA0338 

1185 

84 . 163 

9230 

X57206 

Homo sapiens lD-myo- inositol - trisphosphate 3- 
kinase 

1194 

87.081 

9231 

AF002197 

Caenorhabditis elegans short region of weak 
similarity to protein kinase C; contains 
similarity to Pfam domain PF00130 (DAG_PE- 
bind) , Score=10.0, E- value=0 . 0034 , N=l 

453 

34 .061 

9232 

M63963 

Rattus norvegicus H , K-ATPase catalytic subunit 

285 

91 .667 

9233 

AJ388557 

Canis familiaris zinc finger protein 

327 

62 .366 

9234 

AC006264 

Arabidopsis thaliana unknown protein 

802 

43 .408 

9235 

X98507 

Homo sapiens myosin I beta 

2935 

97.650 

9236 

AF152963 

Rattus norvegicus NADH/NADPH mitogenic oxidase 
subunit p65-mox 

319 

36 .216 

9237 

U89280 

Rattus norvegicus oxidative 17 beta 
hydroxys teroid dehydrogenase type 6 

418 

69 .697 

9238 

Z21507 

Homo sapiens human elongation f actor- 1- delta 

511 

72 .308 

9239 

Z73619 

Saccharomyces cerevisiae ORF YPL263c 

438 

34 . 014 

9240 

AC004484 

Arabidopsis thaliana hypothetical protein 

611 

42 . 029 

924X 

Z83760 

Ciona intestinalis COS41.5 

181 

33 .333 

9242 

AF061025 

Homo sapiens leucine zipper-EF-hand containing 
transmembrane protein 1 

1628 

99.184 

9243 

J02638 

Escherichia coli polynucleotide phosphorilase 

601 

44 . 091 

9244 

AB013607 

Mus musculus c2 9 

671 

62 . 983 

9245 

AJ243310 

Homo sapiens C14orf3 protein 

357 

36 . 910 

9246 

X14046 

Homo sapiens CD3 7 (AA 1-244) 

217 

57 . 831 

9247 

M69181 

Homo sapiens non-muscle myosin B 

9236 

99.384 

9248 

X00947 

Homo sapiens alpha 1 antichymotrypsin 

275 

58.889 

9249 

AF104246 

Gallus gallus enhancer of f ilamentation 1 
homo log 

319 

39.490 

9250 

L24521 

Homo sapiens transformation- related protein 

227 

51.220 

9251 

AF021231 

Mus musculus acetyl cholinesterase-associated 
collagen 

177 

30.833 

9252 

A31036 

Nicotiana alata PRP2 

197 

36.275 

9253 

X80433 

Mus musculus tex2 92 

278 

80.597 

9254 

X61047 

Hydra sp. mini -collagen 

183 

43.038 

9255 

AF118838 

Homo sapiens citrin; adult -onset type II 
citrullinemia protein 

1574 

99.605 

9256 

D90869 

Escherichia coli similar to 

1633 

97.048 

9257 

AF114817 

Homo sapiens KRAB-zinc finger protein SZF1-2 

271 

82.143 

9258 

M15530 

Homo sapiens B-cell growth factor 

156 

55.102 

9259 

M15386 

Homo sapiens gamma -globin 

476 

67.241 

9260 

X55777 

Homo sapiens put . ORF 

157 

50.877 

9261 

V00488 

Homo sapiens alpha globin 

364 

74.684 

9262 

U36771 

Rattus norvegicus sn-glycerol 3 -phosphate 
acyl transferase 

226 

60.714 

9263 

L10910 

Homo sapiens splicing factor 

161 

52.941 

9264 

AB019438 

Homo sapiens immunogloblin heavy chain 
variable region 

506 

82.105 
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9265 

X53777 

Homo sapiens putative ribosomal protein (AA 1- 
184) 

513 

69.369 

9266 

X03725 

Mus musculus ORF 2 (466 aa) 

183 

42.466 

9267 

AF115435 

Rattus norvegicus syntaxin 17 

236 

90.244 

9268 

L38717 

Rattus norvegicus titin 

345 

65.000 

9269 

U23452 

Caenorhabditis elegans No definition line 
found 

224 

29.921 

9270 

AF134983 

Mus musculus energy- dependent regulator of 
proteolysis 

286 

62 . 857 

9271 

D10653 

Homo sapiens cell surface glycoprotein 

393 

66.990 

9272 

L07955 

Bos taurus factor activating exoenzyme S 

582 

72.656 

9273 

J05459 

Homo sapiens glutathione transferase M3 

597 

75.424 

9274 

AB000910 

Sus scrofa ribosomal protein 

356 

89.286 

9275 

L07599 

Homo sapiens ribosomal protein S6 kinase 3 

550 

81.188 

9276 

A12029 

Homo sapiens MRP- 14 

534 

71.296 

9277 

U01317 

Homo sapiens G- gamma globin 

287 

58 . 140 

9278 

AC002550 

Homo sapiens Unknown gene product 

181 

51.724 

9279 

AF028808 

Mus musculus hemin- sensitive initiation factor 
2 alpha kinase 

2818 

81.714 

9280 

X56411 

Homo sapiens alcohol dehydrogenase 

230 

87 . 179 

9281 

M15386 

Homo sapiens gamma -globin 

295 

61.039 

9282 

V00488 

Homo sapiens alpha globin 

228 

94.595 

9283 

AB025026 

Homo sapiens brain carboxylesterase hBrl 

242 

87.805 

9284 

M15386 

Homo sapiens gamma-globin 

455 

70 . 588 

9285 

M15386 

Homo sapiens gamma -globin 

199 

91.176 

9286 

M15386 

Homo sapiens gamma -globin 

382 

67.010 

9287 

U14966 

Homo sapiens ribosomal protein L5 

1157 

96 .667 

9288 

V01514 

Homo sapiens reading frame AFP 

460 

69.725 

9289 

AF115435 

Rattus norvegicus syntaxin 17 

236 

90.244 

9290 

X02515 

Homo sapiens T-cell receptor beta 1 chain 

209 

53 .425 

9291 

X00497 

Homo sapiens putative p3 3 

294 

54 . 167 

9292 

AL035593 

Homo sapiens dJ310J6.1 (novel protein) 

164 

61 . 905 

9293 

M15386 

Homo sapiens gamma -globin 

491 

73 . 786 

9294 

AF079873 

Rattus norvegicus splicing factor 1 homolog 

252 

49.451 

9295 

AF150089 

Homo sapiens small zinc finger- like protein 

163 

56.818 

9296 

M15386 

Homo sapiens gamma -globin 

333 

55.670 

9297 

M15386 

Homo sapiens gamma -globin 

415 

65 . 000 

9298 

V00488 

Homo sapiens alpha globin 

370 

90.323 

9299 

M15386 

Homo sapiens gamma -globin 

205 

53 .030 

9300 

D13891 

Homo sapiens Id-2H 

38 

32.000 

9301 

AF016370 

Homo sapiens U4/U6 small nuclear 
ribonucleoprotein hPrp3 

1798 

91.613 

9302 

X55777 

Homo sapiens put . ORF 

248 

62 . 963 

9303 

X17617 

Mus musculus zinc finger protein (AA 1-580) 

661 

54 .023 

9304 

U80955 

Caenorhabditis elegans contains a domain found 
in band 4.1, ezrin, moesin, radixin and talin 

822 

48.561 

9305 

X55777 

Homo sapiens put. ORF 

319 

58 . 537 

9306 

M13100 

Rattus norvegicus unknown protein 

158 

39.241 

9307 

M36899 

Cricetulus griseus uridine diphosphate N- 
acetyl D-glucosamine dolichol phosphate N- 
acetyl glucosamine -1 phosphate transferase 

1050 

91.765 

9308 

A67508 

Mus musculus MUS MUSCULUS GENOMIC DNA 
CONTAINING THE FV1 GENE. 

405 

44 .253 

9309 

AF143241 

Mus musculus ADP-ribosylation factor- like 
protein 3 

241 

37.838 


<?iu> 


,:i U U B «+6 4-3; . O E! 5 6 O 22 


9310 

D37885 

Rattus norvegicus choline kinase R2 

304 

41 .401 

9311 

AF126062 

Homo sapiens Arf-like 2 binding protein BART1 

292 

55 . 952 

9312 

U38979 

Homo sapiens hPMSR3 

177 

35.354 

9313 

U60269 

Homo sapiens putative envelope protein; orf 
similar to env of Type A and Type B 
retroviruses and to class II HERVs 

450 

77.907 

9314 

AF045641 

Caenorhabditis elegans No definition line 
found 

1008 

44 . 789 

9315 

U54644 

Homo sapiens tub homo log 

600 

65.922 

9316 

S36219 

Homo sapiens prostaglandin G/H synthase, 
PGG/HS 

204 

73 .469 

9317 

X55777 

Homo sapiens put . ORF 

248 

62 . 963 

9318 

AF055985 

Onchocerca volvulus pyrrol idone- rich antigen 

117 

28 .395 

9319 

AF043724 

Homo sapiens hepatitis A virus cellular 
receptor 1 

1578 

77.640 

9320 

U78294 

Homo sapiens 15S- lipoxygenase 

193 

54 .237 

9321 

AF037402 

Bos taurus butyrophilin 

253 

42 .308 

9322 

D90903 

Synechocystis sp. hypothetical protein 

211 

51.667 

9323 

AF017368 

Mus musculus faciogenital dysplasia protein 2 

299 

78.182 

9324 

U66220 

Nannocystis exedens unknown 

166 

32.584 

9325 

Z30425 

Homo sapiens orphan nuclear hormone receptor 

332 

92.308 

9326 

U87965 

Mus musculus putative G-protein 

972 

41.667 

9327 

Y08685 

Homo sapiens serine palmitoyltransferase, 
subunit I 

958 

94 . 737 

9328 

Z47072 

Caenorhabditis elegans similar to acid 
phosphatase 

722 

36 .607 

9329 

L27428 

Homo sapiens reverse transcriptase 

220 

43 . 836 

9330 

A22096 

Homo sapiens plasminogen 

854 

90 . 845 

9331 

D63477 

Homo sapiens The KIAA014 3 gene product is 
related to a putative C. elegans gene encoded 
on cosmid C32D5 . 

1332 

98 . 030 

9332 

AF115435 

Rattus norvegicus syntaxin 17 

236 

90.244 

9333 

AF008203 

Homo sapiens homeobox protein 

442 

42 . 424 

9334 

AF067972 

Homo sapiens DNA cytosine methyl transferase 3 
alpha 

1036 

82.902 

9335 

AJ243460 

Leishmania major proteophosphoglycan 

212 

34 .653 

9336 

Z80220 

Unknown similar to nucleotide binding protein ; 
cDNA EST EMBL:M758 97 comes from this gene; 
cDNA EST 

433 

50 . 806 

9337 

AC009322 

Arabidopsis thaliana Hypothetical protein 

232 

50.704 

9338 

AL023284 

Homo sapiens dJ406A7.2.1 (Microtuble 
Associated Protein E-MAP-115) 

401 

59.804 

9339 

AF055077 

Homo sapiens zinc finger protein 42 

469 

63.025 

9340 

Y11354 

Homo sapiens subunit of RNA polymerase II 
transcription factor TFIID 

215 

29.646 

9341 

AP000001 

Pyrococcus horikoshii 3 05aa long hypothetical 
L- asparaginase 

173 

44.928 

9342 

AB020690 

Homo sapiens KIAA0883 protein 

179 

50.000 

9343 

AF045640 

Caenorhabditis elegans No definition line 
found 

665 

45.378 

9344 

AF173378 

Homo sapiens 60S acidic ribosomal protein PO 

442 

89.333 

9345 

AC005328 

Homo sapiens R26660 1, partial CDS 

694 

73 .600 

9346 

AF022465 

Mus musculus high mobility group protein 
homo log HMG4 

492 

83.333 

9347 

X66357 

Homo sapiens serine/threonine protein kinase 

257 

91 . Ill 
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9348 

AL024499 

Caenorhabditis elegans cDNA EST EMBL:C10123 
comes from this gene 

452 

35.961 

9349 

AF076167 

Rattus norvegicus UDP-GalNAc : polypeptide N- 
acetylgalactosaminyltransf erase T6 

48 

23 .404 

9350 

AB014555 

Homo sapiens KIAA0655 protein 

965 

87.952 

9351 

AB014074 

Rattus norvegicus semaphorin Y short isoform 

668 

86 .441 

9352 

D89103 

Schizosaccharomyces pombe unnamed protein 
product 

211 

46.602 

9353 

Y10392 

Human endogenous retrovirus K gag protein 

335 

41.026 

9354 

AF176514 

Mus musculus E2F1- inducible protein 

169 

38.636 

9355 

AB024075 

Homo sapiens B120 

394 

50.000 

9356 

Z14310 

Human endogenous retrovirus tripartite fusion 
transcript PLA2L 

170 

59.574 

9357 

U55366 

Caenorhabditis elegans Similar to cuticle 
collagen 

186 

29. 775 

9358 

L29028 

Unknown amino acid feature: N-glycosylation 
sites, aa 41 . . 43, 46 . . 48, 51 . . 53 , 72 . . 
74, 107 . 

147 

30.345 

9359 

AF022985 

Unknown Similar to collagen; coded for by C. 
elegans cDNA yk55f3.3; coded for by C. elegans 
cDNA 

187 

29.730 

9360 

U56964 

Caenorhabditis elegans weak similarity to S. 
cerevisiae intracellular protein transport 
protein US) 1 (SP:P25386) 

371 

31. 939 

9361 

M26312 

Oryctolagus cuniculus unknown protein 

161 

40.000 

9362 

AL050156 

Homo sapiens hypothetical protein 

547 

49. 756 

9363 

S58722 

Homo sapiens X- linked retinopathy protein {C- 
terminal, clone XEH.8c} 

204 

58.462 

9364 

AL021481 

Caenorhabditis elegans cDNA EST EMBL:T01200 
comes from this gene; cDNA EST EMBL:D72618 
comes from this gene; cDNA EST yk343c3.3 comes 
from this gene; cDNA EST EMBL:Z14 821 comes 
from this gene 

395 

31.050 

9365 

AC005498 

Homo sapiens R31665 2 

305 

56 . 098 

9366 

AJ011523 

Caenorhabditis elegans CHE-2 protein 

405 

41.135 

9367 

U49974 

Homo sapiens mariner transposase 

227 

55.422 

9368 

S61069 

Homo sapiens reverse transcriptase homolog=pol 
{retroviral element} 

291 

86 .275 

9369 

X55777 

Homo sapiens put. ORF 

332 

64 .557 

9370 

AF010314 

Homo sapiens PiglO 

223 

35.593 

9371 

X07858 

Saccharomyces cerevisiae ORF (1 is 3rd base in 
codon) (266 is 1st base in codon) 

109 

41.667 

9372 

U41020 

Caenorhabditis elegans coded for by C. elegans 
cDNA ykl00g4.5; coded for by C. elegans cDNA 
ykl00g4.3; weakly similar to human SREBP-2 
basic -helix- loop- helix- leucine zipper 
transcription factor 

407 

50.450 

9373 

U37263 

Homo sapiens KRAB zinc finger protein ,- 
Method: conceptual translation supplied by 
author 

222 

73.333 

9374 

U50193 

Unknown coded for by C. elegans cDNA 
CEMSG95FB; coded for by C. elegans cDNA 
CEMSG95RB; coded for b 

255 

33 .113 

9375 

AB020709 

Homo sapiens KIAA0902 protein 

624 

70.896 

9376 

AL021571 

Caenorhabditis elegans predicted using 

312 

36.943 
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9377 

AF050640 

Homo sapiens NADH- ubiquinone oxidoreductase 
NDUFS2 subunit 

211 

91 .429 

9378 

AL031432 

Homo sapiens dJ4 6 5N24.2.1 (PUTATIVE novel 
protein) (isoform 1) 

226 

37.838 

9379 

AC002291 

Arabidopsis thaliana Similar ATP -dependent RNA 
Helicase 

1818 

42 .125 

9380 

U53585 

Mycobacterium avium fibronectin attachment 
protein 

136 

37.179 

9381 

AJ243460 

Leishmania major proteophosphoglycan 

265 

36 .404 

9382 

AF116268 

Mus musculus G-protein XLAS 

157 

29.570 

9383 

AF090867 

Rattus norvegicus guanos ine monophosphate 
reductase 

976 

67.727 

9384 

U49974 

Homo sapiens mariner transposase 

348 

70 . 000 

9385 

L20319 

Rattus norvegicus developmentally regulated 
protein 

1188 

63 .869 

9386 

X66285 

Mus musculus HC1 ORF 

182 

44 .828 

9387 

Y00638 

Homo sapiens LCA (AA -23 to 12 81) 

322 

76 .389 

9388 

M31013 

Homo sapiens nonmuscle myosin heavy chain 
(NMHC) 

173 

73 .529 

9389 

Z70208 

Caenorhabditis elegans predicted using 
Genef inder; similar to collagen 

180 

30.000 

9390 

AF055995 

Homo, sapiens thyroid hormone receptor- 
associated protein complex component TRAP100 

646 

80 .645 

9391 

AF045646 

Caenorhabditis elegans contains similarity to 
collagens 

245 

32 .143 

9392 

U41021 

Caenorhabditis elegans contains two LIM 
domains 

240 

61.111 

9393 

AF022465 

Mus musculus high mobility group protein 
homolog HMG4 

492 

83 .333 

9394 

X53581 

Rattus norvegicus ORF2 

181 

55.814 

9395 

D90710 

Escherichia coli Hypothetical protein HI1731 

1041 

91.515 

9396 

U00039 

Escherichia coli overlaps end of previous orf 

566 

93 .258 

9397 

D64006 

Synechocystis sp. asparaginyl- tRNA synthetase 

997 

42 .359 

9398 

Z74036 

Caenorhabditis elegans predicted using 
Genef inder; similar to collagen 

176 

32 .847 

9399 

X89715 

Saccharomyces cerevisiae AOF1001 

221 

32 .616 

9400 

X99302 

Homo sapiens Popl 

762 

96 .610 

9401 

AC003682 

Homo sapiens ZNF134 

827 

51 .429 

9402 

Y17918 

Drosophila melanogaster CRAG protein 

1790 

48 .253 

9403 

X94976 

Brassica napus cell wall -plasma membrane 
linker protein 

250 

27 .551 

9404 

AF146396 

Homo sapiens soluble liver antigen/liver 
pancreas antigen 

234 

100.000 

9405 

AF054180 

Homo sapiens hematopoietic cell derived zinc 
finger protein 

753 

71.338 

9406 

AJ388555 

Canis familiaris hypothetical protein 

735 

67 . 832 

9407 

AF024636 

Homo sapiens STE20-like kinase 3 

681 

87 . 500 

9408 

D49835 

Homo sapiens DNA- binding protein 

253 

97.368 

9409 

AC006284 

Arabidopsis thaliana putative ankyrin 

205 

40 .659 

9410 

U09366 

Homo sapiens zinc finger protein ZNF133 

428 

60 .360 

9411 

A58792 

unidentified unnamed protein product 

201 

90.625 

9412 

U42208 

Oryza sativa OSBZ8 

82 

64 .706 

9413 

AF118838 

Homo sapiens citrin; adult-onset type II 
citrullinemia protein 

401 

90 .141 
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9414 

U27121 

Danio rerio G12 

212 

73 .810 

9415 

X99145 

Canis familiaris overexpressed in thyroid 
tissue after TSH stimulation 

608 

92.000 

9416 

J03535 

Mus musculus embigin precursor 

167 

60.000 

9417 

U35022 

Rattus norvegicus cis-Golgi matrix protein 
GM130 

681 

68.639 

9418 

X75090 

Homo sapiens PHAPI (Putative HLA DR Associated 
Protein I) 

838 

65.517 

9419 

AB011137 

Homo sapiens KIAA0565 protein 

219 

47.222 

9420 

D63881 

Homo sapiens KIAA0160 gene product is novel. 

3129 

98.323 

9421 

AB020716 

Homo sapiens KIAA0909 protein 

348 

100.000 

9422 

Z75330 

Homo sapiens nuclear protein SA-1 

651 

100.000 

9423 

AJ005890 

Homo sapiens JM1 

169 

100.000 

9424 

AC004697 

Arabidopsis thaliana unknown protein 

226 

29.143 

9425 

AF140691 

Mus musculus melusin 

733 

51.376 

9426 

AJ006591 

Homo sapiens cysteine-rich protein 

256 

97.500 

9427 

X90875 

Mus musculus FXR1 

183 

96.000 

9428 

Z27170 

Homo sapiens IG light chain variable region 
(VJ) 

518 

76.238 

9429 

AF061034 

Homo sapiens FIP2 

377 

65.060 

9430 

D64062 

Rattus norvegicus annexin V-binding protein 
(ABP-10) 

1326 

73 .448 

9431 

L22557 

Rattus norvegicus calmodul in-binding protein 

420 

80.723 

9432 

AF111168 

Homo sapiens unknown 

759 

100.000 

9433 

U16800 

Xenopus laevis ribonucleoprotein 

1367 

79.688 

9434 

AF105378 

Homo sapiens heparan sulfate D-glucosaminyl 3- 
O- sul f otransf erase- 4 

1257 

99.474 

9435 

AB014536 

Homo sapiens KIAA0636 protein 

1756 

61.358 

9436 

AF123880 

multiple sclerosis associated retrovirus 
element gag polyprotein 

226 

53 .448 

9437 

U85055 

Mus musculus rapl/rap2 interacting protein 

703 

93 .694 

9438 

AB007903 

Homo sapiens KIAA044 3 

416 

49.682 

9439 

Z66568 

Schizosaccharomyces pombe hypothetical trp-asp 
repeats containing protein 

522 

42 . 941 

9440 

AF157706 

Human herpesvirus 6B B4 

157 

36.923 

9441 

AC004908 

Homo sapiens zinc finger protein from gene of 
uncertain exon structure; similar to Q99676 
(PID:g3025333) 

855 

99.180 

9442 

AF153191 

Homo sapiens nm23-H7 

890 

79.558 

9443 

AF059569 

Homo sapiens actin binding protein MAYVEN 

519 

100.000 

9444 

AF065414 

Homo sapiens COBW-like placental protein 

188 

90.625 

9445 

AB019440 

Homo sapiens immunogloblin heavy chain 
variable region 

486 

79.310 

9446 

AB019439 

Homo sapiens immunogloblin heavy chain 
variable region 

577 

89.320 

9447 

V00488 

Homo sapiens alpha globin 

331 

81.356 

9448 

AF155105 

Homo sapiens putative zinc finger protein NY- 
REN- 34 antigen 

251 

35.577 

9449 

AF062146 

Homo sapiens immunoglobulin heavy chain 
variable region 

866 

93.525 

9450 

AJ131730 

Homo sapiens DREAM protein 

142 

70.370 

9451 

AJ224819 

Homo sapiens tumor suppressor 

572 

42 .132 

9452 

AF084521 

Homo sapiens brefeldin A- inhibited guanine 
nucleotide- exchange protein 2 

186 

100.000 

9453 

M93017 

Rattus norvegicus , gene product 

491 

92 . 500 
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9454 

M33328 

Homo sapiens calpastatin 

152 

95.833 

9455 

X00452 

Homo sapiens DC classll histocompatibility 
antigen alpha- chain 

255 

95.238 

9456 

AF117888 

Homo sapiens myosin- IXa 

5696 

99.649 

9457 

X60221 

Homo sapiens H+-ATP synthase subunit b 

284 

53.750 

9458 

X75785 

Rattus norvegicus SCP3 protein 

191 

58 . 824 

9459 

V00488 

Homo sapiens alpha globin 

374 

98.246 

9460 

AB020716 

Homo sapiens KIAA0 90 9 protein 

195 

79 .412 

9461 

AC005825 

Arabidopsis thaliana putative glucokinase 

265 

48 .193 

9462 

AE000854 

Methanobacterium thermoautotrophicum Na+/H+- 
exchanging protein :Na+/H+ antiporter 

191 

37.391 

9463 

A31039 

Nicotiana alata PRP3 

176 

38 .272 

9464 

AC002988 

Homo sapiens OLF4 

303 

78 .689 

9465 

AF151826 

Homo sapiens CGI -68 protein 

201 

68.889 

9466 

X52128 

Mus musculus domesticus pBS13 precursor 
polypeptide, testis-specif ic 

543 

40 .609 

9467 

AF070663 

Homo sapiens HSPC007 

297 

71.875 

9468 

U31449 

Homo sapiens tetraspan membrane protein 

252 

74 . 510 

9469 

D14340 

Mus musculus ZO-1 

174 

100. 000 

9470 

V00488 

Homo sapiens alpha globin 

118 

100. 000 

9471 

AC003007 

Homo sapiens Unknown gene product (partial) 

468 

89.024 

9472 

AC007842 

Homo sapiens BC331191 1 

290 

54 .545 

9473 

M74090 

Homo sapiens TB2 

356 

63 .043 

9474 

A06100 

synthetic construct synthetic antithrombin III 

459 

92 .105 

9475 

AL050062 

Homo sapiens hypothetical protein 

259 

94 .872 

9476 

L06498 

Homo sapiens ribosomal protein S2 0 

387 

84 .211 

9477 

X56932 

Homo sapiens 23 kD highly basic protein 

963 

100. 000 

9478 

AF151830 

Homo sapiens CGI -72 protein 

935 

85 . 556 

9479 

AL033503 

Candida albicans conserved hypothetical 
protein 

334 

40 .299 

9480 

AF054180 

Homo sapiens hematopoietic cell derived zinc 
finger protein 

234 

55.556 

9481 

X57821 

Homo sapiens immunoglobulin lambda light chain 

484 

79.787 

9482 

J02828 

Gallus gallus beta-tubulin 

1224 

82 .692 

9483 

AJ006710 

Rattus norvegicus phosphatidylinositol 3- 
kinase 

1147 

93.855 

9484 

AF041207 

Homo sapiens midline 1 cerebellar isoform 2 

285 

33 . 113 

9485 

X63797 

Gallus gallus decorin 

519 

55.245 

9486 

AL023799 

Homo sapiens dJ322P7.l (zinc finger) 

276 

89.130 

9487 

U09368 

Homo sapiens zinc finger protein ZNF14 0 

296 

62 .500 

9488 

X93207 

Homo sapiens NRD2 convertase 

285 

100. 000 

9489 

AF017369 

Mus musculus faciogenital dysplasia protein 3 

2523 

80 .328 

9490 

D83268 

Athalia rosae vitellogenin 

141 

43 .662 

9491 

U23514 

Caenorhabditis elegans similar to S. 
cerevisiae SSD1 protein (SP : SSD1_YEAST, 
P24276) and to E. coli VACB and Ribonuclease 
II genes 

231 

39.560 

9492 

AC005594 

Homo sapiens R26984 1 

786 

47 . 566 

9493 

AB023231 

Homo sapiens KIAA1014 protein 

1009 

51 .908 

94 94 

L4 23 24 

Homo sapiens G protein- linked receptor 

654 

96 . 970 

9495 

AC004262 

Homo sapiens R2 9368 2 

906 

83 .140 

9496 

U29488 

Caenorhabditis elegans No definition line 
found 

578 

44 .915 

9497 

AF116826 

Homo sapiens putative protein- tyrosine kinase 

818 

96 .923 

9498 

U95044 

Homo sapiens zinc finger protein 

402 

69.565 
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9499 

U05343 

Mus musculus zinc finger protein PZF 

2307 

93.151 

9500 

Z74031 

Unknown Similarity to Yeast D- lactate 
dehydrogenase ( SW : DLD1__YEAST) ; cDNA EST 
EMBL:C122 35 comes fro 

662 

44.017 

9501 

AJ011118 

Mus musculus skeletal muscle and cardiac 
protein 

988 

87.709 

9502 

X78927 

Homo sapiens zinc finger protein 

439 

98.529 

9503 

AF030131 

Mus musculus Plenty of SH3s; POSH 

382 

58.163 

9504 

Y18208 

Rattus norvegicus serine- threonine specific 
protein phosphatase, glycogen-binding (GL) 
subunit 

228 

80. 952 

9505 

AC002130 

Arabidopsis thaliana F1N21.13 

346 

48.148 

9506 

Z73428 

Caenorhabditis elegans similar to Zinc finger, 
C3HC4 type (RING finger); cDNA EST EMBL:D67323 
comes from this gene 

391 

34.197 

9507 

AF061258 

Homo sapiens LIM protein 

227 

100.000 

9508 

AB002584 

Rattus norvegicus beta-alanine-pyruvate 
aminotransferase 

448 

78. 049 

9509 

D42054 

Homo sapiens KIAA0092 gene product is 
distantly related to smooth muscle myosin. 

191 

46.970 

9510 

D87455 

Homo sapiens Similar to S.cerevisiae 
hypothetical protein 5 (S49634) 

1099 

82 . 775 

9511 

AF123880 

multiple sclerosis associated retrovirus 
element gag polyprotein 

263 

45.556 

9512 

X68670 

Mus musculus deoxynucleot idyl transferase 

442 

39. 906 

9513 

AF149414 

Arabidopsis thaliana contains similarity to 
Pfam family PF00145 (C-5 cytosine-specif ic DNA 
methylase) ,- score=10.4. E=0.051, N=l 

390 

36.946 

9514 

AF155739 

Mus musculus axotrophin 

302 

88.235 

9515 

U49385 

Mus musculus CTP synthetase homolog 

2213 

93.820 

9516 

X72467 

Homo sapiens Ig kappa light chain (VJC) 

717 

90.517 

9517 

AF109907 

Homo sapiens S164 

380 

26 . 140 

9518 

X17617 

Mus musculus zinc finger protein (AA 1-580) 

198 

69.767 

9519 

U28831 

Homo sapiens protein that is immuno- reactive 
with anti-PTH polyclonal antibodies 

712 

59.556 

9520 

AJ222636 

Homo sapiens hypothetical protein 

288 

41.618 

9521 

L11672 

Homo sapiens zinc finger protein 

270 

43.548 

9522 

U25691 

Mus musculus lymphocyte specific helicase 

427 

96.774 

9523 

AB011097 

Homo sapiens KIAA052 5 protein 

660 

57.297 

9524 

U35376 

Homo sapiens repressor transcriptional factor 

729 

39.655 

9525 

Z81030 

Unknown similar to O-sialoglycoprotein 
endopeptidase; cDNA EST EMBL:D73 065 comes from 
this gene; cD 

472 

42.593 

9526 

AL049608 

Arabidopsis thaliana extensin-like protein 

241 

27.372 

9527 

AF019085 

Homo sapiens BRDT 

231 

24.869 

9528 

AF039720 

Caenorhabditis elegans No definition line 
found 

974 

56.452 

9529 

D25215 

Homo sapiens KIAA0032 

277 

48.148 

9530 

U29096 

Caenorhabditis elegans coded for by C. elegans 
cDNA yk44f2.5; similar to P59 protein (HSP 
binding immunophilin) and to TPR domain 

272 

40.351 

9531 

D82080 

Gallus gallus leucine- zipper protein 

283 

40.816 

9532 

S45663 

Rattus sp. SC2=synaptic glycoprotein 

220 

42.667 

9533 

Z54327 

Caenorhabditis elegans similar to 
oxidoreductase 

449 

31.707 
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9534 

AL050159 

Homo sapiens hypothetical protein 

832 

60 .215 

9535 

U83176 

Mus mus cuius ROSA26AS 

622 

67 .808 

9536 

L31840 

Rattus norvegicus nuclear pore complex protein 
NUP107 

1227 

84 .071 

9537 

AF155595 

Homo sapiens CoREST protein 

225 

47.619 

9538 

L01042 

Homo sapiens TATA element modulatory factor 

1129 

100.000 

9539 

Z93386 

Unknown Similarity to Yeast hypothetical 52.9 
KD protein (SW:P43616); cDNA EST EMBL:M89432 
comes fr 

180 

38 .889 

9540 

AL022393 

Homo sapiens p373c6.2 

605 

52 .151 

9541 

Z48475 

Homo sapiens glucokinase regulator 

1390 

99.074 

9542 

Z30425 

Homo sapiens orphan nuclear hormone receptor 

573 

98 .734 

9543 

AB023191 

Homo sapiens KIAA0974 protein 

241 

34 .146 

9544 

AC002337 

Arabidopsis thaliana G-beta-repeat containing 
protein isolog 

156 

37.349 

9545 

AF109134 

Homo sapiens 7-60 

195 

80.000 

9546 

K02401 

Homo sapiens chorionic somatomammotropin 

789 

96 . 000 

9547 

X15187 

Homo sapiens precursor polypeptide (AA -21 to 
782) 

1315 

99 .057 

9548 

AF129756 

Homo sapiens NG32 

839 

100.000 

9549 

AF085279 

Arabidopsis thaliana hypothetical protein 

538 

49.032 

9550 

U42208 

Oryza sativa OSBZ8 

82 

64 .706 

9551 

D78132 

Homo sapiens ras- related GTP-binding protein 

435 

57.798 

9552 

AC004886 

Homo sapiens C- terminus matches KIAA0559, N- 
terminus similar to Bassoon protein; match to 
PID:g3043642; similar to PID:g3413810 

422 

100. 000 

9553 

A52806 

unidentified unnamed protein product 

677 

83 .186 

9554 

D50310 

Homo sapiens cyclin I 

245 

55.556 

9555 

M63180 

Homo sapiens threonyl - tRNA synthetase 

729 

44 .492 

9556 

AC004382 

Homo sapiens Unknown gene product 

968 

100. 000 

9557 

AF099973 

Mus musculus schlafen2 

680 

50.000 

9558 

D16200 

Sus scrofa sp32 precursor 

2428 

82 .118 

9559 

X74504 

Mus musculus T10 

1514 

73 .089 

9560 

AJ007970 

Mus musculus interferon-g induced GTPase 

168 

87.500 

9561 

Z18946 

Mycobacteriophage 15 predicted 21.3kd protein 

120 

31.624 

9562 

AB002819 

Perilla frutescens actin 

178 

96.667 

9563 

AB011089 

Homo sapiens KIAA0 517 protein 

986 

94.118 

9564 

Z66521 

Caenorhabditis elegans similar to 
mitochondrial RNA splicing MSR4 like protein; 
cDNA EST EMBL:C09217 comes from this gene 

698 

54 .688 

9565 

AF005497 

Bos taurus butyrophilin 

655 

43 .421 

9566 

D14168 

Bombyx mori 50kDa lectin 

221 

29.268 

9567 

AC004883 

Homo sapiens similar to KIAA0766; similar to 
PID:g3882253 

1418 

96.413 

9568 

AB018340 

Homo sapiens KIAA07 97 protein 

855 

100. 000 

9569 

X51760 

Homo sapiens zinc finger protein (583 AA) 

669 

66.258 

9570 

AJ132889 

Mus musculus kinesin like protein 9 

537 

86 .667 

9571 

L38933 

Homo sapiens the longest open reading frame 
predicts a protein of 202 amino acids, with 
fair Kozak consensus at the initial ATG codon; 
an in- frame TGA codon is seen at nucleotide 8; 
ORF; putative 

243 

92 .683 

9572 

U19617 

Mus musculus Elf-1 

1020 

91 .525 

9573 

AF022962 

Mus musculus Sec8 

748 

98 .333 

9574 

X96586 

Homo sapiens FAN protein 

1216 

100.000 


.i. o o e «+ 6. "+ 3; ,„ o e 2 6 o s 


9575 

Z74043 

Caenorhabditis elegans predicted using 
Gene finder; cDNA EST EMBL:C13 850 comes from 
this gene; cDNA EST EMBL:C115 75 comes from 
this gene; cDNA EST yk343f4.5 comes from this 
gene 

369 

65.476 

9576 

AF067855 

Homo sapiens geminin 

567 

98 . 901 

9577 

Z83107 

Unknown cDNA EST EMBL : D69907 comes from this 
gene; cDNA EST EMBL:C13424 comes from this 
gene ; cDNA 

202 

26 . 994 

9578 

AB023187 

Homo sapiens KIAA0970 protein 

436 

60.185 

9579 

X68249 

Xenopus laevis Proline rich protein 

983 

66.376 

9580 

AF166261 

Xenopus laevis nuclear protein Sojo 

303 

61.644 

9581 

AF056034 

Rattus norvegicus F- act in binding protein b- 
Nexilin 

625 

93 .269 

9582 

D13626 

Homo sapiens KIAA0001 

735 

47 .393 

9583 

AB020684 

Homo sapiens KIAA0877 protein 

427 

69.880 

9584 

K02113 

Unknown Gallus gallus vitellogenin 

186 

22 .280 

9585 

L36315 

Mus musculus zinc finger protein 

925 

97 . 727 

9586 

AJ001810 

Homo sapiens mRNA cleavage factor I 2 5 kDa 
sub unit 

265 

100.000 

9587 

Z78542 

Unknown cDNA EST EMBL:D72182 comes from this 
gene; cDNA EST EMBL:D72353 comes from this 
gene ; cDNA 

158 

28 .409 

9588 

AF067430 

Mus musculus Smarcel -related protein 

895 

52 .140 

9589 

AL031266 

Caenorhabditis elegans VM106R.1 

242 

46 . 237 

9590 

U23486 

Caenorhabditis elegans similar to S. 
cerevisiae zinc finger protein GCS1 
(SP:GCS1 YEAST) 

323 

64 .407 

9591 

U00483 

Macaca mulatta mucin 

164 

24 . 000 

9592 

X77953 

Rattus norvegicus ribosomal protein S15a 

257 

78 .431 

9593 

U80753 

Homo sapiens CAGL79 

1936 

98 . 962 

9594 

D50919 

Homo sapiens The KIAA012 9 gene product is 
novel . 

218 

97 . 222 

9595 

M68864 

Homo sapiens ORF 

197 

50.820 

9596 

AB023432 

Rattus norvegicus aspartoacylase 

261 

50.769 

9597 

AC004997 

Unknown match to ESTs AA667999 (NID : g2626700 ) , 
AA165465 (NID : gl74 14 81 ) , Z45871 (NID : g575105 ) , 
and 

1762 

71.131 

9598 

U29659 

Human endogenous retrovirus pol gene product 

626 

100.000 

9599 

AF134726 

Homo sapiens NG22 

741 

41.053 

9600 

X66405 

Mus musculus collagen alphal type VI- 
precursor 

2251 

92 .837 

9601 

AF153366 

Mus musculus tubulo- interstitial nephritis 
antigen 

1246 

50 . 578 

9602 

Z99118 

Bacillus subtilis similar to hypothetical 
proteins 

345 

50 . 526 

9603 

AC004780 

Homo sapiens F17127 1 

186 

82 . 759 

9604 

AC004849 

Homo sapiens similar to KIAA0662; similar to 
AB014562 (PID:g3327138) 

1459 

94 .286 

9605 

X72012 

Homo sapiens endoglin 

479 

82 . 609 

9606 

X05562 

Homo sapiens alpha- 2 chain precursor (AA -25 
to 1018) (3416 is 2nd base in codon) 

2224 

100.000 

9607 

AL050272 

Homo sapiens hypothetical protein 

667 

99 . 000 

9608 

X84692 

Mus musculus spermatid perinuclear RNA binding 
protein 

1617 

98 . 387 
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9609 

A31038 

Nicotiana alata PRP3 

161 

40 .323 

9610 

Z74042 

Caenorhabditis elegans predicted using 
Genefinder; Similarity to Haemophilus 3- 
oxoacyl- (acyl- carrier protein) reductase 
(SW:FABG__HAEIN) ; cDNA EST yk470b2.3 comes from 
this gene; cDNA EST yk470b2.5 comes from this 
gene 

266 

43 .000 

9611 

AJ010317 

Fugu rubripes Sand 

481 

46 .860 

9612 

Z47808 

Unknown similar to beta- transducin; cDNA EST 
yk489h7.5 comes from this gene; cDNA EST 
yk399d9.5 com 

1099 

40.643 

9613 

V00639 

Bacteriophage 4 34 reading frame (rexl protein) 

1128 

100 . 000 

9614 

AB016816 

Homo sapiens MASLl 

336 

36.032 

9615 

AF071317 

Mus musculus COP9 complex subunit 7b 

537 

98 . 864 

9616 

AF182426 

Rattus norvegicus arylacetamide deacetylase 

755 

42 . 748 

9617 

AB014557 

Homo sapiens KIAA0657 protein 

2375 

100 . 000 

9618 

S69232 

Homo sapiens electron transfer f lavoprotein- 
ubiquinone oxidoreductase , ETF-QO {EC 1.5.5.1} 

219 

97 .297 

9619 

AF123880 

multiple sclerosis associated retrovirus 
element gag polyprotein 

170 

37.838 

9620 

AB020625 

Homo sapiens butyrophilin like receptor 

225 

27 .612 

9621 

U09367 

Homo sapiens zinc finger protein ZNF136 

1402 

52 .308 

9622 

AB014888 

Homo sapiens MR J" 

441 

87 . 838 

9623 

AF132726 

Mus musculus FLASH 

665 

88 . 793 

9624 

X90849 

Gallus gallus polybromo 1 protein 

196 

30.392 

9625 

S83545 

Homo sapiens Na+/H+ exchanger isoform NHE-3 

260 

97.500 

9626 

AF151904 

Homo sapiens CGI -14 6 protein 

208 

96 . 552 

9627 

AF181645 

Drosophila melanogaster BcDNA . GH12144 

292 

45.263 

9628 

AJ243895 

Mus musculus basic-helix-loop-helix protein 
(bHLH) 

417 

66 . 364 

9629 

Z80223 

Caenorhabditis elegans predicted using 
Genefinder 

258 

27.152 

9630 

U72661 

Homo sapiens ninjurinl 

356 

56 . 190 

9631 

Y18265 

Homo sapiens zinc finger protein SALL1 

149 

26 . 901 

9632 

AF111943 

Dictyostelium discoideum development protein 
DG1124 

196 

45 .614 

9633 

X56958 

Homo sapiens ankyrin (brank-2) 

580 

98 .936 

9634 

AL050231 

Drosophila melanogaster BACR37P7.g 

272 

46 .988 

9635 

AF080171 

Homo sapiens zinc finger protein ZNF2 32 

595 

100.000 

9636 

AF005392 

Homo sapiens alpha tubulin 

519 

90. 588 

9637 

AF011566 

Homo sapiens immunoglobulin- like transcript 4 

136 

35.000 

9638 

Z46935 

Caenorhabditis elegans weak similarity with 
quinone oxidoreductase; cDNA EST EMBL:C13104 
comes from this gene; cDNA EST yk264f3.5 comes 
from this gene 

247 

40.385 

9639 

U09367 

Homo sapiens zinc finger protein ZNF136 

207 

33.835 j 

9640 

AC002339 

Arabidopsis thaliana hypothetical protein 

185 

29.851 

9641 

Z22866 

Mus musculus skelemin 

573 

74 . 603 

9642 

Z50028 

Caenorhabditis elegans cDNA EST yk321h8.5 
comes from this gene; cDNA EST EMBL:D68896 
comes from this gene; cDNA EST yk395f9.5 comes 
from this gene; cDNA EST yk360fl2.5 comes from 
this gene 

616 

33 . 793 

9643 

AF067211 

Caenorhabditis elegans contains similarity to 
26S protease subunit 

194 

27 . 219 
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9644 

L12018 

Caenorhabditis elegans putative 

375 

47.656 

9645 

AF054180 

Homo sapiens hematopoietic cell derived zinc 
finger protein 

573 

70.492 

9646 

U19614 

Rattus norvegicus lamina-associated 
polypeptide 1C 

874 

66.667 

9647 

U79260 

Homo sapiens unknown 

229 

56.757 

9648 

U30883 

Homo sapiens SRp55-l 

1635 

99.213 

9649 

U97667 

Rattus norvegicus RSP29 

248 

42.045 

9650 

U64598 

Caenorhabditis elegans weakly similar to S . 
cervisiae PTM1 precursor (SP:P32857) 

285 

26.531 

9651 

X74764 

Homo sapiens protein- tyrosine kinase 

3799 

94.842 

9652 

AL032643 

Caenorhabditis elegans similar to 
Uncharacterized protein family UPF0034, 
Double -stranded RNA binding motif; cDNA EST 
yk489b3.5 comes from this gene; cDNA EST 
yk439g7.5 comes from this gene 

283 

50.000 

9653 

AB018353 

Homo sapiens KIAA0810 protein 

668 

100.000 

9654 

AF056618 

Homo sapiens BWSCR2 associated zinc- finger 
protein BAZ2 

273 

53.571 

9655 

AB014593 

Homo sapiens KIAA0693 protein 

298 

60.274 

9656 

AF073883 

Danio rerio myotubularin related protein 2 

134 

33 . 871 

9657 

AF090133 

Rattus norvegicus lin-7-A 

474 

98.684 

9658 

AF119668 

Rattus norvegicus lipolysis-stimulated remnant 
receptor alpha' subunit 

1383 

85.897 

9659 

U37263 

Homo sapiens KRAB zinc finger protein; 
Method: conceptual translation supplied by 
author 

461 

49.664 

9660 

U00017 

Mycobacterium leprae pimT; B2126 CI 16 5 

397 

34.862 

9661 

U65092 

Homo sapiens melanocyte- specif ic gene 1 
nuclear protein 

301 

91.667 | 

9662 

AF041378 

Homo sapiens cell death activator CIDE-A 

302 

54 . 023 

9663 

AB017508 

Bacillus halodurans truA homologue (identity 
of 62% to B. subtilis ) 

206 

38.636 

9664 

L29162 

Homo sapiens immunoglobulin light chain 
variable region 

733 

93.162 

9665 

AL096844 

Streptomyces coelicolor A3 (2) probable 3- 
oxacyl- (acyl-carrier-protein) reductase 

165 

36.111 

9666 

AJ010469 

Arabidopsis thaliana RNA helicase 

806 

42. 901 

9667 

Z14020 

Nicotiana tabacum Pistil extensin like 
protein, partial CDS only 

223 

44 .444 

9668 

AF044127 

Homo sapiens peroxisomal short- chain alcohol 
dehydrogenase 

195 

100.000 

9669 

AL009193 

Unknown /predict ion= (method : " "genef inder " " , 
version: ""084" ") ; 

/prediction= (method : " "genscan" " , ve 

178 

39.286 

9670 

AC007228 

Homo sapiens BC3 72 95 1 

407 

96.970 

9671 

Z81109 

Unknown predicted using Genef inder; cDNA EST 
EMBL:D71433 comes from this gene; cDNA EST 
EMBL:D74134 

188 

33.981 

9672 

D16226 

Oryctolagus cuniculus one of the members of 
sodium-glucose cotransporter family 

268 

83 . 333 

9673 

D10631 

Mus musculus zinc finger protein 

230 

44 . 776 

9674 

AL049943 

Homo sapiens hypothetical protein 

180 

70.270 

9675 

AC009465 

Arabidopsis thaliana unknown protein 

192 

50.000 

9676 

AF140675 

Homo sapiens zinc metal loprot ease ADAMTS7 

371 

41.436 
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9677 

AL023781 

Schi zosaccharomyces pombe N- terminal 
acetyltransf erase 1 

237 

49.351 

9678 

AF008554 

Rattus norvegicus implantation-associated 
protein 

478 

82.955 

9679 

Z48334 

Unknown similar to ribosomal protein L10 (QM 
protein) ; cDNA EST EMBL:T00732 comes from this 
gene ; c 

719 

80.469 

9680 

X62639 

Drosophila melanogaster hrp48.1 

675 

52 . 174 

9681 

AF053356 

Homo sapiens nucleoporin 

1484 

96 . 087 

9682 

L00923 

Mus musculus myosin I 

1299 

97.073 

9683 

M57547 

Rattus norvegicus ER alpha-mannosidase 

849 

85 . 507 

9684 

D83776 

Homo sapiens The KIAA0191 gene is expressed 
ubiquitously. ; The KIAA0191 protein retains 
the C2H2 zinc- finger at its N- terminal region. 

443 

41 . 143 

9685 

Z68006 

Caenorhabditis elegans K09C8.4 

231 

21.429 

9686 

U58728 

Caenorhabditis elegans C54H2 . 1 gene product 

239 

31.884 

9687 

AF031903 

Mus musculus ADP-ribosylation-like factor 
homo log ARL6 

373 

95.238 

9688 

AL117204 

Caenorhabditis elegans Y116A8C.9 

249 

43 .617 

9689 

AJ005821 

Homo sapiens X-like 1 protein 

410 

79.452 

9690 

Z66524 

Caenorhabditis elegans T13H5.6 

150 

30.928 

9691 

Z36715 

Homo sapiens Net 

420 

92.537 

9692 

Y09501 

Homo sapiens NADH- cytochrome- b5 reductase 

347 

79.730 

9693 

AL096768 

Homo sapiens dJ858B16.2 (novel protein similar 
to hamster PSSC (Phosphatidylserine 
Decarboxylase Proenzyme, EC 4.1.1.65) 

535 

96.386 

9694 

AF160893 

Drosophila melanogaster BcDNA . GM10765 

272 

45.745 

9695 

AL110477 

Caenorhabditis elegans Y113G7B.24 

184 

34 .211 

9696 

U94363 

Homo sapiens glycogenin-2 beta 

297 

97.959 

9697 

U79745 

Homo sapiens monocarboxylate transporter 
homologue MCT6 

335 

47.872 

9698 

AC003682 

Homo sapiens F18 54 71 

199 

65.217 

9699 

AB006628 

Homo sapiens KIAA0290 

1144 

57.746 

9700 

AB011110 

Homo sapiens KIAA053 8 protein 

801 

83 .448 

9701 

AB018288 

Homo sapiens KIAA074 5 protein 

1007 

65.086 

9702 

AL031856 

Schizosaccharomyces pombe putative DNA repair 
and recombination protein 

241 

38.542 

9703 

Y00978 

Homo sapiens PDC-E2 precursor (AA -54 to 561) 

129 

100 . 000 

9704 

AF017433 

Homo sapiens putative transcription factor 
CR53 

522 

69.159 

9705 

M64488 

Rattus norvegicus synaptotagmin II 

967 

85.882 

9706 

AF053368 

Mus musculus lysyl oxidase- related protein 2 

1035 

90.244 

9707 

X85750 

Homo sapiens expression associated with 
monocyte to macrophage differentiation 

450 

68. 966 

9708 

AB014566 

Homo sapiens KIAA0666 protein 

472 

63.025 

9709 

AF126736 

Homo sapiens ubiquitin processing protease 

353 

38 .211 

9710 

U29244 

Caenorhabditis elegans No definition line 
found 

379 

39. 860 

9711 

X67469 

Mus musculus AM2 receptor 

409 

49.020 

9712 

AB018293 

Homo sapiens KIAA0750 protein 

1014 

56 . 877 

9713 

AF006829 

Coturnix coturnix slow myosin heavy chain 2 

702 

69.231 

9714 

U23484 

Caenorhabditis elegans No definition line 
found 

582 

52.201 

9715 

Y00204 

Xenopus laevis nucleoplasmin 

468 

53.147 

9716 

AF059569 

Homo sapiens act in binding protein MAYVEN 

329 

67.089 
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9717 

AJ002204 

Zea mays polyamine oxidase 

228 

45.946 

9718 

U28831 

Homo sapiens protein that is immuno- reactive 
with anti-PTH polyclonal antibodies 

613 

63 .855 

9719 

AJ242723 

Drosophila melanogaster hypothetical protein 

209 

49.254 

9720 

AF132480 

Mus musculus Ese2 protein 

990 

94.839 

9721 

D63877 

Homo sapiens KIAA0157 gene product is novel. 

708 

37.500 

9722 

U18771 

Rattus norvegicus Rab2 6 

215 

57.143 

9723 

X58483 

Pseudomonas putida urocanate hydratase 

280 

34.513 

9724 

AF067165 

Homo sapiens zinc finger protein 3 

547 

98.765 

9725 

AL009195 

Drosophila melanogaster EG:30B8.1 

757 

79.412 

9726 

A26182 

Mus musculus beta3 -adrenergic receptor 

249 

30.488 

9727 

AB018315 

Homo sapiens KIAA0772 protein 

222 

94 . 444 

9728 

AL050306 

Homo sapiens dJ475B7.1 (novel KIAA0615 and 
KIAA0323 LIKE protein) 

272 

75. 000 

9729 

AC007055 

Homo sapiens unknown 

570 

35.593 

9730 

D86969 

Homo sapiens similar to Human zinc- finger 
protein, BR140 (PI : JC2069) 

321 

49. 020 

9731 

U97006 

Caenorhabditis elegans No definition line 
found 

172 

67 . 568 

9732 

AB025258 

Mus musculus granuphilin-a 

1425 

87 . 967 

9733 

AB029016 

Homo sapiens KIAA1093 protein 

295 

40. 559 

9734 

U96174 

Onchocerca volvulus OvB8 

509 

39.908 i 

9735 

AF151824 

Homo sapiens CGI- 66 protein 

379 

74 . 157 

9736 

X80754 

Homo sapiens GTP-binding protein 

968 

100 .000 

9737 

AL009066 

Caenorhabditis elegans Similarity with Human 
adenylate kinase KAD protein (PIR Acc. No. 
A33508) ; cDNA EST yk265c3.5 comes from this 
gene; cDNA EST yk265c3.3 comes from this gene 

172 

36.709 

9738 

AB011167 

Homo sapiens KIAA0595 protein 

270 

100 .000 

9739 

AF090326 

Mus musculus AE-1 binding protein AEBP2 

1535 

97 . 788 

9740 

AB014558 

Homo sapiens KIAA06 58 protein 

830 

100 .000 

9741 

Z84476 

Homo sapiens dJ25J6.2 (zinc finger protein) 

280 

28.877 

9742 

AB020629 

Homo sapiens KIAA0822 protein 

355 

58. 511 

9743 

Z75550 

Unknown limited similarity with some myosins; 
cDNA EST EMBL:C084 02 comes from this gene; 
CDNA EST E 

208 

29. 333 

9744 

AC003682 

Homo sapiens F1854 7 1 

212 

74 .419 

9745 

D10923 

Homo sapiens HM74 

1065 

94 .410 

9746 

U40419 

Caenorhabditis elegans No definition line 
found 

962 

47.416 

9747 

U42436 

Caenorhabditis elegans C49H3.6 gene product 

290 

35.616 

9748 

X78898 

Saccharomyces cerevisiae N1342 

892 

48.252 

9749 

AF069291 

Homo sapiens hT41 

505 

65.179 

9750 

L15309 

Homo sapiens zinc finger protein 

242 

87 . 805 

9751 

U53421 

Sus scrofa betaine- homocysteine 
methyl transferase 

297 

82 . 000 

9752 

AF173867 

Homo sapiens DNA binding protein p79PIF 

763 

92 .481 

9753 

AF070530 

Homo s ap i ens unknown 

160 

46.053 

9754 

Z12017 

Caenorhabditis elegans predicted using 
Gene finder; basic -rich; cDNA EST yk328f6.3 
comes from this gene; cDNA EST yk328f6.5 comes 
from this gene; cDNA EST yk393c3.5 comes from 
this gene; cDNA EST yk641dll.3 comes from this 
gene 

377 

33 .453 

9755 

D87438 

Homo sapiens Similar to a C. elegans protein in 

3322 

100 .000 
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9756 

AL034364 

Caenorhabditis elegans cDNA EST yk255b9.3 
comes from this gene; cDNA EST yk255b9.5 comes 
from this gene; cDNA EST EMBL:M75923 comes 
from this gene 

719 

30.582 

9757 

AF170301 

Mus musculus nuclear body associated kinase la 

702 

94 .595 

9758 

AF026565 

Mus musculus ring finger protein 

170 j 

47.368 

9759 

Y08134 

Homo sapiens acid sphingomyelinase -like 
phosphodiesterase 

1362 

100. 000 

9760 

AF056116 

Fugu rubripes unknown 

252 

79.545 

9761 

M96629 

Canis familiaris homologue to sec61 

691 

96 .262 

9762 

AF125385 

Drosophila melanogaster L82B 

188 

42 .667 

9763 

AL022603 

Arabidopsis thaliana hypothetical protein 

214 

28 .426 

9764 

U19729 

Saccharomyces cerevisiae Ylr4 09cp 

183 

36 .082 

9765 

AB003930 

Homo sapiens raplGAPII 

370 

45.985 

9766 

Z95397 

Schizosaccharomyces pombe Thslp 

308 

26 .780 

9767 

AJ007012 

Mus musculus Fish protein 

740 

78 .295 

9768 

U23037 

Oryctolagus cuniculus eIF-2Bepsilon 

631 

88.679 

9769 

J05499 

Rattus norvegicus L-glutamine amidohydrolase 

1770 

95.307 

9770 

L14684 

Rattus norvegicus elongation factor G 

709 

85 .606 

9771 

U00051 

Caenorhabditis elegans coded for by C. elegans 
cDNA yk50b2.5; coded for by C. elegans cDNA 
CEESV26F; similar to lipases over a short 
region 

192 

35.922 

9772 

D89340 

Rattus norvegicus dipeptidyl peptidase III 

806 

99.187 

9773 

AB023189 

Homo sapiens KIAA0972 protein 

332 

58 .333 

9774 

AF022789 

Homo sapiens ubiquitin hydrolyzing enzyme I 

839 

91 .729 

9775 

AF139185 

Rattus norvegicus myomegalin 

269 

65 . 152 

9776 

AC003682 

Homo sapiens F18547 1 

206 

45.833 

9777 

D10884 

Bos taurus neurocalcin 

595 

100.000 

9778 

D63876 

Homo sapiens KIAA0154 gene product is related 
to mouse gamma adaptin. 

657 

76 .923 

9779 

M15888 

Bos taurus endozepine-related protein 
precursor 

489 

39.698 

9780 

AJ011033 

Mus musculus KCC2 protein 

900 

97.761 

9781 

AF160973 

Homo sapiens p53 inducible protein 

1474 

100.000 

9782 

Z81547 

Caenorhabditis elegans predicted using 
Genef inder 

194 

36.036 

9783 

AJ011855 

Homo sapiens PAK4 protein 

265 

63 .333 

9784 

X87143 

Helianthus annuus cytochrome b5 containing 
fusion protein 

173 

42 . 105 

9785 

AC005328 

Homo sapiens R26660 1, partial CDS 

623 

98 . 925 

9786 

U85494 

Zea mays LON1 protease 

325 

27.019 

9787 

M96625 

Gallus gallus cardiac muscle tensin 

687 

60 . 326 

9788 

AF020351 

Homo sapiens NADH : ubiquinone oxidoreductase 18 
kDa IP subunit 

177 

90.909 

9789 

AB002354 

Homo sapiens KIAA03 56 

445 

87.500 

9790 

Z73423 

Caenorhabditis elegans cDNA EST EMBL:Z14 908 
comes from this gene; cDNA EST EMBL:M894 03 
comes from this gene; cDNA EST EMBL:T00022 
comes from this gene 

435 

46 .512 

9791 

AL032657 

Unknown predicted using Genef inder; similar to 
DnaJ domain ; Thioredoxin; cDNA EST yk433f3.5 
come 

960 

37 . 945 

9792 

AB003 503 | Mus musculus Guanine Nucleotide Regulatory 

622 

71.053 
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9793 

U51000 

Mus musculus DLX-1 

1344 

99.515 

9794 

U09367 

Homo sapiens zinc finger protein ZNF13 6 

429 

69.149 

9795 

AB004664 

Mus musculus Rab33B 

859 

94 .815 

9796 

Z98601 

Schizosaccharomyces pombe zinc finger protein 

236 

27 . 778 

9797 

AB017016 

Homo sapiens p2 5 alpha 

213 

91.667 

9798 

X82018 

Homo sapiens ZID, zinc finger protein with 
interaction domain 

768 

96.032 

9799 

Z97055 

Homo sapiens dJ388M5.3 (Sulfotransf erase 
(sulf okinase, EC 2.8.2.1) like protein) 

1395 

100.000 

9800 

AF030131 

Mus musculus Plenty of SH3s; POSH 

1571 

94 . 017 

9801 

U35113 

Homo sapiens metastasis-associated gene 

785 

73 .446 

9802 

U25842 

Saccharomyces cerevisiae Similar to several 
members of the Cdc48/Pasl/Secl8 family of 
proteins (Swiss Prot . accession numbers 
P25694, P24004, P18759) 

165 

31 . 507 

9803 

AF033566 

Mus musculus cdc2/CDC28 - like protein kinase 4 

686 

93 .684 

9804 

AC007193 

Homo sapiens Putative homolog of hypoxia 
inducible factor three alpha 

1917 

99.653 

9805 

U60644 

Homo sapiens HU-K4 

285 

39.695 

9806 

S73488 

Rattus sp. zinc finger transcription factor, 
Kid-1 {KRAB A and B regions} 

267 

62.500 

9807 

Z81135 

Unknown cDNA EST yk345gl.5 comes from this 
gene; cDNA EST yk496a3.3 comes from this gene; 
cDNA EST 

156 

38.272 

9808 

U63818 

Xenopus laevis RING finger protein 

493 

31.641 

9809 

AB002377 

Homo sapiens KIAA03 79 

506 

81 .633 

9810 

S71659 

Mus sp . LIM-type homeodomain=Gsh- 4 { C- 
terminal } 

398 

93 .651 

9811 

L15313 

Caenorhabditis elegans putative 

1088 

58.885 

9812 

AB032904 

Hylobates syndactylus dopamine receptor D4 

216 

31.206 

9813 

Z82215 

Homo sapiens dJ6802.2 

2663 

100.000 

9814 

AF180919 

Homo sapiens RNA lariat debranching enzyme 

2176 

100.000 

9815 

AF155105 

Homo sapiens putative zinc finger protein NY- 
REN- 34 antigen 

348 

38.235 

9816 

X79510 

Homo sapiens protein- tyrosine-phosphatase 

485 

100.000 

9817 

AF077000 

Rattus norvegicus protein tyrosine phosphatase 
TD14 

879 

99.265 

9818 

AF050182 

Mus musculus PERIOD 3 

518 

63.830 

9819 

X79828 

Mus musculus NK10 

368 

65.000 

9820 

AF002714 

Homo sapiens centromere protein B; CENP-B 

182 

31 . 532 

9821 

AB011146 

Homo sapiens KIAA0574 protein 

278 

100.000 

9822 

AF161181 

Mus musculus P55T protein 

252 

97.619 

9823 

Z83760 

Ciona intestinalis COS41.4 

324 

66.667 

9824 

X96973 

Mus musculus Lasp-1 

800 

84 .127 

9825 

AC002457 

Homo sapiens Rap2 interacting protein; similar 
to U73941 (PID:gl916018) 

819 

98.387 

9826 

AF008915 

Homo sapiens EVI-5 homolog 

839 

77.711 

9827 

Y18881 

Mus musculus midline 2 protein 

311 

29 . 064 

9628 

Z38062 

Saccharomyces cerevisiae orf, len: 432, CAI : 
0.15 similar to dnaJ proteins 

218 

47 . 059 

9829 

AF053970 

Homo sapiens outer dense fiber protein 2/2 

260 

100. 000 

9830 

Z83123 

Caenorhabditis elegans T04A11.2 

405 

49.275 

9831 

AF030558 

Rattus norvegicus phosphatidyl inositol 5- 
phosphate 4 -kinase gamma 

1106 

95.322 
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9832 

AF084530 

Homo sapiens cyclin-D binding Myb-like protein 

883 

100 .000 

9833 

X99145 

Canis familiar is overexpressed in thyroid 
tissue after TSH stimulation 

736 

53.646 

9834 

AB029025 

Homo sapiens KIAA1102 protein 

464 

46 . 023 

9835 

M20823 

Mus musculus mdm-1 

1013 

75 . 000 

9836 

U41534 

Caenorhabditis elegans similar to yeast MAK16 
protein (SP:MK16 YEAST, PI 096 2 ) 

644 

59.864 

9837 

AB007869 

Homo sapiens KIAA0409 

300 

68.571 

9838 

D14076 

Rattus norvegicus testicular dynamin 

701 

99. 091 

9839 

AF109907 

Homo sapiens S164 

1178 

100.000 

9840 

AB014536 

Homo sapiens KIAA0636 protein 

400 

50. 794 

9841 

AL031667 

Homo sapiens dJ620Ell.le (novel Helicase C- 
terminal domain and SNF2 N- terminal domains 
containing protein, similar to KIAA0308) 

913 

100.000 

9842 

U33821 

Homo sapiens taxi -binding protein TXBP151 

869 

33 . 190 

9843 

AL080080 

Homo sapiens hypothetical protein 

1065 

100.000 

9844 

AF168362 

Rattus norvegicus protein associating with 
small stress protein PASS1 

1052 

79.888 

9845 

U58280 

Mus musculus second largest subunit of RNA 
polymerase I 

832 

85.211 

9846 

AF119569 

Homo sapiens patched 2 

176 

89 . 286 

9847 

M35297 

Rattus norvegicus G-protein coupled receptor 

477 

89.333 

9848 

X83226 

Saccharomyces cerevisiae global 
transcriptional regulator 

734 

49 . 057 

9849 

AB007872 

Homo sapiens KIAA0412 

175 

90 . 000 

9850 

D29640 

Homo sapiens KIAA0051 

548 

75 . 000 

9851 

AL080125 

Homo sapiens hypothetical protein 

473 

64 .103 

9852 

AF041207 

Homo sapiens midline 1 cerebellar isoform 2 

190 

28 . 276 

9853 

AB029334 

Halocynthia roretzi HrPET-1 

350 

42 . 857 

9854 

AB011370 

Mus musculus Ankhzn 

1122 

91.237 

9855 

AL117407 

Homo sapiens hypothetical protein 

442 

47 .468 

9856 

AF125053 

Homo sapiens pyruvate dehydrogenase El alpha 
subunit 

169 

100.000 

9857 

U97191 

Caenorhabditis elegans No definition line 
found 

276 

46.316 

9858 

X71997 

Rattus norvegicus myosin I 

1848 

63 .699 

9859 

AJ012409 

Homo sapiens hypothetical protein 

211 

93 . 750 

9860 

AB000776 

Rattus norvegicus semaphorin Z 

424 

87 . 671 

9861 

Z27080 

Unknown cDNA EST EMBL:D33198 comes from this 
gene; cDNA EST EMBL:D3286 7 comes from this 
gene ; cDNA 

345 

55.422 

9862 

AF027514 

Homo sapiens zinc finger protein 

791 

99.174 

9863 

Z78019 

Unknown Similarity to Yeast LPG22P protein 
(TR:G1151240) ; cDNA EST EMBL:T00686 comes from 
this gene 

1006 

75.258 

9864 

AF051155 

Rattus norvegicus G beta- like protein GBL 

1358 

80.934 

9865 

L28174 

Acanthamoeba castellanii disulf ide-like 
protein 

349 

45 . 794 

9866 

AL096857 

Homo sapiens hypothetical protein 

1028 

98 . 101 

9867 

Z98531 

Schizosaccharomyces pombe hypothetical protein 

184 

38 . 158 

9868 

Z92669 

Mycobacterium tuberculosis hypothetical 
protein Rv0235c 

166 

31.858 

9869 

U96781 

Homo sapiens Ca2+ ATPase of fast-twitch 
skeletal muscle sacroplasmic reticulum, 
neonatal isoform 

1370 

99.539 
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9870 

L13687 

Homo sapiens ADP-ribosylation factor- like 
protein 2 

390 

37.838 

9871 

U26358 

Rattus norvegicus S100A1 gene product 

167 

36.585 

9872 

AB026190 

Homo sapiens Kelch motif containing protein 

1392 

43 .254 

9873 

AB000199 

Rattus norvegicus CCA2 protein 

306 

82 .143 

9874 

AC003682 

Homo sapiens F18547 1 

223 

74 .419 

9875 

Z69902 

Unknown predicted using Gene finder; cDNA EST 
EMBL:D2 757 9 comes from this gene; cDNA EST 
EMBL:D64778 

595 

32 .419 

9876 

AB011527 

Rattus norvegicus MEGF1 

353 

69.620 

9877 

U69262 

Mus musculus matrilin-2 precursor 

276 

73 . 016 

9878 

AF168418 

Homo sapiens activating signal cointegrator 1 

221 

97 . 059 

9879 

U40342 

Mus musculus ninein 

2234 

89.390 

9880 

AF016430 

Caenorhabditis elegans contains similarity to 
a BR-C/TTK domain 

184 

37.647 

9881 

AL031685 

Homo sapiens dJ963K23.2 (novel protein) 

276 

41.304 

9882 

AF073879 

Mus musculus myotubularin homologous protein 1 

382 

87.302 

9883 

L14745 

Caenorhabditis elegans homology with glucose 
induced repressor, GRR1; putative 

328 

29.944 

9884 

AF036705 

Unknown Similar to phytoene desaturase; coded 
for by C. elegans cDNA CEESX74F; coded for by 
C. el 

664 

50 . 971 

9885 

AC004890 

Homo sapiens similar to zinc finger proteins ; 
similar to BAA24380 

514 

84 .375 

9886 

AJ243591 

Xenopus laevis hnRNP-E2 protein 

694 

83 .594 

9887 

Z19153 

Unknown similar to Sodium/phosphate 
transporter; cDNA EST EMBL:D67894 comes from 
this gene; cDNA ES 

710 

43 .882 

9888 

AL022157 

Homo sapiens SPIN (SPINDLIN HOMOLOG (PROTEIN 
DXF34) ) 

274 

72 .000 

9889 

U29096 

Caenorhabditis elegans coded for by C. elegans 
cDNA yk44f2.5; similar to P59 protein (HSP 
binding immunophilin) and to TPR domain 

354 

39. 548 

9890 

AF016452 

Caenorhabditis elegans similar to 1-acyl- 
glycerol - 3 -phosphate acyl transferases 

327 

44 .792 

9891 

AC004557 

Arabidopsis thaliana F17L21.21 

523 

39.768 

9892 

AF003136 

Caenorhabditis elegans F28B3.4 

184 

42.353 

9893 

X52022 

Homo sapiens collagen type VI, alpha 3 chain 

244 

74.510 

9894 

L20303 

Gallus gallus actin filament-associated 
protein 

666 

45 . 387 

9895 

D80009 

Homo sapiens KIAA0187 

338 

88.333 

9896 

Z46242 

Unknown similar to beta- transducin ; cDNA EST 
EMBL:Z14703 comes from this gene; cDNA EST 
EMBL:D67532 

869 

50 . 871 

9897 

AB012725 

Mus musculus zinc finger protein 

483 

91 .429 

9898 

AJ001693 

Drosophila melanogaster PRUNE protein 

502 

36.071 

9899 

AJ235271 

Rickettsia prowazekii PUTATIVE ENOYL- 
REDUCTASE (fabl) 

419 

46 .825 

9900 

D79983 

Homo sapiens There is a C3HC4 zinc-finger in 
the C- terminal region. 

381 

54.348 | 

9901 

U95044 

Homo sapiens zinc finger protein 

396 

86 .301 

9902 

D90899 

Synechocystis sp. hypothetical protein 

251 

34 .884 

9903 

AB016496 

Mus musculus intelectin 

791 

76 .510 

9904 

X03342 

Homo sapiens rpL32 (aa 1-13 5) 

172 

84 .375 

9905 

AF022891 

Drosophila melanogaster Fuzzy 

165 

36 .000 
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9906 

AL096751 

Homo sapiens hypothetical protein 

1004 

100.000 

9907 

AF156271 

Homo sapiens RING finger protein terf 

196 

77 .419 

9908 

AB014561 

Homo sapiens KIAA0661 protein 

462 

65.766 

9909 

AF059569 

Homo sapiens act in binding protein MAYVEN 

253 

46 .667 

9910 

AF019082 

Borrelia burgdorferi virulent strain 
associated lipoprotein 

249 

26 .506 

9911 

AF058693 

Mus musculus M-RdgB2 retinal degeneration 
protein B subtype 2 

435 

94 . 118 

9912 

U95044 

Homo sapiens zinc finger protein 

675 

69.677 

9913 

U60666 

Homo sapiens testis specific leucine rich 
repeat protein 

225 

33 .155 

9914 

AB018374 

Mus musculus GARP4 5 

647 

51.915 

9915 

Z68014 

Caenorhabditis elegans Similarity to Human DRA 
protein (SW:DRA HUMAN) 

291 

43.333 

9916 

Z82095 

Caenorhabditis elegans similar to PDZ domain 
(Also known as DHR or GLGF) . ; cDNA EST 
EMBL:M75803 comes from this gene 

413 

53 .390 

9917 

AL022238 

Homo sapiens dJ1042K10.3 (novel protein) 

221 

41.414 

9918 

AC005328 

Homo sapiens R26660 1, partial CDS 

169 

72.222 

9919 

AC007369 

Arabidopsis thaliana Putative RNA helicase 

315 

41.667 

9920 

X14549 

Chlamydomonas reinhardtii spoke protein 

428 

46 .207 

9921 

AF117610 

Mus musculus inner centromere protein INCENP 

570 

79. 798 

9922 

AF151835 

Homo sapiens CGI -78 protein 

248 

100.000 

9923 

L14745 

Caenorhabditis elegans coded for by C. elegans 
cDNA GenBank : CEL16D4 (M89109) ; putative 

278 

30.808 

9924 

AB007865 

Homo sapiens KIAA04 05 

266 

53.731 

9925 

X90840 

Homo sapiens axonal transporter of synaptic 
vesicles 

2660 

100.000 

9926 

AF175969 

Rattus norvegicus Leman coiled- coil protein 

371 

39.181 

9927 

U80747 

Homo sapiens CAGH3 

188 

34 . 375 

9928 

S62597 

Drosophila melanogaster , Peptide, 753 aa 
neu=neuralized 

186 

23 .757 

9929 

U63131 

Homo sapiens CDC3 7 homolog 

181 

37.079 

9930 

Y09723 

Homo sapiens Miz-1 protein 

838 

31.299 

9931 

D86728 

Mus musculus TIS 

219 

86 .842 

9932 

X67683 

Homo sapiens keratin K4a 

824 

100.000 

9933 

AC007136 

Homo sapiens Putative map kinase interacting 
kinase 

2718 

100.000 

9934 

Z79755 

Caenorhabditis elegans predicted using 
Genef inder 

186 

54 . 902 

9935 

AC002131 

Arabidopsis thaliana Contains similarity to 
hypothetical protein C18bll.05 gb|Z50728 from 
S. pombe. EST gb|H76601 comes from this gene. 

367 

31.696 

9936 

U00050 

Caenorhabditis elegans No definition line 
found 

377 

35.979 

9937 

AL021918 

Homo sapiens b34I8.1 (Kruppel related Zinc 
Finger protein 184) 

377 

38.806 

9938 

AB005541 

Rattus rattus PCTAIRE3 

341 

67 .416 

9939 

U63648 

Mus musculus pl60 myb-binding protein 

651 

73 . 913 

9940 

Z78201 

Unknown Similarity to E.coli 2-oxoglutarate 
dehydrogenase (SW:OD01_ECOLI) ; cDNA EST 
EMBL:D32590 com 

226 

29.878 

9941 

AF010144 

Homo sapiens neuronal thread protein AD7c-NTP 

67 

75 . 000 

9942 

AC007067 

Arabidopsis thaliana T10O24.10 

364 

61.905 

9943 

U41663 

Rattus norvegicus neuroligin 3 

968 

96 .479 
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9944 

X91617 

Mus musculus 5*-3 f exonuclease 

745 

100.000 

9945 

AF167411 

Mus musculus pendrin 

347 

56.627 

9946 

U80953 

Caenorhabditis elegans weakly similar in 
serine repeat region to rat thyroxine -binding 
globulin (PIR:A3 956 7) and to D. melanogaster 
ecdysone- inducible protein E75-C 
(SP:E75C DROME , PI 3 0 5 5 ) 

163 

37.805 

9947 

AC007633 

Unknown similarity to several hypothetical 
proteins- Arabidopsis thaliana 

226 

61.818 

9948 

U35245 

Rattus norvegicus vacuolar protein sorting 
homolog r-vps3 3b 

431 

98.507 

9949 

M94315 

Oryctolagus cuniculus neurof ilament-H 

226 

24.528 

9950 

AF096896 

Drosophila melanogaster pushover 

197 

24.291 

9951 

Z68117 

Caenorhabditis elegans similar to Probable 
rabGAP domains; cDNA EST yk325d7.3 comes from 
this gene; cDNA EST yk325d7.5 comes from this 
gene 

280 

37 .705 

9952 

AC007018 

Arabidopsis thaliana unknown protein 

308 

33.163 

9953 

U29156 

Mus musculus involved in signaling by the 
epidermal growth factor receptor; Method: 
conceptual translation supplied by author 

1222 

92.857 

9954 

M95762 

Rattus norvegicus GABA transporter 

1153 

89.730 

9955 

U09284 

Homo sapiens PINCH protein 

1648 

85.020 

9956 

AB007872 

Homo sapiens KIAA0412 

320 

73.913 

9957 

U95825 

Homo sapiens androgen- induced prostate 
proliferative shutoff associated protein 

704 

83 .607 

9958 

U68535 

Mus musculus aldo-keto reductase 

1175 

68 .110 

9959 

AF003151 

Caenorhabditis elegans No definition line 
found 

241 

25.987 

9960 

AF060173 

Rattus norvegicus SV2 related protein 

745 

96.522 

9961 

AF004841 

Homo sapiens CDO 

243 

41.489 

9962 

U66561 

Homo sapiens kruppel - related zinc finger 
protein 

273 

56.338 

9963 

U09367 

Homo sapiens zinc finger protein ZNF136 

199 

72 .093 

9964 

U34925 

Drosophila melanogaster TH1 

665 

62 .025 

9965 

U32626 

Drosophila melanogaster unknown 

317 

57.333 

9966 

L08239 

Homo sapiens located at OATL1 

179 

100.000 

9967 

M61866 

Homo sapiens Krueppel -related DNA-binding 
protein 

226 

60.377 

9968 

AB007144 

Homo sapiens ZIP-kinase 

222 

33 .884 

9969 

U75321 

Mus musculus chromaffin granule ATPase II 
homolog 

487 

65.574 

9970 

AF036706 

Caenorhabditis elegans strong similarity to 
class - I aminoacyl - tRNA synthetases; most 
similar to glutaminyl-tRNA synthetases 

558 

50.000 

9971 

X82018 

Homo sapiens ZID, zinc finger protein with 
interaction domain 

540 

63 .415 

9972 

X97324 

Homo sapiens adipophilin 

312 

44 .340 

9973 

AF080070 

Mus musculus zinc finger protein 54 

240 

48.052 

9974 

AF169548 

Homo sapiens gamma- syne rg in 

1069 

100 . 000 

9975 

AF056302 

Drosophila melanogaster eIF-2alpha kinase 

454 

39.891 

9976 

AC003114 

Arabidopsis thaliana T12M4.6 

143 

39.759 

9977 

Z78198 

Caenorhabditis elegans Similarity to E.coli 
glyucerophosphoryldiester phosphodiesterase 
(SWrUGPQ ECOLI) 

203 

46.154 
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9978 

AF098505 

Caenorhabditis elegans similar to Arabidopsis 
thaliana male sterility protein 2 (SW:Q08891) 

498 

44.578 

9979 

D31763 

Homo sapiens ha0 946 protein is Kruppel- 
related. 

383 

55.660 

9980 

AC007591 ! 

Unknown Strong similarity to 

221 

41.667 

9981 

Z48758 1 

Saccharomyces cerevisiae unknown 

404 

46 .043 

9982 

U29187 

Mus musculus prion- like protein 

716 

76 .471 

9983 

AB013607 

Mus musculus c2 9 

528 

70.833 

9984 

X63337 

Homo sapiens high sulfur keratin 

839 

77.397 

9985 

U09824 

Rattus norvegicus 3 0 kDa S-type lectin 

328 

42.857 

9986 

AJ006692 

Homo sapiens ultra high sulfer keratin 

732 

76.068 

9987 

U43292 

Homo sapiens MDS1B 

384 

65.000 

9988 

AL117204 

Caenorhabditis elegans Y116A8C.9 

593 

40.773 

9989 

X55126 

Mus musculus Zfp-2 9 

354 

84 .375 

9990 

AF153208 

Homo sapiens GC-rich sequence DNA-binding 
factor candidate 

719 

100.000 

9991 

AL023781 

Schizosaccharomyces pombe N- terminal 
acetyltransf erase l 

522 

52 .564 

9992 

U59240 

Rattus norvegicus N- tropomodulin 

267 

50.000 

9993 

AF070651 

Homo sapiens zinc finger protein 4 

835 

74 .847 

9994 

AP000399 

Oryza sativa EST AU056133 (S20320) corresponds 
to a region of the predicted gene; similar to 
Caenorhabditis elegans cosmid D1054; 
hypothetical protein (Z74030) 

486 

43 .478 

9995 

U21549 

Mus musculus Ac3 9/physophilin 

379 

69.620 

9996 

AL022318 

Homo sapiens bK150C2 . 7 (PUTATIVE novel protein 
similar to APOBEC1 (Apolipoprotein B mRNA 
editing protein) and Phorbolin) 

1280 

100.000 

9997 

AJ242977 

Homo sapiens p24 3 

371 

100.000 

9998 

Y10495 

Mus musculus CDV-1R protein 

1736 

92.384 

9999 

Z93394 

Caenorhabditis elegans similar to Probable 
rabGAP domains 

829 

50.211 

10000 

U97571 

Mus musculus signaling molecule 

892 

90.714 

10001 

AL021068 

Homo sapiens dJ206D15.1 (Reduced Folate Carier 
protein RFC LIKE) 

764 

64.458 I 

10002 

AF056116 

Fugu rubripes All-1 related protein 

427 

35.156 

10003 

AP000059 

Aeropyrum pernix 115aa long hypothetical 
protein 

54 

29.787 

10004 

X51760 

Homo sapiens zinc finger protein (583 AA) 

345 

80 . 952 

10005 

AB020710 

Homo sapiens KIAA0903 protein 

602 

67 . 717 

10006 

AB025259 

Mus musculus granuphilin-b 

256 

30.769 

10007 

AJ011855 

Homo sapiens PAK4 protein 

391 

72 . 000 

10008 

U06944 

Mus musculus PRAJA1 

262 

66 . 102 

10009 

AF186469 

Rattus norvegicus TM6P1 

288 

29 .605 

10010 

D45913 

Mus musculus leucine- rich-repeat protein 

299 

83 .019 

10011 

AL117203 

Caenorhabditis elegans predicted using 
Genefinder; preliminary prediction 

171 

33 .036 

10012 

AB020678 

Homo sapiens KIAA08 71 protein 

759 

77 . 397 

10013 

AF017368 

Mus musculus faciogenital dysplasia protein 2 

315 

57.732 

10014 

U13878 

Mus musculus neural -restrictive silencer 
factor 

361 

34.615 

10015 

S73488 

Rattus sp. zinc finger transcription factor, 
Kid-1 {KRAB A and B regions} 

173 

57 .447 

10016 

AF099011 

Homo sapiens EH-domain containing protein 
testilin 

396 

62 . 921 
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10017 

D61689 

Mus musculus SOX-LZ 

1050 

96.933 

10018 

AF070651 | 

Homo sapiens zinc finger protein 4 

423 

82 .432 

10019 

D63478 

Homo sapiens The KIAA0144 gene product is 
novel . 

600 

58 .282 

10020 

U40802 

Caenorhabditis elegans similar to other 
protein phosphatases 1, 2A and 2B 

420 

35.407 

10021 

AC004472 

Homo sapiens PI. 11659 5 

223 

35 . 593 

10022 

AF051944 

Gallus gallus Xin 

298 

33 .645 

10023 

AF007170 

Homo sapiens unknown 

177 

44.262 ( 

10024 

AC004997 

Unknown match to ESTs AA667999 (NID :g2626700) , 
AA165465 (NID :gl741481) , Z45871 (NID :g575105) , 
and 

532 

55.833 

10025 

Z25535 

Homo sapiens nuclear pore complex protein 
hnupl53 

273 

100 . 000 

10026 

U43503 

Saccharomyces cerevisiae Lph2p 

198 

45 .455 

10027 

D86043 

Homo sapiens SHPS-1 

337 

43 . 704 

10028 

AB011093 

Homo sapiens KIAA0521 protein 

1394 

97.309 

10029 

D17577 

Mus musculus Kiflb 

1152 

97 .159 

10030 

U89264 

Drosophila melanogaster kinesin like protein 
67a 

642 

47 . 926 

10031 

D14663 

Homo sapiens KIAA0107 

305 

94 .118 

10032 

AF132856 

Homo sapiens suppressor of G2 allele of skpl 
homo log 

183 

90.625 

10033 

AB023189 

Homo sapiens KIAA0972 protein 

331 

62 . 025 

10034 

AF031939 

Mus musculus RalBPl- associated EH domain 
protein Repsl 

1647 

92 .636 

10035 

AL117600 

Homo sapiens hypothetical protein 

326 

67.123 

10036 

AL031788 

Schizosaccharomyces pombe conserved 
hypothetical protein. 

429 

51.079 

10037 

AL031276 

Homo sapiens dJ1118D24.1d (part of novel 
protein similar to worm proteins T08G11.1 and 
C25H3 . 9) 

586 

98 . 901 

10038 

D42063 

Homo sapiens RanBP2 (Ran-binding protein 2) 

825 

86.014 

10039 

AJ011306 

Homo sapiens guanine nucleotide exchange 
factor (long isoform) 

317 

100.000 

10040 

AF047010 

Drosophila melanogaster asteroid protein 

196 

40.244 

10041 

Z54096 

Schizosaccharomyces pombe hypothetical protein 

328 

52.294 

10042 

X89571 

Mus musculus human homolog is GPI -anchored 
protein 

436 

52.713 

10043 

AC010077 

Homo sapiens MTDM_HUMAN ; DNA 

METHYLTRANSFERASE; DNA METASE; MCMT; M . HSAI 

249 

77 . 358 

10044 

D86971 

Homo sapiens no similarities to reported gene 
products 

967 

49.684 

10045 

AF099973 

Mus musculus schlafen2 

258 

36 .111 

10046 

X78077 

Equus caballus link protein 

214 

36 .250 

10047 

U80448 

Unknown coded for by C. elegans cDNA CEESS55F; 
coded for by C. elegans cDNA yk84al.3; coded 
for by 

185 

54.000 

10048 

U39703 

Mycoplasma genital ium DNA helicase II, 
putative 

183 

25 . 087 

10049 

AF100657 

Caenorhabditis elegans Contains similarity to 
Pf am domain: PF00614 (PLDc) , Score=13.8, E- 
value=0.2, N=l 

481 

33.660 

10050 

AC007259 

Arabidopsis thaliana Hypothetical protein 

189 

31.915 

10051 

AF099973 

Mus musculus schlafen2 

338 

37 .267 
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10052 

U41559 

Caenorhabditis elegans No definition line 
found 

300 

38.095 

10053 

AB018341 

Homo sapiens KIAA07 98 protein 

423 

70.787 

10054 

S44213 

Saccharomyces cerevisiae, Peptide, 323 aa 
YKL522=mitochondrial ADP/ATP carrier protein 
homo log 

211 

35.294 

10055 

AF111423 

Xenopus laevis chromosome condensation protein 
XCAP-G 

719 

64 . 162 

10056 

AJ242978 

Homo sapiens p621 

1835 

100.000 

10057 

AL023777 

Schizosaccharomyces pombe hypothetical protein 

428 

32.464 

10058 

AB018285 

Homo sapiens KIAA0742 protein 

166 

40 . 000 

10059 

AC006528 

Arabidopsis thaliana putative DNA replication 
licensing factor with an MCM family domain 
(prosite : PDOC00662 ) 

228 

29.612 

10060 

AF077818 

Mus musculus syntrophin-associated serine- 
threonine protein kinase 

205 | 

62.000 

10061 

U15173 

Homo sapiens BCL2 /adenovirus E1B 19kD- 
interacting protein 2 

449 

40 . 722 

10062 

AC002328 

Arabidopsis thaliana F20N2.6 

53 

50 . 000 

10063 

Z98974 

Schizosaccharomyces pombe hypothetical PSU1- 
like protein 

323 

45. 918 

10064 

AB018413 

Homo sapiens Gab2 

551 

58 . 462 

10065 

AF151816 

Homo sapiens CGI- 58 protein 

289 

100 . 000 

10066 

AF099973 

Mus musculus schlafen2 

204 

40.187 

10067 

Y12090 

Lycopersicon esculentum putative 3,4- 
dihydroxy-2-butanone kinase 

1197 

41 . 199 

10068 

U49046 

Mus musculus Zfp64 

1594 

89 .494 

10069 

L39211 

Homo sapiens carnitine palmitoyltransferase I 

252 

58 . 730 

10070 

AB000199 

Rattus norvegicus CCA2 protein 

554 

84 . 848 

10071 

AC007228 

Homo sapiens BC372 95 1 

252 

65 . 000 

10072 

AF060246 

Mus musculus zinc finger protein 106 

264 

97 . 436 

10073 

X81058 

Mus musculus tex261 

1140 

99.422 

10074 

AB020675 

Homo sapiens KIAA086 8 protein 

439 

46 . 715 

10075 

AJ011928 

Drosophila melanogaster Fidipidine 

586 

52 . 174 

10076 

X98834 

Homo sapiens zinc finger protein Hsal2 

706 

100 . 000 

10077 

AF151363 

Mus musculus Cdc42 GTPase- activating protein 

499 

69.608 

10078 

AF149413 

Arabidopsis thaliana contains similarity to 
histone deacetylases ; Pfam PF00850, 
Score=13.3, E=5e-10, N=l 

641 

53 . 093 

10079 

AC007842 

Homo sapiens BC331191 1 

834 

51 . 711 

10080 

AB002334 

Homo sapiens KIAA0336 

213 

97 . 143 

10081 

S76838 

Mus sp. Dbs=Dbl guanine nucleotide exchange 
factor homolog 

287 

59.420 

10082 

AB015306 

Homo sapiens Leukotriene B4 omega -hydroxylase 

750 

81 .818 

10083 

AB029000 

Homo sapiens KIAA1077 protein 

329 

88 . 679 

10084 

AC002528 

Homo sapiens Genscan gene prediction; 90% 
similarity to AA023673 (NID :gl4 87590) 

765 

99 . 099 

10085 

X54618 

Listeria monocytogenes phosphadidyl inositol 
specific phospholipase C 

197 

28 . 571 

10086 

X57303 

Homo sapiens REC1L 
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10087 

AB023178 

Homo sapiens KIAA0961 protein 

331 

68.421 

10088 

AF061243 

Homo sapiens metalloprotease 1 

1936 

98.913 

10089 

AF038563 

Homo sapiens membrane associated guanylate 
kinase 2 

665 

100. 000 

10090 

AJ010017 

Homo sapiens zinc finger protein 

200 

44 .615 


°&1 


JL O O S NHb „, 11 S 1 2! iS O 2 


10091 

AF067806 

Mus musculus cAMP- specif ic cyclic nucleotide 
phosphodiesterase PDE8 ; MMPDE8 

300 

54.167 

10092 

U44731 

Mus musculus purine nucleotide binding protein 

2293 

74.730 

10093 

Z70269 

Unknown predicted using Genefinder; Similarity 
to Yeast hypothetical protein YHG1 
(SW:YHG1 YEAST) ; 

167 

54.348 

10094 

AB014516 

Homo sapiens KIAA0616 protein 

235 

45.669 

10095 

AB018293 

Homo sapiens KIAA07 50 protein 

487 

66 . 038 

10096 

D90400 

Human papillomavirus type 58 open reading 
frame E5 

71 

44.444 

10097 

AB011128 

Homo sapiens KIAA0556 protein 

325 

100 . 000 

10098 

X89426 

Homo sapiens ESM-1 secretory protein 

197 

33 . 588 

10099 

Z48334 

Caenorhabditis elegans F10B5.8 

787 

76.471 

10100 

X14971 

Mus musculus alpha-adaptin (A) (AA 1-977) 

688 

100.000 

10101 

U81788 

Drosophila melanogaster kinesin-73 

402 

50.400 

10102 

L12579 

Homo sapiens alternatively spliced 

380 

88.406 

10103 

U22296 

Rattus norvegicus casein kinase 1 gamma 1 
isof orm 

408 

60.177 

10104 

M22743 

Oryctolagus cuniculus lambda- crys tall in 
precursor 

202 

81.081 

10105 

U44129 

Rattus norvegicus p58 

479 

46 .154 

10106 

Z92813 

Unknown similar to WD domain, G-beta repeat (3 
domains) ; cDNA EST EMBL:D694 52 comes from this 
gene ; 

213 

34 . 579 

10107 

AF031370 

Rattus norvegicus PLC-b4b 

1014 

95 .425 

10108 

Z35639 

Caenorhabditis elegans similar to TNF-alpha 
induced Protein B12; cDNA EST yk579c8.3 comes 
from this gene 

224 

45.055 

10109 

AF175292 

Mus musculus neuronal IL-16 

393 

84 .286 

10110 

AB012099 

Pyrococcus kodakaraensis Glycerol Kinase 

223 

36 .047 

10111 

AB023658 

Rattus norvegicus Ca/calmodul in- dependent 
protein kinase kinase alpha, CaM-kinase kinase 
alpha 

1070 

89.205 

10112 

L06940 

Escherichia coli tetracycline resistance 
protein 

296 

32 . 515 

10113 

U75329 

Homo sapiens serine protease 

173 

41 . 026 

10114 

L22557 

Rattus norvegicus calmodulin- binding protein 

673 

99.010 

10115 

AL035528 

Arabidopsis thaliana hypothetical protein 

207 

55.738 

10116 

M15888 

Bos taurus endozepine- related protein 
precursor 

1198 

86 . 829 

10117 

AF079974 

Mus musculus Rac GTPase-activating protein 

588 

77 . 778 

10118 

Z72511 

Unknown possible zinc finger protein; cDNA EST 
EMBL:M8 9115 comes from this gene; cDNA EST 
EMBL : D715 

1343 

57 . 576 

10119 

M83679 

Rattus norvegicus RAB15 

552 

82 . 524 

10120 

AF023450 

Homo sapiens Down syndrome cell adhesion 
molecule 

518 

60 .000 

10121 

U80955 

Caenorhabditis elegans contains a domain found 
in band 4.1, ezrin, moesin, radixin and talin 

335 

45.192 

10122 

U15002 

Rattus norvegicus COUP-TFI 

271 

74 . 000 

10123 

L40378 

Homo sapiens cytoplasmic antiproteinase 3 

354 

100.000 

10124 

AL121764 

Schizosaccharomyces pombe putative ATP- 
dependent RNA he li case 

491 

60.000 

10125 

AC003682 

Homo sapiens F18 54 7 1 

210 

67.391 

10126 

M58583 

Homo sapiens precerebellin 

234 

48 .485 
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10127 

AF055017 

Homo sapiens unknown 

398 

32 . 917 

10128 

AL050284 

Homo sapiens hypothetical protein 

216 

70.213 

10129 

AF078828 

Homo sapiens talin 

652 

72.180 

10130 

U58746 

Caenorhabditis elegans weak similarity to 
regions of guanine-nucleotide releasing 
factors 

168 

48.980 

10131 

Z81521 

Unknown predicted using Gene finder; cDNA EST 
EMBL:C09934 comes from this gene; cDNA EST 
EMBL:C08308 

259 

32.877 

10132 

AJ010585 

Rattus rattus PTB-like protein 

753 

97 . 581 

10133 

Z68297 

Unknown cDNA EST EMBL:D32434 comes from this 
gene; cDNA EST EMB1*:D33 710 comes from this 
gene ; cDNA 

335 

37.415 

10134 

D89928 

Homo sapiens HKL.1 

647 

86.555 

10135 

L16559 | 

Caenorhabditis elegans putative 

224 

33 . 793 

10136 

AB009372 

Rattus norvegicus Lysophospholipase 

1408 

86.585 

10137 

AL035496 

Homo sapiens dJ437022.1 (novel VHS domain 
containing protein similar to predicted worm 
and human proteins) 

200 

96 . 552 

10138 

AF187318 

Homo sapiens F-box protein Fbx2 

229 

43.243 

10139 

Z83760 

Ciona intestinalis COS41.5 

248 

32.283 

10140 

U81788 

Drosophila melanogaster kinesin-73 

927 

66 .351 

10141 

AB024400 

Rattus norvegicus LAT4 

1006 

64 . 103 

10142 

U23522 

Caenorhabditis elegans No definition line 
found 

180 

32.061 ; 

10143 

AJ243460 

Leishmania major proteophosphoglycan 

173 

21.577 

10144 

Y08774 

Silene latifolia Men-2 

115 

37.097 

10145 

Z71262 

Unknown similar to serine/threonine kinase; 
cDNA EST EMBL:D27596 comes from this gene; 
CDNA EST EMB 

161 

29. 032 

10146 

AB010363 

Mus musculus mszfSl 

164 

37.500 

10147 

AF000196 

Caenorhabditis elegans No definition line 
found 

160 

30.556 

10148 

AC005306 

Homo sapiens R27216 1 

423 

98.507 

10149 

X98709 

Homo sapiens COL1A1 and PDGFB fusion 
transcript 

146 

31.068 

10150 

AF099053 

Mus musculus phosphatidylserine synthase- 2 

2504 

89. 776 

10151 

U02928 

Dictyostelium discoideum Rab7 

362 

36.207 

10152 

X98475 

Mus musculus vasodilator- stimulated 
phosphoprotein 

164 

29.412 

10153 

AL080200 

Homo sapiens hypothetical protein 

138 

37.113 

10154 

AB002374 

Homo sapiens KIAA03 76 

712 

36.215 

10155 

L04733 

Homo sapiens kinesin light chain 

504 

53 . 521 

10156 

AL023592 

Schizosaccharomyces pombe zinc finger protein 

295 

45 . 055 

10157 

L07063 

Mus musculus FKBP65 binding protein 

819 

87.234 

10158 

X82200 

Homo sapiens gpStaf50 

907 

62 . 679 

10159 

AF106682 

Homo sapiens spindlin 

256 

68 .421 

10160 

AF125385 

Drosophila melanogaster L82B 

173 

44 .444 

10161 

Y17282 

Homo sapiens cytokeratin type II 

818 

59.244 

10162 

AB029023 

Homo sapiens KIAA110 0 protein 

784 

50 . 000 

10163 

Y17282 

Homo sapiens cytokeratin type II 

294 

43.972 

10164 

AF173681 

Mus musculus thioredoxin interacting factor 

348 

38.372 

10165 

AF125963 

Caenorhabditis elegans No definition line 
found 

159 

25.743 

10166 

X78925 

Homo sapiens zinc finger protein 

162 

60. 870 
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10167 

AF013969 

Mus musculus antigen containing epitope to 
monoclonal antibody MMS-85/12 

773 

66.667 

10168 

AB017437 

Gallus gallus avena 

1096 

93.143 

10169 

X75760 

Drosophila melanogaster LRR4 7 

485 

29.443 

10170 

AJ007014 

Homo sapiens AMMECR1 protein 

1134 

55 - 000 

10171 

J04802 

Homo sapiens open reading frame A 

161 

96.154 

10172 

AF057170 

Homo sapiens bestrophin 

401 

70.370 

10173 

U70855 

Caenorhabditis elegans similar to the RAS gene 
family 

382 

48.305 

10174 

AB025410 

Mus musculus Ten-ml 

835 

93.893 

10175 

AF091434 

Homo sapiens secretory growth factor- like 
protein fallotein 

184 

49 . 180 

10176 

U95044 

Homo sapiens zinc finger protein 

525 

76 . 786 

10177 

M20681 

Homo sapiens glucose transporter- like protein 

201 

83 . 784 

10178 

AF067165 

Homo sapiens zinc finger protein 3 

240 

55 . 072 

10179 

D88026 

Mus musculus Dhm2 protein 

385 

82.090 

10180 

AL035419 

Homo sapiens dJ1100H13.1 (putative novel 
protein) 

252 

100.000 

10181 

X97064 

Homo sapiens Sec23 protein 

2794 

100 . 000 

10182 

X79828 

Mus musculus NK10 

531 

90.588 

10183 

AC002332 

Arabidopsis thaliana putative NAD ( P )- dependent 
cholesterol dehydrogenase 

245 

40.336 

10184 

Z48166 

Schizosaccharomyces pombe gar2 

176 

28.175 

10185 

Z81525 

Unknown cDNA EST yk282b7.5 comes from this 
gene; cDNA EST EMBL:D28011 comes from this 
gene; cDNA ES 

512 

39.640 

10186 

AL110487 

Caenorhabditis elegans Y39E4B.7 

491 

47.794 

10187 

AB018341 

Homo sapiens KIAA0798 protein 

396 

74 . 074 

10188 

L42178 

Ovis aries carbonic anhydrase I 

620 

64 .615 

10189 

L11275 

Saccharomyces cerevisiae selected as a weak 
suppressor of a mutant of the subunit AC4 0 of 
DNA dependant RNA polymerase I and III 

191 

24 .876 

10190 

AB018260 

Homo sapiens KIAA0717 protein 

260 

94.595 

10191 

AF118767 

Homo sapiens endothelial lipase 

2456 

99.446 

10192 

AF096771 

Homo sapiens kinase related protein 

200 

26.596 | 

10193 

U47920 

Pseudomonas aeruginosa dihydrolipoamide 
acetyl transferase 

444 

72 . 917 

10194 

U18009 

Homo sapiens similar to Pacific ray VAT1 
protein, Swiss-Prot Accession Number P19333 

422 

49.618 

10195 

AL035086 

Homo sapiens dJ44A20.3 (novel protein similar 
to worm F32F2 . 1) 

369 

55 .455 

10196 

L01089 

Homo sapiens profilaggrin 

447 

38 . 095 

10197 

AL035702 

Homo sapiens dJ593C16.1 (ras GTPase activating 
protein) 

2171 

73 . 131 

10198 

X92969 

Mus musculus odorant receptor 

360 

52 . 525 

10199 

AF062249 

Homo sapiens immunoglobulin heavy chain 
variable region 

699 

85 . 593 

10200 

AF125569 

Homo sapiens tumor suppressing STF cDNA 6 

718 

83 . 333 

10201 

Z99118 

Bacillus subtilis similar to hypothetical 
proteins 

159 

32 . 143 

10202 

AB020654 

Homo sapiens KIAA0847 protein 

2158 

98.795 

10203 

Y18102 

Oryctolagus cuniculus titin 

436 

95.652 

10204 

D90706 

Escherichia coli PhoH protein homolog. 

1451 

100.000 

10205 

D9O708 

Escherichia coli Putrescine transport protein. 

704 

100.000 

10206 

D90717 

Escherichia coli Hypothetical 38.9 kd protein 

515 

100.000 


9m 
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in ding/rarB 3 ' region (o361) . 



10207 

X56958 

Homo sapiens ankyrin (brank-2) 

179 

96 .429 

10208 

D90730 

Escherichia coli Hypothetical protein 63 (MukB 
3 1 reqion) 

655 

98 . 058 

10209 

X55683 

Lycopersicon esculentum extensin (class I) 

122 

59 . 259 

10210 

Z70683 

Unknown Weak similarity to Human tyrosine- 
protein kinase CSK (SW : CSK_HUMAN) ; cDNA EST 
EMBL:C10908 C 

291 

41.284 

10211 

AE001373 

Plasmodium falciparum predicted secreted 
protein 

383 

25 . 198 

10212 

U28377 

Escherichia coli ORF fl41 


yb 

10213 

D90754 

Escherichia coli Hypothetical protein HI0761 

99b 

yd . ouu 

10214 

AF109134 j 

Homo sapiens 7-60 

con 

^U . 

10215 

M13934 

Homo sapiens ribosomal_protein S14 

885 

99.265 

10216 

AF055666 

Mus musculus kinesin light chain 2 

524 

ob . b4z 

10217 

X70032 

Mus musculus nebulin 

423 

42.143 

10218 

D42167 

Ciona intestinalis myoplasmin-Cl 

246 

28 . 477 

10219 

AF034580 

Mus musculus TSG118.1 

811 

54 . 426 

10220 

AF128535 

Mus musculus cytoplasmic phosphoprotein 
PACSIN2 

1389 

56 .891 

10221 

AF084928 

Homo sapiens erythroblast macrophage protein 
EMP 

811 

98 .450 

10222 

AL050321 

Homo sapiens dJ717M23.1 (novel gene) 

576 

98 . 864 

10223 

AF113615 

Homo sapiens FH1/FH2 domain- containing protein 
FHOS 

833 

62 . 189 

10224 

Z49967 

Unknown cDNA EST EMBL:T00743 comes from this 
gene; cDNA EST EMBL:D6 9356 comes from this 
gene ; cDNA 

173 

34 . 694 

10225 

AB025412 

Mus musculus Ten-m3 

1096 

96 .471 

10226 

X99145 

Canis familiaris overexpressed in thyroid 
tissue after TSH stimulation 

184 

48 . 980 

10227 

AF037454 

Mus musculus ubiquitin protein ligase 

447 

98 . 551 

10228 

Z21878 

Physarum polycephalum myosin- related protein 

A C "> 

4 S3 


10229 

X70208 

Homo sapiens immunoglobulin M chain 

631 

90 .291 

10230 

AB018285 

Homo sapiens KIAA0742 protein 

258 

Jl . bo / 

10231 

X82157 

Homo sapiens hevin 

466 

98 .684 

10232 

D90832 

Escherichia coli ORF ID:o341#12; similar to 

234 

100 .000 

10233 

AE000186 

Escherichia coli orf, hypothetical protein 

1155 

100 . ooo 

10234 

X73413 

Escherichia coli 0RF2 

24 7 

95 . OOO 

10235 

D90743 

Escherichia coli Rod protein FlgC 


yb . bb / 

10236 

D90703 

Escherichia coli Lipoic acid synthetase (lip- 
syn) . 

48 0 

1UU . uuu 

10237 

X01563 

Escherichia coli L5 (rplE) (aa 1-179) 

/ / y 


10238 

U28377 

Escherichia coli ORF o4 92 

/ b Z 


10239 

U00039 

Escherichia coli treF 

1305 

99.468 

10240 

D90872 

Escherichia coli similar to 

54 5 

lOO . ooo 

10241 

AJ001614 

Escherichia coli catabolite gene activator 
protein 

22 8 

9b . 9 / u 

10242 

U14003 

Escherichia coli ORF o243 

A "5 C 

4 Jo 

XUU . UUU 

10243 

XObl 16 

T?or«Vior*'i r»"h -J a nnl *i UP ( TT 1 ( rrl tiT. cr^nf* nroduct ) 
(AA 1-349) 

570 

89 . 423 

10244 

AE000336 

Escherichia coli orf, hypothetical protein 

852 

100-000 

10245 

AE000396 

Escherichia coli orf, hypothetical protein 

858 

98 . 571 

10246 

U28377 

Escherichia coli ORF ol79 

611 

97 . 872 

10247 

D90769 

Escherichia coli Sn-glycerol- 3 -phosphate 

416 

100.000 
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transport system permease protein UgpA. 



10248 

D90699 

Escherichia coli Hypothetical 16.8 kd protein 
in nfrA 3 ' region precursor. 

520 

100.000 

10249 

AE000406 

Escherichia coli orf, hypothetical protein 

469 

r\ C COO 

95 . boo 

10250 

U00006 

Escherichia coli No definition line found 

502 

85 - 542 

10251 

AE000458 

Escherichia coli putative enzyme 

378 

90.323 

10252 

D90809 

Escherichia coli Superoxide dismutase (EC 
1.15.1.1) (Fe) 

566 

100 .000 

10253 

AE000396 

Escherichia coli orf, hypothetical protein 

426 

100. 000 

10254 

X15996 

Escherichia coli narX 

165 

100 . ooo 

10255 

X12614 

Escherichia coli uroporphyrinogen III 
cosynthetase (AA 1 - 246) 

1188 

100 . 000 

10256 

AE000402 

Escherichia coli orf, hypothetical protein 

576 

100 . 000 

10257 

X04711 

Escherichia coli put. 16S rRNA 
methyltransf erase (ksgA) (AA 1-273) 

451 

100.000 

10258 

U00007 

Escherichia coli yehW 

434 

100 . 000 

10259 

D90740 

Escherichia coli ORF ID:o231#4 

321 

98 . 000 

10260 

U29579 

Escherichia coli alternate name ygbC; ORF1 of 
L07942, has different start due to frameshift 
in L07942; ORF f255 

1112 

98.837 

10261 

X14436 

Escherichia coli ORF4 (AA 1-197) 

1119 

yy . 4o b 

10262 

AE000334 

Escherichia coli orf, hypothetical protein 

748 

y y . u y i 

10263 

D90756 

Escherichia coli Protoporphyrinogen oxidase 
(EC 1.3.3.4) hemK 

638 

98.958 

10264 

AE000242 

Escherichia coli orf, hypothetical protein 

563 

92 . 784 

10265 

U29579 

Escherichia coli ORF o433 

591 

95 . o y y 

10266 

U82664 

Escherichia coli hypothetical protein 

927 

100.000 

10267 

AE000142 

Escherichia coli putative esterase (EC 
3.1.1.1) 

634 

100 .000 

10268 

AF009205 

Homo sapiens unknown 

163 

96.429 

10269 

D90872 

Escherichia coli similar to 

1280 

98 . 02 0 

10270 

AL032654 

Caenorhabditis elegans similar to Heme-binding 
domain in cytochrome b5 and oxidoreductases 

386 

52 . 83 0 

10271 

AB019280 

Mus musculus sprouty-4 

453 

83 . 750 

10272 

U40953 

Caenorhabditis elegans No definition line 
found 

437 

32 . Job 

10273 

AB002349 

Homo sapiens KIAA0351 

347 

6 3 . 52 9 

10274 

AF043695 

Caenorhabditis elegans similar to zinc 
metalloprotease family of peptidases 

168 

J / . obi 

10275 

L47274 

Encephalitozoon intestinalis beta-tubulin 

351 

O Q 111 

10276 

S66427 

Homo sapiens retinoblastoma binding protein 1, 
RBP1 

812 

82.759 

10277 

AB011166 

Homo sapiens KIAA0594 protein 

847 

99 . 24z 

10278 

U49973 

Homo sapiens ORF1 ; MER37; putative transposase 
similar to pogo element 

302 

O C A C 1 

3b . 4b 1 

10279 

Z93785 

Unknown predicted using Genefmder; similar to 
RNA recognition motif, (aka RRM, RBD, or RNP 
domain) 

4 90 

o b . jfi / 

10280 

AF132177 

Drosophila melanogaster unknown 

Z. /, Z. 

CC. (\1Q 

o o . woo 

10281 

AF03 3bb4 


1932 

88.785 

10282 

U93305 

Homo sapiens synaptophysin 

601 

66 .667 

10283 

X58636 

Mus musculus lymphoid enhancer factor 1 

932 

100. 000 

10284 

AF009329 

Rattus norvegicus enhancer-of -split and hairy- 
related protein 1 

529 

98 . 718 

10285 

AL049558 

Schizosaccharomyces pombe hypothetical protein 

267 

37 .725 
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10286 

L15309 

Homo sapiens zinc finger protein 

237 

85.714 

10287 

Z98596 

Schizosaccharomyces pombe putative SMC family- 
protein 

302 

55.000 

10288 

U13152 

Mesocricetus auratus guanine nucleotide- 
binding protein beta 5 

362 

53.636 

10289 

X93302 

Saccharomyces cerevisiae Msn5 protein 

173 

35.200 
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table 7 : Neural Network SignalP Program Data 


SEQ 

BEGINNING 

MAXIMUM 

MEAN 

AMINO ACID SEGMENT 

ID 

AND END 

S SCORE 

S 

CONTAINING SIGNAL PEPTIDE 

NO. 

NUCLEOTIDE 
LOCATION 
CORRES- 
PONDING TO 
AMINO ACID 
SEGMENT 


SCORE 

(A=Alanine, C=Cysteine, D=Aspartic 
Acid, E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, Q=Glutamine, 
R=Arginine, S=Serine, T=Threonine, 
V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown) 


1 

188:323 

0.946 

0.85 

MLKSSVDLLFLFSMCVLSINNLRSSLLAVHALS 
GDTLVTEIQTGN 

2 

916:1051 

0.94 

0.875 

MLSAILLLLQLWDSGAQETDNERSAQGTSAPL 
LPLLQRFQSIICR 

3 

13559:13424 

0.99 

0.968 

MLTLALLVLRICVCEAASTFVCPCLPWLSLLFL 
HLLPRLFQVQIW 

4 

11219:11354 

0.984 

0.936 

MLRSLLVYMLFLLVTLLASYGDASCHGHAYRL 
QSAIKQELHSRAF 

5 

34:169 

0.983 

0.936 

MAWKTLPIYLLLLLSVFVIQQVSSQDLSSCAGR 
CGEGYSRDATCN 

6 

7698:7563 

0.99 

0.928 

MESELTWQIAWRLLLLPVVAATWAVAAVRAG 
TQDRVLADRKLGS 

7 

13595:13730 

0.966 

0.884 

MFRFLGVLLGIAIRTGSPLSLNLAEPVWKQLA 
GMSLTIADLSEVD 

8 

1527:1392 

0.997 

0.92 

MWYGVFLWALVSSLFFHVPAGLLALFTLRHHK 
YGAAIAGVYRAAG 

9 

1683:1548 

0.997 

0.92 

MWYGVFLWALVSSLFFHVPAGLLALFTLRHHK 
YGAAIAGVYRAAG 

10 

466:331 

0.95 

0.651 

MGKSLASQFPITLIFSAFSSTFCLLDGLFISCPC 
TSTELPKVNSL 

11 

768:633 

0.956 

0.821 

MVLSGIHWYSVLLLAVEFCRYCPLRYRCSTFS 
SWARVSSTPQASS 

12 

604:739 

0.991 

0.957 

MMCLVMASFGFILCGLCADSSVCLGLCHLLNL 
GSFQPLVLWIFFS 

13 

189:54 

0.969 

0.795 

MYLVKGLLIGLHSILLCLREQGGLRRVERDEG 
TASWYSSQNTYN I 

14 

1012:877 

0.993 

0.897 

MJQLFFVLYGILALAFLSGYYVTLAAQILAVLLP 
PVMLLIDGNVA 

15 

191:326 

0.958 

0.769 

MTNFFWCCLSLWGLQSRFREYLTSTFLVLGE 
SPCHTHLLKGCSV 

16 

206:71 

0.941 

0.726 

MPWGSLFGVESLWCSLTFLPFDDLSLLRLWE 
GNCVYCFAVLMYYS 

17 

284:149 

0.972 

0.902 

MLHSQLLAVSFRLIVTLPLSIQDWDDAENMKG 
LQYIFNTLWSVSS 

18 

311:176 

0.974 

0.852 

MHLPTLFFFLRIILPTQGPLQFPYEFEDGLFHF 
CRKGGHWNFDRD 

| 19 

154:289 

0.962 

0.784 

MVLHRKLVSFFSIWIFSFPSVIHLKRLSFPQRM 
FLACSSKISWLE 
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IVIOO VvJLILI 1 1— 1 IVII v-/ 1 1 AAI_/A/A| r\ 1 1 OL AAI_ 1 1_ \~J 1 l_L_ 

TIFLDDIHFV 

155 

406:271 

0.943 

0.901 

MLLLWGRLTWRKMCRKLLDMTF^KTNTL W 

IVIL.L.L. V V \S 1 \L_ 1 V V 1 M \ 1 VI V— / 1 \l \ L_ [_ !_/ IVI 1 1 JUI\ 1 1 1 1 L V V 

RQRCGRPGGGVLLR 

156 

18:153 

0.977 

0.826 

MF^ CP LQTVPLAVFFI 1 AGIAVHI <*HI VI KM K 

IV| | «JV/r UW 1 V l L_/A VII uunvj l/A V 11 l_\_> III— V t_r\ 

SLIFYVDFDHPP 

157 

150:15 

0.921 

0.745 

MAPGGAKGQGASAl AI 1 Fll ASPATf^Gf^PRI 
WRAGGLGFTHCQAN 

158 

115:250 

0.955 

0.862 

MHRL PAATRATLILSLAFASLHSAPSAFASSSN 
SSTLTAHHPDPG 

159 

816:681 

0.939 

0.87 

Mil I 1 STFFPPFRFDSPFYKKYVni WOW! MPV 

IVI ILLLO 1 1 1 VWl IXLUOVu/l 1 r\r\ I VULVUVVLIVIrV 

IPALWEAKVGGSL 

160 

243:378 

0.937 

0.751 

MFI YASI ITVYGFVKIOPVAFTPI 1 Hm YMPI 

IVIIL I nOLI 1 V i orv rxiwr V AAl 1 ULLnULl IN i l_V_> 

SQPATHRTFPCA 

161 

70:205 

0.884 

0.588 

MWHAFPMAKI 1 Yl f^lROV^K'PI ANRIKFAAR 
ivi v vVJnrrivirtixuL i LOir\wVOr\rLnlNr\li\Cnr\r\ 

RSEFFKTYICLPP 

162 

213:348 

0.987 

0.932 

R4i HFYM IIFII NPI Fl POAI PTPII PI nn^VIPI l<^ 
FALAFLLW 

163 

508:373 

0.989 

0.875 

MKAI PI 1 1 1 PVI f^l 1 V^^KTI P^MFFAINFRIO 

IVIr\/AL_\»/L.L.I— l_l V LOLLVOOiN 1 LUOIViC Cr\l IN L_r\l W 

EVAGSLIFRAIS 

164 

18:153 

0.941 

0.818 

MVPWHFI F^l IRKITPAI 1 nWRPDV^Tn^Rni 
ivi v i v v i 1 1 i_i o i_ i r\rx i i r mllo vvrvr w v o i oorwjL 

DEQGVRGPGAQGP 

165 

20:155 

0.957 

0.763 

MI MPFIPTASTVWKITI RA^iHPVYI M^PD^WK 
ivi i_ivi riu i /ao i v win i l_ r\A\ vp n v*> v i liviowmOWix 

SPAEFSAHLSNFTV 

166 

292:427 

0.984 

0.861 

Ml PI LL WPIPI 1 SI PSPPWAPHF^^n^KYAVII 
EPLSLLSHLPGT 

167 

244:379 

0.964 

0.826 

MOKFFI ITYVl RFAI nV/ANIYIFIPIFIMMFIMI^M 

IVIVJ|\I 1 l_l 1 I V AAl_U/ V /AIN 1 ll l^/l r IIMIVIi 1 IVI IOIVI 

PLSFKFWGII 

168 

241:376 

0.981 

0.822 

MFFWRl | ii YSSPFITVPI HI FT^P;! KMDHI R 
SHCHLFRQALFL 

169 

8:143 

0.932 

0.681 

MY/OSI WKTVI AVFYKTKPTI TIOFM^AIVI 1 HI 
ivi v wou v v rv i v i_AA v r i im r\w I L I lyriNOni v lloi 

YPNELKTDAYKD 

170 

131:266 

• 

0.971 

0.585 

Ml FAWKV55FFPFOFKI PFI RFDI 1 1^1 FIKOAP 
ivi i—i aa v v i \ v o i i i » wrrxLurLrM-ULLiOLr i r\ 

GARKPLHQHSIC 

171 

107:242 

0.937 

0.766 

MKSFII PSSRYI *^l Vr^FWPTI VfiW^F^MMI 1 
FSKVILVYFIHGL 

172 

33:168 

0.953 

0.862 

MDHPHA! Fl PI PI 1 Tl ONATTFTWPFI 1 
NMQVSRGRSSVFS 

173 

198:333 

0.983 

0.903 

MLSFKSFIVLYFIFRSMICFELLLVKGIRSLSRFV 
FYMVWSICSS 

174 

138:273 

0.888 

0.626 

MPCSGSSVQTFRPLLIFHNVTFFILPVKCFNALI 
NVLERPFWQLL 

175 

362:227 

0.924 

0.818 

MMRVIILIWFRISKGTFQHSTTKCDVCFRVFLL 
SNCSFLSLNYKL 

176 

146:281 

0.944 

0.622 

MGGFKVNLFVKVKAEGSPLCWLKLACLGAWL 
LSLLISQKSDEQLC 

177 

59:194 

0.972 

0.68 

MSSSGQLGHPPRAPHSWRRWCWWLFMLAT 
SLSRRRRPSTPLIHYR 

178 

58:193 

0.975 

0.788 

MATPSSPVWVAHSGLPPLFSSGLSWRLVPLF 
WCLQSLTGFLGPCLP 
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17S 

) 48:183 

0.95C 

5 0.62C 

MK"IFI ^R\/l Pfif^pi A\/l CI PXAAA/Dl MDCT/T\r*r > 
ivir\i r lo r\ V Lro CLM V UV/ Lr VV VV Y LIN \r toUOO 

\ VFTPWHMGRENPGR 

18C 

) 222:87 

0.98e 

> 0.917 

ivir am_l_ v ur\v Vr\VLL"IVIVL 1 oLvj VtCLfStHVI VLL 

'LPVPCAAIIGSFK 

181 

174:309 

0.92C 

> 0.71 

MTFWI ADnQFFAATI 1 1 1 XA/ADI U'Tr^DI TMCC\/ 
ivi I cnLMUUoCr/\M 1 LLLLVVAKLfN 1 bKL 1 Nor V 

NGPGQHGKMCCIL 

182 

102:237 

0.931 

0.584 

MOYWNIf^HKTRR^I YTKMATMr^l if^Afil paap 
iviw i vviNonrv I unoL T 1 PJVJA 1 MoLloAuLrAAr 

LLWTYEAQATTCA 

182 

91:226 

0.95S 

> 0.811 

ivirAOiMLOOUVVL I CINVLL 1 VALL 1 1 O V T V rlllilN 1 

APKLFHLNVEWP 

184 

99:234 

0.96S 

0.927 

MKMI SOTYI 1 1 C)\ 1 1 Ql QOTPPI k'Fl rilTPCDC 
ivir\ivii_ow i i LLLULLUOLoU I trLrMzLOII 1 rrrr 

LHFPLTLFQPP 

185 

92:227 

0.981 

0.858 

MV/f^Ff^TMRRAriRI PCM \/l \A/I 1 \A/l\A/l APMV 

WSISSRHVLLEEE 

186 

699:834 

0.907 

0.56 

MPTRNJI Al 1 PAP^\/PHTnr:DPPUP\/DV/l nn I 

DGYISVFDIDSDQ 

187 

148:283 

0.911 

0.687 

MTIII OIFTVIFI 1 VI ADnT\/DDI T~I1TV*1VJ! APIV/l/Tl 

IVI I HLUIt 1 VlrLLYLArU 1 VKKL 1 II 1 CjMAoIVKU 

QIDSHITDP 

188 

181:316 

0.959 

0.818 

KAl UpCTCp/^pOUl 1 A\/l 1 Al 1 ^TA\A/A CT\ /\ A/DD 

lvll_nrc I oroixvjnLL/Av LLALLVj 1 AvvAcVWrr 

QLQEQAPMAGALNR 

189 

28:163 

0.95 

0.685 

MRIIQI Tl KAl 1 PI PnCPnDDHDCVI LVMII UDI V 

ivimioL I LiviLLcLrUobUKKUKbYLKlNlLnKLY 
GRMLGLRPYIHK 

190 

239:104 

0.954 

0.645 

yPAVT<^QnriQQTr'APi/ADi i i \/~ri aik^otii a 
ivirAT 1 booboo 1 UAOrvAKLC/LLV 1 LAIIvlo 1 ILA 

PLNCRDRALGRF 

191 

77:212 

0.977 

0.902 

MP! A\/^fMTQFI KA\ Ql 1 1 PA/1 AUDI DnDDDCDIC 

iviklmvoVw'I 1 orLMLoLLLrMAnKLKUKKKbKlb 
SLIGANLHHFNL 

192 

8:143 

0.958 

0.838 

iviiwlvvao vlvj 1 r l^oL»LlAI\Vb(jAVb 1 bPbPb 
LCQQTEWKSGQRWE 

193 

11:146 

0.973 

0.925 

MVPPQnVPQI A FfiFI \/TZ(^ \CKA\ A/IT\ OCI//^ 1 11 

ivivrUoUVr olArbrLVrulrMVVI 1 VCjorKCjIlL 
WYIFYFLLiPW 

194 

134:269 

0.965 

0.692 

MKAFFPIFYl ppcpi Mc\A/i cri pcudi/dcpi a a 

AKKLCENLLQTVS 

195 

129:264 

0.914 

0.74 

ivi v riLLLvr vvgornlNLbnr Wr^vlrvLn AlOLINvJo 

G Yl 1 AFFVLYTN RM 

196 

122:257 

0.97 

0.898 

ypcpcri WRII 1 1 \/l fSQ\/i qpcada a adi d\/i 
RQTALCCATEALV 

197 

110:245 

0.978 

0.797 

\A\ \ Al PI AAP^rPMI PTrvQCDDTy/crnAMMP 

VILLAALrLr\r\rOL»rivlLb l LfloOrr 1 V ObUAN IN I 

SSVPLSLPPSTQR 

198 

93:228 

0.985 

0.928 

V/tAFI YHA/AWI V/PMI rsi PPNPPPVCPi i ccc\/ 
y/*r\i i— T nvni vLV^iviLoLrwntrrTorLLrboV 

YRHQTLLNDIPCV 

199 

173:38 

0.984 

0.859 

i^wnvi YFP^tkyl\A/r*Fi nti rrn r^/Diccprm 
VLGLVLGVLSWGLL 

200 

130:265 

0.942 

0.817, 

VIHXISALCLHPSLYISSCLFLYlTFSiLSLLDLLS 
SYYSLSLLLL 

201 

52:187 

0.94 

0.686 1 

vIRKAGCLLELCCPLYTVLITSQGCFLPHLRPL 
^PHHFHIHPSYLP 

202 

302:167 

0.89 

I 

0.589 ( 

vlAWKPLGRQAVLRETPLATLCIDRRQVSSSLV 
3EGFHSKSCHCLG 

2oi: 

271:136 

0.923 

I 

o.8i 1: 

V1XXXXXXXFLSTTGFLPAAAAFSSDLACSAAX 
KTXPLYCRSLRFL 

204 { 

93:228 

0.995 

t 

0.941 > 

vIVPFSYFWFYVAFLLISPWAASVITRKMLSPS 
fGFQASTLLLTGV 

205 < 

*:139 

0.978 

0.74 r 

vILYFQLVIMAGTVLLAYYFECTDTFQVHIQGFF 
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CQDGDLMKPYPG 

206 

122:257 

0.919 

0.798 

VJII WV/FI \/l I nPPI WMAPCPCFCCFTVI Al VMf 
VluVVtZL V LLwCr L.V V IvIrtrOrgCoOC 1 V l_r\I_ V INO 

ISPLKYFSDFRPY 

207 

315:180 

0.924 

0.674 

MFRTTARI ATPMI 1 A\A/n^rR9RTHnfMI ^1 ^ 
VlC. r\ 1 I nr\Ln 1 OIVlLLrtVvUoV/rvOrx 1 nUUivlLOLO 

MLHGSDIFSIVFYE 

208 

395:260 

0.924 

0.831 

PTSRYRGYHLLAQD 

209 

19:154 

0.958 

0.789 

V1L V Li vn 1 VVOULOLUr\l\r\rl\roOVVIN 1 OOV«*r\ 

YPGLACPLGRPPGQW 

210 

27:162 

0.968 

0.919 

iuk'XAA/TFICI 1 Fl FCQAYCRf^PK'AFFAFVCKI V 
TDLTKVHTECCHG 

211 

84:219 

0.96 

0.876 

MIDDDAI U'WXAA/AI T\A/T A C F QT(^ PWP C P F P 
IV1KKKALM1 VV VMLOL 1 vv 1 MoCo 1 orVVrOrtr 

SVRAKEADPSGRRSL 

212 

677:542 

0.978 

0.886 

KA A ACDADD A\/l Al TCI Al 1 1 1 1 r\A/^P^(^l^nM 

KLKLMLQKREAPV 

213 

426:561 

0.968 

0.935 

K/iupci MPXA/AI Ql 1 VCA/AI AAA^PHPKI OHEG 

CISCAILNRLGNYL 

214 

101:236 

0.921 

0.771 

K>iii/^ir*xiQDi PPif^i r\/r\/PPP\/i\A/i PX/PPVI FT 
KILKVVFVIV 

215 

39:174 

0.903 

0.61 

TiATt rCAAHl Al IfWQ QQfiTl VWQOPK'HI WHI O 
M 1 LUoAAULnLlioinoooo 1 Lvnour rxnLtNriLw 

LLPEGSGAQEPQA 

216 

345:210 

0.916 

0.677 

RAWEEEGWEEEG 

217 

336:201 

0.991 

0.872 

*Al C\A/\/CDPQ\/CPI r*l C\AAA/F\A/HOACI HRA^ 

GCAPALRVGLIPGCR 

218 

80:215 

0.985 

0.962 

rv/iL'D\AAA/AT > l 1 \A/MI 1 1 \/DDI A AR WCl QPFF AC 
IVlrxr V VV V A 1 LLVVIVI LLLVrKLoAMr\r\oorcur\o 

FYYGTFPLGGHHSA 

219 

711:576 

0.897 

0.772 

KAn a\/i i \/r\ OTCiAARnni k'nA\/n c ^ni I I ANS. 
LSHFVPLWRNYL 

220 

182:317 

0.934 

0.749 

N/\ytil I Dll \/KA ADl/IC APlMCXA/QARCPCTI PCf^C 

LWRKVRDPRTINM 

221 

257:392 

0.929 

0.775 

KA/^l IA\/T\/IAATAC\/AI DOQinT\/HFVnKWOK 
MoLlMV 1 VIM/A 1 MoVALKUolU 1 Vnr vur\vvur\ 

NSTRMWNSQVLIK 

222 

43:178 

0.996 

0.874 

IUN\A/FI 1 FTI 1 1 FMkVnX/MCFFI FFYVKT^I HP 
MIIVVrLLr 1 LLLrlNrvT UvlvlorrUrc: T v r\ I OLnr 

WDKSQLIIVNYL 

223 

193:58 

0.982 

0.864 

MPirM\AA/CI VD\/I Wl k'FI 1 Wf^HYIFWV^RKMF 
MrlolVVVoL T UVLvVLr\L:LLVVvjn T IrVW orv r\i vi I 

VYGGVGGKTANIC 

224 

191:56 

0.96 

0.749 

IV >ti i isci rcrKyiDci i i qppi nQPRCf^inf^ciMf^N 

M Ll_r\ i LL> Ivl Jr O L.L l_o C r l_LJO r r\0 0 1 UOOINOIN 

SMFNFVKNCHFPT 

225 

223:358 

0.988 

0.904 

MrbLLrKoUorVVoVVIVIrMLrLiM'irUro 

TQYLGVACRSRQGV 

226 

334:199 

0.997 

0.949 

MFLLLFCLMFDFTKVFFILLLHIFCLSTCLFLGL 
HICASFHARAL 

227 

103:238 

0.939 

0.728 

MHMMKLSIKVLLQSALSLGRSLDADHAPLQQF 
FWMEHCLKHGLK 

228 

48:183 

0.978 

0.793 

MLWPWGPRLLVVGAPSALLQLRQVEELSGVG 
RGLGGIRMGWGGLP 

22S 

143:278 

0.992 

0.925 

MTIAMIFYLLEIFFCAWCTAFKFVPGGVYARER 
SDVLLGTMMLII 

23C 

1196:331 

0.996 

0.971 

MCISKCVCWGLCVFLCWCLH"n/FRCGLvWSI 
VQVKSASYCRGCWA 

231 

86:221 

0.97S 

0.947 

MACYLLVANILLVNLLIAVFNNTFFEVKSISNQV 
WKFQRYQLIMT 


ioHo 


232 

361:496 

0.995 

0.973 

MDILLLLLFFMIIFAILGFYLFSPNPSDPYFSTLE 
NSIVSLFVLL 

233 

150:285 

0.918 

0.863 

MSPQHLLLTLPLPLRSPILFSHTAQLLVLTRIAF 
RACELFFFVMV 

234 

207:342 

0.914 

0.656 

MPVFHLTPCLLPPTAVPIFFPTEKREINKNRGQ 
RKFRREMGVTRP 

235 

338:473 

0.971 

0.86 

MESWWGLPCLAFLCFLMHARGQRDFDLADA 
LDDPEPTKKPNSDIY 

236 

234:99 

0.977 

0.93 

MWIVLLGGFVGPLYLTPAPSPCTHTLGVRAVP 
LVTGLTSQLWLNA 

237 

125:260 

0.968 

0.864 

MVTGVLWTMEVLLWIPLPSQIIYQAVTVLSFHR 
GGRKAVLEVTEV 

238 

53:188 

0.988 

0.939 

MGVMAPRTLLLLLLGALALTETWAGECGVGR 
ERASAGRSEWPARP 

239 

457:592 

0.962 

0.871 

MPVPLAYFQSSIVLFPLIFSLVTCVSLDGEPKS 
WGVISISAYYR 

240 

382:247 

0.951 

0.655 

MLAEQVELWWSQQPRRSALCFWAVGLVAG 
CG AG G VALLSTTSS R 

241 

358:223 

0.95 

0.705 

MVFGWVSGGLQAGWMVLDWVSGALQAGW 
MVFGWVSGGGARLGSFW 

242 

1814:1679 

0.994 

0.957 

MPALRPALLWALLALWLCCATPAHALQCRDG 
YEPCVNEGMCVTYH 

243 

157:292 

0.947 

0.754 

MLLGLFTLQFPHLLKGDGILVARPPAWCGFPL 
LSFFTEIEELDSS 

244 

223:358 

0.915 

0.795 

MAGSPLMAICWMFRLGMAGDGLDSIPEQAGP 
ASQPAGQSGTHRSF 

245 

208:343 

0.979 

0.863 

MKFALFTSGVALTLSFVFMYAKCENEPFAGVS 
ESYNGTGELGNLI 

246 

566:701 

0.987 

0.792 

MENCVGERNHPLFWYLALQLWLLWGLYLA 
CPGVCGCGPAGSCS 

247 

251:386 

0.936 

0.72 

MILQVLYEWIFLHVAAPSSSHVCGSSPLHCIAD 
VYEGKRKIHPVC 

248 

334:199 

0.957 

0.882 

MLEILLKLVRLLTTQPYLTLFQAVRNLALNLSTS 
SGSLGPAPGEP 

249 

88:223 

0.958 

0.882 

MMHFLTGGWKVLFACVPPTEYCHGWACFGV 
SILVIGLLTALIGDL 

250 

291:156 

0.99 

0.952 

MSFTTFVALLLSLCLWVTSTNYPTRSLRISSFN 
KEESDVASYSPQ 

251 

35:170 

0.987 

0.761 

MNIIFCWXLTLKLSLIRLPGFPLLSYLSASLPLF 
SLPTSSDSWIS ! 
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u.yo4 

MPWVLGCTPFIALAYFFLWFLPPFTSLRGLWY 
1 1 r YULrUALA 1 V 

253 

341:206 

0.973 

0.907 

MRSSCWLTRVTVFFATIWTASAPGPVTLLDV 
WRLTGVFCTSVLST 

254 

298:163 

0.924 

0.567 

MGSSVLSIWILSPSIYPILSPLAMPCLSRTDLIR 
VRRIQGAWPSE 

255 

496:631 

0.992 

0.949 

MNTWLLFLPLFPVQVQTLIWIIGMLVLLLDFLG 
LVHLGQLLIFH 

256 

9:144 

I 

0.894 

0.666 

MTTTFPPRKMVAQFLLVAGNVANITTVSLWEE 
FSSSDLADLRFLD 

257 

203:338 

0.989 

0.82 

MENCVGERTHPLFWYLALQLWLLWGLYLA 
WSGLRFFQPWGLWL 

258 

435:300 

0.989 

0.968 

MPIWKPFMAWMAAWALAVLSKLTKPIHLLWM 
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VARSINTLEEMILP 

259 

55:190 

0.914 

0.729 

MYGCLGAVAWCHVTTVTRLTFSSAYQGNSLM 
YHDSPCSNGYVYIP 

260 

247:382 

0.966 

0.687 

MINLGFYCLGLiPMTFLWRIKPGMAFNSAGIIN 
KTSDHSEILDN 

261 

29:164 

0.989 

0.884 

MWLLGPLCLLLSSAAESQLLPGNNFTNECNIP 
GNFVCSNGRCIPG 

262 

204:339 

0.991 

0.578 

MTGSRCEEHVFSQQQPGHIASILIPLLLLLLLVL 
AAGVVFWYKRR 

263 

247:382 

0.979 

0.849 

MVGFGANRRAGRLPSLVLGVLLVVIVVLAFNY 
WSISSRHVLLQEE 

264 

184:319 

0.92 

0.844 

1 1\ / A f — 1 \ All 1 Ik A n 1 — I t| S s-\ f\\ 1 A *"N y*N At A/I \ r\ , , — \ 1—1 1 

M VAR VWS LM RF LI KG S VAG G AVYLVYDQ ELL 
GPSDKSQAALQKAG 

265 

53:188 

0.962 

0.61 

MRNRRTERTCTPPLASPYNLVPHLQNLLAVLL 
MILVLTPMVLNPH 

266 

349:214 

0.971 

0.879 

MVWLPCFAVLKLLFGQSKLGPMQPSQSGLD 
PVGAGMSASIADGS 

267 

193:58 

0.904 

0.774 

MTMLPCDIFFPAVFTLGLNSVWFWQYNKNSP 
ENLLDESAFTKSPN 

268 

324:189 

0.961 

0.867 

MTPIQSLLYWWTFRIFIHFCYCMECTHVPIRVG 
QGGQFHVIPIFA 

269 

341:476 

0.923 

0.708 

MLPQVFLGFTKVRLLRLRNPWGCVEWTGAW 
SDRWDGSGVGVGLDP 

270 

159:294 

0.888 

0.694 

MTLLADLLLQTKSAMSLSCGCYFSVKPDLSWL 
PSRAVSFTWQPSW 

271 

59:194 

0.887 

0.64 

MDLGRVFITLILNLLRETIFKRDQSPEPKVPEQ 
SVKEDRKLCERP 

272 

6:141 

0.949 

0.835 

MRSLWKANRADLLIWLVTFTATILLNLDLGLED 
AVIFSLLLEEVR 

273 

126:261 

0.989 

0.858 

MLKKIFLIKLSSILSITLKLCLILESMLLSLAGTKL 
PELKKNFKI 

274 

354:219 

0.964 

0.794 

MFIIFLAFIALKRSKSVIGAFLYLASIFLAHGVAA 
HIVFMSAFYQ 

275 

149:284 

0.924 

0.683 

MYFSGPFLIQKMISLKLIQATYFFSFFWCLTFG 
HFSHYYYEREIK 

276 

287:152 

0.975 

0.909 

MLTEALTLLLLHSALEALILSELIHRGLYYFGWV 
FVLHCSVFGQI 

277 

6:141 

0.968 

0.925 

MRNPGFPLFVCTLFSLPLPCLALDFIFSLPFLSI 
SLLFFAFHPNF 

278 

22:157 

0.963 

0.661 

MQRGRNEWIGKIRPWWHRNCLLSLTCLLPLV 
NFGCIFVSKCHIR 

279 

33:168 

0.971 

0.781 

WIOIPI f^llCDVA/MGCDQCCTII \/CCMMMMni CI II 

MUltLUIIrrVVIVIoorbrrlLVbbNNNNULrU 
MVLLLVSLGLT 

280 

148:283 

0.909 

0.564 

MFIINMHILEGCTKFCFDCFFWLVCWGMWSN 
KFGELSLLISHNLA 

281 

79:214 

0.914 

0.597 

MLLLSRVFLSTFQSPHSHLPGLEPLPLGLHNS 
DRVSLTLSYSGIF 

282 

172:37 

0.926 

0.684 

MEVEGGGLVLIGCVTFAVFCIARLEGLVIFHW 
GCCSLFGVFRWE 

283 

360:225 

0.982 

0.822 

MWFAAPWALVRGPGFCLLDCYPRGGSGGGI 
TVLLAFGLLGEDAEG 

284 

21:156 

0.949 

0.774 

MMGVSGCMVLLAPLLARRSQSSLWKQFEKC 
SAGPKLMLSKFLPWG 
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285 

146:281 

0.948 

0.714 

MLFSAVKLYCCQFWHLILNRVPSPSLLYSCGL 
STNVLNTTVCYVR 

- 286 

30:165 

0.91 

0.654 

M L M M DTL W P I L L QTL K V I SQ VG HAG P LAN M I H 
DNPCIIAYRITLR 

287 

72:207 

0.945 

0.801 

MICSTTFLILKCFSVLKEILKPILTNVHCRSGVE 
AQAFFLQQFTS 

288 

190:325 

0.955 

0.831 

MIDIVRFAGLPSLLLHALCLISLTYPSSFRHSSY 
USPCASFWIL 

289 

46:181 

0.908 

0.638 

MPIWACARISSILFARLFADPPNSHLLLPTPNV 
TGCSSVPVYER 

290 

61:196 

0.926 

0.615 

MGATDSLRRCRQGPWRPSPWMFLLGRLFEL 
GSGAWLVEGPWPTGP 

291 

261:126 

0.98 

0.81 

MGWHRLWALLSPFACFRICGSFESQNNGSAS 
LPSTPPVGGWPRGC 

292 

615:750 

0.972 

0.926 

MIALTQLLTFILSCNSSLLHIFPFCEQVLVENGT 
KAGHSLLMDAR 

293 

175:310 

0.948 

0.782 

MIFSPSPRLCLFLVPCAHSCSLQSQPGLGIVN 
SFSFFSYCFLYSP 

294 

231:96 

0.97 

0.853 

MTHTNISLLLFMFAYFDCTCTLDLVGSLAKQVY 
FLKYTREVGVHS 

295 

49:184 

0.928 

0.636 

MVLSWLTLIEALADVMTTDGNMLQLFCVERTN 
LLVNQIRMTLYAQ 

296 

178:313 

0.96 

0.835 

MLSCFSSCRRLFLALWGASQTAWMDSLVFMP 
SSTLAPANSYLSLG 

297 

274:139 

0.955 

0.65 

MCVWLAVLRGILLKGPQIGGHSSWPLTPGCE 
PSPAVLVARAPGAW 

298 

153:18 

0.939 

0.649 

MVIGSLHTFTLLAASSLVDTPKQIQLLMQNLMN 
DPRKEVKILAIQ 

299 

415:280 

0.969 

0.734 

MWVSLVGKCLCGLFTVQKAAQQCNGLTALDR 
ESEAWVLWALAPP 

300 

441:576 

0.937 

0.722 

MSTPHFFCPFLILPCWSDWKWGCSSLDTGEK 
EGPWGCHFLCPFGW 

301 

297:162 

0.986 

0.878 

MKCFHCASALAFFLCLTLFSVSPTCHLSQVAL 
NRNLSFYVCFNSP 

302 

111:246 

0.94 

0.712 
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MIWDVHWSKKAIYELTLALLWPWATVFISEPE 
IYTDCILTEKKT 

303 

207:72 

0.976 

0.959 

MKIALLLLILARCFTMESEVSVQVSEFSNHIGT 
SIFKLLVFWFNT 

304 

50:185 

0.954 

0.602 

MWTVLWHRFSMVLRLPEEASAQEGELSLSSP 
PSPEPDWTLISPQG 

305 

226:361 

0.98 

0.947 

MILMLLLLIVDLVQLAGNAVISSGSWDSACTGT 
PSPSTPSTWPGP 

306 

35:170 

0.901 

0.784 

MLLIFTFDPSTSLCVLSFPIINSTGICFSGQELN 
VSPMPSNKVSD 

307 

9:144 

0.891 

0.71 

MDHKSLWAGVEVLLLLQGGSAYKLVCYFTNW 
SQDRQEPGKFTPEN 

308 

116:251 

0.928 

0.697 

MQCAICILLYLLNKKWWRCSRIHHNNTVVLTR 
ESSPFLTTCTLS 

309 

173:308 

0.904 

0.611 

MKRAQGIFLEPLFLTFLYIMDVHMPATFSHLTE 
LPQYRDGTPASS 

310 

142:277 

0.948 

0.676 

MIFYVILSSPSSRTFFKITLIMSLGLISKLLITSCT 
FDTVTFMML 

311 

293:428 

0.986 

0.881 

MLITELFITWFIYLLFDFKHLLLVDLHMLGLGNF 
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NIQVGIPVAIV 

312 

146:11 

0.942 

! 0.695 

MSPFSKLFFWLYPGSLIPEALRWMLVDTLRD 
5 PYPATSAGDIMGL 

312 

40:175 

0.94£ 

i 0.75C 

MLLCTCHVPNFSLWLIPYTAIEMNFHFFSCGR 
1LSFTIVFMLPKV 

314 

100:235 

0.977 

0.867 

MPAKAWPIWlLKAASFLIKLYlFARFSTIPSFLKK 
' NLYASFTPSL 

315 

297:162 

0.891 

0.702 

MIELAWKFIMHINALLSFGTTSLKIKFAHVFSFL 
QLCIMNKITLI 

316 

317:182 

0.995 

0.938 

MYALLGVFCLAILVFLINCATFALKYRHKQVPL 
EGQASMTHSHDW 

317 

97:232 

0.943 

0.839 

MGMGTLIIMNVWVLFIPTRLRIDQQPVHIKPSM 
RVLDKWVSAFVH 

318 

70:205 

0.946 

0.733 

Mil F YG FG VC VFM RVQP AL Y AL WN 1 R ITI F 1 VD F 1 
LFSHLCTYYW- ! 

319 

175:40 

0.931 

0.551 

MELETFVRFLWLLLKPKNKGKWQHIEMKKPQ 
VCNTLKG RG PKFV 

320 

187:322 

0.981 

0.933 

MWHYAFSLAFYFFGACLAFSHSGIIKDLIAHLH 
PFVSLSQISSHD 

321 

55:190 

0.974 

0.879 

MNILIWGRLIPWFSVPAWLSVIPYDVLVSPRVH 
SPRWLKLGTNP 

322 

401:266 

0.909 

0.59 

MSNFFLQRAILWSILPDTQCIPRFFHSDLWSH 
VPFLALGDLQELF 

323 

65:200 

0.959 

0.738 

MPHIQTLLRTLFASHLLVSLWQSEPMAKPRMR 
KYNTSSEYLSELD 

324 

387:252 

0.885 

0.73 

MGRNPVFTLLFMISQGFLGYNYIQLQVMVESTI 
REYFYKEKMAKL 

325 

414:279 

0.974 

0.944 

MVCTLPVLLLKVWCLNKTPTTLDMPRAGITQY 
TWLRPDENLPRSG 

326 

58:193 

0.99 

0.849 

MKWKTGVAIFVWWYLVTGGLVFRALEQPFE 
SSQKNTIALEKAE 

327 

66:201 

0.933 

0.811 

MSYWAGFHFFLTTAFCYYRYSTGIFYLVLINIT 
NQISIIWKILL 

328 

266:401 

0.991 

0.9 

MEFSYFCLIFFQALCCIRGFFAFCINSVCSHGD 
LTFLNPPLQFLW 

329 

35:170 

0.969 

0.724 

M ALALAAY VCG WW D R ETW P VP M PC N KG G R 
ACNLEMGMEWLNLHC 1 

330 

31:166 

0.986 

0.595 

MRILSLSSFTDYNWKNRLHVCILTSFCCGLLA 
LFLVLAVQGYLA 

331 

424:289 

0.932 

0.581 

MVFLLATSGFCASGCRYLSDAVDTEAVATSD 
REGCFPTQSQTCQW 

332 

472:337 

0.982 

0.809 

MIIAIGGVIVASfil VFlV/l l MiRVk'x/vr^nrinQOP 

viii/*ai\jv? v i vnovjLV nv i_l_IVIlr\ Y r\ V Y bUoUoKK 

VKGSRALPRVR 

333 

16:151 

0.955 

0.704 

MILCPPCSFKQFIVFIICLILERIHIYYRKLLNWK 
KKKKPAEIP 

334 

182:317 

0.986 

0.889 

MCWIAVLTFAISASTVFIALKVCVAWAAAASR 
3LGGSLLNLSAV 

335; 

33:168 j 

0.942 

0.568 

VISLPPSLACVGRACCLLPLGRQRPGGCGALA 
RVTRSGCGRQGGCD 

336; 

280:415 

0.959 

0.742 1 

VIHLIGSATLARSTINLPSTLIIVTMLILLTILEIAG 
DLIQAYVFT 

337; 

275:140 

0.944 

I 

0.709 1 

MLSSVFLDLPVLLLGGFQDVITVAPLAGLVPGP 
HQSTLCCDPTSG 
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. 338 

321:186 

0.957 

0.807 

MSSTAAFYLLSTLGGYLGTSFLLLKYPTLLHQR 
KKQRFLSKHISH 

339 

477:612 

0.984 

0.921 

MFLHSVNLWNLAFYVFMVFLATLGLWDVFFG 
FEENKCSMSYMFEY 

340 

316:181 

0.937 

0.757 

MWRRFLSFFQCLVKNYNVCKCSVMLRFSWL 
DSTCQTKLPLGCNEF 

341 

125:260 

0.904 

0.724 

MRQWGQHWWVTFGWREFGALWRTQQGW 
CLGHRAGGEGQTHHPTL 

342 

30:165 

0.965 

0.775 

M N KP F I N S AS I YC L Y I LS A I TL E L W AL VL VSS W 
ASLGPFYGFSW 

343 

313:448 

0.992 

0.961 

MKLYKWSFSLLILLSLLLINFSYLDLLIIQHFQSD 
SMGLFGLPQG 

344 

34:169 

0.975 

0.903 

MYFFLFFLFAEMGLYCNHPSFFYFLHTIPSYGY 
TVIYLVSSLCFR 

345 

269:134 

0.978 

0.896 

MFSFSMFMMLFFKTFNSLVQVEFILIWGIEPF 
PQiVKQSQSLIL 

346 

42:177 

0.972 

0.646 

MGCDGRVSGLLRRNLQPTLTYWSVFFSFGLC 
IAFLGPTLLDLRCQ 

347 

384:519 

0.996 

0.96 

MVSKFLLSHLVLAVPLRVLLVLWALCVGLSRV 
MIGRHHVTDVLSG 

348 

37:172 

0.969 

0.681 

MMALFTGKLLQWSKVLWLYQTN FSLHTHYSF 
NRGQIFKRKTVQN 

349 

274:409 

0.958 

0.867 

M M LVLP RLP EWWL VS VACM RTGTVM I PGVTQ 
LTEKDLKYRLQASR 

350 

87:222 

0.964 

0.752 

MLTFKSDNGKTXXXILLCLLLLLFISFLDDPRAS 
IFSSPIPNCVS 

351 

295:160 

0.962 

0.603 

MLGFRKPCQCLRLRWVCLHLLRRDFSILGLSF 
PICETKGPEERKL 

352 

209:344 

0.951 

0.857 

MNHCLSFHLLFYGWAYTLQASLPSLLTIHNISK 
TQFFLFPSECHI 

353 

262:397 

0.973 

0.905 

MSFVKILIWELFIACFPQGPLVHSGKMLKHGLD 
WHRTLLQKHPCI 

354 

206:71 

0.963 

0.916 

MCTVTAWWPLVTTAGCWPSRPAGMSPGSW 
CPSCGSTASQALCSAL 

355 

368:233 

0.943 

0.775 

MPTFFAWLLPAGPGATTACSLHSQAAQRRRA 
PLSCAAEPVAAGSG 

356 

370:235 

0.976 

0.927 

MLDLIFNFWLLLPPAVLIPKPWLLLRPAVLIPKP 
WLLLRPAVLIP 

357 

119:254 

0.968 

0.784 

MDLPFFHLSLSTTALGFSWAVCLFKIFLSALNP 
ARNLGHVFLEST 

358 

581:716 

0.971 

0.875 

MPVQAGSGTPLLLFSISLISALNIISVLLFSVLFC 
CPFSNFLKRD 

359 

86:221 

0.954 

0.782 

MSVRFIHVAVRSIVCACSLLKTSIVWLYRSSQI 
HFTVSQACPMHA 

360 

128:263 

0.973 

0.806 

MNQKNPEKLFPAGLFLLLMPFLSLCVPPHLCH 
QFKMTLKVLLHPS 

361 

620:485 

0.985 

0.881 

MCCWIQFASILLRICTSMFIKDTGLKFSFSWSL 
PGFGIKMMLAS 

362 

319:184 

0.992 

0.902 

MFLLEPCLLVTCLCTGXAALILCQHLRLEHKCV 
AVPSGIQQEREL 

363 

255:120 

0.899 

0.62 

MIKSLYLFFDIIFISMTTLPSSKHRNILLYKWKR 
HRAYRELHNNV 

364 

174:309 

0.979 0.778 

MIVCLLFMMILLAKEVQLVDQTDSPLLSLLGQT 
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SSLSWHLVDIVS 

365 

14:149 

0.923 

0.717 

MKMLPSCKAALSLLHIWLQSGGHHRPVPCLA 
PAQTVKCAWCMPMV 

366 

402:537 

0.984 

0.872 

MPTYWLANLRPGLQPFLLHFLLEWLAVFCCKI 
MVLAAAGLLPTLH 

367 

141:276 

0.962 

0.646 

MSLLSDFFILAIVFSTSRISIQFFFIIAIPTDTLCL 
GKYISYTFS 

368 

423:558 

0.96 

0.791 

MLIVSHLHYFLFIPFAGFGGSRKLIKFLSKMEK 
SLFAYFLLTSLV 

369 

186:51 

0.901 

0.76 

MAIWLSYPSRWVALIYAEVNLANLICLIAPKAF 
RGRPKKILIYNP 

370 

59:194 

0.956 

0.624 

MSWLANGVCLYEYLFFRCGFLILQPCSFDASL 
TDEESRKNWEEFG 

371 

318:453 

0.939 

0.764 

MCVALLVLGLTVQSDRCFPELPGGRSLLFCAL 
RRSERPLPQASRG 

372 

222:357 

0.903 

0.669 

MQVLHHCWGALKAQWRLVGFCERKLHLCHH 
VRKKDLLLARFLLTT 

373 

110:245 

0.915 

0.72 

MRSLVWVLIQQLTPLYKGETWTQTCTEDHVT 
MKAEIRVMLLEARE 

374 

167:302 

0.9 

0.696 

MNSNPLLVIKLSSLFLCHSSQNLFSVLMHKNV 
FKIVCLFSLKKHA 

375 

18:153 

0.931 

0.657 

MYVPGRILKCKMTLLFSPWGLTLDKLKNKSILY 
SVFSIYCDIFDQ 

376 

162:297 

0.967 

0.719 

MLLFLYTLXWFFYKXEYFHAQINFFHMLQKV 
QEFILVRKTRESN 

377 

96:231 

0.956 

0.802 

MADSVLTLVFTSCLLSELSLVCSDFRPTPISYQ 
SRYGSGDGWIRC 

378 

165:30 

0.888 

0.572 

MLSSQRLGHLPRLPSWGLLSWTPRVSAVIFID 
KKESCFLDKNLHS 

379 

248:383 

0.994 

0.871 

MLLFGLCWGPYVATLLLSVLAYEQRPPLGPGT 
LLSLLSLGSAKAA 

380 

177:312 

0.911 

0.74 

MANYSLCQCIMVISFLCHGTFISYFLKVIIVSFRI 
EKPKLYYLVT 

381 

204:69 

0.969 

0.786 

MVRASHRSLLAFFFACWTSLIRLIRTSTRGSRL 
PTFCSFCTISIK 

382 

24:159 

0.989 

0.923 

MKVLLAVALIARTVFFLLLAGPSAADDKKKGPK 
VTVKVYFDLRIG 

383 

167:32 

0.921 

0.562 

MLPYYATSLFFFVIGKSEIRFSNAYSVMPSAYA 
NLVKSKNLSEGA 

384 

234:369 

0.96 

0.911 

MLLSFLFPQTMALPSRMLVCGTDPFIHSFVHS 
FICKHSSNSHHMP 

385 

144:9 

0.974 

0.774 

MAAADDTILGFRAALLILVAAAAALSPKVACRV 
GTVRRRETPQPS 

386 

158:293 

0.938 

0.73 

MKFFVLSAILCIKLRHFTTGKKIWPQELFTRITT 
QLRVLNIHFQK 

387 

245:380 

0.897 

0.56 

MKMINTMGLFLLNLEMRIGTRIHFIALQLGREK 
CYWGKNKTDTLC 

388 

271:136 

0.931 

1 0.743 

MIIYLTFPVAMFWVSNQAEWFEDDVIQRKREL 
WPPEKLQEIEEFK 

389 

72:207 

0.94 

0.752 

MEIPVFSHICFLFSSLLCVRNPLSTMLHFPAWS 
LPDTSPLPHHTQ 

390 

|1 53:288 

0.931 

0.614 

MVHRALHLMAPHLTKLTFFTCSCSRALRSECI 
CSCHLEYSPQTSP 
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391 

24:159 

0.949 

0.761 

MFCFLVSVLYSKAKLASACGGIIYFLSYVPYMY 
VAIREEVAHDKI 

392 

702:837 

0.966 

0.735 

MLSQWAMLSLTSAFTYTVLYPKCFSFPLSVQQ 
TCLLSTVMXPFQA 

393 

40:175 

0.918 

0.614 

MGRWLRELWNLSCFHVHCVLELDHHEETSKI 
SSFIVIGRRQGGP 

394 

159:294 

0.907 

0.685 

MSLLKSYCKLRANCNLLFLFVARGLHRSLNICF 
PCHPAFRGWPAI 

395 

51:186 

0.959 

0.814 

MVLGLLVQIWALQEASSLSVQQGPNLLQVRQ 
GSQATLVCQVDQAT 

396 

176:41 

0.972 

0.935 

MALRHLALLAGLLVGVASKSMENTDTDVPAPE 
VLTRSTAGVRGAC 

397 

422:557 

o 0.907 

0.703 

MQADRGGVLFLVALPGLWETVLRHPGASPEP 
VSLHTGLAAEPLLG 

398 

325:190 

0.95 

0.791 

MWVFTLTLAASWGMGNGYRKGWRQHGWIG 
CIRWLEMQGFSSLAH 

399 

102:237 

0.92 

0.576 

MPWLLSAPKLVPAVANVRGLSGCMLCSQRRY 
SLQPVPERRIPNRY 

400 

149:284 

0.967 

0.909 

MVDLVLRLSFSALAAPLFGSSLFTWSSQMAQI 
RSLRANPCQVGHI 

401 

184:319 

0.989 

0.705 

MTPSFVSWLGVGKSQTWGPHSFLFFLPTLFL 
CMLLFTQYLPQKSK 

402 

43:178 

0.937 

0.61 

MSWFRYTFSPMPTYLLHLLQQCCSLRNEVHN 
ENSGTPQYHKLLRK 

403 

387:252 

0.955 

0.876 

MMYLGQDLFVLMLFRVCWDSWMSFSSSMFL 
PIISSNSFSILPQPW 

404 

169:304 

0.904 

0.659 

MRKLPLRGSWSCSCHYVRAELVLEWLLVQCF 
FSLCSSSYVSPRRR 

405 

255:120 

0.971 

0.803 

MGSTWGSPGWVRLALCLTGLMLSLYTLHVKA 
ARA RN RD YRALC D V 

406 

342:477 

0.942 

0.626 

MALDFVNVLLCQLAEVTLGVLREEGASLLVAL 
GSALFPSAAAVGK 

407 

118:253 

0.947 

0.796 

MPLALSLPAKILFCALGYNHLGLVDEFGRIFMQ 
GNNRYGQLGTGD 

408 

298:163 

0.964 

0.869 

MPRVALVCLPTSWVLSLAHAIPNGCEGVWLF 
CPCPHLATLLVLPQ 

409 

305:170 

0.96 

0.797 

MCISVPLTPLLLFLNAGTMVSKDTGKCILTTSE 
SEVEPAACLALE 

410 

85:220 

0.954 

0.724 

MILINFREICLKVLHTPLCVSGGCVLLYILALTC 
CYTNSLLISHL 

411 

321:456 

0.957 

0.649 

MGWLYVMLCASLPFDDTDIPKMLWQQQKGV 
SFPTHLSISADCQD 

412 

59:194 

0.932 

0.558 

MHCITGWPNVWLMAGQFLCNTPSCHRWATT 
QQEYWAGICPKPRLH 

413 

172:37 

0.933 

0.742 

MASPPPKVCLLAALWGQIFGLSLPCGLSEDHL 
QLTRNSHCKIWET 

414 

231:366 

0.965 

0.788 

MSCIIPPEANAA/VGALALLFCALRQRTTSLPLGN 
EWKPNFFPLPLKV 

415 

45:180 

0.914 

0.78 

MGERAPCFLIPLLHFTITYALEINYVCCVCMVE 
GTCLRNSTTEIS 

416 

736:871 

0.947 

0.686 

MSLSCTGFALEKRCAGWVWWLTPVIPALLGG 
QGRQIMIMVRSLRP 

417 

265:130 

0.958 

0.832 

MGVIHLKFHCRVLVWNLLTTCQVFNKVTLPVS 


.1. 0 O S Nh& 4» 3 ,,, O 2 2 6 O 2 






YLEEKGLRTSILG 

418 

101:236 

0.977 

0.823 

MINTLWSFLLLKIMLQPTAFYVCAGIPIRLIGYC 
WYKDKLFICT 

419 

78:213 

0.937 

0.617 

MLGTKPLEIILFFLCYLTEETHGRLHVLACPVG 
QGSLEAKPLVRS 

420 

262:127 

0.922 

0.588 

MFCALSLSGHALLNSTLLSTHLGGIGTAQKPK 
LYLAAHQVFIGPR 

421 

183:48 

0.965 

0.812 

MRRRWWGLWASVGILLPAELDSRLWREGTTL 
GACESGGSPARSWA 

422 

1542:1407 

0.982 

0.832 

MACKGLLQQVQGPRLPNATTRLLLLLLVFAVGFL 
CHDLRSHSSFQAS 

423 

82:217 

0.976 

0.87 

MGAFMWAAIAMGVLFLGGVNGKLFGGLVAT 
AVGTFTMLVWLSPW 

424 

165:30 

0.964 

0.617 

MKFWLPGGRGCVLLISVPIVPTPVSHSMYSTN 
ADECRGARGWHAL 

425 

132:267 

0.958 

0.864 

MCVHICLILVKWIFPITYNGYLTIFLCKLFMFSA 
HFSIDVSVFHI 

426 

195:330 

0.99 

0.965 

MLLLFSLCWACLCNVIIYCDVSRLLSGIGHHM 
HVENLFQNKIYW 

427 

304:169 

0.973 

0.845 

MRPLFSWSCFCSCGILLLSLKYQALTRVKCWV 
NIYEVFHLFELKD 

428 

98:233 

0.961 

0.799 

MSTGRFFLVYTFSVLNLVPLACRRHPHDGSQ 
RKFLLNGRWNFLTY 

429 

221:356 

0.967 

0.792 

MTIIILLLAKKALCPLAEVQTLLLPARLGNXKEL 
QKEGQQSISFC 

430 

166:301 

0.924 

0.595 

MWSENTVNLWRNTATSLLQVLFFQSCFVPFY 
LVHFMPKPNRFYLK 

431 

349:214 

0.974 

0.892 

MSTCLPPPLLYHLWSLFWLPLLRGVQAGTRR 
PSGGCCTKEAQSPN 

432 

240:375 

0.99 

0.77 

MMVMTLKCLFPVLNSMSMFLAGFLTFSPGSP 
ENTSSSACSNSDGL 

433 

242:107 

0.989 

0.791 

MAGLWFWAACAEKCLSLPWSLRQLEVAPRS 
RQGRG RAGAASFLL 

434 

208:73 

0.936 

0.785 

MKLMRPLMLLYISQLYMLMKRNSPHIGDCLSL 
LFLQEKKQKEVYT 

435 

140:275 

0.985 

0.814 

MQPSGLEGPGTFGRWPLLSLLLLLLLLQPVTC 
AYTTPG PP RALTT 

436 

402:267 

0.885 

0.73 

MGRNPVFTLLFMISQGFLGYNYIQLQVMVESTl 
REYFYKEKMAKL 

437 

160:295 

0.942 

0.725 

MRKIYTTVLFANIYLAPLSLIVIMYGRIGISLFRA 
AVPHTGRKNQ 

438 

388:253 

0.891 

0.726 

MSRRWGACLAWPCPACMGAHPAPPSRKLRG 
ASLLPREDPPWWVEV 

439 

347:482 

0.933 

0.731 

MKWRRKSAYWKALKVFKLPVEFLLLLTVPWD 
PDKDDQNWKRPLN 

440 

1220:1355 

0.923 

0.565 

MLQAPPLTSLQLSLKYLFAVRWSPVRPLVFAA 
ASGKGDVQLFDLQ 

441 

135:270 

0.907 

0.855 

MFVFMLLVGSGIYAPEAVKVYHYDMETESGQ 
LLLSELKSRPRREP 

442 

91:226 

0.968 

0.582 

MTTRGEDFLYKSSGAIVAAWWVI I IFTWLILL 
KMYNRKMRTR 

443 

32:167 

0.934 

0.678 
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MMVTVIWLSLKLFVCRGENTCMPMLWEGFDV 
YLILGEEEVHTFSV 

1181 

255:390 

0.926 

0.689 

MLWKAH FSLVTAS VWPG SHTWRVSLWG RTG 
HWADGIIYIXKGTRF 

1182 

135:270 

0.954 

0.788 

M ITIAPWLMERWVTVC ITLGALG NLGRSGl F YQ 
LGLRGPPAGPWR 

1182 

431:566 

0.992 

0.96 

MFWTMIILLQVLIPISLYVSIEIVKLGQIYFIQSDV 
DFYNEKMDS 

1184 

\ 129:264 

0.976 

0.832 

MLVCLIVHHSPPYTLFTF1HYFFLLLLGLGTFNC 
PIFKFTCSSAC 

1185 

i 826:691 

0.933 

0.702 

MFLQLLLATLQSGRKKCSCSYCTSNPQGGKY 
AFFSGVRLGLASKM 
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1186 

1197:1062 

0.926 

0.588 

MVLPVAGKAVLFCSVSGEGWMRPLPSSPPRL 
R PALS LG H SAG RG L 

1187 

191:326 

0.997 

0.887 

MTAG VL WG L FG VLG FTG V AL LL YAL FH KISG E 
SSATNEPRGASRP 

1188 

32:167 

0.906 

0.658 

MLGRGYFIHRRELRSILILICLAAGNFIKIGIFSP 
WHLETTILLN 

1189 

1304:1169 

0.919 

0.688 

MTLCCPWATMHPS7VLRMWVSLRSRARRWG 
SVRTGLSWSSSSDSR 

1190 

514:649 

0.931 

0.744 

MSSSPGLLFSSLSHLLLNSSTLALLTHRLSQM 
TCLQSLRLNRNSI 

1191 

33:168 

0.967 

0.668 

MVLSVPVIALGATLGTATSILALCGVTCLCRHM 
HPKKGLLPRDQD 

1192 

1528:1393 

0.991 

0.596 

MSDISPLLYEIWLGDTSAGFFTFCVTVLHVLLL 
LSSVLHFLCPRD 

1193 

289:424 

0.948 

0.76 

MPCTTILRIWTPFPALLFGPHCQFTPVGIFILEG 
GNCSNREARDF 

1194 

250:385 

0.929 

0.676 

MSRGHRSAAGRAAQRLHLLLLLSNVFRREGW 
NLFPSCPGAIPVPS 

1195 

198:333 

0.897 

0.726 

MPLILAFIKLFNKCLECFVSTRGIECYRSFGSR 
HQAAYNQGSKNQ 

1196 

49:184 

0.891 

0.676 

MKFLICLRSHFPFFFNSAIEMCFSKLKIFKIACN 
MNFTSYISTFF 

1197 

93:228 

0.988 

0.954 

MMKSLRVLLVILWLQLSWVWSQQKEVEQNSG 
PLSVPEGAIASLNC 

1198 

95:230 

0.954 

0.637 

MRKRERNAEWRTGFWHSFLSLLVLPViKNPS 
VWTQCPFMAWQESS 

1199 

227:362 

0.972 

0.594 

MFGGSNILFLLPSPGITSINDRTYFLFVMRSNW 
LFLLTCLIAFQK 

1200 

95:230 

0.98 

0.944 

MRDYLVFSLCVLLWFRAVIFGLLKDSISELLSG 
GELCGPCLAHSI 

1201 

49:184 

0.928 

0.837 

MLAPAUNSWLFFIYTLTQSPGSPPDFVTAGLR 
RHIVLSAITSVY 

1202 

117:252 

0.98 

0.944 

MIPAFGIFRLLIIILIIVLDMGFALYRRFFVPEDGS 
PVSFAAHIA 

1203 

191:326 

0.892 

0.723 

MRVWRCRLAHRGGWMVQELWFRDGYMIWR 
LAISYVRIGIQMACRT 

1204 

12:147 

0.946 

0.641 

MLSILLATLTLSLKEKRGERSIHQPEPSEKSVC 
LPVSGADPFRGS 

1205 

49:184 

0.969 

0.888 

MKILVAFLWLTIFGfQSHGYEVFN I ISPSNNGG 
NVQETVTIDNE 

1206 

194:329 

0.959 

0.612 

MCGWCPGFWPLAFTLHP\ATTRYVKPVNEHSW 
VLSIQSNTLNWSLHS 

1207 

68:203 

0.95 

0.872 

MNKIVFCFLLLLGGCREESIVLDGIFKLIRTFFC 
RGRENETKFFK 

1208 

6:141 

0.96 

0.692 

MWWCPXSVFIYFVFSHTHKKKPLPPMHQEAP 
ALPALLTGLTLPQQ 

1209 

18:153 

0.986 

0.769 

MFLYPFHFSSLVITALFFLTYPPHNIIYVRWKIA 
PNFSKPLWGGK 

1210 

4:139 

0.943 

0.803 

MVLFYSILFLKTSYIGGMTILFSGSALYESLTILP 
FVMLELMSTA 

1211 

57:192 

0.979 

0.877 

MHLNAHRALFVLFCLSLFFMWMRIDRHDRLHT 
PCGQEPCSFHFCV 

1212 

125:260 

0.988 

0.862 

MALQWSVLGQASFLLLLGHLIFFYLDVKKIPSL 
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12138 

J5:220 

0.965 

0.856 r 

^WMSVCLSAFICTRILMDYGLLHIVHITRVICM 
n 1 ntKY VorSrlYO 

1214C 

54:199 

0.999 

r 

0.98 1 

^AYFFILLGLFCVALRRIFCFFFLLLCLFLEYSF 

DDI 1 CT^TI 1 Wl 

1215' 

130:265 

0.903 

\ 

0.709 1 

VI AHTSWLI 1 HCCSPRN AF 1 LRAARCLLWFQLTA 

it/OVUoorrAVV 

1216< 

487:622 

0.957 

I 

0.887 ( 

vIVIALIIFLRSPAMAGGLFAIEREFFFELGLYDP 
.jLUIVVoofciNrt 

1217* 

985:850 

0.957 

0.772 

V1FWFLNIFILILSKHSSKSLSLQLPEVLLLFLCQ 

-ULKLnrVKbL 

1218 

31:216 

0.978 

0.897 

MCSILRFIRTCLVILYMFVYSRFPMYFLILLFFHL 

jKLrOrMLMK 

1219 

127:262 

0.98 

0.816 

MTSILSQLPRYQSLFGLLLPGFFFPVFFFPCIS 
VI LAK Y Ivl U o L o L n 

1220 

71:206 

0.956 

0.719 

MGYTLRQKELQRNLKLFL1MWLWYLDWFFE 
TIIYIFYMY 1 Loo 

1221 

35:170 

0.929 

0.808 

MSLFLLPLKRIVFIDHFTCLHEKSPWYKASACC 

. l/>M »\ //~*|/"M \;kIIV 

VGHVGIQLVNIY 

1222 

309:444 

0.921 

0.575 

MCLHFLGSEQHMAAHVRVCMCFHNHILSQLL 
LCLWoNrNoroUU 1 

1223 

239:374 

0.969 

0.855 

MRIHPPLPTPLPWVELLVCRANVLMFLVFLVP 
LTLLGSKbMrWMI 

1224 

229:364 

0.976 

0.822 

MLIYILLLERILVKCVLFRVNAVLIYVNMCNMSF 
hroVK 1 MrLKo 

1225 

70:205 

0.953 

0.865 

MLLNLLIPFVLTVGSGSKDSPWLEQPEVQLLL 

AT\/IM\/I 1 DDDII 

Ql VINVLLPr KM 

1226 

88:223 

0.888 

0.612 

MHFCLVLLITEQSLGKEVHPLRLVSFRFDSKNL 
P Ao M Lo LP o o r r\ 

1227 

69:204 

0.969 

0.885 

MAKWQRAAWPEGMLTLWCYFMATALSATLV 

rjnTi VCI 1 fCMMIDA 
PL) 1 LYrLLotiNlMlr'M 

1228 

99:234 

0.934 

0.588 

MIFGTDCCALSKYMWAFVFFLIKARWREKNPC 
rUUbLKrtU^LLU 

1229 

156:291 

0.965 

0.684 

MFRGLFLLPAPTFDPFYCLLVSQPLNKNNEER 

/~» IT tZ/^\t Tl_J\ A/ A 0 1 Kl^!i 

GttUL 1 nWAoLlMo 

123C 

976:841 

0.996 

0.962 

MQIFLLLYALGRFVLLVTFSPLVLSLSYPVLVSF 

wi dvdc\/i tr\/p 
YLKYroVLrVr 

1231 

30:165 

0.956 

0.614 

MLWIFAYFRIISSMFTETTVLKIVLAIRTGPDRIN 
GfcUK 1 WIInIK 



0.94 

i- 0.792 

MYRSVLFVFPQGFRLPFMLFEASKTFLDYWE 
\ AVRSSKVQNKKHGF 

123C 

$233:368 

0.931 

0.55E 

MCPFMPPPGLLRLFQIVFWVEHPGSVNPFER 
1 STIIGRSAKLKKDL 

123' 

\ 2962:2827 

0.91C 

J 0.73* 

MPYVTEATRVQLVLPLLVAEAAAAPAFLEAFA 
SANVLEPREHALLT 

1231 

5112:247 

0.9< 

\ 0.6* 

MQWKVTLTSRWGLLRHCQVLAGLLHLGNIQF 
) AASEDEAQPCQPMD 

123f 

395:230 

o.< 

3 0.77! 

MDTPSSLGLHLVLFLSVTHLQGLD FAjYNLGEA 
5 LQRAKVL1EGLPI 

123 

7 28:163 

0.97: 

3 0.92I 

MILPLPLFLRCLPQLLHASATFFQALCVWSSLP 
3 APTTPQTVPFQN 

123 

8 295:160 

0.98 

5 0.77 

MALLALPQNA/VCWSHLVLFLQGRTQHPNIDTR 
2 SLQQPASIKDPRVC 
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1239 

3024:2889 

0.968 

0.925 

MAAGVSSVLLLLFTLMESGLKHRVWESWQLF 
TSWLAFCSPSFSW 

1240 

19:154 

0.963 

0.916 

MLSTSVSAILLSfCLRAPCFLCPDLKMLGWQD 
MALGPLRKTG KRT 

1241 

312:447 

0.908 

0.573 

MRRRSCIVSKQLWCPVTVLEGLTSSXNAGGL 
VLRNCMGAFSLLHA 

1242 

1336:1201 

0.988 

0.974 

MLWLLFFLVTAIHAELCQPGAENAFKVRLSIRT 
ALGDKAYAWDTN 

1243 

74:209 

0.961 

0.721 

MMRSSQIQVLSVFCSAFVACGFVIPSSGNGG 
CSPRHNGPLRKKGD 

1244 

94:229 

0.994 

0.947 

MGLLWLITLLLQKSQFILISLQRTSLATLLKESS 
AFPSSIILVLF 

1245 

251:386 

0.987 

0.902 

MCWLLLWIKEWKIIKTCTVLVLVELRSLQSRGK 
REDTSNQHPLTQ 

1246 

62:197 

0.913 

0.682 

MFFYISIGVLFPFNSLLEQPETFLCHISSAVLCP 
IELAASLVLQL 

1247 

3:138 

0.938 

0.806 

MSSLLCLAGLIGGCCPHFSFHTPVAQPAEALM 
LETLGACREGPGQ 

1248 

29:164 

0.917 

0.833 

MLGGNFLMFLPPLQRLCSNLLSYVIPNDFSVM 
SCFIKASLNYTLL 

1249 

27:162 

0.984 

0.934 

MCVFVCFYLFLRCIGLSRYLSVHSLLCHCCPD 
GCVQVAELNKYEL 

1250 

1381:1516 

0.971 

0.794 

MTVSFCCCWILAVLPSPPLYQDLVGSKLEIQA 
AGDPMPAASRLFH 

1251 

284:419 

0.952 

0.822 

MLKIGMHCIYLFLKSLVNVLMAVRSARKKPNIIT 
WDCVGIWLCG 

1252 

481:346* 

0.998 

0.963 

MHFLATFALFFIFGVFFLFAVLTNLLLAEEVNIR 
GGNFLGSFLVH 

1253 

77:212 

0.942 

0.835 

MSCHCLSPWVPNLCAFHFPPFRVLFWLSLAS 
LSGFIVVLSGEEQG 

1254 

157:292 

0.986 

0.914 

MLKTHLLAPKWGKVLFFLLLLPLHCRAHLPAL 
PRAAAQGWQAGEQ 

1255 

19:154 

0.952 

0.654 

MASHGEEDRHWLRACTWIWALSLTLSVSSSV 
GWRRGGCRWLGRRN 

1256 

191:326 

0.985 

0.937 

MCVCLLYLCIHLLSLSLSHYGWLSSPEPTGRK 
TGRKDQEQAGTPQ 

1257 

403:538 

0.887 

0.553 

MAGLKRRVPLHSLRYFISMVGLFSKPGLLPWY 
ARNPPGWSQLFLG 

1258 

126:261 

0.971 

0.865 

MSLCLAFLLHWGHFRTCPLSHVEMHLYPKRC 
PQRNAESRWSPALV 

1259 

165:300 

0.983 

0.747 

MFVIAFLSPLSLIFLAKFLKKADTRDSRQACLA 
ASLALALNGVFT 

1260 

289:424 

0.949 

0.834 

MMSGWLLRAAICRGLLSSESLTFTSAPHSISIA 
VTCRDGNLQTGY 

1261 

815:680 

0.949 

0.83 

MRIPKVAPPLAVMRCLTSLLVCLGTGSGGTME 
ASIPACLSLQVRL 

1262 

138:273 

0.897 

0.68 

MASEPCWWAGMLPCACAGLRRCSHSRFLQR 
GHGLHSLMGSLPAP! 

1263 

138:273 

0.982 

0.911 

MASSPWGCVCGLLLLLLPLLGTGPALGRGFP 
RPLENSEIPMIPGA 

1264 

1159:1294 

0.992 

0.733 

MSFFLILGVGSCLSYSLVPLIILSFCHFYPESVG 
CPDAPSPRVRG 

1265 

125:260 

0.962 

0.902 

MFKAARKARLRWPFLFWVFSVMECFSLNCDD 
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1266 

13:148 

0.955 

0.786 

MRPLKPGAPLPALFLLALALSPHGAHGRPRG 
F\KCjAKV 1 UlNtrKrL 

1267 

37:172 

0.915 

0.646 

MSFGPYFYVCFIPILLYSLNTKGYGNPYPCFPS 
TPPKr rr vjQPKh 

i 1268 

163:298 

0.974 

0.838 

MLLSLGHLFYCFRSSRFLQCILPCSFPLIPFIILV 
LLYovrAroU 

1269 

296:431 

0.935 

r\ TOT 

0.737 

MQFVLLRTLAYIPTPIYFGAVIDTTCMLWQQEC 
bVUboOWtYlNV 1 

1270 

126:261 

0.962 

0.88 

MKQLSPLPLPWVLCFLWKPSKLSVLSFASPPS 
TKPSQQAGLVGbL 

1271 

523:388 

0.945 

0.704 

MHLFPVLFQPLQNCLPFFIGEPFHSCFLKVFLD 
ITIIVFLGFLYF 

1272 

2025:2160 

0.993 

0.782 

MVFTVTLKLALDTHYWTWINHFVIWGSLLFYV 
VFSLLVVGGVIWF'h 

1273 

1366:1231 

0.979 

0.91 1 

MSFSAYQTAFICLGLLVQQIIFFLGTTALAFLVL 

iin\/l 1 1 r~~y k. 1 1 1 1 

MPVLHGRNLLL 

1274 

1111 :976 

0.998 

0.936 

MMLAFTMWNPWIAMCLLGLSYSLLACALWPM 
VAFVVPbnQLGTAY 

1275 

919:1054 

0.976 

0.757 

MFKNTSGYTERVAVWLGVEIFCLLMMSSVLVP 
LFYFLMLFGNrLQ 

1276 

129:264 

0.888 

0.564 

MTMVRVWCMARVRIQSVTRVRAQCVTRVRT 
QCVTRIKAQCVTRIR 

1277 

172:307 

0.943 

0.798 

MLRHLRLAQCPGPGLLLPALTSLLPLAATQAT 
VEMTvSMAQVGAD 

1278 

12:147 

0.989 

0.586 

MAGSSEWNPDSRPASFAGWRAWLPWMWS 

1 nci 1 1 i_jt*\ /Dpnc d 

1279 

496:631 

0.965 

0.846 

MHWIVLKALVAVQILLSIKEYTLERNHMHVISV 
IKVLVKAQ I oL 

1280 

320:185 

0.95 

0.787 

MHVQWSLTTVLRLPLSFWHKKLHLYLNLEILS 
SENUALNUINNKL j 

1281 

238:373 

0.982 

0.879 

MAMLRVATFLTRLQMIVTSVLLAGSLYLPFGL 
RALKMU 1 AMr VKr 

1282 

225:360 

0.94 

0.625 

MRGFFRSEAQPLCCCWCNLLCCGRVWVSLS 
olhvjiboAOV Aoino 

1283 

265:130 

0.921 

r\ toc 

0.725 

MTCSLLSLLDAVCSSFVQAFCSRDPERWPAIS 

□ uci o/~* a crviri m\ / 
HHoLobArYrLNV 

1284 

187:322 

0.883 

0.568 

MTTTIFTCKMGYWSREFLFFLLPKELANQNRR 

pnriA/OPTC a\/i ic 

SRFWSSTSAVLIF 

1 ZOD 

CQ. A QlA 

oy. iy4 

n ncc 

u.yoo 

U.OOt 

MRGIIHYVLFCDWLCSFSIICLRSTRGMHGAG 
YPFI IfiWFRPAA 

1286 

138:273 

0.967 

0.685 

MNPFFWSHPSLLGSEGRILWLWGLLEIKLFLL 
HMTRLHVLQSLNL 

1287 

107:242 

0.95 

0.698 

MSLRAPSVRIFVYLLFRLHTQRGLLAGRRQW 
GPCPLSFSHFLHLS 

1288 

158:293 

0.97 

0.77 

MSPSLHLCFLPSSFFCKCLFIPVYICKHYFKYW 
GYNDEKNAQSVL 

1289 

88:223 

0.984 

0.874 

MTALILLSSPLAWVSGTLASFLGSPKRVSYPAL 
PSLRIAPVYLSF 

1290 

11:146 

0.995 

0.824 

MKVWLIFPLFLGAGMKLLSLAWGRKNSCVLSL 
RRSTWPGAVAQT1 

1291 

8:143 

0.991 

0.831 

MLWVKIFLLSPVPFLGLLQ1VNKIIVSMKFSLCC 
SFLNDYAPERM 
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1292 

53:188 

0.912 

0.702 

MCSVSAGTLPLLGELLSLTVAKTNSAGCHHC 
QSGEGSLSTHTCIL 

1293 

315:450 

0.939 

0.614 

MHLRPLASLPQCPLYSTLLFLIDSYLPFKNPLR 
WPQSNTFHKASK 

1294 

109:244 

0.954 

0.91 

MLMLILVTGVSSLRNMIMCDYISRAKLKSSHIV 
LSYCTLKQEYDD 

1295 

67:202 

0.912 

0.653 

MEYLLHSLSFHNQLFTYLLPLIHWLLLECTQHR 
ALHLVDPQGMLM 

1296 

179:44 

0.946 

0.867 

MLNTFISLLLWGTARWGDVKKNFLQNKAVAS 
PGKQGPGPALTLMT 

1297 

333:468 

0.963 

0.898 

MMPTNLAHLVFWQALLASGRFSLMEHYPPNV 
QSNRGITHYMLPRG 

1298 

192:327 

0.898 

0.586 

MSVIKLLSERLFLVLEPSGSLVAQSQSPALELP 
MWRWPSSLLTVC 

1299 

70:205 

0.991 

0.963 

MWASLPFILSRLLLLIWCHFGLGQQWWATYL 
MSVPLTSPCLSVY 

1300 

94:229 

0.983 

0.659 

MFFLLCRQAHSLSVLPICLHLPLCPRVPTAPVT 
PDLKPKMVGGAG 

1301 

254:389 

0.953 

0.641 

MQAEHVNSSFVCSPKILIPFSLGSLLALSLLHT 
LAFGPTFNTLYF 

1302 

53:188 

0.98 

0.841 

MHFLACKKAFLCLLVHRCLLFPPAQSSFMWS 
ERIHDSVSSMPLLT 

1303 

72:207 

0.98 

0.907 

MSLCIYTLTCTWLITFPWEMRVLLLYWLMFKF 
VHVCVWRAASQS 

1304 

18:153 

0.963 

0.743 

MLWKRELSSTTWTWNMIHVLLFKAVAPGLPP 
SLAHCCLCLFTIGT 

1305 

73:208 

0.982 

0.88 

MWMIFPFVLMFISCPSISARIRTDEVAlYRLCiF 
TLGNYKPFPGS 

1306 

323:188 

0.933 

0.679 

MNVICLTLCLVSSKCSVGGTASFVLLCFSLPVS 
SRRRAFQESQGW 

1307 

68:203 

0.937 

0.6 

MLKWHCQESICLLSSAMPSSLWASLPGKSRS 
WWQRWLPAAPTIFL 

1308 

457:592 

0.984 

0.901 

MVLLVVGNLVNWSFALFGLIYRPRDFASYMLG 
IFICNLLLYLAFY 

1309 

625:760 

0.89 

0.672 

MDPPCPWLHPAAWPLQTPLALPLLGTGSSPM 
PIFRWRPPVHLLSM 

1310 

125:260 

0.984 

0.618 

MLPPLCWCCVRTMTCCIGTSTGMDGRPPSP 
WRRIPCWTQTCSCRN 

1311 

258:393 

0.912 

0.602 

M G D FAG VD FVFL WC FAQRQG AA E AVG AVL A 
VLLCDTLLGVTRLE 

1312 

74:209 

0.921 

0.71 

MWLIRLGTCTKWRGWCTGIWLHRAGVTKG 
LRIKVAMERARKK 

1313 

143:278 

0.89 

0.713 

MSASVNITITITVLLPFVRSFHKYVQSGCSEPS 
VGSRAAQRRPER 

1314 

52:187 

0.888 

0.593 

MGEVPLLLCTFQNWLFTAHSPVEQNRGPRTP 
TFCHFHITEFAVNL 

1315 

242:377 

0.964 

0.863 

MCLLFLLPRFPVSWRAGVDGAAPSSQDLWRI 
RSPCGDCEGFDVHI 

1316 

49:184 

0.981 

0.945 

MVPGAAGWCCLVLWLPACVAAHGFRIHDYLY 
FQVLSPGDIRYIFT 

1317 

82:217 

0.915 

0.634 

MLVLLPRSKAMPLLSVNVTLAFFPRNKEIVKYL 
LNQGADVTLRAK 

1318 

161:296 

0.987 

0.896 

MALGFGRLLALMVMIWLHTGQTDTHMLQSPA 
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1319' 

781:916 

0.928 

U./ 11 

MLWGSGKCKALTKFKFVFFLRLSRAQGGLFE 
ti rnni I niPHTlR 

1 LwUULLUlr VJ 1 lr\ 

1320 

179:314 

0.957 

n R*1 i 
U.o\ 1 

MEGSRNTPLWVLPIISLWISLTASQAGCVGWL 
<VRLKIELCLARI 

1321 

129:264 

0.952 

u.yo 

METFPtLLLSLGLVLAEASESTMKIIKEEFTDEE 

VI LJ T UIVIMrXOOWC 

1322 

143:278 

0.919 

n 7 

MCDVLKLPSNLLSCFCLPCVFSLHTGMAHGN 

qacmi MPFVMWTHfi 

OMOINLINr i V IVI VV 1 no 

1323 

1402:1537 

0.991 

0.74 

MGLFFFFSGVGSFVGSGLLALVSIKAIGWMSS 

1324 

188:323 

0.979 

O.obi 

MLILISCPCLDFFWALRLIGKIRQTDIKIYQRETV 

1325 

116:251 

0.963 

0.727 

MLFPQLHLFKGPLPCLLRFLTLGLGVPGSSPDI 
K t U o t; r\ IN r u l lj i vi 

1326 

43:178 

0.944 

0./O 

MHWFPPVSPLLCLPSLSLSPAQSVTPSLPLL 

OlA/AX/l \/Ql~»TI IAA/W 
UVVnVLVoU 1 LJWVn 

1327 

137:272 

0.954 

0.86 

MFRFYLWVICHFTSVPSFPHMFLGLISLATLW 

r\CCI UIUIDDUIOME^I 

Ubr LnnPr'riLJIN tL 

1328 

147:282 

0.9 

0.764 

MPVPNPILVLFLIELPTMLMQKWMVKLPLLSR 

DIOTTCDCI 1 Dl D 
Klo 1 1 hroLLKLK 

1329 

279:414 

0.982 

0.963 

MFLLFWFILSEGCPLLEQLNISWCDQVTKDGI 

UALVKbUbbLlVAL _^ 

1330 

182:317 

0.983 

u.y4 

MTWVISSVWLLFHVLXWSKMGQRRNEVKWM 
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NPRNLSVYLLFSA 

2082 

20:155 

0.964 

0.715 

MLSFVPLLSSWLGTWITDRGAAGSCQAEAPR 
LAGETAGQRVWERG 

2083 

171:306 

0.909 

0.691 

MMLCHLELQLPALLMRKMIPTKPGWNSRQY 
HKTPSKMHYIQGNE 

2084 

271:406 

0.945 

0.891 

MGNLGFLILLLEEILALSTMDSVITRLCDLELWA 
IRPSPTFGEGM 

2085 

47:182 

0.944 

0.747 

MKAVWAFILQTGTNSSMLFVLPVQPCWEGLG 
ELPTRRDSCCFGKL 

2086 

241:376 

0.915 

0.616 

MAMSLLRAFSSPCLVPASPRPPHNPLVSGRA 
GNLLHLAAVQPLVS 
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2087: 

30:165 

0.909 

0.657 

MVFLLSVCHFYGMISKDGAPRMRHPPSRRGP 
GCLAVIVRALWFTE 

2088: 

388:253 

0.941 

0.616 

MINTNILRAGAVRVLLWLQLPYPQKDHDHFVRI 
SCCKDQRATFVT 

2089 

95:230 

0.907 

0.67 

MRTPSCLIGFPCPYFASSLISYFLILLIRLPKWP 
QIIFPKPQSAH 

2090 

31:166 

0.888 

0.649 

MIICQLLTGVHVSVSCARPAPGLGAHIRCDYFS 
TASSTQHQPPCQ 

2091 

235:370 

0.896 

0.656 

MLIFICACCFPADSHWINRVPQDKSILEQPFSK 
CSPWTSVGPRDH 

2092 

22:157 

0.939 

0.725 

MYPRVRPRVRPRVRLFILFGLFTRGKFFPVRR 
DKNLKISLNDQVM 

2093 

259:394 

0.904 

0.819 

MLLVCFVFKIVNSTIHRCQECSAKSSYQLCLTV 
KQVTPDSIDKDL 

2094 

61:196 

0.962 

0.66 

MFPLLLFYPEHSPPLSIWCNLLIYFLSLSCLPLQ 
LDFQLYRNRDF 

2095 

69:204 

0.944 

0.664 

MFLCSSTRISVSLHALCYLLMENVPQLPCLIWA 
SVQAQCRCHFTH 

2096 

69:204 

0.927 

0.704 

MTRSKCIGDCKLIITGVWALCMILFSSIYSFKFL 
QCTGIFYSGKI 

2097 

40:175 

0.963 

0.894 

MCYKHLLPICGLSFLFLCGAFQGTKVLHCTVY 
VNFMFSILHHYSF 

2098 

47:182 

0.963 

0.815 

MKMLRLLSLGVFYLCCEIRASVEKIFKYISSLR 
AVRSQGSIEYLK 

2099 

76:211 

0.964 

0.633 

MALFFCKIICTMKKVLLLTKFSISHSTNTRLSFL 
HIRLCCSRCCG 

2100 

204:339 

0.964 

0.772 

MRLHLGFSHFLLWFCDWRLEMTLSIEYLLLGD 
LPECNDGVNLCQD 

2101 

25:160 

0.98 

0.91 

MFVYMYILYVCMSVCMYIYLQFSQTSYHSLEL 
CSHVNHNNSNSTR 

2102 

258:393 

0.952 

0.741 

MPRLFSPLILLHTLSLKSHETFQWSQFLYQNT 
RDACFTWTYIFPR 

2103 

86:221 

0.907 

0.653 

MPVLTLVRPGLVLRSAGRRGAQLTRKLNCDEl 
HTQQAPSQMSWNH 

2104 

1710:1575 

0.946 

0.64 

MGLLNKYASVIIYLYFSLVKSESLFHLMYLPSLF 
IQFFLGIFSLK 

2105 

538:673 

0.9 

0.68 

MPVCVLQLNNCLGAYCVISSFFTNTFLLLNAVI 
NFLYLGYLGFSE 

2106 

206:341 

0.926 

0.721 

MPVPLARICWVSFLHQAPCWELCYYCWLTS 
ALiGAVNSSILQMT 

2107 

305:440 

0.942 

0.799 

MEITKSWFVFLWLIKFLENRDTYVSFFADLKKI 
VREIRCVIIISS 

2108 

187:322 

0.973 

0.942 

MHVLFTLYLVPCCLHFELFIGDLAVKNGTKHSA 
EVLSRVPKHKKA 

2109 

663:528 

0.955 

0.698 

MFRNEFSLTFYCFLWGRLAFWIIDRTQSSSR 
RVSGSQGRRPAQG 

2110 

265:400 

0.964 

0.918 

MFSLVIIRFLFGWISDFFRGKKIGNFNVLNEFT 
FDFTISFHLVP 

2111 

194:329 

0.975 

0.901 

MWLWVLLSHTFVSHLAEGDSATWVAHFTSG 
SQLCGVFPLLACTL 

2112 

! 187:322 

0.992 

0.957 

MLAAFFLLSLKIPFLLKKLPVLMLSHVGGVKRA 
FIVGFTCQPFFL 

2112 

1502:637 

0.887 

0.567|MSLFYPRIPSRSLWCVCPVSLAPPIRIPSRSLW 
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3VCPVSEFHDTD 

2114; 

271:136 

0.883 

0.62 

VITAQHHSIAVLLLNLEVTCECMEYNKVFYSGS 
FASTSFLIGYCSS 

21151 

91:226 

0.983 

0.808 

MVSALLCCAVLLYILYMKWRVHRVGTLKGFSE 
PREAFSKAKARTG 

2116 

147:282 

0.907 

0.74 

MSLLLGRIIWQHIPCTVSGVYCCLTLVTDWEL 
WLAG TAQ H HTK I L 

2117 

241:376 

0.927 

0.663 

MKSLYRRMSLMLLLRRKSSFQTTYWWPFSF 
LYSGCVYVCAHHVH 

2118 

166:301 

0.884 

0.583 

MKCLLCGRYSGSTFYSLSHLILMTALSLSPNP 
PPSHFTAEEVKVQ 

2119 

140:275 

0.884 

0.746 

MLAHLSFERSLILHLIFSGIAVSIKALTKTWMPP 
EMGSSPVYKAF 

2120 

40:175 

0.943 

0.578 

MVMGWHWPQGLGLSLSLCPSDLDGWVSREV 
PLLDRPQALPPCVQ! 

2121 

112:247 

0.985 

0.829 

MFYLFLLYFGSFNCYVFTFTDFIIYIFYSTEQTS 
NKFLISDILIS 

2122 

62:197 

0.934 

0.832 

MTFCMYFTFFSCLNARHIVYPTLVCSQLYVIGP 
LPFTYLLLVSCP 

2123 

217:352 

0.955 

0.786 

MTSRPHFFRYLCSLPPLLFPLLXQSQLLPGSP 
LPIALQSRVGSLL ! 

2124 

23:158 

0.982 

0.897 

MWARLRILLFVPLSSMRSLFFINGCVFTGANT 
HLDFELPSYSLH 

2125 

148:283 

0.974 

0.883 

MLLFLIVITVIYIKHPEFWTACFTCIILFNYQQFIL 
HMNKLSFRK 

2126 

100:235 

0.885 

0.675 

MALLHICVGHPLLSFPKAGDFSFSSQDDPSEL 
TAGAKDKEFSCLL 

2127 

147:282 

0.943 

0.671 

MQVLLLAYKNLIYLFVFRDNTEPLFLYSVFLKF 
KQSVERHFSSSM 

2128 

151:286 

0.897 

0.586 

MIFHFTQNKTKVLTKILLPYTTLLPVNAVTSCLT 
AIHLAYHSCSW 

2129 

323:188 

0.996 

0.952 

MLFFLSLMPFFFLSRSLSLSLSTSLFYSPSSFS 
SSFSSFSFSYPS 

2130 

10:145 

0.926 

0.693 

MVFDERFLFLFTFHFISFYIGCTYTVYMNFAQK 
ILCSCEEKVTLC 

2131 

33:168 

0.985 

0.9 

ML\AA/IWTVAGRSLGFQAVLAHVSGGSVTQV 
GQSPGLHVACVYGY 

2132 

357:492 

0.97 

0.922 

MITSLPAYLFTLPSCLTPLYLLSGFFCLREASLI 
FTLSLRARLSC 

2133 

65:200 

0.948 

0.759 

MHGNEFIKHAFCLFASWFGLVWFCCQTQDPN 
RIRILWGKDEAVEE 

2134 

340:475 

0.991 

0.916 

MWGFSFLLCHWRAAIPLLMRHTGIIHCRSRTE 
HARYLLGSSRPMN 

2135 

131:266 

0.989 

0.913 

MLAWIWWPLCCKMILGTIDQISLSNHYSLPFVT 
CSHCFSCPECLL 

2136 

129:264 

0.916 

0.71S 

MYVLDWVAMCAFPLFQTGAFGSILRAFVYRYI 
SDILEITKLLGHN 

2137 

111:246 

0.93 

0.72S 

MFRSNPGFFFFCCCKSC1LAISLGEIPRNEFTE 
NMSLRESEDLKP 

21 3€ 

212:77 

0.93 

0.702 

MLCRWMISIKCLAVPGMCKHSIPIICLAPWLG 
. DLRNTTKLQSTR 

213£ 

)31:166 

0.951 

0.671 

MLKTKWSIVDFAVWLPPICPIVLLLLHFYEMN 
KYQIECNFFGTK 
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2140: 

£13:348 

0.882 

I 

0.706 1 

V1EAWKALIGLFPLRSSASPFTYHCWEPAQPA 
4QEFHSTIALRGRG 

2141 : 

393:258 

0.983 

I 

0.749^ 

VlFFKTSLCFCLFHCmLECNLTVLTHGLVINS 
✓LPFRDINIHFFL 

2142; 

262:397 

0.966 

0.762 

MNGNRRWQLEAAWGPVRAVQFWCAFFLVLI 
SFCAADRQKAGMWL 

2143 

92:227 

0.937 

0.624 

MCRLYSCARMPLFSTVLFSNVYINDFLLQKPE 
MTTSQPLSNQRW J 

2144 

148:283 

0.973 

0.919 

MWLTKSRILIFVLQSMAICHLVEIYEVYEENLAL 
HRSWGKGRSH 

2145 

143:278 

0.913 

0.569 

MGIQWTCEWPSSLSPGWKFIACLWFSMWGS 
RPPLSQAMSHKQWPM 

2146 

35:170 

0.986 

0.832 

MFAIFSVYFTYASTPFCLLLLLARNA/VKDLEFRA 
LSVLFFSWLESH 

2147 

491:356 

0.97 

0.65 

MYTLFQPYCLSLHPNVSHFFSCLKPLLMLLCH 
LPRCLLLSLHQST 

2148 

396:531 

0.917 

0.782 

MLDYQLLSLMLLPVSVTSNFPSKTPLTYFLKVT 
GNLIGKIQKDFL 

2149 

126:261 " 

0.979 

0.758 

MSLRI RAARN WARDVQKLWTI WLLVLI LI RSA 
VNLLINSRTEDK 

2150 

91:226 

0.987 

0.83 

MLLPLRTWCLLEVLAVLGGPVQAKAIHLLAAL 
QPRPADTMHVLHF 

2151 

269:404 

0.952 

0.668 

MRVLILLCGERMAIIQELLRVSCIATHFSAILSG 
WVIVFLLIVEN 

2152 

229:364 

0.918 

0.82 

MGNTDILLLLSLFCFSYELVAGKTKAQFGVPFA 
EFSVFLILENVT 

2153 

823:958 

0.921 

0.657 

MGPKRGLFFFIFFLDTEPSVLGGGGGGQYGL 
TRTHLWRQGASYLT 

2154 

60:195 

0.911 

0.59 

MHSSCCRHQPRVLCCLYSWLMSLRRQSRRG 
GTGPFPHQPLLQATR 

2155 

1098:963 

0.977 

0.903 

MRWIAFAVMIVLALIRIGHGQGEGHPPLADFS 
GVRNLFGVCVYSF 

2156 

955:1090 

0.892 

0.676 

MLGIIPSLLRRALLQVHGSLCSIGRRPGDTGEA 
RERGQVAPPDSL 

2157 

146:281 

0.959 

0.786 

MGFLCEIGAIPTMVLALWGFLRASSASPPSLN 
RLPGQGSAFSFQI 

2158 

497:632 

0.993 

0.958 

MISWVLLFSLLCLSVNYLLAFEPTYVAFYYGNN 
NIEAMYVFTYPC 

2159 

61:196 

0.996 

0.97 

MAILFIWLFLNYSISFSLLFFSATKLTFSSLPKIIF 
FTLPILVSK 

2160 

308:443 

0.925 

0.654 

MLKASLKQDRLLWKLWKLAVTFCQRGFICFLA 

nox/mrfQCVA/CT 
CjoVLi I I ooovvr 1 

2161 

216:351 

0.956 

0.791 

MSLNPEFLWLKWFSLLLRGRRNSCLIALKGYH 
SVMIFHLPLIPSS 

2162 

106:241 

0.889 

0.6 

MPCFMPNPGAVLGLPPWLLSTQRLTHTRAYL 
NWLASDRWMRRHWR 

2163 

11 07:242 

0.98S 

0.92 

MICYFHVLFTAKLLKRCLFLLSLPPLLPLFLEST 
PSCFCPHQSTE 

216^ 

M 04:239 

0.91 

o.8ie 

MLCNLALKLLNCVSAWNMNIRLKCLLKPKNVS 
» KVCSRGLYFIYVM 

216f 

>51:186 

0.90c 

) 0.66e 

MVTSGRKPHLLPALACLTFAHFPSAQSLAGRS 
i LPLEFDELKCDFE 

216* 

5207:342 

0.967 

0.664 

[ MTLLGLAVCQRLCPLGIIRVPAACAMGNRVWL 






APVHRQNRAYHLQ 

2167 

125:260 

0.922 

0.806 

MFMYVYTYKLAVAMLKLNQLLLKSTKYFLSYN 
FCPPKNQLLFLNS 

2168 

179:314 

0.93 

0.593 

MQTWALSLSHSTLIYFPYCACVKGQNFSTAS 
MFGQGPWARRPWWV 

2169 

110:245 

0.945 

0.636 

MQSLVCFENGLGVFDVLGCPGFLFCLMATMW 
LVSISLYTEVPFGS 

2170 

129:264 

0.998 

0.977 

MVCWWVFVAWLFCFVCLRGPHGSCDMYCK 
NKSVLLTMLLTNIQT 

2171 

77:212 

0.959 

0.65 

MFLIIGEKVRLLLLSFLPTPTRRQEVDRDQIELE 
EDSICGLPNKF 

2172 

78:213 

0.92 

0.72 

MFDCVRLVLTFLNFPLANKPLNTAATLILLPSY 
MHRCKSRQFALY 

2173 

112:247 

0.986 

0.91 

MVWLIPDCTFTPFLLLVLVLCVPCEQGPGTCV 
FKRFSASRWLHP 

2174 

456:591 

0.939 

0.721 

MCDLLLTLLGTSPSSLGLLPPLSHEKVGLHHL 
SKYSLPISLLFRD 

2175 

456:591 

0.956 

0.714 

MGKKFKICLSLCPLLLSAPSLLEKCGKVLTTLF 
VLPGAYWCFSKQ 

2176 

179:314 

0.964 

0.82 

MWYAILHLKQIFJSYLLCISPKLQLPSTVIPKDCI 
LIRPPNQSHI 

2177 

186:321 

0.961 

0.873 

MCSGLSTYCYFLCSLSLILPISQCPLACGPQVA 
WSLGRGJLTQLG 

2178 

441:576 

0.926 

0.821 

MCALLIFSVTIVNIKLNKLPTWLINDLRASWFG 
QTVYPNFYIPS 

2179 

199:334 

0.946 

0.756 

MKPLFLLWQFNLLKLPCRGHVYRPLGSHSTL 
QLLAYDGTHVHSTA 

2180 

195:330 

0.988 

0.651 

MCVALRQEHTWLCYKRCSKWGMFSLIFMHR 
AMFLFLFFAIYLKYS 

2181 

140:275 

0.933 

0.771 

MHFHCTVFFGSCVNSSCLLVIFYFSLRIIFFNLP 
GYLIVFKNYFF 

2182 

363:498 

0.908 

0.72 

MMTVITVTTIDNILFCRGGNQNQEGLFLSPNPK 
LARFPDPPSMP 

2183 

228:363 

0.972 

0.808 

MRVILLWYIEYFFALDPVLVCIYSHVLIVTIFSEY 
YFSGEKILLQ 

2184 

123:258 

0.991 

0.959 

MVLWLVLLISVIEWLMTCHSTTEMLSFGNLLQL 
EGLAVPHLFLGL 

2185 

273:138 

0.919 

0.774 

MLLLSSSLLKCGTCQWQVQPAVAGSLEGGEE 
ESMVSALLISALPF 

2186 

13:148 

0.902 

0.709 

MLNSISLTNWINIFWCYSMWFMVSKCVSYTL 
CSIFHLGGVTYQM 

2187 

44:179 

0.977 

0.851 

MYYSLQLTIILVDVFWRVLFICLNLKEHLTMFMI 
ETVLFLVQINI 

2188 

125:260 

0.934 

0.744 

MHQNFLSPRLWVFLMSFSSFEKIQIEFIPSVAS 
KMNSPAIRLNQL 

2189 

41:176 

0.951 

0.775 

MLRARTVGALVGASLTSGLCVLSADSCISSLL 
PPPLIEKEWPCAL 

2190 

546:681 

I 0.944 

0.744 

MKDPSEPLPVFLSPAVLTFLPFLMLRLHLVSGL 
SQLPSDHSACSA 

2191 

134:269 

0.907 

0.631 

MISVLHGRFCVFGVGLSHSSPALSFTCCHNLC 
QPPWPAFPSCLCS 

2192 

130:265 

0.912 

0J96 

MFLHTRFSQAATLYVFITPFKYDRNSWFLTN 
GAVYLNIKAFSRQ 
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2193 

182:47 

0.972 

0.948 

MWPILVWLVTLCIAEPQTHLILTKGSKSGDK 
DDAELKQRVFLK 

2194 

60:195 

0.958 

0.844 

MERGAGAKLLPLLLLLRATGFTCAQADGRNG 
YTAVIEVTSGGPWG 

2195 

49:184 

0.918 

0.849 

MTQVERVIVFLTLSTLSLAKTTQPIFMDSYEGQ 
EVNITCSHNNIV 

2196 

158:293 

0.96 

0.931 

MAVAFVLSLGVAALYKFRVADQRKKAYADFY 
RNYDVMKDFEEMRK 

2197 

47:182 

0.996 

0.953 

MTKl^QWLWGLAILGSTWVALTTGALGLELPL 
SCQEVLWPLPAYL 

2198 

469:604 

0.998 

0.985 

MLFAFISLLVMLPTWWIVSSWLVWGVlLFvYLV 
IRALRLWRTAKL 

2199 

301:436 

0.902 

0.571 

MSVGLAGAVGRRCHLALAVLHDPLCHHGSLA 
TICKQPEVCLFTIV 

2200 

124:259 

0.909 

0.589 

MQVWLLTLERLRVLPKPRSEFDFFPISHTHPR 
RHPNPRISHVRLP 

2201 

165:300 

0.945 

0.746 

MPFLLNQCGSLLYYLTLASTDLTLAVPICNSLAI 
IFTLIVGKALG 

2202 

154:289 

0.97 

0.807 

MRLSSPWGSLVSCLAILAAFISQILKDPSTEVG 
HGLCVGSLSAGC 

2203 

157:292 

0.969 

0.856 

MRWMVLYFQLVFLPMDECYFCQGFNINSIPCF 
SPYHINGEKHCLS 

2204 

126:261 

0.989 

0.752 

MGESARPGRVSSCECPALLWASLLCFFALGF 
LPTFLNLGHPWDSL 

2205 

74:209 

0.954 

0.848 

MAFFEVLAPLLLDCVFSSMFSSTFYELVTFSTP 
QFKGMNVFYVLC 

2206 

127:262 

0.94 

0.669 

MPIHHILLAIEILPCFVTLLKSVIIKWSKRDTSAT 
SKMLSTRNH 

2207 

59:194 

0.943 

0.803 

MWKFVNFVRISLCMWKSCREKKIHRVISLEVT 
KEKEGYLAPKRL 

2208 

12:147 

0.974 

0.752 

MAVLVLGCVFSRRLLLHSFSSAGDLGGCILED 
CEEPQLLPSSSDQ \ 

2209 

115:250 

0.988 

0.841 

MVGILKIIWYLLRTALFIWKHFCSGGWAGFPEV 
PLYRVLRPMQLP 

2210 

110:245 

0.951 

0.612 

MRLVFFPFTGVTWRSFCSPTYGSPVQFSHSY 
FPIGAHFPCWHEHF 

2211 

66:201 

0.942 

0.619 

MIRVCAQCLPVSIFFLHTNVLVNPRLPPPGTES 
FDSNTLIKIYRD 

2212 

89:224 

0.924 

0.772 

MSRSGIRRPLILVLSALFEASSANHLMSLGLTR 
QSQQSPVHIFNF 

221 3 

25:160 

0.934 

0.659 

MYRRLIKGFILFMLIARENKFDSTLKIYRQFFFC 
VLWKKTSKKKF 

2214 

980:845 

0.947 

0.697 

MLWRLIIILCEALQRKSRLLADCDHFSFPNRYE 
RKLLLDFTVRIW 

2215 

49:184 

0.983 

0.969 

MRVFCVGLLLFSVTWAAPTFQPQTEKTKQSC 
VEEQRQEEKNKDNI 

2216 

11:146 

0.946 

0.82 

MFTFLCLSKINTLFLIYAMRPSQNTTTTQTIPLP 
DNNHRYQT1LF 

2217 

424:559 

0.896 

0.624 

MRSTLLLLELGTYTGNAGALQDCYCTGMDLG 
QGQVKCHKAFLSLH 

2218 

122:257 

0.976 

0.779 

MLLLQILAFAFFQIHSNLGFQPRWEREKEPISS 
SSSLKVAQLALF 

221S 

99:234 

0.892 

0.556 

MLPSFLPQSLGNLIHTLGFLLIIHKYMSAFKNRT 
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DEFMNMGMQPY 

22201 

39:224 

0.933 

I 

0.7181 

V1ESLSLHQYIELILILKIFAWLGFPPVWSSELQL 
=>CANSCVLGLP 

2221 

103:238 

0.925 

0.77V 

MLLPVFLLYLSQDLADSRAPAHCSVNTDLHLK 
yVGSLCVLSHFQVD 

2222 

7:142 

0.985 

0.756^ 

VIEPLLQKWEWFLASLIIYVWLDVGRITRFML 
WSLTWLNVCPVL 

2223 

153:288 

0.933 

0.641 

MLFPLCFTALSFSNFIDKKCISHQPYLRIGGKII 
NLDTNNKRPET 

2224 

205:340 

0.967 

0.763 

MMEWWTMGKGTLILRLVTLFNILKQRLFYLSW 
TRRQHN 1 AARSVG 

2225 

544:679 

0.977 

0.842 

MAGRGGSALLALCGALAACGWLLGAEAQEP 
GAPAAGMRRRRRLQQ 

2226 

516:651 

0.992 

0.966 

MHVWWMLSCCLSHLTLASPCPCHSWFLLMYI 
KGGGGRGFSGAIDK 

2227 

184:319 

0.987 

0.854 

MLiQFTFFFPISQRFWFCLLFFFPQTFKCMKFY 
SLIEVREGVCIH 

2228 

259:124 

0.94 

0.815 

MPTLLQVMSWMLSFGTQTLQLESCTCALHIV 
GAWKVPYPLFSRVL 

2229 

460:595 

0.955 

0.747 

MPACCYRPCLLQPISLLNILLLLMRKPSQEVIN 
DTP KAG KWLS R Y 

2230 

212:347 

0.952 

0.789 

MIRIISFAQICVSSSLLHVALVGYFNWRLKILLY 
RWLIHMANKLL 

2231 

23:158 

0.988 

0.789 

MWLFIASKC1FLLIVPNFIFVFWRKVFSHDRLNI 
AYSFELSSKHI 

2232 

142:277 

0.886 

0.612 

MQVKF1LKYY1SFLWKTVTANGETVNMSLLYIF 
TTMEMRKKSEVG 

2233 

116:251 

0.943 

0.787 

MLLGVRAVPLCSAWQGAVGLVSLTISICKHGL 
SFQQNLVPGKSNV 

2234 

34:169 

0.896 

0.663 

MRiVRRMCMWSAGPAPATVCAVMVAAPKSP 
QSPPRWACVYSLIGC 

2235 

119:254 

0.965 

0.574 

MWSWHVQLQVSAPLHHLLCLHFPPAHRIYMP 
FPSPKRAPAMLNKG 

2236 

198:333 

0.889 

0.779 

MVTRTLVTLNILGTMRLGYLPPVTYWPCDSDV 
KRYLCLAFFGWRH j 

2237 

171:306 

0.913 

0.567 

MSTIFTFLYQPPIILTVLSSTYFINPISTAVHVIPV 
TDRSVECEL 
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f ATRSVIT1NKNG 

3725 

) 251:386 

0.97S 

> 0.887 

MRPALAVGLVFAGCCSNVIFLELLARKHPGCG 
7 NIVTFAQFLFIAV 


/ 
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3730 

114:249 

0.95 

0.7 

MMKCSVRLTLQFFGLGFFFRPQRGISLFIAET 
RRNKIILQHICQD 

3731 

51:186 

0.935 

0.561 

MRRFFWCPSPSFCTSVCSFEVPRPSSQKSG 
GSTSHSPTALPHRRE 

3732 

82:217 

0.969 

0.802 

MLATTNFFSVEVGLGLVFFFLLNYILQNFSYEY 
LRPLNNTSKKLF 

3733 

66:201 

0.981 

0.904 

MPVQLTTALRWGTSLFALAVLGGILAAYVTG 
YQFIHTEKHYLSF 

3734 

304:439 

0.954 

0.856 

MLLLWCNLHLIRCSKFSDFYMLEVWSNHPYR 
MLDNTAYCDLLAW 

3735 

91:226 

0.946 

0.775 

MSLKMLISRNKLILLLGIVFFERGKSATLSLPKA 
PSCGQSLVKVQ 

3736 

275:410 

0.977 

0.914 

MTEATFDTLRLWLIILLCALRLAMMRSHLQAYL 
NLAQKCVDQMKK 

3737 

195:330 

0.933 

0.7 

MLDWGFPMQSTFYLLLMPFRTVPSGLPSKPR 
FPEVLGG IDTTVRV 

3738 

106:241 

0.915 

0.825 

MNNFRATILFWAAAAWAKSGKPSGEMDEVGV 
QKCKNALKLPVLEV 

3739 

61:196 

0.953 

0.6 

MVGDTQWTRPECQAWTGTLLLGTSLLYCARS 
SMPICTVSMSQDFG 

3740 

216:81 

0.983 

0.929 

MLVAVFIILAFNLVLSPATNLLYGSHTLSPSQRL 
LPGPPSPCSHP 

3741 

307:172 

0.987 

0.92 

MADLVLRLTWLGLLVLFLTCYADDKPDKPDD 
KPDDSGKDPKPDF 

3742 

327:192 

0.899 

0.824 

MPLAFLGWVSGSLGTGVGMVEFGKYLLSYDT 
GPQVPVRVSNHLMD 

3743 

189:324 

0.905 

0.659 

MRWPIMIIIDLCLNFPMGSLSALGTDIPQNLM 
GLLIVEKKCKRG 

3744 

168:33 

0.905 

0.579 

MQFKELIFKIFSYFIRFSLVRILWMPESYLLKEL 
YLQIISKHTNP 

3745 

158:23 

0.888 

0.63 

MVTQRLVRMQWLPGLKPRHSDTLLIHIQEQL 
WYNRTKRLPDALLQ 

3746 

508:643 

0.958 

0.856 

MIIAIFLVLVLSQNLATSHQEMGSTALAFESRQ 
DKYNTVEARLLA 

3747 

223:88 

0.976 

0.877 

MKQSTASVFFLTKLFITAVIISIIITVIYASVSHKK 
LELSSLVQP 

3748 

204:339 

0.974 

0.791 

MIKNFSLLSQNSGSVLFVLTYLITFTVISNLALLL 
LTYPKIACLI 

3749 

164:299 

0.975 

0.614 

MKREVTGPLEGSASVCQPFLFLLPGALFIQTA 
LWLLTSKMLYEAL 

3750 

52:187 

0.952 

0.768 

MLTLPQSLWMLTRRTICFVPTIVSCRGLLPSNP 
HHELARLISVSQ 

3751 

359:224 

0.966 

0.762 

MHCWFLASVCPALLPGAYVGSQIPTMSQLAG 
WVHGVQLTFYFLQE 

3752 

162:297 

0.95 

0.669 

M KC RAQLLAG HRH AALLRVLLLTQ PSSKLP EA 
SSMKPKMKISQRl 

3753 

148:283 

0.902 

0.569 

MYLTMISCKPDLHLWNLLLAPLVLQHYPRAK 
ELLLIFFSQFSQC 

3754 

83:218 

0.948 

0.791 

MQFASNEtQIEFCILLLPLLVLIPLSLQERIAVLQ 
SSHCYKQRQF 

3755 

216:351 

0.958 

0.815 

MLGLKRVLALLVRG (TLHGG EVTPRTTWPG AA 
EPACWAGTRKPPQ 

3756 

243:108 

0.905 

0.581 

MGNSCIGRFIRFLCRLSLNLTSNGQLDRGFEF 
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TVLLDPDTRWFMG 

3757 

705:570 

0.965 

0.621 

Mf^riVP>Q1 P\/f AMfiVFMPPPnTPil ATVTAQP 

FLFTVSWALP P ALLM 

3758 

268:403 

0.987 

0.763 

MPPI \A/Wn\AAA/TAnRI 1 X/CMA/I QMAHPHPrPI 
IVIr OLVVI>lkJVV V V 1 MUKLLVoVVLonMUr\UrOrL 

SLFCGRRNPLAWTIF 

3759 

231:96 

0.914 

0.634 

Ml \/PMMPO\A/l VP'K'CM CiTAPni A Wk'PQVA/QWI 
IVILvrlvMNUWVVLVUINOLo 1 Mr\OlMr1l\r\o V V Onl 

S YSRE ERL VAPLP 

3760 

283:418 

0.973 

0.938 

Mf^VI \A/I TVI Wl F^^V/tf AnQKAITTC;! TTUTW/F 
W\\p VLV VL 1 VLVvLrooVr\MUJ5(V\l 1 IOLI 1 F\VVr 

STPLLLEASEFLA 

3761 

199:334 

0.975 

0.878 

IVIV 1 oirivlLLVrirrirxiVIMOV I uLCKLrrrrXULrOr 

CYLRLMRPLGPF 

3762 

702:837 

0.903 

0.631 

GFFVAQGRWAIHRA 

3763 

119:254 

0.994 

0.972 

MOOl 1 Q\/PI \f~T\ 1 1 AVI Ql 1 CKyiUHTMETirM AFI UT 

MrULLovrLV 1 LLLAYLoLLriVlr\ 1 Nfci c --iArLiS.r 
RIDILLFEALM 

3764 

275:410 

0.964 

0.833 

h At CIDU^MCVDII 11 1 II OET/^J nnTD A IMV\A/ A Lll 1 

MurlrnOlNO Y KlLILLiLorOLvJP I PAINYWAriLL 
DLPFLRHFTW 

3765 

296:161 

0.97 

0.862 

mitpi i ii i i \a/wvni ^/icai/mvcdv/i q\/qp»i ai 
•VI 1 1 KLLILLLVvn T nLrWIoArvN YrKV LovoULAL 

LFtLYSNHFJF 

3766 

252:387 

0.975 

0.82 

ka\ CCI 1 ^cnrCATCDTI rnCAPDDCII XAA/lPkA/ 

IVlLrrLLortiVw'rA 1 rr 1 LbUoAUr r'olLVVVIKIN.v 
WASRPDAAQWGL 

3767 

19:154 

0.938 

0.842 

iwipcci-ipii hAi i i CMn\/Di~rir>ci\/i m/i dimiu'mi 
PYPMCEIFSS 

3768 

293:428 

0.945 

0.796 

WlbLoUrUAvVLLoLr 1 AVV Yv^oLALr 1 1 ILnlNV 

FLLYYVDTFVSVY 

3769 

30:165 

0.932 

0.651 

M VVVVAULUoolU 1 LLobWKVUIiKAbbKAUb 

AQGSRWEDEYKWCCDE 

3770 

221:356 

0.946 

0.752 

Mk'M/"*! CT\/n icmi i qi piFiQQin I icri ACAQkTiT 

mfMML/LO 1 VOLIC(NLLoLWrloollLLIr IMoAOrSLy 1 

SCCVIMLPI 

3771 

331:196 

0.957 

0.748 

MPFRI Tl RKRI PFI ^1 FVMII I MPIRMYQK'TAT 
IVIrrrvL 1 Ur\f\r\LrrLoLr T rllLLIVlOIKIN TOr\ 1 A 1 

DFPPDLYSCWFG 

3772 

170:35 

0.96 

0.854 

Ml QX/tfRFR AMX/MFFM ANM/AMIVAU'M kTfinTMC 

AKFCVHMYLHFYFSS 

3773 

224:359 

0.959 

0.754 

hA\A/i 1 II \/AV^ylF^DM\/MQ\/U^)IDTW > A\/^lDPAC^^ 

A/IVVLLILVA Y JVIrwKINI VlMo VriMK J PVAVUr fcArlVI 
NISEilQHQGYPCE 

3774 

126:261 

0.965 

0.907 

MU 1 1\ 1 IrVLVVbbrLLVVlNLYVobbU 1 1 YKollNAK 
ITQRALDYGVQA 

3775 

134:269 

0.927 

0.741 

IVI 1 VVIVII>iLLr tv\oo VrrLo 1 OrrrbrlUObl\Mbo 

AGFSYLSFLNQPL 

3776 

442:577 

0.995 

0.957 

MGFL LGGLFVDWAFGl FVFI 1 TVY^Pl FCRSA 

IVI KJ 1 L» l_ \~s KJ I— 1 V V V rif VILLI V \J 1 I— I vl \<Jr\ 

^VCWRSTPDPVRL 

3777 

173:38 

0.882 

0.72 

MLLKIKFLYAVSRWFADDLPWLKKRSYIEVTR 
MFCLCVWRNIFR 

3778 

379:244 

0.949 

0.78 

MYSPTNPLSFTLSSFFLYSLILIHLLNTYYMLSIF 
IVPRVSKIWH 

3779 

215:350 

0.951 

0.916 

MPRIYLMLFLISQWVTWEWRINSHSNDFISTLL 
ATHPASTPLTFW 

3780 

157:292 

0.942 

0.739 

MGRAALPLQALKTLLFPPTLHGCQQLWTVIL 
GPRCPAQTLFPCP 

3781 

157:292 

0.942 

0.739 

MGRAALPLQALKTLLFPPTLHGCQQLVVTVIL 
GPRCPAQTLFPCP 

3782 

325:190 

0.947 

0.787 

MAPLYLGNYQTLFLKLISISVLAFLFRNVWSNI 
GCHFSSVWTMVI 
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3783 

168:303 

0.923 

0.555 

MrLL 1 LVSQjUVjrr AArL 1 rPVcSCRRC loLL 
WERLGWDREGERHR 

3784 

269:134 

0.93 

0.551 

MoU^VNLLbrVonVVoVbPFLLLbRK 1 AAUN 1 
KPMSLVLFSLSYF 

3785 

239:374 

0.983 

0.916 

MAVLYIuArVLVrLLOKVMCjKVrPGULKAAhM 
EHLLERSWYAVTE 

3786 

108:243 

0.974 

0.821 

K AS*\CltJllklA AI\/OI 1 r~> 1 \ / A T\ IT I IPiACDriLICIA ADI 

MUoMINMIVoLLijLVA 1 VILIPArRGHrlAARL 
GGQSLGKTSRQH 

3787 

282:417 

0.979 

0.934 

MAASLLAVLLLLLLbRGMrSSPSPPPALLcKVr 
QYIDLHQDEFVQ 

3788 

27:162 

0.997 

0.982 

MLLLGLLLLLPLLAGARLLWNVvVvKLRoLHLPPI 
APGFLHLLQPDL 

3789 

139:274 

0.987 

0.958 

MS Ron VALLG LSLLLM LLL YAG LPS PPEQT5C 
LWGDPNVTVLAVS 

3790 

153:288 

0.976 

0.804 

N4WJM\/QPDI API P>\/l/QQVI\A/ACI IAI 1 111 Dl Qll 1 

YHDFYLRLL 

3791 

1355:1490 

0.954 

0.759 

MGRRKFLPPPLLSLLSSSLPLPICHPPAPLTPG 
LGIPPCGWGRE 

3792 

107:242 

0.958 

0.782 

MLQGHSSVFQALLGTFFTWGMTAAGAALVFV 
FSSGQRRILDGSLG 

3793 

592:457 

0.914 

0.687 

MRAVLLQHLFILLDRQTTKKNSNLDIGHVFREA 
LIFLADLKSQLP 

3794 

288:423 

0.986 

0.967 

MRSLGALLLLLSACLAVSAGPVPTPPDNIQVQ 
ENFNISRIYGKWY 

3795 

510:645 

0.981 

0.953 

MFLLLPFDSLIVNLLGISLTVLFTLLLVFDVPAIF 
GVSFG1RKL 

3796 

56:191 

0.974 

0.82 

MLLLLLLPLLWGRERVEGQKSNRKDYSLTMQ 
SSVWQEGMCVHVR 


I 
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SEQ 
ID 

no: 

Me- 
th- 
od 

Predict- 
ed beg- 
inning 
nucleo- 
tide 

location 
corres- 
ponding 
to first 
amino 
acid 
residue 
of amino 
acid 

sequence 

Predict- 
ed end 
nucleo- 
tide 

location 
corres- 
ponding 
to last 
amino 
acid 
residue 
of amino 
acid 
sequence 

AMINO ACID SEQUENCE ENCODED BY THE NUCLEOTIDE 

SEQUENCE OF THE CORRESPONDING SEQ ID NO. 

(A=Alanine, C=Cysteine, D=Aspartic Acid, 

E= Glutamic Acid, F=Phenylalanine, G=Glycine, 

H=*Histidine, I=Isoleucine , K=Lysine, L=*Leucine, 

M=Methionine, N=Asparagine, P=*Proline, 

Q=Glut amine, R=Arginine, S=Serine, T= Threonine, 

v=vaj.xuc, w= 1 1. yptopnan, x = iyrosine, x=UnKnown, 

*=>Stop Codon, /^possible nucleotide deletion, 

\=possible nucleotide insertion) 

1 

A 

2 

424 

LFWSPOTOREPTMVLSPADKTNVKAAWnKVr;zvMaf^T?Yr:ziT^nT p , x?MTrr\ 
LSHGSAQVKGHGKKVADAXTNAVAHVDDMPNALSAiSDLHAJHKLRVD 
PVNFKLSHCLLVTLAAHLPA^FTPAVHASLDKFI^SVSTVLTSKYR 

2 

A 

25 

619 

E FHRLRENPP WCS S PAD KTNVKAAWG KVGAHAGE YGAEAJjERMFL S F 
PTTKTYFPHFDLSHG\ SAOV\KfJHf3\ TCirvnnaT TKrzvtrntn/V nriMDiaV 

ALSALSDLHAHKL\RV\DPVNFQSS * SQLPCLGEPWAAHLP\AEFQP 
IxAVATSSLGTKFPGLSVEAPLL.TFOTTFTCr;WK'PPvr , Mniri?aT wnccn 
PLLPFPAPVPPWSLK 

3 

A 

32 

476 

LYALRNQIKPPPHVTGGQ*QKQERTKPAWRPPRP/GPPPFIALIGFC 
FSSVPVAWVELRVEPPGGHWPLSPLGEVRGEWRRPLSGLAS/VGQWL 
VPR S PEGGGGG WAGS PQVSGGWP PS AHLRNP PLS LGD I L PAPQHPAIj 
SPGGPKLCG 

4 

B 

3508 

3762 

MTPGVVHASPPQSQRVPRQAPCEWAIRNIGQKPKEPNCHNCGTHIGIi 
RSKTLRGTPNYLPIRQDTHPPSVIFCLAGVGVPGLPV* 

5 

A 

105 

1711 

VESNTGRKWTEAAGTGDI QCLAWGS SGDGRGGDPRGRVPAAALGAAV 
VAAMAS PDPWAPGPKQPG/ LMGRPGS SPALWAQ\ PA* PP Y / FPGLTP 
PCPQEVPQSFGPPG/EQGRVLRGGQNVT*GEGRVPQVCGRRPPCPSIi 
MEPFVYTPAS PFPSGQPQGTPGRRQDGEPRLGTGGQTGS PGPGLVSG 
iri^^ji^-rtMJvt'ova a yokjjjVKy v i'Jr / k5r / i-iA J I PLji^ARI *HLtGWVGCP*RMGQG 
NFGVAS KPFLDL/ PGPMLGSTNCDNLGSFGLS / GPMFPKWPTSTGRL 
PKGPSAN*GG/PGMERPPGVMP C ; C ? /RHnnPPnQfiDT nT t < m^l^RT.^^r2^• 

*ALGHK\GGAS*TSQKAESLPQPLGPPGGGCYGITQNPLASGPKQTW 
SMGRPGS FPAIiWAQNRLTFLTPRP\ S PPCPQEVPQS FGP PG \ PSQGA 
ERRARLSPRERGGFLKVCGRRPPCPSMMEPSCLHTC*PLSLPASLRQ 
AKEPPSCGLRAPS*HLRRPCLTPPFRPQGLLGTSHCPQKPAPDSGLL 
HSPLNSPRGFSGQCNPRRLHPQLNPSNRDR 

6 

A 

144 

398 

NMVLSLIFHIQELPQAVCEILGLQGSEVTTPDSDVGEDENWMTPCP 
TSAFQSNALPTLSASGARNDSPTQIPVSSDVCRIiTPA 

7 

A 

46 

1069 

RQKWfiMFPWVTTi'T.PYPTlfK'PDTrkT AVi^VPnTyvrTnuitrr AMv/tTnnrAn 

-Cj 0 r r w v iriiLiv. 1 AJic.KiyjjyKO*vrDEYFRMKLQWx^ 
KRNSRLRD YRSL I EKDVNRTDRTNKFYEGQDNPGLI LLHDI LMTYCM 
YDFDLGYVOGMSDX FFSPFT.YA/MPNP'\7T^zvT?wr , i7nQVTvrnr\ft>mA>jT?T?T?rk 

MQGMKTQLIQLSTLLRLLDSGFCSYLESQDSGYLYFCFRWLLIRFKR 
E F S FLD I LlRLWEVMWTEL PCTN FHLIiL.CCA T T. F «; F KOO T MPKHYH PM 

EILKHINEL\SMKIDVEDILCKAEAISLQMVKCKELPQAVCEILGLQ 
GS \ AVTTPDSDVG * RTKMLVNDFLGPTVCI SKVTALPTLS ASGARND 
SPTQIPVSSRCLQD 

8 

A 

190 

383 

TRM IH ITS VYSH PHQAQLRGCE VLRHKAHLVL PA / WS TS CSG I PRGQ 
QTAQQGYSRCFLAWSLEA 

9 

A 

1130 

1379 

ASHLRSGVRDQPEQHGETPTL\LKIQKSPGCGGRHL*SQLPGRLRQE 
NHLNLGGGGCGELRSHHCIPAWVTETLTKKKKKKRR 

10 

A 

67 

373 

FCDCHHFILMFKSPHIWPVGIFSSWLLCFFWACLHHSLSIALLSCTK 
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R YS GLI L YFLCS S FE ITVS SKS S VS F * RRMVFRNQVLGSRCACCC * Q» 
VAAPRPFP =5 


sua 


11 


366 


796 


AWVEQS KVL I KEGG I QLLLT I VDTPG FGDAVDNSNCWQPVI KYFDSI^, « 
S QD \ YLNAE S QVNRCQMPGNRV\HCCI» YF I AP SGHGPLHN * RLPPSCjjJ ^| 
RIG*YMFVTTWHCLLLRLKPLDIEFTKHLHEKVNIIPLIAKAJDTLMP^ 
EEC 


12 


105 


702 


AGSSVSLGFCPAAAAHKPRGGALRLPVFRRRAQQGPDYALAGVARQP 
AGTCRRRCNRSHCRAEDPQWPTPAAAPAAHSPHMSLGESGLGKLILI 
NSLFLTDLYSPEYPGPSQRIKKPVQVYILVFLIDDKLE*Y*YTQSTC 
CNFHYAS\QSWQPAINYIDSKFEDYLNAESRVNRCQMPGNRVQGCLY 
FIAPSGHGPLHN 


13 


177 


MSVSARSAAAEERSVNSSTMVAQQKNLEGYVGFANLPNQVYRKSVKR 
GFEFTLMWE* 


14 


79 


534 


SSIMTFLESSAVPPHWTGQDGRVCWTGWIPQCQAGSAPE/RS*VFIN 
S AGQKS ADTGWS S S KPQN * HLS S FHQAWGM I QPSHS QFLMKRKAAS 
PRKLEWEH/LQPLHPMTLLYR*DGKPFR*VLLSTYTYCSSRDRPKSS 
GKNARRFPAHGSS 


15 


354 


416 


MKESPGGELPQTGKKPVFLF* 


16 


145 


SSSSSDFAGQTL*STQTVQN*FKKVLKPGRLYPVPIATMQIKEPLIS 


17 


18 


19 


214 


246 


12 


464 


FCGLLLLHPVSADF*PAELINTQEPQERCQLDTGESSRVQHTLPSCP 
VQCGGTAELSRNVMIGASELKCLHPSPKLEYILPGN 


732 


SSIMTFLESSAVPPHWTGQDGRVCWTGWIPQCQAGSAPE/RS*VFIN 
SAGQKSADTGWSSSKPQN*QLSSTGAALPLASLSRERAW\VDDGKHR 
LTTPMTVPQRAVQQL * ETSG * *DWRQKVQI FQQAWGM I QPSHSQFI* 
QREDVIMLRPFGLHLSWEENGS 


288 


FGGGYIPTWGKGEGILALELNHDISREFCSAPAIiASRPPPTPPPLLP 
PT/ PPLPAPRS PADATPRRVGGPLR * AliKPRAPG PGWSRRRCRS WW 


20 


103 


282 


EDTAAQS S PGRGEEAEAS AAEAQGGEQAYLAGLAGQYHIjER YPDS YE 
SMSEPPIAHLLRP* 


21 


918 


1872 


AEKGPPSPGVPSTCLANSP\PPSAGSAAVPHSAPPGAAPRRSSRQRP 
CSPRRCWQEGAGVRAGQDGLGGSATRSAAQQSHQRQTGP/EGSTGKA 
KARGSTGRKR*AMGGSAAGDGA/ phvlgtgqprapppstgdgaragn 
WLAQGSAPTHRRWSTCWEPAS PGLPASSNCCTHTWTRNCPGTAPSGT 
ALPGLPGRPAPHPEPPPPMPPPP/PPRPGL/PLGSCVFIi*VERPIWR 
EVLS VPAPPRACVDTSGHE\ ASGPS \ PVAGTEPSRPWLKQPLPCPPQ 
QPRSWLTRRPE/ AELHMPPTPLPAYLPGS \ GVPQSHS * PPATLLR 


22 


23 


24 


10 


497 


284 


FKLIAWHSHEKIYEGQVEVTGDEYNVESIDGQPGAFTCYLDAGLART 
TTGNKVFGAIiKGAVDGGLSIPHSTKRFPGYDSESSCCYTRESSL 


438 


LFISLLSISEKIIENCWV*LSAARS*ALRKI»AFF*ATRSFF*ARDIL 
GRFHLFF/CNFFLGLLFIDWILSYSSMSFLIHLLHPAGQQQASTICC 
S 1 1 CQANLHT I FWQFVC I RCADYH I PL YTG I SNLTND I S VCHTNYHP 
VIGVW 


753 


DGVLLLLPRLECNSAILAHRNLRLP/GFKRFSCLTLLSPWDYRHIjPP 
RLAIFFVFLVYVGFHHVGYAGLE\LLTSR»SARPRPPKIA 


25 


26 


922 


238 


2057 


XTQTDSPWFMISEKQRNFNAESTIGSHIHGPRIVAGLHAPTLMEEDE 
DALQETVRASIRKEQRNSRHDGGDGIRKAHAAIPRESRSMKRSPRKE 
VKKKRWNRPKMSLAQKKDRVAQKKAS FLRAQERAAES * 


446 


GFWSLDSR*GQWESRSSIHA\VTN*ASSSSSSSSSSS\FSR\VYPR 
F IE YLHKD IQSTGQKS PDAWVAGG 


27 


244 


309 


NP KGQKDREAPLS R S VFVL KI KI KRAFALLLHGR F * P S TPAAS AS PA 
S KS PRG SG KALAS ALF Y I VQLTK 


28 


1230 


1820 


QEYRPSQTPHLALSPERVAPGRRAAGRLAPEARAPRGSP/LPPHRVS 
EKTIRVWFHPGARKAGGTPPRAPRGDTGGAPGA\ PTYS TPLMSLHR 
ARLESSSTGSSFPADSAKPVPLAWSLDSR*GQWESRSSIHA\VTN* 
ASSSSSSSSSSS\FSR\VYPRFIEFLHFDIQSTGQ/RITSRQHPPR/ 
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DLRD AXi F * LNS R I PL 

29 

A 

376 

612 

FPAFWESQVRWIASASGVQDQPDQYGEILFLLKLQKLGGHGG/IR*E 
DHLSLGGQGCKEPRLYHCTPAWVTEHYLVSKK 

30 

A 

147 

766 

G ALHL I RKGS PLHGQAHKHVAQQGRGI LSRVHRPTHVHTQGRKATRL 
GLCSAHGRAPHPSFLPLVPVCCS/WHHCETP\RPCPLLPCPPPSSGA 
STSRTASSPVLSLTD*PYLSETEEQVSP\GLTHKASLEPRSDSDAKT 
SSHPRDWPHPMVDCAAAIiSWTQPLWMFCGSRLAFHDAPAPAHMRAIW 
AS RA I P AAR PG PHLG LAAAS C 

31 

A 

1 

365 

FFFLLL I LFLLL I QYHY I FYTFLLWLRLLVLTGS I * C * KCKI TMNST 
* IE/ TVF KD Y YEHL YVCKLKNLEKTDKVLT I HNFLRLNQ * EPEVLTR 
AKMYKIYNDIE 

32 

A 

603 

855 

GGW I MRSGVRDQPDQHGETLSLLKLQKLAGHGGACL * S QLVRRLRQE 
YHLNPGGGGCSEPRF\GHCAPAWATE*DSVSGKKKKKK 

33 

A 

200 

559 

KNFFFLEMEF/SVXLPRLECNGVISAHRRLRLPLSSYSPASSSQVAG 
DYRACTTTAG* ILYF* *ETGFLHVG\QAGLEFPTSGDQPAliASQSAG 
I TGVSHC PQLKKS I LHETPKGLTG VTS 

34 

A 

2583 

3580 

DRVSLLLPRLECNGAILAHCNLCLSGSSDSPASASQVTGITGKCHHT 
QL I FVFLVEMG FHH I AQ AGLELLTS \ DS PTLAS QS AG I TGVNHHAWL 
FFFCS / RDTVSLCYPGWSRVA* SRI TATS A\ PGDK* FACFSLPS SRD 
YRHVPPHPGNFCIFGRDEVSPCWPGWF*TPDLR\YPPASASQSAEII 
GVSHHTWPQEVFLFLNLFIYLRWSL/DSVAQARVQRRDIiGSLQAPPP 
RFKPFSCLSLPSSWDYRRPPPHPANFFVFLVETGF\TVLARRVLIS* 
PRDLPASASQSAGITGVSHHTR/LIF/NFFETGTHSVTWAAVQWYTI 
/GSLQPRTPELK*SSHLILTSNWDYRCTPPCPPNLFIYL»FIYFHRDE 
GSLCCPGWS *TPELK 

35 

A 

45 

819 

RSLALSPRLECNGAISAHCKLRLPGSRHSPASASRAAGTTGAHHRAR 
LIFFVFLVETGFHRVSQDGLE/LPDL/DDPPASASQSAGITGVTAAP 
SRACSFKLiARGRACPGWDWAEELTGTSHTRRVHPGQALWGWDWRRCS 
CGSRSWQT*LQRCPAVLRGWLACLTAQEEQIGPYVSVSPPVIPGAAC 
CFIDVL.R*KISDKDVFKAGRGAQPKTFSFLLRKSL/NPVTQARVQWC 
DLSSLQLLPPRFK* FSCFSLPSRTIT 

36 

A 

2 

424 

LFWSPQTQR^PTMVLSPADKTNVKAAWGKVGAHAGEYGAEALERHFD 
LSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVD 
PWFKLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR 

37 

B 

44 

307 

mJAVSCRQRPTSRPAWGKVGAHAVRSMCAEALERMFLSFPTTKTYFP 
HFDLSHGFCPGLRATGKKVADALTNAVAHVGRHAQRRCPP* 

38 

A 

3 

203 

HASDKMADKEKKKKESILDLSKYIDKTIRVKFQGGREGE\GLPLSVP 
PL * KFQ IN PATEMMRTHAEA 

39 

A 

132 

370 

SNSPA* PSGEPGTMG/ CRPPCPVNFGILRKNGVPPCGPGGPKSPDLR 
THPLGPPKG/RGIPGGTHCPGPPLGFKKPWPPGRV \ 

40 

A 

25 

412 

I QARKKQQS VLG I KTEDQGTFNLLRKG IMD I EAYLER I G YKKFTSKL 
DFETLTDILSHQIPAVSFDNLTVLCGDAMDLGLEAIFDQAVR*NR\G j 
GWI\LQVNHLLYWALTTIGYETTKIGRYGYQTPAEK 

41 

A 

319 

1002 

IQARKKQQSVFWIKTEDQPTFNLLRKGIMDIEAYLERIGYKKSRNKIi 
DLETLTDILQHQIRAVPFENLNIHCGDAMDLGIiEAIFDQWRRNRGG 
WCLQGNHLJi YW ALTTI GCETTMLGG YVYSTP AKKYS TGM IHLLLQVT 
IDGRNYIVDAGFGRSYQMWQPLELISGKDQPQVPCVFRLTEENGFWY 
LDQIRREQYIPNEEFLHSDLLEDSKYRKIYSFTLKPRTIE 

42 

A-. 

1 

818 

MAGAEGAAGRQSELEPWSLVDVLEEDEELENEACAVLGGSDSEKCS 
YS QDKAKVNSGNKYNDN F FGL Y C I CKRP YPD P ED E VRELEVKPGVTK 
I S TEDDGL VRN IDG I GDQEV I KP ENGEHQDS TLKEDVPEQGKDDVRE 
VKVEQNSEPCAGSSSESDLQTVFKNESLNAESKSGCKLQELKAKQLI 
KKDTATYWPLNWRSKLCTCQDCMKMYGDLDVLFLTDEYDTVLAYENK 
GKIAQATDRSDPLMDTLSSMNRVQQVELIC/GIQ*FED 

43 

A 

46 

512 

QLRMAGAEGAAGRQSELEPWSLVDVLEEDEELENEACAVIiGGSE\S 
EKCSY SQGS VKRQALYACSTCTPEGEEPAGICLACSYECHGSHKLFE 


.1. 0 O IB *4-& «+ 3 . O iS EH! 6, iOl H! 






LYTKR I FRC \ DCGNS KF *NFGMQLLP * QS K/ D WSGNKYNDNFFGL Y 
CICKRPYPDPEDEIPDE 

44 

A 

275 

403 

FSHFWPPEFLRD*MWLGV/VAHACNPSTLGGRGRWIT*GQESET 

45 

A 

2 

343 

NLCQVFIPFLFHSLLYVFKTILKGLFQASFLGHLLLLPLSGTPGRPG 
GGGAGGSGRGDPPAAGASRTIPWPLLLQQLSSSSSHFKA/PSKLSSH 
G*PRALGPPGPYHRPPSPGTT 

46 

A 

105 

328 

RDLHMKTITSKVY*LTI\H*SGSVGRDYELHENPLVGPNIKLGDFVD 
ISHPNDEYSPLLLQVKSLKEDLQKETIS 

47 

A 

1478 

1732 

GGQGRWITRSAVQDPHGQDGEKPSLLKIQKLARHGGRRL*SQLLGRL 
/RQENCLNLGGGGCSEPRSHPCTPAWVTEQDSVSKKKKK 

48 

A 

1662 

1858 

PSFLETEPCSIAQARVGWHDLGSLQPLPP\GSSNSPASASQVAGITG 
A* KHARLI FLFFVEMGCAE 

49 

A 

245 

616 

NPPN*NPPNACVKPEFENQPKLKWSQRGVGGPRREGPPAQPAPEGTA 
GRRADKLVRVGWRPPPATS AQGS AHS PPLFC* S SRSRGPS S PCLEAP 
TPGAS PST * TI TPPRRAWRRACWTW PCSCPTPCG 

50 

A 

96 

547 

ADPAAPVPWFGPSQPGARAQWPGLLGPAAPPPALCSPP * TDSGVQDP 
VSNAPWVNSQQWWRQ/ PAATEKPPATKAPGLLVQEKYKVSR *QKGW / 
PSTKKAKGYFP*NCPPLQFLLPPLLSLPPGDISGQCPHLPKPIiIPPP 
SYPAPCPGFCPR 

51 

A 

702 

1107 

LQERYSDPGLLASFHGESSQDRLDTVQQEK*NAGSASPPGPPHPNTC 
QPRDGRGFTRLPCHRMWGTPGIRWKMPLQ*AAGP\PHAA*WPGSLMP 
PCL* PSASAWTRQQ* * PGSLPTLTSPG *DPGAHILLWRKQSY 

52 

C 

150 

467 

MMGLRLQLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxx* 

53 

C 

150 

467 

MMGLRLQLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXK* 

54 

A 

644 

906 

CQHFGRPKWADHLRSGVHDQPGQHGE/IPSLLKIQKLARCGGAYL*S 
QLLGRLRQENHLNSGGEGCSEPRFHQCTPAWVTEQDSISKKK 

55 

A 

730 

951 

SFLKDIGLFKI I YFFFFFFETGVSFCHPRLECSGETMAHC\ SLTSLG 
PGDPPTSAS*QAGITGVHHHARIiIFFF 

56 

A 

939 

1605 

ghsanwwlgpeenpalptpelvspprgcrqssrpslrrhkeekak*k 
spqptilrpppapptgslqapkggarppsrprpppprpaapnpXrpl 
ppppspe/pgrsscpappttrttpr\qsqthrppp\ssayekkdaar 
aptatgraprrpglrpkswmpeqekepptdrsidrtpvpspyqtreg 
gra\prassi*gpdspatrqdgssgphrsyrpgcvar 

57 

A 

1 

1513 

MG PLS APPCTQR I TWKG LLLTALL LN F WNLPTTAQVTI E AE PTKVS K 
GKDVLLLVHNLPQNLAGYIWYKGQMKDLYHYITSYVVPGQIIIYGPA 
YSGRETVYSNASLLIQNVTREDAGSYTLHIVKRGDGTRGETGHFTFT 
LYLETPKPSISSSNLYPREDMEAVSLTCDPETPDASYLWWMNGQSLP 
MTHSLQLSKNKRTLFLFGVTKYTAGPYECEIRNPVSASRSDPVTLNL 
LQHS WL IDGKFQQSAQVFFI PQ I TKTYRGVYVCFIHNS ATAGTNIil I 
KRIIVPDTLRKRGNHEKVESSDDIENSNQPFSHPKAFKNIQVQHGQY 
GIHRKLITKLEKWLDSGLPCVISAETSSPRPGCSVSSGFQGHGKSHY 
YSSLECPWKWKLQRNHIYGGKVGWNLEGAHQLHIPAJCLPKPYITINN 
LNPRENKDVLAFTCEPKSENYTYIWWLNGQSLPVSPRVKRPIENRIL 
I LP S VTRNETG PYQCE I QDRYGG IRSY P VTLNVL 

58 

A 

1107 

1365 

HS GS AGMHWNILFLVHCMQAH I *LFDRLLHMM * CNGRGP * FALCTRC 
SQ*DSGAN\WDK*KNILPLRNFGWLGFNRDLGSRGWVPEV 

59 

A 

582 

1303 

GACSRAGLWLLGQGLLGPRIYQLSEDCVSWSC\HQPGLSPQSLIWAR j 
TELSS PDTQSGEDRKTS FVGHQPTALGGLGQS I E SLMS S EAETEERR 
CTWMQLSETNRTLFLFGVTKYTAGPYECEIWNSGSASRSDPVTLNLL 
HGLDALTISSSYTYYHTGEVPKLSCLIDTHPLAEHSWLIGGKFQQSA 

QVFFIPQITKTYRGVYVCFIHNSATGGTNLIIKRIIVPDHSLRSALS 
LEVTGS 
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60 

A 

713 

811 

I PGRPWRL* A* PVTLRLQTQATCGG * MVRASL 

61 

A 

1 

1194 

MGPLSAPPCTQHITWKGLLLTASLLNFWNLPITAQVTIEALPPKVSE 
GKD VLLLVHNL PQNLAG Y I W YKGQLMDL YHY I TS YWDGQ INI YGPA 
YTGRETVYSNASLLIQNVTREDAGS YTLHI I KRGDRTRGVTGYFTFN 
LYHTSYLWWMNGQSLPMTHRLQLSKTNRTLFIFGVTKYIAGPYECEI 
RNPVSASHSDPVTLNLLLKLPKPYITINNSKPRENKDVLAFTCEPKS 
ENYTY I WWLNGQS LP VS PRVKRP I ENRI L I LPS VTRNETGP YECE I R 
DRDGGMRSDPVTLNVLYGPDLPSIYPSFTYYRSGENLYLSCFAESNP 
PAEYFWTINGKFQQSGQKLS I PQITTKHRGLYTCSVRNSATGKESSK 
SMTVEVSEDPYDDTGSHTDKPE 

62 

A 

994 

1108 

EGPHDLCCVHVLLCGDEPRIQKLPEHGWSAVLSFLLQK 

63 

A 

547 

644 

I PGRPWRL* S * PVILRLQPQATSGG* MVRASL 

64 

A 

647 

885 

TGNLQNGRKFSQPTHLTKG*YPESTMNSNKFTRKKQTTPSKSSSPQR 
RTHRQQRPWGSSQPLPAHSASPGRGSCSQHHF 

65 

A 

1586 

1775 

AGGGQLNVNFMGKPLYVKHEPLRTHQNIRHHNNRCSNCKPGQQVDDF 
TLWTVFPKMSEHDS PS 

66 

A 

275 

577 

GGWITRSGVQDQSGQHS VDVSL/ LKKYKKLTKYGGVLL * SQLS / RRL 
RQDNHMNPGGRSCS * PRQHHCTPAWVTVRDS ISKKKKKKNKKKKEKK 
GKKALLEPG 

67 

A 

1016 

1461 

VSHSHLSSPQFNSHLNRRPTRIPLSPQPK*GPAYSRPPVWTPAEPEA 
LRWGRHRG P * KARG SR P S S AHSGWRPRGE APHRTGTAQP PTAVPCRR 
AHLGEDSTPGPPGALGGVGFPSASP/PVPTPT*TQTGRRPETQLRQQ 
TPRGAPPRA 

68 

A 

241 

492 

GGQ I MRSGY * DHPG * /HGETRS LLKI \ QKLAGHGGRHL * SQLLRRPR 
LENRLNLGGRGCSEPSSRHCTAAWATE*DSVSKKKKSRR 

69 

A 

709 

942 

SQHFERSGDQHHPGQHGETLSLLKMQKLAGCGGMRL*SQL\LGRLRQ 
ENLLNPGGRGCS KPRSHHCTPAWVTQQDS I S 

70 

A 

170 

340 

TDSALGSPCLHPTALCAVCGTA*SST/ GLVALALMRLELILKSSVPA 
VMAGIITIYNL 

71 

A 

134 

452 

QSPDSMNAREFRRRGMEMVDYVAYYMEG I EGLLVYPDV * PGYLRALI 
PAAAPH*LDTCEDIISDA\EKIIMPGLSHWHSPDLFVYFDSVSSYAR 
MLEDMQCGALDC 

72 

A 

218 

428 

LSSLPTTISRVFYNLS/ CSHL*YSLKHHFIYLFIYLFIYLFIYLFIF 
ETESPLSPRLECRSGISAHCNLRL 

73 

A 

702 

1282 

AVEMYL WCGTVAI SRVRMSAVCVLRTWS \RNAG\QLI \ CFTVFQPC 
SNVHVL/KGPNYVCFFG\YPSF\KYSHP\HNFVKTINAVCGQ\LVQF 
RFPDTEEGIRKVTV/ KCYVKEGDTVSQFDNIGKVQSDKASVT\ I ASP 
YDGVI/RKL*HSLDDIAYVGKPLVNIETEALKGTVNLFYQIDYCPPF 
VIGSQLKVTFKKKK 

74 

A 

743 

1233 

ECGVFSFPFLFLRQCLVLSPQAGVRWP\HLGSLQPQDSPGSSRPSHL 
TLLSSWDLTGTCPPCLANFRIFFVETGF/TCFPRLVLNS*GSSDPPT 
PQSVGDCRAGSHHTLAVFSF\CFEMGPCSITRAGV\R*HDHSSLQP* 
NPGLKRSPCLSLPSSGDYWSMPPCLA 

75 

A 

3 

300 

VFQRTSLQT FAI LTLS CQGTQAYEERS IRS FANDDRHVMVKHS T I YP 
SPEELEAVQNMVSTVECALKHVSDWLDETNKGTKTEGETEVKKDEAG 
ENYS 

76 

A 

189 

442 

KTMRSIRSFANDDRHVMVKHSTIYPSPEELEAVQNMVSTVECALKHV 
SDWLDETNKGTKTEGETEVKKDEAGENYSKD\NG*RTL 

77 

C 

183 

350 

MYGLPGFFYAHRXXXXXXXXXXXXXRHAPALRVTNRSHS PTGGRVAL 
PQPWLVMV* 

78 

A 

226 

481 

CLAHSRHS INI STCTGAEGHESESFS YWGTRGPSATMACDGLTRTGH 
SSPSRARSPPLLCARSLTKAACQ*AVPVTWCHCWRPQG i 

79 

A 

421 

899 

IASLFLIVKVEEVSQTAIWKKKKCDEYIPGTTSLGMSVFNLSNAIMG 
SG ILGLAFALANTG ILLFLVLLTS VTLLS I YS INLLLI C SKETGCMV 
YEKLGEQVFG\PQGKFVIFGATSLQNTGGKKNMLFFT*LEYTHHTSF 
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FS KWLFVRLLLLSRSE INLLF I 

80 

A 

44 

377 

TFFLYLENLNTAVRFWKLPYIHCFKNPGGAGLICIAPWLTKAFVFGS 
D Y S CLiVTVRVK * VF KP F LKDVE I RR / FTWT I RQPWDAVFLRVPD YHA 
LIKQPMDLETD* MRLANS 

81 

A 

457 

489 

PP * ITLG* KTCSQARWLMPVI PTLWKAEVGRSLELRSLRPAWATRRN 
PISTKN/LKISWGRQGSSELRSRHCTPAWQHSETPI 

82 

A 

999 

1525 

WEMVEIWSYSSSEQEHQLIKLSYQRIKPLPHAHWWPFNPSGLPPEAS 
K* PPG * TI»\LG I FLGKKA* LQTFPLNYIRLENMQPRHGGSCPVI PTL 
WKAEVGRSLELRSLRPAWATRRNPISTKNPKLAGGARAHL*SPGTPE 
AEVGOSPEP/ GGRQGSSELRSRHCTPAWQHSETPIS 

83 

A 

2 

450 

LHCCPLQLKMVACRAIGILSRFFAFRFLRSRGYICRNFTGSSAXiLTR 
THINYGVKGDVAGAR\INSTNSKVNTLITELQSQFSQVMNEILSSYQ 
IRRAVLISSKPGCFIASSDINILTACMTL*EVSHRSSEAQIIVQKLE 
MTTLSIVA 

84 

A 

247 

287 

KRGFVLPPGLECGGPIWVNGR/LRLLGLPPFSGLSFPGSWDYGGLPP 
GR VK I WH FLEKRG FRG VAR VG S TLLTG E P P VLG F * KP P FF 

85 

A 

2 

526 

RGGCDRDGPFSASSPLAMASLTVKAYLLGKEDA\AREIRRFSLCCST 
EPEAQAEVAAGPGPCERLLIRVAALFPAVRPGGF\QAHYRDEDGDLV 
AFSSDEELTMAMSYVKDDIFRIYIKETNECLA\DHRPPDAHERA/PR 
NMVHLNE ICDG *NGPVAGTLYMCSFCPYYYLCTPFNV 

86 

A 

285 

326 

NLPKILGHTQKRLQSTRSWKRMMLSLINLQII I YNPVI PFVGTIPDH 
LDFVS*IVI\RGPVSSDANYIQYDL*NPNKSLSV 

87 

A 

133 

394 

L I T I Y Y I PENCQ PGQH S ETS S L / LKKI KKLAGCGGTHL * S QLLGRLR 
RED YLS PGGQ I CS EP * S YHCTPAWVTG QD PA S KKKKKKKKR 

88 

A 

2 

359 

EVLRMAGTGLVAVEVWDTVLPDFEEGYEGPGGREAAAAIiVEDETRRY 
RPTMNYLS YIiTSPDYSAFETDILN\NDSERLAARQP\ I *LLTMKLYY 
LSSPSCGHYFVLLHVYNCLSLYFPSP 

89 

A 

69 

342 

VGNMSGTNLDGNDEFDEQLRMLELYGDGKDGDTLTDAGGEPDSLGQY 
STDT\ PTEW\DLDI * AWFFMI SEYFLFTYLVQFMAFLLYCGFMFL 

90 

A 

144 

2205 

DRRERCHPFKRALMP/LRPAEPRPAFPIHHPLLPQDANEA/PG/CWA 
EAVQGCPT/ AED P PC VS RAAPGGS VPP FC / PGRLL * GWGGPRGKGDP 
EGLADSGPPCELRFEEESRPPRWGES\PAEKIiAYPP\GPLCFF*PF 
SEF/ PEL*TKSPS*EMTNHMLHSGGGGSWPGSPTS PGPPPGGSLPRA 
AAliPPKTPGQTRAPAGAGCPHADIiAMSPGPTPGSPPVAPTSLVADCS 
SQNV*HPRVPEPTQAATPVQFPKNVITSPGCQSPPKQP/APVPVPRM 
YN/ MPRVTE PAQAATPES LGHVPELRSTQTVATAS PGHXiTPMKGTLG 
SAREIPEVGTGPPWCPVP\PPSPLPLCPAPRGSAGGLTLPLRASQTS 
GSTFLLI S FPFCKLTS PFPGLGVOGTPAGGRSVP TWnfiWQPRRRT 

SVGQLLHLPVPLCPPLSKGQ\VMAPIAGL/PIQCSGPPVQWGDPRPT 
PP*CIQPGQVYCCD/VLPALLCRFPGAAQPRTPHEWACPALAECLPC 
ALRQVPRAGQPG*TMCGAGTGPGCWKGGQKEAGGRCEKliPTPPLNTT 
ALPAPAGPCSQLPLSHGWPGL\PPSPH*DSRQGGPDP/ASPSQPWLR 
KPPNPTMGLRKPRK* LLPAS PKRLRKPKAASLPVRHPKASLGCS * EG 
LIWALP*GW\PCRWGPLGAIiGGTWQPPRHPGPVIiISRA*GLQPQPLT 
K 

91 

A 

122 

876 

SPLLGGLRQKIFFNWEGGGSIKPKLTPSPPARGQGGGPFSSSSPSSS 
SPES*LFPALRTPSRQGGSAIPSASGLSTPACVPSGSTRNLFHAPHV 
GRVSLLLPAS * KAAGPSKKGPGQVNPA* FRKPGVGI *RSVGPRQLPL 
E*LHARKNRAGPVPLPGLGKEGPSLQGPKRTQLASLLMEGPPTHPTI 
HDGTTAQPELWPCRGPQMQSWGSGP\EHTFGPRQPHPPHPSSRHRSD 
SESSSLLQASAHPGKA' 

92 

A 

27 

321 

I INTS I SLWFLGLSSI IHHYQERFL* FN* FGSYMVFIEI FFGISSAF 
FVC* WPK/ Y/ LLYFRYCRCFLILLLLS IFEF/DLCYLVLi* IFMFFLF 
FYMFLCV 

93 

A 

264 

664 

I RLAEFCG I FFQAFS I CNSN 1 WLLPE I PLLG I YP * E\ * NTKTCTQMF 
TAALFI IS * KVEISQMPI INEKINKLWYIHMTCHTKNKVX»IRDTT*M 
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NLEN I KLNERGQTKKKKKAI YCLS LFM * YVQNRQI HTDEK 

94 

A 

2 

605 

SHGTPGRPTRPVEFFSGGS AGS FS PWFPKKAKKRAGGADSNVFSMFE 
Q\TQMQEFKEAFSIMDQNRDGFIDKN\DLRDTFAALGEV\N\VKNEE 
IDEM\IKEA\PGPLNFTVFLTNVWGENLREADPE\ETILN\AFK\VF 
DP*RPKGCMKA\DY\VSGKMLTHAGRRRFSKE\EV\DQMFAGLPPLT 
* LGNLD YKNL\ VH 1 1 THG \ EEKD 

95 

A 

1 

1254 

MGKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAA 
EMGKGSFKYAWVLDKLKAERERGITIDT qT,WKFFT<3trYWTT rnaDP 

HRDFI KNM I TGTSQADCAVL I VAAGVGEFEAG I S KNGQTREHALLAY 

TLVAFVP I SGWNGDNMLEPS ANMPWFKGWKVTRKDGNASGTTLLEAL 

DCILPPTHPTDKPLRLPLQDVYKIGGIGTVPVGRVETGVLKPGMVVT 

FAPVNVTTEVKSVEMHHEALSEALPGD1WGFNVKIW 

GDS KNDP PM E AAG FTAQ V I ILNHPGQI STGYAPVLDCHTAHI ACKFA 

ELKEKIDRRSGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPP 

LGRFAVRDMRQTVAVGVIKAVDKKAAGAGKVTKSAQKAQKAK 

.96 

B 

52 

554 

MGKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAA 
EMG KG S FKYAWVLDKLKAERERG I T I D I S LWKFETS KY YVT 1 1 DAPG 
HRDFI KNM I ILNHPGQI S AG YAP VXtDCHTAH I AC KFAELKEKIDRRS 
GKKLEDGPKFLKSGDAAIVDMVPGKPI * 

97 

A 

1 

1299 

MPSIKLQSSGGEIFEVDVEIVKQSVTIKTMLEDLGMNDEGDHDPVPL 
PNVNAAILKJCVIQWCTHHEDDSPPPKVYENKEKRTDDIPVWDQEFLK 
VDOGTL F E L I LAANYLD I KG L LD VTP KTV AKTM V R P T Q irvw q t t n a o 
GHRDF I KNM I TGTSQADCAVL I VAAGVGE FEAGI S KNGQTHEHALLA 
YTLGVKQLIVGVNKMDSTEQPYSQKTYEEIVKEVNTYIMKIGCNLDT 
AAFVLISVNVTTEVKSVEIHHEALSEALPGDNVGFNVKNVSVKDV^ 
GNNAGD S KND P PMEAAG FTAQ V 1 1 LNH PGQ I S AG Y ALVLDCHTAHT A 
CKFAELKEKIDRCSGKKLEDGPKFLKSGDAAIIYMVPGKPMCVESFS 
DYPPLGHFAVGHMRQTVAMGI IKAEDRKAAGVGKVTKSAQKAPKPSE 
YYPLYLPPQS 

98 

A 

36 

698 

KLPLKAK^GKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIE 
KFE KE AAEMGKG S FKYAWVLDKLKAERERG I TI D I S LWKFETS KYYV 
TI IDAPGHRDF I KNM I TGTS \ QADCAVL I VAAGVGEFE AG I SKNGQT 
REHALLAYTLG VKQL \ I VGVNKMDSTG I AILTGD FPS LEPRHVS TWL 
QHWTI PTRNWHKCYCVG WANFLNVSADFLNNF 

99 

A 

62 

1548 

PLKAKMG KEKTH I \ N I W I GH \ VDS G KS TTT\ GHL I YKC\ GG I \ DKR 
TIEKPEKEAAEMGKGSFKYAWVLDKilCAERXERGITlNDISLWXKFE 
T\SKYYVD*SLIAPG\HQETFIQKHDLQGTSQGLAWFGGPPFWGGSP 
L I VAA\ G \ VGE F * SLVS P KNGAD PTKHALLH T .HTnvw KOTKT , Pnvrrv 

NWIST*/EPPYSPEEDIEGNC*GKVSTYIK\KIGYNPDTSSILCPIS 
GW\NGDNM\LEPS\PNMPWFQGDGKVTRKDGNASGTTLL»EALDCILP 
PTRPTDKPLGLPLQDVYKIGGIGTVPVGRVETGVLKPGl^TVTVTFGPVN 
VTTEVKSVEMHHEALGEALPGDNVGFNVKNVSVKDVRRGNVAGDSra 
DPPMEAAGFPAQVIILNHPGQISAGYAPVLDCHTAHIACKFAELKEK 
IDRRSGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPPLGCFA 
VRDMRQTVAVGVIKAVDKKAAGAGKVTKSAQKAQKAK 

100 

A 

404 

1074 

ARACKIPNTASDRPI IVALQRKRSSPKMTQSAAGTCPTSCMIK/ IDS 
HKCGDDRELFAQAPVDQFPGTAVESVTDSSRYFVIRIEDGNGRRAFI 
GIGFGDRGDAFDFNVALQDHFKWVKQQCEFAKQAQNPDQGPKLDLGF 
KEGQTIIOiNIANMKKKEGAAGNPRVRPASTGGLSLLPPPPGGKTSTL 
IPPPGE/PVGCGGIPRPASSCSQFRSTSSQTQPGTGW 

101 

A 

2 

477 

I VS PEVRWAPG VAMEES G YES VLCVKPDVH VYRI PPRATNRG YRAAE 
WQLDQPSWSGRLRITAKGQMAYIKLEDRTSGELFAQAPVDQFPGTAV 
ESVTDSSRYF\VIRIEDGNGRRAFI * I\GFGDRGDAFDFNVALQDHF 
KWVKQQCE FAKQAQNPDQG 

102 

B 

118 

419 

XLFCVDIDECSIMNGGCETFCTNSEGSYECSCQPGFALMPDQRSCTD 
IDECEDNPNICDGGQCTNIPGEYRCLCYDGFMASEDMKTCVDVNECD 
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LNPNICLX* 

103 

A 

59 

451 

CLGVTVKDVNQQEFVRALAA\ F\ LKKSGKLKVPEWlADTVKLLAKHK 
ELAPY\DENWFY/ SREFVRNIxASTARHL»\YLRGGRWGIiAPLTKIYGG 
DVORKRAFIAPAI FSRG\ SKSVAPRVT.OATA EGLKMVFTmnn 

104 

A 

583 

1526 

PHL I LQVTKAMCPFESGRE S FLWS I C I YFKPDS S AS FS FPDPYS VQ 
C*PQLEPPPHCSTSIYFPK/ PTLPHIPFPPPLLTKHPKEDLELAGWT 
SSGCFYFSL»PSTKLGENWSLHPQSHVYRSGDLDVGSF*LLSQKL\HR 
NPSICSLKGPPPREGLGNDPVSTNTAPYPRNLP*DLQRTIFSSPSIF 
YPGPPGAPSGES *NP * ELEG I LEVF * LELCPTE/ VH*HQPGLVFPPP 
SGLFSSFPPPQKILTHRFLLVQASKPTPLPIiRLCPLWSPSHPYPSSL 
APSSSLEFGPGSPQKTLE*PPLPKKQTDRHAPGPE 

105 

A 

1283 

1648 

SSGASVAPTSWTSNRFPP*SWVPSSF*RTHGPRPSGPPRERKPRAPG 
QE PTKGTPRES VCLNDLPCPGLLGI CRP I LQTS P / CHGHHG I LS WN 
VLKGDKPKSPRSLGLPVFHNHHFRDLSVL 

106 

C 

263 

304 

MLKLSVRNRETFL* 

107 

A 

426 

1519 

AWRRRRSGTSGKATWWCSGLRRASPTPQPPVO^WATAVMVJK'P«IP«5<?c2 ■ 
PAS WE/ PALPREPHRAVSEQRQ*GR* PCKPELTAPLCLEPVHRPEGP 
MGT/YSRCCLSPLHRP/PGPVGTFPV/SPEPVHRPQDPWHIPGVPEP 
VHRPQDPWHIPGVPEPVHRPQDPWHIPGVPEPVHRPQDPWHIPGVPE 
PVHAPTG P VALAS VGAS S RDGLL PAHAAACTLiHETAGOTPTS PT.QPQ 
*GLGLPFCSRRSRQPWTPALGHSKTSGEWRGGARPGPCGC*CCMLSP 
TQQPLPGGHPQRARAS PRAGEG * TSRAYLAVHAAATLWKLPLPEDPP 
PLLDARISAHRPLCPSGRHRKVSLFLTLSFSISCEARKIG 

108 

A 

1 

539 

ivi\v^uv«ivn x Aar^.urtJJuuDX v ojjt OI7 /\v jry ^ "Uoo it Li v c X I. V AK iniriJ 

PPSLLPLPAQLWQRTMWTMKLLDYER*MRWETAAGGDGAEAPJ\KKDV 
KGSYVS/LSHSSGFLDFLLLL*VIjPSIVHCAFEHSTHFRHECTSOAT 
LGKDALFPALLD I FLTGVFFLTI FS PVLRVLLRCLFCLS 

109 

A 

1 

457 

AGGGCSPKGRPEAKSGQRDWELVAGGPPGISRREGTCCSRFPSRLSQ 
PFRSAO0L0LAASLPANLSNFCOGSEMPTTSRPALDVKGGTSPA1CED 
ARPEKSTLGQ/YSTLLVIDNQVSSKTR*PDESANQYYASDTFIILSR 
TYNRYILVHLSK 

110 

A 

232 

378 

FPTTKSLG*DSFTSEFCQTFKAELIPILS/RLFQKLEQYVTLPYPFY 
EA 

111 

A 

515 

911 

LPLFIMNMTVELWPDTTSNLPRNSEILSSPTRPNQLFVCLFLGSPS 
LPRLE YKWYSQS SL* PQNPGLK* SS P SAS YVAKTTDMCHHAWLI FLF 
LQTEGL/NYIAQVG/VQTPGFKQSSCLTLPKC*DYRHEPP 

112 

A 

40 

351 

LKIPMQFLHSGFWFSFFVFFGF*KFGFGPQGGRQGG\NKTKGEKLPP 
GSSSLPGPNPQENREKKGPPKTLKKFGNLSSSGGKTRGPRGEKNSDP 
KGTPGQNPGN 

113 

A 

307 

1429 

CTATQSGWLCLHRPCPAWRCTWRTTWFCIRYKGEMVKVSPJ^FSKLW 
LLYRYSCIDDSAFERFLPRVWPTjT.PPYOMMT^nvriT.T.T^r:Tr:T.or;i?nzi 

CCMSLRPSTDSLASVSECFASPIiNCYFRQYCSAFPDTDGYFGSRGPC 
IiDFAPLSGSFEANPPFCEELHGCHGLSL*ETA*ELTGAPVPSSVFIP 
EWAGNPQHQRSPAWKQ/MPLQTPPVDPACL*A*VPQWLPAHLQEGGN 
ALQGRPQHGCAiPTE/PTTiALPSGPJRRIiTGCRS*VLPTGSPGP/PAT 
ALVLPHRS YLGGPRTG IRGREQGPKPRAS PHLTYSCGEEGAPGVLSL 
DLLGLGPLGPQRDPGCH*HMKIMVLPGLPSLPVPKSSPQTPSKSHVY 
RS 

114 

A 

161 

218 

SWDRVLGRSDELTTLSCTP 

115 

A 

116 

565 

eptgtaspjuv™pnfsgi^kiirsenfqellkvlgviwmlrkiava\ 
aaskpaveikqegdtfyiktsttvrtt\einfkvgeefeeqtvdgrp 
ckslvkwesenkkvceqkllkgegpktswtrelt1togeliltmtadd 
wctrvyvre 

116 

A 

343 

528 

KTVSLPGIVPHACNPSTLGGRGGWITRSGVQDQPGQHGETPFILKKK 
KKKI CGRKLI PFSEG 

117 

A 

296 

631 

FKLTSSRNVPPTGPGAVAHACNPQHFGRPRQVDHLRSGV*DQPGQHG 
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HCTPAWTTG *DS ASKKKK 

1 1 O 


/ ft 

J JO 

GO T T , A TjM P KG f5C?nnG T IjTY PN P PT. PG * NN F PG L. T P P R Tfi T NG T .G P PG 
RVNFGIFKKKGGFPPGARGV*NPGPRGASSPGFPKGRG*SPPP/QGP 
FKPLKR FPDLP I FPR 

119 

A 

454 

778 

ADPMSPSSKSPNMEAVLWIPDTNIVL^SLKVYPSS^TIKSWLGT/CG 
SRL * CQHFGRPRRVDHEVKRLRPS * PTWRTPSLL/RTTKI SRAWWHT 
SWPATREAEAGESLE 

120 

A 

133 

359 

PJICSSGMEIPPTl^PASRADLVAQNYINYQHGTPHRVFEVHNAYRVI 
MQD\MSG*GHLVSPSSRFFLHSFATSLFE 

121 

A 

3 

1671 

SSGARWRFRAGSMGLFCQRKHSS/ SHPVLQKPSVFGNDSDDDDETSV 
SESLQREAAKKQAMKQTKJjEIQKALAEDATVYEYDSIYDEMQKKKEE 
NNPK1»LLGKI)RKPKYIHNLL 

EFDDKEAFVTSAYKXKLQEPJVEEEEREKRAAALEACLDVTKQKDLSG 
REMHSFKLM*K*RJENPDADSYFDAXSSPDD\K*KKLELTAEGKRS*R 

PT.PMTQCJTTGVTfTTT.nHT.VKKPGTVPGTTP lfnWPRPPnMRKnKTQT<? 

XT XjX\1 I X O O X iUVA.1 X UVJXli-l V AVXVijO X V C O JL X IvInX^Xi OIvX^IjXJiM JvIXOIvX O X O 

RSNPETKRTITLTVITGKXGILIGTERPVIEIPIGRGMNRKINQGRG 
TKEK^VTEYGKGRKIGRNIPKENKKEIDNKMIRTDPVRKERRKRKAK 
QRKRAYESKGGKVTjKLLINTE I EKNER* VFS / AS ERNQDQKGKQPKF 
QGQRINFLTIS^DFIQNGETWQRTKERNQEKPSNSESSLGAKHRLTEB 
GQEKGKEQERPPE AVSKFAKRNNE ETVM S ARJDR YLARQMAS G 

122 

A 

170 

339 

IMKIjITILFL*CRLLLSLTQESQSAEIDLLDRDLFLAEF^ 
MQIQTNPRI 

123 

A 

23 

307 

RQTRWCPWRLSHYRTLGGCCHLRRGRGVA*VRGPQSGTISSVENTP 
P WRR VS C F P APN I TCKDS S GNETHFTGNEVG F FKP I SCRNVNGY S YK 

124 

C 

35 

91 

MSPPQNKGPFPKSPGWF* * 

125 

A 

50 

230 

NLKGPLRRPVSGIIIHVISLPLYQKCSKNEKKIPWRQMEM/C*NVPS 
ANN P PT.G LL KM T V F 

126 

A 

814 

1292 

GISPFYIFGQDMGLEKNPTSFPSKMCFHCPLESLPSYVGCWKTGNMS 
CXA^CTVNWFLRSVIYF^IFTNILSHFEVlxAXiKPv 

T,*\ CCRRTTGHORVWPSRPPEOTDOTARRPPSWRPTIj / CS PLPLPPP 
PRSGREKGNRARFLKGPRIG 

127 

A 

191 

482 

DSSGQVQWLKPIIPVLGNLRQADHLRSGVQDQHVQHGETPSLLKIQK 
KT,ARHGGACT.*SOTjI.GRIiR /OETH*N^GSRGCqEPRTiRHCTLA*ATE 
GDSI 

X 4Z O 

7V 
r\ 

4. *± ** O 

X / / o 

NT , S R NK F \TL L. FG KN I P WVG W AR W L. V PGN PN T IjGG OG RADH L KLGVOD 
QPGQHGEIPSLLKIQKLTRHGGVCIi*SQ\MHIxNPGGGGCSELRSHHC 
T\ CTPAWAME * DS I PKNK 

129 

A 

3 

370 

PGFPLFSFPEGNGPSKRQTGD*IRCLF*DGKVWECSPPSSSSSPKKR 
KKAVIF/ CVPVQTKCIWEGGEETLVGDV*V* P* \GSFKHWAMFPE 
K/DCLCTLYEASFKTKESRRVDGFVCVRVGT 

130 

A 

412 

616 

VFVCLFVCFETGSCSVTQAGGQWCNHGSLQPQPATSAS * I VG\GVAG 
VYHHFQVFLLLLFNRDEVLLY 

131 

A 

278 

466 

YTH I LRQLPTLRHEQKS RENCLEEMSLDRFQAAKP \ S PTPTHHTYKP 
TLAGH * K I HAMGLTRA 

1 J Z 

fc» 

C 1 o 

1 / IB 

1 v 11J^xjVj1^>\JVx'\^V»OW1\A^^, Jj 1 x Libr yxJXJl v XM.V K^C F o X JjKVoyx/X/Jr 1 VI XrXXO 

SRFPDSTGTPAHSIPLILLWTWPFNKPIALPRCSEMVPGTADCNITA 

DR KVYPQADAVI VHHREVMYNP S AQLPRS PRTRQGQRWI WF SMES PSH 

CWQLKA^GYFNLTMSYRSDSDIFTPYGWLEPWSGQPAHPPIxlSrLSAK 

TELVAWAVSlWGPNSARVRYYQSLQAHLKVDVYGRSHKPLPOGTi^ 

TLSRYKFYIxAFENSLHPDYITEKiWRKALEAWAVPVVLGPSRSNYER 

FLPPDAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRWRETLRPRS 

FSWAIiAFCKACWKLQEESRYQTRGIAAWFT* 

133 

B 

1154 

2233 

I^PLGPAKPQWSWRCCLTTLLFQLLMAVCFFSYLRVSQDDPTVYPNG 
SRFPDSTGTPAHS I PLI LLWTWPFNKPI ALPRCSEMVPGTADCNITA 
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DRKVYPQADAVIVHHREVMYNPSAQLPRSPRRQGQRWIWFSMESPSH 
CWQLKAMDGYFNLTMSYRSDSDIFTPYGWLEPWSGQPAHPPLNLSAK 
TELVAWAVS1WGPNSARVRYYQSLQAHLKVDVYGRSHKPLPQGTMME 
TLSRYKFYLAFENSLHPDYITEKLWRNALEAWAVPWLGPSRSNYER 
F LP PD A F I HVDD FQ S PKD L AR YLQELDKDHAR YL S Y FRWRETLR PR S 
FSWALAFCKACWKLQEESRYQTRGIAAWFT* 

X34 

B 

1029 

2108 

MDPLGPAKPQWSWRCCLTTLLFQLLMAVCFFSYLRVSQDDPTVYPNG 
S R F PDS TGTP AHS I PL I LL WTW P FNKP I AL PRCS EMVPGT ADCN I T A 
DRKVYPQADAVI VHHREVM YNPSAQLPRS PRRQGQRWIWFSMES PSH 
CWQLKAMDGYFNLTMSYRSDSDIFTPYGWLEPWSGQPAHPPLNLSAK 
TELVAWAVSNWGPNSARVRYYQSLQAHLKVDVYGRSHKPLPQGTMME 
TLSRYKFYLAFENSLHPDYITEKLWRNALEAWAVPWLGPSRSNYER 
FLPPDAFIHVDDFQSPKDLARYLQELDKDHARYLSYFRWRETLRPRS 
FSWALAFCKACWKLQEESRYQTRGIAAWFT* 

135 

A 

885 

1173 

LSQGPRRHSSAVQPPPHSHRGHHHDDCASPSQVRQNY\AINRQINVE 
LYASYIYLSMSYYFDHNDVALKNFAKYFLHQSHEEREHAKKLMKLLH 
FDC 

136 

A 

961 

1051 

LRVQACTWLVNFLYFFCRDGVSLCGPDWS 

137 

A 

2 

804 

SSGFPASTVLGRNPALVPHPGRPPIAS PPS PLHRTLGLPQGPRRRS S 
AAQPPPPAAASPLVAAMKTAVPPRKVRQ\NYHQGLKRAAHQPARINP 
G S S TAS YVLPC PM S YLLLTRDDGGL * RTFA \ KY FL \HQF SMKEEGNH 
AEKT * * KLAEPNEGGRNL SFKDI QEP\ DC \ SD WGERGLNA\ MECALH 
L\EKNV\NQSLLELHKTGPLTKMTPHLCD\FIET\HY\LNEQGESHQ 
KNLGDHVT\NLR\ KMGAPESG\ CAE\ YLFDKAHPWGDSDNES 

138 

A 

3 

402 

HGKIFYFILFYFYFFIFLRRSLALSPQVRTADCS\GAISAHCKL/RL 
PG FTPFS \ CLSLP S S WD YRRP\ HPRPANFFL YFLVETG VS PC * PGMG 
LDLLNS/SIPPRLGLPKCWDYRREPPRPVETFFLKAENVRVNYI 

139 

A 

210 

1641 

DPARAGTGVGASFRGATWQQGQGRGSACSCTQCPPATCLDTGADAPT 
SRPWSLLSRLLGHCFSVLTVPAAAPPPPGSL/PEPPRAGPQCP*TRP 
PLR*PGCSHLAGPHSGSPRPCSLLGASATLYGFRHFLAGPAAQGGGQ 
AVGSQGDHPTRAQPP * WS S PQTPLNLS AAQE FEPRGTCPQTRW * SRP 
DALPWPRPWEPWSG*AEGWEQE*WRLQPQGTAAPRATSSGYSGSSRG 
QRARPGPARCGDGGGAGRC\GHVRGRWRQLGHRP/RGSSAPAVCCGT 
LSAAPPGPQHSLSSL*SSAPGPQWCPHWHPQSGRQG*/SPLPPPPGP 
G\RPPCAPCSGSPSA*GGCTPGCPTAP*GRRQGRWR*PRTGRLSPLG 
HLPGCRSQAAVSHHRCWPLLPPPR\PSGPPPPLRSGSSPGG/PLPKG 
C * CQGGERPGQQHKEAGQGPAQQTWLHPQAPGGRRPSHTGWGGGAPG 
G * QRRNLA* RPLLP 

140 

A 

885 

1173 

LSQGPRRHSSAVQPPPHSHRGHHHDDCASPSQVRQNY\AINRQINVE 
LYASYIYLSMSYYFDHNDVALKNFAKYFLHQSHEEREHAK3CLMKLLH 
FDC 

141 

A 

170 

737 

IVTATCLWGSLVLTHSVFPFQSYYFDRDDVALKDFAKYFLHQSHEER 
GTCLRNLMKLQNPRG\GLIFLQDIKVNKRS*GCHTSSSGSVRVSEIT 
*TSNCPYKVMIHWAFAFCAFLGLPSKLN*ANSNVPFAKTWWLEMMGL 
LTSRLVGRDALTYSHS\KPDCDDWESGLNAMECALHLGKMVNQSLLE 
LH 

142 

A 

1 

809 

WGVSCFNSAWTEPGARSPPRPAAHSQPSVTSSPHPRTAPRPPPPLQ 
RREATAAGRRLSLVAAMTTASHLAGCAKNYHQGLKRPPINRQDQPWK 
LYGLLRFTLSHVLTNFDPRMMLAFERTFAKYFLHQFSMKEEGNHAEE 
T**KLQN\QRGWPEFFL\QDIKETQTCD\DWEERGLNA\ME\CALHL 
\ E KNVES S HLLELHKT.GQLDKND PP FC VTF I ET\ HY \ LNEQ \ VKGHQ 
R I WGDHVTNL \ R KM G S APNLAW AKYL FDHS TS LG \ D S DNE S 

143 

A 

776 

1115 

APGVPDKPRQQNE/ IPVSTKNM*LGVGDCLRGLRQEDHLNQEVPGCS 
EP * CHDQATALPAWATQQD PVS KKKKKKWWREARKGKPQ * GDGEKDS 
TTHSWL*RWRSLKSRITVSL 

144 

A 

1 

419 

MAAGTLYTYPENWRAFKALIAAQYSGAQGRVLSAPPHFHFGQTNRTP 
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EFLRKFPAGKVPAFEGDDGFCVFESNAIAYYVSNEELRGSTPEAAAQ 
VVQWVSFADSDIVPPASTWVFPTLGIMHHNKQVLEPSFRQAFPNT 

145 

A 

3 

1363 

HASGITMAAGTLYTF\SVNWRAFKALIAAQYSGAQVRVLSAPPHFHF 
GQTN\RTPEVLRKFPAGKGPAFEGDDGFCVFESNAIAYYV\SNEELR 

GSTPEAAAOWOWV^Ptt \ nQHTMDDG Q TWA 7 CD HPT r»\ tmtjii\ vtv/hiw* 
v?gi r&nnfty v vynvorM \UZ>LJ± V ir irMo 1 W Vr r i. LiKj \ XMHH \NK.QAT* 

ECKGRK*GRILGLV\DAYLKTEDFCWGANVERLSGITW\CTLLWLY 
KQVL\EPSFRQAFPNTN\RWFLTCI\NQPQFPAVLGGSETCVRRLAP 
FGA*KVLQRPQPKK\DTPR\KEEGFTGKKKQKPQAERKEEKKAAAPG 
XPEVEEMDECEOAIjAAEPKAKDPFAHLPIC^tfvt np*PirPTrvc>jT?nTT 

SVALPYFWEHFDKDGWSLWYSEYRFPEELTQTFMSCNLITGMFQRLD 
KLRKNAFAS VI LLG \ TNNS S S I SG VWVFRGQELAFPLS PDWQVDYE S 
YTWRKLDPGSEETQTLVREYFSWEGAFQHVGKAFNQGKIFK 

146 

A 

3 

454 

AVPGPGFGLSPTMVTLAELLVI J T,AAT.T.AT\7Qr5Y\ ttwc mnunrppci? 
ERVTSGTKMGL I FEAEDGG FLD ID WI TL PDR / RK I KPRLLKKKGQ * 

TYRSFMDVTFKLCYNLRMS WMNPNIRNHNHWLLLTS I KFLITQFRSS 
LSYLSSCIQSE 

147 

A 

90 

513 

VQGLGVERVPLGSHRGWMGPPRPLLSPQERASCLLLLLLPL\VHVSA 

TTPEPR\ ELDDEDFRCVLQLSPEPQPDWPEALHRASAVQA* ISAGGS 

HLQSSFL IGLRLLKTVTVLLWPLFVIiI CVYLS VYLPFRLCLDTLSCV 
V 

148 

A 

98 

440 

KDDTNTKCW*AWNCSSTRAH\WKRTLTLLG\RLLTMNIPHDSAIPLL 
GAHPT*MWAYVHQNPGTVMLTETLFMI ATNWTLHKCPQ* * KE* CNGA 
VTEWSAALKQNKLQLHVTPRV 

149 

A 

535 

917 

LVSPGKPPEQQGQLP*PRCQII*LVSPGKPPE/PTGTAPRSQPRLSV 
CPSTQDICRICHCEGDEESPLITPCRCTGTLRFVHQSCLHQWIKSSD 

TRCCELCKYDFTMETKT.KPT.PKTdFV'T nMTDDI?DD VTT?nn rrrrvn »iTt-> 

SPVWFGLCMY 

150 

A 

116 

830 

EGFPGRSLSGGLCCRLRRRPPTDnvP Pl*n?DDDucrrDor»tn>mroDMt? 

HKSWIESTLTKRECVYIIPSSKDPYRCLPGCQICQQLVRCFCGRLVK 
QHAWFTASPAMKYLDVKLGDHFNOAIEEW^VFKTITP , nQP r rnavr , VTKi » 

FQGGSHSYRAKYVRLSYDNQP\LVILQLTVKEWQMELPKXVISVHGG 
MQKFELHPR I KQLL * KGL I KAAVTTG AL I LTGGRNTG VG KHGGDAPQ 
RTC 

151 

A 

770 

950 

CHSEHRNYKNNHHS 1 1 KVRPRWR I HFHSNVI S * SLVH I S KVFVAYKC 
NQYFHIRKFRSVT 

152 

A 

1 

253 

RKHHKIiLEAISSLlX3KKRRKIiAEMSDVILMM*EFVVA 

153 

A 

1 

759 

WQTQISKTADELISYWGTSFPPPFAASLTLYELEYCITIDISLWKF 

ETSKYYATVIDAPGHRDFIKlsn^ITGTSQADCAVLIFAAGVGEFEAVT 

FAPVlTVTTEVKSVEMHHEVLSEALPGDmmFDVKNVSVKir\^ 

GQISAGCAPVLDCHMAHIACKFAKLKKKTGSTSGKKLEDGPTFLKSG 

DAAIVDMVPGKPMC^ESFSVYPPLSRFAVCDMRQTVAVGVIEAMDKK 

AAGAGKVTKSTQKAQKAK 

154 

A 

1 

921 

KFETS KYYVS I IDAPGHRDF I KNM I TGTS QADCAVL I VAAGVGEFEA 

GISIO^GQTHEHALLAYTLGVKQLIVGVNKMDSTEQPYSQKTYEEIVK 
EVNTYIMKIGCNLDTAAFVLtISVNVM"rPVK r QVT7TWTJT7&T cprt -orrwi 

VGFNVKNVS VKDVRHGNNAGDS KNDPPMEAAGFTAQVI I LNHPGQI S 
AGYALVLDCHTAHTACKFAELKEKIDRCSGKKXEDGPKFLKSGDAAI 
IYMVPGKPMCVESFSDYPPLGHFAVGHMRQTVAMGIIKAEDRKAAGV 
GKVTKSAQKAPKPSEYYPLYLPPQS 

155 

A 

1 

424 

WQTQISKTADELISYWGTSFPPPFAASLTLYELEYCITIDISLWKF 
ETS KYYATV IDAPGHRDF I KNM I TGTSQADC AVL I FAAGVGEFEAVT 
FAPVNVTTEVKSVEMHHEVLSFJU^PGDNVDFDVKNVSVKEVHHGNVA 

156 

A 

3 

1371 

INIWIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAAEMGKGSF 
KYAWVLDKLKAERERGITIDISLWKFETSKYYVTIIDAPGHRDFIKN 
MITGTSQADCAVLIVAAGVGEFEAG\ISKNGQTR\EHALLAYTLGVK 
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SGWNGD1WLEPSANMPWFKGWKVTRKDGNASGTTLLEALDCILPPTR 

PTOKPLGLPLQDVYKIGGIGWPVGRv^TGVXI^ 

E VKS VEMHHEAIjGEAIj PGDNVG fnvknvs vkd vrrgnvagdskndpp 

MEAAGFPAQVIILNHPGQISAGYAPVLDCHTAHIACKFAELKEKIDR 

RSGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPPLGCFAVRD 

MRQTVAVGVIKAVDKKAAGAGKVTKSAQKAQKAK 

157 

A 

1 

92 

TSHQSHCSTFLTVSKW* *LKTAYCLYHYHS 

158 

A 

232 

552 

S LH * PRMATQRKHLVKDFNP Y I TC Y I CKG YLI KPTTVTECLHTFCKT 
C I VQHFEDSNDC PRCGNQVHETNPLEMLRLDNTLEE 1 1 FKLVPGLRE 
QELERESEFWKENKPQGNGQDDTFKSLTNRK 

159 


44 5 

673 

RECLH* PRMATORKHLVI DFMA Y ITCY T PKf! YT. T KPTTVT\ RPT/HT / 
FCRCMEAFPSLLIxA 

160 

A 

22 

849 

WIERDLLNCIKRLK/PTTNNMLNDEIVNISPKIIKIRQGYLLSMILF 
G I VQKDLTRKLMQGRETKG I E I R KEVKL * KRKR I * I S I CRCHE * I W * 
VPCIKVT^QKAFYDIPAKITOENEILKKQCHFKDPSSA*REKMRLICFE 
ELYPENKITKEERDR I / RRTI SKTjL.L.FPKFHtiOP *NPP OV^TjMTjN * Ci 
ANF * EF I C I FQKS \ KI VKAI L * NGQRGLKFLNI KTCYKAI EIMKVX I 
WHKD\ KKLD * WNS IQVSKVT)PRVYHHLS FEKGDIEV* WGKGCSFQ 

161 

A 

796 

1381 

S LTSDKRQWALGS^KFKKHW I CH/ P I KMLHLGVG PKGLS VTQQFS S 
QN * FPVCAFQS SQYI PLPAQKLMYSQ * PVQNRCQSNQT I PVAHPMSG 
HQVSHHR\PANNPGKEEVPGQEGKN*1S^ 

TWPERNL I IxALKVP I KAELKRS WAS SNCKQT * LRHPQCDDVF I SMKE 
QSMEKCRNF 


TV 


T "1 1 Q 
J. ± J. fc> 

LPKIPPLEELKExTiLTHKDSHKLKIKGWKKAFIjANGHQKQTGEAILI 

PDKTKFKATAVTCRDKEGDYIMVKGLV^ 

KQLIjIDLFJJEIDSNIVIVGNFNTPVTAIDRS^ 

eqmdltdiyrtfhpttaeytfysx^etiskidhmighkmsintfkk 

ieimsstxVtdhsgiklkinsernlqn^^ 

km e 1 1 ql f e lnnnndtt y qnlwdtakwlrg kf i alna y i kkterak 

kti yshtsgi *kqeqtkpkpsrrkei ikiraelneietkkti 


/i 

A 1 

^ JL 


Ji-K 1 x 1Y1 l\XnxjJ\J\.yi\.yo lyAl^oKrlNAv v irtlX o XXj lixN VINoXxN Vi j Xj1vK 

YRVA * W I R 1 YQQT I CCLQETHLT PNT \ KDSHKLKVKGWKK\ AFHANG 
HOKOAGVA T 1YTSD1CTWFKAT AVKKTjKFR HY T MVKT5T .nnDFW TTTT.TJT 
YAPNTGAPKFMKQLLID VRNEIDSNTI IAGDFNTPLTAIjDRS SI QKV 
iSnvliTiTOLl^TLEQMDLKD I YRTLHPTTAE YT I Y S TVHGTFSKIDYMT 
GHKTSlxNKFKKSEIISSSLSDHSGIKLEI^ 

LLNEHWVKNKI KMEI KTF/ FELNDNNGTAYQl^WDTAKAVLRGKFTA 
LNAY I KKYERAQTDNLR SHLKELEKQQTKPKS S RRKKK 

164 

A 

265 

446 

LFCLGDTSLEAILGYLPTOLV^TRSGEPQSDDIEASRMKRAAAKHLI 
ERYYHQLTEGLWK 

165 

A 

283 

398 

NWQEKCTFQI IGGRKRMSFRI ILINFFHN*DRTVCYVP 

166 

A 

283 

397 

NWQEKCSFQI IGGRKRMSFRI ILINFFHN*DRTVCYVP 

167 


2 


SEASSSSTGDSSQGDNlsnJQKx^PD^^ 

I AFLNAliVYLS P1WECDLMYHKVYS QDPNYLNLF 1 1 VM ENRNLJi S PE 
YLE^lALPLFCKAMSKLPIxAAQGKLI^ 

TY KVI SNE FNSQNLVNDDDA I VAAS KCLKM I YYANA YAVTKNLGIiYY 
DNR I RMY S ERRI TVL YS LVQGQQLNP YLR L I VRCDH 1 1 DDAL VRLEM 
ITMENPADLKQFY/RGI *RRTRWVAAFWDRASEPKANS IGFGGSQLW 
MPTPVASYT 

168 

A 

966 

3172 

SVLLEEFLSSAEALVQSFRKVKQHTKEELKSLQAKDEDKDEDEKEKA 
ACSAAAMEEDSFASSSRIGDSSQGDNNLQKLGPDDVSVDIDAIRRVY 
TRLLSNEKIETAFLNAL VYLS PNVECDLTYHNVTSRDPlSrYxxNLFIIG 
MENRNLHS PE YLEI^IALPLFCKAMS KLPLAAQGKL I RLWS KYNADQ I R 
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RMMETFQQLITYKVISNEFNSRNLVNEFNSRNIiWDDDAIVAASKCL 
KMVYYANVVGGEVDTNHNEEDDEEPIPESSELTLQELLGEERRNKKG 
LRVDPLETELGVKTLDCRKPLIPFEEFINEPLNEVLEMDKDYTFFKV 
ETENKFSFMTCPFILNAVTKNLGLYYDNRIRMYSERRITVLYSLVQG 
QQLNPYLRLKVRRDHIIDDALVRLEMIAMENPADLKKQLYVEFEGEQ 
GVDEGGVSKEFFQLWEEIFNPDIGMFTYDESTKLFWFNPSSFETEG 
QFTL IG I VLGLA I YNNC I LDVHFPMVVYRKLMGKKGLFVDLGDSHPV 
LYQSLKDLLEYVGNVEDDMMITFQISQTNLFGNPMMYDLKENGDKIP 
I TNENR KE FVNLYSDY I LNKSVE KQFKAFRRG FHMVT1TE S PLKYLFR 
PEEIELLICGSRNLDFQALEETTEYDGGYTRDSVLIREFWEIVHSFT 
DEQKRIiFLQFTTGTDRAPVGGLGKLKMIIAKNGPDTERLPTSHTCFN 
VLLLPEYSSKEKLKERLLKAITYAKGFGML 

169 

B 

1 

2187 

MAGKASESWRKVKXiTSC^VTRENEKDAKAETPDKTIRSRETYYHKN 
SM WETAPM I Q I I S QG VTPTTHENYGST IQDE I WCLTNFCLDDMLS FV 
LES CTNHCAYCLNVW YRKRAAAKHL I ER YYHQLTEGCGNEACTNEFC 
ASCPTFLRMDNNAAAIKALELYKIN7UCLCDPHPSKKGASSAYLENSK 
GAPNNSCSE I KMNKKGARIDFKDVTYLTEEKVYE ILELCREREDYSP 
LIRVIGRVFSSAEALVQSFRKVKQHTKEELkSLQAKDEDKDEDEKEK 
AACSAAAMEEDSEASSSRIGDSSQGDNNLQKLGPDDVSVDIDAIRRV 
YTRLLSNEKIETAFLNALVYLSPNVECDLTYHNVYSRDPNYLNLFII 
VMENRNLHS PE YLEMAL PLFCKAM S KLPLAAQGKL I RLW S KYNADQ I 
RR^lMETFQQLITYKVISNEFNSRNLVNDDDAIVAASKCLK^f^^YyANV 
VGGEVDTNHNEEDDEEPIPESSELTLQELLGEERRNKKGPRVDPLET 
ELGNHCTLDCRKPLIPFEEFINEPIjNEVIjEMDKDYTFFKVETENKFSF 
MTCPFILNAVTKm»GLYYDNRIRMYSERRITVIiYSLVQGQQIiNPYLR 
LKVRRDHIIDDALVRLEMIAMENPADLKKQLYVEFEGEQGVDEGGVS 
KEFFQLWEEI FNPD IGMFTYDESTKLFWFNPSS FETEGQFTP IGIV 
LGLAI YNNCILDVHFPHGLSTGS * 

170 

A 

270 

498 

MH FLKAGRGGSRL * SQHFGRPRWADHE/ RSGDRDNRG * HGETPSLLK 
NTKKN* PGTVAGALVASTREAEAGEWREPG 

171 

A 

2 

423 

PAWLSRFTCAATYIKMPEE*ETHYQPMEEEEADTFTY/EAKMAPLML 
*IINTFYSSKEISLREL.ISNSSD/AK*LINPQSDFGFRVNLLGVLNA 
* IN I FC * QALDK I RYES LTDP I KLDSGKELH INL I PNKQDRTLTI VD 
T 

172 

A 

1 

267 

CSAGGPWRAPQPRRFHRRRRPAQLPPPLPLPPLPASPRIHNRTF\PR 
PSQRTPPPAALGCPEPGS/RSQGRGHARPPGSGEGDPTVSSPGY 

173 

A 

218 

431 

FDWWWQIQRAEPEFFPPFLAQLGSGKAPLSGLKIAARPDGSDAACLR 
PRLSAPGPQCSADGPWRAPQPRRF 

174 

A 

9 

201 

LLWQELLARQPSSSIPAIPFPPFPFFKTALQLWGLQPASGFLTLLVP 
SFRRGKAVEQLLHSWQ 

175 

A 

1453 

1936 

EVEKHLCQG * ELLRAQHN * AAACRRPRPPAPGPQCSAGGPMARAPAP 
QVPPPPPPCSA\PPPLPLPPLPAS/HAHPQPHFRHGRRSALLPRPPW 
AVR S RGALAG PRTRAAAGLRGGAGAAPAPADARFPAS S PAE * PKFPQ 
NSARALTGFPRCTDPTVSSPGY 

176 

A 

5 

712 

FEALRM I GHLFAKS P YHGKINS AKI VARTNI KLMVKWMWKNGE 1 1 D 
TLQ I VYGDNAPKKS VAVYKC I TSLRRSKVIiDEACSSRPVTS I CKGK\ 
INLVYANISIGSAYTILM/EKLNLSKLSTHWMPQ/PVHPDQLKTRAK 
LSME/ ILNKWDQDPKGFL * KI VTRDRTWLY* YTFEDKAQSKQWLPRG 
GSGPVKAKAGWSRAKVTCA/TTGFWNAQIVIiLVDFLEGQRTITSAYES 
ILLKKK 

177 

A 

1 

152 

PGAMAVLLETTLSDVVIDLYTEERPRGEA*APLTCRRGPRACLPTFP 
SLR 

178 

A 

1112 

2085 

RHSHAVQKKPLSGGGAGAMAVLLETTIiGDWIDIiYTEERP/RCQLYG 
DQASFFEAEKVPRIKHKKKGTVSMVNNGSDQHGSQFLITTGENLDYL 
DGVHTVFGEVTEGMDIIKKINETFVDKDFVPYQDIRINHTVILDDPF 
DDPPDLLIPDRSPEPTREQLDSGRIGADEEIDDFKGRSAEEVEEIKA 
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EKEAKTQAI LLEMVGDLPDAD I KPPENVLFVCKLNPVTTDEDLE I IF 
SRFGPIRSCEVIRDWKTGESLCYAFIEFEKGGGAYGKTPATRPFYGS 
WPFAGLLLTCSFLRYPLILWITVLPPLSELIPLAAAERPSAASQ 

179 

A 

3 

492 

GGGAGAMAVLLETTLFYVAIF\LYTEKRPRACGNFLKLCRIKYYNYC 
L IHNVQRDFI I QTVDTTGTGRG * ES I FGQLYGDQASFFEAEKVPRI K 
HKKJCGTVS1WNNGSDQHGSQFLITTGENLDYLDGVHTVFGEVTEGMD 
IIKNINETFVDKDFVPYQDIRIN 

180 

A 

1 

451 

MGFRHVGQAGLELLTSGDLPASAYQSAGITDVSHCAQPASPLSYFLQ 
ALKHEFWRHIiTPGHLDTQTPDTKKPGHPDTQTLDTQTPSHLTSRHP 
DTQTADTQTPDTQ\ NLTPGPPDT* HPDTWHLTPDTQTPGHPTLRHPD 
TQIPRHPET 

181 

A 

143 

647 

LRSRCVQIQGSPATEPVSGSHCADTGLVIRGGALSAHAIAPGQRLSH 
ALHTASAYINSGRMWDTVHLPQKRCVRPRPQGRVRTPRTRATH/NRV 
VGARRGTPQRYTG/ WGRD * EPSLSQLPQNGDLLAARRREHPACSTGC 
TSGARVRSRVWRAGQALVPGCAGCAYILH 

182 

C 

225 

335 

MLARLGSNSWTSSDPPTSASQTAGITGVSHRAGPLT* 

183 

A 

1108 

1627 

PMDQVMCRT*KG*MGGQRDVSPTASEQVSTARPGPRAVIDYSKADAW 
AVGAIAYEIFGLVNPFYGQGKAHLESRSYQEAQLPALPESVPPDVRQ 
LVRALLQREASKKPSARVAANG\LHLSLWGEHILALKNLKLDKMVGL 
APPKIGRHFVGQTGSQEEVWLETKNEDALFG 

184 

A 

56 

335 

TGFCFFSRVKCNGTILGPCNL*IS/ GSKYFSGL/SLPSKWDFRFAPP 
RPGNFLFF * * TRFS PVYQDGFDFLTS I CPPLGLPKLLEFRGAPPLPS 
E 

185 

A 

523 

818 

SQHS VG PRQADRLR SG VRDQPGQHGETP S I LK\ IQKLPGRGGACL * S 
QLLGSLRRENCLNPGGRGCSEPRSHHCS PAWMTE *DS IS KNK* INKN 
EIKKK 

186 

A 

49 

1357 

RTPERCLREVGKATGWPEC I LT * QT 1 1 PRP/ YPS VGTAASDTKKKKI 
NNGTNPETTTSGGCH S PEDAQVQTI R I LTCQKTELQMALi YYS QHAVK 
QLEGEARDLISRLHDSWKFAGELEOAT J < ?AVATOKKK'anPYTP*PT.TTrR 
RDAXiSLELYRNTITDEELKEKNAKLQEKLiQLiVESEKSEIQLNVKELK 
RKLERAKLLLPQQQLQAEAI5HLGKJBLQSVSAKLQAQVEENELWNRLN 
QQQEEKMWRQEEKIQEREEKIQEQEEKIREQEEKMRRQEEMMWEKEE 
KMRRQEEMMWEKEEKIRELEEKMHEQEKIREQEEKRQEEEKIREQEK 
RQEQEAKMWRQEEKIREQEEKIREQEKKMWRQEEKIHEQEKIREEEK 
RQEQEEMWRQEEKI REQEEI WRQKE KMHVRTDE I S VCS I FQGF I SVG 
FLCKFAYPFDCFTL 

187 

A 

1 

408 

ASDRPESRATHASGKS PVFSDEDSDLDFDISKLEQQSKVQNTGHGKP 
REKSIIDEKFFQLSEMEAYLENREKEEERKDDNDDESVKSSRNVNNK 
DFFDPVESDEDI ASDHDDELGSN/ EDDEI AEEEAEEGS I SEI 

188 

B 

1 

1995 

MSKSTRQSKLAEAJCEQLTDHHPQTNPSVGTAA5DTKKKKINNGTNPE 
TTTSGGCHSPEDEQKASHQHQEALRRELEAQVHTIRILTCQKTELQM 
ALYYSQHAVTCQLEGEARDLISRLHDSWKFAGELEQALSAVATQKKKA 
DRYIEELTKERDALSLELYRNTITDEELKEKNAKLQEKLQLVESEKS 
EIQLNVKELKRKIjERAIGjIjLPQQQLQAFIA^HLGKELQSVSAKIiQAQV 
EENELWNRLNQQQEEKMWRQEEKIQEWEEKIQEQEEKIREQEEKIRE 
QEEKMRRQEEMMWEK^EKMRRQEEMMWEKEEKmRLEEMMWEKEEKI 
RELEEKMHEQEKIREOEEKROEEEKIREOEKROEOEAKMWROPFK"TR 
EQEEKIREQEKKMWRQEEKIHEQEKIREEEKRQEQEEMWRQEEKIRE 
QEEIWRQKEKMHEQEKIRKQEEKVWRQEEKMHDQEEKIRKQEEKMWR 
QEEKIREQEEKIREQEEKIREQEEKIREQEEMMQEQEEKMGEQEEKM 
QEQEKMRRQEEKIREQEEKIREQKEKIREQEEKIWEQEEKIREQEEM 
MQEQEEKMWEQEEKMCSRKRRCKNRRRKYVFARI IQRVYLLNHI IRN 

RKSVQILVLSCSDFLNSKNSHLLPDTLMLEDKAYLQIESCLIPNEEE 
YQYLAE* 

189 

A 

386 

1321 

RTLERCLTEVGKATGWPQCILTYARRSLAPVLLGSH/HGRGLQTPGK 
LWSWGKSEEQECEEDGSETETGGQEDLEDLQEEEEVSDMGGDNPEVG 


^10 
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KKARNSSKFELRKSPVF\SDEDSDLDFDISKLEQQSKVQNKGQGKPR 
EKSIVDDKFFKLSEMDLYLENIHK\R*EERKDDNDDELRDSPPQTSP 
SVGTGATDTKKKKINNGTNPETTTSGGCHSPEDAQVHTIRILTCQKT 
ELQTALYYSQHAARQLEGESRDLVSCLHDSWKFAGELERALSAVTTQ 
KKKADRRQTGAESAAVGELGDTGDTVGSEHWT 

190 

A 

792 

1061 

GSGV*DQPGQHGKTPSLLKIQKLAERGGGHL*SQLLRRLRQENHLNP 
GGRCC\SEPRLLHCTPAW/VNESKTSSQTNKKISQEWWCVPIVL 

191 

A 

2 

343 

LLFFFFFEMESCSVTRLECSGVISAHCKLC/LPGFKRFSCLSLPSSW 

dyrrm/ pprlanflyf\ vemgfhrvaqaglklls SGNLPASA/ F / PK 

Y*NYRRDDASIiAACSTFLGLGLLWV 

192 

A 

2 

449 

KEEIIPIL*NLFQNIKAEGILRISFNEARITLI/PKPN/RAITRKIN 

PIDQSLMDRHAEILNKISAN*IR*RMKRIIPHGQVRFS*GMWGWFNI 

RKQINVIHHTHSLKKKNHMIISINAEKEFDKIRKLLKLRNIYKRGI* 
LT**VMVRNS 

193 

A 

128 

365 

VHTWMLSSP*GPQPGVFHAQIRGCPFLSP*R\CQFQVFSLFYFDLLW 
VFTILFFFLEAEYHFVARLECSGLISAHCNLC 

194 

A 

233 

598 

PLLLPLTRRLLLQHGLACQRAGLLHGPGRLQCQLHSHGHCHLDQSGP 
AVAGAARPRLLCGP I ADQS PS PFRS PRPLRWGLLQG WRCEDHDRRPS 
AEHWQEGQGGTVI SRRRREKENPKGKE 

195 

A 

38 

.1222 

EPESCSVTRLECSGVISAHCNFR/LPGFKNFPASASQVHGTTGTPHH 
AQLI \ L YF * VEEQGFPPMLiAPGW I LGS P * PLMDPAPSLALPQSAGDP 
QP* AHPHPGPS YLFLFKERENYERPKI I *LNPLPLAQGKKMEFALI\ 
WVMKHTQ*IRNNHIFSKRQK/C*DL*SYMVAYFWVGKK*EKNITSIiT 
GNDTF* *KIL»IiSFPTRMIHCKAKIIY/ IAKFFFWRRSLTSVTLGWSA 
VWHNLSSLQPPPSGLKRLSHLNLPNT/ WDYRPPCPANLCFVVVVVVL 
LFVFW *RWGF \ TMLARL ISNS * PQ/ CDPPTSASQSAEITGMS /HPCL 
AMG FVFHLTL * KPPFFKD YMKS FFQFFKYL I QG * CS LV * GVRYRSS L 
IFFF/ CF FETES CliVTQAGVQWRDLGSLQS 

196 

A j 

526 

835 

FNLNFTVSLRTHSPLIPFFSSNERIKPGKSTIDGPWTRRTRL*RKNL 

WMIQ*LWDFLF/VLFETDSSSVARLECSGAISVHYNFHLPGSSDSPD 
SRSMPIVDRQ 

197 

C 

7 

276 

MQWRDHNYCIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLARPFI * 

198 

C 

2 

28 

MIDTHTPS* 

199 

A 

780 

889 

NKRNQGSSRVGTHL*SQLL/RRLRQENHLNPEGRGCS 

200 

A 

224 

552 

KVLRVARCGGSLLSRPSTLGGQGGRLMRSGVRDQPGQHSETPSLLKM 
QK\ LAGHGG/RAL * SQLLGRLRQENHLNPGGRGCSELRS RHCIP\ SR 
VTV* DS I S KKKRRKKVL 

201 

A j 

41 

320 

VRGI PVTDQGELEKEREPKIQRLVTPRVLQHKRRR I ALKKQRTKKNK 
EEAAEYAKLLAKRMKEAKEKRQEQIAKRRRLSSLRASTSKSESSQK 

202 

A 

80 

518 

VGPLTPNSVLPRGSV/LPTEVGLDALGEGLKGLCGPNPWETTNQGF 

PMKQGVI»THGRVR*ET*PQSTQDSASCYSTCPAAQTAAYCSE/ASSV 

PRKIKKRLQNMLNFWPRE*RRLRRSARNKLRRDADFPLCELLLLSLN 
PVRNKIF 

203 

A 

63 

552 

VGPNQWWNAQPRFPHKQGVLTPWPCRLLLSKGPSLSRPKETEPRRLG 
PKRASRVRKLFNLSKEDDVRQYVVRKPLNKEGKKPRTKAPKIQRLVT 
PRVLQHKRRR I ALKKQRTKKNKEE AAE YAKLLAKRMKE AKEKRQEQ I 
AKRRRLSSLRASTSKSESSQK 

204 

A 

1 


£> V KJMGRG xWG I SGGTTNKVS P * SRVS * PMAVSACY* VKNGRGYW 
G I SGGTTNKVSP 

205 

A 

379 

540 

IKKVRNLGPKHPRFSVLLLHVSCSTNGGVLL*RSSVPRKIKKRLQNM 
LNFWPRE 

206 

A 

287 

449 

IKKVRNLGPKHPRFSVLLLHVSCSTNGGVLL*RSSVPRKIKKRLQNM 
LNFWPRE 

207 

A 

263 

525 | 

IKKVRNLGPKHPRFSVLLLHVSCSTNGGVLL^RSSVPKKNKEEAAEY 
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AKLLAKRMKEAKEKRQEgiAKRRRLSSLRASTFKSESSQK 

208 

A 

68 

280 

LDSDLEVEARREESLRLFAICSWRFSLASFILLAKSLAYSAASSLFF 
LVRCFFRAIRRRLCCRTRGVTRR 

209 

A 

465 

1444 

KKGEKDYSWTDLILTVPSPPGPQKELARI\GQTFPISLKKMMSASML» 
*ESPLNKGG*ET*GPKAPKIQRLVTPTCSSSPNGGRIALKKQRT/TR 
KI KKRLQNMLNFWPRE * RRLRRSARNKLRRDADFPLCELLLLWPCPP 
ATE 

210 

A 

87 

904 

c 

GGWVAPETPSDPWFPVQRGVGDASVPGIGKNWCEKAATSVDAFRMV 
TASRYYPQLMSIiVGNVLRFLPAFVRMKQLISEHYVGAVMICDARlYS 
GSLLS PS YGWFC\ DELMGGGGLHTMGTY I LDLLTHLTGRRAEKVHGL 
LKTFVRQNAAIRGIRHVTSDDFCFFQMLMGGGVCSTVTI4NFNMPGAF 
VHEVMVDVPLLYLKGIWYMVQAIjRQS FQGQGDRRTWDRTPVSMAAS f 
EDGLYMQSWDAIKRSSRSGEWEAVEVLTEEPDTNQNL 

211 

A 

217 

625 

GDELVKREAGMKMLPGVGVFGTG/ELPPEVLVPLLRAEGFT\VEAIiW 
GKT * GGRRKALLKEMNI AF\ HTSRT YD I LLHQD VDLVC I S I PPPLTR 
Q I SVKALG I GKNWCEKAATS VDAFRMVTASR YYPQLM S LVGNV 

212 

A 

858 

993 

LKS PQRPGMVAHACNHSTLGARGG W I T/ R * GQEFKTS LANMAKFHL 

213 

A 

1 

126 

FRAWTRSRSERRRRRKRSSSGSEKFDSKK*KKNSSRKKQHEE 

214 

A 

90 

945 

S PLADTFL S SGLGRTAS PRPKAEARGTGRVRGLRRAVRAS PGRMGRS 

RSRSSSRSKHTKSSKHNKKRSRSRSRSRDKERVRKRSKSREK*\RNR 

RRESRSRSRST/RHWPCPGASGTGSAPRPRPTASTSSGARRASAGSL 

DEKQKARGGGEESGVPSGSEKFDSKK*KKNSSRKKQHEE*KNW*QKR 

VEEELEKRKDEIEREVLRRVEEAKRIMEKQLLEELERQRQAELAAQK 

AREEEERAKREELERILEENNRKIAEAQAKLAEEQLRIVEEQRKIHE 
ERMK 

215 

A 

739 

857 

IIPPSPNSNQLNPP** *FPGNYY*YFNITQINKDNYKTE 

216 

A 

263 

425 

F AFQKFY I YTYF * KHTL/ WLGAVAHGCNPS TLGGQGGQ IT* DEEFET 
TLDKMVK5 

217 

A 

2 

660 

FVTRPDAGVMRLPLVLLLAVLLI^VLCKVYLGLFSGSSPNPFSEDVK 
R P P APL VTD KEAR KKVL KQAF S ANQ V PE KLD VW I G S G FGGLAAAA I 
LAKAGKRVLVLEQHTKAGGCCHTFGKNGLEFDTGIHYIGRMEEGSIG 
RFILDQITEGQLDWAPLSSPF\CIMVLEGP/KWPKGVPHVQWRESLH 
SGPQG/ GSFHRRKLSLTSI*SWLRWYPVEPLMPSC 

218 

A 

151 

416 

ERERESVCVCMCVRVCVCVCLRERERQRQRDKRLCGEMRKIQYVNKL 
HYI/SKYLYIYKYVILQT*TFIHIYIYLYK*ICIHTQMYAVY | 

219 

A 

4 

328 

HG VS LLFPRLECNGT I S AHHNLRLLS * SD S PAS AS QVAG I TGMHHHA 

RLIL/YFLVEMGFLHVGQAGLELPTSGDPTLPWPSPKCWDYRREPTS 
PGPQTYIITQSKSSP 

220 

A 

436 

111 

IYSYGKTGPV*KRKKCQSRIKEQNPIICWIjQATHLKCKETYKLK/MR j 
KI CQANTNQNEE I G I L * NS RL Y S KNY YQG * * * KG S I LQEDMT I LNM Y 
APKNRGSGYMRQKLIELQGE 

221 

A 

212 

275 

FLILNCNSF*KIYPKYKSLIR 

222 

A 

770 

1573 

NHIiEIVNLCNVGISQSFPQI*LKAQATKEKINCIPIKLKTFG/TEKY 
IIKNVKRQYIE*QKISANH\LSDKRLVFRIYFLKTITVQY*KDKPII 
K*VKDLNR/HFSKEDT/YT^TNKYMKRC*KSSTIREM*IKTTMR*TLP 
HYRWNIIKKTDNNNC* *GCG*TGI\LIHC\WL»ECKIMYP/IFRTV*Q 
FPEKVKQHYPSDAAILLLGIYPRNMK/SKCLPKNYTRMFITELLIIL 
KK*KQLKC/PIN* *W/ IK*NDKMW* IHMIKYYSAIKRHRVLINA 

■59-1 


06 

428 

SPQIRKQEISI/ SLLTDGTILYIENPKBSPPPKLLKVIKFNKATRFK 
INMQKSWFLHPRN/ QFENEFKKILSFGTEPS/RSEYLGINITKEV* 
NLNTKNDIVEKMKELIiSKWKDSPC 

224 

A 

35 

397 

FLTGTQWGK/ DS PFNTWC * DH/ WN IHRNLDL YL I P Y I KI NLKQLTG P 
NLRAKTIKLPEQNIGENLCDL*LSRERYSTKSITRIRKLi/DTLGFIK 
I KNKC I S KDTIRKR * ATDWEKI FT^HVLKG 

225 

A 

832 

1043 

LLWW FRLR S S PCGTQQGPWHL VC VYG YAG \ GWAGP P WHTC PWD PSHC 
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PGTE * CP WCVWFCFFCNTYLLVPN 

226 

A 

764 

1016 

LLQAGAHACNPSGFGEAKAGGSTDLRSGDQPGQHGETPSLLKVQKVA 
* RGGAPL * S QL / LRGGGGC S E PRSHD * I P AWMTEGDS V 

227 

A 

748 

1488 

RLTKRPETFPRRHPKLPTLKTPVAGATPLSPSHPDWPVFNPIPLTNR 
T*PPLQLQLRQPSRSCPETTQGPAISSEAELPVRSPPPPFRLFPTPE 
/ PGLSRQW P * EVEGTSRTPFFKAEGQK/ EGKVKEGKTEGPCS PSRRT 
RILPPYSRLESGPVHWGEQGPNRKSWPPPNLPDQGF*QEREEAMGHI 
PSPPGIRQRHRTSVQE*GSEIKPK*CPEPGKAFNPRPWGLGLT/TRP 
LPLHSQKSRQDS *NTG 

228 

A 

34 

478 

LSPRLECSGAI*AHCNLRPPGFKQFSCLSLS\SSW\DYRHAPPTPQL 
I W F W Y FK* R PG FQLVLVQ AGLEL \ LTL * I P P AS AS QKC \ WD YQ V * AP 
H/ wpalykvs Y * EKWFP IMFFAPFS WLSLGTEKFVRKVKVAGLGTLP 
TGWALCSLWKY 

229 

A 

28 

359 

KGKIPWLCGI *PQNEKLM*HSKSLATVFCIYR/TN*KNKRIISINAE 
KAIDQI QHTFMNFLKNF/ MKTVRKLKMDGS ILNIIKEI CER PVTNM* 
DGELLDDFPKIGDEAQRPP 

230 

A 

1216 

2019 

HPRGAKYPETNPGFVTLSYVGPPDSRQLTHSRPSRSENACGHRPGEL 
NWLNCSGCLLVPRGKRVWTETCNLVWKTEASGPPPGDPLPSLSSPIT 
HTQARPT/ DTQAHCPAPCPPH* PLALNPQQNFCPEESAAQVGQGEED 
RTSRGWWHGSQPQVGAGK*PLSTGSKAQGARIjTGGGVPSPRLR*EQA 
LVRCRCRDCRLGSCVRSHSLKF1DEAGEAPSAAGLDGQGTGRAPGAP 
EARCPRPAESGEGPGSGQAPAEQDPQPRAALRA 

231 

A 

5 

399 

DGVSLV/ S PRLE YSGA I S AHCKLRLPRFK* FSCLS I PS S WDYRHVPP 
PPANSVFLVKTGFHHVGQAGLELLTSGDLPTSA/ FPKCWDYRC\DHC 
AWAKD*KFKCVSVGRRGGPGEAVHQNSNRLSYWWEDGGKTG 

232 

A 

481 

527 

AAGTYS L * IGKE * VKLS LS VDSM I L YI DNLKTLPRKTLTTDKFSKI S 
EYNIS\VF*CINDEKAEKEIRKTIPFTIIKR*NTSDLTKKVKGLYKG 
NYKTLMKEINXWKDIPCA* IRRINIIKIVYPK*T*FKAISIKII»LSF 
VTEIE\KKIPKFI*NQKRIRIAKATLSCWNLFPMNSWHNRC 

233 

A 

20 

129 

SAALHLFSAPCSMGAGPSLLLACDSVDLPVAASKGL 

234 

A 

16 

370 

ACRIAAI^YLDIiLRTWFQVGSSGSQRDVNCSVMGPQEKJCVVVYLQK 
LDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKDVTDFISHLFM 
QLGTVG I YDL PHLRNKLAMNRRWG 

235 

A 

1 

1125 

MASRGVHAEDWMQDPWWHQCADEQI VTDCHWVI KGTHRS EPSSMKVK 
RIIPIS TNWFYRI QTS S LLNKLWKD SD YE VLNCRAFEEKG PQEKKW 
VYLQKXDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKYKEEGS 
KVTTYCNETMTGWVHDVLGRNWACFTGKKVGTASENVYVNIAHLKNS 
QEKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSG 
GHSRKIPSKYYKHLYAKXLENLEEMDICFLDAYTLPRL1NQEEVGSI1NR 
SITSYESEAVINSLLMKKSPGPDGFTVEFYQRYKEELIPFLLKLFQT 
IEKEGLLPNSFYEASIILIPKPGRDTTTKRKLQSNIPDELRRKKSQ 

236 

A 

426 

2173 

MRAQTHNRAVDIALGTAGAHLEDPGAQQGQDRCSWQVCRSARRRRRR 
EAAGGRRAARRAQHPCCRE PS GNQSLS AHLHTG ATKG PAS S LLHEHL 
WKPIARSARLPGNSSPLFSAPCSMGAGPSLLIiAALLLIiLSGDGAVRC 
DTPANCTYLDLLGTWVFQVGS SGSQRDVNCS VMG PQEKKVWYLQKL 
DTAY\DDLGNF\GHFTIIYNQGFEIVLNDYK\WFAFFKYKEEGSKVT 
TYCNETMTGVAm\DVLGRN\WFFTGKKVGTASENVYVNTAHLKNSQ 
EKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGG 
HSRKIPRPKPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQAS 
CGSCYSFASMGMLEARIRILTNNSQTPILSPQEWSCSQYAQGCEGG 
FPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYV 
GGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRD 
PFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIR 
RGTDECAI ES I AVAATP I PKL 

237 

A 

3 

463 

GTRFLHSFLSSLKHGCGPSLLLAALLLLSPGDGA\VRCDTPAN\CTY 
LDLLGTWVFQ\ VGS S \ GSQRDVNCS VMGP\ QRKKK * WYLQK\ LDTA 
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\YDDLG\NSGHF\TIIY\NQGFBIVLNDYKWFAFFKDVTDFISHLFM 
QLGTVGIYDLPHLRNKAGY 

238 

A 

12 

362 

DRVSLLSPRLECNSAIPAHCNPCLP/GFKQFSCLSLPSSWG*RCLPP 
RLANFLY/SLVEIGFHHVGQAGLKLLTSCDPPTSA/FPKCWDYRHEP 
Q/WPDL*LTSF*GYRYEDSFDFLPKVPI 

239 

A 

3 

155 

SHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFTDRANLGTLRGYYNQ 
SED 

240 

A 

3 

423 

GSTHASARMAVMAPRTLLLLLSGAIiALTQTWAGSHSMRYFYTSGSRP 
GRGEPRFIAVGYVDDTQFVRFDSDA\ASQRMEPRAPWIEQEGPEYWD 
QETRNVKAQSQTDRVDLGTLRGYYNQSEDGSHTIQIMYGCDVGPDGR 

241 

B 

1 

540 

M PVNKH KRHDT I AES L I TPNVTTD S VKDC I KELGNRLFJLiWTQGCS SH 
VATVAPPTAPRGLRVRSAVRPLPLGGWAAAMEGEPPPVEERRRLQEE 
LNEFVESGCRTLEEVTASLGWDLDSLDPGEEEAAELHQIRCQESWEF 
ASSPFCAIiKLDTSDWDSTNSRIIKGCHQEWTGNSCGH* 

242 

B 

232 

387 

MEGEPPPVEERRRLQEELNEFVESGCRTLEEVTASLGWDLDSLDPGE 
EEAAE* 

243 

A 

1124 

1396 

ACDDWNRMDKSHLHPPPPPFPRDPDYVGPS PGTRSPPPTSGSRSPRN 
RSAPPAPPAAPPQVQARPP*H\SPPDKRKRPNCRTRGSPPWFRS 

244 

A 

171 

514 

CTVGVRCTVFPTEHKLPAVESHLI VSRSESVINKI *ANQIWEH/ IKK 
IIHHEEVGFILGMQGWLNIRKAPTEIYHINR/SKDKNHMIILT/D*E 
KAFDKI * HYFI LKTLDKLG I * GNF 

245 

A 

1 

293 

FLRWSLALSPSLECSGAISAHRNL/RLPGSTNLPA\STSRVASTVGM 
ATAPGLNFCIFSREG*VKHVGPGCLELP\TSSDPLPLVSPKLRDYRP 
EPLCLA 

246 

A 

466 

744 

LiraAVQEMIVKIELTQHMYINFHGIKLEINKGKKFGKLIHLWKLNNT 
VL SDQ * VKAE I QKGNKE I VLR WNKKEGMM YQNT/ W / D VAJCP VLRAD A 
W 

247 

A 

243 

935 

IPVGELYPLLEDGIAAFIEWNPTHEALWEPQEQQETQGPKSKHGEAP 
VGASWPFRRGFGEARPPPPVRPTLIHAWKSS*PICLFSLRHCHQALP 
GGTLDKWPSSPRG/GCSPRTPGVTQQARRLSPHRVAG*QSSSGACPW 
MSRSSAPFPFLTPSQSPRAA\L*VRKTKSPGTGEHRLWTGLGAGGVS 
ARPWKHSAGPGGIHGGRAPGAEQRSGLWKAGCWGSVRASAPGP 

248 

A 

56 

247 

QEQNPQ I S I LTLNRNG / LNATLKRYRLLNQI K/ S FCKKDQTMCCLQE 
TYLTCKDTHRPKVKG * KNI 

249 

A 

1 

372 

RALAVSMLGAWPGPWALDWGPYSMTARHGTFSL*APYPDPCPAGPPP 
APRQQGKAPSGCGGRS I CSGRLPS WWPLMPAWQRPS * / GGGLHSAP 
NARKAQAYPGCLPFYLP\ LPGLLSGWS AGFLC 

250 

A 

13 

475 

NGSSLRSPRLR*SRTIPQSS*VAGITGAHHHARLIFFFFFFFETESC 
RVSQAGVQWRDLGSLQALLPGFTPFSCLTLQSSWDYRHPPPRPANFL 
YF**RRGFT\n J ARMVLIS*FRDPPASASQSAGIIGVSHRARPRGRLL 
APSYICRNRGLEG 

251 

A 

986 

1401 

LEQGCNLFLHKHS*LGEVFFFVCFRRSFTLVAQAGVKWGDLGSL/HK 
LPSSLSLPSSWDYRHPLPRIiA/NFFVFLVKMGF\TVLARMVSIS*PR 
DPPTSASQSAGIIGVSPRAGPVAGILMFCRWNISNKPKGAVIEKKKK 
K 

252 

A 

303 

551 

QEGIKSCKSDENSGLAPGQHGETPSLLKV\SKLAGYGGACP*SQLLR 
RLRQENCLNRDLGGGGCSEPRSCHCTPAWTTEQDSV 

253 j 

A 

3 

403 

STFGFQGTGWETRWGPT*PGVSAWSLVPSPVLETPDLERGKRAHLGV 
ATAGGPAGPYRLSPHTLSSRKEGKGEAASKGTEAELMTIGIENSTCQ 
SCLS PS FSHYFWS PRGAGPKAMWLVGDRKGATKNDKRG Y 

254 

A 

1833 

2126 

EKRKKRGFFKTKPLLKCPGPWGESRFWGPPKGKSQV*G/HI*MGPNL 

GERWKKGPKRPKKPKTKGPPPFGVHPPKRNRPHLG/WPGGPTLKKGK 
NNLGFC 

255 

A 

70 

380 

YSNNPKDRKGETGINEKTNNKMVDLSSYTSVITLNF*ML*/D/KIN* 
KFVKS P * KKPTLCCLLETYFKYG IGGLKVKRWRKI * HAHTIGKKTLV 
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AILITKHTSELS 

256 

A 

225 

353 

RWG FTQLLRRLRQENHLNRG VGG CS E PRLC \ TP AWATB * DS VL 

257 

A 

238 

514 

HMYVPNLRTSKYMKRTLIELKGEIDSNTILVGDFNTTLSTMGRTFR* 
D I S KETEDLHNTVDPMDLTD / S AELSTQQQQNTLS SQAHG I PSRTE 

258 

A 

167 

409 

DRRITRSGVRDQPGQHGETPSLLKIEK\LlAGHG/ GQAPVCGGRSYLR 
RLRQENRLNPGGRGCSEPRLRHCTPA*VTVRDSVS 

259 

A 

660 

1200 

LTVGKKNRKQQQHQPKSPHGKPHLKGQQPSKVQIEDKFLGR*GKELK 
NPFTTQKGQSGFVYS KL I HNTS YRQGHS PGWRLRWMN * QK* ASEGG * 
*QTSLS*RSTIi*PNILERIPELKDWFSKIR\RQK*KRIKRKEENLQ* 
VWGYI*RPILQITGIPERRVEISMHWVRACSFSSVKFVIP 

260 

A 

1 

682 

EQCRSPARARLQPQPDPASTEAERRTPYPGSIGAPAGEGKVPGLVTY 
LLWPWQNQDEALSCIiPSEAYSLSRQHGLPLAVNTMSAGGGRAFAWQV 
FPPMPTCRVYGTVAHQDGHLLVXGGCGRAGLPLDTAETLDMASHTWV 
ALAPLPTARAGAAAWLGKQVLWGGVDEVQSPVAAVEAFLMDEGRW 
ERRATLPQAAMG VATVERDGMVY/ VSGGNGP * HGPPGPGT 

261 

A 

1 

403 

RTRGQLNRGVYGPRRSLSVPSTLAGAPLAGRLDETSGEEQTLTLCGG 
KMSGPTQPPAEGTEGTAPCGGAPVPPPNMTSNRRLQQTKAQVEEEVD 
1 1 PVNGDNVLGRDHKLRELDDRS \DALQARA* PFESSAAS 

262 

A 

72 

276 

AI QFS VY I CVPG I C I S PDRERKRDRGKDGGRHRTRWRQGE * KGEVGR 
' ERRRGRER\ DRQRDRDR I YS PD 

263 

A 

166 

398 

PQI PPPPGI PRKRENLSREPGG/ PNPWGG * A * FPLLEKFLGKKNLGP 
PPGGGPTPFPPFNPNLLGGPKGGDQKAGKKP 

264 

A 

3 

413 

SSYRWPSFPILKTCRGPMAPSLSFPGSFLSGLSVPLPLiHHPPCPT*L 
KAAYPNCQLWPFLASVAPDTAPPECPSCFFIiLQRAQA/ PPFLLDLFP 
YDC * GLCCT * PHQPRGNPLTPRQGLWLGLKTHCMGTKAGPG I PR 

265 

A 

646 

1333 

EPGMWQS P PS I GCPHRGTGQAS CA\ SPPSTGLS*I PAF PGWR PA\ WH 
LHPLLTSPCTC*L*SLGVFPVPPPQPWSTPSQDLDNDIFPVPPSYRW 
PSFPILKTCRGPMAPSLSFPGSFLSGIjSVPLPLHHPPCPT*LKAAYP 
NCQLWPSSASVAPDTAPPECPLLLCFRGPGSPILLTSPICLLRTLLQ 
LSSPASGESHYPQGRAVGWAEKPSAWSTRSIxALGFPMNNNEN 

266 

A 

268 

926 

ASCTPFPTLPVRGRLGYNPDSPGTSNAHWAGDNEMSLVGLGPVTKPY 
PHGALPQGWGSWWVRGLSRTPAVTLSQGLCSCVLETCWGPGEGAWFL 
SPQSVMCDLESPRSPADPQDGSLGVYGLASSWTNVQPTWNTSRIGSA 
CQA/RSEFNKSHLFIFLEGET*GPQKGGGVPPKPHYKAPMQSPGPIIj 
WTQPRAPPTGAIYFFLQMGGPPWRFSLFGVGG 

267 

A 

599 

1420 

SHLPSLSLLPRIPPEVIVGSERGPWFPCPWGSCWGLSRTPAVTLSQ 
GLCSCVLETCWGPGEGAWFLS PQS VMCDLES PRS PADP / RGWKFRVY 
GLASSGQMCNHLEYSRMLVLiARHSELISPTYSFFEGETEAQRG*GIP 
PKPHSRSNAEPGPYTGTHTWPLHTGTFTFSSQMRGPSGLLDSGGWIK 
TEGDKEKAPGLMGLKDRDTGTCYGCVCSKCPGQGLQAGGVIiYLAQPP 
AlilRLPNGPRPARARERRDWHHREETGFLIiDEATLLKARP 

268 

A 

1755 

2524 

VPWGNPGSESVGLPGGRGLVTAPAVFLPPKPAPGFGAQHVRSRK/PS 
RRGQL I RAAWLTGS PVRPGQEMLGAGRGGASGRAG * GPAP YRKLSS A 
SARGAGHPESGNCSSSGSGSTWSSSWGSGSGSSWGSRGLSQEMVEEV 
L / HF I QGG AG * AGGTLQ W AjLVSNLG FG VRQGDD I VWTL I GLHGS I E 
DGAGVQGRSEAMEPRQAG/RGTEGWLGGHSSETPHPPAGTSFCNLRP 
TVSREAAPAGRRGAESQTSLPSTSA 

269 

A 

3 

389 

GVSPCWPGWS *TPDLK* STRLGFPRCWDYRLLEVDPCTGHSLYL* KD 
PPVLPLPAEGCDYQGKEYALSSCCDKSYLLPDYRTKFLCCHPERGTW 
KLGTVGGCYAPIQS FG I ADEQAWLQHGSGAVYLC 

270 

A 

555 

1182 

RGPMERPKQMTAVHCQGHQRGDSEIRNFRCHGPNSCTRWGSPPHASA 
SFSTCL*NKPKYFPWGISFLSFFLSRRSL/DSVAQARVQWCNLGSLQ 
PLPPRF*QFSCIRLLSS*DYRCMPPCPA/NYL*F**RQGFTI*ARLV 
LNS*PRDPPVLASQSAGITGVSHQARTFFLFFETTFHS\VAHAGVQW 
HDLSSLPPPPPRFKRLSCLSLSSS 

271 ! 

A 

80 

600 

S I QEKRSLGGEQS PTFLKELQEGTS WLLMAKLYEAVTFKDVAVI PTE 
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I EELGLLDPAQRKLYRDVMLENFRNLL/SQWGINHAMEILSTS*GKEK 
LWVMGTS SQ* ERNLGGKI QTEMET\ VPEAGTHEEFSCKQI WEQI ASD 
LTRSQDTTISNSQLFEQDDNPSQIKARLSTVHTR 

272 

A 

1321 

1913 

VEMRRNQCKNSSNSNGLCVICPSNNHTSSPTRVL\TWMNCLELHKYN 
V* IWIGKKI I * TQENGKTQSEENKNNIKVLQELKDKVAGI IKKNLTD 
LNAPKNTLLEFHNAFTSINSRK/DKSDERISELEDWVSNIRWTKVN* 
NEQILHGGWDDIMRP I LQI TG I SEREVEKSNNLENWQDI IFESFPN 
I LARKANSQIQE 

273 

A 

193 

310 

WSLSVLGSGRRKRSQGTEELLCQSHTLLGDTGNSISHSH 

274 

A 

831 

1102 

j HFGRPRQVDCLSSGVQDQPGQHGEVQSVLKL\KKLAGHGGAHL*SQL 
LWRLRHENHLNLGGRGCSEPRWCHFTPAWTTEQDSV*NNNNNNK 

275 

B 

53 

403 

1 ALVTEFQETHSQDAIEHVLANLANFAYDPSNYQYLRQLQVLHLFLDS 
LSEENETLVEFAIGGLCNLCPDRANKEHILHAGGVPLIINCLSSPNE 
ETVLSVITTLMHLSPPGRSFLP* 

276 

A 

842 

2432 

SLISCSEGSSPKASGPSS*EHCPAKPLGPGTWPLLTPGPESGGGG\P 
DVGLR\GQPMNAAPV * s s pcl *N/ PGFLTP SGVS S V* MP VLLKEPAF 
QMAEKAPNGAVCAP/ SFPASTSGLWSPAVGVPRSRSLHGSALPWGHA 
PRQ/SGYPGQSAPAGCGPRPPGGGSRSPPGRSVPGSGAWRWPRGRSE 
ACTAPRAWRSALGESCGPAGSGA*AW* WQTAPSPHWGWIGS* * *VGH 
LLRAGCAPCWPCLGTGCTGLPSRSGHKSLSLSVQQVQWLIRFFFPLS 
* M * KPAACLPS * RERPSCAHWTAGGVMGFLSV * S VI R * SLSHCLAAE 
SS PHPGAPGQGTATQIGGGGSRYVQRG *NHRPCRS * PIQRPCGQAGS 
GGRVASTWCAQ*QTPPGSHSPPTANREINPGPAAAADTRSCWGHKRS 
WRGWRGLAPWRLGFGSPWNSRDQGPAGIPAARRGGPTFGFLAHGPR* 

RRTPRDGREGGKGAGAKPSETLTRSPPVWRGKRGSANGFLSWVQILQ 
Y FAAGNG I WEMGRGLR I h 

277 

A 

721 

1027 

QHPCQPGPAHVPQARDQPQHGQPLGEGELPDGRAHCRAGQAAGAPAA 

RHSEQGFQVLLASELLPQHPHAEGSWAQCRRGFQREESALDCRPGGR 
RGKWESAH 

278 

A 

615 

2183 

PQVIRPPWPPKVLGLQGLLTFRDVAIEFSQEEWKCLDPAQRTLYRDV 
MLENYRNLVSLDTSS KCMMKMFS S TGQGNTE WHTGTLQ IHASHH IG 
DTCFQE I EKDIHDFVFQWQENETNGHEAIiMTK/ YQKVDE \ CTERHDQ 
RHAGNKP\ LKMS LDQAF I RI CLKCTY FTPKGKLVI KLRRLSTMLS /H 
VS AS ORIS CRPKTRI SNKYRNNFLQS SLLTQKRE VHTREKS FQRNES 
GKAFNGSSLLKKHQIIHLGDKQYKCDVCGKDFHQKRYLACHRCHTGE 
NPYTCNECGKTFSHNSALLVHKAIHTGEKPYKCNECGKVFNQQSNLA 
RHHR VHTGEKPYKCEECDKVFSRKS \ TLERHRR IHTGEKPYKCKVC0 
KAFR\RDSHLAQHIVIHTGEKPYKCNECGKTFV\QIHLLVMHKVL\H 
TGEKR YKCNECGKVFNHKSNLACHP * TS I PGRETFT\ RVMECGQGF\ 
NRKSNLiERHHRLHTGKKS * KC/EICKVFRQQSNLACHHRLYTGEKPY 
KCEECDKS FQFQITP 

279 

A 

403 

935 

QQRWQQGWWQAWGGQPVPQEVGFWFLLSWLLSRLLGVPAQGPARGEV 
DPRPGSRGARGGCSASAAPPARASPGRPCCWNGSRPPAQHRRTSPAT 
RVGEAGQGEVRGVRRLPAPLLQAVQPRLSDRSYPAAPARPTATSP/A 
PPPSPAPKAPIVGALS*GTADPGSDSYPPSRSHAPRP 

280 

B 

73 

207 j 

ELGTQLRTCTTLSGIASQVEAHHSLVLSHLKTLLWKKISNTKYS* 

281 

A 

276 

565 

WTSSVSTGQSDQA/ GPAPALIQMQRPGCGPSPVPGGWLE *RDGQAGM 

GAAAAPAWVLLCIPAGQGPLPGPRLPFHIPILKFCYCGILVEKKEPR 
GCF 

282 

A 

1466 

2353 

icim vnxjwjjj_,itir XLtX X K.1WVFHIDFPLGGEKNSGVLGFQAVQKQPRGS 
FFL \ PRI PQ *QNliRIGl * KGRRGPGSGPWPAGDTQQHPS KGKGLPPL 
PACPSL* *QQGRRCSCGPGGRLGS*GCKHPECQWSQRGVRPIiSVPLF 
QP\ PPGTGAGPTSWASASGSGAGACQPGS/ PLGSVHTGS PSQAKSPP 
SRVK*QECQHPTHVSSPVSRRPGFPKL»PG\GNRAGPAAPSPPRDTPA 

APPPRPGPPSPGSLAQDAPGRSLPPPRTRRPAAQSPRPRSASESEAA 
RRG P VRRS RTGREEKSG 
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283 

A 

134 

387 

CQIMDGMLAIQAWPTPQQVASNGTGLNPNAKVWQEIAPGNTDATPVT 
HGTESSWHE I AATSG AHPEGNAE \ PQKI YVKNMK* CI 

284 

A 

305 

1129 

DICKEYEVMYSSSCETTRNTTGIEESTDGMILRPEDLSYQIYDVSGE 
SNSAVSTEDLKECLKKQLEFCFSREKLSKDLYIiISQRDSDQFIPIWT 
AAl^EEIKKLTTDPDLILEALRSSPMVQVDEKGEKVRPTHKRCIVIL 
REIPETTPIEEVKGLFKSEYCPKVISCEFAHNSDWYITFQSDTDAQQ 
AFKYLRKE\VKQFKARPILAGIKAINTFFC*EWSSNNGILESIVHPI 
* TQAQYAS P VFMQPVYN PHQQYSVYS I VPQS WVS KS YTLL 

285 

A 

1053 

1404 

TFTQDSAGSLWHSPMTSIWKTAESVDQGHFPIIILILFVYLLFETG 
SHS VTHATMQWCDHS SLQPQTP / SGSGDPPTLAC * VPGI IGMCHHTW 
LIFTPFAETGSHHVPRLVSNSPLA 

286 

A 

259 

506 

PGYRRGHGGAAEGRQEPAARHNPAVREDADGGD/RPRSAACPGTSTA 
PPGT*GS\GQAHRDGARPAPAPRPSHPSAGPAAGPSG 

287 

A 

1 

2072 

FRRS PS P AAVS LGLGVAWS S LVNG S TFVLQKKG I VRAKRRGTS YLT 
D I VWWAGT I AMAVGQ I GN FIiA YTAV PTVL VT PLGALG VP FG S ILAS Y 
LLKEKLNILGKVGCLLSCAGSVVLIIHSPKSESVTTHAELEEKLDQS 
RVGGLPCASLLVMLAAAHLLDPCRPMGPPTSWSTSASAPCWADTPCL 
PPRASGWRPKTSCITTRPVREPSACAWYSWPCSAAASSSSSGTRNKA 
LECFDSSVFGAIYYWFTTLVLLASAILFREWSNVGLVDFLGMACGF 
TTVSVGIVLMRVFKEFNFNLGEMNKSRDTWRQSSPLIVAPRDHFHLE 
RGVQFQNMNSVMGSHLVMVALSPGVDFIPQPDDPAALLVTELFANRL 
VSGYAPATGLQGTI LQYGNTLMEVMPKI CRLPRHECGS PGP\ PGSCA 
PPPSPHSTR/RRPPRSAD/CGPCKGTGTRRERAPGAG*AEAAGPGEG 
GVGAGAANAGAGAADAGAGGADAGAGGAGAGAGGAGAGAGGAGAGAG 
GAGAGAGGAGAGAGGAGAGAGGAGAGAGGAGAGAGGAGAGAGGAGAG 
AGGAGAGAGGAGAGAGGAGAGAGGAGAGAGGAGAGAGGAGAGAGGAG 
AGAGGAGAGAGGAGAGAGRAGAGAGGAGAGAGGACAGAGGAGAGAGG 
AG AG AGG P VAG AGG AVC S QT I FTRSHGGLHL WR PC 

288 

A 

1 

392 

ITLGPDSIIGIWSGNQHKQDPYIASMEHHTDWVNDIVLCCNGKTLIS 
ASSDTTVKVWNAHKGFCMST/YKDT*ASSLSGNKDSIYSLAMNQLGT 
IIVSGSTEKVLRVWDPRTCAKLMKLIGHTDNVKALLb 

289 

A 

1 

1051 

frvnmqdggpspaehskaeesagmearflglpdaagssgptparrcp 
aprpagvsyvirdevek™rngvnalqldpalnrlftarrdsiiriw 
rvnqhkqd pyiasmehhtdwvnd i vlccngktl i s as sdttvkvwna 
hkgfcmstlrthkdyvkalayakg*\elvasagldrqiflwdvntlt 
altasnntvttsslsgnkdsiysiiamnqlgtiivsgstekvijrvwdp 
rtcaklmklkghtdnvkalllnrdgtqclsgssdgtirlwslgqqrc 
iatyrvhdegvwalqvndafthvysggrdrkiyctdlrnpdirvlic 
eekapvlkmeldrsadpppai 

290 

A 

1 

366 

FPNSSRLIWPECNRIKEESLVSASPGSFCKPHLDLALGLGL/GVGLM 
VELGLKLGLFLGLGFDLGVGLG I GL / GVKLGLSDG Y/GLGFRLGLGL 
* AGLVLGLG I R VGLKPGLGLE I FVGL ILCICI 

291 

A 

49 

420 

FMINPPNQVGRYRNIINLTTNIYQKPLAPVIL/HGEIFKIFLTGIGV 
TQRYPLSPLPFKVFLAALDRK*REGAGEEVNGIENKRIKLSFVSDMI 
VYVENSKEKNGQLLELMRGICKTATKLQKL 

292 

A 

5 

377 

FM I NPPNQ VGR YRN I INLTTN I SQKPLAPV I L/ HGE I FKI FLTGIGV 
TQRYPLSPLPFKVFLAALDRK*REGAGEEVNGIENKRIKLSFVSDMI 
VYVENSKEKNGPLLELMRGLSKNATKLQKLI 

293 

A 

346 

490 

QLQAMAIFEYLKKTFLRPGTVPHSCNPSTIi\GGRGGWIT*GQEIiEAS 
P 

294 

C 

140 

172 

MTVXLRTNGDF 

295 

A 

2008 

2438 

WSLHRPRGPIiLWHMPRVPPPLTGL*SSPHDSVGPQPWKSPPTTSFNS 
PDPPENVQNHPGKPVAKS PFVPQRNGH / PPKR/ PTWGSS TCWGEVIR 
SGSPPPPTXKPVWPTSCEFGSVLPPEDETVFPXPHFPVSLILFAISF 
ALVCGD 

296 

C 

165 

290 

MCHRAG PASLSAGQF IH I FLVTPMAAPRI S AVCGPESRHQG * 
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297 

A 

883 

1097 

GSG V * DQPG \ QHGKPPS LL * LQNLAERGGGHL * S QLLRRLRQENHLN 
PGGRCC\ SELRSRHCTPAWATRAKL 

298 

B 

191 

494 

MKLKRGAGSCKKPGSSGWGPAVGEFHSVQNICARREYQRQDYMDAMA 

EKQKLDAEFQKRLEKNKIAAEEQTAKRRKKRQKLKEKKLLAKKMKLE 
QKKQEGPX* 

299 

A 

2 

478 

SGRLPWLAQPPPRCDHRGPRKSRRRSSSPRMRREEQKLKLERLMKNP 
DKAVPIPEKMSEWAPRPPPEFVRDVMGSSAGAGSGEFHVYRHLRRRE 
YQRQDYMDAMAEKQKLDAEFQKRLEKNKIAAEEQTAKRRKK/RPQS* 
♦KKNLLAKKMKLEQKKQEG 

300 

A 

72 

409 

VCSETGFI LYCRN/ DCKWDAP I LETVWFFVRS *T\ LDMNCTPAI PFL 
FI YPLEMYGHAY*MTCTRIFMAALFI IAT\NQKSPKCS INSIHATKY 
Y I AMKL I QL YP YVATW IN KVF 

301 

A 

92 

617 

DRVLSLLPRLEC$GS ILAHCNVCLP/ GFKRFS \CLSLPSSWDYRHTP 
PCP/V*LVFLVEMGFRHFSQAKSRTL/TGDPPASASQKCVWYRRDNL 
\AWLASALNKVINLLKSEIPSFFLVQLSGFSRTLLLQGFLNFNYHTL 
VYSNNLPIMQTFTQDFGRISGKPFPGIMILCNREILYIM 

302 

A 

367 

620 

ETRCRGHPPRDREPPPS*PGIQSSPGVP/PSRPSNKAYPRLPIRPQP 
PGLSFR\STPPRPSP\KPPCPPHIGRHVQPTKTSAVSILS 

303 

A 

2 

520 

FPLFLFLLLLLAFPVALTTTRRIQGVPQTLVLQAKRVNTSRPHPD/G 
S/ QPPTSASFNQLNPTTNSTPQLRSHVASGYVG* KQMQYPH / PPQHA 
PAPPQPCPWQVDTFLPAPTRPLLGSGPGSPSRRSPGG\PPPAGKGPA 
PGPECPPTAQRPTT/RQYLGPLFKGDPRNSPREKPRS 

304 

A 

444 

920 

NPSLRCWLGLSPGKLAGARGRGTVHVGGPEGPCTHSTSTKRMAKRHP 
E VS TGGGGKAP * / S QG I RGQG AGQG \ GAR S G VAGNG GGG ARS RLRL S 
PAAASATA/ SGPEGS\GPGSGGRSRVPVEAAPPCPRSDWPAARGAA 
AAGPGGTWLPASQVGPSGPWG 

305 

A 

504 

935 

AYLQSSMIAQSRATRTGLATWHQHKALDRHFPRLSSRHPLFETCSSP 
PCKRWGEKSHFRRACKEGRESAPCGIPRPRFKVHTPRGKLRM\RPVP 
RRFGGSRLGPRSRRKRRTKQPR*PAAPRDSPGAPAAPPTPRPSAARN 
GGG 

306 

A 

163 

389 

IRQKKAIFVIVAFKRRGLRKTD/IYYYARGG*FVKNKYRGFIIELGT 
TCGKYFLFCTLAI IDPSDSNI 1 1 SMPEQT 

307 

A 

131 

395 

LWAARAAVLTSLLPTGSPSAALSGCCRPGGQGRYSVAACRGRAAGGV 
APVPAHT * AAAPLGHFS FLLG I VS PAPE PHFELGHY YSGLS 

308 

A 

327 

598 

GPARNPAQCRWAGTAGGPSTLSAAAGPGAKPLMPGAA/GAGQAGWLL 
♦VQGPPSPRPPGTLAGPQARHTALVPARASPSTPPCKLRGGLRP 

309 

A 

1 

630 

MKPWTFTVSVTVLKDGVSGVCYFRCSDVSGVSYFWWVRGLADFRSEA 
MDLVASVTALNGGVDPKRTGFGDKFKPVIGTWDCDTCLVQNKPEAIK 
CVACETPKPGTCVKRALTLTWSESAETMTASSSSCTVTTGTLGFGD 
KFKRPIGSWECSVCCVSNNAEDNKCVSCMSEKPDDFFGGVEHLAQGW 
PCSMTWSDTKGFPQTQGDSLRW 

310 

A 

101 

520 

RR1»QKPGCLIWTRGST*CPLTLLLPSRGAGLGTC/KPAMPEPPTHSM 
GTCAARAS PTSTTPCST/ CAQSHRPGRNKQPKG * GMRAHGAGLAGSS 
TCSPGVGSTG*SQGL 

311 

A 

411 

640 

ADIDSRGPKMSLQSSS*SRGSWMSGN*WSFSSDPTYSGSGGSAPSPI 
\PTQGAEECGRTAPGLAGSSTCSPGVGSTR*SHFRPTHPSLENLEDI 
ALTNVLQNRFVRAEPASLKSPVIALLCMSDLTVGTTVTQLQHLNTMG 
VIGSQDGRDQVAALhmQKQGDLWPFPRVTVLWRKGIDQTFWGLLNTG 
•a x xj x tr \ju xr iv v_ k. v_ cj r tr v iv V rvA±itj»v. y V J. NG VIjAQIQIjTVGLi VGPRPV 
PSTTQGAEECGRTAPGLAGSSTCSPGVGSTR 

312 

A 

375 

519 

GMLFPKQITPQSI*VETNAiagrv*GGDIASFMGSTHAI,FEFPDSHHSA 

313 

A 

640 

975 

VPWEAGGSMSICFYPRQSEQSHNIVKSKRPVTFQQNRCQKGGGHSAD 
VSSTCQCFQCSHREQGFAKPQFSYL*NGGSNSFYLTGCFDNLITSVF 
CSVRFIHGPKAIIKGKYH 

314 

A 

22 

297 

GKEKLSHSKAWAPKSQNSNSSASSWSNQSQAQPRFKGED I/ESHLLM 
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GREALS*IQS/GRLVYLQSLS*T*RFSTASPEQLDTECRIiVHSQTLS 

315 

A 

152 

437 

GLRLCLSRPLTRPGDDSVGGSAMASGAGGVGGGGGG\KIR\TRRCHQ 
G P I KP YQQGRQQHQ VWD PD AAVA WR VG KLG WRGRAVRG PG APRG ADR 
RA 

316 

A 

3 

611 

GRARGARGCSGGGAAREGPPKAERCAASGRASAANMSDTKVKVAVRV 
RPMNRRELELNTKCWEMEGNQTVLHPPPSNTKQGERKPPKVFAFDY 
CFWSMDESNTTKYAGQEWFKCLGEGI LEKAFQGYNAC IFAYGQTGS 
GKSFSMMGHAEQLGLIPRLCCALFKRISLEQNESQTFKVEVSYMEIY 
NEKVRDLLDPTGSR 

317 

A 

1 

949 

MGSCAARASLTSTTPCSTAPSPINHPKAEECERTTAQDWQAAPPAAP 
VRDPLGEPSWAPESGGDVESLYIYLRDCKYTNQHPVFSSRFVNPPID 
TLYLAALVGPWKTFMSSSGIVNTPIGTLYLAQVTVPPPL.HTRFFALS 
PRQPRSFKNGLLFQTGTTLNPLSGYSSDYKGFGRFRNHPQTGFSPAG 
ANQRGPLVATLSGPGGEGQSAV\PVSLVKRKTTLAPNTQTASPRAIiA 
DSLMQLARQVSPTGKRAVSATAINLKVCKHTSQHPVFSSRFVSAPID 
TL YLAAL VRTWRVFM S S SGIVNTP I GTL YLAQGL 

318 

A 

1 

4425 

MASGAGGVGGGGGGKIRTRRCHQGPIKPYQQGRQQHQGILSRVTESV 
KNIVPGWLQRYFNKNEDVCSCSTDTSEVPRWPENKEDHLVYADEESS 
NITDGRITPEPAVSNTEEPSTTSTASNYPDVLTRPSLHRSHLNFSML 
ESPALHCQPSTSSAFPIGSSGFSLVKEIKDSTSQHDDDNISTTSGFS 
SRASDKDITVSKNTSLPPLWSPEAERSHSLSQHTATSSKKPAFNLSA 
FGTLSPSLGNSSILKTSQLGDSPFYPGKTTYGGAAAAVRQSKIiRNTP 
YQAPVRRQMKAKQL SAQ SYGVTSS TARR I LQS LEKMS S PLADAKR I P 
SIVSSPLNSPLDRSGIDITDFQAKREKVDSQYPPVQRLrMTPKPVSIA 
TNRSVYFKPSLTPSGEFRKTNQRIDNKCSTGYEKNMTPGQNREQRES 
GFSYPNFSLPAANGLSSGVGGGGGKMRRERHAFVASKPLiEEEEMEVP 
VX»PKISLPITSSSLPTFNFSSPEITTSSPSPINSSQ7VI»TNKVQMTSP 
SSTGSPMFKFSSPIVKSTEANVLPPSSIGFTFSVPVAKTAELSGSSS 
TLEPIISSSAHHVTTVNSTNCKKTPPEDCEGPFRPAEILKEGSVLDI 
LKS PGFAS PKIDSVAAQPTATS PWYTRPA I S S FSSSG IGFGESLKA 
G S S WQCDTCLLQNKVTDNKCI ACQAAKLS PRDTAKQTG I ETPNKSGK 
TTLSASGTGFGDKFKPVIGTWDCDTCLVQNKPEAIKCVACETPKPGT 
CVKRAIiTLTWS ES AE TMTAS S S S CTVTTGTLGFGDKFKRP I GSWEC 
SVCCVSNNAEDNKCVSCMSEKPGSSVPASSSSTVPVSLPSGGSLGLB 
KFKKPEGSWDCELCLVQNKADSTKCLACESAKPGTKSGFKGFDTSSS 
SSNSAASSSFKFGVSSSSSGPSQTLTSTGNFKFGDQGGFKIGVSSDS 
GSINPMSEGFKFSKPIGDFKFGVSSESKPEEVKKDSKNDNFKFGLSS 
GLSNPVSLTPFQFGVSNLGQEEKKEELPKSSSAGFSFGTGVINSTPA 
PANT I VTS ENKS S FNLGTI ETKS AS VAPFTCKTS EAKKEEMPATKGG 
FS FGNVE PAS L P S AS VFVLGRTEEKQQE P VTS TS L VFG KKADNEE P K 
CQPVFSFGNSEQTKDENSSKSTFSFSMTKPSEKESEQPAKATFAFGA 
QTSTTADQGAAKPVFSFLNNSSSSSSTPATSAGGGIFGSSTSSSNPP 
VATFVFGQSSNPVSSSAFGNTAESSTSQSLLFSQDSKLATTSSTGTA 
VTPFVFGPGASSNNTTTSGFGFGATTTSSSAGSSFVFGTGPSAPSAS 
PAFGANQTPTFGQSQGASQPNPPGFGSISSSTALFPTGSQPAPPTFG 
TVSSSSQPPVFGQQPSQSAFGSGTTPNSSSAFQFGSSTTNFNFTNNS 
PSGVFTFGANSSTPAASAQPSGSGGFPFNQSPAAFTVGSNGKNVFSS 
SGTSFSGRKIKTAVRRRK 

319 

A 

160 

747 

VLNGSGRPSQSKKHFPPKPGTFGGASVHQMFIMSLPA/ GPDLSGFDE 
DDKGWPENQLDMSDYSSSYQDVACYGTLPRDSPRRNKEGCTSKTPHA 
LTVSPFKGIFSSATKVFQAPNACKRGA*GKG* PWEGKAS SKPGRDFL 
REGRKMSWKLFNRKLlSSQPYVPVGVCLTSVFTIASLGEQGMVFPLK 
VDLFKLTG 

320 

A 

1812 

5065 

PNKTGGKTVSDGLHHPSQLHSKIjENDQGFYNVDSSTGFHNKTNPVGP 
FRKSPFLVMGVDWAETRRSMMMQMRRAPMSDDILARDRGSRIrSRGNR 
RNGGGGCRDDDDDGGGAGAVRTGEGATEELPPVGGRVRPQHRGELFV 
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LSSAGDLHRDRDTHRGGAGSGGGSAMASGAGGVGGGGGGKIRTRRCH 

QGPIKPYQQGRQQHQVSSSRPDLSGFDEDDKGWPENQLDMSDYSSSY 

QDVACYGTLPRDSPRRNKEGSGCTSETPHALTVSPFKAFSPQPPKFF 

KPLMPVKEEHKKRIALEARPLLSQESMPPPQAHNPGCIVPSGSNGSS 

\ SQ*NTIANVRRRLTLRRKKMSWKLLTGS * * VHSLMYLWSLLTSVFT 

MPAWRTGNG/LSSKVDLSNSRVLEKEVSRSPTTSSITSGYFSHSASN 

ATLSDMWPCSDSSDQLAIQTKDADSTEHSTPSLVHDFRPSSNKBLT 

EVEKGLVKDKI I WPLKENS ALAKGS PSSQS I PEKNS KSLCRTGSCS 

ELDACPRKISQPAKGFCPRELKVCKHTNQHPVSSSRFVNPPIDTLYD 

AALVGPWRTFMSSSGIVNTPIGTLYLAQGSNGSSMPVEHNSKREKKI 

VSSGTISVILPGGPQLASPSGSHTGATGGLACQ/CSRSAPALLSPWA 

VHGTRRRAAGGGACRGGSGRAGAGGGGGGGHS * QGYR I C * E YCARVA 

TKILQQE\EDVCSCSTDTSEVPRWPENKEDHLVYADEESSNITDGRI 

TPEPAVSNTEGGHNENAAAGPHDSHSCASGRSSSRSPKAPTLSPVHT 

HHKKGALPEGTHRIGF/TYG/ IPVXiESTL*KP*GPPVCSTWGAETSI 

RPS PPAE / CS PPARPRRRWGP I PDAWARAGFAS PKIDS VAAQPTAT 

SPWYTRPAISSFSSSGIGFGESLKAGSSWQCDTCLLQNKVTDNKCI 

ACQAAKLSPRDTAKQTGIETPNKSGKTTLSASGTGFGDKFKPVIGTW 

DCDTCLVQNKPEAIKOTACETPK^GTCVKRALTLTWSESAETMTAS 

S S S CTVTTGTLGFGDKFKRP I GS WECS VCCVSNNAEDNKCVS CMSEK 

PELQVFSGSNS 

321 . 

A 

819 

1244 

KAYASLSLWQAPSGESRPAPAAADLRGSSG / RPP * TFPNPPCGPHTG 
CP \ AGS FP I S FQPRCLLVAKS KPGMPVSDEAAYES PWKGGAGAPALG 

LLGALSSEVADRALQAAPAWAPRDAGRGDLPGQDRPQGPAGASLGPS 
RS 

322 

A 

2 

375 

P R VR VR VDL VS G WGG KMVDL F VL VTVKRV \ I D YAVKI L VKPDWTG W 
SDGVKHSLNPFCDIAVDEAVLLKEKKLVKEGIAVCCGPAQCQETNRT 
ALGMGAHRGIHVEVSP* * AQRLCPLQVARG 

323 

C 

303 

395 

MYSQPQANLGSIiEHKLGRLHIRILGHGSGRL 

324 

A 

268 

393 

DGRRKEKWHKVERRHRPYLLSSLSQHRW*TVTNFGEISGTIAIEVDE 
GTYIHALNNGLFTLGAPHK/ERIALKPGYGKYLSINSDELW/GRSD 
A I GPREQWE P VFQNQEVRNGG P AEMGEEKRNGTKWREDTDHTS F PLF 
PSTGGQPKAHSNWRKVCH 

325 

A 

1 

67 

RHEAYVKSTKLVHKGTKAKRFA 

326 

A 

163 

552 

AGFSLSAQKSPGAMA*YSYVKBSKLVLKGT\SKKKKSKT>KKRKREDE 
ETQLD/ 1 VGI W * TVTNFGE I SGTI AI GMDEGTYIHALDNGLFTLGAP 
HK/ERIALKSGYGKYLGINSDELVGHSDAIGPREQWEH 

327 

A 

289 

818 

EPSRGVWPHEDARINGSKJOCKSKDKKRKREEDEETQFD/lVGIW*TV 
TNFDEISGTIAIEMDEGTYIHALDNGLFTLGAPHK/ ERIALKPGYGK 
YLSINSDELW/ GRS DA I GPREQWE PVFQNGACAAVFTVIGSEKQSE 
CS LLRES RAKYHG CTHGQ I S S SLKQH PRWM Y SHQEDLKV 

328 

A 

213 

905 

YVQSLKQILS/GCI*ESIAIKKKKNKDKKRKREEDEETQLDIVGIWW 
TVTNFGEISGTIAIEMDKGTYIHALDNGLFTLGAPHKEVDEGPSPPE 
QFTA\VKLSDSRIALKSGYGKYLGINSDGLWGRSDAIGPREQWEPV 
FK\MGKMALSTSNSCFIRI \HEAEDIEAKSKTAGEEEMI /KGSPNLC 
QTFTFMTLPYLPEHRSLLLKIRSCAERETKKKDDIPEEDKGNIKQCE 

329 

A 

133 ; 

482 

GYGRRSVKVRAWARCTGFSPKTPNPWVNSQHFGRLRWADHLRSGVRJD 
QPGQHGETLSLLKIQKLPGCGGRCL*FQLFGRLRQEN*I/RLNPGGG 
GCSELRSHHCTPAWVTEQDSVSK 

33 0 

j± 

9 *7 

£t f 


I»QE*SRRPSRSEAAI)LHGECYSS*GSTSGWCSSRWARGIjAGLRSEA 
ADLRSEYYSS 

331 

B 

396 

809 

MAGCRSRALPHGEAAKAQRKVTAAAGPGAKHLTAWGWQGQLATPSVG 
PAEPTHTQNSHWPASAVCSPSSRLRLSLHTYPQAEGAGSGLGQPRKG 
LPQCSSRLKGSSSAAKVGAQAEEVPRASEACEGCQHAVTSHKY* 

332 | 

A 

130 

249 

GAPPPAPWRPVPSTTQGLRSASAQRGTGRQLHLQPGCGIH 

333 

A 

1 

3408 

MSEAPRFFVGPEDTEINPGNYRHFFHHADEDDEEEDDSPPERQIWG 
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ICSMAKKSKSKPMKEILERISLFKYITVVVFEEEVIIiNEPVENWPLC 
DCLISFHSKGFPLDKAVAYAKLRNPFVINDLNMQYLIQDRREVYSII, 
QAEGILLPRYAILNRDPNNPKGVKLQTFAVSVTALKVARLELFVPPO 
GLLVLLGSGVKLQIFAGCRWSCLPVTRRVLALLSPWWDGTGRRGAG 
GGARWGGSGRTGAHGVECNLIEGEDHVEVNGEVFQKPFVEKPVSAED 
HNVYIYYPTSAGGGSQRLFRKIGSRSSVYSPESNVRKTGSYIYEEFM 
PTDGTDVKVYTVGPDYAHAEARKS PALDGKVERDSEGKEVRYPVILN 
AREKLIAWKVCLAFKQTVCGFDLLRANGQSYVCDVNGFSFVKNSMKY 
YDDCAKI LGNI VMRELAPQFH I PWS I PLEAED I PI VPTTSGTMMELR 
CV I AVI RHGDRTPKQKMKMEE VLD I ARQLLMELGQNNDS E I EENKPK 
LEQLKTVLEMYGHFSGINRKVQLTYLPHGCPKTSSEEEDSRREEPSL 
LLVLKWGGELTPAGRVQAEELGRAFRCMYPGGQGDYAGFPGCGLLRL 
HSTYRHDLKIYASDEGRVQMTAAAFAKGLLAJLEGELTPIIiVQMVKSA 
NMNGLLDSDSDSLSSCQQRVKARLHEILQKDRDFTAEDYEKLTPSGS 
ISLIKSMHLIKNPVKTCDKVYSLIQSLTSQIRHRMEDPKSSDIQLYH 
SETLELMLRRWSKLEKDFKTKNGRYDISKIPDIYDCIKYDVQHNGSL 
KLENTMELYRLSKALADIVIPQEYGITKAEKLEIAKGYCTPLVRKIR 
SDLQRTQDDDTVNKLHPVYSRGVLSPERHVRTRLYFTSESHVHSLLS 
I LR YG ALCNE S KDEQW KRAMD YLNWNE LNYMTQ I V I ML YED PNKDL 
SSEERFHVELHFSPGAKGCEEDKNLPSGYGYRPASRENEGRRPFKID 
NDDE PHTS KRDEVDRAVI LFKPMVSE P IHIHRKS PLPR SRKTATNDV 
VSENANYLRTPRTLVEQKQNPTVASTALRSSPIMRKKVSLNTYTPAK 
ILPTPPATLKSTKASSKPATSGPSSAWPNTSSRKKNITSKTETHEH 
KKNTGKKK 

334 

A 

3 

443 

SFQGGLEGEAWAGTGAAPGACGPARVPGGRRIiRGPRTRPAGPTGPGQ 
* G / VLAPGPAAAVL/ RFLTGP * LPSRSRARDREGLQLHS * SQRDHEP 
TGRN/ GTTPDA P P * EL * HS PRRSAAS LCPQAPEVS PQGS QVPALSGN 
SPQRLLLPT 

335 

A 

2 

306 

PCGGIRGGKEMNEIKTQFTTREGLYKLLPHSEYSRPNRVPFNSQGSN 
PVRVSFVNLNDQSGNGDRLCFNVGRELYFYIYKGVRKGSLSSSSQAS 
SPGETW 

336 

A 

243 

748 

PCQSFQPSGFGKDRPTSLEQRSQRERQAAI FAVSQPSLVI PPGTGKS 
EVNADRSRPPAYCSNLGRYSGPGPPSLVIPPGTGKSEVNADRSGPPA 
YCSNHFPSALP* *RSSMRSLSLQQTSAWTCRHFHTSFEIQ/RGGSQI 
LTLAFCAPAGPTPRGSHVLGLAPSEALT 

337 

A 

514 

967 

APACCPAAVSPFLCEPTLIHPPHPPDLKEDTGQKHRGAQSHPTLAGA 
GLAGRSSKQPSPSAISIWHSD\GTSSCDRRQQCDRDLPSALLSPHLR 

CSQPATRQLESS*T\PPGTADLFLASSGPNPRAFSNARSVKKCKATS 
EKSKPKSYQGS 

338 

B 

1 

456 

MKLRTLAVSATALKVARLEFVPFDVRMCSEFLSSGVKLQTFAVSVTA 
LNALRLELFVPPGGLMVSIiASGVKLQIFTVSVTAHKSSVDPKTIiGWS 
MGLGAVEQGAALIGEAWAAQEPMEGVGGSGMAGCRSRALPRGKAAKA 
RREIERSAGH* 

339 

A 

1 

1519 

MAPELSSKEQPGSEWKKACQMEGTVCAKAQRDLLDGFDQNADDDMDN 
EIQAEMVSDGDEELVGNWSKEPTTLGAATMGLGTEKQLGMGWAGRKG 
PIEDLEPLPQAACKLHAVAHTTWEEAVQCGSLPRRNSNLLLEQGVLE 
ETRNLLQEHGGEIDLEKKRWLGTVAHTCNASTLGGRGVKLQTFTVSI 
TAPKAARLELFVP PGGL WLLAS GVKLQ I FAVS VTAHKS S VDPKTLG 
WSMGLGAVEQEWLVGEAPAAQEPMEWVGGSGMGAAGPGAiPRGKAA 
ivj\kke» J\{jLi x IUjJjPHSEYSRPNRVPFNSQGSNPVRVS FVNLNDQ 
SGNGDRLCFNVGREL YFYI YKG VRKTI P * G\ CRTE I L I S VLTEAADL 
SKPIDKRIYKGTQPTCHDFNHLTATAESVSLLVGFSAGQVQLIDPIK 
KETSKLFNEEAFEGRYATFFTI/WYECSHGKLHISSALGLVFFGGDI 
SYSLSSQPAPLEGASRDSGSRGHELWSESSRKRNNKRH 

340 

A 

759 j 

1034 

IQRFSTSPPDSGAQLASPSGSRTRAAGGAACQSWCRAPALLSPWVYD 
GTGRRGAGGGGRRGGSGRTGAHGVGGRLRHGGLQVPSPAPREGS 
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341 

A 

587 

1097 

QKI INKWP I TGRQCCFVRLS PLESLKFQS KEE I RNTSHRANVTKRN 
SANRKTDAILATPGPPPAQPHGA/PGGYAPRLPQHPAPGCFPKCGNP 
APRPWYGPFSGPLLRKEQKRNPRKGTG*MQGVS PVPFPPT*NAGPLP 
GGSLGPPISGHHFLVSHTWPPPPRSPTAPP 

342 

A 

462 

1238 

SFYETKKLLISGSETLDSKDQYRINKYNHYFQTLHYTIHLKFSSGLS 
SATQ*KLSWSKGNG*GERPVSPPPIJMAPSGOLT.ATm/Wc:p\;K > c:vc-c^ , 

FLSCQQGHVLTSPGSSFPQCQPAQCYLIPCSSEKYTNGW/PGPIPPS 
LGIQSS/SSPGS*SGPISSSKYSGDAQVCFSSPNLFPSFRPVTGCIQ 
SPPNISPGAEKGCGLQPIPVHSFLTPSTFSPVSPI*RSHRSTHPETH 
SMSHSSQSSPQAPSALPPSISLPLSP 

343 

A 

2 

384 

LREDDRRGHRVGSDSRASCKMLLILLSVALIiALSSAQNIiNEDVSQEE 
SPSLIAGNPQGPSPQGGNKPQGPPPPPGKPQGPPPQGGNKPQGPPPP 
GKPQGPPPQGDKSRSPRSPPGKPQGPPQGQSPQ 

344 

A 

2 

624 

SDSRASCKMLLILLSVALLALSSAQNLNEDVSQEESPSLIAGNPQGP 
SPQGGNKPQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGPPPQGDK 
SRSPRSPPGKPQGPPPQ\GG*LQGPPPRGGIKPQGPPPPGKPQGPPA 
QGGSKSQSARAPPGKPQGPPQQEGNNPQGPPPPAGGNPQQPQAPPAG 
QPQGPPRPPQGGRPSRPPQ 

345 

A 

85 

194 

FVE P I RRQRTARS PAS PRLQCTRARTGAWARCCETS 

346 

A 

617 

712 

RRRSSTRTARSPASPRLQCTRARTGAWARCC 

347 

A 

2 

343 

SSVRAVEFPEDASGGSSPSGTSKSDANRASSGGGGGGLMEEMNKLLA 
KRRKAASQSDKPAEKKEDESQMEDPSTSPSPGTRAA\TSHLTPQRLA 
GSPGSGATRWRSLCPRFCPG 

348 

A 

2 

2355 

WCDLGSLQPPPPRFKQFSCLSLPRHS*TSQ*PQPPKTQLNFTVAIDF 
TTISNGETRMSEKVGGNPLQPTSLHYMSPYQIjSAYAMAIjKAVGEIIQD 
YDSDKLFPAYGFGAKLPPEGRI SHQFPLNNNDEDPNCAG IEGVLES Y 
FQSLRTVQLYGPTYFAPVINQVASNSWSSVTLGTDSEPAVEVPQYVG 
IRLLVEGFTIKXPMAMCHRRMGVRPAVPLLTQRGSGEGKDSGTPTHS 
LHTKAQLPSPHVXjRHQGQVLRRQHSKLVGTKALSTTGKAIjRTLPTAK 
VF I S LPPNLDFKV APS I LKPR KS I R EDRNGRS QKTVHTEGDMNMN I K 
KIVKQATVLTFTTALLAGGATQAFAKENNQKAYKETYGVSHITRHDM 
LQIPKQQQNEKYQVPQFDQSTIKNIESAKGLDVWDSWPLQNADGTVA 
EYNGYITWPALAGSPKDADnTQTVMyvninzrinMOTnciJVTvTar'DTTOTm 

*-» -i. i. v \_> j. .i 1 v v i r-k i ij jr s\uj^xju A o A. I X y IS. VbUNo lUu WlvIN/\Vji\. V t iSAJ 

SDKFDANDPILKDQTQEWSGSATFTSDGKIRLFYTDYSGKHYGKQSL 
TTAQVNVSKSDDTLKINGVEDHKTIFDGDGKTYQNVQQFIDEGNYTS 
GDNHTLRDPHYVEDKGHKYLVFEANTGTENGYQGEESLFI^KAYYGGG 
TNFFRKES QKLQQS AKKRDAELANGALG 1 1 ELNND YTLKKVMKPL I T 
SNTVTDE I ERANVFKMNG KWYLFTDSRG SKMT I DG I NSND I YMLG YV 
SNSLTGPYKPl^KTGLVLQMGLDPNDVTFTYSHFAVPQAKGNNVVIT. 
SYMTNRGFFEDKKATFAPSFLMNIKGNKTSVVKNSILEPGQLAVN 

349 

A 

1129 

1822 

AWG S GKQORROEAOCEVAE FT PAWRRA PDTP A PT.VT.PPaR A * HTQ PK 
KHPPMWCRFFFFRQSFALVPQIGVQWRNLGSLQPLPPRFKA\FSYPQ 
P P P S S WH Y \ RHVP I FVFSANFC I FNKRWG F\ TMLAKL \ VT.NS * PHE I 
HPPGPSQS AG I MG VSHCTWLV\ FS FKWLVDYFFEKFPN * GNRFEFY 
FYLFETGSHSVNQAGVQR/PYHASLQPQPPQAQVSLPSSWDYRHVPF 

350 

A 

325 

609 

LMPCNPRHFERLRQADHLRSGVPRPALVQHGETLSVIiKIQKLGGHGS 
TCL*SQLLRRLRQKNLLSLGGRGCSKLRSHHCTPAWVTEP\DSVSKK 
K 

351 

A 

3 

220 

RGSGRGEKKKGCSGAAPGYLPWRHRGRGPWRMAFLLATCGGLDSGFV 
PSVQDFDKKLTEADAYLQILIEQLK 

352 

A 

319 

1981 

VPGVPAARPERECRAHPASFPPPPGGLDSGFVPSVQDFDKKLTEADA 
YLQILIEQLKLF* *QASK\RKEDEQRKKIETLKETTNSMVESIKHCI 
VLLQIAKDQSNAEKHADGMISTIKSCRCNIPA*SLGTCDQAQCLPRL 
PYLQNPLSCVSQSSVHLS*QVGPVIiSYLGTSSDSNTQIVPGSGNFTN 
LAASLTPSQAMSTLSPNTVPEFSYSRQ*/EDEFYDAD/ENSIKVGSS 
PKRLIDSSGSASVLTHSSSGNSLKRPDTTESLNSSLSNGTSDADLFD 
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SHDDRDDDAEAGSVEEHKSVIMHLLSQVRLGMDLTKVVLPTFILERR 
SLLEMYADFPAHPDLFVS I SDQKDPKDRMVQWKWYLSAFHAGRKGS 
VAKKPYNPILGEIFQCHWTLPNDTEEN\KEIiVSEGPVPGVSKNSVTF 
V\AEQVSHHPPIFSLFMLECFNKKIQFNAHIW\TKSKFLGMSIGVHN 
IGOGCVSCLDYDEHYILTFPNGYGRSTT.TVPWWT r , r r TrnsTTMr , c v-*rr« 

YSANIIFHTKPFYGG^KHRITAEIFSPNDKKSFCSIEGEWNG 

353 

A 

27 

750 

STCCFQELVTL * GCKLCTFHPGRKWETRL\ EPLPQRDPS YRWMLGE 
LRGIVFLTGIVSPRTENDOEISEDTR^wnvV T,T.nRT?OKTiTQnr , T vuw 
EAYEREVSLKRPLGNSPGERLNRKMPDFGQVTVE\RS*PPGEREA/R 
KYMDFGNSFTVNSNLISHQRLPVG/DTRPHKCDECSKSFNRTSDLIQ 
HQRIHTGEKPYECNE\CGKAFSQSSHLIQHQRIHTGEKPYECSDWGK 
TFSCSSDLILH 

354 

A 

69 

378 

KGGSLLGAPRVEGPNFGL\KDSSSPKKREFPALPPPGGGN/KRAGPP 
CPGKFGFLKKKGVPP\GGKRGLNPRPQGDSSS * PPKGGGITGGAPRP 
GKKRSFSLQKIL 

355 

A 

213 

660 

KPVILGYAEPPAPGQFSRGP/WSRAE/WPGTASPLLV/SCATSALL* 
SIPKGPDSHRGHQHGTPKTPSKPEKGTAHGQLSRYITPQEKEIAPLN 
PPALKSPSKENRDPRDLTGPGQTSRAIjTVAPGCALGPQVEPMKALRR 
QMGGAVFPLGS 

356 

A 

629 

1007 

KGTY I KQERLENLRTNHLDKYYNPPMKQNYQKliCQKGMASRTHREES 
R*^GNRCRLRITRNA*FEGENLQNK\AQGEI\TLEKKPEPATYI*EA* 
ES KRDKGV * KS YC VS S FPTMQ * KS D * S E YS YG 

357 

C 

1 

555 

MLYVLIESERARIKKLOEEKTRNLESSPKTiFPTTVc;TrHKr;r;T.T?TPr) r r 
DIDVLGQHLTKEHVSSHSQSPIRDSQWFTLSPQEKYACSYERKDPLI 
KRAFIIFYHPJ^AVVSHGGNGPAVVLPISRFVLTPPTFESTLPFLSSR 
LAWGTS S KDPR I AAGQQS PLEKKI L VS KLS SHHTVWNYTEKES * 

358 

A 

4627 

5052 

LS TKAPRKDTLFNKWCWENW I S I CRRLQLGS YLL P Y * K I NQKW I KGL 
NDRSETMKLLKENIGEMLKNIGLGKDFMVKTSKA*AT\KQKIGRWDY 
I KLKI CTAKKT/ 1 RVKRPTVDGNN I FKLSNRGL T <3 R VY KEHN TVTCK 
KKK 

359 

A 

1 

368 

KKQI FLEL I S I YS RVAG YK\NTQKS VAFL YNSNKQLNL * I K/ DTMPF 
ILAC/ NK* KHLPLNLTK* VQE I YEGNYDTFMK* I ED / DKQRN I S S S \ 
W*KRFIKMPILPNLIYRYNAFPVKISESYLVDI 

360 

A 

241 

583 

NGPPFFFFFEMEFSLLLPRLECNGAISAHRNLRLPGSSDSPASAGGL 
Ij * SQVAG I TRLRHHD * LI L Y / FLVEMRFHRVG * AGLELI/TSGDPPSS 
ASQGAGITGMSHSAGHYGKIF 

3 61 

A 

654 

942 

CHACNPQHFGRPRQVDHLRSGV*DQPGQHGETPSLLK\IQKLAGHGG 
VHL*S*LLRRLRQENRLNLGGGGCSEPRSHHCTPAWVTE*DSVSRRK 
KKK 

362 

A 

1 

2322 

MHHGKRELLPK^RRWHWGRQLLI^^ 

GDSVDI FKGI PFAAPTKALENPQPHPGWQGTLKAKNFKKRCLQATIT 
QD S T YGDEDCL YLN I WVPQGRKQGTCCTGTGDRLETGPG GAS G VTSR 
S P I S VS RDLP VM I W I YGGAFLMGSGHG ANFLNNYL YDGE E I ATRGNV 
IVVTFNYRVGPLGFLSTGDANLPGNYGLR 

DPNN I TLFGES AGGAS VS LQTLS PYNKGLI RRAI SQSG VALS PWVIQ 
KNPLFWAKKVAEKVGCPVGDAARM^^ 

E YPMLHYVGFVPVI DGDF I PADP INL YANAAD I D Y I AGTNNMDGH I F 
ASIDMPAINKGNKKVTEEDFYKLVSEFTITKGLRGAKTTFDVYTESW 
AQDPSQENKJXJCTVVDFETDVLFLVPTE I ALAQHRANAKS AKTYAYLF 
SHPSRMPVYPKWVGADHADDIQYVFGKPFATPTGYRPQDRTVSKAMI 
AYWTNFAKTGDPl^GPSAVPTHWEPYTTENSGYLEITKKMGSSSMKR 
SLRTNFLRYWTLTYlxALPTVTDQEATPVPPTGDSEATPVPPTGDSET 
APVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPV 
PPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPT 
GDSGAPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDS 
EAAPVPPTDDSKEAQMPAVIRF 
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363 

A 

8453 

9757 

APCLLGWSLPSRSAKTYAYLFSHPSRMPVYPKWVGADHADDIQYVFG 
KP F ATPTG YRPQDRTVS KAM I AYWTNFAKTG * DVG * VQGGGPQPRRA 
S HHEALFPHLPVEGLWASHLTS PC I G IHVCLRMRVTGRAPS PCTCTA 
SAQYAVRGMVPRASSEGGDGSGVQVESRASAPWESPAPAQPLLTLQ\ 
DPNMGDS AVPTHWEPYTTENSG YLE I TKKMGS SSMKRS LRTNFLR YW 
TLTYLALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSG 
APPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPP 
VPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGAPPVPP 

TGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDD 
SKEAQMPAVIRF 

364 

A 

10 

548 

FRLVLSPRLEGIGAILAH*NLCLLGSNDSPASATRVAGITGTRHHAP 
LIFVFLVEMGFRHVGQAGHKLLISGDPPA/SASQSAGITGVSH/ARL 
AALSNF * GLGQVPTPLKQATLLSLLDTSPSLPAI FFLFFF* DRFS \ S 
VTQAEVQWGDHCSLHP*PTGLK\DPPASASRLGTTRTRHHSR 

365 

A 

252 

934 

LCASLRLLPAPPMLSLSHRISPHPWPGVHYGC*GGRSGSDIGSCRET 
GLCRGFSPGCIKTSTS\PWAPAG*APSTGPSGQETCTAHSPFIPAPA 
* AQGLKF *GVTES ICAGAAGTAGGTETAVSAGS PAGQAQPGPGAGQS 
LS AGSQMA* GSDHPGP I WQTC * S VQGAFALDG *GG * LHPHPP*GPAT 
LPEGGGGVCPAAKNASGATREVPAVLQAVHAPGATWLRPP 

366 

A 

169 

459 

SLEVSLGDHFQKCRFLEKRERFWKGMFSFCSMAVSGFLAAGAGLKPR 

SG/WH*VFHSPRRSWTENFKFQRDEVSASPLGAWRALTKEKGGAQPL 
EGL 

367 

A 

186 

372 

QIRETSGTGT*YTSASQSAGIIEVSHHARQMFQFYSQKYTLPNCCKL 
* ILMFKIKFFLTQK. 

368 

A 

150 

799 

QC * DPETSETV* EKLRAGRHQRCCHGGQSCFDKS ATRALPPPRKWRH 
RGTRCAATKRTRLGEMAVXrGP*E*RQPEEFPRTTSILAEPTVIjSRRK 
CRPSEKRGSATEKFGATSAVTENPPLGAAGERANKTL/RCCHRDTDF 
RTPASDGERRRSHTEGPINRSLAMRMGKGGSIGTENRADWAGPGCKV 
CE S CS / RVLHLCVYGSHHR I WHMEG YQHML V 

369 

A 

1200 

1679 

RYKLLSYRKPETAIEQKENIPFQNLSRFSKNLHFWK*SPKETSRLYH 
ECLQLNS SCLII/TLAYPVQI I / CHLFLDEPGRMSDQRRAQNVPEMGF 
HHVGQ/ ACLNLSTS\DRPASASQSARIIEVSHHARQMFQFYSQKYTL 
PNCCKL* ILMFKIKFFLTQKQN 

370 

A 

1 

1107 

MTLE ATHLL LRVLLVLLAS GG S L FQS P PVKS 1 1 LG VAVTVVLLLDLL 
ALLINRYLQKARGRARKPYTEDVSVDKKTETWEWARGEGESYHVYDD 
VQKEKTTDRM I RS S SMC VHKI NTS VSMCAQETPNS SKKAJCLGQKI P V 
QG KDE I LH KALC FC PWLGKGGME PLRLL I LLFVTELSG AHNTTVFQG 
VAGQSLQVSCPYDSMKHWGRRKAWCRQLGEKGPCQRWS \ STTCGLL 
SFLRRWNWEHSHSQTIPWGGTLTITLRNLQPHDAGLYQCQSLHGSEA 
DTLR KVL VEVLAD PLDHRD AGDLWF PG \ DLRASRM PMWS TAS PGLR \ 
AREGR*WEEKPRRSPTRDQPSLHTCHLATRTPCSALARDYSA 

371 

A 

2 

1310 

EQQDLQDPLEVFHPTVEAAEALHIQEPPGGEPSQPTQLPGGAPNTRP 
PHPQGAIiWSV*RAPEQELHQDGRQPTG*QWLRTCSLGWPEAQLPAHL 
IGQADWPSQRPP*DPE*TPPRPQQNPGPSNWRTCFATPSCTGWA*AT 
TQIRMIENGKPELPGPPSGEVPLGQQQVGPGV\PSGFPGPQAPPGGL 
ILHSNNIHQSGVFNEFLFPWGFGGEAGPYYNGHSASSTTPCPTGRCK 
PATFRRRH* PPGHPVWATTKSRGM/ TAATAGPQWGSLGNTARHPDGE 
AEPGS*ARAQLASNPAPHLRVP*PQLRLPHSPPFPWVPKGCRWTQRP 
EGMTCYAWAHELRPALPPQPPRGTP * S FFLVHS PACPAQALPPNLYC 

PGGGWEVEAQHPAQMTPVFLEAPHPHWPTGG*VSPYPSGPAQGGAAS 
EVGGCLS I KETPCNV 

372 

A 

475 

929 

NLVACSLPISPETLSPVPL\TPKPSGLLALSIiGLAFLSCPGLATPSP 
LPTKPQGTAGPPQPCPCSGLAPQTLYLSRRRLGGWRPSIPAQMTPIN 
RPESQTPVFLEAPHPHWPTGG*VSPYPSGPAQGGAASEVGGCI»SIKE 
TPCNVKKKKKK 

373 

A 

2 

643 

PICLFSRPTLRPSRSKVSLIEGRGANMAARVmFWCVSVTMVVAIiLIV 
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CD VPSAS AQRKKEM VLS E KVS QLM E WTN KR P \ V I RMNGDKFRRLVKA 
PP\RNYSVI\VMFTA\LQL\HRQ\CWCKQADEEF\QIL\ANSW\RY 
S \ S AFTNKD I FFAM\ V\ DFDEGS/DMYFQMLKH* I S APNFPSTFLQK 
GKPKRG\DTY\ elqvrgffqlrqispwi \ADRTDVQY 

374 

A 

562 

767 

GKSISRYPLYPFVIFYFYFLETESCSSAQAGVQWRNLGSLQSPPSG\ 
S SNS PASAS * VAETTGERHHAQ 

375 

A 

23 

483 

NGVLLS SRLKCSGAI SAHCSLCLLGSSNS PTSASQVAG ITGAHHHTW 
ANFCI/YFF*VEMGFHHVG\QAGLELLTSDDP\SALASQSAGITG/G 
EHLRLAVTYLNKHERGSFLSSSCLPSAWWKWLCLCSCFWVPKCLALC 
HFKHYGRRKQVGHTSK 

376 

A 

129 

366 

NY YTHLYANI FDTLEERGKFL/ DKQNLS KLEQTGKAQGFYDLLKKLN 
* * L * SLLTKKASGRGPLLGNS FQF *RKKMTPTL 

377 

A 

67 

224 

NLYLKFSNSWLRTQIPKWKFNSVFLFTEYTDDNALIPKNSSVIVRRI 
PYWRC 

378 

A 

436 

829 

NLREFPWAPCPVLHYKFSSKLNLWIPVTL*GGSHILPSATLKKQIYG 
E\KKLKAADCDLQITNAQTKEEYTDDNALIPKNSSVIVRRIPIGGVK 
STSKTYVISRTEPAMATTKAVCKNTISHFFYTLLLPL 

379 

A 

74 

380 

GCTKDSNI YFI KEHFQS VNNH I KRCLIS FLIRVM * LKPQQVTI I \TP 

IRMAKI*KAENSKCWQECGAAVTLMH***V*LPLKSWHFLLKLSTCL 
F YD LAI PL 

380 

B 

1 

483 

MGQARWLTP V I PALWEAEAGGS RG P S AAGLLE FARG PLQTLFAWVS A 
AVAAEQR I FVNRECCCL I VPLE VLS QRS TRPCEVS VCP YWENAFQLG 
CSGTVTVGSTTVTLA£CLKMLTTFOLVSIGLAAVAWDVt J NTnnv\^T^ 
LNVAAEWNSFVNIPNDSLS * 

381 

B 

119 

955 

MVS ISRPRDPPAS ASQSAG ITGPALGAEGERAFHRHRVM I VTEGAPG 
GLS SGHLRDAMDS EAGTDDGCASHVRWHRDLKPSN I LYVDESGNPE 
CLRICDFGFAXQLRAENGLLMTPCYTANFVAPEVLKRQGYDEGCDIW 
SLG ILLYTMLAGYTPFANGPSDTPEE I LTRIGSGKFTLSGGNWNTVS 
ETAKDLVSKMLHVDPHQRLTAKQVLQHPWVTQKDKLPQSQLSHQDLQ 
LVKGAMAATYSALNSSKPTPQLKPIESSILAQRRVRKLPSTTL* 

382 

A 

1 

662 

LG S CTVTQTG VQWCGH S S LRPQTQAQA I LL PCL SNCWD Y KHE PLHLA 
YVTQFCFLSFFFGGGGWSLALSPMLECGGVILAQCKLRIiLGFTPFSC 
LSLPSSWDHRHPPPCLANFLNF/SVETGFH\FLAQDGLDLLTS/S/I 
PPAS A/ FPKCWD YRR\ DHRAQPM * CNSD VNS LMLPQMPQVK/ AHQS P 
IGCPRFRHTCRLGCPQTTCTSHQLAANLEVPTMPSSSITH 

383 

A 

110 

784 

AAAAASKVLM* TGGAAPQEQP * TGGVQA* APGS VA/ AEGAS VEGPGF 
GDTAPAHQGLSPTRSHGQGGAGRASAHSRDPPGGTGGWGFLKCGPGA 
LCPERGKDGASASVPRGPYAEAEKGGWALRGLGGVAAPGPPSRAGQA 
PSGLHTGPNARPAPW P I PGQGGGLRRDQAG * VS S WTGSTEPGAHTAH 
RAPGHGGKGGSPQQPHPQGPGQI PTTHTRMHSCR I TAA 

384 

A 

244 

315 

NPKGQKDREAPLSRSVFVLKI KI KRAFALLLHGRF* PSTPAASASPA 
SKSPRGSGKALASALFYIVQLTK 

385 

A 

933 

1511 

QVYRPSQTPHLALSPERVAPGRLiAGGRLAPEARAPRGSP/LPPHRVS 
EKTIRVMVFHPGARRAGGTPPRAPRGDTGGAPGA\ PTYSTPLMSLHR 
ARLESSSTGSSFPADSAKPVPLAWSLDSR*GQWESRSSIHA\VTN* 
ASSSSSSSSSSS\FSR\VYPRFIEFLHFDIQSTGQ/RITSRQHPPR/ 
DFRDALF*LNS 

386 

A 

15 

, •• • 

468 

GVPNNFI I NKNKTPSGWLQPLPRWL * APVGGACH YPG LEAR TEE RAP 
CPSKYTGEPEVLRI KNQGGTPRPGGGPGPS APRGS FLWS T / PGSCPS 
ITPHSLPAAR/PLPPTPRHPAPSSPPCTE/SAPSGPSSSQAPGITVA 
PG I S QVQRAAALR 

387 

A 

50 

481 

FMEYLTLSFIHVLLQ*FIHCFVHLFIHSFFYALMKSLIQ*FIHLYSC 
F/ ICIDLLISSCI YSFIYALVKSLANSLTHSFIHTFLQ* FVHLFTPL 
LICSFTHSS IH* PSHSLTFIQLHLFFHALVSGFIWSFIYIjFCKINVF 
LIDK 

388 

A 

119 

1533 

KKMNLAE I CDNAKKGR E YALLGNYDS SMVY YQG VM \ QQD S RD IAQSV 
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QRSQLSKGKWPTRFGQBLD\EEYEQVKSIVKHFKICVFKIDKPSQISP 
VSLSR*THLEIPAVWPTP\VPAEHRAPPQIRR/RQSRSKTSEERNGR 
SRS PGTCRPSTS Y IKE * /NSLLQVGTRTI EPEGGDDKGRKNMOnnA c: 
DGEMPKFDGAGYDKDLVEALERD I VSRNPS I HWDDI ADLEEAKKLLR 
EAGVLPMWMPDFFKGIRRPWKNDSQNLGKPYSRAQDDIGMNIVGPLT 
HGFSSASAAPETTRQPLLFLLLLSLSKVKTFKGVLMVGPPGTGKTML 
AKAVATECGTTFFNVS S S TLTS KYRGESEKL VRLLFEMARFYS PTT I 
FI\DEIDSICSRRGTSDEHEASRRVKSELLIQMDGVGGALENDDPSK 

riVTWLAATNFPWDIDEALRRRCFIWVGDFPDVPQEKDTSVSPESCLL 
TDWWWGL 1 

389 

A 

293 

805 

KARDKSPFLGDLCLKDTGQQSLLQCRFFPFLGTKISCFQNVHSWQTI 
TSDGLSAGDGGFGPKYQ/RSRGQIPSGLNSLPLNSSESDLLPTPHFP 
LQSQGLGSQVAQIDDVKAPPPFPAEPQPKRLRSPPRPLLGAIESVAV 
SRSLPSGPTGTDAIi*WLSVQKPPTRTIHGNE 

390 

A 

2 

428 

GLPGSTTASAAAASAASASSDAPSFQLGKPRLQQTSFYGRFRHFLDI 
IDPRTLFVTERRLKEAVQLLED YKHGTLR PGVTNEQLWS AQKI K/ SG 

YIPFGTPIVVGLLLPNQTIJVSTVFWQWLNHSHNACVNYANRNATKPS 
PA 

391 

A 

3 

297 

ASASSDAPPFQLGKPRFQQTSFYGRFRHFLDIIDPRTLFVTEEEGCL 

CSKNALNKASEEEVALLDRGEEWCILILNGERRGKEEEEKNSGKESM 
ILR 

392 

A 

1 

885 

RRLREAVQLLED YKHGTLRPGVTNEQLWSAQKI KQAILHPDTNEKI F 
MPFRMSGICQGCLLVATSDAQQGKQEGMRPLSQGSELTRCHVX.PRAV 
S QS KLDDQAE PKS EE I NSFCDEAVARR 1 1 KGTLWCHGQC PGQQLSGK 
CSECYWAVSYDVPGGPSTVRGWGLLLPNQTLASTVFWQPSPASKFI 
QGYLGAVISAVSIAVGLNVLVQKANKFTPATRLLIQRFVPFPAVGPL 
DTDMQQLARETSVDPDMRKGLQELKAKGKLVDCKVSAQKXLSLLEKD 
EFKSGAHVDFYDK 

393 

A 

2 

675 

SCPLSCTSAATGLWVKEGLRIVSSVLLGLSCPWGSPLEAELDLGEG/ 
RLWRTSFYGRFRHFLDIIDPRTLFVTERRLREAVQLLEDYKHGTL.RP 
GVTNEQL* SAQKI SQ\ AI YHPDTNEKIFMPFRMSGYIPFGTPI WGL 
LLPNQTIASTVFWQWLNQSHNACVOTANRNATKPSPASKFIQGYLGA 
VISAVSIAVGLNVIiVQKANKFTPATRLLIQRFVPFPAV 

3 94 

A 

485 

1044 

HLVPLCPCNLSKAGRCTVQERLLVICCSRSWRWRNIiM*RWLNYAPVS 
SGHRH* SSLARETSVDPDMRKGPAGAEGKGKAVGCKGVS QETAEAYW 
KRTSSSLEPTWTLLTNKPMFFGFRNLFAPTFRHTPEALWLPTPCHRG 
SPALHIEAFMPAAIiPSGTASCELPGHWPYQLSGVLCCAPWVIKEA 

3 95 

A 

901 

1215 

SQHFGRPRWADHLRSGV*DQPGQHDETPSLLKIQKLAGYSGEHL*SQ 
LLRRLRHENCLNPGSGGCS EPRIiHHCTPAW / VAE PDS VGGGVGGKEK 
NNTSNMLQTRP 

396 

A 

110 

1014 

KTTRGYTLLHCAAAWGRLETLKAIiVELDVDIEAJLNFREERARDVAAR 
YSQTEC^EFLDWAGLNPGTATHDSDEEAAFLTRVWSTGHNDSGVKQR 
MGQKEIQNATSSNHFTCPSS\TLQRKKGV*FVGLLW*FRNVSTLLMK 
TFLPQARNSPPETPSGTITTQITGKICRMYFWSYMESEGPPESERAE 
FFSQQEEENEEEEAQEPEETGPKNPLLQPALTGDVEGLQKIFEDPEN 
PHHEQAMQLLLEEDIVGRNLLYAACMAGQSDVIRALAKYGVNLNEKT 
TRGDTLLHCAAAWGRLETL 

397 

A 

247 

496 

SLILFFYTFCQVRHFASKQDAPNSQPGSLASPAQTCGPSIPQSQGTG 
PAGGGFMFCFKQKTQPYFSLQKQKRKPKKIQPLNDK 

398 

A 

653 

1140 

RPFFPPHS ES QNP PRQ/ PDS S PTGTS T * PADTHCPR P VS QAQGGTAE 
GL**PRVHGPSPLPHS1GSAPPQCQEGAPAGEWAREPGEDGV\PGAP 
GQN*GKMEFSGAG/GQVPSRWEGVQSHLRSEGPPPACPAVPCAAEG 
PD \ GR PGNTGRGG PGLRNQRR W EE W 

399 

A 

2 

510 

FLANFLCFIRD/ GFHHVGQDGLHLLTS* SACLGLPKCWDYRREPLRP 
TSPLLSRPHQALCCLAFGHSAATCTCVN/SALAWAGFWNPQAPGF/S 
VAPAHLLTHTVSLSPMPLWATCSPGAGHEAPGLMLQDPFGPPWCSCG 
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PVFHEKTAGCKARAFSGRVCRCKAQALPETGR 

400 

A 

249 

434 

IVEYRGMWDYRLLFFFPETESCSVAQVGTQWHDLSSLQPLSPG\SSN 
SPAS AS *VAGITGTH 

401 

A 

532 

986 

PWAKLGLS CVPAWQVQVLAI CRWKEVG I LDFLFF FVFFFLR* S L/DS 
DTQAGVQWCDLGSLQTPLPGFKWFSCPSLPSSWDYRHAPPRSADFCI 
F/M* RWGFTMLAKLVSNS * PQ/ CDLPASASQSAGITGVSQCTRRNLG 
IFIDGTSVSDSSRT 

402 

A 

2 

992 

FVDAAGCPLRCSWTGIWLQKSGNRPEDLRAAOLPGRLELLLNNPC 
SAMS PAPDAAPAPAS I S LFD LS AD AP VFQGL SLVS HAPGEALARAPR 
TSCSGSGERESPERKLLQGVLWTISEKLFCSTCDQTFQNHQEQREHY 
KLDWHRFNrLKQRLQGQ\PLLSALGL*KAELHRRSFPAFSGIRRIQDS 
AS E EDLQTLDR ERATFE KL S RP PG FY PHRVL FQNAQGQF SLCLPLCP 
R/ LL I R I PQKRQNLLLTRTLQSRGPRDCVVLMAAAGHFAGA I FQGRE 

WTHKTFHRYTVRAK/QGHSQGAFG/NARGGPSHSAGANLR\RYNEA 
TLYKVS 

403 

A 

2583 

3827 

DRVSLLLPRLECNGAILAHCNLCL/ SGFK*FSCI>SLPSNWDYREVPP 
H\ QL I FVFLVEMG FHH I AQAGLELLTS / S I / PPTLA/ FPKCWDYRR\ 
DHHAWLFFFCSE\NSFTLLPRLE*GGLISDHCNLRIiLGSSDLPASAS 
Q\EPRLQACATTPR* FL YFW * RRG FTMLAR L VLNS * PQV/HPPASAS 
QSAEIIGVSHHTWPQEVFLFLNLFIYLRWSLALSPRLECSDGISAHC 
KLRPPGS S \ ILLPQPPE * LGLQAPATTPG * FFC I FSRDGFS PC * PGG 
S*SPDLVICLPRPPKVLGLQV*ATTPGYFYFFETGTHSVTWAAVQWY 
TIAHCSLELLSSSDPPTSSSQVIGTTGAPHHAHLIYLFIYTjFIFIEM 
RVPCWQAGLELLSSRDPFAWASQSAEITGMSHRAQLGIIFKAPRMS 
V 

404 

B 

1 

336 

MGRARWLTLVIPALWEAEAGGSREVEEYMGAEAVKSWTSSMFKEWDE 
KVRREKTS SRRARAAQCGGVFES PEYVHLLKVFLEDLDKQQQTNSKA 
SRRQEIIKIRAELEIKT* 

405 

A 

2 

313 

FFFFLRRSLALSPRLECSGVISAHCKLRLPGSR/RFSCLSLSHSWDY 
RRPPPRPANFLYFLVEKGFHCVSQDGLHLLTS/AIPP/TSA/FPKCW 
D YRCDSMNFLMVS KD 

406 

A 

4565 

5027 

LIGYILFCFFFFFFFFLRQSLVLLPGLECSGVILAHCNPR/LPGFK* 
FSCLSLPSSWNYRHAPARP\VSFVFLAEMGFHHIGQAGLKLLT/SV/ 
I PPTS A/ FR KCWD YRRE P PRPTA\ FFVLNKFTVLG SNS G F * T * DHR I 
NHFMRLYCNYTLEYSGSHL 

407 

A 

187 

743 

TRGTS AARTGRS SGIASMAS G I LVNVKEEVTCP I CLELLTQPLS LDC 
GHSFCQACLTANHKKSMLDKGESSCPVCRISYQPENIRPNRHVANIV 
EKLREVKLSPEGQKVDHCARHGEKLS\IFCQEDGEGSFCW\VC*AVF 
REHRGSPHVSSQRRFAREYQVKLQAALEMLRQKOOEAEELEADIR 

408 

A 

1234 

1337 

TRGPRGPADSCRKAWSEKAWQRGPWSWCPAAGASGGCGPLHPWLW/P 
PEPWVERATPT*GPCSRTPAPGSPLPGFLRPGLPAAVCRPPGSPCSE 
TVLF*EAE/PT/P*APVRIGGTFQPCAVLLPARMKVVQSSTVR*/VT 
VTRLPVQKVEVAGGWPGSHEKTLP/AA/PAWGLDRKPLQAPRSQLKP 
PNPQGTAESGQRWAHRYKCCFVSLTPCSSQLPVP*DLASKG*PSHT* 
ILGSF*DLCLPA*PMLQGPCPQPLSAP*VFLFPSL\S*VGALIGPFL 
ARPFSRGQLP*AWATPPWPS*PLAK/SPPDVACSLVPGPSFSPSHPS 
APWEPHL^APGALNVLSPLDLMCIPSPASYVAAKAPQAVNGGSCHT 
SLPELT*P\PATSCIGILRAGPCSLMGPPRPVKPW*RSIACLLGNSQ 
CLTPQPKTGSVPYGSPAVAWVP/ERLPPPGSP\PPAPIKVLPLMQAC 
VDHVFPWPEPPPATGGQLPF 

409 

A 

89 

596 

IILKTGSSLCHPRLEQSGVII,VHRNIiR\LPG\SSNSPASGSPVA*IT 
GARHHDRL I FVFLVEMGFCYVGQAGLELLTSSDPPTSAS QSAGVTGV 
SHCARPSFAL* F* KSA* YNLELCLFYHH* YSKCI IFHFLFLDYNCQN 
tilTINIESSLIFSILLISTFYCYLAEIPR 

410 

A 

372 

639 

RAFWQYFKSLNMFMSFDPAILLIGIYPKEKS* IQK/DTLCPEVSEIL 
LMWKNYN*YICHTRNDQAGLIY/HLYSGIYLALK*VCEFFTLKG 
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411 

A 

136 

251 

I HQEKPPN I FSVKKRHYD * PGQHDPLAS AS QS AG I TGV 

412 

A 

118 

337 

IHQEKPPNIFSVKKRHYD*PGQYGKTLSLLKIQI3oAGYSGTCL\KSQ 
LLRRVGREVIQLALKIRAPIWKIECIj 

413 

A 

160 

437 

KRD I TTS LGQ YGQNP \ S LLK I Q I LAG Y \ S GTCL \ KS QL» \ LRRLRHQN 
RLNLGGRG\GSEQRSCHLHSWGGHSETVSKKKKKRERQQWRQiGTCM 
P 

414 

A 

2 

150 

WLPSLLEPGLHPSNHVQGPAVGIDLGTTYSCVPI\GNRTTPSYVALR 
PG 

415 

A 

1 

1786 

MCLLVITYSSRHVYNNLTEEQKGRVAFASNFLAGDASLQIEPLKPSD 
EGRYTCKVKNSGRYVWSHVILKVLVRPSKPKCELEGELTEGSDLTLQ 
CESSSGTEPIVYYWQRIREKEGEDERLPPKSRIDYNHPGRVLLQNLT 
MS Y SGL.YQCTAGNEAGKES CWRVTVQSTMFQGTLQLG IDLGTTYS \ 
C VG V FQQRKS PRL I AN\ DQGNR \ TTPS \ YVAFTGH * TG W I GD AAKNQ 
\ VAMNP\ TNTVF \ NAKRSDLDGRF * MNA VCPVT .T MKWW AT^YWVHFMM 
LGRPQGP K* DYKGEDQKAF Y PEEGCLLWVPDKD * KE I CR S PNLGEEL 
VTNAV\ VTVPAY\ FND\ SQRQATKDAGTIAGLNVLRI INE\PTAAAI 
AYGLDKKVG AERNVL I FDLGGGTFDVS I L.TI EDG I FEVKSTAGDTHL 
GGEDFDNRMVNHFIAEFKRKHKKDISENKRAVRRLRTACERAKRTLS 
SSTQAS I E IDSLYEGIDFYTS ITRARFEELNADLFRGTIjDPVEKALR 
DAKLDKSQIHDIVIjVGGSTRIPKIQKLLQDFFNGKELNKS inpdeav 

aygaemrqdkkellldpiaedfleevafdkkrmgfggf 

416 

A 

1 

154 

pqrrvarrkravlpstka*pentpstftsasmewasrsvhlghskrf 

GNLP 

417 

A 

1 

893 

PTRPCKmGEKVEKPDTKEKXPEAKK\VDAGGKVKKG^KAKKPKKG 

kp\hcsrnpvllrgigrysrsamysrkai^k\rkysaakskvekkkk 

EKVXJ^TVTKPVGGDKNGGTRVVKT,RKWPRYVPTPnVPRTn^.T»<?\ WRKTf 
\ PFSQHVRKLRAS I Y/TPGTILI IliTGRHRGKRWFLKQLASGLLLV 
TGPLVLNR\VPLRRTHQKFVI\ATFTKIDISN\VKIPKHIiTDAYFKK 
KKLRKPRHQEGEIFDTEKEKYEITEQRKIDQKAVDSQILPKIKAIPQ 
LQGYLRSVFALTNGIYPHKLVF 

418 

A 

934 

1272 

wcilglctslmsrtalfpsrhtffa/rshsaaskl/ekkkkekvlat 
vtkpvggdknggtrwklr kmpr yy pted vprkllshgkkp fs qhvr 
klrpsitpg\tiliiltg\rhrg 

419 

A 

1 

2607 

l^FAWWPCLILALLSSLAASGFPRSPFRLLGVANGIEVYSTKINSKV 
TSRFAHNVVTMRAVNRADTAKEVSFDVELPKTAFITNFTLTIDGVTY 
PGNVKEKEVAKKQYEKAVSQGKTAGLVKASGRKLEKFWSVNVAAGS 
KVTFELT YEELLKRHKGKYEMYLKVQPKQLVKHFE I EVD I FEPQG I S 
MLDAEASFITNDLLGSALTKSFSGKKGHVSFKPSLDQQRSCPTCTDS 
LLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPWPKNVAFVI 
DISGSMAGRKLEQTKEALLRILEDMQEEDYLNFILFSGDVSTWKEHL 
VQATPENLQEARTFVKSMEDKGMTNINDGLLRGISMLNKAREEHRIP 
ERSTSIVIMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNN 
LNYNFLENMALENHGFARRIYEDSDADLQLQGFYEEVANPLLTGVEM 
EYPENAI LDLTQNTYQHFYDGSE I WAGRIiVDEDMNS FKADVKGHGA 
TNDLTFTEEVDMKEMEKAIjOERDYT FGNY TEPLW aYT.TT EOT.T.PKPTf 
NAHGEEK^LTARALDLSLKYHFVTPLTSKVVTKPEDNEDERAIADK 
PGEASYQPPQNPYYYVDGDPHFIIQIPEKDDALCFNIDEAPGTVLRI, 
I QDAVTGLTVNGQ I TGDKRG S PDS KTRKT Y FGKLG I ANAQMD FQ VEV 
TTEKIT\CGTG\P^\STFSWLDTVTVTQDGLSIWINRKNMVVSFGDG 
VTFVVVLHQWKKHPVHRDFLGFYVVDSHRMSAQTHGLL.GQFFQPFD 
FKVSDIRPGSDPTKPDATLVVKNHQLIVTRGSQKDYRKDASIGTKVV 
CWFVHNNGEGLIDGVHTDYIVPNLF 

420 

A 

1 

1842 

MEKALQKRDY I FGN YI ERLWAYLTI EQLLEKRSH ITCRKLSLPPPGF 
LVIPMSPSGLLQHIiAIFWAVITTGIVSTUWALIJ^PIENPDLKLAIV 
MLIVPFFVNGFYSQVAKPLLVDVDLQYPQDAVLAliTQNHHKQYYEGS 
E I VYLHE KH I LHRDLKTQNVFLTRTN 1 1 KVGDLG I ARVL» ENHCDMAS 
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TLIGTPYYMSPELFSNKPYNYKSDWALGCCVYEMATLKHAFNAKDM 
NSLVYRIIEGKWIL.TDSKASCGPWKPDWLGSMTPERASGRKLEKFTV 
SVNVAAGSKVTFELTYEEIjLKRHKGKYEMYLKVQPKQLVKHLSHTAS 
YQPPQNPYYYVDGDPHFIIQIPEKDDALCFNIDEAPGTVLRLIQDAV 
TGEACGLGPGPGGSSWEGLGPPRASGRTMEQFTIHLTVNPQSKVTFQ 
LTYEEVLKRNHMQYEIVIKVKPKQLVHHFEAGAQEAPIKKKRPPVKD 
EDLKGARGNLTKNQEIKSKTYQVMRECGPGAAFELSIGVIDGDPLES 
HMQACKQDRSSKLVSIMDFADFGTTIKQDFRLLGQTSVDRLLQLSQG 
QAVKGNQLLPVSLVKRKTTIAPNTQTASPRALADSLMQLARQVSRLE 
SGQ 

421 

A 

3 

379 

SYQPPQNPYYYVDGDPHFIIQIPEKDDALCFNIDEAPGTVLRIilQDA 
VTGEACGLGPGPGGSSWEGLGPPRASGRTMEQFTIHLTVNPQSKVTF 
QLTYEEVLKRNHMQYEIVIKVKPKQLVHHFE | 

422 

A 

1 

623 

MTKRCLDHRGEWLPGAGGGGHTEGTRCLHHAPVTWVGIEVDIFEPQG 
ISMLDAEASFITNDLLGSALTKSFSGKKPVWLRGRHTPKGNLDSEVL 
AGLSPCPIPI^GLTVNGQITGDKRGSPDSKTRKTYFGKLGIANAQMD 
FQVEVTTEKITLGTG\ RA\ STFS WLDTVTVTQDG * APLQGLQGGLQG 
EGDHSGPQPNPGALSEPELV 

423 

A 

3 

2722 

FSDGLCKVALSHLGSALQIjGSLCFPRSPFRLLGKRSLPEGVANGIEV 
YSTKINSKVTSRFAHNVVTMRAVNRADTAKEVSFDVELPKTAFITNF 
TLTIDGVTYPGNVKEKEVAKKQYEKAVSQGKTAGLVKASGRKLEKFT 
VSVNVAAGSKVTFELTYEELLK^KGKYE^1YLKVQPKQLVKHFEIEV 
DIFEPQG I SMLDAEAS F I TNDLLGS ALTKS FS GKKGHVS FKPS LDQQ 
RSCPTCTDSLLNGDFTITYDVNRESPGNVQIVNGYFVHFFAPQGLPV 
VPKNVAFVIDISGSMAGRKLEQTKEALLRILEDMKEEDYLNFILFSG 
DVSTWKEHLVQATPENLQEARTFVKSMEDKGMTNTINDGIiLRGISMLN 
KAREEHRI PERSTS I VIMLTDGDANVGESRPEKIQENVRNAIGGKFP 
LYNLGFGNNLNYNFLENMALENHGFARRIYEDSDADLrQLQGFYEEVA 
NPLLTGVEMEYPENAILDL.TQNTYQHFYDGSEIWAGRLVDEDMNSF 
KADVKGHGATNDLTFTEEVDMKEMEKALQERDYIFGNYIERLWAYLT 
IEQLLEKRKNAHGEEKFINIjTARALDLSLKYHFVTPLTSMVVTKPEDN 
EDERAIADKPGEDAEATPVSPAMSYLTSYQPPQNPYYYVDGDPH/FS 
IIQIPEKDDALCFNIDEAPGTVLRIjIQDAVTGLTVNGQITG\DKRGS 

pdsktrktyfgktgaspmaqmgfpgwevttekitlleqarcraffsw 
ldtvtvt\qdghflassrrlsmminrjcnmvvs fgdgvtf*vwlhq/v 
cwkjoipvptvdflgfyvvdshrmsaqthgllgqffqpfdfkvsdirp 
gsdptkpdatlvvtowqlivtrgsqkdyrkdasigtkvvcwfvhnng 
egl i dg vhtd y i vpnl f 

424 

A 

1168 

1629 

RAGRGGEGHKLNSYGGRRARSQGHLLSSALSPFVSAASYQPPQNPYY 
YVDGDPHFIIQIPEKDDALCFNIDEAPGTG\LRLIQDAVTG\liTVNG 
QITGDKRGSPDSKTRKTYFGKLGIANAQMDFQVEVTTEKIT\CGTG\ 
RA\ STFS WLDTVTVT 

425 

A 

87 

508 

MG C F I REGGEEGNL 1 1 KKRFVS EAE L \ DVRRKRRQEEWENVRKPBDP 
EECPEEVYDPRSLYERLQEQKDRKQQELRGTVSNCKNM\VRG\LDED 
ETNFLDEVSRQQELIEMHRIEEELIELNEYRI7VLWNVGISYENYEGR 
G 

426 

A 

90 

549 

QGASQTTRSETSCLGIAR* TFLGL\ FVRIERIDGNLKMLHIVEPYVT 
WGFPNLKSVRELILKRGQAKVKNKTIPLTDNTVIEEHLGKFGFICLE 
DLIHEIAFPGKHFQEISWFLRPFHLSVAHHATKNRVGFLKEMGTPGY 
RGERINQLIRQLN 

427 

A 

215 

870 

RAPPNGLTDQQKKGKGLRFKRLESFLHDSWRQKRDKVRLRRLEVKPH 
ALELPDKHSLAFVVRIERIDGVSLLVQRTIARljRLKKIFSGVFVKVT 
PHNLKMLRIVEPYVTWGFPNLKSVRELILK\RGQAKVKNKTIPLTRH 
/ S Q * WRS TWGKFG VI CLEDL IHE I AFPGKHFQE I S WFLC PFHRLSGP 
SCYQK* SGLPQGDGHTWAIGVNASNQLIRQLN 

428 

A 

476 

821 

TFTWPLFVRIERIDGVSFTWCREPLQDFALRKIFSGVFCKGTPQDSK 
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M \ LR I VEP YVTWGFPNLKS VRBL ILKRGQA\ KVKNKTI PLTDNTVI B 
EHLGECYSLGVSWGRKPGSLKL 

429 

A 

33 

371 

ATRSRLGKHTRRSKSSAI QAVR 1 1 SQVLRCKQRCMHCPRTI PATLQE 
HKSVNNPSAQQRTKTWSIHTPAPHPH\PAPHPSTHQ/HPHPST/PAP 
HPSTHQHLIHPHTSISSIRTRG \ 

430 

A 

42 

334 

RRFRAAAVMAPPGCPGFCPNFVLVCSFLERYGPLLDLPELPLPELER 

VLQAPPPDVGNGEVPKELVELHLKLMRKIGKSVTADRWEKYLIKKSK 
RVG ' 

431 

A 

55 

1079 

SERRRGPLCLPSDSVLDCSGPPMYCGIAEPSLRRWDPQRSPLESASC 

RLLGCLAIPRHPNFPRPPIASSSILQIKMATAAAAAAVMAPPGCPGS 

CPNFAVVCSFLERYGPLLDLPELPFPELERVLQAPPPDVGNGEVPKE 

LVELHLKLMRKIGKSVTADRWEKYLIKICQEFNSTWAWEMEKKGYTiE 

MSVECKLALLKYL\CECQFDDNLKFKNIINEEDADTMRLQPIGRDKD 

GLMYWYQLDQDHNVRMYIEEQDDQDGSSWKCIVR2TRNELAETLALLK 

AQIDPVLLKNSSQQDNSSRESPSLEDEETKKEEETPKQEEQKESEKM 
KSEEQPMDLENRS 

432 

A 

1 

568 

ASGPELPGRRFRDRAPWLPARLLRGDLSVWVSLTALGPGSFCRRRVP 
S LAQLGHS EAAPS PDD VRWSR VPDRCPEERDRAWPPPPP PSLPPS FR 
RNMANN S PALTGNSQPQHQAAAAAAQQQQQCGGGGATKP AVSGKQGN 
VLPLWGNEKTMNLNPMI LTNI LS S P YFKVQL YELKTYHE WDE I YFK 
D 

433 

A 

1 

572 

ANNSPALTGNSQPQHQAA7VAAAQQQQQCGGGGATKPAVSGKQGNVLP 
LWGNEKTMNLNPM I LTNI LSS P YFKVQL YELKTYHE WDE I YFKVTH 
VEPWEKGSRKTAGQTGMCGGVRGVGTGGIVSTAFCLLYKLFTLKLTR 
KQVMGL I THTDS P Y I RALG FMYI R YTQPP \ KDLWD WFES FLDDEEVC 
QQG 

434 

A 

6 

223 

TVTNFGE I SGTIAI EMDEGTY IHALDNGLFTLGAPHK/ ER IALKSGY 
GKYLG INSDELVGHSDAIGPREQWEH 

435 

A 

3 

687 

RTQSGNVYFASTSVPPRPGAAPGHILVLSRLQPLRAEVSRSRLLRVE 
YS YVKTTKLVFKGTKAKS KKKKS KDKK * QREEDEETQLD I VG I WWTV 
TNFGEISGTIAIEMDKGTYIHALDNGLFTLGAPHKEVDEGPSPPEQF 
TA\VKLSDSRIALKSGYGKYLGINSDGLWGRSDAIGPREQWEPVFK 
\MGKMALSTSNSCFIRI\HEAEDIEAKSKTAGEEEMIKIRS 

436 

A 

115 

663 

AGFSLSAQKS PGAMA* YSYVKS S KLVLKGT\ S KKKKS KEKKRKRE ED 
EETQFD/ 1 VGI W* TVTNFDE I SGTIAI EMDEGTY IHALDNGLFTLGA 
PHK/ERIALKSGYGKYLGINSDELW/GRSDAIGPREQWEPVFQNGA 
CAAVFTVIGSEKQSECSLLRESRAKYHGCTHGQISSSLKQHPRWMY 

437 

A 

1 

381 

GPPASVRTDSGHPRRRAEGAVSTRTQADPLAPCGPPRPAGGQGAYAR 
LLLLQCLLAAARPSSA\DGSAPDSPFTSPPLREEIMSNNFSLESHNI 
SLTEHSSMPVENNITLERPSNVNLTCQFTTSGD 

438 

A 

3 

217 

RVMSHRKV\ SAPRRGSLGFLPRKRS SRHRGKVNSLPKEDLVHPLHLT 
CYLEDS PAFAD I VRNVYKTGG S AMN 

439 ! 

A 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGETLGRLLWYPWTQRFFDSFGNLS 

SASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKL 

HVDPENFKLLGNVLVTVLAIHFGKEFTPEVQASWQKMVTGVASALSS 
RYH 

a a n 


37 

527 

APSPDAMGHFTEEDKATITSLWGKVNVEDAGGETLGRLLWYPWTQR 
FFDSFGNLSSASAIMGNPKVKAHGKKVLTSLGDAIK\HLG*SQGAPF 
AQA*SELHC\DKPAMWDPENFKAPGEMLLVTRFGQSHFRAKNFTPEG 
CKAS WAERWVT \ GV\ AS ALVP SR YH 

441 

A 

68 

398 

GPASNRALG F WLLETRMV S AVSDPPTSTTGS TMCECI S I HVGQAGV 

QMGNACWELYCLEHDIQPSGTMPSHKALWSSDNSFNTFFRETQPGRH 
V\ PGLSVDLEPAVI AQ 

442 

A 

3 

400 

CFRDMAFNFGAPSGTSGTAAATAAPAGGFGGFGTTSTTAGSAFSFSA 
PTNTGTTALGGLFSQPTQAPTQSNQLINTASALSAPTLLGDERDAIL 
AKWNQLQAFWGTGKGYFNNNI PPAEFTQENPFCRFKAV 
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443 

A 

1 

255 

PAVNFN I GN FG VS TTS AT PFNFGNS LAS AE KQ VL YEQQ 

444 

A 

2 

733 

YLVSCFRDMAFNFGAPSGTSGTAAATAAPG\GFGGFGTTSTTAGSAF 
SFSAPTNTGTTGLFGGTQNKGFGFGTGFGTTTGTSTGLGTGLGTGLG 
FGGFNTQQQQQT\AVGYSCMPSNKDEDGLWLVFNKKETEIRSQQQQ 
LVESLHKVLGGNOTLTVNVEGTKTLP\ DDOTPWTY\An?ucDMrTCD 

RVPATTLYAHFEQANIKTQLQQLGVTLSMTRTELSPAQIKQLLQNPP 
AGMFLESQSS 

445 

A 

2 

683 

AREGSAIiRVAGVTYLVSCFRDMAFNFGAPSGTCGTAAATAAPA/GNT 
FLLFGGGFGGFGTTSTTAGSAFSFSAPTNTf2TTf!T pr:r: r ^r>tJT<'^ , T?r , pr , 

TGFGTTTGTSTGLGTGLGTGLGFGGFNTQQQQQQTTL\GGLFSQPTQ 
APTQSNQLINTASALSAPTLLGDERDAILAKWNQLQ\AFWGTGKGYF 
NNNIPPVEFTQENPFCRFKAVGYSCMPSNKDEDGLVVLVFK 

446 

A 

1 

461 

HRRPLYPAGIARMTRGDRTRTGCTRPKGSRTGGAVGAAMQEIIGSVD 
HIKFDLEIAVEQQLGAQPLPFPGMD/KTPDSQAPISTVWGLLLPSLG 
LRSSLPRI FGPLRLFKESGAAVCEFFLKAACnKRGMP'PFP'HT ^nvycr 
WCKHCLRGLCMN 

447 

A 

1 

811 

DGTMEbSEAVQMATALIEQRlxAQEEENEKLRGDARQKLPMDLLVLED 

EKHHGAQSAALQKVKGQERVRKTSLDLRREIIDVGGIQNLIEL\RKK 

RK\QKKRDALAASHEPPPEPEEITGPVDEETFLKAAVEGKMKVIEKF 

IiADGGS ADTCDQ FRRTALHRAS LEGUME I LE KLLDNGATVDFQDRLD 

CTAMHWACRGGHLEVVKLLQSHGADTNVRDKL 

IGEHFLSLGLENNARNREGDTALH\DAGRVNRYPILQ 

448 

A 

100 

1534 

EARRNWGAMGKARRSPPGHHRHCEGCFNRHCHIPVEPOT 

LLCGATFHMCKEAEHQLLCPLEQVPCmSEYGCPLSMYRH^ 

ACPASVVCCSMEWNRWPNVDSETTLHENIMKETPSEECLDTAIiALQD 

OTCVTT.FP QT.1CMVT<*T ■T« , D77 p P"Dt? RTPUT? CP F^rKT/" 1 T 1 7t^ tt»vh tv t t/-»/-»i tt\ t/-*t 

vxvv jjr i^oxjfu , 'i v jc»jjr rDlKriAl lilStjJr XrlXvvj£»X b VfiEMGGAVGGVD I GL 
VPHGLS ATNGEMAELSQEER15VL AKTKEGMDLVKFGQWENI FS KEHA 
ASALTOSSASCESKNKNDSEKEQISSGHNi^ 
QQDVRTAjTOTTGLAPWQDGVLERLKTAVDAKDY 

FGOMPACTPKERDFVYG\ KT.PZXOPVT<r r T T \rv r r'Ptr\7D , \7c vr^r* v"td tvdt rnj\ 
MLSCKPSEHKAVDTSDLGITVEDLiPKSDLIKTTLQCALERELKGHVI 
SESRSIDGLFMDFATQTYNFEPEQISSGTELADLTAATPGGLHVELH 
SECVTKRHN 

449 

A 

124 

602 

WATTPFQPKSNSKMAELFMECEEEELEPWQKKVKEVEDDDDDEPIFV 
GE I S S SKPA I SNI LNRVNPS S YSRGLKNG AT. <3 P G T T A ZX F TCPT<? nHYT 

NPTSNP\VPASPIKF\HPESRSSDSSVIVQPFSKPVSVSKTIRPAQG 
SIGCCLSISTVPSYNSGLS 

450 

A 

1 

369 

I CRFNP YVFLAAKADQLQVRTTQLT/ CKSCWLYHC I NHS TLQTCNI S 
TL/M/ILGHIPGLWILINLSEPWDATPVWHFVKLLLTKLTHHACRAS 
GI\ I IFAIVSLVTLITSAVMSSVALHSSVQMAQ 

451 

A 

126 

374 

GRGTSKI LMALQRTH S LLLLLLLTLLGLGLAYPS YGHDVL Y \ HRWA 
DNGHLEETLVM\DRCLSVMILLRIISRDDCERLIGIxA 

452 

B 

1 

1812 

MPSPSPLFFVATIHDCQELLETTETGPSDLQDVPLEKADATVFTDGS 

SFLEQGVRKAGAAVTTETDVLWAHAL PANTS AQKAELIALTQALRWG 

KDINVDTDSRYAFATVHVHGAICQERRLL 

RSSSSLGNNljRPDGSMYMHRSKGVPAVSYiWNILTPSESL 

TKKLSPEKEIYEMESLQWENMGKRINHHLQYNGL^ 

EASQEGLYMCVKITCEEKATESHSTSSTFHRIIPTKEKIjYKSPNPSQ 

DTGYSQHHTRTTFSKRQPKNCERFPVEGRPLTLPAQTSXGYSGSKPD 

VITLLEQGKEPCWAP^VTRRQCPAAPIWDSLIARVGVMARGNAITL 

PVCGRDVKFTLEVLRGDSVEKTSRVWSGNERDQELLTEDAIJDDLIPS 

FLLTGQQTPAFGRRVSGVIEIADGSRRRKAAALTESDYRVIiVGELDD 

EQMAALSRLGNDYRPTSAYERGQRYASRLQKEFAGNISALADADNIS 

RKNITRCINTAKLPKSWALFSHPGDYLPVPQAKGNNVVITSYMTNR 

GFFEDKJCATFAPSFLMNIKGNKTSWK3JSILEQGQLTVN* 
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453 

A 

305 

1089 

NHWPMHLVTFRDVAI DF\ SQKEWECLDTTQRK\ LSRDVM\ LENYNNL 
GLTG/SDILGSKPWITLLEQGKEPCWARDVTGRQCPGLLSRHKTK 
KXSSEKDIHEISLSKESIIEKSKTLRLKGSIFRNEWQNKSEFEGQQG 
LKERS I SQKKI \ SLKKMS ADRKRPSFTLNQRIHNSEKSCDSHLVQHG 
KIDSDVKHDCKECGSTFNNVYQLTLHQKIHTGEKSCKCEKCGKVFSH 
SYQLTLHQRFHTGEKPYECQECGKTFTLYP 

454 

A 

1 

2088 

PTQSTRRIATVSIAAAVAPLTLFLYRGDGGLSSRRRADAAAGALCGE 
VAVKPPINPFTEFMEKAVNDGSHSEELFCHLKTISEKEDLPRCTSES 
HLSCLKQDILNEKTELEATLKEAELVTHSVELLLPLFKDTIEKINFE 
NANLS ALNLKI SEQKEILI KELDTFKS VKLALEHLLRKRD YKQTGDN 
LSSMLLENLTDNESENTNLKKKVFEKEAHIQELSCLFQSEKANTLKA 
NRFSQSVKVVHERLQIQIHKREAENDKLKEYVKSLETKIAKWNLQSR 
MNKNEAIVMKEASRQKWALKKASKVYKQRLDHFTGAIEKLTSQIRD 
yisAlu_ii>.fc. 1 XoAbNAWKSHYEKIVIEKTELEVQIETMKKQIINliLEDL 
KKMEDHGKNSCEE ILRKVHS I EHENETLNLENTKLKLRFPCRITESK 
NMN I L I VLDMLCY I S S EKTTLAALKDE WS VENE LS ELQEVEKKQKT 
Xj±E*nl J\>luvyi\jjyii>\A±,x v AoKL.c*NljJjHl\JJNy 1 X KTKNKNVEKMRGQ 
MESHLKELERVCDSLTAAERRLHECQESLQCCKGKCADQEHTIRELQ 
GQVDGNHNLLTKLSLEEENCLIQLKCE^QQKLEQMDAENKELEKKL 
ANQEECLKHSNLKFKEKSAEYTALARQLEAALEEGRQKVAEEIEKMS 
SRESALQIKILDLETELRKKNEEQNQLVCKMNSDPETP 

455 

A 

140 

511 

PTQSTRRIATVSIAAAVAPLTLFLYRGDGGLSSRRRADAAGG/APFS 
L»CGEVAVKPPINPFTELM\ENAVNDGSHSEELFCHLKTISQKED\liT 
RCT\ SESHLSWYSHQYQGKSKFPILGFPVLS I I 




1014 

MLLKLQCAHR S PGETFKI NSDS VGLGEAS AD S S YKLLVD ANVAERAA 
GLCCRLWPCHKGMPRLTDLSVKTKDVWEIPRESLQLIKRLGNGQFG 
EWMEKADGLCFNLTVIASSCTPQTSGIiAKDAWEVARRSLCLEKKLG 
QGCFAEVWLEIPVYPLKKGSKRFAKPSFPVGEVPYCACRELAVGHGP 
STPHPNDSSQPSRKEGDWWEARSLTTGETGYIPSNYVAPVDSIQAEE 
RAENEAQVSAQGRKDPGDGRNKALQLVGGSVARVLTATHGSGAGGMD 
TTCRRDIEYEEGRKRKRTKKKRELESQMPRGLCCGLSGGPAVQYGHN 
LTRETMVEK 

457 

A 

577 

2195 

IMGCVQCKDKEATKLTEERDGSLNQSSGYRYGTDPTPQHYPSFGVTS 
I PNYNNFHAAGGQGLTVFGGVNSS SHTGTLRTRGGTGVTLFVALYD Y 
EARTEDDLS FHKGEKFQ ILNSS EGDWWE ARS LTTGETG Y I PSNYVAP 
VDSIQAEEWYFGKLGRKDAERQLLSFGNPRGTFLIRESETTKGAYSL 
S 1 RD WDDMKGDHVKHYKI RKLDNGG YY I TTRAQFETLQQIiVQHYSER 
AAGLCCRLWPCHKGMPRLTDLS VKTKDVWE I PRES LQL IKRLGNGQ 
FGEVWNGT \ WN\ GNTKVA I KTLKPGTMS PES FLEEAQIMKKLKHDKL 
vyjjiAv v t>u, \ j. x xv l ii x MNJv(jSIjIjDFIjKDGEGRAIjKXiPN^ 
QVAAGMA Y I ERMNY IHRDLRS AN I LVGNGLICKIADFGLARIj I EDNE 
YTARQGAKFP I KWTAPERALYGRFT IKSDVW S FG I LLTELiVTKGRVP 
YPGMNNREVLEQVERGYRMPCPQDCPISLHELMIHCWKKDPEERPTF 
EYLQSFLEDYFTATEPQYQPGENL 

458 


561 


Vj/A_L»U>\>vo v JPK<- vWboAuv VALt Ji.bHCAP.Li VwVYTYECCHYMCSAIjIjS 
LSCPCPAPSERAAGLCCRLWPCHK\GMPRLTDLSVKTKDVWEIPRE 

459 

A 

46 

949 

ARRPRPWLSSARTPRRPFWRPPSCCSPMLTTSSETLMMKNIDPSGLE 
TQPF/YEVLQSNIQHVIiVYENPALQEKAIjACIPVQELKRKSQEKIjSR 
ARKLDKGINISDEDFLLLELLHWFKEEFFHWVNNVLCSKCGGQTRSR 
DRSLLPSDDELKWGAKEVEDHYCDACQFSNRFPRYNNPEKLLETRCG 
PWCE 

460 

A 

3 

284 

FSFLVFSSPLLVFLCEQDCFHHCRKHLLSLVGFSELAFCSDQLILSD 
GTLQRCLRGCASAPRSCWFCWLPGPLDRVLSQTALRVHSHPRKAW 

461 

A 

420 

690 

AAEGRGPAQGFHSLQRFLPSVNGIKPKCIIFLAWATLYQPGSREHRR 
SRPIPEPFLPKAEPPTTAWEACFQDLLEPGAIRQDQLIRTES 


JL O O S 4-6. 11+3 . O iS El! i& U 2 


462 

A 

557 

753 

I IEIGIMEQRKPWKKICYFICCASHAKGSL/ CRRMKRAPVYYGRTVP 
SPDGS IAHGEYHVLLELNTE 

463 

C 

137 

274 

MRKQLLLPQKKRRVYLLR I TKKVEM I PQDNENPGNTNCHDVVLVQ * 

464 

A 

1 

2155 

MKKKKMKEEEEKEKKEEEEEEEEEERRKEEDDFCNRTSPAGGSEGPG 
QESEQPAQPPEQAGGRPGASPAPDEDAEAAGAEQGGDSTEATAKPKR 
S FYAARDLYKYRHQYPWKLSQFHS S VSHRRKE IDCKAFKWQDLAFFG 
ELTRGKRSQHNYLR ITRI LKSLGELGYES FKS PLVKFILHEALVENT 
IPNIKQSALEYFVYTIRDRRERRKLLRFAQKHYTPSENFIWGPPRKE 
QSEGSKAQKMSSPLASSHNSQTSMHKKAKDSKNSSSAVHLNSKTAED 
KKVAPKEPVEETDRPSPEPSNEAAKPRNTEKDSNAENMNSQPEKTVT 
TPTEKKESVSPENNEEGGNDNQDNENPGNTNCHDWLPRSPAPAAAS 
SMGNLLGGVSFREPTTVEDCDSTWQTDSEPEPEEPGPGGGSEGPGQE 
SEQPAQPPEQAGGRPGASPAPDEDAEAAGAEQGGDSTEATAKPKRSF 
YAARDLYKYRHQYPQNFKDIRYQNDLSNLRFYKNKI PFKPDGVYI EE 
VLSKWKGDYEKLEHNHTYILWL\LPLTEQGLNFYAKELTTYEIDESS 
KTKEAI IRFPPWAYKNDAR I FWNKT/GLI KLGMFARGVYWQES FQHL 
NESQHNYLRITRILKSLGELGl\ESFKSPLVKFILHEAIjVGGILFPI 
LRQSAL\EYFVYTIRRTGRRKGRKA/LCGFAP/WKHYTAFQGNFIWG 
DPPSEKEQFGGGGQRPREM 

465 

A 

226 

487 

NWKTKIITSMLCWGNAS FGQLGLGGIDEEIVLEPRKSDFFINKRVRD 
VGCGLRHTVFVLDDGTVYTCGCNDLG\QLGHEKSRKKPEF 

466 

A 

674 

1026 

LWAEGTLVWGS I REWLQHT P PNG I RDWAKQRMWRTGQ PQPAPTRVN I 
SRPSPTPFPRFTTKLMWALGTDPFTHTHTHSHSFAHIHSCTHA/HVQ 
EHTYTYSLPHTSRPFLKGSKSTPKP 

467 

A 

153 

380 

PRSWGQDTTPSRGERSPNSSGTPGLPPPGGHPFPSQPVRPQLLPPTP 
DLGPGPGGQAGIQG\PSGSPTQDGANSRL 

468 

A 

140 

755 

TAMSSEEGKLFVGGLNFNTDEQAltED\HHFSS FGP I SEWWKDRGT 
SSGPGGFGFITFTNPEACFKLPMKKPMNRRSLLEWSVRSRCWIMQAK 
SASGEPEGGGLLGAHWGLGSQLTFKKWGGPGLIGELSFAMRAMNGE 
SLDGRQIRVDHAGKSARGTRGGGFGAHGRGRSYSRGGGDQGYGSGRY 
YDS\RPRGYRYRY\GRSRD 

469 

A 

2 

259 

GGI\DLAFPH/HENEIAQCEVFHQCEQWGNYFLHSGHLHAKGKEEKM 
SKSLKNYITIKPSTTVTAPCSKLSSCSWGWALSWRTHVPT 

470 

A 

198 

353 

PRPAGFSTFQETAARLPCIWWWTSWCGSGRRSGSLRWPCPRPRGTPG 
GSSS 

471 

A 

103 

484 

NTAVSTTVAGSCLASESAGAPLPSALSVGLGVPAPWNKGRSCLPGRP 
STGEDLGADFIPNPTLGLYIG\NVQVPIRFLGSHVSLRPESLLTQLR 
LNAFVFVSGHLHAKGKEEKMSKSLKNYITIKKK 

472 

A 

98 

1339 

GLRAAQAWAPRCSRPRWALGGLGGTGLRSLSASGGRGRAWLQPTGRE 
TGVOVYN S LTGRKE PL I VAHAEAAS WYS CG PTVYDHAHTifiHAP c; YVP 
FD 1 1 RR I LTKVFGCS I VMVMG I TDVDDKI I KRANEMNI S PAS LAS L Y 
EEDFKQDMAALKVLPPTVYLRVTEN IPHIISFIE\RI I ARGNAYSTA 
KGNVYFDLKSTGDKYGKLVGEVPGPVGEPADSDKRHASDFALWKAAK 
PQEVFWAS PWGPGRPGWH I \ EC S C\ LASNG I WKVNLD I H SGG I DLAF 
PHHENEI AQCEVFHQCEQWGNYFLHSGHLHAKGKEEKMS KSLKNY IT 
IKDFLKTFSPDVFRFFCLRSSYRSAIDYSDSAMLQ\VSSCSWGLGSF 
LEDARALHGRGAAGLRLRQGSDAVGEALQHQEGREGGLGR 

473 

A 

414 

833 

NVLVICQRNSNSFKKLFKVRDDGEQTQQDFSALQ/AS/SQGRPELVH 
SRGVQGPGWHQPPWVCQNPLAKAAFTALLVLESLREEERTVTAEGAP 
LSPGGPHTSPSPGRDAGGSNLGTFSRSYSQRLEFLSSRKIiCNINCKL 
S 

474 

A 

635 

901 

PHPEPKGPKLRA\ PLSPPDFPPGSRLAGSPGA\ PPPPPHPEDRLHQG 
SSHPHPIPRLLERSLAQGGAAPLGPPRMPHGAAPQGCSPSTPT 

475 

A 

816 

1099 

HFGEPFFFLRQSL/NSVAQAGV\KWRDPSSLQPLPPGFKRFSCPSLP 
SSWDYRRVPPHLANFCIF/MSRRGFIMLARLAQPQ/CDPPISASQSA 
RITGV 
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476 

A 

3 

232 

ooiMMJA>\iDortvjij^i\^v/\V o.t'r ir/v W JKrCJJXM x £»/VK^^xjJsJrKxl}AVVIGAGH 
NGLVAVSTSPLDPTSLALGVWLSAAWEGV 

477 

A 

2 

732 

FVPGAEMAASGRCLCKAVAAS PFPAWRRDNTEARGGLKPBYDAGVIG 
AGHNGLVAAAYLQRLGVNTAVFERRHV I GGAAVTEE 1 1 PGFKFSRAS 

YT.T.QTiTiRPDTYTnT.PT WKUCZT PT UT DXTUVCT?tdm\ r>vn\rr\y\ a mmnn 
x XjAjo uj-irtirs^ in Jl^Ljii. r^xVrlVjilj KJ_iri Lj KN ri lor 1 JfcrM \ KJ\j<. VQAAJCVPRC 

LLLGTDMA\ENQKQIAQFSQKDAQVFPKYEEFMHRLALAIDPLLDAA 
PVDMAAFQHGSLLQKRRALSTLKPLLKAGRILGAQLPRYYEVLTASI 
TKVLDOWFES 

478 

A 

161 

612 

GRRPSPQETPRPTSLSGAPPTPRHSRCPPNHTVSSASLSLPSRHRLF 
LTYRHCRNFSILLEPSGCSKDTFLLLAIKSQPGHVERRAAIRSTWGR 
VGG WAMGRH L KTD / G S S YG WKD A / P TPTDT ,MAYP «5 P v pnn t t ,own t?t 
EDFFNLTLKEL 

479 

A 

15 

405 

GGTGAMAPW/AGGEHSRMNPLRAV\WLTLTAAFLLTLLLQLLPPGLL 
PGCAIFQDLIRYGKTKCGEPSRPVACRAFDVPKRYFSHL\YTISFLW 
NDISAFW/LFLFFLFLTPLFSYLRFLFHSHLASCFHFLV 

480 

C 

80 

370 

MWAPREQLLGWAAEALPAKDSAWPWEEKPRYLGPVTFEDGAVLFTEA 

EWKRLSLEQRNLYQEAMLENLRNLGSLXXXXXXXXXXXL^ 

WL* 

4 81 

j± 

T 

X 


MTnPnMT MrtT.PDTTTMDC\7TDPT WPnr\A X /"« tit I'll tt*t~>t~\i t tv i tt nmi i« wt.i 

n J. JJr yniaJNyxjuK x JL XJMFo V ±x j CJjKY CGDQIGP VTTFEDVA VLFTEAEW 
KPXSLEQRl^YKEVMLENLRl^VSLESKPEVHTCPSCPLAFGSQQFL 
SQDELHNHPIPGFHAGNQLHPGNPCPEDQPQSQHPSDKNHRGAEAED 
QRVEGGVRPLFWSTNERGALVGFSSLFQRPPISSWGGNRILEIQLSP 
>vijXN/\i>i>£.c. vUKl oiVK/vc< I F\3b UAVKr OliCAx^FNQKSNIjFRQKAVTA 
EKSSDKRQSQVCRECGRGFSRKSQLIIHQRTHTGEKPYVCGECGRGF 
IVESVLRNHLSTHSGEKPYVCSHCGRGFSCKPYLIRHQRTHTREKSF 
MCTVCGRGFREKSELIKHQRIHTGDKPYVCRD 

482 

A 

3 

503 

TGTPVSTISWCCYLVCLLLACVRAQMWAPREQLLGWTAEALPAKDSA 
WPWEEK^IYLGPVTFEDVAVLFTEAEWKRLSLEQRISTLYKEVMLENLR 
NI,VSLAESKPEVHTWPFCP\LGFGSQQFLSQDELHNHPIPGFHAGNQ 
LHPGNPCPEDQPQSQHPSDKNHRGAE 

483 

A 

138 

2125 

MRGAGPSPRQSPRTLRPDPGPAMSFFRRKVKGKEQEKTSDVKSIKAS 
ISVHSPQKSTKNHALLEAAGPSHVAINAISANMDSFSSSRTATLKKQ 
PSHMEAAHFGDLGRSCLDYQTQETKSSLSKTLEQVIiHDTIVIiPYFIQ 
FMELRRMEHLVKFWLEAESFHSTTWSRIRAHSLNTMKQSSLAEPVSP 
SKKHETTASFLTDSLDKRiEDSGSAQLFMTHSEGIDLlSTMRTNSTQlW 
LLLSQECDSAHSLRLEMARAGTHQVSMETQESSSTLTVASRNSPASP 
LKELSGKLMKS I EQDAVNTFTKY I S PDAAKP I P I TEAMRND I IARIC 
L»Jt»JJ\jy VL/FWLr VLiAybx. Vr oAMEQEHFSEFIjRSHHFCKYQIE VIjTSG 
TVYLADILFCESAIiFYFSEYMEKEDA\^ILQFWLAADNFQSQIJ\A^ 
GQYDGQEAQNDAMILYDKYFSLQATHPL.GFDDWRLEIESNICREGG 
PLPNCFTTPLRQAWTTMEKVFLPGFLSSNLYYKYLNDLIHSVRGDEF 
LGGNVS PTAPGS VGPPDESHPGSSDSS ASQSS VKKAS I KILKNFDEA 
IIVDAASLDPESLYQRTYAGKMTFGRVSDLGQFIRESEPEPDVRKSK 
GSMFSQAMKKV^QGITTDEAQEELAWKIJVKMIVSDIMQQAQYDQPLEK 
STKL 

484 

A 

1 

2580 

MKTEVGTGLGEDDMFPFRQVELWKMAVKGKEQEKTSDVKS I KAS I S V 
HSPQKSTKNHALLEAAGPSHVAINAISANMDSFSSSRTATLKKQPSH 
MEAAHFGDLGRSCLDYQTQETKSSLSKTLEQVLHDTIVLPYFIQFME 
LRRMEH LVKFWLEAES FHS TTWSRI RAHSLNTVKQS SUVEPVS PS KK 
HETTASFLTDSLDIOiLEDSGSAQLFMTOSEGIDLNNRTNSTQNHLLL 
SQECDSAHSLRLEMTU^GTHQVSI^TQESSSTLTVASRInTSPASPLKE 
LSGKLMKKSQGFRFPILQHTSCSYVNLCGHDFFFLGLQFLLSESRIL 
GAAVIAAVAALMLPGSRRRERNKTKQPRGFPPLSLIRIEQDAVNTFT 
KY I S PDAAKP I P I TEAMRND 1 1 AG P AGG Y ANRHR PPDG LAG S P AP VH 
EVGGTETSGVGS PVRLKAAR ICGEDGQVDPNCFVLAQS I VFSAMEQE 
HFSEFLRSIfflFCKYQIEVLTSGTVYLADILFCESALFYFSEYMEKED 
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AVW xxjv* WJ-iAAJJWFQSQliAAJ^v>y xDGQEAQNJJAMXIjYDKYF SLQAT 
HPLGFDDWRLEIESNICREGGPLPNCFTTPLRQAWTTMEKVTQNFK 
RIKLQEVLLVELIKYKVGKPSRIAQWKLKQSSKVIKSRGLESGLKTV 
NCKTASDRSFRRFPEGGIVIKGDDGSMRVTAPEDLPVGQDVESSVKK 
ASIKILKNFDEAIIVDAASLDPESLYQRTYAGKMTFGRVSDLGQFIR 

go ijri-j r j-> v A.Aox\\joi Jr oyMP'JJVtVrv v yui' xxJX^>\v^£«xitXx^VVVlYX^uNJ v lx. V jUI 

MQQAQYDQPLEKSTKDSKRLTHCTSAQLASASGSHSHQQDCPVGEHT 
PLTTWRQRFLNAHD 

485 


T 

X 

DO/ 

YMF.KEDAVN I LOFW"LAADMFO cjfjTj AA KKGOYDfinP TxCnvinJiM T T .YDMft 
■X I'JEi ivdi/nvin x u\£E r» iin/U/n r uoyUn/uuvO^ x WOyx^/*^i>|X//*I T lXXj X 

AQVSAPRLRGPGARQPJVAGLGQPQPRARADSHAERPERKLGKLPRGA 
RVSEARPGPCRSWPGLPPAAGCLAXRPRRCRARGAVTVTVRAQSRTR 
ALRQYFSLQATHPLGFDDWRLEIESNICREGGPLPNCFTTPLRQAW 
TTMEKVTONFKR I KLQE VLL VEL I KYKELPNS VFLVME I LA 

486 

A 

444 

843 

EKSVYSLIFPCVFFLPSTWEKEDAV\NILQFWIjAADNFQSQIjA^ 

QYDGQEAQNDAWILYDKYFSLQATHPLGFDRCCTIKKIESNICREGG 

PLPNCFTTPLRQAWTTMEKVTQNFKRIKLQEVLLVELNKI 

487 

A 

77 

199 

SKSRPIASNKIEAIIKSLPAKKP\GPDGFATEFYQTFKEEL 

488 

A 

29 

380 

SETPQAPTLPTQGSLLEGEERMEDIRTLTVALFTLICCPGSDEKVFE 
VHVRPxOOiAVEPKGSLEVNCSTTCNQPEVGGL\ETSL\DKVFAGTEQ 
AQ W \ KHLLG S SNLS P \ DTVL \ HCHFH 

489 

A 

27 

406 

HAHLVTl^PISCPGSQSYPSWSPQGTLTVALFTLICWPRSDEKVFEV 
HVP.PKKLABEPKGSLEVNCSTTCHHPEVGGLETSLDKILI^ 

tj VT .D ^ WMYTTDTVT .Dr*W PTf^HKHP 2\ MM T Ml/ Q rz 

Xi X J_»ivOxVl J X rlXJ X Vljyvnr X w O V7i\^I!irU , U.i X JN V i_>kj 

490 

A 

191 

1074 

RLPEMS S FGYRTLTVALFTL I \ CCPG \ SDEKVFEVHGEAKGS WAVEP 
KGSL\EWCSTTC\NQP*SGVGLGDLS*D*GFCWDEQVQI./WKHYIiG 

CNPLLGGFLGKS FHHLKLQGVPTVEPLGQP * PS FLVS VGNETL\ HYE 
TFGKAAP APQEATATFNS TADREDGHRNFS CL AVLDLMS RGGN I FHK 
HSAPKMLEIYEPVSDSQMVIIVTWSVXLSLFVTSVI.LCFIFGQHL.R 
OORMGTYGVRAAWRRLPOAFR P 

491 

A 

3 

405 

SWSGSAAAIjPJ^RPVRLMKVFVTRRIPAEGANLKVIS 1T4SVGIDH 
DEI KKRG IR VG YTPD VL / TR YHRRTR S L PATYHLPPVAGGHRG <5 EEW 
W\WTSWKPLWLCGYGLTQSxVGIIGLGRlGQAIARRLKPFG 

492 

A 

2 

465 

DAAGANL KVI STMS VG IDHLALDE INKRG I RVG YTPD VL / TRYHRRT 
RSLPATYHLPPVAGGHRGSEEWW\WTSWKPLWLCGYGLTQSTVGIIG 
LGRIGEAPTGPlxARPGSHSVVCIPGTTCLKAEKTHMLSGHFACSRDI 
SNEG YS FVKHRQIHK 

493 

A 

217 

428 

TL VNMVEAD VTTR CL I RS G I R VG YT PD VL / TR YHRRTRS LP ATYHLP 
PVAGGHRGSEEVSERRPGAVAHGC 


TV 


1 ACQ 

KtoO Jr/Vj I r r KwK V/UjAK/U\ULb V byWUbUbfl r AJ\£>Lc.K\3 V AuAHvi 
LLCLLSDHVDKR I LD AAG A1ST0KVI S TO S VG I 

YTPDVLTRYHRRTRSLPCSY\HIxA\PLAGGL\QGSEEWW\WTSW\KP 
LWLCGYGLTQSTVGIIGLGRIGPGPLLGVLKPFGVQRFL\YTGRQPR 
S \ EEAA\ EFQAEFVS TPE\ LAAQSDFI WACSLTPATEGLCNKDFFQ 

KT^KRTAVFTKTT^RrjnWNOnnT.V YDZlT.Z\Qf:K'T \ aa7ir5T.'n\7TQPTrT>TA 

JXl'l 1\-Ej X r\ V I AH V VIVV^LALSXJ \ X l^-fn LtM-v? V7JVX \ M_rt>\Lj LjL- ' v X o ir dir Xj \ 

P 1TTOPLLTLKNCVI LPHIGSA TORTRNTMSLlxAAl^ 
SELKL 

495 

/ 

A 

3485 

4056 

DLPSRHRVMEAICHVCCVRRGCCV/CAC/ICVCHMCSVWCWO/LCV 
PQVLCWHALCV\ CVCGLQCARYVWCMCCVCS VLCMRCV /CVCVPRV 
LC^XCVCCGCSRCSVWCVCCTCRV/CLCV/C^CAWCVCCMC/LCVL 
CV/CACACVCCVCRVqSWC^CV/CVCCACSWCVLWCVC/V 
VCYVWCMCCVCCVCLE 

496 

A 

1 

858 

G ATLGGMSEHVE P AAPG PG PNGGGGGP AP ARG PRTPNLNTPNPLINVR 
DRLFHALFFKMAVTYSRLFPPAFRRLFEFFVLLKRVLERRPQLTVGV 
EELRLLVVLPMAVRPQVAHCTPGIGAPA5RSRVTEGPCAPGSLACDF 
ESDQFFPESLGFHVAVGAPLAQLGPQFCGSGQQVDVTLTRLGGDAWT 
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SGWEALFVLFVIAYIHIVFSRSPINCLEHVRDKWPREGILRVEVRHN 

SSRAPVFLQFCDSGGRGSFPGLAVEPGSNLDMEDEEEEELTMEMFGN 
SSIK 

497 

A 

106 

544 

GATLGGMSEHVEPAA\PGPGPNGGGGGPAP\ARGPRHPQFSTPTPLI 
NVRDRLFHA\LFFKMAV\TY\SRLFPPAFR\RLFEFFVLLKALFVLF 
VLAYIHIVFSR\SPIN\CLEHVR\DKWPREGNPALWKVRHNSSRAPV 
FLQ\ FCD\ SGGR 

4 98 

A 

309 

587 

NKNLE I S FFFETESRCCQPRIA ECNGAI S VHCNLCLPGS SNSPTS /A 

SQVAEMTGVCHHAQLIFVFLVAMVFHLH/VGQGGLELLTSGGPPASA 
SQ 

499 

A 

3 

451 

QTQREPT^^^^JSPADKTNVKAAWGKVGAHAGEYGAEALERMFIJSFPTT 
KTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHYDDMPNALSALSDL 
HAHKLRVDPVNFKLLSHCIiLVTLAAHLPAEFTPAVHASLDKFIjASVS 
TVLTSKYR 

500 

A 

2 

493 

GLEFGTSHRLRENPPMVAVSCPTKTNVKGPPGGKVGAHAG\EYG\SE 
ALERMFLSFPT\TKTYFPHFDL\SHGF\AQVKGATAKKVA\DALTKA 
VANV\DDMP\NGAVRPEATLI1AHKLRVEPVNFK\LLKPLACLVDPGP 
AHLPGRVSPLAVARI.PWNKVSWAFCGQI 

501 

A 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLS 
SASAIMGNPKVKAHGKKVIiTSLGDAIKHLDDLKGTFAQIiSEIiHCDKL 
HVDPENFKLLGim.VTVLAIHFGKEFTPEVQASWQKMVTGVASALSS 
RYH 

502 

A 

38 

557 

APSPDA\MGHFTEEDKATITSLWGKVNVE\DAGGETLGRLLWYPWT 
QR F FD \ S FGNL S S AFCH P WATP KVKAHGKKVL/TS \ LGDA I KHL \ DDL 
KG\TFAQA*SETALVTKLHVGS*RTFKLPGEMLLGDPFLGNPIFGKR 
ISPLEVGRLSWAERWVTWSWPVALVLPRLPLKLNCP 

503 

A 

318 

449 

ERKKKKMATVQKGM/ PHKYYHGKTGS YNVIQHAVGNAVTNRTRGS 

504 

A 

1 

468 

DHSSSPAREQNWMEN\EFDELTEVGFRRWVITNSSELKKHVLTQCKE 
AKNLSKRLGELLTRITSLEKNINDLMELKNTARELHEAYTSINSWIN 
QAEERLSEIEDQLNEIKREDKITEKMKNSTTVRVAASMQSKLLQGVA 
EEGPLRLTRSASFSA 

505 

A 

1 

1140 

MAKGQGESWLLDQEALAE I E KQQQKYKS LE ARAVS PVLDAGWTLPSG 
SLQSAEESAERELNIDHGAARPSLLHDQADHQAFGAPIFLDQQSESP 
KP S YAEVLR I S E KKAAARQGLADKI PTFLGQS TWGKVAL KRAADLPV 
QRSNSDTGQTASSSGSLTPVYPDWETPPSRGRPCIYHLTQESSGWHL 
AGAPFGQSFQRKEQAAIFAVMQPLLVIPRQTGSGVDLQQTPVDLKQR 
ERNNININKKDVLSETSSEDHRLQRSKVDKSTKMGRNQCKKAENSKN 
QNAYSPTKDHNFLPAREQNWTESEFDELTEVGFRRWVITNSSELKEH 
VLTQC KEAKNL E KRLDELL IRITSSEKN INDLMEVKNTAQELREA YT 
GINS 

506 

A 

1 

2640 

TLQSKSHYMGQARQASGASVWRAGCGHGQQQVLEPKWDERGSSIEEE 
HEQEHKQARSGMVNRYHDGPSSTPFSGIYSQGPVNVISGADRHIilQK 
SSSWHLLGALLGQSFQRKEQAAIFAVLQPLLVIPRQTGSGVDLQQTP 
7U5LQKRGLTVRGKTNKQKGIVATSAKRTSTQRPHPKDHNSLPAREQN 
WIENEFDERTEVGFRRWITNSSKLKEHVLTQCKEANNAWLLVEFVQ 
WDLLARDGGWKEKLGYLFLLFYPLWCHSSGSDCVPVQVYLINHTGIP 
FSAHCQEDRHRAFLLKDSDPSALIRSRQLQVFVKTLDIRQGSQLRNV 
CGQCPSQEHSSESWGSVMALTCPSQRACRGARVTGSKVPGPGWEELP 
LEVAMAPQNIHASGRGCGMTRTGETRPCFSPAILVLRPLHTQSTACS 
RRLFSQKNFWEHTEVRISLDPGGTSSYEEKLISNFSKVSGYKINVQK 
SQAFLYTNNGQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKEN 
YKPLLKEIKEDTNKWKNIPCSWVGGINIVKMAILPKVIYRFNAIPIK 
LPMPFFTELEKTTLKFIWNQKRARIAKSILSQKTKAGGITLPNFKLY 
YKATVTKTAWYWYQNRDIDQWNRTEPSEITPHSYNYLIFDKPEEKKQ 
WGKDSLFNKWCWENWRAICRKiKLDPFLTPYTKINLRWIKDLNVRPK 
TIKTLEENLGITIQDIGMGKDFMSKTPKAMATKDKIDKWDLIKLKSL 
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IKLKSTAKETTIRVNRQPKKWEKIFPTYSSDKGLISRIYNELKQIYK 
KKTNNPIKKWAKDMNRRFSKEDIYAAKKHMKKCSPSLAIREMQIKTT 
VRYHLTPVRMVI I KKSGNNRCWGGCGE IGTLLHC 

507 

A 

1 

714 

MD KFLDNYTLRRLNQEEVKS LNR P I TS S E I EAV INS L PTKKS PEPER 
FTAEFFQKYKEELSTIRLVPHFMFRTSPYAPPILSQRLPAKFSTWAL 
DSNRPAVSLERGLPVTISLTEAQLMNPIKGKLLLLNQSMESLPSPVG 
YAI QDHNS S P ARKQNCMENEFDELTEVGFRRWV I TNS SELKEHVLTQ 
CKEAENFEKRLHELLTRITSLEKNINDLMELKNTARELREAYTSINS 
QTD 

508 

B 

1 

546 

MKQPNRKRKLNMDSKERLDQDGRLVKMSFFVFCCVVSSEESEHEETQ 
DLMCS ELFWVLLHM WS LDQQHQHHLEDHN S S PARGQHWMENE SDELT 
ETVGFRRWVI TNS SELKEHALTQCKEAKNLEKRLDKLLTR 1 1 SLEKNI 
NELMELKSTARELREAYTSINSQIDQVEKGYQRLKINLMK* 

509 

A 

1 

732 

MVIAEQWFGYISKSEISSWFGAIAGRHLPVGADRHLIQESTGWHLA 
GAPLGWSFQRKEQTAIFAVLQPLLVIPRQTGYGVDLQQMPADLQQRF 
LTVRRKTNKQKG I VSTSTKRTS SEGHQHQRRKVDKFTEMGRNQHKKA 
ENSKNQNASSPSKDHNSSPAREQNWMENEFDELTEVGFRRSVITNSS 
KLKEHVLTQCKEARKLEKRLDELLTRKTNLEKNINDLMELKNTAQEL 
GEAYTSINS 

510 

A 

2 

1601 

SITHNSQKVETTKCPRMDEWINKRQYIHTMEYYSALKRKESLLHADS 
ENIMLSERSQTQKDKSAIAKIYKKKVNSAKEVLNKEPKRRSEGSRSN 
PLLQKLK1NHKQKANGTWNTSKSEPFTPLERGLKPGSQMWLVGSHP 
HRAQQTKIHWLEILAASTAAVWDRPGVLELEHLGEGAAVRRSFSRIiK 
RPCLTALKRAVDLPAQRSSSAKGQTASSSGSLIPVYPDWETPPSHKH 
RRPKVDKSTKMGRNQRKKAENS KNKNAS FPPKDLNS S PAREQNWMEN 
EFDELTE VG FKMWV I TNS S ELKEHVLTQC KEAKNLEKRLDELLTR IN 
SLEKWINDLMELKNTAWELREVYTTRHHTYSKIDNIIGSKTLLSKCK 
RTE I KTNSLPDHS ASKLELR I KKLTKNCTTTQKLNNLLLNDCWVNND 
MKAE I KMFFETNENKDTTYHNLWDTFKAVCRGKF I ALNAQKRKQERS 
KTDTLTS KLKELEKQQQTNS KASRRQE I TKI RTE PKE I ETQKTLQKI 
NESRSWIFEKINKIDH 

511 

A 

1374 

2220 

QAENSHININKKDVHSETPSEGHQRKRPKDHSSSPAREQNWMEN\EF 
DELTEVGFRRWITNSSELKKHVLTQCKEAKNLEKRLGELLTRITSL 
EKNINDLMELKNTARELHEAYTSINSWINQAEERLSEIEDQLNEIKR 
EDKITEKMKNSTT\TLVSTWGSQLEQNNLPHIISPLFHSTHDLNISFY 
GGCLLQACS SKLLQGLAEGGPLRLTRS AS FFAAKLCLTKHFLPTEM I 
LIKSRKTTPDTQPRRLYLLQGWDQTSRKGPTALNIRQKTFAAAPPH 
R 

512 

B 

1 

765 

MLAGQQREFPVGVGSADPALGAAGWPQRPGQARDLQLAMPEPPIPSM 
GSCAAGASPTSTAPCSKAPSPIDHPRAAECRRRARDWQAAPPAAPVR 
DPLGEASRAPESVKDVCIYDIMFDGGAGRFAAATLGTISELPAFPEE 
RKSWKPKRQAPGALV * 

513 

A 

594 

1300 

LGASEPWQPRSQETPKHSWPSPH/PANQSKEVPENPNYALKCTLVGH 
TEAVSS VKFS PN/ CENGLAS SSADRLI I IWGAYDGKYEKTLYGHNLE 
I SDVAWS SDS SRLVSASDDKTLKLWDVRSGKCLKTLKGHSNYVFCCN 
FNPPSNLIISGSFDETVKIWEVKTGKCLKTLSAHSDPVSAVHFNCSG 
SLIVSGSYDGLCRIWDAASGQCLKTLVDDDNPPVSFVKFSPNGKYIL 
TAT 

514 

A 

188 

480 

QRGVIiLKTANVLVPEDLGREMRIFFSVKTWKTFTCIFTKGTNJHLFIi 
FYFFFETASLSEAQAGAQ\LGSLQPPPPGGSSDSPASASPIAGITGA 
CHH 

515 

A 

355 

612 

S FS I SGS FS AS I ALVCQLLLAVKGS VFEVGG I S YCPLNVFLLD\ LQV 
APEEDERKKRRRERNKIAAAKCRNKKKEKTECLQKVSAF \ 

516 

A 

3 

853 

SQARTAQLSSLAAARAHPSARAPAVSPRCSPAGQTNSPPDPVPTIiAA 
PSGAWSKMMLQHPGQVSASEVSASAIVPCLSPPGSLVFEDFANLTPF 
VKEELRFAIQNKHLCHRMSSALESVTVSDRPLGVSITKAEVAPEEDE 
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RKK\ RRRERNKI AA\ AKCRNKKKEKTECLQLQY * QSHRPPLWHHQGF 

SEEEGLHFPKPSAALPSPIFLSQLDEPRCV\QESEKLESVNAELKAQ 

IEELKNEKQHLIYMLNL\HRPTCIVRAQNGRTPEDERNLFIQQIKEG 
TLQS 

517 

A 

226 

569 

TGMDYLQVGVTSQKTMKIA I PASRHRATQKWIGDHDGWMCFGMKK 
GEAAAVFKTLPGPKIARLELGGVINTPQEKIFIAAASEIRGFTKK\G 
KQFLS FETNLTES I KAM YVKC 

518 

C 

512 

739 

MAEHHWAFP I PSXASXGFAXLQPTKATLKKXEGFKX FXKKKKKKKTP 
EGGAXGTX IXXPXGGAGXNKXPXGXXXG * 

519 

A 

1 

760 

WQGQKDRERERREEKKREGRETRGDTRGETRGETGEGEERTKRERGG 
ERDTREERDERERERERETRHERREREERERERERERERESEREREE 
RERQRERETETEREKKRERERERERERERERERERERERDRERERER 
ERERERERKREREQRERERERDRERERKREREREREKEREREREKER 
GREREREREERERERGRKRERGDRTEREKERGRERESTQAMFPGQPK 
TRALAIVFPANVRQINGD . 

520 

A 

669 

823 

MSRGWAASCRIRHEERERERERERERERERERERAEQCLLLLGALRP 
RASP 

521 

A 

3 

108 

VCREREPHQKVFNKMPK*SYRTVPVTPEGIVIiSIi 

522 

A 

383 

542 

LWS PPNSN I LDTTTAGETPEQDLNREREREKE / RERERERERERERE 
RERERERERES 

523 

A 

389 

549 

ERKRPSRKQ/RERNRKNQNERKSRGDWQRANVGAGQKEEMKCLAVEL 
EIGSQPRS 

524 

A 

1 

495 

EELVPIFLTLFHKIEKEGTLPNSFYEASITLIPKPGKDITKKENYRW 
ISLMNIDAKILNKILANQIQQITKKIIHHDQVGFIPGMQGWFNMHKS 
I^IHHVNRIKNKNHMIISIDAEKAFDKIQHPFMIKTLSK/MGIQGT 
YLS V I KA I YDKPTAN I KQTFQ I QD 

525 

A 

219 

471 

INFKKXNRLFNKWY\WTCVATLEKNKMNSFLMIYPKSNSK*FKNPNV 
RGKKL*NLFILLYFKF 

526 

A 

1 

1423 

MNPNQEEISDLPERIQETEKEGTLPNSFYEAS ITLI PKLGKDKTKKE 
NYRPISMMNIEANILNKILANQIQQHIKKIIHHDQVGFISQMQGWFN 
IHRS INVIHH INR I KNKNHM 1 1 S I DAEKAFDKIQHHFM I KTLS KIG I 
QGAYLNVIKAIYDKPTANIILNGKKVE\AFPLRTGTRQGYPLSQLFF 
NIALEVLAIUVIRQENEIKGIHIGKEKVKLSLFAGDMIFYLENPKDSS 
KKLLELIKELSKVSRWELNNENTWTQGGEHHTPGPVVGSGEERGIAIi 
GDTPNVNDKPLTGAALASTLGVWENENKFTHLFNIHSQFCLPSQGIF 
FLCGTS TYVCLTTNWTS TCTLI FLS PKID I APGNQTLPVPVRAQVHQ 
HRAVQIIPLLIGLGVTNATGTGIAGLSPSLSYYHTLSKDLSDSLQDI 
AKSTLTLQSQIDSLAAVTLQNHRGLRPPNRRKRWAMHLFRGRLFFHQ 
PVWD 

527 

A 

31 

332 

QRRSGCVLRMTEWETAAPAVAETPDIKLFGKWSTDDVQINDISLQAI 
WLLCTGAREAAFRNIKTIAECLADELINAAKGSSNSYAIKKKDELER 
VAKSNR 

528 

A 

27 

178 

QRRSCCVLRMTBWETAAPAVAETPDI KLFGKWSTDDVQINDI SLQVR 
GTW 

529 

B 

110 

719 

XGRQQHQRWHETPD I KLFGKWS TDDVQ IND I S LQD Y I AVKEKY AKYL 
PHSAGRYAAKRFRKAQCPIVERLTNSMMMHGRNNGQEAHDCAHRQAC 
LRDHNTCSQAKNLLEVRGTAIINSGPREDSTRIGRAGTVRRQAVDVS 
PLRRVNQAIWLLCTGAREAAFRNIKTIAECLADELINAAKGSSNSYA 
INKKDELERVAKSNR* 

530 

A 

443 

565 

QPGFGAGARAPNI LTHKHLG/ LLEFKTTGAQQQGF * LNE S S F 

531 

A 

82 

986 

TMNTRNRGVNSGLGASPASRPTRDPQDPSGRQGELSPVEDQREGLEA 
APKGPSRESWHAGQRRTSAYTLIAPNINRRNEIQRIAELELANLEK 
WKEQNRAKPVHLVPRRLGGSQSETEVRQKQQLQLMQSKYKQKLKREE 
SVRI KKEAEEAELQKMKAI QREKSNKLEEKKRLQENLRREAFREHQQ 
YKTAEFL/RQTEHRIARQKCLSKCCLWPTILNMGQKLGLQ\DSLKAE 
ENRKLQKMKDEQHQKSELLELKRQQQEQERAKIHQTEHRRVNNAFLD 
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RL/RRQKSTRWPRAIWRLLEYE 

532 

A 

122 

1342 

EAASKTTKDNICSWKDDTIGCKEMPTSENCPSFALHQKIS\DRNHGE 
CREYGKTLCQDSKPVQHERIHSSEKPNRCKECGKNFSNGHQLTIHQR 
LHVGEKPYKYEKCGKAFISGSAFVKHGRIHTGEKPLKCKQCGKTISG 
SYQLTVHKSIHTGKKPYECGECGKAFLVYGKLTRHQSTHTGEKPFGC 
EECGKAFSTFSYLVQHQRIHTSEKPYECKECGKAFSTSSPLAKHQRI 
HTGEKPYECKECGKSFTVYGQLTRHQSIHTGEKPFESIHAEIKPYRC 
KECGRTFSRASYLVQHGRLHTGEKPYECKECGKAFSTGSYLVQHQRI 
HTGEKPYECKECGKAFISRHQLTVHQRVHTGEKPYECKECGKAFRVH 
VHLTQHRKIHTDVKPYECKECGKTFSRASYL 

533 

A 

456 

824 

DRVSLSPRLECSGVIL\AQCNLHLP/GFNRFS\TLGLPSRLGITGHA 
PLCLGQFFVFLVEDGGFHHVG\QAGLELLTSSDPPTSAVPKCRDYRH 
EPLRLACCS FLKQENIHINCWVR I Y IR I FQTN 

534 

A 

1 

2278 

MDSSIHLSSLISRHDDEATRTSTSEGLEEGEVEGETLLIVESEDQAS 
VDLSHDQSGDSLNSDEGDVSWMEEQLSYFCDKCQKWIPASQLREQLS 
YLKGDNFFRFTCSDCSADGKEQYERLKLTWQQWMLAMYNLSLEGSG 
RQGYFRWKEDICAFIEKHWTFLLGNRKKTSTWWSTVAGCLSVGSPMY 
FRSGAQEFGEPGWWKLVHNKPPTMKPEGEKLSASTLKIKASKPTLDP 
1 1 TVEGLRKRASRNP VES AMELKEKRS RTQEAKD I RRAQKEAAGFLD 
RSTSSTPVKFISRGRRPDVILEKGEVIDFSSLSSSDRTPLTSPSPSP 
SLDFSAPGTPASHSATPSLLSEADLIPDVMPPQALFHDDDEMEGDGV 
IDPGMEYVPPPAGSVASGPWGVRKKVRGPEQIKQEVESEEEKPDRM 
D I DSEDTDSNTS LQTRAREKRKPQLEKDTKPKEPR YTP VS I YEEKLL 
LKRLEACPGAVAMTPEARRLKRKLIVRQAKRDRGLPLFDLDQWNAA 
LLLVDGIYGAKEGGISRLPAGQATYRTTCQDFRILDRYQTSLPSRKG 
FRHQTTKFLYRLVGSEDMAVDQSIVSPYTSRILKPYIRSDPHWTPEP 
DAPLDYCYVRPNHIPTINSMCQEFFWPGIDLSECLQYPDFSVWLYK 
KVIIAFGFMVPDVKYNEAYISFLFVHPEWRRAGIATFMIYHLIQTCM 
GKDVTLHVSASNPAMLLYQKFGFKTEEYVLDFYDKYYPLESTECKHA 
FFLRLRR 

535 

A 

750 

1053 

PSTLSSLI\SRHDDEATRTST\SEGLEEGEVE\GETLLIVES\EDQA 
S \ VDL\ SHD \ QSGG I S LNSDEGD VS WMEEQLS \ YFCDKCQKW I PASK 
ELLKSFDLSIPV 

536 

A 

426 

634 

VFCYSRTNWNKTEINRKKFGKLPSIWKLSSTLLNNQ/L/VSEENKRE 
I KKYPDINENEDTTYQNIiWNELKQC 

537 

A 

1190 

2097 

SSCLKILNIITL7VKSLLL/CLDISVCYYLLDTGICYCLL.I)ISVCYCL 
LDISICYCLLDTSVCYCLLDISVCYCLLNISICYYLLDTGICYCLIiD 
ISVCYCLLDISVCYCLLDISVCYCLLDISVCYYLLDISVCYCLLDIS 
VCCCLLD I SVCYCLLD I SVCYYLLD I S ICYCLLD I SVCYCLLD I SVC 
YCLLD I S I C YCLLD I S VC Y CLLD I S VCNCLLD I S FCYYLLD I S VCYC 
LLD I SVCYCLLG I SVCYYLLD I S VC YCLLDI SVCYCLLD I S VC YYLL 
Uri z> V Y V- LtLiD 1 b ± LLCLLD I SH 

538 

A 

163 

374 

I RQRKAKL WLAFNRRGLRKPD/ 1 F YYAGVGKI VKKKYS GFFIELVT 
TCGKNYLMCALAVIDPRDSNI IRS 

539 

A 

995 

1194 

1 1 LGNLTACS F YTSSLNAVWL I KSAMLQL YF I F\ FFGDRPSLCHPGW 
S A WQS LLT ATS TS W AQ AE 

540 

A 

77 

199 

SKSRPIASNKIEAIIKSLPAKKP\GPDGFATEFYQTFKEEL 

541 

A 

212 

322 

DAFFRTPRVPSVTVPVQCVFGARFKNAPPPFSTPQF 

542 

A 

1 

518 

>LAVARLAAVAAWVPCRSWGWAAVPFGPHRGLSVLLARIPQRAPRWLP 
jjJjFN LiA i KKLi ALr Kb PL> 1 1 F I PG YLS YMNGTKALAIEEFCKSLGHACI 
R FD Y SG VG S S DGN S EES TLGKWR K\ NVLS I IDDLVDGPQ I LVGS SLG 
GWLMLHAAIARPEKVVALIGVATRCRYLSDKV 

543 

A 

3 

606 

VMSWAFTFIHHSLTDSPRATSSPPSPIHEGDQDIIVLKFLKVLGGKN 
SGAWGRNGRPQGRRRGAR\FLRLNVGGCIYTARRESLCRFKDSMLAS 
MFSGRFPLKTDESGACVIDRDGRLFKYLLDYLHGEVQIPTDEQTRIA 
LQEEADYFGIPYPYSLSDHLANEMETY/ CFKALTDFCDS YGLVCNKP 
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TVWVLHYLNTSGAS 

544 

A 

3 

533 

AEGDQDI IVLKFLKVLGGKNSGAWGRNGRPQGRRRGARX FLRLNVGG 
CIYTARRESLCRFKDYNVASMFSGRFPLQTDESGACVIDRDGRLFKY 
LLHYPSWRSSEFPQMRDTRGRPTGKRADYFGIPYPYSLSDHIjANEME 
TYSLRSNIELKKALTDFCDSYGLVCNKPTVWVLHYP 

545 

A 

1492 

1672 

KQWHESHPDYDRSPSQTGEGDEPCSWHRLFAQGDK*YEATACHRNLD 
WQKATDISCNTNG 

546 

A 

138 

257 

IFLLYLKNKVQNK*EIKQHFLGKIMSRDNNTYLTWGTEN 

547 

A 

5 

960 

PTFSRAVATMFSRAGVAGLSAWTLQPQWIQVRNMATLKDITRRLKSI 
KNIQKITKSMKMVATUUCYARAERELKPARIYGLGSLALYEKADIKGP 
EDKKKHLLIGVSSDRGLCGAIHSSIAKQMKSEVATLTAAGKEVMLVG 
IGDKIRGILYRTHSDQFLVAFKEVGRKPPTFGDASVIALELLNSGYE 
FDEGSIIFNKFRSVISYKTEEKP\IFSLNT\VASC*QHGVSYDDIDA 
DVAAKITQEY\NLANIIY\YSLKESTTSEQSA\RMTAMDNASKNASE 
MIDK\LTLTFNRTRQAVITKELIEIISGAASSVKKENSAS | 

548 

A 

1326 

1780 

YLSKLPYFELESIAAARSRPTAQRHPRERKLKPARIYGLGSLALYEK 
ADIKGPEDKKKHLLIGVSSDRGLCGAIHSSIAKQMKSEVATLTAAGK 
EVMLVGIGDKIRGILYRTHSDQFLVAFKEVGRKPPTFGDAS\VIALE 
LLNSGYEFDEG 

54 9 

A 

20 

403 

KVQGPHASAHASGEGSLTIIPVTRSPLATMNHIVQTFSPVNSGQPPN 
YEMLKEEQEGAMLGAPHNPAPPMSTVIHIRSETSVPDHWWSLFNTIj 
FMNTGCLGL I TFVYS VKA\ RDRKMGRRTR PGAQG 

550 

A 

2 

431 

DAWAMNHTVQTFFS PVNSGQPPNYEM\ LKEEHRVGCAG AAPTTLFPP 
TSTLIHNRQRDLPCPTHWWSLFNT\LFMNP\CCLGFQ*EFAYSV\K 
SMDRKMVGDVTGA\QAYASTAKCLNIW\ALILG\lLMTILLIVIPVIi 
IFQAYG 

551 

A 

137 

838 

SSGICSALIWPWPALHIjTNARKRKLLRKRITYWRKGEGSIjRTIPVTRS 
PLVTMNHIVQTFSPVNSGXQAXPhTyEMLKEEQEVAMLGAPQKPCLPE 
CPPVIH\IRSETSV\PDHV\VWSLFNTL\FMNTC/CRLGFIAFA\YS 
SEVSGTRKMVGEVTRAQGLLPPPPKCLNIWA\LILG\IFMNQFLLIN 
HPPPSVGSSRPSDRSGRHHLRPRELCPVE PVSQVKNSIFHSLALPPE 
ARKFCP 

552 

A 

1 

528 

MTTDNXjSDTLKXI>KITAVDKTEDSLEGCLDCLLQALAQNILFTSDFIj 
ISEFKMGRKAAETTSNINNALGPGTANEHTVQWWLKKFCKGDESLDE 
EDCDVQQEVDFIWQLKSGFYTTGDDQFSDWTKKLQSTSQSQTCTKKR 
SWSLSGGLLLIHYSFLNPGETIKSEKYAQQIDEMH 

553 

A 

1 

1682 

MDNLSDTLKKLKITAVDKTEDSLEGCLDCLLQAIiAQNNTETSEKIQA 
SG I LQL F AS LLTPQS S CKAKVAN 1 1 AEVAKNEFMR I PC VDAGL IS PL 
VQLLNSKDQEVLLQTGRAIiGNICYDSHSLQAQLINMGVIPTLVKLLG 
IHCQNAAIiTEMCLVAFGNIiAELESSKEQFASTNIAEELVKLFKKQIE 
HDKREM I FEVLAPLAENDAI KLQLVEAGL VECLLE I VQQKVD S \ E KE 
GGITELKTGSDLMVVLLLGDESMQKIiFEGGKGSVFQRVLSWIPSNNH 
QLQLAGALAI ANFARNDANC I \ HMVDNG I VEKLMDLLGRHVEDGNVT 
VQHAALSALRNLAIPVINKAKMLSAGVTEAVLKFLKSEMPPVQFKLL 
GTLRML IDAQAE AAE P I GERMLKIjV\ ERLVGWCEAKDHAG VMG \ EQN 
RLLSALIRHSKSKDVIKTIVQSGGIKHLVTMATSEHVIMQNEALVAL 
AlilAALELGTAEKDLESAKLVQILHRLLADERSAPEIKYNSMVLICA 
LMGSECLHKEVQDLAFLDVVSKLRSHENTKSVRQQASLTEQRLTVES 

554 

A 

2«* - 

911 

GRPPRVPPWLRERRRLEGGHRAAPPAPYDLRLAPPVPHPRGATWQII 
LSDTLKKLKITAVDKTEDS\LEGCLD\CLLQA\LAQNNTANKCKKSK 

qvailqlfaslltpqssckakvaniiaevak\nefmrip\cvdagli 
splvqllnskdqkvllqtgralgnicydshegrsavdq\eggaqivi 
dhlrslcsitdpaXnekxltvfcgmlmnysnendslqaqlinmgvip 
tlvkllgm\hcqnaaltemclvafgnlaeldaiklqlveaglvecll 

EIVQQKVDSDKEDDITELKTGSDLMV 

555 

A 

1062 

1421 

drvslcc/prlecsgmisahcnlpspgfkqfsclslpngwayrsvpp 


^00 
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CPANFCVF/SVETGFHHVGQAGLELLTSGDLPALASEKCWDYRREHT 
ARGP VTKPS YFQVLFFRLHP I F INHLNT 

556 


A X<? 


GGKSLARRVPKGRMDGLLNPRESSKFIAENSRDVFIDSGGARRVAEL 

LIJUCAAGPELRVEGWKALHELNPRAADEAAVNWV^ 

EQDEHKW 

557 



ft O Q 

FLFYLFIFFETGSHSV\TRLECGDTIMARCSLNFPGSSDPPASASWV 
AGTAGVLYHTWLSFAFFRRDGVSRVARAGLQLLAW 

558 

A 

442 

626 

VEEGGSEAGWALRIKCINGVSPKL/VLQLLHFPQIFNGSFVKLNKAS 
VNTLRIVEPYITWST 

559 

A 

3 

398 

GGIRRKLLTFLLVAGCWTPEAS PI IMATSATS PHAPGFPAEGRCGYY 
VEKKKRFCRMVVAAGKRFCGEHAGAMEVWYRPTLSRVGNKYPGPVGA 
GTRPLPSLTAFCLCPLDSPAFTLINPVLPSLRSDYEF 

c; <c n 

A. 

1 

960 

IMATSATSPHAPGFPAEGRCGYYVEKKKRFCRMWAAGKRFCGEHAG 
AMEEEDARKRILCPLDPKHTVYEDQIAKHLKKCNSREKPKPDFYIQD 
INAGLRGETEIPEQL/CLNSTLTDHIMSHPALHDALSDPINGDSATK 
HLKQQASI \ LGNIE^KLLGPRRCFVEFGAGKGKLSHWVDIALKDAE 
KVHFILVEKVTTRFKVPGKHRKKNSVFERLQIDIQHLCLNKIPVLRE 
EKIJPVVGIGKHLCG^1ATDIJ^RCLVETYAASFEERNEEPLAKRIKND 
iv 1 £, Jvis ± X I iaAi^GNEKjJVPEKWNPVAGI VIALLCHHRCDW 

561 

A 

111 

X J» X 

OCA 

S LDSDSGCKAYVGDEY \ LPGP\ LDLHAQPS LLKEHEEEKMFTLKGNR 
LPAADVKNI I FFVRPRLELMDI I AENVLSEDRRG 

562 

A 

223 

748 

CIGKVYVWLGGVGWLQKSWGTPLFSPHPPGRPRSFAKGHSPPQGILG 
VGLGGENRILVI IFNLIYCTYLGQGLGEVEGGEGSPLSAPPTPFLRR 
FVCV\GPHPLPIPHCCLDWLFFYRSPFPSSPFSPPPHPLLPLPFVS 
CSFLGFCTTQLVYTVYTQPAKRKPNGKHFKKKKK 

563 


1 ~k ft 

O *7 

MQ1NSQMER* 

564 

A 

549 

848 

LFFPHSQSTKVFSTYEQIMCVCVFLFFCFFVFPETGSHSV\TRLGCS 

GVIMAHCSFNLPGPSDPSTSASQVAGTTGVCHHAQLIFKFFVEPGQH 
TEIPSL 

565 

A 

x / 0 

T7C 
J / Z> 

ANFIAGWSQSRFHALALHIIQQGSKKVP*ANFIAGVWSQSRFHAIiA 
LHIIQQGSKK^PIVSKLENDTAARHELFIAEWRKQCSEKKAQAKGWR 

566 

A 

656 

874 

VTHLSKKKFTHQPDAKCGNGWNNPTGSSTVKKMRRSLSTSSPIRSAH 
IVNSSGSRRARGILAKCILMSLVIW 

567 

A 

718 

971 

VPTNTPPAPEEEATGGRQVTAPCSYPEMTSAQPGTYWMCPQETPPGP 
HPPGRNPASQKEAPRCTSAWESCPPSGRPARQALYRA 

568 

A 
/A 

4 ft A 

0 83 

PHHYAVLLFYRLGLSRIAVSLALAICDWLNAIiYQRLHFAKRSGSTLC 
NNCLRCIADRHLHYVGNITV 

569 

A 

811 

1481 

GNG I LLMVHHHCDAGTV IMRADRF I PGEDQKNRVLVNTVIaVRI SRAV 
DGKHLS AG I RHTVFR I CHNACTQHRGL FRG FNVFGEFLR FA FY I L I V 
KENADQ C AG FVG HE LG E I DR VM Q AL PD VEQAVTHACV I NQAAATGG 
DARQLVGYLVSQSGFNINGNRHLTTTINPVGFPADK*GFPLMLPRVS 
GRNQHRIS / RVYLPCTATTIjLRPGNGPPPSSVRPRR 

570 

A 

961 

1627 

LPHPAVAVIQNVRSCRGFHGSPTTiWIPGNPRHSRTITLIKLLHGII 
VI IGRVKFQRHPGCGQAKFI FFTLRQNKPNWSGSTGRPKGVMVGQT 
AIVNRLLWMQNHYPLTGEDWAQKTPCSFDVSVWEFFWPFIAGAKLV 
^1AEPEAHRDPLAMQQFFAEYGVTTTHFVPSMLAAPVASIJTPQTARQS 
CATLKQVF\ADLCREWQQLTGAPLHNGLCVFLMR 

571 

A 

803 

1403 

FNRRILKINGJjTREOTSGVOOOTROAT,c;aT,PT.P\7>iT?T T?T?Tm*zr i T>x?T\w 
RKCQAWLKD I ESARLQHNQAYTEAMLTE YADFFRQVES S PLNPAQAR 
AVWGEHSLLVLAGAGSGKTSVLV7VRAGWLLARGEASPEQILLL\GL 
VAKPLKRWTSGFANGYIPKTLPHARFMR/*ALHIIQQGSKKVPIVSK 
LENDTACRHELFIA 

572 

A 

3 

1642 

SAKEPWILATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLG 
LRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTV 


0^0 1 
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RNRN I LPLLAFHLGPFRS LAD I PRNRVYED FDPHCRPM 1 1 TSLSNLP 
WHSINSCQQPLGVAITCTCSLNVHDSHLRYAASTCTCVLPMSGAQPL 
STRAVAQLYSPDADAKRAAVPARSTDKPENHAHPHSHFTHQYYKTQY 
HHLKKTFPWWLGDVKLT I PLTLGMVAAALTDLDDRLAGRLRNL 1 1 TL 
FCFFIASASVELLFPWPWLFAIGLTLSTSGFILLGALVQRYATIAFG 
ALLIAIYTMLGTSLYEHWYQQPMYLLAGAVWYNVLTLIGHLLFPVRP 
LQDNLARCYEQLARYLELKSRMFDPDIEDQSQAPLYDLT^LANGLLMA 
TLNQTKLSLLTRLRGDRGQRGTRRTLHYYFVAQDIHERASSSHIQYQ 
TLREHFRHSPVLFRFQRLMSMQGQACQQLSRCILLRQPYQHDSAFLS 
ALLRILMLRWSGCAITAHPPIYSKHWDNA 

573 

A 

114 

5270 

SARSASFRPGRPAARRPKCSHPLVPDGVPECCKCVHPASTNFPRRTS 

TAARHSHHNDGSTSTMGRTWQI/VGYLVSQSGLPLDTSALQAQLRET 

LPPHMVPEVLLQLPQLPLNANGKLDRKALPLPELKAQAPGRAPKAGS 

ETIIAAAFSSLLGCDVQDADADFFALGGHSLLAMKLAAQLSRQVARQ 

VTPGQVMVASTVAKLATI IDAEEDSTRRMGFETI LPLREGNGPTLFC 

FHPASGFAWQFSVLSRYLDPQWSIIGIQSPRPNGPMQTAANLDEVCE 

AHLATLLEQQPHGPYYLLGYSLGGTLAQGIAARLRARGEQVAFLGLL 

DTWPPETQNWQEKEANGLDPEVIAEINREREAFLAAQQGSTSTELFT 

TIEGNYADLSSARSWASTSFPLNHDFGTLAGVCIGLSHNRASDAETV 

VLENWRQVHEQCLALHSQQQTLQQQDVLAAQS LQKAQAQFDTALQAS 

VFDDQQAFIJVAIjMDEQTLTQLEQLKQNLENQRRQAQTLVTQTAETLA 

QHQQHRPDDGLALTVTVEQI QQELAQTHQKLRENTTSQGE IRQQLKQ 

DADNRQQQQTLMQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQGL 

TLDNLVHLANQQLTRLHGRYLLQRKASEALEVEWDTWQADANENQT 

HAQHEQAPTTGTTHGTTMTHNTEHTERLSPGSGHRRYRHRSTDRNRQ 

CTGTGNDTHFTR FVNVTRLDTDFAFTRSDNAWAVRADH TNACF I QLH 

FHGQH I QRRDAFSDGDNQFDAC VDGFQNGVFAERCWYSNRSGS ASGS 

NGFTHGVEHRQAEAAVSQFLRAEFGVVTFQTYHDRYVNTHFCYRADD 

TVSDQVATNDTAEDVNQNGFHFVVREDDFERFGYTFFGCAAANVKEV 

SRFAAVQFNDVHRTHRQTCAVNHTTDVTVQRNFPLCSLCFTRIFLRIi 

VTHFAQIRL7VEQCVAVSAQFAVEANQIALFGDDQRKRIKRNFMDKVR 

S FFRHF FD FHAAFGGGNKDHTTGTTVNNGTE I QLLS D I GRCFNQNLV 

NRLAIGICLIRYQTLAQPVFCKSTDVFFAVYNFHTASGLFQSSSAND | 

VRYTLTATEMVIAPAGFMDEAISENVLAGNIMSRRAIiYSYGLLLPHN 

AQGNVGNGLGVTIATGDPS 1 1 APTKTI VRTGEKMI IDGL.EFDFLMTP 

GSEAPAEMHFYIPALKALCTAENATHTLHNFYTLRGAKTRDTSKWTE 

YLNETLDMWGND AE VLFMPHTW P VWGNKH IND Y I GKYRDT I KY I HDQ 

TLHLANQVYTMNE IGDM I KLPPALANNWASRG YYGS VSHNARAVYNF 

YLGYYDVYPANLHPYGQVEMGKRYVQALGGSARVINLAQEANKQGDY 

RWSAELLKQVIAANPGDQVAKNLQANNFEQLGYQAESATWRGFYLTG 

AKELREG VHKFSHGTTG S PDT I RGMS VEML FDFMDRQACHQ YNRTRV 

VDQLIEFLQDQHVNAHLAEIVRHHQPNRAGTSDRHLERYEMSINMQN 

NEQTEYKTVRGLTRGLMLLNMI^KLDGGASVGLl^LSGLHRTTVRR 

LLETLQEEGYVRRSPSDDSFRLTIKVRQLSEGFRDEQWISALAAPLL 

GDLLREVVWPTDVSTLDVDAMVVRETTHRFSRXiS FHRAMVGRRLPLL 

KTASGLTWLAFCPEQDRKELIEMIiASRPGDDYQLAREPIjKLEAlIjAR 

ARKEGYGQNYRGWDPGGEDRLYRRTAAQ 

574 

A 

1 

2967 

MKQEVILVLDCGATNVRAIAVNRQGKIVARASTPNASDIAMENNNQA 
VFPGKLQIPRRQTMTEIQRLLTETIESLNTREKRDNKPRFSISFIRK 
nrsaijv ± ul\ ra v ii\j ±&xk>i Ki. UJM liPCa fc, Y AQIjAQIjNAT QVE I P ETTLS ALV 
AEQAAKTPDAPAIjADARYLFSYREMREQVVAIiANIiLRERGVKPGDSV 
AVALPRSVFLTLALHAlVEAGAAWLPLDTGYPDDRIiKMMLEDARPSL 
LITTDDQLPRFSDVPNLTSLCYNAPLTPQGSAPLQLSQPHHTAYIIF 
TSGSTGRPKGVMVGQTAIVNRLLWMQNHYPLTGEDWAQKTPCSFDV 
SVWEFFCPFIAGAKLVM71EPEAHRDPLAMQQFFAEYGVTTTHFVPSM 
LAAFVASLTPQTARQSCATLKQVFCSGEALPADLCREWQQLTGAPLH 
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NX» YG PTEAAVD VS WYP AFGEELAQVRG S S VP I G YP VWNTGLR I LDAM 
MHPVPPGVAGDLYLTGIQLAQGYLGRPDLTASRFIADPFAPGERMYR 
TGDVARWLDNGAVEYLGRSDDQLKIRGQRIEI^GEIDRVMQALPDVEQ 
AVTHACVINQAAATGGDARQLVGYLVSQSGLPLDTSALQAQLRETLP 
PHMVPVVLLQLPQLPLSANGKLDRKALPLPELKAQAPGRAPKAGSET 
IIAAAFSSLLGCDVQDADADFFALGGHSLLAMKLAAQLSRQVARQVT 
PGQVMVASTVAQLATIIDAEEDSTRRMGFETILPLREGNGPTLFCFH 
PASGFAWQFSVLSRYLDPQWSIIGIQSPRPNGPMQTAANLDEVCEAH 
LATLLEQQPHGPYYLLGYSLGGTWRRVAFLGLLDTWPPETQNWQEKE 
ANGLDPEVLAEINREREAFLAAQQGSTSTELFTTIESECNVYSDLDL 
YLGVQRINWTEMFNPGATFAVTFSGLNDTIRNSQYGAMKVKDAIVDA 
FTRKNLPRPNVDRDAPDIRVNVWLHKETASIALDLSGDGLHLRGIAI 
VL 

575 

A 

1882 

3404 

LDAS FW I PPKATQGMTW IFTFPSSPS YNS S / C YAGWLKG S APLQLSQ 
PHHTAY 1 1 FTSGS TGRPKGVMVGQTAI VNRLLWMQNHYPLTGEDWA 
QKTPCSFDVSQFFAEYGVTTTHFVPSMLAAFVASLTPQTARQSCATL 
KQVFCSGEALPADLCRVWQQLTGAPLHNliYGPTEAAVDVSWYPAFGE 
ELAQVRG\SSVPIGYPGYLGRPDLT\ASRFIADPFA\ PGERMYRTGD 
VARWLDNG AVE YLGRSDDQLKI RGQR IELGE IDRVMQALPDVEQAVT 
HACVINQAAATGGDARQLVGYLVSQSGLPLDTSALQAQLRETLPPHM 
VPVVLLQLPQLPLSAKAGCPAAQVPRVAQVRVCAVLEDLGDIRESEG 
ERKQEADAVLGTQDTPRVTSQPNGLLLKIEPQENVKIEGQHQGLVWL 
CVAAS PVLVRVTR EKEE KKKE EGNNMR FTS L VL S VL, VN I VRALEKKV 
LD IMFNNCRTGVQAITLSHRRALAI AI ALAR Y I KSALYDG 

576 

A 

22 

413 

QVSAPHGAGASGVPPAGESVAPSGTSDGRGRWSGVEWVEEIKSLLLL 
FGGQFASSQETYGKSPFWILSIPSEDIARNLMKRTVCAKSIFELWGH 
GQS PEELYSSLKNYPVEKMVPFLHSDSTYKIKISHF 

577 

A 

1 

2148 

MAEEQVNRSAGLAPDCEASATAETTVSSVGTCEAAGKSPEPKDYDST 
CVFCR I AGRQDPGTELLHCEVGGDARPGNEDL ICFKDI KPAATHHYL 
VVPKXHIGNCRTLRKDQVELV^INIWTVGKTILERNNFTDFTNVRMGF 
HMPPFCSISHLHLHVLAPVDQLGFLSKLVYRVNSYWFITTNLQGNGE 
ISKEMVKFQSNFEEQVLKGCRESWKHTPSGTITWKYTVLLSQAAPES 
HASPLTSYQRWETPNHQAFTGEEAPPTAAGVSDSEREKRRDSACLAE 
LPLPALQALSLWRLGFGREKEGPSAGRAGARWAAAMALSCTLNRYLL 
LMAQEHLEFRLPEIKSLLLLFGGQFASSQETYGKVPFLHSDSTYKIK 
IHTFNKTLTQEEKIKRIDALEFLPFEGKVNLKKPQHVFSVLEDYGLD 
PNC I PENPHN I YFGRW I ADGQREL I E S YS VKKRHF I GNTSMDAGLS F 
I MANHG KVKEND I V FD P F VGTGGLL I ACAHFGA YVYGTD I D YNTVHG 
LGKATRKNQKWRGPDENIRANLRQYGLEKYYLDVLVSDASKPSWRKG 
TYFDAI ITDPPYGIRESTRRTGSQKEIPKGIEKWEKCSGFQQQISNI 
SLIILFGFLDVICGTLGGNLKVFKDDLFSVFLFKPSYLKCFEVIWEL 
P I H P AE SHL YH S VKAP YS PS FKFVCDL I LPACWTRTR VS RGQG VKGC 
HPDS PLSWFNT 

578 

A 

222 

666 

SLCFCFLE VSLPACQETYGKS PFW I LS I PFED IARNLMKRTVCAKS \ 
IFELWGHGQSPEELYSSLKNYPVEKMVPFLHSDSTYKIKIHTFNKTL 
TQEEKIKRIDALEFLPFEGKVNLKKPQHVFSVLEDYGLDPNCIPENP 
HN I YFGRW 

579 

A 

1664 

1846 

WQKLLFLFGTESCSVARVGVQWRHLSSPKPPPP\GSSNS/PASASRV 
A\ TGAHLHARL I FVFL 

580 


3 

A A 1 

rb bllb WDLAAA 1 1 D VS G VRGL S R L P S ARJ^LAjLALAKAVENYGM P F S 
RTEDGKI YQRAFGGHSLKFGKGGQAHRCCCVADRTGHS I LHTLYGRS 
LRYDTSCFVEYFALDLLMESGECRGVFALCIQDGSIHRIRAKNTIVA 
TGLL 

581 

A 

1 

693 

MDQDSASTKTCPGTHRKSGPRSREQHGALGAANAHDTVRDGERYDRW 
SGPRRREQHGALGAANAPDTVRDGERRSLRRDWRDCAEATADVSGVR 
SLSRLLSARRIJOjALAKAWPTVLQTGTRGVHFTVDGNKRASAKVSDS 
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I SAQYPWD \ HE FDAVWGAGGAG \ LRAAFGLS EAEFDTACVTKLFP 
TRSHTVAAQLENYGMPFSRTEDGKIYQRAFGGQSLKFGKGRQA 

582 

A 

306 

458 

I YIFLLNCI FLU 1 1 1 YFETESCS VAQAGW\ VQWHDLGSLQALPPGF 
TPFF 

583 

A 

47 

359 

SLLTPLVSPVPACPCPSKSEVSVPQEHPLPCRPLFRVWRNRQSRGCP 
QGVEHYGAGAWGLLPPVWKSCPPAKTCCKWGLP / CPPCGRSCLHSG\ 
GAALCLLCGGSA 

584 

A 

1248 

1667 

TNTGGWGPRHHAFVQHTLELLSLLCLNCGFLVYPFFLSWTVMYGSVN 
LVNLLKYGFQGIWFYYCVIKKLKKIWILCLPRPR\VPWLPTSDPSL 
RSPSSMVLNQICRVYSATKRGSVTGSWAYPCVLSRGMMSELCKSRA 

585 

A 

334 

698 

I KPEMEAGI S VTLGLDRNTRVQVGCRELRSTKY I SGWASCTS IQPSG 
REL/VCWGEC\LPLPWP\NWIGGGYGTKYWSRRSSQ\EWAVCSMD 
KTRYPEDSQLQCQDGSTRTYKITVSSLPAS 

586 

A 

258 

384 

NKSSESLNPPVCLAEFVKRLRYCEYLGKYFCDCCHSYAESC 

587 

A 

1637 

1834 

VFSTRCGTHPPVCLAEFVKRLRYC\EYLGKYFCDCCHSYAESCIPAR 
ILMMWDFKKYYVSNFSKTA 

588 

A 

1 

856 

MKRSMTLTI FHSERI ILRNRQPPPKESWEDIEDFGETHLDFLKQYGD 
FENGIPVHDTIARWSCISPAKFHECFINWMRDCHSSNDKDVIAIDG 
KTLRHSYDK5RRRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPE 
LLNMLD I KGKI I TTDAMVNCI VKVTGVFAINGDQRVAE PDSGALCQV 
RYPGNHR1TOQRYLFRHSDSELLHWMVALDAAVKRVKCLWMSLHKVYS 
CIELPELPQPESAG\HSGSQYA*CNRTRPHGQKPGTSAPGSSGVWRI 
TSV 

589 

A 

148 

333 

KDQDMRLRFWLLIWLLLGFISHQPTPVINSLAVYRHRETDFGVGVRD 
HPGPTWQNPISTKIG 

590 

A 

311 

577 

FFLSEILSVSLTFSLFQLLLFQVINTLADHHHRGTDFGGSPWLHVII 
AFPTSYKWITLWIVYLWSILTFCRGILVIANKNKLYKTI | 

591 

A 

279 

603 

FFLSEILSVSLTFFIISNYYYSQVINSLAVYRHRETDFGVGVRDHPG 
Q/ PWQNPI SHEKLDNLI III IGFLRRYTFNI LFCTSCLCVS ILTFCR 
G I LV I TNKNKL YKT K 

592 

A 

66 

1474 

TAPPPSPPSPSSLLSPCNGVHHVSTQPSCPGRGKMSKLLNPEEMTSR 
DYYFDSYAHFGIHEEMLKDEVRTLTYRNSMYHNKHVFKDKVVLG\VG 
SGTGILSMFAAKAGAKKVFGIECSSISDYSEKIIKANHLDNIITIFK 
GKVEEVELPVEKVDI I ISEWMGYCLFYESMLNTVIFARDKWLKPGGL 
MFPDRAALYWAIEDRQYKDFKIHWWENVYGFDMTCIRDVAMKEPLV 
D I VD P KQVVTNACL I KVWTTj I RGG PGCAG S PAVP PWS PGNAGS CTAP 
TWARAKVVRLRYEPEVRLEGHATLVTEGTKKERYQGPWKEPGAWLIG 
PEKVMEKSVLTPNVILDASYCWELAYQKIGEGVVWFIYITGGPDKTQ 
IAVLVQQEVDIYTVKTEELSFTSAFCLQIQRNDYVHAliVTYFNIEFT 
KCHKKMGFSTAMEEPC I P I FVLS KEKVNNWWTAQVES VLDEGEAG 

593 

A 

379 

475 

DLQGVNHLRP\GVQDQPGQHGETLSLLKIQNN/LARRGGGRL/QSQL 
LRRLRQDNRLNPGGGGCSEPRSRHCTPAWVTEQDPISKQNKRQQANF 
LIWEDFPY I PQGTKLI FTAQKLPRCK 

594 

A 

2 

307 

QAGV * * WDLGSLQPLPPRLKQFS / CILNPGNLSKEF * STKBTKQNI F 
VGHIQSQTSKFAISLIQIHPINMRSGTKTFMMV*GNKQRSKFPIWTF 
KIFPDMLPS 

595 

A 

374 

665 

GAQGL SLS PRLECNGA I LAHCNLCLPGS SNS PGS AS * VAGTIGMHHH 
ARLM FVFL V ES G FHHVGQAGLELLTS SDPPAS ASQS AG 1 RG ISRRAG 
LDF *» 

596 

A 

1 

687 

EIKYHSLPRLECRGEISAH*NLCLPGSSDSPATAS*VAGITGMRHYA 
QLI FLFLVET* FHHVGQGWSRTPDSNDPPAS ASQGAGDYRRD 

597 

A 

3 

198 

G I PGS S FCGLCGDVPGKPV * RADGSC *DGVAPRLLRPRGFRGGRCGP 
VLDSLAGQRGAESGCRG 

598 

B 

188 

246 

MAWLPGSCARVAFPAGAAAR * 

599 

A 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLS 
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SASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKL 
HVDPENFKLLGNVXiVTVXiAIHFGKEFTPEVQASWQKMVTGVASALSS 
RYH 

600 

A 

3 

496 

HSLFGTSEVINKLRSPDAMGHFTEEDKGTITSLWGK\VNVEDAGGET 
LGRLLWY PWTQ\ RFFD S FGNLS S A S A I MGNP KVKAHG \ KKVLTS LG 
DAI KHLG * SQGAPFAQA* S ELH\ CDKAALLDPEELSSFLGEMLLGDP 
FLGNP I FGQKNFTPEGCKAFLGQKDG 

601 

A 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLS 
SASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKL 
HVDPENFKLLGNVLVTVLAIHFGKEFTPEVQASWQKIWTGVASALSS 
RYH 

602 

A 

38 

538 

APS PDA\ MGHFTE EDKAT ITS LWG KVNVE \ D AGGETLGRLLWY PWT 
QRFFDS FGNLS S AS AI \ MGNP \ KVKAHGKKVXiTSLGDAI KHL\DDLK 
GTFAQADVNCTCDK\ LHVD \ PENFKLLG\NVLVTVLGNP I FGKEFTP 
EGCKASWAERWVTWSWPVPCSSRIPLKPLCP 

603 

C 

83 

349 

MRSEAQSGPHRREWISHRYGTRPPGFKYQLQHXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXQNHTCVPWHACAYVSFNRVPCSC 

604 

A 

295 

448 

NQTCVCWHACAYVSSNRVPWSIGSAIRSVCCYPLPFP7VLCAPCL*IS 
TALD 

605 

A 

43 

208 

ARTLIEVIGLSHHNQKAHAMLLNAF*SSIRASP*RSGY*SSLWNQTS 
RFQIPAPT 

606 

C 

892 

1269 

MGQS QEVRVQGPC VWHWRRTRGPRLGQG WATVHLELDGE FAGAKCEA 
GREEHSALLLCDRLSALPLTPLDLLEKQVLEQGRAAVPRNAVLQGHS 
VVDDFSSAFSARGGQS VNELI SLSSGVASAWV 

607 

A 

187 

363 

MPLRLP IQDVY * I GG IGTALVGRVESGAVN YGLVGS S VH * AVQSVAL 
DFRFNSWMLYSQ 

608 

A 

514 

792 

DS RC PG AMACALRAP PR I TNP VS AQ WD PETG S VAGE P W I WTQRERNQ 
S ALQRGNVPFCRCRDHEFTLDLGGQGGQ IT* GQEFETS LANMAKPH 

609 

A 

1169 

1336 

NFASDDKFSLP*DYTTFKVNWPGAVAHA*NPSNLGGQGGQIT*GQEF 
ETSLANMA 

610 

A 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLS 
SASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKL 
HVDPENFKLLGNVLVTVI^IHFGKEFTPEVQASWQK1WTGVASA1.SS 
RYH 

611 

A 

3 

552 

HSLFGTSEVINKLRSPDAMGHL/TPEEDKATITSLWGKVNVEDAGGE 
TLGRLLVVYPWTQ\RFFDSFGNLSSASAIMGNPKVKAHGKKVLTSLG 
RCHKSTWDDLKG/TPFAQA*SELHCGQACMWDPEELSSFLGKCCWVT 
RFWAIPFSAKEFHPWRLQGFPGKSRRWQKMVTGV\ASALVPSRYH 

612 

A 

251 

488 

GGWITRSGDRDHPG*HGE/IPVY*KKLAGRGGGRLKSQLLGRLRQEN 
GMNSGGRACSEPRSHHCTPAWVTERLLLKKKKK 

613 

A 

5987 

6461 

GVNKSWNHWLFHLGVREAEHAGVVRTPHHDLVLLPRGMAACALGACS 
LGAVALPGHPVAGGVSHGCSSSGTTYRGERHRVGRMRQKNRLNPGGG 
GCGEPRSRHYTPAWAT\EQNSVSKQNKTKQNKKHRLVHWNRKKWIYL 
LFALKKYKMSLMLCRKS 

614 

A 

180 

347 

GLSFHMLKKKVYKYMCIYMYICTHTHTHIjI*MYINRPSLYDYQFLFY 
YLSQLDSC 

615 

A 

173 

339 

GLSFHILKKKVYKYMCIYMYICTHTHTHLI*MYINRPSLYDYQFLFY 
YLSQLDSC i 

O X D 

7\ 

JO J 

A T A 
414 

t X4AHHVHGGLCC VPQR 

617 

A 

540 

820 

KIGPQPGQHSKTLSL*TIKKKLARCGGTCLWSQLLGRLRWEDCLSAG 
S *GCGEL * S YHCTPAWVTERDLVS KTQKNRA I TGKLLLD VLSLLIR 

618 

A 

45 

366 

HHLHFLQRHKSQMRTMNTGLFEILTMLLTTMNLVDIFHEKADVVDMA 
YNAFDDEYLNCTDRMDI KYWQLLKEEQASHHH * DTECENA* ARWAA 
RMTHIFLPMDYN 

619 

A 

112 

322 

NSLIAGYILVYVKTPRTQKEKSFLKVLSTYIKVAKYKV* ILKSITFL 
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FTTKEYSKINIKMPFTTPFTTPN 

620 

B 

180 

372 

MAKDAGLIEANGELKVFIDQNLSPGKGVVSLVAVHPSTVNPLGKQLL 
PKTFGQSNVNIAQQWIX* 

621 

A 

39 

462 

GVYLKLSINTVGKDKBKVRRHHPITTQLQCGIQGHNIiVATVTAASEIR 
SFHLSPSSRFRRFKSIFCHVTRSGKSMLQMFTFPEKKNDFSGCPVRP 
RGRGRGEGAVRSAGRRG*ARAGAGPGGPGRGGVPALGRVAPGLVPN 

622 

A 

138 

322 

GFFQGAQKKRVHPKGPPGKSRF * KFFRAGPGGS I WNPHHLGGPGGPN 
CWPPGFQTPPNNPG 

623 

A 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGETLGRLLWYPWTQRFFDSFGNLS 
SASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKL 
HVDPENFKLLGNVLVTVLAIHFGKEFTPEVQASWQKMVTGVASALSS 
RYH 

624 

A 

60 

548 

AP S PDAMGHFTEEDKAT I TS LWG KVNVEDAGGETLGRXiLVVY PWTQR 

ffdsfgnlssasaimgnpkvkahgkkvltslgrchkstwddlkgtfa 
qXlselXhcgqacmwdpenfXkllgXnvlvtvlVaipfsgkefhpwr 
lagfpgqkdg* lgv\ asalvpsryh 

625 

A 

32 

452 

SRRHGSSLWGKVirVEDAGGETLGRLL.VVYPWTQRFFDSFGNXSSASA 
IMGNPKVKAHGKKVLTSLGDAIKHLDDIiKGTFAQLSELHCDKLHVDP 
ENFKLLGNVLVTVLAIHFGKEFTPEVQASWQKMVTGVASALSSRYH 

626 

A 

3 

570 

HSLSGTSEVINKLLVQTAMGHFTEEDKATITSLWGKVNVEDAGGETL 
GRLLVXnfPWTQRFFDXSFGNLSSASAIMGNPKVTCAHGKKVLTSLGDA 
I KHLDDLKGTFAQlA S ELH\CDKLHVDPENFKLliG /NKLLVTVLGNP 
IFGKEFHPWRLQ\ASWQKQKMAEDGDCSWPVPCSSRLPLSSLAHDGR 
VFQG 

627 

A 

62 

281 

SGVYFSFVQRLKTLTQHLILIFGFLP*EYPTLTTFFEGEIISKKHPF 
LTRKWDADEDVDRKHWVSKSDLCWVI 

628 

A 

1 

395 

TLTTFFEGE IIS KKH P FLTRKWD AYED VDRKHWG KFLAF YQ YAKS FN 
SDDFDYEELKNGDYVFMRWKEQFLVPDHTIKDISGASFAGFYYICFQ 
KSAASIEGYYYHRSSEWYQSLNLTHVPEHSAPIYEFR 

629 

A 

27 

121 

APTPDAMGHLTYED*ATITSLWCKVNVDDSG 

630 

A 

128 

380 

FNSRPSLRIGLFCITCSLWDCFWCCWIHASLYSVIRSLLSFGTLVCF 
SLFCVAFLSFSF*WLFFFLLFISSGFFSFVFLVFLF 

631 

A 

1 

371 

RQGPQGGRAAPRPPMPGPPLSSARHALPLGSWESWCCCCLIPADRPW 
DRGQHWQLEMADTRS VHETMMQASRKE IQSL.PKNGS FPS TNEMRLRY 
YSSYNQASE*P*KLSRPGLWDPIGRAKWD 

632 

A 

323 

586 

SQHFGRTKPEDSLRPGI*DQTGQPRETLFLQNLKKKLAGHGGACLQS 
QLLGRLRWEDRL I SGV * G CSDT * LYHCTPAWVTEQDPVS E 

633 

A 

1 

231 

FRGAVRAGLGFSCGFGVPDSGIGGAMSSFEGQMAEYPTISIDRFDRE 
NLRARAYFLSHCHKDHMKGLRPPTLKRRLE 

634 

A 

1 

686 

ARAGLGFSCGFGVPDSGIGGAMSS FEGQMAEYPTI S IDRFDRENLRA 
RAYFLSHCHKDHMKGLRAPTLKRRLECSLKVYLY*SPVTKEV1,LTSP 
KYRFWKKRIISIEIETP.TQISLVDEASGEKEEIWTLLPAGHCPGSV 
MFLFQGNNGTVIiYTGDFRIJVQGEAARMELLHSGGRVKDIQSVYLDTT 
FCDPRFYQIPSREECLSGVLELVPKLDHSEPVPCCVAEIjQ 

635 

A 

52 

264 

GLLVGVGAAAVMPGIVELPTLEELKVDEVNISSAAIiKDACHDYGDEW 
DKPNVDIML* *LEQLHPSMGQGAL 

636 

A 

1 

380 

KHSVCGRARASCSSFASVLFLISKMVREQYTTATEGICIERPENQYV 
Y K I G I YGWRKRCL YL VGLLLLM I LWN I TLTI W I LNVTW VS S TS TAP 
HR V * H * NS LCAETQLQLRL PKRGRHVNFE VP Y 

637 

A 

23 

360 

LCQSQATLLSIFSQEYQKHIKRTHAKHHTSEAIESYYQRYLNGVVKN 
GAAPVLLDIxANEVDYAPSLMARIjILERFLQEHEETSPFKGIINMMLR 
YPSQIPDGALAY*VYHGK 

638 

A 

1 

77 

ESYYKTEGDEEAEEEQEENLEASGK 

639 

A 

34 

421 

AVANMSGWESYYKTEGDEEAEEEQEENLEASGDYKYSGRDSLIFLVD 
AYKAMFESQSEDELTPFDMSIQCIQSVYISKIISSDRDLLAWVFYGT 
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EKDKNSVNFKNIYVLQELDNPRCKRILKLCQVKGP 

640 

A 

3 

347 

DAWAWPLNRG F FRMMMD I G AAGVMQGGD S VELLD I P I LKLPDG SD I P 
LPPILLGRLGFHPLKNTVCIYGHLDVQPAAIiEDGWDSEPFTLVERDG 
TLHGRGSTDD * VPLAGW INA 

641 

A 

1 

1254 

MGKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAA 
EMGKGSFKYAWVLDKLKAERERGITIDISLWKFETSKYYVTIIDAPG 
HRDFIKNMlTGTSQADCAVLIVAAGVGEFEAGISKNGQTREHAIiLAY 
TLVAFVPISGWNGDNMLEPSANMPWFKGWKVTRKDGNASGTTLLEAL 
DCILPPTHPTDKPIiRLPLQDVYKIGGIGTVPVGRVETGVXiKPG>5VVT 
FAPVNVTTEVKSVEMHHEALSEALPGDNVGFWKNVSVKDVRRGNVA 
GDS KNDPPMEAAG FTAQV 1 1 LNHPGQ I STGYAP VLDCHTAH I ACKFA 
ELKEKIDRRSGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPP 
LGR F AVRDMRQT VAVG V I KAVD KKAAG AG KVTKS AQKAQKAK 

642 

B 

52 

554 

MGKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAA 
EMGKGSFKYAWVLDKLKAERERGITIDISLWKFETSKYYVTIIDAPG 
HRDFIKNMIILNHPGQISAGYAPVLDCHTAHIACKFAELKEKIDRRS 
GKKLEDGPKFLKSGDAAI VDMVPGKPI * 

643 

A 

36 

697 

KLPLKAKMGKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIE 
KFEKEAAEMGKGSFKYAWVX.DKLKAERERGITIDISLWKFETSKYYV 
TIIDAPGHRDFIKNMITGTSQADCAATLIVAAGVGEFEAGISKNGQTR 
EHALLAYTLGVKQLIVGVNKMDSTGIAILTGDFPSLEPRHVSTWLQH 
WTI PTRNWHKCYCVGWANFLNVSADFLNNF 

644 

A 

62 

1548 

plkakmgkeicthiVniwighvdXtgkstttXghXliykcggiXdkr 
tiekfek\eaaemgkgsfkya*vldklk\aerergiti\dislw\kf 
edqqviimwtiidcprdtetfiqkhdlqgtsogiiawfggppfwggspl 
ivccwclvefeswysskhwgrprehallgft\tgvwkqtklfgvnkn 

WIST*GHPTAQKXILKEIVKGKSAI»TLRKIGYNPDTSSILCPISGL»N 
GDHM PGAQMPNLP \ W FKEWK\ I TRKDGNASGTTLLEALDC I LPPTRP 
TDKPLGLPLQDVYKIGGIGTVPVGRVETGVLKPGNTVTVTFGPVNVTTE 
VKSVEMHHEALGEALPGDNVGFOTKNVSVKDVRRGNVAGDSKNDPPM 
EAAGFPAQVI I LNHPGQI S AGYAP VLDCHTAH I ACKFAELKEKIDRR 
SGKJCLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPPLGCFAVRDM 
RQTVAVG VI KAVD KKAAGAG KVTKS AQKAQKAK 

645 

A 

2281 

2431 

KLFFLKNVLLLNFFLPLLPSPVLVCYCCFSFFFLYLCLFFTVVIjGSA 
RNK*KLFFLKNVLLLNFFLPLLPSPVLVCYCCFSFFFLYLCLFFTVV 
LGSARNK 

646 

A 

203 

372 

HTDYIPNGPRTNEPTAHELIG*H**PSHFRT*VEQYNINADLKS*YQ 
HSMDRMLSG 

647 

A 

12 

357 

LFSSLSGGMQSLSLGQTGISEG1*N*LTSMAPGNLWHMRNNFLFGSRC 
WMTRFSAENIFKSVSFRLFGVKCHNTDSEPLKNEDLLKNLLTMGVDI 
DMARKRQPG VFHRM I TNEQD 

648 

A 

1134 

1633 

PNPTGTLMLARKWCPDGLQSLPPDVXjGS * GEHLLSCLCQERACSSNS 
RRPRQAWTGS QGS RV I NF F F F AKVGTCHQGPNTiQKAPKAPGTPPTPS 
YPGTPSRQLLWQWVQPRPALPASSPCSRHQLYIiPRQAMSWLLSPAPS 
VPLDFSGASPVWATLCFPHPRLPHR 

649 

A 

1 

370 

QRLAEMSNAI ERKHATKMTYQPTNGTLTSG YVADRRVKHHSGGEEPF 
QAQKQEPHPGTSRQ*QTRVNLHSLPDPEMHEQTYFRGMYRKLrGGwKA 
G PEGTF YD I PR YVTAYT F AHTRP AD I RAM 

650 

A 

1 

222 

UT T T nnT TfVTiVTTyWBTW^TDtlAPrMiVXrVTVtiMWDVDDPT 

HIjLIjGIMAAlaRPljVKPKlVlU^TKKt IRHQSDKYvKxKRI^WRiNJrKUX 
DNRVRRRFKGQCAVKKMSRQLMCTFSV 

651 

A 

3 

556 

LPTEVAAISFSLAIGKHSATVFSPWAGKPPSRILPHLGPLLPQKASW 
PPLRPLVKPKFVKKRPKKF I RHQSDRYVKI KRNV?RKPRG IDNRVRRR 
FKGQILMPNIGYGSNKK\T}QiMLPSGFRKFLVHNVKEQG\LEVIiLMC 
NKSYCAEIAHNVSSraRKAIVERA\AQLAIR\rraPNARLRSEENE 

652 

A 

2 

396 

PRVRAEHSIVGTRLVSGQLQPSQPSADHGKLTTMRIAGICFCLLGIT 
CAIPDKQADSGSSEEKHNAESSEETNDFIQETLPSKSNESHDHMDDM 
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DDEDDDDHVDSQDS I DSNDSDDVDDTDDAHQSDEAHH 

653 

A 

1 

1055 

TDCRVDPRVRPRVRVEHSIVGTRLVSCQLQPSQPNADQGKLTTMRIA 
VI CFCLLG I TCAI P VKQAES \ GS S EEKQL YNKYPDA\ VATWLNPDPS 
/QVRQNLLGPTGPFPSKVQRKGHGPHGMIWDGWKVDDDPC/VDSQDS 
IGLGTTLDDVDDTD\DSHQSDESHHSDESDELVP\DFPTDLPATEVF 
TP VVPTVDTYDGRGDS VVYGLRS KS KKFRRPD IQY PDATDED I TSHM 
ESEELNGAYKAIPVAQDLNAPSDWDSRGKDSYETSQLDDQSAETHSH 
KQSRLYKRKANDESNEHSDVIDSQELSKVSREFHSHEFHSHEDMLW 
DPKSKEEDKHLKFRISHELDSASSEVN 

654 

A 

2 

1103 

PGRPTRPPTRPGGGRAQHRLYQTLSQASCSLLSQTPTRGKLTTMRIA 
VICFCLLGITCAIPFKQADSGSSEEKHF\YNKYPDAVATWLKP*PIF 
RRQNLLG PTE WLCPLKETNGL * TRRTLS KLRSNGKAMDHM\ DDMG * W 
KVDGWTMWDSQDS IDS \N\DSDDVDDT\DDSHQSDESHHS \DESDEL 
VTGFSTDLPATEVFTPWPTVDTYDGRGDSWYGLRSKSK\KFRRPD 
IKY PDATDED I TSHMESEELNGAYKAIPVAQDLNAPSDWDSRGKDSY 
ETSQLDDQSAETHSHKQSRLYKRKANDESNEHSDVIDSQELSKVSRE 
FHSHEFHSHEDMLWDPKS KEEDKHLKFR I SHELDSAS SEVN 

655 

A 

3 

267 

TI NPKLLKNHARGTPEVPNQKHHWS PKQKPLLTKNPKNNLPRWPVP I 
IPPFPEEKPRKFP*TPKRRIP*TKIGPLPSNPGYPTKPPF 

656 

B 

197 

404 

EQQQLQAQHLSHGHGPPVPLTPHPSGLQPPGIPPLGGSAGLLALSSA 
LSGQSHLAI KDDKKHHDAEHHRX * 

657 

A 

2 

652 

LSRVTjEFEEFQFI^AQYHSLKLECEILASEK^EMHRHYVMYYEMSYG 
LN I EMH KQTE I AKRLNT I C AQ V I PFLS QEHQQQ VAQAVERA I Q VTMA 
ELNAIIGVRGLPGLPPTQQQLQAQHLSHGHGPPVPLTPYPSGLQPPG 
\ I PPLGG\ SAGLLALT\ S ALDGQSHLAIKDDKKHHDA* HHRGDRPGK 
PD * DFVL I LLQCCKWC IAKES PNL YREQT 

658 

A 

455 

1033 

GLYFGKVGEVIEQVKAFDELKTUjIAFFPLSSLSFLAPFVCLIiNLLFC 
FRLPLLSCLFSPGCRIiHRHHCHHYLLLLLLFLSHHHHQQQQQLQAQH 
LSHGHGPPVPLTPHPSGLQPPGIPPLGGSAGLLALSSALSGQSHLAI 
KDDKKHHDAEHHRGERPGKPD*DFVLILLQCCKVVCIAKESPNLYRE 
QTMR 

659 

C 

178 

360 

MERPERKIXXXXXXXXXXXXXXXXX^^ 
XXXXSYYSFKNLI* 

660 

A 

365 

801 

INVNFLEF YMTD I TCFLFS YLSTLM S P I YLD VLLLSLLIjFLFH I AGM 
HILTFINHDI*VVHSFYLRI*HAMEITVYEFICLRIRYPFKASKIFS 
MYNATVI J FHI*CDR*LGS*IGILERSVGVCSVMVCIVIYIRYVISSS 
VDAP 

661 

A 

2 

58 

AGRCAQHGQPPAAPACRE 

662 

A 

218 

342 

QPKMKYSGMPGLLPSVRSVQAILDGSLRPPKKTCHLKNFGA 

663 

A 

527 

665 

AGLLQNTAGFLGMPGLLPSVRSVQAHIGMGSLRPPKKTCHLKNFGA 

664 

A 

1233 

1706 

CLVFWLLYYPGLQLPFHLLPHSLGSGQQSSEESAATPSLAEGRREAA 
FRCGPAGLARQAWRPHARLPGPAAPLAAPSLAPLPTGRSGRWLPMLC 
AAS C * S PS PAMS VYPQRRLGQGG AEDS EGGP VSRGCRAPGPQRPLAA 
PGEQTAARTQAICLIW 

665 

A 

598 

1159 

CRQTGVKIYDVWFFWLLYYPGLQLPFHLLPHSLGSGQQSSEESAATP 
S L AEGRREAAFRCG PAGLARQAWR PHARLPG PAAPLAAP SLAPLPTG 
RSGRWLPMLCAASC*SPSPAMSVYPQRRLGQGGAEDSEGGSVSRGCR 
APGPQRPLAAPGEQTAARTQASVYLLLPHSPPLLSALHIHHLLPPS 

666 

A 

358 

644 

RRSLALSPSPECSDVISAHCKLCLPGFTPFSCLSLLSSWDYRCPPPR 
LANFFFVFLVETGFHRVSQDGLDLLTS*SARXiSIiPKCWDYRCEPPRP 
A 

667 

A 

55 

288 

PFKKGGFFNGNLPTTFGKLGNKKFPLNLFPGGPPLPHGGLKKGPGAP 
QVPPLFIiKKPQTSPP*GLREGPPPLKKFYIiF 

668 

A 

2 

386 

ARGEPTMELSHADTTNVKAAWGWVGAHAGE * GADVLVRMVLCLPTIW 
TYFPYFDLSHGCAQVNGHGKKVADALTNAWHVDDMPNSLIGLSELH 
AH* LLVDPVNFKLLNHCLLGTLADHLLDEFTPAV 
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669 

A 

787 

896 

LLYNI YCGEKKREERKD * PGQHGETPS LQKI QKLAG 

670 

A 

539 

787 

SIGRLRRVGHLRSGVRDQPGQHGGTPSLLKIQKFAGHGGAFLYSQVL 
GRLRQENRLNSRGGGCSESGSQHYTPAWVTE*DSI 

671 

A 

24 

452 

APS PDAMG/ HSLWGKVNVEDAGGETLGRLIjVVYPWTQRFFDSFGNLS 

sasaimgnpkvkahgkkvltslgdaikhlddlkgtfaqlselhcdkl 
hvdpenfkllgnvlvtvij\ihfgkeftpevqaswqkmvtgvasalss 
ryh 

672 

C 

336 

728 

MGLPKTGHQQHFPQELESSSRIPHAGLSQVQVHFRLGQRCPLRSSQV 
LLWHLPRKVSTFLPCALTLGVAHGWQRQRTGCQSCQRTSGSMGRQPG 
AFPGFLLQHLSTFTLPHRLCDSSLVLLCEMTHGRLD* 

673 

A 

1 

1254 

MGKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAA 
EMGKGS FKYAWVLDKLKAERERG IT ID I SLWKFETS KYYVTI IDAPG 
HRD F I KNM I TGTSQADCAVL I VAAGVGE FEAG I S KNGQTREHALLAY 
TLVAFVPISGWNGDNMLEPSANMPWFKGWKVTRKDGNASGTTLLEAIj 
DCILPPTHPTDKPLRLPLQDVYKIGGIGTVPVGRVETGVLKPGMVVT 
FAPVNWTEVKSVEMHHEALSEALPGDNVGFNVKNVSV^ 
GDSKNDPPMEAAGFTAQVIILNHPGQISTGYAPVLDCHTAHIACKFA 
ELKEKIDRRSGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPP 
LGRFAVRDMRQTVAVGV I KAVDKKAAGAGKVTKS AQKAQKAK 

674 

A 

1 

921 

KFETS KYYVS 1 1 DAPGHRDF I KNM I TGTSQADCAVL I VAAGVGE FEA 
GISKNGQTHEHALIAYTLGVKQLIVGWKMDSTEQPYSQKTYEEIVK 
E VNT Y I MK I GCNLDTAAFVL I S VNVTTE VKS VE I HHEAL S EAL PGDN 
VGFNVKNVSVKDVRHGNNAGDSKNDPPMEAAGFTAQVIILNHPGQIS 
AGYALVLDCHTAHTACKFAELKEKIDRCSGKKLEDGPKFLKSGDAAI 
IYMVPGKPMCVESFSDYPPLGHFAVGHMRQTVAMGIIKAEDRKAAGV 
GKVTKSAQKAPKPSEYYPLYLPPQS 

675 

A 

659 

1601 

SRASSSWPLALPSLQVTFHPLNEGTLAVASSMLSSFQPEIGTNATV 
LGL*PIFLM*VLTSLTISSYLFWL/RGWLSGTTLLEALDCILPPTRP 
TDKPIX3LPLQDVYKIGGIGTVPVGRVETGVLKPGMVVTFGPVNVTTE 
VKSVEMHHEALGEALPGDNVGFNVKNVSVKDVRRGNVAGDSKNDPPM 
E AAG F P AQ V 1 1 LNH PGQ I S AG Y AP VLDCHTAH I ACKF AE LKE K I DRR 
SGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPPLGCFAVRDM 
RQTVAVGVI KAVDKKAAGAGKVTKS AQKAQKAK 

676 

A 

1 

1335 

MSAEVIHQVEEALDTDEKEMLLFLCRDVAIDWPPNVRDLLDILRER 
GKLSVGDLAELLYRVRRFDLLKRILKMDRKAVETHLLRNPHLVSDYR 
VLMAEIGEDUDKSDVSSLIFLMKDYMGRGKISKEKSFIJDLVVELEKL 
NLVAPDQLDLLEKCLKNIHRIDLKTKIQKYKQSVQGAGTSYRNVLQA 
AIQKSLKDPSNNFRSIPEERYKMKSKPLGICLIIDCIGNETELLRDT 
FTS LG YEVQKFLHLSMHG I SQ I LGQFACM PEHRD YDS FVCVL VSRGG 
SQSVYGVDQTHSGLPLHHIRRMFMGDSCPYLAGKPKMFFIQNYVVSE 
GQLEDSSLLEVDGPAMKNVEFKAQKRGLCTVHREADFFWSLCTADMS 
LLEQSHSSPSLYLQCLSQKLRQERKRPLIiDLHIEIiNGYMYDWNSRVS 
AKEKYYVWLQHTLRKKLILSYT 

677 

A 

1 

811 

MSAEVIHQVEEALDTDEKEMLLFLCRDVAIDWPPNVRDLLDILRER 
G KLS VGDLAELL YRVRRFDLLKR I LKMDRKAVETHLLRNPHL VSD YR 
VLMAE I GEDLDKS DVS S L I FLM KD YMGRGK I SKE KS FLD LWELE KL 
NLVAPDQLDLLEKCLKNIHRIDLKTKIQKYKQSVQGAGTSYRNVLQA 
AI QKSLKD PSNNFRS I PEER YKM KS KPLG ICLIIDCI GNETELCG VR 
GP AGGQQPLGGG W ASDEECG I QG S E ARAVH S S PR S 

678 

A 

269 

561 

LSFVGSLSRXiFTAGFAECPRFRFRPAPNPRTPSTRNSAPDSQKGATW 
ATAVQAAPRWT * ETGSNPWPTLAAPCSRS PRGPPAS FSGGPS AS CKL 
CLK 

679 

A 

415 

1094 

SKSRMSAEVIHQVEEALDTDEKEMLLF\LCRDV\AIDWPPNVRDLL 
D I LTERGKLS VGDLAELLYRVRR FDLLKRI LKMDRKAVETHLLRNPH 
LVSDYRVLMAE I GEDLDKSDVS S L I FLM KD YMGRG KI S KEKS FLDLV 
VELEKLNLVAPDQLDLLEKCLKNIHRIDLKTKIQKYKQSVQGAGTSY 
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RNVLQAAIQKSLKDPSNNFRMITPYAHCPDLKILGNCSM 

680 

A 

551 

1706 

SKSRMSAEVIHQVEEALDTDEKEMLLFLCRDVAIDWPPNVRDLLDI 
FT/ RERGVLS VGDLAELLYRVRRFDLLKR I LKMDRKAVETHLLRNPH 
LVSDYRVLMAEIGEDLDKSDVSSLIFLMKDYMGRGKISKEKSFLDLV 
VELEKLNLVAPDQLDLLEKCLKNIHRIDLKTKIQKYKQSVQGAGTSY 
RNVLQAAIQKSLKDPSNNFRLHNGRSKEQRLKEQLGAQQEPVKKSIQ 
ESEAFLPQSIPEERYKMKSKPLGICLIIDCIGNETELLRDTFTSLGY 
EVQKFLHLSMHGISQILGQFACMPEHRDYDSFVCVLVSRGGSQSVYG 
VDQTHSGLPLHHIRRMFMGDSCPYLAGKPKMFFIQNYVVSEGPAGEQ 
QPLGGWMGQR 

681 

A 

76 

158 

KDKRVGKMYKEVPLSKGSKKYNDINI I 

682 

A 

1 

2313 

MVYRQNQLPSSCAPEMGREDGLRALHPAQPPPPRAQQSPPPPLSAQK 
SLHLPVHSKALHRPCSPPTVAILNPSQPHRPPVHSPGSGLGFTQLSP 
SLEASLGHGLQHARPARHPLPLHMGWATFTLTLVSAPWKX.RRRPRLQ 
VAGVPHVQRMEPWAPHAPAELLETQAQLWGHRRGHWIPDGRACNTEF 
THPGLEIGARESGSACKGPPPGSLVPFPASPLGSKINGGSCYQILTG 
SSGKICRINGCSCQNKWSMPTSATASLQTCPARAVPDLGIARRSWSP 
HRRPSWTHEKSSCPSQKAPQGRAVPLQPGKGGEPVSFWSPGQAPSKA 
RGRRAGSAAMRGRAAKLLSRSRSTGGLCVTjCGQGTVGLQMQGRAAPP 
APPPAHPLFPRPVSPLDFSASKRAIiQHPALLGGKDSWRPECGHLNLS 
HLPSGVKDSWGSECGHLNLSHLPSGVKDSWGSECGHLNLSHLPSGGK 
DSWGPECYVFCMLHRLPEQHDCTFDHMGRGREEAIMKMVKLDRKVGR 
SCQRIGEGGQEDDWASILVSMQPHSPRTPSPHLPYSKFKDFSSCNC 
ARQGWKEMGACVQRPSAVLGGTRPVSVQAKQPLPCFLLGASAliGSCG 
TGLG VKDRD I DR I VVMS VMKI LRGNQRPSGDEQARGQRAREGLGGLG 
VRRACREQWAVLCPL VHRNWVCVS SFLIASQSNS PQQQHNCC YANNS 
PQQEEGMVGRQAETFLLGGVRNRKPWVGVGRGLQGSHQQHIQGAVSS 
PSSPKKGVSKREGPRWAQQ 

683 

A 

788 

926 

PSPELPEGDFEGFFPQKIjQ* SCLPTLQKKKNNNNNNN1TO 

684 

A 

242 

438 

KSCFFCLIRCYDSLPLPLLLNNFFFLGTESCSVAQAGVRWLNLSSSA 
TSGLPGS INS PASAS * AA 

685 

A 

1 

1254 

MGKEKTHINIVVIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAA 
EMGKGS FKYAWVLDKLKAERERG IT I DI SLWKFETS KYYVTI IDAPG 
HRDFI KNM I TGTSQADCAVL I VAAGVGE FEAGI S KNGQTREHALLAY 
TLVAFVPISGWNGDNMLEPSANMPWFKGWKVTRKDGNASGTTLLEAL 
DC I LPPTHPTDKPLRLPLQDVYK I GG I GTVPVGRVETGVIiKPGMVVT 
FAPVNVTTEVKS VEMHHEALS EALPGDNVG FNVKNVS VKDVRRGNVA 
GDS KNDPPMEAAGFTAQV 1 1 LNHPGQ I STG YAP VLDCHTAH I ACKFA 
ELKEKIDRRSGKICLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPP 
LGRFAVRDMRQTVAVGVIKAVDKKAAGAGKVTKSAQKAQKAK 

686 

A 

73 

1553 

PLKAKMGKEKTHI\NIWIGH\VDSGKSTTTGH\LIYKC\GGI\DKR 
TIEKFEKEAAEMGKGSFKYAWVIpDKLKAER\ERGITIDISLWKFET\ 
SKYYVTYPFDAP\GH\RDFYPKTWITGDIFRLDWFGGPPFWGGSPLI 
VAAG\VGEFEAGYLPRNGADPERHA\LLAYTLGCETTKLSGVNKNGI 
PLEPTPTAQKKILKEIVKGKSALTLRKIGYNPDTSSILCPISGW\NG 
DNM\LEPSA\NMPWFQGDGKVTRKDGNASGTTLLEALDCILPPTRPT 
DKPLGLPLQDVYKIGGIGTVPVGRVETGVLKPGMVVTFGPVNVTTEV 
KSVEMHHEALGEALPGDNVGFNVKNVSVKDVRRGNVAGDSKNDPPME 
AAGFPAQVI I LNHPGQI S AGYAP VLDCHTAH I ACKFAELiKEKIDRRS 
GKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPPLGCFAVRDMR 
QTVAVGVI KAVD KKAAG AG KVTKS AQKAQKAK 

687 

A 

67 

408 

KAAGKRGDFKTEGA YQRQRT I FKNKKRARRE KTGRENIiRGNYKNMGR 
GLKTPRGALGGPYLDKKGPLRGKGPFQGGTLLSSSPKIKIQGTL*IR 
RNYWPYIRRYNRFKKRQKNM 

688 

A 

10 

386 

AGAAAIGEPMA*RGEGHPPWSVDERADSTNVNNWHV?TERDASHWSTD 
KLKTLFIi/mHVQNEERNGEATEVSKLDGEASINNRIGKL.MFFYEWGV 
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TT *T\WTTZ A C * QnVDVVPinfl? T DVT Ti "PitTXTr"*! 7TVC 

J. xjUW ** ouv xrxxsAjri vktXir x Lu\jJctV4\^VUEM [ 

689 

A 

2 

323 

REMDLQVHNAMDQLEQRVS E FFMNAKKNI PEWREEQMAS IKKDYYKA 

LEDADEKVQLADQIYDL/RSLELDTPSQPVNNHHAHSHTPAEKRKYN 
PT^HHTTTDHTPPK 

690 

A 

2 

495 

AKKNKPEWREEQ^SIKKDYYKALEDA\DEKVLLSNQIYDLVDRGGR 
KLDQELAKFKMELEADNAG I TE I LERRSLELDTPSQPVNNHHAHSHT 

PVEKRKYNPTSHHTTTDHIPEKXFKSEALLSTLTVRCLLRENTLGCR 
\ 1STNN S TA S S NN A YNVN S S O P TjG YN 

691 

A 

50 

392 

GSQKIAPVFWSGFECDSEIRLTDLESWLYILPATQPWSLTYFYFQFS 
FILTFLRYDKLFFFFTLYLYLNLYFFLYVFLFIV*ICIFCYIYLVFI 
FKCVV*FFIFFFVPFMT.FFT, 

X. IVw VV X X. X. x I I V 1 iU C ± XJ 

692 

A 

3 

548 

PAPPGRVFAAWPGSEEARVLWliKGCACRAGLFKMYPLLPMTEVCTK 
SSTVTWFCFAAYESTiPPTPF^ATnVQVT.VPQMnT TT AT Q *W*i?A npn 

♦LLASLFSQDPLVSPSAAMWASPACFCAEPSSAATGV^ ^P^^^VPTO 

QS * *GSPLAGSASASAKNMARARSVAPPRNGRPGNTNADS 

693 

A 

3 

391 

DAWVDPRFRIiI FLKTLTGKTMAVDLEPNDTI ANAKA * TODKFOT PPD 
QQRIil FAGKQLENGRTLPD Y 1 1 HKESTLLL VXRLKGGMQ I FVKTLTA 
*TISLQAEPSDTIHNV*EKIQDKESIPPDQHTVIL 

694 

c 

70 

225 

MN 1 0 Y I OHE S VYONVR PS ORE S TGR W QHCPP OOP <3 p p t p FTVF F T n 
QHGV* 

695 

A 

277 

396 

EAP * HPDF I KNMITGTS * ADCAGLI VAAGGGVFKS WDLQK 

696 

A 

549 

7 09 

QDNPWP PMW AnT4T.PT.r2\TPT^014f^01Jr'T? r riOCT T V T nvT APUrTPDT * C?i^ 
LVRKTjR 

XJ V 1\ 1> I IXy 

697 

A 

82 

417 

AFQEMELKRPQEEDVAALPEGMRPHS1JI»YGFSWE*VICAAVVGLFAA 
L F FLWR S FRTVR I R I HVGRERKXiALML S GLNQ * KS E LLGHC I LAQNE 
YDGYELDE * FVDLRVDKE 

698 

A 

751 

1063 

I * KIxAGHGGAHV * S * LLRR PRQDNPLS PGGQGC S VL * SHHCTPAWAT 
QGDPGSKSKN 

699 

A 

3 

1 7 

ILQR 

700 

A 

3 

443 

DLEGTKMFQVRNAWAMEWDKKERRKE 

GRKEKKERKKERKKKRKETKEGKJ^ 
RLFRV 

701 

A 

147 

437 

WYVKXERKKERKQERKKERKKERKKERi^^ * REREREGR 

KEGRKEGRKEGRKEGRKEGRKKGRREGREKER 

EK 

702 

A 

505 

637 

r»T,T.T.Krc t YRPHOPHP ♦nniCPPK'Trnp'irirPTJTrvwD vttpd vvi?d vvt?d vv 
E 

703 

A 

220 

476 

HNSLVVTYFEIWSRILFWSCSIHTSWTH*SLLSLGHLKYIWAGHG 
VSHL*FQHFGRPRWENHLRPGL*DQPGQHSETPSLLKK 

704 

A 



xJxjxxjixixixjr r v iKyyr or xir r r rof r tlx* r xArlJj Inyb x Vir LbVi X 
VIIHLDFYISLFLTYDI^F*TKIlTr*AXLLLIYElWLIVl J * 
SNYTVL I VQ 1 1 ASL IHS STK* I SQCANI IKI*IILH*L*S S VE I NS Y 

JJ ur uijjv ion! xxjxaxj v Xj\^ro\^Xr ox Z 11 X XxiXvX IvJjLT JUJN X2» JM*U1 X 

705 

A 

1039 

1560 

SLVLDLTGLPRPTSPSHERGLDSSSSGPGLSTGP*ECSPDPNPMCPK 

VLPGPDOPCWN^HQTAPTAPPPPVPQP * QnCUTD^/DfiTr' ATT MDDfA 
vurvjr J^vA-V^niMono irurxrVT XrXZ*xrvi\.OXv OooW X Xr V lrj-\ X X XjJM ir rv 

RGLAIPNSLVSSP*GPAGWRELPSSQQAHPHPKR*GALRSENRQEGP 
DQGCGNCFVKYGGRVGRVGYSGPVRTGCSRASN 

706 

A 

644 

931 

EGSHGLGWAHACNPSTLGNRCGVDHLSSGVQGQPGQHGKTPFLQKI 
QKLAGHGGTRL*SSYERLRWEDEGGRGCSEPRSRHCTPAPWATE*DP 
V 

707 

C 

185 

397 

MMWLSYIQPAGSEESRLQ1TOAXQPDWIFEWKKCSRIYGRTVAPAAK 
LPRKHRGNPFCFPRTXERRNKTKT 

708 

A 

3 

521 

ckmdagff\rgtsaeqdnrvsnkqk\ kllkqlkpaeclekkvdmskv 
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NLE V I KP \ W I TKR VTE I LG FEDD W I E F I FNQLEVKA* TKMEVI PCV 
*SIAPSPY*VTAE*VSQCRSDPTPLMMQCVFG\NPDSKMMQINLTGF 
LNGKNAREFMGELWPLLDSAQENIAGIPSAFLEL 

709 

A 

730 

983 

GAVAH ICKSQQFGRPRQAHQLRSRV* DQPGQNDETPS ILKIQKLAGH 
GGTRPLS QLLRRLRH * NHLNPGGGGCSELRSRHCI PG 

710 

A 

108 

420 

GWGGLLIARPPFKRGKSSGPVGHLKPRGVGGRRG*NVNLCPKSSNPQ 
GGLGERPRELRDKATVS ALANFTARKGC * LRGPEGGQPGPRGKLFPK 
GVPGVQGQP 

711 

A 

274 

394 

SPFPKEVGGGKFLNLGGQRFWGPKFGPLFSAKTKKGTSFLK*GFKNP 
PAQGGKPPFFLKIQN*PTLGGGPLNPHFLRRLGGENF*TSEGKGFGD 
PNLALCS PPKQKKGLLF 

712 

A 

59 

306 

FGTDRTAVQTSSSQRLCLPWAQKTYWLLVPSSLLKDLKEKKEWEE 
A*NGRDAPANGNAVSVCFAPEPWQLPPHKIFPVLIi 

713 

A 

171 

565 

TLAFFLIPCIGSPACPTMSDAAVDTSSEITTKGLKRRRRKLWEEAEN 
GKRRPCLTGNAENEGKWGS REA\ DNEVHQKEE * CGEEQEEEEEGDGE 
EEDGDEDDETLSATVTRATEDEDDDAVAPTKQTTRLDE 

714 

A 

1 

1497 

MSLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSGDS 
VKTIAKLWDSKMFAEIMMKIEEYISKQAKASEVMGPVEAAPEYRVIV 
DANNLTVE I ENELNI IHKFIRDKYS KRFPELESLVPNALD YIRTVKE 

EACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVA 
GGLTNLSKVPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQS 
LPPDLRRKAARLVAAKCTIAARVDSFHESTEGKVGYELKDEIERKFD 
KWQEPPPVKQVKPLPAPLDGQRKKRGGRRYRKMKERLGLTBIRKQAN 
RMS FGE I EEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKARI SKT 
LQRTLQKQSWYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEK 
KVAEANQKYFSSMAEFLKVKGEKSGLMST 

715 

A 

32 

3730 | 

N S ERRGLGM S LADELLADLEEAAEEEE GGS YG EEE EEP A I EDVQEET 
QLDLSGDSVKTIAKLWDSKMFAEIMMKIEEYISKQAKASEV\MGPVE 
AAPEYRVI VDANNLTVEIENELNIIHKFIRDKYSKRFPELE\SLVPN 
AL\DYI\RTVKVSAEKELGNSLDKVARAIEEPWQQILTNATIM\WS 
VXTASTPQGQQVLSEEELERLEEACDMALELNASKHRIYEYVESRMS 
FIAPNLS 1 1 IGASTAAKIMGQLPSLSEPPLHLPIiAEWARSPAWQGHR 
GIQPELHVKVPSTRRAPRQIYSPHSLLSSDRPLPPSLPRRCGGGLTN 
LSKMPACNIMLLGAQRKTLSGFSSTSVIjPHTGYIYHSDIVQSIiPPDL 
RRKAARLVAAKCTIiAARVDS FHE S TEG KVG YELKDE I ERKFDKWQEP 
PPVKQVKPLPAPLDGQRKKRGGRRDEKRGGEASSAAAILDRARSASA 
EPSADAAASLEPRPPSSEQPRAQSNKPTIRHTRRYRDSARARGARKR 
RRALAS PACRDTLLSGGASS CARGSRRGAAPS LS PSRSCAS FTFLS A 
SRGSFREPGAPGPARRRLPAWHRVGPPGEIDRCLKKVSEGVEQFEDI 
WQKLHNAANANQKEKYEADLKKE I KKLQRLRDQI KTWVASNE IKDKR 
QLIDl^KLIETQMERFKVVERETKTKAYSKEGLGI^QKVDPAQKEKE 
EVGQWLTNTIDTLNMQVlDQFESEVESLSVQTRKKKGDKDDARVQTSR 
LFL S KDPGS PS PNPQPDARVLASNLLVCQ I QQS KPLTFS SLRMTPVH 
KRVLSDSLLSRIGPQRDVSHFLLAAYRSLRLVPGLGMSLADELLADL 
EEAAEEEEGGS YGEEEEE PAI ED VQEETQLDLSGDS VKT IAKLWDS K 
MFAEIMMKIEEYTSKOAKACJPnPTPQT TDPVTdpdcct ddt it at dot 

CLL PLPD PTGH KMMWS NFFLQE ENRRRGAAGRRRAHGQGRSGIjTPER 
EGKVKlaALLIxAAAPCAXRPHLGLWKACTKRLWQADVPVDRDTCGPAE 
LPGEANCTYFKFFTTGENARIFQRTTKKEVNLA7VAVIA\njGLAVMAL 
GCLCIIMVLSKGAEFLLRVGAVCFGLSGLLLLVSLEVFRHSVRALLQ 
RVSPEPPPAPRLTYEYSWSLGCGVGAGLILLLGAGCFLLLTLPSWPW 
GGGTGPPRATLPPGTTSSNYSLGNFVRLGLAAVIWIMGAFLVEAWY 
SRNVSPGESEAFKPE 

716 

A 

3 

117 

HKR PLMAS YLL F FC S FVRVE * T 1 1 TCLT * * ECHRYNR 

717 

A 

1007 

1119 

APGPCPNQTLARDSGWVG * KNHQLCTCVCVCVCVCVC 
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718 

A 

1692 

2123 

SQPRFSWTQERLRLFPGMLLNSLIVSQATQGPLK5CLGSCLMAFTFK 
KMEPSPMSCSPQCFGKPMMGRLSGGQKFKNQAGQYHKTPSLRKSKKL 
ASHGGVHLWS*LLRRLRQEDCLILGIRGCSEL*LHLCTPPSATQRDP 
LS 

719 

A 

1 

371 

HRHCHPSHTVDGRMESIDVATDGG*RTGVLAVSGLSVSLRTAELSTR 
KL LMN S EQR I NR I MG FHRLG S GAVE * RHTQ S MQHD S D KLN S L> 1 1 P S V 
SKRWLVDSVCTGTTDHQGGVLERKVTTL 

720 

A 

3 

409 

DAWANARSRVRLEVAGLRRDL*R/CSERWPLARATPFCFGSAMPMYQ 
EESNLSLQALESRQDD I LKRLYELKAAVDGLS KM IQTPDADLDVTN I 
IQADEPTTLTTNAIiDLNSVLGKDYGALKDIVINANPGFPPLF 

721 

A 

25 

494 

PLTVS ERW PLARATPFCFG S AMPMYQVKP YHGGG APLRVELPTCMYR 
LPNVHGRS YGPAPGAG \ HVQEESNLS LQALESRQDD I LKRLYELKAA 
VDGL\SRRIKPPRATLD\ETNIIQADEPTTLTTNALDLNSVL»GKDYG 
ALKDIVINANPGSPPLS 

722 

A 

358 

445 

NSFGRTIFFFY*FFFFFFFFFTIQTFKGGPLSPFFFLLFFINPFPRF 
FFFF*TPFLRALAFFKPFFFFKIVLAEPYFFFTNPQILMRGGGGFSG 
RGS 

723 

A 

452 

765 

LLFSKCRAVEKNLSKEDPHMETIKVAFVMTSGERDTGLTEPMAG*RG 
S*GS**DGPARGHETSLGHRGVSGDGGSMGPSLTANALPSISRPFPD 
SVTDSGSAPL 

724 

A 

209 

385 

KYETIRYRDSVKGRFTIF*HLTKNCLHLRAIGLGADDSHFYSCANDV 
ADSGTTLFFDS 

725 

A 

68 

362 

SLPTIRECIYIHWHAGDHIGNACWELYCL*HGIHPDGHMPRDLTIG 
GGEDSFNTFFTETGASEPVPRAAFADLEPTVIDEVGTVTYGQYLQL* 
QLIK 

726 

A 

1 

1254 

MGKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAA 
EMGKGSFKYAWVLDKLKAERERGITIDISLWKFETSKYYVTIIDAPG 
HRDFIKNMITGTSQADCAVLIVAAGVGEFEAGISKNGQTREHALLAY 
TLVAFVPISGWNGDNMLEPSANMPWFKGWKVTRKDGNASGTTLLEAL 
DCILPPTHPTDKPLRLPLQDVYKIGGIGTVPVGRVETGVLKPGMVVT 
FAP VNVTTE VKS VEMHHE AL S E ALPGDNVG FNVKNVS VKD VRRGNVA 
GDSKNDPPMEAAGFTAQVIILNHPGQISTGYAPVIiDCHTAHIACKFA 
ELKEKIDRRSGKKliEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPP 
LGRFAVRDMRQTVAVGVI KAVDKKAAGAGKVTKSAQKAQKAK 

727 

A 

65 

1552 

PLKAKMGKEKTH I \NI WIGHVDNGQVS PLLGHL I YKC\ GG I \DKRT 
IEKFEK\EAAEMGKGSFKYA*VLDKLK\AER\ERGITI\DISLW\KF 
ET\ SKYYVTYPFDAP\ GHQRLLS KNM I YRGHLKA * PGLGGPLFGG VL 
P*LFAAGVWWNLKAGISKNG\Q\TREHALLGLHTWVWKQTKLFGVNK 
NWIST*GHPTAQKKILKEIVKGKSALTIiRKIGYNPDTSSILCPISGW 
\NGDNNAWSPNA\NMPWFQGDGKVTRKDGNASGTTLLEALDCILPPT 
RPTDKPLGLPLQDVYKIGGIGTVPVGRVETGVLKPGMVVTFGPVNVT 
TEVKSVEMHHEALGEALPGDNVGFNVKNVSVKDVRRGNVAGDSKNDP 
PMEAAGFPAQV 1 1 LNHPGQ I SAG YAP VliDCHTAH I AC KF AELKEKI D 
RRSGKKLEDGPKFLKSGDAAI VDMVPGKPMCVE S FSD YP PLGCFAVR 
DMRQTVAVGVI KAVDKKAAGAGKVTKSAQKAQKAK 

728 

A 

344 

803 

KRDPSLSAYPFFKCNLLSTLHFMTAYASLYFIYTEPVFVQHDWLHSK 
AL * I PFH ISPS TT 1 1 PLTMVSQ P YTEH Y I VTKLGE FQVS KGNY I HKR 
FCFI AQFTI * LVITQPTMLR* I SDYLDKY I GFCPQPQMKQERHTYS P 

729 

A 

209 

373 

KRKKILRINKTKS*FFEKINKIYKMLTHLRRDKRQKKSEIKKGDITT 
DTTEMKK 

730 

A 

24 

452 

APSPDAMG/HSLWGKA/NVEDAGGETLGRLLVVYPWTQRFFDSFGNIiS 
SASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSEIiHCDKL 
HVD PENFKLLGNVL VTVLA I HFG KE FTPE VQ AS WQ KMVTG VASAL S S 
RYH 

731 

A 

99 

641 

APSPDAMGHFTEEDKATITSLWGKVNVEDAGGETLGRLLWYPWTQR 
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FFDSFGNLSSASAIMG/NTPKVKAHGKKVLTSLGDAIKHLDDLKGTF 
AQLSELHCDKLHVDPENFKLLGNVl,\rrVIiAIHF\GK\EFTPEVQASW 
QKMAED\VTGVASALASTYPLSSLAHDSELSRDRLLFCKQIKI 

732 

A 

154 

411 

AGQPG PEG I QLG * PQ I RDKGDQMGPHPCW * VCGTLTPNTS F * VFCCV 
AGRDGAPGVKAHFPAAQKPWDGGFPGP*SAGVGPPPIPR j 

733 

A 

233 

353 


734 

A 

1017 

1882 

TLAKWWTRREQRSQLSTQASSYHQWQRVGACGQAARMRSAPAVIPTR 
PGEFVHOSLESASNSRLOPVTSTnHP^APT.PZir:! T TnmjT mdppcac 

QASMELHSWSPCPDGFCTYRVLRSCPLCNCSRPDGPEKENHGQEPEA 
AVRSQPEGPEGGTRAGRPEPGGGPHLQEPKASETHNPGIPSREGAAA 
QGHLQGSGTHVRGGKQPSAAGPGRIIARVLSGPPGPRARGRGCADGS 
*ASPRARSSCRSGGRSRAAAPRRRRGGAGAARACARGCAGG*AGSAA 
RGSSGR 

735 

A 

1000 

1211 

VPSTVLDALHILFCLISFLKNKLGWCSGLTPVIPGLWEAEAGGLLES 
QEFGTRLVNMAKPCLY* KYKKLA 

736 

A 

109 

346 

PRSLSPHRPAPPPSVSVGFRPRGKRGTQGDKGKCLGYRGPQQL»*PHG 
TPGPTQGRRARGCYLQPGAWGPLLS PPWECDV 

737 

A 

134 

387 

PRSLSPHRPAPPPSVSLGFRTRGKRGNQGRQGEVSWGTGGPQQLWPH 
GTPGPTQGRRARGCYLQPGASGPFLRPPWECDV*ERG 

738 

A 

33 

509 

VSVSVGLRPPWELQVWVIjCVPRGQAKTRLEPAGRAPISAPLPALSVIiA 
GGSDSDSRFRQSLSSAAAPGADSLGARTELRVEPGPSRPRPRPRPRP 
SWTTASTADASRRAAGARGRGNESAGSGAEVRSEQKVKRESIGNR*K 
RKNKTHSKRKKTLQQKG 

739 

A 

2 

350 

PASSIQQGGRRACLEVTWSCSIWNPTATGGLDRGSVRHSREGGGEAP 
VRVQGPGEMIPAVPKSSWTLHKMQLPGHVCSTLGRRGDEKGEAGPEA 
HAYDPS TLGGRGQR IT* GQEFK 

740 

A 

15 

239 

RSTISTSSISPSQEVEDRWQNHNTQHAQSALAADWMVSIHTACGSSS 
PRPPTQLSVFSGNTLTDTPRNSTPPAI 

741 

A 

2 

311 

FQIjDTGSRSAPML*CSGVIMAHSIAKSLQTPGLMGSFNX.SLPSRWDY 
RHIPPYLANFLIFFIEMGSCYITQGSHFPTLNGLSLSSIiANKIQGNI 
GSDLVEDTG 

742 

C 

398 

574 

MLQVAVEELPRRPSRPGELLQTHAAKVLAVDVELLERDTIGIQLWF 
XRCWSHSHTCYL 

743 

A 

3 

862 

RNERHAYDKGTWKTRRAIMTVQ * SGETNVDRAYNLLYQTLDWTDSYV 
TC * ENKEIAAGQRNESHSNDWTKYQVWEWLQHLLDTNQLDANC I PFQ 
E FD I NG EHL C S M S LOE FTRAAGTAG OL.L. Y "5 Ml »OHT ,TCWMf?nHTr? T <? k Q 
CKETRHLFPFFYLGPGHTLATCRAVSKRLQPKLQCAVHSFVTPGIPY 
LDEIGYRARDLREPGQGVAALLPLSSDGNPRRRPGLPKLKAQRRVRF 
PSNKGRGGPSSPPFHSAFLNWQHKimmVKKELSSYLGPLQYYPGSF 
GGAI 

744 

A 

15 

404 

RNFRVDDLVSPRCCGILWFFTQGQASCWRPLATLPSRQKPGSSPGKA 
GCWPA/WDLGLVPHGVSGVSIAASSTPQGQAVCSPSVAAPSTLLLLR 
THLLG AASLQGCGHQ I LLHHL I GMDL YVAAS P SHRF W 

745 

A 

1001 

1459 

GVYVVPHTLVMMPLQAlTESLITTGClxREKPENSTAPWRVIiPGCDRAS 
GREDVPMAPPSVGGADNALARAARTACDQPWNIiRLRLGPWDGSSQGL 
QLWTSDALPPGKKPPRRLSLPAATKAPSSNSFPTSVAGSTRLAPVEL 
YSAERTSKRKE 

746 

A 

3 

359 

LFSPRQKGGEGNGAYFLLPGLLKKDSGCGCAAGDKRCNTGAAPDTAT 
PSHRHGPICGSKSISSLLVKVSPVCKDSHSSLDLALLRRHKTPCHSA 
VAAAAS AS ACAG L S G Y AG SR FALR 

747 

A 

461 

792 

ESLQTGLTFTKSDEMDLLPHIGPCR*DGVAVSGAPSSNSFPTSVAGS 
TRLAKERGDHHLRKQPRWEVHGAVRRAVHRPFIFHLSRKEAACCLFP 
PPTLLPSRQQTLPLSK 1 

748 

A 

40 

317 

LRVTAQWASHTLPDPWGNPGSGWKEKI^RDSEWGWRDLKSKTWVEVE 
AEGGLSGDTFQWLPQGSSQRPGLPLVNQGAGVQETKGAGPL*SPAPL 
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PPGSPEGAQASGCSPVGAIGMYPLTGPLPPPPQPRSWISGPSTPILS 
LCPSPSTRSQGFPPQGLEVCVTPIELLPEVRLKIRECFPSFL 

749 

A 

2 

335 

QKRGKTLPDF* SDFG* QLNGRHTHFQ/HPVVGNPGSGWKEKETKGAG 
PLC YPPSHS KVWGRGATS S VS EGGS QRAGHLP / TS PAAQQGLECRNT 
RLPKVRAVAVRLCYPPLGG 

750 

A 

444 

1001 

HSLTATALTLGSLVFLHSSPCCAAGLWASDLPSVSLPLTQRRWLPFP 
KP * KWLGG * QRGPAPLVS CTP APWFTQRERMKDGS I SHAPS PTLRHP 
GVGGRSPRPLAWSPVGAIGMYPLTGPLPPPPQPRSWISGPSTPILSL 
CPSPSTRSQGFPPQGLEVCVTPIELLPEVRLKIKECLSLVSVKK 

751 

A 

172 

431 

IGRPTKQGLGQWTAVWERMERTNKSRAWPNACEGPGAVGNYIAWASG 
LP * MLKTRAARTWVQAGVRESAQGFLNKNRRI LEMGLSR 

752 

A 

253 

429 

QASGVSDLQ*KLCAC*AQSQDLRGGGFREARKTST*QPGFGLLFCLL 
YSANKLQESVTFKDAAIDFSWEEGIQAESAQRMTVYGEVMVENYQEP 
GFLGKYLFQTRGGLP 

753 

A 

963 

1497 

NTALSNEEILVMALVVISFLSLTFFQLMEGTRLNKLWGFIPIRTL*F 
SE*TLSSIKAKLLTILTPSLKFNREVKNEVCHFVEWFWSYHNNLIKM 
VPFISMFTPLNKMKAFRTFICKMTVNINWMHQSTIYAKGEEHLMGCY 
WCKLRTACFMAFAVKKKLLAFNMYFKIYSCLGENF 

7 54 

A 

1 

363 

WRESPHSARPRLEPQCYDLSSLQPPGLK*CSHISLPSSRDYSCCATM 
PS * I FCVFFGVEMRFHHVAKAGLKFPGS SNPPASG I PKCWDYRC I LG 
QFSSFEAKPRMLIFTLFIICSGHVMMS 

755 

A 

1 

433 

FFKNFYRINRDSYYKAAKFFPVLCISSFHSYCSTLPQAPLPQAWLNL 
RRSSGVFLLLISIVRADVPTPFQILGLILEAPPSQSYFLPFSSMDGL 
MLHPNRKVGQLTFSRPGEVAHTCNPSTLGGRGGRIA*GQEFETSLGS 
LIS 

756 

A 

362 

567 

PAPTPGTWDYRHKSPCLANFLYFY*RQGFHHIGLAGLELIASR*STH 
LGLPKCWD YRCEQHVQPEN I F 

757 

A 

204 

391 

DMG YDVTR FQGDVDEDL I C P I CRG VLEE PVQPCQTKV I C 1 1 LGQV * G 
EAHDYHAPYCEHAFC 

758 

A 

381 

1335 

DMG YD VTRVQGDVD *DLICPICSGVLEE PVQA PHCEHAFCNAC I TQW 
FSQQQTCPVD\RSWTVAHLRPVPRIMRNMLSKLQIACDNAVFGCSA 
VVRLDNLMSHLSDCEHNPKRPVTCEQGCGLEMPKDELPJSTHNCIKHLR 
S WQQQQTR I AELEKTS AEH KHQLAEQKRD I QLLKAYMRAI RS VNPN 
LQNLEET I EYNE I LEWVN S LQPARVTRWGGM I S TPDAVLQAVI KRSL 
VESGCPASIVNELIENAHERSWPQGLATLETRQMNRRYYENYVAKRI 
PGKQAVWMACENQHMGDDMVQEPGLVM I FAHGVEE I 

759 ! 

A 

1 

1236 

MNKQ I QGRTGAC S QRDT ATL I KTE S S D V I HL I MQ YLKENGLHRALAT 
LOEETTVSLNAVDSiESFVADTNSGHWDTVT.O 

YEQIVLELIELRELAAVKDVEEEKFPTQLSRHIKFGQKSQVECARFS 
PDGHYLVTGSVDGLTEVWNFTTGKIRKDLKYQAQDNFMMMDDAVLCM 
CFSRDTEMLATGAQDGKIKVWKIQSRQYLRRYERAHSKGVTCLSFSK 
DSSQILSASFDQTIRIHGLKSGKTLKEFCSHSSFVNKATFTQDGHYI 
ISASSDGT\^IWNMKTTECSDTFKSLGSTAGTDITVNSVILLPKNPE 
H5WCNRSNTWIMNMQGQI VRNFS SG KREGEDFVLYC FSTATGKLE 
RTLTVHEKDVIGIAHHPHQNLIATYSEDGLLKLWKP 

760 

A 

55 

792 

GRAGACSLIDTAMSIEIESSDVIRLIMQYLKENSLHRAXiATLQEETT 
VS LNT \ VDS I ES FVA\ E LKSGHWDTVLQA I QS LKLPDKTL IDLYEQV 
VLELIELR/ENLGAARSLFETRPDPMIK\LKQTQPERYIHLENL\LA 
RSYFDPREAYPDGSSKEKRRAAIAQALAGEVSWPPSRLMALLGQAL 
KWQQHQGLLPPGMTIDLFRGKAAVKDVEEEKFPTQLSRHIKFGQKSH 
VECAPRAE FVQRRSN . 

761 

A 

85 

379 

KKPKTKKRTGPVRQNVRWGRSLSPGRAPAEGPQGGSTESPPLLVGRC 
LILGLREPGVPP7^AYGRFLRDYMNSIQGKLEKQRPQ*GLiPSLTGAAE 
LIKH 

762 

A 

209 

394 

WAVYCTSFMLS *CYKI * CKPALSLLGI * TTEVKTYVYTWRCCEMSTS 
ALL I VAKN * KHS KC 
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763 

A 

348 

640 

VQGKPWGMGRAVCPAPGPVRIiEERPVCGTPCLCLHPARMTPAWGSIH 
CLPSHGGCPCLPVNAPSWLGQPSPWARALLQ*WWLGAEGLHPTLRT 
TVG 

764 

A 

746 

1280 

I SHMELQFG A I FAE I L FRG P FHHLHN IN FTWAS CC PHAS OPGP OP a <5 
SWNEYHSFREFNQEKEKIWRAGSESAPLPLAGAAACPWRGGASNPIC 
VTGASAAGGRAGGGSGEETGSAPGAACSST*ESYEAFSGFSSSPGNS 
V*TFSTSSTYTASPPTSPSPSVSPSPPCYYYSSSSSL 

765 

A 

1 

493 

KGPVSKEPRSGMNRNSALRVNVRPLRPCPQEGGLPLQSCVPPLGIjCR 
VPSRGKQPKHLLPTG * * AI AGLPNLHVIDF I PLKLSFGLi* GVPQGPP 
PLGRGPQSQGSLVTQAPMGGGPRKGLPIDRPCE*GACTLGPWAAPVP 
LSSKGPFPPTVGPRSLSDSFSAR 

766 

C 

82 

201 

MKSKHFINLISRQHKNYNLIYTFIRSRDFKTAIFLSILS* 

767 

A 

88 

372 

QGMDSKRRGVKLNDGQFMPVLGFGTYAPS*VPRSKALEVTKLAIEPG 
FRRIDSGHLYTNEEQAGLAILNMIADGCVKREDIFYTSELWATFY*P 
E 

768 

A 

177 

313 

WFQNPCYY YLFFLETESLSPRLERSGTVSAHCSL * LPSS SDSPAS 

769 

A 

193 

363 

I PLAL S TGNGTLLFF F * DR VS LLS PTLECSG V I S AHCNLCLPGS SYS 
AAL AS QVAG I 

770 

A 

260 

361 

SPN * S PNVIFDS * DS PEKTAATVRGDF YVTGDRGMMD 

771 

A 

80 

358 

GPFFFFFGDRVLLCHPRLECSGMITAHCSIjKX»LCSQ*PSHNSLIiNSW 
EHRHMPPHLAKFFFHQQTKXi * KK*NKNRSLESAPLS VFI KKVTTT 

772 

A 

80 

358 

GPFFFFFGDRVLLCHPRLECSGMITAHCSLKLLCSQ*PSHNSIjIjNSW 
EHRHMPPHLAKFFFHQQTKL*KK*NKNRSLESAPLSVFIKKVTTT 

773 

A 

810 

1077 

slrssktqprqhgktssllkiqkisrvw*rtlllgklrqenclnpgg 
ggcseprlhhctpawate*dsgsqkkkjcgqgrkschfsvrt 

774 

A 

499 

946 

lwlrprgagiagagiclrregtwllplrshggkceaaspvwsqrlel 

LQKPSALWWPLGLLHVHIFWKYTCAGSFIiAGKLLRITFNFFLFFFLP 
FPISCIFYLFLiFLFFFRDGVSLCCSGWP*TPGLKQSSCLSLPSSWDF 
KCTPLCLL 

775 

A 

56 

426 

GQSPPGHWGHVAQARELLRPGRLPCWVPDGHTPVRPVSGPEPPTGSR 
ILSGEESPSRPPSSQAARPQPGQPRDPSSLRIPLLLPRP*LSQSGLA 
SSGSGLRAGALPRRPPGPALLS PTPCTS I 

776 

A 

3 

385 

QGFRKLPLKTTTGKEKTHSCIVGLGHVNSGKTTTTGHLIYKCGGFDK 
RT I D KFETEAAD KAKG S FKNAWDLDKLKAERERG I T I D I S LWKYETS 
RY YVTI I VAP * LIDFI INMSTGTSQADCAGLI V 

777 

A 

120 

399 

DGPGCWGFRRRWPEASWRDSARWHGCRSPGGPGPARPPCPA*GGCA* 
CGLTARPPQTAYYTWCRTPPAWRSATVTRTPC*GCPGPAGAPSAA 

778 

A 

213 

500 

S KRC LS LG RGAS S LEAAEHLS CC POADGLSR S POWA S AP PROLL S C P 
SAGPRGPLQLARSPGSRRGPWVPPQGPPHQHSALTI*APAQRKRRKN 
G 

779 

A 

1 

615 

MEQVKTAAL INGKI LPR I PNEQKENADTSLAFS F I SQQDGELTMLRE 
R VR I E I ASTH I ALQHAIVTGDAVGMDDI PQEARQYRHNQAYAYS IQG 
DGAEDDDERIVRFHTRVTVDSDTLASDAAJILTCRHAGVTGYFVHITV 
RKMLCDKQDTAAIHSRHTTAHRGEHIDYCGYSVLPMAVEQDVLIAVE 
PVKTYALQLANTNPLYP 

780 

A 

140 

460 

RSSLLPCLGLWTHLIPVKVRDSLGPEAFRGEKADMPVLYDRLLKLME 
MLYFKSGTIPMFYG*APRRVYIIGERIHYCGYSVLPMAVEQDVIiIAV 
E P VKTYALQLAN 

781 

A 

595 

1060 

ARFLPKYQSPTGPHPSFNAFLSLGAMCVLPRPNFHSLAVRSPIVPGG 
PAPLVPGKE * EGLGVTSQSRPDCPSPMSGTPRCRS * LRPAAGRGCSP 
LLGGCPWTVTHPLRTKAGCTLSWAQQPWLPVGGEGPRCASPVAVLG 
FSPCALCHLDPPVK 

782 

A 

71 

237 

EQPGMVTHTCNPSTLGGRGGWIT*DQEFKTSLAJ^1WKPCLCV^YKKL 
AGHGGACL 

783 

A 

236 

405 

EDGHHSGLHSKQQKQPAWPSMVTHTCNPNTLGGRGGWIT*GQEFETS 
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LTNMEKPCL 

784 

A 

2737 

2915 

KNWHTHQG I LYSHKKE QD * LG WAHTCNPS TLGGRGG W I T * GQEFE 
TSLTNMVKPCLY 

785 

A 

361 

711 

DLFWLF*MDKFLDTYTLPRLNQKEVK 

786 

A 

2 

424 

L F W S PQTQRE PTM VL S PAD KTNVKAA WG KVG AHAGE YG AE ALERH FD 
LSHGSAQVKGHGKKV7VDALTNAVAHVDDMPNALSALSDLHAHKLRVD 
PVNFKLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR 

787 

A 

1 

462 

MVLS PADKTNVKAAWGKVGAHAGE YGAEA1.ERMFLS FPTTKTYFPHF 
DLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRV 
DPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASTQREPTMVL 
VSCRQDQRQGRLG 

788 

A 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGETLGRLLVVYPWTnpppnQpriMT Q 
SASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKIj 
HVDPENFKLLGNVLVTVI^IHFGKEFTPEVQASWQKMVTGVASALSS 
RYH 

789 

A 

38 

485 

APS PDA \ MGHFTEEDKATI T\ S LWG KVNVE \ DAGG ETLGRLL WY P W 
TQRFFD\SFGNLSSASAI\MGNP\KVKAHGKKVLTSLGRCHKSTWDD 
LKGTFAQPDVNCTCDKL\HVDP\ ENFKLLGKCCW* PVFGNPI FGKEF 
TPEGAELPGQKDG 

790 

C 

128 

274 

MMDRLAVS CTLRNWAFS ATTANTNP I YNVTLLGLVAQS ARF IRPGGV 
QL 

791 

B 

72 

524 

LPAPPGRIKRADLATRPSKVIRVRRGGRKRPVPKGATLRKPVHHGVN 
QLKFARSLQSVAEERAGRHCGALRVLNSYWVGEDSTYQWITKPVHKH 
REMRGLTSAGRKSRGLGKGHKFHHTIGCLAGQLGEGAIIiSSSTVTAN 
ISKVCKIHT* 

792 

C 

56 

289 

MNFTNFTYISGSGGAGEYCAFSKLPGESHRIVWWNLW|>FPRPRLLSA 
CRCQPQRHLPCALWELVLVNPTGCSRISSE* 

793 

A 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLS 
SASAIMGNPKVKAHGKJCVT.TQT.nna TKTJT nm vt-tpapit cpt unnvr 

HVD PEN F KLLGNVLVTVXiA I H FG KE FTPEVOA S WO KMVTfiVR qnrcq 
RYH 

794 

A 

38 

604 

APSPDA\MGHFTEEDKATI\TSLWCKVNVE\DAGGETLGRLLVVYPW 
TQRFFDXSFG^SSASAIXmGNPXKVICAHGKKVIjTXSLGDAIKXHLD 

dlkg\tfaqa*selh\cdkaamwipenfk\llg\nvlvtrfgqshfr 
qknftpegckaswaerwvtcswpvalflqdtteaqlpmnaelfkdka 
filasnyk 

795 

A 

23 

153 

apspdamghfteedkatitslwgqcpvlqiplsslpmmqsfqg 

796 

A 

84 

359 

apradtvmghfteedkatitslwgkanvedaggealgrllwypwtqr 

FFDSF/ GQLSELHCDKIiHVDPENFKIjLGNVLVTRFAIHFRKEFTPE 

797 

A 

1 

428 

APSPDAMGHFTEEDKATITSLWGKVNVFnAfifiPTT m?T t wvDWTno 

FFDSFGNLSSASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTLHV 

DPENFKLLG1WLVIVLAIHFGKEFTPEVQASWQ 

H 

798 

A 

2 

476 

V1WEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPKVXAHG 
KKVLTSLGD / ASSASAIMGNPKVKAHGKKVTYT^T.tTnz* T TTHT.nnT.KriT 

FAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFGKEFTPEVQASWQ 
KMAED \ VTG VASAL S S RYH 

799 

A 

23 

349 

APSPDAJ^GHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQR 
FFDS FGNLS S AS A I MGNPKVKAHGKKVLTS LGDAWGKVNVEDAGGET 
LGRLLVVYPWTQRFFDSFGNLSSASAIMGNPKVKAJIGKKVIjTSLGDA 
TKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVQPVLCLCHHGQP 
QSQGTWQEGADFLGRCVGQGECGRCWRRNPGKAPGCLPMDPEVL 

800 

A 

17 

535 

APSPDAJ^GHFTEEDKATITSLWGKVIWEDAGGETLGRLLVVYPWTQR 
FFDS FGNLS S AS A I MGNPKVKAHGKKVLTS LGDATKHLDDLKGT FAQ 
LSELHC\IKMLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVL 
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AIHFGKEFTPEVQASWQKMVTGVASALSSRYH 

801 

A 

17 

533 

APSPDAMGHFTEEDKATITSLWGKVNVEDAGGETLGRLLWYPWTQR 
FFDSFGNLSSASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQ 
LSELHCDKLHVDPENFKLLGNVLVTVIAIHFGKEFTPEVQASWQKMV 
TGWGQGECGRCWRRNPGKAPGCLPMDPEVL 

802 

A 

160 

434 

LISVIPALWEAKAEDHMRSGVQDQPGQRGKTPSLRKIHRLARHGGMH 
L*SQLLGRLKHKNRLNPGGKSCSEPRSHHCSSAWATDQDSVSKK 

803 

A 

1223 

1339 

RW PLW S F 1 1 ETKRRTL * S QLLRRLRQENLLNLGGGGC S 

804 

A 

1057 

1354 

SQHFGRLWQVNHLRSGAPAQPGQQGKTPYLLKIQKLAGRDGARL *SQ 

LLRRLRQENRLNLGGGGCSEPRSCHCTPAWVTKRDSFSKTNKQTENN 
KKRIL 

805 

A 

831 

1084 

SQHFGRPRRAlDHLRSGDRDHPAQHGETPSLLKRQKLASHGGRHIi* s * 
LLWRLRQENHLNPGWGGCSELRWRHCTPAWATSKNSF 

806 

A 

144 

443 

GYQLPLQDELVLGQFLFQVSKIRFFSARKITCNSHFLLLKFF*FPHI 

ick*vsfsseraaaadragashprihhnlqrlqensirgpkysshtv 

SADAW 

807 

A 

165 

1222 

QS KRLNAKRYLLTUN wklke fqqkkvavacnl s gtket yfrnlkkk 
ltqnklilkgelitllhlcesrdhvelaknviyryhaenknftlgey 
♦fgplfvrlcyqldleesagelmkdqhiirgffsdstlfnilmdmlfi 
kgkyks alqvl iemknqdvkftkdtyvlafa i c yklns pes fki ctt 
lreeallkgeilsrrascfavalalnqnemakavs i fsqimnpes i a 
cinlniiihiqsnmlenliktlknaaegnlskfvkrhvfseevlakv 

REKVKDVPALVAKFDEIYGTLHITGQVTTDSLDAVIiCHTPRDRKSHT 
LLLNKRMVSRRTFQPLSQSLLAE 

808 

C 

324 

500 

MGNWGRSS FLKLQNLYLFLKMRS FDQAQWLTPKI PALWEAEAGGSSE 
VRSLRSAWSTW* 

809 

A 

1734 

1910 

NWVGGQQEGRE FFQ V* WATDLKS A* GEE I EPD I PKLKPGGQKWETGL 
RFLHFDSKWFF 

810 

C 

226 

306 

MTTSPPPSALQKKEMEILPVILRVPS* 

811 

A 

168 

258 

GHLGPRQKQ ARGL. * GHRAVQVL.HNYV I PM 1 

812 

A 

229 

436 

TEELKRCRQTLPSGCGVLPWRARLAVGVS/GRRGRSSSPMLTPSAAP 
CGNPGSLTSNLPAAGCKPSCQAW 

813 

A 

1060 

1542 

GRDMVAPDASWKEASGCNAPARICPGQAFRANGCSDPTLGIQHQRLF 
RSRLSPKVLGPVGQPWELGGE*DEAPSASFFSHWEVLGRLLVGASGS 
PSVKQEPAALERARVPGDAARRQPRDRKGPPGPPCCSEHPAPVRAPL 
VAGLGDAGSGAPRAKLGVMM 

814 

A 

166 

437 

ADHLKSG V * DQPGQHGE I LSLLKLQ * FPGRGGAHL * SQDLGRLKQEN 
HLNPGGGGCSEPRLCHWTPVRATVGDSVQKK* KSQDGPRAKLG 

815 

A 

3 

238 

SGDRDHPG^HSETLSLLKIQQXIAGRGGGRL^SRLLRRLRQENGVSP 
GGGACSEPRSHHCTPAWETERDSVSKKKKKKL 

816 

A 

4005 

4346 

SQHFGRPRRADHLRSGVQDQPDQHGETPSLLGGRGGRITKSGDRDHP 
G*HGETPSLLKMQ/EKLAGRGGGRLWSQLLGRLRQENGVSPGGRACS 
EPRSCHCTPAWLTEQDSVSKK 

817 

B 

1 

9234 

MGAPTLPPAWQPFLKDHRISTFKNWPFLEGCACTPERMAEAGFIHCP 

TENEPDLAQCFFCFKELEGWEPDDDPIEEHKKHSSGCAFLSVKKQFE 

ELTLGEFLKLDRERAKNKIAXETNNKXKEFEETAJQCVRRAIEQLAAM 
D* 

818 

A 

1577 i 

2233 

SGCLLS PPS VGRQNS PVELGGAGLSRAGWAPQERGRAALLLI S PGPN 
VRGGPDWLPSVLQMRGLPLWDLGGRPDVGRMS PGGRPGS CWATQLRF 
HISSLAPLFSWAGRSGSRLNPSTLGGRGGPITRSGDRDHPG*HGETL 
SLLKIQKISQACWR/CACSPSYGRLRQENGVNPGGGACREQRSGHCT 
PAWATEQDSVSKKKKKKSGSTIRLKHILHKI I 

819 

A 

151 

455 

FQKIGPGAVAHACNPSTLGGRSRRITRSGGRDHPG*HSETPSLLKIQ 

\KLAGRGGGCL*SQLLWRLRQENGVNPGGGACSEPRSRHCTPAWVTE 
RDSVSKKK 
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820 

A 

1362 

1649 

ECVHWRTAPRPFALCLAKLTVBTCPRIiVSDQSLWHRKTWPPKKKFWN 
WKPDNFGRLRQENGVNPGGGACSEPRSHHCTPAWVTEQDSVSKKKKK 
K 

821 

A 

10289 

10708 

SQHFGKLRQEDHLRSGVREQPGQHGKTPYLLKIQKLARRSGACL+SQ 
LLRRLRQENRLNPGGVGCSEPRLHHCTTAWTLQ*DPVSKXLKKKYIE 
RQRYHQHMKHPWSTKI QYVCMDG* HRS VEKQI IQTLCMFVFTHTY 

822 

A 

662 

766 

AHFLHIVLVLMPVIPATREDEAGESLEPGRWRLQ 

823 

A 

749 

853 

AH FLH I VLVLM P V I PATREDEAGE S LE PGR WRLQ 

824 

B 

1 

855 

MDWDPDIFNRDPRDHYDLLQRLGGGTYGEVFKLFAKCYTCIHTYDA 
MRVLFLMTKSGSQLPRLKEKGKWYERVTGGRSANVHYIIVYCESVHF 
LPILLPTLSTKPLSSPELNAAASYVPFPPIIHIDTAASMCYLFHMYV 
GVRAGGGIGDEIEDPAGDEYELYRWFDITFFFFVIVILLAIIQGLI 
IDAFGELRDQQEQVKEDMETKCFICGIGSDYFDTTPHGFETHTLEEH 
NLANYMFFLMYLINKDETEHTGQSRALCRAFRGMRETGDLVRALKEL 
AV* 

825 

A 

65 

290 

HLVSSLAKGKLAVRQAPFARVPPQHP*VKVKNWDPLLQPPPCRGLAL 
KVTGCRVLGTEGLPHLKDFP IRAATQPL 

826 

A 

230 

653 

GLSAPVAADPSLEPTPALiHAS IRPEADLQP * PRTPPLRS WGS / PGVR 
S PRTGDPSGVTRLHRAIHQ\ P ITI SS STWVTCTGPLCS VLISPSQQE 
SAPKRTVXEEHRLPRPPGAHADAHSGQLRAWPGQQREHGQEASQWEQ 
HP 

827 

A 

108 

1102 

RMTQCLSECLICKFLSGKVTLGQAQWLTHVIPALWEAEAGRSPEVGR 
SRPSLAGIWKSRL/VLKIQKKXKKKXISRAWWHMPVVPATWEAEAGE 
LLEPGKAEVAARRDRATALPAWGSQADGK*G*GLSAPVAADPSLEPT 
PALHASRRPEADLQP*PRPPPLRSWGSQVCEVPGLGTPPGSPASQSH 
PS TP S P YLPL PGS PARVPCARCS S RPAS KRARQS APCGRAPTS PAAR 
GSRRCTQRAAQSLARSAARTRTEASAVGTAPAAGRTGRGGPDKQEFV 
HGLVPLKQDNGRLQCTAQERAPRARTLRHTARPAAFGVLDLGGNHVL 
PCQ 

828 

A 

241 

1104 

QNILLGPKTLTSLGPLQFRDVAIEFSLEEWHCLDTAQRNLYRNVMLfe 
NYRNLGFLG I WS KPDL I TCLEQGRKPLTMKRNEM I AKPPVMS SHFA 
QDLCPEQS LKDS FQKV I VTRYE KRE YGNLELKKGCES VDEGKVHKRG 
YNGLNQCLTATQSKVFQCDTYVKVSHIFSNSNRHKIRDTGKKPFKCI 
ECGKAFNQSSTLATHKKIHTGEKPYRCEECGKAFNQSANLTTHKRIH 
TGEKP YKCEECG KAFNWS SDLNKHKKI H I ERKP YI VKNVTDLLNVP P 
LLISIR 

829 

A 

95 

463 

PAG I RRVTARTPGPPGS LEMGPLQFRDVAI EFSLEEWHCLDAAQRNL 
YRD VMLENYRNL I FLG IWSKPNL I TCLEQGKKPLTMKRHEM IAKPP 
VMYSHFPQDLSSEQSIKDSFQKVILRKI 

830 

A 

2 

648 

FVKMAAPAGGGGSAVSVliAPNGRRHTVTCVTPSTVLLQVXiEDTCRRQD 
FNPCEYDLKFQRSVLDLSDQWRFANLPNNAKLEMVPASRSREGPENM 
VRIALQLDDGSRLQDSFCSGQTLWELLSHFPQIRECLQHPGGATPVC 
VYTRDEVTGEAAIiRGTTLQSLGLTGGSATSQVCHEVLRPRGRRPQEA 
WARQRRLARQPPALHF PWNLG SSAAAT 

831 

A 

90 

401 

AGMLPAVG S VDE EEDPAEEDCPELVP I ETTQS EEEEKS GLGAKI PVT 
1 1 TG YLG AG KTTLLNY I LTEQHS KR VAVI LNE FGEGS ALiEKS LS VS Q 
GGELYEEWP 

832 

A 

276 

490 

TN FLSKTMY F KRAARAFPVLLTGAG KTTLLNY I LTEQH S KRVAV I LN 
EFGEGSALEKSLAVSQGGELYEEW 

Q *5 *3 

o 33 

A 

186 

537 

IWFPLRRRKARQEEKSGLGAPRSPSHNYPPGYLGCLGKTNTS*TYIL 
DQSNIGKR VA\ AILN * I LGGRKLRLEKSLSCQPKVEEL YERVAW /IP 
* KPGCLLLVS VKVRNVFDWCTWVY 

834 

C 

79 

405 

MIGGTPQMFFISGAKGQWSPSLQPPPRAHRSSPWAPSSKSTSGGTAA 
LGSLGSKDYFPRTGDGWELRRSDQRRAHLPGCPTVLRTLLPQQRGD 
RDLQQLRHHELRSL* 

835 

A 

1 

321 

MDLLCKNMKHLWFFLLLVAAPRWVLSQVQLQESGPGLVKPSETLSLT 
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CTVSDVSITSGNYFWAWIRQPPGKGLEWIGSKFHTGTNLYNESLKSR 
VTFSIDTSGSTRP 

836 

A 

101 

290 

FSSEHINQACEMHPPPCPSYFKPSVVHSTVCTTSLFKHSSVFLAPKQ 
IWVRFVYAETTYEEGP 

837 

A 

21 

215 

PSATPRQTGPATTMNSKGQYPTQPTYPVQPPGNPVYPQTLHLPQAPP 
YTDAPPAYS EVLPVCQ I 

838 

A 

158 

955 

KMTSSSEQEEDEKNNQSATPRQTGPATTMNSKGQYPTQPTYPVQPPG 
NPVYPQTLHLPQAPPYTDAPPAYSELYRPSFVHPGAATVPTMSAAFP 
G\ASLYLPMAQ\SVAVGPL\GSTIPMAYYPVGPIYPP\GST\VLGGK 
GGYDAGARFGAGATAGNI PPPPPG\ CPPNAAQLAVMQGANVX,VTQ\R 
KGNFFMGGSDGGYTHLVRNQGHLCAREKTSHTLQHFSQCNCFSHINIi 
KLQFRHMLLGCLSGAQTFRHFSNLIRNHVMVAVPP 

839 

A 

217 

491 

LQWVSNRWTSPCCIVTSTYGWTANMERIMKAQALRDNSTMGYMAA 
KKHLEINPDHSIIETLRQKAEADKNDQSVKDLVILLYETALLSS 

840 

A 

1789 

2411 

KT YWRKKVEKVWSNR\ LVTS PC \ C I VTSTYGWTANMGENH * KLQAL 
KETTSTMG/YYMASQRKHRGIKPLTLSIIEYLKAKRPEGLIRTDKS\ 
VKDli\ VI L»IjY\ ETALLSSGFQSWKI PRHHA*QVS YRMI KL\GLfGIDE 
DGPYIjLDDTSAXAVNLKELPPXLEGDDDTFTHGKEVGLILLGLRGWT 
LPVSVLYNSSDNIFFQGCFPLFLVNI 

841 

A 

225 

338 

RS S $ S S PS LGLCTCAAGDPPLMFNED YQKS LLEQ YHL 

842 

A 

610 

834 

IS I PQGYLYCRGIjQVPSDLPHHPSAIjCCCPS PGS IGFSLLTKQTRQV 
PASGLAWLLAGGWGNSCSTCSLTSFYTC 

843 

A 

36 

353 

APRPGAMGHFTQDDMATITSLSGQVNVEDPRGETLAWLLFVYPWTHR 
LFDS FGTLTS AS A IMGNS KFKAPGKMVLTS L I DAI SHLDDLMGS FYQ 
LIELHCDKLPV 

844 

A 

3 

259 

HEPQTQR^PTMVLSPACKTNVKAAWGKVGAHAGEYGAEALERMFLYF 
RMTRTYLWYFCLCHCCRQAWGPGFNFWTTCDGCQTRRH 

845 

A 

3 

663 

EGKWLAWKHNPLSLFEFVLSSFLVPCDSRPTCCDVCRGRNWSATQPG 
S YHS LDMTEALQWAR YHWRRL I RGATRDDD SG^ YNYS SLLACGRKS S 
QIPKLSGRHRIWPHIQPFKDEYEKFSGAYVNNRIRTTKYTLLNFVP 
RNL FEQFHRAANL YFLFL WLNWVPLVEAFQKE I TMLPL WVIiTI I A 
IKDGLEDYRKYKIDKQINNLITKVYSRKKKK 

846 

A 

33 

382 

APSPDAMGHFTEEDKATITSLWGKVNVEDAGGETLGRLLWYPWTQR 
FFDSFGNLSSASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQ 
LSELHCDKLHVDPENFKLLGIK 

847 

A 

21 

371 

APSPDAMGHFTEEDKATITSLWGKVNVEDAGGETLGRLLWYPWTQR 
FFDSFGNLSSASAIMGNPKVKA/HPENFKLLGNVIiVTVLAIHFGKEF 
TPEVQASWQKMVTGVASALSSRYH 

848 

A 

24 

474 

APSPDAMGHFTEEDKATITSLWGKVNVEDAGGETLGRLIjWYPWTQS 
FGNLSSASAIMGNPKVKAHGKKVLTSLGDATKHLDDLKGTFAQLSEL 
HCDKLHVDPENFKLLGNVLVTVIAIHFRKEFTPEVQASWQKMVTAVA 
SALSSRYH 

849 

A 

24 

415 

APSPDAMGHFTEEDKATITSLWGKVNVEDAGGETLGRLIjWYPWTQR 
FFDSFGNLSSASAIMGNPKVKAHGKKVLTSLGDATKHLDDLKGTFAQ 
WPVPCPPDTTELTAHDAELSRIGFILQAIQI INLFC 

850 

A 

24 

432 

APSPDAMG/HSLWGKVIWEDAGGETLGRLLVVYPWTQRFFDSFGNLS 
SASAIMGNPKVKT^GKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKL 
HVDPENFKLLGNVHFGKEFTPEVQASWQKMVTGVASALSSRYH 

851 

A 

323 

737 

QAACGLWGKVNVEDAGGETLGRLLWYPWTQRFFDSFGNLSSASAIM 
GNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPEN 
FKLLGNVLVTVIiAI HFGKEFTPE VQAS WQKMVTGVAS ALSSRYH 

852 

A 

24 

589 

AP S PDAMGHFTEEDKAT I TS LWGKVNVEDAGGETLGRLL WYPWTQR 
FFDSFGNLSSASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQ 
LSELHCDKLHVDPEVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVD 
PENFKLLGNVLVTVLAIHFGKEFTPEVQASWQKMVTGVASALSSRYH 
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853 

A 

100 

684 

MWKMLEEKPWEGSWLSTHGPRGSLTALATCPLPLPSWATPKSRHMAR 

RATITSLWGKVNVEDAGGETLGRLLWYPWTQRFFDSFGNLSSASAI 

MGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPE 

NFKLLG1WLVTVLAIHFGK33FTPEVOASWO 

PMDPEVIi 

854 

A 

135 

851 

APSPDAMGHFTEEDKATI\TSLWCKVNVEDAGGETLGRLLVVYPWTQ 
RFFDSFGNLSSASAIMGNPKVKAYGKKVLT\SLGDAIKHLDDLKG\T 
FAOA * S ELH \ CD K\ LHVDPEN FK\ LLGKCAGD P FT . A T P P <3 a vp FTDfJ 

GCRLPGQKDG * LGVGQCPGPPDTTEAQLPMNAELFKDMALFLQAITN 
NKSISAKPSNFKLLGNVLVTVLAIHFGKEFTPEVQASWQKMVTGVAS 
ALSSRYH 

855 

A 

40 

505 

OPOTVS ILHLTPEEKSAVTAJLWGV KVNLDEVDVFALfiPTA T.WYPWT 
QRI,LESFGNLSTPDAVMGNPKVKAHGKKVIiGAF\SDGLAHLDNLKGT 
F ATLS ELHCDKLHVD PENFRLLG QRAWS V/ LAG P I T FG KEFTP PVAG 
LP I RKLVAG \ VANALAHK 

856 

A 

41 

536 

APSPRRPWGHFTEED\KATI\TSLWGK\VNVE\DAGGET\LGRLLVV 
YPWTQRFFD\ S FGNLSS AS AI \MGNP\ KVKAHG\ KKVLT\ SLGDA\ T 
KHLDDLKG\ TFAQA* SELH\ CDK\ LHVDPENFK\ LLG \NVLVT\ VL\ 
AI PFS AKEFTPEGCRASWAERWVTWSWPVPCS SRYH 

857 

A 

241 

435 

GARLQKAGSGFEAVAGEALRKSEIEx^AMIxAJCIXjVHHYSGNNIEx^T 
ACGKYYRVCTLAI IDPGD 

858 

A 

243 

451 

FCYFLKALTTLTLVKEWTFSLSVTWYRIYLFAHFGSQRKSEIEYYA 
M LAKTG VHH Y S GNN I ELGTACG 

859 

c 

275 

397 

MLCWLKLGVHHYSGNN I ELGTAMRENTYRVLHTGLFIDPG * 

860 

A 

2 

424 

LF WS PQTQRE PTMVLS PADKTNVKAAWGKVGAHAGE YG AEALERHFD 
LSHGSAQVKGHGKI^ADALTNAVAHVDDMPNALSALSDLHAHKLRVD 
PVNFKLSHCLLVTlxAAHLPAEFTPAVHASLDKFT^ 

861 

A 

73 

608 

SVMTPVAPTLKAASKSDSSGPHRLRENPPWCLSPADKTNVKAAWGKV 
GAHAGEYGAEAL\ERMFLSFPTTKTYFPHFDLSHG\SAQV*GATGK\ 

kvadal\tnava\hvgrtlpnalsgpeiatlhahk\ LRGGPGSTFKLL 
\SHLPCLGEPWAAHLPGRVSPLGGCKASLGTNFLGFLLKQPC 

862 

A 

7 

330 

LREPYTRKETHQNIKPTGRANTQIKKRKNLYVTTTEK^PTTOK^^ 
ERKEEMLYKNQKS I TKMTG ISPVISIII LNTNELNLSLKR YRGAERI 
FLNDPTTCCLQEIH 

863 

A 

2 

380 

APSCQPTMSSQIRQNYSTDVEAAVNSLVNLYLQASYTYLSLGFYFDR 
DDVALEGVSHPTSACNRPnT.T.T.PnWWT.T.PT.PPT T OX/OTTV/nMWT.T.nPR 

v rujuu v uiir x Q/twi vj Xr %^xjxjxjIW3 YV lnl xjxjIvJjiX.2vJ_»xj%/ V V ^liniJjjyulj 

LLLLLPPGLCQVCPGLHLQRGVREVQLLCLMS 

864 

A 

3 

722 

HERGSVSCFNSGWTDHIRGHLPOPPTAT.RFPT.RT.nPPnP^QRP^PA 4 ; 

GTCQHQFLWLAPSCQPTMSSQIRQNYSTDVEAAVNSLVNLYLQASYT 

YLSLGFYFDRDDVALEGVSHFFRELAEE\ KREGYERLL\ RMQTQRGG 

PJVLFQDIKKPAEDEWGKTPDAMKAA^lAiEKKIlNQALxxDL 

DPHLCDFLEraFLDEEVKLIKKMGDHLTNLHRLGGPEAGLGEYLFER 

LTLKHD 

865 

A 

40 

264 

SGYKLRHSSREWQPPLGPLPSYGSSVFTLLNLATARSSGLCLLRLEL 
SFCSLSTTAVCRCYRPAIDSHPGPPLLF 

866 

A 

3 

1231 

ISLPDLNINSMLEQRREPWSGESEVKIAKNSDGRECIKGVNTEMSSS 

VKTPTFlTRNnPDDQ^PT.PnPnKTTHT.PPlfDVTrrMPHCirUPWVQCQT.TK' 

HQVIHx^KPYKCNSCGICWSRNSHlxAEHCRIHTGEKPYKClWCGKV 

FSYNSNFARHQRIHTREKPYECNECGKVFSlNTNSYlxARHQRIHAEEKP 

YKCNECGKGFSHKSSLANHWRIYTGEKPYKCDECGKAFYRIALLVRH 

QKIHTGEKPYKCNECGKVFIQNSHIxAQHWRIHTGEKPYKCNECGKVF 

NQLSNlxARHRRIHTGEKPYKCNECGKAFSEYSGLSAHLVIHTGEKPY 

KCSECGKAFRHKLSLTNHQRIHTGERPYKCNE 

KIHTGEKPYKCNECGKAFSR I S YLAQHWTIHMG 

867 

A 

3 

498 

GHAAARLTSKESCWDEKAQKRKKRKAYESGMALPQGHLTFPJDVAIEF 
SQAEWKCLDPAQRALYKDVMLENYRNLVSLGISLPDLNINSMLEQRR 
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EPWSGESEVKIAKNSDGRECIKGVNTGSSYALGSNAEDKPIKKQLGV 
SFHLHLSELELFPDDRVINGCNQ 

868 

A 

1121 

1442 

RSLIiAYPLPALSPGCSWPLLTAHQGSLSNSEKNAPHSVPGAGGLDF 
Y LPSRVC I LYRGNCMKNWGGGGRARWLTPVI PAL.WEAEAGGSRGQE I 
ETILANMVKPRLY 

869 

A 

2380 

2714 

EETEGQARWLTPVIPTLWEAEAGGSPEVGSSRPA*PPLSAFLYPAFF 
LRSARFL 

870 

A 

3260 . 

3533 

HFKQNRKLQGKCLFKRAGQARWLTPVIPALWEAEAGGSRGQEIETIL 
ANTVKPVSTKNAKKISRAWWQAPWPATREAEAGDWREPRRRTL 

O / J. 


6330 

6802 

R S KGRAK I K S R RAG O VOW LM P V I P AL. W F A K AG H <5 PF AR S Li * PA W P I W 
* NPALHSSL/ V* * SKALSQKQKKQKKNGHDGQHDETVSLLKIQKS AG 
AHL*SOLLREAEAGELLEPRRQTLQ*AEILPLHSGLGDKSKTPSQKK 
KKKR S QGQVG VR EPQG P 

872 

A 

1 

500 

MPTLAS PGALPLAS PS PTWALAEKSGGKKPAEC PPLLCGRVRAGAHT 
HTHACTFPKQEAALCQDRMEFRFPEGVLGLQLYGCMMKKGPSVCAGG 
WERGASSSAHIYLGPKKNPQLAVliSPATPSLGHIPSIiPEPTLGGPGK 
PASTPPSPPAGRPSPSPHGAASKYK 

873 

A 

63 

377 

QNLCLGKGLMALSLGWKAHRNHCGLLLQALRSSGLLLFPCGQCPWRG 
AGS FLDPE I KAFLEENTE VTS SGSLTPE I QLRIiLtTPRCKFWWERADL 
WPHSDPYWAI 

874 

A 

27 

376 

DRRSVANMHLSVRGEELHTFEVTGHETVAQIKAHLTSLEGIAPKDQV 
ELLADAPLKI^EATIjROSGA^AMTTTjPVA^RMT.nRKAHn^LARAGKVR 
GHTPKEANQDWKKSMTGRAKRC 

875 

A 

1 

801 

MATWRRDGRLTGGQRLLCAGl^GTLSLSLTAPLELATVLAQVGVVRG 
HARGPWATGHRVWRAEGLRALWKGNAVACXRZiFPCSAVQIJVAYRKFV 
VLFTDDLGHISQWSSI^GSl^GWSTIVTYPTDLIKTRIilMQNILE 
PSYRGLLHAFSTIYQQEGFLALYRGVSLTWGAJJPFSAGSLLVTMl^ 
EKIWNGPRDQFSLPQNFANVCIjy^AVTQTLSFPFETVKRKMQNIQLF 
L F I GVS ISSNI KPS VS LLLYTPVDFDP I TAHK 

876 

A 

1286 

1389 

WQRMAAVGL I MQN I LE P S YRGLLHAF S T I YQQEG 

ft 7 7 

A 
r\ 

X 

i n ^ 

X \J 3 D 

DHIQLDASIPVTFTKTDEVFLEAQIQITOTTSPMFMEKVSl J EPSI^fx r N 
VTELNSVSQAGECVSTFGSRAYLQPMDTRQYLYCLKPKNEFAEKAGI 
I KGVTVI GKLD I VWKTNLGERGRLQTSQLQRMAPG YGDVRLS LEA I P 
DTVl^EEPFHITCKITNCRYKILNILGFAGRKVSVTl^ELCIVLQEQ 
PVAKKY S G VTKEQLTREAEQKS LNRRRDNR VH I GPKME I RWTLGLD 
GAG KTT I LF KLKODE FMOP I PTI AWFWD S SHRDR T S EAHS EIiAKL 
LTEKELRDALLLIFANKQVFAAG 

878 

A 

651 

1469 

RCETVLNHFQIWMLSHTViyrVCTVDLLEFQPSLiaCQYL™ 
NSGNLVMRLTKPTLFTN I P VTCEEKDLPGDLFNQLMRDD PSTVNGAE 
VLMLGEMLTLPQNFGNI FLGETFSS YI SVHNDSNQVVKD ILVKADLQ 
TSSQRLNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYT 
TQAGEKMYFRI^FKFQVLKPliDVKTKFYNAESDLSSVTDF^FLEAQI 
QNMTTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVS 

879 

A 

3 

416 

HAWQPLRTCESFLPLHSPLQSLHAGGGFGACTCTLTSRRHLTTLVPL 
RKDPEGAXAEGEMGNPLSPHTPVASSGMKHTGEKPFECPRCGRRHFL 
KENLLQHDARICMSRSEQVFTCSACQVTFRLKKKLRVHRSAHT 

880 

A 

164 

412 

PQQTSGPPGMASSRAVQVKLELGHRAQVRKKPTVFIGFTHDWMVFVRG 
SEHSDVQHFAEKCCFLLHESGTTLLRVYSHPSYRV 

881 

A 

29 

490 

GEFFHMERASTITYLKGETIGTEPEGKGEGECASTGIiVNPRFDFPSK 
VSFPKYRINKRKQKQNTTLSGTKQNPQILFYYTRHEVKSLSTVSPLD 
HGRKASTQSLSRNFHSEASEHAEDGALRVPLPSWRPPSGVETVKSRS 
HMPEDHQPTTHW 

882 

A 

44 

847 

IKENSFIWIRASTITYLKGETIGTEP£GKG\EGECASTGIj\VNPRFD 
FPSK\VSFPKIPV*TSLHITPL\AIVCPFQNIWSLHGKGQRGKG\QN 
INKRETKNKNTTLFREPKQNPQIPFF*TTQGH\EVKSLSTSVPL*IT 
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grka\stqsl*qefstleasehaedg/ ALLRVPLPSW\ RAP SGVG TV 
KSRFSH/VPRDHQPTTHWWPWWSMAGRQRPSPTLSHFLSWMTSCLGP 
SHFRRELAFWKPTFPIHHRTRQSAHILVPLVFKGHFQQTL 

883 

A 

40 

431 

ESP EAGTG S WLG RTRNQG I NTLADHRHRETDFGG S PW 1 1 1 1 1 E FLGR 

YKFTILFCTIYLWEPFLKTIFWSRNGHDRSRDVQQRAMRSNRCLQEG 
I K I GPED I CTLW KQAETKVQAKI RKM KVTKKVNHH Y 

884 

A 

122 

342 

WWSLETRMRTDRVPMKPSWTRTPSFARALKTSTHSWGPLIARVILRV 
ATT ARG YRAM I RAKFRS I L PAN A I GH 

885 

A 

23 

420 

WDLSTMEPKKAMKRAGGANSNGVSMFEQTQIQEFKEAFTIMDQNRDG 
F IDLNDLRDTFAALGRGNVDNEE I DEM I KDAPGPNN FTAI LTMFGEK 
LMGADPEETILNAFKVFDPVLKRELKADYLRKMLTTQA 

8 86 

A 

1 

598 

LARNPGSTHASGGILLGRQCVA^FTMAPKKAKKRAGGAN\SNVFSMF 

EQTQIQEFKEAFTIMDQNRDGFIDKN\DLRDTFACPWRKLNVKNEEI 

DEMI\KEAPGPINFTVFLTMFWGRNLKGADPEGKPFLNRISKVFDPE 

GKGVAEGLDYVREMLNNAGRRGFPRKEVDQMFAAFPP\DVTGNLDYK 
NLVHIITHGEEKD 

8 87 

A 

174 

454 

ERNPPRGGGGGFPRGALFFFSPPFLRGGFSGGPPKPRAPPPPPFLKG 
GPPPGGPGGVAPLFFPPCPGGGGGFLKIKPPPPSGPPPNKPFFLGG 

888 

C 

60 

245 

MAXPXGPXLFYSPXLXXXQXGFXPXWAXXGXPPXLQXXQGXAXXGXQ 
XFPVSXXERKTPLGP 

889 

A 

1878 

2295 

SFSPAWPSRWCASSSWCTDCLVPSTCAALQHPWFIiDKGNHFLIFFY 
FFFFVILYTSSSHEVED*PCVMFSKIKWYLLE/VRYILWAYRLSFC 
KERKKTIKLCQIILLCLAARTWLPCLSLSLPPFLSFSLCMDFFEKK 

890 

A 

3218 

3469 

GFTMLARLVSELLTAGDPSSLGSQSAGITGMSHHAQPSMHFFYFFFE 
TEFRTSCPGWSTMVRSWLTATSTSWVQALLQPQPPE 

891 

A 

618 

1238 

GQRRYSGERDEGRDPLCKDEPQGASSEAPLLVLLFSRGVMMLHQGSC 
SPSCQSRQGGKAANIWSTSSLENLSAWWSISRTVISLQHPFAFRVK 
HFECVENGFLRVRSLKFLPKHCEEHCIOmWTRSLLDHFINFFIFHVH 
S PKG S KGVSQ I VLVNEAI P VLVHDSEGLLKFLDLGL FEHGEHVGVG A 
PGSLLCFLRCHGGKDPAL 

892 

A 

12 

138 

TDH I MREY KLWLGSGGVG KSALTVQ FVHV I SWLYYLL I L 

893 

A 

1 

1002 

- 

MKRPDVGVLLDNAAEAQLDSHMKPKHGRNPLSSLMRKACSRGKAMTE 

QCMEVRIMNFCSRTIAGINQESQENPQRSEGSRLLLQDPGDTPNTGS 

LRKAARGTGKGHCRRKPPAEVCNNIiNRQRISWPELGGGHESSVQTSQ 

EATSLGKEGEYYIKGTPCGTKESEQQPSALDLPSDKAYPNEKEPENQ 

LCSPAMDPNQEEIPDLSEKEFRIiVNKLIREAPEKVIAINTKRTYINN 

FLKEQLIINHSKGLGATNTTCGQAVFQIMACFGLKVGSEKAGTGRAD 

FFPLALHESYFRVGVGCLALAKSDTPAMGARETAYGDQPFETLRRLG 
EEHRT 

894 

A 

66 

295 

RLLFRFFCPTGCSLDWIjStiFS *GCGFI»* SLHTGFMPSPEFWPGDSL 
SVQIVTNFSWRFSSAVAFSSASGSFPQGS 

8 95 

A 

2202 

2337 

GLPNHASHTACRFSSSFATLKTATLTSPLPLPPQPTQHEDTQIR 

896 . 

A 

343 

906 

YDKTRLFNTVPLPPPHKKI IKKI IQDIKGEILRAFPLRTGTKHRCPL 
SPLLFNILLEVLARAIRQEKEIKS IQIGKEEVKLSLFADDI I IYLES 
PKYSSRKLQELIKEFSKVSRYEINVHKSVALLYTNSNQAENQIKKST 
PFTIAAKNKIKYLGIYLTKDAKDGYKENYKTLMKEIIDDKNKQKYIP 

897 

A 

1455 

1604 

LGVLAQLGKNGGLWSQAMEFQTMQAQALLCLPRWLGSSLPPLVCLAP 
QA 

898 

A 

370 

807 

v^L-Jjo vv x t til VKQKVGLrljCLGSLSQGATIGVDFMI KTVEINGEKVKEK 
GGEYHIKGTSCGTKESEQQPSALDLASLTEPTQIRRNQKTNSGNMTK 
QGSLTPP KKI TIjAHQQW I QTNKKS L I YLKKNS ARHQR KAKPDARKS K 
KQYEK 

899 

A 

1 

1548 

MLPERVPDPDPKKG FWDLVEER I QGES I S CRKR IQRAS VEAGG YCKV 
QS KDDEGQRRGGKGGNGEKLSDSALLPRLEKLSDIRKKP PRYPLHRA 
CSVLRPGFDCGTPASGPLRASGVRVPLNRAGSLLGNRAARAGAALRW ' 
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j-i&trEtir MiLRunvjvjyoKl ubCKiiK V V V V r bij£*Kr vjJjoK-Kxii'Kr Ql JLEEV 
PG E MD ARR KHW KENM F T P F F S AOD VT* E ET S E P E S pmr JX D Q o trp m 
EE IYNLSS RKFQEE S KFKRKKY I FQLNE I EQEQNLRENKRN 1 S KNBT 
DTNSASYESSNVDVTTEESFNSTEDNSTCSTDNLPALLRQDIRKKFM 
ERMS PKLCLNLLNEELEELNMKYRKI EEEFENAEKELLHYKKEI FTK 
PLNFQETETDASKSDYELQALRNDLSEKATNVKNLSEQLQQAKEVIH 
KLNLENRNLKEAVRKLKHQTEVGNVLLKEEMKSYYELEMAKIRGELS 
VIKNELRTEKTLQARNNRALELLRKYYASSMVTSSSILDHFTGDFF 

900 

A 

3 

127 

VLCQFS KGMLPVFAQS V* YWLWVRYVPS I PSLLRVFSMKGC 

901 

B 

1 

1926 

MFSDSLTPMHKERIVTLLKREGHWGFMGDGINDAPALRAADIGISV 

DGAVDIAREAADIILLEKSLMVLEEGVIEGRRTFANMLKYIKMTASS 

NFGNVFSVIiVASAFLPFLPMLPLHLLIQNLtiYDVSQVAIPFDNVDDE. 

Q I QKPQRWNPADLGR FM I FFG P I S S I FD I LTFCLMWWVFHANTPETQ 

TLFQSGWFWGLLSQTLIVHMIRTRRVPFIQSCASWPLMIMTVIVMI 

VGIALPFSPLASYLQLQALPLSYFPWLVAILAGLATNSDRFDLNLHV 

FROTGNFNARTCREGGVVFGEEGFA^fiFnnRPVVTnPT.Tr'T.PQnvr'p 

YFARFRIYRDLTGGDDHIAQINXLYVRPIAAGPAPGTFRLWHSRRAM 
LLTRADMEDRERFLNARDTLRALLDNN I VPV I NENDAVATAE I KVGD 
NDNLSAIiAAILAGADKLLLLTDQKGLYTADPRSNPQAEIilKDVYGID 
DALRAIAGDSVSGLGTGGMSTKLQAADVACRAGIDTIIAAGSKPGVI 
GDVMEG I S VGTL FHAQATPLENRKRW I FGAP PAGE I TVDEGATAA I L 
ERGSSLLPKGIKSVTGNFSRGEVIRICNLEGRDIAHGVSRYNSDALR 
RIAGHHSQEIDAILGYEYGPVAVTVMHDYR* 

902 

A 

1 

1602 

MFFETNENKDTTYQNLWDTFKAVRRGKFIALNAHKRKQKRSKIDTLT 
SQLKELEKQEQTHSKAYRSLEITKIRAELEEIVKQKTIQKINESRSW 
FFENINKIDRPLARLINKKRDKNQIDAIKNDKENITTNPTEIQTTIR 
EYYKHLYANKPENLEEMDKFLDTYTLPRIjNQEEAESIjNRPITGSEIE 

ATTN c 5tjPTKK c iPRPnf!PTAPP , VnPYirV"l?T "\7DTT t vt rncTTrvrnTT r> 
/■vx ii^Qur x -i vivo rvjri/Vjr i. ruir i W-*v Jt JvIxJLIj V Jr Jr J_j1_i xVL» 17 yo J. Jfc» JVtitj JL JLtir 

NSFYEASIILIPKPDRDTTKKENFRPISSMNIDAKILNKTLTNQIQQ 
YIKKLIHHDQVGFIPGMQGWFNIHKSINVIQHINRTKDKNHTIISID 
AEKAFDKIQQPFMLKTLNKLGIDGTYLKIMTAIYDKPTASTILNGQK 
LEAFPLKTGTRQGCPLSPLLFNIVL1EVI1ARAIRQEKEIKGIQLGKEE 
VKLSLFADDMIVYLENPIVSAQTLLKLISNFSKISGYKINVQKSQVF 
P YTNNRQTESQIMSELLFTI ASKR I KYVG IQLTRDVKDLFKENYKPL 
LKE IKEDTNKWKDI LCS 

903 

A 

1 

972 

MPWADKRNPSGAQS LNG YAP PDL I WG PS HRAFRAQVE YS I G I S S I YN 
SGTHTLPILNQEDFESLNRPITGSEIEAIINSLPSKKSTGPDRFTAE 
FYQKYNKELPLWDIFAKICSIPHPSYHEEQLAEYIVGWAKEKGFHVE 
RDQWFQAIiIHLVKGNMGTGILGLPHRCEERGHPGQSRLIQLWEQLF 
Si YGIHVGQMLLTRADMEDRERFI^ARDTLRAliLDNNIVPVINENDA 
VATAE I KVGDNDNLS ALAAI LAGADKLLLLTDQKGLYTADPRSNPQA 
ELIKDVYGIDDALRAIAGDSVSGLGTGGMSTKLQAADVFRAQ 

904 

A 

280 

358 

SHFSCYQQNDSPYQGGVFFLTIHFPT 

905 

A 

141 

152 

QCPPQPRPPGRCPTPQSGHTGCSTGG*GLNPLYYNLCDRSGAWGIVL 
EAVAGAG I VTTFVLT 1 1 LVASLP FVQDTKKR S LLGTQLRGRCHHTAG 
TMGSC 

906 

A 

177 

247 

S WQEAGARSSGPS PNPTAS PRHR 

907 

A 

574 

737 

AWEGAHVFTTSPSSCHSWVRDYARVGLPPLPLPCPQRAIjLGLWEVWK 
GAYSPAI 

908 

A 

221 

576 

GVGGDTHDTHPNTGREGGPSRSGAHVAGPARGRRSLESPLPGARSSG 
PSPNPTASPSPPLTQSHT*GFPKRPESSCRLSRGVAAWWLWAEYIVF 
KPAGS S WHCPS P WLLAVARGRMMS 

909 

A 

1 

157 

TKTWTLKKHFVGYPTNSDFELKTAELPPLKN/GRVAAKRLKEGDTMM 
GQQVAK 

910 

A 

3 

484 

VRLREDDRRGPSLGTCHKSDPGRPAAQSQPPSPGSGTFGLLSFRMVR 
TKTWTLKKHFVGYPTNSDFELKTAELPPLKNGEVLLEAXiFLTVDPYM 
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RVVESKNVALPKGTIXO^ASPGWTTHSISDGKDLEKLLTEWPDTIPLS 
LALATVGMPGL\TAYFGLLE 

911 

A 

1 

1148 

MLLCQKAPSLKTTYNHPPAADSAGTALNLETTVKQTRETQLEYNNVG 
TDLSPEPKSFNYPLLSSSGDOFEIOLNOOI J W«;L.TPT<njnVT?PT irctn/T 

RTLKTDCTETHLQLACAKLISRTGLLMKLLSEQQELRTVSMTAWKPR 
MNRKSRSRMRQSHFASHAGRWWHNHSTLQPQSPKLQMAELSEARRRS 
FRMVRTKTWTLKKHFVGYPTNSDFELKTAELPPLKNGnT.PPT TnvrM 

LYFVEVLLEALFLTVDPYMRVAAKRLKEGDTMMGQQVAKVVESKNVA 
LPKGTI VLAS PGWTTHS I SDGKDLEKLLTEWPDTI PLSLALGTVGMP 

G\LTAYFGLLEICGVKGGETVMVNAAAGAVGSVVGQIAKLKGCKVVG 
AVGSDEK 

912 

A 

214 

540 

ASGWVRTKTWTLKKHFVGYPTNSDFELKTSELPPLKNGEVLLEALFL 
TVDPYMRVAAKRLKEGDTMMGQPVAQVIISICPQCLENMSPLYLLLG 
PVSVHCSYNLFSLCC 

913 

A 

495 

686 

TDQQNEVTLGSHEVTLQPKHVNIHTHDTIYPLFGFFFETQSRSLSQP 
GLQWRNHGPLQPPPPE 

914 

A 

1 

1911 

MGRKQSRKAENSKNQSASSSKDCSSLPATEQSWTENDFDELTEVGFR 
RSVITNFSELKMYVRTHCKEAKNLEKRSTQFIKQVLRDLQRDLDSHT 
IIMGDFNTPLSILERSTTQKVXWDTQELNSALHQVDLIDIYRTLHPK 
STEYTFFLAPHHTYSKIDHIVRSKALFSKCKRTEITTNCItSDHSAIK 
LELR I KXLTQNHTTTWKLNNLLLND YVA^NNEMRAE I KMFFETNENKD 
TMYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDNLTSQLKELEMQE 
QTHS KASRROETTKIRAELKRTKTnKTT.DK'TTPQTJCurTrpvTMWTnD 

LLARLIMKKREKNQIDTIKNDKGDIXTDPTEIQTTISGYYKHLSTNK 
LENtiEEMDKLLDTYTLPRLi YQEQVE SLNTP ITG PE I EAI VNS PPTKK 
IPVSDEFTAEFYQRTNDKNHM 1 1 S IDAEKAFDESQQHFMLKTLNKLG 
IHGTHLKILRAVYBKPTANIILNGQKLEAFPLKTRTREGCPPSPLLF 
NIVLEVLTRAIRQEKEIKGIQLGKEEVKLSLFADDVIVYLENPIVSA 
QKLLKLI SNFS KVSGLRVLANVYGS FQDMETQGLRFFGQNS VPQQLR 
PYNGASLQTLESGEKASAPAKDHCLV 

915 

A 

3 

1381 

KTPFVG YQHQRPKVDKTTKMGRNQSRKAENSKNQSTSS P PKDRSSLP 

ATEQSWTENDFDELTEVGFRRSAITNFSELKEHVIjTRRKJSAXNIjEKR 
LD E WLTR I N S V E KTLND LMET . KTM A O P T . P n n p t c: ttm c * T?*sr\\rr?\rx} r a 

VIEDQIDEIKQDDKAAETC^^WSESLRQLRASVDAFHARASHYNA 
GECLHQLAAIiN S RLNCAQEMARRDS I GE VP PVPWRTWG S G I AGE AK 
LDHLRLVS L I GRE VNKENS PAATRWL FS FQAG ALAGGQ I VLQAAKP1T 
AHGQPWATRNLLNPAGIVAVLKQRPRLVAAAVQAFYLRDPIDLRAC 
RVFKTFLPETRIMTSYYAIEWSLWTLVDVLHAENSESAHMSHNSYDP 
PSDVARHLTHLPMSPIPDYIIARFRYRHRSRVRLTARKLSPAFMRCS 
SEIQRYSTMLERGFNGGYCRSTTVRWTEATRMLSF 

916 

A 

1 

1191 

MAHACWPSTLGGQGKKVNVSNLETLIJaNIEIEVGKEEYEDLLiraLPV 
DENEMVDVNVVMDEAJO^TGEKVNV^NT.n^lPT trktact vt ttyppupvt 

LKTLPIRRPRVOTKKVEI FLENMGT^ 

N INEKV I YTKTPS VGHQHQRP KVDKTTKMGRNOS RKAENS KNO^T*^ 
PPKDCSSSPATEQSWTE1TOFDKLTEVGFRRSVITNFSELKEHVRTHL 
KFJUCSLEKRLDEWLTR INS VEKTINDLMELKT>1ARELHD ACTS FNSR 
FDQVEEK^KRATDTYNLGIALEHRKEMLNLWQKIRGDLIGMDSRNES 
F YDTFST YTWS WNVCQELLS PKDLRL YDAYVNRNSSHNS RS S SS SDT 
SECYTDSGRKRKRKGLKGFQQ 

917 

A 

1 

227 

GTRDPGVWVHSLLLSESQESDQQPLPPPGPRSPELIPMELCRSLALL 
GGSLGLMTWLIACGTDHIjPWTSLSLVRL 

918 

A 

1 

392 

CTSGTMPLVSVFMEADESEKAAREQEPLEGTEQTLDAEEEQEESEEA 
ACGSKKRVVPGIVYLGHIPPRFRPLHVRNLLSAYGEVGRVIFHAEDR 
FLRR I KNAYS I CRR I I AF YT YD FTNQLY IYIIYFIK 

919 

A 

157 

388 

PLMHILKYTMVYLVCPLLSSSHNLPFTPVKLLSYSLFYPYFHFSRFL 
RLCCLRLFLSFFYSSSLLHSLYSFPFLLSP 
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920 

A 

451 

593 

DQFRPLRPKNPTPPRTCEHTLTATPVTRAATHTHTHSHPHPHAHPRR 

921 

A 

467 

1038 

KRCFQVFDYALPPKRWFKDNYNADFLEDRQIiGLQAFLQNIiVAHKDIA 
NCLAVREFLCLDDPPGPFDSLEESRAFCETLEETNYRFTERTTWKNK 
RRWIHLKKLLSEKQLHIDTLENRIRTLSLEPEESLDVSETEGEQILK 
VESSALEVDQDVLDEESRADNKPCLSFSEPENAVSEIEVAEVAYDAE 
ED 

922 

A 

233 

346 

QLHTLDS FSQDVKIRFH I IMRLFPLLMP I FFTWQDNV 

923 

A 

1 

1386 

MDIKKGITDISASLRVESGWEARTRKEKTHINTVIIGHVDSGKSTTT 
GHL I YKCGGVDKRTI EKFEKEAAEMGKCS FKYAWVLDKLKABREHG I 
TIDISLWKFETSKYYVTIIDAPGHRDLIKNMITGTSQADCAVLIVAA 
GFGEFEAGISKNGQTREHALLAYTLGVKQLIVGVNKMDSTEPPYSHK 
RYEEIVKEVSTYIKKIGHNTDTVAFVPVSGWNGDNTLEPSANMPWFK 
GWKVTRKDGNASGTTLLEALDC I LP PTRPTDKPLCLPLQDVYK I GG I 
VNVATEVKSVEMHHEALSEVLPGDNVGFI^VKNVSVKDVRRGNVAGDS 
KKD P PMEAAG FTAQV 1 1 LNHPGQ I S AG YAPVLDCHTAH I ACKFAELK 
EKTDRHSGKKLEDGPKFLKSGDAAIVDMVPGKPVYVESFSDYPPLGR 
FAVHD I RQTVAVG VI KAVDKKAAGAGKVTKS AQKAQKAK 

924 

A 

1 

1254 

MGKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAA 
EMGKGSFKYAWVLDKLKAERERGITIDISLWKFETSKYYVTIIDAPG 
HRDF I KNM I TGTSQADCAVL I VAAGVGE FEAG I S KNGQTREHALLAY 
TLVAFVP I SGWNGDNMLEPSANMPWFKGWKVTRKDGNASGTTLLEAL 
DCILPPTHPTDKPLRLPLQDVYKIGGIGTVPVGRVETGVLKPGMWT 
F AP VNVTTE VKS VEMHH E AL S EAL PGDNVG FN VKNVS VKDVRRGNVA 
GDSKNDPPMEAAGFTAQVIILNHPGQISTGYAPVLDCHTAHIACKFA 
ELKEKIDRRSGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPP 
LGRFAVRDMRQTVAVG V I KAVDKKAAGAGKVTKS AQKAQKAK 

925 ■ 

A 

36 

697 

KliPLKAKMGKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIE 
KFEKEAAEMGKGSFKYAWVXiDKLKTVERERGITIDISLWKFETSKYYV 
TIIDAPGHRDFIKNMITGTSQADCAVLIVAAGVGEFEAGISKNGQTR 
EHALLAYTLGVKQLIVGVNKMDSTGIAILTGDFPSLEPRHVSTWLQH 
WTIPTRNWHKCYCVGWANFLNVSADFLNNF 

926 

B 

52 

554 

MGKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAA 
EMGKGSFKYAWVLDKLKAERERGITIDISLWKFETSKYY^VTIIDAPG 
HRDFIKNMIILNHPGQISAGYAPVLDCHTAHIACKFAEIiKEKIDRRS 
GKKLEDGPKFLKSGDAAIVDMVPGKPI * 

927 

A 

63 

1545 

PLKAKMGKEKTHI\NIWIGHVD\TGKSTTT\GHIjIYKCGGID\KRT 
IEKFEK\EAAEMGKGSFKYA*VLDKLKAERERGITI\DISLW\KFET 
\SKYYVD*SLIAPG\HRDFYSKT*FTGTISRLDLVWGAPFLGG/CLP 
* LFAAGVWWNLKAG I SQEWGRTREHALLGLHTG VWKQTKLFGVNKNW 
IST*GHPTAQKKILKEIVKGKSALTLRKIGYNPDTSSIIiCPISGW\N 
GDNM\LEPSA\NMPWFQGDGKVTRKDGNASGTTLLEALDCILPPTRP 
TDKPLGLPLQDVYKIGGIGTVPVGRVETGVXiKPGMVVTFGPVNVTTE 
VKSVEMHHEALGEALPGDNVGFNVKNTVSVKDVRRGNVAGDSKNDPPM 
EAAGFPAQVIILNHPGQISAGYAPVIjDCHTAHIACKFAEIiKEKIDRR 
SGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPPLGCFAVRDM 
RQTVAVGVIKAVDKKAAGAGKVTKSAQKAQKAK 

928 

A 

2 

246 

VTKELLQMDLYAXjLGIEEKAADKEVKKAYRQKALSCHPDKNPDNPRA 
AELFHQLSQALEVTjTDAAATAAYDKVMYASMHVT 

929 

A 

318 

422 j 

LCGRLWWLMPVIPALWEAKAGGLLKLRSMRPCLY 

930 

A 

205 

400 

QSVAQLPRLECNGMILAHCSLHLLGFKRFSCLSLPSSWDYSRLRELP 
REGRE VGAKHTKPGCP I I 

931 

A 

234 

427 

ICKWI ILPCGLAGS YKIAKWSGFTLLLTHMVGAAWV* LSLFGYTI L 
F*FPKMSSFLTMKKDIF 

932 

A 

405 

862 

DCRMGLRIHFWDPHGWCCMGLIVFVWLYNIVIiIPKIVLFPHYEEGH 
I PG I h 1 1 1 FYG I S I FCLVALVRAS I TDPGRLPENPKI PHGEREF WEL 
CNKCNLMRPKRSHHCSRCGHCVRRMDHHCPWINNCVGEDNHWLFLQL 
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CFYTELLSCTH 

933 

A 

2 

413 

PRVRPRVRLLILLLENVSGYRFPVIFQLDASPGHKILPDCMIYSITV 
S LMFP WD Y I STL YGLGLHLGMSLR I VYAL I VLNLCS VS YSTLS FYY 
RSISYYTLTIDYIYITILSVWIWFFFFFFYLDLFLFHFSLTSS 

934 

A 

110 

431 

RKIiVH I RAGQCGNQ I GAKFWEV I S DEHG I D PTGTYHGD S DLQLDR I S 

VYYNEATGGKYVPRAILVDLEPGTMDSVRSGPFGQIFRQDNFVFGQS 
GAEPPSPQACQFP 

935 

B 

754 

1544 

MREIVHIQAGQCGNQIGAKFWEPWKASSIELSQCRNSPSKVFRSKEH 
DGLPVTPPTRR* 

936 

A 

1 

451 

NQMTSKQYTGNSNMPGGLPRQAYTFKTSQLEHKSQMYQVEMNQGQSQ 
GTVDQHLQFQKPSHQVHFSKTDHLPKAHVQSLCGTRFHFQQRADSQT 
EKLMSPVLKQHLNQQASETEPFSNSHLLQHKPHKQAAQTQPFQSSHL 
PQNQQQQQI 

937 

A 

1 

258 

S VALNTRKLWSMHLHVQAKLLQE I VRS FS GTTCQP I QHMLRR I CVQL 
CDLAS PTALLIMRTVLDL I VEDLQR YFHFALKI FSSNYF 

938 

A 

3 

1906 

VTYVELCDIASPTALLIMRTVLDLIVEDLQSTSEDKEKQYTSQTTRL 
IJUjLDALASHKACKIjAILHLINGTIKGDERYMIFQDLLALVRSPGD 
SVIRQQCVEYVTSILQSLCDQDIALILPSSSEGSISELEQLSNSLPN 
KELMTS I CDCLLATLANSESS YNCLLTCVRTMMFLAEHDYGLFHLKS 
SLRKNSSALHSLLKRWSTFSKDTGELASSFLEFMRQILNSDTIGCC 
GDDNGLMEVEGAHTSRTMSINAAELKQLLQSKEESPENLFLELEKLV 
LEHS KDDDNLDS LLDS WGLKQMLES SGDPLPLSDQDVE PVLS APES 
LQNLFNNRTAYVLADVMDDQLICSMWFTPFQAEE I DTDLDL VKVDL I E 
LSEKCCSDFDLHSELERSFLSEPSSPGRTKTTKGFKLGKHKHETFIT 
SSGKSEYIEPAKRAHWPPPRGRGRGGFGQGIRPHDIFRQRKQNTSR 
PPSMHVDDFVAAESKEWPQDGIPPPKRPLKVSQKISSRGGFSGNRG 
GRGAFHSQNRFFTPPASKGNYSRREGTRGSSWSAQNTPRGNYNESRG 
GQSNFNRGPLPPLRPLSSTGYRPSPRDRASRGRGGLGPSWASANSGS 
GGSRGKFVSGGSGRGRHVRSFTR 

939 

A 

1 

585 

ECLCVSTAVRIRHELKTMLTELLRGGSFEFKDMRVPSAIiVTLHMLLC 
S I PLSGRLDSDEQKI QND I IDI LLTFTQGVNEKLTI SEETLANNTWS 
LMLKEVLSSILKVPEGFFSGLILLSELLPLPLPMQTTQVSLPYNMHL 
INDCSNTF*KASDSVKKQKPSSPFLPKRHKITG*GGDKTETSWSLRK 
YGGKNFK 

940 

A 

625 

2115 

YI RGHSTPRKLWSMHLHVQAKLLQE I VRS FSGTTCQP I QHMLRRI CV 
QLCDLASPTALLII^TVIiDLIVEDLQSTSEDKEKQYTSQTTRLLALL 
DALASHKACKLAI LHL INGTI KGDER YAE I FQDLLALVRS PGDS VI R 
QQCWEYVTSILQSLCDQQSSHIDVVRFPCVVYINEVRVIPPGVRAHS 
SLPDNRAYGYLHSHHFLELVTLLLSIPVTSAHPGVLQATKDVLKFLA 
QSQKGLLFFMSEYEATNLLIRALCHFYDQDEEEGLQSDGVIDDAFAL 
WLQDSTQTLQCITELFSHFQRCTASEETDHSDLLGTLHNLYLITFNP 
VGRS AVGHVFS LEKNLQSL I TLMEYYS KEALGDS KS KKS VAYNYAC I 
LILVWQSSSDVQMLEOHAASLLKLCKADENNAKLQELGKWLEPLKN 
LRFEINCIPNLIEYVKQNIDNLMTPEGVGLTTAIiRVLCNVACPPPPV 
EGQQKDLKWNLAVI QLFS AEGMDTFI R 

941 

A 

180 

359 

GSLQRDRWPTSSWRTGRQGSSLLRPRWISFLAQLWRV*RIPAGTFVI 
R IEDGNGRRAF I 

942 

A 

1286 

2189 

HEEKPCSGQNWTQPVPGWVWLEVDPVDLVKSPQMSAVAGLGCGQGTG 
APPELGATAG WTRDPPTANCS PGGG I RVE VFPPGGGGSRLS PP VLAG 
RTRGFPAAPSFFFMYAHVEIDGLALLEAQVQFRALVWVXiGMFCKFTL 
LFHPLEMVLQCNI EVKG ITS VPEANSNKRPSP I S I FDADHE VP AG I R 
HTLHSCARKLIHRGLSKELPIVAALMTVFFIMQLVGQVPAALWVIFG 
EDR FRWS ATM IGLS LAVFG I LHALAQAFVTGPATKRFGE KQAI IAGM 
AADALG YVLLAFATDMAI F 

943 

A 

1 

2537 

FRAAATWPGPPCARRAVNKWYDDHESEEEEEEEDMVSEEEEEEDGD 
AEETQDSEDDEEDEMEEDDDDSDYPEEMEDDDDDASYCTESSFRSHS 
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TYSSTPGRRKPRVHRPRS P I LEEKDI PPLEFPKSSEDLMVPNEHIMN 
VIAIYEVLRNFGTVLRLSPFRFEDFCAALVSQEQCTLMAEMHVVIiLK 
AVLREEDTSNTTFGPADLKDSVNSTLYFIDGMTWPEVLRVYCESDKE 
YHHVLPYQEAEDYPYGPVENKIKVLQFLVDQFLTTNIAREELMSEGV 
IQYDDHCRVCHKLGDLLCCETCSAVYHLECVKPPLEEVPEDEWQCEV 
CVAHKVPGVTDCVAEIQKNKPYIRHEPIGYDRSRRKYWFLNRRLIIE 
EDTENENEKKIWYYSTKVQLAELIDCLDKDYWEAELCKILEEMREEI 
HRHMDITEDLTNKARGSNKS FLAAANEEILESIRAKKGDIDNVKSPE 
ETEKDKNETENDSKDAEKNREEFEDQSLEKDSDDKTPDDDPEQGKSE 
EPTEVGDKGNSVSANLGDNTTNATSEETSPSEGRSPVGCLSETPDSS 
NMAEKKVAS ELPQDVPEE PNKTCES SNTS ATTTS I QPNLENSNS S S E 
LNSSQSESAKAADDPENGERESHTPVSIQEEIVGDFTSEKSTGELSE 
SPGAGKGASGSTRIITRLRNPDSKLSQLKSQQVAAAAHEANKLFKEG 
KEVLVVNSQGEISRLSTKKEVIMKGNINNYFKLGQEGKYRVYHNQYS 
TNSFALNKHQHREDHDKRRHLAHKFCLTPAGEFKWNGSVHGSKVLTI 
STLRLTITQLETTSLHPSFIPTGHHIGQIGSRQFRCVANPENLHWL 

944 

A 

2 

437 

RTTAARRAVNKVXFDDHVSEEEEEEEDMVSEEEEEEDGDAQETQDSE 
DDEEDEMEEDDDDSDYPEEMEDDDDDASYCTESSFRSHSTYSSTPGT 
HPAQLLQTPSPPPLPSPLAHSCAVHPAPISPQPRLPPKQRGNARAHQ 
VAKN 

945 

A 

298 

581 

GARAEARAGSGQGAGLGVSSPVRWRGWSDKGAERPGGPLSPLREVSR 
AGPSG/HAARGQQPGRPRFPPPGPRPPRRRCPCTRARPGGAAGGGPC 
L 

946 

A 

377 

640 

ITSSLLFPSS I FKCRHAHTGLLI SKHGDHRVSKES PLNS PASQAGS A 
TFATTPLTPPKLQLRAPWQPLHSFRPCPGVTSRKRTARAR 

947 

A 

1279 

2253 

PLLTSAAS S KLLTS S WPTS KC S WRTS SGPCTS S KDTMQ S PERPCLM 
PLKKWQELSPETSGKRKKRKQMNQYSYIDFKFEQGDIKIEKRMFFLE 
NKRRHCRSYDRRALLPAVQQEQEFYEQKIKEMAEHEDFLIiALQMNEE 
QYQKDGQLIECRCCYGEFPFEELTQGANAHLFCKECLIRYAQEAVFG 
SGKLELSCMEGSCTCSFPTSELEKVLPQTILYKYYERKAEEEVAAAY 
ADELVRCPSCSFPALLDSDVKRFSCPNPHCRKGCCSPSGFSPRNSQR 
QLLTPRQVAEPTPTCISEPRKSRVSVGSENPAATWGLRCRRPP 

948 

A 

268 

369 

ELLILTLIFFSFQEFSGYVQQVKYAMTRIKAGMP 

949 

A 

15 

442 

CRDDPRVRLSRSRPVGPHFLSPTMFRRKLRGLDYHNPAAFNCKDETE 
FRNFIVWLEDPKIRHYEIEDRGNLRNIHSSDWPKFFEKYLRDVNCPF 
KIQDRQEAIDWLLGLSVRLEYGDNAEKYKDFRPDNSKTADNGTKNAE 
P 

950 

A 

118 

376 

VAWQIIFLPVFIAEKYKDLVPDNSKTADNATKNAEPLINLDVNNPD 
F KAG VMALANLLQ I QRHDD Y LVML K\ A I R I LVQERLTQD 

951 

A 

1 

969 

AATVLTTIGEAPSFRSDSAPARPLAASPVPAPPAPPRFFSPGRGPGD 
QSEKRWTMFRRKXjT\SLDYHNPAGFNCKDETEFRNFIVWL»EDQKIRH 
YKIEDRG\NLRNIHSSDWPK\FFEKYFKRC*TCPFKIQDRQESYLTG 
FFG\LAVRLEYGDNAEKYKDLVPDNSKTA*QLQLKIAEPIiIN\LDVN 
NP\DFKAGWGFG*TWLQIQRH\DGLPGQMLKANSGFWVQERLDHQG 
CQFA*GQIKQKRGLPVA\LDKHILGFDTGDAVLNEAAQIIiRIiLHIEE 
LRELQTKINEAIVAVQAIIADPKTDHRLGKSLEDEHLRTSASHLL 

952 

A 

1258 

1429 

ILFWIIQLNLECSEFSWRLRQGRGIMLFFFPFLFFFETESHSVAQTG 
VQWCDLGSLQ 

953 

A 

189 

405 

GLPRCASCTACGMLPKTLEEVAMDITWEKWKQMDTVQKSIYRNMLEK 
YRNLVLLEKPYEYHDWENFQLLFR 

954 

A 

51 

1702 

GLPRCASVQPVECYQRRWKRWLWDITWEKWKQMDTVQKSIYRNI3J\E 
KYRNLVLLDGKLAAGAKTS R VEQQD I S KED * S QRLEMEE LAKRKS VP 
EEI WKSRGQFKNQQLNKENNLGQE I atctki ptrkrdi esnef/ rek 
FYCKINTCC/ MNR * I LWKR IVINMVHVERCSNKTQI * LYKESMMEKK 
KPCKYSECGRTFRGHITLVQHQITHCGERPCKCTECRKGFNQSSHLR 
NNQRKTLSGEKPYKCSECGKAFSYCLVLNQHQRIHSGEKPYEGTECG 
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KTFIQS\SYLTQHQR/TSHTGEKPYTCLECGRLFSQNTHLTLHQRIH 
TGEKPYECNECGRSFSQTAHLTQHQRMYTGEKLYECNECEKAFHDHS 
AL I QHH I VHTAEKP YD IMTGKTFS YCSDL I QHQRMHTGEKPYKCNEC 
GN AF S D C S S L I QHQRTHTGEE P YECKQCG KAFS RS T YLTQHQRS HAG 
EKQYKCNECEKTFSLSSFLTQHMRVQTGEKPYKYNEYGKAFSDCSGH 
FQRTHTGEKPCECNDCGKPFSFCSALIQHKRIHTRKKP 

955 

A 

2 

270 

REEKEWPPHTQPWYPEGCFKDGQHYEEGSVIKENCNS*WTAQNYSQF 
WGMTLEDGFKFRLGTLPPSPMLLSMNEMTVSVPSDSRMCMYL 

956 

A 

1 

999 

MWTGYKFSIFSYLTTEIWMEKQYLSQREVDLEAYFTRNHTVLQGTRF 
KRAIFQGQYCRNFGCCEDRDDGCVTEFYAANALCYCDKFCDRENSDC 
CPDYKSFCREEKEWPPHTQPWYPEGWTAQIYSQFWGMTLEDGFKFRL 
GTLPPSPMLLSMNEMTLVSHACYPLFKDQNATNNGCAMASRSDGRGK 
RDATKPCPNNVEKSNRIYQCSPPYRVSSNETEIMKEIMQNGPVQAIM 
QVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLTGWGTRRGAQ 
GQKEKFWIAANFWGKSWGENGYFRILRGVNESDIEKLVIAAWGQLTS 
SDEP 

957 

A 

1 

1518 

VHCSEFPGRPPRPRTRGRKVEAIPVFQRMWTGYKILIFSYLTTEIWM 
EKQYLSQREVDLEAYFTRNHTFLQGTHFK\RAIFQ\GQYCRNFGCCE 
DRDDGCVTEFYAANALCYCDKFCDRENSDCCPDYKSFCREEKEWPPH 
TQPWYPEGCFKDGQHYEEGSVIKENCNSCTCSGQQWKCSQHVCLV\R 
PE\ LI EQVNKGDYGWTAQNYSQFWGMTLEDGFKFRLGTLPPSLMLLS 
MNEMTASLPATTDLPEFFVASYKWPGWTHGPLDQKNCAASWAFSTAS 
VAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWYLR 
KRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVEK 
SNRIYQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGI 
YRHVTSTNKESEKYRKLQTHAVKiTGWGTLRGAQGQKEKFWIAANSW 
GKS WGENGYFR I LRG VNESD I EKL I 1 AAWGQLTS SDEP 

958 

A 

316 

411 

CSFIPGMEDRFNIRKTINVIHHINRTKNKNHI j 

959 

A 

314 

391 

ECTFFSSVHGTFSKTDHIIGHKTSL 

960 

A 

112 

1154 

SCGLGHRKTFSFVSLPARNETQPKACREQNMEGDFSVCRNC*RHWS 
ANFTLHEAYCLRFLVLCPECEEPVPKETMEEHCKLEHQQV\GCTMCQ 
QIMHKSSLEFHKANECQERPVECKFCKLDMQLSKIiELHESYCGSRTE 
LCQGCGQF IMHRMLAQRRDVCRS EQAQLGKGER I S APEREI YCHYCN 
QMIPENKYFHHMGKCCPDSEFKKHFPVGNPEILPSSLPSQAAENQTS 
TMEKDVRPKTRSINRFPLHSESSSKKAPRSKNKTLDPLLMSEPKPRT 
S S PRGD KAAYD I LRRCS QCG I LL PL P I LNQHQEKCRWLAS S KRKTS E 
KFQLDLEKERYYKFKRFHF 

961 

A 

103 | 

538 

NETQPKACREQNMEGDFSVCRNCKRHTVSANFTLHEAYCLRFLVLCP 
ECEEPVPKETMEEHCKLEHQQAYGSGIGKRFWFQERLAVLLRSVKRG 
CEKGRSWKAVRWSPVSRLSMSIPPGAPDLSLWRYSSSGSRCQHKTGL 
MNNS 

962 

A 

1 

405 

SGFMDHLEEKADLSELVEKEELGFFQYYRERCHQKVYHPITKPGGSA 
KDAAPGGGHHQAGPGQGGDEGIRMMFCVSPNRATFVVSVVPLVSIID 
V I YGRYLRKLTKVTQDS LAQATQEER I GNVKNCS S FWE S PP 

963 

C 

334 

471 

MRNIFHFTRGSGTTSVKRGYVQAAGGPVPPRRSQEGRLAFRLHVPG 

964 

A 

1 

1137 

MGRGAGAALGRWSRAPLEELLPGRGSGRLGGPRGPRTAPGAVGLGPA 
AAGEEAWRRGRAAPSRDDQRLRPMAPGLSEAGKLLGLEYPERQRLAA 
AVGFLRCPIIDAIYTNPTVDYSDNLTRLCLGLSGVFLCGAAANAIRV 
YLMQTSRQRWKRLRTSLFSSILGQEVAFFDKAGTGELINRLSSDTA 
LLGRS VTENLSDGLRAGARASVGIRMMFCVS PNRATFWSWRLVS I 
I D V I YGRYLRKLTKVTQDS LAQATQEERFNTWGEKGVL LSGGQKQR 
IAIARALLKNPKILLLDEATSALDAENEYLVQEALDPLMDGRTALVI 
AHHLSTIKNANMVAVLDQGKITEYGKHEELLSKPNGIYRKLMNKQSF 
ISA 

965 

A 

1 

451 

GDRPFRV*DPRAGQAATFWTDPSLS*SLLLGVMFPAAPSPRTPGTGP 
RRGPLAGLGPGSTPRTASRMGLPLGSAVSSPVLFSPGGRRSSLISRG 
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TPTRMLPHHSITESVNYDAKTIGSSLPVKVMEALTLAEVDDQLTINI 
NEGGWACLV 

966 

A 

301 

1867 

ECLCFIaAAVGFLTMSRVISMSAPFFLGKIIDVIYTNPTVDYSDNLTR 
LCLGLSAVFLCGAAANAIRVYLMQTSGQRIVNRLRTSLFSSILRQEV 
AFFDKTRTGELINRLS SDTALLGRS VTENLSDGLRAGAQAS VG I SMM 
FFVSPNLATFVLS WPPVS I IAVI YGRYLRKLTKVTQDSLAQATQLA 
EERI GNVRTVRAFG KEMTE I EKY AS KVDHVMQLARKE AFARAGFFGA 
TGLSGNL I VLS VL YKGGLLMGS AHMTVGELS S FLMYAFWVG I S I GGK 
QEC I PSQLGTNDFLVFVS LL VFENQDGKVRRG YA WEGKLGMRN I FH 
FTRGSGTTSVKRGYVQAAGGPVPPRRSQEGRLAFRLHVFKFRAQARL 
QPDGQIPRCPSRCSLESCSQPPLLRGPRTTIDHRYLKLQIKGGNTVY 
AKFSPVYRVDQVYEVSRETPKLAPHVIGFNNTVKSSPTSISGSYVAS 
QRLKDLGHAQMWDELFSSHSDLDSDSELDRAVTQISVDLMDDYPAS 
DPRWG 

967 

A 

57 

383 

GWLAHTANERPVRCFPWLEGFGLENIRAPTLNF*QIQQSLAIFVEPI 
KTILLPFCFFRIFSTRVRICTYSSHSFPGPGAGLLHFCSSSLVRFKI 
S FGDTVAVAPVPDF 

968 

A 

507 

904 

GKRAGECWRRSLRPRGPRGETRAASPSSARPSLALASAPRPKSGPRP 
RQGPRRPLWPRGLKGQVCARGPGRGGPGGVGGRRSASARAPRVPGDQ 
PPPPPAPSS*ED/WT*EIGRLSSHAPLAVLHTRSALIiNS 

969 

A 

379 

858 

KWG FCRG FLKWLTS KKRAFGLRATRD I KAE E L FLWVPRKLLMTVES A 
KNSVLGPLYSQDRILQAMGNIAIiAFHLLCERASPNSIWQPDIQTLPS 
E YDTPL Y FEEDE VRYLQSTQAI HD VFSQ YKNTARQ YAY F YKVI QVS S 
CLQFKY I FHRVGRKEELS P 

970 

A 

259 

483 

NKMDDLTLI^LLECPVCFEKLDVTAKVIiPCQHTFCKPCIjQRVFKAHK 
ELRCPECRTPVFSNIEALPANLLLVRLL 

971 

A 

175 

588 

CTDCCRGCRVSQLLETALLITRHPSHVGIEKKAEGPSHQISIPETDT 
SASILVASSCJ^RTNLKNQLVCKYLNHIMGLLDRLSVIjIjGLKKKEVH 
VLCLGLDNSGKTTIINKLKPSNAQSQNILPTIGFSIEKFTSSSL 

972 

A 

1 

383 

TAKAALYPLVGKGGGAGGPVLAMRPNQAPVSGRVYIHRDYSRVTRCQ 
FQTRFPAQLDNRIAGHPFEKTVLTLNNLYAEAEKLGGQAYLEGCWAC 
LTPYT I FLCMETNYEKVLKKD FQ Y I PQLTE S I Y 

973 

A 

79 

445 

RRQLG VAL I PSHRMD YKS SL I QDGNPMENLEKQL I CP I CLiEMFTKPV 
VILPCQHNLCRKCANDIFQAANPYWTSRGSSVSMSGGQFRCPTCRHE 
VIMDRHGVYGLQRNLLVENI IDI YKQEC 

974 

A 

1163 

1402 

TETKINFQNVKAMDLIHLFIYLFYFVFETESCSVARLGCSGMVXiARC 
NLRLPGSGDS PAS AS RVAGTAG ACHHARL I FA 

975 

A 

80 

407 

LRRRPRRGKRKECRVQAGSWCGRMAVSHSVKERTISENSLIILLQGL 
QGRVTTVDLRDESVAHGR IDNVDAFMN I RLAKVTYTDRWGHQVKLDD 
LFVTGRNVRYVHI PE 

976 

C 

70 

240 

MITGTSQADCAVX»IVAAGVGEFXAGISKNGQTRRDMRKXLRKSALTL 
RKLATTPTQ* 

977 

B 

68 

490 

MGKEKTH IN I WI GHVD S GKS TTTGHL I YKCGG I D KRT I EKFE KEAA 
EMGKGSFKYAWVLDKLKAERERGITIDISLWKFETSKYYVTIIDAPG 
HRDF I KNM ITGTS QADCAVL I VAAGVGE FE AG I S KNGQTREHALLA* 

978 

A 

261 

408 

TRRPTARGMIjEETNPADSKPGTIRGDFCIQVGRTFFYSPVFFKNSFF 
Y 

979 

A 

1 

678 

MPEPPTHSMGSCAARASPTSTTPCSTAPSPIDHPRAEECERTAQDWQ 
AASPAAPVRDPLGI^ISKLDLVTFLEGKKEPWTVKSEETIAAQPEVG 
GKLSDLVARGTALGASSTRGLQGGPSGFLVTPSGDQMWASLWPVLVA' 
AGHTFGL I RDS KLRKEEG I LTY PLAQ YQQVW FF FHHKFC WQGRWRH I 
VELYANVTLLFALAWTSKGIVTYCWAQQHGDMSLLYGP 

980 

A 

181 

340 

DVRTSWKLGKGLAISKLDLVTFLEGKKEPWTVKSEETIAAQPVQMYN 
LIPEHF 

981 

A j 

1 

375 

HCSQGLLPTPLLRVSTLLFKLASFTMGNLPPSISPSSPLACVLKSLrK 
PLHLLPDLKS KHL I FFSNTAW PQY KLDNG S KWPENDTFD FS I PRDLD 
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TFCRKKG KW S EVP YVQAFFTLC S LLS YMTHT 

982 

A 

1 

2583 

MWFTVDQIRAIMDKKANIRNMSVIAHVDHGKSTLTDSLVCKAGIIA 
SARAGETRFTDTRKDEQ\ ERC I PIKVNAIS \ LFYEL\ SENDLNFI QQ 
SKDGAGFLINLIDSPG\HVDFSSE\VTAALRVTMGALVVVDCVSGVC 
VQTETVLRQAIAERIKPVLMMNKMDRALLELQLEPEELYQTFQRIVE 
NVNVIISTYGEGESGPMGNIMIDPVLGTVGFGSGLHGWAFTLKQFAE 
MYVAKFAAKGEGQLGPAERAKKVEDMMKKLWGDRYFDPANGKFSKPA 
TSPEGKKLPRTFCQIilLDPIFKVFDAIMNFKKEETAKLIEKLDIKLD 
SEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCEL 
LYEGPPDDEAAMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFS 
GLVSTGLKVR I MGPNYTPGKKEDLYLKP I QRTI LMMGR YVEP IEDVP 
CGNIVGLVGVDQFLVKTGTITTFEHAHNMRVMKFSVSPWRVAVEAK 

DLEEDHAC I P I KKSDPWS YRETVS EESNVLCLS KS PNKHNRLYMKA 
R P F PDGL AED I D KG E VS AR OE L KOR AR YT . A F KY F WH V a F a P JC T W P Pfi 
PDGTGPNILTDITKGVQYLNEIKDSWAGFQWATKEGAIiCEENMRGV 
RFDVHDVTLHADAIHRGGGQIIPTARRCLYASVLTAQPRLMEPIYLV 
E I QCPEQ WGG I YGVL1TOKRGHVFEESQVAGTPMFVVKAYLPVNESF 
GFTADLRSNTGGQAFPQCVFDHWQILPGDPFDNSSRPSQWAETRKR 
KGLKEGI PALDNFLDKL 

983 

A 

1326 

1498 

VQLKA^TI CQI SRI LSRVIxACLMRESSSEPKEVSGVCVQTETVLRQA 
IAERIRPSSK 

984 

A 

60 

966 

RPJWSDGNACQPATHLLLXSARFITGHGQKLKSELKJCTLQVIMLNIQ 

L S C C S E RKD AN P KS WC S F FM OEOC TKG E KP FT. T P GTT i <3 VDP HT.P Y F 
GILPTVSNAAVVKEVPTVSNAAVVTFA 

VMEVPTGSNAAWTE VPTG SNAAWTEAPTG SNAAWKEAPTG SNAA 

RVMEVPTGSNAAVvlCEAPTGSNAAJ^V>1EVPTGSNAAVVKEV^ 

AWKEVPTGVTLRWSRKRPRE 

985 

A 

1393 

1625 

EGSNKFANDIQLLS I PYWKKRFLHMKKMPLFSKSHKNPAEI VKILKD 
NS AI LEKQDKKTDKAS EE VSKSLQAMKE I C 

986 

A 

358 

475 

LCEETPS I VKKCKI SWEWWHI PVI PATRE AEAGELLEPR 

987 

A 

232 

329 

VGIKMSISSDEVNFLVYRYLQESGFSHSAFTF 

988 

A 

127 

461 

FREDRL PADLRAKVWK I ALNVAGKGDS LAS WDG I LDLPEQNT IHKDC 
PASLLDQLSVPEEKAAELLLDIESVITFYWKSRNIKYSTSLSWIHLL 
KPLVHLQLPRSDLYNCF 

989 

A 

562 

897 

KGLVSNGPEPJ^TLCPCSTLLEEGSAAVlSGSGAWGALRPVIiVEPLDHG 
IRF>EVVT^WNSKOACTGTHOEACPVSL.HKCCGPPOGTC^FCSHC , GO«; 
GRIRPEKGPSRSSVRAPP 

990 

A 

282 

507 

TMGLEKPQSIQjEGGMHPQLIPSVIAWFILLLGVCFIASCLv^^ 
SRWKRGTG VTiKLKHHAKLKC IKEKSEL 

991 

A 

7 

494 

IRLDFQLLRTDRFSRIQAJiEDKARGERTLHAAVPTPGYPESESIMMA 
P I CLGENHEEQLTVNS KALE I LDLI SQPAVGVAI VGL YRTGKS YLMN 
RI^GKPJJGFPLGSWQTETKGIWMWGVTQLFKPigHTLGLLDTEGLGD 
VEKSNPKNDSWIFCLAVLLSS 

992 

A 

206 

357 

RILGGAKVYGGLQGIIFFLIGGRFKTYLGIFGKDLFFGGGTYWANFL 
PEF 

993 

A 

1 

101 

LRSRYGRAQHSTMIxHHAPLQKILPPPVI^TIIjMS 

994 

A 

256 

407 

RILGGAJCVYGGWEGNIFFLLGGRIKTYLGII^KDPYFGGGTYWANLP 
TEF 

995 

A 

254 

432 

RTLGGAKFYGGLQGNSFFLIGSRIKTYLGILEKNLSLGGGTYWANLP 
PKFKGFRENINF 

996 

A 

249 

402 

R I LGGAQVYPGLQGNS FFL IGGR I KTYLG I FGKNP YFWGGTYWANLP 
SEI 

997 

A 

37 

465 

GRARPGQNSYDLFLLAAAARGAGGHYAPPCTEDCRKSTYPPSGPTNR 
GAGPWYTINLDLPPYKRWHELMLDFJ^PMLKVIWSLKl^INTFVPSG 
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KVMQVVDEKLPGLLGNFPGPFEEEMKGIAAVTEIPLGKVHLEALKKK 
SN 

998 

C 

127 

216 

MRLRKQAEKNVEKKIDKYTEVLKTHGLLV* 

999 

A 

1 

1258 

LFLPLLEPGERRGLL YRSRPGGAVALI LLAAAVS \ CAV\ SQHAPPWT 
EDCRKSTYPPSRPTYRGAVPWDTIQCLTYHPYKRWHELMLDKAPV/L 

DKVTVN^LKNMTNTFV VD\ QnTfTMnmmirVT \ DOT t r->riT?T>\ r»rn7«T?T7c» 
\ji\.v jl v iv o xjjvivi ixivir \ vr \ oox\.irjy v v L/CjisAj \ irvjxjxjvjN r tr \brf hifco 

MKGIGRCLLDIPLGEIISFNIF\YE\LFTICTSIVAEDKKGHLIHGR 
N\MDFGVF\LWNINNDTWITEQLKPLTVNtiDFQRNNKTVFKASSF 
AGYVGMLTVGFKPGLFSLTLNERFSINGGYLGTT.FWTT.rsKKTjaiwrwTa 

FLTRTVLENSTSYEEAKNLLTKTKILAPAYFILGGNQSGEGCVITRD 
RKES LDVYELDAKQGRWYWQTNYDRW KHP FFLDDRRTPAKMCLNRT 
SQENISFEmYDVLSTKPVLNKLTVYTTLIDVTKGQFETYLRDCPDP 
CIGW 

1000 

A 

2 

333 

ACPFNKSAEDLLNL/ROGLTAGOLPFTjPWWNTVT.n<;p / QIYT^TPAT^ 
FSTSLATARARSASEVPIWKTTLFCSLSQVRKTTRSSGATKSTVLST 
SLTTLLH*IILRSISSSQ 

1001 

A 

292 

945 

LPPLPACPALHNMQPSGLEGPGTFGRWPLLSLLLIiLLLLQPVTCAYT 
TPGPPRALTTLGAPPJ\HTMPGTYAPSTTLSSPSTQGLrQEQARALMRD 
FPLVDGHOTLPLVLRQWQKGLQDVl^RNFSYGQTSLDRIiRDGLVGA 
QLHTCTSTITHAYATTYTDTCPQPCSCSEEVLSTVDFVAPDDRVVFR 
TCEREQNRWFQMPLVNTYKVNSHSVPTNS 

1002 

A 

46 

493 

GQEEMAAGETHLYAKVSNKLKSRSSPSLLEPLxxAMGFPVHTALKAH^ 
ATGRKTAEEALAWLHDHCNDPSLDDPIPQEYALFLCPTGPLLEKLQE 
F WRE S KR Q CAKNRAH E VF PHVTLCD F FTC EDQ KVE CL YE ALKRAGD R 
LLGSFPTA 

1003 

A 

2 

407 

QTQREPT^T^^JAPADKNNVKAGWGKGGAHAGEHGAEAIJDRKILYFP 
KJSTx'FPHFYLSHGSAQGKGHGKKEDDALTNAVGHGDIiMPNALCALSDL 
HAHKLGVD PDNCKLL IHCLLVTLCAHL PAE FTPAVHATLDK 

1004 

A 

192 

548 

GLFERKIDI IKIIxNI FSLFLINSKNTNKCVTjRYMYRNCCNI I ILNNL 
KOTEL PKVG YNER KGGL F I LWN I FOELKRMR I HTHTHTHTHTHTHTH 
THSLGIPNPEIQMLQSPKLSDPPT 

1005 

A 

1 

52 8 

DNELLDYEDDEVETAAGGSMS/EAPAKKDVKGSYFSIHSSGFRDFLL 
KPELLRAIVDCGFEH\PSEV\QHECIPQAILGMDFIiCQAKSGMGKTA 
VFV1J\TLQQLEPVTGQVSVLVMCHTREIJ\FQTR*KTRPFW1WTVCPS 
GEDTRVGVKETLLPPPLTAPTPWLPSFASPPLt»NPHF 

1006 

A 

807 

1036 

RNSDVRPLVMAENDVDNELLDYEDDEVETAAGGDGAEAPAKKDVKGS 
YVS IHS SGFRDFLLKPELLRAI VDCGFEH 

1007 

A 

136 

1520 

LFTPCFHLFCENPSRSPFPSSPAGPVMAENDVDNE\LLDYER*MRWE 
TAAGGDGAEAPAKKDVNGLLCLPSHSSGFRDFL\LKPELLRAI\VD\ 

lepv \ tgogvcag * cchtrelafoi ske yerfs kympnvkvavffgg 
LSIKKEEEVIjKKNCPHIWGTPGRILAI^^ 

DKMLEQLDMRRD VQE I FRMTPHEKQVMMFS ATLS KE IRPVCRKFMQD 
PMEIFVDDETKLTLHGLQQYYVKLKDNEKNRKLFDLLDV^ 
FVKSVQRCIAIiAQLLVEQNFPAIAIHRGMPQEERLSRYQQFKDFQRR 
ILVATNLFGRGMD I ERVNI AFNYDMPEDSDTYLHRVARAGRFGTKGL 
AITFVSDENDAKILNTDVQDRFEVNISELPDEIDISSYIEQTR 

1008 

A 

92 

191 

YHWMGMSNLKLLKYVLFLFNLLFWVRASLLSTG 

1009 

A 

219 

424 

RKTTSPGAFYLGGKGGYPGRVRGAVKKPPGEFWGGPQLRGGDPGGTP 
PSGIKGSRGKKRFFGASGGNP 

1010 

A 

215 

417 

RVEVHFKSCDCQKAEEGGCSQKNFTRGPLSDTCIGILKGSHICRYSE 
CGKIFPDPRYFCVSRNIHTR 1 

1011 

A 

261 

486 

GFLGKVLQS PATTWRTLNDRS S I VMGEP I SQS S SNSQ * FFFFKENS 
RILFCLG*CAVEQHFSSIFHCVDFYII 

1012 

A 

31 

2697 

DSPCEPAGGGRERQRRVGLQDGGSVWTAAAGEQASPGGQRDGRVGV 
CAGEWGNGASGRRGLSGSCHPFSRLNLWSHGEFGPLRSRRRRHRHY 
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CLYRIiLRVRSADEVSEEAAARAVETAATMSDTRRQVEGLYPETKTGN 
GTTEAPGTSPPLTWRSSRGCRCWFGQSPTDHYSWESKINPNTAYQKQ 
QDTLIVWSEAENYDLALSFQEKAGCDEIWEKICQVQGKDPSVEVTQD 
LIDESEEERFEEMPETSHLIDLPTCELNKLEEIADLVTSVLSSPIRR 
EKLALAIiENEGYIKKLLQLFQACENLENTEGLHHLYEIIRGILFLNK 
ATLFEVMFSDECIMDWGCLEYDPALAQPKRHREFLTKTAKFKEVIP 
ITDSELRQKIHQTYRVQYIQDIILPTPSVFEENFLSTLTSFIFFNKV 
EIVSMLQEDEKPLSEVFAQLTEEATDDDKRRELGMDDLQVRSAATDI 
FSYLVEFSPSMVREFVMQEAQQSDDDILLINWIEQMICDTDPELGG 

AVOIjMGIjIjRTLTDPENMTJ\TTN>ITFTnPPVr;pc:T KTKT T T DDT VQT.OT? 

TRLLKTDIWPTDSGMLTAIjSVPLPHSGPWLQDWLCPTAASHRVVWDG 
CPPPAEGKGPQAKFLGTSHTGAIQQHDDVRNRSPLPPSGLSRRRRGR 
GDRENLPPNCDPAPRGGAERGGSRAERGGSLAEVLPPQSGTPPRESA 
EHSGLGSACRSGAPARTARASLLEEEEKGRESRVTRWRICSSYRGGG 
RASEPWGAKRRESGCMRGVRSADEVSEEAAARAVETAATMSDTRRRV 
KVYTLNEDRQWDDRGTGHVSSTYVEELKGMSLLVRAESDGKCLRVNS 
VEGELENTHLVSNMELIDYWWGEILTLLGDEGDSSDLCCDSW 

1013 

A 

306 

441 

WFWLIELATMSKCNIKPMGKSEPLFFFFESESCSVAQTGVEW 

1014 

A 

149 

405 

PRQEAKMGKKLDLSKLTDEEAQHVLEVVQRDFDLRRKEEERLEALKG 
K1KKESSKRELLSDTAHLNETHCARLLEPLTLVWERQK 

1015 

A 

2 

344 

LRVSTLLFKLAS FTMGNLPPS I S PS S PLACVLKSLKPLHLLPDLKSK 
HLI FFSNTAWPQYKLDNGFKWPENDTFDFS I PRDLDTFCRKKGKWSE 
VP YVQAFFTLC S LLS YMTHT 

1016 

A 

172 

341 

S R W PG P F PQS GE VS AAPEMS KL S FRARALDAS KP L» P VFRCEDLPDLH 
EYAS INRAR 

1017 

A 

73 

408 

FLQLQERQWFLTQEDSLRNMWNDIELLTNDDTGSGYLSVGSRKEHGT 
AliYQVDLLVKI S SEKAS LNPKIQACSLSDGF 1 1 VADQS VI LLDS ICR 
SLQLHLVFDTEVDVVGLW j 

1018 

A 

53 

237 

FLQLQERQWFLTQEDSLRNMWNDIELLTNDDTGSGYLSVGSRKETSIi 
HFLQASNFYPSAKK 

1019 

A 

51 

482 

ESLRWKLSLTKVVNGCRLGKIKiajGKTGDHTMDIPGCLIiYTKTGSAP 
KLTHHTLHN I HG VP AMAQLTL S S LAEHHE VLTE YKEG VG KF I GM PE S 
LLYCSLHDPVSPCPAGYVTNKSVSVWSVARRVEMTVSKFMAIQKALQ 
PD 

1020 

A 

49 

399 

NTKHPNLAIVLPVEQAALLHNCVKTVDEVFPNEKDLTDRPLRDPDVK 
YFTGGSSFVljEEFGHARYTVVTSDSVVEAQPLPTGTSAQKAELIALT 
RALLLAKEKKVS I YTNS KYASRA 

1021 

A 

564 

651 

KIGCSAYGVAILLFLYFFNKLAFTLWKK 

1022 

A 

2 

368 

SLPASDRPPISSPLATSGTIFSAISCFWDLPAPFLWLAPSCQPTMSS 
Q I RQNYSTDVEAAVNSLVNLYLQAS YTYLS \ LQD I KKP AEDEWGKTP 
DAMKAAMALEKKLNQALIJDLHALGSART 

1023 

A 

18 

781 

EICPSRPKNSARRGGPAGLSLASTVFGRNRSGDWASSLRPPSDFLLR 
LQ P PG PS YRPS PAS GTCQHR FLWLAP S CQPTMS S QI R\ QNYS TDVEA 
AVNSL\VNLYLQASYTYLSLGFYFDRDDVALEGVSHFFREL7^EEKRE 
G Y* RLLKMQNQ\ RGGRALFQD I KKPAEDEWGKTPDAMKAAMALEKKL 
NQAIjLDLHALGSARTDPHLCDFLETHFLDEEVKLIKKMGDHLTNIiHR 
LGGPEAGLGEYLFERLTLKHD 

1024 

A 

217 

608 

DKKLKKDS FTATSLCKTSLLI ITNQGKKSGSREHRNTAKGNI AKCFP 
SFIRQTSCNATTGCICTQFREMPTLLFSYPQFSFFLSFFFSFFETES 
LLPRLECSGAISAHOOJDLLGSTDPPTLAPQITCTT 

1025 

A 

147 

430 

GMAAVGNLLGRTOQSTVKVTGPAIiHRLHTSFWRADSNRASLTRVHRQ 
AYARLYPELLEKQDGCTIHIRYREPRRMLAMPIDLDTLSPEERRARIi 

1026 

A 

489 

702 

TFICSRLRQSTVKATGPALRRLHTSSWRADSSRASLTRVHRQAYARIi 
YPVLLVKQDGSTIHIRYREPRRM 

1027 

A 

899 

1097 

QPAGPSLTRCATAQALCTTLPPCCIjVKQDGSTIHIRYREPR/QQCWR 
MPIDLDTLSPEERRARLRKR 
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1028 

A 

59 

427 

GGALGRAGSPPSVDPASGVAVSEAARSPLGSGFTSREGVKTENNDH1 
NLKVAGQIX3SWQFKIKRHTPLSKLMKAYCERQGLSMRQIRFRFDGQ 
PINETDTPAQLEMEDEDTIDVFQQQTGG 

1029 

A 

1 

2148 

MVKG S I QQEELT I LNR YASNTGAPRF I KQVLSDLQRDLDSHTL IMGD 
FNTPLSTLDRSTRQKVNKDTQELNSALHQVDLIGIYRTLHPKSTEYT 
FFSAPHHSYSKIDHILGSKAIiLSKCKRTEIITNYLSDHSAIKLELRI 
KKLTQNRSTTWKMNNLLLNDYWVHNEMKAEIKMFFETNENKDTTYQN 
LWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSK 
ASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARL 
I KKKKRE KNPIDAI KNDKGD I TTDPTE I QTT IRE Y YKHL YANKLENL 
EEMDKFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGP 
DGFTTESYQRYKEELVPFLLKLSQSIEKEGILPNSFYEASIILIPKP 
GRDTTKKENFRPISLMNIDAKILNKIIiANQIQQHIRKLIHHDQVGFI 
PGMQG W FN I CKS I NVI QHRNRAKDKNHM 1 1 S I DAEKAFDE I QQCFML 
KTLNKLEPSNHTALAGVGVHQEDS I AALAYDLDENHNFSQALAAPQD 
SHIIQSCRKNQLSAPFSAAGSAVHAAPMHSCMASSPYRVRTRHLFCE 
AAAEETPVFTMANEKPTEEVKTENNNHINLKVAGQDGS VVQFKI KRQ 
TPLSKLMKAYCEPRGLSMKQIRFRFGGQPISGTDKPAQLEMEDEDTI 
DVFQQPTGGVY 

1030 

A 

193 

564 

GDSGGSPWPDEKPKEGVKTENNDHINLKVAG\Q\DGSWQ\FKIKRH 
TPLSKLMKAYCE\RQGLSMRQIRFRI*PGNPIHETDTPAPVGKWKDE 
RYQLMVFQQARPGGVYLKKGTCFFYSKNS VS F 

1031 

A 

216 

372 

SG I C S CTDLTMTTVEEDSDTVTVKTVNS VTLTQDTEGNL I LHCPQNG 
RRTC 

1032 

A 

566 

788 

EPKSQPDQHGETPSLLKIQKLAGRGGTCLYSQLLRRLRQENHLNPGG 
RGCSEPRSHHCTLAWATVRIiPLKKKKK 

1033 

A 

2 

402 

QTPDLLPEALEAQVLPAKRLRSSEEPTEKEPPGQLQVKAQPQARMTV 
PKQVRSRDISREEWKGSETYSPNTAYGVDFLVPVMGYICRICHKFYH 
SNSGAQLSHCKSLGHFENLQKYKAAKNPSPTTRPVSRRC 

1034 

C 

379 

453 

MNIYLYACLLYVSVYRARGSTRKFE 

1035 

A 

2465 

2796 

APRPVSWEVSRGSPDPRPTLACFQVRSRDISREEWKGSETYSPNTAY 
GVDFLVPVMGYICRICHKFYHSNSGAQLSHCKSLGHFENLQKYKAAK 
NPSPTTRPVSRRCAIN 

1036 

A 

1264 

1385 

I ISSRGAYHNQDLPKNSQAWWLVPAVPAAWEAEVGGSLEA 

1037 

A 

1249 

1457 

YWLSKSLLLLLLIiLFLRQSLAVTQDRVQWHDLGSLQPLLPGFKQFSC 
LSLPSRWDYRCPPHSRLIFVLL 

1038 

A 

146 

43 9 

PGLIQEPSGSMDTAVIVAIRGIiATIFLASFANLGLVFRQRYCRPRDL 
LQRYNS KPI VDL I GAMETQTEPS E I ELDDVG 1TNPHIDAPLENEKRD 
QIC 

1039 

A 

2 

2387 

SMHQQLCQELQRDNVDLFVQSSLSAKERHLAAVASALWRHFFSFLKS 
QRMSQVVPFSQLADAAADFTLLAMDMPSTAPSDFQPQPVI S I IQLFG 
WDD IIC PQ WARYLSHVLQNS TLCEALSHSGYVS FQALTVRSW I RCV 
LQMY I KNLSG PDDLL IDKNLEEAVEKE YMKQLVKLTRLL FNLS EVKS 
I FSKAQVEYLS I S EDPKKALVRFFEAVGVTYGNVQTLSDKSAMVTKS 
LEYLGEVLKY I KP YLG KKVFS AGLQLTYGMMG I LVKS WAQI FATS KA 
QKLLFRIIDCLLLPHAVLQQEKELPAPMLSAIQKSLPLYLQGMCIVC 
CQSQNPNAYLNQLLGNVIEQYIGRFLPASPYVSDLGQHPVLLALRNT 
ATIPPISSLKKCIVQVIRKSYLEYKGSSPPPRLASILAFILQLFKET 
NTDIYEVELLLPGILKCLVLVSEPQEIQTTIGEYYKEliYTNKLKNLE 
EMDKFLDTYILPRLNQEEVEHLNRPITGIEIEAVINSLPTKKSLGPD 
RFTTDFYQSYKEELETFLVKLFQSIEKEGILPNSFYEAS IILILKPG 
RDTTTKKNFRSISI^INAKILNKILANRSQQHIKKLIHHDQVGFIP 
GMQGWFNICKS INI IHHINRTNDKNHMI ISIDAEKAFNKIQQYFMLT 
TLNKLGIDGTCLKIIRAVYDRPTANIVIiNGQKLETFPLKTGIRQGYP 
LSSLLFNIVLEVLARTVRQEKAIKRIQLGKEEVKLSLFADDMIVYLG 
NPI VSVQNLLKLI SHFSKVSGYKI SVQKSQAFLYTNNRQRDKS 
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1040 

A 

1 

1329 

MNRGS TKDFGGSGNAli YD I AM I DAE I WD YVKRPNltRL I G VPDSHGEN 
GTKXENTLQDIIQENFPNLARSWFFEKINKIDRPLARLIKKKREKNQ 
I DTI KNDKGD I TTD PTE I QTT I RE Y YKHL YTNKLENLEEMDKFLDTY 
TLPRLNKEEVESLNRPITGSEIEAIINSLPTKKSPGPDRFTDEFYQR 
YKEELVPFLLKLFQSIEKEGILPNSFYEASIILIQKPGRDTTKKENF 
RPRSLfWIDAKILNKILANQIQQHIKKLIHHDQVGFIPGMQGWFNIR 
KSINVIQHINRTKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLOIDG 
MYLKI IRAIYDKPTANIILNGQKLEAFPLKAGTRQGCPLSPLLFNIV 
LK\HLiARTIRQEKEIKGIQLGKEEVKLSLFADDMIVYPENPNVSTQNL 
LKL I SNFSKVSGYKINVQKS 

1041 

A 

1 

283 

RQLLEQSRHSWVNTTVLITGCTNAAGLLVVGNFQXLVLGGVFFVHES 
SQLQHGAALCEVA^CVIDIIilFYGTFSYEFGAVSSDTLVAALHLAPGP 

1042 

A 

1 

1296 

MLFPKETSGFGPGQFPCHLFLMLCTKWEKGEDGDCSNLKALDDQGVS 
S S QMVE S I S FPP PPNDC VPPAAQDFGGLLRG PQLES APRGGRAGCGG 
GGALRGGRRWSGGCLRAADSPPLHWAGTQPLLGVASSWQPFSQPWS 
LRRRVTPFQMLRSACVSQHAGGIWVDRGGPQCQRVFTFCRGLSPNFG 
RSETQRERWIRSYNESCPPDPAEQGGPKTCCTLDDVPLISKCGSYPP 
ESCLFSLIGNMGAFMVALICLLRYGQLLEQSRHSWVNTTALITGCTN 
AAGLLWGNFQVDHARSLHYVGAGVAFPAGLLFVCLHCAIiSYQGATA 
PLDLAVAYLRSVLAVIAFITLVLSGVFFVHESSQIiQHGAALCEWVCV 
IDILIFYGTFSYEFGAVSSDTLVAALQPTPGRACKSSGSSSTSTHLN 
CAPES I AMI 

1043 

A 

301 

888 

WLWQKHLKVAGPPPLSHLPLVLPPTPPPPLAPFLMTAWILLPVSLSA 
FSITGIWTVYAMAVMNHHVCPVENWSYNESCPPDPAEQGGSKTCCTL 
DDVPLISKCGSYPPESCLFSLIGNMGAFMVAXrlCLLRYGQLLEQSRH 
SWVNTTALITGCTNAAGLLWGNFQVDHARSLHYVGAGVAFPAGLXiF 
VLLHRALF 

1044 

A 

3 

581 

AS AHR LLQHYQLH SAAVNGLS FH PS GNYL I TAS SDS TLKI LDLMEGR 
LLYTLHGHQGPATTVAFSRTGEYFASGGSDEQVMVWKSNFDIVDHGE 
VTKVPRPPATLASSMGNLPEVDFPVPPGRGRSVESVQSQPQEPVSVP 
QTLTSTLEHIVGQLDVLTQTVSILEQRLTLTEDKLKQCLENQQLIMQ 
RATP 

1045 

A 

1 

741 

PLTRAAGIRHEDSQSVGNSSPEIPVLPEPAYQLGPLCQVLPRRAGSS 
CLPVMTRTVKLWDKSSRECVHSYCEHGGFVTYVDFHPSGTCIAAAGM 
DNTVKVWDVRTHRLLQHYQLHSAAVNGLSFHPSGNYLITASSDSTLK 
ILDLMEGRLLYTLHGHQ\GPATTVAFSRTGEYFASGGSDEQVMVWKR 
NFDI GDHGEVTKVPRPPGTLAS SMGNLTVS I LEQRLTLTEDKLKQCIi 
ENQQLIMQRATP 

1046 

A 

526 

1272 

L S T P IGTC I AAAGMDNTVKVWDVRTHRLLQH YQWLLELKPTWGFGLW 
RQLEPFVLRWLTLLVHSAAVNGLSFHPSGNYLITASSDSTLKILDLM 
EGRLLYTLHGHQRHIQCAGLVGLFSLGQHIQRGWLLALVCPRGFALP 
GALVYMSMCTALCQACPSSHLKGLTPGGWLVDQETRWALVPVPYLR 
LWKVLGPATTVAFSRTGEYFASGGSDEQVMVWKSNFDIVDHGEVTKV 
PRPPATLASSMGNL 

104 / 

A 1 

57 

307 

LF I S KFKY S I YYFVQMLWENG I NG I LADEMGLGKTVQC I AT I ALM I Q 
RGVPGPFLVCGPLSTLPNWMAEFKRFTPDVRHASLC 

1048 

A 

1669 

1820 

I FK I CNLKYL F I TG ETLE I PDTVMGLTLLAAGTS I PDT I AS VIiV AR K 
GKN 

1049 

A 

212 

548 

IQGQSSHPSCPMSSQANKPGAAHTPDPCKVQTTAETRRIALRSKKSA 
j\s»ztct v iKjtXjiuwk' ± 1oUI^lj±QETTF PPLPSCVLSSHSPISSSDHSGC 
STWVGEQGSHRPEGTTLN 

1050 

A 

670 

1296 

FHMLFQSNEAHVMEFPSEKKNNFLVSEDHGQKILSVLQ1TFREQNVFY 
DFKIIMKDEIIPCHRCVLAACSDFFRAMFEVNMKERDDGSVTITNLS 
SKAVKAFLDYAYTGKTKITDDNVEMFFQLSSFLQVSFLSKACSDFLI 
KSINLVNCLQLLSISDSYGSTSLFDHALHFVQHHFSLLFKSSDFLEM 
NFGVLQKCLESDELNVPEEEK 
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1051 

A 

243 

335 

TCMFISAQFAIAKSWNHPKCPS INEWIKKL 

1052 

A 

1 

1170 

MEHVYNIFSSNIKKEWGRKMYCDKKHHTTKRNCPNKLRQNSGKWSK 

DTAFCTLAAVAFCRRGFLPPRL.SAPPPPPPLLFAPPPPWLFARRGFL 
PPAVTAFCPPAAATFSSPRLFAAAAVAAAFCARGFLPLLPPPFAPAA 
AAFCPRRRGFLLPRLFAPVAAAFCHRDFLPSPPRIFVPAAAALRAGA 
AGSAASSTG^^ J ARAAPRGAPGPCGLHRAAI J PAEAMGEKKDGGGGGDA 
AATEGGTGAAASRALQQCGQLQKLIVIFIGSLCGLCTKCAVSSDLTQ 
QE IQTLENSVEI KPAS PRFS AVXiMPNKLSPLHAKIGLKP ICYPS YTS 
S I KVQKCDFEYARV 

1053 

A 

1 

1122 

MNTR KKEG KKE KE RE R ERKKG E RE EG KE E ERM E F I D Y S S FTCHGG AY 
LRLFYVSASVLFRLAGFLPHRRHSFLHPRRRGFLPPRLSAAAAFCPP 
AATAAAFCPTAAVAFCPPRLFTRHGVLPAAAFCPPLSRLFAPPPQRL 
FPRRGFLPPPLLPPLFAPAAFCRCCRRLLPPPQRLFVPAAAAFCYRD 
FLPRSP PLFATATFC PRRRG FLS PPAAALRAGAAGS AAS STGVLARA 
APRGAPGPCGLHRAALPAEAMGEKKDGGGGGDAAATEGGTGAAASRA 
LQQCGQLQKLIVIFIGSLCGLCTKCAVSSDLTQQEIQTLEGRDDPEK 
AQVAAGTKSDIVPPVHTPWLCHHAEDRFKGLASLPAKLKEHKLRN 

1054 

A 

1 

392 

SRRPDPPGCRTMANLERTFIDVKPDGGHRSLVGEIINRFEQKGFRLL 
AIWFLRAFEKHLKHHYIDI^RPCFPGLVKYMNSRSVAAMVWEGLNA 
LKTGPGMLVETTPSHSMPGS IRGDFCIQVGTNI IHG 

1055 

A 

47 

296 

PEDTGKS I AKM PG P P E S LDMG PLTFRD VA I E FS LEE WQCLDTAQQDL 
YRKVMLENYRNLVFLGENNFNTKSLIYPKVFISLFS 

1056 

A 

1 

315 

RLHCSVSSAPRGPASGPVTCRYWESTAKTPGHPGSLEMDNLRYGVYP 
VKGASGYPGAERNLLEYSYFEKTEFRSCCPGWSAMARSQLTSTSASR 
VQAILLPQPPE 

1057 

A 

1 

579 

MA I LGRAQWLML V I PALWEAEAR SLSGEDG FRD VAGFVR F FVFGCRW 
NSGSRLHCSVSSAPRGPASGPVTCRYWESTAKTPGHPGSLEMDDLRY 
GVYPVKGASGYPGAERNLLEYSYFEKVWGLVRFTLEIiFHTNEEEREY 
SKVTEEVTEHVYLPAKEKAAKEGEVHPYPSAHPPYYFEEKDPPDLSF 
PEDTG 

1058 

A 

87 

509 

PAGIGRSTAKTPGTPGSLEMENLKSGVYPLKEASGCPGADRNLLVYS 
FYEKGPLTFRDVAIEFSIiEEWQCLDTAQQDLYRKVMLENYRNLVFLiA 
GIAGSKPDLITCLEQGKEPWNMKRHAMVAKPPVICSHFPQDLWAEQ 

1059 

A 

866 

1002 

ARFMLFIFNKAGIAVSKPDLVTCLEQGKDPWNMKGHSTVVKPPGR 

1060 

A , 

385 

573 

DPFQKMAPKVKKEAPGPPKAEAKAKAIjKAKKVV^ 
. SPTFQRPKTLRLWMPP 

1061 

A 

328 

530 

NPSILVTEARECLYAQAVTKHKPHTILLLNFLFRAPPRLECNGMISA 
HCNLHLPGSNDSAPASHIKL 

1062 

A 

6 

666 

LPGRPTRAPTRPAEHSIVGTRLVSCQLQPSQPNADQGKLTTMRIAVI 
CFCLLGITCAIPVKQADSGSSEEKQLYNKYPDAVATWLNPDPSQKQN 
LIJVPQTLPSKSNESHDHMDDMDDEDDDDHVDSQDSIDSNDSDDVDDT 
\ DDSHQSDESHHSDES \D\ ELVTDFPTDLPATEVFTP WPTVDTYDG 
RGDSWYGLRSKSKKFRRPDIQYPDATDEDITS 

1063 

A 

2 

613 

PRVRPRVREEAEHSIVGTRLVSGQLQPSQPNADQGKLTTMRIAVICF 
CLLG I TCA I P VKQAD S G S S EEKQLYNKYPDAVATW \ LN PDPSQKQNL 
LAPQN\AVSSEETNDFKQETLPSKSNESHDHMDDMDDEDDDDHVDSQ 
DSIDSNDSDDVDDTDDSHQSDESHHSDESDELVTDFPTDLPATEVFT 
PWPTVDTYDGRGDSVG 

1064 

A 

1 

1073 

TDCRVDPRVRPRVRVEHSIVGTRLVSCQLQPSQPNADQGKLTTMRIA 
VICFCLLGITCAIPV^QAES\GSSEEKQLYNKYPDAVATWLNPDPSQ 
KQNLLAPQ\TLPSKSN\ESH\DHM\DDM\DDED\DD\DHVGTARDSI 
GLGTTLDGCRMDTGWIFHQF*WSLHHFVmESGWNWSLDFSPRDLAQA 
TEVFQFQ F VP\ TVDT YDGRGDS WYGLRS KS KKFRRPD I QY PDATDE 
\DITSHME\SEELNGAYKAIPVGPDPDAPSDWDSRGKDSYETSQLDD 
QSAETHSHKQSRbYKRKANDESNEHSDVIDSQELSKVSREFHSHEFH 
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SKEDMLWDPKSKEEDKHLKFRISHELDSASSEVN 

1065 

A 

1 

1128 

LETPIDSPRNRPGNPGATHASGRRQSTASSGPDSVSGQLQPSQPNAD 
QGKLTTMR I AVI C FCLLG I TCA I P VKQ AD S G S S EE KQL YNKY PD AVA 
TWLKP * P I FRRQNLLGPTEWLCPLKETNDFKQGGPFPS * GPTERPWT 
TWDDMG* WKVDGWTMWDSQDS IDS \NDSDDVDDT\DDSHQSDESHHS 
\DESDELVTGFSTDLPATEVFTPWPTVDTYDGRGDSWYGLRSKSK 
KFRRPDIQYPDATDEHITSHMESEELNGAYKAIPVAQDLNAPSDWDS 
RGXKDSYETSQLDDQSAETHSHKQSRLYKRKANDESNEHSDVIDSQE 
LSKVSREFHSHEFHSHEDMLVVDPKSKEEDKHLKFRISHELDSASSE 
VN 

1066 

A 

514 

1000 

WYVQPAAQEGSQCWHLEGPRPCMGAPKSMGRCHCSRQHRPTVCYHSL 
ALCNKQVNKVIRRLHCCLNLLSLTKHLFMLKSLLHVFFCEFFCVICS 
FIYIiYGFCLEENVNFNLVKCIKFYLHSLWISFFLSSPEIDLPYYMII 
LKILQFFSSTFIVSFHFVFVL 

1067 

A 

1 

1098 

MG FHHVGQ AGLELL I S S D P P ALAS QG AG I TG S W KW I LAP V I L Y I CER 
ILRFYRSQQKVVITKVKNMHFPLAVIRVIFYRTTELNFYVVTLKEEL 
GNWAQWVVVMHPSKVLELQMNKRGFSMEVGQYIFVNCPSISLLEWHP 
FTLTSAPEEDFFSIHIRAAGDWTENLIRAFEQQYSPIPRIEVDGPFG 
TASEDVFQYEVAVLVGAGIGVTPFAS ILKS IWYKFQCADHNLKTKKI 
YFYWICRETGAFSWFNNLLTSLEQEMEELGKVGFLNYRIiFLTGWDSN 
IVGHAALNFDKATDIVTGLKQKTSFGRPMWDNEFSTIATSHPKSWG 
VFLCGPRTLAKSLRKCCHRYSSLDPRKVQFYFNKENF 

1068 

A 

2388 

2658 

FYKVTFMWKSKVKSLGDNFVLGSVVPCFLFFFFFFLRWSLALLPRLE 
CSGAISAHCILCLPGSSNS/PASASQWGITGTCHHAWIjIFLYF 

1069 

A 

788 

1001 

DSFMPLSQLYLIYFFSRPSLALLPRVECSGVISAHCNLRLPGSSNSP 
ASASQVAGITGTCHHAWLMFCIFS 

1070 

A 

306 

1195 

LKSPSRRSWEPQRLTWHATMEKSPKSEVVITTVPLVSEIQMMAATGG 
TELS CYRC 1 1 PFGVGAF I AG IWTALG I CGRFRENLE YKMKKGNEGQ 
FWGTSVRHRTPSASWLEKSGKKQSQCHGIHAMIKRYPQSIPDYTAW 
ASSRIWEEMDDTEGVLERLPWQQTPDLKPDMQSDFLPVRVCQYQSWL 
LFAHNSKTEGLLGKFGHPSCGIVQRFFGEKKETWELYEQECNGETW 
YWEVKYPFPMSNRDVSFPQAAVYCREVELVSQQAQITLSLILAAAFA 
KDIASCMVFGELRV 

1071 

A 

3 

715 

FVAHTKGVRGLPSMRRS PDCGRMELAAGS FSEEQFWEACAELQQPAL 
AGADWQLLVETSG I S I YRLLDKKTGL YE YKVFGVLEDCS PTLLAD I Y 
MDSDYRKQV^QYVKELY\EQECNGET\VVYWEVKYPFPMSNRDYVYL 
RQRRDLDMEGRKIHVILARSTSMPQLGERSGVIRVKQYKQSLAIESD 
G KKG S KVFM Y YFDNPGGQ IPS WL INW AAKNG VPN FL KDMARACQNYL 
KKT 

1072 

A 

1 

1128 

MHEIWCRDSDQETSLGRSIPCPAAIiCSVRKIHVQPQVLRPTSPRNIS 
PISNLRLKSDAASSPWKPPGPSRTPSFGLLCVFLSYPPTLTTSIGHL 
YSFPGNRSHAHYHPTKPNHPYPTQRQYPIPPHALKGLKPVITDLLQH 
RLLKP INS PYNS P I LPVQKPDKS YRLVQDLRLTNQI VLP IHP WPNIi 
YTLLSS I PS STTHYS VLITLKDAFFTI PLHPSSQPLFAVTWTDPDTQQ 
SQQLTWAVLPQG FRDS PHYFSQALSHDL»LS FH PS ASHL I QY IDDLLL 
CSPSFESSQQDTLLLLQHLFSKGYQSDDRPAFTSQITQAVSQALGIQ 
WNLHIPYHPQSSGKVEWTNGLLKLLLLLLTANIDDTSYLQIPHKGLQ 

1073 

A 

3 

862 

RLHPGE INSHVAHTKPVWFHTDAHE I WCCDSDRGTS LGRS I PCPPV 
LCSVRNIYLQPQILRPTSPRNISLILNQTKETRFICGPKTPAPVTDW 
DGSLPLVFNHCRDASLIVHPRFKGVRARRDACLGPSPLAASPAFLGK 
GQAAPRQAEU3PNASSASAPPPYNLFIASPPHTWSGLQFRSVTSPPP 
PAQQFTLKKVAGAKG IVKHALKRLKPV I TR LLQHGLLKP INS PYNS P 
ILPVLKPDKPYKLVQDLRLINQIVL.PIHPVMPKHTDGYQNVGPNITQ 
CVIH 

1074 

A 

1 

912 

MTDNILELAQNMDKYTKYEMTTTSILSQPSSSQREQDGQGFQELTVT 
SEMFRKGKGS FCSHPHPEKFLRTFNE I ET YLI GNFQDLELES SDDLP 
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RGCTNEKARKTYDPKKLPLPYT/VRPCWILASKLHI*ESYG*RQ*A* 
*CHQ*TPWPRPVWWSLHTEAHEIWCRDSDQGTSLGRSIPCPPVLCS* 
ERS TYDLR\ PQTDQPS KHLTNLKS ASTPPP YNPF I TS PPHTRSGLQF 
RS TS S PPAPAQQFTLKKVAEAKG I VKVNAPFSLSDLSQ I SVRLGS FI 
KYEKS SPVHGS FGSNPETLYS PRP 

1075 

A 

611 

817 

LSQWKHALKRLKPVITCLLQHGFLKPINSPYNSLILPVLKPDKPYKL 
VQDLRLINQIVIiPIHPMVSQA 

1076 

B 

1 

1500 

MRTREVSITGADFTALLVDIIGNSTSYLTEIFKSTSILSVNQSNESD 
CIFICVMTGKSGRNLSDFWEIEEKYPIINYTFTSGLSGVLALLLTQS 
LFGGLFTRTRMKFGAVTRIGGPPLGNQSPSSCSLLHEKDPPTTSGPQ 
TDQPKKHLTNFKSGTEEAMNTTSLLAPAAEIMATPGSPSQASPTSGA 
FTHGTQTPSPTKATAPRYPQTGAQSRRPRRFRRPPAGAAPKAAAPRH 
PHPRGTAPPPRRISPESIRPHPPPPLDRGPRPPVTPFLIVLGCLILA 
VLTTFKEYETVSGDWLLLLETFAIFIFGAEFALRIWAAGCCCRYKGW 
RGRLKFARKPLCMLD I FVL I AS VP WAVGNQGNVLATSLRS LRFLQI 
LRMIiRMDRRGGTWKLLGS AI CAHS KEL I TAW Y I GFLTL I I>S S FLVYL 
VEKDVPEVDAQGEEMKEEFETYADALWWGLITLATIGYGDKTPKTWE 
GRLIAATFSLIGVSFFALPAGILGSGLALK* 

1077 

A 

606 

1065 

LVARTERLSVSQGFLPWCTGR IRSHVGLENECKVLLSGS SSQKMGKP 
EGRWFSPGVGPPPCLAAPALLRLPWKSPPHPTGGWPASVPVPVGVLF 
CQRAPLDDQLLVCWPARVLEKRYWQPPLS * LCPSCAHHMNGYGSGAY 
VWVLTELTVD FAG F W A 

1078 

A 

1 

822 

MWNAVTLWQQRES C IEEESEI GTLETKETH F I RG PKTLAP VTDWEGS 
LPLVFNQCRDASLIIHPRFKGFRPRRDACLGPSPLAAS PAFLGKGQA 
APRQAELGPNSSSASAPPPYNPFITSPPHTWSGLQFHSMTSPPPPAQ 
QFPLKKVAGAKGIVKRLKTDTARLPWKPPDHHRRRASGNSHSGRVQP 
P/ CPAAFVGSC * VSQAFPGARCKLS VDLP FWDLED\ RVFMCVR VKRP 
PNRLCVSNMAVYFTWVQLLQAI WAYTCKSQGMRWLGLGS EA 

1079 

A 

38 

639 

MTLIKSPIVIWTIRSRLKWSQMEMRNLLGTGAKRQRRHVLSVDPKLR 
RWSRTGKAAFPWCLI IAGRP7V1,VLHP*QQVLLSWGRGKI *LTSPSRC 
TIIEKSCNSWPPL*DKPQPHLQHTRTSKRLNRSGQAFLQNLLPQELA 
TSTRN/ PDHQAKECLQPRIPPKPCPICAGPHWKLDCSTHLAATPRAP 
GTLAQGSLTDSFSA 

1080 

A 

1 

1611 

MGLVHLQRQS YWTVQRS S INHFLGCNPHTGKDKTL YMRWTKS FRKVE 
DPPTTSGPQTDQPKKHLTKFKSETKETRFIHGPKTPAPVMDWEGSLP 
LVFNHCRDASLIIHSHFKGVRPRRDACIiGPSPLAASPAFLGKRQHAL 
KRLKPVITHLLQHGLLKPKNSPYNSPILPVLKPDKPYKFVQDLCLIN 
QIVLPIHPMLYSLFVESPTITIVSGPDFNPASHIIPDTTPDPHDCIS 
LIHLMLSPFPHISFFPISHPDHTWFIDGSSTRPNRHSPAKAGYAIIS 
STSIVEATTLPPSTTSQQAKLVAIjTQT^LTLAKGLRVNILLILNMPFI 
FGTTMQSYGLKEGLFRSPPFPTHQARGFAPAQDWQIDFTHMPRVRKL 
KYLLVWVD/VFTGVA^EAFPTGSEKATAVISSLLSDIIPRFGLPASTQS 
DNGPAFTSQITQAVSQALGIQWNLHIIYRPQSSGKVERTNGLLKTHL 
TKLSLQLKKDWTVLLTLALIiRIRACPRDATGYSPFELLYGHTFLLGP 
ML I PDTS PLGNYLPVLQQAR 

1081 

A 

17 

1173 

MADSRIKRTWMRMKFGAVTRIGGTSLGRS I PCPPALCS VKKIHLRPR 
VLRPTSPRNISPILN/TARFKRIKACYYSPATAWPFKAYKLPLQFPH 
FT/ WS *NQTRLTA* FS * KHTCSP * LSSPANLPNPNPFYKTTTPFLPR 
HG**GQ\lLTQELGPRPIAFLSKQIiDLTVLTQPSCLHAAAAAAIiILL 
KALKI TKYAQLTL YS SHNFQNLFS S S YLMH ILS APWLLQLYS LFVES 
PTITIVPGTDFNPASHIILDTTPDPHDCISIjIHIiTFTPFPRISFFPV 
PHPNHIWFIDGSSTRPKCHSPAKAGYAIVSSTSIIEATALPPSTTSQ 
QAELVALTQALTLAKGLCVNIYTDSKYAFHIVHHHARS FLTKQGSSI 
INPSLIKTLLNAA 

1082 

A 

324 

548 

SFYHLPSSHWV1,LTVSFRD*PSPTCPAIYS*KGGWSQRHSQGACYKC 
QKSGHWAKECPQPRIPPKLRPICVGPH 
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1083 

A 

760 

1260 

HTDGVLVWMSFLFVSFPSNSQDPQLQLCWSLLEVHSRSCLPGYQQWR 
LSWKCRNHSSSASLTLRAVDWSCSYSAILEPRWYCLLYFIQSIILKK 
HRGRRW I FLMEQRTGGQR I DLPRGGP P I * VTA PNLMHVRVKRP PNRL 
CVSNKAVYFTSKSGPLSQDWTWIH 

i084 

A 

908 

1192 

LFSGNFIPKSFFSLISPTVGSHAAPNPTRSSPEKHRPLSLHTTPQKF 
RCPNTSPLFCFVFLCPSAGMLTFHVKRGCDFTSCASLNPEGTPPVEM 

1085 

A 

1 

420 

WVLGLADFKNEAADPRETKETHFIRGPKTPAPVTDWEGSLPLVFNHC 
RDASDYSPTFQRCQTMQGRLPWSFTLSGKSRFSRGGARACYKCQKSD 
HQAKECLQPRIPPKPCPSVRDPTGNRTVQLTWQALPEPLEFWPKAL 

1086 

A 

287 

515 

LFTHVSKELATSARNLTTRPRTAGSPGFLLSHVPSVWDPTANRTVQL 
TWQPLPEPLE\SGPRLSD*LLPRSSRLSG 

1087 

A 

1 

5127 

MLFWAFDKIQQPFMLKTLNKLGIDGTYLKIIRAVYDKPTANIILNGQ 

KLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKB 

EVKLSLFADDMIVYLENPIISTQNLHKLISNFSKVSGYKNNVQKLQA 

FLYTNNRQTES Q IMSELP FT IAS KR I KYLG I QLTRD VKDLFKENYKP 

LLKEIKEDTNKWKNIPCSWIRRINIVKMAILPKRTPLLSLPSTLYEE 

LWRLRSYFPLCPQLYNMEQNTTDWVIYKEQCIWLTALETGESKSTAP 

AFGKGYPMAEYIIWGRLTPHMAGYSSETKLPEERSGSNICCSPISAV 

LQPLLL I PRQTGSGVDLQQTPTDLQLRVLTVRRKTNKQKGHPHQNPI 

CTSPSSKTKGSNSHITILTLNWGLNAPIKRHRLANWIKSQDPSVCC 

IQETHLTCRDTHRIKIKGWREIYQANGKQKKAGVAILVSDKTDFKPT 

KI KRDKEGHYMMVKGS IQQEELTTLNI YAPNTGAPRFI KQVLRDLQR 

DLDSHTLIMGDFNTPLSTLDRSTRQKVNKDIQDLNSALHQVDLIDIY 

RTLHPKSTEYTFFSALHHIYSKIDHIVGSKAIiLSKYKTTEIITNCLS 

DH S A I KLELR I KKLTQNRS TTWKLNNLLLND YWVHNKMKAE INTLFE 

TNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK 

ELEKQEQTHSKASRRQEITKIRAKLKEIETQKILQKINESRSWFFEK 

INKIDRPLARPIKKKREKNQIDAIKNDKGDITTDPTEIQTTIREYYK 

HLYANKLENLEEMDKFLNTYTLPRLNQEEVESLNKPITGSEIEAIIN 

SLP I KKSPGPDGFI AKFHQRYKEELEAS 1 1 L I PKPGRDTTKKENFRP 

I PLMNIDAKILNKILANRIQQHI KKLIHHDQVGFI PGMQGSFNICKS 

INVI QD INRTNDKNHM 1 1 S I DAEKAFDKI QQPFI LKTLNKLG I DGTY 

LKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLE 

VLARA I RQE KE I KG I QLGQEE VKLS L F ADDM I VYREN P I VSTQNLHK 

LISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIK 

YLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVK 

MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSIL 

SQRNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIM 

PH I YNHL I FDKPDKNKKWGl^S LFNKWC^ENWLA I CRRKXjKLDPFLT . 

PYTKINSRW I KDLNVRPKT I KTLEENLGNT I QDI GMGKD FMSTTPKA 

MATKAKIDKWDLIKLKSFCTAKETTITVNRQPTEWEKIFTIYPSDKG 

LISRIYKELKQIYKKRTNNPINKWVTOMNRHFSKEDIYTAKRH^ 

SSSIiAIREMQIKTTLRYHLI PVRMAI IKKSGIJKRPSSNVFSVKIjSEI 

QSLALIETRCLMNADEQGVPLIQEMLESWASICKVPKYPNAEAVRPE 

EQTPHLHPGEINSHVAHTKLVWWSLHTDTHEIWCRDSDRGTSLGRSI 

PCPPALCSVRKIHLRPQVLRPTSPRNISPISNPHTRTSKRLNRSGQA 

FLQNLLPQELATSARNLTTRPRNACSPGFLLSRVPSVRDPTGNWTVQ 

LTWQPLSEPLELWPKAJL 

1088 

A 

3 

721 

TRMKFDAMTQLGDLPWEINPLSSCSLLREKDPPTTSGPQTDQPKEHL 
TNFKSTTNHAPLTI PLKPNHS YPAECQYPVPQHALKGLKS I ITRLLQ 
HGLLKPINCPYNSPILPVLKPDKAYRLLQDLRIjINQIVIjSIHPVXiPK 
PNTLLSSIPPSTTHYSVLDLKHAFFTIPLHLSSQPLFAFTWTDPDTH 
QAQQ I TWAVL PQG FTD S PH YFNQAQ I S S S S VTYLG 1 1 LM KTHVLS LL 
IVSG 

1089 

A 

144 

410 

GYEIWCRDSDRGPSLGRSIPRPPALCSVRKIHLRPQVLRRTRPRNIS 
PISNLVSGLFLLSSPTSLTIPQPLSSFNLDDTLQSLPSLNF 
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1090 

A 

265 

769 

RQKRHVLSVDPKLRRWSRTGKAAFPWCLIIAGRPALVLHP*QQVLLS 
WGRGKI*LTSPSRCTIIEKSCNSWPPL*DKPQPHLQHTRTSKRLNRS 
GQAFLQNLLPQELATS TRN/ PDHQAKECLQPR I P PKPCP I CAGPHWK 
LDCSTHLAATPRAPGTLAQGSLTDSFSA 

1091 

A 

69 

634 

KQKRSTYNLRSSDPPAQETSHQFQIRDKGDTFYLWTQNSGAAHGLGR 
QPSLDV*SLQGHLSDYSPMFPRCQTMQGRLP*SFTLSGKSRFSGEGA 
STPQPLLHP*WQVPLFWGRGKYPSTPSSPLVASPAFLGKGQKPPRPS 
RMPSFG 

1092 

A 

1 

655 

MGATHPFELLTKMTSQGSDISGDLPWEINPLSSCSLLHEKDPPTTSG 
PQTDQPKKHLTNFKSETKETHFIRGPKTPVLVTDWEGRLPLVFNHSR 
DASLIIHPRFRGVRPRRDACLGPSPLAASPAFLEEGQVPQPLLSMSL 
TPSLLFWRRGKKPSTPSSPLAASPAFLEEGQVPQPHIS/ GA/LDPLF 
LHPNLL*LCTPTPFPLFWKTVRKYSSNNQKGE 

1093 

A 

756 

880 

LSQWRSDNGPAFISQITQAVSQAPGIQ*NLYIPYHPQSSGK 

1094 

A 

781 

1194 

F P KG G P P I * V I APN FMR VR VKR P PNR LCVS N KAVY FTWVQ VG ALCRL 
GA/PAPCIPAAPVP/VHGSEGPRYNSSRCLAELKP*ALAASMWYLSL 
KALGIESGRVSITAILINISSARKA/ SCVPLGSRILESLMLSTVRAL 
R 

1095 

A 

400 

687 

RQVLLFWGRGKYPSTPSPSPLAASPTFLGQGQELVTSARNLTTRPRN 
ACGPGFLLSHVPS VRDPTGNRTVQLTWQPLPEPLE\ SGPRLSD * LLL 
RXi 

1096 

A 

1 

883 

MASSAQLLGSSQETYNHSKRQRGGEMSHMAGARRKRERGEMLHTFKQ 
PDLMRWSSVCRKNKEKVGNSRKRRNVRYCFSRKFNGTSKVFESWQW 
WGEINSHyAHTKPVRWSLHTDAHEIWCRDSDRRTSLGRSIPCPPVL 
CSMRKIHLQAQVLRPTSPRNISPILNRRKRRHVLSVDPKLRHRSWTR 
EGSLPLVFNLCRDASLIIHPGFRGVRPRRDTCLGPSPLAASPTFLGB 
GARACYKCQKSGHQAKECLQPRIPPKLCPI\WRDPAGNRTVQLTWQP 
LPKPLELWPKVL 

1097 

A 

2 

551 

CGKVVmFLETFSMALTKMIilMIWTMKFRLRSSQMEMRNLLGTGISLE 
TWCPAS * PL* P * LKGDKI QLRPWLQRVQGQS IGS FQE VLGPWVLRNQ 
ELRFGNLCLYFAGCMEKPVCPGRSLLQGQGF/PWYVPVAVVGAKVHD 
VNLHMLSFPSKWKLHTCMKFGAVTWIRGPPLGDQSPVLLLFAP 

1098 

A 

1436 

1699 

RQVPLFWGRGKYPNPFSPCLYPFPAFPGTGSLLQVPEIYLTTRPRNA 
CSPGFLLSRVPSLRDPTGNRTVQLTWQTLPEPLELWPKAF 

1099 

A 

1099 

1250 

LVYLKVTGRMEPSWKTLCRILSRRTS PI * QGRPTFRFRKYREHHKDT 
PRD 

1100 

A 

303 

1413 

VRRQRSDRER SDARMVR FCNL YM * RKNP F I LH * LFR * TLRQTKPDS S 
A/V*MCQNLMTHSKSTEWKITK/QIFDGDGKTYQNVQQFIDEGNYTS 
GDNHTLRDPHYVEDKGHKYLVFEANTGTENGYQDSAHLHPGEINSHV 
AHTKPVWWSLHMDAHEIWCRDSDRGTSLGRSIPRPPALCSVRKIHLQ 
PQVLRPTS PRN I S P I SNPGFCRFRNHHQTGFS PAGANQRGPLAATLS 
GPGGEGQSAVARLTGEKKNHPGAQYANRLSPRVGRFINAAGTTGFPT 
GKRAVSATQLMDFADFGTTTKQDFRLLGQTSVDRIiLQLSQGQAVKGN 
QLLPVS LVKRKTTIiAPNTQTAS PRALAD S LMQLARQVSRLES GQ 

1101 

A 

846 

1825 

APCDPYSCPPENNPPFSFTYPNPIKQPHPYLPSLTFFSDSAHLHPGE 
INSFIAHTKPVWWSLHTDTHEIWCCDSDRGTTLGRSIPCPPVLCSKR 
KIHLRPWSSDRPAQETSHQFQIHLLCVLLLYSPTLTLKYISTPSLAT 
DHAPLTISLKPNHPYPAQCQYPIPQHALKGLKPAITRLL.QHGLLKPI 
NSPYNSPILPVLEPEKIYRLVQDLRLINQIVLPIHPWPNPYTLLSS 
I P P S T I HY S VLDLKRAFFT I PL YPS S QPL FAFT WTD PDTLQAQQ I TW 
AVLPQSFTDSPHYFSQAQISSLSVTYLSIILIKTHTLSLLIMSD 

1102 

A 

2141 

2384 

AEQWPSVKILRQELATSARNLATRPRNARSPGFLLSCVPSVWDPTGN 
QTVQLTWQPLPEPLE\ SGPRLSD* PLPRCSRLSS 

1103 

A 

305 

1148 

TVLLLTQSLFGGLFIWTRMKFGAVTRIGGPPLGDQSPVLLLFALERQ 
RRHVLSMDPKLRCWSRTGKAAFPWCLIIAEMPDYSPTFQRCQTTQGR 
LPWSFTLSSKSRFSGEGATQRQYPIPQQALKGLMPAITRLLQHGLLK 
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P INS P YHS P I LPVLKPDKP YKLVQDLRL INQ I VLPIHPMVPNP YTLL 
SSI PAS TTH Y S VLDLKQAF FT I PLH P S S QPL FAFTWTD PDTHQAQQ I 
TW A VLPQAFTD S PH YFS Q AQ I S S S S VT YLG 1 1 L I KTHVL S LLFVFD 

1104 

A 

2779 

3182 

DKTQPHLLHTGTS KCLNCSGQAFLQNLLLQELATS ARNLATRPGNS C 
SPGFLLSHVPSVPDPTGNRTVQLTWQPLPEPLELWPKVLSRVMDYI* 
M\TC*STIPQNSAIVLTDLLLGVYIPSESKHARPKVVLWAH 

1105 

A 

2286 

4921 

PQVLRPTS PRN I S P I SNPGETKETRFI RGPKTPAPVTDWEGSLPLVF 
NHCRDASLIIHPRFKGVRPRRDACLGPSPLAASPDFLRKGQVPLNPF 
SPCLYPFSAFLGQGQEPLNPFSFTLSGKSRFPGAGAKGDVSGSQDNC 
GERVSRQTREQRSLHHRQDSAHLHPGEINSHVAHTKPVWWSLHTDVH 
EIWCRDSDWGGDLPWEINPLSSCVLCSVRKIHLRPQVLRPTSPRNIS 
PILNWWLKTDAARLPRKPPRPSLMPASRNFHSGGLQPHLIVALLPNP 
KPPS CLPLVS PHLNPQVWDI SNPSLATDHMP I T I PLKPNHPYPAQCQ 
YRIPQHTLKGLKPVITPLLQHCLLKPINSPYNSPILPVQKPDKSYRL 
VQDLRLINQI VLPIHPWPKPYTLLSS I PSS ITHYSVLDLKDAFFTI 
PLHPSSQPLFAFTWTDPDTHQSQQLTWAVLPQGFRDSPLYFSQALSH 
DLLSFHPSASHLIQYIHDLLLCSPSFESSQQDTLLLLQHLFSKGYQV 
S PSKAQI S S PS VTYLG 1 1 LHKNTRSLPADRVQIi I SQTPTPSTKQQLL 
S FLGMVGYFRL WI PGFAFLTKPL YKLTKGNLAD P IDPKS FPHSS FHS 
LKTALETAPTLALPDSSQPFSLHTAEVQGCAVGILTQGPGWHPVAFL 
SKQLDLTVLGWPS CLRAAAAAAL I LLEALKI TNYAQRTL YI AHNFQN 
LFSASFLRHILSAPQLLQLYSLFVESPTITNVPGPDFNPASHIIPDT 
TPDPHDCISLIHLTFTPFPHISFFPVSHPDHTWFIDGSSTRPNRHTP 
AKAG YAI VQANS PPLRTS H I L S I VEMYPQGNNFS VFHLLF YYS SG 1 1 
QAPSLP YTS S SG I CHRLGLANQR YSTCPESGN 

1106 

A 

1 

761 

HPLPSASLGLPSVSLGVSLCVRSALLEAVVPMLPKRRRARVGSPSGD 
AASSTPPSTRFPGVAIYLVEPRMGRSRRAFLTGLARSKGFRVLDACS 
SEATHVVMEETSAEEAVSWQERRMAAAPPGCTPPALLDISWLTESLG 
AGQPVPVECRHRLEVAGPRKGPLSPAWMPAYACQRPTPLTHHNTVLF 
GPSLQEALE I LAEAAGFEGSEGRLLTFCRAAS VLKALPS PVTTLSQL 
QGLPHFGEHSSRWQELL 

1107 

A 

1 

969 

MVNPDTGY INYDQLEENARLFHPKL 1 1 AGTS C YSRNLE YARLRKI AD 
ENGAYLMADMAH I SGLVAAGWPS PFEHCHWTTTTHKTLRGCRAGM 
IFYRKGVKSVDPKTGKEILYNLESLINSAVFPGLQGGPHNHAIAGVA 
VALKQAMTLEFKVYQHQWANCRALSEALTELGYKIVTGGSDNHLIL 
VDLRSKGTDGGRAEKVLEACSIACNKNTCPGDRSALRPSGLRLGTPA 
LTSRGLLEKDFQKVAHF IHRG I ELTLQ I QS DTGVRATLKEFKERLAG 
DKYQAAVQALREERGNAHVHSLYTLPPHYTJVIQWHAKGELN 

1108 

B 

1 

2175 

MVNPDTGYINYDQLEENARLFHPKLIIAGTSCYSRNLEYARLRKIAD 
ENGAYLMADMAH I SGLVAAGVVPSPFEHCHVVTTTTHKTLRGCRAGM 
IFYRKGGMAPLGTATLLQALFSLLFLSKSRDVPGTGADPGVMYVKRR 
PRGTDSGCGVLEPRRFLPSGMAFTKEEEYEEEPYNEPALPEEYSVPL 
FPFASQGANPWSKLSGAJCFSRDFILISEFSEQVGPQPLLTIPNDTKV 
FGTFDLNYFSLRIMSVDYQASFVGHPPGSAYPKLNFVEDSKNTVLGDS 
KEGAFAYVHHLTLYDLEARGFVRPFCMAYISADQHKIMQQFQELSAE 
FSRASECLKTGNRKAFAGELEKKLKDLDYTRT\TLHTETEIQKKANDK 
GFYS SOAI E KANE LAS VEKS 1 1 EHODLLKO I R S Y PHR KX. KGHDLC PG 
EMEHIQDQASQASTTSNPDESADTDLYTCRPAYTPKLIKAKSTKCFD 
KKLKTLEELCDTEYFTQTLAQLSHIEHMFRGDLCYLLTSQIDRALLK 
QQH I TN FLFED FVEVDDRMVE KQES IPS KP S QDRP PS S S LE ECP I P K 
VLISVGSYKSSVESVLIKMEQELGDEEYKEVEVTELSSFDPQENLDY 
LDMDMKGSISSGESIEGLGTEKSTSVLSKSDSQASLTVPLSPQWRS 
KAALLQPHPNPTHRLRSSSMEYKPDPPHRFSEPWPNLEPEANLSSPA 
VKDSVTKELQTARPSREQT* 

1109 

A 

2 

964 

DI I PLVMVNPDTGYINYDQLEENARLFHPKLI I AGTSCYSRNL\ EYA 
RLRK\ I ADENGAYLMADMAHI SGLVAAGWPS PFEHCHWTTTTHKT 
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LRGCRAGM I FY \ RKG VK\ SVG S PRLGKE I L YNLES L INS AVFPGLQG 
GPHNHA\ IAGVAVAlA KQAMTLEFKVYQHQWANCRALS E ALTELG Y 
KIVTGGSDNHLILVDLRSKGTDGGRAEKVIiEACSIACNKNTCPGDRS 

HT .PPCJfST.PT/^TPZXT.TQPfST.T.PTniPnirVJZlWPTWPriTPT TT ATnCnTPW 

RATLKEFKERLAGDKYOAAVOALREEVES FAS LFPL.PGL.PDF 

1110 


172 

375 

I WHHS F YNELRVA PE EH PTLLTE APLN PKANRE KMTQ VSML PQHTTb 
WGLGKLLLEGGY7VAQQREGRS 

1111 

B 

827 

1276 

MATAAWSSSLEKSYELPDGQVITIGNERFRCPETLFQPSFIGMESAG 
IHETTYNSIMKCDIDIRKX)LYA1>JNVLSGGTTMYPGIADRMQKEITAIj 
APSTMKIKI IAPPERKYSVWIGGS ILASLSTFQQMWISKQEYDEAGP 
SIVHRKCF* 

1112 

A 

144 

261 

FPKKINNFYFTFLQDPTGIFSLDKTIGLGTYGRIYLVS 

1113 

C 

122 

253 

MGWVGTATSPHPVAWRTRRPSSLRSLPSVRALWRTERRVPCG* 

1114 
nil 

/a 


J D O 

PRPRRRWAARRPRGRRAQVARRTARR I C PCGRPP PVRAPAADPWARR 
AWSTSRSPAGTE 

1115 

A 

336 

689 

PLRPPAAPRSRGIRAPGASALPPTPSLWLRASASSHHRVGTRCGPYP 
AHTCTTW S LLQPR PRRRWAARR PRGRRAQVARRTARR I CP CGRPPPV 
RAPAADPWARRAWSTSRS PAGTE 

1116 

A 

164 

370 

SPQDPQRQNQPESSQPTLRDGTAALGPASRSAGSWCSRVLKSLRSLR 
IKSVGNRIGPWKLRNVDTSGP 

1117 

A 

974 

1112 

NPE I PVCTTWSLLQPRPRRRWAARRPRGRRAQVERRTARRI CPCGR 

1118 

A 

179 

406 

FS SS IGS LRRQRRGMKTP FGKAAAGQRSRTGAGHGS VS VTM I KRKAA 
HKKHRSRPP\SQPRGNIVGCIIQHGWKDG 

1119 

A 

1 

1698 

MVNG P I QQDELTI LN I Y APNTG APRF I KQ I L SDLQRDLD SHT I KI GE 
FNTPLSTLDRSMRQKVNKDIQEVNSALHQADLIDIYRTLHPKSTEYT 
FFSGLHHTYSKIDHIVGSKALFSKCKRTEITRNCLSDHSAIKLELRI 
KKLTQIWSTCWKLNI^LL 

LQDTFKAKKKEETAAQQRLVGVRCGRGAAATATAAATRKAEDAGAKS 
AbKRHboLKRQKKGVRFVHLxPAIRDGGESAPP 

ASYFTEPQILARLSWGGRLLQALSQGMKTPFGKAAAGQRSRTGAGHG 
S VS VTM I KRKAAHKKHRS R PTS QPRGN I VGCR I QHG WKDGDE PLTQW 
KGTVLDQ V P VNP S L YL I KYDG FDC VYGIiELHRDERVS S LEVL PNR VA 

YITYEK1^PVLYMYOIj1VDDYKT)GDL.P. T LODSND QPT.ARR EPGEVTDSIi 
VGKQVE Y AKDDGS KRTGMVI HQVE AKP S VY F I KFDDDFH I YVYDL VK 
TS 

1120 

A 

264 

799 

WESDVGEGLRPPPPPPPPGRRRTQEPRARDAATVIFACPAALLETLI 
AYG SSSPS FCKHRAARPL I FLLHRLTAEATARCP I CALEARNPGRWG 
ICASWPGMKTPFGKAAAGQRSRTGAGHGSVSVTMIKRKAAHKKHRSR 
PTSQPRGNIVGCIIQHGWKDGDEPLTQWKGTVLDQLL 

1121 

A 

231 

353 

RMERKGRWNKKRKEEERRKTKGRQLFRIEGVKSVFFGPDF 

1122 

A 

1 

3654 

MDKFLDTYTLPRIiNQEEVESIjNRPITGAEIVAIINSLPTKKSPGPDG 
FTAKFYQS KI LANQIQQH I KKLTHQDQVDF I PGMQGWFNIRKS INVI 
QHINRSKDKNHMIISIDAEKAFDKIQQHFMLKTLNKLRIDGTYLKII 
RAIYDKPTANIILNGQKLEAFPLKTGTRQGCSLSPLLFNIVLEVIJVR 
AIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISN 
FSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYIiGI 
OIjTRDVT^LFKIST^KPTjT.KT^ TNTVKMATL 

S^XJ X xVL/ V 1\±JUE JVJ-iAM X lUT XjXJXVC X IVtLX^ X *M JVri IrLwr? V OK X, X¥ X V WlttJ. J-l 

PKIIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKAILiSQKN 
KAGG I TL PDFKLYY KATVTKTAWYWYQNRD IDQWNRTE P S E I TLHI Y 
N FL I FDKP E KNEQWGKD S L FNKWC WENWLA I CRKLKLD P FLT P YTKI 
NSRWIKDLNTVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKA 
KIDKWDLIKLKNFCTAKETTIRVNRQPTKWEKIFTTYSSDKGLISRI 
YNELQQIYKKKTNNPIK3CWAKDMNRHFSKEDI 

IREMQI KT 1T1RYHVTPVRMATI KKSGNNS WRS CLQPWAAHS TTYRHP 
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GTMEKKTSSGDTPTVFPSTLGRKKSTSAQLTELTALTLELSKGKVRN 
APSKLDLSPFEMLYGWPFLTNDFLLDQETSELGKHVNSLARFQQELT 
QVAEAQPQEVGPPLFNPGYLVLPKQGGPYTILLSMPSAVKVTGIDSW 
IHHAQVKAWKTERAI PGS PAECLGYQCEEIRDLKLKI KRDKKMSKGY 
SSYSVPVIHSVIQSSSLCTTGTQKSRIEVWEPLPRFPKMYGYAWIPR 
QKFAAGAGPSWRTSVKAVQKGNVGSKPHHRDPTGALPSGAVRRGPPS 
SRPQNGRSTGSLHRVPGKARPQHQPVKAARREAVPYKATGVELPKTM 
GTHLLHQCDLDLPATFSLSLHIKYLKHRHSYSPHLHLLFTVMGPHAR 
QM I R I S G I D VDTRNL YKLNLKVKWLE VLDQE E PE Y S RTTS RKKQG FP 
NSDEEDVLWMEKQLSCFCDKYQKWIPTRQLKGQLSYVKEADLIPDVM 
PPQALFHDDDEMEGNGAIDAGMEYVYNIDTLLRWSDGEEEGEGGEEG 
RGGGRGGEGGGKREEGEGKKEEEQGGWGGGQEVEEEGGGGGGN 

1123 

A 

1376 

3462 

TKPKTKTTLLSQ * MQKKPLTKFNNPS C * KLS IN/ I VLE VLARAIRQK 
KE I KG I QLGKEEVKLSLF ADDM I VYLENP I VS AQNLLKL I SNFSKVS 
GYKINVQKSQAFLYTKNRQTESQIMSELPFTIASKRIKYXGIQLTRD 
VKDLFKENYKPLLKEIKEDTN1WKNIPCSWGRINIVKMAILPKVIY 
RFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKVGGI 
TLPDFKLY YKATVTKTAW YWYQNRVI DQWNRKEPSE ITPHTYNYL I F 
DKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWI 
KDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAl^TKAKIDKW 
DLIKLKSFCTAKETTIRVNRQPTTWEKIFTTYSSDKGLISRIYNELK 
QI YKKKTNNP I KKWVKDMNRHFSKED I YAAKKHMKKCSS SLAIREMQ 
IKTTMRYYLTPVRi^IIKKSGNNRQTGSGVDLKQTPTDLKLRDLTVR 
RKMNKQKEIASTSTKRTSTPNPTCRSVGPKDCSSLGAMEQSWTENDF 
DKLTEKKAIiEENQEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIEA 
I IDS IPTKKYPGPDGFTAKFYERI KVFCTESLAKWI KWTHTKTFIME 
FHT IGNAK I LQAS S FTE VKTKTKTLEHKRLES I MALTS Q 

1124 

A 

485 

2347 

TE PKTKTT * LS Q * MQKRPLTKFNNTS C * KLS IN/ 1 VLE VLARAI RQE 
KEIKGIQLGKEEVKVSLFADDMIVYLENPTVSAQNLLKLIGNFSKVS 
GYKINVQKSQAFLYTNNRQTERQIMSELPFTIASKRIKYLGIQLTRD 
VKI)LFKENNKJPLLKEVKEDTNEWKNIPCSWVGRINIVKMAILPKVIY 
RFNAIPIKLPMTFFTELEKTTLKFIWNQKRACIAKSIFSQKNKAGGI 
TLPDFKLYYKATVTKTAWYWYQNRDIAQWNRTEPSEIMLHIYNYLIF 
DKPEKNKQWGKDS LFNKWCWENWLAI CRKVKLDPFLTPYTKMNSRWI 
KDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATKAKIDKW 
DLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELK 
QI YKKKTNNP I KKWAKDVNRHFS KED I YAAKKHMKKCS S SLAIREMQ 
I KTTMRYHLTP VRMA 1 1 KKSGNNRKI Q/ GG I WCDR I L * R * TTCRVAK 
EIQSL*RRI/WKRLQRTLSIPVLDAV*PPMF*ASVIDTMTI*CFEAR 
DTCFTLTLESFVTOMHRCIAASKGIGLLLC*PLIWHMSLMGVKSPPFV 
FS CLWTS AVRPTT 

1125 

A 

1 

2784 

MQTTIREYYKQLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPI 
TGSEIVAIINSLPTKKSPAPDGFTTKFYQRTEDKNHMIISIDAEKTF 
DKIQQLMLKTLNKLGIDGMYLKI IRAI YDKPTANI ILNGQILEAFSL 
KTGTRQGCPLS PLLFN I VLE VLARAIRQEKEI KGVQLGKEEVKLS LF 
ADDMITYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNR 
ETENQIMSELPFTIASKRIQYLGIQLTRDVKDLFKENYKPLLNEIKE 
DTNKWNNIPCSWIGRINIMKMAILPKVIYT^FNDIPIKLPLTFFTELE 
KTILKFIWKQKRARIAKSILSKraKAGGIMVPDFKLYYKTTVTKTAW 
YWYQNRDINQWNRTEPSEIMPHICNYLIFDKPDKNK3CWGKDSLFNKW 
CWENWLATCRKLKXJDPFFTPYTKINSRWIKDLNVTSKTIKTLEENLG 
NTIEDIGMGKDFMFKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRV 
NRQPTEWEKI FATYSSDKGLI SRI YNELKQI YKKKTKNP IKKWAKDM 
NRHFSKEDIYAAKGHVKKCSSSMAIREMQIKPTMRYHLTPVRMAIIK 
KSGNHRCWRGCGEMGTFLHCWWDCKLVQPLWKS VWRFLKDLBLE I PF 
DPAIPLLSIYPKDYKSCCYKDTCTRSPGRQYLLELQAQWYCFCVNSP 
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AEC I FLLS LETWKHLDS R I FATLRSRDPLMS PHHQGLEPETELCG I S 
AEQLLRHAETQELYIPWPRYPQQGGSHSSLQALESPSSPDEEGSPKL 
STAALPECVQIASLNETPSIPPHWWNEIKAQIKXFFETNESKDTMC 
QNLQDAAKAVLRG KF I PLNAH I KKLERS EVNNLTLQQKELENQEQTN 
SKASRRQEITKIRAELKEMGTHKPLQKSMNVGAVF 

1126 

A 

1 

3000 

MGKKQSRKTGNSKKQSASPPPKECSSSPATEQSWTENDFDELREEDF 
RRSNYSELQEEIQTKGKEVENFEKNSDECITGITNTEKCLKELTELK 
AKARELHEERRSLKSRCDQLEERVSVMEDKMNEMKREGKFREKRIKR 
NKQSLQEIWDYVKRPNLRLISVPESDRENGTKLENTLQDIIQENFPN 
LARQANIQIQEIQRTPQRYSSRRATPRHIIVRFSKVEMKEKMLRAAR 
EKEIQTNIREYYKHRYANKLENLEEMDKFLNIYTLRRLNQEEVESIiN 
RPIRGSEIVAIINSLPTKKSPGPDGFTAEYYQRYKEELVPFLLKLFQ 
S I EKEG 1 LPNS FYEAS 1 1 L I PKPGRDTTKKENFRTI SLMNIDAKILN 
KILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRTKD 
KNHVIISIDAEKAFDKIQQLFLLKTLNKLGIDGTYLKIKRAIYDKPT 
ANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEV1»ARAIRQEKEI 
KG I QLGKEE VKLS LFADDM I VYLENP I VS AQNLLKL I SNFS KVS GYK 
INVQKSQAFLYTNNRQTETQIMSELPFTIASKRIKYLGIQLTRDVKD 
LFKENYKPLLKEIKEDTKKWKNIPCSWVGRINIMKMAILiHKV'IYRFN 
AI P I KLPMTF FTE LEKTTLKFTWNQKRAR I AKS I LS QKNKAGG IMLP 
DF KL YYKATVTKTAW YW YQNRD I DQWNRTE PS E I TPH I YNYL I FDKP 
EKNKQWGKDSLFNKWCWENWLAIWRKLKLDPFFTPYTKINSRWIKDL 
NVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAIATKAKIDKLDLI 
KLKS FCTAKETT I RVNRQPTEWEKI FATYS SDKGL I SRI YNELKQID 
KKKTNNPIKKWAXDMNRHFSKEDIYAAKKHMKXCSPSLAIREMQIKN 
TMR YHLTPLRMAI I KKSGNNRWELNNENTRTQEGEHHTLGTWGWGE 
GGG I AVGD I PNAK 

1127 

A 

1 

3045 

MEYYAAIKKDEFLSFAETWMKLETIILSKLSQGQKTKHRMFSLIVLE 
VLARAIRQEKEIKGIQLGKQEVKLSLFADDMIVYRENPIVSAQNLIiK 
LISNFSKLSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKIIK 
YLGIQLTREVKDLFRENYKPLLNEIKEDTNKWKNIPCSWVGRINIMK 
MAILPKVIYRFSAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSIL 
SQKNKAGGIMLPDFKLYYKATVTKTAWYWYQNRNIDQWNITEPSEII 
PHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWIJVIRRKLKLDPFLIP 
YTKINSRWIKDLNGRPKTIKTVEENLGITIQDIGTGKDFMYKTPKAM 
ATKAKIDKWDLIKLKSFYTAKETTIRANRQPTKWEKIFATYSSDKGL 
ISRIYNQLKQIYKKKTNNPIKKWAKDMRRHFSKEDIYAAKKHMKKCS 
LSIiAIREMQIICrTMRYHLTPVRMAIIKKSGNNRCWRGCGEIVTLLHC 
WWDCKLVQPLWKS VWRFLRDLELE I PFDPAI PFLGI YPKDYKSCCYK 
DTCTLLE VTiARA I RQEKE I KG I QLG KQE VKL S L FADDM I VYRENP IV 
SAQNLLKLISNFSKLSGYKINVQKSQAFLYTNNRQTESQIMSELPFT 
IAS KI I KYLG I QLTRDVKDL FKENYKPLLNE I KEDTNKWKNI PCS W 
GRINIMK^ILPK^ICRFSAIRIKLPMTFFKKLEETTLKFIWNQKRA 
HIAKSILSQKNKAGGIMLPDFKLYYKATVTKTAWYWYQNRNIDQWNI 
TEPSEIMPHIYlWLIFDKPDKNKQWGKDSLFNKWCWENWIjAICRKLK 
LHPFLTPYTKINSRWIKELNVRPKTIKSLEENLGNTIQDIGMGKDFM 
S KTP KAMATKAK I DKWDL I KLKS FCTAKETT I RVNRQPTKWEKLFTT 
YSSDKGLISRIYNQLKQIYKKKNKQPHQKVGEGHEQTLLKRRHLCSQ 
KTHEKML I ITGHQRNANQNHNE I PS HAS 

1128 

A 

1 

2736 

M 1 1 S IDAEKAFGKVQQPFMLQTLNKLG IDGS YLKI I RAVYDKPTANI 
TLNGQKLEAFPLKTGTRRGCPLSPLLFNIVLEVLARAIRQEKEIKGI 
QLGKEEVXLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINV 
QKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIHLTRDMKDLFK 
ENYKPLLNEIKEDTNKWKNIPCSWGRINI\n<MAILP\RFNAIPIKL 
PMTFFTELEKTTLNF I WN/ QKRAR I AKS I LSQKNKAGG I TLPAFKLY 
YKATVTKTAWYWYQNRDIDQGNRTEPSEITLRIYNYLIFDKPEKNKQ 
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WGKDS LFNKWCWENWLA I CRKLKLD P FLTP YTT INS RW I KDLNVRP K 
TI KTLEEKLG I T I QD I VMGKDFMS KTPKAMATKAKI DKWDLI KLKS F 
CTAKETTIRVNRQPTKWEKIFETYSSDKGLISRIYNEL.KQIYKKKTN 
NPIKKWAKT^lNRHFSKEDIYTAKRHMKKCSPSIAIGEMQIKTTMRra 
LTP VRMA 1 1 KKS GNNRS AM I QM FVFPQD S YVE I LT I NVWVGVLPSQ 

LQRTLKDSSKEALETPSPPSFPSSPFCFPRKLHTDFSFGPKAKAFMH 
KRLNQEEVESLNRPITGSEIWIINSLPTKKSPGPDGFTAEFYQRYK 
EELR I KYLGIQLTRDVKDLFKENYKPLI^NE I KEDTNKWKNI PCSWVG 
RINIVKMAILPKNWKKTTLKFIRNQKRARTTKSILSQKNKAGGITLP 
DFKL YYKATVTKTAWYL YQ YRD I DQWDRTE PSE I TPH I YNYLI FDKP 
EKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDL 
NIRPKT I KTLEENLG I TIQD I GTGKDFMS KTPKAMATKAKI DKWDL I 
KLKS FCTAKETT I RVNGQPTK 

112 9 

A 

1 

2955 

MGKKQNRKTGNS KTQS AS PPPKERS S S PATEQS WMEND FDELREEGF 
RRSNYS ELRED I QTKGKEVENFE KNLEEC I TR I INTEKCLKELMELK 
TKARELREECRSLRSRCDQLEERVSAMEDEMNEMKPNLRLIGVPESD 
AENGTKLENTLQD I IQENFPNLARQANVQI QE I QRTPQR YSSRRATP 
RHIIVRFTKVEMKEKMLRAAREKDFKPTKIKRDKEGHYIMVKGSIQQ 
EELT I LNI YAPNTGAPRF I KQVLSDLQRDLDSHTL IMGDFNTPLS TL 
DRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHT 
YSKIEHIVGSKALLSKCKRKEIITNYLSDHSAIKLELRIKNLTQSHS 
TTWKLNNLLLND YWVHNEMKA£ I 

CRGKFIALNAYKRKQERSKIDTLTSQLKELEKQEQTHSKAGRRQEIT 
KIRAELKEIETQKTLQKINESRSWFFERINKIDRPLMLIKKKREKN 
QIDTI KNDKGD I TTDPTE I QTT IRE YYKHL YANKLQNLEEMDTFLDT 
YTLPRLNQEEVESVNKPITGSEIVAIINSLPTKKS PGPDGFTAEFYQ 
R YKEELVP FLLKLFQS I EKEG I LPNS FYEAS 1 1 L I PKPGRDTTKKEN 
FRP I S LMNI DAKI LNKI LANR I QQH I KKL I HHDQ VG F I PGMQG WFNI 
RKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGID 
GTYFR 1 1 RA I YDKPTAN 1 1 LNGQKLEAFPLKTGTRQGCPLS PLLFN I 
VLEVLARAVRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQN 
LLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASK 
RIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRIN 
IVKMAILPIOIGWGCLVAPHWPTVI^TIQLRSPRYPCTVVNVLAGV 

1130 

B 

1 

3105 

MGKKQNRKTGNSKEQSTSPPPKECSSSPAREQSWTEND FDELREEGF 
RRSNYS ELWED I QTKGKE VENFEKNLEEC I TR I TNTKKCLKELMELK 
TKARELREECRNLRSRCDQLEERQINETESQQGYPGIELSSAPSGPN 
THLQNS PPQ INR I YI FS APHHTYS KTDH I LGS KALLS KCKRTE 1 1 TN 
YLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWVHNEMKAEIKM 
FFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTS 
QLKELEKQEQTHSKASRRQEITKIRAELKAMETQKTLQKINESRSWF 
FERINKIDRJjLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIRE 
YYKHL YANKLENLEEMDKFLDTYTL PRLNQEE VE S LNRP I TG S E I VA 
IINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIGKEGILPN 
S FYEAS 1 1 L I PKPGRDTTKKE S FRP I S LMN I DAKI LNKI LAKR I QQH 
IKiOjIHHDQVGFIPGMQGWFIICKSINVIQHINRAKDKNHMIISIDA 
EKAFDKIQQ P FMLKTLNKLG I DGTYLKI I RA I YDKPTAN 1 1 LNGQKL 
EAFPLKTGTRQGCPLSPLLFN I VLEVLARAIRQENE I KG I QLGKQEV 
KLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFL 
YTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKSLL 
KEIKEDTNKWKNIPCSWGRINIV1CMAILPKVIYRFNAIPIKLPMPF 
FTELEKTILKF I WNQK&AR I AKS I L S QKNKAGG I TLPDF KLYYKATV 
TKTAW YW YQNRD IDQWNRTE PS E I TPH I YNYLI FE KPDKNKKWGKDS 
YLINGPQSTMSAQPWMHGYSAQPRHRFSRRQCTALAQADGQMQQLLL 
I S TDS I TAG YGRAPPNACE VW I KVW I PHLTSAD I HGEDT PHYLCPG P 
DSLASKIKKSYLWGNEEERPPDMENGGSQIHRIPATGKGIQAPLEGA 
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1131 

A 

1 

2827 

MEYYAAIKNDEFMSFAGTWMKLGTIILSKLPQGQKTKHHMFSLTAPH 
HTYSKIDHI IGSKALLSKCKRTEI ITNYLSDHSAIKLELRIKKLTQN 
RSTTWKLTNIiLLNDYWVHNEMKAEIKMFFETYENKDTTYQNLiWDAFK 
AVCRGKF I ALNAHKRKQKRS KI D I LTSQLKELEKQEQTHS KANRRQE 
ITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKRB 
KNQ IDA I KNDKGD I TTDPTD I KTTI RE YYKRL YTNKLENLEEMHKFL 
DTHTLPRLNQEEVESLNRPITGAEIVAIINSLPTKKS PGLDGFTAKF 
YQRYKEELHHPDTKAGQRHNQKENFRPIPLMNIDAKILNKILANRIQ 
QHIEKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRTKDKNHMIISI 
DAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQ 
KliEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRKEKEIKGIQLGKE 
EVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQA 
FLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKP 

TFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKA 

TVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPDKNKQWGK 

DSLVNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNW 

TLEENLGNTIQD IGMGKDFMS KTPKAMATKDKI GKWDL I KLKS FCTA 

K^TTIRVIWQPTEWEKIFATYSSDKGLISRIYl^LKQIYKKKTNNPI 

KKWAKDMNRHFS KEG I Y AAKKHMKK\ CS PS LAI REMQI KTTMRYHFT 

PV 

1132 

A 

1709 

2974 

TEPKTKTT*LSQ*MQKRPLIKFSNASC*KLSIN/ IVLDVLARAIRQE 
KE 1 KG I ELGKEEVKLSLFADDM 1 VYLENP I VSAQNLLKLISNFNKVS 
GYKINVQKSQAFl^TNNRQTESQITSELPFTIASKRRK^LGIQLTRD 
MKDLFKDNYKPLLNEIKEDTNKWKJJIPCSWGR 

TETAWYWYQNRDIDQWNRTEPSEIMPRIYHYIiIFEKPDKNKQWGKDS 

LFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIK^ 

EENLGNTIQDIGMGKDFMSKTPKJUYLATKA^ 

TTIRVNRQPIEWEKIFANYSSDKGLISRIYNELKQVYKKKTNNPIKK 
WAKD>ll>raHFSKEDIYA\ANRHMKKCSRSIJ\IR 

1133 

A 

1 

2856 

MVKGSIQQEELTILNIYAPAGAPRFIKQVLSDLQRDLDFHTLIMGDF 

NTPLSTIaDRSTRQKA/NKI>TQELNSALHQADLIDIYRTLHSKSTEYTF 

FS APHHTYS KI DHLLGS KAFLSKCKRTE 1 1 TNYLSDHS A I KLELRI K 

NLTQNRSTTWKPNNLLLNDYWVHNEMKAEIKWFFETNENKDTTYQNL 

WDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQVKELEKQEQTHSKA 

SRRLNQEEVESLNRPITGSEIVAIINSLPTKRVQDQMDSQPNSTREK 

EGILPNSFYEASIILIPK^GRDTTKKENFRPISLMNIDAKILNKIIiA 

NQIQQHIKKLIHHDQVGFIPGMQGWFNIHKSV1W 

IIS IDAEKAFDKIQQPYMLKTLNKLGIDGTYFKI IRAI YDKPTANI I 

IJtfGQKLEAFPLKTGTRQGCSLSPLLFNIVLEVLAR^ 

LGK£EVKLSLFADYMIVYLENPIVSAQ1«iLKLISNFSKVSGYKINVQ 

KSQAFLYTNNRQTESQIMSELPFTIASKRIKHLGIQLTSDMKDFFKE 

NCKPLLNEIKEDTNKWKNIPCSWGRINIVKl^ILPKLPMTFFTELE 

J\X IXjJvt XW.Nyi\J</\uX livh>NJjbQJ^KAGGITljPDFKLiYY KATVTKTAW 

YWYQNRDIDQWNRTEPSEITPHIYOTLIFDKPEKNKQWGKDSLFNKR 

FWENWIJVIFRKXKLDPFLTPYTKINSRWIKDLHVRPKTIKTLEENLG 

I T I QDTGMG KDFTS KTPKAMATKAKI DKWDLI KLKS FCTAKETT I RV 

NRQPRKWEKI FATYSSDKWLISRI YNELKQI YKKKTNNP IKKWAKDM 

NRH FS KED I Y AAKKHMKKCS PS LA I RETQ I KTTMR YHLT PGH P I LNL 

KHS ELHHAPQS S S WKREGLDLEERVE FGQ I KKNAGTLQKRRG CE PRH 

GDTNIGDIFVRQ 

1134 

B 

1 

3786 

MVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGD 
FlWPLSTLDRSMRQKVNKDTQELNSALHQVDLIDIYRTIjHHKSTEYR 
FFSAPHHTYSKIDHILGSKALLSKCKRTEIITNYLSGHSAIKLELKI 
KNLTQNRSTTWKLNNLLL1TOYWIHNEMKAEIJCMFFETNENKDTTYQN 
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LWDAFKAVCRGKFIAIiNAHKRKQERSKIDTLTSQLKELEKQEQTHSK 
AGRKKE I TKI RAQLKE I BTQKTL KKLMNPGAE IQTTI REYYKHL YAK 
KLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGAEIVAIINSLPTK 
KSRTRWIHSRILPEEAS I ILIPKPGRDTTKKENFRPISLMNIDAKIL 
N K I LAKR IQQHI KKL I HHDQVGFI PGMQG WFN I HKS INVIQH INRAK 
DKNHI X I S IDAEKAFDKIQQPFMLiKTLNKLGIDGTYFKI IRAI YDKP 
TANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVIiEVLARAIRQEKE 
IKGIQLGKEEVQLSLFADEMIVYLENPIVSAQNLLKLISNFSKVSGY 
KINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVK 
DLFKENCKPLLNEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRF 
NAI PTKPPMTFFTELEKTTLKFIWNQKRARIAKS ILSQKNKAGGITL 
PDFKLYYKATVTKTAWYWYQNRDLDQWNRTEPSEITPHIYSYLIFDK 
PEKNKQWGKDSLFNKWCWENWLPICRKLKLDPFLTPYTKINSRWIKD 
LNVRPKTIKTLKENLGITIQDIGMGKDFMSKTPKAMATKDKIDKWDL 
IKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQI 
YKKKTNNP INKWVKDMNRHFS KEDI YAAKKHMKKreSStJVTREMOTTC 
TTMRYHLTPLRMAI I KKSGNNS AS PTARNKTARNQRTKMI AVTAPRN 
RAPLELEL I L YRQNRQS KTH I LE TNNTS AELLVP FEED YL I E I RTVS 
DGGDGSSSEEIRIPKMSMIDHILPKSIPEELQNGEGFGYIIMFRPVG 
STTWSKEKVSSVESSRFVYRNESIIPLSPFEVKVGVYNNEGEGSLST 
VTIVYSGEDDGYVFLWMVEPQLAPRGTSLQSFSASEMEVSWNAIAWN 
RNTGRVLGYEVLYWTDDSKESMIGKIRVSGNVTTKNITGLKANTIYF 
AS VRAYNTAGTG PS S PP VNVTTKKSR YL ITTAYLEVPE I * 

1135 

A 

1 

3276 

MGDFNTPLSTLDRSTRQKVNKHTQELNSALHQ7U3LIDIYRTLHPKST 
EYTFCSAPHHTYSKIDHIVGSKALLSKCKRSEIITNCLSDHSAIKLE 
LR I KKLTQNRSTTWKLNNLLLNDYWHNEMKAE I KMFFETNQNKDTT 
YQNLWDTFKAVCRWKFIALNAHKRKQERSKIDTLTSQLKELEKQEQT 
HSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKTDRPL 
ARL I KKKRQNNQ I Y A I KNDKRD I TTDPTE I QTT IRE Y YR YLYTNKLE 
NliE EMDKFLDT YTL PR LNQE EGD S LNR P I TG S E I VA I TNS L PTKKS P 
GPDGFTAEFYQRYKEELVPFLLKLFQS IEKEGILPNSFYEAS I ILIP 
KPGRHTTEKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVG 
FIPGMQGWFNIRKSRNVIQHINRTKDKNHMIISIDAEKAFDKIQQRF 
MLKSLNKLGVDGTYLKI IRAI YDKPTANI I LNGQKLEAFPLKTGTRE 
GCPLSPLLFNIVLEVXARAIRQEKEIKGIQLGKEEVKLSLFADDMIV 
YLENPIVSAQNLLKLISNFSKVSGYKIHVQKSQAFLYTNNRQTESQI 
MSELPFTI ASKRI KYLG I QPTKDVKDLFKENYKPLLKE I KEDTNKWK 
NIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTQLEKTTLKF 
I WN Q KRAR I AKS I L S QMNKAGG I M L PD F KL Y Y KATVTKT AW YWYQNR 
D I DOWNR I E PS E I M PH I YNYL» I FDKPEKNKOWfS KD T . PTsI KWPWPMWT* 
AIWRKLKLDPFLTPYTKTNSRWIKDLlWRPKTIKTLEENIiGITIQDI 
GMGKDFMSKTPKAIATKAKIDKWDLIKLRSFCTAKETTIRVNRQPTK 
WEKI FTTYS SDKGL I SRI YKELKQI YKKKTNS P I KKWAKDMNRHFS K 
ED I YAAKRHMKKCSPSLAIREMQI KTTMR YHLTPVRMAI TKKSGNNR 
CWRGCG E I GTLLHCWNPTYKGCEG PLQGERQTTAQGNKRG YKQMEEH 
SMLMGRKNQYRENGHTAQERVQQLMVIiNQANSCFECLEICLGVERKG 
PCGTS I YVREG 

1136 

A 

1 

3042 

MGKKQSRKTGNSKNQSASPPPKERSSSPAMEQSWMENDFDDLREEGF 
RQSNYSELKEEVRTHGKEVKNLEKIO^DEWLTRITNAEKSLKDLMEIiK 
TTAREIJiDECTSLKRVSVTEDQMNEMKREEKFREKRIKRNEQSLQEI 
WD YVKRPNLHL I GVPES DRENGTKLENTLQD 1 1 QENFPNLVRQANI Q 
IQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRVTHK 
GKPIRLTADLLAETLQARREWGPIFNILKEKNFQPRISYPAKLSFID 
RSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTY 
SKIDHI VGSKALLSKCKRTEI ITNCLSDHSAIKLELRI KKLTRNHST 
TWKLl^LLLlStDYWVHNEMKAEIKMFFETNENKDKTYQNL 


W7 
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RGKFIALNAHKRKQERSKIDTLTSQLKGLEKQEQTHSKASRRQEITK 
IRAELKEIETQKTLQKINESRSWFFEKINKIDRPLARLIKNKREENQ 
ID AI KNDKGD I TTDPTE I QTA I S E Y YKHL YANKLENLEEMD KFLDTY 
. TLPRLNQEEVESLNRPITGSEIEAI INSLPTKKSPGPDGFMAEFYQR 
DKEELVPFLLKLFQS IEKEG I LPNS F YEAS IMLI PKPGRDTTKKENF 

KS LNV I QH INRTNDKNHM 1 1 S I DAEKAFD K I QQPFMLKTLNKLG IDG 
MYLKIIRAIYDRPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIV 
LEVLARAI RQEKE I KG I QLGKEE VKL S LFADDMTVYLENAI VS APNL 
LKLIGNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFRIASKR 
I KYLG I QLTRD VKDL FKQNYKPLLNE I KEDTNKWKN I PCSWVGR IN I 
VK^ILPKGTQRPYAKVRVQLNSGEELHFLGLLDTSTQVTVIPTNPC 
IKMRRKQIVFSDFGLLLTSWMAPTTE 

1137 

A 

1 

3663 

EHSSSPATEQRWTENDFDELREEDFRRSNYSELWEDIQTKGKEVENF 
EKNLEECITRITNTEKCLKELMELKTKARELRKECRSLRSRCNQLEE 
RVSAMEDGMNEMKQEGKFREKRIKRNKQSLQEIWDYVKRPNLRLIGV 
PES DGENG TKLENTLQD 1 1 QEN FPNLAR QANI Q I QE I QRTRQRYS S R 
RATPRHI I WFTKVEMKEKMLRAAREKGRVTHKGKPIRLTADLSAET 
LQARREQKPS FFETNENKDTT YQNLWDAFKAVCRGKF IALNAHKRKQ 
ERSKIDTLTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTI 
QKINESRSWFFERINKIDRPLARLTKKKREKIQIDAIKNDKGDITTD 
PTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLN 
RPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQ 
STEKEGILPNSFYEASIILIPKPGRDTTEKENFRPISLMNIDAKILN 
KILPNRIQQHIKNLLHHDQVGFIPGMQGWFNICKLINVIQHINRTKD 
KNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDKPT 
AN 1 I WNGQKLEAFPLKTGTRQGC PLS PLLFN I VLE VLARA I RQEKE I 
KGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSAYK 
I NVQK S Q AFL YTNNR QTE S Q I R S E L P FT I AS KR I KYLG I QLTRD VKD 
LFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKIIYRFN 
AIPIKLPMTFFTELEKTTFKFIWNQKRARIAKSILSQKNKAGGITLP 

EKNKQWGKDSLFNKWCWENWLAI CRKLKLDPFLTP YTKINSRWI KDL 
KVRPKTIKTLEESLGITIQDIGMCKDFMSKTPKAMATKAKIDKWDLI 
KLKS FCTAKETTI RLGVELHKHPCGHHHYDTTGSDLKPAQHWVSPKA 
YYNHSQAIANVHSRPCDFTISMWQSQSGLCPSLQDGKVSHALGRLAP 
LWPREGPEMPSKYQVLESGTPSILLMLYPSVAMLVPKVQDKVTFTFP 
STFLKQMEFLHAASTAGEGRGGVASEIQDCFFYLFSASLSDMKLKPG 
TVDAHWTFGSYEAIIAKNSGKSMVSLPLASTSLIISRNSASDPVTS 

1138 

A 

1 

3144 

MGD FNTPL STLDRS S RQKVN KDTQELNS TLHHADL I D I YRTLH PKS T 

E YTFFS APHHTYS KI DHWGS KALLS KCKRTE 1 1 TNCLSDHS AI KPE 

LRIKKLTQNRSTTWKLNNLLLNDYVfVHNKMKAEIKM 

YQNLWDTFKAVS RG KF I ALNAH KRKQKRC K I DTLAS QL KE VE KQEQT 

HSKASRRQEITKIRAELKEIETQKTLQKINESRSWFLERINKIDRPL 

ARLIKKKREKNQIDVIKNDKGDITTDPTEIQTTIREYYKHLYANKLE 

NLEELDKFLDTHTLQRLNQEEVESLNRPITGSEIVAIINSLPTKKSP 

GPDGSTAEFYQRYKEELLPFLLKLFQS I EKEGILPNS FYEAS 1 1 LI P 

KPGRDTAKJ<ENFRPISLMNIDAKILNKILANQIQQHIKKLIYTMIKW 

AS S LGCKAVLEVIjARA I RQE KE I KG I QLGKEEVKLS LF ADDM I VYLE 

NPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSE 

IPFTIVSKRIKYLGIQLTRDMIOjLFK^N^KPLLNEIKEDTNKWKNIP 

CS WLGR I NNVKMA 1 LP KV 1 YRFNAIL 1 KLAMTFFTESEKTTLKF 1 WN 

QKRACIAKSILSQKNKVIGITLPDFKLYYKAIVNKTAWYWYQNRDID 

QWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAIC 

RKLKLDPFLTPYTKINSRW I KDFHVRPKTI KTLEENLG I TIQDIGMD 

KDFMSKTPKAMATKAKIDKWDLIKLKSFCRAKETTIRVNRPPTEWEK 
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IFVTYSSDKGLISRIYNBLKQIYKKKTNNPIKKWAKDMNRHFSKEYI • 

YAAKRDMKKCS S S LAI REMQI KTTMRYHLTP VRMVI I KKSGNNRCWR 

GCGEIGTLLHCWWDCKLVHPLWKSVWRFLRDLELEIPFDPAILLLGI 

YPKDYKSCCYKDTCTHEKAEIQKSKGIRDYGEIGTRTEDMDSQFRPL 

SWRAQIIPFEFPWEGIPLGGPFPRHSCSYPVGLALLYHSSSAAGPV 

RGHLSLQWNSRMTS 

1139 

B 

1 

3380 

MVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGD 
FNNPLSTLDRSMRQKVNKDTQELNSALHQVDLIDIYRTLHHKSTEYR 
FFS APHHTYS KIDHI LGSKALLSKCKRTE I ITNYLSGHSAI KLELKI 
KNLTQNRSTTWKLNNLLLNDYWIHNEMKAEIKMFFETNENKDTTYQN 
LWDAFKAVCTRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSK 
ASRRINKIDRPLARLIKKKREKNQIDAIKNDKGDITNDPTEIQTTIR 
EYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIV 
AIINSLPTKRVQDQMDSQPNSTRVLEVLARAIRQEKEIKGIQLGKQE 
VQLSLFADEMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAF 
LYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENCKPL 
LNEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPTKPPMT 
FFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKAT 
VTKTAWYWYQNRDLDQWNRTE PS E I TPH IYSYLIFD KPEKNKQWGKD 
SLFNKWCWE1JWLPICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKT 
LKENLG I TIQD I GMGKDFMS KTPKAMATKDKIDKWDLI KLKS FCTAK 
ETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIN 
KWVKDMNRHFSKEDIYAAKKHMKXCSSSLAIREMQIKTTMRYHIjTPL 
RMA 1 1 KKSGNNS AS PTARNKTARNQRTKM IAVTAPRNRAPLELEL I L 
YRQNRQSKTHILETNNTSAELLVPFEEDYLIEIRTVSDGGDGSSSEE 
IRIPKMSTGGEEGMAAVFKNKCRCSWSRVVIAYHSSSGNQMGTNPEQ 
DPGQHAI PLEX * 

1140 

A 

1 

3345 

MGDFNIPLSTLDRSTRQKVNKDTQELNSALHQADLIDSYRTLHPKST 
EYTFFSAPHHTYSKIDHILGSKALLSKCKRTEIITNYLSDHSAIKLE 
LRIKNLTQNCSTTWKIjNNLLLNDYWVHNEMKAEIKMFFETNENKDTT 
YHNLWDAFKAVCRGKF I ALNAHKRKQER S KI DTLTSQLKELEKQEQT 
HSKASRRQEITKIRAELKEIETQKTLQKINESRSWVFERINKIDRPL 
ARLTKKKSEKNQIDAIK2TOKGDITTDPTEIQTTIREYYKHLYANKLE 
NLEETDKFLDTYTLPRRNQEEVESLNRPITGAEIVAIINNLPTKKSP 
GSDGFTAEFYQRYKEELVPFLLKLFQS I EKEGI LPNS FYEAS IILIP 
KPGRDTTKKENFRP I S LMN IDAKI LNK I LANQI QQH I KKL IHHDQVG 
FIPGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKAFDKIQQPF 
MLKTLNKLGIDGTYLKI IRAI YDEPTANI ILNGQKLEAFPLKTGTRQ 
GCPLSPLLFNIVLEVLARAIRQEREIKGIQLGKEEVKLSLFADDMIV 
YLENPIVSAQNLLKLISNFGKVSGYKINVQKSQAFIiNTNNRQTESQI 
MS EL PFTI AS KR I KYLG I QLTRD VKDL FKENYKPLLKE I KEDTNKWK 
NIPCSWVGRINIMKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKF 
IWNQNRACIAKSILSQKNKAGGITLPDLKLYYKATVTKTAWYWYQNR 
DIDQWNRTEPSEITLHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWL 
AICRKLKLDPFLTPYTKINSRWI KDLNIRPKTI KTLEENLGITIQDI 
GMGKDFMSKTPKAMATKAKIDKVHDLIKLKSFCTAKETTIRVNRQPTK 
WGKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKFAKDMNRHFSK 
KDI YAAKKHMKKCSPSLAIREMQIKTTMRYHLTPVRMAX IKKSGNNR 
TGRGLHPGRDWS VTCARS PGDPEPSGI KGGCGSGTPSPS PSKQTVRR 
PWTQIPEMKPLLLAVSLGLIAALQAHHLLASDEEIQDVSGTVfYLKAM 
TVDREFPEMNLESVTPMTLTTLEGGNLEAKVTML 

1141 

A 

1 

3429 

MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKST 
EYTFFS APHHTYS KIDHI LGSKALLSKCKRTE 1 1 ANYLSDHSAIKLE 
LRIKNLTQNRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNENKDTT 
YQNLWDAFKAVCRGKFIALSAHKRKQERSKI DTLTSQLKELEKQEQT 
HSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPL 
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ARLIKKKREKNQIDAIKNDKGDITIDPTEIQTTIREYYKHIiYANKLB 
NLEEMDKFLDTYTLPRLNQEEVKSLNRPITGAEIVAIINSLAIKKSP 
GPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIP 
KPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVG 
FI PGMQGWFNIHKSINVIQHINRARDKNHMI I S IDAEKAFDKIQQPF 
MLKTLNKLGIDGTYLKI IRAIYDKPTANIILNGQKLEAFPLKTGTRQ 
GCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIV 
YLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQI 
M S ELPFT I AS KR I KYLG I QLTRDVKDLF KENYKPLLKE I KEDTNKWK 
NMPCSWVGRINIVKMAILPKVIYRFNAIPIKX.PMAFFTELEKTTLKF 
IWNQKRARIAKSILSQKNKAGGITLPNFELYYKATVTKTAWYWYQNR 
YIDQWNRTEPSEITPHIYNYIilFDKPGKNKQWGKDSLFNKWCWENWL 

GMGKDFMS KTPKAMATKDKIDKWDL I KLKS FCTAKETT I R VNRQPTK 
WEKI FATYS FDKGL I SR I YNELQQI YKKKTNNP I KKWAKDMNRHFS K 
ED I YAAKKHM KKYS SSLAI REMQ I KTTMR YHLTP VRMAT I KNSGNNR 
SLCFYKENLSLCLAILLSVPVQEGGARSFLAPAGGVAGRATSHYGGH 
MATAERLSVELSGQALKEFQKGPLSPKQFKGHHSQQLRWTHRLNPSY 
GQSKTPKNTLLKMRGPTTWALGGTDVAASTDNNGNAHFETLQVMDMP 
RPWCCLLDGALLVSV 

1142 

A 

1 

3030 

MGKKQSRKTGNSKKQSASPPPKECSSSPATEQSWTENDFDELREEDF 
RRSNYSELQEEIQTKGKEVENFEKNSDECITGITNTEKCLKELTELK 
AKARELHEERRSLKSRCDQLEERVSVMEDKMNEMKREGKFREKRIKR 
NKQSLQE I WDYVKRPNLRLI S VPESDRENGTKLENTLQD I IQENFPN 
LARQANIQIQEIQRTPQRYSSRRATPRHIIVRFSKVEMKEKMLRAAR 
EKE I QTNI RE YYKHRYANKLENLEEMDKFLNI YTLRRLNQEE VESLN 
RPIRGSEIVAIINSLPTKKSPGPDGFTAEYYQRYKEELVPFLLKLFQ 
SIEKEGILPNSFYEASIILIPKPGRDTTKKENFRTISLMNIDAKILN 
KI LANR I QQH I KKLIHHDQVGFI PGMQGWFNI RKS INVI QHINRTKD 
KNHVI I S IDAEKAFDKI QQLFLLKTLNKLGIDGTYLKI KRAI YDKPT 
ANI ILNGQKLEAFPLKTGTRQGCPLSPLLFNI VLEVLARAIRQEKEI 
KGI QLGKEEVKLS LFADDMI VYLENP I VS AQNLLKL I SNFSKVSG YK 
IlNTVQKSQAFLYTNIJRQTETQIMSELPFTIASKRIKYLGIQLTRDVKD 
LFKERYKPLLl^IKEDTKKWKNIPCSWVGRINIMKl^ILiHKVIYRFN 
n.L r ±. xvuxri ixrr x aucti^x X XjJVt X ni>(viv-K>\X<.±/\ivoXXjoyi>XN 2\JAIjL3 XrLLi.r' 
DFKLYY KATVTKTAWYWYQNRD I DQWNRTE PSEITPHI YNYL IFDKP 
EKNKQWGKDSLFNKWCWE1WKAIWRKLKLDPFFTPYTKINSRWIKDL 
lWRPKTIKTLEENIiGITIQDIGMGKDFMSKTPKAIATKAKIDKLDLI 
KLKS FCTAKETTIRVNRQPTEWEKIFATYSSDKGLISRI YNELKQIY 
KXKTNNPIKKWAKl^MlNrR^ 

TMRYHLTPLRMAIIIvKSGNOTtASRSARGESLGPSQSFPEHVHCPGPL 
HTALHRPMTFQIPRDMLEVSSAH 

1143 

**■ 

A 

1 

4171 

^SLEQNPRSKWELIVHRGTMELWPTIWADEEEQGLKAVIjALS 

PGARS KTGLTDCMGVGGGLLLPAFPHI PGKRDSHNPLWN I TPENQS P 

TPWAMERSSSPATEQSWMENDFDELREEGFRRSNYSELQEEIKTKG 

KEVKNFEICNLDECiraiTNTEKCLKEVMQLKAKARELREECRSLRSR 

WNQLEERVSVMEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPN 

LRLIGVPESDGENATRLENTLQDIIQENFPNIjARQANIQIQEIQRMP 

QRYS SRRAS PRH 1 1 VRFTKVEMKEKMLRAAREKGRVTHKGKP I RL I A 

DLSAETLQARREWGPIFNILKEKNFQLRISYPAKLSFISEGEIKSST 

DKQMLRDFVTTRPALKELLKEAL1TOERNNRD 

GKQKKAGVAILVSDKTDFKPTKIRRDKEGHYIMVKGSIQQEELTILN 
I Y APNTGAPRF I KQVLSDLQRDLDSHT I IMGDFNTPLS TLDRSTRQK 
VNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYSKTDHI 
VG S KALLS KCKRTD I KTNCLSDHSAI KLELR I KKI»TQNH S TTWKVNN 
LLLITOYWVHNEMTAEIKMFFETOENK^ 
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LNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQBISKIRAEIiK 
EIETQKTLQKINESRSWFFERINNINRLLERLIKKKREKNQIDTTKN 
DKGDITTDPTEIQTTIREYYKHLYANKLE^TLEEMDKFLDTYTLPRLN 
QEEVES LNRP I TG SEI VAI INSLPTKKS PGPDG FTGEFYQRYKEELV 
PFLLKLFQSMEKEGIPPNSFYEASIILISKPGRDTTKKENFRPISLM 
N I DAK I LNK I LTKR I QQH I KKL I HHDQVG F I PGMQGWFN I R KS INVI 
QHINRTKDKiraMIISIDAEKAFDKIQQHFMLKTLKKLGIDGMYLKII 
RATYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLAR 
A I RQE KE I KG I QLGKEE VKLS L FADDM I VYLEN P 1 1 S AQNLLKL I SN 
FSKLSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGI 
QLTRDVKI)LFKENYKPLLNEIKEDTNKWKNSPCSWVGRINIMKMAIIi 
PKLPMTFFTELEKTTLKF I WNQKRACIAKS ILSQKNKAGGITLPDFK 
LH YKATVTKTAW YW YQNRD IDQWNRTEPS E I TPH I YNYLI FDKPEKN 
KQWGKDS LFNKWCWENWLA I CRKLKLD PFLTP YTKINS RW I KDLNVR 
PKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIK\S 
FCTAKETT I RVNRQPTKWEKI FATYSS 

1144 

A 

1 

3921 

MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGF 

RRSNYS ELWED I QTKG KE VENFEKNLEEC I TR I TNTEKCLKELMELK 

TKARELHEECRSLRSRCDQLEERVSAMEDEMNEMKREGKFR5KRIKR 

NEQS LQE I WD YVKRPNLRL I G VPE SD VENGT KL ENTLQD 1 1 QENFPN 

LARQANVQI QE I QRMPQR YS S RRATPRHI IVRFTKVEMKEKMLRAAR 

QKAPHHTYSKIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELRIK 

NLTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNENTTO 

WDTFKAVCRGKFIALNAYKRKQETSKIDTITSQLKELEKQEQTHSKA 

SRRQE I TKIRAELKEI ETQKTLQKINESRSWFFERINKIDRPLARLI 

KKJGIEKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENIjEE 

MDTFLDTYTLPRLNQEEVESLNR P ITGS E I VAI INSLPTKKS PGPDG 

FTAEFYQRYKEELVPFLLQLFQS I EKEG I LPNS FYEAS I ILI PKPGR 

DTTKiCENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPG 

TQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKT 

LNKLG IDGTYFKI IRAI YDKPTAN 1 1 LNGQKLE AFPLKTGTRQGCPL 

SPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLEN 

PIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSEL 

PFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPC 

SWGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQ 

KRACISKSILSQKNKAGGITLPDFKXYYKATVTKTAWYWYQNRDIDQ 

WNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICR 

KLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGK 

DFMS KTPKAMATKAKIDKWDLI KLKS FCTAKETTIRVNRQPTTWEKI 

FTTYSSDKGLISRIYNELKQIYiaCKTNNPIKJCWAKDMNRHFSKEDIY 

AAKJOIMKKCSSSIAIREMQIKTTMRYHLAPVRMAIIKKSGNNSKMVE 

ADGPGNLFTGVLHLETNEKSLKAWGKYGPILEDNFLDGKAMKVEQV 

NKPSFQSGGNVEITIFLEKQEPFRKSETGKKFFKLFLLTLANFFKFS 

DHDDYSGGLGRDHgEHPNGSSYRDAYQSYGTFYGAPPA 

1145 

A 

1 

2884 

MVKGSIQQEELTILNIHAPNTGAPRFIKQVIiSDIiQRDIiDSHTLIMGD 
FNTPLSTLDRSMRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYT 
FFSLPHHTYSKIDHIVGSKALLSKCKRTQIITNYLSDHSAIKLELRI 
KTLTQSRSTTWKLNNLLLNDYWVHNEMKAEIKMFFETOENKDTTYQN 
LWDAFKAVCRGKPIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSK 
ASRRQEITKIRAELKETETQKTLQKINESRSWFFERINKIDRPLARL 
IKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHIiYANKLENLE 
EMDTFLDTYTLPRLNQEEVESIiNRPITGAEIVAIINSLPTKKSPGPD 
GFTAEFYQR YKEELH INRAKDKNHMI I S IHAEKAFDKI QQPFMLKTL 
NKLG IDGTYFKI IRA I YDKPTAN 1 1 LNGQKLEAFPLKTGTRQGCPLS 
PLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENP 
IVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELP 
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FTIASKRI KYLGIQLTRDVKDLFKENYKPLLKEI KEDTNKWKNI PCS 
WVGR I N I VKMA I L P KA I YR FN A I P I KL PMTF FT EL E KTTT ,KF T WW nV 
RAR I TKS I L S QKNKAGG I TL PDFKL YYKATVTKTAW YWYQNRD I DQ W 
NRTEPSEITPHTYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRK 
LKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLEITIQDIGVGKD 
FMS KTPKAMATKAKI DKWDL I KLKS FCTAKETT IRYNRQPTTWEKI F 
ATY S SDKGL I SR I YNELKQ I YKKKTNNP I KKWARDMNRH FS KED I YA 
AKKHMKKCS S S LA I REMQ I KTTMR YHLTP VRMA 1 1 KKS GNNRN I \ RL 
CSYSGSHSRAWFLPPAGRHIQ 

1146 

A 

1454 

3917 

SIAYQRKRVQDQMDSQPNSTREKEGILSNSFYEASIILIPKPGRDTT 
KKENFRPISLMNIDAKILSKILAKRIQQHIKKLIHHDQVGFIPGMQG 
WVNIHKSINVIQHINRA103K3mMIISTDAEKAFDIIQQPFMLKTI^ 
LGIDGMYFKI IRAI YDKPTANI ILNGQKLEAFPLKTGTRQGCPLSPL 
LFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMTVYLENPIV 
SAQNLLKLISNFSKVSGYKINVQKSQGFLYTNNRQTESQIMSELPFT 
I AS KR I KYLG I QLTRD VKDL FKENYKPLLKE I KEDTKKWKN I PCS WV 
GRINIVKMAILPKVIYSFNAIPIKLPMPFFTELEKTTLKFIWNQKRA 
RITKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNR 
TEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLK 
LDPFLTP YTKINS RW I KDLNVRPKT I KTLEENLGNT IQDIGMGKDFM 
S KTPKAMATKAK IDKWDL I KLKS FCTAKETT I RVNRQPTKWEKI FAT 
YSSDKGLISRIYNELKQIYKKlCn^PIKXWVKDMNRHFSKEDIYAAK 
KHM KKC S S S LAI REMQ I KTTMR YHLT PVR VA 1 1 KKSGNNRCWRGCGE 
IGTLLHCWVTOCKLVQPLWKSVWRFLRDLELEIPFDPAIPLLGIYPKD 
YKSCCTKDTCTlblFIAALFTIAKTWSQPKCPTTIDWIKKMWHIYTME 
YYAAI KNDE FMS FVGTWMKLE 1 1 1 LS KPLQEQKTKHRMFSL I VS SKV 
IRRLSQKLPLKIVAMCIPVKRN 

1147 

A 

11537 

15574 

AKSPANIIMTGSNSHITILTLNINGLNSAIKRHRLASWIKSQDPSVC 
CIQETHLTCRDTHRLKIKGWRKIYQANGKQKKAGVAILVSDKTDFKP 
TKIKRDKEGHYIMVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQ 
RDLDSHTLIMGDFNTPLSTLDRSMRQKVNKDTQELNSALHQADLIDI 
YRTLHPKSTE YTFFS APHHTYS KI DH I VGS KALLSKCKRTE 1 1 TNYL 
SDHSAI KLELR I KNLTQS R S TTWKLNNLLLND YWVHNEMKAE I KMFF 
ETNENKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQL 
KELEKQEQTHS KAS RRQE I TKI RAELKE I ETQKTLQKI1STESRS WFFE 
RINKIDRPLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYY 
KHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEIVAII 
NSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSF 
YEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIK 
KLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEK 
AFDKIQQPFMLKTLNKLGIDGTYFKI IRAI YDKPTANI ILNGQKLEA 
FPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKL 
SLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYT 
NNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKE 
IKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFT 
ELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKXjYYKATVTK 
TAW YWYQNRD IDQWNRTE PS E I MPH I YNYL I FDKPEKNKQWGKDSLF 
NKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEE 
NLGITIQDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKS FCTAKETT 
IRVNRQPTTWEKI FATYSSDKGLI SRI YNELKQI YKKKTNNPIKKWA 
KDMNRHF SKED I YAAKKHMKKCS S SLAIREMQ I KTTMR YHLTP VRMA 
1 1 KKSGNNRVGD YHS YENDDDNLG ATAAQS CTLLFS L I AHLLLQDP Y 
T I PA YMTQVQNS AAGNYWKNTD AAKKLET 1 1 LGKLTQEQKTKHRMFS 
LISGS 

1148 

A 

115 

450 

VWQPEAWTPFLDSLSGGCLRPSSWFAAFQRWEGVPLPGVQSVKLSPL 
SRDPAPPAGTLPCVALEADWFLDMPGGRRGPSRQQLSRSALPSLQTL 
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- 

VGGGWGKGTGLRNRKGRA 

1149 

A 

278 

886 

NWLTTINLGIiQLQVALEADWFLDMPGGRRGPSRQQLSRSALPSLQTL 
VGGGCGNGTGLRNRNGSAIGLPVPPITALITPGPVRHCQIPDLPVDG 
SLLFEFLFFIYLLAALFIQYINIYKTAWGYPYiraPASCTSLNFHLTD 
YDIJVAFITKMLAKRLVWALISQATKAGAASMIHYMDLTSARLGLLTL 

cgwvlcwtlvnIjFr j 

1150 

A 

2 

379 

CGLPLEAPRCPTFRTPLPALSRHRRRRLTEAVSTEEKESDRMSAIQN 
LHSFDPFADAREGDDLLPAGPEDYLHIKIQHRIGGKTLTTGQGIADD 
HDKKKL WAS KKRSGCDGTVTGP PD YGE V I E 

1151 

A 

172 

464 

ASHRVGLLQPFLNLWPSGCSTVIAKMKSVLVATEGAEVLFYWTDQEF 

EESLRLKFGQSENEEEEVGLLML*AR*PHPHTPPVLSGLNEGKKKSN 
FIT 

1152 

A 

164 

529 

ASHRVGLLQPFLNLWPSGCSTVLAKMKCVLVATEGAEVLFYWTDQEF 
EESLRLKFGQSENEEEELPALEDQLSTLLAPVIIFSMTMLEKLSDTY 
TCFSTENGNFLYVLHLFGECLFIAING 

1153 

A 

1 

1122 

MVIjLTLDGDYTHADRNLEWWGLKVLSATPQADKKKLASADNQPETVS 
LLLAKNALFPCRSVGVSTTELVSAQIHAGPGPEVSLVEPVGAMWTMV 
LRAVLKQSQVEAPSRALKXPSLECAASLGQLHTPADLTYYCQEKVLF 
VDFLLQGGIGFKTTGSWLISNKEHHKTFFKAQQALEEEGISAFQFLH 
YKVQS PVNNKNNKEG I KKGKEHSS S PATEQS WTENDFDELREEGFRR 
SVITNFSELKEDVRAHCKJ^KNFERRLDEWLTRINSKEKTLNDLMEL 
KTMARELCDACTSFSSQFDQVEERVSVIEDQMIEMKQEEKFREKRVK 
TNEQSLQEIRDYVKRPNLRLIGVPSSTIVEDSVAIPQGSGTRNTI 

11-54 

A 

1 

558 

MPGNSEAKSYYNAKTVLESCTLLWEQFKKVTKPYWSQGKLLREALES 
DDTCLWLCSILPDEEQEEKHSFAEVSVSPIGLSLTQPPLLYLLLKPW 
HNYHLLTQITAKHNRNTAPPATEQSGTENDFDELREEGFRRLVITNF 
SK1KGEVRTHLKEAJCNLEERLDEWLTRINSVQKTLNDQMELKTMA 

1155 

A 

1 

495 

MKPPKGPPSLQVPSAPPKKLRTDGRITTPSTHSLHWQLIGWCPFTLR 
VGLPLLGHQLNCQS SLATPSQTHPETVLHQPSRHPS IQS SGHLI LTI 
TSPGTMASAASWMRQTGSDHFPSLHLSLRECSSSPAMEQSWTENDF 
DELREEGFRRSVITNFSELKKLKP 

1156 

A 

51 

581 

LRSSSPATEQSWTENDFDKLREEGFR*SNYSELQEEIQTKGKEVENF 
EKNLEECITRITNTEKCLKDLMELKAKARELHEECRSLRSRCDQLEE 
RVS VMEDEMNEMKREGKFREKR I KRNEQS LQE I WD YVKRPNLHL IGV 
PESDGENGTKLENTLQDIIQENFPNLARQANIQIQ 

1157 

A 

286 

456' 

FCHLSSTSWGGADGTCREGGPLGGFMGPSHQ*ESSPVPEAASSFRTF 
KSSAVSQSPL 

1158 

A 

1 

1760 

MDDIPQEARQYRHNQAYAYSIQGDGAEDDDERIVRFHTRVTVDSDTL 
ASDAARLTCRHG LGNQDRS SS PAMEQS WTENDFDELTE VGFRRS VI T 
NFSELKEDWTHRKEDHSAIKLELRIKKLIQNLTTTWKLNNLLLNDY 
RVNNEMKAEIKMFFETNENKDTTYQNLCDTFKAVCRGKFIALNAHKR 
KQERSKIDTLTSQLKELEKQEQTHSKACRRQEITKIQAELKEIETQK 
TLQKINESRSWFFEKINKIDRPLARLIKKXREKNQIDAIKNDKGYIT 
TNPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVES 
LNRPITGFEIEAIINSLPTEKRPVPDGFTAKFYHSSADCTRSMAPAP 
ASGEALRLLPLMGEGEGELTCRDH I AREEAQECS S S PATEQS WMEND 
FEELGEEGFRRSVITNFSELKEDVQTHFKEAKNLEKRLDEWLTRINS 
VEKTLNDLM* LKTVA* ELRDTYTS FNSRFDQVEERVS V I EDQMNEME 
REEKFREKRV* RNEQS / LQE I WNYVKRPNLHL I GVS E I DRENGTKLE 
NTLOD I F OENFT YT . A J? O AM TOTn 

1159 

A 

138 

326 

GLYYTGCYGI SRMPDLSRAPWKDS FYLLPVPMVEPG I P P I SGFRGPK 
GQMGEPLLPGYTGKK 

1160 

A 

1 

525 

MKTILSNQTVDIPENVNVTLKGRTVIIKGPRGTLRKD 
LGKXKjKRLRVDKWWGNRKEIxATVRTICSHNGSLVEIRNFLGEKYIRR 
VQMR PG VACS VPQAQKDEL ILEGNDTELV SNSAAL I QQ P TTVKNKD I 
RKFLDG I YVS EKGTVQQVDESDLR V I QLQKEDAR 
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1161 

A 

319 

1035 

EWSSVRRSLVEKRALRRPHPQCLCFRMKTIL\SNQTCRPFPE\NVDI 
TLKGRTVI VKGP/ REGTLRRDFNHI \NVELSLLWKEKKRGFRVDK\ W 
WGNRKEL\ATGRD*FVSHVQNMIKGCYTGASGYKMKVLWYAHFPIQR 
LLFQGELGPSLLKSRNFLGGWKNTSRRVSG*GPGCLLVSVSQGPRKD 
EINPLKG\NDIELVSKFQRALIQQ\ATTVKNKG\IRKFFGWVSMSLE 
KGTVQPGLIE 

1162 

A 

232 

338 

VGIKMSISSDEVNFLVYRYLQESGFSHSAFTFGIE 

1163 

A 

474 

64 7 

EPSLSQLPQNGDLLAARRREHPACSTGCTSGARVRSRVWRAGQALVP 
GCAGCAYILH 

1164 

A 

1 

413 

QLSHKSDSWLVSQSRQRTCRRKYPNYGRRNRSWRELSSGNESSSSV 
RHETSCDQSEGSGSSEEDEWRSDRKSESYSESSSDSSSRYSDWTADA 
GINLQPPLRTSCRRRITRFCSSSEDEISTENLSPPKRRRKRKK 

1165 

A 

2 

2545 

LSMSEE I ILHP WLSLALGGPLQCI S SRAALA1»VFGPYAVGEGLWEK 
IWLSNXHVAPDHLLQICQRIGPMLDKEIPPSISRVTSLLGAGRQSL 
LRTAKDCRHTVWKGSAFAALHRGRPPEMPVNYGSPPNLVEIHRGKQL 
TGCSTFSTAFPGTMYQHIKMHRRILGHLSAVYCVAFDRTGHRIFTGS 
DDCLVKIWSTHNGRLLSTLRGHSAEISDMAVNYENTMIAAGSCDKII 
RVWCLRTCAP VAVLQGHTGS I TS LQFS PMAKG S QR YMVSTGADGTVC 
FWQWDLESLKFRFLSGSRDGTAR I WRFEQLEWRS I LLDMATR I SGRI 
QWFWYHSLDKGHVGVNIQIMVEEIVAGETSCDQSEGSGSSEEDEWRS 
DRKSESYSESSSDSSSRYSDWTADAGIRLQPPLRTSCRRRITRFCSS 
S EDE I STENLS P PKRRRKRKKENKPKKE I WQDRRQLG I ATWWDVAM 
VLSTHLEMGAVELLGCPAVKLGAGIGLMGIVAALLGAHVTVIDRKNL 
RRMTPAE LANMEH L YE FH P PVW I TDTTT/R K ^ P FV P DM RDF V T Y FR OG 
HEAY I E AVRRNN I YELN PNKE P WRKMDLRL KKE FH I LASG KD VGLI/T 
VIQIAGGDQELVKIVGIRYEVGPPTLCCLKLAFIDPATGKLMDKSFS 
IRYHDMPDVIDFLVLRQFYDEARQRNWQSCDRFRS I IDDAWWFGTVL 
SQEPYQPQYPDSHFQCYIVRWDNTEIEKLSPWDMEPIPDNVDPPEEL 
GAS I S VTTDELE KLLYKPQAGEWGQKS RDEECDR 1 1 SG I DQLLNLD I 
AAAFAGPVDLCTYPKYCTWAYPTDLYTIRMRLVNRFYRLVETCFQN 
V 

1166 

A 

1364 

1618 

SQHSGRPRQADHLRSGVRDQPGQHGEILSLLKIQKLAGRAGSRL*SQ 
LLERLRLYHRTPA* VTE * DMAS KNKKKPHR I QARKYF 

1167 

A 

3 

342 

LTQELPGAEAHACNPSTLGGQGGQIMRSGARDQPGQHSGTPSLLKIQ 
K\LAGRGGTHL*SQLLRRLRQENRLN1 J GSGGCSELRLRHCTPAWVTD 
SVSKKNELEKESYLIFSSLT 

1168 

A 

2 

232 

WAGRGGSRL*SQHFGRPRRADHERWKNTWELRQLNLGQAPCSRNGMR 
RYGERRHHPDEPGQPSVEGFLRVLSMCIC 

1169 

A 

1294 

1624 

GQLYEKLGRRGPGAVAHACNPSTLGGRGGWITRSGDRDHPG*HGETP 
SLLKIQKK\LAGRGGGHL*SQLLGRLRQENGVNPGA/RGCSELRSCY 
CTPAWGTERDSVS KKKKK 

1170 

A 

427 

730 

SQHFGRPRRADHLRSGVRDWPGQHGEALSLLKIQKLAECGGVCLWSQ 
LLRRLRQGNRLNIiGDRGCSEPRSYRCTPAWVTEQDSVSGKRKEKGMK 
RKKKKKK 

1171 

A 

6497 

6788 

SQRFGRPGQANCLSSGV*DQPGQYGETLSLLKIQKLGGCGGTCL*S* 
LLGWLRQENHLNLGDGGCSEPRMCHCTPPWTTEGGSA*KLKKKIOCKR 
KYL 

1172 

A 

173 

396 

VNSHPKGATSEHJjEPRGPTSKPLGRTGWTLSLRIiGYSPSLQKAVTHS 
LGCLLVRALEKARSSA^QTNYPSSDN • 

1173 

A 

239 

405 

HSPTPPTSPRASTMEDVKLEFPSLPQCKEDAEEWTYPMRREMQLPVIi 
QKHGITHI 

1174 

A 

126 

915 

SACVSCNPAALLLALRSAGPPSFLPPHPARGSAGCVTIiSHPTHQPAG 
QHHGWTVKLE\FPSLPQCK\EDAE\EWTYPMRREMQ\EILPG\LFLG 
P YS SAM \ KKQD P S LLAVS SHGQENE FLPLLTRVQS CHGG F YPHDLNL 
KLIIFQRLHHHILGLPVLQKHGITHIICIRQNIEANFIKPNFQQLFS 
AAFVIAYIMETFGMKYRDAFAYVQERRFCINPNAGFVHQLQEYEAIY 
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LAKLTIQMMSPL/ RDRKVI ICSFWYHRQFEENT 

1175 

A 

1 

924 

MLRARPRPGSPWRPAREEPTGTGRALWAGVAAPHQSRAPRLGRRRV 
RVLLSCTRCLGSARRGPPPGFEDGGVATGSDPAMASDGNAKACALHL 
PGASTCPVARATGSSATPTLRTTPGRETSYPKNNPNAHGDFLKINRA 
YEVLKDEDLRKKYDKYGEKGLEDNQGGQYESWNYYRYDFGKDGLVNV 
GWMDCATQDNLCKSLD I TTSTTAYFPPGATLNNKEKNS I L FLNS LDA 
KEIYLEVIHNLPDFELLSANTLEVGRFDCSSAPDICSNLYVFQPSLA 
VFKGQGTKEYEIHHEYLANSKLPFKT 

1176 

A 

441 

709 

I RTC IRKRMGVWLNKDD YIRDLKM 1 1 LCCL I VYMA ILVGTDHDYYS L 
LGVSNTASSRDIRQAFKRLASKLHPDKYPNNPYAHGDFLKIK 

1177 

A 

109 

437 

NQRRKWRRSRTQLQTLQEALKAEIQGHQKIiAAQMKQDPQNADL*KQL 
YELQAXITALSEKQKRWEQLRKNLIVKQEQPDKFQIQPLPQSDNKL 
RTAQQQPLQQLQQQQ 

1178 

A 

343 

670 

RMELQTLQEALKVEIQVHQKLVAQMKQDPQNADLKKQLHELQAKITA 
LSEKQKRWEQLRKNLIVKQEQPDKFQIQPLPQSENKLQTAQQQPLQ 
QLQQQQQYHHHHAQQ 

1179 

C 

130 

390 

MAEQSLISGGPKPKSVNSLRWINLXXXXXXXXXXXXXXXSXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSWVDK* 

1180 

A 

238 

435 

KKYLKSWVMEGGACSEPSLHHCTPAVATTERDSVSKKIKNKMGLTNYL 
I KSDNTI SQGGRGGYSHAR 

1181 

A 

615 

947 

QQQQQGWGTEQKDPYASCLCWEAPLPQGPTQVSQERHWFPLSLLTC 
FSALRRGAWSPSPRRTHPQEQCGLWGRPPLEEPPHPLSERTRRPPVT 
PAEAEPCLPTPKHSVY 

1182 

A 

232 

564 

GLQIIQSLCLVAFWKKRVFGGQAGWLAESWLMIX3WLVFPLILASQMY 
VPGKLHDVEHVLIDVGTGYYVEKVSESMLDAPLNREGKFRGSSIAQH 
GQRESPFSPLFTSDLVE 

1183 

A 

3 

487 

LPNMAQSINITELNLPQLE\MLKNQLDQEVEFLSTSIAQLKWQTKY 
VEAKDCLNVLNKSNEGKELLVPTDGVI,CMSPGKLH\DVEHVLID\VG 
TG YYVKKTA\ EDAKDFFKRKI \ D F \ LTKQMEKI QP ALQE KHAMKQAV 
M \ EMM S \ QK I QQLTA \ LGEAQ ATAKA 

1184 

A 

187 

423 

NLRQRKQGSSSYSLRRAHGQFSRKFVGQFLLLLFFRHSLAVAQPTSF 
SGTISAHCSLHFLGSSDSPASASQAAGTTAPA 

1185 

A 

539 

871 

S PW YLWRT YLCREAVDS EVTCRKNVS FPLHMLL I FP I KFVS FLEKI E 
RTVQIEASTVEIEERGVKLRLTVVDTPGYGDAINCRDVfYAPPCPGIC 
ICLLNILHKFIDSVNE 

1186 

A 

158 

1330 

SVDLVIHPLWPPEVLGLQQQPTQFINPETPGYVGFANLPNQVHRKSV 
KKGFEFTLMWGESGLGKS/TLS*NSLFLTDLYPERVIPGAAV*FFS ! 
RKN*KELVQIEAST\VEILRARGSSLRLASG*DTPG\YG*PLFN\CR 
DCF*GQFISYYLMRQF*GGYL\HDESG\LNRRHIIDNRVHCCFHFIS 
PF\GHGLQPLDVAFMKAIHNKVNIVPVIAKADTLTLKERERLKKRIL 
DEIEEHNIKIYHLPDAESDEDEDFKEQTRLLKASIPFSWGSNQLIE 
AKGKKVRGRL YPWG WEVENPEHND FLKLRTML I THMQDLQEVTQDL 
H YENFRS ERLKRGGRKVENEDMNKDQ I LLEKEAE LRRMQEM I ARMQA 
QMQMQMQGGDGDGGALGHHV 

1187 

A 

405 

632 

PSKRDFADGINVFWFFGVFFETESHSVTQARVQRCKHSSLHPRVPSP 
RPSSHVSCPVAGTIGTRHYVQLILKNFL 

1188 

A 

111 

377 

PINMSCERKGLSELRSELYFLIAJRFLEDGPCQQAAQVLIREVAEKEL 
LPRRTDWTGKEHPRTYQNLVKYYRHIAPDHLLQI CHRLRPL 



^* 4. O 

1 r n r 

575 

CLKEKAQGS P FALP FCHVRMQQKG 1 1 FEADGKPS PDTE S AGTL I LD F 
PA 

1190 

c 

216 

458 

Ml^niDRTSRNRCVCDVFRNAVKGQTVLGRQLFIGRVRVPVRPGEGFLP 
WGFLPVSPWCPSWGLSTHALWWAEAVPGRALVH* 

1191 

A 

1 

594 

RTRGRTRGLLPSGAFEPEAAGSATAPRGCKNPGAKGGLL AAMAGRQD 
I FHAI VKADERFHGEGYREGYEKGS ILGVKEGRQHGTLHGAKIGTE I 
GRYQRFAFAWKCLLH\IAPLRRTA*T*RS*HAWIGRIQIFPYDDPTY 
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DKLHEDLDKI RGKFKQVRALCVSSC * EHL I APFPGGAAHSRGRCGNG 
LGLRAPRNCP 

1192 

A 

482 

789 

VGGPGQFLAISLTHLIGEAPSPACtiAHSTQQGLPGEALHFHVTFELQ 
LQSPCPHGGLRQVPQGPLQGPPGLLSLPGPPPHCLGPHPAPGWWVGG 
PALGSACA 

1193 

A 

572 

677 

SLWILPFLFVFFSFETRSRSWQAGVQWCNHSLLR 

1194 

A 

1 

396 

KGGSMHM YAKNF YGGNG I VGAQGQ I FEAYNMAALWKX.PC I F I CENNR 
YGMGTSVERAAPST\DYYKRGDFIPGLRVDGMDILCVREATRFAAAY 
CRSG KG P I LME L \ QT YR YHGHSMS DPGVS YRTREE I QE 

1195 

A 

641 

723 

LLPG PD I S KLYNMAALWKLPC I F I C EN 

1196 

A 

354 

1145 

KQISCINRKLFVVSVTCVMVRKLAVWAWRPASTPQTISSQPTGLTAL 
LSPGAFPSEKFSQSLQDEKEWLKGKEDRCTCMPRTSTGAMASWERR 
CPWAAG I ALACKYNGKDEVCLTLYGDGAANOGO T FP AYMMAALWK7.P 
CIFICFJWRYGMGTSVERAAASTDYYKRGDFIPG\LRVDGMDILCVR 
EATY\ IGRAYCRSGKGPILMELQTYRYHGHSMSDPGVS YRTREE I QE 
VR\ S KSDP I MLLKDRMVNSNLAS VEELKG I 

1197 

A 

2 

5Q6 

ELSTPLCWDSKTilGYGSAAQHLPYLTSIiLPMKLLQYCLADAWVAEIG 
LAFNILEMWFAFATITPLMFTGYGLI.FLSLVITPVIRAKLSKLVRE 
TEQGALFS AEACVNSLTMLTASG I FNS LYPATLNFMKGVPFLLGAGL 
MLIPAVLIEMLEQADPHIiEFQQVPQSP 

1198 

A 

1 

1455 

MEGSASPPEKPRARPAAAVLCRGPVEPLVFLANFAIjVLQGPLTTQYL 
WHRFSADLGYNGTRQRGGCSNRSADPTMQEVETLTSHWTIjYMNVGGF 
LVGLFSSTLLGAWSDSVGRRPLLVlASLGLLLQAIiVSVFWQLQLHV 

GVAGMLASLLGGHWLRAQGYA^PFWIiAXjALIjIAJ^TLYAAFCFGETLK 
EPKSTRLFTFRHHRS I VQLYVAPAPEKSRKHIiALYSLAI FWITVHF 
GAQDILTLYELSTPIiCWDSKLIGYGSAAQHIiPYLTSIiIjAX.KLLQYCIj 
ADAWVAEIGLAFNILGMWFAFATITPLMFTGSIYRFQGYESDAIGH 
HYSASYASEPSCDEGLIIFLIQGVRIREGYGLLFLSLVITPVIRAKI* 
SKLVRETEQGAX.FSAVACVNSLAMLTASGIFNSLYPATLNFMKGFPF 
LLG AGLLL I PAVL I G 

1199 

A 

509 

840 

QVAKPRQGQSPFLLDYCLPTLTIKEVLPGSNLSPTWGMDRSSAAFPL 
DAPLPS P I FLMS VCFSTGYGLLFLSLVITPVI RAKLSKLVRETEQGE 
CQNPPVCGSSCPGIRR 

1200 

B 

109 

267 

MEETPCRELEEEEEWGSGSEDASKKDGAVESISVPDMVDKNLTCPEE 
EDTVKV* 

1201 

A 

856 

1030 

VP1^QVGDEKQDSPNGEHWHGQE\DSTAEPAEVKAMMSVAVV*KNLT 
TPPPLLGRSSV 

1202 

A 

67 

264 

PTWPVGTWHESAPTQMPLPSLSSECCVPPGMACSWTGLNRSTRRKQE 
RQVPPLWRWPSARRCLRWP 

1203 

A 

3 

1253 

SFNIRGPHSLCFPNHHLPHATYRDKAKMKLPLLLALIiFGAVSALHLR 
SETSTFETPLGAKTLPEDEETPEOEMEFTPPPFT.RFFFFWG^R^FnA 
S KKDGAVES I S VPDMVDKNLTCPEEEDTVKWG I PGCQTCRYLLVRS 
LQTFSQAWFTCRRCYRGNLVSIHNFNINYRIQCSVSALNQGQVWIGG 
R I TG SGRCRR FQWVDGSRWNFAYWAAHQPW S RGGHCVAXi CTRENDAP 
HLESLETQADLGQDLDSSKEQERDLALTEEVIQAEGEEVKASACQDN 
FEDEEAMESDPAALDKDFQCPREEDIVEVQGSPRCKTCRYLLVRTPK 
TFAEAQNVCSRCYGGNLVSIHDFNFNYRIQCCTSTVNQAQVWIGGNIj 
RGWFLWKRFCWTDGSHWNFAYWSPGQPGNGQGSCVALCTK 

1204 

A 

1403 

3362 

PGVGLSVIQLWHYLFIGTLNSLLTNMAGGDMARVSTYTNAFAFTQFG 
VLCAPWNGLLITORLKQKYQKEARKTVPPLLPRRQRRRGHRAVRLKIiR 
GGRVLHAEKTPAPARLRAGPTGLRTRRESVAIiPEASAAAGCSFAGDR 
RLGASMAPTLQQAYRRRWWMACTAVLENLiFFSAVIiLGWG SLL I ILKN 
EGFYSSTCPGKSEKEDRDGKREGLIHWPLWLKAVGRRQYSPPTFNR 
APLPARWVFLHEKAYQVRDTAI ES S WTKVKGSGL YANRVMDVSD YV 
TPPQGTSVFVIITKMIVTENHMQGFCPESEEKYRCVSDSQCGPERLP 
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GGGSETSTFETPLGAKTLPEDEETPEQEMEETPCREIiEEEEEWGSGS 
EDAS KKDGAVES I S VPDMVDKNI»TCPEEEDTVKWG IPGCQTCRYLL 
VR S LQTF S QA W FTCRRC YRGNL VS I HNFN I NYR I QCS VS ALNQGQVW 
I GGR I TG S GRCRR FQ WVDG S R WNFAYWAAHQP WS RGGHCVALCTRG S 
STLAVALCSTVPSLALTSLLCLGFALCASVPILPLQYLTFILQVISR 
SFLYGSNAAFLTLAFPSEHFGKLFGLVMALSAWSLLQFPIFTLIKG 
SLQNDPFYVNVMFMLAILLTFFHPFLVYRECRTWKESPSAIA 

1205 

A 

388 

1711 

ACALGMAPQRKRRKQLSMKTGSYSQRSSFQRRKRPQGQGRSRNSAPS 
G ATL S LGALAWHL E WLS V I QL WHYLF I GTLNS LLTNMAGGDMARVS T 
YTNAFAFTOFGVLS APWNfiT.T .MTTRT .KOKYntfP Zi o tern o pmt t> n tvljot 

YFPNHHLPHATYRDKAKMKLPLLLALLFGAVSAIi\HLRSETSTFETP 
LG\AKTLPEDEETPEQEM\EETPCR\ELGGRGREWGLWEVEDASQER 
WGLFES ILQCPDMGGTKTFTCS *GKEGHQLKVGWGIPGGCPRPARYL 
PG*ESLSRRFSSKLWFYFARSC\YRGQTWVFHSTTFQY*L»SESKCSV 
\SALN\QGQVWIG\GRI\TGSGR\CRRFQWVDGSRW\NF\AYWG\AH 
QPW\SRGGHCVALC\TRG\GYWA\RAH\CLQKNFPFICFPLSWSQPA 
VQWPSPGQLPPLLCLPSLPPPPCNNNGFY 

1206 

A 

97 

441 

MAPEQD P KPKVQEGERVLC FHGPLL YEAKS VKVA I KDKQGKY F I HYR 
GWNKNWDEWLPESRVLKYLDTNLQKQRELQKANQKTKNNKQKTPGNG 
DGGSTRETPQPPRKKRARVDP 

1207 

A 

19 

432 

I^PKQDPKPKFQEGERVLCFHGPLLYEAKCVTCVAIKDKQVKYFIHYS 
GWNKKSAVRPRRSEKSLKTHEDIVALFPVPEGAPSVHHPLLTSSWDE 
VfVPESRVLKYVDTNLQKQRELQKANQEQYAEGKMRGAAPGKRHL 

1208 

A 

165 

257 

TVR I GAMLAYTPLDEKS LALLLNYLHDFLK 

1209 

A 

80 

1239 

GAWGGGGRRRRRRITYKCLPKLDPKPKFQEGERVL.CFHGPLLYEAK 

LQKANQEQYAEGKMR\GAAPGKKTSGLQQKNVEVKTKKNKQKTP\GN 
\ GDGG \ STSETP\QP PRKKRARVD \ PTV \ ENE ETFM \ NR VE VKVKI P 
EELKPWLVDD\WD\LITR\QKQLFYLPAKKNVDSILEDYANYKKS/R 
VGNTDNKEYA/VLNEWAGD*KNTFNV\MLG\TQLLYKFERPQYAEI 
LADHPDAPMS PG VWSAQHLLE 1 1 WYR IGAMLAYTPLDE\ K\ SLALLL 
NYLHDFLKYLAKNSATLFSASDYE\VAPPEYHRKAVWKSTIjTHLCLD 
LRKHIFVLSLSPWYKPMCFEDVSV 

1210 

A 

31 

1029 j 

WDRV\ k\ dlatrvp wtvl keqrtetyvs q feglrlgens * tlkliAd 
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RNERQFKEEKLAEQLKQAEELRQYKVLVHSQERELTQLREKLREGRD 
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KDLIKSMLRNERQFKEEKLAEQLKQAEELSMVVSAGPWSSEKAEMNI 
LEINEKLRPQIJ^NKQQFGNLKERCFVTQIAGFIJ^QQKKYNYEECK 
DIjIKFMLRNERQFKEEKLAEQLKQAEELRQYKVIjVHSQERELTQLRE 
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ESNVNAKKQLPVGIKCEPMDQECIPGSRACTLLVRFSNGGPPMDPGS 
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GYLELTDSCQPYRSAFYILEQQRVGWALDMDEIEKYQEVEEDQDPSC 
PRLSRELI.DEKEPEVLQDSLDRCYSTPSGYLELPDLGQPYRSAVYSL 
EEQ YLGLALDVD / E * VP Y YEDE ETE AQRS EDTGAG SHS W 

1237 

A 

3 

1270 

RTRVSFNIiHVPIGPPCGNDISQLSNGHLEQEYDLYMEDSVDPYHLHR 

JSJSJjAo 1 AJ\.VV JSJtvl IS. 1 tlrU\}r\jN r IiyijWMJ_iI\l£> VJU I FCjDAREVQKAElioKV 

PPn^T.FRPATTPQMQHnPr , nQTJOPHTOJTT^TT , T?T?P , riT?'\7TvTQ r PT \7\mDT?C 
f c*Lj<juE*Ei\—j-\jL x vol* ox»vjt'^i^oi>iyjrrii\i > i ± xvi j. c ,e»jc*j~jjc» vxn o il V v ukdo 

SHDEWQDALNILPEIEKYQEVEEDQDPSCPREVQKAEESKVPEDSLE 

ECAITCSNSHGPCDSNQPHKNIKITFEGYRARIRIMGALGGPRITDG 

IAQCMEMAQQDEGPACGNDVSRVSKGHLEQEYHPYLEDQVEAYHLHS 

KVXiTVH VKS QAKVP S MG PD P S FS E A P KL S R ELLDE KG P E VLQD S LDR 

CYSTPSGCLELTDSCQPYRSAFYVLEQQHVGx^VDMDDPRQYLGLAL 

DVDSEYLTVKVISLHWFQIGVIFLFQVPLTDPRDVIAAGRTYGRI 

1238 

C 

160 

267 

MGPLTLSS S S LHTETFLC P I LTQGHQS CQCENRRL * 

1239 

A 

1 

1233 

MLQIPKQQQNEKYQVPQFDQSTIKNIESAKGl^VWDSWPLQNAIXSTV 

>vl1i IlMoinv v r J±±jM\j& c m\±J£+±J1J ± o lint jl ^ IV V kjJjrJo _L Do i\T*/\LjK V r lv 

DSDKFDANDPIIiKDQTQEWSGSATFTSDGKIRLFYTDYSGKHYGKQS 
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LKGPGGLFCTTGSSRLGAAQAEERSNPGEGPSAAGTGKSLSPLRLSF 
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RSLNEHLQALLTPDEPDKSQGQDLQEQLAEGCRLAQHLVQKLSPEND 
NDDDEDVQVEVAEKVQKSSAPREMQKAEEKEVPEDSLEECAITYSNS 
HGSYDSNQPHRKTKITFEEDKVDSTLIGSSSHVEWEDAVHIIPENES 
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CQPYRSAFYVLEQQRVGFAFDMDEIEKYQEVEEDQDPSCPRLSRELL 
\ GEKEPE VLQDSLDRC YS I PSG YLELPDLGQP YS S AVYS LEEQYLGL 
ALDVDRIKKDEEEEEDQDPPCPRLSRELLEWEPEVLQDSLDRCYST 
PSSCLEQPDSCQPYGSSF\YALEEKHVGFSLDVGEIEKKGKGKKRRG 
RRSKKERRGGRKEGEEEENPPCPRIiSRELLDEKGPEVLQDSLDR*YS 
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WDEGYSTLSIPPEMIiASYKSYSSTFHSLEEQQVCMAVDIGRHRWDQV 
KKEDQEATGHRLSRELLDEKGPEVLQDSLDRWYSTPSGCP*LTDYCQ 
P YRS AFLHYWSQQRI GLAVDMDE I EKYQEVEEDQDPS C PRLSRELLD 
EKEPEVLQDSLDRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLAL 
DVDRTKKDQEEEEDQGPPCPRLSRELLEWEPEVLQDSLDRCYSTPS 
SCLEQPDSCQPYGSSFYAIiEEKHVGFSLDVGEIEKKGKGKKRRGRRS 
KKKRRRGRKEGEEDQNPPCPRLSRELLAEKEPEVLQDSLDRWYSTPS 
VYLGLTDPCQPYRSAFYVLEQQRVGLAVDMDEIEKYQEVEEDQDPSC 
PRLSRELIiAEKEPE\njODSLDRWYSTPSVYL.GL.TDPrnPYP<?aPVVT. 

EQQRVGLAVDMDEIEKYQEVEEDQDPSCPRLSRELLDEKEPE\VIiQD 
SLGRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLAIiDVDRIKKDQ 

eee/eqdqgppcpr\lsrellewepevlq\dsldrcystpsscleq 
pdscqpygssfyaleXekhvgXfsldvgXeiekkgkgkkrrgrrstk 
krrrrgrkegeedqnppcprlsgvlmeveepeilqdsl\dryystps 
my felpds fqh * rs vf ys feeqh i s faldmdnrf ftltvtslhlvfq 

MGVIFPQ 

1244 

A 

148 

197 

DPLGFL*QKRNQQEDD 

1245 

A 

1333 

2383 

RMKKEHVXiHCO FS AWY P F FRGVT T TC 9 V T T , PT .POMWDYT .T .nr>f3TT A7V 
SGRDDPPTHSQPDSDDEAEEIQWSDDENT\ATLT\APEFP*SLPLKV 
QGSYQIPLGGQVSFPKV*FGS\APRD\AYWIAMNSSLKCKTIiSDIFI. 
LFKSSDFITRDFTOPFIHCTDDSPDPXCTEYFT^VTjFKWPPMTPrinX P 
FR C FVKENKL I G I S QRD YTQ Y YDH I S KQK\ EE I RRC I QD F FK\ KH I P 
VQIL*MKDLVFDIYR\DSRG\KVWLIDF\NPFGEVTDSLLFTWEELI 
\SENNFKRRFLVKVDAQEQDSPSFSVAQTSEVTVQPQPLICSYRL\P 
KDFVDLS\TGEDAHKLIDFLKLKRNQPEGR 

1246 

A 

1 

322 

SSCHGWILGEHGDSSVAV>JSGViWAGVSIK3ELNPEMGTAraSENWKE 

VHK1WVT3SAYEVIKLKGYTNWAIGLTC 

LWDIGCRNLKTTM 

1247 

A 

68 

1224 

LCLLRTTVTEVSRAFSLLCKMATLKEKIiIAPVAEEEATVPNNKIT\V 
VGVGOVGMA\ CATSTPfiKVciWPnPT.V 21T >\m\7T unia ktcmmtit nunc 

LFLQTP * NCGQDKD Y / S LWTANS * DL * WVTAGS / RRQPRKGE S G SNP 
VAEKWLMS FKFI\ I PQDRPSTVPD\ CI 1 1 GG FN PQLD I PTYVT \ WET 
KVGLPKH\RVIGSGC1^DS\ARFRYIJ4AEKTWHFIPASC\HGW/IFW 
GEHGDSSVGLV\vmGvWAGVSLQELNP\EMG\TOOTSEKW\KEVHK 
DGGLKS P YEVI \ KLKG YP \NWGYLDLS VA\ DLI E\ SMLKNP IPGFHP 
VSTNG*RGMYGH/VENEVFPGAFPCIL\NA\RGLTSVIN\QKIiK\DD 
EVAQL\ KKS ADTLW\ D I QKDLKRPVTSEL 

1248 

A 

66 

703 

RRRRLPSVAIMIILPGPSSSHDEMF\SDIYK\IRGDRGRGLCLEGGR 
GRWVSRTE\GTID\DSLIGG\NA\SAESPRGAKGTPPHK* c ;TGV\nT 
VWJHHLPGNKFSQKEAS KKVHQR I TMKS I \ KGKP * KNRR PRKS KTFL 
* QG AAEQ I KH I LANF KKLQF/ YF I GENMN PRWA WVLLLD Y\ * RDGVP 
P \ YM I F FK\ DG \ LEMEKMLTNVA I I LDLS PCH P 

1249 

A 

1 

521 

mkitosrnkgiwewqaegirskrsvgkqrpskakipsgdkngvslt™ 
eviiwdnples1toekegqeatcsrpqivp/efqq**lfrpe*wrras 
gnlqli pkkrvsa* gtrgs ps kkgervrrdaqqtatw * trspasgcf 
qicegnkqdeacdvrglqhcerhslagpreymp 

1250 

A 

3 

170 

SCEQIAVQVNNGDAGREMPCPLPCDEEIISSEDSEGSTDVDEPLEVF I 
ISAPRSEP 

1251 

A 

3 

251 

YYSDLVPLSYIES^gj^LLSAVIGQDHDFSESSEEEAPAEASSGAIx 
RSKHGEKGFSSSDFSDLSNGEELQETCSSSLRRGS 

1252 

A 

23 

2669 

GG KMAGGGGDLSTRRLiraC I SPVAN^ 

DDRLLYDIVFKHFKRNxWEISNAIKKTFPFLEGLRDRDLITN)^ 

SQDSCRNLVPVQRWYlWLSELEKTFNLPVLEALFSDVx^ 

HIYKGFENVIHDKLPLQESEEEEREERSGLQLSLEQGTGENSFRSLT 

WPPSGSPSHAGTTPPENGLSEHPCETEQINAKRKDTTSDKDDSLGSQ 

QTNEQCAQKAEPTESCEQIAVQVNNGDAGREMPCPLPCDEESPEAEL 
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HiraGIQINSCSVRLVDIKKEKPFSNSKVECQAQARTHHNQASDIIVI 
SSEDSEGSTDVDEPLEVFISAPRSEPVINNDNPLESNDEKEGQEATC 
SRPQIVPEPMDFRKLSTFRESFKKRVIGQDHDFSESSEEEAPAEASS 
GALRSKHGEKAPMTSRSTSTWRIPSRKRRFSSSDFSDLSNGEELQET 
CSSSLRRGSGSQPQEPENKKCSCVMCFPKGVPRSQEARTESSQASDM 
MDTMDVENNSTLEKHSGKRRKKRRHRSKVNGLQRGRKKDRPRKHLTL 
NNKVQKKRWQQRGRKANTRPLKRRRKRGPRIPKDENINFKQSELPVT 
CGEVKGTLYKERFKQGTSKKCIQSEDKKWFTPREFEIEGDRGASKNW 
KLSIRCGGYTLKVLMENKFLPEPPSTRKKRILESHNNTIiVDPCEEHK 
KKNPDASVKFSEFLKKCSETWKTIFAKEKGKFEDMAKADKAHYEREM 
KTYI PPKGEKKKKFKDPNAPKRPPLAFFIiFCS EY RPKI KGEHPGLS I 
DDVVKKLAGMWl^^AAADKQFYEKKAAKLKEKYKKD IAAYRAKGKPN 
SAKKRVVKAEKSKKKKEEEEDEEDEQEEENEEDDDK 

1253 

A 

97 

1609 

GGKMAGGGGDLSTRRLNECIS PVANEMNHLPAHSHDLQRMFTEDQGV 
DDRLLYD I VFKHFQR\NKVE I SNAI KKTFPFLEGLRDRDL I TNKMFE 
DSQDSCRNLVPVQRVVYN\VLSELEKTFNLPVLEALFSDVNMQEYPD 
FKFHIYKGFE\NVI\HDKLPSPRKVEEEEKGRRGLGLPTKSLNKGTG 
ENSFRKPDFGPPSGFPHPPCLGTTPPE/NMGLSEHPCETEQINAKRK 
DTTSDKDDSLGSQQTNEQCAQK71EPTESCEQIA\VQVN\MGDAGREM 
PCP\ LPCDE \ ES PRGKS LHNHWNPKFNSLVXiCELVD I KKEKPFSNS K 
VE\ CQAQARTHHNQASDI I VI S S ED SEGSTDVDEPLEVF I SAPRSE P 
VINNDNPL\ ESNDEKEG\ QEATCSRPQI VPE\ PL»I FRKLFT\ FRES F 
RKRVIG/QKTHDFSESQ*GGGAPQEASSGGTEEARHGEKAPIDF*EV 
HLTWEEYPSR\ KETFPAVSDFSDPE * MGERAFQETC \ S S S \ LRRGLG 

1254 

A 

3 

274 

FFASLLESPVSPRXiAMDPNCSCTUVGVSCTCAGSCKCKECKCTSCKKS 
ECEAISMVWGCG*GCCSCCP/AAASKCAQGCVCKGASEKCSCCD 

1255 

A 

788 

1174 

PSCTRLLRRRQNQRSLKTNI S PRKRATVLiKLPSQMLGLVPGVDGRSP 
RGGRGGLGWRSCFLSDGEWILRTGSVGSGLVGSRGSAGGPRLEMDPN 
CSCTTGGSCTCAGSCKCKECKCTSCKKSCCSCCP 

1256 

A 

80 

231 

IRPLPPRFKTESRSLPGPCLQPGTFLWSRNRRVLGFPSMNGEDMGLL 
FLCSEWERS S EGWLCTEKGEVDTQLNPTAVPS C I SIiTAHCVFLFLVG 
GSCTCAGSCKCKE\CKCTSCKKSECG/CH/PPGIWGCG*GAWFSQHE 
WRGHGASLPLI* 

1257 

C 

81 

476 

MGPRWYKKKKSRFVTCLSVYIWEKGCPWHQFAAAALSLTPLCKHNPW 
GTLANNPRGGREQAALS CRRVHLH S FALAGNQRRVQGGHPAGAGSS W 
GVHLQARGRDWDFSQGEEAKKHVGQVEGWDPNSRPLL 

1258 

A 

3 

452 

QTQRE PTMVLS PADKTNVKAAWGKVGAHAGEYGAEAIiERMFLS FPTT 
KTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDL 
HAHKliRVI)PVNFKLLSHCLLVTLAAHLPAEFTPAVHAS LDKFLASVS 
TVLTSKYR 

1259 

A 

9 

486 

NS ARATD S ERTHHGACLLPDKTNVKA\ AWGKVGAHAGE YGAEALERM 
FLSFPTTK\ T YFPHFDL \ SHG \ S AQG * RAHGK\ KVA\DAIjTKRRGAT 
WDDM/ PQTALSALSDLH\AHKL\RVGPGSTFKLIi\ SQLPCLGEPWAA 
HLPA\ EFQPLAVARLPWNKVS WGFC 

1260 

A 

3 

497 

PTLLVPTDSERTHPWLLS PADK\TtlVKA\ AWG\ KVGAHAGEYGAEAL 
ERMFLSFPTTKTYFPHF\DLSHG\SAQV*GPRARKVADAIi\TNAVAQ 
RGTDIAQRACPPLSDLH\AHKL\RVGPGSTFKLLKATC/HCLGEPWA 
AHLP\AEFQPLAVARLPWGQSFLGFLLKQRC 

1261 

A 

1 

1077 

MLSGVGGFVLGLLFLGAGLFIYFRNQKAEESFVSAI.SIDLSGGGNMA 
LLSMVCLKFPGG\ SCMAALTVTLMVLS S PLALAG\ DTR\ PPVRLRKT 
EDEPI/5CVLSGLRVGPDSVFPGGRFCNRIVLVPPARFLEQVKHECHF 
FNGTERVRFLDRYFY&QEEYVRFDSDVGEYRAVTELGRPDAEYWNSQ 
KDLLEQKRAAVDTYCRHNYGVGESFTVQRRWPEVTVYPAKTQPLQH 
HNIiLVCSVNGFYPGSIEVRWFRNGQEEKTGWSTGLIQNGDWTFQTL 
VMLETVPRSGEVYTCQVEHPSLTSPLTVEWRARSESAQSKMLSGVGG 
FVLGLLFLGAGLFIYFRNQKGHSGLQPTGFLS 
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1262 

A 

3 

825 

LFS SMVCLKLPGGS SLAALTVTLMVLS SRLAFAGDTRPRFLELRKSE 
OTFFNGTERVRYLDRYFHNQEEFLRFDSDVGEYRAVTELGRPVAESW 
NSQKDLLEQKR\AAVDNYCRHNYGVGES FTVQRRVHPQVTVYPAKTQ 
PLQHHNLLVCSVSGFYPGSIEVRWFRNGQEEKAGWST\GLIHNGDW 
TF\HTIiVMLEWPR\SEEVYTC\QVEAPRA*QAPLTVE\WRARSESA 
QS KMLSGVGGFVLGLLFLGAGLF I YFRNQKGHS GLQPTGFLS 

1263 

A 

11 

885 

DLP AS LAPG P VL FS SMVCLKL PGGS CMT ALTVTLMV\ LS S PLALAGD 
TRPRFLWQPKRECHFFNGTERVRFLDRYFYNQEESVRFDSDVGEYRA 
VTELGRPDAEYWNSQKDFLEDRR\AAVBTYCRHNYGVGESFTVQRRV 
QPKNTTVYPSKTQPLQHHNLL/VFCSVSGFYPGSIEVRWFLNGQEEKA 
GMVSTG\Ll\QNEDGPF\QTIiVMLETSFFGVERVNT\SQVEHPKCAR 
P\IiTVE*RARSESAQSKMLSEVGG\FVLG\LLLPLGPGIiF\lYFRNQ 
KGHSGLQPTGFPELKCR 

1264 

A 

25 

628 

EFHRLRENPPWCLSPADKTNVK/APAWGKVGAHAGEYG\SEALER\M 
VLFPPPTPKPYFPHF\DLSHG\SAQV*GPRARKVADAL\TNAVAQRG 
TDIAQRAVPPIiSDLH\AHKL\RVGPGSTFKLLKATC/HCLGEPWAAH 
LP\AEFQPIiAVATSSLGTKFPGFLVEAPLLTFQITFKGWKLWLAIVF 
LPFGLPPSPSSPFLHPYPRGL 

1265 

A 

1 

625 

CKF IRVMAHTRLRLLPLRRKKAHLME IQVNEG TVAEKLDWARERLEQ 
QVPVNQVFGQPEM I DV I GVTKGKGYKGVTSRWHTKKLPRKTHRGLRK 
VAC\KDGKLIKNNASTDYDLSDKSINPLGGFVHYGEVTNDFVMLKGC 
WGTKKRVLTLRKS LLVQTKRRALEKIDLKF IDTTS KFGHGRFQTME 
EKKAFMGPLKKDRIAKEEGA 

1266 

A 

1 

1251 

MASKLRLSDMTGEKGLNLAPI KI ILNSLFEDGSS IGLQGAQGAVDKP 
CGLEQPVWPTKCDCYTASPKCSSGHCGKVKSLPKDDPSKPVCLTGF 
LGYKAGMTH I VQEVDRPG S KVNKKEVVEAVTI VETLPMMVVG I VG YV 
ETPQGLRTFKTVFAEHI SDECKRRFYKNWHKSKKKAFTKYRKKWQDE 
NGKKQLEKDFS S MKKYCQKKAHLME I QVNGDTMAKKXiDWARERFEQQ 
LPVNHVFGQDEMIDI IRMTKGKGYKGVTSHWHAKKLPRKTHQGPRKV 
SCIGAWHLVHVAFSVARTWQKGYCHHTEINKKIHKIGQGYIilKDDKL 
IKNNASTDYDLSDKSINPLGVFGHYGEVTNDFVMLKGCVTKQRALQK 
TDLKFTDTTSKFGHGRLQTMEEKKASMGRLKKDQIAKEERA 

1267 

A 

1 

903 

KCSSGHCGKVKSLPKDDPSKPVCLTGFLGYKAGMTHIVQEVDRPGSK 
VNKKEVVEAVTIVETIjPMMVVGIVGYVETPQGLRTFKTVFAEHISDE 
CKRRFYKNWHKSKKKAFTKYRKKWQDENGKKQLEKDFSSMKKYCQKK 
AHLMEIQWGDTMAKKLDWARERFEQQLPVNHVFGQDEMIDIIRMTK 
GKGYKGVTSHWHAKKLPRKTHQGPRKVSCIGAWHIiVHVAFSVARTWQ 
KG YCHHTE INKK IHKIGQG YL I KDDKL I KNNAS TD YDL SDKS INPLG 
VFGHYGEVTND FVML KG C V 

1268 

A 

1 

1131 

MSHRKFSTPRHGSLGFLPWKRSSRHCGKVKSFPKDDPSKPVHIiTAFL 
G YKAGMTH I VRE VDR PG S KVNKKEWEAVT I VERPP VG I VGCVETPQ 
GFRTCKT^AEHISDECKRRFYKNWHKSKICKAFTKYCKKWQDEDGKK 
QLEKDFSSMKKYCQVICVIAHTQMQLLPLCQKKAHLMEIQVNGGTVA 
EKLDWAGERLKHQVPWQVFGQDEMIDVIRVTKGKGYKRVTSRWHTK 
KLPRKTHQGLCKVAC I GAWHPARVGFS VG YLI KDGKL I KNNAS TD YD 
LSDKS INPLGGFVHYGfiVTNDFVl^KGCVVGTKKWVLTLRKSLIjVQT 
KQRALEKIDLKFIDTPSKFGHGHFQTMEEKKAFMGPLKKDRIAKEEG 
A 

1269 

A 

1 

J 345 

WALPAGFDGVMSHRKFSAPRHGSLGFLPRKRSSRHRGKVKSFPKDDP 
SKPVHLTAFLGYKAGMTHIVREVDRPGS KVNKKEWEAVT I VETPPM 
VFVGIVGYVETPRGLRTFKrVFAEHMSDECKRRFYKNWHKSKKKAFT 
KYCKKWQDEDGKKQLEKDFSSMKKYCQVIRVIAHTQMRLLPLRQKK\ 
AHLM\EIQV\N\GGTCARESWDWGPREGLKQQVPVKPSVLGRDE\MI 
ELHRG*PKGQKAYKGGHPWWHTQESCPRKD/HHPRACAKVACIGA\ 
FHPARV\ AFSVARAGQ\ KGYHPRT\ EINKKI \ YKIGQQ\ YIjIKGG\K 
LIKNNASTDYDLSDKSI\NPLGGFVHYGE\VTNDFVMLKG\CWGTK 
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K\RVLT\LRK\ SLLVQTKAAGLWRRJTLKF\ IDTTSKFGHGRFQTME 
EKKAFMGPLKKDRIAKEERSLMPGTDFAVGGVSIKVIFH 

1270 

A 

3 

451 

QTQREPTMVLSPADKTNVKAAWGKVGAHAGEYGAEAIjERMFLS fptt 
KTYF PHFDIj SHG S AQVKGHGKKVADAIiTNAVAHVDDMPNALS ALSDL 

HAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVS 
TVTiTSKYR 

1271 

A 

9 

487 

NSARATDSERTHHGARLLPDKTNVKA\AWGKVGAHAGEYGAEALERM 
FLSFPT\TKTYFPHFDL\SHGFCPRLKGPRQRRWPDAIiTK\AVA\HV 
DGHAQTALS GPEATLHGAQS FG VDPVQLSS S L SH \ CLLG * PWAAHL P 
RPSSTPGGWNAFPGTKFPWVSC 

1272 

A 

197 

821 

RLFHSNQTVDHSQKNVDITLKG\RPSNRVRAPKGTLR\RDFNPHQM* 
NSALLGKEQQRGFRVD\KWWGYQKGNWPTRSGLFGSHVQD\MIKGWIi 
PLGLPVTKMR\SVYAHFPHPTLLSRENGVSLLKSRNFLG\EKYIPQG 
FRMKTRVLLCQYLKAQKR*N*SLEGND\IGLVS\NFSRLLIPASPTR 
LKTKGIRK\ FLDGI FCLLEKGLFRQA 

1273 

C 

22 

282 

MSEGPSVRSEEAICLYYEELGGGARQTHVRRPLSECSPGDWSHSGVA 
EGPXCIQFLHITSHGAKEALSTWIjGLXjTSGPATTAAVLP* 

1274 

A 

60 

1576 

GYLGAPVALG\LW\ALCWSLAIATPLPPTSAHGNVAEGETKPDPDVT 
ERCSDGWSFDATTL\DDNGTMLFFKGEFVW\KSHKWGPGV*SSERWE 
GFSPS PCGMLAF PFKVHNS VLS * SKGGDKVLGY * PS LKKKGRKGLPK 
VCS KIDFPGI PHS P\ LD\ AAVECHRG\ ECQAE\ G V\ LFFQGDRE\ W F 
WDFA\ TGNHGRERSW\ PAVWGTCSS \ ALRWAGPLTYWLSRGNQF\ LR 
FRPCQGEEVPPR\YPRDVRE\YFMPCPGKGH\GTQEWGIX3HGNSNHH 
GP*IYAACSP\HLSLCLALTSDNH\GANFCLSVGTHYWR\liDTS\RD 
GWHSLAPLLIKWPQGPSAVDAAFSW\EEKLYLVQGHPGYYVFLTKGG 
Y\TLS *GGYPKRLGEREV\GDPSWGFIL\DSVDAAFI\ CPGVFLRLH 
YSWAGRRLWVGW/ VP* KSGSPKPTW\TELSFGPHE\ KVDGALCMEKS 
PFGPKFMFPPNGPGL \ Y\ LIHGPNFTL\ YS WEK\ LNA\ AKAIiPQPQ 
NVTQSPGACTH 

1275 

A 

1 

3431 

MLPHERGLETTPRGECIPVRIDTKLFEMLVPQCHKEIAIjEHKFIYSF 
LVTLNTKPPGYSHSHPEALLDPEVGDPNGTNAQLIKCFLIiPLCPSFP 
LCPEECMHCSGENYDGKISKTMSGLECQAWDSQSPHAHGYIPSKFPN 
KNLKKNYCRNPDRELRPWCFTTDPNKRWELCDIPRCTTPPPSSGPTY 
QCLKGTGENYRGNVAVTVSGHTCQHWSAQTPHTHNRTPENFPCKNLD 
ENYCRNPDGKRAPWCHTTNSQVRWEYCKIPSCDSSPVSTEQLAPTAP 
PELTPWQDCYHGDGQSY11GTSSTTTTGKKCQSWSSMTPHRHQKTPE 
NYPNAGLTMl^CRNPDADKGPWCFTTDPSVRWEYCNLKKCSGTEASV 
VAPPPWLLPDVETPSEEDCMFGNGKGYRGKRATTVTGTPCQDWAAQ 
EPHRHS I FTPETNPRAGLEKNATECGGASTELCSTSLC7VFTMLMDYE 
GQGEPLDDYVNTQGASLFSVTKKQLGAGSIEECAAKCEBDEEFTCRA 
FQYHSKEQQC^IMAENRKSSIIIRMRDVVLFEKKNTYLSECKTGNGKN 
YRGTMS KTKNG I TCQKWS STS PHRPRFS PATHPSEGLEENYCRNPDN 
DPQGPWC YTTD\ PEQRYD YCDI PECEBECMHCSGENYDGKI SKTMSG 
LECQAWDSQS PHAHGYI PSKFPNKNLKKNYCRNPDRELRPWCFTTDP 
NKRWELCDI PRCTTPPPS SGPTYQCIjKGTGENYRGNVAVTVSGHTCQ 
HWSAQTPHTHNRTPENFPCKmiDE^CRNPDGKRAPWCHTTNSQVRW 
EYCKIPSCDjSSPVSTEQLAPTAPPELTPWQDCYHGDGQSYRGTSST 
TTTGKKCQSWSSMTPHRHQKTPENYPNAGLTMNYCRNPDADKGPWCF 
TTDPSVRWEYO^KKCSGTEASVVAPPPVVLLPDVETPSEEDCMFGN 
GKG YRGKRATTVTGTPCQDWAAQEPHRHS I FTPETNPRAGLEKNYCR 
NPDGDVGGPWCYTTNPRKLYDYCDVPQCAAPSFDCGKPQVEPKKCPG 
RWGGCVAHPHSWPwOvSLRTRSPRPSSYKVIIiGAHQEVWLEPHVQE 
IEVSRLFLEPTRKDIALIiKLSSPAVlTDKVIPACLPSPNYWADRTE 
CFITGWGETQEHFYFN 

1276 

A 

111 

2785 

VNNVLGLGHTFWALLAS PKMEHKEWLLLLLFLKSGQGE PLDDYVNT 
QGASLFS VTKKQLGAGS I EECAAKCEEG\ EEFTCRAF\ QYHSKEQQC 
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VIMAE\NRKSS \ 1 1 1 RMRDWLFLKKKVYSLQS AKT\ GNGKNYRGTM 
SKTKNGIT\CQKW\SSTSP\RRPRFSPATHPSEGL\EENYCRNPDND 
PQGPW\CYTTDPEKRYDYCDILECEEECMHCSG\ENY\DGKISK\TM 
SWDWECQGLGTFQEPHTVHG\YFPSQIFPNKNLKEGITVRNPDRGL.\ 
RPWVFSPPDPNKRWGTF/ CDI PR\ CTTP\ PPSSGPTYQCL/RRGTGE 
NFRGNVAV*PCSGKTVQHWECTGPLHTHNR\TP\ENFPWQNLGWKTT 
CRNP\ DGKKGPLGCHYNQQAKLRVGS YC * GYRSC\ DS S PVSTEQLAP 
TAPPELTPVVQGLATHG* WDRAYRGNI PPTHPPTGKES VKSWS IL*H 
PHRHPEDPQKTTPNAGL\TMELLSGIPDGRLKAPWCFYHRTPAFRWE 
YCNLKKC\ SGTEAS W\ APPPWLLPDV\ * TPS EEDCMFGNGKGYRG 
KRATT\VTGTPCODWAAOEPHRHSIFTPETTJ\ ppam irK"Mvr T PKiPnr» 
\DVCGPWCYTTNPRKLYDYCDVPQCAAPSFDCGKPQVEPKKCPG\RV 
VGGCV7VHP\HSW\PWQVQSLEQGFGLHFCGGTIiISPEWV\LTAAHCL 
EKSPRPSSYKVILG\AHQEVNL\EPHVQEIESV/SRLFL\EPTRK\D 
ICLAKAKAVL»PFIHI»TKIiIPA\ CI1AIPOIFWRL1TGTECFITGL/WG 
ETQGTFGSWAF\LREAQLPC*FENKVCKSAI**SFPENGKKSQSHPK3j 
LCLGHFGPEGHIiNKLPPRVNSGRAPLVCPPKRDQNTIL\QGSPPPWG 
VLACARPP I TPGVLCS VFPR FVYLGLKGS 

1277 

A 

29 

455 

EFQFSVVYAKCDSSPDSAEDVRKVCQDCPLLAPIiNDTRVVHAAKAAL 
AAFNAQNNGSNFQLEE I SRAQLVPVTSQPQPEGANEAVPTPWDPDA 
PPS PPLGAPGLPPAGS PPDSHVLLAAPPGHQLHRAHYDLRHTFMGW 
S 

1278 

A 

1 

1368 

MKSLVLLLCLAQLWGCHSAPHGPGLIYRQPNCDDPETEEAALVAIDY 
INQNLPWGYKHTLNQIDEVKVWPQAVEGDCDFQLLKLDGKFSVVYAK 

\.i/i3i9r v xv JV V V-V 1 - , ^-'- t ^ XjAaZ-Ut A4IMU A K. V V nMAftrtrtliHAh Dif\\Jvt£i\jiy 

NFQLEEISRAQIiV\nCTEILLTGEPHVDGEPGDLRFRIKVVKHPIFER 
RGDDL YTNVT I S LVES LVG FEMD I THIiDGHKVH I S RDKI TRPG AKLW 
KKGEGLPNFDNNNIKGSLIITFDVDFPKEQLTEEAREGIKQLIjKQGS 
VQKI TF YEDKNFQGRR YDCDCDCADFHTYLSRCNS I KVEGGTWAVYE 
RPNFAGYMYILPQGEYPEYQRWMGLNDRLSSCRAVHLPSGGQYKIQI 
FEKGDFSGQ^ETTEDCPSIMEQFHMREIHSCKVLEGVWIFYELPNY 
RGRQYLLDKKEYRKPIDWGAASPAVQSFRRIVE 

1279 

A 

1 

1269 

PPTRPPTRPAPGLVPKPSTTCTPACQGLSGAAMKSLVLLLCnjAQLWG 
CHSAPHGPGLIYRQPNCDDPETEEAALVAIDYINQNLPWGYKHTLNQ 
TD\ EVKVWPOOPSC5RAVT.T? PRTPTPWfJT'^T.pr*r , LJn , D , rT.'\7f2ri'ri7iCT.'Rri 

s lkehavegdcdfqllkldgkf \ swyakcd \ s sqds aedv\ rkvcq 

h\cpllapl\ndtrwhaakscpgppfnaqnngfqffsiiebisr\aq 

lvplpps \ tyv\ eftvsg \ tdllfa* kkateaakcnlsgqksnmgfc 

k\atlsek1^sgqrlqltcwf\qtqpvt\sqpnpeganeavptpv\ 

vdp\dappspplgapgllpswltpkt™vll^ 

lchtfmgws \lgs pfrrkcshprknt/rtwea* wlgaaagatgfl 

plfrggirhfky 

1280 

A 

1 

1254 

MGKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAA 
EMG KG S F KYAWVLDKL KAERERG I T T D T T . WKFPT <5 vvvVT T T r> APf* 
HRDF I KNM I TGTSQADCAVL I VAAGVGEFEAG I S KNGQTREHAIjIiAY 
TLVAFVP I SG WNGDNMIjE PS A1JMPWFKGWKVTRKDGNAS GTTLLEAIj 

dcilpptoptdkplrlplqdvykiggigtvpvgrvetgvlkpgmvvt 
fapvnvttevksvemhhealsealpgdnvgfnvknvsvkdvrrgnva 
gdskndppmeaagftaqvi i lnhpgq i s tgyapvldchtah i ackfa 
elkekidrrsgkkledgpkflksgdajvivdmvpgkpmcvesfsdypp 
lgr favrdmrqtvavgv i kavdkkaagagkvtks aqkaqkak 

1281 

A 

1 

921 

KFETSKYYVSIIDAPGHRDFIKIWITGTSQAI^CAVxjIVAAGVGEFEA 
GISKNGQTHEHALl^YTLGVKQLIVGVNKMDSTEQPYSQKTYEEIVK 
EVNTY IMKIG CMLDTAAFVL I S VNVTTE VKS VE I HHEAXj S EAL PGDN 
VGFNVKOTSVKDVRHGNNAGDSKNDPPMEAAGFTAQVI I LNHPGQIS 
AGYAiVLDCHTAIITACKFA^LKEKIDRCSGKKLEDGPKFIiKSGDAAI 
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IYMVPGKPMCVESFSDYPPLGHFAVGHMRQTVAMGIIKAEDRKAAGV 
GKVTKSAQKAPKPSEYYPLYLPPQS 

1282 

B 

70 

572 

MGKBKTH INI W I GHVDSGKS TTTGHL I YKCGG IDKRT I EKFEKEAA 
EMGKGS FKYAWVLDKLKAERERG ITIDIS LWKFETS KYYVTI IDAPG 
HRDFIKNMIILNHPGQISAGYAPVLDCHTAHIACKFAELKEKIDRRS 
GKKLEDGPKFLKSGDAAIVDMVPGKPI * 

1283 

A 

1 

1410 

MTSPNELNK7^PWTNCGETGWIKDLNGKPKTIKILEDNLVNTILDIGT 
GKDFMTKTPKAIATKAKIDKWNLIKFKSFCTAKETINSINRQPTEWE 
KI FTNYAFDKG FDLLTP VMTTVS I KNV T TGT O VDP A VT . T V A ZMWHF 

A V _1_ J. A. AT J, fXt J,WJ A. !_/ J_l ±J J. XT villi v wX i vLl V X X VJ X O 1^ V XsV_/"\ V 1 * JL V V V3.G 

FEAGISKNGQTSEHALLAYTLDVKQLIVGVNKMDSTEPPCSQKRYEE 
IVKEVSTYIKKIGYNPNTVAFVPISGWNDDNMLEATANVPSFKGWKV 
TCKDGNASGTTLLEAIJDYILPPSGPTDKPLRIiPLQDVHKIGGIGTVIi 
GGQVKTGVLKPGMWS FAAVDVTTEINFI PGNNEGFNI KNVS VKDVR 
CGNIAGDSKNGPPMKAAGITAQVIILNHPGQISAGYTPVLDCHMAHT 
ACKFAVLKEKIDHRSGKKLEDGSKFLKSGDAAIIDMVPGNLMCIESF 
SDYPPLGHFGVCDMRQTVAVGVIKAVDQKAAGAGKVTKSAQKAQKAK 

1284 

A 

1 

1366 

MDIKKGITDISASLRVESGWEARTRKEKTHINTVIIGHVDSGKSTTT 

GHLI YKCGGVDKRTI EKFEKEAAEMGKC S FKYAWVLDKLKAEREHG I 

TIDISLWKFETS KYYVTI IDAPGHRDT, T KNM TTGTSOADPAVLT VAA 

GFGEFEAGISKNGQTREHALI^YTLGVKQLIVGVNKMDSTEPP 

RYEEIVKEVSTYIKKIGHNTDTVAFVPVSGWNGDNTLEPSANMPWFK 

GWKVTRKDGNASGTTLLEALDCILPPTRPTDKPIiCLPLQDVYKIGGI 

V1TVATE VKB VEhTHHEALS E VL PGDNVG FNVKNVS VKDVRRGNVAGDS 

KKDPPMEAAGFTAQVIILmiPGQISAGYAPVLDCHTAHIACKFAELK 

ECTDRHSGKiCLEDGPKFLKSGDAAIVDWPGK^VYVESFSDYPPLGR 

FAVHDIRQTVAVGVIKAVDKKAAGAGKVTKSAQKAQKAK 

1285 

A 

70 

1538 

KPKWERKRLIFNIWIGHVDSGKSTTT\GHLIYKCGGIDKRTIEKFE 
KEAAEMGKDSFK\FAWVLDKLKA£RERGITIDISLWKFETSKYYW 
IDAPG\HRDF\ IKNM\ ITGDISRLDCAVLDLFA\SGVGEF*SLVSPK 
NGO\ TREHAI»LAYTLG\ VEOL I VGVNKM\ D^TRPPY^OKR YKEMWT.T? 
EVSTF \ I KKI WLQPPTQ *HFVP I SWFG IGDNM\ LEPSA\ NMPWFKGW 
\ KVTRKDGNASGTTLLEALDC I LPPTRPTDKPLR\ LPLQGVH\ KLGG 
I GTVS S AP^^ET\ GFSNPGIWVTFAPS PR* QRKVKSCRKCTHEALK* S 
SFI^TxWGLQLSGIVSCQGMFRPWQTVAG*PAI<2TOPTQWESSWASLV 
RGDYP * PIPGQNKAPGYAPCIGIGAHGIiTFACK\ FAEIjKEK\ IDRR\ 
SGKKLEDGPK\ FLICS \ GDAA\ I VD I VPG \ RPMCVES FSD YPPLGRFA 
VRDMRQWAVGVIKAVDKKVCWSWARSPKFAQKG 

1286 

A 

41 

601 

APS PRRPWGHFTEEDQGLL ST S LWGK\ VKCGKNAGRKKPLGKAPLW 
L/HPWDPK1RSFEQAIjGNPVPLPSA\IMGNPPKSRAHGK\ kvlt\slg 
EMPIimPG*SSKGTFAQA*SELH\CDK\LIiVDPENFK\LLG\NVLVT 
\VL\AIPFSAKEFTPGGCRASWAERWVTWSWPVPCSSRIPLSSLAHD 
CRAFQG 

1287 

A 

2 

287 

KNMS 1TVNDLTVTQDGRKI YFTDS S S KWQRRD YLLL.VMEGTDLX5RLLE 

YDTVTREVKVLLDQIxRFPN^ 

R 

1288 

A 

36 

427 

GKQWDFAFCFGSHVWSVDLLAFFECELLSl^HHLHLAYLKCFKSKQ 
KTCVFQGSCAGCDLLFSQETVMKFVjPRNSLVLELSDSGAFRRSLITOP 
DGLVATYISEVHEHDGHLYjLGSFRSPFLCRIjSLQAV 

1289 

A 

247 

1183 

CO^SPVDPQPLRQISGIALFCSFKEPPLLLGVLHPNTKJoRQGRKGC 

lkinllgpes\mahiggcdwwg^^ 
plakpeddepc^gdpx^irgrgpkwgllfw^ 

TVKVKLLLSSETPI *GGRNMSF\ VNDLTVTPGWKEDLISPNPKQQNG 
QRRKLPAFWVMGGHKMTGRLLEV*YCDPGK*KFYWTQLRFPNGVQPV 
S * QKTLSLVAETS \MAR I RRVYVSGPDERR\ ADLFVENMPGFPDNIR 
PSSSGGYWVGMSTIRP*P\GSSMLDFLSERPWD 

1290 

A 

2 

217 

LGVRI P WALHVMELFDTTVEQLYS I FTVKELTNKKI IMKWRCGNWP 
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EGMLLITAIiHIKGPKTLRLGFSYG 

1291 

A 

1 

2283 

MAASIALEAIiEALYGIVKESGVKHKGLIEIPNLSEENEVDDTESSFC 
YRHWAFQIVIilRLCGSCVVVRVPCGGPVCREGLPYCGGSVLCGAGVV 
QERESTLSKKKGDGVILKDLMKTAGTAKVREALGDYLKALKTEFTTG 
MIL PTKAMATQELTVKRKLSGNTLQVQAS S P VALGVR I PTVALHMME 
LFDTTVEQLYSIFTVKELVQKFSKSTAVLETEKGGKFQMFDGNITGE 
YLGLLTNKKIIMKWRCGNWPEGLNCTDPLVSGFSPNSGTPKAARPTP 
HLPPPLLLPDQCEDHEDEDLYDDPFPLEEQTERDATSLSKGKFQELL 
VGI\A^ENDAGRGEINELKQVEGEASCSSRKGKLIFFYEWNIKLGWKE 
RTVPFCP S I AFVTAS PLQGTPLS CFAQAKS TLGQEEB YGCTCPTLLS 
GRRTADS SCPLLGHGLPDLS S VAAQGALEWWNPVRAAELFS I VKESG 
VKHKGLIEIPNLSEENEVDDTESSFCYRHWAFQIVLIRLCGSCVVVR 
VPCGGPVCREGLPYCGGSVLCGAGWQERESTLSKKKGDGVILKDLM 
KTAGTAKVREALGDYLKALKTEFTTGM I IiPTKAMATQELTVKRKLSG 
NTLQVQAS S PVALGVR I PTVALHMMELFDTTVEQL YS I FTVKELVQK 
FSKSTAVLETEKGGKFQMFDGNITGEYLGLLTOKKIIMKWRCGNWPE 
EHYAMVALNFVPTLGQTELQLKEFLSICKEENMKFCWQKQHFEEIKG 
SLQLTPLNG 

1292 

A 

249 

433 

KWRCGNWPRRTLMPWLH*NFVPTLGQTELQLKEFLSICKEENMKFCW 
QKQHFEENKKVPAS 

1293 

A 

778 

1578 

PRQVDPSWGFHSRIiSGPW/H/WTERDATSLSKGKVPGAPGGHPLWKK 
DAGRGEINE \ LKQVEGEAS CS SRKGKL I F FYEWNI KLGWKG I VKESG 
VKHKGLI E I PNLSEENEVDDT/ EEFTTGMI LPTKAMATQELTVKRKL 
SGNTLQVQASSPVALGVRIPTVALHMMELFDPT/S *SSLYS IFTVKE 
EERVCVLFSLTNKKI IMKWR\WGTWPEEHYAMVALNFVPTLGQTELQ 
LKEFLS I YL * RKVP *NFCWQKQHFEEIKGSLQLTPLNG 

1294 

B 

46 

386 

X IRHESGSRSHSHCSTLSS IGDVAKKLGEMWNNTAADDKQP YEKKAA 
KLKEKYEKDIAAYRAKGKPDAAKKGVVK7VEKSKXKKEEEEDEEDEED 
EEEEEDXEDDDEEEDDDDE * 

1295 

A 

263 

484 

CFFLPYCFILPLLWKDIAAYRAKGKPDAAKKGVVKAEKSKKKKEEEE 
DEEDEEDEEEEEDEEDEDEEEDDDDE 

1296 

A 

157 

886 

TWGKGDLKKPRANMSSYAFFVQTCRGG\HKKKHPDASVNFS/ESFSK 

kcserwktmsa*r/ekgkfedmaka\dkaryer\ EMKTYIPPQRGRQ 

KRKFKDSQLHPRGPPSGLLSSSCSEYRPKIK\GEHP\GIj\SIGDVAK 
KLGRDVG INTAAD\ DKQP Y\ ERRAAKLKEKYEKDI AAYRAKGKPDAA 
KKG\WKAEKSKKKKEEEEGEEDEEDEEDEBIEE\DEEDEEDEEEDD 
GLMNKLGSGAVFFFSCIi 

1297 

A 

1 

450 

CKSRGSNLRVHFKNTRETAQAIKGMHIRTATKYLKDVTL.QKQCVPFR 
RYNGGVGRCAQAKQWGWTQGRW PKKS AE F\ LVI EH I QVNKAPKMRRR 
TYRAHGRINPYMSSPCHIEMILTEKEQIVPKPEEEVAQKKKISQKKIi 
KKQKLMARE 

1298 

A 

1 

1725 

MAHKEGDMPAEQGFGAGYGFKRFYVPMKEGLKGSWSFLKYQRRRKYS 
LSAVFHSATMLQDVGEAIQFEVSIGNYGNKFDTTCKPLASTTQYSRA 
VFDGNYYYYLPWAHTKPVVTLTS YWEDI SHRLDAVNTLIJ^lAERIiYP 
QEKNNGPKVPVELRVNIWLGLSAVEKKFNSFAEGTFTVFAEMYENQA 
LMFGKWGTSGLVGRHKFSDVTGKIKLKREFFLPPKGWEWEGEWIVDP 
ERS LLTE ADAGHTE FTDE VYQNES R YPGGDWKPAEDTYTDANGDKAA 
SPSELTCPPGWEWEDDAWSYDINRAVDEKGWEYGITIPPDHKPKSWV 
AAEK>fVBTHRRRRLVRKRK3CDIiTQTASSTARAMEELQDQEGWEYASL 
IGWKraWKQRSSDTFRRRRWRRKMAPSETHGAAAIFKLEGAIiNTCET 
AQA I KGMH I RKATKYLKDVTLQKQCVP FRRYNAGVGRCAQAKQRGWT 
QGRWPRKSTEFLLHMLKNAESNAELKGIiDVDSLVIEHIQVNKAPKMR 
RWTYRAHGRINPYMSSPCRIEMILTEKEQIVPKPEEEVAQKKKISQK 
KLQKQKLMAWE 

1299 

A 

278 

879 

S VKMVRYSLDPGGTPRKSCK/ SQRGSNLRVPFKDHS * KXjPQAHQGVC 
H I RKS PTKY \ LKDVHLTRNQCVP I PDR Y\ NGG I CKVCRR PKQWG \ WT 
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QG\RWPQKGVLNFLLHMLKNAE\ SNAELKG\LDVDFLV\ I EHIQVNK 
AP\ KM\RRRTYRAHGRINPYMSSPCH\ IEM\ HVTEKEQIVPKPEEEV 
AQKKKI S \ QKKLKETPTLWHGE 

1300 

A 

85 

266 

LSHCLPLQTTEVGGFGNLLGYWIACSIGCVLSTGMLSHQLKQHVIDG 
EKTI IQNPTDQQK 

1301 

A 

1 

1776 

MSATGDRHPTQGDQEAPVSQEGAQAEAAGAGNQEGGDSGPDSSDMVP 
AAEWGVAGPVEGLGEEEGEQAAGLAAVPQGGSAEEDSDIGPATEEE 
EEEEEGNEAAN FDLAVATRR YPAAG I GFVFLYLVHS LLRRLYHNDH I 
Q I ANRHLS RLMVGPHAAVPNL WDNPPLLLLS QRLGAGAAAPEGEGLG 
LIQEAASVQEAASVPEPAVPADLAEMAREPAEEAADEKPPEEAAEEK 
LTE EATE E P AAEE PT S E E AV AP EGKG AS G S S GE AG I HVS KG SGQGCG 
AGTAGEAGELB 1 1 SRFGWE I YWKKGKYSVPVEM t KQQQQVTKSQPEK 
WDEEAQDAAGEEEKEQEKEKDVENKVFKRKTSSMALLPHTKYQCAPB 
SHQLEAS S SGVLVLS LCEKDDAM IMEE IGKI FKKKI EMRKGI AFPTS 
I SVNNC I CH FS PLKSNQD Y I LKEGDLVKI DLGVHVDGF I ANVAHTFV 
VDVAQGTQVTGRKADVI KAIHLCDEAVLRLVKPGNQNTQVTEALNKA 
AHSLNCTPIEGMLSHQLKQHVIDGEKTIIQNLTDQKKKDHEKAELAV 
HEVYAVDVLVSSGEGKSKDAGQRTTIYK 

1302 

A 

207 

1645 

LSQRALRLSPRARSFSLSPACPLPCLLALSLALSSRIEGTIiTTACGW 
GRETEAAAAQGKRGCSGGSRKMSGEDEQQEQTIVD/DSIiWTKYKMG 
GDIANRVLRSLVEASSSGVSVliSLCEKGDAMIMEETGKIFKKE\KEM 
KKGIAF\ PTSI S\VNNWC34/CHFSPFERSDPGLYSSKEGDI*VK\ IDL 

gvpcwmgfianvash/sfvvdvaqgtqvtgrk\advikaahlcaeaa 

\ LRXj VKPGN QNTQ V T \ EAWW KV AH S FNCT P I \ E GML S H S 1* KQH V 1 Dvj 
EKP * FQNPTDKQK\ RAHEKADFEV\HEVYAVDVLVKPQERARPKDAG 
QRTTIYKRDPSKQYGLKMKTSRAFFSEVERRFDAMPFTIjRAFEDEKK 
ARMGV\ VECAQT * TCWQ PFNVXjY * EGRVI LFAQFKFTVIiAHGPNGPM 
Rl\TS\GPFEP\DFYK\SEMEV\QDAELKAIiLQ\SSAKSEKPQKKKK 
K\ KASKTA\ ENATGGILCS LGNIRRK 

1303 

B 

29 

200 

MSRTRLVCPSLIPFCIYWDVGFSPGPQSCTSHEPKDIHAKCELAFL 
HHQRFYKNEGX* 

1304 

A 

138 

1908 

ASRTAVARWECVLQNVRRE PS PSRAWPSQLRP I ASTATKCRE \ CGPG 
YS TPLEAMKGPREE I \ VYL \ PCI LPETQGTEGPRLS WATVDVDPKS P 
QYCQVIH\RLPMPN\LKDELHHSGW\NTC\GSCFG*LAPSRGTK\LV 
LPSFHLLFGIWGGtWGSEPRAPKLAQGSLSPRDIHAKC\NWAFLHT 
SHCLASGEVMI S SLGDVKGNGKGGFVLLDGETFEVKGTWBRPGGAAP 
LGYDFWYQPRHNVM I STEWAAPNVIiRDGFNPADVE AGLYGSHajYVWD 
WQR I E\ I VQTLS LKDGL I PLE IRFLHNPDRCPKAFVG\ CALQAPNIQ 
RFLQRTRGGTLFSGRR*FQV\PPRKLKGW\LLPKMPGL\ ittilasp 
WNDGF\LYFS\NWLAWGP*GKYDISDPQRPAIiTGQLFLGGSIVKEGP 
\VQVLEDEEL/TSPSPEPLWKGKRVGEGP\QMl\QIiSLDGKRLNNH 
HGRCTS ALGQS S F YP * S QS GERLLVNAGRVE W * DNS KKGGA * KLNPQ 
LSWVDFGEGAPLPKPLPH*ARYP\GG\DCSS\DIWILNSPPSHPHSL 
FWALHFPGGPGLSFCISLGTRTIX3KHVPTTAKLRLWQCVES 

1305 

A 

1 

1000 

STRAPS PGPFPS SKIxAGAYKSWCRRDPRTHSAGAQAAAARSVPIRCP 
APTAS ATM SHHWG YGKHNG PEHWHKDFP I AKGERQS PVD IDTHTAKY 
DPSLKPLSVSYDQATSLRILNN\GHAF\NVEFDDS\QDKAVLKGGPIi 

KYGDFGKAVQEPDGLAVXiG I FLKVGS AKPGLQKVVDVLDS IKTKGKS 
ADFTNFDPRGLLPESL\DYWTYPGSLTTPPLLECVTW\IVLNF\PFS 
VS S \EQVFE IP * TLTFNGGGVNPEELMVDNWRPA\QPLKNR\QIKAS 
FQIRWSHSLYSK 

1306 

C 

127 

435 

MAASXNPEVLDITEETLHSRFLEGVRNVASVCLQIGYPTXASVPHSI 

INGYKRVLALSVETDYTFPLAEKVKAFI^ 

SXCCCSPS* 

1307 

A 

1 

689 

KCF I / VGADNVASKQMQQI RMS FRGKAVC * WGKNTMMRKP I RGHX»EN 
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NPAIiEKLLPHIRGNVGFVFTKEDLTE XRPMLLANKVPAAARAGAIAP 
CEVTVPAQNTGLGPEKTS FFQALGITTKISRGTI E I LGVRNVASVCIi 
QIGYPTVASVPHS I INGYKRVLALSVBTDYTFPIjAEKVKAFIiADPSA 
F/VAAAP/VAAATTAAPRAAAAPAKVEAKEESEESDEDMGFGLFD 

IJUo 

TV 

c 0 
00 

1111 

DT&\7MODPr»D ft r T r LJVO'KTV"l7T V T T CW T T^i*Pl"V D "L r /^« T?Tt Tf* \ tv r\VTt CP VAMAn 

K ± /W ri fKE/JJrOi 1 WlvoiN 1 rljJ\.± J.yjjJjJJJJx irlvL-r J. VVar \AJuJN VCioKQMQv 

irivpwgeacvlmgqktm\mgqahpkgtl*nnpslwrkllp\hirg\ 
nlgfcftqggpsleikghvclankglpswlpwgancphgeshwwpa 
pehwssgpektsffpgl*aittk\isqgapienpevnvpaesrtgdq 

Cf2'PPCp , 2\MA A *TT , QMT QP1?\ QT?f2Af3QQf , »Prj\7C r rKrr*CMT DDT VPT riTU 
&\3Etir Z> &rXT*jr\n. ILJlix jrr ^ OJ7 OrVVJOOyru Vg xxVOdixJUxr ItxjIVV^xxIJXIx 

rrnlcifwlswegvrkccpvsvcqig\ YPTVA\SVP\HS I INGYKRV 

LA\ LS V\ EPDYTFPIiAEKVKAFL\ ADPSAFWAATCGLLPPQLLLW 
VAAPAKVEAKEESEESDEDMGFGLFD 

1309 

A 

60 

569 

STDLEELPT1iGWF*KQELIILSCPFVSLTYRERLPANFFKFQFRNVE 

yssgrnktflcyvveaqgkggqvqasrgyledehaaahaeeaffnti 

rjPAFDPALRYNVTWYVSSSPCAACADRT TKTT,<3KTKTTLRIYLTI.VG\ R 
LFMWEEPEIQAALKKLKEAGCKLRIMNLV 

1310 

A 

62 

858 

QLRW^SGARAWPRPACLSPLPQRLLSHSPSMAQKEEAAVATEAASQN 

GEDLENlA DDPQKLKEL I ELPPFE I VTGERLPANFFKFQFRNVEYS S 

\GRTKTL\LCYVV*STGARGGKVQASWG\YLEDE\HAACPLQKESFS 

FNTIIiPAFRPKPIxAVTNVT/ WGYVSSSPC\ AACADR\ I VKTLSKTKN 

I.RI.T.TLVGRljFMWEEPETOAATjKKT.Knpa\ rtnrr.X PTMKPfjTj'PPTT. e ?W 
xxxvxjxj J.U v urujr nriDDr i-i J- \^./-v_f-vXJ jvixXj AXVn \ V7V»I\_u \ JvXI'lxVr \£Uk Kiuon 

B\NFVEQ\EEGESKAFQPWEDIQENFxYYYEEKIxADILK 

1311 

A 

1 

729 

NTEDQRNEEKAQREANKKIEKQLQKDKQVYRATHRLLLLGAGESGKS 
TIVKQMRIIxHVNGFNGDEKATlWQ^ 

VELANPENQFRVDH I L S VMNVPDFDFPPEFYE11AKALWEDEGVRACY 
ERSNEYQLIDCAQYFLDKIDVIKQADYVPSDQDLLR\CRVLT\SGIF 
\ ETKFQV\ DK\ VNFH I V * RG VGQRDERRKW I QC FND VTA 1 1 FWAS S 
SYNMVIREDN 

1312 

A 

2 

1271 

PWSSAPRRGHSVASAPRSGLRQVAGRRGAAIiPCSIaAPGCGAAAGAS 
PCPGAGRRRAAGGRCLACECTSLTCAGESGKSTIVKQMRILHVNGFN 
O Jb^OliliiJFyAAKbXSJ biJtjliKA I K.VQD I KNNliKKA.1 ETI VAAMoNJbVPP 
VEIJWPENQFRVDYILSVMNVPDFDFPPEFYE11AKALWEDEGVRACY 
ERSIJEYQLIDCAQYFLDKIDVIKQADYVPSDQDLLRCRVLTSGIFET 
KFQVDKVNFHMFDVGGQRDERRKWIQCFNDVTAI IFWASSSYNMVI 
REDNQTNRLQEALNIiF KS I WNNRWLR \ T I S VI L FLNKQ \DLLAE KVL 
AGKSKIE\DYFPE\FAR\YTTPE\DATPEP\GEDP\R*TRAK\YFIR 
\DEFLRISTASGDGGHYCYPHFTCAVDTENIRRVF1TOCRDIIQRMHL 
RQYELL 

1313 

A 

235 

1571 

- 

GRPRPPPPQPGRAPPPPPPRMGCLGNSKTEDQRNEEKAQREANKKIE 
KQLQKDKQVYRATHRLLLLGAGE S GKS T IVKQMR I LHVNGFNGEGGE 

NPENQFRVDYILSVM1WPDFDFPPEFYEHAKVLWEDEGVRACY\ERS 
N\EYQLIDCAQYFL\DK\IDVIKQAD\YVPSDQDLLR/CARVLTS\G 
I F\ETKFQVDKVNF\HMF\DVGGQ\RDERRKWIQCF\NDVT\ AI IFV 
VGSSSYN\MVIREDTGHNGIJVGRL*TSPKGIWDNRWAAAPSLVILFIi 
TKQ/EILLA*ESPLAGNSK\ IKDYFPE\ FAR\ YTTPEGCYSRRPGEG 
phgvyrgovtpfedeft»rssncpvednrhypyphf\ tcavdteni\r 
RVFNGLAVDIHFSGMHLSFSYGAGFKEGEPPKFNIjKA 

1314 

A 

3 

786 

PLFWFPLS PD I AGRPGGWGGGPGGGPAWAAPAMNGLS LSELCCLF 
CCPPCPGRIAAKLAFLPPEATYSLVPEPEPGPGGAGAAPLGTLRASS 
GAPGRWKIxHLTERADFQYSQRELDTIEVFPTKSARGmVSCMYVRCV 
PGARQGHQAQGGHPQIxAWGRLGDSNNPAPGGCLbGKSWGTGAALAC 
GYIHLIxARYTVLFSHGNAVDLGQMSSFYIGLGSRLHCNIFSYDYSGY 
GAS SGRPS ERNLYAD IDAAWQALRT 

1315 

A 

3 

2231 

PAIWGLSLSELCCLFCCPPCPGRIAAKIxAFLPPEATYSL,VPEPEPGP 
GGAGAAPLGTLRASSGAPGRWKLHIiTERADFQYSQRE\RSTIEVFPT 
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KS ARGNRVS CMYVRCVPGARYT\VLFSHGNAVDLGQMS S FYIGLG SR 
LHCNI FTYDS SGYGAS SGRPSERNIiYADIDATWQALRTR*GRPl»VGR 
VRARWRPRIiTLLRRRQVRHQPGQHPSCTGRSIGHGAPPWDWASRYEC 
AAVILHSPLNLGHARRIPRHPRKTYCFDAFPLHSRKVSKINVSPCSS 
SHGHEGRGDRLSRTGWALYEALPPRRVEPLWVEGAPGTTDIEPLQPV 
PGAAGVAFILPGAAQPARLAAAPNRPDLSNKAAPGLHPAPAPTQGLA 
CGPPGRPRWRPGLGGARHEWAVDDVQATRRTHSFPFGSKKKIRENGQ 
LKI 

1316 

A 

1 

1347 

MTSTVLLQRGSRMKSFAGKEVVFYRPEEDAGDEKGYESFPWFIKRAH 
SPSRGLYSVHINPYLIPFFIGLQNRFTQFRLSETKEITNPYAMRIiYE 
SLCQYRRTSLSKFFMVKGVERTP I FGGIiDPS SVIPEGDbGNES SLGN 
LVWFVDILQGNS PARS PGKVHRQYSTLRAS SLFFS PYSS IQQTIGLP 
ELRESLAIDRSQSEHQQKPIWTCAEAFLWIFGQKSQVAIAAENWRE 
EE PQAG PSGGVPEG I AVRDDS SMRVWPEDLLVGEDVEVKDDRVQTN 
PKAPI SRS FKASES KNFQKQHEMTEAEGRE YKG YARI SC FQKFGNLV 
P S RGH I ALARAPEE PQAGPFGGVPEG I AVRDDS S LRVWPEDLLVGE 
DVEAEQQKKVAI I S AEG YS KAAEL I ANS LATAR I RLMEL CKLEAAED 
I AYQLSRSRN ITYPPAGQS VLLQLPQ 

1317 

A 

53 

1027 

NFRVEAGVRGVQQKETCAFKVLES IGKLG\LALSVAGGAENSAIiYNV 
DAGHRAVI FDPI PGQK* QDI WGEGTHF\ LI P\ WVQKPQLSNDCRSR 
PRNCCQSITGSK\DLQNV\NITLRHPSSGPVRQPSFPRIFTSIGED\ 
YDERVLAVPSQLENIiK\ SVWARFDAG\ EI*ITQR\ ELVSRQV\ SDDIi\ 
TERA \ ATFGL I LDD VS LTHLTFGK\ D FT * AV\ EAKQ VAQQG KQRRAR 
r V/ VJL.Jbi\AJBUQKK\A/VIISAEGDSKAAELIANSIj\ATAGDGLI 
LGKLQRTFAYQLSTLSGTS P YLPAGQS \ VLLQ\ LPQLRAHPCLAPPA 
GLTWGHSPD 

1318 

A 

424 

598 

MTKGRSASAKEGQGGIGIGGGHPRSWRLCGQMCRDIKGI1WI1REIKPC 
PIiTNRAGAGPS 

1319 

A 

3 

371 

SIKPSVKEFTETSAVFEDGTMFEAIDSVIFATGYDYSYPFIiDETIMK 
SRNNEVTLFKGIFPPLMEKPTLAVIGLVQSLGAAIPTADLQAWWAAK 
VFASRWAILSFIHFINEHLLNTCYKLKW 

1320 

A 

1 

1596 

MSKRVGIIGAGVSGLAAIWCCLEEGLEPTCFERSDDVGGIiWKFSDHT 
EEGRASIYQSVFTNSSKEMMCFPDFPYPDDYPNYIHHSKLQEYIKTY 
AQKKELL.RYIQFETLVSGIKKCPSFLVTGQWVWTEKDGKQESTIFD 
AVMICSGHHVYPNLPTDSFPGLDQFRGNYLHSRDYKNPEAFKGKRVL 
VI GLGNSGSD I AVEL SRLATQVI IS TRSAS WVMSRVWDDG YPWDMMY 
VTRFASFLRNVLPSFISDWLYVQKMNTWFKHENYGLMPLNGSLRKEP 
VFNDELPSRILCGTLS I KPSVKEFTETSAVFEDGTMFEAIDSVT FAT 
GYD YS YPFLDET IMKSRNNEVTLFKGI FPPLMEKPTLAV IGLVQSLG 

AAJ.F1 AIJJLjUAWWAAAVFANSCTLiP A IWRMMnn7T)PKMGKKT'KWFGQ-J 
QTLQTD Y I T YVDELG S F IGAKPN I P WLFLTDPRLALEVY FGPCS P YQ 
r K!jML>fij>in.wjjl>aKi>i Ax Li 1 y KTV KJPTRTRVv SEVQRPHPFYNXiLKM 
LiS FPLLLIiAVTLTFY ' 

1321 

A 

2 

557 

WDMMYVTRFAS FLRNVLPS FI SDWL YVQKMNTWFKHENYGLMPLNGY 
LKMEIFFIQKRGALI* * I YLS I KPS VKEFTETS AVFEDGTMFEAIDS 
VIFATGYDYSYPFLDETIMKSRNNEVTL^ 

\7(~\ Q T ,(~2 Zi TDTBFlT A IaILJ 7\ 7\ V\ J"C A C? O T«J7V TT ptjt T-Ttt' T"vtt7TTT t 
vyoiioiiHi ir i/iLijjyaw WAAlv V r A0KWA1 Jbor 1 Mr XNEHIj JLiN TC* x 

1322 

A 

955 

1187 

IFFFFFFKMESCPFAQAGVQWCDLGSLQALPPGFTPFSCLSIiLSSWD 
YRRPPPHIJ^FLYF**TWFTVIiARlWSIS 

1323 

A 

6345 

9041 

ERNCHERRESVAl^l^FKCIJ^LFQCYGLLQVFLGNELFTARVFASHC 
CSSLRKEPVFm)ELPSRILCGTLSIKPSVKEFTETSAVFEDGTMFEA 
IDSVIFATGYDYSYPFLDETIMKSRNNEVTLFKGIFPPLMEKPTLAV 
IGLVQSLGaAIPTADLQAWWAAKVFASRWAILSFIHFINEHLLNTCY 
MQSTVX.GIQ 

1324 

C 

182 

433 

MHLDLFLNKTLPQIRGVESEQSSRLHPLPDPRGDRHRMACNLPMEIH j 
GS SATS SGKPSDFNQAAVDGAGERGGKEVIiGGTIiDVL 
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1325 

C 

72 

254 

MVSTQLRQASDPRTTIGRERFELLLiRR\^KLMSPRIiPTGTIxt^HHFW 

1326 

A 

225 

735 

GELRVNSLHVSTHFQIPEBTDIGWLVSPGQOPARPFEDIQLWPPGSL 
MAAEPTDQSLEESH*DRWITTFFTFAR\QEGRK\D*PQRSNEFKE\L 

WPKE IRKEEKTLKI QERS KAAWLEDGVGQGRT 

1327 

A 

55 

391 

PAGRDAQPGLHHWFLRAQRHHPATGDPHGPLWPPTRAAGWQCLLHCV 
LHPHGPGLPGRGSSVSVDIPGAVPEWLWWHLPNVHFTHGTGLLPNPS 
DHATARSKSSPMPSVPSP 

1328 

A 

996 

1334 

WASVGLSGPRSPPSSRPQ*ARPRPGAPASLRQADLGRGWRDRLGAPR 
PRPPRTGGWRSCCRGRGPGSRPRGARAGLGPGAPGGWRRSRRSWTRA 
RAATRPRAAARGSRTPRG 

1 J 2. 3 

TV 

A 

1 

993 

MDWR I KDCVDAPTQGTVLGDARP I LFFCAPLHPTQDDL FADLANLSH 
LFLHGNRLRLLTEHVFRGLGSLDRLLLHGNRLQGVHRAAFRGLSRLT 
ILYLFNNSIASLPGEAIiADLPSLEFLRI^ANPWACDCRARPLWAWFQ 
KARVSbbDV rL.ATPPfc.KQGRDIjRAIjREGRFQACPPAGPTRPGSRARG 
NSFSNHLYGPSRGFTVYISTHVSFPIFIGLQNRFTQFRLSETKEITNr 

"DY7AMT? T .VPQT.PriYP VDFin QCI y\7CT DimUT TT7DVHT T5l^C"VrM3M'DT*i"C r D"D 

RFLQDKAKMKLPLLIiALIi FGAVS ALHLKTWFQVS PFRPELS PLMPQA 
QV 

1330 

A 

72 

496 

PPWARGSARRPPAWRTVRMPSCHPRMFGAPQKTFLRVSVWSRCRPWG 
I VMRMM * PMRGQ VRRHNS CMAPKTEE * NPTVS ATFCCC S FVS CS WPP 
VTRYSSILFTAAM 

1331 

C 

202 

378 

MTPYLTLFLSPLPPKGEIWGLLLFLTPLGFLLPSLPLLLPCPAPAGV 
RRQWDGPTEGA* 

1332 

A 

1 

1541 

l^PLRAQQLAAELEVEMMADOTYtn^T^ 

SVCLDRCVSKYLDIHERMGKKLTELSMQDEELMKRIHFIQAQLRRPL 
APGGFPVPPLLPRRQRRRGHRAVRLKLRGGRVLHAEKTPAPARLRAG 
PTGLRTRRES VALPEASAAAGCS FAGDRRLGASMAPTLQQAYRRRWW 
MACTAVLENLFFSAVLLGWGSIiLIILKNEGFYSSTCPAESSTNTTQD 
EQRRWPGCDQQDEMLNLGFTIGSFVLSATTLPLGILMDRFGPRPVRL 
VGbALr 1 AoC ri^MAlxASKlJ VEAX»S PL 

LPNMFGNLRSTLMALMIGSYASSAJTFPGIKLIYDAGVAFVVIMFTW 
S GLACL I FLNCTLNWP I EAFPAPEEVNYTKKI KLSGLALDHKVTGDL 
FYTHVTTMGQRLSQKAPSLEDGSDAFMSPQDVRGTSENIjPERSVPLR 
KSLCSPTFLWSLLTMGMTQLADHIiLHGCCEQDAGVPCDWWPGA 




1 A C 

34b 

i LiSE VCj Vb V <j1 V ± KPRQW XRRPEGDPFHGGRLKr^PIjRAQQLAAELE 
VEMMA\ DMYNRMTSACHRKCVPPPF\ KEAELSKGES VCLD\ RCVSKY 
L\DIHGA* WGKKFDRWLLQG 

1334 

A 

170 

842 

EHVYKLPSKAKITRPLMLSSARGGAEESERAEPPLRWAFLLLGIX3TV 
VGVDESTAFSWPVCDMCGNGRLEQRPEDRGAFSCGDCSRWTSPVLK 
RHLQVS \ LDCRSRPQCRVKVKLLQRS I S S LLRFAAGEDGLYSQWIiIR 
SIiLRIWKEADRRWVPEGPG*RC*LRBTQYGFCQFSGATK*RSVFGKE 
VGLLNCFVQSVTAHPTSCIGLEEIELLSAGGASAEH 

1335 

A 

2 

2925 

FVLRRRCAQAALPEMPRGSRARGSKRKRSWNTECPS \ FPGERPLQGR 
RAGLRTAGAAASLSEAWLRCGEGFQNTSGNP\ SLTABEKT\ ITEKHL 
ELCPRP KQETTTS KS TSGLTD I TWS SSGSDLSDEDKTL S QLQRDELQ 
F IDWE I DSDRAEASDCDE FEDDEGAVE I SDCAS CASNQS LTSDEKLS 

IMERLIDSRTKSPETILHTPQNPTVKFPRTAI^SAKKICLLRGGLAER 
LNGLQNRERSAISLWI^QCISYQKTLSGRKSGVIjTVKILKIjHEECAM 
QVAMCEQLLGS PATS S SQS VAPR PGAGLKVLFTKETAG YLRG PPQDT 
VRIFPPWQKLIIPSG\SCPVILNTYFCEKWAKEDSEK\TCEVYCPD 
I PLPRRS I SLAQMFVI KGLTNNS PE IQWCSGVATTGTAWTHGHKEA 
KQRIPTSTPLRDSLL\DVVESQG\AASWPG\AGVRWVQRVYSIiPSR 
D S TRGQQGAS SGHTD PAGTRACLL \ VQDAC\ GMF \ GEVH \ LE FNM\ S 


JL O O S M-3 m O 2 i2 iB, O ici! 






KARQ\LEGKSCSI#/VWGMKVIiQKVTRGRTAGIFRL\lDTL\WPPAIP 
L\KTPG\RDQPC\EEIKT\HLATPAFGyNIjTAHPNI#GQIDIIDEDPI 

YKTjYOPPVT'RPTiP , nTT«OMIJnT.n r r\ PrQPVttTVTVnVDT /ot xroT t t / 
x *uj x r v x nuuxu/x xj\^i 'inuuu x r Inl V 1 1 X / l^xjxvoxji-iJ-i / 

LWEQREIWLLVTOVT\LQTKEE\RDPRLPKlXLVYVAPLCVIiGSEVL 
\ EALAG AAP\ HS LFFK\ DALRDQGR \ I VGA* TNWS S LLQKPLLS WS 
GASSCELPGP\VMLDSLDSATPVNSICSVQGTWGVDESTAFSWPVC 
DMCGNGRLEORPEDRGAFSCGDCPRVGPFPV\ LKRHPP FFTA nrpQif 

pq/ carvrvk\ llqrs i sslvkvcppgedgs yevkevslgkgsggll 
ncfvqsvtahp\tscigweei\ellsa\ggasartlavapriwgtlp 
mwpakgwwvwvigg 

1336 

A 

1 

480 

NFALEAKNSARAISSWQTPMGHFTEEDQ/AIiTITSLWGK\VNVE\D 
AGGETPGKGSLWYP\WT\QRFFDSFGNLVLLPSCPSMGNPQKSKAT 
WPRKVLT\SLGDA\TKHL\DDLKGHLLPKPEWLHLLTSLHVGS*GT 
FKLPGEILLVTRFWAIPFSAKEFHP 

1337 

A 

52 

454 

S 0T0 R E P TMVL S P ADKTNVKAA / WGM FL S F P TTKTY F PH FDT.QHGQA 
QVKGHGKKVADALTNAVAHVDDMPNAI^SAIiSDLHAHKLRVDPVNFKL 
LSHCLLVTLAAHLPAEFTPAVHASLDKFIJ\SVSTVLTSKYR 

1338 

A 

3 

616 

PTLLVPTD S ERTHPWLLS PADKDOROG P A WC5 * O * n Qhpp QNVlxTCrT .T3 
R\MVLFPPPTPKPYFP\HFDL,\SHG\SAQV\KGHGK\KVADALTNAV 
AHV\ DDMPNALSA\ LSDlxHAHKIi\ RV\ DPFNFKLLX SHCLLG* PWAA 
HLPRPSFTPCGCKASL\DKFPGLFVEAPLLEPSKL^ 
LCPFGPFPQPLLPFPAPVPPWSLK 

1339 

A 

2 

390 

GWDWNCVWEPHHWLCQSL/NSVTQAGVQLCNLSSLQPLPLGFKQFSC 
LSLPSSWDYRNPSLKQQLFSYAILGFALSEAMGLFCLMVAFLI\SLP 
CEGAVSTSHSSPASGWPRVFLFT.YT.PPOPfiTTPriWT RV 

x^jc*vj^> v «j x oiik/w r^y\»j vj r » t x\. vrur Xj X XJX^*\\^x^\jXS*vwr\fxjXv V 

1340 

A 

13159 

14007 

VSLSPRLKCSGTISAHC^CLPGSNDSPASASQVAGITGAHHHARLI 
FYFFILFYFIFLR/HESDSVTQAGVQLCNLSSLQPLPLGFK\HSIxAS 
ASQVAGITGTHRYPQLIFVFFFFLF/SFLRQSL/DSVAQAGVQW\RG 
LGSLHPLPPGFTPFSCLSLLSSWDYKP'LPTPT.a'KrFT.YF* ♦POfTVTVT. 
ARMVS IS* PRDLPTSASQS AG I TDMSHCAQ/ LI FVFLVETGFHQVGQ 
AGLE/PPDLKQSTHLGLPKCWDYRREPPRLA\NFCIFSRDGVSPCWP 
GWSPTSGFK 

1341 

A 

1 

532 

DSGTP^TVxJKxJLREWYMIISREMFNP^fYAIJFRT^P^^PV^^TTl^^ c ; 
H<^P1WLSYFKFVGRIVAKAVYDN\RLLECYFTOSFYKHIxjGKSV\R 

ytdmesedyhfyqglvyllendvstl\gtoltfstevqe\fgvcevr 

DL \ KPNGG. \N I LVTEE \ NKKE YVHLVCQMRMTGAI RKQFG 

1342 

A 

1 

532 

AEADAI IQMVREGQRARRQQQAATSESSQSEASVRREESPMDVDQPS 
PSAQDTQSIASDGTPQG\EKEKEERPPELPLLSEQL\ALDELWDMLG 
ECLKELEBSHDQHAVLETHRTVIjNQILRQSTTHxxADGPFA 

vldfdvkrkyfrqelerldeglrkedmavhvrpjdhvf 

1343 

A 

262 j 

589 

PVSKESRVAPIiCDFCLPFIQSESSQSEASVRREESPMDVDQPSPSAQ 
DTQSIASDGTPQGEKEKEERPPELP/LLSEQLSLDELWDMLGECLKE 
LEESHDQHAVLVLQPA 

1344 

A 

1 

2502 

MTPPHLPPRRASDDEFENLRIKGPNAVQLVKTTPLKPSPIjPVIPDTI 

K^VIYDMLNALAAYHAPEETVGFTSPMLro 

liNALFETFlTOALSMGGKVPVSEGLEHSDLPDGTC^ 

WPTTVLESPHSLPAKLPGGVQNFPQFSALRFLVVTQKAAFTCIKNL 

WNRK^LKVYGGFJvlAESMxxAILCH^ 

TCQEEGGSRREPQWQQQLQQL^MGFTREHA^ 

YLLTHPPPIMGGVVFJDLSMSEEDQM*!^ 

EVACRKEEEERKAREI^EEEEAKCLEKFQDADPLEQDEIjHTFTDTML 
PG CFHLLDE L PDTVYR VCDL I MTA I KRNG AD YRDM I LKQ WNQVWE A 
ADVLIKAALPLTTSDTK l^SEW I SQhlATLPQASNIiATRI LLLTLLFE 
IEVRSWSYPPFQDKDHCKKEKENFEAIAAALAAERESKPPVl^ 
QLAHSKDEPPPLSPAPLTPATPSSLDPFFSREPSSMHISSSLPPDTQ 
KFLRFAETORWLNQILRQSTTHxxADGPFAVLVDYIRVLDFDW 
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FHQELERLDEGLRKEDMAVHVR1UDHVFEDSYHTASQSLTHTNDWMYP 
GFSAQLFSASAFLCRYIVFEGEEGQDAGGIiLREWYMIISREMFNPMY 
ALFRTS PGDRVTYTINPS S \ HCNPNHLS \ YFKFVGR\ I VA\ KAVYDN 
RLLEC YFTRS FYKH I LGKS VRYTDMES EDYHFYQGLVYLLENDVSTI> 
GYDLTFSTEVGQEILITAHPSQSGRSNSQVHLRTSTA 

1345 

A 

1 

2219 

MPQLPGISLPEGVDPSFLAALPDDIRREVLQNQLGIRPPTRTAPSTN 
SSAPAVVGNPGVTEVSPEFLAAIiPPAIQEEVliAQQTAEQQRPELAQN 
ASSDTLMDPVTLIQTLPSDLRRSVLEDMEDSVLAVMPPDIAAEAQAIj 
RREQEARQRQLMHERLFGHSSTSALSAILRSPAFTSRLSGNRGVQYT 
RLAVQRGGTFQMGGSSSHNRPSGSNVDTLLRLRGRLLLDHEALSCLL 
VLLFVDEPKLNTSRLHRVLRNLCYHAQTRHWVIRSLLSILQRSSESE 
LC I ETPKLTTS EEKGKKS S KSCGS S SHENRPLDLLHKMES KS SNQLS 
WLSVSMDAALGCRTNIFQIQRSGGRKHTEKHASGGSTVHIHPQAAPV 
VCRHVLDTLIQLAKVFPSHFTQQRTKETNCESDRERGNKACSPCSSQ 
S S S SG I CTDFWDLLVKLDNMNVSRKGKNS VKS VPVS AGENKVS EAQA 
NSGSGAS STTTATS TTS TTTTTAASTTPTP PTAPTPVTS AP ALVAAT 
A I S T I WAAS TTVTTPTTATTTVS I S PTTKG S KS PAKVS DGGS S S TD 
FKMVSSGLTENQLQLSVEVLTSHSCSEEGLEDAANVLLQLSRGDSGT 
RDTVLKLLLNGARHLG YTLCKQ IGTLLAELRE YNLEQQRRAQCETLS 
PDGIiPEEQPQTTKLKGKMQSRF\SGLGSASSIQAAVRQLEAEADAII 
QMSES SQSEASVRREESPMDVDQPSPSAQDTQS I A 

1346 

A 

59 

6349 

KI SQYYMHTP I S PHPRLL ISPS I APRKVE WTGLKVKSQDRLFAQQLQ 
VELVALPLVLCLAASALGRSTTSFVSLGQPHAAIQTYSQKWPTAVSA 
SPFIiPLRGSGTGNGSSRIPRESAPEMATAESLVEELSEDAAGGASPG 
VELPALGCSELPAAEVS PTAS S KNLET I CE YAYCMAMLPETGLDP YP 
KRGFLDLTQERIWTDIPPSPGNIPTTHPLMVRHADHSSliTLGSGSST 
TRLTQG IGRSQRTLRQLTANTGHT IHVHYPGNRQPNPPL I LQRLLGP 
SAAADILQLSSSLPLQSRGRARLIi\VGNDDVHIIARSDDELLDDFFH 
DQ\ STATSQAGTLS S I PTALTRWTEECKVLDAESMHDCVS VVKVS IV 
NHLEFLRDEELEERREKRRKQLAEEETKITDKGKEDKENRDQSAQCT 
ASKSNDSTEQNLSDGTPMPDSYPTTPSSTDAATSESKETLGTLQSSQ 
QQPTLPTPPALGEVPQELQSPAGEGGSSTQLLMPVEPEELGPTRPSG 
EAETTQMELSPAPTITSLS PERAEDSDALTAVSSQLEGS PMDTSSLA 
SCTLEEAVGDTSAAGSSEQPRAGSSTPGDAPPAVAEVQGRSDGSGBS 
AQPPEDSSPPASSESSSTRDSAVAISGADSRGILEEPLPSTSSEEED 
PLAGISLPEGVDPSFLAALPDDIRREVLQNQLGIRPPTRTAPSTNSS 
APAWGNPGVTEVSPEFLAALPPAIQEEVLAQQRAEQQRRELAQNAS 
SDTPMDPVTFIQTLPSDLRRSVXiEDMEDSVLAVMPPDIAAEAQALRR 
EQEARQRQLMHERLFGHSSTSALSAILRSPAFTSRLSGNRGVQYTRL 
AVQRGGTFQMGGSSSHNRPSGSNVDTLLRLRGRIiLLDHEALSCLLVI* 
IiFVDEPKI^SRLHRVLRNLCYHAQTRHWVIRSLLSILQRSSESELC 
I ETPKLTTS EEKGKKS SKSCGSSSHENRPLDLLHKMESKS SNQLS W/ 
LSSVSMDAALGCRTNIFQIQRSGGRKHTEKHASGGSTVHIHPQAAPV 
VCRHVLDTLIQLAKVFPSHFTQQRTKETNCESDRERGNKACS PCS SQ 
SSSSGICTDFWDLLVKLDNMNVSRKGKNSVKSVPVSAGGEGETSPYS 
LEAS PLGQLMNMLSHPV I RRSSLLTEKLLRLL SLIS I ALPENKVS EA 
QANSGSGASSTTTATSTTSTTTTTAASTTPTPPYVHPPRVTSAPA\L 
VAATAI ST I VVAASTTVTTPTTATTTVS I S PTTKGS KS P AKVSDGGS 
S STDFKM \ VS SGLTENQLQLSVEVLTSHS CS EEGLEDAANVLLQLSR 
GDSGTRDTVLKLLLNGARHLGYTLCKQIGTLLAELREYNLECX5RRAQ 
CETLSPDGLPEEQPQTTKLKGKMQSRFDMAENVAIVASQKRPLGGRE 
LQLPSMSMLTSKTSTQKFFLRVLQVIIQLRDDTRRANKKAKQTGRLG 
SSGLGSASSIQAAVRQLEAEADAIIQMVREGQRARRQQQAATSESSQ 
SEASVRREESPMDVDQPSPSAQDTQS IASDGTPQGEKEKEERPPELP 
LLSEQLSLDELWDMLGECLKELEESHDQHAVLVLQPAVEAFFLVHAT 
ERESKPPVRDTRESQLAHI KDEPPPLSPAPLTPATP\ SS FDQFFSGE 
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PSSMHISSSLPPDTQKFLRFAETHRTVLNQILRQSTTHIjADGPFAVIj 
VDYIRVLDFDVKRKYFRQELERLDEGLRKEDMAVHVRRDHVFEDSYR 
ELHRKS PEEMKNRLYIVFEGEEGQDA\ GGLLREWYMI I SREMFNP\ M 
YALFRTSPG * SESPNTINPS \ SH\ CNPNHLS \ YFKFVGRI V\ AKAVY 
DNRLIA ECYFTRSFYKH\ I LGKSVRYTDMESEDYHFYQGLVYLLEND 
VSTLGYDLTFST\EVQEFG\VCEVSWTSNPMGANIL\VTEEIKEGVL 
LHLV\ CQMR\ MTGAF\ RKQLPAFIA EGFYE 1 1 P\ KRL I S I F\ TBHEL 
ELLISGLPTIDIDDLNPNTEYHKYQSNSI\QIQWF\WKKHCRSFRIK 

'IM I'Df'DV'E'D'D QT.T? r TYlVT .DTDXT^TTDfTI Vf* ~C 7\ PDCVOMPUn t^i^-ci rnr> -ct^ r» tt>t rrt 
J. L/r'^r'JVr rroLr IVjILirKV X JrblVurALrolVuMEWGFQjEVrArPSEVD 

RS\TDR\LPSAHTCFNQLDLPAYESFEKL,RHMLLLAIQECSEGFGLA 

1347 

A 

1 

746 

maaagafrlrraasalllrsprlparscrprpds itrs ppdvrlple 
kqlkna inqrgtkgp y i ry ype wdhyenprnvgs ldkts knvgt\ g 
lvgapacgdvm klq i \ qvg * krgrfvgc * gfktfs avg s a i \ as s sl 
shlngvtcgkt\veealtikntd\ia\kelcl 
afkaaladykietrtqkkgeaekkwspplgeassrptpavpqpavpv 

1348 

A 

2 

832 

SARGSTVAAI ICS PRLTPPRTRDAKAAC ERLRRVGVE PQLSRGLALF 
WSPRPNPPEEMSGGLAPSKSTVYVSNLPFSLTNNDLYPDIFQSIGKS 
Jj^lS-V UMy^lvolS-KjC / HKl\JiViv(jVAr ILr LDKDSAQK\CARA\INNKQL 
FGRVIKASIA\IDNGRAAEFIRRRNYFDKSK\CYECG\ESGHL\SYA 
CPKNMLGEQ*/RLPKKKEKKKIGaCAPEPEEEIEEVEESEDEGEDPAIi 

DSLSQAI AFQQAKI EEEQKKWETQFQGVP SNIRMI PRRTR I KKSTY F 

r> 
W 

J- JtJ 

n 

J. O 3 


AlMJ-»AXiJ-tUiM w u o V X o 1 r o ISJjKJblJiAs V lv^.c».r WJjNJjfc,KEx JbvjJjKvkrJSiN. 
DLEEVKAKVQPYLDDFQKKWQEEMELYRQKVEPLRAELQEGARQKLL 
PVLESFKVSFLSALEEYTKKLNTQ* 

1350 

J\ 

3 


Ivor Xi^iJV^^*^^AJJljoK/\i»lrJUurJjI\J^ lb I r oi\XjK.E»yjjljr' V X 

QEFWDl^njEKETEGLRQEMSKDLEEVKAKVQPYI^DFQKKWQEEMELY 
RQKVE PLRAELQEG ARQKLHELQE KLS PLGEEMRDRARAHVDALRTH 
LAPYSDELRQRLAARLEALKENGGARLAEYHAKATEHLSTLSEK 
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AAYDKFKDKRVGTKGLDFSDRIGKTKRTGYESGEYEMLGEGLGVKET 
PQQKYQRLLHEVQELTVILYNCF 
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A 

204 

433 

QHKCGTI I VT I KQLHEQAMQFGQLLTHLDTTQQM I ANS LKDNTTLLT 
QVQTTMRENLATVEGNFAS I DERMKKLGK 
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3 

392 

GNFARNEPDVYETSDLPEDDQAEFDAEELTSTSVEHIIVNPNAAYDK 
FKDKRVGTKGLDFSDRIGKTKRTGYESGEYEMLGEGLGVKETPQQKY 
QRLLHEVQELTTHVQYCFAWCGEGLGIiVPQHSFCG 

1405 

A 

1 

1314 

MADPKYADLPGIPILRPRAPAARSPIGIjSTPRTLLTRLGRAAGRHQA 
PFPAVAVARPRNRLRARNEPDVYETSDLPEDDQAEFDAEELTSTSVE 
H 1 1 VN PNAAYDKFKDKRVGTKGLD F SDR I GKTKRTG YE SGE YEMLGE 
GLGVKETPQQKYQRLLHEVQELTTEVEKIKTTVKESATEEKLTPVLIj 
AKQIJUVLKQQLVASHLEKLLGPDAAINIjTDPDGAIiAKRLLLQLEATK 

nskggsggkttgtppdsslvtyelhsrpeqdkfsqaakvaelekrlt 

ELETAVRCDQDAQETVELLQAKVSALDIAVIjDQVEARIjQSVLGKVNE 

iakhkasvedadtqskvhqlyetiqrwspiastlpelvqrlvtikqii 
heqamqfgqllthldttqqmianslkdnttlltqvottmrenixatve 
gnfas idermkklgk 

1406 

A 

2 

279 

fvfplpehrprlfrcpgpparlaaamadpkyadlpgiarnepgvyet 
sdl peddq ae fd afaqvrp fn as s lhvff i chlacnal plc yccs 
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TSCSSAAPFAAALARDPNPASPLPEHRPRLHRGPGPPARLAAAMADP 
KYADLTG I ARNDPDVYETSDLPEDDHAEFD AFAQELEELTSTS V\ EH 
I \ I VNPNAAY\ DKFKDKRVGDKG1 * FSQI VLGKTKRTG YESGEYEML 
GEGLGVKETPQQKYQRLLHEVQELTTEVEKIKTTVKE\SAAEEKIiTP 
VLLAKQ\LAALKQQ/VWVASHL/EGKLLGPDAAINL\TDPDGRPGLR 
RLLLQ\ LEATKNSLGGSRGKTPGTPPD\ SSLVTYELHSR\ PEQDKFS 
S SLPKSQKLEKRL\ TEL\ ETAVTLL IQDAQNPLS AGLQGACLMETVE 
LLQAKVS ALDLAVL\ DQV\ EARL\ QS VLGK\ VNE I \ AKH* SLC * EGA 
GYTKARLHQLIIETITAIiGAPLPSTLPGAGCRRLVTHQSSLHEASPC 
QFGQLLDT/ HLDTTQPM \ I ANFLGRTNTNPL * PQVADKPLRENLAHQ 
LKGNFAQQ*RNG*KKLGKSEAHLGKLEEPGG 
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MAPPLAAGPSPPLHPSAAGLEAGRLCKRVLGPCGDGEATPRPLNRIP 
PRVTDVQLCCVQETDIHS FSQS VNQCFLTPEHMADS S PLVGS TPTG Y 
GTLTI GTS IDPLS S S VS S VRLSG YCGS PWRVI GYHVWWMMAG IPLL 
LFRWKPLWGVRLRLRPCNLAHAETLVIEIRDKEDSSWQLFTVQVQTE 
AIGEGSLEPSPQSQAEDGRSQAAVGAVPEGAWKDTAQLHKSEEAKRV 
LRYYLFQGQRYIWIETQQAFYQVSLLDHGRSCDDVHRSRHGLSLQDQ 
MVRKAI YGPNVI S I PVKSYPQLLVDEALNPYYGFQAFS I ALWLADHY 
YW Y ALC I FL I S S X S I CL S L YKTRKQ S QTLRDMVKLS MR VCV CRPGGE 
EEWVDSSELVPGDCIiVLPQEGGLMPCDAALVAGECMVNESSLTGESI 
PVLKTALPEGLGP YCAETHRRHTLFCGTIj ILQARAYVG PHVLAWTR 
TGFCTAKGGL VS S I LHPRP INFKFYKHSMKFVAALS VLAIjLGTI YS I 
FILYRNRVPI^IVIRALDLVTVVVPPALPAAMTVCTLYAQSRLRRQ 
GIFCIHPLRINIiGGKLQLVCFDKTGTLTEDGLDVMGVVPIiKGQAFLP 
LVPEPRRLPVGPLLRALATCMALSRIjQDTPVGDPMDLKMVESTGWVIj 
EEEPAADSAFGTQVLAVMRPPLWEPQLQAMEEPPVPVSVLHRFPFSS 
ALQRMSVWAWPGATQPEAYVKGSPELVAGLCNPETVPTDFAQMLQS 

LLKPQTTP VI QALRRTR I RAVMVTGDNLQTAVTVARGCGMVAPQEHL 
1 1 VHATHPERGQPASLEFLPMES PTAVNGVKDPDQAAS YTVEPDPRS 
RHLALSGPTFGI IVKHFPKLLPKVLVQGTVFARMAPEQKTELVCEliQ 
KLQYCVGMCGDGANDCGALKAADVGISLSQAEASVVSPFTSSMASIE 
CVPMV I REGRCSLDTS FS VFKYMAL YS LTQF I S VI* I L YTINTNLGDL 
QFLAIDLVI TTTVAVLMS RTGPALVLGRVRPPGALLS VP VLS SLLLQ 
MVLVTGVQLGGYFLTLAQPWFVPLNRTVAAPDNLPNYENTVVFSLSS 
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FQYLI LAAAVSKGAPFRRPLYTNVPFLVAIiALLS SVLVGLVLVPGLIi 
QGPLALRNITDTGFKIjLLLGLVTLNFVGAFMLESVLDQCLPACLRRL 
RPKRASKKRFKQLERELAEQPWPPLPAGPLRKGKKEPLVVEASHGPE 
LS S VGEEGAGLREAAGLAGEAEGKPRQES VLEAGRRQRE I EAYFLAB 
GPRDAGESGVGEAAAVEEARAVLSDTTPACLFVAMRAAYLFLLFLPA 
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RRRPRQPTMAAAWGGRGPEVWSAGTLFDVGPRYTNLSYIRRGRLTT 
CVCSAYDNVNKVRV\AIKKISPFEH\QTYCQRTLREIK\ILLRFRHE 
NI\ IGINDI I\RAPT\ IEQMKGCI * * xT^DLMETDLYKLLKTQHL\SN 
D \ H I WY Fit YQI LRGLKY I HS A \ NVLHR \ DLKP \ SNLLLNT\TCDLKI 
CDFGL\ ARVADPDHDHTGFLTEYVATRW\ YRAPE IML\NS KGYTKS I 
\DIWSVGCIL\AEMLS*QGPSFQGKHYL\DQL\NHIFGVFLGSPSQB 
\ JJ J_tN LII Kr * JvUGN x LiLiS SliTS KIMVP JjEHACPQNADS KS S GTLLGQ 
NCLTFNP\HKRIEVEQALA/HTPYLEQYY\DPSD\EPIAEAPIQVRP 
WKLDDLPKEKLKGTNFLK3lLARF\QPGVQILNFVQTKGSEGLDVIiRH 
RCSSSQFIiTPGPVFQPCLGLSTLTPLNPFGRGRFLGSCGFYGFSKIS 
FSPGGFLLGQPCCCVHPLVTLRAVCTSVHLTAYCCFSH 
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GNKKAAPKGSLWVRTQLGLPPLLLLTMA1^GGSG\TASAEAFDSVLG 
DTASCHRACQLTYPLHTYPKVGPVRSGLRPFPCSPFLGSPHVCRLiWQ 
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QRGTRWPJlERGSLWWTQLGLPPLLLLThlAIiAGGSGTASAEAFDSVI< 
GDTASCHRACQLTYP\LHTYPKEEELYACQRGCRLFSICQFVDD\GI 
DLTRTKLECESACTE\AYSQSDEQYALPFIiGCQNSACHFAE\LRQEQ 
Jjx VPKWPJvMAP xtr JLi*1jIjGEGSFWE1jT* WDSAQ\SFITS SWTF xXiQA 
\DDGKIVIFPV*SQKSQYAPHFGAREPTNFEENHIiLSKMSSDLQMGK 
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RSKTE\DHEEAGPSYLPKVNLAPFLEI 
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RDADDAVYELDGKELCSERVTIEHARARSRGGRGRGRYSDRFSSRRP 
RNDRRNAPP* 
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SRSRSRSRSRSRKSYSRSRTRSRSRSRSKSRSVSRSPVPEKSQKRGS 
SSRS KSPAS VDRQRSRSRSRSRSVDSGN 
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ILIEYKCGKCHVCTLSNIFSFSSLVPFISCDCLCVPFPIjIjCLTQLSC 
VKBLKDFMRPAGE\VTFA\DAHRPK\Ioi^GVVEFASYGDIjIWAIEK1j 
\SEKEINGRKIKLIEGSKRHSRSRSRSRSRTRSSSRSRSRSRSRSS\ 
KSYSRSRSRS\RSRSRSDVPVLLSRSPRA*EEPRNRGSSSRSKSPAS 
VDRQRSRSRSRSRSVDSGN 

1419 

A 
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PDIMSGCRVFIGRLNPAAREKDVERFFKGYGRIRDIDLKRGFGFVEF 
EDPRDADDAVYELDGKELCSERVTIEHARARSRGGRGRGRY\SDRFS 
SRRPRNDR/RVCEGWMAALNNx^*G*PFKIQESLAVMILGPAV*SVL 
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LFPR*PIVLDESI*VIEHKSIDGSH*NGL*YLMA*TCPQLNTSAVIA 

FLPL*IRIFFLRNAPPVRTENRLIVENLSSRVSWQDLKDFMRQAGEV 

TFADAHRPKLNEGWEFASYGDLKNAIEKLSGKEINGRKIKLIEGSK 

RHRSRSRSRSRTRSSSRSRSRSRSRSRKSYSRSRSRSRSRSRSKSRS 

VSRSSPCPEKS\QKRGSS\SR\SKSPSHLWNRPEVPGPRSRSQIQLD 
QWPIKPVK 
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PDIMSGCRVFIGRLNPAAREKDVERFFKGYGRIRDIDLKRGFGFVEF 
EDPRDADDAVYELDGKELCS\ERVTIEHARARFTRLGRGRGRYSDRF 
NNSAELRNDRRNAPPVRP\ENRLIVENLSSRVSWQDLKDF\MRQAGE 
\VTLPDTTRLNIi\NEGWEFASMGDLRNAIEKLSGRELNGRKIKLIE 
RPAKRPQ*VQQSRSSDPGTQKSPLGPRSRSPSPW7UsTIiNSRSKK/RR 
GSREPGSPEPSPRSC*VGSSPVP*ERFFKGYGRIRDIDLKRGFGFVE 
FEDPRDADDAVYELDGKELCS 
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1 

1317 

MTQRFLIVLGCLILAVLTTFKEYETVSGDWLLLLETFAIFIFGAEFA 
LRIWAAGCCCRYKGWRGRLKFARKPLCMLDIFVLIASVPWAVGNQG 

IGFLTLILSSFLVYLVEKDVPEVDAQGEEMKEEFETYADALWWGIjIT 
LATI GYGDKTPKTWEGRL IAATFSL I GVS FFALPAG I LGSGLALKVQ 
EQHRQKHFEKRRKPAAELIQAAWRYYATNPNRIDLVATWRFYESWS 
FPFFRQVGTHLNAQGVTSHLSCEESNFDPEGNEKIJDEPENDISRSTF 

QEDDSKECGGWIYGLKSKQEDQLANIAGFRVRGNEGLDQGKNSWRQH 
PDFSAYICEMKELiKOMVP'K*ar , nr3WWf2T?DT caktmdtt addt ccmtjoit 

QL YE FNMNAE KEPNGN 
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MTDEDLVFRKHRVTVWQKECSFPFQQPQHHDLHTAPSFKDKVTEAQK 
GHRFAEGQPDGKGGLGLKSSQKLGLLDRVRLSNPRGSNTKGKLFTPIi 
NVDA I EES PS KEPKPVGLNNKES LRTAFRMKAYAFWQS S EDAGTGDP 
MAEDRGYGNDFPIEDM I PTLKAAIRAVR ILQFRLYKKKFKBTLRPYD 
VKDVI EQ YS AGHLDMLSR I KYLQTS I CGWRVNI WTSDC I S PAMLTLR 
GKllERKCRQWRSSWVSKEETEGAMTVSGCIILMVLDL 

1423 

A 

19 j 

2867 

PPDPLPGLPCPPGGPPLPAFGGGWGGARGS WHWS S RACS RRRRLVHA 
PRAPLLPRAAAEKAKRPAGARQMGLKARRAAGAAGGGGDGGGGGGGA 
ANPAGGDAAAAGDEERKVGLAPGDVEQVTLALGAGADKDGTLLLEGG 
GRD EGQRRTPQG IGLLAKTPLSRPVKRNNAKYRR I QTL I YDALERPR 
GWALL YH \ ALVFL I VL.G \ CL I LAVL \ TT FKE YETVS GD WLLLLETFA 
I F I FGAEFALR I WAAGCCCR YKGWRGRLKFARKPLCMLD I FVLI AS V 
PWAVGNQGNVLATSIiRSLRFLQILRMLRDGPGEGGTWKLIjG\SAIC 
AHSKELITAWYIGFLTLILSSFLVYLVEKDVPEVDAQGEEMKEEFET 
YADALWWGLITIiATIGYGDKTPKTWEGRLIAATFSLIGVS FFALPAG 
ILGSGLALKVQEQHRQKHFEKRRKPAAEIilQAAWRYYATNPNRIDLV 
ATWRFYESWSFPFFRKEQLEAASSQKLGLLDRVRLSNPRGSNTKGK 
LFTPLNVDAIEESPSKEPKPVGLNNKERFRTAFRMKAYAFWQSSEDA 
GTGDPMAEDRGYGNDFP I EDM I PTLKAAIRAVRI LQFRL YKKKFKET 
xjrtr- iljv inj-j v xcy x oavjJ1XaUI*lLjoK.L Ki XjQTRIDM IFTPG\ PPSTPKHK 
EVFRKGQHFTFPSQQSSRGLNHM * ARPST\ S£I \ EDQRH * WGKFVKS 
LKGQV\QGLGRKLDFLVDMHMQHMERLQVQVTEYYPTKGTSSPAEAE 
KKEDITOYSDLKTIIO^SETGPPEPPYSFHQVTIDKVSPYGFFAHDP 
VNljPRGGPSSGKVOATPPSSATTYAn^PPTVT dtt tt t ncuvcrwcna 
DLQGP YSDRI S PRQRRS I TRDSDTPLSLMSVNHEEIjERS PSGFS T SO 

DRDDYVFGPNGGSSWMREKRYlxAEGETDTDTDPFTPSGSMPLSSTGD 
GISDSVWTPSNKPI 
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A 

2 
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pdssgphrlrenppmvavscptktnvkgppggkvgahag\eyg\sea 
leXrmflsfptXtktVfpXhfdlVshglcprlkghgxkkvadaltXn 
avahvd\dmp\ngvvrp*sdlh\ahkl\rvdpvnfkllsh\cllvtl 
a\ahlp\aef\tpavprppwdkfpwlsvkhrcltfkyr 
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M I LL VFLPXHQVFLERXQESE I LHHLNTLADVL * 
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R WKRR VNAIjKNLQVKCAQ I EAKFY \ EEVHDLOR KYAVT/VOPT , PDTTR 
FEIINAIYEPTEEECEWKPDEEDEXSBELKEKAKIEDEKKD 
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MDVTVSELLELFLQSPLVTWKTFGPFGSGSQDNLT>rmDLVr)GIPIi 
NQ I MLQ I D PR PTNQR I NKHVNND VNLR I QNLTI LVRNI KT YYQDMG Y 
GQTQAWGAGILESFLGGNVCLRLVGLTGREGVSSGEERHLQSFGDIK 
QPEGWGVSCGYRFRKGELEPLEVAVFVGTGSVGAAVMVAMGAATLEA 
PE ALE S I NQE S S CGC I KPCGNS LEWROLARALLVR wann Ynppr: t?f a 
DEEGEEGGDEENDPDYDPKKDQNPTECKQQ 
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A 

1 

836 

MDVTVS ELLELFLQS PLVTWVKTFG PFGSGSQDNLTMYMDLVDG I FL 
NQ I MLQ IDPR PTNQR I NKHVNND VNLR I QNLTI LVRN I KTYYQDMGY 
GQTQAWGAGILES FLGGNVCLRLVGLTGREGVS SGQERHLQS FGDI K 
QPEEAQGRMVGS FERTPQI CGLG YRFRKGELEPLEVAVFVGTGSVGA 
A VMV AMG AATLE A PEALE S INQE S S CG C I KPCGNS LEWRQLARALLV 
RWGDDYDEEGEEADEEGEEGGDEENDPDYDPKKDQNPTECKQQ 
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ARAKTYRMRSEPDDSDPFPFDGPEIMGCTGCQIDWKKGKNVTLKTIK 
\KKQKHMGRGTVRTVT\KTVSNDSFFNFF\APPEVPESGDL\DD0AE 
AILAADFEIGHFLRERIIPRSVLYFTGEAIEDDDDDYDEEGEKSG 
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LQKQKQANKQQIT* K* ACQMV/ SNS S FPGKQKVDPTTKKRCLVNGGL 
NLKIQL/ 1 IQANF* KSRFIHLTWPVTILSQVTLQLTMSPKTQ 
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- 

VTACAAPAAWLPILVADIWSSYNMADIDNKEQSELDQDLDDVEEVEB 
EETGEETKLKAR\ Q\ LTVQMMQNPQILAA\ LQERLDGLV\ ETPTGYI 
ESLP \ R WKRR\ VNALKNLOVKCAOKFTOV FVRTrvunT .PPwavTvn 

PLFDKRFEIINAIYEPTEEECEWKPDEEDEISEELKEKAKIEDEKKD 
EEKEDPKGI P * IW\ LTVFK\NVDLLSDMVQEHDEP ILKHLKDI KVKF 
SDAGQPMS FVLEFHFE PNE \ YFTN\ EVLTKTLR\ M S S I PD\ DSDPFF 
F*MEPEH*GCTGCQIDWKKGKNVTLKTIRK\RPKPQGTWGQFRTVN* 
NQFPNDSFSNFFCPLLKFLESEDRND\DAEAILAADFEIG\HFL\RE 
RIIPRS\VLYFTGEAIEDDDDDYDEEGEEADEEGEEEGDEENDPDY\ 
DPKK\DQNPAECKQQ 

1432 

A 

1 
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PLKRSDGCNDGRPTRPPTRPDTTVFTSNLKQTLLVHLTPVE\KSAVT 
ALWGKVNVDE \ VGGKALGRLL WLPWD PKR SFQS PLGESVPTP * MAK 
EKVLGCPLVVGLASPGTIXKGHLCPHWSELALLTSLPRGIPEEI^GS 
WGKRAGSCVAWAQSTFGQKNFNPNKLQGLPNQENWLAWCWLNALGPT 
SNHLSLAFLAGP I SN 

1433 

A 

240 

463 

GLVFSRL S PE YYDLARAHLRDEE KS C PCLAQEG PQGDLLTKTQELGR 
DYRTCLTIVQKLKKMVDKPTQRSVSNA 

1434 

A 

2 

206 

TMSTWLLLQRS VSNAATR VCRTGRS R WRD VL S CGRS TG S CPQ I PGT 
SATS AGS S LPRS RIHS PVSLP 
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2940 

3296 

PVC TVSG PMA PGAS S SHPVLPLLGLL BPAGLAELGRTH P I TMS TWL 
LLQRSVSNAATRVCRTGRSRWRDVCRNFMRRYQSRVIQGLVAGETAQ 
QI CEDLRLC I PSTGECRGDSRDTS 

1436 

A 
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736 

ENKISSVFKADFLPPAPCSLPGLEVSVSPKGKNTSGRESGFGWAIWM 
EGLVFSRLSPEYYELA\RPHLRDEEKS\ CPC\ LAOE\ GPOG\ DLLTK 
TPELG P * I TRTC \ LT I \ VQKT * RKMVDKP \ TQRS VSNAATRVCRTGR 
S RWRDVCRNFMRR YQSRVTQGLV\ AG \ ELAQQNLVSTS RLCI PSTGP 
L 

1437 

A 

3 
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QTQRE PTMVLS PADKTNVKAAWGKVGAHAGEYGAEALERMFLS FPTT 
KTY F PH FDLSHGS AQVKGHGKKVADALTNAVAHVDDM PNALSALSDL 
HAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVS 
TVLTSKYR 

1438 

A 

3 

485 

PTLLVPTDSERTHPWLLSPADK\TNVKGPPGGKVGAHAVRSMCAEAL 
ERMFLSFPTTKTYFPHF\DLSHG\SAQV*GPRARKVADAL\TNAVA\ 
HVGRTLPNALVPPLSDLHAHKL\RVGPGSTFKLLKATCLAGLTL\AA 
HLPARVQPLAVASL^WDKVSWASC 

1439 

A 
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RTCASLKSLHRPHGSHGHGQRGGFL*VFSSSFDSSDGVWQVCSPGGQ 
IPPTCPHHCCDPESPDSSSGPPVPWHLPCAVQGPSPGGL 
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REHPVAGLQEHLQGGGSGQQDLRG * WAYFS / HR * SRKVPTS W * RRWK 
MVAPWAARRV 
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S WPQVPKTNKI E PRS YS 1 1 \ TS CG I \ RRLG PALNTLI F S K*NASGPA 
GHSAKSIEGAPRGKGRGRAVARLAADRPPAPKIQLPSFV/LRSTL*Y 
Fli Lit. Jbt» \ 1j FKJjJLiA 1 Hit PSWG S \ S JLi K \ D IjKWT\H5NYRASKEP\CIVI 
F\VTTSPG\REWVICAPAAFLG\CGSRFSGSPLPE\SNP*FPV\TRG 
HHGRHGDYHRKLIGQTFEWVWRRHGG\RAIGPRLSRVTKAAGARPP 
AGAGEG/IiDRVGFDLINA\PSPPAKGVSARRHVLALELPQIiSK 

1442 

A 

2 
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RKTQTTRRGPGLWAGPGG * RGGWWSRRLLLAAGFLGTHPGSTHPGLQ 
QPRFKWDHTRS S QGAF I FT FFPRGGQEHS FTS 
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VMQRD I SAGD F I EHAE FSGNLYGTRWAMRGCGWAPRVAV 

1444 

A 
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890 

PRSIGEGLGPSLLCGSGRARFSSGGMSGPRLWLSGPSGA\GKSTLL 
KRLLQEHSG\IFGFSVSRESRALVEG\TTRNPRPGEGRRQKITYFVT 
REVMQKD 1 AAGD F \ I EHAE F \ SGNL YG \ TS KV \ AVQAVQAJj * PHR VC 
LD V\ DLQG \ VRN I RPTDLRPNLHLLFS P P FTATC WKQPGFRQPQQLE 
TEGRAWLKAGLLL AQA\ DMEE PAKEAPALF \ D WHPLNDQPGTQA\ Y 
AELKEALSEEIKKAQRTGALRLAVCSR 

1445 

A 

978 

1440 

AGVGVRGTTGRL WRKFLTL I FGNPLFLVAPPKPHS EWSQRLTY\ RR 
RFSPYNTA \ SNKT \RIjSPNPDGNR I CLPF I PKKVGKAPQSIjHVVCAP 
GRLRG \ VRAVR PKV/ LL * RLS KTKKHVS RAYG W FHCVXi KCGRD \ R I K 
ACFSLIGGSRKIRCGKSV 

1446 

B 

43 
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MDWTWS I LFLVAATTGVHSQVHLVQSGAE VKKPGAS VKVSCKAS FNS 
FDTYGFNWVRQAPGQGLEWMG WVS AFNGDTNY I RKLQGRVTMTTDS S 
TS TAYLELRS LKSDDS AI YYCAATNSDKYFWGQGTLVTVS AAS PTS P 
KVFPLSLCSTQPDGNVVIACLVQGFFPQEPLSVTWSESGQGVTARNF 
PPSQDAFGDLYTTSSQLTLPATH* 

1447 

A 

3 

1637 

SPGIFRGFQSVIRTEQRELTMESGLNWLLLVAVLKGVQCEVQILESG 
GGQVQPGGSRTLSCAASGFIFSNYVMTWVRQAPGKGLEWVSSTAASG 
ANTFYAES VKGRFTVSRENS ENMM YLQM S SLRDEDTG I YYCAKDGDV 
PNLGVAWIVAGPGNVRPRKWFDAWGQGTTVTVSSASPTSPKVFPLSI* 
CSTQPDGNWIA\SCLVQGFFPQEPLSVTWSESGQGVTARNFPPSQ\ 
MASGDIjY rTbbQJbTLiPATQCIiA\ PKSVTCHVKHY TNPHPDVDG\PCP 
VPSTPPTP/ CSLNSTYPISLMLPPPTVTAPTGPSKDLFIiGSEANLTC 
TLTGLE\NASGCHFQSEGLQVGKSAVQGP\ PEA* PSVAAYS WQLSC 

T3f2TjJ"D TT BMTTDT A7DDT?TPTA TV T CD O T"3D T'D 7\ XTO n T1 C P VCniTU T cr»TiT?PTJO 

K<jWKii/U w l.E»F.UVKFr ILIAALyKbPKIKANRPPbbJ^GwllobrbGi'r 
CCRPPSEEI^NELVTIiTCLAR\AFSPQGPCWVRWLQGSPKLPPRKS 
T * LG/ PFPAGAQARAPTTFAVTS ILGR / VQPEDWKKGDTF\ SCMAGH 
\ EALALAFTQKT I DRLAGKPTHVNVS VVMAEVDGTC Y 

1448 

B 

113 

249 

XAAMTTASTSQVRQNYHQDSEAAINRQINLGALRLLRLPVHVIiLL* 

1449 

A 

846 

1193 

VMGPKPLPGIVPEFLKNWPRPSGLLIEFCPHWDTTDMTSN/CLV*BB 
NYSEQCLBLLNPVGMDLILRGDCESYHGHKPNRKLGSQHLSDQAALT 
GRLS S PCLMKRRRS AS FRFTQAG 

1450 

A 

2 

1087 

AIEHCQSGDNPESRRGFLLQQWIrGRNPALVPHPGRTGHSQPPVTFHR 
HPSDCQRSPAGRFKGGPSHRGQPPPFHKSPPMTTARPTSQ\VRQN\Y 
HQ\DSEAAINR\QINLE\LYASYVY\LSMSYYFDRD\DVALKNF\AK 
YFLHQSHEER\EHC*ENLMKACRTNEGWPNLSFQDIKETKTCDD\WE 
SGAC.L.Q WKCAJjH \ JjEKNVES SHYWNIjHKIj \ ATDKN * PP I CVDFX * DT 
FTLNEO /V*KAIKRIWGDH/ V* PKLWRKM(3^APNT,GFGEYIiF*OSTP 
W\ETVIMKAKPRANFPNSRGVTFLVTKAVHACWGFLYLFYKLYQNIH 
LSSLICTIPSNKEIWXPGWFEVLDESEIYPGYIiPDSIiSAWQF 

1451 

A 

24 

452 

APSPDAMG/HSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLS 
SASAIMGNPKVKAHGKKVLTSLGDAIKHLDDIjKGTFAQLiSELHCDKL 
HVDPENFKLLGNVLVTVLAIHFGKEFTPEVQASWQKMVTGVASALSS 
RYH 

1452 

A 

41 

542 

APSPRRPWGHFTEED\KATI\TSLWGK\VNVE\DAGGRKPLGKAPWIj 
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STPWTQRFFDSFGNLS SAS AIHGQTPKVKAHG\ KKVLT\ SLGDA\TK 
HLDDIjKG\TFAQA*SELH\CDK\LHVDPENFK\IiLG\NVIjVT\VL\A 
IPFSAKEFTPEGCRASWAERWVTWSWPVPCSSRYH 

1453 

A 

1 

1233 

MGEAFAQENMAENP Y I S VWSH I LNNWEFPKWSLDDSTKVQIiNSMKL 

fSfST'TM A DDDUnT.A/PTTrSTT R PDPr , OPPCtJ'l'L'UP1\r!nT?r>TiT c<rtiiT nrtnm 

l liPlrtlrKilvvyij 1 i-iAFKOColrr oWl fc/KPACJPFPPLiSPALRQRT 
ASTHS AGAR I APHHTRLRLEE P VTETRSGNE I LPPGEEARGGCS AVG 
AAPPSPSRPGPPPHAAPMHPFYTRAATMIGEIAAAVSFISKFLRTKG 
LiTS ERQLQTFSQSLQELLAGEQGEG VGGTRAPHPWVRVRPSGLREPR 
QRPGAGRAWAPPRDRWQNGEEKTTRTTKKLKPTEAGAAVRGPSADLG 
TKE P WG W EH Y KHHW F PE KP C KG S G YRC T P T NHKMH P T . T no ZV a OP T fi T, 
S S QELFRLLP S ELTLWVDP YE VS YR I GEDG S I CVL YEAS P AGGS TQN 
STNVQ^m^SRISCKEEIlLLGRTSPSK^^YNMMTVSG 

1454 

A 

233 

884 

ESPGVGCSARRGPRPRSPGPPPAAPGTPRPHGIPLYTRAGHO**GEI 
RRRPCTFISKFLRPQGGSASERQLPDLQARAWQELLGRPFNKH\HWF 
PR*SPCKGIGVTRCI\RINP*KWIPLIGPGQHSAIGLSSQELFRLLP 
\ S EL \ TLWG * P P I EVS YR I GEDG S HL.C ACM KP S PA\ GG S T\ QNQT\ N 
VQMV\ D S \ R I S CKE ELLLGRTE P F P KTTNMMTVSG 

1455 

A 

363 

487 

GRFLSLSHFCPPAPTSQVRPWALELRDSPHPSPTPAPSIiML 

1456 

A 

742 

990 

GTPRGDMRGRLYCSLPGQCLCGFIKCGAPLRPGLPFSSVPAAGNAFC 
QAAQLHLQLQSKHDAATCFVDAGNAFKKADPQGEGL 

1457 

A 

1 

198 

LFPAFSDSRECKLMKKLLEAHEEQNVDS YTESVKEYDS I SRLD\ QWL. 
TTMLLR I KKT I QGDEEDLR 

1458 

A 

281 

584 

GRFLSLS PLLS PSQAYKAHVAAETCCFCLGAGSSS PSPGSLPQLPAA 
PTSQVRPWALELRGFSPPQPHTCSFPNALRFFWLEAAAGPRKKIKNH 
PPPKKKK 

1459 

A 

120 

1069 

KPFEAAMYNSGKEAEAMAIpLAEAEPKVKN\SQSFFSGLFGGSSKIEE 

ace\ iyaraanmfkmajqw\saagn\afcqa\aqlhlqlqsk\hdaa 
t\cfvdgwqlhsrkadpqggpfncfdcvafeiy\tdmgr\ftiaakh 
hi s i c * dllrqklvd i ekai ahlrgqs ady \ ykgrgs sn s ftnkcli* 

EGGLVTPALLGAVIQKAIGHFTEQVGGPNAHWTSPPPSKY\SAKDYF 
F\KAALCHFCMDMLQA\KLAF\QKYEELFPVFSDSRECKIJ4KK\LLK 
A\HGEQNVGQLTPS S VKG ITTP I SRL\ DQWL\ TTHCLR I \ KKTIRGD 

1460 

A 

255 j 

2154 

IiAEPEVATDSGQQADLPAEGGDPRAEASCSVLHSKPHAMADSRDAAS 
DQMQHWKEQRAAQ\ KYIiGVDLLGHKXADVLTTGAGNPVGDKLNVITV 
GPRGPLLVQDVVFTDEMAHFDRERI PERVVHAKGAGAFGYFEVTHD I 
TKYSKAKVFEHIGKKTPIAVRFSTVAGESGSADTVRDPRGFAVKFYT 
EDGNWDIiVGNNTPIFFIRDPILVSFLFSDRGIPDGHRHMNGYGSHTF 
KLVNANGEAVYCKFHYKTDQGI KN1»S VEDAARLSQEDPD YGIRDLFN 

2XT AT > f^KA r PQWTPYTriVMT , T?Mr»2iPT , T?D'C r KroirT^T TVTrwDtivnVDT T 'D\in 
aiMxwM row ir x lyvnji c vt\jJ\rt x r kr r vt c r ULj 1 IV V WlrrliviJ X krLixxrVl** 

KLVliNRNP VNY FAEVEQ I AFDP SNM P PG I EAS PDKMLQGRLFAYPDT 

HRHRLGPNYLH I PVNCPYRARVANYORDRPMPMOnwonr; APNYYPNQ 

FGAPEQQP S ALEHS IQYSGEVRRFNTANDDNVTQTSLYLTYRFMEWI 

EKVRAFYVNVLNEEQRKRLCENIAGHLKDAQI FIQKKAVKNFTEVHP 

DYGSHI QALLDKYNAEKPKRSIjAF I RVTRS S LDEDS PVLDVQMQASG 

FKIENPTFYSRLCNNFNAISPGENEALTIEQMGSTEANCIiLRNSVQL 

VAFVI E I TSRKSD I VERHKCAWT 

1461 

B 

129 

321 

XYWMLCSKAEGCCSGAPKAVGWIWSTTLIVLHAHRAVTIiVVGHSST 
GRDS SQVYEDNWVPGRWR * 

14 62 

A 

70 

2954 

RMLITGSPALGSAI PGTPGRRGEALQLLGQSGTLiPFRTIjIjGI SGLQA 
SFGEPCSEPIGLPILACLAPSSQSEGSPPEGAGRGGGADWLSLKSPR 
TRGNRQI CLLRVETHEPRPPAVFCTANRTLWLTAGI PPATRCSTGRS 
SGPRRIEMRACTTSIQGNNVQDKWKSNCEVPDVECACTFUESLKPGW 
IiGQS KQGAKRSGGDECLQS SHLWEFVRDLLLS PEENCG I LEWEDREQ 
GIFRWKSEAIJUClWGQRKKNDRMTYEKLSRAIiRYYYKTGIIaERVDR 
REKLFNSAIRDTSNAPI PKVQKFSNNCGSS PKVRAFYVNVLNEEQRK 
RLCEN I AGHLKDAQ I F I QKKAVKNFTEVHPDYGS H I QALLDKYNAEK 
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PKVWPHKDYPLIPVGKLVLNIUJPVNYFAEVEQIAFDPSNMPPQIEAS 
PDKMLQKADVLTTGAGNPVGDKLNVI TVG PRGPF \ liVQDWPTDEMA 
HFDRERIPEEKLCMLKG\AGAFWYF*RSHIDITKYSKGKRYFE\HIW 
KRRLPIAV\RFFHCLLGESGFSLTQFR\DPRGF\AVKFYTEDGNWDIi 
VGNNTPIFFIRDPILFPSFIHSQK\RNPQTHLKDPDMVWDFWSLRPE 
SLHQVS FLFSDRGI PDGHRHMNGYGSHTFKLVNANGEAVYCKFHYKT 
DQG\IKNLSVEDAARLSQ\EDP\DYG\LRDLF*TPLATGK\YPSWTF 
YIQVMTFNQAETFPFNPFDLTKVWPHKDYPLIPVGKLVL/NTRNPVN 
YFA* GWNR * PFDP\ SNMPPGM * GPVP \ DKMAFRGRLFCLFLDTHRHS 
F _*U*FI\l ibM JjF V £, Jj V P YKG5 EWGQLTRRDG PDVALQDNQGWWPKF I 
NPNSFG\AREQQPSALE\HSIQYSGE\VRRFNTAN/EMDNVNSGGGH 
FYVERAELREQE/RKRLC * GTFAGHLEGMAQI FI PEGKRVKNFH*GS 
HP * PTGAH I PRLFWTKYKA\ ENPKNAI \HTFVQSG I S P 

14 63 

21 

-a 


UiyKiiFiMvLb FAD KXN V KAA WG KVGAHAG E YG AE ALE RM FL S F PTT 
OTYFPHFDLSHGSAQVKGHGKKVADAIiTNAVAHVDDMPNALSALSDIj 
HAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLiDKFIiASVS 
TVTjTSKYR 

1464 

a 

•r- 



rUi> £> <j FiiKliKxvrJ FFWCbb PAD KTNV \ KAAWG KVG AHAVRSMCAEALE 
RMFXSFPTTKTYFPHFDLSHGF\AQV*GPRARKVADALD/TNAVANV 
GRTLPNALVRPL\SDLHAHKLSGGTRFNFKAPKGHLPCI.GEPWAAHL 
PRP S FNP WRLQRLP WGQS FLG FLLKHRCLNL PNYR 

1465 

A 

154 

678 

PPLHLRDCFSPPGRALSPVGLYPYR\RSVP\TWLKIiT\SDDVKE\QI 
YKLi \ AKKG \ LTPS \ Q I G \ VI LRDSHG\ VAQ VRLG \ TGHDTFKI \ LKS 

kgldpdlpedlyhlikk\avavrkhlern\rkdkdakf\ri»ilie\s 
rihpfwi^yyk\tkr\vlpp\nwei*nhltasalgrin_vwctpsk 

1466 

A 

23 

636 

FSYLPPGPSHGTWGLWELQFKMRFGVCRHLMEDSMDMDVSPLR/PQN 

YLFSCELKADKDDHFKVDNDEl^HQLSLRTPTVSLGVFEITPPVXiIiW 

LKCG SGPVH I SGQHIiVAVEEDAES EDEEEEDVKLLR I SGKTKTFMAT 

NGKEYKHYKISSEKSIJDNKYICra 

SLRVTPPVFLVLRIJJDFD 

1467 

A 

3 

618 

AKD/ELHIVEQGHDIRGRSIKIT/IATLKMS/VQPTFS/LGGFEIQP 
TVV*GLKCVSGPCHISGQHLVA/VEEDAESEDEEEENVTCLLSISERR 
SAPGWSMVPQKKVK\LAADEDDDDDDEEDDDEDDDDDDFDDEEAEE 
KAPVKKS IRDTPAKNAQKSNQNGKDS KPS STPRS KGQES FKKQEKTP 
KTPKGPSSVEDIKAKMQAS IEKAH 

1468 

A 

1 

1689 

MEDS^MDMSPLRPQNYLFGCELKADKDYHFKVDNDENEHQLSLRTV 
SLGAGAKDELH I VEAEAMNYEGS P I KVTLATL KMSVQPTVSLGGFEI 
TPPVVLRLKCGSGPVHISGQHLVAVEEDAESEDEEEEDVlJRVIiFMKP 
KGRGLKHMFGDLVCSWKLAAIETQSSSSEEIVPSPPSPPPLPRIYKP 
CFVCQDKS SGYHYGVSACEGCKGFFRRS IQKNMVYTCHRDKNCI INK 
VTlttTOCQYCRLQKCFEVGMSKESVRN^ 

PE^GELIEKVRKAHQETFPAIiCQLGKYTTNNSSEQRVSLDIDLWDKF 
SELSTKCIIKTVEFAKQLPGFTTLTIADQITLLKAACLDILILRICT 
Kx I F_.yD I m L r oUlj_li_NK 1 QI^lHNAGFGPLTDIjVFAFANQLLPLEMD 
DAETGLLSAICLICX3DRQDLEQPDRVDMLQEP | 
RPHMFPKM^KITDLRSISAKGAERVITLKMEIPGSMPPLIQEMLEN 
SEGLDTLSGQPGGGGRIXKX3IiAPPPGSCSPSLSPSSNRSSPATHSP 

1469 

A 

3 

535 

DS VLRGCSLEQRSFI S VRLIiS YLSACRHPMEDSMDMDMS PLRPQNYL 
FGCELKADKDYHFKVP^sTDENEHQL S LRTVS LGAGAKDELHI VEAEAM 
NYEGSPIKVTLATLKMSVQPTVSLG^ 
GQHIiVVYRRKHQELQAMQM\DCRALSTS*ASSAPRPS 

1470 

A 

1 

1025 

SVIiRGCSLEQRSFIYGRLLSYLSACRHPMEDS^MDMSPLRPQNYLF 
GCELKADKDYHFKVDNDENEHQLSLRTV^ 

Y\EGKSN*T*HI_\TLKMS\VQPTVSL/GGAFEITPPVVI J RLKCX5SGP 
VHIKWTSTYVAVEGKMQKSRRI,KKEGRIVKI J LKVYLGK\RSAPWKVG 
SKVSTEKKVKTLIjA*WKDDDRMI^EE\DDD\EDDDDDDFDDEEA^ 
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AP\ VKKS 1 \ RDTPAQKCSESQNQNGKRLQKPS FTPKNQKGQES FQET 
RKKLLKTPKG\ P\ SSVEDIKAKMQAS IEKGGSLPKVEAKFINYVKN\ 
CSRMTDQEAIQDLWQWRKSL 

14 71 

A 

3 

490 

SSGPTRLRENPPMVAVSCPTKTNVKA\AWGKVGAHAVitSMCAEALER 
MFLSFPT\TKTYFPHFDLEPRFLPRFKGHGKKVADALTNAVAHV\DD 
MPKRAVPPLSDLHAHKF\RVGPGSTFKLLKPLAIjIiG*TLGRPTSPSE 
FQPLAVARLPWGQSFLGFLLKQPC 

1472 

A 

35 

1288 

KSSHCIKMGPQIFHKTSELFLPATSCPSCPDQNEEDVSQTQYKECCG 
GGWCSHS I FAVWHFAALGAGFSDKTpAHTVTMAC I S ANQAMTTG VGL 
lASGQCTVIVAGGVELMSDVPIRHSI^KMRKLMLDLNKAKSMGQRIiSI* 
ISKFRFNFLAPELPAVSEFSTSETMGHSADRIAAAFAVSRLEQDEYA 
LRSHSIiAKKAQDEGLLSDWPFKVPGKDTVTKDNGIRPSSLEQMAKL 
KPAFIKPYGTVTAANSSFLTDGASAMLIMAEEKALAMGYKPKAYLRD 
FMYVSQDPKDQLLLGPTYATPKVLEKAGLTMNDIDAFEFHEAFSGQI 
LANFKAMDSDWFAENYMGRKTKVGLPPLEKFNNWGGSLSLGHPFGAT 
GCRLVMAAANRLRKEGGQ YGLVAACAAGGQGHAM I VEAYPK 

1473 

A 

1 

1641 

MLPRSPTPNPQDPQDLRRNARQPARSHAAQGDTDRAARPLSRIiGEAG 
SRERGIRRPAVPCTYQSQPAAPGSPSHDDPKTERPLGSDSRLQGCDT 
QAPHRLGKDGKDWAAREALFVAS FAS VKIGELRGLLHRTSVPKEWD 
YI I FGTVI QEVKTSNVAREAALGAGFSDKTPAHTVTMAC I SANQAMT 
TGVGLIASGQCDVIVAGGVELMSDVPIRHSRKMRKLMLDLNKAKSMG 
QRLSLISKFRFNFLAPELPAVSEFSTSETMGHSADRLAAAFAVSRLE 
QDEYALRSHSLAKKAQDEGLLSDWPFKVPGKDTVTKDNGIRPSSLE 
QMAKLKPAFI KP YGTVTAANSS FLARGVESG WPTG WNTTLLS PGGE 
ELVRLVICKTELDDRAMWHGLHLPDLNPFDYFQGMLKVQVYLTDGAS 
AMLIMAEEKALAMGYKPKAYLRPTYATPKVLEKAGLTMNDIDAFEFH 
EAFSGQILANFKAMDSDWFAENYMGRKTKKFNNWGGSLSLGHPFGAT 
GCRLVMAAANRLRKEGGQYGLVAACAAGGQ 

1474 

A 

212 

371 

HPVTVYLLLGYLLFQLPCGSEFSTSETHGHSADRLG\AAFAVSRLEQ 
DEYAPG 

1475 

A 

63 

255 

VLMFSSSHG*GYQSS\RLQCKLQIVQLIQDILLFFSF*IPE*LLS*I» 
TPLKIFPLHQNGPSDFVS 

1476 

A 

169 

391 

FLKHISVEKIYKACGSIELKKFIFLIGKDTVTKDNGIRPSSLEQMAK 
LKPAFIKPYGTVTAANSSFLELAMLFV 

1477 

A 

85 

1534 

KSSHCIKMGPQIFHKTSBLVLPATSCPSCPDQNEEDVSQTQYKECCG 
\ GGWCSHS I FAVW\ HFI * RPDAT* FGLEQRLTGLLASGP VSLREW* 
LYSSLGTVISGK*KTSNVG*RGLALGSWAFSDKYSWFTMFTWACISG 
PTKAL\TTGV\GIiIAF\GQCDVIVAGGVELMSDVPIRHSRKMIKLMIi 
DLNKAKSMGQRLSLISKFRFNFIiAPEIiPAVSEFSTSETMGHSADRLA 
AAFAVSFL\DQDEYALRSHSLSKKAQDEGLLSDWPFKVPGKDTVTK 
DNG IRPS S IiEQMAKLKPAF I KPYGTVTAANS S F/ LLTDGASAML IMA 
EEKALAMGYKPKAYL/RRDFMYVSQDPKDQLLIiGPTYATPKVLEKAG 
L\TMNDID7^EFHEAFSG\QILANF\KPMDSDWFAE\NYMG*KKPRF 
GLPPLWRRFNNWG\GSLS\LGHPFQT\TGCR\LV\MTAANRIiRKKGG 
Q YGL\ VAACAPG\ GQGS ATDYVEAYPK 

1478 

A 

2 

359 

LDQQCKQTSEYLRYEDTLLLEDQPTGENEMVIMRPGNKYEYKFGFEL 
PQGPLGTS FKGKYGCVDYWVKAFLDRPSQPTQETKKNFE WDLVDVNT 
TPDLMAPVSAKKEKKVSCILNNNVQ 


a 

A 

X 

585 

PRGV IGHG PLGTS FIGKYGCGDYWVKAFLDRP SQPN\ QG PKKNFE \ V 
WDLVDVNTP\ DLMA\PVSAKKERKVSCMFI PDGRVS VSARIDRKGFC 
EGDEISIHADFENTCSRIWPKAAIVARHTYLANGQTKVLTQKLSSV 
RGNHIISGTCASWRGKSLRVQKIRPSILGCNILRVEYSIiLIYVSVPG 
SKQVFIKAL 

1480 

A 

218 

1677 

seiifckgvssiwsffflppsft.tiiktnsvpswvmfk\kiksfev\v 
fndpgkgvtavgekva\grvnsgrcvevtrv\kavrip\acrsglkc 
lwmpgi psrckqts \ eypryedtvfl\ edqptgen\ emv imrpgnqy 
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EYKFGFEJjP \ QGPLEHSFKGKYGCVDY\ WVK\ AFLDRPSQPTQBTKK 
NFEVVDLVDVOTP\DLMAPVS/ALKKEKKVSCMVIPDG\RVSVSARI 
DRKGFCEGDEISIHADFENTCSRIWPKAAIVARHTYLANGQTKVLT 
yiUjboVKbWnllobiLAfaWKUltbLRVQKIRPS 

IYVSVPG\SKKVILDLPL/V*LGSRSGLSSRTS\SMGSRTS/SLRMS 
WVDLNIP\DTPEVPS\CILGCSFPEGSTVWESPTTP\LLDDMDGSQN 
SP\IFMYAPEFKFMP\PRTYTEVDP\CIFNHQCAVSMWKKRSSFTLL 

1481 

B 

21 

410 

MPS KVRCX S VQVFDAMKTATAVAHCKRGNGLI KLLEPVLLLGKERFA 
GVDTRVRVKGGGHVAQI YAI RQS I S KALVAYYQKYVDEASKKE I KDI 
LIQYDRTLIiVADPRRCESKKFGGPGARARYQKSYR* 

14 82 




mdc /KnPT.n^vnvT?f2PirFrT , 7iT7i\77AU / r , iror r Ktr , T nnmpDm t?m / ttto 
riczs/ i^urDyavy vruKJUVlRl/iVArl/ l-iS_K\jINJoij X 1\. VrvUixIrljc*!*! / xi&jr 

RTLQYKVLGSGTGVSGWRTLGDRDWALESWGAGISNGMFRSCVGCR 
QWAAGASSASRQERFAGVDIRVRVKGGGPWPRFMSKKFGGPGARARY 
QKSTDKPIVTQNSLV 

1483 

A 

172 

661 

LLE PVLLLGKERFAGVD I RVRVKGGGHVAQ I YGE S QELGAWRRWIiWE 

VCGDQVKDSV*LSKSSSLLFLPDVDEASKKEIKDILI\QYDRTLLVA 
DPRRCESKKFGGPGARARYQKSYR 

1484 

a 

1 

X 


A\NGL IQG * TGGPLEMI EARARLQ YK\ LLEP\ VLLLGK\ ERFAG\ VD 
IPCPV*KGGWSTWPQIYAIRQS\lSQKPLVAYYPEM*VSMGPSHE/Y 
VDEAFQRREIKDILHPSY\DRNPAGLAGPFVRCE\SKKF\GGPGA\R 
ARYQKSYR 

140c 

p 

X£, / 

1 O D 

MAAoaW ir Ji V JjU X X hjh*XLdi SKr LEG VRNVAS VCLQ I GYPTXAS V PHS I 

INGYKRVLALSVETDYTFPLAEKVK^ 

SXCCCSPS* 

1486 

A 

1 

689 

KCFI / VGADNVASKQMQQ IRMS FRGKAVC * WGKNTMMRKPIRGHLEN 
NPALEKLLPHIRGNVGFVFTKEDLTEIRDMLLANKVPAAJU^AGAIAP 
CEVTVP AQNTGLGPEKTS F FQAJLG I TTKI S RGT I EI LG VRNVAS VCL 
QIGYPTVAS VPHS I INGYKRVLALSV15TDYTFPLAEKVKAFLiADPSA 
F/VA7VAP/VAAATTAAPRAAAAPAKVEAKEESEESDEDMGFGIiFD 

140/ 

7V 

M. 

OO 

1104 

Hi A VMFKkJJKAl WKbNYFLiKI IQLLDDYPKCFI VGADNVGSKQMQQI 
RIVPWGEACVLMGQKTMNGPGPSEGHLENNPASEELIiPH*\RGHLGF 

PI?T\ PPn\ T.TPT \ DTIMT T 7\ * /"* /"•/""'i 7\ A 7\ id r~*\j/T\T nnnmTfmrnn \ /*wnry« 
1 \ rCr*U \ Jj 1 E» J. \KlJrUjJjAy *^UyAAAKv^CQljPPCErv 1 V PA \QNTG 

LG\PEKTSFFPGL*VSPTKNLPGGTH*KS*SYVQL\IKT\GDKMGSQ 

TKAKAAEKMLKNIiPPS PFG AGQPKQGV\ RKNGKHPTNPE SA* I S TRG 

KLCHSRF\ LGGCPANVAKCLSCKIGYP\TVASSTPI PI I \NGYKRVP 

GPCLWTPDYTFP\I^KVTO^L\ADPSC^^ 

LLQPPAKVEAKEESEESDEDMGFG\LFD 

1488 

B 

98 

264 

XQVVCKKYRGFTI PEAFRGVHRYLSNAYAREEFASTCPDDEB IEIiAY 
EQVAKALK* 

1400 

TV 


X£ X / 

Urrb Ir VFAPFbofKiXylli* 1jVPDATMAEEQPQV\EIjFVKAGSIX3AKIG 

ncpfsqrlfmvlwlkgvtfnvttvdtkrrtetvqklcpggqlpflly 
gtevhp\dttkieefleavl\cppr\ypk\laalnpevqhswgwdif 
axfflpniqefqtpaln* qsgrrgflesp * kvldnyltx s ppspeev 
dets c * ki eg vsq\ rkf\ ldgqrrphpwldiiqtccpkvth * vqw\ c 
krk* pgns phppkafpgkchrvp * skmp yapgknspshpvpddee i e 

LRPMSKVAKALOI SPSLGLPSTPS I FSTKAPGGFHI ATPMGHTPKLA 
SGQG I LGD IEPAKGWEEGMRERNGGPGSDF 

1490 

A 

2 

746 

AVIhlANLGCWMLVLF^ATWSDLGljCKKRPKPGGWNTGGSRYPGQGSP 

GG1TOYPPQGGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQGGGTHSQW 

NKPSKPKTNMKHI^GAAAAGAVVGGLGGYMLGSAMSRPI IHFGSDYE 

DRYYREimHRYPNQVYYRPiyroEYSNQimFVHDC^ITIKQHa^ 

KGENFTETDVKMMERVVEQMCITQYERESQAYYQRGSSMVLFSSPPV 

ILLISFIiIFIiIVG 
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1491 

A 

217 

1007 

LNHNRLAVIMANLGCWMLVLFVATWSDLGLC 

PGQGSPGGNRYPPQGGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQPH 
CjOCj W(jQC?GGTH 5 QWNKPS KPKTNMKHMAGAATAGAV*GGLCSYS\LiG 
SAMSRPIIHFGSDYEDRYYRE\NMHRYPNQVYYRPMDEYSNQNNFVH 
DCVNITIKQHTVTTTTKGENFTETDVKMMERWEQMCITQYERESQA 
YYQRGSSMGLFSSPPVILLISFLIFLIVG 

1492 

A 

32 

487 

SRRHGSSLWGKVNVEDAGGETLGRLLWYPWTQRFFDSFGNLSSASA 
XMOW rKv aAHVj lUvvi* TbliGDAI KHLiDDIjKGTFAQLSELHCDKLHVDP 
ENFKLLGNVLVTVLAIHFGKEFTPEVQASW/ QEDGDWSGQCPVLQI P 
LSSLPMMQSFQG 

1493 

A 

41 

597 

APS PRRP WV I S QRRTKAT I \ TS L WGK\ VNVE \ D AGGET \ LGRL \ I» W 
\wi \yKr rU\brL>\NljSSASAIHGQPPKVQGTWSKKVLTFLGEMP 
* KHL\ DDLKGHLLPKPEVNI*HC\DKPAMWDP \ ENFKAPGEMLLVTRF 
WAI PFS AKEFHPWRLAGLPGQKDG * LGVGQCPCS FQIPLKPLGP * IQ 
\SFQG 

1494 

A 

1 

478 

DTRFLERLRLS I S SLVPDA\MGHFTEED\KATI \TSLWGK\ VNVE\D 
iWjKjc* 1 \iajkjj \ Jjv v x froufKbr r JJort^WljSbASA\IHGQTPKVKATR 
AKKVLTSLGKMPIKHLGLIFKGTFCPSLS*TCTC*QACMWDP*GTFK 
Ij PGEMLL VTRFWA I PFS AKE FHP 

1495 

A 

3 

370 

SVC\mAHESVVTCSEDFSLPAYMDRRDHPLPEVAHVKHLSASQKALKE 
Js-Ci ivrt.o rvooijo i'llj oi\.vjcj1jx.K-L iv±' ivxsor AJirlJJKvjroNiiWKvL VVCaGAMFFI 
GFTALVIMWQKHYGLXASKWDYEKNEWKK 

1496 

A 

24 


KA VAA<j£>VjOKr9JU 1 i VI* b JjVOKRAI STS VCVRAHES WKSEDFSLPA 
Y24DRRDHPLPEVAHVKHLSASQKALKEKEKASWSSLSITOEKVECGY 

1497 

A 

197 

745 

IASEQFSTSWCTSSMKVFVTCSEDFSLPT\YMESAVTHPLAGRWPHV 
i\Afv~ fc> Ay rKK F Jj ivhi isJk. aALi JjG AAFSMG*GKFELIjFAXjKFKE BALLED 
*TRGLRTELGKTGLFGPVPLFPSIGFSPRLVIHVGQKHYVLTAPFP\ 
QS F * TKS WVGPSRTKRMLGQ* R* TP I QG\ LAS K\ WDYEKNE\ WKK 

1498 

C 

78 

281 

MELSNNHQLSMSLVELSMSLVWDANLLGWGKSCELTKPSWSLVRSTS 
RHSRKKGSSWHLPAKLCSTC* 

1499 

A 

302 

688 | 

TI AI FSCWRRLRFSGPKTS I ICSPMTSLKTSDMVPSLFSSPGDRVA 
IESGAPRENDEFCKMGRYNLSPSIFFCATPPDDGNLCRFPEIVKLSL 
EEAKCLPFCVAPHGPSRGALGCRSAAFPLTQPRC 

1500 

A ! 

1 

1622 

MSKPKCLVILVGIQKSSQMPQFPPPAGSERRRGIRQWEMGWGGEMG 
VWWPPGVQGSPPQAHRRLGSREQLFAPPLKEKSHIPPWGYSRENYPI 
PEPGPNGRPVTEVWGSPGPTVQWGWEGLHEALSPALINQAQKDPEVR 
LGAMWVSVT)THPPPLTPRQSGPLSALEELGVSFPTLSYLTPEPQLL 
GEHQ I KGS F PGRS TALLLKEVLLRMHS VG I CG SDVH \ YWE YLS / RFG 
NFIVKKPMVLGHEASGTVRKSGIIGKSTLKPG\DRVAIEPGCSPEEN 
* WNS CQDG / RRYNIiS PS I FFCATPPDDGNLCRFYKHNAAFC YKLPDN 
VTFEEGAL I E PLS VG I HACRRGG VTLG\ NKVL VCG S WANRGWVTLLV 
AKAMGAAQVVVTDLS ATRLS KAKE I GADLVLQ I S KES PQE I ARKVEG 
LLGCKPEVTI ECTGAEAS IQAGI YATRSGGTLVLVGLGS EMTTVPLL 
\ HAAJ. RE VX) I KGVFRYCNTW \ PVAI SMLAS KS V/ DMS KPIjVTHRVSS 
WRKVXEAFETFKKGLGLKIMLKCDPSDQNL 

1501 

A 

1 

804 

MPLRAFNPVS S FRWARGMT I VAALMTVFF I MQIiVGQVPAALWVI FGE 
DRFRWS ATM IGLSLAVFG I IiHALAQAFVTGPATKRFGEKQAI XAGMA 
ADALGYVLLAFATRGWKAFPIMILLASGGIGMPALQAMLSRQEQEYS 
CVVKMPSGEFARICRDLSHIGDAVVISCAKDGVKFSASGEIjGNGNIK 
LSQTSNVDKEEEAVT I EMNEPVQLTFALR YLNFFTKATPLS STVTLS 
MSADVPLVVEYKIADMGHLKYYIiAPKIEDEEGS 

1502 

A 

178 

1093 

TFLLPACLLAALLPLRHHVRGRAWVQGSILNEGVG*ALKD\LINE\A 
CWGY*APAGVNLQSMGHRPTVSL\VQLTLRV*GASTPYRC\DRNIiGH 
GR* NLTS \ MSKI LKMAAGNED/ 1 SLTLRAEDNAGYLGR* YFEGTKPG 
RKFSDYEMKLMDLDVEQLGIPEQE\YSCVVKMPSGEYA\RICRESQP 
ILGDAW\ISCA\KDGSENFSASGELGNET\IKLSQTSNVDKEEEA\ 
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VP I KMNEP\ VQPNFCH * G YLNFFTK\ ATPLSSTVDTPVCS ADGTPLV 
GRS I KI AGYGDHLKYLLGLPKDPRI EEGSLGHS 

1503 

A 

32 

487 

S RRHG S S LWGKVNVEDAGGETLGRLL WY P WTQRF FDS FGNLS S AS A 
IMGNPKVKAHGKKVLTSLGDAI KHLDDLKGTFAQLS ELHCDKLHVDP 
ENFKLLGNVLVTVLAIHFGKEFTPEVQASW/QEDGDWSGQCPVLQIP 
LSSLPMMQSFQG 

1504 

A 

1 

591 

NFALEAKNS ARAI S SL VPDAHGVI SQRRTKATI \ TSLWGK\ VNVE\ D 
AGGET\LGRLLVVYPWTQRFFDQLLANLSSASAHHGQPPKVQGHMAK 
KVLTFLGEMPIKHLDDLKGHLLPKPEVNCTVDKPAMWDPENFKAPGE 
MLliVT/LFWAIPFSGKEFTP*RLQASWAERWVTWS\GQCPCSFQIPL 
KPLGP * I Q\ S FQG 

1505 

A 

49 

273 

IRMTEKAPEPHVEEDDDDELDSKLNYKPPPQKSLKELQEMDKDDESL 
IKYKKTLLGDGPWTGMCVQRTGWVEDR 

1506 

A 

81 

720 

LFKAPEPHVEEDD\DDELDSKLNYKPPPQKSLKELQEMDKDDESLIK 
YKKT\LLGDGPVVTDP\KAPNV\VVTRLTL\VCESAPG\PITMDLTW 
KI WKAIiKKGNHIiVLKGRFWN I RS S KFHF PKLNRG * LLF RA* NYVQHT 
YRTG\VKVDKATF\MV\GSYGPRPEEYEF/LSLPVEEASQRAVfLARR 
HVTTKSPFFTDDDKQDHLSW\ EWNL\ S IKKEWTE 

1507 

A 

5 

290 

FNLTHIESRPSRLKK/ DE YE/ FFTHLDKRSLPALTNI I KILRHDIGA 
TVHELSRDKXKDTVPWFPRTIQELDRFANQILSYGAELDADHPVSPW 
PVG 

1508 

A 

68 

312 

PLQSPGVLSILNPFSLLVPCLPSSLLGSYSTLNPYHLLVSCLYPQLL 
PSILTQQPEGVLGNTSEIMTIYLPIKIVTSIELL 

1509 

A 

317 

917 

TSSPPSSLCFLSFSDICHELLGHVPLFSDRSFAQFSQEIGLASLGAP 
DE Y I EKLATI YWFTVEFGLCKQGDS I KAYGAGLLS S FGELQYCLSEK 
PKLLPLELEKTG I QNYTVTE FQPLYYVAES INDAKEKVGNS AAT I PR 
PFS VRYDPYTQR I EGLDNTQQAHDLG * FHLTVE I G I LC S ALQKNKVK 
AMDRMWCQAVE 

1510 

A 

389 

1881 

nlqphvlfanlpvpealksqrphsrgasmstavlenpglgrklsdfg 
q\ etsy i \ edncnqkwvp isldpphlkerklgalgpkycalf\ eend 
vnlt\hies\rpsrlk\kdeygffppfgikrslp\altniikilrhd 
igatvhelsrdkkjodtvpw\fprtiq\eldrfanqilsygsgnwda\ 
dhpgfkdpvyrarrkqfadiaynyrhgqpiprveymeeek3ctwgtvf 
ktlkslykthacyeynhi fp\ llekycashedni pql\ edvsqfl\ q 

TCTGFRLRPVAGLLS SRDFLGDLAFRVFHCTQ YI RHGS KPMYTPEPD 
ICHELLGHVPLFSDRSFAQFSQEIGLASLGAPDESIEKLAPIYWFTV 
EFGLCKQGDS I KAYGAGLLS S FGE FQ YCLSEKPKLLPL / ESLEKTAI 

qnytvtefqp\lyylae\sfndaqgei*gtfaati\prpfsv\rhdp 
htpqriggswdntqql\kila\dsi*q*igipfavalqnik 

1511 

A 

1 

191 

MQK* ITAWAPAPMKI KI IAS PERKYS VWI GGS I WPQLS T/ FQQMW I S 
KQEYDESGPS I VHRKCF 

1512 

A 

1 

360 

SGACPAFLVDRNLRHHETTFNLIMKCDVDIRKI>LYANTVLSGG\TTM 
YPG I ADRMQKE I TAL/ APPS TLRFRF I AP P / ERRKYS VW IGG \ S I LA 
SLSTFQQ\MCLGKQEYDESGPSI\VQRKCF 

1513 

A 

13 

1277 

INPPPLSRRCQLSHSVLPPLRRRVSLPVAMEEEIAALVIDNGSGMCK 
AGFAGDDAPRAVFPS I VGRPRHQGVMVGHGPRTDS YVGDEA/ QRS KR 
GILTLKYP IEHGIVTNWDDME\ KI WHHTFYNELRV\ APEKHPVL\ LT 
EAPLNPKANREKMTQ/ILCFETFNTPGHVPWPIQAVLSL*SLWAQPI 
G I VMDSCj \ DOVTHTV \ P I LRGATTLLHAI \ LRLGPGLARDLTD YLMK 
ILT\ERGYS FTTHGPSGKTFRNI KGEACATS PLDFEQ\ EMGT\ AASS 
SSLEKSYELPDGQVITIGNERFRCPEALFQPSFLGMESCGIHETTFN 
SIMKCDVDIRKDLYANT\ALSGGTTMDPG\IADKIAEGRSTALAAPA 
P*KIR\IIAPP\ERK\YSVWIGGSILASLSTFPARFWI\SKQE\YDE 
SGPSI\VHRKCF 

1514 

B 

93 

366 

XTSVVRPFAKLVRPPVQVYG IEGRYATALYSVl^NPYVKR S IKVKSLN 
DITAKERFSPLTTNLINLLAENGRLSNTQGVVSAFSTMMSVHRGE* 
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1515 

A 

1 

785 

FRRQRARAPLRVPLFP3JGFDLQPPGRRWAAPAV\SGLS\RKVRCFST 
S W\ RHFAKLVGPS \ VQVYGI EGR YAT\ AI/YS AAS KQNKLB \ QLE KE 
LIiRVAQNPEGNPKVAAS\VLNP\YVKR\SIKSEKALN\DITSKKRRF 
SPPSTTQPWIKFALPE\NGSD*SKYPQGSPFPALFPTHDEVSHPR\G 
GYPCTVDLWHLLLEGSQTPPGI*KLSLKSLP*VKGQVLKLEAKTDPS 
I \ LGGMI VR IGE \ KYVDMS VKTKIQKLG\ RAMRE I V 

1516 

A 

1 

2199 

MAQFSIVGMCLGLFNQSPTDGSGVQLFVIMNQVMAANLAETHAAIjCP 

TLSSGTSARFRGKNQFSGGLPQITLSPUVQPCGRLAAMYSNVIGTVT 

SGKSLPSTSPLVPLLLSGFAPLFLKGLQNWTQTTQEGKSPSAAPASS 

QPQACPTDHVRESALGACLSLPGEDGKIQKVYLLSLLLIGFWDCVTC 

HGSPVDICTAKPRDIPMNPMCIYRSPEKKATEDEGSEQKIPEATNRR 

VWELSKANSRFATTFYQHLADSKNDNDNIFLSPLSISTAFAMTKLGA 

CNDTLQQLMEVFKFDT I S EKTSDQ IHFFFAKLNCRL YRKANKS SKLV 

SANRLFGDKSLTFNETYQDISELVYGAKLQPLDFKENAEQSRAAINK 

WVSNKTEGR ITDVI P S EA I NELTVLVLVNT I YFKVLRMALERPQGLP 

LALQLTPFFFKWRDRSPERANGLPKATQGLWKSKFSPfiNTRKELFYK 

ADGESCSASMMYQEGKFRYRRVAEGTQVLELPFKGDDITMVLILPKP 

EKSIJUCVEKELTPEVLQEWLDELEEMMLVVHMPRFRIEDGFSLKEQIj 

QDMGLVDLFS PEKS KLPG I VAEGRDDL YVSDAFHKAFLEVNEEGSEA 

AASTAWIAGRSLNPNRVTFKANRPFLVFIREVPLNTI IFMGRVANP 

(TVT^SEAIjAVLVVNLTTLMKRTHGESLDFME 

CQRTKTGQGLTVRMNIDRFEELAGYGGD 

1517 

A 

9 

1618 

PALCPTLSSGTSARFRGKNQFSGGLPQITLSPLAQPCGRtiAAMYSNV 
IGTVTSGKRKVYLLSLLLD5FGDCVTCHG\ S PVD I CTAKPRDI PMNP 
M\CIYRSPEKKATEDEGS\EQKIPEATNI^R\RVW\ELSKAN\SRFCIi 

pls ys apgqns kd * h * ltffcs pls i fqgfll wtkv\ gacndtlqql 
mevfkf\dtis\ektf*srshfffa\kln\crl\yrkankssklvsa 
nrlfgdksltfnetyq\diselvygaklrpl\dfk\ enaeq\sr\aa 
inkw\vsnktegrihrcsfpsgrpfneltvlgggfntiyfqgacwk\ 
skfspentrkelfykadg\esc\sasmdvtregkfrysgawlegt/q 

VLWC PFKGDD I T\ MVIj I LP KP * EGAWAKVEK\ ELTPEVLAKSGWD * 

fwremmlvvhmprf\ri \edglqv*reql\ qrhgplsdlfsp* kspk 
l/lpgivaegrddlyvsdaf\hkaflevneegseaaasta\vviagr 

oijiMJfWK v i r\j\j\j\jijr ir\jt ijljLKii V jfLt \NTI x LHG AELAN PCV 

1518 

A 

2 

363 

ELDTLCDL YE P*PSPSIIF INTRRK/ VDWLTEKMHARD FTVS AMHGD 
MDQKERDVIMREFRSGSSRVIjITTDL\RIGRGGRFGRKGrVAINMVTE 
EDKRTLRDIETFYNTSIEEMPLNVADLI 

1519 

A 

201 

64 8 

GPCGHGRVFPLPSI^UVHMDA*GLLLRDRVSSVHMLKRIiSFIjT*ARGI 
DVQQVS LVINYDLPTNRENYIHR * A* IWNTPLPLHTWPSLGLKLLIF 
LIPFLVFQ\IGRGGRFGRKGVAINMVTEEDKRTLRDIETFYNTSIEE 
MPLNVADLI 

1520 

A 

49 

1720 

GSTISSASQDSRSRDNGLDGIEPEGVIESNWNEIVDSFDDMNLSESD 
LRGIYAYGFEKPSAIQQRAILPCIKGETSQSQKTLWTVPDLGRVASG 
W*CPSHISQGQSNLLVHPSLAFDPCPCLVHAIiDT*VSFKEWIi*PAY 
ICIYSHVSSPAWC^II^ELITGPCF*SGSHCVTVIjIGKWCEP*NAW 
FIGCLLTSLT*DLNI PKGMLFTTFNS * F I CLG YDV I AQAQSGTGKTA 
TFAISI LQQ I Q \ KWMALG\DYMG\AS \ CHAC I RGAPTCVAE VQKL\ 
QMEAPH IHRGVP PGRVF * YALPEDTLS PKYI \ KM FVLDEA\ DEM\ LS 
RGFKGQI YGHI QKAQAAPPRVVLLSATMPFD\ VLEVTK3CFNIRGPPFR 

ilvqkgelt\ legirq\ fyi\nvepee \ FNLDTLCDLYENLDHHPRP 
V I FHQPPGGKVDWPHPRRMHAADF/ TLYSAMHWRFWTQKERRT* L*R 
EF\RSWLLARI FDTQLDLLGQRALMCQQV\ SLVIQTYDPFPPTRGKL, 
LIHR\IGSRVDRFGRKGVPINMLTE/EKTKRNLEDIETF\YNTSIE\ 
EMPLNVA 

1521 

A 

3 

607 

FCPRGQEFGEGNKLLSPRRPWVISQRRTKATI\TSLWGK\VKCGKNA 
GKEETPGKGSLVVL/HPWTPRGSFEQLWQTCPSALCPSMGNPQSQGT 
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MAKKVLTSLGRCP * STLD\DLKGHLLPKPEVNLHLLTSIiHVGS *RTF 
KLPGEMLLVT/LFWAI PFSAKEFHPLKVAGFPGQKDG * LGVGQCPCS 
FQI PLKPXiG P * I Q \ S FQG 

1522 

A 

23 

437 

KTPGKGSLWL/HPWT\QRFFDSFG\NLiS\SASAHHGQPPKSKAHGK 
KVLTSLGDA\IKH\LDDLKGTFAQLSELHCDKLHVDPENFKLLGNVL 
VTVIiAIHFGKEFTPEVQASWQKMVTCSGQCPVLQIPLSLLPMIQSFQ 
G 

1523 

A 

23 

249 

APSPDAMGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQR 
FFD S FGNLS S AS A I MGNPKVKAHGKKVIjTS LGDA I KHL \ DDLKGTFA 
QLE * TCPLPLPS WATPKSRHMARRC 

1524 

A 

46 

379 

SQTPMGHFTEED\KATI\TSLWGKGEMWKKCWKEKTPGKGSLWL/H 
PWTPRGSF\DSFGKPVPLPSAHPWATPKVKAPWPRRCLTSLGEMPIK 
\HLGLIFKGTFCPSLK*TCTC 

1525 

A 

21 

457 

NPRVRGALTMELSESVQKGFQMLADPRSFDSNAFTLLLRAAFQSLLD 
AQADEAVI*\ DNKNS LE I LLG S I GR S LPH I TDVS WRLE YQ I KTNQLHR 
MYRPAYLVTLSVQNTDSPSYPEISSSCSMEQLQDLGGKLKDASKSLG 
KSTQL 

1526 

A 

1 

455 

RKGSKMLADPASFDSNAFTFSSGAAFQSLLNASPDEPRVSTYLEDCN 
FDRERIELFCTEYQNNKNSLEILLGSIGRSLPHITDVSWRLEYQIKT 
NQLHRMYRPAYLVTLSVQNTDPPSYPEISFSCSMEQLQDLVGKLKDA 
SKSLERATQL 

1527 

A 

3 

470 

VSRGVRVEGHGAIiTMELSESVQKGFQMLADPRSFDSNAFTLLLRAAF 
QSLLDAQADEAVLGKPLGSARSWIRRGGGALGEEEPAEREGESPGKR 
KSPGFALRSGRSVGDVALPGLGADSEHCFGPTWHLHRANSTFLARFP 
SQSLCSFVKPVFKS 

1528 

A 

50 

895 

THASIX5ALTMELSESVQKGFQMLADPRSFDSNAFTLLLRAAFQSLLD 
AQAD\EAVLDHP\DLKHIDP\VVIiKHC\HAAA\ATYILEAGKHRA\D 
KSTL\STYIi\EDCKILTEKRIELFFAREYQ\NNK\NSLEl\LLGKY* 
GRSLPSYNRVFSWALWIIQVKDQSTFHRM\YRP\AYLGDLKWQNTG 
IPPS\YPRELVFSCQPWNQL\QDL\VGETLKDASKKPWKRATSWTL 
GKVl^SPPSSRRKTQKPPLPFSWNHRLCRAGCPFSVEKNFSLLNLYP 
FIHFGHFKNV 

1529 

A 

1404 

1586 

ENESRFSDRNQASAGLGYLSDSL*QWIVGNGHATDLWQNCSTSSSGN 
VHHCFSSSPNGSG 

1530 

A 

187 

701 

ABLAARMLLLLLS 1 I VLHVAALVLLFVSTIVSQWIVGNGHATDLWQN 
CSTSSSGN\VHHCFSSSPNEW\LQSCSRGTMDPVDSSFSILSIiFLFF 
CQLFTLTKGG\RFYITGIFQII^GLCV\MSAAAIYTVRHPEW\NLNS 
GY\S*RFA* I\LAWAFPL\ALLSGVIYVILRKRE 

1531 

A 

1 

485 

PPKLYNKVEPLRKKARAPEAQFEMPYVVRLHNSTQLSAPKSCFTFSH 
PNRDPMIDITNRYCTLEFPVEVNTVIJiGFAGYFETVLYQDITLSIRPE 
THSPGMFSWFPILFPIKQPITAAHIPLASSPACQVSRAIiEAASGSAP 
WQKGS SHQSMKTSGQGVRN 

1532 

A 

2 

460 

SARRKMAAMAVGGAGGSRVSSGRDLNCVPEIADTLGAVAKQGFDFLC 
MPVFHPRFKREFIQEPAKNRPGPQTRSDLLLSGRAFLLPLNQEDNTN 
IJU^VLTNHIHTGHHSSMFWMRVPLVAPEDLRDDIIENAPTTHTEEYS 
GEEKTWMWWHN 

1533 

B 

1 

2784 

MAAMAVGGAGGSRVSSGRDLNCVPEIADTIiGAVAKQGFDFLCMPVFH 
PRFKREFIQEPAKNRPGPQTRSDLLLSGRALEIGADLPSNHVIDRWIj 
GEPIKAAILPTSIFLTOKKGFPVLSKMHQRLIFRIiLKIiEyQFIITGT 
NHHSEKEFCSYLQYLEYLSQNRPPPNAYELFAKGYEDYXQSPIiQPIiM 
DNLESQTYEVFEKDPIKYSQYQQAIYKCLLDRVPEEEKDTNVQVLMV 
LGAGRGPLVNASLRAAKQADRRIKLYAVEKNPNAWTL.ENWQFEEWG 
SQVTWSSDMREWVAPEK7UDIIVSELLGSFADNELSPECLDGAQHFL 
KDDGVS I PGEYTS FLAP ISSS KL YNEVRACREKDRD PEAQFBMP YW 
RLHNFHQIiSAPQPCFTFSHPNRDPMIDNNRYCTLEFPVEVNTVLHGF 
AG Y FETVL YQD I TIjNSRNYP S S TPHNI LKGTERTGTPCGDPE LDEQQ 
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RKEVKKDYPSNTTSSTSNSGNETSGSSTIGETSNRSRDRDRYRRRNS 
RSRSPGRQCRHRSRSWDRRHGSESRSKDHRREDRVHYRSPPLATGEP 
VI)NLSPEERDARWFCMQLAARIRPRDLEDFFSAVGKVRDVRIISDR 
NSRRSKGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRL 
AAMANNLQKGNGGPMRLYVGSLHFNITEDMLRGIFEPFGKIDNIVLM 
kDSDTGRSKGYGFITFSDSECARRALEQIiNGFELAGRPMRVGHVTER 
LDGGTDITFPDGDQELDLGSAGGRFQLMAKLAEGAGIQLPSTAAAAA 
AAAAQAAAIiQLNGAVPLG/UjNPAALTGKSVAQYTASLCIjWILPLHIY 
SSMAPFLHFVDQAILRAWTLSLRKLGPPLRYQEKLLPMVPLEMDSAE 
UoxiWJjivrC x r»o 1 lr /\J_dM 1 l_ J? oMLIVoisi irxjzrr 1 r'i.r' w 

1534 

A 

2 

608 

FGPRDFLCMPVFHPRFKREFIQEPAKNRPGPQTRSDLLLSGRDWNTL 
IVGKLSPWIRPDSKVEKIRRNSEALEVQFIITGTNHHSEKEFCSYLQ 
YLEYLSQNRPPPNAYELFAKGYEDYLQSPLQPIiMDNIjESQTYEVFEK 
riDT Trvcnvnna t wtt t 'H'd\7'dt?ci7 vt^ttvttj^ttt matt pbpdpdt VKFRCT. 

RAAKQADRR I KLYA 

1535 

A 

1 

207 

PETHSPGMFSWFPILSAIKQPITVREGQTICVRFWRCSNSKKVWYEW 
AVTAPVCSAIHNPTGRSYTIGL 

1536 

A 

232 

755 

LS CCADDGVS I PGEYTS FLAP \ I FS PKLYNKVRACRKKARDLKAQFE 
MP I WRLHNSNQLS APQPCS TFSHPNRDPM IDNN\ R YCTLEFPVEVN 
T\VLQCFAGYFETVLYQDITLSIRPETHSPGMFSWFPILFPIKQPIT 
VREGQTICVRFWRCSNSKKGSSHQSMKTSGQGVRN 

1537 

A 

35 

2271 

lcdwllvsrnpgvdsarrkmaamavggaggsrvssgrdlncvpeiad 
tlgavakqgfdflcmpvfhprfkrefiqepaknrpgpqtrsdlllsg 
rdw\ntlivgk\lspwdfvpd\skv\ekirrnseggpciiqelnfg\a 
ylglpafllplnqedntnlarvltnXqihtghdfymfwrXrihlkkp 
edl\rddiienaptthteeysgeektwmwwhnfrtlcdyskriava1# 
eigadlpsnhvid\ rwlgep i \ kaailpts i fltnkkgfpvls kmhq 
rlifrliiklevqfiitgtnhhsekefcsylqyleylsqn\rpppnay 
elfakgyedylqsplqpiimdniiesqtyevfekdpikysqyqqaiykc 
lij5rvpeeekdtnvqvliwlgagrgplvnaslraakqadrgiklyav 

EKNPNAVVTLENWQFEEWG\SQV\TVVSSDMREWVAPEKADIIVS\E 
LLGIiIC*PIEI*SP*VPWIGAQHFP*KMIGVKHPPGSYTSFlAPISSS 

tw~t matrix m tv ^t^tt*tti \r>r>y\w *rv f%nr»*«n\w mi it t t*vt nit /^t n/%rtnruni90\ t TT"> 

KLYNEVT^CREKDRDPEAQFEMPx \hr 
NRD\ PM\ IDNNRY\ CTLG\ FPVEVNTVLHGFAGYFETVLYQDITIiS I 
RPE IHb rbMr bW \ r JflFLiPx *C*bP±j 1 VKi^KAJvPFL. V \Rr W \Kv-Kyr lr 
RKVWVWSGGC*QAPVCSCLIQKPPKGPPQYTHWPLLSPCRAPSVPEA 

GGPRDYN 

1538 

A 

137 

303 

LSGGLELSGFDRAGPETLPPLPSVSVREAGSCHSHSGSEWLELLVCV 
QVQDTTNW 

1539 

A 

122 

385 

YPALEHILKAQAIQSRCGCDSCLPPSAPWDHPGPTTP\SPGRRAAAD 
PWHLSPIDGREHLR*VPVLPVTPPSPTLGHWrDPSPGVGG 

1540 

B 

277 

480 

GTGHFYGRTPSDTNCQEQYTHRKLCQIKSKADLVIiMKNSKSLTRVIR 
NILAPQDQNHQQNPLNSQFLQ* 

1541 

A 

32 

1386 

VliLGPKAERTNSRRNYQRRD YFS APR S I TSNQS AKS S S S RGVY S AYQ 
APDIHECCHFRSASFFLDKMATPAVPVSAPPATPTPVPAAVPASAPA 
SVPAPTPAPAAAPVPAAAPASSSDPAAASATTAAPGQTP\ASAQAPA 
QTPAPALPGPALPGPFPGGRW\RLHP\VIIiASIVD\SYERRNEGAC 

xro OVj L J-iV VjtINXj VUI\. \xlo VJ2» \ V XiM \ V-.r* o V ±r JrLW JC» o E*UJC V \ /-v V uri H r >U\ \ 

NMYETGIKKVS\PNKLIIjG\WYATGHDITEHSVL\IHEY^SREAPNP 
IHLTV\ DTSLPGTGRMS I KA\ YVS T\ LMG I PLGRT / LWG VMFTPLTV 
\ KY \ A YYDTER I RRLTL IMK\ TC F * PPTR V I WTS QVDLQQEGGG I QL 
RNPGMPLSTSVAlWPEGCTCLGKVSADlTriRKVGHFLMSIi\VN\QVP 
E1TOKPMTFE1^I^SNI1TOLF\WVTYLANLTQSR\ ialneelvnl, 

1542 

A 

1 

3399 

MLVGQGAGPIiGPAWTAAWLLLSGVGPAHGSEDIWGCGGFVKSDV 
EINYSLIEIKLYTKHGTLKYQTDCAPNNGYFMIPLiT)KI)HIJ^SRI)V 
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FVVTSVAVRYSHPGDFILKIEPPIiGWSFEPTTVELHVDGVSDICTKG 
GDINFVFTGFSVNGKVLSKGQPLGPAGVQVSLRNTGTEAKIQSTVTQ 
PGGNSCLSPLSNPVECSVTGPCCATRIGFMPEILEIKGDVSTPLSDQ 
HMEHPSWHTASTTVRVTNSNANAASPLIVAGYNVSGSVRSDGEPMKG 
VKFLLFSSLVTKEPQDESLVYLCYTVSREDGSFSFYSLPSGGYTVIP 
FYRGERITFDVAPSRLDFTVEHDSLKIEPVFHVMGFSVTGRVLNGPE 
GDGVPEAWTLNNQIKVKTKADGSFRLENITTGTYTIHAQKEHLYFE 
TVTIKIAPNTPQLADIVATGFSVCGQISIIRFPDTVKQMNKYKVVLS 
SQDKDKSLVTVETDAHGSFCFKANPGTYKVQVMVPEAETRAGLTLKP 
QTFPLTVTDRPVMDVAFVQFLASVSGKVSCLDTCGDLLVTLQSLSRQ 
GEKRSLQLSGKVNAMTFTFDNVLPGKYKI S IMHEDWCWKNKSLEVEV 
LEDDVSAVEFRQTGYMLRCSLSHAITLEFYQDGNGRENVGIYNLSKG 
VNRFCLSKPGVYKVTPRSCHRFEQAFYTYDTSSPSILTIiTAIRHHVI, 
GTITTDKMI^VTVTIKSSIDSEPALVLGPLKSVQELRREQQLAEIEA 
RRQEREKNGKEEGEERMTKPPVQEMVDELQGPFSYDFSYWARVLCFV 
GTGPAJCbKY INY FKSGEKITVTPSS KELLFYPPSMEAVVSGESCPGK 
LIE IHGKAGLFLEGQ IHPELEGVE I VI SEKGAS S PL I TVFTDDKGAY 
SVGPLHSDLEYTVTSQKEGYVLTAVEGTIGDFKAYALAGSPGQYYFK 
PMMKE FRFE PS S QM I EVQEGQNLKIT I TGYRTAYS CY GTVS SLNGE P 
EQGVAMEAVGQNDCS I YGEDTVTDEEGKFRLRGLLVGNND IDDVNI I 
VFRQINQFDLSGNVITSSEYLPTLWVKLYKSENLDNPIQTVSLGQSL 
FFHFPPLLRDGENYWLLDSTLPRSQYDYILPQVSFTAVGYHKHFTL 
IFNPT 

1543 

A 

1 

3126 

MASTTVRVTNSNANAAS PLI VAGYNVSGS VRSDGEPMKGVKFLLFS S 

LVTKEPQDESLVYLCYTVSREDGSFSFYSLPSGGYTVPVFHVMGFSV 

TGRVLNGPEGDGVPEAWTLNNQ I KVKTKADGS FRLEN I TTGTYTI H 

AQKEHLYFETVTIKIAPNTPQLADIVATGFSVCGRISIIRFPDTVKQ 

MNKYKVVLSSQDKDKSLVTV^TDAHGSFCFKAKPGTYKVQVTWPEAE 

TRAGLTLKPQTFPLTVTDRPVMDVAFVQFLASVSGKVSCLDTCGDLL 

VTLQSLSRQGEKRS LQLSGKVNAMTFTFDNVLPGKYKI S IMHEDWCW 

KNKSLEVEVLEDDVSAVEFRQTGYMLRCSLSHAITLLLRCLCSPYLL 

SLSSSLTYSGLILIVSLIRGYCLLVLIYRCFDDSFNADRALVLGFGG 

GGPWGLVACPS LS WQDS ECQAPQVCNRAQGDLVGRDVS HNLLRALDV 

GLLANLS ALAELD I SNNKI STLEEG I FANLFNLS E INLS GNPFECD C 

GLAWLPRWAEEQQVRWQPEAATCAGPGSLAGQPLLGIPLLDSGCGE 

EYVACLPDNSSGTVAAVSFSAAHEGLLQPEACSAFCFSTGQGIiAALS 

EQGWCLCGSAQPSSASFACLSLCSGPPPPPAPTCRGPTLLQHVFPAS 

PGATLVGPHGPLASGQLAAFHIAAPLPVTATRWDFGDGS PEVDAAGP 

AASHRYVLPGRYHVTAVLALGTGSALLGTDVQVEA^ 

VQSDESLDLS I QNRGGSGLEAAYS I VALGEEPARAVHPLCPSDTEI F 

CPWPUPVDT lATPTf RUT O 7\. I'M? /"W^D HUTU" 1 ATT TV M\ 7T^ CD1\X 7TVD 'CT VODU 

TRSLDMWIGFSTVQGVEVGPAPQGEAFSLESCQNWLPGEPHPATAEH 
CVRLGPTGWCNTDLCSAPHSYVCELRPGGPVQDAENLLVMVFPGLRL 
S REAFLTTAEFGTQELRRPAQLRLQVYRLLSTAGTPENG SEPESRS P 
DNRTOLAP ACM PGGR WCAG AN T CTj PT^TP PATPRT .P OPCTEVT T STPL 
EKEEEKIP 

1544 

A 

115 

348 

EFNLALVSPSHPQIKAEDDQPLPGVLLSLSGGLFRSNLIjTQDNGILT 
FSNLVTCSAIYHLPVFPEREPGCSMRDLRVA 

1545 

A. 
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Ia^PARKLPEQDIAQGSYIALPLTLLVLIaAGYNHDKLIPLLLQLTSR 

I^GVGAiGQAASDNSGPEDAJa*QAlCKQICl^ 

WRGTLGIHLFSSLPFASEILLETTATCIHY 

1546 

A 

1 

888 

MASLIMMSLPAJlRTl^WAlviAVSGN^^ 

IEQFRAQRTLIIGITOVGTVTGQREFEIYTDERRPDKTRERRIRQSAL 
LPDAGVNALSGLRRVREFVGLIRRASVASGSRYHCRMRRKRLIRPTE 
GAGIRRPDKTRKRRIRQSAPMPELPGPPVIETSTRNFKENWEENFCY 
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LEFTIAKLGTTRKRIQENCKGQPLGPAGVQVSLRNTGTEAKIQSTVT 
QPGGKPKVDKTTKMGKKQSRKTGNSKKQSTSPPPKEHSSSPATEQSW 
MEDDFDKLRKEGFR 

1547 

A 

1 

4710 

MG S TGVYKVTPRS CHRFEQAF YTYDTS S PS I LTLTAI RHHVLGTITT 
DKMMDVTVT I KS S IDS E PALVLGPLKS VQELRREQQLAE IEARRQER 
EKNGNEEGEERMTKPPVQEMVDELQGPFSYDFSYWARVIjCFVGTGPA 
KLKYINYFRSGEKITVTPSSKELLFYPPSMEAWSGESCPGKLIEIH 
GKAGLFLEGQIHPELEGVEIVISEKGASSPLITVFTDDKGAYSVGPL 
HSDLEYTVTSQKEGYVLTAVEGTIGDFKAYALAGVTLHSQDVLMLPG 
DLVGLQHDAGPGALLHC S PAPGH PG PQAPYLS ANAS S WL PHL PAQLE 
GTWACPACAIiRLLAATEQLTVLLGLRPNPGLRLPGRYEVRAEVGNGV 
SRHNLSCSFDWSPVAGLRVIYPAPRDGRLYVPTNGSASVLQVDSGA 
SATATARWPGGSVSARFENACPALVATFVPSCPWETNDTLFSVVALP 
WLGEGEHVMD VVVENS ASRANLS LR VTAEE P I CGLRATP S PEARVLQ 
GVPVVLLAGSSGYLVGFKFLESHGSDSGSANSFHRLISRNEFKTPLP 
DLTRVPR YS P WEAGSDMVFRWT INDKQS LTFQNWFNV I YQSAAVF 
KL S LTASNHVSNVTVNYN I TVE RMNRMQGLR VSTVP AVL S PNATLAL 
TAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPDPSVAQVL 
VEHNVTHTYAAPAALGGGAVLTRQPSVLLHLCSVPHVAWBPGTLKAG 
PQVSTVLTVIJVSNAFENRT<MVPVSVCASLPSVSVC^LTGACWYPR 
VLIRSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGRCLNCSSGSKR 
GG YTFTLTVLGRSGEEEGCAS I PLS PNRPPLGGSCRLFPLGAVHALT 
TKVHFECMGWHDAEDAGAPLVYALLLQRCRQGHCEEFCVYKGSLSGY 
GAVLPPGFRPQFEVGLAVVVQDQLGAAVVALNRS LAI TL PE PNGS AM 
GLTWLHGLTASVLPGLLRQADPQLVIEYSLALVTVIjNEYERAIiDVA 
AE PKHERQRRAQ I RKN I TETLVS LRVHTVDD I QQ I AAALAQCMRKLP 
EQDIAQGSYIALPLTLLVLLAGYNHDKLIPLLLQLTSRLQGVGALGQ 
AASDNSGPEDAKRQAKKQKTRRTLATSINTSREPSTDDQLPAHNQTM 
PQRHARRSAPPRAYDRKTRQEENPHQTRSHAAAKRRERPPHDLQKQA 
TTRL I PAG PRRRDGTS PRRTQPPPNTRRPAAAGHLAR FRRAAPGARG 
ARPPTARRGREELDPAHIYAAAPGLPTPPRAGRTPPTPERRDRNTRR 
RRTREEGEGE FRP VS FLKT I FP S QNGHDG S TDVQQRARR SNCRRQEG 
I KIVXiED I FTLWRQVETKVRAKI RKMKVTTKVNRHDKINGKRKTAKE 
HFSTEKLRPDDDICRRSTSKHYRKFTKGSTEVLSKWQPLGPAGVQVS 
LRNTGTEAKIQSTVTQPGGNSCLSPLSNPVECSVTGPCCAARIGFMP 
EILEIKGDVSTPLSDQHMEHPSWHTVPSLFPKPSSKSLKKNRQFEFA 
KTLDMHRTFTTFSCFRQQR 

154 8 

A 

37 

3683 

LGLGLSMLVGQGAG PLGPAWTAAWLLLS GVGPAHG S ED I WGCGG 
FVKSDVE I NYSL I E I KLYTKHGTLKYQTDCAPNNGYFM I PLYDKGDF 
I LKI E P PLG WS FE PTTVELHVDG VSD I CTKGGD INFVFTGFS VNGKV 
LS KGQPLGPAGV QVSLRNTGTEAKI QATATQ\ PGGKFAF FKVLPGD Y 
E I LATH PTWALKEASTTVS VTNSNANAAS PL I VAG YNVS GSVRSDGE 
PMKGVKFLLFSSLVTKED\ALGCNVSPVP\GFQPPIREVWCNLCYTV 
SREDGSFSFYSLPSGGYTVIPFYRGERITFDVAPSRLDFTVEHDSLK 
IEPWHVMGFSVTGRVI^GPEGD\GVPEAV\ VTI^NNQ IKVKTKA* WA 
HSRLENITTGTYTIHAQKEHLYF\ETVTIKIAPNTPQLADIVATGFS 
VCGRIS I IRFPDTVKQMNKYKVVLSSQDKDKSLVTVETDAHGSFCFK 
AKPGTYKVQVMVPEAETRAGLTLKPQTFPLTVTNRPMMDVAFVQFLA 
SVSGKVSCLDTCGDLLVTLQSLSRQGE\KRSLQLSGKVNAMTFTFDN 
VLPGKYKISIMHEDWCWKNKSLEVEVLEDDMSAVEFRQTGYMLRCSL 
SHA I TLE F YQDGNGRENVG I YNL S KGVNRFCLS KPGVYKVTPRS CHR 
FEQAFYTYDTSSPSILTLTAIRHHVLGTITTDKMMDVTVTIKSSIDS 
EPALVLGPLKS VQELRREQQLAE I EARRQE\ S VYYGKEEGEERMTKP 
PVQEMVDELQGPFS YDFS YWARSGEKITVTP\ SSKELAS FIPLSM\E 
AWQWEKACPGKLI\EIHGKAGLFLEGQIHPELEGVEIVI\SEKGAS 
S PLD/ ITVFTDDKGGLQCLGPLHSDRE YTVTSQKEG\ YVTjT\ avegt 
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MGDF\ KAYAIA AGVSFE I KAEDDQPLPGSPFYP* TGGL\ FRSNLLTQ 
GQRAFIiTFSNLKPLAS I YFKPM\ MK\ E FRFEPS SQMI EVQEGQNL\ K 
ITITGYRPAYSC\YGTVS\SLNREPEQGV\AM\EAVGQ\NDCKHFTE 
KTPVTDEEG\KFRL\RGLLPG\CVYH\VQLKA\EGN\DHIERALPHH 
RVI\EVGNNDI\DDVNII\VFRQINQFDLSGKCDSLPLEYLP\TLWG 
SSFYKSE\NLDNPIQTVSLGQ\SLFFHFPPLLRDGRELCLCFWDSTL 
PRSQY\DY\ILPQVFFSPQLGLTIKHITLDFYFPRREAAWNRDIRTK 
DPYIALAIDRCWFCWAGYNP 

1549 

A 

1 

474 

mdnlsseeiqqrahqitoeslestrrilglaiesqdagiktitml.de 

QKEQLNRIEEGLDQINKDMRETEKTLTELNKCCGLCVCPCNSITNDA 
REDEMEENLTQVGSILGNLKDMALNIGNEIDAQNPQIKRITDKADTN 
RDRIDIANARAKKLIDS 

1550 

A 

1 

1554 

MLS CYKQKFPLAADALVTMDVDFYQAKENAELKNAINQPHHLAQWT 
IKQLQTLPDSWGQICNKNQSLSVSLTSQDAGIKTITMLDEQKEQLNR 
IEEGLDQINKDMRETEKTLTELNKCCGLCVCPCNRTKNFESGKAYKT 
TWGDGGENS PCNWS KQPGPVTNGQLQQPTTGAASGG Y I KRITNDAR 
EDEME ENLTQ VG S I LGNLKDMALN I GNE I DAQNPQ I KR I TDKSHLGQ 
PSVLKLKDTSFLRRFHMSAERTVSHPSAAAQVPAGRKRCCHRPVHPV 
GLPAGAPAPALNSRRSLAPGARPKSDKEESEEHTARPPSPAAFAALR 
RPGASGSLRiaREPRTRLPPLSCRKAPQEMLNMSKKTVSCFVNFTRL 
QQ I TN I QAE I YQKNLE I ELLKLEKDTADWHP FFLGKNPTDAYLDAT 
MNEAPGPINFTMFGEKLNGTDPEDVIRNASACFDEEATGTIQEDYLR 
ELMTTMGDQFTDEEVDELHREAPIDKKGNFNYIEFTSILKHGVKDKD 
D 

1551 

A 

87 

736 

FIMDNLSSEEIQQRAHQITDESLESTRRI\LGLAIESQDAGIKTITM 
LDEQKEQ\ LNR I \ EEAWAQ 1 1 KDMR \ ETEKTLTELNK\ CCGtCVCPC 
NRTKELLSLGQGF I KTTWGRWWEKTSPWQC * YSKQPGP/ VWTNGQLQ 
QPTTGAASGG Y I KRI TNDAREDEMEENLTQVGS I LGNLKDMALNIGN 
EIDAQNPQIKRITDKADTNRRFVLDYCPMPEQK 

1552 

A 

99 

362 

FSHIYSLFILFQPLEIRGPYDVANLGLLFGLSESDAKAAVSTNCRAA 
LLHGETRKTAFGIISTVKKPRPSEGDEDCLPASKKAKCEG 

1553 

A 

184 

360 

YVKPPHFKLCLNFKHSGFPVCFTETRKTAFGIISTVKKPRPSEGDED 
CLPASKKAKCEG 

1554 

A 

3 

403 

ISSALNLMQICKGKNVIISSAAERPLEIRGPYDVANLGLLFGLSESD 
AKAAVSTNCRAALLHGETRKTAFGI ISTVKKPRPSEGDEDCLPASKK 
AKWSHSVTQAGVQWHNLGSLQPLPLGLQAIPPTFSLPRN 

1555 

A 

1 

872 

EFGTRWDFSMAVFADLDLRAGSDLKALRGLVET\AAHLGYSWAINH 
IVDFKEKKQEIEKPVAVSELFTTLPIVQGKSRPIKILTRLQIMLSDH 
\SPAKVLKNTLKRGAGL*D\VGAGFPKAEKAFFILLCTHIj\DVDL\V 
C ITVTEKLPFYFKRPP INVAI DRGLAFDLALI PLLSKDSTMRRYTI S 
PVPLQF * CKS CKGKNVI I S \ SACKKRPF* KIRGP ILTWAN\ LGLPVW 
GFSESERQGFGCP PNCRAA\ LLHGETRKTAFG 1 1 STVKKPRPSEGDE 
DCLPAS KKAKCEG 

1556 

A 

46 

584 

SRR P WVI SQRRTRLLS TS LWCKVNVE \ DAGGETLGRLL WYPWTQRF 
\FDSFGNLSSASAIHGQTPKVKAHGKKVLTFLGEMPLKHIi\DDLQGA 
PFAQA * SELALVDKP AMWDP * GTS KLPGE I LLVTRFGQSLFRQKNFT 
PGGARVSWGRKMGDLELASALVPSRLPLSSLiAHECRAFQG 

1557 

A 

1476 

1747 

GNFNSLRLSKTQLCAHCLYPHTFGRQRWVDHLRLGVRD*PGQHGETP 
SLLKNNNNNTKI S WAWWHEPV\ I PA\ MGEAEAGE S LEP \ GRRRLQ 

1558 

A 

1 

507 

MSMLRLQKRLASSVLRCGKKKVWIjDPNKTNEIT^NANSHQQIWKLIKD 
GLIIRKPVTVHSRAR^QKNTLAHWKGRHIGIDKRYCESKKIDRHMYH 
SLYLKVQGNVFTNKPILMEHSHKLKADKAHKKLLADQAEARRPKTKE 
ARKRSEERLQAKKEEII KTLFKEEDTKK 

1559 

B 

15 

400 

MSMLRLQKRLASSVLRCGKKKVWLDPNETNEIANANSRQQIRKLIKD 
GLI IRKPVTVHSRARCRKWTLARRKVRHMGIGKRKGTANARMPEKVT 
WMRRME I LRHLLTRYRECETINRAMHHLLNLKVMS * 
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1560 

A 

15 

710 

INPPPPAFLSLLRPQPCSMLRLQK\RLAS\SVLR\CGKKKVWLr>PNE 
TNEIANANSRQQI \RKLIKDG\ LI IRKPV*RVHSRARCRKNTI*\ARR 
KGTAH/ CGIGKREGYSPMPRMP/TRKVTWMKENEGFWRRLASERYR* 
NLKKIRFATLLSQAL YPEG * RGNWS KTRRVFHGNTFHKLEGRQRPR 

KKAPWLDQA*G/RRRS*DQGKHGKR\REERLPGQRKEEINQRLYSKE 
EETKK 

1561 

C 

1791 

2163 

MNRTVENS TPLQDFVFTS S FTNPKPNPLLKI S LAALPRS AYDGG IH I 
GRAQVCPWVAMARPRAERCLLHFALQLLQLQPDRERAKRLQQLAERW 
RRPPSGHHQI PLFPRPRHGDPRTDLRASNVYSS I IC* 

1562 

A 

2 

421 

MASGRARCTSNLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAG\ 
WAIFKGKYKEGDTGGPAVWKTRXiRCAIjNKSSEFKEVPERGRMDVAEP 
YKVYQLLP PG I VSGQPGTQKVPS KRQHS S VFS ERKEEEDAI ANCTL 

1563 

A 

20 

432 

GRLSPATAWTATQAWAIFKGKYKEGDTGGPAVWKTRLRCALNKSSEF 
KEVPERGRMDVAEPYKVYQLLPPGIVSGQPGTQNVPSKRQHSSVSSE 
RKEEEDAMQNCTLSPSVLQDSLNNEEEGASGGAVHSDIGSSSS 

1564 

A 

107 

401 

FQ\^LTLQLLLWQDPLLPLSIMGSSLLPPCLWPSLFFLFFEPLTLSL 
SDLVRYFQGLGPPPKFQVTLNFWEESHGSSHTPQNLITVKMEQAFAR 
YLLE 

1565 

A 

191 

353 

PHSSTTCPPAPMLVF*KRDPPSLGPHDALVPPCPVPVEILRSSAKTR 
CGKKASS 

1566 

A 

553 

690 

SRRRFPMGSGTKLV*G*GEMESLEEQAKGKGTESCALHPVDLFSSPG 
PLFNSLCLSKPMAPPTL 

1567 

A 

2406 

4031 

GGRAGDGPLSATCTYAPSLWLDEGSPCLPGPLVTEADRRGTLGTEYP 
PQAEVAEGKGPDEGPMACSLRNSSSTNKEASYHPGFLVVLLPEFDWY 
LKSPNMYQVGTVGECRCTGVHSSPEVPGLTPGNWPPWGSHGVTQRMA 
SG\ RARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFR I PWKHAGKQ 
DFREDQDAAFFKAWAI FKGKYKEGDTGGPAVWKT\ SLCCALNKSFDF 
KEVPERGRMDV7VEP YKVYQLLP PG I Ii \ SGH\ PGTQKVPS KRQHS SVS 
SE\RKE\EKDAMQNCTLSPSVLQ\DSLNNEE\EGGSGGAVH\SDI\G 
SSSSSSSPETTRKITDTTE\APFQGDQ\RSLEFL\LPPEPD*SLLLT 
FI\YNGRWGEAQVQSLDCRLVAEPSGSESSMEQVLFPKPGPLEPTQ 
RLLSQ\LRRGILVASTPQGLFRCSAFCPIPIFWGIAPQ\APPG\PGP 
HLLPSNECVELFRTAYFCRDLVRYFQGLGPPPKF\QVTLNFWEESHG 
S SHTPQNL I TVKMEQAFARYLLEQTPEQQAAI L SLV 

1568 

C 

77 

325 

MSLIQEALHLWiTDPDAPAGDDPKYREWHHFLVVNMKGNDISNGTVL 
SDYXCAAPPKAPSHVPQFSVACI IDFSSSCPPWHG* 

1569 

A 

60 

287 

ISMFPQQKNPPPSWFFQGWFFKGPEGHFYAFLGILNALCWGLTPNPP 
FGFFGEGPPPKIPLGSLLGSGFQRQRPP 

1570 

A 

3 

703 

VEFFSS QRAELYATPLTPAPGPNGG I PGWTLWLALPRPGNLRKGPGP 
LS LQE VDEQPQH PLHVTYAG AAV/ DDELGKVLTPTQVKNRPT\ SISW 
DGLD/ S KGKLYTLVLT\ DPDAPKQGKDP \ KYRE \ WHHFLGWSTLKGQ 
MTSATGTVLS \DYVGLGGLPKGTGLHR\ YVWLV\ YEQ\DRPLK\ CDE 
PHPS ATRS \ GDHRGKI QRWASLP VKK* * SSRAPGGWAPC YPQPEVGM 
NQCAPKL 

1571 

A 

1749 

2411 

apsclvsehsapgpqrelpqplltfqayeqilgitc\gscp\aqgwg 
awssdavpqllarrpplphglpacgewgrgelgvkpsglpshagpaw 
ghqvrtvcatahpqdc i s pegaveee i vgg * gc \ tegqs qrvlq i wp 
\ sqgvs slsalvpln\mf\ tell i eyyeki fstp\ eapgehglapwe 
qgsraaplqeavprtqatgLtkptlppsplmaarrrl 

1572 

A 

1 

3408 

MKLMETLNQCINAGHEMTKAIAIAQFNDDSPEARKITRRWRIGEAAD 
LVGVSSQAIRDAEKAGRLPHPDMEIRGRVEQRVGYTIEQINHMRDVF 
GTRLRRAEDVFP PVI G VAAHKGG VYKTS VS VHLAQDLALKGLRVLLV 
EGNDPQGTASMYHGWVPDLHIHAEDTLLPFYLGEKDDVTYAIKPTCW 
PGLDI I PSCLALHRI ETELMGKFDEGKLPTDPHLMLRLAI ETVAHDY 
DV I VI DS APNLG IGT I NWCAADVL I VPTPAEL FD YTS ALQFFDMLR 
DLLKNVDLKGFEPDSAHDPPAVHLSNGPGQEPIAVMTFDLTKITKTS j 
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S S FE VRTWDPEGV I FYGDTNPKDDWFMLGIjRDGRPB I QLHNHWAQLT 
VGAGPRLDDGRWHQVEVKMEGDS VLLEVDGEEVIjRIiRQVSGPIiTS KR 
HPIMRIALGGLIjFPASNIjRLPLVPALDGCIiRRDSWLDKQAEISASAP 
TSLRSCDVESNPGIFLPPGTQAEFNIiRGSSGSKSVGPLRGLVAVCEA 
LGLWSQTAGIRMASVRIREAKEGDCGDILRIjIRELAEFEKLSDQVKI 
SEEGPCWGYGIYYFIYSTWKGRTIYLEDIYVMPEYRGQGIGSKIIK 
KVAEVALDKGCSQFRLAVLDWNQRAMDLYKALGAQDLTEAEGWHFFC 
FQGEATRKLAGNWQSERQI PFGDVRLPPPADYNKE I TEGDEVEVYSR 
ANEQEPCGWWLARVRMMKGDKVAQVIHNFSFFTMCQFYVIEYAACDA 
TYNEIVTLERLRPVNPNPLATKGSFFKVTMAVPEDLREACSNENVHK 
EFKKALGANCIFLNITNSELFILSTTEAPVKRASLLGDMHFRSLRTK 
LLLMSRNEEATKHLENd I RVRNI ETKLVTGKVGTQNTQLIjGNTIiTGS 
TAAKGVGVLIEGLATSKNPLMTLKPNDTNSVYIDYETEDDTSDGVYP 
NQGNVTSQPLHFQATLKQDGNIAIEPGEFKATSTFQAEAKLPSKHPI 
CSKFRGSLQRMGRGYWIQLFFSPPVHPDYGKQEAKADLSFSFVANEV 
LSERREAFIS IDSEEGALLEDNGMLIiAHWI I YVSEICDYCNIjRFQGS 
SVTGMISMDGSYCFDIHVSSSELFSSVTEHLLRTFSSHI I PRLECRS 
DATTSEKD 

1573 

A 

2 

1136 

MPGQRIASLAYSLLPYGAGIPAPLTRPRLPRRPPRLPPPRSLRLFRI j 
QVRRPFKGSSCRLRGPGTLVPDSGDPDGFRADPRGQGGRLWRYPEAD 
S GENCRP VTRGS EWRGQS GVI*RGGGRGRGS P WE PARGAAGP FCSNLA 
SGPTP * ELAEFEKLSDQVKI SEEALRADGFGDNF FYHCLVAE I LPAP 
GKLLGKSTWPLEVHLSSRPQRSPHRLPEALLPFSLSQG/HCWGYGI 
YYFIYSTWKGRTIYLEDIYVMPEY/RGYWAEAEAGVGAKHPTDPTVI 
PCPQSPAGVI F* SPSAQTIAI FCLFLSTTGQGIGSKI I KKVAEVALD 
KGCSQFRJ^VIiDVWQRAMDLYKAtiGAQDLTEAEGWHFFCFQGEATRK 
LAGK 

1574 

A 

182 

797 

LPFARPWDFGPRQRGSGWLPCGSREAKEGDCGDIL\RLIRVKTAGPE 
LAEFEKLFGSGERSVEEAIjESRLAFGDNPFYHLFW*AEILPAPGESY 
WG/HCWGYGIYYFIYSTWKGRTIYLEDIYVIAGNIGVQGIGSQK*S 
KKV\AEVALGIRGCSPNFRLARPLD\WNPEGPWDLYKAPRSPKI*RK 
LEGWHFFCFQGEATRKLAGK 

1575 

A 

1 

254 

STISLCRRCGSKAYHLQKSTCGKCGYPAKRKRKYNWSAKAKRRNTTG 
TGRMRHLKIVYRRFRHGFREGTTPKPKRAAVAASSSS 

1576 

A 

3 

308 

RTS EMTKGTS FFGKRRNKTHTLCRRCGS KAYHLQKSTCGKCG YPAKR 
KRKYNWSAKAKRRNTTGTGRMRHLKIVYRRFRHGFREGTTPKPKRAA 
VAASSSS 

1577 

A j 

1 

380 

IPTPLIGNFGPRGPRIRHERPQKRDDRREPSSFGKRRQ*DGTIiLC\R 
RCGS \ KA\ YHLQKSTCGKCGYPAKRKRKYNWSAKAKRRNTTGTGRMR 
HLKIV\YRRFRAWDFREGTTPKPK*GSIiLQHSSSS 

1578 

A 

41 

544 

APSPRRPWGHFTEED\KATI\TSLWGK\VNVE\DAGGE\TPGKGSLV 
VYP\ WTQRFFDS FGNLSSAF\AHHGQTPKVKAHGK\ KVLT\ SLGDAI 
K\HLDDLKGTFAQA*VNLHL*QSCNVDP\ENFQAPGEMLLVTR/VLA 
IHF\ GK\ EFTPGGCKAS WAEDG * LAVGQWPCS SRYH 

1579 

A 

1207 

1369 

I LETYI KTIiQR YADGS YKSEKVKS VGPRGGAI EKEKNCLTVPFQEKS 
LGSGIW 

1580 

A 

1 

1716 

TCIAAVKMEGPLSVFGDRSTGETIRSQNVMAAASIANIVKSSLGPVG 
LDKMLVDDIGDVTITNDGATILKLLEVEHPAAKVIiCELADLQDKEVG 
DGTTSWIIAAELLKNADELVKQKIHPTSVISGYRLACKEAVRYINE 
NLIVNTDELGRDCLINAAKTSMSSKI IGINGDFFANMWDAVLAIKY 
TDIRGQPRYPWSVNIIiKAHGRSQMESMLISGYALNCVVGSQGMPKR 
IWAKIACLDFSLQKTKMKLGVQVVITDPEKLDQIRQRKSDITKERI 
QKI LATGANVILTTGG IDDMCLKYFVEAGAMAVRRVLKRDLKR I AKA 
SGAT I LS TLANVEG * ER FEGAMWDQAEE WQER I CDDEL> I L I KSTKA 
\RTSASIISRVPIDSMCDEMERSL\HDALC\WK\RVLESK\SWPR 
\GGAVEAALSIY\LENYA\TSMGSREQLAIA\EFARSLSGYSPIPLA 
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VNAAQDS TDLVGKN * R/ RLFHNEAPGLTPE \ RKN\ LKW IGLDLSNGK 
P\ RDNKQAGVFEP/ TP I VKVKSLKFATEAAIT\ I LRIDDLI KLHPRK 
*R* \KHGSYEDAVHSGALND 

1581 

A 

1 

514 

PRVRNLSREWLCDRHLREKMFSSVAH\LARANPFDTPH\LQLVHDGL 
GDLRSSSPGPTGQPRRPRNLAAAAVEEQYSCDYGSGRFFILCGLGGI 
I S CGTTHT\ ALVPLD\ LVKCRMKV\ DPQKYKG I FNG\ FS VTLKEDGV 
RGLAKGWAPTFL\ G YSMQGLLQVIiAF YEVFKVLY 

1582 

A 

2 

1296 

ALCEPQPFQGSGCVIAILGRKMFSSVAHLARANPFNTPHLQLVHDGL 
GDLRSSSPGPTGKPRRPSQ/HMAAAPVEEQYSCDYGSGRFFILCGLG 
GI ISCGTTHTAIiVPLDLVK\ CRMOVDPOKYKO\ TPNrJPQVTT.K-vrifiv 
\ RGLAKGWAPTFLG YSMQGLCKFGFYEVFKSIA YSNMLGE\ ENTYL * 
RTS L YLAAS AS \ AE FFAD I ALAPME AAKVR I QTQP \ G YANT * EG I S F 
P KC I KE EGLTS I LQGGLLPL WMRQI PYTMN * S S PCLERTV\ E A\ LYK 

fv\vpk\prre*fkrqsrlwtiw*qvtiarvfcancfsplpeflg* 
pvi»d*gkkvsqcflwvlqrdlgfk\gv\wkglfa\rii\migt\lt\ 
alqwf i yys vkg yfr\ lprp \ ppp\ emqe s \ lkkklgvns wri kan 

CGIiNLLVDP VFEES AKGTF IYLTV 

1583 

A 

127 

434 

RPLES W IGLVRCNI CRS P I AEAVFRKLVTDQNI S KNWRVDSAATSGY 
EIGNPPDYRGQSCMKRHGIPMSHVARQ\DLNRKSNRVKTCKAKIELL 
GSYDPQKQL 

1584 

A 

2 

535 

QRGGIRKMAEQATKSVLFVCLGNICRSPIAEAVFRKLVTDQNISENW 
VIDSGAVSDWNVGRSPDPRAVSCLRNHGIHTAHKARQITKEDFATFD 
Y I LCMDESNLRDLNRKSNOVKTCKAKI ELLGS YDPOKOT , T T PDPYYf3 
NDSDFETVYQQCVRCCRAFLEKAHLGRFVPCCGQPD 

1585 

A 

551 

1299 

PADPPRPSYYRHRTPPQAHWSRLRRSRLRRRGSHTRCPVGVGAGLRR 
RAGARLAVRLRASACGTPRCLGASARGKMAEQATKSVLFVCIjGNICR 

S P IAEAVFRKLVTDQNI s kn/ wegrqrgnfrwvi dsgavsdwnvgrs 
pdprav\sclrnhgihtahkarqit\kevfptfdyilcmdesni>\rd 

LNRKSN\RVKTCKS * KFELPWEIi * SPQKQLI IED\ P YYGE * LWTLET 
VYQQ\ CVR\ CCRAFL\ EKAH 

1586 

A 

1 

1239 

MERAEFVFTGIDAYSRYGFAYPACNASAKTTSVDSWNALSTVMVFHT 
AL P Li TKALTLRXj KICLVTV S T . VT> Q twt . wt . wdt .OTOT . fC\T 7i imuapoDP 

PS VGEENPPYESLOI S PDVPLRRTRSRCOVHNYTKPPAWGnVPAHAR 
VRGEWADGQLPQPILDQSTPMILTRVLWPYSKGGKKVSLPLSAPGEK 
GGPCRQGSKSGAAVGGVLQAIjLWRKSCCPEWRQGWMWVDNQIjIGTTQ 
P FML YVTP LS NENEV I ETG PAVQ VNAVKF PS KS AJL.TN I Y KHLM I TAQ 
RFTVQIEEKLLLKIiLSFFGYDQAESEVEKYDENLHEKTAEQGGTPIR 
YYFENLKISIPQIKLSVFTSNKLPLDLKEIiLSQAARILGSVDFLGNP 
MGLLNDVSEGVTGLIKYGNVGGLIRNVTHGVSNSAAK 

1587 

A 

298 

408 

SILRCSVISYFSFSVSKISPSWCLGFRSFFFLKRTRG 

1588 

C 

92 

244 

MRCE I ILVLI PYVYFYSNKLLCSRLXXXXXGGAVLKNPWGGQSLPGL i 
AR* * 

1589 

A 

33 

191 

RDDPRVRPPPNSHT* PQQEPGIj* LI KCTS PPQAPAPRTVHGPYFYMR 
LIKMF 

1590 

A 

3 

1285 

CFSFVFVRHSSRHCREYETLITADFPPEASYLMSLCSWSVTTMVAFL 
AQSRPSGSAGAHRHSRVRS STALGHS KRLADTDLALMKPAWTPAFQH 
CMQHPNQPCEEKRYEOMVTOOSDTHTOKATrRGRTrWATjOOPOT.TjVQ 
HVIMKREQILYSGPCCQMTSPSCQLILSCDKTEFYVLFIPALMDETA 
FLVKFAGTLSDGLGKTMDNRHQS ERE Y I RYHAATSGEHI/VAG IHGLA 
HGKSWVTSGSAAAGPQRRLCINLIAAIKKERIiQQPLSFSLGIIGGIiT 
SVITSTVEGVKTEGGySGFISGLGKGIiVGTVTKPVAGAL.DFASETAQ 
AVRDTATLSGPRLKYFQVPLLVGVRWHSKRQPPNFPLKFPGNPIVLR 
KADEAiKSQLGALGLRLRGPYCGMIKWGPADSGLLLRGAJCLNGDERG 
DLKXi 

1591 

A 

3 

3469 

QPGHTIYLLPTVVICinjLPCELDFYVKGMPINGTLKPGKEAALHTAJD 
TS QN I ELGVS LENFPLCKELL I PPGTQNYMVRMRL YD VNRRQIjNLTI 
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RI VCRAEGSLKI F I S AP YWL INKTGLPIiI FRQDNAKTDAAGQFEEHE 
LARSLSPLLFCYADKEQPNLCTMRIGRGIHPEGMPGWCQGFSLDGGS 
GVRALKVI QQGNR PGL I YN I G IDVKKGRGRYI DTCMVI FAPRYLLDN 
KSSHKIAFAQREFARGQGTANPEGYISTLPGSSWFHWPRNDYDQLIj 
CVRLMDVPNCIWSGGFEVNKNNSFHINMRDSI»GK\CFFLRVEITLRG 
ATYRISFSDTDQLPPPFRIDNFSKVPWFTQHGVAEPRLRTEVKPMT 
SLDYAWDEPTLPPFITLTVKGAGSSEINCNMNDFQDNRQLYYENFIY 
IAATYTFSGLQEGTGRPVASNKAITCAELVLDVSPKTQRVILKKKEP 
GKRSQLWRMTGTGMLAHEGSSVPHNPNKPSAARSTEGSAIIiDIAGLA 
AVTDNRYEPLMLRKPDRRRSTTQTWSFREGKLTCGLHGLWQAKGGL 
SGLFDGAEWLGPDTSMELLGPVPPEQQFINQKMRPGSGMLSIRVIP 
DG PTRALQ I TD FCHRKS S R S YE VDEL P VTE QE LQKLKNPDTEQELEV 
LVRLEGGIGLSLINKVPEELVFASLTGINVHYTQLATSHMLELSIQD 
VQVDNQLIGTTQPFMLYVTPLSNENEVIETGPAVQVNAVKFPSKSAL 
TNI YKHLMITAQRFTVQI EEKLLLKLLS FFGYDQAESEVEKYDENLH 
EKTADOGGTPIRYYFENIjKT S T POTKT.^VFT<?NKT J PT»nT J KAT J K<3TT J (3 
FPLIRFEDAVINLDPFTRVHPYETKEFIINDILKHFQEELLSQAARI 
LGSVDFLGNPMGLLNDVSEGVTGLI KYGNVGGLI RNVTHGVSNSAAK 
FAGTL SDGLGKTMDNRHQS ERE Y I R YHAATSGEHLVAG IHG PGLMG I 
I /GVGLTS VI TSTSGRV * KQKGGVSGFI SGLGKGLVGTVTKP VAGAL 
DFASETAQAVRDTATLSGPRTQAQRVRKPRCCTGPQGLLPRYSESQA 
EGQEQLFKLTDNIQDEFFIAVENIDSYCVA/LISSKAVYFLKSG\DY 
V\ DREA I FIj \ E V\ KYDDLYHCLVS KDHGKV\MCR 

1592 

C 

398 

655 

^FPLAFSLPLKNAFHISVCRVCPGYTGFAKRALTAIxNIiDTSLSAN^ 
CNTP PAEX PNVHNPCYMGLS KPARX S KLG S MC KG S SXH * 

1593 

A 

\ 

• 93 0 

MAMD ALAYTOM FLLLMNG GD WV T AT. AMLNTT.TNH < 5T J OT.VSODANSRE 
IQLRFTANDTLPE IQADPDRLTQWGWEGQI IKKYICKNI IGVGAVE 
KKMSEWNRKCFKERLQEVKECAIJ^ 

CTSLSQFPNSIAGVRWERTLGAIVRQKNHPMTPEAVKANV?KKICDFE 
NASKPQSIQVYSYSEKELICHNQFSLFLVGSGGFGGKRTSDKVKDHR 
SSPAIEQSWTKMDFDKIjTEVGFRRIVVITNF 
EKRLDEWLTRINSVEKTLNDLMEPKTMA 

1594 

A 

1 

134 

SRVDDFVSLQALFMGSPLRFDGRVVLVTGAVAGEHAKVGGRAPC 

1595 

A 

3 

2455 

HASVCPAVGVQRLCLFPCVSIjQALFMGSPLRFDGRFF\LiVTGAGAGIj 
GRAYAIiAFAERGAXVVVlTOLGGDFKGVGKGSLiAADKVVEEIRRRGGK 
AVAITxTDS VEEGDKVVKTALDAFGRIDVVVNNAGI LR/D INSFARISp' 
EDWDIIHRVH\LRGSFQVTPAAWEHMKKQKYGRSIMTSSASGIYGNF 
GQAITCSAAKLGLLGLANSLAIEGRKS 

EDL^AiKPKYVAPLVLV^CHQSCEENGGLFEVGAGRIGKLRWERTI* 
GAIVRQKIOTPMTPEAVKAITOKKICDFENASKPQSIQESTGSIIEVLS 
KTDSEGGVSAOTTSRATSTATSGFAGAIGQKLPPFSYAYTELEAIMY 
AJ^VGASiraPKI^LKFIYEGSSDFSCLPTFGVIIGQKSMMGGGLAEI 
PGLS INFAJvVLHGEQYLELYKPLPRAGlCL 

I MDVYS YS E KEIj I CHNOF S "L FT.VG <5 O d PGG ICR T<5 DKVICVAVA I PNRP 
PDAVLTDTTSIxNQAAiYRLSGDWNPLHIDPNFASLAGFDK\PILHG\ 
LCTFG I FCQGVLLQQFCR * MDWQGFKGN * RARF \ AKPVYPGANFYQ 
T * ECWKE\ GNRNS FFKPKVQGNIiETLVI SICWHMWDLGTQHSGYFSLR 
TPSEGPGSFRVPLVFEE\IGRRLKDIG\PEVVK\KVNAVF\EWHITK 
GGNI\GAKWTIDLK\SGSWEKLYQGPS/KKGAAI)TTIH/ ITjSDEDF/ 
LWEVVLGQA*PSRKAFFSGPG*RPC^GTS^*AQ\KLSDGFLKDYAK 
LLKGTPTLLIKMESIKIPPPHPQICIxDYSAKS 

1596 

A 

2 

289 

QHAMFGOT>TITCTTHGl\WTKL^ 

LYYKDKATFGCHDGYSx^GPEEIECTKLGlTOSAMPSCKEHEFSGFLE 
N 

1597 

A 

1 

397 

PTRPRTRGVCLPFIxAMFGNDTITCTTHGlWTKLPECREVKCPFPSRP 
DNGFVNYPAICPTLYYKDKATFGC^GYSIxDGPEEIECTKLGxTOSAMP 


1 0,600 
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S CKAS CKVPVKKATWYQGERVKI QEKFKNGMLHGDKS 

1598 

A 

1 

412 

STMISPVIilLFSSFLCHVAIAGRTCPKPDDLPFSTVVPIiKTFyBPGE 
EITYSCKPGYVSRGGM\RKFICPLTGLWPINTLKCTPRVCPFAGNLR 
KMGAVRL I TDFLNYS PTRFS FS LLTWG F I LEWALDS \ AKCI EGG 

1599 

A 

19 

1215 

CQCDSSTMIFSRCSSLFSSFLCHVAIAGRTCPKPDDLPFSTWPLKT 
FYE PG\ EE I T Y S C KPG YVS RGG I E E S L S C PL\ TGTVGP FNTS GNVTP 
RVCPF\AGIFRKMGGRTLITTF*NYPNTDPVFSLLTLGF*FWNGALD 
FWPSCTGGKGKW\SP\ELPGLVAPII\CPP\PSIP/TGFATLHVLLR 
PFRLGNNS PP I GDTAVFECLAHNMAMFG \ NDTI T\ CTTHGKLDLNYP 
ECRGSKMPPFPHQDPDNGIW*TYPCQNPNTLFTRVKAPHLGLPHDGI 
FSGMGPRJCEI\EC*PQTWGKPGSWPIiAPSW*KPSLVKGTPVKKRPTV 
V/YPQGERVKDSREKFJCEWECLHG**KFLSFCKNKEKKCSYTEDAQC 
IDGTIEVPKCFK\EHSSIiAFWKT\DAS\DVKPC 

1600 

A 

1 

282 

RCGSQQLGRREEWQRQGSPVSRRLSARRGPQAPGTRLPRRHPARAFP 
AATMPKRKVSSAEGAAKEEPKRRSARLSAKPPAKVEAKPKKAAAKCR 

1601 

A 

1 

453 

EFGSOQLGRREEWQRQGSPVSRRLSARRGPQAPGTRLPRRHPARAFP 
AATMPKRKVS SAEGAA* LEPNSRS ARLS AKPPAKGEAKPKKAAAKDK 
SSDKK\VQTKGKRGAKGKQ\AEVANQETKEDLPAENGETKTEESP\A 
SDEAGEKEAKSD 

1602 

A 

146 

824 

TWGKGDPKKPRGKMS S YAFFVQTCR\ EEHKKKHPDAS VNFS / ESFSK 
KCSERWKTMSA*R/EKGKFEDMAKA\DKARY\EREMKTYIPPQRGRQ 
KRKFKDSQIiHPRGPPSGLLSSSCSEYRPKIK\GEHP\GL\SIGDVAK 
KLGRDVGINTAAD\DKQPYEKK\AAKLKEKYEKDIAAYRAKGKPD7VA 
KKG\VVKAEKSKKKKEEEEDEEEG\DEEDEEEEEDEEDEEDEEEDER 

1603 

A 

1 

223 

PIRTSRVDPRVRVATRDNCCILDBRFGSYCPTTCGIADFLSTYQTKV 
DKDLQSLEDILHQVENKTSEAKQAGDV 

1604 

A 

1 

400 

FADD/ PSDK/ FFTSNNGMQFSTGHNDNDKFEGNCAEQDGSGWWMNKC 
HAGHLNGVYYQGGTYSKASTPNGYDNG I IWATWKTRWYSMKKTTMKI 
I PFNRLTIGEGQQHHLGGAKQVRPEHPAETE YDSLYPEDDL 

1605 

A 

18 

365 

NIL I KVYFNS KNDFKI FHELFFKQNYMKNM YKS VINVI D I FMNKFQ / 
SEKYPI I /DKGSIiNK^KLTILALKSNTTVRLIRBTAFYYVREHI INV 
SSKRARYWVCVGFI*ASC*QPPLF 

1606 

A 

212 

1645 

HYKARSSGHSDIMSWSLH\ARNIiILYFYALLFLSSTCVAYVATRDNC 
C \ I It YERFG CYC \ PTTCG I ADFLS TYQTR VD * DJ^QSlA ED I LHQVEN 
KTS\EVKQLIKAIQL\TYNPD\ESSKPNMIDAATLKSRKMIiEEIMKY 
EAS \ I LTHDSS IRYLQE II* FQI IQKI VNI»\ KEK\ VAQLEAQCQEPC 
KDTVQIHDITGKDCQD\IANKGAKQSGLYFIKPLKANQQFLVY\CEI 
DGSG\NGWTVFQKRLDGSVDFKK\NWIPYKEGFGHLSPTGTTEFLAG 
EMRKIHFD*GTQS\AIPYGI*GVGTGKTWEWARNQYCRSMPLFKWH 
E VD \ KYRFTYAYFAGGDAEDAFDG YDFG \ DD PSDKF FH I P IMAMQFT 
YLGTMDNDKV*KANCA* / QQGWDPGWWDGNKC\HAG\HSSMGVLFTQ 
GWALYFQKASYLPNGLWIMGI I WA\ TWK\ TRWVFR * RKPTMKI I P \ F 
NRLTIGEGQQH\HLGGSQTGLETF 

1607 

A 

2 

531 

GTVAACGAC YWIjIiGIjMAVRAS FENNCE I GCFAKLTNTYCLVAI GGS E 
NFYSVFEGELSDTIPVVHASIAGCRIIGRMC^G\TEEIIJU5VI,KVEV 
FRQTVADQVLVGS YCVFSNQGGLVHPKTS I EDQDELS S DLQVPLVAG 
TVNRGSEVIAAGMWNDWCAFCGLDTTSTELSWE 

1608 

A 

18 

889 

GVQGTVAACGACYWLLGLMAVRAS FENN \ CE I GC FAKLTNTYCLVA I 
GGSENFYS VFEGELSDTIPWHAS I\ AGCRI \ IGRMCVGN\RHGLL\ 
VPNNTTDQ\ EL\QHI SATGLPRHSGRFRAG WKERFLSLWGNFFNHLA 
IDYVGLGSNQD\LDK(?RQEEISGQMLFKGWEVFRQTV\ADQV\LVES 
\ YCVFSNPGRAWVPS PRPFQ * RPRNELS S I S FKVPL \ VAGTC * TKGS 
EVICLLGMGGEMNWCA\ FCGPGTPNPAQSCQWEECLQS *NEAPALA 
PIANRACGNSL\ IDSLT 

1609 

A 

1 

248 

GPLIWEWPASPEPPPLPWGKPRMQ/SG*YG*TP*IPKIRFPKKPFPP 
FPQALEPQQKGPN * AHP * EPTPAKKYS PQRVQKVPK 
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1610 

A 

290 

1414 

NKRHPSRVYMSLPQG\EKVQAMYIWIDGTGEGLRCKTRTLDSEPKCV 
EEL PEWNFDG S S TLOSEGPOTVNMYLVPAAMFRT)PFRK\ DPNKLVLC 
\EVFRYN\RRP\ AETNLRHTCKR\ IM\DMVSNQHPWFGMEQ\ EYTLM 
GTDGHPFGWPSKG\FPGPQGQYLLGVG\GDRSLGRDIR\EAHYRACF 
Y\AGTRIPGTKAGVSPLQ\WEFQIGPCEGIKQGG/HIFWVARFILH\ 
RVCEDLG\VIATFDP*/RPLLGNWNGA\GCHTNFSTKAMRKENGLKY 
IEKAI*KLTNR\HKSHIRAYDPKRGLDNARRLTGFHETSNINDFSVG 
VANRSAS I R I PRTVGQEKKGYFE\ DRRPS \ ANCDPFSVTEALI \RTC 
LLNETGDE PFQYKN 

1611 

A 

38 

396 

S S VHDLLVAAMS ATS EIQDPRGRGDGPGGLPDVVAI PW PPR * I LS QE 
E/HHPTDNQDIEP\GQEREGTPPIEERKVEGDC\QEMDIi\BKTRSER 
G\DGSDVKEKTPT* I LKHAKTKE AGDGQP 

1612 

A 

129 

1182 

APP S PPS SG CS PQPQLS ALTPGTRVLAPS FAS FL P S FFL PPLAPALP 
LQVALPG PDCLG S PLPARAL PRLS LALPES PAAAVADS PRE PQPNPS 
PTATAPAPAPAPQPAAPARGS PGARGRLQWASAPS PSPAPQPCPARR 
GRTGKMNNGGKAEKENTPSEANLQEEEVRTLFVSGLPLDIKPRELYIj 
LFRPFKGYEGSLIKLTSKQPVGFVSFDSRSEAEAAKNALNGIRFDPE 
IPQTLRLEFAKANTKMAKNKLVGTPNPSTPLPNTVPQFIAREPYELT 
VPALYPSSPEWAPYPLYPAELAPAX.PPPAFTYPASLHAQLCEGQTV 
RRSHPLSAPS PDS ASLAWFPV 

1613 

A 

1 

353 

FGTRSFDSRSEAEAAKNALNGIRFDPEIPQTLRLEFAKANTKMAKNK 
LVGTPNPSTPLPNTVPQFIAREPYELTVPALYPSSPEVWAPYPLYPA 
EIJ\PALPPPAFTYP\ASLHAQETL 

1614 

A 

129 

1238 

APPSPPSSGCSPOPOIjSAT.TPnTPVT.APQPaciFT.PQFPT.PPT.ttPM.P 
LQVALPGPDCLGSPLPARAX#PRLS LALPES PAAAVADS PREPQPNPS 
PTATAPAPAPAPQPAAPARGS PGARGRLQWASAPS PSPAPQPCPARR 
GRTGKMNNGGKAEKENTPSEANLQEEEVRTLFVSGLPLDIKPRELYL 
LFRPFKGYEGSLIKLTSKQPVGFVSFDSRSEAEAAKNAIiNGIRFDPE 
IPQTLRLEFAKANTKMAKNKLVGTPNPSTPLPNTVPQF\ IAREPYEL 
\ TVPAL YPS \ S PE VW\ APYPLL PRRELGACF YLPPG / S FTYPASLAW 
PRCAGSLPPRLLLRAGSPVSSAEYYTLQQWLLEGRTIALLWKYG 

1615 

A 

129 

1143 

AP P S PP S S GC S PO POLS ALT PGTRVLRP S FAS PL P S FFL P PLAPAL P 
LQVALPGPDCLGSPLPARALPRLS LALPES PAAAVADS PRE PQPNPS 
PTATAPAPAPAPQPAAPARGS PGARGRLQWASAPS PS PAPQPCPARR 
GRTGKMNWGGKAEKENTPSEANLQEEEVRTLFVSGLPLDIKPR\ELY 
LLFRPFKGYEGSL*KLTSKQL*GFVSFDSRSEAEAAKNALNGIRFDP 
EI PQTLRLEFAKANTKMAKNKL\ VGTPNP\ STPLPNTVPQFI AREPY 
ELTVPALYPS\SPE\VWAPYPL\YPAELGPA\LPP\PAFT\YPASLR 
CPGNPVEKEIQDSV 

1616 

A 

1 

669. 

MRWLVSPIEDTGHKALLFACLALHRACVSGHCLSPKCVGVDGRSHEM 
VAPGLFLDWLLDGTGVWDGTSTNVEGAQEHAVLILLIITDGVISDME 
ETGMPWCRLPSCPCPSSSGRGQCGLRCHGVPGWGRRMLRSHTGEEAA 
RD IVSS FPFESSATMSEEVYDPRSLYERLQEQRTGSSRTARQEAWG 
S KVTVS QPGHRQETTVLEHLVQ YVGF S I CCKELMV 

1617 

A 

267 

1057 

GRTMMFGAKRRQEEWEKVRKPEDPEECPEEVYDPRSLYERLQEQKDR 
KQQELRGTVSNCKNM\ VRG\ LDEDETTFLDEVSRQQELI EKQPKR\ E 
ELERT*ROTEITSRRLEFSQENKKEVEKKLTCE/VL*KPRTKFSQAN 
VLGQEL*SMKSSESGHQCEKT*NRTPEPD*QESRALILAKSLGNNLP 
LSGP FHPLAPS AASM YRHPPRpGWPTLGAATPS PASDS EGT I QCHRK 
DCSPPCFRTNTFLRGPLSFFRSLHREAPPQG 

1618 

A 

1 

641 

DTRFLERLRLS I S S YVQTPMGHFTEEDKAT I \ TS LWCKVNVE \ DAGG 
ETLGRLLWYPWTQRFFDS FGNLSSAS AI \MGNP\ KVKAHGKKVLTS 
FGRLP*KHL\DDLKG\TFAQA*SE\LH\CDKAALLDPE\NF\KLPGE 
ILLVTRFGQFHFRAKNFTPEGCQA\SWQBRWVTWSWPVPCSSRLPLK 
LNCP*MQSFSRIRLLFLQAITNNKSISAKRSP 

1619 

A 

1 

194 

KEQGVLSFWRGNLANVIRYFPTQALNFAFKDKYKQIFLGGVDKHTQF 
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WRYFAGNliIiPVDGRGRIi 

1620 

A 

98 

1136 

asdafhslsapglrlgsrsaarpatmteqai s fakdflaggiaaai s 
ktavapi ervklllq \ vqhas kq i aadkq\ ykgi vdc i vr i pkgagg 
vlsflegqplptviryfpXtqafnfrpfkd/ksykqifpggawdkht 
qf \ w\ r * fgganwasgvaagadhpscfvypldfartglgkgrwksqa 
qsasfrglg\dclvkihqvqtasrglyqglpvspftatiiypa\ay\ 
ygvy\dtakghasptpsnt\hillsrmnrartvtargavgcpnplkt 
. vrr\ rmi dai rgakgad i mytg / tlvdc * rknls edeggqrpffkgc 
wqtllrghggr p s vl vp vrts s rr vi 

1621 

A 

4 

101 

. VQWNDFGSLQNPPPGVSPFFWLSLPNNLGFKGSSSRPGPFLKF* *KP 
/EVFRVNPDGFPFPAP*FGPPGPPKSWGFPP*PPPGVSPFFWLSLPN 
NLGFKG 

1622 

A 

3 

1728 

GDRTDGVWWWLGLSRRYAI PFNSLEPSSIiGSCPLS IGTHK\ AJjHALE 
LTTTD I QCGKAFN S S S S FRYHERTHTGEKP YECKQCGKAFRS ASLLQ 
THGRTHTGEKPYACKECGKPFSNFSFFQIHERMHREEKPYECKGYGK 
TFSLPSIiFHRHERTHTGGKTYECKQCGRSFNCSSSFRYHGRTHTGEK 
PYECKQCGKAFRSASQLQIHGRTHTGEKPYECKQCGKAFGSASHLQM 
HGRTHTGEKPYECKQCGKSFGWASRLQMHGRtHTGEKPYKCKQCGKA 
FGWPSNLRRHGRTHTGEKPYKCNQCGKKWKDQNIEYEYQNPRRNFRS 
L IEGNVNEI KEDSHCGETFTQVPDDRLNFQEKKAS PEAKS CDNFVCG 
EVGIGNSSFNMNIRGDIGHKAYEYQDYAPKPYKCQQPKKAFRYHPSF 
RTQERNHTGEKPYACKECGKTFISHSGIRRRMVMHSGDGPYKSSFLR 
SHPKVPGHGRRFAMNPTLLNKEDERGNKNVVTREHLDRMKNSCIVCN 
MGHSNTEIDVNWMGNYERLERDARMTIRKKRKRYSDIRFRIMGRYT 
VRVEVLTAVNSE 

1623 

A 

1637 

5763 

KPGNGACAGREWCDGGGAAWNWRDPGLPVGDSGVWDRVLIiELLGPRS 
PRPLDVGGPAAGTPGVLSRPCPSTAAIAPKPFCAAPRPQPDAPACAG 
ATGGSCADFDSGVDFVRRRSSGLWGPQPPLSPVKNYTEMFQDPVAFK 
DVAWFTQEEWALLDISQKNLYREVMLDTFWNLTSIGKKWKDQNIEY 
E YQNPRRNFRS VTEEKVNE I KEDSHCGETFTPVPDDRLNFQKKKAS \ 
LK*NHVTALCVQKLA* VTHli/ CNMS IRGDTGHKAYEYQEYGPKPYKC 
QQPKNKKAFRYRPS I RTQERDHTGEKPYACKVCGKTFI FHSS IRRHM 
VMHSGDGTYKCKFCGKAFHSFSLYLIHERTHTGEKPYECKQCGKSFT 
YSATLQIHERTHTGEKPYECSKCDKAFHSSSSYHRHERSHMGEKPYQ 
CKECGKAFAYTSSLRRHERTHSGKKPYECKQYRGRAYHMLIiGLQPH\ 
KRMNSGERPYKCKICGKGFYYAKSFQTHERTLTGERRYKCKQCGKAF 
NliSSSFRYHERIHAGEKPYECKQCGKAFRSASQLRVHGGTHTGEKPY 
ECKECGKAFRSTSHLRVHGRTHTGEKPYECK\ECGKAFRYVKPLQIH 
ERTEKHIRMPSGERPYKCSICEKGFYSAKSFQTH\KKTHTGEKPYEC 
NQCGKAFRCCNSLRYHERTHTGEKPYECKQCGKAFRSASHLRMHERT 
HTGEKPYECKQCGKAFSCASNLRKHGRTHTGEKPYECKQCGKAFRSA 
SNLQMHERTHTGEKPYECKECEKAFCKFSSFQIHERKHRGEKPYECK 
HCGNGFTSAKILQIHARTHIGEKHYEFLFDIMKGLTLGRKPINIHER 
AHTGEKPYECRECGKAFKWPISVRRHERTHSRKKPYECKHRGKVLSY 
LTS FQNHLGMHTGE I SHKCKI CGKGFYS PRSLQRHEKTHTAEKPYEC 
KQCGKAFS S S S S FWYHERTHTGEKP YECKQCGKAFRS AS ILQMHAGT 
HPEEKPYECKQCGKAFRSAPHLRIHGRTHTGEKPYECKECGKAFRSA 
KI^RIHERTQTHVRMHSVERPYKCKICGKGFYSAKTHTGEKPYECKQ 
CGKTFRSTSHLRKHGRTHTGEKPYECQECGKAFRSMKNIjQSHERTQT 
HVRIHSGERPYKCKLCGKGFYCPKSLQRHEKTHTGEKLYECKQR/CS 
SSFSSSSS F * YHERTHTGEKP YKCEQCGKAFRAVS I L * MHGRTHPEE 
KJPYESTRPPAMQGHKLCYAIDYVTYHCMITFILLLWPPIAQSSSRRS 
AD S E KLKS PGKQKTHMHTQARTMAQACPTCQHVPGVI P VEGCNPQGIi 
APNEIWQMDVTHIAAFGCEKAIATCMPAGI*LEGLPRQL1^IDCLIRQLM 
ACWCAS AYQCALLLACS S 

1624 

A 

2 

658 

APTPTGQRVVRATPAQSAPWLRRRSYDVNNPIPSNLKSEAKKAAKI 
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LREFT\BITSRNGPDKI\XPGSTVIAKAK\GLANSCLLNQSPGSLVT 
FQRGGPGVLVARL\PDGK\WSSPFS/AIiGIAGFG\GGFEIGI*GIQT 
LVIILEF/DDPCC*EAFAKGGNIiTLGGNLTVAVGPLGRNLEGNVALR 
SSAAVFTYCKSRGLFAGVSLEGSCLIERKETNRKFYC 

1625 

A 

141 

1307 

FHVNNPIPSNLKSEJAKKAAKILREFTEITSRNGPDKIIPAHVIAKAK 
GIAILSVIKAGFLVTARGGSGIVVARIjPDC5KW«?apc;aTaTanT nnnv 

E I GI / EDTFTATYSL * ALPWLPS I S VECHS S FLRLPSA*HI FLHPFT 
VNLMVSDLVI I LNYDRAVEAFAKGGNLTLGGNL TVAVGPLGRNLEGN 
• VALRS S AAVFT Y C KSRGLFAG VS LEG S CL I ERKETNRKF YCQD I RAY 
D I LFGDTPfcPAQAEDL YENLDS FTEKYENEGQR INARKAAREQRKS S 
AKELPPKPLSRPQQSSAPVQLNSGSQSNRNEYKLYPGLSSYHERVGN 
FYQPIDLTALYSFEGQQPGDLNFQAGDRITVISKTDSHFDWWEGKLR 
GQTG I F P ANYVTMN 

1626 

A 

167 

402 

KYSLPRVTSSLWDLPTHVHSQATSPLPSWPFCPYPTHSPTLPPLGKM 
EQVFGKLPHPPLPLLTLSESWGETEAVFLAP 

1627 

A 

133 

312 

VRVGEKLKPCFWPPEANPDP*CYLHLWILDSQTKSKPVLTSP*RPPN 
GILGTSVCFCTFY 

1628 

A 

1759 

1967 

SGCKPLTFPPP*SDSPVKEDPCR/SPPSHPRLPPHSLPALPFPTPNP 
PPPKI YTAVSRIWEMKDRCNI P I 

1629 

A 

167 

1378 

GNTLVTNMTEFWLISAP\GEKT\CQQTWEKLHAATSKNNNIiAVTSKF 
N\ IPDFKGWARWDVLVGlA SDEL\ AKT^Or , TT»*Pnc;MT.irpcpT.nT.WfZT.. 

DVLEDSKDKVQENLLA\NGSGLGLPYITKVPSGDMAKYPIKQSLKNI 
SEIIAKGVTQIDNDLKSRASAYNNliKGNLQNLERKNAGSLLTRSIiAE 
IVTCKDDFVLDSEYLVTLLVWPKL^ 

VLSEDODSYLCNVTLFR/RRAVDDFODTKPGFMKP\ TVPDPDYNFPF 
\MKADKK\EMDRAST\DKKKQFGP\LVRW\LKVNFSEAFIAWIHVKA 
LRVFVESVLRYGLPVNIi\QHMIFHRNNNPFKPLRQVIiHESYIHLDSS 
A\AAIIDAPMDIPGL\NLSQQBYYPYVYYKIDCNIj1iKFK 

1630 

A 

569 

1050 

PLESRRLARSSGGWAGITGTPMNIFTGPDPGPSERSAQPRVWDSTCC 

lksncwfrkvkattppmssmmrqiprmymna/wekvqvvtegrqhtn 
egdhehdd aqededdg ws qegtfkg f i fl p lnlc idahrqdqg pnqt 
cnpsti>ggrggqitrpgdrdh 

1631 

A 

1 

595 

MGTDSRAAKALLARARTLHLQTGNIiLNWGRLRKKCPSTHS EELHDC I 
QKTLNEWSSQINPDLVREFPDVLECTVSHAVEKINPDEREEMKVSAC 
SVLGVAQLDSVIIASPPIEDGVNLSLEHLQPYWEELENLVQSKKIVA 
I GTSDLDKTQLEQL YQWAQVKPNSNQVNLAS CCVMPPDLTAFAKQFD 
IQLLTHNDPK 

1632 

A 

2 

1121 

ARGCGRSSRSRSRSHRCLPFPPPPSRRPASLGPERRPGSSRAAPAAS 
RSLSGLSRASGTASCGRPPAPPPaPQPT.paf2PwaapaMf2 r rnQpaaifa 

LLARARTLHLQTGNLLNWGRLRKKCPSTHSEELHDCIQKTLNEWSSQ 
INPDLVREFPDDL\ECTVSHAVEKINPDEREEMKVSAKLiFIV\ESNS 
XSSSTRKVQLTWACSVLGS/VAQLGFLWIHWLSPPI *KDGGLIFPWE 
HLQPYWEGI * KNLVQRQKGLLP*GTSDIiDKTQLEQLYQWAQVKPNSN 
QVNIJ\SCCVMPPDLTAFAKQFDIQLLT\HNDPKELAFWKASFQGRLF 
QGKAFP\D I S SGTEWGAAVGLLR YS V I VKS RGI I KS KG Y I LQAKRRG 
S 

1633 

B 

63 

458 

SLENTVSTAISKAQNGAJ>SWGGYPSIHAAYQLPGTVKPLPAAVQSVQ 
VQPSYAGGVKSLSSAEHNALLHSPGSLTPPPHKSNVSAMEELVEKVT 
GKVNIKKEERPPEKEKSSLAJCAASPIAJCENKDFPKTEE* 

1634 

A 

3 

2303 

EMEGKEDAQKVLKCMYCGHSFESLQDLSVHMIKTKHYQKVPLKEPVP 
AITKLVPSTKKRAIjQpIAPPCSPEPAGMAAEVALSESAJGDQKAANPY 
VTPNimYGYQNGASYTOQFEAJ^KAQILKCMECGSSHDTIiQELTAHMM 
VTGH FLKVTTS AS KKGKQLVLDPWEEK I QS I PLPPTTHTRLPAS S I 
KKQPDSPAGSTTSEEKKEPEKEKPPVAGDAEKIKEDSEDSLEKFEPS 
TLYPYLREEDLDDSPKGGLDILKSLENTVSTVINKAQNGAPSWGGYP 
SIHAAYQLPGTVKPLPAAVQSVQVQTSYAGGVKSLSSAEHNALLHSP 
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GSLTPPPHKSNVSAMEELVEKVTGKVNIKKEERPPEKEKSSLAKAAS 
P I AKENKDFPKTEEVSGKPQKKGPEAETWEAKKEGPLDVHTPNGTEP 
T , TCA KVTNGCNNTjG T T MDHS PF. P S P TM P Q AT »C> «5 T MNTWT *Cl KM Q TCP V <5 
PSL»DPIiAMIjYKISNSMLDK\ PVYPATPVKQADAIDRYYYENSDQPID 
LTKSKNKPLVSSVADSVASPLRESAL^ISDMVKNLTGRLTPKSSTP 
STVS EKSDADGS S FEEALDELS PVHKRKGRQSNWNPQHLlg I LQAQFA 
SSL\SETTEGKY IM\ SDLGPQERVHI \ SKFTGLSMT1*ISHWLANVEV 
TSLRRTGG\TKFLKEPGTQGHPVFF\CNDCASQFRTASTYISH\LET 
HLG\ FSLKDLS KLPLNQ I QRQQ\NV* KVLTNKTLGP \ LGATEEDLGS 
TFQCKLCNRTFAKQARSQTAP 

1635 

A 

133 

500 

YNTVNYKSHPEGQSMCWSMPVITATFGNPRRVDQPLRSGVQD\QPGQ 
HGKIPSLLKIQKLAGHG\GACIi*SQLLGRLRKENCLN/SPGGGGCSE 
PSSRHC\IPAWAIA*DTI*KIKK*KPPKMRN 

1636 

B 

1568 

1588 

MGDQQLYKTNHVAHGSENLFYQQPPLGVHSGLNHNYGNAVTGGGMDA 
PQASPISPHFPQDTRDGLGLPVGSKNLGQMDTSRQGGWGSHAGPGNH 
VQLRGNLANSNMMWG APAQAE PTDGYQ YTYSQAS E IRTQKLTSGVLH 
KLDS FTQVFANQNLR I QVNNMAQVLHTQS AVMDG APDSALRQLLSQK 
PMEPPAPAIPSRYQQVPQQPHPGFTGGLSKPALQVGQHPTQGHLYYD 
YQQPLAQVPVQGGQPLQAPQMLSQHMQQMQQHQYYPPQQQQQAGQQR 
ISMQEIQTQPQQIRPSQPQPPPQQQQPQQLQLQQRQGSMQIPQYYQP 
QPMMQHLQEQQQQQMHLQPPSYHRDPHQYTPEQAiTrVQIjIPLGSMSQ 
YYYQEPQQPYSHPLYQQSHLSQHQQREDSQIiKTYSSDRQAQAMLSSH 
GDLGPPDTGMGDPASSDLTRVSSTLPHRPLLSPSGIHLNxvTMGPQHQQ 
LSPSAMWPQYLLSAYC^PSIELGFENAAWSTDLLPHPMGMVNKDLA 
KPDKLLKRAVRRLPQELAQMSLSGEAFGGAGLRLTLEGGQPKGAFGE 
QFDAKNKLTCS I CLKEFKNLPALNGHMRSHGGMRAS PNLKQLP PEAK 
SLTPMVMPVSVPVKLLPPKPSSQGFTNSTVAAPSAPJDKPASSMSDDE 
MPVLTPLLEAFJ4APWKAJ5KX.REKQ 

PRKHQPSVPKAEEPLKTVQEKKKFRHRPEPLFIPPPPSYT^PNPAASY 

SGATLYQSQLRSPRVLGDHLLLDPTHELPPYTPPPMLSPVRQGSGLF 

SNVLISGHGPGAilPQLPLTPLTPTPRVLLCRSNSIDGSlWTVTPGPG 

EOTVDVEPR INI GLRFOAE I PELOD I SAIiAODTHKATLiVAIjEMLIjLR 

KPVRLKCHPLAlTiraADGPGSYITSGVA^ 

GTEHCFWGQKSGSDKWTSLERKLFNKAXiATYSKDFIFV^ 

AQCVEYYYTWKKIl^LGRKHRTRloAEIID 

PEEDRKSTKEEESEVPKSPEPPPVPVLAPTEGPPLQALGQPSGSFIC 
EMPNCGAWSSRQALNGHARIHGGTNQVTKARGAIPSGKQKPGGHPE 
WHEKEYYLGEMLPKWFLKLYHENI PS YI SKARVFFKI KSRNAHMKTH 
RQQEEQQRQKAQKAAFAAEMAATI ERTTGARGGAGAAVPWTS * 

1637 

A 

2 

251 

FFFFFLINKTKRLFTP*AIiQWGYPSGSCGSVSQSCKCIL»RGRSRATI 
SIEAEMVDL 

1638 

A 

36 

531 

NKVLPPAASEHSDCQISKHQVQLCP/PNIITLADIVKDPVSRTPAI#V 

FEHVNim)FKQLYQTLTOYDIRFY^EILKALDYCHSM 

HNVM I DHEHRKLRXi I DWGLAE F YH PGQE YNVR VAS R YF KG PELLVD Y 

QMYDYS LDTWRMGCMLAS M IIQKEP 

1639 

A 

1 

1833 

MPJITQPAVTGG0QIESVRRGMQAASSRKGQSQLTTSKEVGTQSYSCK 

EI^SANSI^CGSKPSPEFPAGNPPTGYPHIJU.RVIASGVKFHIVTL 

FCLYGWKRCIDDDYI VRDSTGI PI SLRNQDDYQLVRKLGRGKYSEV 

FFAINITXWEKVVVKILKPVIOCKKIKRBIKILENLRGG 

VKD P VQL YQTLTD YD I RFYMYE I L KALD YCHSMG I MHRD VKPHNVM I 

DHEHRKLRLIDWGLAEFYHPGQEYIWRVASRYFKGPELLVDYQMYDY 

SIJ^MWSLGCMlxASMIFRKEPFFHGHDlsrYD 

DKYNIELDPRF1TOILGRHSRKRWERFTO 

RYDHQSRLTAREAMEHPYFYTVVT<1DQARMGSSSMPGGSTPVSSANMM 
SGGWYLVLFSRDYKNVASQGRQEERKEGKKEGRTQSIGAVDCLLVAY 
ITLLHKRFSGVILVALVEVCLSYIKMLKIYPKCRQILKTSVHCDWFR 
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S GYTGRGLLEKVFWRAE I RRQST6PKS KERQAQNGRDDMGTGYL»VNT 
S GHWTQPGTRNEGNVPAGPTAIjDDAAS sddts prhylgdaeednsgg 

1640 

A 

3 

432 

GWYLVLFSRDYKNVASQGRQEERKEGKKEGRTQSIGAVDCLIiVAYIT 
LLHKRFSGVILVALVEVCLSYIKMLKIYPKCRQILKTSVHCDWFRSG 
YTGRGLLEKVFWRAE I RRQS TGFKS KERQAQNGRDDMGTG YLVNTSG 
H 

1641 

A 

137 

1368 

FHISLFEENRVLKQGgiiPPPAAFLNTVTAQISQTSKSQALSRQPCSD 
HVGDPVPKAGHRV\YTDVNTHRPREYWEYE\SHWEWGNQDDYQL\V 
RKLGRGKYSETVFEAINITNNEKVVVKILKPVKKKKIKR5\IKIIjGGI 
WRGGPNI ITLADI \VKDPVSRTPAL\ VFEHVNNTDFKQLYQTLTDYD 
I RF YM YE I LKALD YCHS \ RG I MHRD VKPHNVMI DHEHRKL * L I D WGL 
AEFYHPGQEYNVRVASRYFKGPELLVDYQMYDYSLDMWSLGCMLASM 
IFRKEPFFHGHDNYDQF/VCRMAKVLGTEDLYDYIDKYNIELDPRFN 
GYLG\RHSRKPMGTALSNSENQH/LLSSPEALDFLDKLLR\YDPPSH 
GLLAREAMEHPYFLHCCEGPRPRNGFHLAMPGCSTPVQQRPI 

1642 

A 

874 

1183. 

TPMLEQLGNRYLQNIACYPFRNMCTYQLGCSGSRL*SQHF\GRPRQV 
DHLRSGVRDQPGQQGETPSLLKVQKFSWAWWRTPVIS\ATWEAEAGB 
VLEPGRRRLK 

1643 

A 

2 

498 

APSPDAMGHFTEEDKATITSLWGKVNVEDAGGETLGRIiLVVYPWTQR 
FFDS FGNLS SASAIMGNPKVKAHGKKVI/TSLGDAI KHLDDLKGTFAQ 
LSELHCDKLHVDPENFKLLGNVLVTVLAIHFGKEFTPEVQASWQTIV 
TGLGNALCFTKHLDFMCMMQSFKE 

1644 

A 

49 

538 

SQTPMGHFTEEDQGLLSKSLWGK/VliNVEKMLGRKKTPLGKGSPLVV 
PTP\WDPKRFL*TSFGKTLSPPAIiLPHQWANPPK\VKGHHGKEGCLT 
FPWEDAHKAPLDDPQRAPFAPA*SELHCDKXiHVDP\ENFKLLG\NVIj 
VTV\LAIHF\GKEFTPGGCRASWAEDG 

1645 

A 

53 

394 

RETGLPALGTCAVPLARAAPILQRLQQSPRTSLKARPKTRITGALPM 
DHTEGSPAEEPPAHAPSPGKFGERPPPKRLTREAMRNYLKERGDQTV 
LILHAKVAQKSYGNEKRFF 

1646 

C 

116 

379 

MYLCLSQERIIQFQIiNGRWGRRQCLNLQGQNFTSKFTECGFGGCRKA 
GNYVTGVGGGVWLLCS S PD I SG I SEEGWEMGSRXTS SGVPV 

1647 

A 

3 

202 

CIASGTKVALFWLRSQTVSTRYLHVEGVDDDESEGEEFTVRDGYIH 
YGQTVKLVCSVTGMALPRL 

1648 

A 

1 

639 

MRSAARGPRQSCSAFNRFRAANSSSPGFGAPCGRQCWIWESLGKERA 
KEGKDGGLQSPRTSLKERPKTRITGALPMDHTEGLPAEEPPAHAPSP 
GKFGERPPPKRLTREAMRNYLKERGDQTVIilLHAKVAQKSYGNEKRF 
FCPPPCVYIjMGSGWKKKKEQMERDGCSEQESQPGAFIGIGNSDQEMQ 
QL/NTLEGKNYCTAKTLYISDSDKR 

1649 

A 

111 

750 

GKREGAGERDQGRRRGESREGWSFGESLWKMAPWTGKFGERPPPKR 
LTREAMRNYLKSRGDQTVLILHAKVAQKSYGNEKRFFCPPPCVYLMG 
SGWKKKK\EQMERDGCSEQESQP\CAFIGIGNSDQEMQQLNLE\GKN 
YCTAKTLYISDSDK\RKHFMLSVKMFYGPTS\DDIGVFIi\SKRIKVI 
SKPSKKKQSLKNADLCIASGTKVALFNR 

1650 

A 

1 

519 

I ISTD/ KAEYT/ FYEGMGPVFTP VTPVP WE S LQLNAGG D VAMLELT 
GQNFTP \NLRV/ WFGDVEAETMYRCEQSMLCVVPD I SGFR\ EGWRWV 
RQPVQVPVTL\VR\NDGI\IYSTSL\TFTYTPEP\GPRPHCSAAGAI 
LRANSSQVPPNESNTNSEGSYTNASTNSTSVTSSTATWS 

1651 

A 

238 

1713 

STMAW I KRKFGERPP PKRLTREAMRNYLKERGDQTVL I IjHAKVAQKS 
YGNEKRFFCPPPC^UyiGSGWKKIKIiQMKCDGCSEQGSQPCAFIGIG 
NSDQEMQQLNLEGKNYCTAKTL YI SDLDKQKHFMLS VKVFYGNSDD I 
GVFLSKSSKPSKKKQSLKNADLCIASGKERWAIiFNRLLSQTVXSTRY 
LHV/BKEGNFHASSQQWGAFFIHLI>DDDGSEGEEFTV*JDGYIHYGQT 
VKLVCS VTGMALPRL 1 1 RKVDKQTTIiLDADDPVSQLHKCAFDLEDTE 
RKYLCIiSQERIIQFQATPCPKEPNKEKINDGASWAIISTHKTlKHTFY 
RESGPLSLAP/VSCPPALSVESLKLNGGGDEPSLKLNRTEFQLPKFY 
KVWFGDVE\AEAMYRCG\ESMLRVVPDVS7VF*EGWR\YSQQP I QVSV 
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TLVRNDGIIYSTSLTFTYTPEAGPRPHCXSVAGAILKASSSHVPPNB 
LNTNSDGSYTNASTNSTSVTSSTPTWS 

1652 

A 

1 

309 

FF*DRA*L\CPPGWSARSQHTWSTFLGSSKFSHIiGPPELPGDHRHA 
PPCPANFFYFS*RWGLPMLLVSNSQAQAILLPSPFQKGWDYRAWGHH 
TWGSYLNFNE 

1653 

A 

259 

331 

RNE * LLTR FS PLS LSH * VLGVSMF 

1654 

A 

569 

4150 

1 1 SliNKGPAEKGKGGNGGGKPPSGPNRMNGHHQQNGVENMMLFEVVK 
MGKSAMQSVVDDWIESYKHDRDIALLDIiINFFIQCSGCKGVVTAEMF 
RHMQNSEIIRKMTEEFDEDSGDYPLTMAGPQWKKFKSSFCEFIGVLV 
RQCQYS 1 1 YDEYMl^WISLLTGLSDSQVRAFRHTSTIiAAMKIiMTAIi 
VNV7VLNLSINMDNTQRQYEAERNKMIGKRANERLELLLQKRKELQEN 
QDEIENMMNAIFKGVFVHRYRDAIREIRAICIEEIGIWMKMYSDAFL 
NDSYLKYVGWTMHDKQGEVRLKCLTALQGLYYNKELNSKLELFTSRF 
KDR I VSMTLDKE YDVAVQA I KLLTLVLQS S EEVLTAEDCENVYHLVY 
SAHRPVAVT^AGEFLYKKLFSRRDPEEDGMMKRRGRQGPNANLVKTLV 
FFFLESELHEHAAYLVDSMWDCATELLKDWECMNSLLLEEPLSGEEA 
LTDRQES ALIE IMLCTI RQAAECHPPVGRGTGKRVLTR\ KEKKTQLD 
DRTKVTELFAVAIiPQLIAQYSVHAEKVTNLLQLPQYFD*EIYTTGRIi 
EKHLDALLRQIRNIVEKHTDTDVLEACSKTYHAIjC!NEEFTIFNRVDI 
SRSQLIDELADKFNRLLEDFLQEGEEPDEDDAYQVLSTLKRITAFHN 
AHDLSKRDLFACNYKLLKTGIENGDMPEQIVIHALQCTHYVILWQIA 
KITESSSTKEDLLRLKKQMRVFCQICQHYTjTNVNTTVKEQAFTILCD 
ILMIFSHQIMSGGRDMLEPLVYTPD\SSLQSELLSFILDHVFIEQDD 
DNNSADGQQEDEASKIEAIjHKRRNLL7UVFCKLIVYT\A^EMNTAADIF 

ngynfdrssstfsgikelarrfaltfgldqlktreaiamlhkdgief 
afkepnpqgeshpplnlafldilsefsskllrqdkrtgyvylekfmt 

FQMSLRREDVWIiPLMSYRNSLLAGGDDDTMSVISGISSRGSTVRSKK 

skpstgkrkvvegmqlslteessssdsmwltreqtlhtpvmmqtpql 
tstimrepkrlrp\edsfmsv/yypmqtehhqtpldynrrgtslmed 
deep/ilwedvmmsseg\riedlnegmdfdtmdidlppsknr\rert 

E\LKADFFDP\ASIMDESVIiG\VSMF 

1655 

A 

1 

2325 

mgkkrtkgktvpiddssetlepvcrhirkgleqgnlkkalvnvewni 
cqdcktdnkvkdkaeeeteekpsvwlclkcghqgcgrnsqeqhalkh 

YLTPRSEPHCLVLSLDNWSWCYVCDNEVQYCSSNQLGQVVDYVRKQ 
AS ITTPKPAEKDNGNIELENKKLEKESKNEQEREKKENMAKENPPMN" 
SPCQIT\nCGLSNLGNTCFFNAVMQNLSQTPVLRELLKEVKMSGTIVK 
IEPPDLALTEPLEINLEPPGPLTLAMSQFLNEMQETKKGVVTPKELF 
SQVCKKAVRFKGYQQQDSQELLRYLLDGMRAEEHQRVS KGI LKAFGN 
STEKLDEELKNKVKDYEKKKSMPSFVDRIFGGELTSMIMCDQCRTSG 
KKSVNDK^KKTVEDEDQDSEEEKDNDSYIKERSDIPSGTSKHLQKK 
AKKOAKKOAKKORROOKI OCKVlVHTiNn tptt nuppn cr? VT?2iT?MQT.nr! 
EVNI KSNH I S QEG VMHKEYCVNOKDLNGOAKM I ES VTDMOKS TEKVD 
MKNTNMDNDLEVLTS S PTRNLNGAYLTEGSNGEVD I SNGFKNLNLNA 
ALHPDEINIEILNDSHTPGTKVYEVVNEDPETAFCTLANREVFNTDE 
CS IQHCLYQFTRNEKLRDANKLLCEVCTRRQCNGPKANI KGERKHVY 
TNAKKQMLI SLAPPVLTLHLKRFQQNVAEENTRVLYSLYGVVEHSGT 
MRSGHYTAYAKARTANSHLSNLVLHGDIPQDFEMESKGQWFHISDTH 
VQAVPTTKVLNSQAYLLFYERIL 

1656 

A 

3 

128 

ATSHNAAFRKLLLSRGHSGVIRRSQFVTRRKTELAAQPKGP 

1657 

A 

203 

2698 

SANMGKKRTKGKTVPIDDSSETLEPVCRHIRKGIiEQG\NIjKK\AIjVN 
VEWNICQDCKTD\NKVKDKAEEETENKPS \ vwlcl\ KC\GPQGFGRN 
FSR/ EQHALKHYLTPRS EPHCLVLS LDNWS VWCYVCDNE VQYCS SNQ 
LGQWDYVRKQAS ITTPKPAEKDNGNIELENKKLEKESKNEQEREKK 
ENMAKENPPMNSPCQITVTCGLSNLGNTCFFNAVMQNIiSQTPVliRELL 
KEVKMSGTIVKIEPPDLALTEPLEINLEPPGPLTIiAMSQFLNEMQET 
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KKGWTPKELFSQVCKKAVRFKGYQQQDSQEIiLRYLLDGMRTVEEHQR 
VS KG I IiKAFGNSTEKLDEELKNKVKD YEKKKSMPS FVDR I FGGELTS 
MIMCDQCRTVSLVHESFLDLSLPVIJ)DQSGKK5ANDTP\LTKTVTYA 
ECYSEEKYHTDCCI Y* RSDI SS GTRKHLQKKATKQAiOCQAKNQRRQQ 
KI SGPALHLND IRTIDHPEDSEYEAEMSLQGEVNITSNHISQEGVMH 
K\ E YCVNQKDLNGQAKMI ES VTPNQKSTEEVDMKNINMDNDLEVIiTS 
SPTRNIiNGAYLTEGSNGEVDISNGFKNLNLNAALHPDEINIEILNDS 
HTPGTKVYEVVNEDPETAFCTLANREVFNTGE\CQIQHCLYQFTRNE 
KLRDANKLLCEVCTRRQCNGPKANIKGE\RKHVYTNAKKQMIiISLAP 
PVLTLHLKRFQQAGFNLRKVNKHI KFPE I LDLAP FCTLKCKNVAEEN 
TRVLYSLYGV\VEHSGTMRSGHYTAYAKART\ANSHLSNLVL\HGDI 
PQDFEMES KG\ QWFHI SDTHVQAVPTTKVLNS \ QA YLLFYERI L 

1658 

A 

992 

1863 

G I WRNVHRQPQLE S CEPTPAC S GRACACCPVS CGWSHGQDWMVPVAG 
RCTRAQRCCTGGASLPTVHKSTLSSCSAPPADSAACVFIYFIIIF*R 
QSLNSVAQ\AGVQWR\NLKLLQPLPPAFKA\FSCLSLLCNWDYRRVP 
PGLANFC I F S RDGGFTML WRLVSNS * PCDLPAS AS QS AG I T\ AL SH 
HAW\LLFFETESRSWQAGVQWCDIiGSLQAPPPGFTPFSCLSLQSSW 
DYRRPPPRPANF/CVFIiVETGFHC*PGWSRSPDLMIPPGLSLPKCWD 
CRRDTKHPASKF 

1659 

A 

318 

1681 

SPRMHALVLLLCIGALLGHSSCQNPASPPEEGSPDPDSTGALVEEED 
PFFKVPVNKLAAAVSQTSAYDLYRVRIQA*APRPNVLPVSLFKCGPT 
ALSA\LSLGKRSKRNKNPIIHR\ALYYDIiIKQAPDl\HGYLIRKLP* 
HGHFPPQKNLKSASR\ I VFEKKLR I KS S FVAP \ LEKS YGTRPR\ VLT 
GNP\RLDL\QEINW\WQA\Q\MKGKLARSTKEIPDEISIVIiL/GV* 
AHFKGQ\WETKFDSRKTS\LEGFLLGMKERTREGPP*LSDPKGCFYA 
MGLGFRFSACKICPACPLTG\SMSIIFFIiP\LKVTQ\NLTLI\EESL 
TLRS*FMTIDPRT*KTVAGGPSLSPKLKL\SYEGEEHPKFLAGR*SL 
QSLV*FHPDFSKI\TGK\PIKLDFRVEHPRLAFEWNE\DGAGNHPPS 
PRGLQPAHlA TFPLD YHLNQPF I FLLRDTDT\ GALLFI GKI \ LDPRG 
P 

1660 

A 

3 

340 

DLDLRGGQPVCRGGTQRPCYKVIYFHDTSRRLNFEEAKEACRRDGGQ 
LVSIESEDEQKLIEKFIENLLPSDGDFWIGLRRREEKQSNSTACQDL 
YAWTDGS I SQFRETS S S F 

1661 

A 

2 

501 

GKPDPSTKKQHTIWPSPHQGNSPDLEVYNVIRKQSDVSIjAETRPDLK 
NISFRVCSGEATPDDMSCDYDNMAVNPSESGFVTLVSVENEIYGY*D 
I*KTETDNNGKEMISKILLFSIRKIHRRSMNKLRSGPVDEHVVPTTS 
CWTPTFWLYPLSQPVI QLDU4RRYL 

1662 

A 

3 

52 

GRSKESGWVENEIYGY 

loo J 

r* 

<L 

92 

244 

MRCEIILVLIPYVYFYSNKLIiCSRLXXXXXGGAVIjKNPWGGQSLPGL 
AR* * 

1664 

A 

336 

413 

GPWTVLGAGACGGEVHFISYRPGSC 

loo J 

A 

233 

401 

GGAVLKNPWGGPSLPGLAR**FFPYRGAY*NLPGNFWKEPLFLGGDI 
LGQPPFGNL 

1666 

A 

194 

360 

GGAVLKDPWGGQSLPGLARK* FFPYGGPN*NIiPGNFWKGPLLWGGDI ! 
LGQPPYRN 

1667 

A 

319 

391 

FFPYGEPYYNQPGDFWEGTTiFLGG 

1668 

A 

313 

542 

ALKQPT/ PQTKEERAFDPRVHAE * I P YVFE IHI RST* KTT*NGNPTA 
PLPVRAPTPARVRTWPNPGHSCAGSHHSSR 

1669 

A 

143 

1316 

ERLEIGKELQLVWDEPHLTPGNDSLLPSSCCVTAASDLDLRGGQPVC 
RGGTQRPCYKVI YFHDTSRRLNFEEV\ KFSCRRDGGQLG S IBSEDEQ 
KLIEKFIENLLPSDGDFWIGLRRREEKQSNSTACQDIiYAWTDGSISQ 
FRNVTYTOEPSCGSEVCVVMYHQPSAPAGIGGPYMFQWNDDRCNMKNN 
FICKYSDEKPAVPSRRS * \GEETELTTPVLPEETQEEDAKKTFKESR 
EAALNLAYILI PS I PLLLLLWTTS \ VCWVWICRKRKREQPDPSTKK 
QHT I WPS PHQGNS PDLE VYNVS KKTNAKS FLS ETRPDLRK I S FR VCS 
EES \ PPDDMSCDYDNMAVNPSESGFVTLVS VESGFVTND I YEFSPDQ 
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MGRSKESGWVENEIYGY 

1670 

A 

18 

688 

SPPPPPPAREMNFVRAANRRPRRVSRPRPVQQQQQQPPQQPPPQPPQ 
QQPPHQQPSSPPQ*QQQQHPPASSPPPPPLPQERNNVGERDDDVPAD 
MVAEESGPGAQNSPYQLRRKTLLPKRTACPTKNSLEGASTSTTENF\ 
GHRAKRARVSGKSQDLSAAPAE\QYLQEKL\PDEWLKIFSYLLEQD ■ 
LCRAACVCKRFSELANDPNLWKRLYMEVFEYTRPMMH 

1671 

B 

155 

310 

MAAIRKIO.VIVGDGACGKTCLLIVFSKDQFPEVYVPTVFENYVADIE 
VDGKQ* 

1672 

A 

162 

877 

AMAAIRKKLVIVGDGACGKTCLLIVFSKDQFPEVYVPTVFENYVADI 
EVDGKAGRSLACGDTAGQEDYD\RLRPLS *PDT\DVIL\MCFS I DSP 
\DSLE\NIPRKSWTPEVK\HF\CPNGGPSILVGELRRVLSGIDGATQ 
GRGLRPRLKAGS PVET * RKGRDMGKQGLALFG YI GS CS SQRPKDWEV 
EEEVF * KWATE S LLWQA * TLGKKKSGVPLS LVKPLLQAQPLMRL I FE 
VLFINLSV 

1673 

A 

1 

711 

MREEARSRELWAGASRHVPVPVYKNLQLFMENKDPRDDLFDRLTTTS 
LNKHLQELMDGLTAKVFRTYNAS ITLQEQLRALTRGLPGGGAEAGCS 
GALTLGMCFACTRGPQRPVLCCPGDNLGAGAALGLWDAEELAGIQVR 
APGEETPPPLREAEDSGPCLSAEDS IAAKILS YNRANRVVAILCNHQ 

RATPSTFEKSMQNLQTKIQAKKEQVAEARAELRRARAEHKAQGDGKS 
RR 

1674 

A 

1 

2647 

MGVTSAASGLVGSAPQCVALPSEGWTLAVWPVAACTCSGVGSSPKLT 
PGSFVHCPWFLLLTEATRAEIKRPFSKAELKASVRPMEGSHCWGGEA 
RRTSQGHTEQKGELRARRQAQNEDEDVKEVWVGKTKKEESDKLGCQG 
AWVPPRVPSWIGRFELWWGYWEQPAVSWQMRVVRLLRLRAALTLLG 
EVPRRPASRGVPGSRRTQKGSGARTDSAWRRAIiTVISTSPGTSRMDP 
VALVAVGGPRRFPGGHTLQRLPVALRTL I PADQAHQAPNSTTWIjG SR 
AGLLALAAGLGGIRDSAAHGPLQVGGMGGYTGMGSEVRWEKEKHEDG 
VKWRQLEHKGPYFAPPYEPLPDGVRFFYEGRPVRLSVAAEEVATFYG 
RMLDHE YTTKE VFR KNFFNDWRKEMAVE EREV I KS LDKCD FTE I HRY 
FVDKAAARKVLSREEKQKLKEEAEKLQQEFGYCILDGHQEKIGNFKI 
EPPGLFRGRGDHPKMGMLKRRITPEDWINCSRDSKIPEPPAGHQWK 
EVRSDNTVTWIiAAWTES VQNS I KY IMLNP\ CSKLKGETAWQKFETAR 
RLRGFVDEIRSQYRADWKSREMKTRQRAVALYFIDKIALRAGNEKED 
GE7UUDTVGCCSLRVEHVQLHPEADGCQHWEFDFLGKDCIRYYNK\V 
PGEKPVY*NLQLFMENKDPRDDLFDRLTTTSIiNKHLQELMDGLTAKV 
FRTYNAS I TLQEQLRALTRAEDS IAAKILS YNRANRWAILCNHQRA 
TPSTFEKSMQNLQTKI QAKKEQVAEARAELRRARAEHKAQGDGKSRS 
VLEKKRRLL * KLQE \ HLAHLS VQATD \ KEENK\ QVALGTSQLNYLDP 
RISIAWCKRFRVPVEKIYSKTQRERFAWALAMAGEDFEF 

1675 

A 

2002 

2238 

KGDFTKLPCL/C*SVPAFY*RSLKICCSIYLV*YMSVSVIESICYKY 
TVFCSRG 

1676 

A 

3 

430 

S AAEVNI QQ VS QKEAKHKI TS ADGH I E S S ALLKEKQRHRLHKFLCLT 

VGKPMRKTFVS QAS ATMQQYAQRDKKHE YWFAVPQERTDHL YAFF I Q 

WSPEIYAEDTGEYTREPGFIWKKIEESETIEDSSNQAAAREWEWS 
V 

1677 

A 

263 

J*. 

899 

ISYQEGTSAIQRK*QEVTLRK*TQESE/ SAGNDSASTAPRSTEESLS 
EDVFTESELSPIREELVSSDELRQDKSSGASSESVQTVNQAEVESLT 
VKS E S TGTPGHLR SDTEHS TNE VGTLCHKTDLNNLEMA I KEDQIADN 
FQGISGPKEDSTSIKGNSDQDSFLHENSLHQEESQKENMPCGETAEF 
KQKQSVNKGKQGKEQNQDFTGQRAG 

1678 

A ] 

1165 

1530 

VKNGGNEVIICHFHLTFGIYIiLFFETEFCSCRPRLEC\NGAILAHCN 
LRLPGFKRFSCFS LPCC * DYRHLPPRPVKFFVVTjVETG FHYLGQAGL 
KLLTPGDL/PPPLGLPKCVSHCAQPRVSTF 

1679 


197 

843 

RLFSSNQTVDHSQKNVDITLKGTPQ*SCKGPRGT\LR\RDFN\HIKC 
GTSALLGKEKKRGFRVD\KWWG\SRRELGYPFGTI\CSHVQDHDPRG 
VTTGASRYQDEGPVYASPSPHPTVGLSQENGSSLLKSRNFFGVKKYI 
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PQGFRMRPGCLLVS VSQGPRKE * INPLKGNDI * ALLQI PAAL\ I PAS 
PTRLKTKGIRKFFGWVSMSLEKGTVQPGLIE 

1680 

C 

27 

218 

MIiMADIRKEERNHLCRSSRRTWTIIjDRAEYSDHVVLQGAGVGWGTSX 
q p ft. Y ^ pr t p vcz a nvn * 

or r j_j i or jz« l r i vj>\y va" 

1681 

A 

490 

773 

**« V-*- v r vtut\. ± Lj]\\jf v xi.ij jj v VljKiirtn,KHo i^o JjRJjHIjEATQRIj 

RHPGLLRLRGELLLWLIIRPIQTLPLPFAAKGPTRGAGIYPRGKQPV 
EGLWLREHERHSPST.Rt.HT^PJXTnPT.r'WPnT.T dt dppt t t mt t tout 

QTLPLP FAAKG PTWGAG I Y PRGKQPVEGLVVXREHERHS PSLRLHLE 
ATQRLCHPGLLRLRGELIiLWLIIRFIQTLPLPFAAKGPTWGAGIYPR 
GKQP VEGLWLREHERHS PSLRLHLEATQRLCHPG / LPQAQGRAPSL 
AHHQIHPNTPSSFCSQGTYAGGWDLPQGQAAC 

1682 

C 

48 

80 

MGLWLFEHIY* 

1683 

A 

858 

1055 

PSVPVMPLLEPEGQEEPLSDSPVPETSVPVLLFHERQGLGFPGGVGG 
EHRTQGRCGFSAGTLFQC 

1684 

A 

1 

103 

VFFLFGGDGVSliCHPGWSAVA*TQEAEEPFVQI I 

1685 

A 

451 

785 

CSQDGGLRWDLVRHPQTFVPGQESFNDRILKQPVEGLWLREHERHS 
PSL/PASS*GHTASPPPRSPQAQGRAPSVAPPPTTPQICPTTPSSFS 
SKRTYAGGWDLPQGQADC 

1686 

A 

1 

1335 

MNDD I RDSDLPD WRDRTPCVQKKAMDRTKTRFRKRGQ I TGKI TTSRQ 
PHPQNEQSLQRSTSGYPLQEWDDEVLGPSAPGVDPSPPCRSLGWKR 
KKEWSDESEEEPEKELAPEPEETWWEMLCGLKMKLKQQRVSPILPE 
HHKDFNSQLAPGVT)PSPPHRSFCWKRK3*EWWDESEESLEEEPRKv^ 
PEPEEIWVAEMLCGLKMKLKRRRVSLVLPEHHEAFNRLLEDPVIKRF 
LAWDKDLRVSDKI PSEPTI /HGS ITQNPS SGFSDLHPTFBYPS I PQF 
PNEYSHPNTEGGCQS*ARDSPLPS/VSGKLTSAGGLSWWCP*APT*F 
LSSATWPMTWRRTTRTPNKTSSTSCMGRPALAYPWSVTVGSSYAVA* 
TPCVQKKAMDRTKTRFRKRGQ I TGKI TTSRQPHPQNEQS LQRSTSGY 
PLQEWDDEVLGPSAPGVDPSPPCRSLGWKRKKEWSDESEEEPEKEL 
/vPisfiifc. iwv V£,M JjL.l>Li KMKLKQQRVS PI LPEHHKDFNSQLAPGVDPS 
PPHRSFCWKRKREWWDESEESLEEEPRKVLAPEPEEIWVAEMLCGLK 
MKLKRRIIVSLVLPEHHEAFNRLLEDPVT KRFLAWDKDLRVSDKI PSE 
PTILGASPKTLPLASQICIRPSNTPPSRNFQMSTVTPTLRVGASPEL 

^T p V7PYT.PT.WT<*ZiT*lT CDDDT T \r\/OT c ttktt tpift pvt a \mMt?c>r\rvDnnv/\ 
v> a v r" x JjJrijvv&MX^JUoKKlrJjJj V VJrJjo 1 WJjX o V Ijo YlxAJvDrlEEDDEDPKQ 

NIFYFLYGKTRSRIPLVT^nmFQLCRCl^PRARKNRSQIAIiFQKLRF 

QFFCSMSGRAWVSREELEENTGPRGDVDFQQELYSNANGRQQERGEE 

PFVQII 

1687 

A 

385 

889 

LEPTLTEQGYARAVLPIQEVEDVLFGVTjVVLLHVIGQVESQEEMNAIj 
VLPGGEAG PAE I RYDHSQE I LVRHPQ 1 FVPGQES FNDRI LKQPVEGL 
VVLREHERHSPSL/PASS*GHTASPPPRSPQAQGRAPSVAPPPTTPQ 
IGPTTPSSFS SKRTYAGGWDLPQGQADC 

1688 

C 

1 

1869 

MDS PTPHDPAAPLLVTVLES VQKKTKDRTETS FGE IGQI LGKIMTSH 
QPQPQEEQSPQRSTSGYPLQEVVDDEVSGPSAPGTVDPSPPRRSIjGCK 

rkrecldesddepekeij\pepeetwaetlcglkmkakrrrvsi.vlp 

EYYFJVFNRLIxAPGVDPSPPRRSI/SCKRKRECLDESDDEPEKBLA^ 

EETWAETLCGLKMKAKRRRVSLVXPEYYEAFNRLLAPGVDPSPPRR 

SLGCKRIO^ECLDESDDEPEKELAPEPEETWVABTLCGLKMKAKRRRV 

oxj v Ljfr* x x &*\r JMKijijAf t» Vl^l'oFPl^oLGCKRKRECLDESDDEPEKE 

LAPEPEETWAETLCGLKMKAKRRRVSLVLPEYYEAFN^ 

SPPRRSLGCKRKRECLDESDDEPEKEL^EPEETWVAETLCGLKMKA 

KRRRVS LVLPE Y YEAFNRLLEDPVI KRFLAWDKDLRVSDKIPSEPTI 

LGASPKTLPPASRICIRPSNTPPPRNFHMST\rTPMLSYLANDMEEDD 

EAPKQKIFYFLYGKTHSHIPLRPKHWFQLCRPMNPRARKNCSQIALF 

QKRRFQFFCSMRCRAWVSPEELEENTGPRGDVDFQQELYSSANGRHQ 

EGGEEPFVQI I * 

1689 

A 

520 

1235 

WTDFRSIGLMALAGSVLEFSARSKDATPDPP/LGTGKVPSTAPTGAP 
PPGLPTAAFDWLHPFRAGRKKYFPSLLFA*WLCQRSSP*RGADPVI 
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Gli Y li VHRGG ACQTFTIiGNKQTPRIjG IHARPRRRATTS LLTLLIiAFGK 
NAVRCALIGPGSLTSRTRPLTEPLGBKBRREVFFPPRPERVEHNVES 
SRWEPRRRGACGSRGGNFPSPRGGSGVASLERAENSSTEPAKAIKPI 

J-'ivxvo vnviv-ovjr v v xr o Xjr\.xrlv.M. v i\Jc*Xj v iztvi o XjjL//\\j>\rl 

1690 

A 

2176 

2641 

RKTIEEKADPKLQGPFVCLFVLETESCSASQAGMEWPNIiNSLQPPPP 
GFTQFLC/ SQPPE*LGLTGVPPHQAHFCIFSRDGASPC*PGWSPTPG 
LiKFSTCFSLLiKCWDYPHFPPPT.AVFTA T.FMPnprJT fiMT oddvomoud 

QVILLLWPPSVLGFQA 

1691 

A 

112 

410 

LLPPPPMPLRCPFQTLRASRVKEANSLPTGPAGVGRPNCPGSPTQTM 
TLLPVQSRPRQGPGMEHMLRAMGVRAKATEQDAEEPRLGPVLPEHEP 

1692 

A 

2148 

2510 

SQHFGKLRQEDHLRSGVREQPGQHGKTPYLLKIQKL\AGHGGMCLYS 
QLLMRLRQENGVNPGGGACNEPRLRHCTPAWVTEQDSVSKKKTVHKK 
KLNWGSH\VRGET*RTSPCVALDTAHPL 

1693 

A 

178 

730 

IFFFFFFKMESCSVAQAGVQWRDLGSLQAPPRGFTPFSCLSIjPSSWD 
YRRPLPRPANFFYF* *RRGFTVLATMVS IS * PHDLPTLASQNAGITG 
v oriri x v irLbr r or £> JL Hi? \ VAyAGGUWJKJJJjGoFyr'lrxrxrK.r J\.yro 

HLSLPSSWDYRHAPPSLANFFCIFSRDRVSPSWSGWSRTPDLR 

1694 

A 

2 

780 

CWGLRRQRS QDVTTMAWALL I LTLLTQGTGS WAQSALTQPPS ASGSL 
GQSVTFSCSGTSSDIGNYNYVSWYRQHPGKAPKLMIYEVTKRPSG\V 
Fx5\Kr oooKb(iN 1Ao1iXVSGIjQ\SEDEG\DYYCCSMARHHS/ VGWVF 
GGGTQVDPSLGQPKRAALGSLCFPPSLGEASSQPRPTLVCVISDFLP 
GKP * TVA* KA\ DS SP VKAG\ VETTPTPSKQ\ SNNNYAATS Y\ LTLTP 
EPLKVPQEATACRVTP\EGGTLEKTVAPTECS 

1695 

A 

103 

532 

NETQPKACREQNMEGDFSVCP^CK3mTVSANFTLHEAYCLRFIiVLCP 
ECEEPVPKETMEEHCKLEHQQAYGSGIGKRFWFQERLAVIiLRSVKRG 
CEKGRSWKAVRWSPVSRLSMSIPPGAPDLSLWRYSSSGSRCQHKTGL 
TT 

icq/: 

A 
A 


1158 

S CGLGHRKTFS FVSL PARNETQPKACREQNMEGDFS VCRNC * RHWS 
ANFTLHEAYCLRFLVLCPECEEPVPKETMEEHCKLEHQQV\GCTMCQ 
y lrlrlls^oijlir HJvAJNjbuyj^KP 

LCQGCGQFIMHPJ^LAQRRDVCRSEQAQLGKGERISAPEREIYCHYCN 

ynx xrxuiMivx r xxxn T i\j»jw- Lrl/oCir xvlvrl r lr VuWrril Lroo lilr O UAAJSJNVc * ^ 

TMEKDVRPKTRSINRFPLHSESSSKKAPRSKNKTLDPLLMSEPKPRT 
S S PRGDKAAYD ILRRCSQCG I LLPLP I LNQHQEKCRWLAS S KRKTSB 
KFQLDLEKERYYKF\ KRFHF 

1697 

A 

343 

586 

KQQKTSFSSLPRRVNC^SHLVLL/RCDFKNCNIAFETKICQFIIKST 
♦EYMFGFIFLCFFLLYNIPFHICGPRVKSSFCYRH 

1698 

A 

217 

3 60 i 

H 

1699 

A 

1 

390 

WFF! TDETiNRVARHR PR O TT A/MriT OPlCXTD Pi T IP T?T PlPlTTTsnPCT TP TOT T7T71T 

QSALELIMSKYREQMFRL1J4ASKKDDPGIIMKLKEQHSKELQA 
ITEMAAVMRK\ PLKLTSNRVARNKNEYFNIjNKKTKA 

1700 

A 

386 

790 

SL\EEHQSGLGNL* * AKYREQM\LRLLMASKK\DDPG\ I IMKLK\EQ 
HS YD * HWYI VTS PKD S S LMHLDTS LKHLNMDWREGTWKQ I RMYTK j 

1701 

A 

1089 

1296 

CPPPLFFFETEFRSCCPGWSNSSIiHRPPPGFKQFILNFIiG*K/PSYP 
YIjFAOSCAPMCVCVPVP t t ftft 

1702 

A 

2 

367 

RDNTSPISVILVSSGSRGNKIjLFRYPFQRSQEHPASQT/RFSDVUjA 
TIlxATKSEMCGQKFELKIDNVKFVGHPTLLQHALGQISKTDPSPKRE 
APTMI LFNWFALRANADPS VINCLHNLS 

1703 

A 

1 

452 

RCQYSTREAKLI/LALQD/ EVSAMAD/GNE/ GPQSPFHHILPKCKLA 
PJDLKEAYDSLCTSGVVRLHINSWLEVSFCLPHKIHYAASSLIPPEAI 
ERSLKAIRPYHALLLLSDEKSLLGELPIDCSPALVRVIKTTSAVKNL 
QQLAQDALLPPRLP 

1704 

A 

1 

468 

LSPNASVCLYSPIx^QFSHQFPSHDLPSVI^AKFSLPVSIjSEFRNPIxA 


loon 


.1. 0 O O 4-3 . O H 5! & O 2 






PAVQETQLIQMWWMLQRRLLIQLHTYVCLMASPSEBEPRPREDDVP 
SMDNS SAELLPSGDSPLNORMTEN"LLASL.SKHERAATT.<?VPaan>iDi? 
DLRMFARLLHYFRGR 

1705 

A 

2 

118 

NENTRRSQLLMLTOKFRSVLVVTTHEDPVIAVFQAIiLP 

1706 

A 

1 

672 

DADSRFSEVF/LATILATQFEMCGQKFELKIDNVRFVGHPTLHRHAIi 
GQISKTDPSPKREAPTMILFlWGFALRANAnpciVTKrrT T-tmt qp-dttvt 

VLQHEERRCQYLTREAKL I LALQDEVSAMADGNEGPQS PFHH ILPKC 
KLARD/ PQLWL PNQQR * HD PHQPQHGQLQRRATS QRGLATE PEDDGE 
PAGQPVGA*TRSHPQCTRSPEP*GPPHVCQAPSLLPRP 

1707 

A 

230 

345 

TWALKDAAS PRDRS PGR PPS SVEM PTRAARTGGKRRNT 

1708 

A 

464 

763 

LEHSVYGPLLVASl^in^NQLLFRLVAKSEYNSQQCFFNKGIiKRFMYN 
ENTRRSKLLMLFDKFRSVLVVTTHEDPVIAVFKALLPWAPGGGRKAA 
RPGNH 

1709 

A 

3 

156 

GRHHL/EEEIMYNENTRGSQL1^IjF*QSFAACWMVTTHEDPVIAVF\ : 
QALLP 

1710 

A 

- 

1 

1878 

frgtwapsasgsvllrlppppapsssgplrprprphggmrdntspis 
vilvssgsrgnkllfrypfqrsqehpasqtskprsryaasntgdha\ 

DEQDGDSRFSDVILATIIiATKSEMCGQKFELKIDNVRFVGHPTLLQH 
\ALGQISKTDPSPDEGKAP\TMILF*CWLFAL\RANADPSV\INCLH 
1^SRRIATVIA5HEERRCQYLTREAI<LIL\AIX3DEVSAMADGNE\DPK 
PPFHHIL\ PKCKLARDLKEAYDSLCTSGWRLHINSWL\ EVSFCLPH 
i/wwo Jj J. r'*'ii.AXCiKi> XjIVAXKF YJ1A1j1j1jL»SDEKSLLGEI#PIDCSP 
ALVRVIKTTS ACERTCRQLAQDADLA\ LLQVFQLAAHL\ VYWGKAI I 
IYPLCENNVY\MLSP\NASVCLYSPAGPSSSSHQFPFSWTWPSVIAK 
FSLPVFLCQNFRNPIxAPRC^U^TQLH/lQMVW\MLQRR\l,IiIQIi\H 

TYV\CLMASPSE\EEPRPR\EDDVPFT\ARVGGRSLSTP\NAIiSFGS 
PT\SSDDMTLTKPOAWTTP t 5AP , T.T.PQr:nc:DT \ mhomtumt t? /tvoot o 

* * i x xj a ivrynn x x xr 0/A_C»xjxj ir o uUu tr JLi JL bWIjf / nrbJjb 

EHERAAIL\SVPAAQNPEDKlWFA/RGILHYFRGRJiHLEEIMYNEOT 
RRSQLLMLFDKFRSVIjV\VTTHEDPVIAVFQALLP 

1711 

r 

A 

1 

613 

PLKRSDGCNDGRPTRPPTRPDTTVFTSNLKQTRMVHLTPVE\KSAVT 

7VLWGQA*TWMIWGGKALGK/RCWVVLPWDPiaiSFEVLWGx^SQLP\D 
AVNGANP*R*KASMAKPTOTT.nr , PT ct?wt>t tv ut \ tykt\ t vr»TT»T ntiTtnr 

NxxNCDK\LHRGSLKNFR\ LLGQTCLVCVPGPINFWQKNS TPTSCKAC 
L I KKS WIxAWC WLNALGPTS IT 

1712 

A 

208 

748 

ENGTTGEREGSLRTHPGYPRSPLGHHDPHCCKPFLLSTAGQPPNY13M 
LKEEQEVAK\ LGAPQKPCIiPECPRVINIRNE\ TSWPDHVRLVPVQ\ H 
PLHEHLLRI/GFIRFAV YSVKST,RDT?K\ M\/RnvMPr , Difa*rT irorA iro 

E\NLGPDFGHL\HDHFCSS / hh/ hpvlgrp\gpaidqeaslrp\gal 
PVTC I PRTLS S I PRPAPRGQE\ LCP 

1713 

A 

243 

850 

WliTV^OTIiFFPCHOWPAPPTMKMIannP^nnT.avT^appTrPT t ddtcV t 

VIPHPASENLRCPD\HVVIiVP\VSTPFFNETPGCLGLQ*DSPYSVK\ 
SRDRK\ MVGDVNRGARAYAS PAKVPEHLG P * F\ GHPHDHSAHRHPS A 
DLARPI / AIDQEASLRPGALPMT\ CIPRTPTS I PRPAPRSRVLYQPF 
ILTOFS1T4AF1WVHVFLVLL 

1714 

A 

97 

424 

S PALWEAYDGW I TLRSGVQD\ QSGQHGEMPS LLKI QKLAGHDGECIj * 
SQLLRRLRRENHI^LGGRGCSELRSRYCIPAWAPEIAPLHSSLGDLN 
KTLSQKKTKTVSFYI 

1715 

A 

83 

1259 

kmaeevvvvakjfdyvaqqeqeld i kknerlwlldd sks wwrvrnsmn 
ktgf\vpstyvgr\ensarkasivi<1\jl 

paddsfv\dpgerly\dlnmpa\yvkfn\ymaeredels\likg\tk 
vivmekcs\dgwwrgxyngqvgwfpsnyvteesd\splgdhvgslsb 
klaavvltoiintgqvlhvvqalypfs 

endpewwkcrkingmvglvpqklw * pllqs i qltsglgtltfqcgf i 
rpyipgkfvgnpwyygkvtrhqaemalnerghegdflirdsesspnd 
fs vs lkaqgknkhfkvqlk\ etvyc i wqrks ctme * lvehyk\ kapn 

FYlWNKGEKLYIjVKHLSMIIiLTQK 
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1716 

A 

18 

367 

SPPPPPRTTRWWPLRRPRLSLGTRAASLRFSSRKPCQNKPDYGLRSB 
KFR*SRK/A*GRQRP/PREKFPLPFPKKPIEPGEAKPGEIVNGSVRP 
PNMPLYIPTS IALPYFTFLLAVLTL 

1717 

A 

47 

409 

NSYIYMCIYSYINTIYIHIYLESNISLPLNIYISTPT/HIY*RHTV* 
VHTKAYVHML * HVY IH FCLCVHKS FKGTI YRDAS FLES CS KVNTECH 
KLRKVKRKYSRIHHTGIHQSSFIMRKMS 

1718 

A 

89 

1560 

IMKHTNPEPGSFSRFYSLKVAPKVKATAAP\AGAPPQPQDLEFTKLP 
NGLVIASLENYSPVSRIGLFIKAGSRYEDFSNIX3\TTHIiLRLYIQS* 
RTKGASSFQDNPVGIGRQLGGQIKC*PQQGENMGLYWWECL\RGDVD 
I\LM\EFLLIV\TT\APEFRSL<^K*VNLQPQLK\IDKAV\AFQNP\Q 
TH\ AIGNLHAAA/ YTGNAI>ANPLYCPDYRIGKVT\ SEELHYFVQNHF 
TSARM7UjIG\LGVSHPVLKQVA\EQFLNMRG\GVGLSGWQRANYRGG 
E I REQNGDS LVHAAF \ VAKS AVAGS AKPNAFS VLQH \ VLGAGATMS R 
GAATTTS \ HLHQ\ AVSQATQQP\ FDVS AFNARYS \ DLGLFG I \ YTI S 
QGHQLAGDCIK\AA\YNQVKPIRSKKPFFPTQGVSAA\KNKG*KAGI 
PLMVQWKSFLKCSPGRKSGSPGLLV/GIiVPYMPTHPTVPSSQMDSSG 
LMLDIIN/ARAKKFVFWARSSMGSKFGKFGDITPFCIiMEL 

1719 

A 

53 

421 

DTMALTSDLGKQIKLKEVEGTLLQPATVDNWSQIQSFEAKPDDLLIC 
TYPKAASGTTWI QE I VDM I EQNGDVEKCQRAI I QHRHP F I EWARPPQ 
PSVLVLRRCFLLSYFKGVEKAKAMPSPR 

1720 

A 

1 

1260 

MALTSDLGKQIKIiKEVEGTLLQPATVDNWSQIQSFEAKPDDLLICTY 
PKAGTTWIQEIVDMIEQNGDVEKCQRAIIQHRHPFIEWARPPQPSAP 
LA^IJVFEEEKMVTSYCPMEETNADDITTSMEGYWQGEGNMANEPFLE 
LVGLGMGPAAWAVSCTLSTLCFSTLSLSNHRGIPGGICRCVINRKDL 
HNVQATMSPAVKVIXSETLEECFLLSYFKGVEKAKAMPSPRILKTHLS 
TQLLPPSFWENNCKFLWARNAKDCMVSYYHFQRMNHMIiPDPGTWEE 
YFETFINGKWWGSWFDHVKGWWEMKDRHQILFLFYEDIKRDPKHEI 
RKVMQFMGKKVDETVLDKI VQETS FEKMKENPMTNRSTVSKS ILDQS 
I S S FMRKGTVGDWKNHFTVAQNERFDE I YRRKMEGTS INFCMEL 

1721 

A 

403 

1334 

DTMALTSDLGKQI KLER/ EVEGTLLQPATVDNWSQI QS FEAKPDDLL 
ICTYPKAGTTWIQEIVDMIEQNGDVEKCQR\AIIQHR\HPFI\EWAR 
PPQPSGVEKAKAMPSPRILKTSPFHFSWLPPSFW\ENNCKF/LFMLA 

seiakd\cmvs*\yhf\qrmnhmlp\dpvtwkeyf\etfingk\vfw 

GSWF\DHVKG\WW\EMKDRHQI\LFLFYEDIKRDPK\HEIRK\VMQF 
MGKKVE*TVLDKIVQETSFEKIKENPMTNRSTVSK\SILD\QSIFPP 
FMRKGTVG\DWEN\HFTVA\ QNERFDE I \ YRRKMEG \ TS \ INFCMEL 

1722 

A 

2 

645 

HGIQAHGQIPSYKTIGGRDDSFHTFFSETGAGKHVPRLLL*NWKPTV 
MDEVRTGTYCQLFHLEQFITARKI AANNYARGHYTIGKE I IDLVLDR 
IRKLADQCTGLQGFLVFHSFGGGTGSGFTSLLMERLSVDYGKKSKLE 
FS I YPAPQVSTAWEPYNS ILTTHTTLEHSDCAFMEEGEFSEAREDM 
AALEKDYEEVGVDSVEGEGEEEGEEY 

1723 

A 

87 

1531 

s latmrec i s ihvgqagvq i gnacwelyclehg i qpdgqm / tq vtrp 
lgggodsfntffsetgagkhvpravfvdleptvidevrtgtyrqlfh 
pbqlitgkedaannyargn\ ytigkei idlvldr irkladqctglqg 
fiivfhsfgggtgsgftsiilmerlsvdyckksk\lefsiyp\apqvct 
avvepynyiltdhtt\lehsdcafmvdneaiydicrrnldierptyt 
nlnrl i sq i vs s i taslrfdgalnftltnfgtkl vpf p \ r i plp fcp 
i mpp vhfa * erp pmns fs v\ re i tqmlcfe ps ptrl vk\ cd pr p wvk 
swpcclvatggdvvpkrcqmlpiahpskpkr\tiqfvdwcptgfkv\ 
ginyqpph\ wvpggnla\ kvtreavcmls khhs pfaeawarpgpts f 
dlmlcqacpfvhw ylg\ egmeegefsegrer * gcpfrkd yekv\ gvd 

SVEGEGEEEGKGIIiI t HSLFG 

1724 

A 

84 

1560 

EAriSPLRIiRHQLGSREAATMRECISIHVGQAGVQIGNACWELYCLE 
HGIQPDGQMPSD/RKPLGEGDDSFNTFFSETGAGKHVPRJVVFVDIjEP 
TVIDEVRTGTYRQLFHPEQL I TGKBDAANNYARGH YT I G KE I IDLVL 
DRIRKLA\DQCTG\LQGFLVFHSLGGGNWVLVPPPPCLQLGGGGAGK 
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RIiSVDYGQEVPSWEFSIYPGAPRFPQPWBPYNFHPTNPTPTLGAL* 
LiCPb W VDIsI fciAJ. i D I CRRNLiD I ERPTYTNIiNRIilGQIVSSITASLRFD 
GAIiNVDLTEPQTNLVpypRIHFPLGHIMPPVIFAE\KAYHEPAFL*Q 
RS QMLCFEPAN\ QM\ VKCD PRPG\ KYMLCC \ LIi YP \ GD WPQR I S FL 
trLttr x 1J\1\J/ KJj 1 Anr ijJLJWb.fr' XL/Jr IvuVXJM iyPJ? J7VVFGG \DL»TKVQRA 
VCMLSNT\TAIAEAWARLDHKFDLMYAKRAFVHWYVGEGMEEGEFSE 
AREDMAALEKDYEEVGVDSVEGEGEEEGEEY 

1725 

A 

153 

380 

EEYKTQNRFELRSPRLDCSGAISAHCNLCLPGS\SNSHAS\ASK*AG 
ITGMHHHAWDNFCILFSRRWGFCHVGQGWP 

1726 

A 

14 

322 

I FS SE PLEGRPGRPGGARAACGQEGAGK\ AGAAGD * PS P/ PG* GHAA 
APKCREFGHNQIDAGWNQRP/GKPGLVPMWEPCPQPSCPLELSEFPP 
GAHSSWTSNSIY 

1727 

A 

605 

3135 

DSRGQEG * RTGAPHMGDKG PGVSGPPGFQAS I KFGCGQN\ FS P ITLG 
PGPG/PWGGCGQALSPSGVPGLEGVSPTRAKGW\RFPPKAPETLNER 
QI YP\NAPPS * AG* \GHADTEGQDRTPHLLGANSSGHLCGQLPF* SA 
S I GGAGRD \ S QG L SRAF S S AS KHS VPAS AGTF * HS FS KG * VS KTTTT 
WAGNALFPMPGSSKTKKPNSHQRGQMGS*GRNPPSLGRAPAPLPERB 
AP I PAPQLG PS AAGTSRQVGQKS S TS P/ PPGRGGN I EP * TQEERRKE 
KM KKATGLS KHQPAGF I QNE *NLKGAGE F/ GP / SGLAGSQNP PS S KL 
QGLGGKC * EQRL * * GAGPDCS PLGKHTP * RS PS PLPRTGDASRG\ S * 
GFSGKEASFGPGQPQSTCLSGIRPSLGS*PLGQ*RTL/PCSNLPAGK 
/ RNCLG/ PPGLGRGHGRCSDLSQHGSQT*AGANWRKRQ/ PPVPAGLL 
DPG PLTAQQAVTRS P WEGAQGRGGEG P VGLC WG * AC AKCRLQG SGRT 
FAGGRQG SRSGVWVGS EVMAPRKRPPAGPPGHKEGTAEAVS S QTVTG 
GRlPEAVWPHHHQGKGTTEQEPCC*DVTKASAPGVSGDTG/MRGPLQ 
PQASPNI*GAAACPFSQRAGSSLQQRRSLPAPSCPQAA/RGPPGLPG 
LPSSGSEENIHSGAWALVGQEGPSMDGRGNGMMLRGVWTGVHGGGMD 
EMVJRRGDLKGKVPHGMI QVWTP/ G/DKQDS SPARTPAPQWLS I TTGS 
* TPE/ GDPGGKIjDAAQRGRAI AAHEQPEVAVLGVA/GHLi* S PGSARS 
SPRWHPHRSACRPPRSGGS PPSPSSA* KSDRTDAGAGVAAAAS PGAG 
APAHCPQGPPRSCQGPQRR 

1728 

A 

1096 

1748 

ELFPPTSTSIAI1AQLRAI1TQAGQPLTQVNQGAFSMPLWVI1DPRERGD 
LKIKPS FLLLWGWAHLGFQHEALWALGCAF I EERGGEREAFLGPEMF 
bWLjCjJb AHPLP* IHOtWPGEPQ/ EV*GGRHCGKAPREKWPAIiAPTFQK 
EKPWPVTPEAI P VCQEG AP PGTAKS IHCPPE IHTKEACPVPGKEEN 
VPGKRKIWSKKRDRQGRAQESRIQGSEEIP 

1 TOO 

± / Zy 

7V 

A 


TOO 

788 

y KXir r J-iAJbJN X 1Kb r bKTVKTLiS FVLiNQMMCFI S VFDVFS FFS PGFTS 
FVISLCFGFAANLIGLGLAAKALDSGAFFSFVVIiSPSFPLPSCPHHF 
TLLKVIMNTRSEIPFLAPSTLGFFEMESHCVTQ/CSGA/ isahcslh 
LPG * SNFPVS AS * VAGTTGASHDNWLI FLFLVETG FHHADQGGLKF * 
PQI IHPLGLPKWX.GLQCEPCGWL 

1730 

A 

158 

468 

TGAGHGGLMPVIPSHFGRPWRADHLRSGVRDQPGQHG*NPVSTKNTK 
IGWA* WRAPVI P\ AT* EGLRQGESLEPGRAEGARRCHYI PAGGDRVR 
LCLKKKKLN 

1731 

A 

1 

1161 

MS S FEKCLFMS FAHFLMDAEKAFDKI QQPFMLETLNKLG IDGTYFKI 
I RAI YDKPTAN IMI^GQKLEAFPIjKTGTRQGC PI*S PLLFNIVLEVLA 
RAI RQEKE I KG I QLGKEEVKLS LFADDM I VYLENP I VS AQNLIiRI. I S 
NFSKVSGYKINVQKSQAFLYTNNRQTESQIMNELPLTIASKRIKYLG 
IOIiTRDVKDLFKENYKPLIjKE T KEDTNKWKNTPr<5 WVGRTNTVKMAI 
LPKVIYRFNAIPIKLPMPFFTELEKTTLKFIWNQKRAHIAKSILSQK 
NKAGGITLPDFKLYYK^TVTCTAWYW^ 

EDIHAAKKHMKKCSSSLAIREMDIKTTMRYHIjTPVkMAIIKKSGNNR 

cwrgcgeigtl 

1732 

B 

1 

1380 

MDKFLDTYTLiPRINQEELESIiNRPITASEIVAIINSLSSKKSPGPDG 

ftaefywsvgsagqairqekeikgiqlgkeevklslfaddmivylen 
pivsapnllklisnfskvsgyki^qksqaflytnnrqtesqimsel 


10014 
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PFTIASKRIKYLGIQLIRDVKDLFKENYKPLLKEIKEDTNKWKNIPC 
SWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTILEKNTLKFIWNQ 

WNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICR 
KLKLDLFLTPYTKINSRWIKDLKVRPKTIKTLEENIiGITIQHIGMGK 
DFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNREPTEWEKI 
FATYSSDKGLISRIYNELKQIYKKKQTTPSKSGRRT* 

1733 

A 

1 

293 

MI IS IDAEKAFDKIQQPFMLKTLNKLGIDGTYFKI IRAI YDKPTANI 
IJ^GQKLEAFPLKTGTRQGCPLSLCTSFHCQLGRASERRNISIGMHE 
ETF 

1734 

A 

1 

1185 

MKLKRNNEMSGKALDPREGFCDASYEIQTTIREYYKHLYANKLENLE 

iL-riLj x c xjxj x i x xjFX\J*ivi\4Et& VlioJ-UMKJrx lbAbl VAl XlMoJjir 1 I\J\or*sJr'L; 

GFTAEFYQRYKEELVPFLLKLFQS IEKEGILPNSFYEAS I ILIPKPG 
RDTTKKENFRP I S LMNTD AKI LNKI LANR I QQH I KKL I HHDQVGFI P 
GMQG WFN I RKS INV IQHINRAKDKNHIil I S IDAEKAFDKI QQPFMLK 
TLNKLGIDGTYFKI / Y/ RDRHFSKEDIYAAKKHMKKCSI»SLAIREMQ 
I KTTMR YHLTP VRMAI IKKSGNNRCWRGCGEIGTLLHWCWDCKLVQP 
LWKSVWRFLRDLELEIPFDPAIPLLGVYPKDYKSCCYKDTC/H/IMF 
IVALFTIAKTWNQPKCPTMIDWI 

1735 

A 

66 

1394 

QVLL S FGTPL VLTTKREKNQ I DAI KNDKGP I TTDPTE I Q I TS I E YYK 
HLYANKLENLEEMDKLLDTYTLPRJ^NQEGVESLNRPITGSEIEAIIN 
^ij^JrxoJ^ T iJ>ixxi>ijvXiji\li\.j.jjvjj>l J. yyxlJL isJvJj J.HJxL)y V(jr XirL»MQGWr N 
IRKS INVIEHINRTKDK2JHMI I L IDAEKAFDKI QQPFMLKTLNKLG I 
DGTYLKI IRAI YGKPTVNI IliNRQKLEAFPLKTGTRQGCPLSPLIiFN 
IVLEVLAKAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIISAQ 

NTiT*KTjT , fi'MP' < ?TrV c 5f2Y'K , 'T7J\70"K"Q<^2i TTT VTMNTDnnnucnTMOPT DCTTne 
j.mxjujvxj x uivr ojvvooz X\ J.1M V vXvo^/AJc Jj X irJlMKy 1 x?>oQ 1Mb JaJjxrl* 1 IAi> 

KRIKYLGIQLTRDVKDLVKENYKPLLKEIKEDTNKWKNIPCSWVGRI 

NILKI^ILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRACIA 

KS I LS QKNKAGG I TL PD FK 

1736 

A 

1 

432 

MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWK 
NIPCPWVGRINIVKMAILPKVATITVLTLQVRFRDLCNMPKVTQLDT 
AJBPRFKSAQLSVLHVYVVWPLKEFVFEALGK>?LGKDQLQATMRVCLI 
SST 

1737 

A 

1 

413 

MSELPFTIASKRIKYLGIQLTRDVTODLFKENYKPLLKEIKEDTKKWK 
NIPCSWGRINIVKMAILPKIIYRFSAIPIKLPMTFFTELEKTTLKF 
I WNQKRAC I AKS I LS QKNKAGG I TLPD FKLYHKATVTKTAWYW 

1738 

A 

1 

1401 

MGQHDKQQYLEVQSNKIIAIWISDDHEDIISELGCLHIPERQKMSIL 
FPSSATSGWAJ^QRDFRVGAGSAJ^PALLSGRLAPPAJLGSERLSTRAS 
oi^VjijJjlrlOil r'K^KAJrJr'i'/il KrlFVP5> x AQ K JL»K o AAAR QRJLi AG o S ARG R 
DALGKASWAPESVLEVLAJRAVRQEKEIKGILJ^KEEVKLSLFAGDMI 
VYLENPIISAQNLLNKINVQKSQAFLYTSNRQTESQILSELPFTIAS 
KRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRI 
NI VKMAILPKVI YRFNAT PT KT.PMPFPT»i?T.T?TrT^T.Tri7TT?Mnirp ap t a 

KS I L I QKNKAGG ITLPDFKLYYKATVTKTAWYWYQNRD I DQWNTTEP 

SKIMPHIYNYLIFDEPEKNKQWGKDSIiFl^K>TYWENWLAICRK^ 

FLTPFTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFM 

1739 

A 

1 

512 

MLEVlaAWAVRQEKEIKGIQLGKEEVKLSL*l^^ 
SIiAFVYTNNRQSESQIMSELPFTVATKRIlQ(xjGIQLTRDVKDL 
YKPLlxNEIQEDTNKWKNIPCSWGRINIVK^ 
LPMTFFTELEKTK\FIWNQK\RAHIAKTIL 

1740 

A 

1 

2052 

MI I S IGAEKAFDKVQQPFMLKTLNKLGIDGTYFKI IRAI YDKPTANI 

ILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGI 

QLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINV 

QKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFK 

ENYKPLLKEIKEDTNKWKNIPCSWGRINIVKMAILPKVIYRTO 

IKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKSKAGGITIiPDFK 
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LADNMTVYLENPIVSAQNLLKLISNFSKVSGYKVSGQKSQALLYTNN 
KyiDoyinofiiirr x xm^ivkxivx XjVjXriXj IKDVKDLiFKENx KPLLKEIK 
KDTNKWKNIPCSWGRINIVKMAILPKDIIQENFPNLARQANIQIQE 
IRKTPQRYSSRRATPRHI I VRFTKVEMKEKMLRAAREKASHHTYSKI 
DPILGSKPLLSKCKRTEIITNYLSDHSAIKIiEFRIKWLTQNCSTTWK 
PNNLLLNDYWVHNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGK 
FIATNAHKRKQERSKVDTLTSQLKELEKQEQTtiSKASRRQEITKIRA 
E LKE I ETOKTLO K I NE S R S W F FE R T MKT .ni? P T . a D T a/ v virp T? VTark t n 7v 
I KNDKGD I TTNPTE I QTT I RD Y YKHL 

1741 

A 

1 

764 

MNIDAKIIJ^KILAKQIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINV 
I QH INRTEDKNHM 1 1 S IDAEKAFDKIQQPFMLKPLNKLG IDGTYFKI 
IRAIYDKSTPNIILNGQKLELMSNFSKVSGYKISVQKSHAFVYTNNR 
QSESQIMSELPFTVATKRIKYXGIQLTRDVKDLFKENYKPLLNEIQE 
DTNKWKNIPCSWVGRINIVKMAIIiPKVIYRFNAIPIKLPMTFFTELE 
KTK\FIWNQK\RAHIAKTIL 

1742 

A 

1 

1057 

LRPYPIIGWVWGKGLRNPQDFANLNSTNCLYGYTFQQKPRNAPFPW 
KHINRAKDKNHM I I S I DAEKAFDKI QQPFMLKTLNKLG I DGTYFKI I 
RAIYDKPTANIII^GQKLEAFPLKTGTRQGCLLSPLLFNIVIiEVIJ\R 
AI RQEKE I KG IHLGKEEVKLSLFADDM I VYLENPI I S AQNLLICLI SN 
FSKVSGYKINVQKSQAFLYTNDRQTESQIMSELPFTIASKRIKYLGI 
QVTRDVKHLFKENYKPLLKEIKEDTNK>?KNIPCSWVGRINIMKMAIL 
PKVI YGFNAI PI S FNWYGVQPGH 

1743 

A 

1 

1380 

MGDFNTPLSTLDRSTRQKN^KDTQELNSAIjHQADLIDIYRTLHPKST 
EYTFFSAPHHTYSKIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLE 

YQNLWDAFKAIIESLIFHNRAEIVAIIDSLPTKKSPGPDGFTADFYQ 
RYKEELVPFLLKLFQS I EKEG I LPNS F YEAS 1 1 L I PKPGRDTTKKEN 
FRPISLINIDAKILNKILAKRIQQHIKKLIHHDQVGFIPGMQGWFNI 
RKSIl^IQHINRAJCDKNHMIISIDAEKAFDKIQQPFMLKTLNKLAQN 
LLKLISNFSKVSGYKINVQNSQAFLYTNNRQTESQIMSELPFTIASK 
KI KYLG IQLTRDVKDIiFKENYKPLLKE I KEDTNKWKNI PCSWVGR IN 
I VKMAI LPKAMLCLDVAYPTDDTRWYKLYTQKWTHFS 

1744 

A 

1 

862 

l^TFx^x^TLPRIilQEEVESLNRPITGSEIVAIINSLPSKKSPGPDG 
FTAKFYQRYKEELVPFLLKLFQS IEKEGILPNSFYEGS I ILIPKPGR 
DPPKKENFRPTSIoMNIDAKILNKIl^TOIQQHIKKLIHHD^ 
MQGWFNIPKS INVIQHINRAKDKNHMI I S IDAEKAFDKIQQPFMLKT 

SPLLFNIVVEVLARAIRQEKKIKGIQLRKEEVKLSLFAX>DMIVYLEN 
PIVSA*RLNQEEVESLNRPITGSEIVAIINSLPSKKSPGPDGFTAKF 
YQRYKEELVPFLLKLFQS IEKEGI LPNS FYEGS I ILI PKPGRDPPKK 
ENFRPTSLMNIDAKI LNKILATR I QQH I KKL IHHDQVG 1 1 PGMQGWF 
NI PKS INVIQHINRAKDKNHMI IS IDA^ 

IDGTYFKI I RAIYDKPTANI I LNGQKLEAFPIiKTGTRQGCPLS PLLF 
NIVVEVLARAIRQEKKIKGIQLRKEEVKLSLFADDMIVYLENPIVSA 

1745 

A 

1 

1551 

ATATSLPTSSENFSGEALCSLKPLPKQLPKRLIRKVQHENGQELSKV 

LRPYPIIGVRWGKGLRNPQDFANLNSTNCLYGYTFQQKPRNAPFPW 

KHINRAKDKNHMI IS IDAEKAFDKIQQPFMLKTLNKLGIDGTYFKI I 

RA I YD KPT AN I ILNGQKLEAFPLKTGTRQGCLLS PLLFN I VLEVLAR 

MRQEKEIKGIHLGKEEVKLSLFADDMIVYLENPIISAQNLLKLISN 

FSKVSGYKINVQKSQAFLYTNDRQTESQIMSELPFTIASKRIKYLGI 

QVTRDVKHLFKENYKPLLKE I KEDTNKWKNI PC SOT 

PKVIYGFNAIPIQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTE 

SQIMSELPFTIASKRIKYLGIHLTRDVKDLFKENYKPLLKEIKEDTN 

KWKNlPCSWVGRINIVKmiLPKVIYRFNAIPIKLTMTFFTELEKST 

LKF I WNQKGAR I AKS I LS QKNKAGG I TLPDFKLYYKATVTKTAW YW Y 
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1746 

A 

1 

947 

MDTFLNTYTLPRLNQEEVESIjNRPITGSBIVAI inslpskkspgpdg 
FTAKFYQRYKEELVPFLLKLFQS IEKEGlLPNSFYEGS I ILIPKPGR 
DPPKKENFRPTSLMNIDAKII^ILATRIQQHIKKLIHHDQVGIIPG 
MQGWFNI PKS INVIQHINRAKDKNHM IIS IDAEKAFDKI QQPFMLKT 
LNKLGIDGTYFKI IRAI YDKPTANI ILNGQKLEAFPLKTGTRQGCPL 
SPLLFNIVVEVLARAIROEKXIKGIOIjRKFFVTCT.QT.PanriMTVVT ttkt 
P I VS APAAFLQS LWS STMAGTLEKGEPLSGKEQ 

1747 

A 

179 

887 

LLKRDVKDLFKENY I PLLNE I KEDTNKWKN I PCS WVGR I NI VKMAI L 
PKVIYRFNAISIKLPMTFFTELEKTTIiKFTWNQKRAHIAKTILSQKN 
KAGGIMLPDFKLYYKATVTQTAWYWYQNRDIDQWNRTEPSEIIPHIY 
IWLIFDKPDKNKKWGKDSLFNKWCWENWl^ICRKLKLNPFLTPYTKI 
NSRRIKDLNVRPKTIKTLEEl^GNTIQDIGMGKDFMSLLNDMSFTEC 
K 

1748 

A 

1 

1074 

MDKFLDTYTLPRLLNOERV1^T.TJPPTTV^I\t?T\71\T THCT TJISTDT'C'DC VTM 

PHT YNYL I FDKPDKNRKWGKD S LFNKWCWENWLA I YRKLKLDP FLTP 
YTKINSRWIKDl^IRSKTIKTLEENLGNTIQDIGMGKDFMSKTPKPM 
ATKAKlDKWDLI KLKS FCTAKETHS SWRPACLCRLHLWGQRTGKQKD 
SSNLCRLKCPCLTALKRAVVLPACSWRSEKGQTASSKQSWTENDFDE 
LREEGFRRSl^SELREDIQTKGKKVENFEKNLEECITRITNTEKCLK 
EIaMELKTKARELREECRSLRSRCDELEERVSAMEDEMNEMKREGKFR 
EKRIKRNEQSLQEIWDYVKRPNLRLIGVP 

1749 

A 

x 

1060 

MDTFLDTYTLiORLiNORRVT!^T.>JPPT r rriQlTT'\7Ji t tktot DTinfODTDnn 

* ixy x xr x x a xj\^rvxxi.ii ytjlj w mi i IM Jvxr X X VjOCL V.M.X XIvo Xjxt X AAMfUrJ-Aj 

FTAEFYQRYMEELVPFLLKLFQS IEKEG ILPNS FYEAS I ILI PKLGR 
PTTKKENFRP I SLMNI DAKI LNKI LAKRIQQHI KKL IHHDQyGFI PG 
MQGWFNI CKS INVIQHINRAKDKNHMI I S IDAEKAFDKI QQRFMLKT 
LNKLGIDGTCFKWKNIPCSWIGRINIVK>1AIL 

MTFFTELEKTTLKFIWNQKI^CIAKSILSQKNKAGGITLPDFlCliHYK 
ATVTKTAWGPYDR/DID/ SWNQTDlaMCAVX»PSRYVTLQDSSIL*KMR 

* VKKLQKKS LQLADLVQGLRKD VS I T 

1750 

A 

1 

1402 

MSELPFTIASKRI!OfLGIQLTRDVKDLFKElSrYKPLLKEIl<EDTO 
NIPCSWVGRINIVKMAIMPKVIYRFNAIPIKLPMPFFTELEKTTLKF 
I WNOKRAR I AKAT 1j S OKNK A aa T TT . P n "PTTT .wv 2\ t\ 7 r vv T r a tJVuvrixrD 

DIDQWNRTEPSKITPHIYNYLIFDRPEKNKQWGKDSLFNKWO^*^ 

A I CRKLKLDPFLTP YTKINS RWI KDLNVRS KTI KTLEENLGNT I QDT 

GMGKDFMSKTPTAMATKDKIDKWDLIKLKSFC^ 

WEKIFATYSSDKGLISRIYITOLQQIYKXKTI^ 

EDIYAAKKHMKKCSLSLAIREMQIKTTMRYHLTPVRMAI IKKSGNNR 
CWRGCGEIGxXLHCO^DCKLVQPLWKSVWRFLRDLELEI PFDPAI PL 
LGWPKDYKSCCYKDTCT/R/MFIVALFTIAKTONQPKCPTMIDWI 

1751 

A 

1 

1410 

MS ELPFT I AS KR I KYLG I QLTRD VKDLFKENYKPLLKE I KEDTNKWK 

NI PCS WVGRINI VKMAI LPKVIYRFNAIPIKLPMPFFTELEKTTLKF 

IWNOKRARIAKSILSOKNKA.GGTTT,PnFKT.YYTCAT 

DIDQWNRTEPSEITPHI YNYIil FDKPEKSnCQWGKDSLFNx^ 

AI CRKLKLDPFLTP YTKINSRWIKDLlTVn^ 

GMGKDFMSKTPKAMATEDKIDKI^LIKLKSFCT 

WEKIFATYSSDKGLISRIYlTOLKBIYKKiaTCxTOI*^ 

EDIYAAKKHMKKCSSSxxAIREMQIKTTMRYHLTPV^^ 

CLRNTFR S VP SD S AVI AQGTGTHPRGDTPSHICAG VLVVGMS FPEE PV 

LQDRSDDFSLAHGSKQGNSVXCVGQVnCTQGPCSRNEQYRPITGPGSH 

1752 

A 

1 

1559 

MDTFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDG 
FTAE F YQR YKEELVPFLLKLFQS I EKEGILPNS FYEAS 1 1 L I PKPGR 
DTTKIGSNFRPISLMNIDAKILNKIIxANQI 

MQGWFNIRKS I1WIQHINRAKDKNHMI IS IDAEKAFDKI QQRFMLKT 
LIKLGIDGTYFKIIRAIYDKPTANIIxxNGQIOjEAFPLKTGTRQGCPL 
SPLLFNIVXEvIxAPJIIRQEKEIKGIQLGKEEVTCLSLFADNMIVYLEN 
PIVSAQNLLKLISNFSKVSGYKJNVQKSQAFLYT^ 


ioon 
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PFTIASKRIKYLGIQLTRDVKDLFTSVISQVWGSLDTSLILQLWVG 
SLDISVILQLWVGSLDTSVISQLWVRSLDTSVISQLWDIAFLSHVPG 
MLS * KSQVS LATLMQRMSSHGLGQLQPCGSAGYS SHGCFHRLALNAC 
GSSSAQCKLLVDLPFWGLDGGLLLTAARGHSPGALCVRVPTPHFPSM 
LP 

1753 

A 

1 

1576 

MNTDAKI LNKI LANRIQQH IKKL IHHDQVGF I PGMQGWFN IRKS I NV 
I QH I NRTKDKNHM I VS TD ABKT FDKI QQP FMLKTLNKLG I DGT YLKI 
I RA I YDKPTAN I I LNGQKLEAFPLKTGTRQGCPLS PLHKFLDTYTLP 
. RLNQEEVESLSSPITGSEIVAIISS/FTNEKESRTRWIHSRILPEV* 
GGT/ RI KYLGIQLTRDVKDLFKES YKPLLKE I KEDTNKWKNI PC * WV 
GRINI VKMAILP/ KELEKTTLKFI WNQKRAC I AKS ILIQKSKAGGIT 
LPDFKLYYKATVTKTAWYWYQNRDIDQWNSTEPSEIMPHIYNYLIFD 
KPEKNKQWGKDSLFNKWCWENWIJVICRKLKLDPFLTPYTKISSRWIK 
DLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAMATKANIDKWD 
PIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNBLKQ 
IYKKK\TNNPIKKWAKDMNRHSSKEDIYAAKKHMKKCSSSLAIREMQ 
I KTTMR YHLTPV 

1754 

A 

468 

4080 

RVRSGTDS IAS GPRVLCS TRTERRRRR S YLVHRRCVPCGP \ AVDGVF 

NLTNIDRWFLHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLG 

IDGTYFRI IRAI YDKPTAN 1 1 LNGQKLEAFPLKTGTRQGCPLS PLLF 

N I VLE VLARAI RQEKE I KG IQLGKEE VKLSLFADDM IVYLENP IVS A 

QNLLKLISDFSKVSGYKINVQKSQTFLYTNNRQTESQIMSELPFTIA 

SRRIKYLGI QLTRDVKDL FKENYKPLLKE I KEDTNKWKN I PCSWVGR 

INIVKMAILPKVFYRFNAIPIKLPMTFFTASCFSAKSVEQYYGWNIG 

KRRAAEWQRAMTMRKVLQEILEKNPRFHHLTPLKTKHIAHWCRCHGY 

TPPDPESLRNDGDSIEDVLTQIDSEPDHGSLGDGKSSSVGPTAWTL 

LACHLECPSSFSSADNLCRKLEDLQQFQKREPBNEEEVDILSLSEPV 

KINIKKEQEEKQEEVKFYLPPTPGSEFIGDVTQKGGMLVWREAVSKA 

A I RTNG FLLL FQ I G ITLQP VALHRNVYAS WEDM I LKATEQL VND I L 

RQALAVGYQTASHNRYYYLCSLWCEAYTS FLTVELPMKVFYTYINPV 

GDGGS KTCDTMVFNHPAI KKFLES PSRSS S PANQRAETP S ANHSESD 

SLSQHNDFLSDKDNNSNMDIEERLSNNMEQRPSRNTGRDTSRITGSH 

KTEQRNADLTDETSRLFVKKTIWGNVSKYIPPDKREENDQSTHKWM 

VYVRGSRREPS INHFVKKVWFFLHPS YKPNDLVEVRYKI VVTRYVLG 

SGRDKGTVKMSTGPWLWETKKVFLPLRIKYLGIQLTRDVKDLFKENY 

KPLLKEIKEDTNKWKNIPCSWGRINIVKMAILPKTERGEEASEEKY 

EASRCWFVRFKEGCRLHNIQLQGEAASADGEAAAS YPEDPAKI IDEG 

GYTKQH I FTVYWGG YDGQNRLS S VECYDS FSNRWTEVAPLKEAVS S 

PAVTSCVGKLFVIGGGPDDNTCSDKVQSYDPETNSWLLRAAIPIAKR 

C I TAVS LNNL I YVAGGLTKA I YCYDP VED YWMHVQNTFS RQRGELGV 

FWPSAGFPASTEAERHTPFYALPSSLERTPTKMTTSQKVTFCSHGN " 

SAFQPIASSCKIVPQSQVPNPESPGKSFQPITMSCKIVSVLLSSVMF 

NKRKFITVDWQSCHKAKTKLIHTEMNGGDK 

1755 

A 

1 

2882 

MDKFLDTYTLPRLNQEEVESLNRS ITGSE IVAI INSLPTKKS PGPDG 
FTABFYQRYKBELVLLLLKLFQS I EKEATLPNS F YEAS I ILIPKPGR 
DTTKKENFRPISLMNIDAKILSKILANQIQQHIKKFVHHDEVGFIPR 
MQGWFNIHKSKNVIQYINRTKDKNYMIISIDAEKAFDKIQQLFMLKT 
LSKLGIDGTYLKI IRAI YDKPTVKI ILNGQKLEEFPLKTGTRQGCPL 
SPLLFNIVLEVLARAIRQEKEIKGILLGKEEVKLSLFADDMIVYLEN 
PIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSEL 
PFTIASKRIKYLGIQFTRDVKDLFKENYKPLLNEIKEDTNKWKNIPC 
S WVGRINI VKMAI LPKPPLL I PRQTGSGVDLQQTPTDLQLS VLTVRR 
KINKQKGHPHQYLIWTSPSSKTKGRQLLASNRTTNGKEIRPBD*KGE 
VS S CCWRRI * YTAENKI PRNPTYKGCEGPLQGELQTTAQGNKRGYKQ 
TEEHSMLMGRKNQYRENGHTAQ\VIYRFNAIPIKLPMTFFTELEKTT 
LKFIWNQKRARI AKS I LSQKNKAGGITLPDFKLYYKATVTKTAWYWY 
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QNKD I DQWNKTErSE I TPH I YNi L I FDKPEKNKQ WGKDolir W K.W x WE 
NWLAICRKLKLDPFLTPYTKINSRWIKDIiNVRPKTTKTLBENIiGITI 
QDIGMGKDSMSKTPKAMATKDKIDKWDLIKLKSFCTAKDTTIRVNRQ 
PTKWEKI FATYS SDKGL I SR I YNELKQ I YKKKTNNP I KKWVRDMNRH 
FSKEDIYAAKKHMKK\CSPSLAIRETQIKTTMRYHLTPVRMAIIKKS 
GNNRD I DEG VNHH S QQT I TRTENQS JLHVLTHTWE FNNENTWTQGGEH 
HTLGPVMGYVFISSVKMDQYTKLVSEEWGVAENQATNAFLHRIGKNY 
FKVHMEPKKSPHCQVNPKPKE 

1756 

A 

1 

1746 

MI ISVDAEKAFDKIQQPFMLKTLNKLGIDGMYFKI IRAI YDKPTANI 
ILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVtiARAIRQEKEIKGI 
HLGKEB I KLSLFADDM I VYLENP I VS AQNLLKL I SNFS KVSGYKINA 
QKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVJC^IiFK 
ENYNPLLNEIKEDTNKWKNIPCSWVGRINIVKM\AILPKNWKKTTLK 

V X VvJ>iyiSJrti\V* A/AJVo X JjoyiVN JVftoul 1 LjtrLJr ISJjI X JVH.X V 1 IS. JLi-VrY X KiXyiN 

RDIDQWNRTEPSEIMPPIYNYLIFDKPEKNKQWGKDSLFNKWCWENW 

LAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENIX5ITIQD 

IGIX5KI)FMSKTPKAMATKAKIDKWDLIKLKSFCAEKETTIRVNRQPT 

KWEKI FATYS SDKGLI SRI YNELKQIYKKKTNNPIKKVA/TODMNRHFS 

KEDIYAAKJOiMKKCSSSLAIREMQIKTx^ 

RDMDEIGNHHSQQTIAMTKNQTPHVLTHRWE 

GPWGNLKLLRKPKISLS 

1757 

A 

1 

2866 

MDIDAKILNKILAKRIQQHIKKLIHRDQVGFILGMQDWFNIQKSINV 
IHHINRTKDKNHMIISVDAEKAFDKIQQPFMLKTLNKLGIDGTYLKV 
TRAIYDKPTANIILNGQKLEAFLLKTGTRQGRPLSPLLFNIVLEVLA 
RAIRQEKKIKGIQLGKEEVKLSLFADDMTVYLENPIISDQNLLRLIN 
NFSKVSGYKINVQKSQAFLNTNNRQTESQIMSELPLTVASKRIKYLG 
IQMLNQEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRN 
S VEEEIjVPFLLKIiFQS I EKEG I LPNS F YEAS 1 1 L I PKPGRDTTKKEN 
FRP I SLMN I DAKILNKI LAKR IQQH I KKL I HHDQVGF I PGMQGWFNI 
RKS INVIQH I NRAKDKNHM IIS IDAEKVFDKI QQPFMLKTLNKLG ID 
GTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTKQGCPLSPLLFNI 
VLEVLARAIRQEKE I KGIQLGKEEVKLSLFADDM I VYLENPI VSAQN 
LLKLISNFSKVSGYKKKINVQKSQAFLYTNNRQTESQIMSKLPFTIA 
SKRI KYLG I QLTRDVKDLFKENYKPLLKE I KEDTNKWKN I PCS WVGR 
XNX VlSJxA.lJbFiS.Xvjx v_VyJL/I loAJNl IxjVHUI 1 FoHVMFFJNxxWUJjAt JN X 
QELEHLQTV1CNISPLQILPPSGDSEQLSDGITVT1HPSGDSDTTMLES 
ECQAPVQKDIKIKNADSWKSLGKPVKPSGVMKSSDELFNQFRKAAIE 
KEVKARTQELIRKHLEQNTKELKASQETHRDI^NGLTVESFSNKIQN 
KCSGEEQKEHQQSSEAQDKSKLWLLKDRDLARQKEQERRRREAEGQW 
APPPPPNGEAAGLPAPPDAHAPVPALLVPGLLLLRALRLHPPAQTFV 
EPRQGAGADLPAARPGSTLPACLSDPSLGTHCYVRIKDSGLRFHYVD 
GAGICLASGEASGSF 

1758 

A 

1 

1285 

MLEVLAWAVRQEKE I KG I QLGKEEVKL S LFADNMTVYIjENP I VS AQN 

LLKLISNFSKVSGYKVSGQKSQALLYTNNRQTESQIMSELPFTIASK 

RIKYLGIHLTRDVKDLFKENYKPLLKEIKKDTN^ 

IV1CMAILPKDIIQENFPNIARQANIQIQEIRKTPQRYSSRRATPRHI 

ITOFTKTOMKEKMLRAAREKASHHTYSKIDPILGSKPLLSKCKRTEI 

ITNxXSDHSAIKLEFRIKNLTQNCSTTO^ 

I KMFFETNENKDTTYQNLWDTFKAVCRGKF I ATNAHKRKQERSKVDT 

ltsqlkelekqeqthskasrrqeitkiraelkbietqktiiqkinesr 

swfferinkldrplarlvkkkreknqidaikndkgdit\tdpteiht 

tireyyknlyakkle^eemdkfletytlprpnqeev^ 

e i vai ins l ptkks pg pdg ftaef yqr ykeel vp fllkl fqs i e keg 

i lpns fde as 1 1 l i pklgrdttkkenfrp i plmntdaki lnki lanr 

iqqhikklihhdqvgfipg\mqgwfnicksinviqhinrtkgknhmi 

i s idaekafdkiqqpfllktlnklgidgtylki irai ydkpiani lit 
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NGQKLEAFPLKTGTRQGCPLSPLLFNIVLBVLARAIRQEKEIKGIQL 
QKEEVKLSLFADDMIVYLENPTVSAQNIiLKLISNFSKVSGYKISVQK 
SQAFFYTNNRQTESQIMSELPFTIASKRIKYIiGIQLTRDVKDLFKEN 
YKPLLKE I KEDTNKWKN I PCS WVGR I N I VKMAI LPKPPLL I PRQTGS 
GVDLQQTPTDLQLS VLTVRRKINKQKGHPHQYL I WTS PS S KTKG\ GS 
S S PATEL YYKATVTKTAW YG YQNRD I DQWNRTE P S E I MPLTYNYL I F 
DKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWI 
KDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATKAKIDKW 
DLIKLKSFCTAKETTIRVNKQPTTGEKIFTTYSSDKGL\ ISRIYNEL 
KQ I * KE IKG IQLGKEEVKLSLFADNMTVYLENP I VS AQNLLKLI SNF 
oivVoUx JVVi>ijyiv5>yAJjiji lISJNKyaE&QIMSEIjPFTIASKRIKYljGIH 
LTRDVKDLFKENYKPLLKEIKKDTNKWKNIPCSWVGRINIVKMAILP 
isjj J. xyi^n* 1 iTMiaMKyAW xy xyn»XKis.x PsiK x SSRKATPRHI IVRFTKVE 
MKEKMLRAAREKASHHTYSKIDPILGSKPLTjSKCKRTEI ITNYLSDH 

^ATKTjRFR TKNT»TONPQTTWtf PNTNJT.T.T TJT^VT#J\7TJ>Jl?MTraT?T'lfMI?T?T?TrKl 
OrtJ. ajjcjT x\. J. X\XM JJ A \^lMV~o X X W IVrTM XM Xj XiXjIN U X W V tXM£!«ril\Ai& X .WIX* 1* A XIV 

ENKDTTYQNLWDTFKAVCRGKFIATNAHKRKQERSKVDTLTSQLKEL 
EKQEQTHS KASRRQE ITKI RAELKE I ETQKTLQKINES RS WFFERIN 
KLDRPLARLVKKKREKNO IDAI KNDKGD T TNR SHRNTTHYttOR Tt, 

1759 

A 

2 

231 

PPSAS/ CVQTGPPCHSLAFPPSAPGQEQEGHQIjPSHVI PCHLALGTA 
FPQPAAMAGWGVS QATYCQELE PQFPPVS S S 

1760 

A 

1842 

2096 

CHSQKPQVPPPKPWGSLERP\PNT*VPACVLPAPAPAQRPGQIARQ* 
PWVAPGTSGOSRVGRTPGVSSGHGOTTjTPPMAT.T.nPT.T. 

1761 

A 

37 

288 

WGOTGSQVMTTVLNTAxjLPPKPSPMLPIKHYAIPPPSY/SPHP*PPI 
PSPTANLESAPPASAP/PPPLPP\PAQLGEAHAPPE*YAIPPPSYSH 
TOSHPSPAQQQTLSQRHQPLHLSSSASTLPSWGKLMHHLNNIPQ 

1762 

A 

397 

506 

RVCI FNIiVGFGI 

1763 

C 

291 

491 

MGADRQTHPQDRWFS LHL I KLLRR SYRI EQP ASHSRGRLDNTSLHRL 
SLCPCHPPLFLTLFLGIMF* 

1764 

A 

630 

1159 

SFYSALMLSDRKNGRERGRHGLSRRIKRQRKLVEAEAASQACLAGKP 
GPTP ASHP / PGDGLPCPLS APTLS PAT/ ALPLP/ S P * RS PNHS AN/ P 
NPLSPTSSVKHSRCSSPHSALPLSTPKATPPSPNHRAGLFSSLPLAP 
MSTS*LLQKNCVLVRAPAFTSYIPVFFMTPAQCPVFLSAQ 

1765 

B 

1 

1359 

- 

MGIKNSKRGSLPliAAPALLKPDTLWSCIFGEGDLGEEDMPPDPQAPS 
PAILPPNHAFGLWPFRYTQTHPQEAAADPAAAYHPEPPPLILAGVRF 
KbW FAijPhjlNiUKGAWRGDGVAEGHAALWVLGL 

HDGRWREGAEKGPRRRWVVRAIQTPRLGKAIilHCWTTRGSLWVNSSV 
SAGSERDQRADHAAAQTSCAAGAVACGLLLIjLVRGQAWVKKQNIiSPE 
TRTKELWSl^PLPHPGQGHNLASRAKARTQPVPSGPTHGAGAGESC 
PCEGRQAGVQTFIiG I PFAKPPLVRCDLHPLS PLNLG WWGWNHPSGH 
x x xir/\rion.i}vjoi>ixjt^llvvit^/vr<X^ 1F1 XjxvA XL» 
TVSPFLASLRWHECVFACCVPHIPRTFHGAIMESGVALIiPGLLPAQL 
MSSPRFPGPCVFYEFQHQPSWLKNIRPRT* 

1766 

A 

2 

597 

RWLIPKVMRI xT)TQKKMDREASQAALQKMLTLLMliPPTFGDLLREEY 
IGDNGDPQlTjQAQFQEMMADSMFVIPALQVAHFQCSRAPVYFYEFQH 
QPSWLKNIRPPHMKADHVKFTEEEEQLSRKMMKYWANFA1WGNPNGE 
GLPHWPLFDQEEQYLQL1^QPAVGRALKAHR1^LW\KXA1jPQKIQEL 
EEPEERHTEL 

1767 

A 

3 

1867 

IHPSAPRLGKALIHCCSFPGQPLGEQQRVRRQRTBTSEPTMRLHRLR 
ARLSAGACGLLLLLVRGQGXQDSASPIRTTOTGQVLGSLVHVKGANA 
GVQTFl^IPFAKPPL\GPL\RFAPP*SPLESWSGVRDGTTHPA\MCIi 
QDLTAVE\ SEFLSQFNMTFPSDSMSEDCIjYLS I ytpahshegsnlpv 
MVW I HGG ALVFGMAS L YDG S MLA \ AI» \ ENWW I IQL.PPGGVLGFFS 
TCDKHATGIWGYI^QVAALRWVQQNIAHFGGNPDRVTIFGESAGGTS 
VS SLWS P I SQGLFHGAIME SGVAL/ LLPGL IAS S AD \ V I STWANL 
SACDQVDSEALVGCLRGKSKEE ILCN* TSLFKM I PGVGGMGVFLAQG 
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TPRELLASADFQPVPSIVGVNNNEFGWL\IPKVIEDLIDNPEGKLGQ 
ERAS QG\ VLQKMLTLLMIiPPTFGDLLREVY IGDNGDPQTLPKRKF \ Q 
KMMADS\MFVI PALQVAHFQ\CS \RAPVYLPTSSQH\QPSWLKNIRP 
P\HMKADHG\DELPFVFRSFF\GGNYIKFTEEEEQLSRKMMKYWANF 
AR\NGNP\NGEGLP\HWP\LFDQGGAITLQLNL\QPAVGPGL*KAH\ 
K L» \ y b WK \ KAJjFyjs.XQ \ EL»EE PEERHTEIj 

1768 

A 

2 

288 

CPNS S PG S AS EVGCARS GQfe S LLRS LPRCDGW PWAEAG A/MCAGRNL 
TS CSVGRY \ YS SR * QDEES * TARHLLCAPQTGHQRRRPCRGQRNFCH 
IPC 

1769 

A 

1017 

1463 

PRG PWSQGE PKWLIiARQAAGCC P PG ACL WGHS P AGACS PLCAAKG S R 
YSRVPASSGTPAGK\GWQIiLAREGEQEAGLCIEASQIGVPNSSPRKR 
PQR * DVPDRGS PVC * EVFPRCDG WP WAEAGAS VQGGTS PPVS FNRMT 
SASTPNIW 

1770 

A 

1 

1181 

MAAYKLVLIQHGESMWNPENRFSSWYNTDLSPAGHKEAKCGRQHSGG 
LIGLNKAETAAKHGEAQVKIWRHSYDVPPPLMEPDHPFYNISKDRRF 
AOTjTEDQLPSCESLKDTIAKAIiPFWNEEIVPQIKEGKQPKYPFEKRL 
E WNHYFTTDDG YR 1 1 S ARFG VPRTQVRTWVAI»YEKHGEKGL I PKPK 
GVSADPELRIKWKAVIEQHMSLNQAAAHFMLAGSGSVARWLKVYEE 
RGEAGLRALKIGTKRNIAISVDPEKAASALELSKDRRIEDLERQVRF 
LETRLMYLKKLKAJLAHPTKKAAE I PRSTFYYHLKALS KPDKYADVKK 
RI SE I YHENRGRYGYRRVTLSLHREGKQ INHKAVQRLMGTLSLKAAI 
KVKR YR S YRG E VG QTAPNVL QRD F KATR PNE KWVTD VTE FAVNG RKL 
YLSPVIDLFNNEVISYSLSERPVMNMVENl^DQAFKKLNPHEHPVLH 
SDQGWQYRMRRYQNILKEHGIKQSMSRKGNCLDNAWECFFGTLKSE 
CFYLDEFSNISELKDAVTEYIEYYNSRRISLKLKDLYASCLTVQLFG 
VSTLEGLSEEAIMEIiNIjPTGIPVVYEIJDKNLKPIQFLGDEETMRKAM 
EAVAAQVEDEYNLYGDVTTVSSLTPEKIiAPEVKElT^ERIjN ILLQVG 
GRSRTNIJ^VRVPIHKHPSIRDPNSQASLNKLTDKMDSLWLKSKNRP 
KGPWKGEGAVKLFVSLENAAKPVFLQPSQRRVPGNNGSWEFPS*SRK 
SGISIGAVKRPTH*GS*KASSIS*N7U\YVSKKAESLSSSHEKGCGDT 
AQYVLLSS KGSQQA* QVCGR * KAY * * DLSRE * RP IR I P * GNAVS S SR 
RETD * P * S C S APDGNPLT * SSD *GQAI PLLQRRGRANRP 

1771 

A 

362 

552 

DRIJ^PHSAAH*GAKSSPLAATSDQWHIiALSRGSGSLYKCTEVKQMSN 
S*PASRSACPPPSPPR 

1772 

A 

2318 

3201 

FMPLHLDGGYCSPAEGFSSRYEHGLMKDLSRGSLSPGGERACEGVPS 
APQNPPQRKKVSLLEYRKRKQEAKENSAGGGGDSAQSKSKSAGAGQG 
SSNSVSDTGAHGVQGSSARTPSSPHTKFFPSHSSMSHLEAVSPSDSR 
GTS S SHCRPQEN I S S RWMG SH I S RTTP/ S KEGAS PRS S EAA* G / S AQ 
KGEPSPTWESNITEKDSDPADGEGPETLSSALS*RSNSFTALSRYSY 
QT/PLAPFTGTPGYFSSQPHSGNSTGSNLPRRSCPSSAASPTLQGPS 
U£>P l£>JJb Vi>y£>t> ICj ill 

1773 

A 

53 

1025 

GTRHLEAVSPSDSRGTFLSHCRPQENISSRWMVPTSVERJLREGGSIP 
KVLRSSVRVAQKGEPSPTWESNITEKDSDPADGEGPETLSSALSKGA 
TVYSPSR\SATSSCSVIVLGQNHKASFSRVPPPSEDIPTQSPGYSYR 
TTALRPGNPPSHGSSESSLSSTSYSSPAHPVSTDSLAPFTGTPGYFS 
SQPHSGNSTGSNLPRRSCPSSAASPTLQGPSDSPTSVFSFSVPAQEL 
*ASTS/SSSEl/PRSSLAIRLTD*SVCPVLGQSAGYQGIiQGICGFQF 
lAiilrllJL VvjVijr a iy xKJ.FJbFljQGSRSQDSRRGIjFX>GIiLGFGGKQN | 

1774 

A 

1 

415 

AENT I LSLMFSGKS WGSLS S ECGALIiPGTPTATRIjOSIjTRRSSIiKRG 
T * GPQI PAARPREGSRI GTCTS S * \ PGLRTEAHRSLRNRQAGVS S P F 
QLS PALKPRKS P SQATGQRPG * GQWGQKGSGSALL PTNTTHVEA 

1775 

A 

1 

458 

ENTILSLMFSGKSWG^LSSECGAIiLPGTPTATRLQSLTRRSSLKRGT 
*GPQ I PAARPREGSRI SQNQVGDGGKENRYRGFGGPPGTKSDPGHQH 
G\ * GGEGLWGS WRS KSGPAI ATGAIASPATQIiLRVNPDTGDWDVYFL 
LTLVSGLRLTGA 

1776 

A 

1 

1387 

HSMGWKEHVDRGGHTKGMVFVSLQPACWLPRERPFQLRLCLIEDPLL 
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RLCGQGRGRGQETTSGLILLSHVGTFLRDRWTLTSLWPCPRAIjLVFL 
SHSCEVMGAGAPGVSLPEGQLS PLPWLQVGRS SRRAPDSGQGPLGPP 
GLADVSMRGRAPGTSMCGS / RS TPVPPQS PGG * VSMPATPG IGFLCR 
LPHKSAPEGP/ GGFGFLFFIKHLKQHCSLPSPRGLSITASTCWLVG 
SKADPDPFCPHCPHPGPLTSSLRGAFSR/PSEQGRAGRGLESSRLSV 
SQGSCEASVLRPESRGSTHP/ISPVSGFTLRSWAGEAIAPVAIAGP 
AGAGLR \ GLRGALRPAHGHPRL * S PGNP ISPSSTS P PCLMPQGHS CL 
PAGPPNPRCLSSFPPSPTLVLTNPASLSGPGCRDLGSLKSLSLKNFC 
GSDS EAELLVGVPGSRAPHS LLKLPQLFPENI KLR I VFSAKKKKK 

1777 

A 

46 

591 

LSPPKPQKQSQEQNPFCQRWALGKQLGPPQGGQEG/QGPPDSLVLPG 
V/ PPVPLFVGGNLPHPPPP\ PQPQRNKGRQTEGSCSPPFKGQHNS/P 
CGPQPS * AHPLRHG SGDQAQPTS AP / PPCR/ PES * PQADEMPTPCCH 
NTG KAL \ G P P S QEGME PGG PQP G P S R S TQ S S VAHLTS GTAVRPGLG S 
PP 

1778 

A 

1412 

1673 

KRCPINRFPLECLPLPHLMGIPPEGHFH\HPLMGE* *NPPCSIQDPH 
CVTYFETPPVNLCPSTRPPEVGWEGGPSSPPAFEAPGLKG 

1779 

A 

646 

1100 

^nnSLLVTFFPFSQLQLYFQLTYASFTELLKVPIJWIiFYGMPMKKHFI. 
KPTVSDQWENNRKLKRRI \ SR* FSCRKV* V* IAHCHKFCLVLREA/ H 
MHVJL I LCAKKRF / P PLKKLTWLGMVSHTCN PSTLGG * GR * TA* TQEF 
KTSLDNMVKPHLF 

1780 

A 

109 

943 

WAKLGKGPAKR* PQALGAS APS AL YPRHVAPARAPGRTKGAGS S CRN 
SPRQV\RPQPWQWGGGAQSDVSP*GQTPRGGSFERSSSCSGHTGLVG 
KERRPLSLEGP * / S PEDPPRARHGGPQGCREHPPWFSRPLCPEAGPE 
PRAPAWTSDS IGERSTGG/ PSRPASKGPVPS AQRAGQTPGNPEAAGS 
LS LPCRALPQGREAPQPQPPPYLPLKLERG I LVFALSKI FKN 

1781 

B 

98 

195 

GLCLGQEVGQEDLVMQTLPYGVLGLDRDEEVTG* | 

1782 

A 

2 

1556 

KHSSGEQDTSTLPSPPLLTTVEDVNQDNKTKTWPPKAPWQHPSPLPS 
TLPS PS APL YAVTS PGSQWNDTMQMLQS P VWAATNDCS AAAFS YVQT 
PPQPPPPPAHKAAPKGFKAFPGKAMYNGENVFKHLAKDSRGSGNEAP 
KRQDWIAQGRANTRDKPPASWGRAVSRVPGGRERTLDKTVAVESPAM 
AALTRDPQFQKLQQWYREHRSELNLRRLFDANKDRFNHFSLTLNTNH 
GH I LVD YS KNLVTEDVMRM L VDLAKS RG VEAARERM FNG EKINYTEG 
RAVLHVALRNRSNTPILVDGKDVMPEVNKVLDKMKSFCQRVRSGDWK 
G YTG KT I TD V I N I G IGG S DLGPLMVTEALKP Y S SGG PRVW YVSN I DG 
TH I AKTLAQLNPES SLFI IASKTFTTQETITNAETAKEWFLQAAKDT 
KVKEFGIDPQNMFEFWDVSTSTSALGELSVLCQVMAVERPMRSGLAP 
QFMAPFTETQ/^ENLRVILDSSYLTHTPLPIPHQLLPHFVQLSAHQKP 
H 

1783 

A 

2 

1928 

ARGAPRLLRAAGAPSSSARVSLSVPSSPAMAALTRDPQFQKLQQWYR 
EHRS ELNLRRLFDANKDR FNHFS LTLNTNHGH I LVD YS KN/ LLVTED 
VMRMLVGLWPKSRGVEAARERMFNG\EKINYTEG\RAVL\HVA\LRE 
TGFKTHPILG*NGKDV\MPEVNKVLDKMKSF\CQRVRSG\NLKGYTG 
QRPFTD\VINIWIGGSDLGPLMVTEALKPYSSGGPRVWYVSNIDGTH 
IAK\TLAQL\NPESSLFIIASKTFTTQ\ETITNAETAKEWF\IiQ\AA 
KDPS\AVGEDFLFALSTNTTKVKEFGIDPQNMFEFWDWVGGRYSLWS 
AIGLTS I ALHVGFDNFEQLL\ SLAHWMDQLFRTDAPGRKNAPVLLAL 
LG I W YINC FGCETHAML P YDQYLH\ ALLRTS SRA\ TWS PMGNTS PNL 
E PVWTTRQ APLCGG S \ QGPMASML F \ TS C I HQGTKM I PC \ DFLI PVQ 

PED\ FERLL PHKGL * KGNRP \ TNF YVFTK\ VTP I QLGAW S PM YEHKI ! 
LRFRVIIW\DINSF*PSGELELGK\QPGLRKLEPELDG\SAQVTF\Q 
DVSTNGL INF IKAAARGPRVP INSVL I CSLLCDSHFFFS SLFS PKPE 
LIVP 

1784 

C 

127 

435 

MAASXNPEVLDITEETLHSRFLEGVRNVASVCLQIGYPTXASVPHSI 

INGYKRVLALSVETDYTFPLAEKVKAFLADP 

SXCCCSPS* 
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1785 

A 

112 

1161 

RTAVMPREDRATWKSNYFIiKIIQLLDDYPKCFIVGTVDNVGSKQMQQI 
RMSLRGKAVVLMGKNTMHAQAPFEGTIi*mPSLWRKLLP\HIRGEFG 
LLFHPGRTLTE I \ RDMLLAQ * GCPAAA\ RPGAI CPHVKVTVAS PRTL 
GLG\PEKTSFFPGL*VSPTKNLQGAPIENPEVNVPAESRTGDQSGEP 
SESHGWLNML\NISPFFLWGWVIPARCSTNGQHSTKPLKVIiGLFTGG 
KLLQFSAFLGGVSRKCLPSVCLPELAYPNCCNQYPDSIING\YKR\V 
LALSCGDGITPFPLAEKVK\AFLADPSAFVCCCNLWVAATTACFA/A 
AAAAPAKVEAKEESEESDEDMGFGLFD 

1786 

A 

2 

355 

RSITCKTKEARMLLAWVQAFLVSNMLLAEAYGSGGCFWDNGHLYRED 
QTSPAPGLR\CLNWLDAQSGLASAPVSGAGNHSYCRNPDEDPRGPWC 
YVSGEAGVPEKRPCEDLRCPGGRI 

1787 

A 

1 

771 

MliLAWVQAFLVSNMLLAEAYGSGGCFWDNGHLYREDQTSPAPGLRCL 
NWLDAQSGLASAP\VGYCRNPDEDPRGPWCYVSGE/AGVPEKRPCED 
LRCPETTSQALPAFTTEIQEASEGPGADEVQVFAPANALPARSEAAA 
VQP V I G I S QR VRMNS KE KKDLGTLG YVLG I TMMV 1 1 1 A I GAG 1 1 LG Y 
SYKHGKDLKEQraQKVCEREMQRITLPLSAFTNPTCEIVDEKTVVVH 
TSQTPVDPQEGTTPLMGQAGTPGA 

1788 

A 

48 

375 

KGL I KP FGHRTPERKK * LAQGRKQATGMARAQLPDGAQHFS TALC * Q 
LSRASNL*CHTQEALAAPSHKASFSEPFHLPMGRRVNGAFYGAIWFG 
DLNLKWS S CGNDAG 

1789 

A 

6 

902 

LQGWDWEAEPPPRGPRLNTGRS I TCKTKEARMLLAWVQAFLVSNMLL 
AEAYGSGGCFWDNGHLYREDQTSPAPGLRCLNWLDA\QAGLASAPVS 
GAGNHSYCRNPDEDPRGPWCYVSGEGG\VPEKRP\CEDL\RCP\ETT 
\SKALPAF\TTEIQGNVLKGPSADE\VQVFAPANALPARSBAAAVQP 
VIGISQRVRMNSKEKKDLG\TLGYVLGITMMVI I IAIGDGI ILGYSY 
KRGKDLKEQHDQKVCEREMQR I TLP * S AFTS PTCEI VNEKTVWHTS 
QTPVDPQEGSTPLMGQAGTPGA 

1790 

A 

1 

- 

1995 

LGRPTRPAPTFWAVAVRTRCLAEKRRQELMGALCYPPQGDRFLQKSW 
IFFRPVMADKLTRIAIGNHDKCKPKKRRQECKKSCPWRMGK\LCIE 
VTPQSKIAWISETLCIGCGICIKKCPFGALSIVNLPSNLEKETTHRY 
CANAFKLHRLP I PRPGE VLGLVGTNG IGKS TALKILAGKQKPNLGKY 
DDPPDWQEILTYFRGSBLQNYFTKILEDDLEAIIKPQYVDQIPKA\A 
KGTVGSILDRKDETKTQAIVCQQLDLTHIiKERNVEDLSGGELQRFAC 
AWC I QKAD I FMFDE PS S YLDVKQRLKAAI T I R S L INPDR Y 1 1 WEH 
DLS VLD YLSDF I CCL YGVP S AYG WTMPFS VREG INI FLDG YVPT\ E 
NLRFRDASLVFKVTVETAITOEEVKKMCMYKYPGMKKKMGEFEIjAIVAG 
EFTDSEIMV1^LGENGMGKTTFIRMIjA\GSLKPDEGGEVPVLNVSYK 

pqkispkstgsvrqllhekirdaythpqfvtrl**kplqieniidqe 
vqtlsggelqrvt\ lal * lgqnlpdvyl 1 \ de ppa\ ylds \ eqrlma 
arwkrfiphakkta\fwgtwtfim\atyl\adrvivfd\gvpstk 
ntvans pqtllg w a * infws s awkftfqeelqtnyw \ pr i \nkl i s l 
edvdqkksgn\yffldd 

1791 

A 

3 

367 

WGLLD\WPLRVRADPVSGCGGKMAELRVL\ELSRGSSTTP*RSE*S 
LTADLRLNEPR YATLPN IMKAKKKKI EVI KPGDLGVDLTS KLSVI S V 
EDPPQRTAGVKVETTEDLVAKLKBIGRI 

1792 

A 

3 

404 

RLKEKKLVKEVIAVSCGPAQCQETIRTALAMGADRGIHVEVPPAEAE 
RLGPLQVARV/ AWTADLRLNE PCYATLPNIMKAKKKKI EV I KPGDL 
GVDLTSKLSVISVEDPPQRTAGVKVETTEDLVAKLKEIGRI 

1793 

A 

1 s 

1044 

MEQCWTETLLHLHQLWGFLPGSHILSQIPCGEMRETIRTALAMGAD 
RG I H VE VP P AE AER LG P LQ V AR VLAKLAE KE KVD L VTjLG KQ VA WS P 
PCAPLLIPAREEGGGDTSTPPLLPPYGPDPPTITVSSDRDAAPARFV 
TAGSNVTLRCAAASRPPAD I TWS LADPAEAAVPAGSRLLLPAVGPGH 
AGTYACLAANPRTGRRRRSLLNLTVAGGPGKRVHGWER I RCVPMST 
ARTCEERHEHCHPAQDVGPGKAPVPKAAAQEADGDSSVDGQGQHDEK 
GWGERAGGCSPITSAQGYPQTAPECPYMTVQVLDITHQPFYVDLHHQ 
I PDPW I LI LRVFETHGASH 
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1794 

A 

11 

. 364 

SGASRVGAFCSAS WPTS S WCS PRPPTTGPANKRATVACGRNATTASA 
PASPARVRLRPPRCQRAFQTPRARDTQLGPWRDAHPFHPGLSCSCPL 
SAEAATNPAPRLVGLSPTPSPSW 

17 95 

A 

54 

868 

WDCGFGATRASRCGARPRAPSSSSAIRVKPDRTGVVTDGVKHSMNPF 
CEIAVEEAVRLKEKKLVKEVIAVSCGPAQCQETIRTALAMGADRGIH 
VEVPPAEAERLGPLQVARVLAKIiAEKEKVDLVLIiGKQA\lACTDCNQ 
\TGQMTAG\FLDWPTGQIRPPQVTLEGDKLKVEPGKSIGGLETPCA* 
KLP\AVGDSLTLRFNGGPRTPRVPNIHESPRRRRSKVIKP\GD\LGV 
DLT\SQLSVISVEDP\PQRTAGVKVETTEDLG\AKLK\EIGRI 

1796 

A 

1217 

2829 

GARS EAAEFQQSASCRRLRGGGGPGTPGRGGALLAS LLP PCRTPPDP 

PDGSCRCTRPLLSPLGKLSAPPRPRPLFWAQAGHAPQGLLPTSRPA 

APATAGSRNMSTTLLSAFYDVDFLCKTEKSLANLNI^NNMLDKKAVGT . 

P/VAAAPSSGFAPGFLRRHSASNLHALAHPAPSPGSCSPKFPGAANG 

S SCGS AAAGGAVGGRRTALLNKENKFRDRS FSENGDRSQHLLHLQQQ 

QKGGGGSQINSTRYKTELCRPFEESGTCKYGEKCQFAHGFHELRSLT 

RHPKYKTELCRTFHTIGFCPYGPRCHFIHNADERRPAPSGG/ASGDL 

/RPTSRTPPPPSCSSASSCSSSASSCSSASAASTPSGAPTCCASAPA 

AAAAALL YGTGGAEDLLAPG APCAACS S AS CANNAFAFG PELS SLI T 

PIAIQTHNFAAVAAAAYYRSQQQQQQQGLAPPRAP/APPSATLPAGA 

AAPPSPPFSFQLPRRLSDSPVFDAPPSPPDSLSDRDSYLSGSLSSGS 

LSGSESPSLDPGRRLPIFSRLSISDD 

1797 

A 

731 

2553 

GAR S E AAEFQQS AS CRRLRGGGG PGTPGRGG ALLASLLP PCRTP PD P 
PDGSCRCTRPLLSPLGKLSDPPRPSPVIRGGSSPATPPQGLLPTSRP 
AAPATAGSRNMWTTLVS AFYDVDFLCKTEKYLANLNliNNML\ DKKAV 
GTPV\AA\APSSGFAPGFLRRHSASNLHALAHPAPSPGSCSPKFPGA 
ANGSSCGSAAAGGPTSYGTLKEPSGGGGTALLNKENKFRDRSFSENG 
\DRSQHLLHLQQQQKGGGGSPDQIPTRYKTELCRPFEEERARAQYGE 
K\CQF\AHGF\HELRSLTR\HPEVQDRSCAAPFHTIGFCPYGPRCHF 
IHNADERRPAPSGGASGDLRAFGTRDALHLGFPREPRPKLHHSLSFS 
GFPSGHHQPPGGLE\SPLLLDSPTSRTPPPPSCSSASSCSSSASSCS 
SASAAST\ PSGAPTCC\ ASAAAALRLLYGTGGA\ *DLLAPGAPCAAC 
SSASCANNAFAFGPELSSLITPLAIQTHNFAAVAAAAYYRSQQQQQQ 
QQQGLAPPAQPPA\PPSATLPAG\AA\APPSP\PFSFQLPRRLSDSP 
VFDAPPSPPDSLSDRDSYLSGSLSSGSL\SGSES\PSFDPGRRLPIF 
SRLSISDD 

1798 

A 

146 

405 

RKLDVYFEYEEKIMSKTTLDKSLLDIISDPDAGTPEDKMRVFLIYYI 
S TQQAP S E \ AFTKMAS APAS YGS TTTKPMGLL S RVMNTG 

1799 

C 

47 

235 

MXVXCNQIKXLVSYRAINRPDITDTEMETVMDTIVDSLFCFFVTLGA 
VPIIRCSRGNSSKKW* 

1800 

A 

46 

2100 

PAPGLPVLPRVEVFLEEPGSGSWEPRWRRRRQRQQQQ\QPSFRKDSQ 
LLSCVYCLSMCLILKTAQEHPYGRYSIJ4HRFGQDIFSPLLSVRELRD 
MGITLHLLLHSDRDPIPDVPAVYFVMPT\EENIDRMCQDLRNQLYES 
YYLNFISAISRSKLEDIANASVRGLSA\VTQVAKVW\DQYLNFITLE 
DDMFVLCNQNKELVSYRAINRPDITDTEMETVMDTIVDSLFCFFVTL 
GAVPI I RCSRGTAAEMVAVKLDKKLLENL \ RDARNS LFYRVDTLGAG 
HF \ S FQRPLLVL VDRN IDLATPLHHTWTY * ALVHDVLDFHLNRVNLE 
ESSGVENSPAGARPKRKNKKSYDLTPVDKFWQKHKGSPFPEVAESVQ 
QEL\ ES YRAQE\DEVKRLKS IMG\ LEGEDEGA I SMAF/ SDNTAKLTS 
AVS S LPELLEKKRL I DLHTNVATAVLEH I KARKLDVYFE YEE K I MS K 

ttldkslldiisdpdagtpedkmrlfliyyistqqars\eadlqqyk 
kaltdagcnlnrsryijcq*raft\kmasa\ * agyg \ stttktmglls 
rvln\tgsqf^m\egVknlvlkqqnlpvt\rild\nlmek\ksnprk 
lmd yryfdpknacggnds s \ vpqklki pfprghs ffvggeegnyi \ e 
yqnl\cdyikgkqgkhil\ygc\selfn\atqfikqlsqiigqk 

1801 

A 

1 

3 94 

AGGRAASCLQSLTARGPSLQSRPGAFAGCGGISFCLRLGLLFLFADV 
DPLS FRDSGTPWFSSSNDPEGMEFS S PSREECPKPLSRVS IMAGSL 
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TGLLLLQAVS WAS GARPC I PKSFGYS S WCVCNATYC 

1802 

A 

3 

538 

RIYIFRV/PMASCD/FSIRT/YTNADTPDDFQLHNFSLPEEDTKLKI 
PL IHRALQLAQRP VS LLAS PWTS PTWLKTNG AVNGKG S LKGQPGD I Y 
HQTWARYFVKFLDAYAEHKLQFWAVTAENEPSAGLLSGYPFQCLGFT 
PEHQGGSLKAAAGVPRHPDDSYGTSQEKWQLLKEKMFEPKK 

1803 

A 

192 

2035 

GRYLHPCFCLVDPLSFRDSGTPVVFSSSNDPEGMEFSSPSREECPKP 
LSRVSIMAGSLTGLLLLQAVSWASGARPCIPKSFGYSSWCVCNATY 
CDSF\DPPTFPALGTFSRYGE/STRSGRTGWSLSMG\PIQANHTGTG 
LLLTLQPEQKFQKVKGFGGAMTDAAAIiNILALSPPAQNLLLKSYFSE 
EGIGYNI IRVPMASCDFS IRTYTYADTPDDFQLHNFSLPEEDTKJUQD 
TPGFHRALQLAQRPVSLLASPWTSPTWLKTNGAVNGKGSLKGQPGDI 
YHQTWARYFVKFLDAYAEHKLQFWAATAKNEPSAGLLSGYPFQCLGF 
TPEHQRDFIARDLSPTLANSTHHNVRLLMLDDQRLLLPHWAKVVLTD 
PEAAKYVHGIAVHWLDFIAPAKATLGETHRLFPNTMLFASEACVGS 
KFWEQSVRLGSWDRG\MQYSHSIHQQTSWYHVVG\WTAGNLAIiNPEG 
\ GPNWVRNFVDSP 1 1 VDITKDTFYKQPMFYHLGHFKQSS I PEG\ SQR 
VGLVA\ SQKNDLDAV\ ALMHPDGS AWWIiNRSSKDVPLTIKDPAVG 
FLETISPGYSI\HTYLW\RRQLDGADYSRRHWGSAGAFKGTESAHTL 
SVTKEGTAGPV 

1804 

A 

113 

1799 

PSAYSYGRYLHPCFCLVDPLSFRDSGTPVLFSSSSDPEVMBFSSPSR 
EECPKPSGRVSIMAGSLTGLLLLQAVSWASGG/RPCIPKSF/SYSSV 
VCVCNATYCD\FDPPTFPALGAFSRYKSRSSGHWMELSTG/PIQANC 
TGTGLLL I LQPE\ FQKVKGFGGAVTDAGALNILALS PPAQNLLLKWY 
FSEEGIGYNIIWVPMAS*DFSIRTYTYADTP\DDFQLHNFSIiPEEDT 
KLK I PL I HRALQL \ AQRPVS LLAS P W \ TS PTRLKTRGAGNGKGPLKG 
QPRD I YHQTWAR Y I VKFLDAYAEHKLQFWAVTAENEPS AGLLSGYPF 
QCLGFTPEHQRDF IARDLGPTLANGTHHNVRLLMLDE>QRLLLPHWAK 
VVLTDPEAAKYVHGIAVHWYLDFIoAPAKATLRETHHLFPNTMLFASE 
ACVGSKFWEQSVRLGSWDRGMQYSQ\ S 1 1 KKLPVPMWVGWEPNW/ IT 
PSL*NITQATRFNKQP\MFLPPLANFSKFIP\EGS\QRVGLVA\SQK 
ND \ LDAVALMHPDGS PWWLNRS S KDVPLTI KDPAVGFLET I S PGY 
SIHTYLWRRQ 

1805 

A 

22 

424 

ALGMAHITLFFFFLLLFCDSLALSPRLQCSGTISAHCNLVPPGFKQF 
S CLS LLGS WD YRCMP PC / RWLTFVFL VETG FHHVGQAGLELLTS GDP 
PALA/ FPKC * D YRR\ DPRAWALFVFLT * F FSKLKYHKAKEKWS 

1806 

A ; 

14 

253 

LIPCGPQLFNCLSL*PGFWAMVKFAWVQYVRSCLSSSGCLKESRSSC 
SESGGDHHPLSSTSLPLSLFMLCKEVLELSGR 

1807 

A 

318 

455 

GGLPLGVETLAGEHLRPCSSPATSLPGASLSQEWWSPPDSETRRS I 

1808 

A 

1960 

2150 

CFVT\ SNLKCSK* GRAWWF I PVI STLWEAKVGGSLEPRSLRLQCAMI 
APLYCSLGDRVRPYLLK 

1809 

A 

2 

1345 

GWPPGLLAGEGV CQLLRHS SPGRCLLKSRARGSVIMSRYGRYGGET 
KVYVGNLGTGAGKGELERAFSYYGPLRTVWIARNPPGFAFVEFEDPR 
DAEDALRGLDGKVICGSRVRVELSTGMPRRSRFDRPPARRPFDPNDR 
CYBCGEKGHYAYDCHRYSRRRRSRAENLRR*SP*Q*IWLTIGAPLRF 
RKRNTRLTTFPTRKI FLRQ S S LTWLI WLS V * S S LD * RKHLDAAIGYR 
YF*IV*IY*TMGQRGSTCKLARFMLNTHYSVLYYVMLSCNSAFNKSF 
FR* * KKKYSTTNRPRVYFQMRH* I * IVLRFDFSRGTQTLKNBLLSDI 
LFFL*LEK*\SRSRSHSRSRGRRYSRSRSRSRGRRSRSASPRRSRSI 
SLRRSRSASLRRSRSGSIKGSRYFQSP\SRSRSRSRSISRPRSSRSK 
SRSPSPKRSRSPSGSPRRSASPERMD 

1810 

A 

1 

840 

WPPGLLAGEGVCQLLRHSSPGRCLLKSRARGSVIMSRYGRYGGETK 
VYVGNLGTG AGNGELY\ RV I R * YGPLRTVW I ARNPPGFAFVEFEDPR 
DAEDAVRGLDGKVICGSRVR\VELSTGMPRRSRFDRPPARRSFDPND 
GCYECGEKGHYAYDCHRYSR\RRRSRSRFRSH\SRSRGRRYSRSR\S 
RSRGRRSRS AS PRRSR/ S P I SLRRSR\ S AS LRRSRSG S I KG\ SRYFP 
IPRRRSRSKIPGLFHGPRSSRSK\SRSP\SPKRSRSPSGSP\RRSAS 
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P\ERMD 

1811 

A 

20 

701 

DHASGQSTASSGPDSVSGQLQPSQPNADQGKLTTMRIAVICFCLLGI 
TCAIPVKQADSGSSEEKQLYNKYPDAVATWIi\NPDPSQKQNLIAPQM 
GCVL*RNQ*L*TR\TLPSKSN\ESHDHMD\DMDD\EGD\DDHVDSQD 
SIDS\N\DSDDVDDT\DDSHQSDESHHS\DESDELVTGFSTDLPATE 
VFTPWPTVDTYDGRGDSWYGLRS KS K\ KFRRPD I KYPDATDED I 

1812 

A 

121 

1206 

LIAGSTHACAHASGRAQHRRDQTRLKASCSLLSQTPTKENSLPRELP 
VICFCLLGITCAIPVKQADSGSSEEKQLYNKYPDAVATWLNPDPSQK 
QNI>LAPQ\TLPSKSN\ESH\DHM\DDM\DDED\DD\DHVGTARDSID 
SNDS\DDVDDT\D\DSHQS*WSLHHS*WNliD\ELVTGFFPTGPCPGN 
RS FS I PVCS PTVDT YDGRGD \ SWYGLRSKSK\ KFRRPD I QYPDATD 
EDITSHMESEELNGAYKAIP\VAQDLNAPSDWANRGKDSYGTSK\LD 
*QSAETHR\HQQ\ SRLYKRKANDESNEHSDCDW * artfpkvsre\ fh 
S\HEFSS\HGDFACL*PPKSKEEDNTPLEFRYSPGIRMWHFWGSI 

1813 

A 

171 

461 

G S LDLWRGAELS PGHS TLFTLCACAKVKAAVKYALSVGYRH IDCAAI 
YGNEPEIGEALKEDVGPGKAVPREELFVTSKLWNTKHHPEDVEPALR 
KT 

1814 

A 

1 

235 

EEAPRNSRKVIS IPKS ITPSRILQNIMVFDFTFSPEEMKQLN70.NKN 
WRYIVPMLTVRMYQPPRLGECEIWGGILAQV 

1815 

A 

2 92 

1396 

AQATGPYSRICACAKGAMAAS\CVLLHTGQK\MP\LIWSWGTWK\SE 
PGQVKAAVKVLPLALGYRHI\DCG\VIYGNELEIG\EALKGRTVG\P 
GKAGCLRGRKPGFVTSKLW\NTKHHPEGMWSIiPIiRKDSGLTFQIiE\Y 
LDLYLMHW\ P YAFERGDNP FPKKCDWN I WLDS PH \ YKET * RALKACV 
AKGLVQA/ VWGL SNFN S RQI \ DD I LS VAS VRPAV\ LQ\ VECHP I TGL 
KMRLI A\ HCQGTWAWR * TAFNPLGLPLNRAW\ RDP\ DEPVPAGGNPV 
VLGI GLKKYGRS PS SESCLRWPGPSGKVI \ CI P\ KS ITPFSNPFRTF 
KVFDF\TFSPEE\MNQLNA\liNKNWRYIVPMLT\VDGK\RVP\KDAG 
HPLYPF\NDPY 

1816 

A 

133 

402 

LLTSLVNSRILILFINSKKIFAIIIFSTRGGILRITAVIW\NNISVT 
HGNGDMALAQYSMP VPA * AIGRR I LVML YPSRTEAFEKFL I RC 

1817 

A 

356 

463 

NFFFFSFRRPDAPVPDGESEKTVEESSDSESSFSD 

1818 

A 

81 

728 

TRGPPPAEEMDEDGLPLMGSGIDLTKXVPAIQQKRTXVAFIiNQFWH 
T\VQFLNRFS\TVCEEKLADL\SLRIQQI\ETTLNIL\DAK\LSSIP 
G\LDDVTV\EVSPFKLSPSVTNG\AHP*RPIiSE\QPQPEPVPPGLLD 
LQES*SIQAGNFL*L*PKDP\RYARYLKMVQ\VGVP\VMAIRN\KMI 
SEGL\DPDLLERPDAP\VPDGE\SE\KTVEESSDSESSFSD 

1819 


1214 

1565 

LKEITDEMVYRTLLHSHRIKMVSPIFPFSTNTVPFFPCYNPFFMNIQ 
EMTKVTASRLFLFFVDLLQGVQPCFLCCCLCS IWFCNEHI.DL* * ASD 
FVMCMCVYMHIYTPIHV*YIHYIYVDTY/MEVCIHLYII*CV 

1820 

A 

1 

1044 

MAEKFDCHYCRDPIiQGKKYVQKDGHHCCLKCFDKFCANTCVECRKPI 
GADSKEVHYKNRFWHDTCFRCAKCLHPIiANET/ FCGQGQQDP\CNKC 
TTREDSPKCKGCFKAIVAGDQNVEYKGTVWHKDCFTCSNCKQVIGTG 
SFFPKGEDFYCVTOTETKFAKHCVKCNKAITSGGITYQDQPWHADCF 
VCVTCS KICLiAGQRFTAVEDQ Y YCVDC YKNFVAKKCAGCKNP IT/ GEK 
DCVKSEPPSL* S *EAPS VPRETIiASHPVSQRQPPGQASGWREDLSLV 
GGGSL*KKSKLSSSSWPGFGKGSSWAYEGQSWHDYCFHCKKCSVNL 
ATKRFVF PQEQVYCPVCAKKL 

1821 

A 

235 

1119 

GPSSYKVGTMAEKFDCHYCRDPLQGKKYVQKDGHHCCLKCFDKFCAN 

TCVECRK^IGADSKEVHYKNRFWHDTCFRCAK\CLHPL\ANETFCG\ 

QGQQRSCATSCTT\REGLPPSAKGCFKAIVA\GDQNVEYKG\TVW\H 

KD\CFTCSN\CKQVIGT\GSFFP\KGKDFYCVTCHETKFAKHCVKCN 

KAIASWGVTY*DEPWlb^GFVCVTCSKKIiAVQHFTTVEDQ*YCVD 

K\NFVAKKCAGCKNPITGFGKGSSWAYEGQSWHDYCFHCKKCSVNL 

ANKRFVFHQEQVYCPDCAKKL 

1822 

A 

222 

622 

KCSSSKHFTKEDSQITNKHIEKCSS*LLVREMQIITTKSVSAIHQNG 
* NENTKQTCQ / D I DNDMQQWE FMR * E WANW * N * KTNWQ * LLRLDKCV 
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S \ YDPAI PFLD I S PTERHI YAYHKTC IRMFKATLFKI APNI 

1823 

A 

75 

1402 

VRRRTLS SRRWHRLSHGPRWL. PQVLTAS PPLQARGAFRS FPHS WGBD 
FLASLMFKIQLEPLKLRAWTLNGFVKFRNKETSAGPVAVMGKDYYKI 
LGIPSGA\NEDLIVCAYRTMALKYHPDKNKEPNAEEKFKEIAEAYDV 
LSDPK\KRGLYDQYGEEGLKTGGGTSGGFRGF\FHYTF\YGDPHATF 
AS FFGGSNPFD I FFASSR\ ST\ RPFSGFDPDDMDVDEDEDPF/GALF 
GRFGFQWG*VGGPR\RAPGTIVTLGRQGCRDPPW\HELRVSLEEIY 
HGSTKRMKITRRRLNPDGRTVRTEDKILHIVIKRGW\KEGTKITFPK 
EGDATP\DNIPADIVFVLraKPHAHFRRDGTNVLYSALl\SIiKEALC 
GCTVN\ I PLFD \ GRG I PLPCND V I KPGTVXRLRGEGLPFPKVPTQRG 
DL I VE FKVRFPDRI/TPQTRQ I L \ KQHL PCS 

1824 

A 

245 

487 

TPPISQLLKEEEDLSKAMSQDGASQFQEVIRQELELSVKKELEKILT 
TAS SHE FEHTKKDLDLTVQQ I EHLNKT YNTD VP 

1825 

A 

63 

1820 

RVDKGGLAAGLRPLGRGSRCACVREEREREGRLRGDFQPASLLSRGA 
INAPNFPACLKEEEDLSKAMSQDGASQFQEVIRQELELS\VKKELEK 
ILTTASSHEFEHTQKKTWMDFRKLFHR\FLQRKGAFLWNIjGEKFQRP 
P *DS IQPYEKI KARGLPDNI SSVLNKLV\ VVKLNGGLGTSMGCKGPK 
SLIGVRNENTFLDLTVQQIEHLNKTYNTDVPLVLMNSFNTDEDTKKI 
LQKYNHC\RVKI\YTFNQSRYP\RINKES/LYFPVAKDVSYS\GEN\ 
TEAWYPP \ GHG \ D I Y\ AS F YNS GLLDTF I GEGKE YI FVSNI DNIX3AT 
VDLYILNHLMNPPNGKRCEFVMEVTN\KT\RADVKGGGDNSLQYGRA 
NLRTWWEIAQVPK/AHHVDEFKSVSKFKIFNT\NNLWISL\AAVKRIj 
QEQNAIDMEIIVNAKTLDGGLNVIQLETXAVGAAIKSFENSLGINVP 
RT\RFLP\VKTTSDLLLVMSNLYSLNAGSLTMSEKREF\PTVPLVKI 
R P VL FRKVQD \ YLKG FE K\ I QNRLE LDHI* \ TVSGRCDHIiEKNG F I KG 
EPVIHPLQIHGDR\MDIPTWEPVFREPRIiVSGNIi\RILDH 

1826 

A 

48 

239 

GRAETMSDI EEWE\ E Y\ EEEGQEEAAVEEEEDWREDEDEQEEAAEE 

DSEAEA*D/ tretraeede* yedasd\aedgpmeeselkp\ws *rpn 

LVL\P*VLI*FIWYVDLHRLCME*DE 

1827 

A 

78 

359 

RSNVTITRGCEPGASACSRCCPQGAPALLTLDPQKPPTFTMSDEEVE 
QVEEQYEEEEEAQEEEEKPRPKLTAPKIPEGEKVDFDDIQKKRQNK 

1828 

A 

3 

327 

CEKTTEGEKVDFDDIQKKRQNKDLMELQAIilDSHFEARKKEEEELVA 
LKERIEKRRAERAEQQRIRAEKERERQNRLAJBEKARVQMIAKFSKKA 
GTPAKGKVGGRWK 

1829 

A 

1 

429 

RAEVALKKKKAIiSSMRAHYS\SYLAKADQKRGKKQTAREMKKKILAE 
RRKPIiNIDHLGEDKIiRDKAKELWETLHQLEIDKFEFGEKLKRLKYDI 
TTLRSRSWDSTSSSPHNPVRGSLTVLGVERPSRGVPRVCVLAAFIPW 
GL 

1830 

A 

3 

986 

HTPATQSIiANGIiLGRSNVTITRGCEPGASACSRCCPQGAPALI.TLDP 
QKPPTFTMSDEEVEQVEEQYEEEEEAQEEAAEVHEEVHEPEEVQEEE 
KPRPKLTAPKIPEGEKVDFDDIQIOCRIiNKDLMELQALIDSHFEARKK 
EEEELSALIjERIEKRRAE\RAEQQRIRAEKERERQNRIi\AEEKARRE 
\REDATRRAEDDL\KKKKALSSMGAN\YSSYLGPRIiDQKRGKK\QTA 

\remkkkilae\rrkplqpsihlgerqn*gdqgqrsfwetlh\qled 

LTKFEF\GEKLKR\QKY\DITTLQEPECRCWPESTSKEGLGPPAKGk 
V\GGRWK 

1831 

A 

7 

1382 

PPGLEARPAPARLAGSGVCSGGRGRGAGRRSRRQSMRGAARAAWGRA 
GQPWPRPPAPGPPPPPLPLLLLLLAGLIX5GAGAQYSSDRCSWKGSGL 
THEAHRKEVEQVYLPCAAGAVEWMYPTGALIVNLRPNTF S PA\ RHLT 
VCIRSFTDSSGANIYLEKTGELRIiLVPDGDGRPGRVQCFGLEHGG\L 
FV\ EATPQS / QDIGRRTTGFQYELVRRHRASDLHELSGECPARSS SS 
SSSSSSSSSPPARAANSHLKWRWSQRCLDVDTLPDLALIiSVRIL*RW 
WAAFQSLSRPGCFLTLPFFS\APCRPCSDTEVI J LAVCTSDF/A*VSP 
RQLLSSSSSSSSSSSSSSPHLLPRTAVRGSIQQVTHEPERQDSAIHIi 
RVSRLYRQKSRVFEPVPEGDGHWQGRVRTLLECGVRPGHGDFLFTGH 
MHFGEPR\ LRCAPKASRTFQRM YR\ DA\ QERGLNPLVGWQRN 
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1832 

A 

1 

433 

NNPDFKAGV / M AT.PTL / T iO TOUHnnvT xtmt waTDTTunroT T>/"vr*7\"i7-A 
iviM cue £\n\j v f ri/Uir xxi/ uvAylUlUJUXJj VWJUivftXKJ.JjVyiiKJUiyjL'AVA 

KANQTKEGLPVALDKHI LG FDTGDAVIjNEAAQI LRLLH I EELRELQT 

KINEAIVAVQAIIHFWHVWKSKCHILGGGSPENWVCSRDLPPLLIAF 
FFNKV 

1833 

A 

1 

459 

YLRDVDCPFKIQDRQEAIHWLLGIAWLEYGDNVNNPDFKAGVMALA 
NLLQIQRHDDYLVMLKAIRHiVQERLTQDAVAKANQTKEGLPVALDK 
HILGFDTGDAVIaNEAAQILRIiLHIEELRELQTKINEAIVAVQAIIAD 
PKTDHRLGKVGR 

1834 

A 

2 

1108 

SFRSDSAPAJRPLAASPVPAPPAPPRFFSPGRGPGDQSEKRWTMFRRK 
LTGSSTTYV SPFVFMRPDFTPPT?MPT \ VTMT pnnvTDUWTpntjri\ vrr 

RNIHSSDWPKVL\EKYFK\DVNCPFKIQD\RQE\TIDWLLGLAV\RL 
EYGR* WLKNTKDLVP * * FQNLLDNTATKN\AEPFDPFWDVNNP* F*GL 
VLLALG * TWLQI \ QRHDD\ FLV\MLKANSGFWVQEPP * PPGMPVCLR 
ANSNKRGA*PVAFRQTHILGFD\TGDASSLMKLLEILRIACT*EELR 
ELTDQKSTKAIVAVQAIYC*SKRQDHRLGKSLEDEHFEDIiQLLTYFR 
Y^GNHTLLACFGKSKCHNSRGEGSPEKLGYGSRGFYHHWLIASCFF 
L 

1835 

A 

1 

891 

MYQYQNQATQQECS I CLEHGMKI KSES QVLAKGRKVAKGGE S EWVEG 
GEGREEKKHSRNIYQRPAVSQWAVYGTLSDLLSVASSKLGIKATSV 
YNGKGGIilDDIAIilRDDDVLFVCEGESFIDPQTDSKPPEGLLGFHTD 
WLTLNVGGRYFTTTRITPTRQIFAVQILNELISGSVLNCANLQGVKM 
LCSNAEGASLKLCNFEDPSGLKANLEGANIiKGVDMEGSQMTGINLRV 
ATLKDAKIiKWCl^RGATIiAGTDLENCHDLSGCDLQEANLRGSNVKGAV 
FEEMLTPLHMSOSVR 

1836 

A 

1 

984 

MAAEPVEDNCINFVAMKFIDNTLYFIE1STLESDYFGKLESKLSVIRNL 

NDQVLFIDQGNRPLFEDMTDSDCRGQKGGEGRRIQVGGGGEGREEKK 

VGGGPGGRWAHSGPSGGSVMRRMTLFLNGSPKNGKWVVYGTLSDIi 

LSVASSKLGIKATSVYNGKGGLIDDIALNRDDDVLFVCEGEPFIDPQ 

TDSKJPPEGLIXJFHTEWLTL^ 

DKGVWGTNLEGVl^KGVDMEGSQMTGINLRVA 

TLAGTDLDNCDLCGOTLQETUJLRGSNMKGAIFEEMIiTPLYMSQSVR 

,1837 

A 

1 

1917 

. MKLMETLNQCINAGHEMTKAIAIAQF1TODSPEARKITRRWRIGEAAD 
LVGVS S QAIRJDAEKAGRLPHPDME I RGRVEQRVGYTI EQ INHMRDVF 
GTRLRRAEDVFPPVIGVAAHKGGVYKTSVSVHLAQDLAl.KGLRVLLV 
EGlTOPQGTASMYHGWPDI^IHAroTLLPFYLGEKDDVTYAIKPTC^ 
PGLDI I PSCLALHRI ETELMGKFDEGQLS TDPHLMLRLAI ETVAHD Y 
DVIVIDSAPNLSIGTINVSSRCTRCCIVLYVWSTAIiDYDSSSSSSTY 

1 * V W V xxv xx/ v x v iijiviHiJloUrooANL/t'o VKL IH VHWHrLb I x bRKF 
DLRSINPCALDRIELSYLAIAHKINSIjGLQQCSKESLIDTRIiATREK 
KFLLKSVPLKAQRLRFFKDNLHSRNIYQRPAVSQWAVYGTLSDLLS 
VASSKIX3IKATSVYNGKGGLIDDIALIRD0DVLFVCEGESFIDPQTD * 
S KPPEGLLGFHTDWLTLNVGGR YFTTTR I TPTRQ I FAVQ I LNEL. I SG 
SVLNCANIjQGVTC>1IjCSNAEGASLKLC^ 
MEGSQMTGINLRVATLKDAKliKNCl^ 
EANLRGSNVKGAVFEEMIiTPLHMSQSVR 

1838 

A 

2 

1411 

FVGKGPRQAEDSRCGAGRRTGRTLGEGQRACVWCVPKGRKVAKGGES 
EWVEGGEGREEKKVGGGPGGRVAAHSGPTGGSAMRRVTLFIiNGSPNN 

nTCACJAdYdTT^ DT.Ti^nJfSQQTTDnT VH. r nn\r\rnr , vnr i T tt\t\ t tv t t Dnnn\; 
vj^vrvvj/-vv? x vj a JJ0xyxJXJOV7V3OOJXtrutX JVA.X J>l V jl rJ LjiV»oo Lj1.ULJX/\xjX i\-IJULJ V 

LFVCEGEPFIDPQTDSKPPEGLLGFHTDWLTlxNVGGRYFTTTRSTLV 
NKEPDSMLAHMFKDKGVWGNKQDHRGAFLIDRSPEYFEPIIJjJYLRHG 
QLIWDGINLLGVLEEARFFGIDSLIEHLEVAIKNSQPPEDHSPISR 
KEFVRFLLA/TPTKSELRCQGLN 

\ FAHANLC *A1^ERTDLYGSVLDCANLQGVKMLCSNAEGASLKLCNF 
EDPSGLKANLEGANLKGVDMEGSQMTGIi^^ 

ATLAGTDLENCDLSGCDLQEANLRG SNVKGAI FEEMLTPLHMSQS VR 

1839 

A 

1 

320 

P RRRW ALLLS ESS GLGGVHVVLSLGRVRAAARVG P AVG VC S VEMGRK 
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LEPGSSSHSLPAPAPRHHIiliVGSALMPQLSLQRDLDFTIDLDFKGQL 
CELSCSTDYRMR 

1840 

A 

1 

1430 

MAAAEAANCIMELPRAFGIRPSGGSYPPSHVEEGWGRFRPGPHVAAA 
RPPPRGPGHTPWGVIDLGPSTMWGVSWEEQCFSALYQPPSELRGHLL 
GYRTRFCAFWVSCGQAESSEKPNAEDMTSKDYYI*LHTHTFGIHEEM 
LKDEVRT\LTYRNSMFHN\RHLFKDKVVLDV\GSGTGID\CMFCCQG 

OTjIT3DTT\rTrfTPPC!CVOe#T T*TD * 7f\ TV \ TaTVT DTiCOrDTtri t vn vn/fninm 

]?WFKKVlUlLLobxoo w LWR w KtVQA\NKLRP^ 

LPVGERWD I IHQRVGWGYCLFLiTESM\ LQHRALMPRDKWLAP\DGL\ 
IFPD\RAQLYVTAIRGTGRYKDSRSHLIiGENVY\GFDMS\CIK\DVP 
IKEPLVDWDPKQLVTQRLAFIKEV\DIYT\VKVEDL\TFNIiPRFCP 
CKLKRN*LTCTALVTLLSTFEFTHCHKRTGGtGFFHQPPRSP\YTHW 

LDFKG\ QLCELS \ CSTDYRMR 

1841 

A 

1 

45 

RIDYDEFLEFMKGVE 

1842 

A 

2 

580 

GRVGGRVGCE PP AW IDI YKAAGRS S FEQE * ARKMS S * AAFRTS FVLG 
A\EDGCISTQGSWGKVMRMHGPEPHPMRELQEMIDEVDEDGSGT\VD 
\FDEFLVM\MVRCMKDDSK\GKF*GRSLSDLLPACFDQKMLDGYI\D 
L\DELED*LLQATGRDPFTEDDDIEEL\MKDGDK\NNDGRID\YDEF 
L \ E FMKGVGVDA 

1843 

A 

481 

1479 

EERLG ED I FRSGTL WPNAGGRAFS QAVYCLNGF I DTS VRLLDKI ERN 
TLTRQSSLPKDRGKRSRAFVFELSGEHWTELPDSLKEQTHLREWYIS 
NTLIQIIPTYIQLFQAMRILDLPKNQISHLPAEIGCLKNLKELNVGF 
NYIjKS I PPEIA5DCENIjEkIjDCSGNIjK1^EL 

NKFS S VP I CVLRMSNLQWLD I HRHNIiADLAED I DRLEELQS FLL YKN 
KLTYLPYSMI^LKKiTLLWSGDHLVELPTALCDSSTPLKFVSLMDN 
PIDNAQCEDGNEIMESERDRQHFDKEV/TESLY*SP*RKRICSQLYH 
yo v v 

1844 

A 

2 

369 

VKDLEHTKM I TPTVKMHTMS S SHLF YLALCLLTFTS S ATAGPETLS S 
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSPFRSCDL 
RRLEMYCAPLKPAKSARSVRAQRHTDMP 

i a a c 
lo4 j 

A 

28 

479 

P I H r QT Jj o S KAMlj Jvl S S L» PTQIj FKCC FCD FL* KVKTTHTMSSSHL»r xJjA 
JLGLLTFTSSATAGPETIiSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 
SSSSSSSSSSSS PFRSCDLRRLEMYCAPLKPAKS ARS VRAQRHTDMP 
KTQKYQPPS 

1846 

A 

4 

458 

NHS I FRLCTSEAMGKI S SLPTQLFKCCFCDFLKVKMHTMS SSHLFYL 
AXjCJLiJjTr I poAi AQjPhf I LbbSbbbbbSbbSSbbbbbbbbbbbblbbbb 

SSSSSSSSSSSSSPFRSODLRRiEMYCAPLKPTUCSARSVT^QRHTDM 
PKTQKYQPPS 

1847 

A 

90 

769 

HCSQHLPSLWISFCFIIPANQFIFRLCTSEAMGKISSLPTQLFKCCF 
CDFLKVKiraTMSSSHLFYLALCLLTFTSSATAGPETLCGAELVDALQ 
r V LoL'Ror i r WKr lOibbboRKAPQTGiVDECCrKSClJbR 
PLKPAKBARSVRAQRHTDMPKTQKYQPPSTNKNTKSQRRKGWPKTHP 
GGEQKEGTEASLQIRGKKKEQRREIGSRNAECRGKKGK 

l n a a 

TV 

^£ J Jl 

Q A Q 

nLbynijrobWlDrLrlir/irjyr Xr KXjI_1oe»/U^oJvXoo1jP XijJj.ri\A-L,r 
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ESRCQENFFLVLGIDAAQKNLSGGQGHIMEDCDLKEPEGLLTVSSFC 

LQHCKAL IQTKPQGQAFVPHGCGQEVAPLACE S LWVRHKGS AG AE I R 

BGRCGEAPLDCEQAQVRGQVRSTVGVQGQTGPASTKMSATSWFLVSS 

SG ARHRLPREL I FVGREECELMLQCPCAQCKPGTEL VP S EQALNDAG 

TWTQAGLPAVAVTPWAQAWQETTAWAGPDLLCPVHRAGLLRVPVA 

ANGELGDCHWGGQEPCSCGCQLRRRNLNRVDLESRSVDKQHAVINYD 

QDRDEHWVKDLGSI^GHACLMKRLSI^GCTASGRSSAC^ 

ERTFVNDMRIPDQKYVTLKLNBVIRFGYDILPLSVPPPGLLLSPQDH 

SHPVTPE^GLSLLGKRPLLCLPRWALELAHLLLGVDFSAGLLALLLC 

VRIOTCPHSNMYVX.ERVQHRVPEEALKHEKYTSQLQVSVKGLAPKRS 

EALPEHTPYCEASNPRPEKGDRRPGTEAASYRTPLYGQPSWWGEDDG 

STLPDAQRQGEPYPERPKGPVQQDGELHGFRAPAEPQGCSFRREPSY 

FEIPTKETPQPSQPPEVPAHEMPTKDAEAGGGGAAPWQSHASFTIE 

FDDCS PGKMKI KDH I TKFS LRQRRPPG KE ATPGEMVS AETKVADWLV 

QNDPSLLHRVGPGDDRHSTKSDLPVHTRTLKGHKHEIX5TQSDSEDPL 

AKAAS AAG VPLEASG EQVR LQRQ I KRD PQELLHNQQAFV I E F FDEDT 

PRKKRSQSFTHSPSGDPKADKRRGPTPADRDRPSVPAPVQAGGRSSG 

PQRAGSLKREKTEERLGSPSPASRTPARPFGSVGRRSRLAQDFMAQC 

LRESSPATRPSPEKVPPVLPAPLTPHGTSPVGPPTPPPAPTDPQLTK 

ARKQEEDDSLSDAGTYT IETEAQDTEVEEARKM IDQVFGVLES PELS 

RASSATFRPVIRGDRDESDDGGVAQRMALLQEFASRPLGAAPQAEHQ 

GLPVPGS PGGQKWVSRWA3LADS YSDPGLTEDGLGRRGGEPEGS LP V 

RMRRRLPQLPSERADSPAGPESSRRSGPGPPELDSEQPSRLFGQEEL 

DPDSLSDASGSDGGRGPEPGVEPQDSRRRSPQEGPTWSRGRRSPRAP 

GEPTPAS FF IGDQNGDAVLSRKPLAAPGDGEGLGQTAQP SPPARDGV 

YVSANGRMVI QLRPGRS PEPDGPAPAFLRQES FTKE PAS GPPAPGKP 

PHISSHPLLQDLAATRAARMDFHSQDTHLILKETET^^ 

NSVDAECEGGSTPRPPED7VLSGDSDVDTASTVSLRSGKSGPSPTTPQ 

PLRAQKEMS PSPPAAQDPGGTALVS AREQS SERQHHPLGPTDMGRGE 

PVRRSAIRRGHRPRGSLDWPSEERGPVLAHLPSSDVMASNHETPEAT 

GAGRLGSRRKPAAPPPSPAAREEQSRSSASSQKGPQALTRSNSLSTP 

RPTRASRLRRARLGDASDTEAADGERGSLGNPEPVGRPAAEQAKKLS 

RLDILAMPRKRAGSFTGTSDPEAAPARTSFSGRSVELCCASRKPTMA 

EARAVSRKAANTATTTGPRQPFSRARSGSARYTSNTRRRQQGSDTTS 

TSEEEYGSRHGSPKHTRSHTSTATQTPRAGSSSRARSRAPGPRDTDD 

EEEEPDPYGFIVQTAEXAEIARVSSLPCPSRLAVPTCSDSSLGFLAL 

GSHLQRQLPARPGSRFPPAASAPCPSWLAVPTCSDSSLPVLADGSHL 

QRQLPARPGSRFPPAASAPCASWLAVPTCSVSSLPVLADGSHLQRQL 

PARPGSRFPLAASAPCPSWLAVPTCSDSSLPVLADGSHLQRQLPARP 

GSRFPPAASAPCPSWLTVPTCSDSSLPVLALGSHLQRQIiPARPG* 

1892 

C 

377 

463 

MVGENNTQWGRSGSRWLGFSXPRPGXQPV 
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1893 

A 

753 

3000 

KLEPCGGTPGPRAGSGPCQDPHIPPEGVGGSPDGPGAWPAPNTHSAS 
WSWGPHPECPPPA*AADCRGTDRSES\PASPVCPLSPNSQRPPHLEA 
PGVTjPGGSSRTQPAV/ PTCSPPGI P/ AVPAIiCP * PG * KTT/ PI YTEA 
GKLATSAPGT/PPRQLSPGNAVYCLPPDTHTQPGRAGLAESAPYPEA - 
QPAGVGQEGDAGTE / PG PSCQTQH * PPQEPLLGPPPQL/ PPTSQQPE 
GGALHSWPQRWPRKPFPDP/PL/PVRPVWSAKPWNT*RRSPPGAPSP 
TRTAMGT\ TTPC PRP S PDS KHVPLALPG AP / VPAQ I LDLLPRQALRT 
EPSAPEPIAGYGDSPWLCGMAVSTGPVLPWIJINGPRTNHSC/CRSRN 
FAS ALHGAA\ PSKS F IRR* WGRTP * PAGCP F/ PGLG I VPRSTS S APT 
QLGSGNVAWGHVRVLPS FHSDGFLQPMS S SGSIiAQQSVWASLGSEQ 
QQVPMLPPASPGTPAARAGPVQFEQCLLQS PTWKPYP/ PARAFQAVP 
*SRMVCPEAGPRSQQ/PA*QPPGIiPPPPPATRVG/PPDVKGRGRLGS 
PGRPA/ PPACMS PEQLP IGPLLAWGLQDARPAGAGGRAFE* PPGGS S 
QRFPPSGT*EAE**QF/VNGDGPAPTFSCPMGKPASPCPVRAVPPAIi 
PDVL,TLGNTAALGGSLKAFGNL,QPSLDNS\PVP*\LLEGPTEPQPPR 
TTS APKECS S S PTMPGVGPWVTGTSGRPTFLS PFFS YEEHFKVLLFK 
EITVQQAAEDGFHHRPFPRNKQ 

1894 

A 

3 

576 

LTRI PFLGAKYAPVI FA/ EGA/ YQ * QRS * TEI QMACFKQATRWVKC/ 
DPRHG KYMA\ CCLL YRGD\ VFPKDVNAAI ATI KTKRS I QFVDWCPTG 
FKV\GINYSPPTWPGGDLAKVQRAVCMLSNTTAIAEAWARUDHKFD 
LMYAKRAFVHWYVGEGMEEGEFSEAREDMAALEKDYEEVGVDSVEGE 
GEEEGEEY 

1895 

A 

58 

1636 

LVGDGNPGPGVCSCRLRIiL I PYPLCGEC I \ S I \HVGQAGVQ I GNAC \ 
WELSCL*HGIQPDGQMPK*PKPLGEGDDSFNTFFSETGAGKH\VPRA 
VFVDLEPTVIDEVRTG\TYRQLFHPEQVITGKEDAANNYARGN\YTI 
GKEIIDLVLDRI\RKL\ADQCTGLQG\F\LVFHSFGGGTG\SGFTSL 
LDE\ RLSVDYWOESPSLEFS I YPGAPRFPOPWEP\ YNS ILTTHT\ T 
L\EHSGI*CPSWVENEAI YDICRRNLDIERPTYTNLNRLI \ SQIVSS I 
TAS L\ RFDGALNVDLTE FQTNLVPLPRHS TS LRPTYAP VNPP S * EKP 
TH\ EQAFCSRRS PKCFAFEPS QPRWLKCDPS PMGKY\MAC\ CLL YR\ 
GDWPK\DV\NAAIAHPSKPKRS\IQFVDWCP\TGFK\VAINYQPP\ 
TWPVGDLA\ KVQKTV\ CMLSNT\ TA I AEAWARLDHKFDLM\ YAKRA 
\FVHVTYLG\EGMEEGEFSKA\REDMAAL\RKDYEEVG\VDSVKG\EG 
EEEGKGILI IHSLFGPCSMSCSQNFSFSLTDRR 

1896 

A 

491 

1068 

VSRIRQNISTRR*RQACLTALV\NLYLQGLPPPTSLWGFYFD\RDDV 
\ ALE \ GVS \ HFFRRIGPRKKR\ EGYERS S * KMQKTSRGRPALFFQDN 
QDSQLED\ EWGKTPQTPMK\ AAMALEEKA* TQALFGS F I ALGFLPAR 
TPMLL*ASLETHF\LDE\EVKLIK\KMGDHL\TNIiHR\LGGPEAGAG 
AKYLF\ ERLTLKHD 

1897 

A 

2 

350 

SQVDR*QSEPESIRICREDHMERLQAFDANSRKQEAEWKEKAIKELB 
EWYARQDEQLQKTKANNRVA\ EKLSTNNPS LT * LVMSEEAFVND IDE 
SS PGTEWERVARLCDFNPKSLD 

1898 

A 

2240 

2492 

AVACVARQP PGCS KGVLHKPT YFWAS RAAEEAFVND IDES S PGTEW 
ERVARLCD FNPKS S KQAKDVS RMRS VL I S LKQAPLVH 

1899 

A 

1 

906 . 

ATAWS VGRLVFVSTGCVRAVQLPAMAELDP\ FGAP\ AGAPGGPALG 
NGVAGAGEEDPAAAFLAQQESE I AGIENDEA\ FAILDGGAPGPQPHG 
EPPGGPDAVDGVMNGEYY\QESNGPTDSYAAISQVDRDQSEPESIRK 
WREEQMERLE\SIiDANSPESKKQSWKEK\AIKGA*KEW\YARQDE\Q 
LQ\ KTKANNRVADESFLjJ^LR* RDWLCHKHKPSLLQPRTGQPEEAI* 
FK\DLEGLSPSNEWE\RVARIiCGIi*PPSIiSKQA\KDVSPHGASVLI\ 
SLKAGPRWGHLKSHPVETLHIiQYIiNPTQ 

1900 

A 

29 

852 

PSRSLVRWEFAPQRWLPGWSVGRLVFVSPVGVRAVQLPAMAELDP 
FG APAGAPGGPALGNGVAGAGEENPAAAFIaAQQE S E I AG I EN\DRAF 
AIL\DGGAPGPQPHGEPAGGPDAVDGVMNGEYYQESNGPTDS\YAAI 
SQVDRLQ\ SQPES \ IRKWREEQMERFGKPFDANSRKQEAEWKEKAI K 
ELEEWYAR\QDEQLPENQKANNR/AQTE7VRPL*NDID\ESSPRPLKW 
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GNGWPRAV*TLNPPKS * AKQAQKMS PPHDASVL ILPLKAGPRWCH 

1901 

A 

1 

585 

MKTIRSNQTVDIPENVITLKGCRVIVKGPRGTLWRDFNHISIELSLL 

FVEIRNFLGEKYIRRVRMRPGVACSVCQTQKHELILEGNDIELVSNS 
AALIQQATTVKNKHIRKFLDGIYVSEKETAQHIRKFLDGIYVSEKGG 
SYRATRO 

1902 

A 

2 

537 

GTLRRDFNHIN\VELSLLVKKKKRLRVDTMIiGQQKRNWPTRSGLFGS 
HVQDHDQGVLPLGFPL PRMRVLCM PH FPQS TW I PGRMG SSLLKI \ R 
NFLGVKNTI\RRVR\MRPGCCLVQYPQAQKDELILEGN\DIEL\VSN 
S AG\ L I QQATTV * KQG I SGNFLDG I YVS EKGTCSCRLMNKI 

1903 

A 

560 

898 

KCNTECFGSLMHFVWIiF 1 1 1 1 1 FLRQQGRSVT\RIjECSGAILAHCN 
LRLPGL S NS PAS AS RVAGTTGTCYR IQL I FVFLVETG FHYVGQAGHK 
LLT\ * VIHPPQPPKVLGLQV 

1904 

C 

224 

379 

MYLHMPLPPGSTQXXXXXXXXXXXXXXXXHADLTRVGKI I INRMING 
LGVRV 

1905 

A 

1249 

1642 

LGCYPGPFIHPVKWMIFPDPCEISVTGVCVC/G/GVCSCGVCG/CG/ . 
CGCLPGGKGICKYI * ICSQIL 

1906 

A 

415 

656 

SLPRSPLGRGTPSPQHLSSNLNLASLYHPHEITPWIIFSSSGSSI*T 
PI/TPSFYPSPTNCDPQIFDPQTPVSGCRLASSQGPSLSNPTSNLSG 
PQIPFASYPCLLLAPHPSLPASRPQSCPSPKTWAPPS 

1907 

A 

1 

419 

TISWNTGPRARSRARGSSTGLDGCVGGGSGGNSGLPCPDIiEPLGGLQ 
S KCRLCAPTEARGLWS / KVPLFRQVRHLALHACGCREAWPPPGPPPL 
LVALCFHLKALPSRGSRAGREAVSKHLKFAMLAGGRVCGSRRVLSM 

1908 

A 

1 

438 

QCTPSSA7U3CPELTACYGFSS*PS*GPSPLPWRPRMCESWVLHLPPT 

SPHRSGTGFDRAAGGRGRRRCNRSEEGVIPFAAPRLLPPHAFWRVFP 
HWETT 

1909 

A 

113 

704 

MAASGAGAEVSGRYGREPPPALPPAPCGPRRRRSPPYPAKTYRFFSL 
WQRGRPDRSSAPIGNSGLPCPDLEPLGGLQSKCRLCAPTEARGLWKK 
VPLFRQVRHUVLHACGCREGLGLPQGPPPLIiVLCGFHLKALPTRGSR 
AGREAVIKLPLSWPQDMYRKSHAAHLREMSSKAAAQLSW10NAYFGS 
SKGLSCVWP 

1910 

A. 

317 

4 7 0 

NYPMSWPXODVMWRKJSHTUV/HTTjRKM^cJTCV taaot.*wtjnayfgssk 

Vt X tr 1 1 k-> V v xr \\^JL/ \ 1 innivOXlnn/ n X UlvurlO O I\ \ Innyu rtl» lr4 r\ i .r VJOOIV 

GLSCVWP 

1911 

A 

147 

850 

MAASGAGAEVSGR*GREPPPAiPPAP\CGP\RRRRSPP*LPKTYRFF 
SLWQRGRPDRSSAPI\GNSGLPCPDLEPLGRAARSKCRIiCAP\TEAT 
KACWK*GPSCFRQV\RH\LALHACGCR/EG/LGLPQGPPTSAGLLCA 
FTLKALPSRGSPCKEEKliSASTSNFSHAGCGRVCGSRRVI,SML*FSG 
LKLP\LSVVPQDM*RK\SHAAHLR\EMSSKACRSAF1X?NNALFLEVS 
KGT.SCVWP 

1912 

A 

119 

1001 

G SRTKGRAVPECACAPVGAGEGRPAGVAVSDGVI KVFNDMKVRKS ST 
\PEEVKKR10CAVLFLP*VRTKlWIIL\EEGKEIIiVG\DVGQT\ VX>DP 
YATF/VSKI^PDKDCRY\ALY\DATY\ETKE\SKKEDIi\VFIFLGPP 
ESAPPLEQNGFMPSSQGRPSKKGSWTGDSSHEFAKPNCLPKEGQGTA 
CTLAREAGGQVPVYS PGRAKPFVS PFWPPCLGASGS PQHLAPWGFAG 
CPPFLQRPGRAWGGSPAGGGEIPLNPSCPKQTPPNPPGNFPSPSNPIi 
DGFWPFPKLLFESFDSSWG 

1913 

A 

361 

1785 

LPLPRWKVLLPRDILGPRKINEVSSSDDKDAFLCEQTSGDILKKTSE 
WKSS PIiGVTPLFMQSNVINSTAI VKTLAATGTGFDCAS KTDIQLCA 

e srgvpperi i ygksfvkqvsq i kyaanngvqm\ mtf\ d s evelmkv 
\argtfpkakvwlri;a\tddskavcrlsvkpgatlrtsrlllerak 
eln idwgvs fhvrsgctdpetfvqai sdarcvfdmgaevgfsmyll 
digggfpgse\dvk\lkfeeitrp*stqpldkyfp\snswn*ks*lb 
pg \ r y yvas tfd alql i s fakki c i krnqtgs * * pki es elsrplmy 
\yvn\dgvygsfncilydhahvkpllqk\rpkpr*rryysssiwgpt 
cdgi^riv^rcdlXpeXmhvgdwXmlfeiwgXaytvXaaasnvpmlp 
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RGPTIYYVMSG\PAWQ\LIQQFQNP\GFPPGSRRQPGCPAPCPVFCA 
W\ ESGMKRTRASPVPSG 

1914 

A 

501 

746 

ARS S LVLL F I Y I FRDRVLLCH SGWS AVVQS WFTAAL I SQA* VI LK* F 
SHLSLPSSWDYRQVSPHPANF/ syilfcrdr/sftmlprvgwnswaq 
VLLLI QPPKVLQL * AGCHGS CL 

1915 

A 

156 

166 

VLFLKRPYLVQDAVLWLFT\ AI I SHSTLELLGSSYPTTSAS * VKHSN 
SN I MKFKVPVLNECTMQLGTKTM 1 1 KVLVKS Y F 

1916 

A 

452 

1017 

SLHGISRPHLPTGRLLGPETCAGFSRFGQNESLTPFVTSDR\SKNR\ 
KRHFKAPFPHFEGKIMSSPLSKE\LRQDVQLCGSDARS*KD\DEVQV 
V\ RGHL * GS AKLAKWQ\ VYRKKYVI Y I ER\ VQREKA\NGTTV\ HVG 
HFTPSKQWIT*G*NWDQRPAKRSLRTGKPKSR\QVGKGKRGKYKER 
T\IEKMQE 

1917 

A 

3 

474 

PLHSHARSHFLFAIMNRPAPVEISYENMRFLITHNPTNATLNKFTEE 
LKKYGAATLVRVCDATYDKAPVEKEGIHVLDWPFDDGAPPPNQIVDD 
WLNLLKTKFREEPGCCVAVHCVAGLGRKRRGAFNSKQLLYLEKYRPK 
MRLRFRDTNGHCCVQ 

1918 

A 

1 

246 

MNRPAPVEI S Y ENMRFL I THN PTNATLNKFTEELKKYGVTTLVRVCD 
ATYDKAPVEKEGIHVLKKKGSVQFQTAALFGEIPT 

1919 

A 

373 

560 

S QL KNAKFNL S R PLPPNVY * DKGQYVTS QL YQNAAQP I FKQAFAETC 
AHHTNTIQDQAPRRI 

1920 

A 

837 

1441 

IFFFHIiSPSHSHARSHFLFAIMNRPAPVEISY^\DMRFLITHNPTNA 
TLNKFTEGT*GSMGVTDFGFGVCGVn J HMDK\APVWKKEGFHVIi\DW\ 
PF\DDGSSTPLIQIVGWIGFKPV*KTKFSCKSHGCCV\AVHCVGRVG 
EGAPVL/ VLALALDWNVGMK\ YEDAVQFIRQKRRGAFNS KQLLYLEK 
YRPKMRLRFRDTNGHCCVQ 


A 

Z 


HR VR F F FAGN PG S D S FGG LL LG LT P VLR WVADGGT I PKRHE\ LVKG P 
KKVEKVDK\ ETELVAQWNYCTLS \QEILRRPI VACELGRLYNK\ EPV 
I E FLLDKS AEKALGKG S I SH * NALTNC * QS * KLSDNPCPGKG I KGNT 
KGDK\HDDLQAGASFl/CP/LVGGPGRWNGRHRFLLPSGGCGIiCCFS 
*AEPWKEIKAEFCHTCGAGLSRRMMIIVIxNGTKEDVDVLKTRME/AE 
KAVERS FKRIS KKPKAAES VS KTQMSVEG S PRAHQKLRPGKP * RS PA 
IJDSR^KKTNLAPKSTAMNESSSGKAGKASVWSHKEVHR* PVKNRKPN 
KSLFTTHSFRQSAPKEGVCPTGVHPTRPTCF 

1922 

A 

272 

468 

GEIEFASCSQASLLLSFVWMPALLPVASRIiLIiLPRVLLTMASGSPPT 
QPSPASDSGSGYVPGSVS 

1923 

B 

131 

268 

XFVTCPNEKVAKEIARAWEKRLAAC^b^ 
* 

1924 

A 

2 

353 

GSPPT\QPSPASDSGSGYVPGSVSAAFVTCPNEKVAKEIARAVVEKR 
LAACVNLI PQITS I YEWKGKIEEDSEVLM\ SVHPYEVAEVIALPVEH 
GNFPYLQWVRQVTESVSDSITVLP 

1925 

A 

70 

357 

DQLSCTSVCRMSGGRAPAVLLGGVASLLLSFVWMPALLPVASRLLLL 
PRVLLTMASGSPPTQPSPASDSGSGYVPGSVSAAFVTCPNEKVAKEI 
AR 

1926 

B 

120 

83 9 

MSGGRAPAVX.LGGWSDRPRPAPSGPRSLDRYPHPKSQRRFAKEIAR 
AVVEKRLAACS/NLIPQITSIYEWKGKIEEDSETVn^MMIKTQSSLVPAIj 
TDFVRSVH P YEVAE * 

1927 

A 

259 

935 

GASLLLSFVWMPALLPVGLPAFLFANPESLLTMG/SIiGSPSDPSPRP 
ASDSGSG\YVPGSVSA\AFVT\CPNEKGSPRBIAR\AVGGRRRLA\A 
CVQPS S PQITI PSMKWK\ GKDSREDS * GCWMDGFKTQKFPWVPS FWT 
DFV\RSVAPYEV\AEVl\ALPVEQG\NFPYTiQWVRQVTESVSDSITI 
LAMMSPVPAHHEDPRDTSKAFLTFQVMTWAPNKSRLWVKKK^ 

1928 

A 

285 

476 

LLKHLINNMMVSKTTWLGVLAHTCNPS/TNFLGGRGGRIS*GQEFEA 
SLGNMGRPCLYKNRQKTN 

1929 

A 

218 

602 

NGGQAVAHACNPSTIXX^WRVDHLRSGVRDQPGQRGETPSLLKIQKIr 
AGRGGARLWSQLLRRLRQENRLNLGGGGCSEPRWH\HCIPAWGNKKE 
*NGNYAQRMGERWLTGLKHQRNGEDRTVRELSGR j 
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1930 

A 

1 

628 

FFYKSVLAQKEVRCVLDE /IS PLGPPELKPVRMFADL / LPHESRRDS 
SVPELAEKMSRSVDVPTPPSLLMAASIYLHDQNPDAALRJUJIQGDSIj 
ECTAMTMQDAYY I FQEMA0KCS PTLLLLNGQAACHMAQGRWEAAEGL 
LQEALDKDSGYPETLVNLIVLSQHLGKPPEVTNRYLSQLKDAHRSHP 
FI KEYQAKENDFDRLVLQYAPS A 

1931 

A 

1 

354 

TFLLMAAS I YLHDKNPDAALRALHQGDSLECT/ ARKELKRMQDLDED 
ATLTQLATAWVSLATDSGYPETLVNLIVLSQHIiGKPPEVTNRYLSQL 
KDAHRSHPFIKEYQA\RLVLQYAPSA 

1932 

A 

29 

1061 

EDSEMAPPAPGPA\ SGGSGEV\DELFDVKNA\ FYIGSYQQCINEAQR 
\ VKLS S PERDV\ ERDVFL YRAVPG AEES SGVALDEDQSPP WTLDFQG 
RAQLFADYXLAHENXRRDKHRWPSLDREXMSRSVGR^PKPTFIiXLMA 
PF I * FQEPEPR I GPLRGLHQGN\ SLE FTPMTVEVIjAESGTALNFAGK 
ELKRMQDLDE\DATLTQ\ LATAWVSLATG\ GEKLQ/ EMP YYIFQGDG 
CTSASPQPLLLAQLGKAA\CHNGPRARW\EPAEGLIjQ\EAIi\DKDSG 

\ypetl\vnl,irpvpsnlgkppe\vtn\rylsq\lkdah\rshpfik 
evprpkendf\drlvlqyasqapeagpelsgp 

1933 

A 

2 

355 

TSMLGCTVLFLR/YC^S\YCNVLATVWSSLV*RSLRICIiHLVS/WS 

fvtdckacyijtgmlfysdy**fvyyf*yycfflcslffsicllmyfn 
ifff/cnfmfdcyiliilsfyfiilyhyf 

1934 

A 

194 

9967 

kmwptrrlvtikrsgvdgphfplslstclfgrgiecdiriqlpvvsk 
qhckieiheqeailhnfsstnptqvngsvidepvrlkhgdvitiidr 
sfryenes\lqngrkstefprkir\drrparrvsrssfssdpdekaq 
DS kayskitegkvsgnpqvh I knvkeds tadds kds vaqgttnvhss 

EHAGRNGRNAADP I SGDFKE I S S VKLVS R YGELKS VPTTQCLDNSKK 

NESPFWKLYESVKKELDVKSQKENVLQYCRKSGLQTDYATEKESADG 

LQGETQLLVSRKSRPKSGGSGHAVAEPASPEQELDQNKGKGRDVESV 

QTPS KAVGAS FPL YE PAKMKTP VQYSQQQNS PQKHKNKDLYTTGRRE 

SVNLGKSEGFKAGDKTLTPRKLSTRNRTPAKVEDAADSATKPENLSS 

KTRG S I PTDVE VLPTETE I HNE P FLTL WLTQ VERKI QKD SLS KPE KL 

GTTAGQMCSGLPGLSSVDINNFGDSINESEGIPLKRRRVSFGGHLRP 

ELFDENLPPNTPLKRGEAPTKRKSLVMHTPPVLKKIIKEQPQPSGKQ 

ESGSEIHVEVKAQSLVISPPAPSPRKTPVASDQRRRSCKTAPASSSK 

SQTE VPKRGGERVATCLQKRVS I SRS QHD I LQM ICS KRRSGAS EANL 

IVAKSWADWKLGAKQTQTKVIKHGPQRSMNKRQRRPATPKKPVGEV 

HSQFSTGHANSPCTIIIGKAHTEKVHVPARPYRVLNNFISNQKMDFK 

EDLSGI AEMFKTPVKEQPQLTSTCH I AI SNS ENLLGKQFQGTDSGEE 

PLLPTSESFGGNVFFSAQNAAKQPSDKCSASPPLRRQCIRENGNVAK 

TPRNTYKMTSLETKTSDTETEPSKTVSTV^TRSGRSTEFRNIQKIiPVE 

SKSEETNTEIVECILKRGQKATLLQQRREGEMKEIERPFETYKENIE 

LKEaTOEKMKAMKRSRTWGQKCAPMSDLTDLKSLPDTELMKDTARGQN 

LLQTQDHAKAPKSEKGKITKMPCQSLQPEPINTPTHTKQQLKASLGK 

VGVKEELLAVGKFTRTSGETTHTHREPAGDGKSIRTFKESPKQILDP 

AARVTGMKKWPRTPKEEAQSLEDLAGFKELFQTPGPSEESMTOEKTT 

KIACKSPPPESVDTPTSTKQWPKRSLRKADVEEEFLALRKLTPSAGK 

AMLTPKPAGGDEKDIKAFMGTPVQKLDLAGTLPGSKRQLQTPKEKAQ 

ALEDLAGFKELFQTPGHTEELVAAGKTTKIPCDSPQSDPVDTPTSTK 

QRPKRS IRKADVEGELLACRNLMPSAGKAMHTPKPS VGEEKDI 1 1 FV 

GTPVQKLDLTENLTGSKRRPQTPKEEAQALEDLTGFKELFQTPGHTE 

EAVAAGKTTKMPCESSPPESADTPTSTRRQPKTPLEKRDVQKELSAI* 

KKLTQTSGETTHTDKVPGGEDKSINAFRETTUCQKfjDPAASVTGSKRH 

PKTKEKAQPLEDLAGWKELFQTPVCTDKPTTHEKTTKIACRSQPDPV 

DTPTSSKPQSKRSLRKVDVEEEFFALRKRTPSAGKAMHTPKPAVSGE 

KNIYAFMGTPVQKLDLTENLTGSKRRLQTPKEKAQALEDLAGFKELF 

QTRGHTEESMTNDKTAKVACKSSQPDLDKNPASSKRRLKTSLGKVGV 

KEELLAVGKLTQTSGETTHTHTEPTGDGKSMKAFMESPKQILDSAAS 

LTGSKRQLRTPKGKSEVPEDLAGFIELFQTPSHTKESMTNEKTTKVS 
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YRASQPDLVDTPTSSKPQPKRSLRKADTEEEFLAFRKQTPSAGKAMH 

TPKPAVGEEKDINTFLGTPVQKLDQPGNLPGSNRRLQTRKEKAQALE 

ELTGFRELFQTPCTDNPTADEKTTKKILCKSPQSDPADTPTNTKQRP 

KRSLKK7VDVEEEFLAFRKLTPSAGKAMHTPKAAVGEEKDINTFVGTP 

VEKLDLLGNLPGSKRRPQTPKEKAKAIiEDLAGFKELFQTPGHTEESM 

TDDKI TEVS CKS PQPD PVKTPTS S KQRLKI S LGKVG VKEEVLPVGKL 

TQTSGKTTQTHRETAGDGKSIKAFKESAKQMLDPANYGTGMERWPRT 

PKEEAQSLEDLAGFKELFQTPDHTEESTTDDKTTKIACKSPPPESMD 

TPTSTRRRPKTPLGKRDIVEELSALKQLTQTTHTDKVPGDBDKGINV 

FRETAKQKLDPAASVTGSKRQPRTPKGKAQPLEDLAGLKELFQTPVC 

TDKPTTHEKTTKI ACRS PQPDPVGTPT I FKPQS KR S LRKAD VEEES L 

ALRKRTPSVGKAMDTPKPAGGDEKDMKAFMGTPVQKIiDLPGNLPGSK 

RWPQTPKEKAQALEDLAGFKELFQTPGTDKPTTDEKTTKIACKSPQP 

DPVDTPASTKQRPKRNLRKADVEEEFLALRKRTPSAGKAMDTPKPAV 

SDEKNINTFVETPVQKLDLLGNLPGSKRQPQTPKEKAEAIjEDLVGFK 

ELFQTPGHTEESMTDDKITEVSCKS PQPES FKTSRS S KQRLKI PLVK 

VDMKEEPLAVSKLTRTSGETTQTHTEP*TGDSKSIKAFKESPKQII*DP 

AASVTGSRRQLRTRKEKARALEDLVDFKELFSAPGHTEESMTIDKNT 

KIPCKSPPPELTDTATSTKRCPKTRPRKEVKEELSAVERLTQTSGQS 

THTHKEPASGDEGIKVLKQRAKKKPNPVEEEPSRRRPRAPKEKAQPL 

EDLAGFTELSETSGHTQESLTAGKATKIPCESPPLEWDTTASTKRH 

LRTRVQKVQVKEEPSAVKFTQTSGETTDADKEPAGEDKG I KALKES A 

KQTPAPAASVTGSRRRPRAPRESAQAIEDLAGFKDPAAGHTEESMTD 

DKTTKIPCKSSPELEDTATSSKRRPRTRAQKVEVKEELLAVGKLTQT 

SGETTHTDKEPVGEGKGTKAFKQPAKRNVDAEDVIGSRRQPRAPKEK 

AQPLEDLASFQELSQTPGHTEELANGAADSFTSAPKQTPDSGKPLKI 

SRRVLRAPKVEPVGDWSTRDPVKSQSKSNTSLPPLPFKRGGGKDGS 

VTGTKRLRCMPAPEEIVEELPASKKQRVAPRARGKSSEPWIMKRSL 

RTSAKR I EPAEELNSNDMKTNKEEHKLQD S VPENKG I S LR SRRQDKT 

EAEQQ ITEVFVLAER I E INRNEKKPM KTS PEMD I QNPDDGARKP I PR 

DKVTENKRCLRSARQNESSQPKVAEESGGQKSAKVLMQNQKGKGEAG 

NSDSMCLRSRKTKSQPAASTLESKSVQRVTRSVKRCAENPKKAEDNV 

CVKKITTRSHRDSEDI 

X935 

A 

194 

1026 

KMWPTRRLVTIKRSGVDGPHFPLSLSTCLFGRGIECDIRIQLPWSK 
QHCKIEIHEQEAILHNFSSTNPTQVNGSVIDEPVRLKHGDVITIIDR 
SFRYENESLQNGRKSTEFPRKIREQEPARRVSRSSFSSDP/G*E*GN 
TFEKKACVLWWAPKT * TI * *KLAS * YASQKGRSPNQKKVSGNAHSTC 
PEENHQGTASTIRKTRVRFRNPCGSEGTKLGYKPSSS *S * ENSSCQ* 
STP*VIiQNSPCFQQQISDKGS*ERRR\KSGNLPSKESVYQPKST*YF 
TDDMFQKKKWCFGSKSDCCKIMGRCSKTWCKTNTN* SHKTWSS KVNE 
QKAKKTC YS KEACGRS SQS I * YRPRKLS L YHNNRES S Y * K\ STCACS 
TLQSAQQLHFQPKNGL*GRSFRNS*NVQDPSEGATAVDKHMSHRYFK 
FREFAWKTVSRN*FRRRTSAPHLREFWRKCVXjQCTECSKTAI * *MLC 
KPSLKTAVY*RKWKRSKNAQEHLQNDFSGDKNFRY*DRAFKNSIHCK 
QVRKVYRVQEYTEATCGK* E * RNKYRNC * VHPKKRSEGNTTTTKERR 
RDEGNIlKTF*DI*GKY*IKRKR*iaDESNEEIKNIjGAEMCTNV*PDRP 
QELA*YRTHERHGTWPESPPNPRSCQGTKE*ERQNH*NALPVITTRT 
NKHPNTHKTTVEGI PGESRCERRAPSSRQVHTDVRGDHAHAQRAS RR 
WQEHQNV*GVSKADPGPSSPCNWNEEVAKNA*GRGPVTRRPGWLQRA 
LPDTRSL*GIND* *ENYQNSLQISTTRISGHSNKHKAMA* EKSQESR 
CRGRILSTQETNTISRBSHAYAQTSRR* * ERH* S I YGNSS AETGPGR 
NFTWQQKTATDS*GKGt>GSRRPGWIi*RALPDSWSHRGISGCW*NH*N 
TLRLSTVRPSGHPNKHKATTQEKYQESRCRGRTLSVQESNAISRQSH 
AHA* TISR*RERHHHICGNSSAETGPDRELN\ RQQETATNS * GRGPG 
SGRPDWIi*RALPDPWSY*RSSGCWQNY*NAIiRIFSTRISRHPNKHKK 
AAQDTFGEKGRTEG ALS PEEAHTD I RGNHTHR * S TRR * G * KHQRV * G 
N\ CKTET\GPS SKCNW\ EQEAPKN* GKGPTPRRPG WLERAX.PDTSMH 
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♦QAHDSRENYQNSLQITTRPSGHTNKLQATVQEKSQESGRRRRILRT 
QETNTISRQSHAHTQTSSKW^EKHLRIYGNSSAETGPDRELNWQQBT 
ATNS*GKGPGSRRPGWIi*RAIiPDTRSH*GIND*R*NCQSSLQIFTTR 
PRQKP S KLQATAQD I PGESGRERRAP S S WQAHTD I RRD YTHTHRANR 
RW * EHE S I YGVS KADLRL S S KSNWQQEAAENS * G KV* S P * RPGRLHR 
AIiPDTKSH*GIND**KNYQSILQSFTARPSGHPNKLQATAQEKSQES 
RH * RR I FS I * ETNAI SRQSHAHTQTS SR * RERHQHVFGNS S AETGPA 
RKFTWQQ*TATNS*GKGPGSRRTDWLQRAFPDTMH**PHS**ENYQK 
NTLQISAIRPSGHPNKHKATAQEKPQESRRRGRI FS IQETNTI SRQS 
HAHA*SSSR*RERHQHICGDSSGETGPARKFTWQQETATNS*RKGQG 
SRRSGWLQRALPDTRSH*GINDR*QNHRSILQISTTRPSQNPNKLQA 
TTQD I LGE S RCERRG PTS RQAHTDVREDHTDTQRDS RRW KEHQS V* G 
ICKADAGPSKLWNWDGEVAKNT*GRGPITRRPGRIiQRALPDTRPH*G 
INN* * QNYQNSLQ I STTR INGHSNKHKEAAQNTFGEKG YSGRALS PE 
AAHTDHTHRQSTRR * G * RHQRVQGNCKTETGPS S KCNW * QEAAKNS * 
GKSPTPRRLGWLERALPDTSMH*QAHDSRENYQNSLQISTTRPSGYP 
NNLQATVQEKSQESRRRGRILSTQETNTISRESYGHTQTSRR**ERH 
ES I YGNSSAEIGPARKFTWQQKMATNS *GKGPGSRRPGWLQRALPDT 
RH*QAHD**ENYQNSLQISTTRPSGHPSKHKATAQEKPQBSRRRGRI 
FSTQETNTI SRQSHGHPKTSS K* * EKYQH ICGNS S AETGPARKFTWQ 
QETATDS *GKG * GSRGPGWLQRTLPDTRSH *G IND * *QNHRSIL* IS 
TARVIQNLKKLQAKAQDTPGESGHERR\SLTSQQAHTDIRGDYANTH 
RANRR* *EHQSV*GVSKADPGPSSKCNW*QEAAENS*GKGPCSRRPG 
* LQRALLSTRSH * RVNDY * QKHKNS LQIS PTRTNRHCHEHKEMPQDT 
SQERSKRGALSS *EAHANIRAKHTHTQRTS\KR* * GHQS I PATCKEE 
TKPSRKGTQQEKAKST*GKGPTPGRPGRIiHRAL*NIRSHSGITDCWQ 
SH * NTLR I S P \ TR SGRHHS KPKE AS QDTC AEGTS KRRAFS SQVHTN I 
RGNHGCRQRTSR * / EIKGHQS I EG I CKTDTG\ S S S \ QCNWQQ\ KTAK 
ST\QGKCPSHRRPSWLQRPSSRSH*RIND**QNH\*NTLQIITRT\R 
RHRNKLKETAQDTCPESRSEGGAVSSWQAHTNLRGDHAHRQRAGR*G 
Q\ KHES I * ATCKAEAGRRRCNWQQETAKST *GKGPTPGRSGQLPRAL 
SNTRPH*GTGKWCC**LYKRSKANT*QWKTSKNIQKSSSGP*SRTRG 
RRGKHQRPCKITKQKQHFPAPTALQEGRWQRWKRHGNL/KQAALiHAS 
TRGNCGGAAN\QQEA\RGCSQGKRQIIRTRGHHEEKFEDFCKKN*TC 
GRAEQQRHENQQRGTQ I TRL S P * K * GN I P ALQ\ TPK* D * GRTANN\ * 
RT.PT QP KNPNTCOK* "fCEAP * PTQ pAP UTT PPFPHDRQ PnKTPHPIjETKV 
H * E \ QKVLEVL/ A* TE * EL« \ PA\ QGGRGERRAEECEGSHAESERERR 
SRKFRLHVPE I KKDKKPACS KHFGEQI CAE SNAECQE VCRKSKEG * G 
QCVCQENKNQKS*GQ*RYENESLQNGRKSTEFPRKIREQEPARRVSR 
SSFSSDPDESEGIPLKRRRVSFGGHLRPELFDENLPPNTPLKRGEAP 
TKRKSLVMHTPPVLKKI I KEQPQPSGKQESGSEIHVEVKAQSLVISP 
PT^PSPRKTPVASDQRRRSCKTAPASSSKSQTKVIjKRGGEKSGNLPSK 
ESVYQPKST 

1936 

A 

139 

784 

GLHHGCSLGMEEAAGRGDRSRSRCRAPQHHRPPPPLSCQQPRIiliGEA 
GRGGVRGKHGSL* KQAPPPRGRAETPGIANHTLPPRVPP / SEGQQHP 
REGQGLHGGPGEKGKPHRRKLKAS VPCV SAERVNGPKGS SLQTARIH 
PTGGHRKPTGAVCVCACAAHTSAARGPLRPHHTACPAHVCTRRCRRE 
HTPPSLCTRVPLSGPGGSSLLHWLSRA 

1937 

A 

i 

1878 

MVSQAHCATLHAPTPAAATLTGGRRSLGTAGRAPPGLGTRDHAPAPG 
RRVSGGEGSKfcAAAAALAAVAAAPGPVRRCSSQSCFSSSGSSHYSAR 
TS PVRVRPRRS LS SRS AAGNRAEATE S AMEKTLETVPLERKKREKEQ 
FRKLF I GGLS FETTEES LRNYYEQWGKLTDC\A7MRDPAS KRSRGFGF 
VTFS SMAEVDAAMAARPHS IDGRVVEPKPJVVAREESGKPGAHVTVKK 
LFVGGIKEDTEEHHLiRDYFEEYGKIDTIEIITDRQSGKKRGFGFVTF 
DDHDPVDKIVLQKYHTINGHNAEVRKAIiSRQEMQEVQSSRSGRGDGY 
GSGRGFGDGYNGYGGGPGGGNFGGSPGYGGGRGGYGGGGPGYGNQGG 
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G YGGG YDNYGGGNYG S GNYND FGNYNOO P SNYQPMKSGNPCSn S PNMG 
G P YGGAEAS QVI I NKAFLG S FSGVGS KMQKMGKKQNGKS KKVEEAEP 
EEFVVEKVLDRRVVNGKVEYFLKWKGFTDADNTWEPEENLiDCPELIE 
AFLNSQKAGKEKDGTKRKSLSDSESDDSKSKKKRDAADKPRGFARGL 
DPERIIGATDSSGELMFLMKWKDSDEADLVIiAKEANMKCPQIVIAFY 
EERLTWHSCPEDEAQ 

1938 

A 

345 

558 

LYMLIRMRLKEGRAKMVESIFR* * FILE*SVLS/RIMKPGMYP\TLNR 
WVKCGNSSSVSYPEEKWGWLLKFI 

1939 

A 

345 

511 

ARDATFVNGLDYL I TLPYCGWKDCKKKCPLRQFP * PFNCCFFL WFV 
RV*KHSLP 

1940 

A 

827 

2660 

CCHCFQDLIS INGEVVEQKRAVAREESGKPGAHVTVKKLFVGGIKKN 
TEEHHLTDYFEEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDPVDKI 
VLQKYHTINGHNAEVRKALSRQEMQEVQSSRSGRGDGYGSGRGFGDG 
YNGYGGGPGGGNFGGSPGYGGGRGGYGGGGPGYGNQGGGYGGGYDNY 
GGGNYGSGNYKDFGNYNQQPSNYGPMKSGNFGGSRNMGGPYGGGIWK 
NTS ITERKKSRKLDLIQS KKGSRTKEAPQPPVASLCMHLGHWSRLMV 
S PGAQLTGKNSHGLS VS S VRKSNVGPRRLCAAMKATGPDNAQSQVS P 
PGHAPSAEDPTGSRTVSSPCTDRPHPFLSWPTWISIJ^LIjKTDGALE 
RMPQQLPSLHPSQGTQSIHPDPSSTSSFLLPFQPPTLKRAAFPCPPS 
IVNPAVWDTSTPSVAEHHTPIRITLKEPTQFLSQKQYPIPQAALVGL 
QPI I SHLLASHLLRPTDS PFNTPILPVKKPNGTYRLVQDLRIilNQAV 
LPRHAGLGRDLGSRRAARGPASASVAARGRLSRGCGDRPVNVAAAGQ 
KMGKK<JNGKSKKVEElAEPEEFVVEKVIiDRRVVNGKVEYFLiKWKGFTE 

1941 

A 

478 

1150 

* 

SMPWQIGRSSVSAPPTITPTSSTASWTIVSSTIWSPHVPATTKVSTL 
HWTVVRRLLVIISKIIVISTSIISSSIVVITTSVAPTLVAISRSSTT 
ISSSSSSI TGATS KI ATS RS S S SAGS RAEVLLAELFIjEQRQFS LQRQ 
DESGSCSSAEISLISLGCGKSG*SD*VRDGERKRNSSVSSLLVA*AL 
KPQKKV*GTTTNNGESLQTVW* * G I LQ AKDQEDLiVI* 

1942 

A 

123 

734 

LFKSAIKNGLQHELHCRKWEKKQNG\KSKKVQ\EAE\PVESGVEKVIi 
DRRV\ VNGKVE\ YFLKWKG FTDADNTWEPE \ ENL\ DCPELD * SRFLN 
FSRKAGQRKRWVPKRKSLSD\SESDDSQIHRRKRDAADQPKEDFARG 
\ LD P * KK* LGAHRPASGE \ LMFL \ MKWKDS \ DEADLVL\ AKEANMKC 
P\QIVIAFYEEKPTW\HSCP\EDEAQ 

1943 

A 

1092 

1286 

IDVCVCLALLLRLECSSVISAHCSLCS/SGSSDPPTSAS*VAGTTTA 
CHHAQLIFGFFFFLKRWGF 

1944 

A 

76 

533 

LPRPRSRLTALPPPPSFLQTPKSRALMAGLEVLFASAAPAITCRQDA 
LVCFLHWEVVTHGYCGLGVGDQPGPNDKKSELIjPAGWNNNKDLYVLR 
YEYKDGSRKLLVKAITVESSMILNV\ RTYKNSEELRSRIVSGIITPI 
HEQWEKANVSSP 

1945 

A 

109 

1008 

AIiPPPPSFLHTPKSRALMAGLEVLFASAAPAITCRQDALVCFL\HWE 
WTHGYFGLGVGDQPG\ PNDKKSELLPAGW \ NNNKDIiYVLRYB Y * GW 
GPESFLVESHSPWESSIDSSMLLGIMGSQQSWQI*PLNLG*IiFSMAE 
HLGDFHRTYKNSEELRSRIGA\GIL\TPIHEQWEKANVSSPHREFPP 
ATAREVDPLRIPPHHPHTSRQPP\WCDPLGPFWGGEDIiDPFGPRRV 
G\MNV\DPLRSGLP\RAFNDP\SSGLPNRL\PPG\AVPQGAGFDPFG 
PIG\TSPPG\PNP\DHLPPPG\YDDMYL 

1946 

A 

152 

991 

RKTKCVTRPAWFQSPIjTSRSSRASACEVAFPRGQPRKGPKRDNWIL 
GTRPS WVAVCSS PRLGLSR\ E YKLVMLGAGGVGKS AMTMQFI SHRFP 
ED«DPTIEDAYKIRIRIDDEPANL\DILDTAGQAEFTAMR\DQYMRA 
GEGFIIC*LLSRIRRSF\HEVPESLNQLIY\RVRRT\DDTPWIi/VW 
GNKSDLQTA*DRFTKGRKGLALAPENSSCPLFWRTSGCHTR\YYIDG 
CFPHAP\VREI\RRKEKEAV\LA\MEKKS*APKTSVWKEAKNHPFRK 
KKDSVT 

1947 

A 

305 

406 

VTLNKQSNLLSSHFLSCNSWIKSPRSGKNQETS 

1948 

A 

372 

501 

RPGAVAH S CNS \ S TLGGRGRW IT* GLEFETCLANMVKLCL FHLY 

1949 

A 

441 

812 

ITTHLYISKPLLCTPMKTYNYYLS IAKIKF* FSLLRQGLA1.SPRLEC 


\W\\ 


:li„ O O El: 1+3 „ O 2 El! 6 0 K 






TSTI TAHCS LNLPGFKOS SHS OPSE * IX5TTDTHHH I OLVFTj I L / AET 
EFCHVAQGGL/NIS * VQLIHLPQTSKVIiGLQM 

1950 

A 

2 

370 

RVAHASGAS QLPDYS I S PPSL PPR I S FHPS PTLARVAMAEPS AATQS 
HSISSSSFGAEPSAPGGGGSPGACPALGTKSCSSSCAGTDLGNSSDY 
KCHTDQLEESLLIGAVLKGLLVFGIiHSI 

1951 

B 

209 

524 

MLLSLAAFSVISWSYLIIiALLSVTISFRIYKSVIQAVQKSEEGHPF 
KAYLDVDITLSSEAFHWYMNA7U4VHINRALKLIIRLFLVEDLVDSLK 
LAVFMWLMTYVX * 

1952 

A 

3 

465 

GRIPCRRFRLGAGGVKGLERASCRIILFPLPLSRPVSLFTLLPPFAR 
VAMAEPSEATQSHSISSSSFGAEPSAPGGGGSPGACPALGTKSCSSS 
CADS FVS S S S SQPVSLFSTSQVHDL I FWRDVKKTGFVFGTTL»IMLLS 
LAAFRVI S VGS Y 

1953 

A 

49 

1129 

RDLIEFSCRIILFPLPSLPPP TfiFHP^PTT.aRVaMAPPQaATn^H^T 
SSSSFGAEPSAPGGGG\SPGSLPRPWGPKSCSSS\CAVHDLIFWRDV 
KKTGFVFGTTLIMLLSLA\AFSVISV\VSYLILAIj\LSVTISFR\IY 
KFVIQAVQKSEB\GHPFQKPNWNVD1TLSSKSFS INNMNAAILHINM 
FLKL 1 1 RLFLVEDLVDSLKlaAVFMWljMTYVGAVFNG I TTjIi ILAELF I 
FS VP IVL * RKYKTQ ID\ HYVGI ARDQTKS I VEKI PSKTPLG IAKKKG 
RIKYWETRNATSYLKHIILISYNVVTCTMKEI^QCQLEPAFQAFFLI^ 
CFLPSFPFNPQSSSTKIDGLIKDLFLDLRRRKNQIS 

1954 

A 

46 

519 

SQTPMGHFTEED \ KAT I \ TSLWGK\ VNVE\DAGGET\ LGRIi\ LWYP 
MGPQRFL^PALGNLSSASAIHGQPPKSRAHGQEGC^RSLGXDAIKAP 
GIDLORAPFAOA* SELiATjVDKT.AMWnQT.PTJ'PKZi QWf3TCP*r , T.VTJD"FT.ZV T 
PFSAKEFHPLRCQVFLGQKDG 

1955 

A 

1 

747 

MNSLLGLIIFJvTFIEHQPYTTNDVLl^ 

AGP PNGAQATRYLNPGTRLQRRRS PAIiRARRAVLTAPRTLVS S LFS P 
RPSLGIUVGTLAFPVPHGAAPTPDGECARGPIjSEASPSAPQIiPCRFSH 
LLRASPSQPGLLFSSPELTASPASLTULRPVIiSGPPFPVWIPPSPNPS 
L PRGGRS VG KRFRDG ACO VS S D LEW T DTG VTJ fnTK<5 WVT TPR TVL.F B 

jjxi\uui\k> » fvuuri^,\^ V OkJlVliuiv i,U X \J VIM O XV V XVO XI V X -L XvXv X V 1 rr>f% 

I GNR VTTRAAQVAK 

1956 

A 

1 

813 

MKEENLCQAFSDALLCKIEDIDNEDWEl^QLCSDYVKDIYQYLRQLE 
VG\LQSINPHFI^GRDINGRM/RAILVDWLVQVHSKFRLLQETLYMC 
VGIMDRFIiQLSLPAEDREALGTSSPQHSGALGDVGYKSGFIIiSPHPC 
MS KIEPEDEKLS FLF IGPFLKNPSPRANGDPMFLCLNEDEAQQLEET 
KWTGCQKQLCDPLS EEVKTGEKLVQTKGERTSR I RE VQFIiAQNHTTR 
RWQSWDLGTSSLTPEPVFSLEINVREQRDEDNIQVLRG 

1957 

A 

1 

1390 

EATASKIPSAAGSESSPNGASYASVPPFSWVPPWAGLALLPSPSLM 
ALIjRRPTVS SDLEN IDTGVNS KVKSHVT I RRTVLEE I G\ NRVTTRAA 
QVAKESSGTPKFQVQPTKTTiMVNKQIi\ KPTASCQTQYQMGKVWLPKG 
PSPTP\ EDVS\MKGRRST.PKT.E<?Tjx\T.T.\ r , TfTETlTrilsJP T 'nW13 t Mt>\ PT.PC! 

dyvkdiyqylrqlevlqsinphfldgrdingrmx^^ 

S KFRliLOETLY\ MCV\ G IMGSDF* OVOPVSRKKLOIjVG T TALIjIiAPK 
YEKMFSPNIEDF\VYITDNAYPS\SQIRE^TLILKELKFELGRPLP 
LHFLRRAS *AGEVDVEQHTLAKY\LMELTLIDYDIWHYHPF*G* PAA 
AS \ CLS QKVLDKGKMEL * S TO YYHKDTQENE VLEVHAS TMAQECGAK 
♦MENLNLNS IGHQRI KYAKQQTP* KI SMI PQIxNSKAVKDL\ASP\LI 
GRS 

1958 

A 

3 

269 

VGGGTGF I GTAIiTQLLNARGHEVTLVSRKPGPGR I TWDELAASGLP S 
CDAAVNIiAGEN I LNPLRRS ARALKL I PTRAQGGQCPTDENL 

1959 

A 

3 

253 

CRTLVHP PQEAGGRAELGGARRGRGTVCFLRQDELAAS GLPS CDAAV 
OTjAGENILNPLRRSARALKLIPTRAQGGQRPTDENL 

1960 

A 

283 

1208 

TIFHFPIQVAGTGFIG/RQP*PSCLNARGHGITFGFPESPGPAGFTW 

DE1xAASGLPSL\0AAV1mX.\AGENII j NPLRRW1^ 

TQLLG * SHS PKAPQP PK\ PGVLVTGVS FITS PV* LRS ID* RQPREGT 

LTFFSNLLTKW/EKLAAKAFLGDSTRQVGWCRSRGLCLGPLGAGSMG 

HMLLPF\SLGLGGPIGSGHQFFPWIHIGDIxAGI^ 
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NGVAPS SATNAEFAQTLGAALGRRAF \ LPLPS AWQAVFGRQRA IML 
LEGQKVI PRRTLATG YQ YS FPELGAALKE I VA 

1961 

A 

698 

1240 

GLKDSHTAHFCYRPKVSMGRVRKGKQKPGGAVGSFKAEKTPVPSLFP 
v v ijvjiwwwim^iipijjjj v t KIjOLiGGP I GSG\HQFFPWKHIVAIjARN 
LDPGP * SKPRARGPEW/ MLAPSSATNAEFAQTFGAALGPPSL\ LPLP 
SAFGASCLWGDSVPSCLSGGPER*SPRAEHLATWLPSIPSQS 

1962 

A 

494 

1068 

EFIFPSPLVLFPIRIMKMGLLCEVSAAVPAVLFLTDLVGLGPCLKAK 
NVSVFFFFVFVLVMCNLSLITVEQISLTQITREGKGASARNTNLFLF 
v-r V3&vr^vvorvoA.VKiJJ\JjJ>il>Jij VJjr 1J i\A I 1 iJ K I , (J KEVPNYKTj X X PAWS E 
RLKI RGS IxARAALQELLS KGL I KL VS KHRAQV I YTRNTKGGD APAAG 
EDA 

1963 

A 

1 

573 

KYSQRTWFG PVS EQRFLFVPT I LTRLRCLLL I LPELRNAALRTDQE 
v vj i^i^iv^yKi^y " 1 iN^VjvayGQKEGRNKTSIjKETREVVQKAKV 
RDKLNN\ LVLFDKA\ TYG * TLVKGSS PTY * TLLTPAVGLLRRL * R I S 

EAPWGQGPALSR\GFL*LKGFYSKLGFFKRHRSSKLFYTRNYQGWKM 
LPAAG 

1964 

A 

33 

515 

SAWWNSWDK^^P'R'PZ^T.VfST.VTi'KT OAT TPXTUrtDXroT/WTTv-rkT t r\/->x irn 
«-> n j_/rvoo ciDH1jA.u Li VUl\Xiy/vJj J. VjW livjR.VS VEN X KQLJjQCliVP 

GSTTLHS AE I LAE I AR I LRPGGCLFLKE P VETAVDNNS KVKTAS KLC 
SALTLSGIiVEEKELQREPLTPEEVQSVREHLGHESDNLIiFVQITGKK 
PNFEVGSSMQLKLSITKKS 

1965 

A 

1 

503 

GHESDNLLFVQITGKKPNFEVGSSRQLKLSITKKSSPSVKPAVDPAA 
AKLWTLSANDMEDDSMDLIDSDELLDPEDLKKPDPAS\liRA\ASCGE 
GKKRKACKNCTCGLAEEL\ EKEKSREQMSSQPKSA\ CGNCYRGAMPS 
GCASCPYLGMPAFKPGEKVtiLSDSNLHDA 

1966 

A 

29 

1270 

F P FWPAVFQVCQYCTARMADFG I S AGQFVA WWDKS S PV\ EALKGL \ 
VDKLQAF / TPGNEGRVSV\ENlKAAVAIliPTKNPSFGHYFVQ/CLVP 
^1UU^JjWH^A*DFWAGNPPGFLRPGWMFFFLKEPVETAVR*Q*AKWKT 
ASKL\ CSAL\ TLSGLV/ EKLKELQREPLTPEEVQS VREHLGHESDNL 
LFVQITGKKPNFEVGSSRQLKLSITKKSSPSVKPAVDP7V7^AKL\WTL 
S ANDMEDDS MC I FCGCS LTHRW PLEHWQ VE / 1 MMDQPKRRTRVDT\ 
r v ± rKiir'Af f c>Kt>i'As>iii»i>r olKQKT/TRPVSLIAIiNTLi\QDLIDSD 
ELIJDPEDLKKPDPSLLRAASCGEGKKRKACKNCTCGIiAEELEKBKSR 
EQMSSQPKSACGNCYU5DAFRCASCPYLGMPAFKPG\EKVLLSDSNL 
HDA 

1967 

A 

3 

498 

LANRAIMSHKQIYYSDKYDDEEFEYRLVI^ 

ISNRN*GDRIGACVRDMSCCPKDIAKLVPRTH\liMSESBWRNI#GVQ/ 
yftoyvjwvn \ xm.riinr' \zSt , rl±ljLir \RRPJjP\RKPKEMKLGKIjTFQPSS 
FYTAGPYIiPNIFLDNI I YVGLL VFFTFDI 

1968 

A 

1 

690 

RRKAFPKRLPKMAEVQVLVL\DGQG\HLI*G\RIjA\AIVAK\QVLLGR 
KGGCS YACEG I H I \ S GNFLQNQVCSTLAFPLQA / RMNTNP \ SQGP\ Y 

HFG\ APSRIFX WRT\VRRM\ T.DUVTVn T77\ T>r>T T t\\ t>t inrcnnrnnnv 
nrvj \/ir^n.xr \ mrt x \ viwjn \±j.f.tt.l\.l JSJ^Ai<ir'ljJjD\RIjKVlTlX3IPPPY 

DXKIOOlXNrWPXAALKXWRFEAYTESFAYLGRLXAPEVGWNAIRPV 

TAPPGGERGKRJCAKIH\YllKKK*L\MRL\RKQAREETWRKKI 
VLKTHGLLV 

1969 

A 

2064 

2561 

KRFWSFALFYYLILKLL/CIDSIVRIGTILYSTVLFFIFIiKFKV*LV 
LITFIQIF AI FFGS ETF * QVGV * FLL I PNFFSRVLL I LS EGKVI * VC 
vjjiiuuwrnivr 1 v xt?r.VVtyJ. Y xSIJuNK/ AVIHFFIICV/ YFHVKF 
LFL YVLLLS YITQFLF * KS S FVEVL.VKN 

1970 

B 

49 

754 

MAGAAPRL PWSRPHGS CGWWPMQPLQGPGRRAVAAAWEVATPVFM PV 
GTQATMKASRPNSWTLWVAASAWEYLPSGSKAGGTAAFPDGVAGVSV 
RGDGGGLRFRS P YDGNETLLS PENPCRSRMRWAPGWTGALtiQVNPL 
AGPVHAAHQRPDKQNLFAIIQVGWTQISGPPALKRFGSAWHSRAFLT 

HCCTVTTRPRCTTSRPQHRLPT^ELMSAVRTSIVEKRFPDSCGTSWAP 
X* 

1971 

A 

1 

1498 

RQSRWREQLPRLSWRLVPRIMRLVAECGRSRARAGELWLPHGTVATP 
VFMPVGTQATMKGITFXEFLDT^GCRICLGXNTYHLGLRPGPELXIQ 
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KANGLPRLS * IWPSLILLNGTAAVFQMVSLVSLSEVTEEGVRFRSPY 
DGNETLIiSPEKSVQIQNALGSD\ 1 1 MQLDDWS S TVTGPRVEEAMYR 
S IRWLDRCI AAHQRPDKQNLFAI IQGGIjDADLRATCLEEMTKRDVPG 
FAIGGLSGGESKSQFWRMVALSTSRLPKDKPRYLMGVGYWDREARA 
LPVGSGFLGTPYPAWGGGIWGKRTQV*SEETRKTWLSLGGHSEGIWP 
SLGQCEGWEGPWEAPEVLCPLPSWLQPQLi\ATDLWCVALGCDMFDC 
VFPTRTARFGSALVPTGA\LQLRKKVFEKDFGPIRPGVHLPPRPKST 
£>KAr LnAi AAyywbFKH rjj jlVHN \ I AYQLQIjMSAVRTS IVEKRFP 
DFVRDFMGAMYGDPTLCPTWATDALASVGITLG 

1972 

A 

786 

1502 

PPPTKKEMFVPYSPEQRIETS IPPPFKGTGRP/ PGQGRTWERPPFSL 
noivrtVj/vur'r'jjiJoi'K. 1 1 JMjrSJJir'oJi IljAyUbGQEGDCliNRWQHIjH*SAF 

RPSSAEPFTRKRLEGGPAPLRYPGAGNEPGRDAEGRP*GALAGRPRW 
PPSHGR.PPPAPOHPJVQPfV3 , T , ZlPTf r PD /fPCTVDDDPnnnm mRTenv 

AIPANGRKGGAMSPHRGAGPASPSRFFSHIKQGRAIPHVSSRLHFSP 
SSSGSR 

1973 

A 

8 

234 

SAQMAVTTADPRVRPRVRTQLCSLAS L IQTLLVHLT\ PEEKS AVTAI* 
WGKVNVDE\VGGKA1X3RLLVVLPWDPKRSFQSPLGESVPTP*MKVGG 
KALGRLLWLPWDPKRSF 

1974 

A 

1 

169 

NLYISNLPLSMDEQELENMLKPFGQVISTR\IIjRDSSGTSRGVGFAR 
MESTEKCEG 

1975 


2 


XKJlJUii2,£> FKIajN rltv b ATLiHGML PS S FMG KVWKQQMYPQYATYYYPQ 
YLQAKQSLVPAHPMAPPS PRHKLQF 

1976 

A 

17 

796 

HSTAKLY*HSTFAKRSHRNPQHPYV\ P * I SKSLTS S I \NSSTSSNSG 
WDQLSKTNLYIRGLPPHTTDQDLVKLCQPYGKIVSTNAILHKTTOKC 
KG YGFVDFDS PVAAQKAVS ALKASG VQAQMAKQQEQDPTNLY I SNLP 
LSMDKQELENMLKPFGQVI STRI LRDASGTSRG VGFARTESTEKCEA 
V\ MVQS PS WTQPQPYI LQHPGAVLTPSMEHTMS LQPASM I S PLAQQM 
SHLSLGSTGTYMPATSAMQGAYLPQY 

1977 

A 

3 

1421 

CLRRLHEFGTRPPPTPPLPSSPERERQESPRLGNPKVSATIiHGRLPS 
S FMGKVWKQQMYPQ YATYYYPQYLQAKQTLVP AHPMAPPS PSTTSND 
NHR\SISSNSRWDQLSRTNLYIRGL\PPHTTDQDLV\KLCQPYG\KI 
V\ STKGNFGIRTTNNCKGYGFLFDF *QPMQQLKKAVS \ ALKASGGSS 
SKMAKQQGQD\ PTNLYI \ SNLALSMD\ EQELENMLKPF\GQVI STR\ 
ILR\DSSGTSRGV\GFARMESTEKCEAVIGHFNGK\FIK\TPPGVSC 
PHRNLL / CV3JS FAGWEGQEKGDRTPNKY I PNG\ RP WA* EEGRGETLL 
GLTLT/ YTDPTTAAIQNGFYPSPYS IATNRMITQTS ITPYIAST\VH 
AFQVQSPSW\MQP\QPYILHDPGAVLTPSMEHTMSLQPASMI\SPLA 
QQMSHLSLCSTGTYMPATSAMQGAYLPQYAHMQTTAVPVEEASGQQQ 
VAVETFNDHS P YTFQPNK 

1978 

A 

2 

1440 

NSGPS S FMGKVWKQQMYPQ YATYYYPQ YLQA/ KVW KAFGNTK * KGRV 
KNRKML VG WQTL * TLLFLVGQKNQS LVP AHPMAP P S PSTTS SNNNS S 
SSSNSGWDQLSKTNLYIRGLPPHTTDQDLVKLCQPYGKIVSTKAILD 
KTTNKC KG YGFVDFDS P AAAQKAVS AL KAS G VQ AQMAKQQEQDP TNL 
YISNIiPLSMDEQELENMLKPFGQVISTRILRDSSGTSRGVGFARMES 

IPNGRPWHREGEVRXiAGMTLTYDPTTAAIQNGFYPSPYSIATNRMIT 

QTSITPYIASPVSAYQVAKETRENKYRGSAIKVQ\SPSWMQP\QPIY 
FYOHPGAVI/TP\ 53MTCWTMQTA nDfiCMTnDT?WDnnMCDC^ru*pnuDPO 

YHALQP S S LCKGALLGPQ YCTYCRLTAVPC * GRQSGQTGRLAVRGRL 
NDPFS I YRFNPNK 

1979 

A 

104 

340 

TGQSYKKRKFKWIiKQINRYLTSFLF/VFKEIRSHSVA*TIiEHSGAIM 
AHCSLKFLGLSNPPASASQVAGTSGVQYHTSLI 

1980 

A 

118 | 

528 

TFFSVLISYYPLSSWAFELYLMLLWDNVYKSSSMNPTETKAVKTEPE 
IGCSQSTKLSVVHEKKSQEGKPKEHTEPKSLPKQASDTGSNDAHNKKA 
VSRSAEQQPS EKS TEPKTKPQDM I S AGGE S VAG I TAI SGKPGD 

1981 

A 

1 

864 

MFPGKDKRASRRAAANS PQG ARGQGEAHR PGR VHAQAG P RAVAEGQA 
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RAQGAQARPKRPKIjEGGGRMGGVTWSGSQQGLGRNGGKPGVHTGRPW 
LATGTHATWGS CTRAWQVMQVTQAAPPAGAEGSAHLAPPWDVRSSVP 
ARSRPSECEPGASNATPRAIiAGSPPCGSEPAGCCHGIRQWWYKTVNR 
SGCAWAGPEGSVLEGSVFALPFWAWDEALSGVWRGRGADRVDLLADL 
QRAI VGRTS PTQRRGSCNVRQGENPNI KGQKQLRSETKTLERAVEGW 
RKESEK 

1982 

A 

3 

4258 

S DKTS R S EAQALVHTFFVVEWSDALFVG I DEN I LAVQR I VRKAVETD 
LQTSRESIGVAAGASPKLHQQREKKRRRGEAVGCQRLQLDIGDKQPS 
TDSKAKRALTGTFLSGASGIAGTSLDAAMAPTEFASWWRPKAIjASAP 
APPTCAPRPTRDLAAIPDFQSGAMENWGLTTYRESALLFDAEKSSAS 
S KLD ITM TVAHEIiAHQWFGNLVTME WWNDLWLNEGFAKFMBFVSVSV 
THPELKVGDYFFGKCFDAMEVDALNSSHPVSTPVENPAQIREMFDDV 
SYDKGACILNMLREYLSADAFKSGIVQYLQKHSYKNTKNEDLWDSMA 
SICPTDGVKGMDGFCSRSQHSSSSSHWHQEGVDVKTMMNTWTLQKGF 
PL I T I TVRGRNVHMKQEHYMKGSDGAPDTG YLWHVPLTF I TS KSDMV 
HRFLLKTKTDVLILPEEVEWI KFNVGMNGYYIVHYEDDGWDSLTGLL 
KGTHTAVSSNDRASLINNAFQLVSIGKLSIEKALDLSLYLKHETEIM 
PVFQGLNELIPMYKLMEKRDMNEVETQFKAFLIRLIjRDLIDKQTWTD 
EGSVSERMLRSQLLLLACVHNYQPCVQRAEGYFRKWKESNGNLSLPV 
DVTLAVFAVGAQSTEGWDFLYSKYQFSLSSTEKSQIEFAIiCRTQNKE 
KLQWLLDESFKGDKIKTQEFPQILTLIGRNPVGYPLAWQFLRKNWNK 
LVQKFELGSSSIAHMVMGTTNQFSTRTRLEEEIKMKLNIKILIDDIS 
GVRRPWGCKRAPLESYRIAAVKTEPEKKSQSTKPKSLPKQASDTGSN 
DAHNKKAVSRS AEQQPS EKS TEPKTKPQDM I S AGGES VAGITAI SGK 
PGDKKKEKKSLTPAVPVESKPDKPSGKSGMDAAIjDDLIDTLGGPEET 
EEENTTYTGPEVSDPMSSTYIEELGKREVTIPPKYRELIiAKPIGPDD 
AIDALSSDFTCGSPTAAGKXTEKEESTEVLKAQSAGTVRSAAPPQEK 
KRKVEKDTMSDQALEALSASLGTRQAEPELDLRSIKEVDEAKAKEEK 
LEKCGEDDETIPSEYRLKPATDKDGKPLLPEPEEKPKPRSESELIDE 
Lit> bU b DKb bCKEKPS KP TE KTEES KAAAPAPVS EAVSRTSMCSIQSA 
PPEPATLKGTVPDDAVEALADSLGKKEADPEDGKPVMDKVKEKAKEE 
DLEKLGEKEETIPPDYRLEEGKEKDGKPLLPKESKEQLPPMSEDFLL 
DALSEDFSGPQNASSLKFEDAKLAAAISEVLSQTPASTTQAGAPPRD 
TSQSDKDLDDALDKLSDSLGQRQPDPDENKPMEDKVKEKAKAEHRDK 
LGERDDTIPPEYRHIiLDDNGQDKPVKPPTKKSEDSKKPADDQDPIDA 
LSGDLDSCPSTTETSQNTAKDKCKKAASSSKAPKNGGKAKDSAKTTB 
ETSKPKDD 

1983 

A 

44 

1360 

VWDCPGLRWASFYLRLSWRAHRPQCGTISFVTVNAEEQEKQFVSSRT 
KQKAKEEKLEKCGEDDETIPSEYRLKPATDKDGKPLLPEPEEKPKPR 
oJboJ&JU±uriJb£3E*iJr Uij £5 £, C isJl KF b KP TE KT.E E S KAAAPAPVS EAVSRT 
SMCSIQSAPPEPAT\LKGTVP\DDAVEALADSLGKKEADPEDGKPVM 
DKVKEKAKEEDREKLGEKEETIPPDYRLEEVKDKDGKPLIjPKESKEQ 
LPPMSEDFLLDALSEDFSGPQNASSLKFEDAKLAAAISEVVSQTPAS 
TTQAGAPPRDTSSDKDLDDALDKLSDSLEQRQPDPDENKPMEDKVKB 
KAKAEHRDKLGERDDTI PPEYRHIiLDDNGQDKPVKPPTKKSEDSKKP 
ADDQDPIDALSGDLDSCPSTTETSQNTAKDKCKKAASSSKAPKNGGK 
AKDSAKTTE ETSKPKDD 

1984 

A 

159 

2467 

EKKGSYEKKAASLGSSQSSRTYAGGTASVTKVSAFSGATSKSYSMNP 
TETKAIPVSQQMEGPHLPNKKKHKKQAVKTEPEKKSQSTKLSV\VHE 
KKSQEGKPKEHTEPKSLPKQASDTG\SNDAHNKKAVSRSA\EQQP\S 
EKSTEPKTKPQDMISAGGESVAGITAISGKPGDKKKEKKSLTPAVPV 
ESKPDKPSGK\SGMDAGL\D\DLIDTLGGPEETEEENTTYTGPEVSD 
PMSSTYIEELGKREVTIPPKYRELLAKKEGITGPPADSSKPIGPDDA 
IDALSSDFTCGSPTAAGKKTEKEESTEVLKAQS\AGTVRSAAPPQEK 
KRKVEKDTMSDQALEALSASLGTRQAEPELDLRS I KEVDEAKAKEEK 
LEKCGEDDETIPSE\YRLKPATDKDGKPLLP\EPEEKPKPRSESELI 
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DELSEDFDRSECKEKPS KPTEKTEESKAAAP\ APVSBAVSRTSMCS I 
QSAPPEAGTLKGTVQDDAVEALADSLGKKEADPEDGKPVMDKVKEKA 
KEEDREKL\GEKEETIPPDYRLEEVKD\KDGKATPCPKSLKEQLPPM 
SEDF\LLDALSEDFSGPQNASS\LNFEDAKLAG\AISE\WSQNPRF 
S TT / TQ AG A P \ PRDTS Q SD \ KALGD \ ALDKL S DS LG \ QR \ QPD PG * G 
TNQWEG * S * REKAKAEHRDKLGRKRWTLFPPEYRHLL\DDNG\ QDKP 
v haj« u l b*N QKNPG \ DDQ \ DP I DAL S EDL \ DS \ C PSTTETSQNTA 
K\ D KCKKAAS S \ S KAP \ KNGGKAKDS AKATEGTS K\ PKDD 

1985 

A 

2 

/ o o 

£.r x vuj.i_inFW>UjJbJbIjTIjLiTHSAV 

VDPSLRVRQGMPPWVNFVPGPSSEEASSPNKAHTGWFISDFLTPGN 
P * QVA\ WKADSQPPFKAGVETPPHPPKQSNNKYTDS \ SYLSLTP* PG 
GS PHRS LQLARVTH * RGAPLEKTVAPTRMFH 

1986 

A 

226 

401 

TSGDHWNIAVAPHENSDLLLVQGHDYKYRYFGLIVCVL^QAIVTPEE 
PQS I VPRLRTR 

1987 

C 

211 

2 85 

MFYPFFNPRYFSVGFT AMKTOHTn* * 

1988 

A 

107 

192 

VPR VT I MLRG VR SCI RQG S S ED PRG WKG 

1989 

A 

1 

285 

YIYIYIYIYIYIYIYIYIYIYIYIYNI»DQSWSFAPLLVCGP**AQW 
TSELGNRN*SEKTFTWRICMRNIGRCSLGMLQDYFKPFDFRSKVMLV 
V 

1990 

A 

27 

437 

I AEGNWCVYMPDI I WVFPPQAEAEEDCHSDTVRADDDEENES PAETD 
LQAQLQMFRAQWMFEIxAPGVSSSNLENRPCRAARGSL\QKTSADT\K 
GKQEQAKEEKLS I IDIVTNYI FFFWHMEIFTDTVSRHYNQTFGT 

1991 

A 

25 

1488 

GGSSAAASGVSSRADAPVLAQSPASAGNGRPSTPRVPGSRRHPSAPR 
SGPLPREDGCRTPGPQLLPLPGAXLLRPRTXLLSSAAEDKARSRHPD 
TQHPSSGGRCRGGTESPSSAAGRPASMAEAE\EDCHSDTVRADDDEE 
NES PAETD VQAIiI P\ MIQAQWKLEPAPRVTS SRLENRPCRAARGSLQ 
KTSADTKGKQEQAKEEKARELFLKAVEEEQNG7VLYEAIKFYRRAMQL 
VPDI\EFKITF\TRSPDGDGVGNSYIEDNDDDSKMADLLSYFQQQLT 
FQESVLKLCQ\PELESSQIHISVLQMEVLMYIFRWWSSDLDLTSL\ 
EQLSLVC\RGFLTSCA\RDP*KYARPGPALKVWGQKACIKLVSVTRP 
\jK&uvr Ztit.Frv z> / Kr jlajV x 1SKTTYIRQGEQSIjDGFYRA\WH\QVE 
YYRYIRFFPDGHVMMLTTPEEPQSIVPRLR\TREYQGLDAIPTGVTI 
RLSPRHRTIRTQSIWLLITKEKRKEKPL 

1992 

A 

242 

433 

j^rn^ijivrKKVAHiiLVljAaMLDEPMYVKIjVKAT 
LRIVLGLMFSDACIWG 

1993 

A 

243 

549 

CLLHS PRTGLAE I ENS Q YVCLRTAS I RKQKXiPRPGTAGATSFEEWK 
TALIHDGLARVIREAAKALDKYVYQSQYCGFLQPEQNCHPRBEGMEF 

1994 

A 

81 

436 

AKQG I S PEG VNTO VNTALQE \ VL KIA^ 

RQAH\LCVLASN\CDEP\MYVKLVEAL(^H\QINlxN*GLIDNNET/ 
MGEWG * GLLLKFDRGGGKPRKS WFG\ CSCC 

1995 

A 

3 

*± ZJ 

W I yKh.F ll^vjjbPADATNVKAAWGKVGAHAGEYGAEALERMFLSFPTT 
KTYFPHFDLSHGSAQVKGHGKKVADAIiTNAVAHVDDMPNALSALSDL 
rLttxixvufc vuf vwr JVL»iji>±n^jjrijV 1 IjAAHIjPAEFTPAVHASIjDKFIjASVS 
TVLTSKYR 

1996 

A 

3 

484 

PTLLVPTDSERTHPWLLSPADKTTVK/APAWG 

ERMFLSFPTTKTYFPHF\DLSHG\SAQV*GPRARKVAD7Uj\TNAVAQ 
RGTDIAQRAVPPLSDL\HAHKL\RVGPGSTFKLLKATGLtiG*PWPGP 
PSPAEFNPWRLQRLPWDKVSWVSC 

1997 

A 

797 

1049 

SEEEECCSCLPSPAGSDQRLLCTHLPC^IVNSEWGVARY*LNVCSC* 
PVFFSVG\PKQYPYNNLYLERGGDPSKEPERVA7HYEI 

1998 

A 

652 

1456 

EGPAEENMAAKVFESIGKPGLA1jS\VQDSMVNSA^ 

DQFRGVQNIVVWEGTHFLIPWVRKPIMFDCCSQPRSPPVITGSKDLQ 

NVNITLCILFRPITSQLPRIFTSIGEDYDECVLPFITTEILKSLVAR 
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FDAGELITQRELVS SQVSNNLME * AATFGLI LDDVSLTHLTFLKELT 
/ J5yvHry£Jin.bAKr V VriJSAAXXISA-bGDSKAAE \XjIAjNSXiATAGDGQS 
ELCKLEAAEDIAYQLSCSRNITCLPAGQSVIiLQLP 

1999 



All 

"PT.T r PI?'OP , T A7QDOT7QT»T"lT T'TTTD 7\ A \ t>t /~»t tt \ nn\iPT t»V tit ■m?nvm rr><^\ 
xvux J- xvjKXijjvoiv^voiJl^ijl CiKAA \ 1 Jj^liXJb \t/J-)Voiix\rlljDr GK.GVSQ 

KRVEAKQVAQQEAERARFWEKAEQQKKAAI I SAEGDSKAAELI ANS 
LATAGDGL I ELRKLEAAED I AYQLS RSRNI T YLPAGQS VLLQLPQ 

2000 

A 

546 

882 

CHVPPTLGTRGQGTHQDPFPFPHSPGAPAPCPPPA/LLGPPRPFPSP 
PVPGPGGENRKPFTSAP*A*VFPRTPLGALSRG*GSPWGRGATPSP 
PTPS PACGPLKCWWHIj t toj 

2001 

A 

43 

311 

TMRWNSLVQTAAQQVAEDKFVFDLPDYESINHVWFMLGTIPFPEGM 
GGSVYFS YPDSNGMPVWQLLGFVTNGKPSAI FKI SGLKSGNY 

2002 

A 

178 

817 

SPASGHCRLNGAAVAMFGCLVAGRLVQTA\ * QQVAEDKFVFDLPDYE 
£>ximxiv v vri v ujvji\iwi'\j* FtiOMvjVjo V i F \ S YPDSNGMPVWKXiIjG \ S V X N 
GKPSAIFKISGLKSGEGSQHPFGA/NEYCPNLHLLLRLGISGEIiLDS 
MAQQTPVR*CCWYPQFDSFTQFTQKMLDNFYNFC\SSFAVSQAQMTP 
S/ SI* KCS FRPNWLKWYEAFS VSNI KAFYN 

2003 

A 

283 

840 

TLPAGFTDVISIHKTGENFCLICGINGRFAVHCITLEEAKYKLCKVR 
KI WFHDAHTIHYLDS FVKVNDTV* TGKITDF I KFDTGNLGRIGVITN 
RKKHRGSFDVVHVKDANGNRFAPWLSNIFVTGKCNKPWISLPRGKGI 
RLTI AFHSDKWFCSDI S VQNLCKRFSQGDTG I KNSEARRCSNTjQV 



X 

yfao 

GR P APEDGG P L S L PNAAMARG P KKHIjKR VAAPKHWMLD KLTGVFAPR 
PSTGPHKL\ RECLPFI I F\ LRNRLK\ YAIiTGD\ EVKKI \ CMQR\ FIK 
I\DGQVRN*YNLPLLGFMDVI\SI\EKTGENFPF*SN*HPRGPLLL* 
HKX 1 F \ liKAJvY K/ VCAKMRKIFCGPTKGIP\HLVTS *CPAPHPAYPR 
NPL I QGEMNTH S RL I LE TGQD / 1 TD FHS KFDHL VTLC \ MVT\ GGA\ N 

T ,(~2'D'hTT/J\7~V . T T*0<^D D fZT Df* C CFiT) rjnfri j* tah 7\ VT/^xt T/"T t t\ fnf»nt?<?vrT ctjt ▼ 

iiuKWWVJji 1 V vrtKVj X rbor JJK W r i w JvU/VNGN IvJjJLiAxoDrSJN J. r WxjXjG 

KGNKPW\ISL\PRGKGIPPHHLLEERDKRLAAKQSSWVKWGPWVTWS 
DLLVP , 

2005 

A 

1 

383 

R I RKLCLN I CVGESGDRLTRAAKVLEQLTGQTPVFS K\ VRE YELRKN 
NFSDTGNFGFGIQEHIDLGIKYDPSIGIYGLDFYVVLGRPGFSIADK 
KRRTGCIGAKHRISKEEAMRWFQQKYDGIILPGK 

2006 

A 

3 

226 

GGRFKEPLGGPKLTRAGKEKVFSLWGADLKPGPGPCFNNAGRGKFVT 

2007 

A 

176 

384 

C^SLVHLLLTLSWLGDCWL^^ 
I S KEE AMR WFOOKYDC? T T T .PttK" 

2008 

A 

2 

669 

IMAQDQGEKENPHAGNFAFRKLVLNIC\VGESGD\RPTRA\AQVIiEP 
AQQGKTPCVFPKA*HTPVKSLWASRR\NEKIAFPTATSSEG7UCARRN 
xj\_xv*vvjxjivrv.r oDiovyuw llr ou I Wii x r Gi* GX \yi!»HxlJljGXIvxDPS XG 
IYGPGTSYW/LrGVRPGFQHPQTKKPQGQGCIG\AKHRl\SKEEA\M 
RWF\QQKYDGIILPGQINSPFLSKKSNKKFSVNKKNNKKITL 

2009 

A 

688 

884 

uo^ v ni^uo r noiMnr nijJM r I oKiiVAARn/ WMAnvibri I XjGGKGGWX 
T*SQEFETSLSNIIKPCLY 

2010 

A 

442 

780 

I E FV I KKI FFPFSHCLI CLA IDLQLQ * YMG VPR \ MQLKTC YVRGKYR 
GWLEEQNILWK*IHIPLQVREDGRPCLPLRKLGGKGGGGGEPLNQH 
GDPVQGNPPYFLFCHKRNP 

2011 

A 

136 

219 

DPAREGHPVCCAGQRPCVDGHGQEKGFIiKAALEVELGASNPSW*DTE 
vi\xryo/\xj^U£Vc>/YXi i X w vt> xy^^VJjSATlJjYEIIjIjGKATxjYAVL 
VLMAMVKRKDF 

2012 

A 

3 

762 

TLRGAVLRGAAGRLGGGLLVLGAGRAMGLSAVGRTRAESGTAERAAP 
VFVLGLQAVSTDTQMFGPGTRLTYLEDLKNVFPPEVAVFEPSEAEIS 
HTQKATLVCLATGF\FPDHVELSWWVNGKEVHSGVSTDPQPLKEQPA 
LNDSRYCLSSRLRVSATFWQNPRNHFRCQVQFYGLSENDEWTQDRAK 
PVTQIVSAEAWGRADFGFTS\VKSYQQGVLSATILYEILLGKATLYA 
VLVSAJjVLMAMVKRKDSRG 

2013 

A 

1 

1010 

RYSFFKAVMGIRLLCRVAFCFLAVGLVDA/l^VTQSSRYLVXRTGEKVF 
LECVQDMDHENMFWYQRQDPGLGLRLIYFSYDVKMKEKGDIPEGYSV 


I WW") 
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srekkerfslilesastnqtsmylcasslfnsgyqetqyfgXpgtrl 

LVLEDLKNVFPPEVAVFEPS EAE I SHTQKATIjVCIjATG\ I FPDHVEL 
SWWVNGKEVHSGVSTDPQPLKEQPALNDSRYCLSSRLRVSATF\WQN 
PRNHFRCQVQFYGLSGNDEWTQDRAKPVTQIVSAEAWGRADCGFTSV 
\SYQQGVLS\ATILYEILARGRPTLYCLCW*APIiVLMA\MVKEKGFL 
KAALEVELGAF 

2014 

A 

344 

463 

ITSKRWFHLKFSTQIKKRVIIAPHLLTLSALKLIFCFRT 

2 015 

A 

1075 

2102 

QQKPGQPLFLGSISPKKSFKTRKQKSSSKAEYNLTACKCLLCKRKYS 
SQIMLKRHMQI / RPQDNSFWNKL* KRKRP * * YCQQFRNKS * S *TSRF 
CRIFTPFHYPFSTE/CN*REQIIQMKKRTHRQHRKIKLNKTLK/CPK 
STSPSAAGGQQKTRKPKLSAGFDFKQLYCKLCKRQFTSKQNLTKHIE 
LHTDGNNIYVKFYKCPLCTYETRRKRDVIRHITVVHKKSSRYLGKIT 

7\ OT.T7 TI? JH T VVT3 TnTTTTT XTT^X 7 A VTJnDOOncJMnjcnpvTmoTPVTPncvirv 
/\oJL»il J.JtvrtX JVlVf ±Ur VijJ>llYVA2\J<VjPoKJJiiAi\JT 

EVADVGIEVKVTKNFSLHRCNKCGKAFAKKTYLEHHKKTHKANASNS 
PEGNKTKGRS TRS KALV 

2016. 

A 

1 

624 

GPAVPRWMGQRPGSGGWTAVAKPGELGLPHPLSTGGLPVASEDGALR 
APESQSVTPKPLETEPSRETAWSIGLQVTVPFMFAGLGLSWAGMLLD 
Y FQANTGQ I DDPQEQHRV I S SNLAL I Q VQATVVGLLAAVAALLLGVV 
S REEVDVAKVELLCAS S VLTAFLAAFALGVLMVC IV IGARKLGVNPD 
NIATPIAARPGDLITLSILA 

2017 

A 

216 

1520 

EPKRGPPSHWRTEPSRETAWSIGLQVTVPFMFAGLGLSWAGMLLDYF 
QHWPVFVEVKDLLTLVPPLVG\LKGNLEMTIjASRLSTAANTGQIDDP 
QEQHRVI S SNLGLI QV\ QATWGLLAAVAALLLG WSREEVDVAKVE 
LLCASSVLTAFLAAFALGVLMVCIVIGARKLGVNPDNIATPIAASLG 
DL ITLS I LALVS S FFYRHKD SRYLTPLVCL S FAV\ LTPVWVL IAKQS 
P P 1 V Kl LilU? Li W if P 1 1 LAM V I S S FGGL I LS KTVS KQQ YKGMAIFTPVI 
CGVGGNLVAIQTS\RISTYLHMWSAPGVLP\SR*RNSGPTRVLLFVR 
QKLNSMSCSRLLLLLGGSQGHLI FFYI I YLGGGVSQS * TSQTFVGAL 
TCWAGPDPG*QILLYL/VKK*MVRLTWHQAL\DPDNHCIPYLTGLGD 
LLGTGPPGDSAFSLTGY 

2018 

A 

1 

408 

SNPRVRGGGTHRSGQGAFANMCRGGRMFAPT/KTWRRWHRRVNTTQK 
RYAI CS ALAAS ALPALVMS KGHR I EEVPELPLWEDKVEG YKKTKEA 
VLLLKKLKAWND I KKVYASQRMRAGKGKK/ RCGR * ERKEGCCWC 

2019 

A 

28 

1437 

EERGCFSLPPLACARPL\ISVY\SEKGESSGKNVTLP\AVFKAPI\R 
PD I VELC SNPNLRKNNRQP YAVS EL\ AGHQTK\ AE\ S WGTGRAGGS K 
FPEVRGGGTHRS\GQGAFG\NMC\RGG\RMF\APTKTLGRRWN\RRV 
JM 1 1 P/ OiN-Rx AlC-o \AliA \ ASALTSNWVMSK\GHRIEE VPE\ LPLV \ V 
EDKVEG\ YKK\ TKEAV\ LLLKKL\ KAWND I KK\ VYASQR \ LRA\ S KG 
KMRNRRRIPGAGGPCIIYNEDNGI\IKAFRNIPGNYSA*M*AKLNIL 
K\LAPG\GHGG\RFCI\WTEKCFSGKL\DELYGTWR*S/RASPQRSN 
YHSFPLHK\MINTD\LSRILEKPQRSQRAPPGHHAKKIHR\RVLKKN 
PTEKTLR\ IMLKLKPHMQKTHAAGTTHSSPRPRNHK\LR\VDKGKLL 
HQRHLQAKSDEKAAVAGKKPAVG\ KKGKKAP\ VGVKK\ QKKPLVGKK 
\AAATK\ KPAPEKKPAEKKP\ TTEEKKPAA 

2020 

A 

1 

2196 

MSRKGPRAEVCADCS APD PGWAS I SRGVLVCDECCS VHR SLGRH I S I 
VKHLRHSAWPPTLLQMVHTIjASNGANSIWEHSLLDPAQVQSGRRKAN 
PQDKVHPIKSEFIRAKYQMLAFVHKLPCRDDDGVTAKDLSKQLHSSV 
RTGNLETCLRLLSLGAQANFFHPEKGTTPLHVAAKAGQTLQAELLW 

GRKPDHKNGI1YIIPQMADSLDLSELAKAAKKKLQALSNRLFEELAMD 
VYDEVDRRE1TOAWLATQNHSTLVTERSAVPFLPVNPEYSATRNQGR 
QKLARFNAREFATLltDILSEAKRRQQGKSLSSPTDNLELSLRSQSD 
LDDQHDYDSVASDEDTDQEPLRSTGATRSNRARSMDSSDIiSDGAVTL 
QE YLELKKALATS EAKVQQLMKVNS S LSDELRRLQRE I H KLQ AENLQ 
LRQPPGPVPTPPLPSERAEHTPMAPGGSTHRRDRQAFSMYEPGSALK 
PFGGPPGDELTTRLQPFHSTELEDDAIYSVHVPAGLYRSKLSRHGSG 
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ADSDYENTQSGDPLLGLEGKRFLELGKEEDPHPELESLDGDLDPGLP 
STEDVILKTEQVTKNIQELLRAAQEFKHDSFVPCiSEKIHIiAVTEMAS 
LFPKRPALEPVRSSLRLDNASAYRLQSECRKTVPPEPGAPVDFQLLT 
QQVI QCAYD I AKAAKQLVTI TTRE KKQ 

2021 

A 

1 

207 

SMDSSDIiSDGAVTIiQEYLELKKALATSEAKVQQLMKVNSSLSDELRR 
LQREELEDDAIYSVHVPAGLYR 

2022 

A 

1 

1062 

MTMflPrtM P Q fi > \/'C > Pf"ZOT"t'T'CJ B\nyM'C l TTMir>PPAWn/Tr*riiT tit ti <^Ar\n t-«t t t 

rix nyKowAor vr kowu x o/W V Ma ±UkiL)K±l v V YTETmAIjAGQDREIjIj1j 
AAAQPTEEQVLSRLTAPWTTQLDTKNISFERNKTGILGWRSEKTEM 
VNG YEAKTGR VDKNYPL VTGHTAP VLD I D WC PHNDNV I AS ASDDTT I 
MGAASSAEQRPEPGPKERRESNHPAAPTjPFPPnnnawanPT cDaiiTT 

FGTASLYRGVGSEGAGPERLSQLEGKRFLELGKEEDFHPELESLDGD 
LDPGLPSTEDVILKTEQVTKNIQELLRAAQEFKHDSFVPCSEKIHLA 
VTEMASLFPKRPALEPVRSSLRLLNASAYRLQSECRKTVPPEPGAPV 
DFQLLTQQ VI QCAYD IGKELGNNGF 

2023 

A 

171 

476 

LPRFQTLRNTETEDHSCCCSGRRRFAAAVASRSQ*GNPASLAGSSTT 
GARS CAPVARTORCPR CAAPGPWP MP TT? W t f Q a w r»w i? a w cni? c a x> t r> 
WLQFLEW 

2024 

A 

1 

781 

G I LD YLDN I S PQQ I RKL FYVLS TLAFS KQNEAS SHI QQLHS S VRTGN 
LETCLRLLSLGAQANFFHPEKGTTPLHVAAKAGQTLQAELLWAAKN 
PBKRBYTTENGS P AETPTCS AGKRGTPRTKMSD I S S EDLRRE I QAVXi 
KDADLATISAKRVREQVEGKLNCSLLSRKKEFDKIVMEVINEQQDEE 
DEDDDEGKDPDADPDDESEPSEEEDPSSSEEEAAKKKKQTPKKRPQP 
TKHKAPKKKRKTLNADDSGTADERL 

2025 

B 

1 

1602 

MGYRARIRNHVWl^TQNHSTLVTERSAVPFLPVNPEYSATRNQGRQK 
LGRFNAREFGNIillDILSESKRRQQARAEQPHSAAAADGVTFSVPVT 
PHTFRHSYAMHl^YAGIPLKVLQSLMGHKSISSTEVYTKVFALDVAA 
RHRQPTRRKAPAVLGQCRAIATRERPGWFHVAEVRASRRGRSPTPQC 
aufowrto j. ^ rev? v jj v L_ur,L-^o VrlKSs JbGRH I S I VKHLRHSAWPPTLLQS 
GFPGPSRRAAPRAARGPTPRTEEAAWAAMALTFLLVLLTLATLCTRL 
HRNFRRGES I YWGPTADSQDTVAGS PDHGLLAFAYHRLVRFLLWVLC 
PGWAFFLVNSSRGGVFNPILHPCPRHGQARFAGVGRAEDVTFLYHPC 
AHPWLKLQLALLAYACMANPSLTPDFSLTQDRVDIEQLDPRGRTPLH 

LRYRDYQRWKRl^SIPVLLEKLRKQRSYIiPGRPLARLRRSVNSSGA 
VGM I KS ESMGRWFVGP * 

2026 

A 

2 

692 

ENRSGFQSRRRI YS I SKQKKKLTFFDVKDNT* S WNAVASREECYIX3V 
IT*SRTLSGRS*TKDRLRRT*SMPCYSDSM/ISMQELEFRHI*NTIQK 
MRCELIRLQHQTELTNQLEY1WRRERELRRKHVMEVRQQPKGLKSKE 
LPNKKSSFQGYLQNSQTRQYKALRNHLLETTPKSEHKAVLKRLKEEQ 
TRKLAILAEQYDHSI1STEMLSTQAVSLLFLGQNKFSAPFLPPPE 

2027 

A 

1 

2933 

MDD I PQEARQ YRHNQA YAYS I QGDGAEDDDE R I VRFHTR V I NHKKR K 
NSPRIVQSlTOLTEAAYSLSRTOKRiVlLYLFVDQIRKSDGTLQEHDGIC 
E IHVAKYAE I FGLTS AEAS KD I RQALKS FAGKEWFYRPEEDAGDEK 
GYESFPWFIKRAHSPSRGLYSVHINPYLIPFFIGLQNRFTQFRLSET 
KEITNPYAMRLYESLCQYRKPDGSGIVSLKIDWIIERYQLPQSYQRM 
PDFRRRFLQHIFVLRERPETVLIDLIQRTKDAWEI^l^QYRKMKKL 
LFQEAHNGPAVEAQEEEEEYGWNLDRAYDYVKERRTVTKPNPSFMRQ 
LEEYQGILLARPKRTAYIYGSVYRRCGYADHKVEIKHPGWASISRGV 
L VCDECCS VHRSLGRHI S IVKHLRHS AW\ PPTLLQM\ VHTLASNGAN 
SIWEHSLIi\DPAQVQSGPALKOTPKDKV\HPFKLEFHSGAKLPDCWD 
FVAQVFLCRDD*WELTAKDL\SKAN*HSSVR\TGKPWKTCLRLLS\L 
GA\Q7^FFHPEKGT\T*PLHVAAKAGQTLQAEL\LVVYGADPGSPDVN 
GRTPIDYA\RQAGHHEL71ERIjVEC\QYE\LTERIiAFYLCGRKPDHKN 
GHY 1 1 PQMADRSRQKCMSQSLDI,SELA\ kaa\ kkklqalsnrlfeel 
A\MDVYDEVDRRENDAVWI*ATQ1TOSTLVTERSAVPFLPVNPEYSATR 
NQGRQKLARFNAREFATLIIDILSEAKRRQQGKSLSSPTGNLELSLR 
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S \ QSDLEDQHD YYS VASDEDT\ EHEPLRSTGATRSNRARSMDS SDLS 
DGAVT \ LQE YLELKKALATS EAJCVQQLMKVNS S L SDEL \ RRL\ QRE I 
HKLQAENLQFRQPPG\ PV\ PTLPFPS * TGREHTPLG ARRGEQHRR I R 
QAFSMYEPGSALKPFGGPPGDELNTRLQPFHSTELEDDAIYSVHVHA 

OTiYP PVf5RP.POfT\/T.rt Q\7T.KTCr2T.VTwiT't>T ttjot tt irTVPCrmr /-i r»ni-» nnuv 

uuinrvvjonryuviioo v jlun o <j Jj JvM l K JUT WSLiXXi VATS TliGSRRSPMK 
TRSPP 

2028 

A 

6994 

7054 

LISTSDLYFL*DL*VRCKQTGVIiPWFHTTTIPRSSLEVNPRFPTYHL 

CEL/SASYliTSWY , LSPTjTWTMn*T.T.WT.trHT , T?T T *n* CKrvcnA-trr pmc 

PDTVTAHSAVCTQFW/ SNVDKNNNVI SLiRKSFLFF*QSFALVTQAGV 
QWRDLGSLQAPPPGFTPFSCLSLPSSWDYRRPljPRPAKrT?T.YP**pnr: 
FPETGFHHVHHVGQADLELIiTSGDPPTLASQSAG\ITGVSHHVQPGE 
GFCNCCCWESGPFKERVG 

2029 

A 

411 

713 

TNS L YGLN I YLE I DQH FRNMKI Y \ RSNS L I L I L I LLHL YC I FALSDL 
/ CSH * PQVA I * KE INWPGAVAHTCNP S TLGNRGRW I V * GOE FETSLV 
NMAKARLY 

2030 

A 

3 

84 

SGS THAS GGLQASRWRRAGTSRCS VS SLTSTS TA T ,P OT. PTVATfJT .P 
SGSWTSHHGVAG*GRPTRPVGSRRRDGGERVRVGALCQA 

2031 

A 

3 

519 

SRWRRAGTSRCSVSSLTSTSTASLPGLPTVATfinYPPPnon q q-ktt.tt. 
SPPPGHTHCGNRGQPWPVLGRGGANAAASSSSRPEQGGGS/SSFARL 
S S THGTPGGGEGVKAVS S A* GP \GGLPGTLNLVGGGDVTDAS LSAI/T 
GGLEG ILEKP * AGRCP 1 TDGNPREEGRE PRAG 

2032 

A 

212 

394 

RWRGHSLHE I FKRSVTQNNLI E YR I P IMSLCCASDTRQ/ S FNTTSVP 
AVKKLTVTVNTI S I F 

2033 

A 

726 

908 

LGSFGYGLNHISSPAHSEPQTSPLVGLSVTCN*KSLVHPHSRGHQGS 
EK*GSLDPCISWR 

2034 


1 


iUU<.j\i\lWyjAloDj\Jij\xNN 
PAIKEDVSTLIGDDLASCKDETDESSKEi^ 

HCFTSNEAAPJ^FGGAVQDYFKWKADMTNFDVEVX^ - 

LTEESLHPJWITHFGPTTLRSTl^YGMLRIiCDPLPYDIIVxDPMCGTG 

AIPIEGATEWSDCFHIAGDIWPI^VNRAANNIASLLTKSQIKEGKPS 

WGLPIDAVQWDIO^PLRTGSVDIIVTD^ 

CLREMSRVCTPTTGFAVLLTQDTKCFTKAL^ 

GGLRAAVYVLIRTPQAFVHPSEQDGERGTLWQCKE 

2035 

A 

711 

2257 

grrpj^glqadiivaepillplpglyplpripgpcrwelgddstafket 
vlgileaqptwcdieeat* \hlld vnlienqkbvqvtesdl^sesel 
lvtigatvptgfeqtaade^eklgssckisrdrgkiyfvisvesiia 
qvhclrsvdi^fvwqefqdyqfkqtkeev^^ 

eftshaldshildyyenpaikedvstlig\ddlasckdetdesskee 
tepqvlkixrvtcnragekhcftsh^ 

FDVEXHjLNIHDNEVIVGI^TEESLHRRNITHFGXPTTLRSTLAYGM 

LRLC \ D PL PYD I \ I V\ DPMCG / TLGE I P I EGATE WSDC FH I AGDNN P 

xxAWP^AlWIASLLTKSQIKEGKPSWGLPIDAVQWDIO^ 

DIIVTDLPFGKRTIILTRCK^QKITLCT 

ATPRDPS 

2036 

A 

247 

1294 

MPNTAMKEKPC*TiTjMf5PPQf2Qf2TrKl07iM\ DCTTfavT n?ADnTDD\ t r« * 

* -i-m J^>^x\x^^JjXJl'l>JXTiJvOOO\JiXL>iyiHJ ,, l \KOX 1 r AMit-'KAKI J X KK. \ i_#\jr A 

tidvehshvt^flXgnlvxi^wdcggqdtfmei^ftsqrXdnifXriw 
gsfdfkvfdvesreii\ekdmhyyqscleghpprtlpdakl\fcl\vh 
kmd/llvqedqfjd\lifkereed^ 

RFYKAWVQAFV\YQLDFPNVSAAGRL1^PJDF\AQII\EAD\EVTjL 
RS\TF\LVISHY\QCKEQRDVT4RFEKISNINK\QFKLSCSKL\PASF 
QSMEVR\NSNFA\AFMRHLFT *JTTY^ 

SQCPGNHFEETGRELNGPQATVFPYGVEYCQNALSEKA 

2037 

A 

32 

959 

ALRQS HS QKLRRL WTAAS QGSMPDKACVVTIiAT I AW \ MVS F \ VSN\ Y 
SHTANILPDI\ENEDFIKDCV\RIHNKF\RSEVKPTSQVICLYMTWD 
PAx^QIAKAWA\SNC\QFSHirrRLK\PPPQAWHPNFHFHWGENl\WD 


lot>£t> 
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LGLLPIF\ SVSSAITNWX YDEI\ODY\ TjPKT\ PTP\ tCKVP<"3\ HYTHV 
\ VWAD\ SYKVG \ CAVS I LALKVSGFDRLS PNGAHFYMQIiPGPGGNYP 
/ TTWPYKRGATL/ CSACPNNDK\ CLDNI»CVNR\ QRDQVKRYYS WYP 
GWPIYPR*QRR/SSLPLIGNSVIIjILCVIITILVQHKYPNLVLLD 

2038 

A 

311 

459 

HEHMLLSFLFFFFVFLVEMAFLHVGQAGLELPTSGDLPTSASLSVGI 
TG 

2039 

A 

1271 

1540 

ARGCKR PCRQ I AS LMLQLC I LHAC WAL VPQLHAAPGLA I QRLC I C P Y 
LGSWPSTLGPRVPPLSPSTSPRPSPAFGASLNPPRHSSGPLR 

2040 

A 

176 

552 

MRS S KQM KPN F PRCS S I TV°LS VRGMVL FLTLAC PRFKM S S RTDFRFG 
NPQVT*GSTICSIFRLPSILSMRTTK/DQGMFIWQFQGMRFHF*SSE 
THLVTFLPPGIM*E*FQSLKPEPFSFLLSCYCH 

2041 

A 

204 

401 

WHEFTGAENHCKTLPKENF*WCLCKSHPSEPKNAAYNRTLCDLGISK 
SEVS LGTS FEMWTS QGQE 

2042 

A 

123 

311 

PSRRLLGHPRWAIiSPHLLLPQRFCSWVSFAPASCPC/DLFISRDRSS 
VSVGLMKHRMLGLAELL 

2043 

A 

2459 

3031 

TSPSTSLRPAPIPSPSPSSRPPSSSQTTSAASSPSSAWPKAASRQGA 
*RCRE*LPSWTSQSSPPLKSWGLDSAPPPPLSTCLSASTTRGAGAAA 
TPQCS APCLHR/ CDKVPGFAVAQC INQHS S P SL\ PHSRH PPPAGAPA 
AAGAPATATLEAPARPP PP PQPRVQQS GVAGKDADECS ACRRPTVMA 
ATGF 

2044 

A 

1125 

1695 

DFFSKIDDSHLQVLFENQLLILFFPHFHLNDCSNSFLFFFSEMESRS 
VAQLECTG/AILAHCNLRLLGSSDFP7VIASRVAGTTGTCHHTQLIFV 
FLVETGFHHVGQAGTCMQSQLIKGLRENHLNPGGGGCNEPRSRPLRS 
SLGNRVRLSQREKQNP*DAQVSEKGLNTLELQAKF*GVSHVFPGER/ 
VSLISQNP*PFWKKVIKQVSRLRFKKPQDPFRASQSLRIFQSHH*KS 
WNTYSQQRQVIILFLCLSSYVYFKSFSKQGPRTLQIQLCPWAWSTCC 
KoJjKlir'uiW/iLKV lrlJ-iirIi-iNPKXBDATi\.QDSF IjHGMVPX VRQGQHRM* 
ENAGEFFPWELGLWMRGCFAHGTI RLLS FL»* PFLSSF * R * SRK* S SL 
KMLD KNS QNNKTHAS TS L I KNLP \ HLAPKSMATATKSNFKELRFLQQ 
NR*FTSSSP/CLKISY*YYSSLISI*MTAAIVFCF/CFSEMESRSVA 
/QAGVHWCNLGSIiQSLPPGFKRFSCLSLSSSWDYRHAPPHPAHFCIF 

TRTS LS P I TLRLC I APC PNQAFTDPSRVLHLFVP AS SIP PNLKVTTjK 
Y 

2045 

A 

1 

577 

PTj KP S DGCTTORR PTR P P TP PTYrT\7~FT CUT . VOTT? M\7TIT .TOT?PVQa \7 r T 2V 

LWGKVNVDE\VGGKALGRLLVVYPW\TQRF\FESFGDLSTPDAVMGN 
PKVKAHGKKVLRGAFSDG\ LAHL\ DNLKGTFAHTEVSLHC * QACTWD 
PG ELQG S WGNVLVCVAG P I TFGKR I S TPPVAGL PNQENWLAWCWLNA 
LGPQVIT 

2046 

A 

16 

370 

ACRIAARAYLDLLRTWVFQVGSSGSQRDVNCSVMGPQEKKVVVYLQK 
LDTA YDDLGNSGHFTI I YNQGFE I VLNDYKWFAFFKDVTDFISHLFM 
QLGTVGIYDLPHLRNKLAMNRRWG 

2047 

A 

x 

1 IOC 

ru\&ti\3 vnj\&vriM\jLJt']Nyirl\2 \~J\xJi!t\4± V lJJLnWV 1 t\\j I nKohro orQKV i\. 

RIIPISTNWFYRIQTSSLLNKLWKDSDYEVLNCRAFEEKGPQEKKVV 
VYLQKLDTA YDDLGNSGHFTI I YNQGFE I VIiNDYKWFAFFKYKEEGS 
KVTTYCNETMTGWVHDVLGRNWAC FTGKKVGTAS ENVYVN I AHLKNS 
QEKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSG 
GHSRKI PS KYYKHLYAKKLENLEEMDKPLDAYTLPRLNQEEVGSLNR 
S I TS YESEAVINSLLMKKSPGPDGFTVEFYQRYKEELI PFIiLKIiFQT 
IEKEGLLPNS FYEAS I ILI PKPGRDTTTKRKLQSNIPDEL.RRKKSQ 

2048 

A 

1 

1943 

MWKHLWNWIGSGWKTLDVLEEDRKTKENLVLLRDFSSGCDQSIDRN 
MDSEVQADEFSDRNEEVIGNWSKGQPCYTLAKDLSALCPYPKALWKV 
ELKSDDLGE I PLELQRLVHGGQVNLDMEDHQDQEY I KPRLRFKAFSG 
EGQKLGSLTPEIVSTPSSPEEEDKSILNAWLIDDSVPTTKIQIRIiA 
DGSRLIQRFNSTHRRRAVNGPSNQLPDKPGSPSPFCLAPLITVCPAA 
DD PS PALWLHLVAARS TS QRNGQVLC YLPRKWGAVAVCTS VEACCWV 
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FWIiLERQNDSGMQLGGPPPSAI FRWLAAPAAD APAATS GAGCLATAG 
GAKTRKLALAELYEDEVKCKS S KSNRPKATVFKS PRTPPQRTVKNWF 
QLIAPNAGEDAEQQELAFIAGGTAKWYSHFGRPFGNIiLQRATWNVSR 
KLLRTTQISGSLRYWGFPGFQAKVAATLAKQEVIPPYLSLGRGAEFK 
GLSQNSLQAPLHGTSQGKKNDYPETAVIiPKLPREKVSRREKRLASSR 
CLSLGTKCMSEEAFLVITNSSSCSCEGGFPYLIAGKYAQDFGLVEEA 
CFPYTGTDSPCKMKgDCFRYYSSEYHYVGGFYGGCNEALMKLELVHH 
GPMAVAFEVYDDFLHYKKGIYHHT\GLR\DPFHPFEL 

2049 

A 

X 

2077 

OTYPENIQIKKITGVQWMVPLKYKEEGSK^TYCNETMTGVnTHDVLG 
RNWACFTGKKVGTASENVYVNIAHLKNSQEKYSNRLYKYDHNFVKAI 
NAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPSSVPL.CSSAPLD 
VQLLVSVPAKILGLYGGRMGGMAGQKAPFLRAPVIGVSFADSFQPSR 
LQDS PQLFNQGQLL S AAQVG S S GS QRDVNC S VMGPQEKKVWYLQKL 
DTAYDDLG\NSG\HFTIIYNQGFEIVLNDLQVGLPFFKYKEGGAARV 
TTY\ CNETMTGWGA* CVWGRNWA\ CFTGKKVG \ TAS E \NVYVNTAHL 
K\NSQEK\YSNR\LYKYDHNFVK\AINAIQKSLTATTYM\EYETLT\ 
LAjDM iKKivWWATSKKI PR \ PKPAPLTAE YS QKI / LLHLPTSWDWRNV 

HGI \nfvs pvrnqascg\scy\ sf\ asmgmle/ ARGIRILTNNSQTP 

ILS PQE WSC\ SQ\ YAQGCEGG\ FPYL I AGKYAQDFGLVEEACFPYT 
o x uoxri-ivPiiN^CtUi^r xtx io \oJCf xxlx Vij\vjr ib \0<^WEAXjM\1\Xi\EIjVHP 
WGPWEVCFIiEVYDDFLHYKKGDPTPHTGIiKRPFSTPFE\LTNHAV\I, 
LVG\YGT*LKPLGMEY\WIVKNKLGAPAWGLRNGYF\R1PRGTDECA 
IES I AVAATPQFLNFRGRLPS I FINGS ASSCKGGIGI FTDCRLSTS I 
FRSLQIDFP 

2050 

B 

59 

466 

MSAAMRERFDRFLHEKNCMTDLLAKIiEAKTGVNR 

YLVI WLPSLHLCGFLLWCMAPS PSNGAELLYKRI IRPFFLKHESQMD 
2d V VlUJJji^KAJvb rADAXTKEAKKATVNIjIjX* 

2051 

A 

1 

915 

MVDQVKWADDQAPAEQSLRRl^TNRHIQLIAIGGAIGYFTGWTYWF 
CWVVTGMAJDWAITAYAQFWFPDLSDWASI^VI 

FGVIGLVALYLVFGYGASLLCNIilGFGYPAYISECPPTYGVKPGLW 
GLIAGGEYAIQHAVEGAEDSREGGVNDLKNINLTAQDVWGIAASGR 
x ±* x v x aijIjxS x AKyiAsUK X Vui SCNPGSAVSTTAEFAITPI VGAEWT 

QVN I VKNATGCS AEQAEAAL I E W 

2052 

A 

1 

3180 | 

rxuviVftMr v x .tr^-i-i-iv? X-rt^r ouC* 1 V zriL \n X A/VrOKXLKr c>K.X IjiiJJAINJ l—Pi J. JJJ_i 

LAKLEAKTGVNRS F I ALG V IGLEAL YL VFG YG \ ASLLCNIj I GFG YPA 
YISIKAIESPNKEDDT\OWlATYWVV\ Yr?VPc?HP*KrQpc;nT\ PT.Q«\ 
FPF*LHG*KCGFLVGWC\MAPEPLLNGG*NCSKRRIIRSFSFPGSTK 
SQIGQVLVKD P * RPRPKETADANH * KKPKKS LP * I LLG * KKKKKGGR 

2053 

B 

171 

256 

XVDPASSQAMELSDVTLIEGVGNEVMVVAX* 

2054 

A 

187 

437 

EPTELPHPSEGNDEKAEEAGEGRGDSTWGGWSWGWC 

2055 

A ; 

2 

165 

NQHHHTLP PPS PHYPHPQ* G * HQRAPWPGCW QG PRCRJNKGGQHQAE P 
EGLQPGL 

2056 

A 

1 

506 

SPASQHCHHYPLMMGSSVVRGAFGLPPVGPHAPGVGRMCRLLTQ*RP 

ZVf3f* VP C*T .T .1? T? * "Q 2VT£T.^!T>13T t?C! 7A 'DTDITT ntlOPTlIIVniJTlAir'+tlATJUTltJ 

o/iuV- x K.v,i_«j_>2\.jK iCttJ\jJoJt\KJjxi»oAlrirnJjir^ 

PGCWQGPRVTAR*GGVPHPFSEAKVPLLDGPIPSALDWGPWAHPNRR 
AAWPQPP* *DQLSRPVRSRVG*GGHKP 

2057 

A 

496 

1092 

QPAPGQLLCHLATHP\ PPPGACGPPG\ AGQGKPEPTELPHPSEGNDE 
KAEEAGEGRGDS TGEAGAGGGVE P/ RP * AS P * HPRPAQKTSRCRQQQ 
SRGP PE I * G * HLPPS QPWPHHCAAQ I PQ * YRGAGCG * ARG YRGCPEE 
QILPWTRKPDETDLPGPPATRPSPHTAFSEHYRQLCDSLPPIiTPRVS 
CSRPLS LLGPLGH 

2058 

A 

155 

701 

PGLFRRGLPHPGTGPVQSPFNFP* / PPR/GPEAPPP/RSPAPPRSLR 
GVDGL* PP * PTLLLTGRDNWS Y \ PEAG ARQPFGSGGPMD PS PAPRE * 
DHPKAE P S PQKKGAGPLLTVRS RYGQGRS QAEGQS KDP * GRTPGVLP 
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GPSPREFPVSVPIVFVFIiCRRFFSCKSWIMTIiTFLPSGVSCGCDK 

2059 

A 

1 

506 

DRDAEFYKFLQENDQSQLNFSDSDSSEEEEESFHSLPYVLEBA\AFR 
AAVATT/RGDQESAEANKFQVTDSAVFNALVVFNALVTFCIRDLTGC 
LQKLLIGKVAKDSSRMLQPSSSLLWGKLRVDIKAyiiGSVIQLVSCVA 
ETWLAAVLWHISVLVPCFLTFPKQCSRE 

2060 

A 

9 

2295 

VGCTLRVGVMAAAGSRKRRLAELTVDEFLASGFDSESESESENSPQA 
ETREAREAARSPDKPGGSPSASRRKGRASEHKDQLSRLKDRDPEFYK 
FLQENDQSLLNFSDSDSSEEEEGPFHSLPDVLEEASEEEDGAEEGED 
GDRVPRGLKGKKNSVPVTVAMVBRWKQAAKQRLTPKLFHEWQAFQA 
AVATTRGDQESAEANKFQVTDSAAFNALVTFCIRDLIGCLQKIjLFGK 
VAKDSSRMLQPSSSPLWGKIjRVDIKAYLGSAIQLMSCLSETTVIiAAV 

lrhisvlvp*fltfpkqchmllkrmvvvwstgeeslrvlaflvlsrv 
crhkkdtflg p \ vlkqm y i t yvrnckfts pg al p f i s fmqwtlt\ el 
laiiepgvayqhaflyirqlaihlrnamttrkketyqsvyl^wqyvhctj 

FliWCRVXiSTAGPSEAIjQPD VY PLAQVI I GC I KL I PTARF YPLR \ MHC 
IRALTLLSGS SGAF I PVLPF I L\ EMFQQVDFNRKPGRMS SKPINF\ S 
VILKLSNVNLQEKAYRGGLGWSSCYDLNPGSYLHSQAHCIGFPELVIi 
P WLQLKS FLRECKVANYCRQVQQLLGKVQENS AY I CSRRQRVS FGV 
SEQQAVEAWEKLTREEGTPLTLYYSHWRKLRDREIQLEISGKERIiED 
LNFPEIKRRKMADRKDEDRKQFKDLF\DIiNSS\EEDDTEGFSERGIL 
RPLSTRHGVEDDEEDEEEGEEDSSNSEDGDPDAEAGLAPGELQQLAQ 
GPEDELEDLQLSEDD 

2061 

A 

203 

1383 

MSSFGYRTLTV\ ALFTLI \ CCPG\ SDEKVF\ EVH\ VRPKEAGRLSPK 
GSLEV\NC\ STTC\ NQP * SGVGLGDLS * D * GFCWDEQVQLGNNYIiGIj 
KHFPIDTVLQ\CHFTL\SGKQE\SMNSNVSV\YQPPRQVILTLQPTL 
\VAVG\KSFTIECRVPTV\EPLDSL\TIiF\LFRG\NEILALWRTFRG 
RAAPAPQGGPQPHSNST\ADRRGLAHRKLSPSLAYL\DLMS\RGG\N 
I FHK\HS AP\ KMLE \ I YETCVRTS PDWS SLVTGGVGVACPLFVTSV\ 
LIiC\ FI FG\ QHLRPASGMGTYGVRAAWRRLP\ QAFRPI ATHGVAWPH 
TTVVTGTQCDSSGLRYQPWLKDCDRQQRLGTLPFLARIQTPGLSPVP 
TVS PPG I TMAREGSHHGG PLSLDAS LPTLS FTGWHEA 

2062 

A 

503 

1808 

TTVQEESTVDSQPVPNIDHLLTNIGRTGVSPGDVSDSATSDSTKSKG 
S WGSGKDQYSRI$LLVS S I FAAASRKRKKPKEKAQPS SSEDEIJDNVFF 
KKENVEQCHnDTKEESKKESETLGRKQKI 1 1 AKENSTRKDPSTTKDE 
KISLGKESTPSEEPSPPHNSKHNKSPTLSCRFAILKESPRSLLAQKS 
SHLEETGSDSGTLLSTSSQASLARFSMKKSTSPETKHSEFIiANVSTI 
TSD YSTTS S ATYLA* PGLQSTEP * GA\ S VAES KGDEADDERSBLI SE 
LjKP Vr* I JJbr*br*r tfv rif LoLiUt Kh.Ai* PRKTARb \TKSoRKxNlb>JiObx*Xj£» 
CTEGSLTSSLDSRRQLFSSHKLIECDTIiSRKKSARFKSDSGSLGDAK 
ix r*i\JC*M.*' o xj x xvv v uv i'lrvjvo JVo X vjoXjXj irl ±\.VjXjOHii\.yE>X^ IWxVi i\.X/\Xy.rtXj 
KLRPRAPADDMYE 

2063 

A 

2 

527 

DPEADSIRAEMSRSVALAVLALLSLSGLEAIQRTPKIQVYSRHPAEN 
GKSNFLNCYVSGFHPSDIEVCLLKNGK\RIEKV\EHSDLSFSKDWSF 

\ YIj\ LP ytefhpqlkkde yalp v* thvtls paqr* fkwdrghvkqhh 

GGLKMPHLGWNDSKFLLACFFNIDMLITLTLLCTKM 

Z. U D 4 

7V 

A 

3 

lo3 

iVK-Cilvi hvjKFo XKJ.\2wy-L»«U-L* lxjliiN JjyiYlxjx^xijoii*xJi2.Cio xJJ ibobr 1 £*£ JL 
EDEVSL 

2065 

A 

611 

1434 

TPLllLAVITOQPEIAEALLGAGCDPELRDFRGNTPLHLACEQGCIaAS 
VGVTtTO S CTTPHLH S T LKATMYW / RYVPT .PAT. PMPT .RRDVTWP PGPW 
ANLRS PGKSLNS * HLEG * ENMC AKCLLD AF I KFFQNPDCGFLKFRR * 
DYLFHKRIGERSEG*l^WVYFFPFVLPGHTCLHlxASIHGYLGIVELL 
VSLGADVNA/ QGWCFLPPTH* VRLxjVLML * AE I PNAAI S I SNS FWFQ 
EPCNGRTALYl^VDLPNPDLVSLLLKCGADVITOVTYQGYS P 

2066 

A 

1 

1332 

MVQLSPHYRGDSEDGRKRMVRQRTKPVLFFWSDWLGNSPSLTPPQRN 
PQPAFIGRRGGAAEPTAVRAAVPPASAPARKQRAARGPAHPQQRPQL 
VRAMF\QAAERPQEWAMEGPRDGLKKERLLDDRHDSGLDSMKDEEYE 


1005^ 


.1. 0 OS M-6 H-"J£ „, O IS! Ei! & ill E! 






QMVKELQEIRLEPQB\VPRGSEPWKQQLTBDGDSFIiHLAIIHBEKAIj 
TME VI RQ VKG \ DLAFL \ NFKTNLQQTPLHLAVI TNQPE \ I VEALLGA 
GC\ DLELRDFRGNTP/ L* SLACEQGCLAS VGVLDS SLAPPPQL\HS I 
LKG YPTTMGHTCLHLAFYPMG YWGHRGS FLVFLGVLDVKCSRS PCKW 
PGLPIHLAV\DLQNPEPGCSLLFESVGADVQQSLPTQGYSPPTQPHP 
GPPQAPR I QQQLG \ QLTLENLQMLPRE / S PDEES YDTRVQS STGVS P 
EDELAPMDDCGVFGGPRSDVMSAKGLKET 

2067 

B 

169 

251 

XPQKEPYVI PFTGCVTQKGQVAGDAGWQ* 

2068 

A 

33 

445 

SVARFTGIKTAILVGGMSTQKQQRMLNRRPEIWATPGRLWELIKBK 
HYHLRNIiRQLRCLVVDEADRMVEKGHFAELSQLLEMLNDSQYNPKRQ 
TLVFSATLTLVHQAPAVPATVYGEPENQVSHSVWQAAPACTTS 

2069 

A 

1 

1428 

MGMPGC^rrRHGRIKTVTiTNMNGPGSFTRACKDLHGVESWNGKMARGIi 
DVMAFQEQGGFRPCAELKFSGARVAYPMLPMPWNRLRFEPISSGGAN 
SITVPLRRAWLPRCSGGRCKDSARVRHNTGLHFASSGSGLPSWIPK 
ERAGC PTVAEVS RRRVRTVRLLVAHRDP YDLR VSOKVPPT Q E T YVHR 

SGRTARAT^GLSLMLIGPEDVINFKKIYKTLKKDEDIPLFPVQTKY 
MDVVKGRKVFPKPLGPGVKAQALDSERIPLFPMYVXjQHGRLGIjGPLR 
RVS LAL VR PPGMVGS I LGEKDDVTYA I KPTCW PGLD 1 1 P SCLALHR I 
ETELMGKFDEGKLPTDPHIjMLRLAIETVAHDYDVIVIDSAPNIjGIGT 
INVVCAADVIiIVPTPAELFDYTSALQFFDMLRDLLKNVDIjKGFEPDV 

RILLTKYSNSNGSQSPWMEEQIRDAWGSMVLKNVVRETDEVGKAACA 
EVSKRG 

2070 

A 

1268 

2807 

L.PLESLAVQVKQH IDAVARFTG I KNC YFG FGGM S TQKPARG C * TRRP 
\EIWATPGRLWELIKEKHYHL\GNLRQ\LRCL\WDEA\DRMV\EK 
GPFWLELLHSCFRDASNDS\QYNPKRTKRLFFLATLTL\VHQAPARN 
PFHK\KHTKKMDYTAQIi\DLLMQKIGHEGASPKVIDLTRE*GPRWET 
\ LTETKI PL * DL I EKDFLLVTTFLMO Y£ RAAA * rPAN ctq\ CT TCP T A 

SGLLKV/Lx^lhtALGPWHGLVWHQEGRGLRNLGSILPRL\EDCVLL\ 

ATDVGSSGVWD I PKV\ QHVHPFTRVPRT\ SE I YVHRS WSNLLRS YQIi 

TGLKS*LLlWGLIODVINFKKIYKTLKKDEDIPLFPRADINTWDVV\K 

ERI R\ LARQ I \ EESEY \ RNFQ\ ACLHNS W\ I EOAAAAL\ E I ElA EED 

MY*GEGKA\DQQEERRRQKQMKVLKKELRHLLSQPLFTESQKTKYPT 

QSGKPPLLV\LAPSKERVCFLSCLSQGRKEEGRTKEARGGPQPGTAT. 

AKLQVQFNCPGQVCQ 

2071 

A 

3 

495 

ASG S V I TSGI WASRPR I RT I A/ PS ETP S S S FWFKRRKRRRKSHLAS 
LKRDDNMNTSEDEDMFPIEMSSDEAMELLESSRTIjPNDIPPFQDDIP 
EENLSLAVIYPQSASYPNSDREWSPTPSSLVDCKRTAPHIiAVAAEGG 
LSSSCPPQSSLFHPSESPSGSRPS 

2072 

A 

36 

481 

RQGEIFLVCLFFEMESCSVTQAGVQWCNLGSLQRPPPGFKQFSCLSF 
WS I WDYRCAPPCPANFC I FS KGGVS S CWRGWA* TPG PQM ICPPWSPK 
VLGLLV*ATTPGQRCFFVIiFCFVLFFFQQTCLEEDLVQGITPSSLCL 
ECYKLFA 

2073 

A 

2 

2737 

RADDFVARRRS \ VQTMNYVGQLAGQVFVTV\ KEL YKGLNPATLSGC I 
DIIVIRQPNGNLQCSPFHVRFGKMGVLRSREKVVDIEINGESVDLHM 
KLGDNGEAFFVQETDNDQEVIPMHIATSPILSEGASRMECQLKRGSV 
DRMKGLDPSTPAQVIAPSE\TPSSSSWKKRRKRRRKSQLDSLKRDD 
NMNTSEDEDMFPIEMSSDEAMELLESSRTLPNDIPPFQDDIPEENLS 
IxA\VI\YPQSASYPNSDREWSPTPSPSGSRPSTPKSDSEIjVSKSTER 
TGQIQTPE^WLWGELPQAAKSSSPHKMKESSPLSSRKICDKSHFQAI 
HSESSDTFSDQjSPTLVGGALIiDQNKPQTEMQFVNEEDIiETIiGAAAPL 
LPMIEELKPPSAS WQTANKTDS PSRKRDKRSRHLGADGVYLDDlA T 
DM\ DPEVAAL YFPKNGDPSGLAKHASDNGARS ANQS PQS VGS SGV\ D 
SGVESTSDGLRDLPSIAISLCGGLSDHREITEDAFL\EQAV\SYQQV 
LGNPAI \ IGDP\NFWKIGSKYYNWTTAAPLLL\ AMQAFQKPLPKAT 
VESIMRDKMPKKGGRWWFSWRGRNTTIKEESKPEQCLAGKAHSTGEQ 
PPQLSLATRVKHESSSSDEERAAAKPSNAGHLPLLPWSYKKTLRLT 
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S EQLKS LKLKNGPND WF \ S VTTQ YQGTCRC \ EGT I YLWNWDDK\ VI 
I SD I * WGQLPR * DTLGHI LPTLGKDWTHQG I AKL YHKVS QNG YKFL Y 
CSAIUVIGMADMTRGYLHWVNERGTVLPQG\PLLLSPSSbFSAIiHREV 
I E KKPEKF \ KVQCLTD \ I KNLFFPNTE P FYAAFGJJRPADVYS YKQVG 
VSLNR I FTVNPKGELVQEHAKTNI S S YVRLCEVVDHVFPLLKRSHS S 
DFPCSDTFSNFTFWREPLPPFENQDIHSASA 

2074 

A 

1 

408 

SNPRVRGGGTHRSGQGAFANMCRGGRMFAPT/KTWRRWHRRVWTTQK 
RYAICSAIJUVSALPALVMSKGHRIEEVPELPLVVEDKVEGYKKTKEA 
VLLLKKLKAWNDIKKVYASQRMRAGKGKK/RCGR*ERKEGCCWC 

2075 

A 

3 

1434 

FPVAAAGLRGAWLS PLS AMACARPIt\ I S VYSEKGESSGKNVTLPAVF 
KAPI\RPDIVNFVHTNLRKNNRQPYAVSEL\AGHQTK\AE\SWGTGR 
AGGSKFPEVRGGGTHRS\GQGAFG\NMC\RGGRMF\APTQNLGRRWN 
\RRVDT\TQNRYA \ ICS \ ALA\ ASALTS1WVMSK\GHRIEEVPE\LP 

lvXvedkveglqeeprklflllkklXkawndikkvxyasrrmrggqr 
qngetvrripgaggpci iynedngi \ ikpfrnipgnysa*m* aklni 

T DTVT T T>/"~' \ r^XII^T* T , T?T AOPT*VT npn T<r\ T riDT \r/-lPpT.T"0 * O /DH GOOD OM 

YHSFPLHK\MINTD\LSRNLEKPQRSQRAPPGHRGKREDPIRRV\LK 

JVW ir \Xii\JN JjivXrLLiIS-LiiM Jt* Xl-I\Ajr'WV»K/ ivTi.tr r X? AVAydOVooU VX'ZWJlvlJXt 

HQRHLQAKSDEKAAVAGKQPWG\ KTGKKGCLLVFKNQKKPLVGKK\ 
AAATKKPAPE\ RSLQRRNATTEEKKPAA 

2076 

A 

275 

652 

TVNLCFSTFLWTVIjYFTFFFAKCFQIHLLRIYLR**LKTTNSWFMLY 
A\IMHVLLVQGQN*NKIFSVKQIYLLCTFIYTGIF\W*YHTKG*NKT 
FTM*NSMYIFGYLHTLKPKGKIKRKHLFATTIC 

2077 

A 

63 

292 

GG ILLS ISRPYKTKPxHGIGKYKHIjIKAEEP\ kkkkgkvavrainlg 
TDYNYGVLN IH \ LT\ AYDMTLAES YAQYVHN 

2078 

A 

315 

1092 

RPRSSKRMSGTSEKVLCLRNNTIFKQAFSLIiRFRTSGEKPIYSVGGI 
LLSISRPYKSK\PTHGIGKYKHLIKAEEPKKKKGKVE\VRAINLGTD 
YEYGVLNI\HLTAYDMTLAESYAPLFSTTFCNSLSH*KSEESYAMPT 
QNHKKWLPVCRDQGQPKCLLGLKCI,PTHE\RV\VQISGLSATFARKF 
SWKIIPKPVLP*RESGLFS*REHTERKTSRGRFQRFDPELGRTFWAK 
FEVATVD PFHCQQW S YL S AKEKSLLGS 

2079 

A 

127 

993 

SNIjILRNLFCPHLAIjASSAFDKCQEEETVEKTELIjAERRGLiTENPSR 
LPP I TAGLRSDTMNH I VANLLLSCQQRPRLPNYEM1jKEEHEVAV\ LG 
APQKPCL P EC P PV I HY PQ AR P P WPDHVRLVP \ VQTPS S MNPCMPWAS 
*AFA\YSVKSLRDRK\MVGDVEPGPQGLMPPPAKVPEKSGPVII*GHIi 
HEPILAH\HHPPPSVG\RPGPNG*IKEGITEGQG\ALPR*TOTPTVL 

at OTDT?I>2VDPr!0'P\rr.DT,'Mr , T'DT. r r , DT3CT , Dt>D\/T> , r7VT7Cr , T CDT.QCIUaFT. 
V/JuO X fr(.jrJ\TriK\j\/sZt V l_t Jtr XiIM v_ X JtXj 1 Jr xr o X frCxr V ir I /-VJtjO L- X o riioon/iT Xi 

QWHSIKCICFW 

2080 

A 

61 

346 

LSLFFETEFCSCYPGLECSGVILAHCNFR\LPGFK*FSCLSLPSSW 
YRYPPPDPANFLVLVETGFHHVGPGLVIiNDPLI FNLDTCYSAKRQFL 
E 

2081 

A 

88 

386 

PHRFGKRRNKT\ HTLCRRCG S \ KAYHLQKSTCGKCG YP\ AKRKRKYN 
WSAlUUUiPKYHPlLPCaRMRHljKIVY \RRIQGMGF\REGI 1 P w lyJbvJUA 
VASIQVHL 

2082 

A 

187 

913 

STNKRSTRPPLGFEMAS PHQE PKPGDL I E I FRLGY \ BHWALYIGDGY 
VIHL \ AFP \ SE x PGAGSSGVFS V\ LSNS AEVKRG \Rxi \ ExJ V VtjKjl- v. X 
RGQQTAWDHGVPS PRALEG * S FSSAKEMVWFRRIiK\ YS I V\ SRNCEA 
LCS P S LR YGQVPAVKQVGKG PRFE VG / VLATALG I LWAGC S FAI RR 
YQKKSDS IiKQPQNPVLEAAVGVPVEMS IjPRCLQQPDPRAXiS QAFSRS 
FPLFPSLAGKSMI 

2083 

A 

1 

423 

MGIRLLCRVAFCFI^yGLVDVKVTQSSRYIjVKRTGEKVFLECVQD^ 
HE1WFVTYRQDPGLGLRLIYFSYDVKMKEKGDIPEGYSVSREKKERFS 
LILESASTOQTSMYLCASSLYGATYGYTFGSGTOLx^AnBDLNKVFPP 

2084 

A 

2 

390 

PRVR YRGV I TD PQPLMEQPGLGDS R YWLS SRLRVS ATF W QN PRNHFR 
CQVQFYGLSElSTOEWlliDRAKPVTQIVSAEAWSKADCGFTSGN*QQGV 
LS AT I LYE I LLG KATLYAVLVS ALVLMGMVKRKDF 


\oo5< 
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1 ATT 
XUZ 1 

xlJjlrl^/\MnVjJr'yjj JjVj X V V Li\~L> JjtjAvjFijJciA.y V 1 yiM JFK 1 JL* J. 1 V X L/JVUUVPB 

GYK^SRKEKRNFPLILESPSPNQTSLYFCARSFEQGYEQYFGPGTRL 
TVTEKRNFPL I LESPS PNQTS LYFCAS S S S QGRLS PGFNYGYTFGSG 

VELS WWVNGKEAHSGVS TDPQPLKEQP ALNDS R YCLS SRLRVS ATFW 
\QNPRN\HFRCQVQFYGLSENPEWTQDRAKPVTQIVSAEAWGRADCG 
FTS \ VKS YGQG VLS \ AT I LYE I LARGRPTL YCLC W * A PLVLMA \ MVTC 
EKGFLKAALEVELGAY 

2086 

A 

1921 

1991 

S F V I FD AN I F FQF I G * S LL S L WM * ALLCLL VTEHL YL VS IGHLWCVC 
VCVCVCVCVYDFLIPSHFSIDCFAKAIYILRILIIF\VIFDANIFFQ 
FIGCLLILCFR 

2087 

C 

105 

392 

MQISPPPGHTQNQPYLLFSIASRKCLHPVLLVHYFFQLFLIPIIHSI 
FFKFFISCINVNTSLFTFSLQSQVMGELPLISFSTLSLLEESTYYRIj 
GI 

2088 

A 

38 

230 

LYWQKRKGKLL1TOLTQGIVADPVR*KKFHFKCPSNCPKIHNTESAYA 
VSFMKYSAHVLGSLYEM 

2089 

A 

487 

747 

LPSGPEYSAGLSTSRLKARHRGSRSPERRSGRPRGPRRSRAAAEKPS 
PKCARDAKPGAVATSPAPARRPVPGPRLRAPKLAGRSLLG 

2090 

A 

1 

1350 

MVGAADTAVS IMVLALVGLDESBFRDKLTPITI FMEYRLDYRTAADT 
TGLQPILNQFTPANISRQSGFVLLTCSDQKKIYIGDDNPLTLIVKAQ 

VCDLGNPMKAGTQLLAGLRFSVHQQSEITOTSVKFDLQIQSSNLFDKL 
RNNGPSSFSKAMl^LQWPYKYNNNTLLYILHYDIDGPMNCTSDMEI^ 
PLRIKISSLQTTEKNDTVAGQGERDHLITKRDIJ\LSEGDIHTLEIEA 
T C YDW YVG S V T G K T ft HVT .n T ,MTCT .M EV Q T .T .TT . c Q ut? T c vo wnwun c n V 

DDGFFRRQDIAIAAPYGGEDKKGIVYIFNGRSTGLNAVPSQILEGQW 
aars m p p s fg ysmkg atd i d kng y pe e la v «5 afi t , fcr km c <3 we kr hr 
GQYGLVEQNMKQInTNPGYSPKFEIKVTF 

2091 

C 

179 

649 

MFLLVGAPKANTTQPGIVEGGQVLKCDWSSTRRCQPIEFDATELHRE 
LKENQNH S YS LKS S AS FNV I E FP YKNLP I ED I TNS TL VTTNVTWG I Q 
PAPMPVPVWIILAVIxAGLLLIxAVLVFVMYRMGFFKRVRPPQEEQER 
EQLQPHENGEGNSET* 

2092 

A 

274 

423 

VCraFLEAIENNKYGWFWVCPGGGDICM^T^ALPPGFVFNKDKJOCEE 
KKD 

2093 

A 

1 

3849 

MCGGVFATEELAVSAGLKJlia^CSVVEKRHGDFQVELLLDKXKQKGAI 
RRALFLYSRSPSHSK1WTISRGGLMQCEELIAYLRDESEFRDKLTPI 
TI FMEYRLDYRTAADTTGLQP I LNQFTPANI SRQSGFVLLTCSDQKK 
I Y IGDDNPLTLI VKAQNQGEGAYEAEL TVS I PLQADF IGWRNNEAL 
ARLSCAFK^ENQTRQWCDLGNPMKAGTQHLLSPCTQFSMQVFLFHY 
TAS S S WALDSHRS KP YCVNPAVKCACKGSRX*HAS YENLIPGDPSWT 
VSSRKHPPLCPGKESRMFLLVGAPKANTTQPGIVEGGQVLKCT 
RRCQPIEFDATGNRDYAKDDPLEFKSHQWFGASWSKQDKIIxACAPIj 
YHWRTEMKQEREPVGTCFLQIX5TKTVEYAPCRSQDIDADGQGFCQGG 
FSIDFTKADRVLLGGPGSFYWQGQLISDQVAEIVSKYDPmnfSIK^ 
NQLATRTAQAI FDD S YLGYS VAVGDFNGDG IDDFVSG VPRAARTLGM 
VYIYDGKNMSSLx^FTGEQMAAYFGFSVAATDINGDDYADVFIGAPIj 
FM10RGSDGKXQEVGQVSVSLQRASGDFQTTKLNGFEVFARFGSAIAP 
I^DLDQDGFNDIAIAAPYGGEDKKGIVYIFNGRSTGLNAVPSQILEG 
QWAARSMPPS FGYSMKGATDIDKNGYPDL IVGAFGVDRAILYRARPV 
ITVNAGLEVYPS I LNQDNCTCSLPGTALKVSCFNVRFCLKADGKGVIj 

prkjl^fqvellxjdklkqkgair*^ 

ceeliaylrdesefrdkltpitifmeyrij^yrtaadttglqpilnq 

TPAN I SRQAH I LLDCGEDNVCKPKLEVS VDSDQKKI Y I GDDNPLTLI 
VKAQNQGEGAYEAEL I VS I PLQA* SGKCAQWLRNGMEI S RWNFFWIN 
SSKREQ/YRRALFLYSRSPSHSKNMTISRGGLMQCEELIAYLRDESE 
FRDKLTP I TI FMEYRLDYRTAADTTGLQP I LNQFT PAN I S RQSGFVL 
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LTCSDQKKIYIGDDNPLTLIVKAQNQGEGAYEAELIVSIPLQADFIG 
VVRNNEALARLSCAFKTENQTRQVVCDLGNPMKAGTQHLLSPCTQFS 
MQVFLFHYTAS S S WAIiDSHRSKPYCVNPAVKCACKGSRLHASYENL 
IPGDPSWTVSSRKHPPLCPGKESRMFLLVGAPKANTTQPGIVEGGQV 
LKCDWSSTRRCQPIEFDATGNRDYAKDDPLEFKSHQWFGASVRSKQD 
KILACAPLYHWRTEMKQEREPVGTCFLQDGTKTVEYAPCRSQDIDAD 
GQG FCQGG FS I DFTKADRVXiLGGPGS F YWQGQL I SDQVAE I VS KYDP 
NVYS IKYNNQLATRTAQAI FDD S YLGYS VAVGDFNGDG I DD FVS GVP 
RAARTLGMVY I YDGKNMS S LYWFTGEQMAAYFGFS VAATDINGDDYA 
DVFIGAPLFMDRGSDGKLQEVGQVSVSLQRASGDFQTTKLNGFEVFA 
RFGSAIAPLGDLDQDGFNDIAIAAPYGGEDKKGIVYIFNGRSTGLNA 
VPSQILEGQWAARSMPPSFGYSMKGATDIDKNGYPDLIVGAFGVDRA 
I L YRAR PV I TVNAG LEVY P S I LNQDNKTCSLPGTALKVS CFNVRFCL 
KADGKGVLPRKLNFQVELLLDKLKQKGAIRRALFLYSRSPSHSKNMT 
ISRGGLMQCEELIAYLRDESEFRDKLTPITIFMEYRIiDYRTAADTTG 
LQP I LNQFTPANI SRQAH I LLDCGEDNVCKPKLEVSVDSDQKKI YIG 
DDNPLTLIVKAQNQGEGAYEAELIVSIPLQADFIGVVRNNEALARLS 
CAFKTENQTRQ WCDLGNPM KAGTQLLAGLRFS VHQQS EMDTS VKFD 
LQI QS SNLFDKVS P WSHKVDLAVLAAVE IRGVS S PDH IFLPI PNWE 
HKENPETEEDVGPWQHIY^LRNNGPSSFSKAMLHLQWPYKYNIJNTL 
LY I LHYD I DGPMNCTSDME INPLRIKI S SLQTTEKNDTVAGQGERDH 
LITKRDLALSEGDIHTLGCGVAQCLKIVCQVGRIiDRGKSAILYVKSL 
LWTETFMNKENQNHSYSLKSSASFWIEFPYKmPIEDITNSTLVTT 
NVTWGIQPAPMPVPVWVI ILAVLAGLLLLAVLVFVMYRMGFFKRVRP 
PQEEQEREQLQPHENGEGNSET 

2094 

A 

1126 

2034 

RCWRPRTLGFASPPTRPCSPTHAPGCEMTPRDWHRVLSFASPLRIEB 
I FQGYDKTFGLKNKKGAKQQKFIKAVTHQVKFGQ/ SKSTSGSTE* SW 
KRKLKKDDKKKELQELNELFKP WAAQKI / RVKGADPKS W\ CAFFK 
QGQCTKGDKCKFSHDLTLGEKM\EKRSVYIDARDEELEKETKSSLIQ 
ADTMDNWDEKKLEEVWKKHGEAEKKKPCT 

WFWVCPGGGDICMYRHALPPGFVLKKDKKKEEKEDEISLEDLIERER 
SALGPNVTK I TLES FLAWKKRKRQV 

2095 

A 

1 

213 

ERAAQRRLKAELG I PLEE VPPEE INYLTR IHYKAQSDGI WGEHE IDY 
ILLDGEQRPSHVTPTVMPDVARCP 

20^6 

A 

2 

17 0 

MPEINTNHLDKQQVQLLAEMCILIDENDNKIGAETKKNCHLNENIEK 
GAALKQSFL 

2097 

A 

3 

764 

PAWCGRRLISVLEQIRHFVMMPEIimraLDKQQVQLIiAJEMCILIDB 
NDNKIG AETKKNCHLNEN I EKGLLH\ RAF S VFLFlJTENKLLLQQ \ RS 
DAKVTFPG/ LVLRNTCCSHPLRQSSRAIjRESDAIjGS * /GARQHRGRIi 
ESLSLG I PLEEVPPEE\ L 1 1 * HRIHYKAQSDGI WGGHEIDYILWWRK 
NVT\ LNPDPNE I KS YCYVSKEE LKELLKKAAS GE I KI TPWFKI I AAT 
FL F KWWDNXiNH LNQ FVDHE K I Y RM 

2098 

A 

1 

299 

G I PDQEV I GTG YG I LDQKASGVKYTKS / D FRS PRPDDLS S FQL * VTS 
HWDLRLIEVTETICKRLL\ 1 1 ACTRRGPAAIDLPRACQRPLRHYTTW 
YTKGSRW 

2099 

A 

1 

631 

MDSMPEPASRCLLLLPLLLLLtiLLLPAPELGPSQAGAEENDWVRLPS 
KCEVCKYVAVELKSAFEETGKTKEVIGTGYGILDQKASGVKYTKSDIi 
RIiIEVTETICKRLlA I IACTRRGPAAIDLPRLDSGLSFIRSGVRPAC 
ILVCLLVHRQKCSTVAAPVGT*RH*NGENSARSTMCRKVGV*ARCQR 
TPRACQR PLRHYTTWYTKGSRW 

2100 

A 

171 

825 

RANPTKPIGCAVRGPWSAIiKAGCGCQRRAGGRLAWWARRHVCSPG 
SGEGKLLRARPREEEPPGPLGSGPGRAMDSMPEPASRCIiliLLPLLLL 
LLL»LLPAPELGPSQAGAEENDWVRLPSKCEVCKYVAVEL»KSAFEET\ 
GKTKEVIGTGYGILDQKASGVKYTKSDLRLIEVTETICKRLL\IIAC 
TRRGPAAIDLPRACQRPLRHYTTW YTKGSRW 

2101 

A 

2 

527 

PGRAMDSMPEPASRCLLLLPLLLVMLVLLPAPELGPSQAGTDENDW 


|oo51 


.1 0 O 3 ■+ 6. Ml- 3 „, O S 5 6. 0 S 






RIiPS KCEVCKYVAVELKS AFEETGKTKEVIGTGYG ILDQKASGVKYT 
KSISDPPDQDDLSSFQL*VTSHWDLRLIEVTETICKRLIi\lIACTRR 
GPAAIDLAKVGFGIVLHPLWGPGLQCNLSVSAGIi 

2102 

A 

3 

1203 

SSAAJE^UraVLVRRCWGPPLAHGARRGR^ 

VREKPPWRVliFFGTDQFAREALRALHAARENKEEELlDKLEVVTMPS 
PSPKGLPVKQYAVQSQLPVYEWPDVGSGEYDVGWASFGRLLNEALI 
LKFPYGILNVHPSCLPR\WRGPAPVIHTVLHGDTVTG\VTIMQIR\P 
KRFDVGPILKQETVPVPPKSTAKGIGKAVLFKDWGANMLI\SVLEKF 
GLESLSNGRQQPMEGATYAPKI SAGTSCIKWEEQTF\ EQKFRLYRAI 
GNI I PLQTLWMANTI KLLDLVEVNS S VUVDPKN *REQAL,IPGS \ VI Y 
HKQS Q I LLVY CKDG W I G VR S V\ MLKKS LTATDF YNG YLH PW YQKNS Q 
AQPSQCRFQTLRLPTKKKQKKNCCYATMH 

2103 

A 

3 

215 

EELHEKFDRLKKLHQDEKNKLEDKKKSLDDEVNAFKQRKTAAELLQS 
QGSQAGGSQTLKRDKEKKNLGFL 

2104 

A 

2 

341 

S KP F S LQET YEAKRNE FLGELQKKEEEMRQM F VQRVKEKEA/ ELKEA 
EKELHEKFDRFKXLHQDEKKKLEDKKKSLDDEVNPFKQRKTAAELLQ 
SQGSQAGGSQTLKRDKEKKN 

2105 

A 

1 

130 

EVN AFKQRKTAAELLQS QG S Q AGG S QTL KRDKEKKNNP WLCTE 

2106 

A 

2 

3449 

YKPIVEFIDAQFEAYLQEELKIRRVLHTYHDSRIHVCLYFIAPTGHS 

LKSLDLVTMKKLDSKVNIIPIIAKADAISKSELTKFKIKITSELVSN 

GVQIYQFPTDDESVAEINGTMNYTRDVFVITITILDKSGSRAHLPFA 

VIGSTEELKIGNKMMRARQYPWGTVQVENEAHCDFVKLREMLIRVNM 

EDLREQTHTRHYELYRRCKLEEMGFKDTDPDSKPFSLQETYEAKRNE 

FLGELQKJCEEEMRQMFVQRVKEKEAELKEAEKELHEKFDRLKKLHQD 

EKKKLEDKKKSIiDDEVNAFKQRKTAAELLQSQGSQAGGSQTl»KRDKB 

KKNASKARSRHWREKSTSREILRRDLEVREQPRRKRTGLTTPDGTRS 

ASSLGSLLGGGEDGWRTSAVGGRLPVAPPLPPLPPPPLPPLPPPPPE 

PVLEQWRYSHESDWQWAIiRRSFICRHLHSYPGAALDQLLAIiSAAWTN 

HVFLGCRYSPRLMEKILQMAEGIDIGEMPSYDLVLSKPSKGQKRHLS 

TCDASSSKDERQEDPYGPQTKEVNEQTHFASMPRDIYQDYTQDSFSI 

QDGNSQYCDSSGFILTKDQPVTANMYFDSGNPAPSTTSQQANSQSTP 

EPSPSQTFPESWAEKQYFIEKLTATIWKNLSNPEMTSGSDKINYTY 

MLTRC I QACKTNPEY I YAPLKEI PPAD I P KNKKLLTDG YACE VRCQN 

IYLTTGYAGSKNGSRDRATELAVKLLQKRIEVRWRRKFKHTFGEDL 

WCQIGMSS YEFPPALKPPEDLWLGKDASGQP I FNAS AKHWTNFVI 

TENANDAIGILNNSASFNKMSIEYKYEMMPNRTWRCRVFLQDHCIiAE 

GYGTKKTSKHAAADEALKILQKTQPTYPSVKSSQCHTGSSPRGSGKK 

KD I KDLWYENS SNP VCTLNDTAQ FNRMTVE YVYERMTGLRWKCKV I 

LES EVI AEAVGVKKTVKYEAAGEAVKTLKKTQPTVINNIjKKGAVEDV 

ISRNEIQGRSAEEAYKQQIKEDNIGNQLLRKMGWTGGGLGKSGEGIR 

EPISVKEQHKREGLGLDVERVNKIAKRDIEQIIRNYARSESHTDLTF 

SRELTNDERKQIHQIAQKYGLKKKLTATLHYSFHTVWMCSTSKATTA 

WELGKRLKIHWCFDEKINQV 

2107 

A 

2 

1609 

FVP PP FVEAYGWGCRKNWNWAGHRGKRRVKS APSPYPTFKGWAVEGR 
I/3LEQRRGGPSI^PPPTGLGRARSVTLSLCRRSSSVSC^VAPVAAAP 
VAALADAGAMAATD I SRQVGEGCRTVPLAGHVGFDSLPDQLVNKS VS 
QGFC\FNILCVGETGLGKS\TLM\DTLFQHQISKGEPATHTQPGVQL 
QSNTYDLQESNVRLKLTIVSTVGFGDQINKEDSYKP\IVEFIDAQ\F 
EALLQ\EDPKVR\RVLQTYHDSRIHVCLYFIAPTGHCLKSLDLVTMK 
KLDSKVNI IP I IAKADAISKSELTKFKIKIT\ SELVSNGVQI YQFP\ 
TDDESVAE IQLNP * TPHLP\ FAVIGSTEELKI GNKMMRARQYPWGTV 
QVENEAHCDFVKLREHLIRVNMEDLREQTHTRHYELYRRCKLEEMGF 
KDTDPDSKPFSLQETYEAKRNEFLGELQKKEEEMRQMFVQRVKEKEA 
ELKEAEKELHEKFDRLKKLHQDEKKKLEDKKKSLDDEVNAFKQRKTA 
AELLQSQGSQAGGSQTLKRDKEKKN 

2108 

A 

1 

618 

^nmTQCLLGLRTFVAFAAKLWSFFIYLLRGQIRTVIQYQTVRYDIIiP | 
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L S P VS RNWLAQ W I D KH SVRF FVR KR PHVD F F L EWS Q WYEL WFTE 
SME I YGFAVAGKLDNSRS I LKRRYYRQHCTLQLS SY I KDI*S WHSDL 
SSIVILDNSLGAYRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRF 
TADVHSVLSRNLHQHRLW 

2109 

A 

1 

427 

HPVRFFVHRGPH/VDFSLEWSQWYELWFTASMEIYGSAVADKLDN 
SRSILKRRYYRQHCTLELGSYIKDLSWHSDLSSIVILDNSPGAYRS 
HPDNAI PI KSWFSDPSDTALLNLLPMLDAIjRFTADVRS VLSRNLHQH 

DT 
KJ_» 

2110 

A 

2 

835 

WNSAELGRGGPGAGGAGVIGMMRTQCLLGLRTFVAFAAKLWSFFIYL 
LRRQ\lRTVIQYQTVRYDILPLSPVSRNRLAQVKRKILVt,DLDETLI 
HSHHDGVIiRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFL\E 
V\VSQWYELWF\TASM\EIYG\SAVGRLNWDNSRSIL*GGRY\YRQ 
I IjE J_>(j> \bil KDPLiIjWFHRDLS \ G IV IL\ DNS PGAYRSHPGYGGR\ 
DNAI P I KS WVS VTPS \ DTALLNLL PMLDALRFTADVRS \VLSRNLHQ 
HRLW 

2111 

A 

91 

.744 

TPHFHQGTGKRAAWPTCFKPAGPGKIRQT*RPRKAQRRRRIAPRPRV 
GVPIRP\IVRCPT\VRVHTKV\RAGPRLPALRKLRVAGIHKK\VART 
FG I \ S VDPRKPDQVPRS S FKA\NVQRLKE YRS K\ LI LFPQRS PRPPK 
KGDRS \AEELKI*A\ TQL \ TGPV\ MFVRNVQ\ KKEKARVI TEQ\ EKNF 
l\AJ? A£> JjKJ^1/\J<-AJNJ AJLKxiTGQ KE PREAAE PG C * KAHNK 

2112 

A 

* 

2 

846 

RFPLGCFPAQEPQGPVGQPMGAPAGNGHGS * S PHFHQGTGKRAALAT 
WF\NQPAPERSPDVKAPAKPRARRIAP\RPRVGVPIRPIVRCPTV\R 
YHTKV\ RPGRGFS LEELRVAG I HKK\ VARTIG I S VD PRRRNKS \ TE S 
LQAKRAAG*REYRSKLIIiFPQRSPRAPKKGDSS\AEELKLAHPG*PG 
*HaiiAP£>ij 1 V i \ isJsJi KAK V I TE E E KN F KAFAS L \ RMARAN AP ALS A YR 

AKKSPRKPQKTDVEKPTIKPCLGTWNQSGSHAGSPRGVFRGNNWAWD 
GASLL 

2113 

C 

714 

1010 

MKSLNPHLDKIFPHKPLSSLASPSKTLVPGHLCDSFTPFKFSPNYLL 

r Jb'yiNtiFttibFHJjPbPFi^KGISPIYTPHCL 

HLSY* 

2114 

A 

2 

227 

ERFRPPIARVIDVSNGKVHVAE/SCLEETGGLGVDIVLDAG\LDPPD 
SHCLFLKGATLAFLNDEVWNLSNVQQGRYL 

2115 

A 
/A 

-> 

ADO 

KVQAVGAAQRQ S S TDE E I TFV FQE KEDL P VTEDN FVKLQVKACAL S Q 
INTKLLAEMKMKKDLFPVGREIAGIVIiDVGSKVSFFQPDDEVVGILP 
i-iiJoc*uf kjijUJCi v VKVttc.n jl la VHi\_PEKVTWTEAAGS IRDGVRAYTAIiHY 
LSHLSPGKSVLIMDGASAFG 

2116 

A 

1 

489 

ri ivi'j jvivuij r* f voivCi VLUVuoKVor 1? yPUIJE V VGILiPZjDSEDPGLi 
CEVVRVHEHYLtARVIDVSNGKVHVAESCLEETGGLGVD I VLDAGVRL 
YSKDDEPAVKLQLLPHKHDI I TLLGVGGHWVTTEENLQVKKQKHRDV 
DTDSRTQQVAEQESHSGSLSPF 

2117 

A 

2 

221 

GRVGDEPAVKLQLLPHKHDIITLLGVGGHWVTTEENLQLDPPDSHCI, 
FLKGATLAFLNDEVWNLSNVQQGKYL 

2118 

A 

353 

512 

QSFQLRILKDVMEKLSTGVFRPQLDEPIPLYEAKVSMEAVQKNQGRK 
KQWQF 

2119 

A 

1 

1706 

MPRAAIAQTPAPTPAPAAAARGPGRRLAAIiSSPHLRAAEHDGRCSSE 
TGCHGVLDSLASPTPPFRSDERPDQWEQGAGSLTKDWKVAWKVQAV 
GAAQRQVSPGKRVLCLGKRAFKRKVERCPTEPCSPGEDGGSRRTLTW 
DLFGPRPSLAEAAEGHFFLPHHFAVFGKQLWIRCMWLLSQIGPLHY 

AKRRJ^LERDVTMKGLYFOOSSTDRR TTT^VPOPTrFTiT .PVTPDMTTVTf T. 

QVKACALSQ I NTKLLAEMKMKKDLFPVGRE I A\ G I VLDVGSKVSFFQ 
P\DDEWGILP\LDS*KTPGL\CESLLRVHEPLLGFINPEKVTW\TE 
AAGS I SGMG VRAY YS FLHLSF FI FS PGGNQCL I MDGAKCHLRTI AI Q 
LAHHRGAKVISTACSLEDKQCLERFRPPIARVIDVSNGKVHVAESCL 
EETGGLGVDIVLDAGVRLYSKDDEPAVKLQLLPHKHDIITLLGVVGG 
HWTTEENLQLDPPDSHCLF\LKGATLAFU^EVWNLSNVQQGKYLS 
* QS FQLRI LKDVMEKLSTGVFRPQLDE P I PLYEAKVSMEAV\ QKNQG 
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RKKPSWFQF 

2120 

A 

230 

535 

KLFFPPSLLFTLATGIFLDLMHLKKPGGPDISLFYRDIISIABDEDL 

RVHFEESSKLEDLLRKVRAKETRKRALSRCALSPGQLPTLPLNQKQA 
GRGGSHL 

2121 

A 

344 

2631 

GDREGTGRAHTEEG AGAGMHF YDVEGVDWDRVQGPRMRGLS S SVQVG 
RFPAPKVSGPLSICWRQLGLASSRFARSVPFGVWKKDLPCLSLLSQ 
SEDELTPFDMS IQVRLPFNLRQI * KQYWLGMVT^HAGTEKDKNSVNF 
KNI YVLQELDNPGQ* YFKIS FSLI YENI STLI KEDMGHGSDYSLSEV 
LWVCANLFSDVQFKMSHKRIMLFTNEDNPHGNDSAKASRARTKAGDL 
RDTGGHI S PFS * IGT* LLFF YFLFF * 1 1 S I AEDEDLRVHFEESSKLE 
DLLRKVRAKETRKRALSRCALSPGQLPTLPLNQKQAGRGGFKPPPIK 
LYRETNEPVKTKTRTFNTSTGGLLLPSDTKRSQVGRDAFCCCCF*DR 
VSLCRPGWRGLMLMGFKPLVLLKKHHYLRPSLFVYPEESLVIG\SSV 
un\D v i J_i inDrtv Hj/\/Ujv-/ Ki 1 FRKNIPPYFVALiVPQEEELDDQKIQ 
VTPPGMWQRYFQGF * TLP ILWNHPWTPQGGGTVRLALDVSKKLETGT 
NCRQVKGLCWGNTGRNSKYRRAATTVLLSAVSVTHSATPEQ/VWGKM 
KAI VEKPRFT\ YR\ SDS FEN P \ VLQQHFRNLR\ ALALDLMEPEQAVD 

K* KGLGKKKRPKGWK* FKKEGA * RPH I KKGVRWGKVHCAPC * KEACR 
GLRG * KSGLK\ KQELLEAI*T\ KHFPGLDQRPARPAALPAVLPGCPGL 
VLSQLKCVSPELGRVYPT 

2122 

A 

599 

1063 

SCSEGLNAVNNLKASRKPKCLSLKWLFLFYFP**IDSQAFYPLISKI 
VKRKKMTHFWTLYFMYLFLRQSLAIiLRPGWSAVAQSQLTVAIiGLL/V 
LRQSSYVSPPGTWNHRYMPPCPAF*NYFCRDGGLPILFRLVLNSWV* 

2123 

A 

559 

1450 

ACAPLVAASGSEVRRSSAAWAMVQLRPRASRAPASAEAMVDEGQLAS 
EEEE7VEHGLLLGQPSSGAAAEPIj\EED\EEGDDEFDDGGPEEHNLFA 

R A\ OARAPT?PPPP\7T?TT r r r \rD'Dr*V T PT T VfTTDVUT /nt>t?T tt> t Tjnvtm -wt \ 
fc - > ' rt \ V^»^^v-C»c»-C»-K-rc vivc. X VKKJJJvl IxUJv&JvKlvK 1 / RKEIiFT KQKKKKTf \ 

LPDTILE\ KLTTASQTNI KKF\ PRKR* KKVNLQKKNED\ CEKGN\ DS 

KKVKVQKVQS \ VS PE * KLASAVRLKGPRFWRDSR\QQAAQAF\ I HNS 

lygpgtXnrttvnk^lslXankpxpvkrdcctsfi^jnxawgiqkkNq 
n\akrfkrtvgwvrkgwktkevnpmlk 

2124 

A 

142 

461 

KTVSEGFIFSHRTPSLPTFLVLPSSPNFYILDEPTNHLDMETIEALG 
PJ\CNNFRGGVTLiV < ?Hrr)PPPTPT.'\7r , DTrT wwr , T?r^i^\rn^xrc*/r^ , n*r\rvv'T5 7v 

LLQEQFRREGFL 

2125 

A 

145 

904 

VIMLVYLISSPRPELKPVDKESEWMKFPDGFEKFSPPILQLDBVDF 
YYDPKHVIFSRLSVSADLESRICVVGENGAGKSTMIiKLLIiGDIi\APV 
RG I RHAHRNLKIG YFSQHHVGAAGT * TFS ACGNLLGTQVFLGRPEEE 
Y\ RHQLGFGMG I SGELGHAS SLPACLGGQKEAEVAFCSDGLLPCPNF 
L\IL\DEPTN\HLGHGRAIEALGPCLQTISGVGVILVSHE*SALSR1j 
VCRE\ LWVC *GRRRHPCGKEGF 

2126 

A 

39 

2220 

GASFPGVPEWNMATCAEILRSEFPEIDGQVFDYVTGVIiHSGSADFES 
VDDLVEAVGELLQE VSGDS KDDAG I RAVCQRMYNTLRLAEPQSQGNS 
QVLLDAP I QLS KI TENYDCGTKLPGLLKREQS S TVNAKKLE KAEARL 
KAKLRKRSEKDTLKTSNPLVLEEASASQAGSRKESRLESSGKNKSYD 
VRIENFDVSFGDRVLLAGADWLAWGPJ^YGLVGRNGLGKTTLLKMLA 
TRSLRVPAHISLLHVEQEVAGDDTPALQSXVLESDSVREDLLRRERE 
LT \ S Q I AAGRAEGS E AAE LA\ KI YAKLEE I EADKAPARAS VT LAGLG 
FTPKMQQQPNPGVS QVAWRI^IjAIjAGAIj\ FARPDLLLLDEPTNMLDV 
RAI LWLENYLQTWP STI LWSHDRNFIaNAI ATDI IHLHS QRIDGYR\ 
GDFETFIKSKQERRAQ\QQREYEAQQQYRQHIQVFIDRFRYNANRAS 
QVQSKLKMLEKLSELKPVDKES * VVWKFP* WV*EALrRRPFCSIiDEVD 
FYYDPK3iVIFK\RLSVSADLESRICWGENGAGKST\MLKLIiLG\DL 
APVRG\LRHAHRNLKIGY\FSQHHVEAAWT*TSVLVELLARKFPGRP 
EEEYRHQLGRYGISGELAMRPLASLSGGQKSRVAFAQMTMPCPNFYI 
LDE PTNHL \ DMETI EALGRALNNFRGGVI L VSHDERF IRLVCRELWG 
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MRRRR\VTRVEGGFDQYRALLQEQFRREGFL 

2127 

A 

45 

339 

PPPRTPSVPAG*LRAS/STPPRSSSSSSSSSPRKGLSLRPQEALPPP 
PCPSCRSSAPSAPRGGRGETPNKGTGGSLCQSCHHPPVDVQAVLNKC 
FEADE 

2128 

A 

484 

1879 

F QGQRMAGEIiKPS CNLLEQS I LLAKGTS \ GS S LT AL I S Q VLKAP * VY 
VFGELLELCNVHELAE*ANAAVLQLLNIiFAYGTYPDYIANKESLP*L 
STAQQNKLKHLTI VSLASRMKCI PYSVLLKDLEMRNLRELEDLI IEA 
VYTDIIQGKIiDQRNQLLEVDFCIGRDIRKKDINNXVKTLHEWCDGCE 
A\ ALLG IEQQVLRANQYKENHNR/ TLQQQVEAED VY ILPQSGNDCRG 
LRFVSSLIGGVRCraLKHKGDAVVWEAIEEGEALPGKYRLKGGRSVW 
AAAVRAVCMDQVTN I KKTLKATAS S S AQEMEQQLAERECPPHAEQRQ 
PTKKMSKVKGLVS S S PLGPAGAAGTHQAWVRWGGDTKGPFPPLSTCS 
EFQTCPSPHQRLPTLIiVLFQKNCYSPSPTPSFPSCSLQTQGLHQCHP 
KTGSDTAQLPSRRFLSLCKGLVSLPVFLLLPRHFVRLVISRRQL 

2129 

A 

225 

415 

RMAWWESNNYQSFCLPSEESEPEDLENGEESSAELDYQDPDATSLKY 
VSGDVTHPQAGAEDAL 

2130 

A 

356 

1024 

GRAEPEDLENGEESSAELDYQDPDATSLKYVSG*CHPPSGWG/HEDA 
LIVHCVDDSGHWGRGGLFTALGKA/ SPLKPRKIYELAGENEKTLSLG 
GVLLFPV***KIQDTKGKICWP*LWLSIVNRSNVLSGI*DGSPIRRA 
* RR Y / CLAAKKAGKS S VQPPHVLDMPRKGFKRVGVPEPTLFGKHPGL 
QRGHPQPYHILFPLEGKSACPFHSQFFILPSLKDSWCP 

2131 

A 

3 

205 

ERSAAETVTKGGIMLPEKSQGKVLQATWAVGSGSKGKGGEIQPVSV 
KVGDKVLLPEYGGTKWLDD 

2132 

A 

3 

205 

ERSAAETVTKGGIMLPEKSQGKVLQATVVAVGSGSKGKGGEIQPVSV 
KVGDKVL»LPEYGGTKWLDD 

2133 

A 

1 

428 

SFTSAGRLEKPRNSSFFLPPPSLRVSVLRRALGANRGARRGDRPPSL 
GGARGLAATAGASAPARWVIFSRVARAGLRVSLRRYTRAEYESEAEG 
VMGGEIQPVSVKVGDKVLLPKYGGTKWLDDKDYFLFRDGDILGKYV 
D 

2134 

A 

36 

417 

QAFRKFLPLFDRVLVERSAAETVTKGGIMLPEKSQGKVIjQATVVAVG 
SGSKGKGGEIQPVSVKVGDKVLLPEYGGTKWLDDKVCKP**F*KEV 
RYLOIiWIiTNGFFHLQDYFLFRDGD ILGKYVD 

2135 

A 

496 

751 

ISLLNKHPSFFKGGEIQP\VSVKVGDKVLLPEYGGTKVVIiDDK/DV* 
T* * F*KEVRYLQLWSTKGFFHIiQDYFLFRDGDILGKYVD 

2136 

C 

92 

265 

MSXiVIIRFII KGE IWE I 1 YMTER I SGTS SGMGRHHVDAI SNSDL FQS 
TVLSKNCNNIi* 

2137 

A 

330 

875 

AYAAGFGDSCLIAMSGRGKQ*GKVF\AKAKSRSSRAGLQFPGGRVHR 
LLRKGNYAERV\GAGAPV\YL\AAVLEYLTAEIL\ELAGNAARD\NK 
KT\ R 1 1 PRHLQLGHPATTEELNKLA* GRVT ICFRGGVPS * TSRAVIj\ 
LPKKTESSPQGQGQSDFDRYLELPRKRYSNPKGSFSEAPLPFQRKS 

2138 

A 

25 

414 

LKLVAPCRPSRSLRSSPISRRASTTRCLTIEKAKPIKCGTFGATSLQ 
QRYLATGDFGGNLHIWNLEAPEMPVYSVKGHKEIIUAIDGIGGL.GIG 
EGAPE I VTGSRDGTVKVWDPRQKDDPVANMEPVQGE 

2139 

A 

1 

65 

RTRGRTRGNLSFI *NHM*QIA 

2140 

A 

293 

445 

SPSGYGR*SEASRRPTWCSQFQLRQPPHPEPPKSATWNQTPNIGNLS 
LPL 

2141 

A 

544 

1553 

CQMRSCDRTECSFTTARICLSPLPTAFRTHTYGPKSCRSPACDVLGK 
GRSCRGGWLRLASAWCSCARVSAGSALRFPGMESEMETQSAGAEEGF 
TQVTRKGGRRAKKRQAEQLSAAGEGGDAGRMDTEEARPAKRPVFPPL 
CGDGLLSGKEETRKIPVPANRYTPLKENWMKIFTPIVEHLGLQIRFN 
L\KSRNVEIRTCKETKDVSDSDKRQTDFVK\AFILGLSGWKE*HLTL 
IRL\DDLFLE\SFEITDVKPLKG\DHLSRAIGRIA/GAKEGKTKFTI 
E\NVTRTTI VLAD VKVH I LG\ S FQNI KMARTAI CNL I LGNPPS KVYG 
NIRAVASRSADRF 

2142 

A 

1055 

1223 

SSPGNLNVQEGFRITGLICY*KQYQY*MCFLFLL/SLYVCWFDFLL 
LLLFCGHPLH 
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2143 

A 

2 

258 

SPGRTAMRSWSCRPCGGGGLG/ SSQGRSEPGSSRLGPAPS/CGLTHP 
CRRGPSCPHPCPAGCRTCGGICPGPHPTSRCPPSRNCGQT 

2144 

B 

1 

284 8 

MARRPRNSRAWHFVLSAARRDADARAVALAGSTNWGYDSDGQLLQVT 
CLSLTGSWLWDAGVPAGSAVFDTCAACPVSPNGPAVCCGQEHSDSDS 
DPEYSTLPPSIPSAVPVTGESFCDCAGQSEASFCSSLHSAHRGRDCR 
CGEEDEYFDWVWDDLNKSSATLLSCDNRKVSFHMEYSCGTAAIRGTK 
EIX3EGQHFWEIKMTSPVYGTDMMVGIGTSDVDLDKYRHTFCSLLGRD 
EDSWGLSYTGLLHHKGDKTSFSSRFGQGSIIGVHLDTWHGTLTFFKN 
RKC I R I KEAWS PLTRLPQPCLPCMQMAJLRKGNVS FGGRRSRVPTCPP 
TAVPRSPPAAPDQGLDAGRRPGPGAAAAVLGGTGPPQLEGNKPPKRG 
ATMTTRMSSASSAAPGTSRACMMESRAATQTOiWLDGNNLSSVPPAAF 
QNIi S S LG FLNLQGG QLG S LE PQ ALLGLENL CHLHLERNQLRS LALGT 
FAHTPALASLGLSNNRLSRLEDGLFEGLGSLWDLNLGWNSLAVLPDA 
AFRGLGSLRELVIiAGNRLAYLQPALFSGLAELRELDLSRNALRAIKA 
jm v j? vyjjFKLiUlNJbi IjIJKJNIjIAAVAPGAFLGLKALRWLDLSHNRVAGLL 
EDTFPGLLGLRVLRLSHNAIASLRPRTFKDLHFLEELQLGHNRIRQL 
AERSFEGLGQLEVLTLDHNQLQEVKAGAFLGLWVAVMNIiSGNCLRN 
LPEQVFRGLGKLHSLHLEGSCLGRIRPHTFTGLSGLRRLFLKDNGLV 
GIEEQSLWGLAELLELDLTSNQLTHLPHRLFQGLGKLEYTjLLSRNRIi 

LRTFTPQPPGLERLWLEGNPWDCGCPIiKTVLRDFALQNPSAVPRFVQA 
ICEGDDCQPPAYTYNNITCASPPEWGLDLRDLSEAHFAPC* 

2145 

A 

3 

630 

RSGGSASSGRAEARDCWGGRRSLPGDRTLEPCGGCNSCSGPSPACCV 
APA*GPAAAADPPACLPSLAAAGAGGGSTGTGGK\CAGTG/GG*PSC 
CSG PPWSTPSGRI SC/ CCSLVAATPGREGPSLYTSCS * KR* VCRARC 
j^Lj*lku "Wb ItajK iai Ki> Wfa v_KPCGGGGPCRRGPS CPHPCPAGCRTCGG 
I CPGPHPTS RCPPSCRCRRRERS S 

2146 

A 

16 

716 

SVLESKGQRNKGQRKGPVPAQEGSSTGKSVRSGGSASLWQGQGSRTA 
GVGGGRCPGDRTLEPCGGCNSCSGPSPACCVAPA*GPAAAADPPACL 
PSLGRS\CAGGGSTGTGGKWRWHRSG*PSCCSGPPWSTPSGRISC/C 
CSLV/CCHTY/ITSCS*KR*VCRARCPGARQ*WSPGRTATRSWSCRP 
CGGGGPCRRGPSCPHPCPAGCRTCGGICPGPHPTSRCPPSCRCRRRB 
RSS 

2147 

- 


260 

1 A A 

X O 1 

Il.ijVirKoJL»OISIljJjKlr , lr'l w lF I PHEAJ3KQITGPEEADRPPSMSSHDTASPAA 
PSRSP\CCLCWC*CCNCSWNQQRRRAWQASRESKLQPL\PICEVCAT 
PSPEEVQSWAQSFDKLMHSLAGLSVFRAFLRTEYSEENMIiFWLACEE 

LtKJVJRANOIT\7Vr)PK'APT,TVT?T»VT.7CTT CDVPUOT T^ooT7T3cr«TvrtrKTiui/*vc»r» 
ui\n£*mvyn v v i/ciuuvux X ILU X V o JL Jjo Jtr JVCi V bixUoK V K_fc»v7 J.W JvWMyi2#r' 

SAHTF\DDAQLQIYTLMHRDSYPRFLSSPTLPVPCLLQG£>FTVIiPSR 
GLRPPPSSTDPAASYGRLWPFRCWCRQGCQGTCl^GWGPHPRKTVL 
RPSWEVLCLLGLiPTRGOAnTPnzvnp a QTjrD&rmvDDiroDARruDCR r> 

GWGLLRGES KG S PQE PCWTHEPRWGLLHRQPMGPRQAFGGEQNLR IV 

2148 

A 

123 

489 

LKCNFRLDKYCGGTAMSGGEQKPERYYVGVDVGTGSVRAALVDQSGV 
LLAFADOP I KNWE POFNHHPn Q c pn T w a a pp / \rpuirpn OTjroxrtr t mwt 

DHRAVSQVNRINETKHSVLQYVGGVMSVE 

2149 

A 

2 

321 

AS L I DAHAGGLG VTG ADVRGHGL I CEGQP VTS RLAV I CGTS SCHMGI 
ELQVKATARSPD 

2150 

A 

257 

565 

I NKDP I FVPGVWGP Y FS A\ MVPGFWLNEGGQS VTGKLD I LSTNTPHP 
ITILGNTNSILKTQLQHQKPFLITSPHIPKTPQTSRTDQASFLCSPS 
SDSLQILSL 

2151 

A 

1 

291 

VADLTLKGMVTGL KL FQDLE YLAI L YLATVQA I ALGTRF 1 1 EAMEAA 
GHSISTLFLCGGLSKNPLFVQMHADITGSNGKNEQSWESCVPETTG* 
KIL 

2152 

A 

1 

1690 

FRDDPRVPJU^LRAESVYLKFNFRLDKYCGGTALSGGEQKPERYYVGV 
DVGTGSVPJUUiVD\QSGVLl^FADQPIKNWEPQFNHHEQSSBDIWAA 
CCWTKKWQGIDLNQIRGLGFDATCSLWLDKQFHPLPVNQEGDSH 
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RNVIMWLDHRAVSQVNRINBTKHSVLQYVGGVMSVEMQAPKLLWLKB 
NLREICWDKAGHFFDLPDFLSWKATGVTARSLCSLVCKWTYSAEKGW 

PGIAVAASLIDAHAGGLGVIGAHVTvGHGLHL*GAASDVT\LAVICGT 
S S CHMGI S KDP I FVTG VT^GPYFS AMVPGFWLNEGGQS VTGKLI DHMV 
QGHAAFPELQVKATARMPE Y I LH I * TVTWDLD * GRVQPVG FSFTV * F 
YM FG PDFHGNR S PIiADLTLKGMVTGLKLS QDLDDLA I LYIiATVQAIA 
LGTRFIIEAMEAAGPL/ESSTLFLCGGLSQE/YPLFVQMHADITGMP 
WPVARGGSPFLCGAAV1X5ACASGDFASVT\EAMAKMSKVGKW 
LPE 

2153 

A 

1 

479 

AATS WGAAQT I S S P I LDKS L P AD I S I EDQQCL VFRD VAPQAP VH FLV 
IPKKJ?IPRISQAEEEDQQLLGHLLLVAKQTAKAEGLGDGYRLGE*LL 
ALGPSP^SHSYFSI*SSFFDLCHDPDSDTPSSISVINDGKLGAQSV 
YHliHIHVLGGRQLQWPPG 

2154 

A 

284 

604 

RERRDIGGCWSSSSAWGNPREWGFLGNDQGRCTGSIiRGPTSREHKTIi 
AGPHKRIVSWGGSCPRIRGEDGWGCSPRSWLAGALATFIP*KAPFPR 
ITPG*RRRPAATYVPPLSL*LLGHIjTPMWLSRQQRLRGLGDGYRLVI ! 
NDGKLGA\QSV^ILHIHVLGGRQLQWPPG 

2155 

A 

496 

1579 

N CLNS FS C SMNOMYLLi I ALMT I E YNL A YD <5 YY T P T HF T F K T VYKLTD 
FFLQMRL* IIiMQIFW*GStiFLL* *FQLIRDLPFAFSLR*KLCPHLLK 
YKGILWKOIILVXC*R* ILI ILMCDLFALiSGI IVFLSFIMRKFSHL.* 
VTVKFIYMSIPRRPVQIL*EHLGLQDIIGI*ILISCY**ENRYSIVI 
PLF/ CF/VILITHFGFFSLF* I *LLVFV* * F* FNNPNQQYI * FI / CF 
LSNLTTFFLCFLVGG*/ CFVKGIIV*DHCFHTCFTLHVF*STFIY*A 
F\GLE*FHQMNTFW*FVMSSSNCYTLIiGT*Q* ICKGKRNNFLYSANL 
NFI * * IIHFFLKI WIDCF*NTNLWLWFFFSSAFPASK 

2156 

B 

302 

451 

X E F L F KH P KRTATL S MRNT S VM KKGG I F S AE FL K"V FI^PSI^IjIjSHTjLiA 
IGLG* 

2157 

A 

1 

765 

PPLSPGAHVGRGAPTSAFPPPAAEAHPAARRGLRSPQLPSGAMSQNG 
APGMQEESLQGSWVELHFSNNGNGGSVPASVSIYNGDMEKILLDAQH 
ESGRSSSKSSHCDSPPRSQTPQDTNRASETDTHSIGEKNSSQSEE\D 
DIERRKEv^SILKKNSDWIVroWSSSAGKIFPPlvEFLFraPEAARPTL 
QA * G \ NRA VMKKGG I FS A/ ENFLKVFLPS L\ LLSHFAG PSGLG \ I Y I 
GKGVLTTSTSTFWMKELGVTiTLVR 

2158 

A 

1 

381 

HRQHPCIPHTTPTLSHCRRGGHKARFPRAQNPKREIQKILEFVGRSL 
PEETVX>L>T\/EHTS FKEMKKNPMTNYTTVRRE FMDHS I S PFMRKGMAG 
DWKTTFTVAQNERFDADYAEKMAGCSLSFRSEL 

2159 

A 

2 

312 | 

TGR I PGHHASGDS KI LE FVGRS L PEETVD FMVOHTS PKEMKKNPMTN 

YTTVT>QEFI^HSISPFI^KGl^GDWKTTFTVAQNERFDADYAExxMA 

CSLSFRSEL 

2160 

A 

1 

576 

MRRAARAMAFG P PR PQGTAQVPLLAP S P S L SRTRG PRGL CALRALEL* 

QGR1WELIQDISRPPLEYVKGVPLIKYFAEALGPLQSFQARPDDLLI 

S TYPKSGTTWVSQ I LDM1 YQGGDLEKCHRAP I FMR VPFLEFKVPG I P 

SGMETLKOTPAPRLLxCTHLPIx^ 

TLGG 

2161 

A 

2 

313 

VS YYHFHRMEKAHPEPGTWDS FL»EKFMAGE / VWA* LEEGGCEAEGWW 
L*RTAx^CRCPLPASLVSYGSWYQHVQEWWELSRTHPVTiYTiFYEDMK 
EVRPIVMLPPM 

2162 

A 

901 

1848 

RRxTOELIQDTS\RP\PLEYV\KGVPLIKYFAFJ\LGPLQSFQARPDDL 
LINTYPKSGTTWVSQIL\DMIYQGGDLEKC^ 

P\GIPLRl^TL\raTPAP\RLIxv5HLPxxA\LLPQTLIxDQKv^^ 

aqrkrsvavsyyhfyh\makv\hpepg\tw\dsfl 
adswyqhvqewweixnrthpvl\ylfy\edmkenpekgrfqkdpwslg 
grslqektgdfq\vq\htsfkemkknp\mtnlhptvppgssmdh\si 
splp*gkgmagdwkt\tft\vaqner\fdadyaekm\agcs\lsf\r 

SEL 
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2163 

A 

3 

287 

HEEGSIEIDIPVPKYLSSVSSQETQGGPLAPMTGTIEKVFVKAGDKV 
KAGDSLMVMIAMKMENKLKSSQKDKVRQFMIFTQSSEKTAVRIiSFSK 

2164 

A 

75 

301 

AHKKLRAAPLAPMTGTH*K\VFVKAGDKVKAGN\ PSWVMFPMKMEHT 
I KS PKDGTVKKVFYREGAQANRHHS FS RV 

2165 

B 

*3 

368 

MGKGRSPASPAKADGSHODPKTAPKAKEAPAPPKAEAKAKALKAKKA 
VLKGVRSHTQKRRSACHSPSGGPRHCDSGGSPDILGRAPPGETSVAT 
MLSSKFR* 

2166 

A 

153 

372 

LI AGAFRKASGGNNIFRRPKTL.*LRRQPR* PQKSTPRRNKLGHYAI I 
KFPLTTES AVKKI EETNTLVFTVLV 

2167 

A 

842 

1770 

CTPEKCLPAVHRRNWLRNAEPPVWKRHLSPICRKRQGRATKPKRRPW 
YTTPPTRRVRD VALRP Y SRH / GROPHAKKKI RM S LTFRRP KTLRLRR 
QPRYPRKSTPRRNKLGHYAI I KFPLTTES AVKKI EENNTLVFTVDVK 
ANKHQIRQAVKKLYDSDVAIOITEERRHDEQNNILTREAKNRNDNTRN 
RARTETNTETQKNSGYQALISFHSYYSDTSDNKAHLVSYQSGQCTYD 
RLGWNLL I RMTCR S LQHT YD I QSTQ I AS CFH P VS VRVNLRL YC ANYG 
GISAVFMPLRKELLILTRSSNSKVSQLRI 

2168 

A 

3 

281 

KAKKEAPAPPKAEAKAKALKAKKAVLKGVHSHKKKKIRTSPTFRRPK 
TLRLRRQPKYPRKSAPRRNKSVLPPVPMKRFGYSPLVIWKFTHSA 

2169 

A 

353 

605 

EMAPKAKEAPAHPKAEAKAKALKAKKAVLKGVRSHT/ RKKKIRMSLT 
FRRPKTLRLRRQPRYPRKSTPRRNKLGHYAI IKFPLTT 

2170 

A 

301 

946 

PGSYLKPLSWTVALSLRPGFMSRWPCQRLTOQRRSVTQASGASGASL 
CASRLPRDKTAPKAKEAPAPPKAEAJGUCALKPKI<AVVERCPHTHAKK 
RIRMS PTFRRPKTLRLRRQPRYPRKSTPRRNKLGHYAI I K\ FPLTTE 
\SAGKKNKKNNTPLVHCGC*RPTRHQI\RQAVKK\LYDSDV\AKVTT 
L I CPDKEKKAYVRLAPD YDALD VANKI G 1 1 

2171 

A 

1 

1602 

MHLI PPATNCDTLVECCLPGKLTRDLVCTAFTGFGTS YS PQENSHNH 
SALHSSNSHSSNPSNNPSKTSDANILSQTSRHNDRDYRLPRAETHSS 
STPVQHPI KPWHPTATPSTVPS SPFTLQSDHQPKKSFDANGASTLS 
KLPTPTSSVPAQKTERKVLQQLSKHNRKVMIRSALITS I ILDLLYIE 

VSPVPQSPIPPLLQDPNLLRQLLPALQATLQLNNSNVDISKINEAQP 
SNQSPMSLTSDASSPRSYVSPRISTPQTNTVPIKPLISTPPVSSQPK 
VSTPWKGGPVSOS ATOOPVTADKOOGHEPVS PRSLORS S SORSPS P 
GPNHTSNS SNASNATWPQNS S ARS TCSLTPALAAHFSENLI KHVQG 
WPADHAEKQASRLREEAHNMGT I HMS E I CTELKNLRS L VRVCE IQAT 
LREQRILFLRQQIKELEKLKNQNS FMISASKTGSTKGLFFMLDINVN 
VTTFSDANINFE IALVL 

2172 

A 

243 

620 

LLTS SKVMNLSLLEVFS / VS S SQRS PS PGPNHTSNS SNASNATWPQ 
NSSARSTCSLTPALT^AHFSENLIKHVQGWPADHAEKQASRIiREEAHN 
MGTIHMSEICTELKNLRSLVRVCEIQATLREQR 

2173 

A 

3 

688 

QRSLFLRVPQSPIPPLLQDPNLLRQLLPALQATLQLNNSNVDISKIN 
EVLTAAVTQ AS LQ S I IHKFLTAGPS AFNI TSLI SQAAQLSTQAQPSN 
QSPMSLTSDASSPRSYVSPRISTPQTNTVPIKPLISTPPVSSQPKVS 
TPWKQGPVS QS ATQQPVTAG/ PG PRS / S EPVS PRRS S ALK* PEKS I 
TWVPIHTS\NSSNGIQMATVVPQEFFIiARSTCSLTAGTSSTLQ 

2174 

c 

1 

1038 

M ED FTTP L S TLAS S M R OKVMKD T OD T ,D ^ A T .Of) Z^D V T H T V"R TT ,T? P Jf Q T 
EYTFFSALHCTYSKSDHVIGSKALLSKCKRTEITTNCVSDHSAIKLE 
FR I KKLTQNRTTTWKXilSTNLLLNDYWVDNEMKAE I KVFFE TTQHDQNF 
TKARCIQEHRTPYSQSFKLKAETRDPSSFTKxWQKMKRVRDADIA 
7U^SRLRGKVWVKAWPPLRGCRC\TLGCCFl^FCIRSLhlDVKHSCPVCQ 
RELFYYHRLPRPSPRSAYIJWRDIGPKMARDTV^ 
PALLQTHAEKVLVRRVGSGLGTTLPRRGDDPTPERQISFLKCLLKKL 
SHGFGSEGAMPEE I SH * 

2175 

A 

1 

1312 

MGRNQSRKAENS KHES TYS PPKDHS S SQAMEQS WTENGFEKLGFRRK 
EALYYLDLTASCQSQELFQLYAGMSWGTSMPVQAVCPYCGNRIITV 
TTFVPGALTWLLCTTLFLFGYVLGCCFLAFCIRSLMDVKHSCPVCQR 
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ELFYYHRLRQMELAS SARTVVAPSRRGAGREGRGDANGKNS S PTAGS 
AMSSEPPPPPQPPTHQA\GVGLLDTPRSRERSPSP\LRGNVVP\SPL 
PIKKIKlrb \ATVR\ASQGPVYKGV\CKCFCRSQRAIGFNYPPAGWA 
GPDIF/LWHIS*LWKGGVIVPSGKGDEGHPNKMCSHPHPKE*R/IAA 
RPVKVVHHLTLAPQAPKHET\WS\GHVISSIiGGLVESTPLSLCFVGD 
FAGRRQQTLEDDI LPHETGLQRGNGPSQVS PGGKGYGGAGVGCGVFP 
AISTAYGPLQQPLHHLKSIKSI \ 

2176 

A 

75 

413 

PIITKEHLFYVILVPSSGLDGIQG*SQKEGRGIJWPTPVSVAAGLPA 
At>ri/Ui c»/\/\L/y VbKAALA W Jj v.. PAGRRRGFGGEVMRALVASRPDLLTLS 
ASSGWALGSEGRVKASAD 

2177 

A 

1 

361 

GI KRRSGREATRALITS PPNPFRRPAGQSQARAARPTWSAAS SAMEA 
ACj£>£'AA1£» iUKi ±AiTyKPDGTWRKQR^VKEGYVPQ\RRSQYMKTSM 
*SFSRVNQSCPQG*ALRPLLLSPHPGPEGGEPGPLQGQPNVNLEGER 
EKEAAAGKRKERAEGLSKDS*LRCPWEETAQLPSAPQGSRAAPTAAS 
DQPDSAATTEKAKKIKNLKKKLRQVEELQQRIQAGEVSQPSKEQLEK 

2178 

A 

24 

405 

GGKNN * NCKRD * WNCKL * * CQF * QL * AKD I * KSNREHK\ LTNGSNGR 
TS PS FFPKAATTS SSSPS FTPREKAH YQKHCRSH 

2179 

A 

1 

456 

SRFD PR VRQKQQEG YDGG PNF I TKVWQRGT PS S S P YVS PRHS PWS S P 
K/RPPPPPSSQRLPPPPPPPPPPLPEKKLITRNIAEVIKQQESAQRA 
LQNGQKKKKGKKVKKQPNS ILKE IKNSLRS VQEKKMEDSSRPSTPQR 
SAHENLMEAIR 

2180 

A 

215 

369 

TLPPPPPPSSQRLAPPPPPPPPPLPEKKLITRNIAEVIKQQESAQRA 
LQNG 

2181 

A 

1 

3405 

^TITODPDEDHLTSYDIQLSIQESIEASKTALCPERFVTLSAQNRKL 
VEAIKQGHIPELQEYVKYKYAMDRAJDEKGWFPLHEAVVQPIQQILEI ! 
VLDASYKTLWEFKTCDGETPLTIiA\nCAGLVl?NVRTLLEKGVWPNTKN 
DKGETPLLIGICVNILTVVFNNHFQAVKKGSY 

PCVK1^WSAMHEAAK(^RKDIVALLLKHGGNVHLRDGFGVTPLGVAAE 

YGHCDVLEHL IHKGGDVXAXiADDGAS VLFE AAGGGNPDC I SLLLEYG 

GSGim > NRAGHLPIHRAAYEGHYI^KYLIPVTSKNAIRKSGLTPIH 

SAADGQNAQCLELLI ENGFDVNTI*LADH ISQS YDDERKTALYFGVSN 

ITOVHCTEVLLAAGADPNLDPLNCljLVAVPJU^nfYE I VRLLLSHGANVN 

CYFMHVNDTRFPSVIQYAIjNDEVMLRLLLNNGYQVEMC 

GNS FVWS E IQEEVIiPGWTS CVI KDNPDLES S KVTWMTLRRE P YQGD 

RCLGCSPSTSFFLILQRTTLSKLWPTLGVSGQRKGLSSEDLFETAP 

SGSDKAGTMSTFGYRRGLSKYESIDEDELLASLSAEELKELERELED 

IEPDRNLPVGLRQKSLTEKTPTGTFSREALMAYWEKESQKLLEKERL 

GECGKLSWARHDEQELADGGCKSNSHNREl^RAAVGWLGEQGSCVA 

EDKEESEEELIFTESNSEVSEEVYTEEEEEESQEEEEE/DRQ*RRGK 

NN* NCKRD * WNCKL* * CQF * QL * AKD I * KSNREHK\ LTNGSNGRNTE 

S PAAIHPCGNPTVI EDALDKI KSNDPDTTEVNLNNIENI TTQTLTRF 

/u^A_L»rdJJM I v Vls.1 r fc> LAN THADD S AANA I AEML KVN JbJH I TNVNVESNF 

ITGKG\ I LAI LRALQHNTVX.TELRFHNQRH I MGSQVEME I VKLLKEN 

TTLLRLGYHFELPGPRMSMTSILTRNMDKQRQKRLQEQKQQEGYXX3G 

PNLRTKVWQRGTPSSSPYVSPRHSPWSSPKLPKKVQTVRSRPLSPVA 

TPPPPPPPPPPPPPSSQRLPPPPPPPPPPLPEKKLITRNIAEVIKQQ 

ESAQRAI PNGTNNE I REQGQETAKQ YS KGNI KFSEVS AREENGRQFP 

TFYPTEISS j 

2182 

A 

3 ! 

140 

GHFPCALK5VVPPDEKHWNDLALEFHYMRWVLASFISMRKIQRRI 

2183 

A 

365 

2078 

NLVHCHCLDIFINQAFDMQRDLQITPKRLEYTRKKE^ELYESLMNIA 
NRKQEEMKDMIVETLNTMKEELLDDATNMEFKDVIVPESGEPVGTRE 
IKCC IRQIQELI I SRLNQAVANKLI SSVD YLRES FVGTLERCLQSLE 
KSQDVSVHITSNYLKQILNAAYHVEVTFHSGSSVTRMLWEQIKQIIQ 
RITWSPPAITLEWKRKVAQEAIESLSASKLAKSICSQFRTRLNSSH 
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BAFAASLRQIiEAGHSGRL\ BKTEDLW\LRVR\ KDHAPRLARLSLESC 
S FTGCLCLHRKP \ KLGQELGRGQYGWYLCDNWGGHFPCALKS WPP 
DEKHWNDIiALEFHYI^SLPKHBRLVDLHGSVIDYNYGGGSSIAVLLI 
MERLHRDL YTGLKAGLTLETRLQ IALDWEG I RFLHSQGLVHRD I KL 
i\JM v J_»±jJJlvyiMKAAJL iUJjUr CKPEAMMSGSIVGTPXHMAPEIjFTGKYDN 
SVDVYAFGILFWYICSGSVKLPEAFERCASKDHLWNNVRRGARPERL 
PVFDEECWQLMEACWDGDPLKRPLLGIVQPMLQGI/L*NRLCKS\NS 
EQPNRGLDDST 

2184 

A 

2 

468 

GPEAVPHRACSLWLPTPSGPTCPATRHSEHEFTFQMWPGCASSSLLC 
VGTPGEARADHRCMRVTPVHLGLPGWVGGAGARPHCCESATPGGPRG 
CPTP/DDWVAAHIPPSFHK*GNPGCIRGDPRGLLWPLCPMTCVASDK 
APAIiLGSACCSVWSVR 

OIOC 
^ XOj 

TV 

r\ 

-L / v 

1 9 9 A 

Din*IDnDBPDTDf3TDOTTI7VTJ r r'DT T> c TV vnurmrt tt inrV "e»T7»t i vwt/iiT\ /"i 
lr V WKvixrAVjxr lxrUXlro I iEJ\JiTRljPSANMrTvX*KV\ 

EAGEFLPEDFRDGEY*AAVTL\ERQEDLKTIiLAHPVTLGEPQLRKRE 
l JliiiY>vjiJuJVA.ivJ\iiJiyKoi\X^ Xyjb JiJtv/ JA.lvivoK.Jci 1 IvV Jr VLi/v 
DPEP* IHYGTCWECPTFLKA/ SLWENNLPWEKFLSDKNNPDVCDEY 
TCRT AIjHRACIjEG HTJX T VP KT .MP 7A R 71 0 T P PP HMT.PQTZiTWWaQPnnWT. 
DVLK\ LLLNKGAKI S ARDKXL STALHVAVRTGHL * GCAEHL IACEAD 
LNAKDREGDTPLHDAVRLNRYKM I RLL IMYGADLNI KNCAGKTPMDL 
VLHWQNGTKAI FDSLRENS YKTSRIATF 

2186 

A 

158 

429 

TSSGRFJTI^SWLTAGEENAiFLWQASVGSVIKMMAGCGEIDHSINMli 
PTNRKTU^SCSNTAPSLTVPECAICLQTCVHPVSLPCKHVFCY 


TV 

J. 


jiK Ji J\K jvy y y li p jjKiUvAi^Qx^ 
NHRRPYTSRPEGPRVKA 

2188 

B 

1 

819 

MTKWKSNPQIHMEIiQRTLSNPNNLEKSNVKTYYKAKIFDDNLIQIiTF 
QQHRQDRDTEQEKKKEEEREKRXQQPETLREKAl^QTVRPGFDLTQG 
PNCLTPHGKTWSRiPPKNHRRPYTSRPFAGAAPAQGRPSTYRPIiTPP 
PPL SGPERKADPE YVAAGGDTAS YAS S PRVR I LEWPAFTLLPREGLA 
P i ir* i brAlAAAl t>yt>if IjoIjXAjARSCRERARPSQCACFvIjGPRGSERV 
QFQQKE I WGGG S KTC VGP AGG AAQKS S FAI LRKLWGA * 

2189 

A 

694 

1482 

PVPECAICLQNM\VHPVSLPCKHVFCYLCVKGASWAWESGCALCRQG 
NbKvjlfeljUlvANJbVVHQEGTQGQASRRErw 

ALVGELEDAFS KG * K\ ATEML IAGFLYVADLEN1WQYRRNEHGRRRK 
IKRJDI ID I PKKGVAGLKtiDCDANTVNIiARES SADGADSVS AQSGASV 
QPLVS S VRPLTS VDGQLTS PATPS PDASTS LEDS FAHLQLRWRQHSL 
TGGHRGEGEEDHGITIFQGRVPGTRHLH 

2190 

A 

93 

458 

LLLVTITOSVIFFACjyVRVRDGLPLSASTO 

KJjAyirvjKvjDAlibLDrbl \jio XXQKVKWHFNYVSSSQMECSIjEKIQE 
ELKLQP PAVLTLEDTDVANGVMNGHTPM 

2191 

A 

1077 

1818 

LLLWTMSVIFFAC\ATCVRxX3LPLSASTO 

R\LAQYPGRGSAEGCDFSIHFS\SFGDVACMAICSCQCPAAMAF\CF 
L\ETLWWEF\TASYDTTCIGLASRPYAFIiEFDSIIQKVKWHFNYVSS 
SQMECSLEKIQEELKLQPPAVLTLEDTDVANGVMNGHTPMHIiEPAPN 
FRME P VT ALG I L S L I LN I M CAALNLQ P VQHVYLGQLDLG AH / HRKS F 
IGREKGN I LVF * VDM I Q 

& JL 7 « 

TV 

i 

X 

o o o 

MKAXiKAvjJLi I i-iAXjUAvjijUAVVEGWRRRREDAP^AAPGIj 

AAELPPVPGGPRGPGELAKYGLPGLAQLKSRESYVLCYDPRTRGALW 

VVEQLRPERLRGDGDRRECDFREDDSVHAYHRATNADYRGSGFDRGH 

LAAAANHRWSOKAMDDTFYLSNVAPOVAPAPP^GRGMRGI^LASRGL 

GLFFCRTGQLPNAPPVPHLNQNAWNmjEKYSRSLTRSYQNVYVCTGP 

LFLPRTEADGKSYVKYQVIGKNHVAVPTHFFKVLILEAAGGQIELRT 

YVMPNAPVDEAIPLERFLVPIESIERASGLLFVPNILARAGSLKAIT 

AGSK 

2193 

A 

1544 

1928 

QKVT^GQIREGCLEEVALGSSRPPAHVCIX5LPTGQTADGKSYVKyQVI 
GKNHVAVPTHFFKVLILEAAGGQI\ELRT\YVMPN\APVDEAIPLEA 
FL\VPIESIERASG\LLFVQNILARAGSLKAITAGSK 


\00VVf 
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ARGWVPRPKPLAVFVSGREVTLSSAAPAALCPRWIPRRGSFKSLAGR 

pplgrs pamralragltlasgaglgawegwrrrredaraalgll.gr 
lpvlpvaaaaelppvpggprgpgelakyglpglaqlksresyvlcyd 

PK i KCxALW WEQLRPERLRGDGDRRECDFREDDS VHAYHRATNADYR 
GSGFDRGHLAAAANHRWSQKAMDDTFYLSKVAPQVP\HSNQIAWNNL 
\EKYSR\SLTRSYQNVYCLH*GHFFLPRTEAD\GKSYVKYQVIGKNH 

VAVPTHFFKVLILEAAGGQIELRTYVMPNAPV\DEAlPLERFIj\VPI 

PQTPRAQRT.T.PUDMTT BT5ar*»eT \ iri.TV /-*r»Tr 
c»o i£iAMDuuijf V cvt X±jAK/\v>oJj \ KAJL \TAGSK 

2195 

A 

204 

507 

GSPGTRSETREPGAMSQPPAPPPPLPPPPPPPEAPQTPSSLASAAAS 

,GGL\ *SGETGESFPGAARIRSVRRVYFSRPPVLSASSVPSAASGTSS 
NSCWGLRR 

2196 

A 

1 

460 

EFKQLIHIEHQPNGGASVIHAYSNELSHLSPMEMERFARKSLVGS*C 
£> iuv £><j£>r v i w wu X V \HGAATY3jP \ DFLDYFS FNFP\NS P\ VKME 

ILGKKDIETTTMSNFHAQSLTVLQPGRQSETPSQKKRKKFMIMLSSR 
WEGETKNKYRRLTN 

2197 

A 

199 

769 

APAQLRLRAMQpRVLGAL PLCLLLPS PSSSRSSFS FS AGTAVPS S AS 

ASLSQPVPRKLLVPPTLLHAQPHHLLLPAAAAAASANAKSRRPKEKR 

EKERRRHGLGGAREAGGASREENGEVKPLPRDKIKDKIKERDKEKER 

EKKKHKVMNEIKXENGEVKILLKSGKEKPKTNIEDLQIKKVKKKKKK 
KH 

2198 

A 

182 

2889 

TEKGAAAGTAGSCSRRQHASPQSYGSPASWSFAPLSAAPSPSSSRSS 
FSFS AGTAVPS S AS AS LSQPEPRKLLVPPTLLHAQPHHLLLPAAAAA 
ASANAKSRRPKEKREKERRRHGLGGAREAGGASREENGEVKPLPRAN 
. DKTRSFDDFSPDQAAAECLSIRKRRILKGSCQTGYLRKALFSDTVTL 
GI KAS TYGFWGGM I QS I VKGLAETFNFW IH I WELNDLQY I SAL FACS 
EEMPDYHLEKQLADK IKDKI KERDKEKEREKKKHKVMNE I KKENGEV 
KILLKSGKEKPKTNIEDLQIKKVKKKKKKKHKENEKRKRPKMYSKS I 
QTI CSGLLTDVEDQAAKGF\ LNDNI KDYVGKNLDTKNYDS KI PENS E 
FPFVSLKEPRVQNNLKRLDT\LEFKQLIHIEHQPNGGASVIH\AYSN 
ELSHLS PM/ EGWER FAEE FVG/ L * VF S ENENS AAFYVMG I VHGJ^AT Y 
LPDFLDYFS FNFPNS PVKME I LGKKD I ETTTMSNFHAQVKRTYSHGT 
YRAGPMRQISLVGAVDEEVGDYFPEFLDMLEESPFLKCTLPWGTLSS 
LKLQSRKDSDDGPIMWVRPGEQMIPVADMPKSPFKRKRTTNEIKNLQ 

VT.T5D r rOT?D'D17MT T?Pr>P -m tlittl mtmr lj-i wi l ■ l _r_i _ _ _ _ _ __ _ _____ _ 

X LiVK 1 bhFKhMLt EDRTRAHADHIGQGFERQTTAAVGVLKAVHCGEW 
PDQPRITKDVICFHAEDFLEWQRMQLDLHEPPLSQCVQWVDDAKLN 
QLRREGIRYARIQLYDNDIYFIPRNVVHQFKTVSAVCSLAWHIRLKL 

YTJSEEDTSOTJTATTTRTf2T'QQ'nC r PCC.'\7T nr>I7Tn\TMTr>mrnvik <-»t noTrn 

inj&Duiijvw i/iinc iksL £>£>ut> 1 b 0 V LU rn I DNM I CAVS KASLDS VF 
SDKLHS KYELQQI KHEPI ASVR I KEEPVNVN I PEKTTALNNMDG KNV 
KAKLDHVOFAEFKTDMTJQ If PPMQWirnT ttptpt r»Dr»KTT or YmTDouccn 
HSNQDKKDDDILC 

2199 

A 

2 

122 

LEYCLCWILFAGFMSFSGLCTPSFCWSIFSNSFPKKGTWR 

2200 

A 

1130 

1437 

LSRIMKAWFPLFFKIPFRDKVFFLSPPAWSAVAQSRLHCSLELVGSI 
NSS I SAS *LGWEY\RCAPQCPG* FL\ FFCRDGALPCCPSWSQVICLP 
*PPKALGLQA 

2201 

C 

279 

449 

MHPVSCWVIMSCRPLCTRLILSDSPPGPPSDICVSRKNPSNYPAMVL 
PQCFTICII* 

2202 

A 

1 

1173 

MSELRLCCDLLVQQVDKTKEVTTTGVSNSEEGIDVGTLLKSTCNTFL 
is. 1 Jj-b£.CMQXANAAFTSELLYHTPPGS PQLAMLKSSKMKHP 1 1PIHNS 
LERQTELSTCENGSLNMEINGEEEILMKNKNSLYLKSAEIDCSISSE 
ENTDDNITVQGEIMKEDRMENLKNHDNNLSQSGSDSSCSPECLWEEG 
KEVI PTFFSTMNTS FSD IELLEDSG I PTEAFLASCCAWPVLDKLGP 
TVFAPVKIWLVENIKKSWQKYITNKEEFTTLQK^ 

N SATEALLWLKRGLKFLKGFLTEVKNGEKD I QTALNNAYGKTLRQHH 
GWWRGVFALALRATPS YEDFVAALTVKEGDHRKEAFS IGMQRDLSL 
YLPAMKKQMAILDAL 

2203 

A 

1 

2112 

MGVELFTPASVMGVKRCCCSIPGQLSGNGEPRFSEIDSSVSVRGGWQ 
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PRWFLLCGG I LS YYDS PEDAWKGCKGS I QMAVCE IQVHS VDNTRMDL 
1 1 PGEQYFYLKARSVAERQRWLVALGS AKACLTDSRTQKDKEFAENT 
ENLKTKMSELRLCCDLLVQQVNKTKEVTTTGVSNSEEGIDVGTLLKS 
TCNTFLKTLEECMQI ANAAFTS ELLYRTPPGS PQLAMLKS SKKVNQK 
YITNKEEFTTLQKIVLHEVEADVAQVRNSATEALLWLKRGLKFLKGF 
LTEVKNGEKDIQTALSFSSN/SCT/CIWRLGPGDVLTL*RKRRL*RA 
NAAL * LQ * SQPVSAFS SLSSP/ CGT I FNGKEKGGCKFD I FVLMFRLI 

AFISRTDTLFEMREDNAYGKTLRQHHGWWRGVFALALRAAPSYEDF 
VAALTVKEGDHQKEAF S I GMQRDLS L YL PAMEKQLA I LDTL YEVHGI. 
ESDEVLYPLGCLDCCPEGEQSWFQKKNVFGCLSQLPVGRGVQLPCGH 
EG PAKENRVWLCM PLAI TPES AEAQNRDNHLWS S WRVDGELMNDDG 
LTSAl^TQITFGMICCVNALSSENKVVASLLTLVRFTAAPSLPTLHC 
VSRHLQVEDPHLVMSVCECGTTISLPEPRSQLrQCSTAVSQLEDSRRM 
AEH 

2204 

A 

1 

348 

GMAEEKTVRDKEYKALQIKLERLEKLCRAL^ 

VSIKAAIKAANRDI^TPVMQPCTALDSHKEIjNTSSKRALGAHLEAE 
KSQRSAVHKPPSTGSAPANRVG 

2205 

A 

62 

398 

KTVRDKEYKALRIKLERLEKLCRAL^^ 

AIKAANRDl^TPVMQPCTAIiDSHKEI^SSKRALRAHLEAEPKSQRS 
AVQKARSTGSAPVIESVD 

2206 

A 

595 

712 

NIiNVQLYSQMTKKIKKLEKETI IWRTKWENNNKALSANG 1 

2207 

A 

1 

1621 

ARGGAEEATEAGRGGRRRS PRQKFE I GTMEEAGI CGLGVKADMLCNS 
QSNDILQHQGSNCGGTSNKHSLEEDEGSDFITEITONIiVSPAYCTQES 
REEIPGGEARTDPPDGQQDSECNRNKSKTLGKEVLLLMQALNTLSTP 
EEKLAALCKKYADLLEESRSVQKQMKILQKKQAQIVKEKVHLQSEHS 
r\jTi -L o ivi_i o ju rv Cj i_i ^rvjux in. x Xjj\i^iij>ii^yy/\ivlii5iiJCil\.r\.il^ V-L 
TLNEIQAQLEQHDIHNAKLRQENIELGEKLKKLIEQYALREEHIDKV 
FKHKELQQQLVDAKLQQTTQLIKEADDKHHFLERVFIKRSDRIEAQI 
RT\ KXOOEVOLKOOLSIjYMnKFPPFnTTMaTf Qlsn7T.T?T^l?POPMT?TO^T 

\iu\^yu v yunyyu jLii riL/i\r dor y 1 X I'LrtJVOIM CiLjc X X P r\^\^Eil T lCtt\X*l X 

KKIKKLEKETI IWRTKWENNNKALLQMAERENSP* *RVQGPS\KKLE 
RJLiEKLCRALQTERNELNEKVEVIjKEQVS I KAAI KAANRDLATPVMQP 
CTALDSHKELOTSSK^ALGAHLEAEPKSQRSAVQKPPVPRLCSGHRV 
RLTKMRCDHC I ER YI LC I TFSVS S 

2208 

A 

132 

509 

NMVQTTCGPGLMSTPCTG PAQARR * K/ WLWECLS LTGMGPGEGRGQ 
GAAEAE PN * VLGTPAGGSGPC PPG I QQTPP S PHQPHSQVLS SLS PTP 
RICPGFLAYLLSLFTPREELQLTQNDTFLAPRV 

2209 

A 

214 

427 

QLVQACDPGCIAHI^QPRGLRHAGLPQRKEGQRWLIjHARSRLKGGAHP 
VEGRVQV * HHLQVEEVR I LPAVG WGRRPLPVTEGLVEVPJLSDS W/ SQ 
ACDPGWIAH\YSHVGCGMLGFPSEKRVNAGFYTPVAV*RAGLTL*RA 
VSKF * TASHLAQCGTPVGF I 

2210 

A 

155 

384 

SPGPALFSQPLGSCSAKAFPAMRPVSVWQWSPWVLLLCDIjCSSCLGS 
PSPSTAPEKKAGTQGLRFRLAGFPRKPYK 

2211 

A 

177 

445 

SPGPALFSQPIXSSCSAKAFPAl^PVSWQWSPWGLLLCLLCSSCLGS 
PSPSTGPEKKAGSQGLRFRLAGFPRKPYEGRVEIQRAGNHSR 

2212 

A 

157 

417 

S PGPALFSQP VG S CS AKAFPAMRPVS VWQWS PWGLLLCLIjCS S CLGS 
PSPSTGHEKKAGSQGLRFRLAGFPRKPYEGRVEIQRAGNV 

2213 

B 

301 

488 

MRPVRVWQGSPWGLLLCLLCS SSLGS PS PSTGPEKKAGSQGLRFRIA 
GFPRKPYEGRGRS ASX * 

2214 

A 

2 

476 

SFPGSTISSDSNVIEV\EHHLQVY*VRIRPAAGWGGRPLPVTEGIiVE 
VRLPDGWSQVCDKDWSAHNSHWCGMLGFPSEKRVNAAFYRLLAQRQ 
QHSFGLHGVACVGTEAHLSLCSLEFYRANDTARCPGGGPALVNCGPG 
PVYAASSGQKKQQQSKP 

2215 

A 

3 

1018 

PGRRFRERPQQPRGLRDAGLPQRKEGQRGLLQARVRIKGGALHGE\G 
RVVVPMASTWGAVCGREWDLHTASVVCREVGFGSARQPVIGARMGQG 
MGAIHLSETOCSGQE\GSLWKCPHKNITAEDCSHSQDAGVRCNX»PYT 
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GAETRVSQS SLP WLRS LS S YHRAWS PGLRGNVPQP IAPRPCPPAAPT 
* P ITC * P I MCLSHASATS VRJDSHYLAPKLS PALPFCVT\ KIRLRGGR 
SQHEGRVE VQ I GG PGPLRWGL I CGDDWGTLEAMVACRQLGLG YANHG 
LQETW YWD SGN I TGWM S GVRCTGTELS LDQCAHHGTH I TCKRTGTR 
FTAGVICSESE 

2216 

A 

1 

1389 

MGQEPRDLLLALRPQEVDGWMAAIHVHPRTQADGLGIFPITVAESRE 
QHEFLKCS SMVFSRRWMRKI F I FP I PMLPTKGLSQKQKNNP VPFLRH 
J.l~JNrl I Jj I vxjx-'or'.r L.<oJrJ. IIYXjIW JftJr/vfc'KV VK^FHi\i<JLxjAQRQQHSFGL 
HGVACVGTEAHLSLCSLEFYR7\NDTARCPGGGPAWSCVPGPVYAAS 
SGQKKQQQSKPQGEARVRLKGGAHPGEG/RRVEVLKASTWGTVCDRK 
WDLHAASWCRELGFGSAREALSGARMGQGMGAIHLSEVRCSGQELS 
IiWKCPHKNITAEDCSHSQDAGVRCNLPYTGAETRRFHSVSLQIRLSG 
GRS QHEGRVEVQI GG PG PLRWGL I CGDDWGTLEAMVACRQLGLGYAN 
HGLQETVTYWDSGNITEVVMSGVRCTGTELSLDQCAHHGTHITCKRTG 
TRFTAGVICSETASDLWLHSSLVQETAYIEDRPLHMLDWA 

2217 

A 

154 

2576 

SPGPALFSQPLGSCSAKAFPAMRPVRWQWSPWGLLLCLLCRSCLGS 
P S P S TG PE KKAG S QGLRFRLTG FPRKP YKGRVE I QRAGEWGT I CDDD 
FTLQAAHILCRELGFTEATGWTHSAKYGPGTGRIWUDNLSCSGTEQS 
VTECASRGWGNSDCTHDEDAGVICKDQRLPGFSDSNVIEVEHHIiQVE 
EVR I RPAVG WGRRPLPVTEGLVEVRLPDGWS QVCDKGWS AHNSHWC 
GMLGFPSEKRVNAAFYRKLRKRAAKVSARHPKPLGRLLAQRQQHSFG 
LHGVACVGTEAHLSLCSLEFYR7UTOTARCPGGGPAVVSCVPGPVYAA 
SSGQKKQQQSK^QGEARVRLKGGAHPGEGRVEVLKASTWGTVCDRKW 
DLHAASWCRELGFGSAREALSGARMGQGMGAIHLSEVRCSGQELSL 
WKCPHKNITAEDCSHSQDAGVRCNLPYTGAETRIRLSGGRSQHEGRV 

WDSGNITEWMSGVRCTGTELSLDQCAHHGTHITCKRTGTRFTAGVI 
CSETASDLLLHSALVQETAYIEDRDXliHMIiYCAAEENCDASSARSAN 

wpvhhppt.T.p P*^ QOTVTMT.r^T? JiT^T7PoV7vr , T?ucij^rt»rtri7rujr , wvTrcMr\Ti7 
x vairivivLjxxxvr oovinrtiionAUf KJrlVMJjKilo w V Wir£«v^xloxlx n&nu Xr 

THYDILTPNGTKVAEGHKASFCLEDTECQEDVSKRYECANFGEQGIT 

VGCWDLYRHDIDCQWIDL\TDVKPGNYILQWITPHFQVAESGFTSN 

AMKCNCKYDGHRIWVHH\CHIGDAFPE*TRRRGGLILSLPQPDIFLS 

QRAFLEPRWHL 

2218 

A 

491 

813 

VYNLYLAPTAGWFSEVEDRWVFGEHGGNLGALTNLMGAWFIiFQPAEV 
GVLVGKDRSSFYVNGLTLGGQKCSVIRDSLLQDGEFSMDLRTKSTGG 
APTFNVTVTKTDK 

2219 

A 

2 

948 

ifkiafakfgwagewgregmgsrkwsaraaaqraeprsgrqrvpra 
lrlpparqprqlepsprppaaapesspsssamagwnXayidni^madg 

T\ C*VTfiAT VRYODSRPPFWHAVPnif /TPy\7MTTDaPVnrPWT.2iKTPQ 
x \^ v lunj. v x\.x v^Jiii f X' nnnv xrUIV/ X Xvir VIM X. X Jr>LCi VVaV« JrrVUMXVXIvO 

S FIiR*MGLTLWGGQK\CSVNRGLHCLQEWGI IACDLRYQEPPGGAPH 
LS I VTCHPRLNKDAKSWLMGQRKVSHGGLT\NKKCY\ EMASHLSGFP 
STDLRLVPFPLSPFPTAFAPLSLPIHTQTNFYFLGPLPHTPYCCQNH 
MGLGGQGLMGQTPSPTHIPSRVWLEKLLFFGVFFFSE 

2220 

A 

3 

417 

GRAGSRKWLLLTGSLASTS PSI.Tj^CiOnPWAPT.nR AMKPP<WRq«5KTLF 
\GFPEEATPSSRPNRMSSNIFGPTEEPQNILKRTNPPGV*GSGIFDE 
STPVQTRQHLNPPG\ GKTSDI FGS PWATSRlxAHPNKPKDHVFLM 

2221 

A 

259 

941 

PVSWSLNSCRFFFFF*DQSLPSW/QAGSGQ*RNLDSL\QPLASRFK 
*FSSSFJJL\SSW\DYFJIMATMARLIFI^ 

S SDPPTSAFPKWLGLQGVKPNTRAVGFN* * LGYYS 1 1 L YHSNS PGTD 
LVFILFIYLFx^L.FLRQEQNSAAQARVQ*WHNLGSLQSPPPGV\H*F 
LCLS LPS S WDYRCAPPHQANFC IF * \ RDGVFPCWPGWSRTPDLR 

2222 

A 

343 

600 

MSPLSQFVDGTLLKIQC YYGFTEFHS VA\ RLEC IGMI SAHCNLCLPG 
SSNSPASAS* IAGIIAP/ CHHAPLIFVFL*RQGFTRIYFFK 

2223 

A 

142 

583 

ILTMREIVHIQAGQCGNQIGAKFWEVISDEHGIDEQMLI^QNKNSSY 
FVEWIPNNVKTAVCDIPPRGLKMAVTFIGNSTAIQELFKRISEQFTA 
MFRRKAFLHW YTG EGMDEME FTEAE SNMNDLVS E YQQ YQDATAKE EE 
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DFGEEA 

2224 

A 

163 

1603 

IQAGQCGNQIGAKFWEV I \ S DEHGID PTGT\ Y\ HGDSDLQ\ LDRI S \ 
VYYNEATG \ GKYVP \ RA I L \ VDLE PGTMDS V\ R SG PFGQ I FRPDNFV 
FG QVWGQ VTTW AK\ GHY TEG \ AE L VD S \ VLD VVRKEAE \ S CDCLQG F 
QLTH\SLG\GGTGSGMGTLLYQQGFEKEYPDRIMNT\FSWP\SPKC 
LDTWEPYNATLSVHQLVENTDETYC IDNEALYD\ ICFRTLKLTTPT 
YGDLNHLVSATIE/ CWTTCLRFP\GQLN\ADLRK\LAVNMVPFPRIi 
\HFFMPGFAPL\TSRG\SQQ\YRALTV\PELTPARSFDAKN\MMAAC 
\DPRQGPINFNRGLLSSRGSGCPMKEGRMKQML*RCKNK\NSSYFCG 
NES PNNVK\ TAVCD I PTS WAS KMAVTF \ I GQ * HS P S QELFK\ R I \ S E 
QFTA\ MF\ RRKAF\ LHW \ YTGEGM\ DEMEFTEA\ ESNMNDLRL * S I K 
QLPRMPTQEEEEDFGEEARKRRPKGRAPNQLKASQFP 

2225 

A 

234 

1374 

KSGGLRRRQRPGRSAAVGEEELPPGMEKFKAAMLLGSVGDALGYRNV 
CKENSTVGMKIQEELQPV\GGLGHLV^ 

LTTD YWCLDDL YREMVRC YVE I VE KL PERR PDP AT I EGC AQLKPNNY 

T T Z\ tjJWT*L5TTXTTT V^Ol^T?/* 1 TV A'"PV"TVMr" , TOT tSVtJVnPDT r.»ivr Tt?tTC^n?r'PDM 

xjl/AWrl 1 ±rr JWJC/iSAjovjr LjAAI J\ATOl.lUJbKx WJvJrliKL»fc»l LilliVo VJiv-taKM 
THNHPTGFLGSLCTALFVSFAAQGKPLVQWGRDMLRAVPLAEKYCRK 
TI RHTAEYQEHWFYFEAKWQFYXiEERKI S KDSKNKAI FPDNYDAEER 
E * TYRKWS SEGRGGRRGHDAPMIAYDALLAAGNS WTELCHRAMFHGG 
uca ATT! \ T T A A C*T • Ff3T .T . YrJT .TIT ATD'KTjT . C fMYT .PnVT?VT ,T?r»T .naiT .VOT. 

DO/inlv? \ X i,nn\« Jjr VJUU 1 UiJvJu V Jr IvVjlJOy l/ij£(lyj\Jj J\Jj£iUlJV3/Uiu X KiJ 

STEKK 

2226 

A, 

J- -3 -7 

**fifi 
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REVTEL I KVDKVKIVACDGKNTG\ SNTTE 

«o ^ / 

A, 

i fin 

CDO 
300 

H M Zl VOT . "CT? TT.DT.'PTCTrriPCTA Tl*T THCAH TTDDTPDUVUVT rvr nOTi\ 

INAALA\QRVR1TO\WFRGSPKYGTRFC\DIWGDFWYRITO\VEFQRG 
*PEPIKRG*K*KLVA\CDG\KNTG1jPIPTEWNRKKYDFFYTHLLLFI 
AFEREA 

2228 

A 

918 

1085 

QEKLFI*VLCFNFFSFFFFFFCFL/VD*F*FLFGKGAV*TLFVFSSA 
SHRYRQGKTR 

O T O Q 

A 

14 5 

3 65 

KDG r A r» QPJUE EATIj S E L» KTVXi KS F L S QG Q VLKIjEAKxT)PSI1jGGMIV 
R IGEKYVDMS VKTKI QKLGRAMRE IV 

2230 

A 

2 

750 

ILGFT)LQPPGRRWAAPAV\SGIiS\RKVRCFSTG*\VRPFA\KLVGPS 
\ vyviulbbKlAl \AtiX bAAi>KyX>lK\ljE \QxjEKEL1jRVAQNPRRNPK. 
VACFLFLNPLCBSRSIKSEKALN\DITSKJCRRFSPPSTTQ/liIjNQFC 
LAGKMGPI * ANNPKGW\ SGLFL TMDGVFHSRERVPLHSGPLASSFR 
KESHTLLNLKTV\ LKEFPKVKGQVLELGAKX TDPS I \ LGGMIV\ RIG 

H \ XV X v JUI'lD V i\ J. i\ \ x\^*VXj\j \ivMi ¥ lrClix V 

2231 

A 

100 

470 

ATEPLAPSERREVSIPLLVPAPLPPPPRPPVPSRTRTRPWRRP*ACF 
PMTLAASCGPTRS PWPSQQP I PQPAPVG / PGNI KTLGDAYEFAVDVR 
DFSPEDI IVTTSNlffllEVRAEKliAADGTVM 

2232 

A 

228 

721 

LWQLHAAPLGAPGLPRQVIRALRGGEVHREGTSAPGRHRRRCGCPVG 
riJ&U V JUHo IjU Pi_ v A i rv I L> V A IblD V AMKTGN I KTIjGDAYE FAVDVRDr 
SPEDI I VTTSNNHIEVRA/ REA/ ERPDGTVMNTFAHKCQIiPEDVDPT 
S VTS AT.REDGS LT T RAVTRHPMTFWVfi 

2233 

A 

3 

605 

FHLPSGEKIPFLFLLLLLFTPPPRPPVLSRPRTRPWRRP*ACFPMTL 
AAICGPTRSPWPSQPAPV/VAGNIKTL*DAYEFWDVRDFSPEDIIV 
TTSNiraiEVRPEIC^AADGTVMNTFAHKCQLPEDVD 
RIiTIRARGHPHTEH 

2234 

A 

229 

510 

KGI IWYINMEEQFVI * IHFL* TRSQFYFVNITFKYI YTHIYVQLYIH 
VH/MICKFIFIYNIFI*IDICRQVIDYRDPRKQLNNCSKFSSLTAGV 

2235 

A 

203 

334 

S RRDHS F I L * I LHL S I Y IHT YMYN Y I YMC I YM YQRKQ YG KTNP 

2236 

A 

1023 

1470 

TPKTKHNSNPFSVPHAHKPACSYFFFCLCVFMGSKLNLl^ 

IOjYLLPJWIFF*FIFTQLFILIYFFYTVx J NTQTYF\LFMIFLSCSHG 

ICFPILFSLISITCWDLTONNVRGRKYFAO^/D 

YASKKKKKA 
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C 
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MSQNTGNI I S I P YQDGLPVI SMI VLKRYMKEQWPS FRLKLKRLNXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXX 

2238 

A 

3 

449 

LLRRVSVTAVAALSGRPLGTRLGFGGFLTRGFP/KAAAPVRHSGDHG 
KKIj FV 1 R P S R F YDRR r Li KLIjR F Y I ALTG I P VA I F I TLVNVFIGQABLj 
AE I PEGYVPEHWE YY\ RVKELEVRKLMHVRGDGPWYYY\ ETIDKELI 

2239 

A 

3 

635 

HASAHASAMSLLRRVSVTAVAALSGRPLGTRLGFGGFLT\RGFPKAA 
APVRHSGDHGKRLFVIRPSRFYDRRF\LKLLRIH\LALTGLPVTIFI 
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GPEKDIMERNNGPSFQIGSLKKAGIYGVKGSWEVRKI*WHVEEGDG\ 
PVATLLWRPFDQGTLLDHSSRKPTPWTN 

2240 

A 

811 

1005 

SNFTGKIHK*LTSFKIVNLFIYL/FFETGFHSIiTRLECSGMILAHCN 
LRFPESRKLFFFGAIDSF 

2241 

C 

256 

540 

MS PAKATEML I FGKKLTAGEACAQGLVTEVFPDSTFQKEVWTRLKAF 

CKLSPNCLEIFKEVXRKRERXKLXAVNAEECNVLQGRWLSDECTNAV 
* 

2242 

A 

602 

2067 

S S S SGLALGMQRPLLARLRGTPGRALSAPAATG FAELRAAQGMAMAY 
LAWRLARRSCPSSLQVTSFPVVQLHMNRTAMRASQKDFENSMNQVKIj 
\ LKKDPGNE VKLKL YAL YKQATEGTCKMP\ KPG\ VFDLINKAK\ WDA 
WNALGSLPQGSCPG * NYVDLVS S \ LSPS IGNPPS \QVEPGTERDNQL 
GFETLWTS \ EDG ITKI MFNRP \ KKKNAINTEMYHE IMRALKAAS KD 
DSI X I VXjTCjWCjJDx YSSGNDLiTNFiUiPP/ GVGVEEKAN IN A \ VX»LiR \ 
EFVG\CFI\DFPKPLDCSGSMGPSCGASPVTLLGAYSMPVLLHLDRG 

REEACASRDLFTEGFPWIALFRKKVWDQG*RAFCKSFPPQMALBEFS 

KIKNCDDHLQQHQSMSQGRMCCSLCLSSTGTK 

2243 

C 

2163 

2402 

MSQAAASHKSKLTPKNQRAPFTLLRKSQCQDPSRGESEEGWWPIQRA 
CRSMETFLPQMFQSWHQIYKNVSRDQFFTLSK* 

2244 

B 

125 

304 

E I PLPTGCLDELLKAAECPAAGSVDLGVCLDTS S SGLDLPMKWDMF 
RSCLPACAVNFK* 

2245 

A 

3 

181 

DQACGTVGDDGRALGECDRA/ CPRAANS SGGPGWTPWPSGYPPALHT 
ARPKALCGQPRAF 

2246 

A 

13 

1188 

LYGKADKTFIGLECLENVKFQIRNHKLRSNKTSAI*ACGFPLSASSS 
GS * ENHHKSDMKI LAPKI AFSLCK* KRVYQV* CS PQRGHTRFLTKPS 
PLQLPHLWTHSVPHKAKPAAAATS/DGHTRFLTKP/MPAAAATSLDT 
LGSSQSQAHCSCHIPGHTRFLTOTSPLQLPHRWTHSVLHKAKPAAAA 
TS VDTLDS SQSQAHCS / WPHPWTHS VxjHKAKPAAAATSLDTLGSSQS 
QARCSCHIPGHTRFLTKPSPLQLPHPWTHSVPHKAKPAAAATSVDTL 
G S SQS QARCS CH IGGHTRFFTKPS PLQLPH/ P / DGHTRFLTKPS PLQ 
LPHPWTHSVLHKAKPAAAATSLDTLGSSQSQARCSCHIPGHTRFLTK 
P S PLQLPHPWTHS VPHKAKP AAAATS VDTLGS SQSQARCS CH I GGHT 
RFFTKPSPLQLPHRWTHS I PHKAKPTAAATSl^TLGS SQS Q7VRCSCH 

NAG 

2247 

A 

1088 

1858 

LRLAGTWGVKNRRRQEAFPSLQRRNQGQPKLPGSQNQFFYQAVPLPS 
FQLLATGRRCSKGFALRLPRRSCGKDSRGPGVI J IPLAPGVMCP*PKG 
AQSAPSSHPIRAAATPAGSSGNWFSLPAGSGKF^EKHLAKLLKGELT 
SLPVPTPI PRSPPG* POVPGPNNKrPFP < ?P*T.PTTT.'nT.T?PF , TTPIjTWT* 
DPS /HRDSSRPSSA/ RASHLTVMLSGNFPVCPTFSHTLVR/ WEHLPA 
SNVLYVPL * APS SRPLANTVPP I CG 

2248 

A 

1100 

2076 

YASRGMGREEQKKRGKRFPSLPREEIKANQKLPGSQNQFFLPGSTTA 
VI S ASGHWEVLLERGLP * DSKKKLREGQQGS WGF I ASGPGVMCP * PK 
GAQSAPSSHPMGAAATPAGSSGTGLACPQDQESLRKASAVELPVPLS 
PQRPGGFLHVS/ASSCKHLEVHGTHRQTAPEHVHHLSWGRARPTWRT 
CADRHPGPPTLEAGGSPRPSSPGDADLDVLEMLLVDEAAVILPQRPF 
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ACPA/ WCC* PAGLAARGDASPARSQNPAP * RPTALCLGPQVLR* RSR 
KKPGP * MES \ GAAQLHEEP VLQVQRCLFKERCRLVIDGHS PACS S SH 

2249 

A 

761 

1178 

LS TPLTS WCWMRRTTPGS PTTRCS CS S A\ PGTQPLGGGHQ * RGCHQC 
VAS S PR * AWPAS G P AKLO * LOOPKTOORA P A TRPHVP P\/np Q zv n * t>u 

SRFQSSETSRPADGSEQQPARASGPGARRPPHQPRQVINCWLRRC 

2250 

A 

135 

408 

PH S R FQSSETSRPADGSEQQ PARAS GPGARRPPHOPRFHPPPSA /PA 
ISIECPA*AVMRSGSEPRARQEYSLGLWTPSQFLQGILYLARSPT 

2251 

A 

123 

578 

TT1MN I FKWAKY FKRH FS RD I *MANEHMKRCLTSLTIMEMQIKTTIRC 
HLT / PRM P 1 1 KKI I KC * * G WRETGTL I / YAWWECK I VQ / P AFR / TVW 

QFLKRLSQHPTIPFP/GMCPRETKTHIHSKTY/TQMFMAALFIEARK 
WKQFKCLSTNEWTNWRP 

2252 

A 

623 

724 

PYGTWMKLETIILSKLTQEQKTKHCMFIjLISGS 

2253 

A 

1061 

1156 

GTWMKLEAIILIKLTQEQKTKHHMFSLISGS 

2254 

A 

365 

640 

RPFSMLAYLILNSLVLQMWKPHFREAACSNHIDGRW/VR*AWFS/YT 
VRSWAAIKKNEFMSFAGTWLKLEAI ILSKLTOEOKTKX HHMF«?I.TRfi 

s 

2255 

A 

209 

532 

RVFLSYQLTLFSSQDDIIDDVESFVAAAEILKERGAYKIYVMATHGI 
LSAFIAPRLIEESSVDEVVVTNWPHEVQKLQCPKIKTVDISIilLSEA 
IRRIHNGESMAYL 

2256 

A 

1 

308 

AJCRAQSYAERLRLGLAVIHGEAQCTELDMDDGRHSPPM\VKNATVHP 
GLELPLMMAKEKPPITVVGDVGGRIAIIVVWLYVIHVQRWHSWLLVS 
RLSTLGLYF 

2257 

A 

2 

1161 

VRKGTDLGALPVPPPAMNAARTGYRVFL\ANSTAV\CSELAKRITER 
ijVviUSijoJvo v v xuci iWLyfc. IK Vh.XKESVRGQDI Fix QTI PRD VNTA VME 
LL IMAYALKTACARNI IG\ VI PYI P\ YSKQSKMRKRGS I VCKLLASM 
LATAGLTHIITM\DLHQKEIQGFFSFPV\DNLRSLHLFLAFSIIQGR 

SCTKLi\DMDDGRHSPPMVl<TJATV\ HP^T.PT.dt MMavT?\ ifDDTnA/m 

VG\GRIAI ivddi id\dvesfvaaaeilkergayki YVMATHGILSA 

EALRLIEESSVDEVWTNTVPXHEVQKLQCPKIKTVDISIjILSPAIX 

rrihngesmaylfrnitvdd 

2258 

B 

80 

402 

XVPTVDTY1X?RGDSVVYGLRSKSKKFRRPDIQYPDATDEDITSHMES 
E ELNGAYKAI PVAQDLNAP SDWDS RG KDS YETSQLDDQS AETHSHKQ 
SRLYKRKAYDESH* 

2259 

C 

272 

409 

MXXXXXXXXXXXXXXXXXXXXXXXXXKJOCXXKKXXXQ 

2260 

A 

2 

628 

EHSIVGTRLVSGQLQPSQPNADQGKLTTMRIAVICFCLIjGITCAIPV 
KQADSGSSEEKQLYNKYPDAVATWI^PDPSQKQNLLAPQTLPSKSNE 
SH\DHM\DDM\DDED\DDGPCGDSQDSIDSNDSDDVDDTDDSHQSDE 
SHHSDESDELVTDFPTDLPATEVFTPVVPTVDTYDGRGDSVVYGIiRS 
KSKKFRRPDIQVNPLTDTPDGSD 

2261 

r A 

67 

1276 

SHA^GRRQSTASSGPDSVSGQLQPSQPNADQGKLTTMRIAVICFCLIi 
QN\AVSSEETlTOFKQETLPSKSNESHDHjViDDiyro 

DS\ITDSDDVDDT\DDSHQSDE\SHHS\DESDELVT\DFP\TALPATE 
VFTP\VVPTVDTYDGRGD\SVVYGLRSKSK\KFRRPDIKYPDATDED 
ITLHMESEELNGAYKAIP\VAQDLNAPSDWDSRG\KDSYETSQLDD\ 
OS AETHR\ HOO\ SPIYYICP KfiMnPQMPwenrnw* a"DTT?r>Trv7CT5TT?TJO\ 

HEFS SPWRFCLFVAPKSREEDNTPLEFRYS \QELDSASSGGQLKGGK 
NTISPLCHLVQKKKGFIAKMKK^EMLLSPVYWLKCVSI 

2262 

A 

1 

651 

MTAFNSGKVD I VA I ND P F I DLNYMVYM FL YDS THGKFHGTVKAENGT 
LVINGNPIT\ IFQDQDPSKIKWAP/LAJCVira 
TATQTINGPSG\NCHWLAAGL^ 
LTGMAFHVPTA2JVSVAI)LTCRL^ 

I VLKDHS VKL I S W YDNEFG YSNR WHLMAHNAS KE 1 

2263 

A 

2 

5264 

TRSAPRAASWLEDPREVRSVCLSATFFTLSLQKPRLLLFSPSVVHLG 
VPLSVGVQLQDVPRGQVVKGSVFLPJJPSRxWVPCSPKVDFTLSSERD 
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FALLSLQVPLKDAKSCGLHQLLRGPEVQLVAHSPWLKDSLSRTTNIQ 
G INLLFS SRRGHLFLQTDQP I YNPGQRVR YRVFALDQKMRPSTDTIT 
VMVENSHGLRVRKKEVYMPS S I FQDDFVI PD I SEPGTWKI SARFSDG 
LESNSSTQFEVKKYVLPNFEVKITPGKPYIIiTVPGHLDEMQLDIQAR 
YIYGKPVQGVAYVRFGLLDEDGKKTFFRGLESQTKLVNGQSHISLSK 
AEFQDALEKLNMG I TDLQGLRIiYVAAAI I ES PGGEMEEAELTS WYFV 
SSPFSLDLSKTKRHLVPGAPFIiLQALVREMSGSPASGIPVKVSATVS 
SPGSVPEAQDIQQNTDGSGQVSIPIIIPQTISELQLSVSAGSPHPAI 
ARLTVAAPPSGGPGFLSIERPDSRPPRVGDTLNLNLRAVGSGATFSH 
YYYMILSRGQIVFMNREPKRTLTSVSVFVDHHIiAPSFYFVAFYYHGD 
HPVANSLRVDVQAGACEGKLELSVDGAKQYRNGESVKLHLETDSUOi 
VALGALDTALYAAGSKSHKPLNMGKVFEAMNSYDLGCGPGGGDSALQ 
VFQAAGLAFSDGDQWTLSRKRLSCPKEKTTRKKRNVNFQKAINEKLG 
QYASPTAKRCCQDGVTRLPMMRSCEQRAARVQQPDCREPFLSCCQFA 
ES LRKKSRDKGQAGLQRALE I LQEEDL I DEDD I PVRS FFPENWLWRV 
ETVDRFQUjTLWLPDSLTTWEIHGLSLSKTKGLCVATPVQLRVFREF 
HLHLRLPMSVRRFEQLELRPVLYNYLDKNLTVSVHVSPVEGLCLAGG 
GGLAQQVLVPAGSARPVAFSWPTAAAAVSLKWARGSFEFPVGDAV 
SKVLQIEKEGAIHREELVYELNPLDHRGRTLEIPGNSDPNMIPDGDF 
NSYVRVTASDPLDTLGSEGALSPGGVASLLRLPRGCGEQTMIYIiAPT 
LAASRYLDKTEQWSTLPPETKDHAVDLIQKGYMRIQQFRKADGSYAA 
WLSRDSSTWLTAFVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADG 
SFQDPCPVLDRSMQGGLVGNDETVALTAFVTIALHHGLAVFQDEGAE 
PLKQRVEAS I SKANSFLGEKASAGLLGAHAAAITAYALSIiTKAPVDL 
LGVAHNNLMAMAQETGDNLYWGSVTGSQSNAVSPTPAPRNPSDPMPQ 
APALW I ETTAYALLHLLLHEGKAEMADQAS AWLTRQGS FQGGFRSTQ 
DTVIALDALSAYWIASHTTEERGLNVTLSSTGRNGFKSHALQLNNRQ 
I RGLEEELQFS LGS KINVKVGGNS KGTLKVTjRTYNVLDMKNTTCQDL 
Q I EVTVKGHVE YTME ANEDYE YDE LPAKDD PDAPLQP VTPLQLFEGR 
RKRRRREAPKVVEEQESRVHYTVCIWRNGKVGLSGMAIADVTLLSGF 
HALRADLEKLTSLSDRYVSHFETEGPHVLLYFDSVPTSRECVGFEAV 
QEVPVGLVQPASATLYDYYNPERRCSVFYGAPSKSRLLATLCSAEVC 
QCAEGKCPRQRRAIiERGLQDEDGYRMKFACYYPRVEYGFQVKVLRED 
SRAAFRLFETKITQVLHFTKDVKAAANQMRNFLVRASCRLRLEPGKE 
YLIMGLDGATYDLEGHPQYLLDSNSWIEEMPSERLCRSTRQRAACAQ 
LNDFLQEYGTQGCQV 

2264 

A 

56 

1357 

IRLSVCSSCSTVSRIFFCVASRATSLRTPMGKVKVGVNGFGRIGRLV 
TRAAFNSGK^IVAINDPFIDLNYMVYMFQYDSTHGKFHGTVKAENG 
KXiVINGNP I T I \ FQERD P S KI KWG \ DAG \ AE YWE \ S T\ GVFNQPWK 
KAGAH\LQGGAKR\VIISAPSCLMPPMFVMG\VNH\EKY\DNSIiKII 
SNAFLQP PTCLA\ PLAKG * FHDNFG * S WEGL IDHSSMAI TGNPRKTV 
D\GPSRG\NCGRGWAAGL\SRTSSIjGLYWALPKA\VGKGHP*G*TGK 

egppqgailg\yteaprwvssdcnsgppvfppfdaga\gia\lnd\h 
fvkliswydnefgysnrwdlhgphglqgsktpgttepqargtrgrd 

2265 

A 

34 

476 

lsggprraascassagllvllpfmpm\fiv\ntklhraslipeg\fls 
kl\tqqlaqanrqappn*fafnvfpdqv\ysfggssepcal\cslhs 

IGKIGGAPDPAPYSKLL\CGPAGRSRIiPIQARTRV\YINY\YDMNRG 
QMWAGNNFHLSP 

2266 

A 

63 

470 

QQPPPERPAHQFARPPAPFTMQPASAKWYDRRDYVFIEFCVEDSKDV 
NWFEKSKLTF\SKIiS>LSNDVK*SFTQIiNNFDWLVFSLKLLLIFVS 
LKRLNSFFIILYSCIiGGSDNFKHLNEIDLFHCIDPNVSSLYA 

2267 

A 

2 

854 

RAFCCVTSRRPASPPLVCTHARSFSVXiAPLFIiHFPLLPDRRSRSFRA 
VHFWGPRSSPPHAAVRLREARRGRDRREKAESPTGEKSTSPSSSRRQ 
RGPPTKVRRPPAPFTMQPASAKWY\DRRDYVLKWEFVFLNTLRDV\N 
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VNF\EKSKTYNFSCLREGSDNF*GI*MEI\DLFH\CID\PNDFQGLK 
GTDRS I FMFVLRKRRI WAS SWPKGIiTK\ ERGKGL T WV\ SVt)FN\ NW\ 

KDWEDDSDEDMSNFDRFSE\MMNNMGGD\EVVDYPEVDG\ADDDSQD 
SDDEKMPYLE 

2268 

A 

117 

390 

RRYRPTPGVHRADGLLHVRGREHVLQRRPEDLRCHSHCHPSHPTDRL 
LLLVIWTHLGGIWAGHSPWTVIQTAGRPPRDLSPSARPISSPP 

2269 

A 

66 

354 

LPGLSGRDEASLWMGGGRGLLGRETLGPGGGCSGEGPLCYWPPPGSP 
PAPSLRASLPLEPPRCPLRSCSLPRSACLCSRCSWQPNRVSHFLLWT 

s i 

2270 

A 

148 

398 

RP7VKRG7W4DGRVQLIKALLALPIRPATRRWRNPIPFPETFDGDTDR 
LPEFIVQTASYP\PWARTRSSARPISSPPPETSCVIjA 

2271 

A 

356 

1241 

KVTFLITRLTGPALQVGDPLHQEGEPPPQLITGAFWPEMKRVFGWGG 
GRGL \ LGRETLG \ PG \ GGCSGEGPLC YWPP PG S PPAPS F \ RAS PPLE 
PP\RCPLRSCSLPRSACLCSRNSAPGSCCRPWASLWSEPPPSPSSQP 
AP PMYI WTLS CAP VAQS * APVTHWTDHPLP PLPS PLLPTRLPDDLHH 
FGPQTLRCQ\SHR\HPSHPHRQDCWLL/M*IWT\HLGG\IWAG\HSP 
WT\VIQ\TAGR\PPRDLSPSARPISSPIPETFCAPATWLPGLSCAWQ 
AKFESLI FSLWTKLVFAHKRS I S 

2272 

A 

2 

178 

EKLRKFRFRKETDNAAI I MKVDKDRQMWLEEEFQVFE I RTTDDLTE 
AWLQEKLSFFR 

2273 

C 

88 

228 

MKVDKDRQMXVIiEEEFQNISPEELKMELPERQPRLQRXTTDDVCRE* 

2274 

A 

3 

382 

VDRTIMSDSLVVCEVDPELTEKLRKFRFRKETDN7UVIIMKVDKDRQM 
WLEEEFQ \ FWYS YKYVHDDGRVS YPLCF IFSS PVGCKPEQQMM YA 
GSKTRLVQTAELTKVQTGMWLQSVRERWCGSWV 

2275 

A 

89 

636 

KTAELRKEEACGQNNHALPGSLRQSDSIiWCEVDPELTEKLRKFRFR 
KETDNAA I IMKVDKDPARWWCLEGKNFKNI S PQREL\ KMGFPRRDKP 
R\FWYS\YKYVHD\DG\RVSYP\LCFIFPPRPVGCKPE\QQMMY\A 
GS\KNRIiV\QTAELTKGVPKSRTT*LTSLRPWL\QEKLSFLSLISGA 
GD 

2276 

A 

2 

2453 

EILLCPSLPFNCQYILVLLGSRIRAREPPMAVQAVTQGPPKYTKSVL 
KKGDKTNFPKKGD WHCW YTGTLQDGTVFDTN I QTS AKKKKNAKPLS 
FKVGVGKVIRGHVARKFIiDDEAELSEEDAEYVSSDENDESENEQDSS 
LLDFLNDETQLSQAINAWFACKSPSIICVDDEWSSVCAFFSSGSSD 
LYKTSEWKLSKLPQFKA£HKLLGNIKNVAKTANKDHLVTAYNHLFET 
KRFKGTES I S KVS EQVKNVKLNEDKPKETKS EETLDEF I QLRRELAS 
VNGHSGDDGPPGDDLPSGIEDITDPAKVTSLILKFVSGQIilTEIENM 
RHR 1 1 E IHQEMFNYNEHEVS KRWTFEEG I KRP YFHVKPLE KAQLKNW 
KE YLEFE I ENGTH ERVWXi FERCVI S CAL YEE FW I KYAKYMENHS I E 
GVRHVFSRACTIHLPKKPMVHMLWAAFEEQQGNINEARNILKTFEEC 
VLiGLAMVRLRRVS LERRHGNLEEAEHIiIjOD A T KMAK^TsTNTP S S F YAVTC 
LARHLFKIQKNLPKSRKVIjLEAIERDKENT^ 

NEENIIxNCFDKAVHGSLPIKMRITFSQRKVEFLEDFGSDVNKAVTEV 
REVTRKSVPRNSLESAEYIiKLITGLLNAKDFRDRINGIKQLLSDTEN 
NQDL WGNI VKI FDAFKSRLHDSNS KVNLVALETMHKMI PLLRDHLS 
P I INML I PAI VDNNLNS KNPG I YAAATNWQALSQHVGDTSRMFRRT 
KSYDSLLTTLIGAGIRILFSSCQEETADLLKELSLVEQRKNVGIHVP 
TWNSNKTHFKKS PCMHK 

2277 

A 

1 

1008 

MVDKCRYHORSLHKVSNS E S KI S SGIiLT. <5 RLOR R PBGRRD AELRRAG 
SVKRRQRGKMAAAVPQRAWTVEQLRSEQLPKKDIIKFLQEHGSDSAV 
QQVITNLLIGQIELRSEDSPDILTYSHERKVEKLIVPLGEELAAlQK 
TYIQIPEQDETYLEDSFCVDEEESCKGQSSEEEVCVDFNLiITDDCFA 
NS KKYKTRRAVMLKEMMEQNCAHSKKKLS RI I L PDDS SEE ENNVNDK 
RESNIAWPSTVKKNKQQDHCLNSVPSGSSAQSKVRSTPRVNPLAKQ 
SKQTSLNLKDTISEVSDFKPQNHNEVQSTTPPFTTVDSQKDCRKFPV 
PQKVLSE 

2278 1 

A 

1 

884 

MVDKCRYHQRSLHKVSNSESKISSGLLLSRLQRRPRGRGD AELRRAG 
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S VKRRQRGKMAAAVPQRAWTVEQLRSEQLPKKD 1 1 \ KFLQEHGSDSL 
LAEHKLLG\NIKNVGKTANKDHL\VTA\YNHIiFETKPP*GYLKV*SK 
VSEQVKKCERLMKINPKETKSERDPWIXROV PPTC\ VTifc;pc;*vT>nnifT 

QLFPKRGDV\ VH\ CWY\ TGNTTKMGTVF* YL IFQTSAKKKEKCPSL* 

GFKVRSRPKLSRGW\DEALLTMSKGEKAR\LEIEP\EWAF\GKKGLP 
DAKI P \ PNAKLTF \ EVE \ LVD I D 

2279 

A 

3 

510 

AI RAVAYCADMAQ S MNHTTLHQS R I WHR IG I KKPRSQTDQS LTGVD P 
MFLKNMRFAKKHNNMGLKRMQANNAKAMSARAEAI KALVKPKEAYL * 
IPNGVSRKLDPIAYIAHPKLAKRALARIAKGLKLCRPEALAEARAKD 
QTKVQG VAP A* A\ PAHAPKRTQAPTKASE 

2280 

A 

9 

674 

I ERLQVR I ACS CRDAI PG S SHAS AHAS EAL.P F PT? T /NrnanM a k* Q vktwt 
TNNQ\SRKWP\RNGIKKP\RS\QRYEFS*RGWPPQVSLRN\MRFAQR 
STNKKGP*RRLQANKLPRAMSATCPRLIKAPPLKAPRRFKPQDPPKG 
\VSRKLDR\LA\YIAQPQGLGSVARCP*LPRGPRL\CPPKAKGQGQG 
QAKAKDQTKGFKPRPRVQAPVS VS V\ QAPKRTOAP\ TKASE 

2281 

A 

417 

822 

RENC ILLS TRAQS VFNAQGTALW S FQD PATGEGGAGHGGRLMRPSLL 
PSQGASAC * / PRFGES KLAPVGKFPKMCLLELPFLSHHLSRGCSGAS 
EGQLCLLEVPATARVGVHKPSQSRGLEFSWVLMSDLCSGPPL 

2282 

A 

1 

239 

HYQQQRQQI^CLERHIKPPKELDTASSDEENKDGDFTVFECPGLAPT 
GEMEVRNPLLDHS PPAPRPRGPRGGAGPGASH 

2283 

A 

129 

1471 

RMATPLPPPSPRHLRLLRLLLSGLVLGAALRGAAAGHPDVAACPGSL 
DCALKRRARCPPGAHACGPCLQPFQEDQQGLCVPRMRRPPGGGRPQP 
RLEDEIDFLAOELlARtTP^VTT.TJQZvpv /t imTjnDT dpoatt j~ , ircn'Dr , r\ 

GLDLGLPSTPGTPTPTPHTSLGSPVSSDPVHMSPLEPRGGQGDGLAL 
\VLILAFCVAGATALSGALLC\WSGLQREIRLTQKA0YATGEGPPGS 
PAAPRISPGDOPJliAOSAEMYHYOHPTATTv^caf:*RWTnsflPPTrPT riTTD 

PSDEENEDGDFTVYECPGLAPTGGKWKVRQTLLF\DHA\ALSAPLPG 
PQAHPPATAMTWRQNRRATCSPALEGPRGGAGPGASPLKTCFDAVCF 
WLGPRAPGPGTPCQGRPPNLCAKDTSWSPAPLLFGLDPQTGGGMENR 
RAQERVGNSRDKSQLKS ISDLRKKKKST 

2284 

A 

85 

409 

TCGLPMCQKQTVSWGGKTPSLRRFPPQRPRPPRGQPPSGARWEH/PP 
PGTTAPPJ^SRRPSPPARLAGPEP*AKSPLHAANSRGQTKGHLWKTLV 
LVFKKPNFFFFNFHN 

2285 

A 

193 

609 

NQINFCLNGKYTYICIDTLPLYMFNIHTLKHINTSVI ISLiEFAI *HK 
GQVELHI KI TYRSN* MWLGHNQRRN/ LCPQEGEE I PNBA* IFSIIKR 
QSWPGTVAHAC1^ITLGG*DGRIS*DQVPKTSLINIVBTPSI»LKK | 

2286 

A 

117 

435 

imPFKXRNPGGPGWPVAJ^IPGMRGHKPFWLTLPLGRAQLSLDTLIS 
LQSFLGSRGSYSSNRRKSISGSRQPHVSDSLLNSRQNYKNYSMESRT 
HFR I QAATLQA 

2287 

A 

2 

1923 

\^TPPQGSVHSGHXiGSVVGDPHTGTGNAGERGPRGKGARVLALDSGG 
MDSSPSLPLIRTPESSLHEALDQCMTALDLFLTNQFSEALSYLKPRT 

kesm™sltyatilemqamotfdpqdili^gniwix^qml 
ssvtdsfsslvnrptlgqfteeeihabvcyaecllqraaxtflqgss 

HGGAVRPRALHDPSHACSCPPGPGRQHLFLLQDENMVSFIKGGIKVR 
NS YQTYKELDS LVQS SQYCKGENHPHFEGGVKLGVGAFNLTLSMLPT 
RILRLLEFVGFSGNKDYRTjTiDT.PFnaQnwcppcT/T mnwrr t t r»vTJTi?T 

TFVLGTGNWIEEAEKLLKPYLNRYPKGAIFLFFAGRIEVIKGNIDA 
VSDGGPGRGWGSLGVSQTSRKSGTCDILRDRIDWGRGGGPRENQPES 
RGRRG PS GRAAWEDKGGGG I CGAWDFDWE 1 * DCS I AVEGGGGRCLRE 
AEVRKAHLFRA* RLGWSLVPPLHYSSLLLFHFVTKNQSPRRGLYLS P 
TSCKT*EVKPGLEARSPGSWGPT*A*HRAGPLCPGGVPVCCGVGRFG 
GCRGVGAGWAPVRLTRRCLQAI RRFEECCEAQQHWKQFHHMC YWELM 
WCFTYKGQWKMSYFYADLLSKENCWSKVG 

2288 

A 

1 

1818 

LGEGGGKKGLWDLVAGLHPLGGQQSPMGQKGHKDSLYPCGGTPESSL 
HEALDQCMTALDLFLTNQ\FSEALSYLKPRTKESMYHSLTYATILEM 
QAIWTFDPQDIL1^GN1WKEAQMLCQRHRRKSSVT\NSFSSLVNRPT 
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IX3\QPTEEEIHAE\VCTAKCLIiQRAAIiTFLQDENMV\SPIKGGIKVR 
NSYQTYKELDSLVQSSQYCKGENHPHFEGGVKLOVGAJmTLSMLPT 
RI LRLLE FVGFS GNKDY GLLQLEEGAS GHS FRS VLCVMIiLLCYHTFIi 
TFVLGTGNVNIEEAEKLLKPYLNRYPKGAIFLFFAGRIEVIKGNIDA 
AIRRFEECCEAQQHWKQFHHMCYWELMWCFTYKGQWKMSYFYADLLS 
KENCWSKATYIYMKAAYLSMFGKEDHKPFGDDEVELFRAVPGLKLKI 
AGKSLPTEKFAIRKSRRYFSSNPISLPVPALEMMYIWNGYAVIGKQP 
KLTDGILEIITKTVEEMIiEKGPENEYSVDDECLVKLLKGLCLKYLGRV 
QEAEENFRS I SANEKKIKYDHYLI PNALLELALLLMEQDRNEEAI KL» 
LESAKQ\NYKNYSMESRTHF*IQAATLQAKSSIENSSRSMVSSVSL 

2289 

A 

486 

713 

SKFYLFIYLFYFFVYTYVINIVYIKI*QQIDLNFIYLFIYTjFYFWRQ 
SRAFLS KL * CSGANMTHLSLNLPGSRNTPAPALQAAGTTGM*HHTWL 
IFWFVETEFHHVGLKFLSLD\ IHPPWPSKVLG*QGGWIV 

2290 

A 

1 

4137 

MDESTFAGGQEYGGWVPEHHKSNEPKSGHIEKGNTDQCQERTPQPMI 1 

YPTGE I E S TVNAWMLSHEKEQS Q I LL I SNCLD ALKHRS CD YGLKI AIj 

DIKHRLKIQPKIFIYTRTHKYAVTLHMLAHTITSERDDIEEWLNAWV 

VDQCCP PEMS VMMEMFYI YAVHYGRS LLS WSQEEEKEKGGREEEEEE 

EEEEEEEEEEEEEEEEEEEEEAEEEKLiGEAAVPEGVQLGWIjTALSLV 

VNFHPWLVS PWTWYRPCAI LIS LDRARAAAQGGSRKGWAE FQARGT 

ERREQCSAALVREMEQRLLCGLLHPRPGCTARIFDLSVARWIiAARWG 

SACPGGAARAPRGALAPGLRQSEAVPAPIiPPRVADAAGASGKRSGRR 

ARSCPSPMARAARVAVAAVGPHAGHRTRRLAGRGREGRAALAAGAQIj 

PAIGDARRHLVSRKRPSEMGFDPACGRRVFRRRGCGQAVPPLWFPGL 

LLAPSWPDGRPGRRAHLKFALLYGEGKFESPEGKYFAKRQARIRGWT 

QEDPEASGTSSNRARSEPSVLTVAGILQTESRKTAKDICKIRCLCEE 

KENVLNINCENKGFTTVSLLQPPQYRIYQLFLNGNLLTRIiYPNEFVN 

YSNAVTLHLGNNGLQEIRTGAFSGLKTLKRLHLNNNKLEILREDTFL 

GLESLEYLQADYNYISAIEAGAFSKJiNKLKVLILNDNLLLSLPSNVF 

RFVLLTHLDLRGNRLKVMPFAGVLEHIGGIMEIQLEENPWNCTCDLL 

PLKAWLDTITVFVGEIVCETPFRLHGKDVTQIjTRQDIjCPRKSASDSS 

QRGSHADTHVQRLSPTMNPAliNPTRAPKASRPPKhnWRPTPRVTVSK 

DRQS FGP I MVYQTKS P VPLTCPS S CVCTS QS SDNGLNVNCQERKFTN 

ISDLQPKPTSPKKLYLTGNYLQTVYKWDLLEYSSIjDLLHIjGNNRIAV 

IQEGAFTNLTSLRRLYLNGNYLEVLYPSMFDGLQSLQYIiYLEYNVIK 

EIKPLTFDALINLQLLFLNNNIiLRSLPDNIFGGTALTRLNLRNNHFS 

HLP VKG VLDQLPAF I Q I DLQENP WDCTCD I MGLKDWTEHANS P VI IN 

EVTCESPAKHAGEILKFLGREAICPDSPNLSDGIVIjSMNHNTDTPRS 

LSVSPSSYPELHTEVPLSVLILGLLWFILSVCFGAGLFVFVLKRRK 

GVPSVPRNTNNLDVSSFQLQYGSYNTETHDKTDGHVYNYIPPPVGQM 

CQNPIYM/ SRKGRETQ*AYIiPKPGKEFQIiLGNLEEKKEEPATPAYTI 

SATELLEKQATPREPELLY*NIAE/PSQGTSQRRP*STITFVPYLKG 

Q/ CCPFPMEFSDGQNQGQDSIKTVIiYGTPRKCFVGQSKPNHPLLQAK 

PQSEPDYLEVLEKQTAISQL 

2291 

A 

1 

774 

MNRAFSRKKDKTWMHTPEALSKHFIPYNAKFLGSTEVEQPKGTEVVR 
DAVRKLKFARHI KKSEGQKI PKVELQI S I YGVKI LEPKTKAEE ITI/T 
IGQAFDLAYTKFLESGGKDVETRKQIAGLQKRIQDLETEIOMELKNKV 
QDLENQLRITQVSAPPAGSMTPKSPSTDIFDMIPFSPISHQSSMPTR 
NGTQPPPVPSRSTEIKRDLFGAEPFDPFNCGAADFPPDIQSKLDEMQ 
EGFKMGLTLEGTVFCLDPLDSRC 

2292 

C 

214 

483 

MTPKSPSTDIFDMIPFSPISHQSSMPTRNGTQPPPVPSRSTEIKRDL 
FGAEPFDPFNCGAADFPPDIQSKLDEMQEGFKMGLTLEAQYFV 

2293 

A 

701 

1034 

TFIWLILIMNIiAFSRKKDKTWMHTPEALSKHFIPYNAKTCL^ 
CHDYKTWMHTPEALSKHFI PYNAXFLGSTEVEHPKGTEVVRDAVRKL 
KFARH I KKSEGQKI PKV 

2294 

A 

440 

1383 

SSSWNRAFSRKKDKTWMHTPEALSKHFIPYNAKFLGSTEVEQPKGTE 
WRDAVRKLKFARHI KKSGGQ\ KI PKV\ELQI S I YGSKNS *DPKTKR 
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EV\ QHNCQLHR I S PCADDKTDKR\ I FTP I CKDSBSNKHLCYVFDSEK 
C AEE I TLT I GQAFDLA YRKFLE SGGKDVETRKQ I AGLQKR I QDLETE 
NMELKNKVQDLENQLR I TQVS AP PAG SMTP KS P S TD I FDM I PFS P I S 
HQSSMPTRNGTQPPPVPSRSTEIKRDLFGAEPFDPFNCGAADFPPDI 
QSKLDEM/ QAPEMEGFKMGLTLEGTVFCLDPLDSRC 

2295 

A 

412 

842 

GKVSPPGL*GP*<3EQGPLG\PSGAATGLPHSLCPEAFASPRSVEYR* 
DQAQLALWAVPL*EGPL/PDLVRGVVPGSPVKVPGDEVRT/CHSCGR 
FPRALGQS KTLPPNLPLTTRRGVNSEDLPS PVYRAADRGNRFPGAVG 

2296 

A 

1036 

1429 

GRAWRTCPCRAHQPRRSHLRPSSCSTLQRG*SPPPGRSSSPVQP*CA 
* ARGWG SGPG * HWTCLFLAWQS PQCTGPQ I CQY PRPLRS PRPAGPPG 
GTS GHS PPDARAPRPRRP APCGRECS S S SGRNAAPLG 

2297 

A 

44 

397 

VSWLPQES SHHYFKYCKI SALALLKMVMHARSGGNLEVMGLMLGKVD 
GETM 1 I MDS FALPVEGTETRVNAQAAAYE YMAAY I ENAKQVGRLENA 
IGWYHSHPGYGCWLSG IDVSTQM 

2298 

A 

298 

409 

HHVMCVFFSSCKTTIEAIHGLMSQVIKDKLFNQINIS 

2299 

A 

123 

1221 

KRQLFPLSWAMAASGSRMAQKT\WE\LANNMQEAQSIDEIYKYDKKQ 
QQE I LGGEALD * GVRSHHYFKYCKI \ S ALALL \ KMV\ MHARS / GEGN 
LEVMGLMLGK/ VWDGET\MI IMGKFFFCLVEGT\ETRVN / SLQAAAY 
EYMAAYIENANQV\GSPENAIGWYHSHPG\YGCWLSG\IDVSTQMI*\ 
NQQFQ\EPFVAV\VIDPTRTISAG\KVNFGAFWAHPK/GGYNPPD\E 
GP\SGVPGLFPLNKIEDLGVHCKQYYALE\VSYFKSLLGSQNCLELL 
WN\ KI WGGIRWSS /SLAGLLNADYPTGQVL * FVLKKLEQSRSPSLGR 
GEFPCWGLETHD\RKSEDKLAKA\TRSS\CKLPIEAI\HGLMS\QVI 
KDKLFNQINIS 

2300 

A 

2 

434 

QRGGIRGNGDPFPWVGLSVGLSVPTTGSERRIjSGDLSSMPGPGTLSV 
R VS P PQP I LiS RGR PDSNKTENRR I TH I S AEQKRRFNI KLG FDTLHGL 
VSTLSAQPSLKERAGLQEEAQQLRDEIEELNAAINLCQQQLPATGVP 
ITH 

2301 

A 

1 

2423 

MAGALAGLAAGLQVPRVAPSPDSDSDTDSEDPSLRRSAGGLLRSQVI 
HSGHFMVSSPHSDSLPRRRDQEGSVGPSDFGPRSIDPTLTRLFECLS 
I^YSGKLVSPKWKNFKGLKLLCRDKIRliNNAIWRAWYIQYVKRRKSP 
VCGFVTPLQGPEADAHRKPEAVVLEGNYWKRRIEVVI^EYHKWRIYY 
KKRVSGGGPGRPQSFPPAAAGYRPPRKIPGKGILTPELAPLGPSIQS 
RADSATVWPQRLLAASLPRGRLRKPSREDDLLAPKQAEGRWPPPEQW 
CKQLFSSWPVLLGDPEEEPGGRQLLDLNCFLSDISDTLFTMTQSGP 
SPLQLPPEDAYVGNADMIQPDLTPLQPSLDDFMDISDFFTNSRLPQP 
PMPSNFPEPPSFSPWDSLFSSGTLGPEVPPASSAMTHLSGHSRLQA 
RNSCPGPIiDSSAFLSSDFLLPEDPKPRLPPPPVPPPLLHYPPPAKQE 
TVPEFPCTFLPPTPAPTPPRPPPGPATLAPSRPLLVPKAERLSPPAP 
SGSERRLSGDLSSMPGPGTLSVRVSPPQPILSRGRPDSNKALLGSFL 
GSPNSLLPETENRRITHISAEQ\KRRFNIKLGFDTLHGLVSTLSAQP 
SLKVSKATTLQKTAE Y I LML \ QQERAGLQEEAQQLR\ DE I EEL\NAA 
INLCQQQLPATGVP I \ THQRFDQMRDMFD\DYVRTRTLHNW\ KFWW 
SSIiKPMAGGLQGLWQGSSLTWAQFSILIRPLFESFNG^lVSTASVHTL 
RQTSLAWLDQYCSLPALRPTVL\NSLRQLGTS\TSIL\TDPGRIP*/ 
EQATRAVTEGTLGKPL 

2302 

A 

1 

535 

S VTVEW I ENGDTKGKE I DLES IFSLN PDLVPDEE I EPS PETPPPPAS 
S AKVNKI VKNRRTVAS I KND P P S P DNTR Wf? A. p A p P *3 n P P pn q q q a o 
QNARRKSNCVKEVEKLQEKREKRRIjQQQELPJEKPJ\QDVDATNPNYE I 
MCMIRDFRGSLDYRPLTTADPIDEHRICVCVRKRPLN 

2303 

A 

155 

276 

YNTYSLFLFFFQDKVKSFRAALQEEEQASKQINPKRPRAL 

2304 

A 

1 

477 

MRAFLAAIiVTDWYQYRLAANALFTFLYCDTFKLEI LKAFKRYKES IR 
WLEDEKALLEMTEEAHDYDVDSYATQLEAIIiEQKIDILTELRENETLV 
PKTTKTNKTHKDCGARGGANQKTTEGKAIHSEKRAPNKSATYTAKEE 
HNKTGQERREPEKYRESE 
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2305 

A 

221 

2323 

LGLQMHTTSGRIHQAMVTSLNEDNESVTVEWIENGDTKGK\EIDLES 
IFSLNP\DL\VPDGEIEPSP\BTPPPPASSAKVNKIVKNRRTV\ASI 
KNDPPS \ RDNRWG\ S ARARPSQFPEQS S S AQQNGS V\ SDI SPVQAA 
KKEFGPPSRJU^SNCVKEVDKLQDKREMKRLQQQELREIRAQDVDATH 
PN YE I MCM I RG FTG S LD YRPLAS AD P I DEHR I YVCVRKR PLNKKETQ 
MKDLDVITIPSKDWMVHEPKQKVDLTRYLENQTFRFDYAFDDSAPN 
EMVYRFTARPLVVTIFERGMATCLAYGQTGSGKTHTMGGDFSGKNQD 
CSKGI YAIiAARDVFLMLKKPNYKKLEIjQVYATFFE I YS GKVFDLLNR 
KTKLRVLEDGKQQVQVVGLQEREVKCVEDVL\KLIDIGNSCRTSGQT 
SANAHSSRSHAVFQIILRRKGKLHGKFSLIDLAGNERGADTSSADRQ 
1 KJjji. Va/Ui X IM Ivb JUJ-lAJj IUj, C I RALGRNK P HTPFRASKLTQVIiRDSFIGE 
NSRTC\MIATISPG/MWASCENTLNTLRYANRVKELTVDPTAAGDVR 
PIMHHPPNQI\DDLETQWGVGSSPQRDDLKLLCEQNEEEVSPQLFTF 
HEAVSQMVEMEEQWEDHRAVFQESIRWLEDEKALLGD*LEEVDY\D 
VDSYATQLEAILEQKIDILTELR\DKVKSFR\AALQEEEQASKQINP 
KPPRATj 

Xn-Tvxt x\j-vxj 

2306 

A 

264 

451 

VLPNLTVHAS PTT I KI WAG \ KVTHAYNPS TLGGQRGR I A * AQE F KTS 

T . CM T MOT .P T . Q V \CWT%C\C 

2307 

3 



r*]±jr,r*wij vkimXiI Kb VMULiX 1 VCCVbKKGADHSSAPPADGDDEEMMATE 

A/T'P Q AMZiRTi'l'l )T f/~iTf f'T.M VT-TC\rT.C7*" ,,r PT T 7\ A x rnn.rr frpiTT w*t /^Murrv 
v a inciLi xiyjJvsxvx^jji'iXvriJCi v bou i. XiXjiAL/A V X WLir X V ijix^j Jjx^rLHvjy 

HDGCMASLVHLAFQIYEALRPRYLEIRAVMEQIPEIQKDSLDQFDCK 
KKTKPMLETE VLDNDGGGLATI FEP * 

2308 

A 

3 

101 

STHASESNFSICSILCFCLEAPISLMYSLSASLEDQPWAWAIiCGGGR 
APSTT*PPLSL.*HIj 

2309 

A 

360 

485 

SMCLTVSMLVTACPADPCVT*A*NRSSFEVPPPWTSDLRFF 

2310 

A 

100 

190 

TLSTYFIPVIDSCPSLGIFSWQPPEHNKER j 

2311 

A 

2 

193 

WRRRRRRFCPQPQSLIWSGMRSPKKLTPPR*NYLRQALRR1^LSTLRR 
PFAQETAWYQWCFLGIi 

2312 

A 

103 

237 

FLCLGRYPHIiPPSSPGQRQVGISGPEVAWWYTHPSPSNPAFLL 

2313 

A 

3 

1361 

VETQEGKTTIIEGRITATPKESPNPPNPSGQCPICRWNLKHKYNYDD 
VLLLSQFIRPHGGMLPRKITGLCQEEHRKIEECVKMAHRAELQRELQ 
RLSPLPATQVYYQITGLGFLKELFRRANPNSTGPLAKAQGGYLTRWA 
PGSVKPIYKKGPRWNRGEANRKAHFSTVQWPAPSAIAHAKDDVSPLQ 
PTQCLiLiDNI KKKVVPIjSGRPGNTDSTCQESRS I YliKSQAXiAVTFTHF 
LTATAEQAEPSQLLSPPPGLGEKPPADLVLQW/SEAPRAKQHYCTSP 
SHPAQRRVSSSDS\EHPAAASHPRRPPRPGTNGWSRRGSCRASWSRD 
CSGVRS RRVALGKAl^QVNALCEQLVKAVTVMMD PNS TQRYRLEALK 
r U£i li r i\-c# ^ Jf x u v trs^ UXjKXjAe* KTQVAIVRHFGLQILEHVVKFRWNGM 
SRLEKVYFEEQWSWNLIAKGPFEHFGRGEPY 

2314 

A 

13 8 

397 

SHCPLSMHVGGTRPSWWGGTGV* LWPRAC/ RVGLLLPSTGAGGTLEA 
LLSVSDTASKVFLELARHCPSEGVNPTGHAHPVPSRGPFIj 

2315 

A 

JL O 

Jib 

QXjNGRSIRHBVmSHRKFSAPRHGSI^FLPRKRSSRHRGKVKSFPK^ 

pskpvhltaflgykagmthivrevdrpgskvnkkevveavtivetpp 
mvwgivgyvetprglrtfktvfaehisde/crllplrqkkahlmei 
hvnggtvaekldwarerleqqvpvnpvfgqdemidvi 

2316 

A 

2 

451 

prakaqkgspvlwaqkscppkttralakv\acigawhparvafsvar 

vvjyj\AJXJi.f KXisxx \ivjvx x JxxLjQ(jY1jIK/DGKLIKN\N7VSTDYDLSD 
S INPLSLLVQTKRRALEKIDLKF IDTTS KFGHGRFQTMEEKKAFMGP 
LKKDRIAKEEGA 

2317 

A 

1 

625 

CKFIRVMAHTPXRLLPiRRKKAHLMEIQVNEGTVAEKLDWARERLEQ 
QVPVNQVFGQDEMIDVIGVTKGKGYKGVTSRWHTK-KLPRKTHRGLRK 
VAC\KIX3K1^IKNNASTDYDLSDKSINPLGGFVHYGEVTNDFVMLKGC 
VVGTKKRVLTLRKSLLVQTKRRALEKIDLKFIDTTSKFGHGRFQTME 
EKKAFMGPLKKDRIAKEEGA 

2318 

A 

1 

1251 

MAS KLRL SDMTGEKGIiNIAP I KI I LNS L FEDG SSI GLQG AQGAVDKP 
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LG YKAGMTHI VQEVDRPGSKVNKKEVVEAVT I VETLPMMWG I VGYV 
ETPQGLRTFKTVFAEHISDECKRRPYKNWHKSKKKAFTKYRKKWQDE 
NGKKQLEKDFSSMKKYCQKKAHLMEIQVNGDTMAKKLDWARERFEQQ 
LP VNHVFGQDEM ID 1 1 RMTKGKG Y KGVTSHWHAKKL PRKTHQGPRKV 
SCIGAWHLVHVAFSVARTWQKGYCHHTEINKKIHKIGQGYLIKDDKL 
I KNNASTDYDLSDKS INPLGVFGHYGEVTNDFVMLKGCVTKQRALQK 
TDLKFTDTTS KFGHGRLOTMEE KKA QMRR T . TCfmn T & \c i?xro iv 

2319 

A 

1 

903 

KCSSGHCGKVKSLPKDDPSKPVCLTGFLGYKAGMTHIVQEVDRPGSK 
VNKKE WEAVT I VETL PMMWG I VG YVETPQGLRTFKTVFAEH I SDE 
CKRRFYKNWHKSKKXAFTKYRKKWQDENGKKQLEKDFSSMKKYCQKK 
AHLME I QVNGDTMAKKLDWARERFEQQLPVNHVFGQDEM I D 1 1 RMTK 
GKGYKGVTSHWHAKKLPRKTHOGPRKVS C IGAWHLVHVAFS VARTWQ 
KGYCHHTEINKKIHKIG<^YLIKDDKIiIKNNASTDYDLSDKSIlJPLG 
VFGHYGEVTNDFVMLKGCV 

2320 

A 

1 

1341 

MKAAGRGDVPCKARDAEI PKAMGAHLLHQHDLDVRHGVXEPFWVTPF 
LLQVCPGFCHLLP YHRYFQSLMVT YKI F IDVARKF S APRHGSLGFLP 
OKRS SRHRGKVKS F P KDD P Q KPVWT.Tli "PT .nvvn riMTH t mp T7\m cone 

KVNKKQTVFAEHISNECKRRFYKNWIKSKIOCAFTKYCKKWQDEDVKX 
QLEKDFSSMKKYCHVIRVIAHTO^ 

EKLDWVCERLEQQVNVNQVFGQDEM I DVI GVTKGKGYKGVTSRWHTK 
KLPCKTHRGLCKVACIGAWHPARV7VFSVARAGEKGYRHRTEINKKIY 
KIGQGYL IKDGKIil K1WASTDSDLSDKSTNPLGGFVHYGEVTNDLVM 
LKGCVVGTKKRVLTLHESLLVQMKRQALEKIDLKFIDTTSKFGHGCF 
QTMEQKKAFMGPLKKDGIAKEEGA 

2321 

A 

1 

1131 

MSHRKFSTPRHGSLGFLPWKRS SRHCGKVKS FPKDDPSKPVHIjTAFL 
vjiAnwnini vivo vjLfrciroo l\ VjNJ\JVe» V VISA V X JL V ortr'jr Vlj A Vvjv» Vb i PQ 
GFRTCKTVFAEHISDECKPJIFYKNWHKSKKKAFTKYCKKWQDEDGKK 
QLEKDFSSMKXYCQV I WIAHTQMQLLPLCQKKAHl^E I QWGGTVA 
EKLDWAGERLKHQVPVNQVFGQDEMIDVIRVTKGKGYKRVTSRWHTK 
KLPRKTHQGLCKVACIGAVTOPARVGFSVGYLIKIX5KLIKNNASTDYD 
LSDKS INPLGGFVHYGEVTNDFVMLKGCVVGTKKWVIiTLRKS 
KQRALEKI DLKF IDTPS KFGHGHFQTMEEKKAFMGPLKKDR I AKEEG 
A 

2322 

A 

10 j 

1304 

ESDGVTSHRKFSAPRHGTLGFLPRKRSSRHRGKAKSFPTDDPPKPGH 
LTGFIxAYTAGMTHIVRE^RPGSKANTKEAGRAATIVETPPMVSVG\ 
I VGYVETPRGLRTFKTVFA'RRT cn'Fr'Trpp PVTfTtfwwvciririf APTVvr , v* 

K\WQDEDGKKQLEKDF\SSMK\KYCQVI\RVI\AHTQ\MRLLPL\RQ 
K\KAHLMEIQV\NGGPCGPRSWDWAREKLE\QQVPVNQVF\GQDE\M 
ID\VIGGDPGAKGYKRGSPVRWHTKKAAPARPHPPJVCAKVGLVLGAW 
\HPG\RVAFSV\ASRWGRKGLPFHRTEINQGRIFKDWPRATLIKD\G 
KLIKDOCLHLNYDLS * PRASNPT.GGFVX HYW* ^DPKmPVMTA KCl\ PV 
VGTKK\RGAHPFRKSLLVQTK\RRAIiEKIDLKFIDT\TSKF\GHGRF 
QTMEEKK7VFMG\ PLKKDRIAKEEGA 

2323 

A 

138 

737 

TTTMSSKKAKTKTTKKRP\ ORATSNVFAMFV\ HSOTOfiVOPGPFNMT 
DQ\NRDGFHSDK/ EKVlxHDMlxAFSREESPI*D^ 

S PWFLTMFW * ES LNRTDPLKI VIQKPPFA\ CFD * KKATRHPFRED YL 
/ REELLTNPWGDRVYQI EE VG * TCTGEAPI * QKKGNFQ I TSEFHHGI 
LTGRPPKHKDDLKELLA 

2324 

A 

2 

307 

KPSSNPJ^SLQWTAVATFLYAEVFVVIiLLCIPFISPK^ FKSRLL | 

ELLVSYGNTFFWLIVISALVHTIjQAHYRGPDCYTFPJ^Ri 

QACTALA 

2325 

A 

2 

215 

KLEEENRSLKAIDLQKLKD^ 
YDRLLEEHAKLQAAVDGPMDKKEE 

2326 

A 

1 

213 

PSSNRMSLQWTAVATFLYAEVFVVIiLLCIPFISPKRWQKIFKSRLVE 
LLVSYGNTFFWLIVILVLLVIGE 
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2327 

A 

2 

935 

FRPRYEGRGRGCCGRYRLLRRGLHVDCGKLGNKLTSSCGKPSSNRMS 
LQWTAV\ ATFL YAEVFWLLLC I PFI S PKRWQKI FKS RLVGWSVPM 
GNTFF/VXiVIjIVILVLL\VIECRAAKFRBVWMDVDGKRVNIiPGTIPG 
GHGEALSHMEAFSVAPGGNLLHWLGFSLLIi\SFLLRRL\VTIiISQQA 
\TLL\ASNEAFKKQAEKC*VKAARKYMEENDQLKKG\APFDEGKIjNF 
ENAELKLEEK\NRELK\ANLQKLKDELGQALSKKLEKG*KTSFWPMR 
KQFMRGLTQ/ ER YDRLLE \ EQPKAAGLQVDGPMD \ KKEE 

2328 

A 

2 

1217 

QARGASLRPSLLRIPSEYAFTMQLRLFRRLIiliAALLLVIVWTIiFGPS 
GLGEELLTLSIiAFLI»PAPALPGPPLALPRIjLlfc > TQEACCGPGAPPFIf 
LILGGTAPENLNQRNAIRASRGGLREARGLRVQTLFLLGEPNAQHPV 
WG\ SQGE * TGPSESAAQGDILQAAFQDS YRNLTLKTLSGLNWAEKHC 
PMARYVL \ RTDEDG I ANVPELVS EL VLRGGRWGQWER S TEPQREAEQ 
EGGQVLHSEEVPLLYLGRVHWRVNPSRTPGGRHRVSEEQWPHTWGPF 
PPYASGTGYVLSASAVQLILKVAS\RAPLLPLEDVFVGVSARRG\GL 
APTQCVKLA\ GA\ THYPLDRCCYGKFLLTS HKLDPWKMQEAWKL VGG 
S\DGEKDCAPFAPWFPGKSLGILRCR\AIA\WLQS 

2329 

A 

1 

395 

AINYNEKI YELRVMETKP\DKAVS I IECDMNVDFDAP/LGYKEPERQ 
VQHEESTEGEADHSGYAGE/ljGFRAFSGSGNRLDGKKKGVEPSPSPI 
KPGDI KRGI PNYEFKLG\ EAGGRFVAFSGEGQSLRKKGRKP 

2330 

A 

1 

1494 

GTRKSGRRGFFVALPERSGVCQWSIMFSFNMFDHPIPRVFQNRFST 
QYRCFSVSMI^AWPNDRSDVEKGGKIIMPPSAIiDQIiSRLNITYPMIiFK 
LTNKNSDRMTHCGVLEFVADEGICYLPHWMMQNLLLEEDGLVQLETV 
NLQVATYSKSKFCYIiPHW^IMQNLLLEEGGLVQVESVNI J QVATYSKFQ 
PQSADFLDITNPKAVLENALRNFACLTTGDVIAINYNEKIYELRVME 
TKPDKAVSIHECDMNVDFDAPLGYKEPERQVQHEESTEGEADHSGYA 
GELGFRAFSGSGNRIoDGKKKGVEPSPSPIKPGDIKRGIPNYEFKLGK 
ITFIRNSRPLVKJCVEENRTTDIQATPTPLTPATHRLHTFDRPl^VLG 
LHSQPPLQACVPWGGHSTLDPPYVMPNQSQLLCEFTSILSGRSSKTA 
GAADLGDMADGSGWQPPRPCEAYRAEWKLCRSARHFLHHYYVHGERP 
ACEQWQRDLASCRDWEERRNAEAQEKDE 

2331 

A 

2 

1062 

^ 

GRAVGGVSSLHCPERSGVCQWSIMFSFNMFDHPIPRVFQNRFSTQY 
\RCFSLS \MLAGPNDR/ SS IGEKEGKI IMPP\ SALDQ\LSRI»*HYRI 
PMLFKLTN\RNSDRMTALLQCLEFVADEG I CYFPTL/ WMMQNLLLE\ 
EGGLV\QVKSVNLSSATLLPNFQPQSP\DFL\DITNP\KAVFEN\AL 
RNFACL\ TTGDVIAI \ NYNEKILRNCWMETQTPTRPVSHH* SVNH* 
TV\DFDCFPLGLPKNPGKTKSHHEESEQKVKADPTVAYAWKSLGFPR 
FSRGSG\NR\IJDGKKKGVEPSP\SPIKP\GDIKRGIP\NYEFKLGKI 
TF/VSRNSRPLVKKVEEDEAGG\RFVAFSGEGQSLRKKGRKP 

2332 

A 

1 

214 

APARSALCSGSAPRTSGQPWGTAPAPPSFHQRNTPCPWSAPPSGAAP 
WGSTKGQRLDGPCCAPFPPRGLSM * SKVPWWPA* TAPAP PS FHQRNT 
PCPWSAPPSGAAPWGSTKGQRL.DGPCCAPFPPRGLSM 

2333 

A 

1 

1516 

MAPLQQH I AVL PDTTADQVTTEDDG YTWTALQHVATAKE KRKS SQTM 
EESQDFPLHI PRPAQGTQRAVARPS PGPKAWTEAGPAKDAARSEKPG 
GRKAPGGRGQLTCRRSLSPVRQPPVSVPVSDSLRRRSPCSPADWREA 
G ATSLDRI RP I LVLMVDKKLVWFG S TGAQGGS VAR I LLEDGTFKVR 
VVTRNPRKKAAKELRLQGAEVVQGDQDDQVI MELAIjE\ WAYATFIGT 
NYWESCSQEQEVTPA/ GSLDRIRPILVLMVDKKLGLHYVVYSGL\GT 
SRS * RQGDWPPRTLTAKGRWRNI SGTLAFP * PVWG\ FLANLENLLLQ 
LFFPRKTPEGKELLCGACPQVTFPWMACPCLTWGPVVLSIiFEDARKI 
RRPEHRG*AIiAGTRPRSTLPCSPSTPARSCTMPR*LPEDYEKLGFPG 
ARDLGQHVPFLCPETPTRD I / PSLTLRLNPKALTLDQWLGTAQRGLQ 
PAVTLPASRPLVGIRGHQRGKKAPTSEIKPICSPKKKKK 

2334 

A 

3 

1037 

I LR S VARAQRGRRS LRFHWS PGGHGGRS EAG AAG P / VEMRRCTAAPQ ! 
GRVPGL/PSKLGGIPDALPTVAAPRPVCQRCGQPL\ALWQVYCPLE 
GSPFHRLLHVFACACPGCSTGGARSWKVFRSQCLQVPEREAQDAQKQ 
GNS LAAEDWCEGADD/ CGK* Y * GGAFTTVYLG FWE * CQQC / HKT* TG 
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LLGSKTS / CliQDAVLGAAHPVPPGIiPIiFLP YYI CVADEDDYRDFVNL 
DHAHSLLRD YQQREG I AMDQLLSQS LPNDGDEK/ S * EDHN * KVGGSR 
PF*QIP*KRNWLLVREQIIiRVFPGVESHSF*PAPTSEVTEIiPSLQPV 
WEAKRIFEFQLMPALGQHASRVLI 

2335 

A 

156 

450 

LQ VE I Y KTVKMHC I WHKQMVS S Y FF YQKKNELSNFE E VFLKTAS GM F 
FSTVGCHPTRCGEFEKNNPDLYLKELLNLAENNKGKW\AIGECGLD 
FDRLQ 

2336 

A 

349 

1187 

LHGGNLQD S KDAL.HLAQTNGM FFS TVG \ I H PTRCG * / YFEKNNPDLY 
LKG VAK I FA/ IlKTIjKGKIjWQIjGEC \ GIjDFD \ RIiQFCPKDTQLQYFEK 
QFE\LSEQTKLPMFF\HGRNSHGWNFLDIMKRNRDRCVGGVVHSFDG 

\DAP\WCGVKSTHA\GSKYIRTAFPS\KKKWESGHCLKDRNEPCHII 
QILEIMSAVRDEDPLELANTLYNl^IKVFFSWNIIGICLPLSIMYVK 
FHSKTS 

2337 

B 

240 

387 

NGKERGTGDVKLLKHKEKGAIRLl^IRRDKTLKICANHYITPMMELKP 
NAX* 

2338 

A 

1 

152 

AAAKDTHEDHDTSTENTDESNHDPQFEPIVSLPEQEIKTLEEDEEEL 
FKM 

2339 

A 

1 j 

830 

MRRPRRRLCWLRCLVECGPLQSRSAVGNSEEQRKAYVFFSSLSLVDT 
rLbJJriUl o 1 \rJN JL 1J JtioJNIjIJJFyr bFlvo \ JjirlSyr»± \ Js.1 ±jr*x^IJx^x^£f J-ir JVW 

TjAVT TTPT?2i QTJ*KTnT.D\ TTUTTROf^TV nn\ \TVX T TfWin7'Ki" ,,, x\T\ 13T T.\ MT7P\ 
KJAJYXj Jr K. r M.O JTjIv ir \ C W JVE»,Klj J. \bU \ V JVUXJlVJtlJVufVVvAX \ xvxjIj \ 

DKDP*KIC\ANPLPSRPLMGAESPTAGRRSVPWVLETPHA\DFRPNE 
CPPSPELPGPPRF\LNA*RMPQKFQNQSFEELQGKRSEGEEEKESRF 
QGKNDSL/HRKKCAEKPGSSLR*REETKEDAEAEAINRIjIIiLSFSSIi 
SLS 

2340 

C 

392 

631 

MRTAESCRRLSWAALLGSHWRCCSHPCAAGQGPYPRAAVNGPRTPtA 
YLRFKEMFSSGLINVEFPQFLYLSHFHSLLLR* 

2341 

A 

3 

121 

FCLIGLFLFPQALSGFRQIHVIDMDTIDVSNIiNRQFLFR 

2342 

A 

1 

2970 

MRKKFHI I VCGLDS 1 1 ARRWINGMLISLLNYEDGVLDPS S IVPLIDG 

GTEGFKGNARVILPGMTACIECTLELYPPQENCPACSQLPQNIQFSP 

SAKLQEVLDYLTNSASLPFIATSFSHEDIDNHTEDSVQYVGYEPRYL 

RLLAIRHIVLRLFPGAHARQGGEERDWQSGEGRGWRSGEGRGQKHNR 

QGADRLPPPPPPDSGGEERKAENMDVNIAPLRAWDDFFPGSDRFARP 

DFRDISKWNNRWSNLLYYQTNYLWAAMMISIVGFGCCVRGRSKRM 

RTWAPFVSAVRERPRLS S FAKQAIiS QAQKS IDRVLD IQEEB PS I WAE 

TIPYGEPASKHNLPEPEPRIASRIiGHCWDPRSEVRKSSGKGRTIANS 

GPTPVFFRISSPVSGGWDTSTWGLKSNTEPQSPPIASPKAITKPVRR 

TWDES ENF FS AFL S PTD VQT I QKS PWS KPPAKS QRPE EEVKS SLH 

ESLHIGQSRTPETTESQVKDSSLCVSGETLAAGTSSPKTEGKHEETV 

NKESDMKVPTVSLKVSESVID\nCTTMESISNTSTQSI.TAETKDIALE 

PKEQKHEDRQSNTPSPPVSTFSSGTSTTSDIEVLDHESVISESSASS 

RQETTDSKSSLHIiMQTSFQLLSASACPEYNRLDDFQKLTESCCSSDA 

FER I DS FSVQS LDS RS VSE INSDDELSGKGYAIiVP 1 1 VNS STPKS KT 

VESAEGKSEEVNETLVIPTEEAEMEESGRSATPVNCEQPDILVSSTP 

INEGOTVIiDKVAEQCEPAESQPEALSEKEDVCKVTLTVEFLNEKLEK 

REAQLLSLSKEKALLEE7^FDNLKI)EMFRVKEESSSISSIiKDEFTQRI 

AEAEKiafQLACKERDAAKKEIKN^ 

IRGIxMEEDWAAVFELYNCEDERCYLDIxAJRL^ 

DKGHHPTLGEHPKFTITY'SFDVTCRFMYT.VT.O^ 

uivvjnnr x jjvj&ii c ivi x in x x x** v Xj jlj x t j x j_i v xj^^rixAXxxx v x/\^xix *\nx x iuuvi * 

DEL 

2343 

A 

48 

1493 

DGQYRIWWLIEWHV1XK3CGDTETWKD 

FEP\STESLQFLLDTCK\VLVIGAGG\LGCELLKNLALS\GFRQIHV 
ID\MDTI\DVSNS*IRQFLFRPKDIGRPKAEVAAEFLNDRV\PNC\N 
WPHF\NK\ IQDFN\DTFY\RQFHI\ IVCGTGTLSCARRWING^4LIS 
L\LNYERMVS*DPSSIVPLID\GGTEGF\KENARVIti\ PGMTACIEC 
TLELYPPQVNFPMCTIASMPRLPEHCI\EYVGMLQWPKEQPFGEG\V 
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PLDGDDP\EHIQWIFQKSLERASQYNIRGVTYRIiTQGWKRIIPAVA 
STNAVI AAVCAT * / EVFKI ATS AY I PL * I ITWVFNDV\DGLYTYTFE 
GRKGK\ENCPAC\SQLPQNISVFLHQAKLQ\EVLGIIIiTNSASLQNE 
ILPAITATLGGEKIGTLYIiQSVTSIEERTRPNLS\KTIGRIGGLLDG 
PKTGRLLDVTTP\QTVIjFQTS illlkgkspt 

2344 

C 

166 

312 

MXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXS 
CR 

2345 

B 

355 

487 

MLGLRSGIPAPVRLQLAVVLGNSLGNFGTEGPGLKGRTMVTGDPX* 

2346 , 

A 

754 

1035 

GKVRTVKI KYWMLARS S GS SL * SQHFGRLRQADHEVQEI KHHPG \ KH 
SANP\SLLKIQKVS*LVSQAWWWVPWPAIREVEAGEWCEPGRQSLQ 

2347 

B 

80 

432 

MANGPRAXFCENFQAALALSRVGLHKNPEKEPYKSKYSARALLEEVK 
ALLG PAPEDEDERPEAEDGPGAGDHALGLPVAQRAVRLAVI EFHLGV 
NHIDTEELSAGEEHLVKCLRLLRS * 

2348 

A 

2 

1060 

HWLSTANVISDYYD\ KS SPTQDTAETV* DTPELYHQGKGEIARCWIK 
YCLTLMQNAQLSMQDNI*ELDLDKQSELIALRKKELDEEESIRKKAV 
QFGTGELCDAISAIEEKVSYLRPLDFEEARELFLLGQHYVFEAKEFF 
Q I DG YVTDH I EVVQDHS ALFKVIiAFFETDMERRCKMHKRRIAMLEPIj 
TVDLNPQ YYLLVNRQ I QFE IAHAYYDMMDLKVAIADRLRDPDSH I VK 
KINNI^KSALKYYQLFLDSLRDPNKVFPEHIGEDVLRPAMLAKFRVA 
RLYGKIITADPKKELENLATSLEHYKFIVDYCEKHPEAAQ\EIEVEL 
ELSKEMVS \ LFPTKM\ ERFRTKMALS 

2349 

•v- 

A 

11 

2571 

VKCRKAEGRRESRLQTFEESQAVEAAMANVPWAEVCEKFQAALAIjSR 
VELHKNPEKEP YKS KYS ARANM\ EE VKALLGPAP\ EDEDERPEAEDG 
PGAG \ DHALGL PAEWE PEGPVAQANRIiRL \ AVI \ E FHLGVNHIDTE 
E LS AGEEHLVKCLRLLRR YRLSHDC I SLC I Q AQNNLG I VWS EREE X E 
TAQAYLESSE^YNQYMKEVGSPPL\DPTERFIiPEEEKLT\BQERST 
RF\SKVYTHILYYLAQVYQHLECFETA7UTY\CHSTLKRQLEHNAYHP 
IEWAINAATLSQFYINKLCFM\EARHCLSAANVIFGQTGKISATEDT 
PESEGEVPELYHQRKGE I ARCWI \ KYTiF * LLMQNAQLSMQDNIGELD 
LDKQSELRALRKKELDEEESIR\KKAVQF\GTGEL\CDAISAV\EEK 
VSYLRPLGFEEARELFLLG\QHYVF/ERAKEFFQIIX5YVTDHI\EVV 
QDHSALFKVXiAFFETDMERRC\ KMH\ KRGLAMXjEPIjT\ VDLNPQYYIi 
LVNRQIQFEIAHAYYDMMDLKVAIADRLRDPDSHIVTQCINNLNKS^ 
KYYQLFLDSIiRDPNKVFPE\HIGE\DVIiRPAML\AKFR\VARLYG\K 
IIT\ADPKERAGKIWATSL\EHYKFIVDYC/EKRHPE\AAQEIEVEI* 
ELSKEMPGRRSETPSLKERNKKDKIQTAKTTSAPPAQSLALATPQPV 
RMCRQLTRVI S LG S S KGKAS FAGPDRPPYEDPHI VRLQL YHPQRERS 
CVTSSRRSCAKSPWTLSRRWPHGSSWEKSYELSDGQVITSSNKRFHC 
PEALFQPSFIX3MESCGIYKTTFKSIVKCDVDIHKDLYANTVLSGSTT 
MYPGIALQDAEGDHCPGSQHEEDQDHCSF 

2350 

A 

1 

423 

GTWMGLPKYFISLSTKFPIiSFSLSADEDHIASLFLWPRSKPFGVIQ 
RSCASQ^ALPEVLKNPGDPDKMLRFAESPRNLPAGVL/GDIWTDGK 
GSVSLQEQVYHFPLIRPTPGS*ARASFLSCG*EEFRRHGLTFSSTRE 

2351 

B 

211 

1632 

MSTELFSSTREEGSSGSGPSFRSNQRKMLNIiLLERDTSFTVCPDVPR 
TP VGKFLGDS ANLS I LSGGTPKCCUDLSNLS SG E I TATQLTTSADLD 
ETGHLDSSGLQEVHLAGMNHDQHLMKCSPAQLLCSTPNGLDRGHRKR 
DAM CSS S ANKENDNGNLVD S EMKTYLG S P I TTVPKLD KNPNLGEDQAE 
EISDELMEFSLipQEAKVSRSGIiYRSPSMPENLNRPRLKQVEKFKDN 
TI PDKVXKKYFSGQGKLRKGLCLKKTVSIiCD I T I TQMLEEDSNQGHIi 
IGDFSKVCALPTVSGKHQDLKYVNPETVAALLSGKFQGLIEKFYVID 
CRYPYEYIjGGHIQG/UjNIjYSQEELFNFFLKKPIVPLDTQKRIIIVFH 
CEFSSERGPRMCRCL&EEDRSLNQYPALYTPEIiYILKGGYRDFFPEY 
MELCEPQSYCPMHHQDHKTELLRCRSQSKVQEGERQLREQIAIiLVKD 
MSP* 

2352 

A 

200 

1525 

IMASAHLATSAFCTRSSSSGRSGSCKRGSWDSEQALIRAWTTLPSSL 
HAADGSFAGGRGIPSKRWSWRAGQQSSIiCLIMQTSSNCGNSFQLVSE 
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GASWRGLPHCSCAELQDSLNFSYHPSGLSLSVRPSSPGNSPKEQPFS 

QVLRPEPPDPEKLPVPPAPPSKRHCRSLSVPVDLSRWQPVWRPAPSK 

LWTPIKHRGSGGGGGPQVPHQSPPKRVSSLRFLTSSQCLFSMCPSSQ 

TLQPSFLQPGPGP\SSSRPCAASPQSGSWESDAESLSPCPPQRRFSL 

SPSLGPQASRFLPSARSSPASSPELPIAT*G/LSATFPESRSQPCDL , 

DARKTGVKRRHEEDPRRLRPSLDFDKMNQKPYSGGIiCliQBTAREGSS 

ISPPWFMACSPPPLSASCSPTGGSSQVLSESEKEEEGAVRWGRQALS 

KRTLCQRDFGDLDLNliIEEN 

2353 

A 

2 

280 

RRRGFCFITYTDEEPVKKLLESRYHQIGSGKCEIKVAQPKEVYRQQQ 
QQQKGGRGAAAGGRGGTRGRGRGNQSTYGKASRGGGNHQNNYQPY 

2354 

A 

2 

971 

RQRRAQSAFFRSRLAAVLVGSSGRRSGSAIjANRGQKKAGLTAFCSIQ 
RSAAAAAATRTARQHPPADSSVTMEDMNEYSNIEEFAEGSKINASKN 
QQDDGYRTLLPPLPLPQRAARALRSSKMFIGGLSWDTSKKDLTEYLS 
RFGEVVDCTIKTDPVTGRSRGFGFVLFKDAASVDKIENIELPMDTKT 
NERRGFCFITYTDEEPVKKLLESRYHQIGSGKGQNWNQGFNNYYDQG 
YGNYNSAYGGDQNYSGYGGYDYTGYNYGNYGYGQGYADYSAGTSLQA 
VCHPDHVYLWGSHVAGRAQGIHQKTLSCGMVSSNFMYQRKD 

2355 

A 

604 

878 

SLHHKDFTGMLLLQFAHHLGEIGFSLLISPKSRQPRLFliPSVRQCGA 
AAA/RRCYRLAPRPVCSGKRRTAPGAGD *GGERARPPRARVFCPR 

2356 

A 

1 

1042 

DLFRRHFKSS S I QRSAAAAAATRTARQHPPADS S VTMBDMNEYSNIB 
EFAEGSKINASKNQQDDGKMFIGGLSWDTSKKDLTEYLSRFGEWDC 
TIKTDPVTGRSRGFGFVLFKDAASVDKVLELKEHKLDGKLIDPKRAK 
ALKGKEPPKJCVFVGGLSPDTSEEQIKEYFGAFGERENIEFSMDTKTN 
ERR\GFCFITYTDEEPVKKLLESTYHQIGSGKCEIK\VAQPKEVYRQ 
QQQQQKGGRGAAAGGRGGTRGRGRGQGQNWNQGFNN\YYDQGYGNYN 
SAYGGDQNYSG\YG\GYD\YTGYNYGNPMEYGTRGYGQTYQWPQQST 
LMGK\ AS \RGGGQSTQNNYPAHT 

2357 

A.y ,;. • « 

A 

f -*. - 

586 

1970 

PQEFTPLRFSLLILLFSQARVPSRVASLLAGRAAMEVPPRIiSHVPPP 
LFPSAPATLASRSLSHWRPRPPRQL\APLLPSLAPSSARQGARRAQR 
HVTAQQPSRLAGGAAIKGGRRRRPDLFRRHFKS SS IQRSAAAAAATR 
TARQHPPADSSVTMEDMNEYSNIEEFAEGSKINASKNQQDDGKMFIG 
GLSVO)TSKKP\liTEYLSRFGEVVDCTIKTDPVTG\RSRGFGFVIiFKD 
AX\ is'VDKVLGT * KGHKLDGKLIDPKRAKALKGKEPS PKRFL^WSGfLSP 
DTS\EEQIKEYFGAFGEIENIELPMDTKTN\ERRGFCFIT\YTDEEP 
VKK\ LI»E SRYHQIGSG\ KCE IKGCTTPKRYIiGQQQQPPKGCK/ RGAA 
AGGRVGTRGRGRS \ QGQNWTQGFNNYYDQGYGNYNS AYGGDQNYSGY 
GGYDYTGYNYGNYGYGQGYADYSGQQSTYGKASRGGGNHQNNYQPY 

2358 

A 

592 

608 

WLRWAT I YKAGQPQLEAQNPYS FRECQPFLRLDHLRTAG AEQNCAAA 
VWVRRL*RimGFVLLTGA\ASFI\lWAHLSHQCSQGPAKKVPKWEYP 
YPCTATEPLNNGHHPSTCIQAKPHPRNTVGKV*PSPSLFFPSLLGGV 
YHPAV*ASWPWAWPWICWTEFLYA\YGLLHGENPSKRSPRSPWGSI\ 
ALLAFGWGTTVCSGFPS ILVWVK\ SGLGQVEPKC\ CHLKKL 

2359 

A 

857 

1343 

S FRI S S FG 1 1 TFKI VS FG I MAQTHAMGQFFERNS VP * LTQGEI ENLN 
RPIPVKL/ESITNNLL**KAPGLDGFTDD*SNI*EINL*PRNISLKF 
FFLRKKSYQFSTIASKKIEAKGTHPN*L*ETSITLILKSDKDMTRKG 
KLQSLINIDAKILNKILANQMQ 

2360 

A 

84 

326 

DKAYQCSDCDK/ ELHSVTLVSR* SVLSMHQS IHIWKKKLICFLK/ CH 
KAFSQ* SALI IKQRIHVAQI I * ICCI WESLQPE WL 

2361 

A 

635 

1170 

RLFKCILGLLTFPPDSSHPQLPGESPGSFEKSRCPGCWPHPAQPHQS 
RW A\GPACS S Y SGVLQG* EPQI * SLQHGRGGGGRTGGLEKETRTGSR 
ISDTALGG/ EAGPCPCGQLPHLPGPPPAAHDAGPYIP/ PKGPGGSGY 
LHPHG S AKLG SNQG AMKQ AGRT\ GS AGSQG S AE FHKAALWT 

2362 

A 

2 

424 

LFW S PQTQREPTMVLS PADKTNVKAAWGKVGAHAGE YG AEALERHFD 
LSHGSAQVKGHGKKVADAIiTNAVAHVDDMPNALSALSDLHAHKIiRVD 
PVNFKLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR 

2363 

A 

3 

452 

QTQRE PTMVLS PADKTNVKAAWGKVGAHAGE YGAEALERMFLS FPTT 
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KTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDL 
HAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVS 

2364 

A 

3 

522 
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VPKLHSNPLLPLPVVRTNDYPEQGGGVWERNQPYWFWFIiYFSQ 

2418 

A 

298 

532 

MGSWSLPGRTLWKEKVRFLKSKERTNPGSNFFSFPGSQHTREERSQE | 
CLSLPVLPS YIiATRLSLPDSCPLKGRKAH * QRHSE I * LMGFEPGWPG 

KL^GWIELAFLLGCLHLTIiPLPCPTPLPPKSTKSGCFSLSIAIiCFNK 
QFAVTK 

2419 

A 

5 

977 

v vjxxrtM.vaix vj Kvj 0 A bAAV v V V AAMLG AW AVEGTAVALLRLLLLLLPP 
AI RG PGLGVAGVAGAAGAGLPES VI WAVNAGGEAHVDVHG I HFRKC I 
PL*GRVGRASDYGMKLPILRSNPEDQSCYQTE\RSN\EETFGYESPI 
QREGGLTCWSLKFAJB\VYFAQSQQKVro 

HSTAHDEIIPMSIRKGKT.QVOnPlA CTT vtpxtoxtmt? /mrvovr * nT n 
KVCAL\YIMAG\TVDDVPKLQPHPG\ *ERKKGEEEEEE\YDEGSIPK 
KNR P I KNGGQSG P \ RHNPTP * ASDNS SLMFP I LV\ AFG VF I PTLFCL 
CRL 

2420 

A 

1 

454 

RNSEKEAHSKRKVALQEAKLKAKGLNPDGTPAXSl^ 

REVKAEEKS P I S INAKTVKKEPEDROOAS \ KS P YNR YTTKTR PQPQr-TV 

SSRSRSRSRSHSESPRRHH1TOSSPHLKAKIITRDDLQSSNRH 
SRSRSQSKSR 

2421 

A 

19 

434 

QRGl^SFEIVAMACIl^xxA^KIEEAPRRIRDVINVFHm 

S YI YQSLGDVEWYIEMKLS S WQQTPS PLI LDQNYINTKNQVI KAERR 

VLKELGFCVHVKHPHKI I VMYLQ VLE CERNQTL VQTAW WHDG I T 

2422 

A 

171 

437 

TSNFYYLSx'lTNRRSRSGTYSSRSRSRSRSHSESPRRHHNH 
KHTRDDLKSSNRHGHIRKXSR\SRSQSK5RDHSDAG*KHRLE 

2423 

A 

21 

1855 

FPGRRFRLRWACVRLAKTMASGPHSTATAAAAASSAAPSAGGSSSGT 
TTTTTTTTGGILIGDRLYSEVSLTIDHSLIPEERLSPTPSMQDGIiDL 
PS ETDLR I LGCEL I QAAG I LLRLPQ VAMATGQ VLFHR FF YS KS FVKH 
\SFE\IVAMACIl\n^KIEEAPRRIRDVIlWFimLRQLRGlvRTPSPI, 
ILDQNYINTKHQVIKAERRVLKELGFC^T^KHPHKIIVMYLQVLECE 
PJ^QTLVQTAWNY^lNDSLRxWFVRFQPETIACACIYL\AARALQIPL 

ptrphwfllfgcteeXeiqeicietlrlytrkkpnXyelXlekkvek 

RKVALQEAKXKAKGLNPDGTPALST/ LGVGFS PAS KPF IHQENVKAG 
RRKSP IFHLMWKDSSKKEPGG*D/ RQAFPKALYNGVKEKDS /RRRSR \ 


loons 
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NSRS AS \ RSRSRTRSRSRSHTPRRHYNNRRS \RSG\TYSSS PRSRSP 
\SHSESPRRHHNHG\SPHIiKAK3m^DLKSSimHGHKRKKSRSRSQS 
KS RDHSDAAKKHR\ HBRGHHR\ DR\ RERS \ RS FEEVP * KARHHGWPV 
R S GTWARAQGADIiS S SLWS PAIQFLGFGP I YQCDGMGLNPKTI KRKT 
GLGFDFLETPPRSS 

2424 

A 

- 

1 

1566 

MLYEKFSPAGPILSIRVCRDMITRRSLGYAYVNFQQPADAERALDTM 
NFDVIKGKPVRIMWSQRDPSLRKSGVGNIFIKNLDKSIDNKALYDTF 
SAFGNILSCKWCDENGSKGYGFVHFETQEAAERAIEKMNGMLLNDR 
KVFVGRFKS RKEREAELGARAKEFTNVYI KNFGEDMDDERLKDIiFGK 
FGPALSVKVMTDESGKSKGFGFVSFERHEDAQKAVDEMNGKELNGKQ 
I YVGRAQKKVERQTELKRKFEQMKQDR I TRYQGVNL YVKNLDDGIDD 
ERLRKEFSPFGTITSAKVMMEGGRSKGFGFVCFSSPEEATKAVTEMN 
GR I VATKPL YVAIiAQRKEERQAKLTNQYMQRMAS VRAVPNPVINPYQ 
PAPPSGYFIAAIPQTQNRAAYYPPSQIAQLRPSPRWTAQGARPHPAV 
HVQGQEPLTASMLASAPPQEQKQMLGERLFPLIQAMHPTLAGKITGM 
LLEIDNSELLHMLESPESIiRSKVDEAVAVLQAHQAKEAAQKAVNSAT 
GVPTV 

2425 

A 

328 

889 

GLRTPAAYLYCGTVRTAGVAAASRYQGDNIYVKNLDDSIDDERLQKE 
FS PFGTI * S AKGMMEGGHS K\ G FGFR/ ICFSS PEEAS KAGTELNGRT 
VATKPS YVALAQPKEECQAHLSNQHMQRMAS VRAVPNHVMNP YKPAP 
PSGYLMTGIPETQNRPPRPGAVGHPCNPSTLGGQEGKNTGSGDRDHP 
G 

2426 

A 

1 

2262 

MKPKTSEEVCSPLVWQYSLFPSHTIXSNPVHIQTU^TTMTFSKSLSPTS j 
T I FRVLGQNSAKTAPVKG IHDHHAVKGNGLFS AS S S GAGVQRAGRLG 
GPGAGGQDPRSQLVSASAAAVGDSSSQQAPLLRPTRVAAVPAGTRSA 
VRVAACG P AGS RAEMNP SAP S Y PMASL YVGDLHPD VTEAML YEKFS P 
AGP I LS I RVCRDM ITRRS LG YAYVNFQQPADAERALDTMN FDVI KGK 
PVRIMWSQRJDPSLRKSGVGNI F I KNLDKS IDNKALYDTFS AFGNI LS 
CKWCDENGSKGYGFVHFETQEAAERAIEKMNGMLLNDRKVFVGRFK 
SRKEREAELGARAKEFTNVYIKNFGEDMDDERLKDLFGKFGPALSVK 
VMTDESGKSKGFGFVSFERHEDAQKAVDEMNGKELNGKQIYVGRAQK 
KVERQTELKRKFEQMKQDR I TRYQGVNL YVKNLDDG I DDERLRKEFS | 
PFGTITSAKVMMEGGRSKGFGFVCFSSPEEATKAVTEMNGRIVATKP 
LYVALAQRKEERQAHLTNQYMQRMASVRAVPNPVINPYQPAPPSGYF 
MAAI PQTQNRAAYYP PS QI AQLRPS PRWTAQGARPHPNTS TQTMGPR 
PAAAAAAATPAVRTVPQYKYAAGVRNPQQHLNAQPQVTMQQPAVHVQ 
GQE PLTAS MLAS APPQEQKQMLGERLFPL I QAMHPTLAG KI TGMLLE 
IDNSELLHMLESPESLRSKVDEAVAVIiQAHQAKEAAQKAVNSATGVP 
TV 

2427 

A 

365 

2419 

FSIPVDFCPSALAPRSPPPGSGPQPRHSLSSSHGKVAACGPAGSRAE 
MNPS APS YPMASLYVGDLHPDVTEAMLYEKFS PAGPILS IRVCRDMI 
TRRSLGYAYVNFQQPADAERALDTMNFDVIKGKPVRIMWSQRDPSLR 
KSGVGNIFIKNLDKS IDNKALYDTFSAFGNILSCKWCDENGSKGYG 
FraFETQEAAERAIEKMNGMLLNDRKVFVGRFKSRKEREAELGARAK 
EFTNVYIKNFGEDMDDERLKDLFGKFGPALSVKVMTDESGKSKGFGF 
VSFERHEDAQKAVDEIWGKELNGKQIYVGRAQKKVERQTELKRKFEQ 
MKQDR I TR YQG VNL YVKNLDDG I DDERLRKE FS P FGT I TS AKVMMEG 
GRSKGFGFVCFSSPEEATKAVTEMNGRIVATKPLYVAIoAQRKEERQA 

YPPSQIAQ\LRPSPRWTAQGART\HPFQNMPGAIRPAAPRPPFSTM\ 
RPAS SQVPRVMSTQRVANTSTQTMGPRPAAAAAAATPAVRTVPQYKY 
AAGVRNPQQHLNAQPQVTMQQPAVHVQGQEPLTASMLASAPPQEQKQ 
MLGERLFPL.IQAMHPTLAGKITGMLLEIDNSELLHMLESPESLRSKV 
DEAVAVLQAHQAKEAAQKAVNSATGVPTV 

2428 

A 

2 

238 

ATMATAP YNYS Y I FKYII I VMADCPHTIGVEFGTRI IEVSGQKIKLQ 
I WDTAGQERFRAVTRS YYRGAAGALMVYD I T 
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2429 

A 

68 

866 

GS TGAVTAATATAATAAQ AS APLGLPVYLPAPRSTAPAPS PAASALL 
FLVLTPPSRPERIjPSHQQPPSATMATAPYNYS YIPKYI I IGDMG\ VG 
KSC\LIjHQFTEKKFMG*FVPHTIGCWNLGTRI\IESLVGQKIKIj\QI 

UT/TST T> Onn\ TTDT.D1WP / 13 7\ WO i 01 A 7\P7\T MTrVTlTTDD OTVXt\ ttt cpmt 
nuLKUwy \£KLKAV 11/ KAI X KvjArtAj AUM v iJJl I KKo 1 X JN \ xlXj b> b W L» 

trcqgispnpkyw**fsign*sqiwrpqrdvtyee\akqfaeengll 

FT . F A Q AKTGP R T VED A P P T .P A f* PP K T YON T OTV3 KT .f3 Q 

2430 

A 

22 9 

593 

slicfavgvgdca^fwvtrlapraekrrlapavpphhqderggtstp 
hfh1^shgfwvtfpallpqydilplnsflheiqepslg\sglgpfsc 
ntyl i s fs stchlapvinlwalpgas im 

O A "i 1 

TV 

A 

O J** 


FI-AXj VXjlrXjiXX^X AX J\ w oriXjX^WXjXjXjvan w VIj/ v*ilv 

* I PS I LTKI FENGYFVI S VLLMRNI LS S S ALL 

2432 

A 

1479 

1734 

LKVi^ISLGMVGHAraPSTLGSQE*ATVPLILYFLNKLAFTSHCGITL 
NSFLHKIQEPSLG\SRLGPLSCNYHATALQPGQQSKALS 


A 

30 

CIA 

510 

T Wr*T J^''Kr'DT? p r'WT?r , i , T T \7"'17T VT3 17V r P'C > T5r>UT OMPUCOVT T XTDXTDT^'DWO'D'KrC V 

LtV4\3Li\3Vi fti 1 c^CjU 1 Vr Jj1S_K£»x if rKnLoJNuvooXuJUJ>iKJNKJDKVbFr«i^ 

HLVLLRDGRTL I G FLRS I DQF ANL VLHQTVER I HVG KKYGD I PRG I F 

VTOGEIWVXLGEIDLEKESDTPLQQVSIEEIL 

LKVQALKDRGLS IPRADT 

2434 

A 

338 

853 

EGRCjFuKAOUPPljClAAljr Q FKMNYMPGTA \TLiIEDI \DlUt\xil»VJjL» 

pj>grthigfl\rsidqfanlvlhqtve\rihvgqk/ytgdiprgifv 
vti\geiw\vllg\eidlekesdtp\lqqvsh*rkfleeqkgwnq\qt 

TTT \ 'P7APVT.TrV/f^\ AT.VTiPfZT.CI T 'DT? A"nT , T?nVT*kT T?T?^T3I7 Q 
xn_Li \ Ij/^CiIVXjXVV V \ AXjJ\X'x\VjXjO J.±rrCAJJ X " UJ\UXjJC V LrCtnVuo 

2435 

A 

357 

893 

TVFQIWANHPGQRPARQLCPVHLLGGQSHQEPRH\TSNSEPSRASAS 
LRRLTPLKPTG I GPG S Q I LEDRDFRAFT * WGD YF WGKGAP ACL I QDG 
VKGQWAGPHS GVGGVGW P APRACP PVFS PQGPQS S ACGCQYYLS PRP 

V7\ voor!M<^'Df2r2T~v'DDr2r^Kr\ro c ouf'rvT ■Q/-»r ,, iv/n»J r PT mptpct? 
iSAx x^yoWl^lxvjvalJi^l^oUlN VKoo V^LJXjK{_1_I v iW iJjJMLlLor 

2436 

A 

233 

567 

RLLFLVRKRMI S FSAPPLMLPFS F YFFVCPAARTARKRKPS PEPEGE 
VGPPKINGEAQPWLSTSTEGLKIPMTPTSSFVSPPPPTA*PHSNRTT 
PPEAAQNGQS PMAALIL 

2437 

A 

219 

702 

ARKVPJUtAAGLESRTGSTGPRPCATRCWRCTSTATRGPSRASFKKEP 
ALTAVARTARKRKPSPEPEGEVRAPLRSTEKAQP\GCPHSTEGLKIP 
MTPTSSFVSPPPPTASPHSNPGPHRLESGPRMGQFPIGSPDLSSRQF 
RGQ S C S KD AN Q VH S TTRRN S N 

2438 

A 

173 

1136 

ALEGAGKSRGFWRAGLGQQAQDRAPJIaAGRCTSTATSGPFESKFKKE 
PAIjTACjKJj fviVr bCjOKUQFGIiKAVCKNSKGKGSPIjPEPRlWKSUFljK 
IQRRGPSRRL/ SPHPQRGSRSP*L1^PLLCLRHOTLPHLIPTGPHRL 
KRPRMAS PPWQP * S * * QTMQGASHAS KDANPG\ SLPTTRKE * PTVGP 
SSVLL*TKERLGPKRGGGPREQATQEDWEPVHPASLPDFtiFGQPVAP 
IiCCTLCHERLEDTHFVQ\ CPS VPS \ HKFCFP\ CSSQGI KQQGS \ SGE 
V^\CPSG\DKCPSCGLPMSPWA\FMQGEIATI\ 

2439 

A 

1 

206 

PTRPRTS YEKQGYLLLPPVFS I VLEVLARAI SQETE I KSVQMGKEEV 
KLSL\ FVCVYI YA*ENPVESTK 

2440 

A 

611 

1204 

ERLLRVCLALG S AGTADDVKLHP VS CADTARS TPGTS A P S TS GTS AP 
FTPGTSAPSTPGTSAPSTPGTSAPSTPGTSAPSKPGTSAPSTPGTSA 
PSKPGTSASSTPGTSAPSKPGTSAPSTPGTSAPSKPGTSAPSTPGTS 
APSKPGTSAPSTPGTSAPSTPG\PQHPPHRGPQHPPSRGPQHPPHRG 
FynJrFoKOFyii.FFnJ^xr'yWFFbKO r* 
PGTSAPSKPGTSAPSTPGTSaPSKPGTSaPSTPGTSAPSKPGTSASS 
TPGTS APS KPGTS APSTPGTS APS KPGTS APSTPGTS APS KPGTSAP 
S TPGTS APSKPGTS APS KPGTS AP S KPGTS APS KPGTSAPADPGVL 

2441 

A 

447 

920 

GWHWFLASQKSLLKPPIiSRNPCVLVS CHMAQS WARTQEFLCPMCPTT 
/CAQSGTGTESCFPNPN*VLGAGAPQPDQRCHKAGPFSS \ PGFRLPS 
SRH\GGPVLPGTLGPPGPPGRPPHKPSNEQRDAGQQLQLPPPACPQG 
KLPQLPFCLKFYSLPELPH 

2442 

A 

1 

376 

LRKDDTQIP^AFHIFVQSWRPALPSLPSLPLARALAETLN^ 

ITPSGILlxNSQMLDFSWPNRTANHSAPS/x^AmPAEGLCGTYx^ 

NGAARGLSGLTQVX1aNA^N*NRNLSDSLARGR 


IT>0jO 
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2443 

A 

1 

315 

AAQVL I MG PDDFI VAMVS S LNQPFGSGIj I TPS G I LLNS QMLDFS WPN 
RTANHSAPSLENSVQPGKRPLSFLLPTVVRPAEGLCGTYIjALGANGA 
ARGLSGLTQGC 

2444 

A 

3 

2027 

NS RVDD FVARARMAAENEAS QESALGAYS PVDYMS I TS FPRLPEDEP 
APAAPLRGRKDEDAFLGDPDTDPDSFLKSARLQRLPSSSSEMGSQDG 
SPLRETRKDPFSAAAAECSCRQDGLTVIVTACLTFATGVTVALVMQI 
YFGDPQ I FQQGAWTDAARCTSLG I EVLSKQGSSVDAAVAAALCLG I 
VAPHSSGLGGGGVMLVHDIRRNESHLIDFRESAPGALREETLQRSWE 
TKPGLLVGVPGMVKGLHEAHQLYGRIiPWSQVLAFAAAVAQDGFNVTH 
DLARALAEQLPPNMSERFRETFLPFG\RPPLPGSLLHRPDLAEVLDV 

GVYRGHLVLSPPPPHTGPALISALNILEGFNLTSLVSREQALHVfVAE 
TLKIALAliASRLGDPVYDSTITESMDDMTjSKATRAAYT.PnHTlJnQriAA i 
P \ A PLR P VYELDGAPAAAQVL FMG PDDF I VPMVS S LNQP FG SGL ITP 
SGILI^SQMLDFSWPITOTANLSAPSLENSVQPGKRPLSFLLPTVVRP 
AEGLCGTYLALGANGAARGLSGLTQVRFTPWLAFFSREPSCGLDCRC 
LS*QSNLLQVDSECRAETSWGGHRDR*RKDSSQGCPWVHGSRRTNNF 
1 1 AVKD PRS PDAAGATI L 

2445 

A 

83 

310 

IFQCTKFTWDSQITPRLFHPAPPTWCRPLPVSG*RRVRGW\TTRQGS 
GGGRGPAPLSSQAPPKGPVPFQWHIjRAEG 

2446 

A 

1 

1833 

MGLAMFAVPPGHTTPAGAPHSASSASLHSERPLSSSAWAGPRPEGLH 

QRS CS VS S ADQ WS E ATTS LPPGMQHP \TLP AD I KLREGQDS ELBQKTA 

VCAAQASx^GEKGRx^VSAPGASPIjMLSVAANAALGW1X3VGVPD<5WG 

SGSPPSAVCVQTQARASTRRQASTVTRTHFEMRPPWSPPGAENQKRS 

CGESSWPAAPQQFALSNSAAIRAEIQRFESVHPNIYAIYDLIERIED 

LRLywyiKHrtv x^XJiWAKKWLKij(j(jPGPDljSRPGRGQVGAAKWL 

GPSGIPSQQLAGGPPAVRAIiQLRGHPGEIQRFESVHPNIYSIYDLIE 

RIEDLALQNQIREHWSIEGKRTGGCGNGPQWPLRRRPRAPQAVLAM 

LSLSGVIiT^RLRTRPRGLPICVAEPAGSFAISRSGTLWGWGYYADCVP 

GREGGVAWGGRDQICPGLAGEEAEESFEFWVSLTGQTWHFEASTAE 

ERD55WVOSVOAOTT»A^T.onAr* c ? AirnTr\A7T Q^;prnri?T?T vpn/nnrDDTM 

EVPAEGPFPGRSDIIALWWNSHIAQGSSATPIALAICKESRKQDKES j 
RKQDKESRMGTGRSSGEWKEKRCGEGEEPGQRPRYT.TjLTT'ARQFtSEA 

2447 

A 

3355 

3951 

ratr pp cdg s ns aa i rae i qr fe s vh pn i y a i ydl i er i edlalqnq 
IREHVISIEGERSRGLREPGRSGLSGAGRGLHRPCLAMLSLSGVIjAR 
LRTRPRGLPICVAEPAGSFAISRSGTLWGWGFSADCVPGREGGVAWG 
GRDQICPGLAGEGGRGRGLQGDPRLLGLEAAAGHAGADRPPGTPHPE 
SGPETPGHCAA 

2448 

A 

699 

977 

LPWLRWAGMALGSLTAGAVGPRPQRCALRPRQGPAPEGRQAPRKGRS 
PHHPPLQSGSWRLPCPPRYRASRPPKPQRARRSRPDRARPAPPAP 

2449 

A 

3 

132 

GRFSSESSPRA\GPPPAPHRGHVQRCCVSPTGLLIWKELBRPFC 

2450 

A 

2 

1735 

TDLKRCTYYETCATYGIjNVERVFQDVG I IDSAVPCS \ AQKWALRKK 
QQLAI GPASHCPNS PK\ HWAVS AAS I PAVH I \ TQAT \NGGGKAFSDY 
SSSVPSTPSISQRELR\IETIAASSTPTPIRKQ\SKRRSNIFTSRKG 
ADLDREKKAAECKVDS IGSGRAI PIKQGI LliKRSGKSLNKEWKKKYV 
T\LCDNGLLTYHPSLHDYMQNIHGKEIDLLRTTVKVPGKRLPRATPA 

TAPGTSPT? ANnT.RVT ; *PQM r rOT.f3rir!TsrDr , Dr , \7\r* TPPi»nTOT Tr*T\r\\Tr tr r'B 
*« r v3 x ijrAm^ojjo v Ciivoii x. rvLsrCvj V V ILunvo Xj luALIVo X if 

EGFNGRRERELLGFQSVQAQILASLQGCLQCQGQDFDLGNQNAAI*AV 
QAVRTVRGNSFCIDCDAPNPDWASLNLGALMCIECSGIHRHLGAHI*S 
RVRS LDLDDWP PELLAVMTAMGNALANS VWEGALGG YS KPGPDACRE 
EKERWIRAKYEQKLFLAPLPS SDVPLRQQLIiRAWEDDLRLLVMLLA 
HGSKEEVNETYGDGDdRTALHLSSAMAITVVFTQLLl/WGESRASSLP 
* PRS S * PCSLKATS S FLP YNQSLSSHVLPTVNMCFLLQYGVDVRSRD 
ARGLTPLAYARRAGSQECA 

2451 

A 

329 

3294 

ESEEILWGEQLAGGPPQQFALSNSAAIRAEIQRFESVHPNIYAIYDL 
IERIEDLALQNQIREHVISIEDSFVNSQ\EWTLSRSVPELK\VGIVG 
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NLSS \GKS A\ LVHRYLTGGPI VQGGS PLKGGGLRSE I AADGHS YI*LL 
I RDEGG P\ GELQYAAW\ VDAMVF / VWS PCEDG I TFS KTVYNYFLRLC 
S FRNAS EVPMVL VGTQD AI S AA\ NPRVI DDS RAR \ KLS TDLKRCTYY 
ET\CATYGLQCGSVSFQ\DVAQKWG/TLRKKQQL\AIGPCKSLPN\ 
S PSHFGRVRRLHS / SAVYQSTS I SQRE/ LRIETIAASSTPTPIRKQS 
KRRSNIFTVPLPPAAAAASSSSWSSSFSSSSLSFSSSSSFSgSSSSS 
SSRCRRSPGAWVSALTGADRLSRKGADLDREKKAAECKVDSIGSGRA 
I P I KQG I LLKRSG KS LNKEW KKKYVTLCDNGLLT YHPS LHDYMQN IH 
GKEIDLLRTTVKVPGKRLPRATPATAPGTSPRANGLSVERSNTQLGG 
GTGPMLFQGPSFHTRFPRFQSPWLPIAFPQDFSIPSWLDHWISCSL 
GPVTAALLVPFLCPVPAVSAWQHRSAAGAHTQELADGAHPARVPLPH 
ADADTDPRAPHSASSASLHSERPLSSSAWAGPRPEGLHQRSCSVSSA 
DQWS EATTSLPPGMQHPASG PAEVLS S S PKLDPPPS PHSNRKKHRRK 
KSTGTPRPDGPS SATEEAEES FEFWVS LTGQTWHFEASTAEERELW 
VQSVQAQILASLQGCRSAKDKTRLGNQNAALAVQAVRTVRGNSFCID 
CDAPNPDWASLNLGALMCIECSGIHRHLGAHLSRAPHNNGPFLSVIS 
REEKERWIRAKYEQKLFtAPLPSSDVPLGQQLLRAVV/DKNELRLIiV 
ML\LAHGSKEEVNETYGDGDGRTALHLSSAMANVVFTQLVI\WTGWN 
*GTPDARGLTPLAYA\RRAGSQEC7UDIL\IQHGCPGEGCGLAPTPNR 
EPANGTNPS AELHRS PSLL 

2452 

A 

493 

955 

TLCTYILISEVISLPGTAFIQRKGLGLRPSQNNAYSIiSQTSPGLIIF 
LRVQGKFDLNIiALVIiFFSGGTESRSIA\QLECSGMTSAHCNLRFIjGS 
S YSPASAS * VAGTTGACHHAQL I FLVFLVEKG FHRVSRDGLHLL /NL 
VIRPPQPPKALGLQA 

2453 

A 

158 

720 

PPPCRAKQHQTKTKKRP\QRATSNVFAMFD\QSQIQEFKEAFNMIDP 
DQDGfr\ ID\REDLHDMLASLG\ KNPT* WS ILDAMMMRAPGPPSNFTP 
WFPHHGFGEE/RLNGHQIPEDVHRKMPFACFE*KKPLGTIR\EDYlA 
RELLTTHGGIGFTDEEVDELYREAPY*PKRGNFNY\SEFTRILK\HG 
A\KDKDD 

2454 

A 

2332 

3329 

VAGATGTHHHAWLILSFLRWSLTLSPRLECSGTFSAHCNFRLLGSSN 
SPASAS*VARITGAHHHTRLIFVFLVETRFHHGGQVGLKLLTSGDPP 
ASASQSTGITGVSHHAWPYFKLFCRDGVLLCCPG*SQTLGLKQ\SSC 
QSFPK*LDYRCEPPRPTNL*RCL\RDRVPYLTTSPVRSRGGTRM/EK 
EVPEQRIiAGSRMGQELTQQPSTETSWSLGRRRTKSPLF*IFFMFLRR 
SL/NSVTQAGVQ*HNLGSLQPPPHGFKRLSCLSLLSSWDHRCTPPCL 
DNF* I FGRDG/GFTMLARQVSNS *PQ\GPPTLASQSSGNTGLSHCNQ 
PVAPIFAPR 

2455 

A 

566 

1038 

GSTLSPSLPGFCPENSNKCKFNTLLTAESLKLPQDLSFDPSQKPTQV 
RHL / ES P PGEG P P \ S RAPQKNSHE I R\ NCLM \ KC I FI» * LGRMKAQTP 
TLLF \ S PGLGMS PAARPRS FPGGLGEVGAGT I S VPS TLTPSTSETTL 
PQPRYGMKLADADVGKKKKK 

2456 

A 

349 

679 

ELGHWGNRVCLSEALGLAELHPCPGNEVGPQRQG*GLAVX.II J AIILL 
QGTLAQS I KGNHLVKVYDW\ QEDGS VLLTCDAEAQN/ 1 STWFKDGKM 
IGFLTEDKKKWESGEVMPRDPRG \ MYQC * RDHRNKSKTTS QVLFTGM 
CSGTCI * /DLNAATISGFPLWLKTVSLF \VLAFGV\YFIA\GQGWSS 
ASPRASDKQTLLP\NDPAPTQPLKDPRKMTQYSHLQGN\QLRRN 

2 457 

A 

356 

505 

WTGGKKKAGWMPKKE/ SVKLAQ * CMPVI PATQE AETGKLLEPRSSRS 
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QGAAEQIQAHPLPNFQKLTSFFI\GENMNPRWAWVLLLDYRE\DGWC 
PP*YDFLLRDGLEMEKMLTNVGNYFGSITLSSITGLLPCHPTQPPGT 
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NSLAAWLTDARTIiAS PNIWILCGNKKDLDPEREVTFLEASRFAQEN 
DGARDTVLETALGPTFMEIiTVQ WVRVEGQTHFQTVTPQEAHVQEE IN 
LWGHVRLPALKDRAAMPYTDMVIHEVQRFADI IPMNLPHRITRDTA 
FHGFLIPKGTDVITLI^TVHYDPSQFLTPQEFNPEHFLDANQSFKKS 
PAFMPFSAGHRLCLGESLARMELFLYLTAILQSFSLQPLGAPEDIDL 
TPLS SGVALANGERWRILRRFSLTI LRDFGMGKQS I KER I QEEAS YL 
LEEFQKTKTLCNVYNSMTLSKQHTHFEDDETSSEAKIHEEINQVIGP 
HRLPRVDDRVKMPYTDWIHEIQRLVDIVPMGVPHNIIQDTQFRGYL 
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\RKLHSIKWLAFPLSSPILELHSEDTLQMASDCIC\DGHILDSKRYA 
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PR YY I S ANVTGFKCVTGMS CLMRKDVLDQAGGL I AFAQYI AEDYFMA 
KAIADRGWRFAMSTQVAMQNSGSYSISQFQSRMIRKTITRMVTQINC 
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\PKKEIPASPSRTKIPAEKESHRDVLPDKPAPGAVNVPAAGSHLGQG 
KRAI PVPNKVl^E.I FADS VFSHFAP\LGIiKKTLLKAPGSTSin*ARKS 
SSGPVWSGASSACTSPAVGKGLCCTPVCPARPARPLHSCCCKMTPSR 
PWLMWQCREGLECP*GSGGTWV\RHSCDRRAGRGSGQSPRPHFGSSB 
VLG*TWRPGERTGDSPDSSTPKLSRAQRPQSCTSVGSSRH*CCVC\R 
VTVHSTPVRRSSGPAPQSLLSAWRVSALPTPASRRCSGLPPMTPKTM 
PRAVG S PLCVPARRRS S E PRKNS AMRTE P TRE SNRKTDSRLVDVS PD 
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CTSVGRVTVHSTPVRRSSGPAPQSLLSARRVSALPTPASRRCSGLPP 
MTPKTMPRAVGSPL\CVPARRRSSEPRKNSAMRTEPTRESSRKTDSR 
\IiV\DVSPDRGSPPSRVPQALNFSPEESDSTFSKSTATEVAREEAKP 
GGDAAPSEALLVDIKLEPL\AVTSDAASQPLIDZjPLIDFCDTPEAHV 
AVGSESRPLIDLMTNTPDMNKNVAKP\SPWGQLIDL.SSPLI\QLSP 
EADKENVDS PLLKF 

2509 

A 

1 

355 

DVQLEGAKIGSTEITFTPEKIKGGIHT\AYTKTAGSVCLLMQVSMPS 
VLFAAS PS QLHLKGGT \ FKVAQDMATAAVRC I KKE IRDL YVNI QPVQ 
EPKDQAFGNGNG III IAETS TGCLF 

2510 

A 

29 

• 

1262 

TKVSELLCGSQRLFFLPLWRRLCRCGLGPRVSPMAGPRVEVDGSIME 
GGGQILRVSTALSCLLGLPLRVQKIRAGRSTPGLSIMT*RPQHLSGL 
EMIRDLCDGQLBGAEIGSTEITFTPEKIKGGIHTADTKTAGSVCLLM 
QVSMPCVLFAAS PSELHLKGGTNAEMAPQI D YTVMVFKP I VEKFGF I 
FNCDIKTRGYYPKGGGEVIVRMSPVKQI^PINIiTERGCVTKIYGRAF 
VAGVLP FKVAKDMAAAA\ VRC I RKE I RDLYVN I QPVQEP KDQAFGNG 1 
NG 1 1 1 1 AETST\ GCLFAGS S LGKPGFNSDKW I EA\ AEWLLANLRHG 
GT\VDEYPARPSLIVFHGH*PNGVS\RIKTG\PV\TLHTQT/APIHF 
AEQIAKG * I LL*RNPEDEEDA\ AKDTYI I ECQG IGMTNPNL 

2511 

A 

109 

743 

RLFHSNQTVDHSQKNVDITLKGTPQ*SCKGPRGNLCGRDFIHI\NVB 
PQPFLEK*KKRLPGLTKWWGNRKGNWPTRSGLFGSHVQDHDPRGVTT 
GASVTKMRVLCMPHLPPIQPLVYPQENGSSLLKSRNFLG * KNYIGRV 
RMETRVLLCSVSQGQ\ KDELI \ LKGNDI \ ELVSN\ SAAL I QQAPP\ V 
KNKDIRK\FLDGI *VS\EKGTVQPGLIE 

2512 

A 

900 

1189 

SQHFGRPRRADHLSSGVQDQPGQRGKTPSLLKIQKLAGRGGSLL*SQ 
LPRRLRQRENRLNPGGGGCS ELRS RDCTPAWATEGNS VS \ KKNK*KK 
KKK 

2513 

A 

1710 

2221 

SPLLDVYFKTRLiLSRG^REPACTLRLiNTIGTLLSHCPLGIQGPHPGP 
SGKTCGPFKPRGPGHSCPPPDKKQVPTVLPLLPAPLRWAQAGLPGYA 
GISGSPGRPLGAQAG*GSGVIGESNYFVHCMHPRGPGSPHL/PGG*G 
LQPKPPRPRLPSSSGSTTGLGDRKPCSYSQ 

2514 

A 

205 

295 

GCNGNKDSCGPIAPSSGDILLLTVKRSLPR*NFSP\CA\PCSSSSKR 
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AMCSRVCSPWQLi*CGP TAPS SOD TT.T T.twdct t>t> 

2515 

A 

1 

927 

MAYLQDRTLRSRYSNYSRPKSTGLERLFPPQIKTRIPTYDEDDNTIL 
YAYETKPEFVNKVIYDALQNLDKKIDVIRRKVSKIQRFHARSLWTNH 
KRYGYKKHSYRLVKKLKLQKMKKNEVYETFSYPESYSPTLPVSRREN 
NSPSNLPRPSFCMEEYQRAELEEDPILSRTPSPVHPSDFSEHNCQPY 
YASDGATYGSSSGLCLGNPRADSIHNTYSTDHASAAPPSGMLAQGEP 
NLVHAPEMMSYSALMENRYTGSSTjPFPSnT.PTnnwaT ttt Tcmn t v 

HLGVKLGTAVKLCYYIDRLKQGKCFEN 

2516 

A 

932 

1688 

KGFKNPTFPCEIPAGOTHKPHKTThIKTTc;or>t>\7r»VT ttt nv-MvirKrcxrvA c» 
TFSYPESYSPTLPVSRRENNSP\SNLPRPSF\CMEEYQRAEL\EEDP 
I/LSARTPSP\VHPSDFSE\HNCQPYYAS\DG\ATYGSSSG\LCLGN 
PRADS IHNTYSTDHASAAPPSVTRSPVENDC5YT ppnc TT7TWPQTW Q\7 

eawlflkqtdplalcplvdlfrsh^ 

vklgtgi* * s lc \ ylh * pdlkqgkcl j 

2517 

A 

3 

179 

ARIPGICGSGSSSCRLTRSSKAQLCWWNMGLGIFRICITRVSEEAQT 
TSSRRCSQSTQ 

2518 

C 

140 

490 

MAWFHDMNPQSIALlPPATTEISADSQLPCIKDGSEGVKDVEIiVIiPE 
X SMFEDAS VS EGRGTQI E ENPLEEN I LAGEXAS QTGDSGNEXANRGD 
GSDVS SQTPQTS SDWLEXXXLX * 

2519 

A 

194 

405 

KKILWKKIFWRGKQHLKLVTVVTTAANRGDGSDVSSQTPQTSSDWLE 
QVHL V * TAH I WG S KW I YR YRS VC 

2520 

A 

1 

2046 

MRQNS GGNVHLPAAHLVLCGL VPNRPWTGKNKNKRHHLGC I PQP PTL 
AKIAKAAKGSAGNFRYPIjILGS FPANWVFMNFFQS KATLDFNLI csl 
NVKDVAEVFQKWLKIEGK3CCHCLSEKTKQNMGNTTTKFRKALINGDE 
NLACQIYENNPQLKESLDPNTSYGEPYQHNTPLHYAARHGMNKILGT 
FLGRDGNPNKRNVHNETSMHLLCMGPQIMISEGALHPRLARPTEDDF 
RRADCLQM I LKWKGAKLDQGBYERAAI DAVDNKKNTPLHYAAASGMK 
ACVELLVKHGGDLFAENENKDTPCDCAEKQHHKDLALNLESQMVFSR 
uf x^\r»r» ± c*>us i /v/AJjjJiVKii f I JbUijKPQDIjRRLKDML I VETADMLQAPL 
FTARALLPJUnDWDREKLLEAWMSNPENCCQRSGVQMPTPPPSGYNAW 
DTLPS PRTPRTTRS S VTS PDE I SLS PGDLDTS LCD I CMCS I SVFED P 
VDMPCGHDFCKGCWESFLNLKIQEGEAHNIFCPAYDCFQLVPVDIIE 
S WS KEMDKRYLQFD I KAFVENNPAI KWCPTPGCDRAVRLTKQGSNT 
SGSDTLSFPLLRAPAVDCGKGHLFCWECLGEAHEPCDCQTWKW\WLQ 
KI TEMKPEEL \ VGVS EAYEDDRQ/ C VC * VMN * TAKPCANCKS P I QKN 
EGCNHMQCAKCKYDFCWICLEEWEKT 

2521 

A 

2 

1431 

GSAHIRSPGSPARHRASLPAPRRPQPASKDQCGMSAPGAYTGVNFEG 
ALFSGYAPDGGLFMPEELPQLDRETLCQWSTLSYPGLVKELCA1»FIG 
SELLPKDELNDLIDRAFSRFRHREVVHLSRLRNGLNIjIiEIiLHGVTYA 
* x w A ^ xjc»ivivc»rvxi v i v v v Vj 1 ovjJJ X boAAi bp VyX3AJvN 
MDIIVLLPKGHCTKIOELiOMTTVT.rX T7Kn/\ npt r-XTOPMcntnc TUDT 
KTVFADVAFVKKHNLMSLNS INWSRVLVQMAHHFFAYFQCTPSLDTH 
PLPL V^rVVVPTGAAGNLAAGYI AQKIGLP IRIjWAVNRNDI IHRTVQ 
QGDFSLSEAVKSTIJ\SAMDIQVPYNMERVFWLLSGSDSQVTRSLMEQ 
FERTQS VNLPKELHS KLSEAVTS VS VSDE A ttotmrp rwDTTwrvvT .t .r 

PHSAVAVNYHYQQIDRQQPRYRQW/VAWATEGPFEFQGPSSSPSSGR 
DRTTKKWli 

2522 

A 

2 

119 

ESTVSCCEKPTGAMP*MHTACGRPCPS I WRxYTKLSMR LR 

2523 

A 

159 

627 

LGRMQDPQGGQMRWAYSLTFAKDypQIiGLSHYFAGAASGGCAQARCG 
PGPRR*GPPAPVPSPSPIxALPPMGTRKLHPGMWTALH/VRIP/VPGK 
EGCVQL I GE I S CVLLQKAS WS EAS TLCP AKG A I PNG AAFHQRCGRPE 
AHLGHRLSIxAPPREPVET 

2524 

A 

3 

307 

ISSAPQIxAGHKRQFGSVCHQDPRVCDEPSSEDPHEWPEDITKWPVSC 

SGGWREPSVSCGRECHTPPTGDTRDSHHGTTSRFQRLCTCPLYSLS 
INQRLPL 

2525 

A 

1053 

2813 | RTWEKAPEQADLTGGALDRSELERSHLMLPLERGWRKQKEGAAAPQP 


1 00% 



KVRLRQEWSTAGPRRGQRIAVPVRKLFAREKRPYGLGMVGRL.TNRT 
YRKR IDS FVKRQ I EDMDDHRPF FTYWI/TFVHSLVT I IAVC I YG I APV 
GFSQHETVDSVLRNRGVYENVKYVQQENFWIGPSSE\DLIHLGA*IS 
PCMRQDPQGLS F IRSAREREKHSACCVRNDRSGCVQTSEEECSSTLA 
VWVKWPIHPSAPELAGHKRQFGSVCHQDPRVCDEPSSBDPHEWPEDI 
TKWPICTKNSAGNHTNHPHMDCVITGRPCCIGTKGRYVCVPASAAGIi 
WAQSIALGCEITSREYCDFMRGYFHEEATLCSQVHCMDDVCGLLPFL 
NPEVPDQFYRLWLSLFLHAGQVTIilLHCIiVSICFQMTVLRDLEKLAG 
WHRIAIIYLLSGVTGNLA\SAIFLPYRAEVGPAG\SQFGILACLFRG 
SSSRAGQILGAALACLLQACWLWVLFLFHLWGLLPWDLTTFAHIFGG 
S S VALFLS FAFLP Y I S FGKVRPVPETLPDHHLSGGLPRAS WLAWWSS 
STSILSAVSRVSSSPASPFTDKLL 


2526 


936 


MAALYACTKCHQRFPFEALSQGQQLCKECRIAHPWKCTYCRTEYQQ 
ERliECNGTISAHCNLHLPGSSDSPASSSRVAGITGIKTNTICKKCAQ 
NVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPYSCEQCKQQ 
CAFDRKDDRKKVDGKLLCWLCTLS YKRVLQKTKEQRKHLS SS SRAGH 
QEKEQYSRLSGGGHYNS FS PDLALDS PGTnuw km— feVATLKK 
MLHQKDQM II ,P.^- V - TAT " " . iKBVTEQ 

lqaknrel! 


2527 


1023 


2528 


157 


512 


MAALYACTl 
ERLECNGT3 
NTVQIiYGTPi 
CAFDRKDDR 
QEKEQYSRL, 
SPDLALDSPl 
LKADFQYQEi 
KKSEKSGAI'l 
LG*LQPPPP<j 
SASRVAGITG 
LGPPKSAGI 
SYDKEIFPIL] 
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2529 


11 


133 


mfcslqptd: 


2530 


3$7 


678 


KWTIiIILIL< 
ADGRLGLAWG 
VI PATAYDNG 



JRTEYQQ 
CKKCAQ 
EQCKQQ 
3 SRAGH 

Vngdsf 

5KKITE 


lALSKS 


dialog.com 


LYT 

.^oi^isRKHKVSGFLVK 



2531 


156 


683 


MKLRQRWRHPAHFSGTGKMIGEGTQVSVGTGVTVDTGVGCLSTSSHY 
ANRNADTFRCLCRKRCQRLLKSQRRGVKKSCISVPHRQLSQALQAMR 
DSS^PWALS I KSLFREDEALASKHYRRN7VPRAALRQFLSDFHRDCLN 
VFKTKYHTLMYMGRTIMRCEPCACGGEGKKNVLF* 


2532 


427 


458 


KNSQYTPLTL IQMMGF * FFVF/ ERQSHSVVEAGVQSCSHGSLQPIiPP 
GSS^PPTSASRVAGTTGAHHHDLLIFLFIi^RRGFALLHRLVSNSWRS 
RDSAWPSAFPKWLGLWDVEATTPRPGLFSHLVAG 


2533 


57 


463 


ARTPLQGAGGLVLSRLLSLPLPGLSPLPTVSCARWPLCPRPWSSLPG 
FWDS WHCNHRTGHRTGPGGLDTQSA*VNKLI WGKGNHL * PQATAQLW 
P\QLGGWQGWTRPS/RWWPLSSPHC * PGYRS LKDS FANTFLLF 


2534 


469 


RKTTTATMTS LCRKVNGGHAG * ETLRRLLWY PWTQRFFDS FGNLS S 
ASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLH 
VDPENFKLLGim.ETVIiAIHFGKEITPEVQASW/QEDGDWSGQSPVI, 
QIPLSSLAMMQSCQG 


2535 


41 


676 


APSPRRPWGHFTEEDQGLLSTSLWCKV\NVEKCWKEKTPGKGSLWY 
P\WT\QRFFD\SFGN\LSSAP\AHHGQTPKVKAHGK\KVIjTFLGRCQ 
QSTLDDLKGTFAQL\ SELHCDKLHVDPENFKIjLG\NVLVTV\ LAIHF 
GKEFTP/ERLQASW\QKMGD\*SGQCPVLQ\IPLSSL\PMMQSFSRI 
RLLFLQAITNNKSISAKRSHMIFFSSFFYIFLNI 


2536 


107 


332 


SRMATLIYVDKENGEPGTRWAKDGLKLGSGPSIKAIJDGRSQVSTPR 
FGKTFDAPPALPKATRKALGTVNRATEK 
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2537 

A 

386 

1074 

NNPEWRTLI YVDKENGEPGTRWAKDGLKLG\ SGPS I KALDG\ RSQV 
S\TPRFGKTFD / APHPALPKAT*KGFnPT.<?TT?2iTl7Tf <3\7Trnt)r , riDT va 

KNGPAFSGPKKMDLKKDCLKQKS \ SVSWPSGLMPNPRNKTKPFPLQI 
LLDF*GVFD/ LCPEEHPDLRHLPPLS \GV\ PLMIPI/TREREL* KSWF 

QLGPPFHLWKDGPLPPWGIPICLQFSFQGISVRTWGV\DLPPVCCX)I 
DI 

2538 

A 

1 

359 

GTRVIKR I S S S REGLAVPGLNATPAP S AFVLS ALQPS PTHS S SNTQR 

LPHRVTGGFSLNGQLIGNKARSPGQHDGTYFGRLGIANAATDFQLEY 
TPENITLNPGFGAPAFSWRDQAVLR 

2539 

A 

1 

483 

FQLEVTPQNITLNPGFGGPVFSWRDQAVLRQDGWVTINKKRNLWS 
VDDGGTFEWLHRVWKGSSVHQDFLGFYVLDSHRMSARTHGLLGQFF 
HPIGFEVSDIHPGSDPTKPDATMVG\LQKDYSKDPWHGAEVSCWFIH 
NNGAGL I DGA YTD Y I VPD I F 

2540 

A 

3 

310 

EVRANLSSQALRMWLDYGFVTPLTSMSIRGMADQDGLKPTIDKPSED 
SPPLEMLGPRRTFVLSALQPSPTHSSSNTQRLPDRVTGVDT*WT*P 
YVLCWNLC 

2541 

A 

1 

1482 

TKEALLR I LEDMKEEDYLNFI LFSGDVSTWKEHLVQATPENLQEART 
FVKS MEDKGMDNFLMRKGKTKAKLEERG ANQDS RNG S KVR YRRAAS H 
EESESEAAACIRCAMAYSVHHNIiRCLVSVDSSVGEGFIDNWEQELPV 

QQPVQGVMSQKARHTYLVAKPPSSGKPDGASGRTMEQFTIHLTVNPQ 
SKVTFQLTYEEVLKRNHMQYEIVIKVKPKQIiVHHFETKEALLKILGD 
MQPGDYFDLVLFGTRVQSWKGSLVQASEANLQAAQDFVRGFSLDEAT 
NLNGGLLRGIEILNQVQESLPELSNHASILIMLTDGDPTEDPPVFIR 
DKLMERRNRRTGRTEKARIWEVTDRTVRTWIGEAVAAAAAIX3VTFSV 
PVTPHTFRHS YAMHMLYAG I PLKVLQSLMGHKS I SSTEVYTKVFALD 
VAARHRVQFAMPESDAVAMLKQLS 

2542 

A 

37 

2885 

SMDGAMGPRGLLLCMYLVSLLILQAMPALGSATGRSKSSEKRQAVDT 
AVDGVF I RSLKVNCKVTSRFAHYVVTSQVVNTANEAREVAFDLB I PK 
TAF I SD FAVTADGNAF I GD I KDKVTAWKQYRKAA I S * EKA\ GLVRAS 
GRTMEQFTIHLTVNPQSKVTFQLTYEEVLKRNHMQYEIVIKVXPKQL 
VHHFEIDVDIFEPQGISKLDAQASFLPKELAAQTIKKSFSGKKTSNG 
SHSRRFQGHVTjFRPTVSQQQSCPTCSTSLLNGHFKVTYDVSRDKICD 
LLVANNHFAHFFAPQNLTNMNKNWFVID I \ SGS \ MRGQKVKQTKEA 
LLKI LGDMQPGDYFDIiVLFGTRVQS WKG SLVQAS EANLQAAQDFVRG 
FSLDEATNLNGGLLRGIEILNQVQESLPEIiSNHA\SILIML\TDGDP 
TEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNG 
RAQR I YEDHDATQQLQG F YS QVAKPLLVD VDLQY PQDAVIaAIjTQNHH 
KQYYEGSEIWAGRIADNKQSSFKADVQAHGEGQBFSITOLVDEEEM 
KKLLRERGHMLENHVERLWAYLT I QELLAKRMKVDREERANLSSQAL 
OMSLDYGFVTPLiTSMSTPnMft'nn , nr;T.V' , D , rT'n'ir'DOT7r»ODT5T ttmt pdd\ 

RTFVLS ALQPS PTHS S SNTQR\ LPDRV\ TGVGHRPFNFIHSTCPRKR 
DTLCF\NINEEPG\VILEPGYRDPNTGFFSGMGQLIGQQGPGSLGEG 
IDGH\YFGAAGEFANPATELFSWEVTPSEHLR*NPGF\GGPVFSW/R 
RDQVFLSGRDGVWTINKKRNT\ WS VDDGGTF\ EW\ LHRVW\ KGS 
S\VHQDFLGLLMCWDKSIGMSSPGRKGCWGQIFPTPIGFE\VS\DIP 
PQGSDPHKS PNA\ TMWRNPPGLTVT\ RGLQKD\ YSKDPWHGAEVSC 
WFI\HNNGAGLIDGAYTDYI\VPDIF 

2543 

A 

159 

1222 

NNSGVMPEMPEDMEQEEVNIP**GGFWVTGCHWGFIjGRAVHKBF\QQ 
NNL/WHAVGCGF\RRARPKF*TG*ICWDSNAV\HHII\HDF\QPHVI 
VHCAAERR PDW\ ENQPRCLPLQLNV\ DASGEF* AKGKAAACLGS I S 
PFYI* AQVFVF * WEPtfPP Y\ REED I PAPLN\ LYGQTK\ LDGRKGCP W 
RNHLGAAVLR I P I L YGEVEK\ LEKKCCELLMFE * KCQFS QQS PAKQW 
IHWQARGSPPHMSKDVAPLCARQ\LAEKRMLDP\SIKGTF\HWSGNE 
QM\TKYEKGMCQLPDA\FNLPSSHL/RDPITDSPV\LGAQRPRNAQL 
DCSKLETLGIGQRTPFRIGIKESLWPFLIDKRWRQT\VFH 
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2544 

A 

211 

463 

STI SAHAYALELLFDQLHEGAKALDVGSGSG I LTAC FARMGGCTGKV 
- L ^ J - LA -'- ri - Lr >- c '->-< vliuo vimjn vkjvUL/PTIjIjSSGRIQIj WGDG 

2545 

A 

188 

366 

SCQSACQKNGKIILRLQLLGP*DRLEDTLIiNTHGLLQSQILCQFCFL 
CTCQQKVLR I S K 

2546 

A 

1257 

1441 

VHFCRIFFMAFLLFTWRSSELSVDMYKESRTGKVFVTATARACSGVC 
LPICPKAQAAAALI 

2547 

B 

74 

733 

MAWKSGGASHSELIHNLRKNGIIKTDKVFEVMLATDRSHYAKCNPYM 
DSPQSIGRVQLWGDGRMGYAEEAPYDAIHVGAAAPWPQALIDQLK 
PGGRIj I Ii P VG P AGGNPNVG AS I PG SRMAS I QNE AL WMGGD I RAI* * 

2548 

A 

45 

972 

TVQPHVSGSGGDGSNSGSYSGDASGAVTVWEWSLLGKLLGTVVALK 
WIi YLLRVCLAMAWKSGG ASTS \ EL IHNLRKNG 1 1 KTDK\ VFEVMLA 
TD\RSHY/SQKCNPYMDSPQIK*GFQ\ATISCSTHCNAYALRNFLFW 
xyij \rt \liUbljiUjrovGSGKWESIiIiACF\ ARMVG\CTGK\ VIG\ IDAH 
* R S * * DDS VKLMFREGT I PTLL S FRGG VQLVCGGMGRMG \ YA\ EEAP 
YDAIHVGAAAP\WPQ\ALIDQLKPGGRLILPVGPAGGNQMLEQYDK 
LQDGSIKMKPLMGVIY\VPLTDKEKQWSGGMIVKATSA 

2549 

A 

2 

479 

RPPRVPSWRGWPAGALRGAPARGIRSCGSRAGRSRMAAAAAAGEARR 
VLVYGGRGALGSRCV QAFRARNWWVAS VD WENEEAS AS I IVKMTDS 

FTEQADQVTAEVGKLLGEEKVDAILCVAGGWAGGNAKSKSLFKNCDIj 
M W I KSJ N I MD S Vlj VS F S CV 

2550 

A 

3 92 

1245 

CTFTKVFLLLRIWYRNMLLKNSVTPLPSCHPSAQCLGTLMRALSQVF 
RDAW/ WWVAS VDWENEEAS AS I IVKMTDS FTEQADQVTAEVGKLLG 
EEKVDA\ I LCVAGG WAGGNAKS KS LF \ KN \ CDLMWEAEHYWTSDHLP 
SUL/ AIjTKHLKEGGI/L\TIAG\AKAAL»DGTPGMIGYGMAKGAVHQLC 

qsi*agknsgmpp\gaaaiaalp\vtldtpmnrks\mpe\adfsswtp 

Lt^ tLiVt.it tiDwl JTGKNRPSS \GSLIQ\ WTTEG\RTELTPAI FLGLI 
SVPMRWPAQKSH 

2551 

C 

60 

266 

MSWLSCLSNITVWNISPVLLSFFGGNSLISHLXKIPEEEPFLTPLLS 
SSPSGFXPHVPRVLRSQSATV* 

2552 

A 

1 

513 

MDLEPVQTEQSGAGVKENRGRNTAWRTPGDLHRRIHTGEKPYKCNEC 
GKVFNQISHLAQHQRIHTGEKPYKCNECGKVFHQISHLAQHRTIHTG 
EKPYECNKCGKVFSRNSYLVQHLIIHTGEKPYRCNKRKGTEIKGRER 
LKGGAEIERRKMMRDSDKVGEDSKKTTYP I 

2553 

A 

345 

2343 

LLGHEEQTRKGKRKAKAAGMVLSQGLLTFRD\EAIEFSQEEWKCLDP 
AQRTLYRDVMLENYRNLVSLGIFSKCEIKELPPKKESN\KEKYSRQ* 
CWKDMKSHRHTKI FCFRETQKNVHDSQCIiWKHD *RHYKRVRVTYKGK 
SH/CVEETMPG*KG*CHKSQPC*KSSLGLNPAVT/CLPRTCQPISKP 
EGKI YKYDHMEKS VNS S SL.VS P PQRI S STVKTH I SHTYECNFVDSLF 
TQKEKANIGTEHYKCSERGKAFHQGLHFTIHQIIHTKETQFKCDICG 
KIFNKKSNIASHQRIHTGEKPYKC^ECGKVFHNMSHLAQHRRIHTGE 
KPYKCNECGKVFNQISHLAQHQRIHTGEKPYKCNECGKVFHQISHIiA 
QHRTIHTGEKPYECNKCG\KVFSRNS YLVQHLI IHTGE\ KPYRCNVC 
GKVEHHI SHLAQHQRIHTGEKPYKCNECGKVFSHKS SL\H^HWR IHTG 
EKPYKCNECGKVFSHKSSLVNHWRIHTGEKPYKCNECGKVFSRNSYL 
AgHLi X XHAGEKP YKCDECDKAFSRNSH\ LVQHHR IHTGE KPYKCDEC 
GKVFSQNSYLAYHWRIHTG\EKAYRCNECGKVFGLHSFLAHDPKIH\ 
TGEKPFKCNECGKAFSMRSSLTNH\QRLHTGEKHFKCNECGKLFRDN 
SYLVRHQRFHAGKKS 

2554 

A 

3 

517 

strtqnvlgekgrrireltawqkrfgfpXaesi^ykllggjlavrr^ 
cygvlrfimesgak/gcevwsgklrgqraksmkfvdglmihsgdpv 
nyyvdtavrhvllrqqvixsikvkimlpwdptgkigpkkplpdhvsiv 
epkdeilpttpiseqkggnaeppampqpvpta ! 

2555 

A 

3 

495 

erggkmavqiskkrkfvadgifkaelnefltrelaedgysgvevrvt 

PTRTEIIILATRTQNVLGEKGRRIRELTAWQKRFGFPEGSVEIiYAE 
KVATRGVLGIKVKIMLPWDPTGKIGPKKPLPDHVSIVEPKDBILPTT 
PISEQKGGKPEPPAMPQPVPTA 
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2556 

A 

24 

861 

INPPPPRLSFQRSAAARWPVQISKKRK\FVA\NGILSKLNLNEFIi\T 
REL\AEDG\YSQ\VEVRVTPTRNRKSFIFSHQKHRMVLGEKG\RRIR 
\ELP\AWQKSF\GFPRGAAVELYA*KRWPT*/RGLCAICPGQSL.CV 
YKPS *GG\ LAVRRAL/ CYGVI*/ RGSFIGEVGPKGCEVCGCLGKLRGQ 
RA*NP * S FVDG\ LM I \ HSGDPV\NYYVDT * \ VRHVLLRQG\ VIiG IKV 
KIMLAWDP\TGKIGPKKP\LPDHVSIVEP/RRDEILPTTPISEQKGG 
KPEPPAMPQPVPTA 

2557 

A 

116 

718 

IFLQAGCLKLVIRSKAGDISHVDTHCQTVHPKAVSISTMTENTPFLT 
LSPALSPIFNNLPFQCIEK*YIiILTLIFt,IAYEVESLCIi*SVFLIYK 
I FL * PLSNFCCL * V/ LLY 1 1 LNLL F YYMYFPTPLFMKSNF I KVT, * KV 
KGFTF*KCSQIYLFFHGYWVLCLGWECLSYTFITIFFQYFIiLVIFNF 
HV*FILIFFIHDMK 

2558 

A 

88 

811 

GSGGNHSVCCDTMEGGGGSGDKTTGVIiAGFFGAGGAG\YSHADIA 
PLTRlWPLCPY\liNV\DPRYLVQGYKMKFIFTLPELIKTRGQDI,KLA 
FFTIGGCCMTGAAFGAMNGL\RLGLKETQNMAW\SKPRNVQILNNG* 
LGKG AL \ WA\ NTLGLWALLY \ S AFGV\ 1 1 EKTRGAEDDLNTVA\ AGT 
PAG \LiiJi IC 1<j\(jJjRGIA\RGGLiTGIjTIjS \sllctiyntwgahgkgs 
LASNSSLWKVFAQPP 

T r r q 

A 

1 

210 

VIIRIjPIGKKIjFRCKPG*TWAPPGIL*SGRPRSSSSSPF*N*MHFLF 
ERESSFHRCKWKAGITCVKSWYP 

2560 

A 

128 

687 

THVLWAFPGARVHLQTGS WRAGVAAAALALGGQASLS YARGKES *GH 
S PPPTHTHKAASGGREKAVS PS VAWVSHG PS PLLS E YTS SQVKGHSG 
PGPDPWTHL/WAPFPDQIiPPSPCPPLSRKEPEPSPLSHPPQGVTPG 
QAQIGQLRTSGLWGHKRGGWARAEGAPVSLLLRGPFDGPATVSEAQR 

2561 

A 

246 

412 

RLGDIjPRRDRQI^GRERHGGHRJIPGERRVCRAAGLEAHYVAGDLPRC 
RHGRRTRG 

2562 

A 

1 

414 

AWHEETHKVDLGLPEKKKNXKVVKEPETRYSVIjNNDDYFADVSPIiRA 
TS PS KSVAHGQAPEMPIjV1QQCKKKKK\ VSAALGKEVKRSQSKIjRIjRN 
TSP* VMTLRPPQRKR* SPKRR* SSQSSRSQL*KGRKRRRGKRGG 

2563 

A 

198 

531 

RNNPLLSLLALCLVHSSHLICWALPDEGGGQGKERALNSRKIATFFL 
SSQGTQFGQWDTAGFENEDQKLKFLRLMGGFKNLS PS FSRPASTI AR 
P\HMALGKKAADSLQQNIi 

2564 

A 

190 

510 

KSISSTGATVITGSCLTLYPSGKGGCSASYSLQICSHRFLLDGSVTC 
HDETGKIiAGRERHGGHRRPGERRVCRAAGLEAHYVAGDLPRCRHGRG 
FHGNRVRRKAANS 

2565 

A 

1 

1977 

MEVDFKIRKVGQWTITLQEHPVSVLLFIEETHPENKPTSTAVEESHI 
SRDTVMWFQCQLDWSKGYLEAYLTKINSICSHRFLLDGSVTCHDET 
GS S RDARDTVGTGVRVNDE S VEQLGLRRTMWLG I CRGAAMAAVSTVT 
AFAGRPRPGRSRNPRGWAGDSKWTSGSRRSWLSRGGGEISPTGMITK 
THF\VYIX5LPEKKKKKRVVKEPETRYSVIjNNDDYFADVS PLRATSPS 
* \DVAHGQAPEMPLVKKKKKKKKGVSTLCEEHVEPETTIiPARRTEKS 
PSLRKQWGHLEFL\RGKRKNKKSPLAMSHASGVKTSQ\DPRQGEEE 
TRVGKK\SKKHKKEKKGGPGPHSI*\ PVQDPWFCEAREARDVGDTCSV 
GKKDEEQAALGQKRKRKSPREHNGK^nCK3CKKIHQEGDALPGHSKPSR 
SMESSPRKGSKKKPVKVEAPEYIPISDDPKASAKKKMKSKKKVEQPV 
I EEPALKRKKKKKRKES GVAGD PWKEWF PSEMD FAVGELRETDTDL 
EWLEKKGNMDE AH I DQVRRKALQEE IDRES GKTEAS ETRKWTGTQF 
GQWDTAGFENEDQKLKFLRLMGGFKNLS PS FSRPASTIARPNMALGK 
KAAD S LOONLORD YDRAM S W KY S R G ART *Ci P <3 TA PNK T P Y I DRN AS KS 
VKLED 

2566 

A 

1781 

2150 

VRRFAAAVATLPPGAAPSG/PAPGT/PGPQTRIPGFPWPPLCPLMSG 
SLGP*TRLLLVPPAPQGPAGSPGPIQAGRGLAAPPPQPLSPSRGPCA 
GSSWELRLEGLSDWGTCPVPCPCHYIHPPKN 

2567 

A 

1 

345 

GRGWSLLGATAMDALEGESFALSFSSASDAEFDAWGYLEDIIMDDE 
FQLLQRN FMDKYYLE FEDTE ENKL I YTP I FNEY I S L VEKY I EEQLLQ 
RIPEFNMAAFTTTIiQPNRLSS 
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2568 

A 

1 

582 

SGRALHAS WAAGGVGAYSGRIiRSDAIjEGES FA\ LS FS S ASDAEFDAV 
VGYLEDI IMDDEFQLLQRNFMDKYYL\EFBDTEENKIiIYTP\ IFNEY 
I S LVEKYI EEQLLQR \ I PEFNMAAF \ TTTLQH/H IRDEVAWVTI FGN 
AGSTFHKI FLGF * RKCFLD YRAEKES \RGL\ DLSSGLV\ VTSLCKS S 
Sl>\ PASQNNLRH 

2569 

A 

2 

705 

GS VGAATVRPQPNALRCRLRSRAGRS PVLVFGVG I KQPPCP S KKAKT 
KTTKKRP\QRATSN\VFAMFDQ\SQIQGVQRGPFNMIDQ\NRDGFHS 
DK/EKVLHDMLAS\LGKNPH*WHYLDA\M\MNEGPKGPFKFHQCSLT 
MFWLRKLNG\TRSPEDCHPETAFCFAFD*KKAT\GTHFRKDYLRRSC 
*PTHGGIRFPD/EREVDELYRGGTLLDQKKGNFQITFEFHTASWJCHG 
SQRPKMT 

2570 

A I 

1006 

1022 

TRNSFLFFIYIYLFLRMGSCSVAQGWKCSGTILG\HCSLCLRIiPG\S 
SNS PTS ASHVAGD / SHGMCRHARLI FAFLVEMG\ FHPCWPGLGLELIi 
TLRVIHPTLGLPKCWGL*AGGSAKIGRRAQIG 

2571 

A 

668 

863 

TFCF FSDLKYS LASCENHL VCFLFVCFAFL V* T * NYFCL * TVNL I \ F 
S VRVTELCPF I F I VL I I T 

2572 

A 

1 

1282 

MRAVSLLVIiVLGVTMSYVMPWPQTENYTIYFPGSEAFKIiGRAISSIiP 
SIQYPGSSAYRWPVMELDSLYOTARRALMGNIiANVIPWGWSEKMTIB 
YLNPGTGELLNNSRGDAPHSEHVHHRRVPRAGNAAASDCCVGAAAPP 
GQDRRGGASGVdKTGSRCPLGVGLAWYATRLIiPQTGETRIKNGQAA 
FWGHRHRRHQNFGVFEPLQCIiAQKGELVKIiAPLGGLEALVVRFLTKR 
FIGDYERNAGNLYTRQVQI EGETLALQVQDTPG\ FRSMRTA* AAVNS 
L I * C IGCAHAGGMGYS I TDY KS YEL I SQLHQHVQKLHLGTRXiP VWA 
ANKADLLHIKQVDPQLGLQLASMLGCSFYEESVSENYNDVYSAFHVL 
CKEVSHKQQPSSTPEKRRTSLIPRPKSPNMQDLKRRFKQALSAKVRT 
VTSV 

2573 

A 

258 

579 

FKQICLFLLLFFEI^SCSVT\RLECSGTISAHCNIiHI»P\GSSDSPAS 
AS PVTGI TGTCHHAQIj I FL YF YYMTG FRHVAQVGVELLG \ * AI HLPW 
LPKVLGLQAPAPDL 

2574 

A 

157 

371 

GGPGNGLHRLGFSPYNVSKTALLGLNKTLAIELAPRNIRVNCLiAPGIj 
IKTSFSRMLWMDKEKEESMKETLR 

2575 

A 

1 

1431 

IWKAGLLGLCARAWNSVRMASSGMTRRDPIJUm^ALVTASTDGIGFA 

IARRliAQDGAHVWSSRKQQNVDQAVATLQGEGLSVTGTVCHVGKAE 

DRERLVATAVKLHGG ID I LVSNAAVNPFFGS IMDVTEEVWDKTLD IN 

VKAP ALMTKAWPEMEKRGGG SWIVS S I AAFS PS PGFS PYNVSKTA 

LLGLNNTLA I ELAPRNIRVNC / LAPGL I KTS F SRM\ LGE PEDCAGI V 

SFLCSEDASYITGETWNLSVMFTGGGVCRAASWKEGGTGTPRTPRE 

SPRQREPGETSSTDTQENKVWNGLPANPQRPAAEGPVRRKTNKQKGI 

ASTSAKDS INIRTGKDIHTKTPS IGHQHQRPKVDKTTKMERNQSKKA 

ETSRNQNVSSLPKEYKSSPAREQNWMENKFDDLTDVSFRRSVITNYT 

QLKEHVLTHCKEAKhTLDKMLNEWL^ 

HEGYTSFNS 

2576 

A 

201 

398 

SLRLS SHQT YS PSS FVPLLVRTC I PLEEKRRSDFWKFQP FCSGFS PS 
LWFYLPLVFDVGDLWMGF 

2577 

A 

195 

434 

TSVPSCVRCRYIILRSSSALTNILGTHSNSSFFHASSSAIiHSCCFPF 
FSWIiQTIiDINVKAPALMTKAVVPEMEKRGYRE 

2578 

A 

150 

284 

SMHKAGLLGLCARAWNSVRMAS SGMTRRDPLANKVALVTASTDG 

2579 

A 

1 

625 

MHKARLRGHCARAGKS VRLAS S GMTRRDPLTNKV ALVT AS TDWIGFA 
VAQRLAQDGAHWVSRRKQQNVDQAVATLQGEGLSMTGTVCHVGKMK 
DWERLVATGFSPYNVSKTALLGLAQT\LPIEL\APRNIRV\NCLAPG 
\LIKTSFSRMLWMDKEKEESMKE\TLRIKKV*ASPEDCAG\IVSFLC 
SEDASLHSLGKTVWDG\ GTPSRL 

2580 

A 

84 

1413 

ARKS VRMAS S RMTRRDPLTNKV ALETASTDG I G FAI ARRLAQDRAHV 
FVISRKQQNVDQ\AVATL\QGEGLSVTGTVCHVGKAEDRGAAWWPPA 
VKLHGGIDILVSNAAWPFFGSIMDVTEEV\WDKTLDI\NVKGPKP* 
MTKAWPEMEKRGGGS WI VSS I AAFSPS PGFS PYNVSKTALLGLTK 
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TLAIK\LAPRNIRVNCIAPGLIKTSF\SRI^W\MDKBKEBSMKETLR 
\IRRLGEPEDCAGIVSFLCSEDASYITGETVVN1iSVMFTGGGVCRAA 
SWKEGGTGTPRTPRES PROREPGENOAP <5 PPMar.T?PPT?PDT? q a ctc 

WMGPEP VGNRERPCKPARKVGAS PRALRRLGS ATRMLSRAALRAGVG 
ARAPRPGGGRGAHATATVWSGLASAAALTARALLKRSLACAAAPQRC 
GWRRLLCSSLGIRGSGWPCPPWSSI 

2581 

A 

1 

560 

MVYRQNQLPSSCAPEMGREDGLRAIiHPAQPPPPRAQQSPPPPLSA/P 
EEPPPPRAQQSPPPPLLTPHRRHPOSFPAPSAP^AT.QPnpPnT.YQAV 

TFSGSLSGPRTSTCTPSPASSPSAHGLGHIHTDAGVCSLEAQEEAKA 
PGGRGPSCSAHGALGPTCTSGVAGDSGPWGSQERALDPRRQGLQH 

2582 

A 

775 

1408 

TIPVAYVWADPPLPPKGHSCPHPPGSD/DGTRTTGTA/PPCLGFQGD 
PDPVHPAPAppDW**FPNMG0PGPPPPFPTWRAOT.OTjT.K < 5SVTJMRGW 
RWRNVPSGPHLPEQISLSELGHSHKPLFLRQNPNPTVPWDVMLPFPL 
LNLLYGKMWNWSLDLWGOIRNEEWRPTNI ILLAPWRHGRLQLKGAG 
ARGRQLPQLLLPALLLILVGYEMSF 

2583 

A 

3 

560 

GVPPKSASKFLCTRNGKGRWAOSPPPSTA\ PPPPAOOQPPPPT.QAnTf 
/KPPPPRAQQSPPPPLLTPHRRHPQSFPAPSAPSALSRQRPGLYSAV 
TFSGSL SGPRTSTCTPS PAS S PSAHGLGH IHTDAGVCSLEAQEEAKA 
PGGRGPSCSAHGALGPTCTSGVAGDSGPWGSQERALDPRRQGLQH 

2584 

A 

189 

608 

AFLTFNKLHRTDTCRGNKPHPPPPPGSKVRKRTWGFAPPGPGSPWAS 
LPHGLPPTAGRQGRGR/EGGGPRGGPGKGSAQPSSGTRAWRKPMGGC 
LTEDGPYGGPKGLWP*TTNG*K/PPGARPWPPLPLGCPLEPRHPVLG 
p 

2585 

A 

2 

847 

LKMAAEEPHQQKQEPLGSDSEGVNCLAYDEAIMAQQDRIQQEIAVQN 
PLVSERLELSVLYKEYA£DDNIYQQKIKDIjHKKYSYIRKTRA\DGNC 
FYSGFRILPLGRHCVJDDSKELQRFKAVS/WPRARVDLGCSQ/WGFT* 
ISQF*GIFHNKFHGT*LSRWRKQTSVADLLASFNDQSTSD\YIjWYL 
RLL\TSGYLQRESKFFEHFIEGGRTVKEFCQQEV\EPMW\KESDHIH 
I \ MALAOGLNGAI OQEDMDRGERmTNPW T P oprj q t? t>v n vt .t.vt? pnu 
YDILYK 

2586 

A 

468 

1191 

DVSRVGCLPPSRAMGWTLAiLRRGSAVSAiASGLVEEPMLGPPFHPT 
PRFKAVS AKS KEDL VS OGFTE FT T ED FTTMT PMDT .T pnVF KYVTQVZV m. 

LA5FNDQSTSDYLVGYLRIJ^TSGLL*GPRASSSSTSSKGGRDCPKEF 
CQ\QELEPM\*KESDHHSHHCAGPGPSACPSRVGVHGTAAEGGNHQS 
AHLSLRASEPKVYLLYRPGHTOILYKIGLGSSPLLPCCPPLPGARHV 
TEVFLWL 

2587 

A 

1 

326 

HS VNRKR LNRNARRKAAPR I E / CVP a T if? fp v A VfiT .np r>n pp nnfiQAM 
SLFAGS SHI RHAWDHAKS VRQNLAEMGLAVDPNRAVPLRKRKVLTWQ 
GLGPCLLYSSPAILT 

2588 

A 

879 

1528 

AAGAVVSAMPKAKG\KTR*\OKFGYSTL.RKR\Mc;PTJARRKAA/PAr;T. 
ECSHIRHAWDHAXSVRQNxxA*MGIxA\VDPN^ 

dieeea*knlyrkpx^ndleaepslpekkgntl\spj5lhltmyrym 
v\e^jhge\dykaia\rd\bknyy\qdtpkqiselrsijvy\krfypae 

WQDF\ LDS1jQ\ KRKMEVKVPGLHHSCPSAJBASPGPSEAG 

2589 

A 

1 

310 

EKKPKPRPPERSPRPLWVPEPQLEIRPJRRPTPAMLFRLSEHSSPEEE 
AS PHQRASGEGHHLKS KRPNPCAYTP PSLKVI QR I AESHLQS I SNLN 
ENQASEEED 

2590 

A 

1 

716 

SALQRLEKKPKPRPPERS PRPRWVPE PQLEVGG ACS AIxAQS PSEKLD 
PACLKPLS * S * IRRRRPTPAMLFRLSEHSSPESEASPHQRASGEGHH 
LKSKRPMPCAYTPPSLKAVQRI \ AES \HLQS YSAILNENQGFRGRED 
ELGEFW\ELGYPREEDEEEEEDDEEEEEEEDSQAEVLKVIRQSAGQK 
TTCGQGLEGPWERPPPLDESERDGGSEDQVEDPALSEPGEEPQRPSP 
SEPGT 

2591 

A 

39 

370 

RGWSLVQTPNPRFKNVPGPPTPQVIAECEESQCGGLGTLGGL*LSPG 
P\G*TDSRGPPMMPSEAQGGGAASKGPLGPPQAAGIHQGTPLRMGEA 
LEGQAGQTAG PR VQDGL 
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2592 

A 

229 

358 

RS IMWFLQMAQLSTTI SQAQRATAFHFFTS KRFLPAPLLVPLG 

2593 

A 

257 

387 

RSIMWFLQMAQLSTTISQAQRATAFHFFTSKRFLPAPLIjVPLG 

2594 

A 

217 

398 

KAWWNLLRKSEPKSCGS *GGLSTSRERLVECSSPLGLGYCG* *LCHL 
QEPHYGSLHRMSS 

2595 

A 

1 

3386 

MAILPNVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSIIi 
SQKWKAGGITLPDFKXYYKATVTKTAWYWYQNRDIDQWNRTEPSEIM 
PH I YNYL I FDKPEKNKQ WGKDS L FNKWCWENWLA I CRKLKLDP FliTP 
YTKINSRW I K0LNVRPKTI KTLEENLGI TIQD IGVGKDFMSKTPKAM 
ATKAKIDKWDL IKLKS FCTAKETTIRVNRQPTTWEKI FATYS SDKGI» 
ISRIYNELKQIYKKKT^PIKKWAKDMNRHFSKEDIYAAKKHMKKCS 
SSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHC 
WWDCKLVQPLWKSVWRFLRDLELEIPFDPAIPLLGIYIiKDYKSCCYK 
DTCTRSHPGYHIAFSHQVFLGFANIFLILDDLESSEYWSGLCMIiCDE 
FVATVQCLGEKTYAQCGIAADGAHQNALNSNPSPHEAI ISKAFPTDG 
VGPAHGGVGPVAWMIWFVGEPERWQESQTGSPTLRDGLADRPSLKGS 
SQQPQAEGEKKEAAQTPVEIALIWPDLNSGANLWQSCECQAQGLRSC 
LWPHSQGRHASDGRTQPQPTPAVEPTLEVKGSLRAGTYCMVDPEQPV 
LQGRRECSTWLLESKSFTCWLFDRDFLSAYSRETEPIEGVCVDRKRE 
REREREIVLKELADAVIAAGKSNICRVPLLGPPAGPSHSIVNIPSGS 
YI/3KENIiVGKCALNVSVNVWQFIADVVm 

HDFS IDAKKAFGKIQQRFMLKAIiNKLGIDGTYLKI IRDSGTPGDAAG 
PLLQER VNATTLP S AH I PMAPS TLNQGNDE KAGDQRRARACVKSNNL 
YTEGCESWKSEGVWVPTLHSSAWSVSCGGACTDKELRNIiASRLKDWF 
GALHEDANRVIKPTSSNTAQGRFDTSILPICKDSLGWMFNKLDMNYD 
LLLDPSEINAIYLDKYEPCIKPLFNSCDSFKDGKLSNNEWCYCFQKP 
GELIAFGPLKVKGPVlLTNSQPSPSSAADPAADIIiTWAFIPRCNEEG 
YYKATQCHGSTGQCWCVDKYGNELAGSRKQGAVSCEEEQETSGDFGS 
GGSWLLDDLEYERELGPKDKEGKLRVHTRAVTEDDEDEDDDKEDEV 

2596 

A 

442 

2093 

CGFLQMGTVINHNIPGPEGYWHFHFFNFKAIiLARAAVGAARGIHIHR 
CGLTKRPDARRRRVAGAQGREPAISLPGDRAAGARATRTRGPGPAPK 
MPAIAVLAAAAAAWCFLQVESRHLDALAGGAGPNHG\NFL,\DNDQ\W 
LS TVS Q YDRDKY \ WNRFRDEVEDD YFRNW\ NPNKP FDQALi \ DPS KDP 
CLKVKCSPHKV\ CVTQDYQTAL \ CVSRKHLLPRQKEWGTWAQKHWGL 
DL * ILVK\ CKPC\ P V\AQS AMV\ CGSRWATPNS I PSCK\ LEFHACST 
GQK/SFATLCDG\PCPCSSQSLEPPKHKGRKGVPCTDKELRNIiASRIi 
KDWFGALHEDANRVI KPTS SNTAQGRFDTS I LP I CKDS LGWMFNKLD 
MNYDLLLDPS E INAI YLDKYEPC I KPLFNS CDS FKDGKPFLNNEWCL 
LPSQNPGGLP/CAQNEMNRIQ\KLSKGKSLLGAFIPRCNEEGYYKAT 
QCHGSTGQCWCVDKYGNELAGSRKQGAVSCEEEQETSGDFGSGGSW 
LLDDLEYERELGPKDKEGKLRVHTRAVTEDDEDEDDDKEDEVGYIW 

2597 

A 

1 

844 

LHQQAAVAWLPTSFLPSQEHCCSTEWRAPQPAASGVHGICFA/RPFA 
SSTVSTEVSGSFCYLPAP\WVYGFLL*TERPLAWWCLGKYWLQELLG 
CHPVPLAML*DCLALLKSSLVPMGSPTHCKAIVTEEPIiHIjPEPAAAA 
AHTS S PS S S CRCKGCLG WVWPGGSAPGPLGPLS FDPWTVSHQSGPS P 
AAPGPTTSSS *RPSAPSHGMQGLADSQGHLCRHSRKGLRSFLLSHTH 
HSTCPPHLLQTGMPRSPLISIDADSSPPRIHRLRGQGHFSGLAKGGQ 
R 

2598 

C 

145 

342 

MQLXXWPPPLLXPGPLQFLIQRSRGHHAPPAXASSRPGQDDXXXTC 
GAXLLRPRGHFLRAAXVPtt 

2599 

A 

4 

213 

AGKFLHRCPPCWSHEFTIP*KCLPQGGRTGRISEACL/HRPSHSLCP 
PQCRLEHLASCDPSTLASQSMIC 

2600 

A 

2 

589 

RPREPPEQELQRRREQKRRRHDAQQLQQLKHLESFYEKPPPG\LIRE 
DELKPEDCIPDVPGNEHAREFLAHAPTKGLWMPLGKEVKVMQCWRCK 
RYG \ HRTGDKEC P F F I KGNQKLEQFRVAHEDPM YD 1 1 RDNKRHE KDV 
R I QQLKQLLEDSTSDEDRS S S S S S EGKEKHKKKKKKEKHKKRKKEKK 
KKKKRKHKS 
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2601 

A 

3 

1105 

IWRPRPGLQGPSRASRWAAAGRGAARRKCPLGRSQSRSAGAAMSSCP 
GREDARAGGAWWPREPLDQELQRAREQKRRRHDAQQLQQLKRLESLV 
QGGHSGSQLPFGTTRNSGPDLGGLALPWLQFKEGPDDPGECGVKKIQ 
S YEKPPPGLIKPYLAQSIiSPSCVGFLS I S SSESSQLRGDPTAHPVQL 
PGGG I LPAAPGY&TREDE AKPEDS I PD I PGNE YARE FLAHAPTKGLW 
LPLGKEVRVMQCECWRCKR*GHRTGGKEWPFFFQSNPSLKQVRRAQ* 
NSPWYHSIQGNKQHERDVR\IQQLKQ\LLED\STSGEDRSSSSSSEG 
KEKHKKKKKKEKHKKRKKEKKKKKKRKHKSSKSNEGSDSE 

2602 

A 

144 

522 

IGKEE I KliLL FPDNLMEYTVNF * MYKPLELI S * FS KVTE YQVNTEK/ 
SNCIYt*QLQIENEIAKTI*FMIASKSIKYLEISLTKCN\KWRDILC 
I C IGRLS 1 1 KVXiVLPKLMCGGEKI FNPNPTGFW 

2603 

A 

105 

440 

RSTPDPVCLGIRSSGCRTADFREP*MLLSDRSSGSFVSEEYPAV*SV 
SLPliLGECLPVRLLRDGHSW*HDSDSS*VFKMADNFPAGFHRLGRSG 
DGRAEHRRCCLASQRCC 

2604 

A 

1 

1047 

MVSISWPRDLPASASQSAGITGLIGALVLSVGIYAEVERQKYKTLES 
AFLAP A 1 1 L I LLGWM FMVS F I GVLAS LRDNL YLLQAFMY ILGICLI 
MELIGGWALTFRNQQLLLRQQVLSHTLGCADLSDGPGSGPVKMFMG 
VPVI PAQPPELLASLRLS RGYGLVLS WLEPRYEKMI SGMYLGEIVRN 
ILIDFTKKGFLFRGQISETLKTRGIFETKFLSQIESDRLALLQVRAI 
LQQIiGLNSTCDDS ILVKTVCGWSRRAAQLCGAGMAAWDKIRENRG 
IlDRIiNVTVGVDGTLYKLHPHFSRI^1HQT^^CELSPKCNVSFLLSEIX3 
GKGAALITAVGVRLRTEASS 

2605 

A 

1 

690 

MVLISGPRDLPASASQSAGITAIiPSASSIMPSLQFLKYIKLILANPV 
LGKSYTVRSFCMFTSQFPSPLPLCRYFAAAVDDMNLGLSRAPANPFD 
P S VPLDCIiEATHVLDKQANCWSMQLAVLGTAAAG S QFHVRCAYFPFT 
FHHDC I TGLVPDWHQNPRLGS PLEADALKGFLADRC I PQTTHFCS PL 
TIPDSAATSSKCLSMKPGESNRYRADYRSGLNRGNVKGRERL 

2606 

A 

210 

426 

RGAPSSAARGFSAMPKSKRDKKVSLTKTAKKGLELKQNLIEELRKCV 
DTYKYLFIFSVANMRNSKLKDIRN 

2607 

A 

651 

835 

PRWSRFPDLVICPPRPPKVLGLQA* AIVCAS IS ITKMWLRNAGLTSS 
MNWPRRSARVLSVS 

2608 

A 

198 

757 

RPVGSGERRLVLPIFRCSCKANWVGRQLIiNTAPGYACCRFGPLFPTS 
ALSHLPALPPPWLPFPSFSTVSLTKTAKKGLELKQNIilEELRKCVDT 
YKYLFIFSVANMRNSKLKDIRNAWKHSRMFFGKNKVMMVALGRSPSD 
EYKDNIiHQVSLWPSRGGAKAKPPSLPLLPCDSLPQPLAGQRLRES 

2609 

A 

1 

1305 

MMRAHAP PTAVPACTAPRAPPPRDHRAG I AREALGRTGRSGFRDCLG 
TREVALQAREKAQAAAWQGEAGCS AHWVRHTRGAAPEGR PRVPRAAG 
VLTGCTVFRRTW I QRDAQ I QARQERWAKGE SGPWGELRGLATQPAVR 
AS GRRGRRLERQD I IiEVFRCLAAS WNGGFGAVKPPEVADARSGLGSG 
DSFCLVSVSLTKTAKKGLELKQNLIEELRKCVDTYKYIiFIFSVANMR 
NSKLKDIRNAWKHSRMFFGKNKVMMVALGRSP\SDEYKDNLHQVSKR 
L\RGEVGLLFTNRTKEEVNEWFTKYTEMDYARAGNKEGFHC*AWDPG 
\PLEQ\FPH\SMEPQLRQLGLPT\ALKRGV\VTIiLSD\YEVCKEGDV 
ADPRARASRS * KLFGVMRWLNFKVTI PNYMWDSQSGRFQQIGKTD\ L 
PESA\ SEFPEESDSEDDD 

2610 

A 

319 

632 

GEPNISGMSLPWGGPSGSTGMPAPAMPLPAMTPQC/C*GQGPV\GPC 
AS/CASA/CFRCGL*AGRFC/G/PCPGHC*CFYFSNSVFPNVECMGL 
REPAVLPAFHLGEEAGRI S SGQGRSQNLPA 

2611 

A 

156 

621 

LHVPMELaQSHS KDEVS PTYLGMSLPGVAPLGSTGDAS PCHAPPRPWT 
PQCSKGQGPCCAP/WCP/CCP/CPPCPWSHGCL*LLCDPCPPPALPA 
AEGQQPLCKLAS AVSQVVS RLGNFATCPWPLLMRPAS S PRWKAGRTA 
GSLRPIHSTLGNTEFEK 

2612 

A 

1 

585 

MAELLLQPLLQQDCDGSCSIPVGPRMQPAEASLTLNVQMKATRPPDG 
TERNI SRLNKR I LYKRWS LLCLSDAKLKVTQPKGEHYGI SHERFPPV 
PSRQSDREPPFDGEDEDELFQSIMEHNVSYPKSLSKEAVSVCKGLMT 
KH P AKRLG CG PEG ERD VR EHAF FRR I D WE KLENRE I Q P P F KPKVLD S 
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CRAP PHD 

2613 

A 

1 

1047 

MAALRAMGVS SLWTAAQSLEWS EWNPQVFAGVQDRKLRCLQVCRTES 
SGVRRCAGQKAQVFAGVQDRKRRCSQKAKLGPAGNKVISPSEDRKQP 
SNNLDRVKLTDFNFLMVLGKGSFGKWLADRKGTEELYAIKILKKDV 
VIQDDDVECTMVEKRVIJVLLDKPPFLTQIiHSCFQTVDRLYFVMEYVN 
GGDIJ^YHIQQVGKFKEPQAVDLKLDNVMLDSEGHIKIADFGMCKEHM 
MDGVTTRTFCGTPDYIAPEGLLPRSILPVNTALLAYALLWPVEESVD 
GCFQGQWDAVRHAGAIGLTTNVIDVFVFTQIIAYQPYGKSVDWWAYG 
VLLYEMLAGQPSTDQLHRIS 

2614 

A 

1 

1674 

MRSQGEVRRYLKRQSCRFGLGSREKDMVTPSASSVCLIDEQIPAAGE 
SYDIIDPRSKHKFKIHTYGSPTFCDHCGSLLYGLIHQGMKCDTCDMN 
VHKQCVINVPSLCGMDHTEKRGRIYLKAEVADEKLHVTGKACSSRSS 
IVGRHLDEGYTDIKAGPQLDAFLCISSQPPFGKNAIIITSSSSSIIT 
LLKHGPRRFYSFASCSCMSSTVSVTNPKILGKGFIDSVWVTCHPGPM 
DCDWRADMGWGPVRLCIWFSTRNHQEQEKFMGPVREPKGPSHNTWGL 
GELQDEIWVQTQGQTTSRNHIQKPPTQGQVNNMKAKRTKIVSSLIFP 
YSLQIDCNFQRKiOa^VVVRDAKN^IP^PNGLSDPYVKIiKLIPDPK 
NESKQKTKTIRSTLNPQWNES FTFKLKPSDKDRRLS VE IWDWDRTTR 
NDFMGSLSFGVSELMKMPASGWYKLLNQEEGEYYNVPIPEGDEEGNM 
ELRQKFE/RDDLG*VP*TPSASSVCLIDEQIPAAGESYDIIDPRSKH 
KPKIHTYGSPTFOTHCGSLLYGLIHQGMKCDTCDMNVHKQCVINVPS 
LCGMDHTEKRGR I YLKAEVADEKLHVTGKACS SRS S I VGRHLDEG YT 
DIKAGPQLDAFLCISSQPPFGKNAIIITSSSSSIITLLKHGPRRFYS 
FASCSC^SSTVSVTNPKIIXSKGFIDSVWVTCHPGPMDCDWRADMQWG 
PWLCIWFSTRNHQEQEKFMGPVREPKGPSHNTWGLGEIiQDEIWVQT 
QGQTTSRNHIQKPPTQGQVNNMKTOaRTKIVSSLIFPYSLQIDCNFQR 
KKKRAVVVRDAKNL I PMDPNGLS DP YVKLKL I PD PKNE S KQKTKTI R 
STLNPQWNESFTFKLKPSDKDRRLSVEIWDWDRTTRNDFMGSLS FGV 
SELMKMPASGWYKLLNQEEGEYYNVPIPEGDEEGNMELRQKFELMIIj 
DKFLKLSDLCFPCLRSGDDNRDCRIGLLGGFTQFTYWIIHMAVPHL 
C I LSHWKVFRG SNTHGAVITPDNNAFWNTS 

2615 

A 

3 

2089 

RPQLPGGGKRWLGGTMADVFPGNDSTASQDVANRFARKGALRQKNVH 

EVKDHKFIARFFKQPTFCSHCTDFIWGFGKQGFQCQVCCFWHK\RC 

HEFVTFSCPGCG*GDPTLDDPRRQATRF\KIHT\YGSPTF\CDH/CV 

GSLLYGLIHQGMKCDTCDMNVHKQCV\liNVPSLCGMVHTEKRGRIYI* 

KAEVADEKXHVTVRDAKNLIPMDPNGLSDPYVKLKLIPDPK\NESKQ 

KTKTIRSTLNPQWNESFTFKLKPSDKDRRLSVEIWDWDRTTRNDFMG 

SLSFGVSELMKMPASGWYKLLNQEEGEYYNVPIPEGDEEGNMBLRQK 

FEKAKIX5PAGNKVISPSEDRKQPSNNLDRVKLTDFNFLJWLGKGSFG 

KVMLADRKGTEELYAIKILKKDVVIQDDDVECTMVEKRVLALIJD 

FLTQLHS C FQTVDRLYFVMEYVNGGD\ LM YH J QQVGKFKEPQAVFYA 

AEI SIGLFFLHKRGI IYRDLKIiDTVML\DSEG\HIKIA\DFGDVQGN 

T*WDGSHGPGTFCGTPDYIAPBIIAYQPYGKSVDWWAYGVLLYEMLA 

GQPPFDGEDEDELFQSIMEHNVSYPKSLSKEAVSICKGIiMTKHPAKR 

MGCGPEGERDVREHAFFRRIDWEKLENREIQPPFKPKVCGKGAENFD 

KFFTRGQPVLTPPDQLVIANIDQSDFEGFSYVNPQFVHPILQSAV 

2616 

A ' 

1 

1136 

MTQISNIKQPFAMQHRFFISLLHWTLQIVYPALFLGLCERGRNGRCD 
QVPEGSVLSLLVVTVA^PGLAFPEVAPGTCGTSYVPDAGLHGPEVESKIj 
GLDPGSMTGTQPTRTPAQPAAAE PPALTGPGRVHQEQAVLACGFLG I 
YHLGAAS ALCRHG KKLVKDVKAFAGAS AG SLVAS VLLTAPEK I EECN 
QFTYKF AEE I RRQ S FGAVTPG YD FMARLRSGMES I LP PS ARELAQNR 
LHVSITNAKTRENHLVSTFSSREDLIKVLLASSFVPIYAGLKLVEYK 
GQKWVDGGLTNALP I \ LP VGRTVTI S PFSGRLDIS PQDKGQLDLYVN 
IAKQDIMLSLTU^VRLNQALFPPSKRKMESLYQCGFDDTVKFLLKEN 
WFE 

2617 

A 118 

1018 

VTARPRASRLKGLVQHG S VPGLHCATARMKH I NLS FAACGFLG I YHL 
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GAAS ALCRHGKKLVKDVKAFAGAS AG S LVAS VLLTAP / EKKLEECNQ 
F\ TYKFS \ EEI RRQS FGAVTPGYDFMA\ RLRSGMES \ ILPPSAHELA 
\ QNRLHVS I \TNAKTRENHLVSHFFS S \ REDPQLRSLLS PAS FVP\ I 
YART* S LVE YK\ GQKWVDGGLTNALPHPCPVGRTVTIS P \ FSGRLD I 
S PQDKGQLRS VCL I SAKQD I \ MLS LANLVRL\ NQALFP P SKRKMESL 
YQCG F\ DDT \VKFLL\ KENW FG I KC I KVL 

2618 

A 

60 

388 

GLDSFSATETRRTHTNTEPHSQSQEPSNGEPQKEEPAAESRDPTPGQ 
QTEEDQDTAE I PVRDMEGDLQELAS VKHRG * ILDLGS \ S VKVKI I PK 
EEHCKMPEAGEEQPQV 

2619 

A 

1 

4123 

MEEVEEDRFKENLEGTUiAGQLLGDEATQALQVLAVELDWVPGALHP 
QRLHRLGAALVERQPVREVDHLVLPAVDDEHGRRDLGHLLDVREGVE 
AVGLLGVAEGDAHARGERRVQHHRGTLVARGQVHGGHRADALPVQDD 
AVRADAVPGGAGAGSAAASNARAPFPPAGVPGPSSGCDPPVSPLSQV 
SAHWELCGPHILNASYLPARVRKPFLVHWPGQRTLFLPAALAHPLGH 
EEFRQLCPQMSPPNFGLSESPRPVRCQCNPGQHRGWWLRRWHPLPPA 
PSLGSGQVLGHLSTTSSHPGAPSPPGHWCAAPDPADPAPVTRPPRAQ 
SQARGTHLPPCPCRDPTTLLPHALGSDPRQTPSCKAGAWAGRSPQLP 
PGCHHSNERDTSPVEALGTLWPPPHGSGPRFLQDKGAAGQMAEQTEIi 
RAGHGRMAKLRSHRASWASPPDLDAAASPHLAPSAASADGLPATRAQ 
TPRPPPTPSRQAELPPGSPSPGAQGLPGGVDVGIEVPLGRPARAGTV 
AGG WGEDVAVEAGAQANVEAAHLAQVHG I AVREEDRVPGTRHAANI 
HAGDTVAAGALGGEDLDGVQLALAVXiEVGTLRQGFWWTLRGTDVETY 
PFSAPRAASHGVGRHEELPDPTGPCGGRLLSLTIHGVTIRYHALLWA 
RGPIMSKSQVLGEWEPVQGGKSSENDKWTMSDPGAEAPTCSRAASGV 
DKEQQGRWQGLWNSHIKPLKIRMVKQNNIIPGETQILLRFTGWESKV 
NAKKQLPVGI KCEPMDQENEQTGGHETDGHRI VSVLIHFPLIS ILS Y 
ATWGLSLLECIPGSPVCTLLVRFSNVGTRWSLEVRGSPCGFGSNKVC 
GVMTPEIKMVCVCEGKAGKAVGSGGVEGTKEVSTGNAEGPVRHEAVD 
GGVHLAFALLQGLLWSLLLGPPGLAGWGGGELDAVPDSTSSATNVSM 
WSAGPWSSEKAEMNILEINEKLRPQLAENKQQFRNLKERCFIiTQLA 
GFLANRQKXYKYEECKDLIKFMLRI^RQFKEEKIiAEQLKQAEELRQY 
KVLVHSQERELTQLREKLREGRDASRSLNEHLQALLTPDEPDKSQGQ 
DLQEQIAEGCRLAQHIiVQKLSPENDEDEDEDVQVEEDEKVLESSAPR 
EVQKAEESKVPEDSLEECAITCSNSHGPCDSIQPHKNIKITFEEDKV 
NSSLVVDRESSHDGCQDALNILPVPGPTSSATNVSMVVSAGPLSSEK 
AEMN I L E I NE KLC PQLAE KKQQ FR S LKE KC F VTQ VAC FLAKQQNKYK 
YEECKDLIKSMLRNERQFKEEKLAEQLKQAEELRQYKVLVHSQEREL 
TQLREKLREGRDASRSLNEHLQALLTPDEPDKSQGQDLQEQLAEGCR 
LAQHLVQKLS P 

2620 

A 

71 

547 

KQIPSLPISRPS\OiAVMVTMTIiKVGAPPMLFIQRSVVNSPSCSSESR 
TTVYFWPPSVSPFMTKLDQSLPTRTPTSAGVMLRKVFPPTSREKPTC 
FS SNA* KELP *GWQESGCSRQCPLSSRBTCRASCINESANARGEAVC 
VLGARWFLFTSETGNS 

2621 

A 

241 

452 

ENVMLC I IRAKEKNHMITLS YTVKAFG\ KLKTFFMI YKNKPQKTLSM 
IRREGNTFNPIKGRFEKNS/AGNIILNGIGLNAF/PIKSEKKVEYLL 
LFFQFS 1 1 LEI * INAI K* EK* I KGRFEKNS EVTLY 

2622 

A 

1014 

1432 

S ARLSLPKCWD YRREPPCPAS FLYLKNI ILYI IRAKEKNRMITLS YT 
VKAFGKIENFF\MIYNNKPQKMLYMMRTEGNTSNPIKGRFEEPTGNI 
1 LNRH K 1 GC F S I KS E KKVE YLLL L F * F S 1 1 LEL * VNAI K* EKLIKY 

2623 

A 

55 

316 

NFWGGGAPKAPPPKKGFFPKIPRGVLNRPPQKEKKLFFPPPVKLGPP 
KDFLKRAPP*TPKKFFFPPKP/YKFLGGGGPKSPPPKKRVFSQNPPR 
GFKSPPPKRKKIIFPPPRKIGPPQGFFKKGPPPLFIFFCLFPAFMET 
TCPFSKSLSKKFTKTK 

2624 

A 

1 

3669 

MGCTPSHSDLVNSVAKSGIQFLKKPKAIRPGCQGGSERGSIPLLVKN 
STCYDAGEGLAEEQPSPRRNQTTAKGLCQLMGDPASGKRKDMEGLIP 
GTKTSSSQLNKSQSHMAKDIPFKTQGSHGSQGADFSGDESEESSTQD 
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TSKWKRTAKCHTSSTQSHCYQTIHPAHEPEGKVDFPEPLVKAHQQAY 
TYLHSSLSKYBAILCIIHQATQTRELLQPMVSFLLLCFEEISQLLGE 
ISKDGEVLLQ.EVREDLAWPLKKREPQEQPNLLQQLLQYTVSKLQVLN 
GWASLTGSFLEGSSSYLHSTATHLENKLSTKRNVDERLLRALRQLE 
S LAS GCGD PG VQGLPLCS EDSG I G ADNE S VQS VDKLGKQTS WDLAPE 
PEEWKSVTSPHTEARQSGHTWQQSPFCLGSGRPQDCLLSGAPMAKVQ 
PRAQDEARSPCLSSTSPENITSPPLKLGTSTPCDSFGIGVSVEPHLS 
KTSRPMDASSLSDSEDSSPEEEEEDKMSSMSLCAWQEKTPHSRPQSS 
PADRESPFQARTRRLRSLQAQEMILKMKESISERIKFVPVPCGHQDW 
SEEEEGRTWPPRPSTVSGSRRAPERQTRSQSESCLQSHVEDPTFQE 
LRRVQRDL S QKLEAF YALGAKGQGQS QEQ I LQPRAAAVW PNGTCRVS 
PSNTTSRLKASLTKNFSILPSQDKSILQKCNPHPEDEQGKAGKLPNA 
IPSGEVSEAAKATDWNVRGCBTRTSVKKLIETFSPTESLRMLGDSKD 
AGASPCLRNCIMPPRFPKYTGLAPLYPKPQISPASGRES/PQNGHRL 
EALSTYLSPSA* SRSSQE *GAQL *NGGEPRAPPS \ PPMEVLMDKSFA 
S \RS PQKAAS PQRTPPRKPRSQGRERIiAPPGEHGLPQS * GPfc * APWT 
^UKaK>VFPA w PSJjTAQGQGVAGAAAS PGSQPWT * AAHQPPAKAPR *R 
VGLGVRQRRPPASTGSPGRPSPGTTPA\PIWTKQDLRVQPGQTKAEP 
\ KRS PPVGRKAS PTRTHWVPQADKRRRSLPS S YRPAQPS PS AVQTPP 
SPPVSPRVLSPPTTKRRTSPPHQPKLPNPPPESAPAQCKVPSPPTQH 
PEASPPFSIPSPSPPMSPSQEHKETRDSEDSQAVIAKVSGNTHSIFC 
PATSSLFEAKPPLSTAHPLTPPSLPPEAGGPLGNPAECWKNSSGPWL 

RADSQRRAALCALNPLPFLRRTASDRQPGGRPQPPTLDPTSTSYESQ 
LGQNR 

2625 

A 

1 

JO J 

PTRPPNFHSPHPTSPAAVGSAAIiRCSGRRRSLEVATSLPHPSPGPRP 
AADVKMSSSEEVSWIS\WFCGLRGNEFFCEVD\EDYHPGTNFNLT\D 
LNKQVP\HYRTSSKP*SLDPEPD\EEL\EDNPN\QSDL\ IEQAAEML 
ijWLUj in \AKi J-P1jPTRRHRPRCLGKVPSKGDFG\ YC\ PRVYCENPAN 
CFPIGLSDIP\GEAHGESSYCPQ/CAMDV*HTPSQSRQPSTRMGA*L 
IU1A ronuori i v^±t'j^xjt'Kjt'l\~Ktr/\\jkri>b VA/ KGFLiRGr KIH\ PMAY 

PAGSFQGRPATFKKPQFKTDSLIPSPTCPAVFCLFLSFLPTLSGTLY 
GF 

2626 

B 

44 

403 

MKAL I L VGGYGTRLRPLTLSTPKPLVDFCNKP I LLHQVEALAAAG VD 
HVILAVSYMSQVLEKEMKAQEQRLGIRISMSHEEEPLGTAGPLALAR 
DLLSETADPFFVLNSDVICEFPFQAM* 

2627 

A 

159 

311 

IGPSLALGSKVFLQKXQSLLFIRCWLIQALQDLWPQGWNTAEMGWA* 
QMG 

2628 

A 

545 

2302 

AACAP S PRTE S S TVS S LPLLQECLLGAMKAL I LVGG YGTRLRS / LTL 
S TP KPL VD FCNKP I LLHQVEALAAAG VDHV I LAVS YMSQ VLE KEMKA 
QEQRLGIRISMSHEEEPLGTAGPLALARDLLSETADPFFVLNSDVIC 
DFPFQAMVQFHRHHGQEGSILVTKVEEPSKYGVWCEADTGRIHRFV 
EKPQVFVSNKINAGMYILSPAVLQRIQLQPTSIEKEVFPIMAKEGQL 
YAMELQGFWMDIGQPKDFLTGMCLFLQSLRQKQPERLCSGPGIVGNV 
LVDPSARIGQNCS IGPNVSLGPGWVBDGVCIRRCTVLRDARIRSHS 
WQESCIGGSGGSWKLLTIDQDLMVAQFSTPSLPPTLKVGFLPSAGKE 
QSVLWVS LEEAEPIPD IHWG I RVLQPPPRARECAEVRMENVTVLGED 
VI VNDELYLNGAS VLPHKS IGESVPEPR 1 1 1 HDGG F AE EAVLWHQV 
LVDTGSTTGEGMRTALHVPTALPTTKLWGPLLADLWAQGWDTAEMGG 
GSCCLGRCRGQVRQDLFQLGQLIGTDIIRIWGWGSVKSEVAEDVMP 
LEY S QPTTLL FLRLLTRP S PRFP 

2629 

A 

124 ; 

1315 

RARGRARRWECAAGTGQAGCRHLRARDTPSTSAGAMKAL ILVGGYGT 
RLRPLTLSTPKPLVDFCNKPILLHQVEALAAAGVDHVILAVSYMSQV 
LEKEMKAQEQRL* IRI SMSHEEE\ PLGTVGPLPLARDLLSETADPFF 
VLNSDVICDFPFQAMVQFHRHHGQEGSILVTKVEEPSKYGVWCEAD 
TGRIHRFVEKPQVFVSNKINAGMYILSPAVLRRIQLQPTSIEKEDLP 
IMGKEGQLYAMELQGFWMDIGQPKDFLTGMCLFLQSLRQKQPERLCS 
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GPGIVGNVLVDPSARIGQNCS\IGPNVSLGPGVWEDGVCIRRCTVIj 
J^ARIRSHSW\LESCIVGWRCRVGQWVRMENVTVLGBDV\ IVNDEL\ 
YLNGASVLPHKSIGESVPEPRI IM 

2630 

B 

162 

276 

MGRRANESLNVLIVYHFLFQKGVCQRLQLVQVVLQHLLX* 

2631 

A 

2 

292 

HVFIDLPTLQERREIFEQHLKSLKLTQSSTFYSQRLAELTPGFSGAD 
IANMCNEAALHAAREGHTSVHTLNFEYAVERVUVGTGGAPWVKALLS 
RA 

2632 

A 

3 

2111 

S S CGLGGQHGLGAARVR S DFKEARARAP C I VY I DE I DAAG KKRSTTM 
FGFSNTEEEQTLNQLLVEMDVFYAKIEELKIiVNRRAETGVSLSRCST 
QK* KS *NWCILITVFYAAIEELKLMYPYHGVLCSSRRKIjS SCSNKEM 
TQ S E KS Q A * YRM TP LGMG TTDHV I VL AS TNRAD I LDG AL.MR P GRLDR 
HVFIDLPTLQRKGVALSPRLECSAAITTHCGLNLQGSS*GARTTMGV 
RRRAG*FLTESRCVAKAGVQAAISAHCNHHLPGEA/RKIFEQHLKSIj 
KLTQSSTFYSQRLAELTPGFSGADIANICNEAALH7U\REGHTSVHTL 
NFEYAVERVLAGPHGFQLWESSGFIiPEEWGQLLCQTLQNTPGFSPTA 
LRGREDWRCLCLTENLFSCNHLIiLCGNRKAFYNSSKAGRTEWEPQAS 
*KKIHDVLECSGPRADSSCTEWTLTDVWKI*DRK*SLTRTAKKSKIL 
S KEEQKWAFHE SGHALEGWMLEH/ TEAVMKVS I TP \ RT\ NAALGFG 
Q\ML.PKDQHLFTQ/ESSFERRFIALGGRAS\EHFSLNEVTS\GAQDD 
LRKVTPIAYSMVKQVGMAPGIGPISFPEAQEGLMGIGRRPFNQGLQQ 
MMDHEARLLVA\ KAYRHTEKVLQDNLDKLQALANALLEKEVINYEDI 
EAilGPPPHGPKKMIAPQRWIDAQEEKQDLGEEETEETQQPPLGGEE 
PTWPK 

2633 

A 

2 

2416 

FQANMAVLLLLLRALRRGPGPGPRPLWGPGPAWSPGFPARPGRGRPY 
MASRPPGDLAEAGGRA1/QS LQLRLLTPTFEG I NGLLLKQHLVQNP VR 
IiWQLLGGTFYFNTSRLKQKNKEKDKSKGKAPEEDEEERRRRERDDQM 
YRERLRTLLV\ IAWMSLLNAL\ STSGGS I SWNDFVHEMLAKGEVQR 
VQWPESDVVEVYIJIPGAWFGRPRLALMYRMQVANIDKFEEKLRAA 
EDELNI EAKDRI PVS YKRTGFFGNALYS VGMTAVGLAI LWYVFRLAG 
MTGREGGFSAFNQLKMARFTIVDGKMGKGVSFKDVAGMHEAKIiEVRE 
FVDYLKSPERFLQLGAKVPKGAIiLLGPPGCGKTIiLAKAVATEAQVPF 
liAMAGPEFVEVIGGLGAARVRSLFKEARARAPCIVYIDEIDAVGKKR 
STTMSGFSNTEEEQTLNQLLVEI^GMGTTDHVIVIiASTNRADILDGA 
LMRPGRIiDRHVFIDLPTLQERRB I FEQHLKSIjKIjTQSSTFY\ SQRLA 
ELTPRFS VAD I ANI \ CNEAALHAAREGHTS \ VHTLNFEYAVERVLAG 
TAKKSK\ILSKEEQ\KVVAFHESG\HPLVGWMLEHTEAVMKVSITPR 
TNAAIiGFAQMLPRDQHLFTKEQL FERMCMALGGRASEAL S FNEVTSG 
AQDDLRKVTRIAYSMVKQFGMAPGIGPISFPEAQEGLMGIGRRPFSQ 
GLQQMMDHEARLLVAKAYRHTEKVLQDNL\ DKLQALANALLE KEVIN 
Y* / EDI ETOjIGPPPHGPK\ KM I \ APQRW/ VSDAQREKQYLG\ EEETE 
ETQQPSLGGEEPTWPK 

2634 

A 

1 

395 

CEGNSTCSENEVCVRPGECRCRHGYFGANCDTKCPRQFWGPDCKELC 
SCHPHGQCEDVTGQCTCHARRWGARCEHACQCQHGTCHPRSGACRCE 
PGWWGAQCASACYCSATSRCDPQTGACLCHAGWWGRS 

2635 

A 

1 

3040 

MEGAGPRGAGPARRRGAGGPPSPLLPSLLLLIiLLWMLPDTVAPQELN 
PRGRNVCRAPGSQVPTCCAGWRQQGDECG I AVCEGNSTCS ENEVCVR 
PGECRCRHGYFGANCDTS ERGVGPVLiVGGAES WRDGAGS KVGRGRIR 
IiRGGSPEVAAGVRDAGRFRIiAGGTYSSTGAFHPLRSSPAJSCPRQFWG 
FLiU ISJiJjU tt ViitjO^ ED VTGQCTCHA 

SGACRCESGWWGAQCASACYCSATSRCDPQTGACLCHAGWWGRSCNN 
QCACNSSPCEQQSGRCQCRERTFGARCDRYCQCFRGRCHPVDGTCAC 
EPGYRGKYCREPCPAGFYGIiGCRRRCGQCKGQQPCTVAEGRCLTCEP 
GWNGTKCDQPCATGFYGEGCSHRCPPCRDGHACNHVTGKCTRCNAGW 
IGDRCETKCSNGTYGEDCAFVCADCGSGHCDFQSGRCLCSPGVHGPH 
CNVTCPPGLHGADCAQACSCHEDTCDPVTGACHLETNQRKGVMGAGA 
liLVLLVCLLLSLLGCCCACXPARTLRADPAPQGVSLGRKKAPHRLCG 


lotvo 
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RFSRISMKLPRIPLRRQKLPKVWAHHDL.DNTLNCSFLEPPSGLEQP 
SPSWSSRASFSS FDTTDEG PVYCVPHEEAP AESRDPEVPTVPAEAPA 
PSPVPLTTPASAEEAIPLPASSDSERSASSVEGPGGALYARVARREA 
RPARARGEIGGLSLSPSPERRKPPPPDP/GHQA*GVLDPRQAQRRCS 
WPCALTTAARLRGRAQPQQEETDAQRQIGAYGRTRQ/SPGPATQRRA 
PRGCPRRRQPSLRPRRPGPER\GRAPASWSPRTPAVPREARRGCLHV 
GR*RARQDSQPGPRRGGPGRAGPQGKAGAPTESQALRAASLARPRAP 
SDRNPG A* EGGDRLARA * D P PEEDPH PEAAAQE EPGRLVRAGQGGRT 
HP VAGCG S S ARS P S AS QRR PP PHTS HATGHGGLLLAGHRAAS GGCVS 
LAQVLQPLLDWSSVLSAGRAHPVGRGLTGA 

2636 

A 

350 

1256 

LCLFPLPARKMATNFLAHEKNLVDKFKYDDAERRFYEQMNGPVAGC/ 
AS RQENG A\ TV I LRD I ARAREN I HKS LAGS S G PG AS SGTS GDHGELV 
VRI7VSLEVENQSLRG\WQELQQAISSLEAR\LNVLEK\TWPG\HRA 
TGPQ\TQHVSPMRQ\VEPPSPRSPPTPAEG/DARDDDIALF\GSDNE 
\EEDKGGGTSLREE/RGFREFPGEKRPKKPRLV\AQLSFLLG*NPW\ 
DR*KDMAQLEACVRSIQ/LGTGRSWGASKLGAPWATGIRK\LQIQCV 
VEDDKVGTDLLEEE I TKFEEHVQS VD I AAFNKI 

2637 

A 

1 

304 

GKTFDTFCPLGPALVTKDSVADPHTLKICCRVNGKWQSGNTNQMVF 
KTEDL IAWVS Q \DV I LTGTP PGVGVFRKPPVFLKKGDEVQCE I EELG 
VIINKW 

2638 

A 

1 

927 

MALGSSVSWILHFSLQALMWSGRRLLTALLQAQKWPFQPSRDMRLV 
QFWAPHLVGPHLGLETGNGRGIINLNAFDPTLPKTMTQFLEQGEATL 
SVARRALAAQLPVLPRS EVTFLAPVTWPDKVVCVGMNYVDHCKEQNV 
PVPKEPI I FSKFASS IVGPYDEWLPPQSQEVDWEVELAWIGKKGK 
HIKATDAMAHVAGFTVAHDVSARDWLTRCNGKQWLLGKTFDTFCPLG 
PALVTKDSV?VDPHNLKICCRVNGEVVQSSNTNQMVFKTEDLIAWVSQ 
KPPVFLKKGDEVQCEIEELGVI INKW 

2639 

A 

3 

849 

PSGSDFAAAWPRLDAVTGASAQAPAALDLPPREP*LSAGGPVAAAA 
AARALAAQLPVLPRS EVTFIJVPVTWPDKWCVGMNYVDHCKEQNVPV 
PKEPIIFSKFASSIVGPYDEWLPPQSQEVDWEVELAWIWKRKAST 
SR*SGKGGLPVQSAAEAPAPASPSSDLTHQHTVPAVPDTRKQSASLL ' 
PDTAFPFTHLWIALATNMG* QLRDPLP * \flLVS PMI * PPVWPNPLPP* 
ATDAMAHVAGFTVAHDVS ARDWQMRRNGKQWLL * KTFDTFCPLGPA 

2640 

A 

2 

562 

ITPRFHLCISDPHNLKICCRVNGEWQSSNTNQMVFKTEDLIAVTVSQ 
*VTRAVIiSAPIiHLPHMWRLT*ALHLWLWPLSQPLVSLGPFALLQ\FV 
TFYPGDVILTGTPPGVGVFRKPPVFPQGRFSEKQREQGPPKPLAGLP 
SDLRSTGLCMCV*RKGLTLSWPALLQKGDEVQCEIEELGVIINKW 

2641 

A 

867 

1476 

LVPVLRRPLPLTSRPASPNSAQEDRLPPP\RRQEVNWEGELAGVIGK 
KGKPI KATDAM\ AHVGG\ FTVAHDVSARDWQMRRNGKQWlitiGKTFDT 
FCPLGPAIiCDQVTGVADPHNLK\ ICC\R\ INGEMVPERQTPNQMVIS 
RQRT* IALGLPSLVTLLPPGDVIPNIjGPPHLHPGVGVFRKPPVFL\ k 
KGDEVQCEIEELGVI INKW 

2642 

A 

3 

846 

LNLS CADVVWHQMDENLIJVT/ AATNGVVVTWNLGR PSRNKQDQLFTE 
HIOlTVNKVCFHPTEAHVLLSGSQDGFMKCFDLRRKDSVSTFFG\QSE 
SVRDVQFSIRDYFTFASTF*ERAMLQLWDIRRPDRCERMFTAHNGPV 
FCCDWHPEDRGWLATGGRDKMVKVVTOMTTHRA^ 

EW/RPECRHHLATCSMMVD/YNIYV/WDVRRP/YVPAACLRHRTSHG 
I / AWRTPRPS SVFGQTVVASLFGAAAGRAPGPCTPWDCSRKNPGRWG 
SYLDGPPF 

2643 

A 

1 

2693 

SPAALRADSVDGGSLLAPLLGLTDRAFSDCPDLADG7VMEKMSRVTTA 
LGG S VliTGRTMHCHLDAPANAI S VCRDAAQVG\ LAG\ RS I FKIYAI E 
EEQFVEKLNLRVGRKPSIaNLSCADVVWHQM^ 

NLGRPSRNKQDQLFTEHKRTVNKVCFHPTEAHVLLSGSQDGFMKCFD 
LRRKDSVSTFSGEATEAGPREWAMAGCVPILPVLSCRILRLHHSFAH 
GPMQDAESTANDARESWGCPLYPLGLCSGPQAGQSESVRDVQFSIRD 
YFTFASTFENGNVQLWDIRRPDRCERMFTAHNGPVFCCDWHPEDRGW 
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IoATGGRDKMVKVWDMTTHRAKEMHCV'QTIASVARVKWRPKCRHHLAT 
CSMMGDHNIYVWDVRRPFAPAAMFEEHRYVTTGIAWRHPHDPSFLLA 
GSKDSFLFQHLFRDARQAVKRANPEGLCYGLFGDLAFAAKESLVAAE 
SGRKPYTGDRRHP I FFKRKLDPAEPFAGLAS SALS VFETEPGGGGMR 
WFVDTAERYALAGRPLAELCDHNAKVARELGRNQVAQTWTMLRIIYC 
SPGLVPTAN/H/ RTTVWARVAPVASRS *TVST*RIWPQGWAVRRGWT 
AAKEMHGATQFCS / HS S ATL I TNEDNEETEGSDVP ADYLLGDVEGEE 
DELYLLDPEHAHPEDPECVLPQEAFPLRHEIVDTPPGPEHLQDKADS 
PHVSGSEADVASLAPEDSSFSLLSVSHALYDSRLPPDFFGVLVRDML 
HFYAEQGTVQMAVS VL I VLGERVRKD I DEQTQEHWYTS Y I DLLQRFR 
LWNVSNEWKLSTSRAVSCLNQASTTLHVNCSHCKRPMSSRGW\FCD 
R\CHTCASMCAVCHHWKGLFVWCQGCSHGGHLQHIMKWLEGSFPLF 
PQAAAHLCEYS 

2644 

A 

33 

222 

NGPM\CFVKQLEIPQYGYRNNVPTTTPRSNIiAKELEKYSKTSFEYTI 
NDNHTYGGLGCMRPRLV 

2645 

A 

1 

1377 

MRTAVLGQAPTQVQPAAVREDQNE I FYLLNVE YSES PFVAI SLRPAP 
MSLSHFPQLPVGIENDEEIKQLDEEIKELNESNSQMEADMIKLRTQR 
LRLGKVEGLSHSQPGVWGSAQQWSTHWTQFSQGWRHPLGLGPHQA 
TERDFQAPRRGPDLNLIKNKSKRRKKETPSKPLNFTPVPDGEDRHAC 
AGWRETAAPVPDGGRPPRLCRRRGRPPRLCRMEGDRRACAGWRETAA 
PVPEKRETAAPVPDGGRPPRLCRRRGRPPRLCRMGGDRRACAGWGET 
AAPVPEKGETAAPVPEKGETAAPVPEKGETAAPVPDGGRPPRLCRMQ 
ERPLLPLSAEGVACYLSSPGVFLLKLGKITTMESNLKTIEEENKVIE 
QQNESLiLHELANLSQSLIHSLANIQLPHMFLRIFVEQAFWIVTRDSD 
QDSCTSDLQIAGGHPGLLLAGDTRSSCGLQQESCVL 

2646 

A 

1 

2232 

MQDTEKDDNNlTOEYDNyDELVAKSIiLNLGKIAEDAAYRARTESEMNS 
NTSNSLEDDSDKNENLGRKSELSLDLDSDVVRETVDSLKLLAQGHGV 
VLSENMNDRNYADSMSQQDSRNMNYVMLGKPM^ 

VCLSSLECLRNQCFDLARKLSETNPQERNPQQNMNIRQHVRPEEDFP 
GRTPDRNYSDMLNLMRLEEQLSPRSRVFASCAKEDGCHERDDDTTSV 
NSDRSEEVFDMTKGNLTLLEKAIALETERAKAMREKMAMEAGRRDNM 
RSYEDQSPRQLPGEDRKPKSSDSHVKKPYYGKDPSRTEKKESKCPTP 
GCDGTGHVTGLYPHHRSLSGCPHKDRVPPEILAMHESVIjKCPTPGCT 
GRGHVNSNRNSHRRDIQVFNLQKQEMIDRLESLSGCPIAAAEKLAKA 
QEKHQSCDVSKS SQASDRVLRPMCFVKQLE I PQYGYRNNVPTTTPRS 
NIiAKELEKYSKTSFEYNSYDNHTYGKRAIAPKVQTRDISPKGYDDAN 
GYCKDPSPSSSSTSSYAPSSSSNLSCGGGSSASSTCSKSSFDYTHDM 
EAAHMAATAILNLSTRCREMPQNXiSTKPQDLCATRNPDMEVDENGTL 
DLSMNKQR PRDS CC P I LTPLEPMS PQQQAVMNNRCFQLGEGDCWDLP 
VDYTKMKPR\RIDEDESKDITPEDLDPFQEALEERRYPGEVTIPSPK 
PKYPQCKESKKDLITLSGCPLADKSIRSMLATSS\QELK 

2647 

A 

623 

4268 

LWACVLPGVTLPPFLGEKTSCKMEVDTEEKRHRTRSKGVRVPVEPAI 

QELFSCPTPGCDGSGHVSGKYARHRSVYGCPLAKKRKTQDKQPQEPA 

PKRKPFAVKADSSSVDECDDSDGTEDMDEKEEDEGEEYSEDNDEPGD 

EDEEDEEGDREEEEEIEEEDEDDDEDGEDVEDEEEEEEEEEEEEEEE 

ENEDHQMNCHNTRIMQDTD\ LDDNNNDEYDNYDELVAKS LLNLGKI A 

EDAAYRARTESEMNSNTSNSLEDDSDKNENLGRKGELSLDLDSDVVR 

ETVDSLKLLAQGHGVVLSENMNDRNYADSMSQQDSRN^ 

NNGLMEFMV*ESDEEVCRSSLECLRNHCFYLARKLSETNPQERNPQQ 

NMNIRQHVRPEEDFPGRTPDRNYSDMLNLMRLEEQLS PRSRVFASCA 

KEDGCHERDDDTTSVNSDRSEEVFDMTKGNLTLIiEKATALETERAKA 

MREKMAMEAGRRDNMRSYEDQSPRQLPGEDRKPKBSDSHVKKPYYGK 

DPSRTEKKESKCPTPGCDGTGHVTGLYPHHRSLSGCPHKDRVPPEIL 

AMHES VLKCPTPGCTGRGHVNSNRNSHRS LSGCP IAAAE KLAKAQEK 

HQS CDVSKS SQASDRVLRPMCFVKQLE I PQYGYRNNVPTTTPRSNIjA 

KELEKYSKTSFEYNSYDNHTYGKRAIAPKVQTRDISPKGYDDAKRYC 
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KDPSPSSSSTSSYAPSSSSNLSCGGGSSASSTCSKSSFDYTHDMEAA 
HMAATAILNLSTRCREMPQNLSTKPQDLCATRNPDMEVDENGTLDLS 
MNKQRPRDSCCPILTPLEPMSPQQQAVMNNRCFQLGEGDCWDLPVDY 
TKMKPRRIDEDESKDITPEDLDPFQEALEERRYPGEVTIPSPKPKYP 
QCKES KKDL I TL»S S \ CPLADKS I RS MLATS SQELKCPTPGGDGSGH I 
TGNYASHRSLSGCPRAKKSGIRIAQSKEDKEDQEPIRCPVPGCDGQG 
HITGKYASHRSASGCPLAAKRQKDGYLNGSQFSWKSVKTEGMSCPTP 
GCDGSGHVSGSFLTHRSLSGCPRATSAMKKAKLSGEQMLTIKQRASM 
G I ENDEE I KQLDEE I KELNESNS QMEADM I KLRTQ I TTMESNLKT I E 
EENKVIEQQNESLLHELANLSQSLIHSLANIQLPHMDPINEQNFDAY 
VTTLTEMYTNQDR YQS PENKALLEN I KQ AVRG I Q FR S E QL L 

2648 

A 

1 

568 

MPQQVQNHGNRNSLSDHSAIIOjEIjRIKKLTQNHTITWKLNNLLIiNDY 
WINNEIKVEINKFFDTNKNKDTTYQNLWDTGFPISKPDGLSQLEQDL 
QVFDLETKTREVLRDDFSEFRQITISKEAFTSEKNNECNKPEKSFSL 
DSTIDADQKVLRIQNTDDNDKNDMSFSQNSASGKHEHINLTQDFQSS 
I 

2649 

A 

157 

700 

WSLSSRAVMEEIPAQEAAGSPRVQFQSLETQSECLSPEPQFVQDTDM 
EQGLTGAPPVPQVPALPREGSPGDQAAALLTARYQEFVTFEDVAVHL 
TREEWGYLDPVQRDLYREVMLENYGNWSLGILLRLPTTRIHSVNSC 
PAL.SHTQASAFSGETLAVLTAGISKRWPKYRLPHRYCSSL 

2650 

A 

1 

2674 

MEGSGTGKRRGKAAKTSLRIMDARAQLLLRVPHPGPSLTSGALTHIR 
DPHPGLSPTSGTLMPGRRRGGPHSGPCTPSPEVPPRSAGIiGAVYSGP 
GEAWAPSASVAVMEE I PAQEAAGS PRVQFQSLETQSECIjS PEPQFV 
QDTDMEQGLTGAPPVPQVPALPREGS PGDQAAALLTAR YQS SQDVLF 
QEFVTFEDVAVHLTREEWGYLDPVQRDLYREVMLENYGNWSLGFPI 
S KPDG I S QLEQDLQVFDLETKTRE VLRDPCSDGETRE ENKLL IPKQK 
ISEEVHSYKVRVGRLKHDITQVPETREVYKSEDRLERLQ/ANSKEIS 
VPGERV*ANNNQQGNL\TSEKNNECHEPEKSFSLDSTIDADQRVLRI 
QOTDDITOKYDMSFNQNSASGKHEHLNLTEDFQSSECKESIjMDLSHLN 
KWE S I PNTEKS YKCDVCGKI FHQS S ALTRHQR IHTREKP YKCKECEK 
SFSQSSSLSRHKRIHTREKPYKCEASDKSCEASDKSCSPSSGIIQHK 
KIHTRAKSYKCSSCERVFSRSVHLTQHQKIHKEMPCKCTVCGSDFCH 
TSYLLEHQRVHHEEKAYEYDEYGLAYIKQQGIHFREKPYTCSECGKD 
FRLNSHLIQHQRIHTGEKAHECNECGKAFSQTSCLIQHHKMHRKEKS 
YECNEYEGSFSHSSDLILQQEVLTRQKAFDCDVWEKNSSQRAHLVQH 
QSIHTKENS*M**R/CGRYLSNSGFIQHLRVHT\GRNHVCTACGKAF 
SHSSAIAQHQIIHTREKPSECDE/CKKRY*C*TLNRLLQIYTSEKSY 
KCIECGKFFMLLVFSYLSHIWRIHMGIKFHCCNECEKAISQRNYLG* 
HQIHAMHKDYKC/N*ACMCVRRFSHNPTLIQHQRIYT*ENLFGCSKW 
QIFQ 

2651 

A 

28 

352 

VRVWVGNAGCRLC FRADGVAS GVGCAWDGDRMCL PES \ FIRAPVLSV 
STAGPQSS.QGSSGRSATPGEGAQGINRTLPSPALPSPPMAPPPSYLV 
GTS PAPHSDDGC WP 

o *r cr o 
2652 

A 

1198 

1889 

LKLPIFRDLKVQCGWVELENRLTKMLLSTSVLSCFSSRPNMSEVSCN 
KRYDYLEWPEYFMAVAFLSAQRSKDPYSQVGACIVNSENKIVGIGYN 
GKPNGVHN\SLFP*RRTPK\NKLDTKYPYVCHAELNA\ILNKKFRPD 
VKRPEVCMFAIiVPPGK*NGGLSLI I / LQAGI KRKL TFHVL, INYPG * h 
TGATA\ARLLF*YGPGVTF\RKFIP\KCSKIVIDFDSINSRPSRKLQ 

2653 
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TTA7T.PCT.2i VT^ A T«.7<riKTr^/ri V^TD TncnDTT T>T»t tv t7»t trfrtnntif't rr TVT T'/^i 7 

QDGQAMIiWDLNEGKHLYTIjDGGD I INALCFS PNR YWLCAATG P S I KI 
WDLEGKIIVDELKQEVSVQQQADHPRHLLPVCDAKLCCATRHL 

2654 

A 

45 

1111 

aiqchprrcsdtalaAamteqmtlrgtlkghngwvtqiat\prssrk 
iispppgekaiilrkpargwsqpmnfqrrsrrihshllrmwis*dg 
QF\ ALiSG\ s w\dgtlr\lgdi»tngh/ STRGRFCGP IPKDVLS VAFSS 
DNRQRLSLGSRRLNHQSLWKYPGVCANTTVPG * EPTQS WVLCVPLPR 
PNNQQTPINRSPCGIiGQSWSKLWK/ LWAKLQS *RTKPTLGHTGLSEP 
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R*LFSP\DG SLCASGG\ KDGQA\ LLWDLQRRPTPFTTLEWWGHQSTP 
LCFSPN\RY\WIiCGCHR/HPGIKIWDt.EGKDPL*MN*SKKLSVPAAK 
QNQPQCTSLAWSADGQTLFAGYTDNTjVRVWQVTIGTR 

2655 

A 

1099 

1712 

CALGEGNSGHRAFSKDRARLRPGWWSGRAKDELGKPKPLGPS*kRAD 
RR*GQSACDSSPPSGPGP/TGDGARGRC*AARPGPRARRRRLGPSGE 
DRSCQMGTGEEHHGRLWGLSVGPAPKPESAPSSLTCPREETPGRGRP 
PAPQFQSLARSHGIRGAGDKPTEEKSLLPQQRHRGGASSVGRGPLGF 
FPLEEAPLP YPWP PNLQ j 

2656 

A 

1 

900 

MDKFLDTYTLPRLNQEEVESLNRPTTGSEIEAIINSLTTKKSPGPDG 
FTAKFYQRTNDKNCTI I S IDVEKAFDKIQQPFMLKTLNKQGTDGTYL 
KIIRAIYDKPTPNIILNGQKLEEFPLKTGTRKGCPLSPLLFNIVLEV 
LARVIRQEKEIKAPAPDATAAEMLMPKKNRIAIYELLFKEGAHVAKK 
DVYTPKQPELADKNVPNLHVMKAMQSLKSRGYMKEQFAWRHFYVTYLT 
NEGIHHLRDYLHLPPEIVPATLCHSRPETGRPRPKGLEGSEEHLCAA 
AHRLRCGEHLCPTALSGM 

2657 

A 

465 

1269 

TSSSCSCHPPPNACCRGTSSVLSQKAGQAPHLPGPTGKPLRSSGSLR 
AVRAPG P IGGGQAPWS D P P VLQAPPRP PQAPRGS WRGAGPRQPGRDA 
APRMLLPFQPRYRTLQPQRLLMPKKNRIAIYE\IiLFK\EGVMVPKK\ 
DVP I PKH PE LAD \ KNAPNLRVMKPMQVS QGRGG YVKGQF \ AW\ RH F \ 
YWYLTKTGS KCQLiHTEAARIjGQP/ DI I PT * RTRAL * PATGHLSMRAR 
SLEVQRPTAAPSGPSWLSLliQKGMWTGKNEDLHFRRCR 

2658 

A 

1 

433 

LNLLMSPQGQVLSAHVSGRWMKSYLSGMPECKFGMNDKIVIEKQGK 
GTADETSKSGKQSIAIDDCTFHQCVRLSKFDSERSISFIPPDGEFEL 
MR YRTTKD 1 1 LS FRVI PLVREVGRTKLEVKW I KSNFKPS LLAQS S P 
KGQ 

2659 

B 

140 

480 

^FEFLYKMCDVMAAYFGKtSEENIKNNFVLIYELLDEILDFGYPQN 
S ETG ALKTF I TQQG 1 KS QHQTKEEQ S Q I TS QVTGQ I GWRREG I KYRR 
NELFLDVLEKVNLLMS PQGS * 

2660 

A 

100 

1674 

SAAMI GGLF I YNHKGE VL I SRVYRDD IGRNAVDAFRVNVI HARQQ\ V 
RSPVTNIARTSFFHVKRANIWIJ^VTKQNVNAAMVFEFLYKMSDEMA 
AYFGKI S\EEN I \ KNNFVVHYMELLDEILEFGYPQEFRD/ SGALKTF 
I TPAGHS RS SG FRQKEG S SQFT SQVTGQ IGWRREGI KYRRNELFLDV 
LESVNLLMS PQGQVLSAHVSGP/VWVMKNYLSGMPECKFGMNDKIVI 
EKQGKGTADETKQEPGSNQLLIDDCTF/HTQCVRLS\KFDSERSISF 
IPPDGEFELMRYRTTKDI ILPFRVI PLVREVGRTKLEVKWI KSNFK 
PSL\LG\QKIEVRIQPPLNT\SGVQVICMKGKAKYK\AGENSFVWKI 
K\RMAGMKESQIQRRGFELLLTNDKKKWGC/RPPFPMNFEVPF\APS 
GLKVG\ YLK\VLNPKLNYSDHDVIKKVR* \ IGR\ SGI YENS ACKATR 
QLAQL P Q P P FLQQVQVP L L PQTTHQ VS P S L P AL L P S PLHQAR S LRSG 
PKQHYKVGPVEPALGLPGQGEF 

2661 

A 

1 

356 

HPNRDGFRAPRECLCLVTRISTMFLSLQAENSSYFVSRELCAHSIRK 

LQESPGGGPGMRIaQVIjCVSTTSLRliKIFFGKPPW^ 

GSGSRGWAGAVPCLSTGVSGGMPR 

2662 

A 

303 

774 

DPKSCGRWRRGRNGNGELADLQAESHRARAAGRGLPGALARVQPEAA 
AAE VGARCGRGA \Aii.No 1D1? VSREJ-iCAHS IRKJjQAn V JjJj J.KAVrUj I r 
D PRENYS DKE S LS FM I DTM KSTLKER FQ F VE VPGNHC VHMS E PQHVA 
S I ISSFLQCTHTLPAQL 




7w* 

G A LARA * P POORWVS C PR / G VP / C3AL * R E FTM FLS LOAENS I DF I SR 
ELCAH S IRKLQAHVLL I K* VAQAGAQRS QLTGYGS LTWACPPLTTVK 
SHGTAPPHPLLHEPSPTSDLPPSMTLLPGGLPKKDLES 

2664 

A 

1 

400 

PNPLSTQMGTSGGKGHQELLEYSPGGGPPPGPRKQFHCSPPPPNPAS 
GKKGH / S P P PL / S PHS F S KVKFNQP AP PPP \ TPTPKQERKS * LKMLF 
NFSAPVDPTPAQMAPPGLFVPSPSPPLHRLPPQAWGQRHSQQ 

2665 

A 

119 

496 

TGAVS FQMPEETQTQDQPMEEEEVETFAFQAEI AQLMSLI INTFYSN 
KE I FLREL I SNS SD / GEPMGRGTKVI LHLKEDQTEYLEERRIKE IVK 
KHSQFIGYPITLFVEKERDKEVSDDEAEEKEDN 
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2666 

A 

1 

235 

VDSEDLPLNISREMLQQSTILKVIRKNLVKKCLELFTELAEDKBNYK 
KFYEQFSKNIKLGIHEDSQNRKKLSEIiYI,DV 

2667 

A 

1 

2424 

RRVREASGKSAARWAGPSLYKAGAGVARQLLQRPGVAVPLVLCGHLA 
KMPEBTQTQDQPMEEEEVETFAFQAEIAQWISLIINTF*LETKRSFL 
RELISNSSDALDKIPVLNA*HDPHSIRPLGKEL\HITLLPDQTKIRT 
\ LTI VDTWNLE * PKAD\ L INN\ LGTIA/ RSLGTKAFMEALQAG\ AD I 

SMIG\QFGVGF\YPA\YLVAEKVTVITKHNDDEQYAW\ESSAGG\SF 
TVRTDTG * TYGS VGTKVI LHLKEDQTE YLEERR I KE I VKKHSQF I G Y 
PITLF\VEKER\DKEVTDDAPQEKEDKEEEK\EKEEKESEDKPEIED 
VGSDEEEEKKDGDKKKKKKIKEKYIDQEELNKTK\PI\WTRNPDDLT 
N\ EE YGEFYKSLTNDWEDHLAVK/ HFFQFEGQLEFRALLF VPPTVLP 

fd\lfenkkkkn\niklyvrrvfi\mdnceelipeylnf\ IRGVGDS 

EDLPL\NISREMLQQKPKF*KFSRKNFGPKKCLEDFT\ELAEDKENY 
\ KKF Y \ EQFSKNIKLGIH \ EDSQNRKKLSELLRYYTS \ ASGDEMVSL 
KD FCPR\ MKENQKHI YY ITGETKDQVANS AFVERLRKHGLEVI YM I E 
P J. 1JH it V PQU \ KEF \ EGKTLV \ S VTKEGLELPEDEEEKK\ KQEEKKT 
KFENLCKIMKDIK\ EKKVEKVWSNRLV\TS PCCI \ VTSTYGWTANM 
ER I MKAQALRDNSTMG YMAAK\ KHLE I NPDHS HE PLGPKA\ EADKN 
DKSVEGILVIWLKETAPPS\SAFSL\ENPRTHATR\IYRMIKP\GLG 
\IDEDDPTADDYQWDLVTEENPP\LE\GDDDTSRMEEVD 

2668 

A 

524 

707 

AVSPSEGEQERGEKRAGA\CGSRL*SQHFVRPRWADHLRSGV*DQPG 
QHGETPVSTKNTKISWAWW*ASVIPATREAEAGEWLEPRRRRLQ 

2669 

C 

102 

323 

MTGRSREGKWSTVPASGCVHVCVNREKATLSHVFPVSLVFLGKPEG 
LPEVAPSCPGSVGLGPHPPPL.VRSRVP 

2670 

A 

146 

804 

TETRFSDSVAQQAGPGTFGPLLLLALPRLTFSPLLSFPTVGTKCGNP 
LWAPKJPCPAIxAAPSGCRMTWKEQGRKMGVNC/PRPVAACMCV*TGKR 
PPSPMFLPSPRFSSETRRAARGSSELPWVAGAWSASSSDS PL.PRSTA 
PRRSVCTHCGLTQPTRHACQKGAGVCVRGPRSEAKGGWRGRALLLLA 
ATPCPLSCPNLGLPAKVTPEQGLVRRGGRPAGP 

2671 

A 

1 

787 

LNSRVEPRVRSRTMETKPVITCLKTLLIIYSFVFWITGAILLAVGVW 
GKLTLGTYISLIAENSTNAPYVLIGTGTTIWFGLFGCFATCRGSPW 
MLKLYAMFLSLVFLAELVAGISGWLRHE I KDTFLRTYTDAMQT\ YN 
GNDE\RSRAVDHVQRSLE/SCCGVQD\YT\NWSTSPYFLEHGIPPSC 
crown \ i \iJUNPODIiHinjTVAATKV\NQKGCYT)LVT*FSWETN^ 
AGSGRLGI\AFSQLIGHAGWACC\LSRFHHGPIQY 

2672 

A 

1 

670 

xiwxjxr ■«■/ i>rtb/ v uy Lit. £dA1iCSCFRPKDTER/LRG/APEGFSRTDL 
HLA/WPVLTALISYHNYLDKTKQREMVYRLEQGLIHRCP/RQCWA 
LS ICS VEMPDI 1 1 KALPVL WKLTH I S ATASMAVPLLEFLSTLARLP 

nuixuirrtADVino vrHiolif I XIMr'oivr N{J Y X vLLAHHVTA.KPTiHK''Q 

P S PHLAMRW ASGS AS S PRWRTS P S LCE AGAL P P ALALDG I ACQ 

2673 

A 

1 

154 

LSHIPEDKDHQTOKLATQLLVDLAEGCHTHHFNSLLDIIEKVRAVVP 
GAGC 

2674 

A 

237 

410 

VGD VDS AS S QTAAAS PP AD F VLQWMD VGLS S E FLIiVLVNIj VKFNS C Y 
LDEYIAKMVQ 

2675 

A 

1 

414 

NYLDKTKQREMVYCLEQGL IHRCASQC WALS I CSVEMPD III KALP 
VLVVKLTHISATASMAVPLLEFLSSESPPCLRMHPRGSGCVTCAGFS 
GALSQDSLGNLGVS PSAAS RPGRLCPAAMS AJjLPGFEGRLIiS PL 

2676 

A 

1 

6371 

MPDSCGLHIP7VAHAAPSRDAP T AnT.Qapr2T?i4T3T , Tr««T»TDcirr dddbca op 

RYRKCGSRFRRRPGARGVRLSPRRGGPERGGAAREGFSGASWSTMAK 
PTSKDSGLKEKFKILLGLGTPRPNPRSAEGKQTEFIITAEILRELSM 
ECGLl^IRMIGQICEVAKTKKFEEHAVEALWKAVADLLQPERPLEA 
RHAVLALLKAIVQGQGERLGVLRALFFKVIKDYPSNEDLHERLEVFK 
ALTDNGRHITYl^EEEl^UDFVLQWMDVGLSSEFLLVLVNLVKFNSCYL 
DEYIARMVQMICLLCVRTASSVDIEVSLQVLDAVVCYNCIiPAESIiPIi 
FIVTLCRTINVKELCEPCWKLMRNLl^THLGHSAIYNMCHLMEDRAY 
MEDAPLLRGAVFFVGMALWGAHRIiYSLRNSPTSVLPSFYQGRWGDW 
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FVHSLQPNSGQLSVAAGLAHRVPRIQLLAMACPNEWSYEIVXSITR 

LI KKYRKELQVVAWD I LLN I I ERLLQQLQTLDS PE LRT I VHDLLTTV 

EELCDQNEFHGSQERYFELVERCADQRPESSLLNLISYRAQSIHPAK 

DGW I QNLQALMER FFRS ES RGAVR I KVLDVLS F VLL INRQF YEEELI 

NSWISQLSHIPEDKDHQVRKI^TQIiLVDLAEGCHTHHFNSLLDIIE 

KVMARSLSPPPELEERDVAAYSASLEDVKTAVLGLLVILQTKLYTLP " 

ASHATRVYEMLVSHIQLHYKHSYTLPIASSIRLQAFDFLLLLRADSL 

HRLGLPNKDGVVRFSPYCVCDYMYAGPRPRPMRLRASEALGLWEPER 

GSEKKTSGPLSPPTGPPGPAPAGPAVRLGSVPYSLLFRVLLQCLKQE 

SDWK\^KLVLGRLPESLRYKVL.IFTSPCSVX>QLCSALCSMGLPSPSG 

YPVTWPLGRGFMPGFGHQLSGPCTLERLRGAPEGFSRTDLHLAVVPV 

LTALI S YHNYLDKTKQREMVYCLEQGL IHRCASQC WALS ICS VEMP 

DIIIKALPVLWKLTHISATASMAVPLLEFLSTLARLPHLYRNFAAE 

QYASVFAISLPYTNPSKPNQYIVCLAHHVIAMWFIRCRLPFRKDFVP 

F ITK\ GLR SNVLL S FDDT P E KD S FRARS TS LNE R P KR YGLRG WE YG I 

FQRDLNSEAWSSPWGPGAKLGHYGQRLLSGVLVSFPSDLQGLATEA 

S VP PRLR I ARPPKQGLNNS P PVKE FKES S AAEAFRCRS I S VS EHWR 

SRIQTSLTSASLGSADENSVAQADDSLKNLHLELTETCLDMMARYVF 

SNFTAVPKRSPVGEFLLAGGRTKTWLVGNKLVTVTTSVGTGTRSLLG 

LDSGELQSGPESSSSPGVHVRQTKEAPAKLESQAGQQVSRGARDRVR 

SMSGGHGLRVGALDVPASQFLGSATSPGPRTAPAAKPEKASAGTRVP 

VQEKTNLAAYVPLLTQG WAE I L VRRPTGNTS WLMS LEN PLS PFSSDI 

NNMPLQELSNALMAAERFKEHRDTALYKSLSVPAASTAKPPPLPRSN 

TVASFSSLYQSSCQGQLHRSVSWADSAVVMEEGSPGEVPVLVEPPGL 

EDVEAALGMDRRTDAYSRCGAQLVAVLLRQRAGAVAWCSGWSVALGW 

RTDRASPPPGQPQSSSVSSQEEKSLHAEELVGRGIPIERWSSEGGR 

PSVDLS FQPSQPLS KSS SS PELQTLQD I LGDPGDKADVGRLS PEVKA 

RSQSGTLDGESAAWSASGEDSRGQPEGPLPSSSPRSPSGLRPRGYTI 

SDSAPSRRGKRVERDALKSRATASNAEKVPGINPSFVFLQLYHSPFF 

GDESNKPILLPNESQSFERSVQLLDQIPSYDTHKIAVLYVGEGQSNS 

ELAILSNEHGSYRYTEFLTGLGRLIELKDCQPDKVYLGGLDVCGEDG 

Q FT YC WHDD I MQAVFH I ATLM PTKDVD KHRCD KKRHLGNDFVS I VYN 

DSGEDFKLGTIKAATSVTLFLEVGWVGQCHSTVRGQFNFVHVIVTPL 

DYECNLVSLQCRKDMEGLVDTSVAKIVSDRNLPFVARQMALHANMAS 

QVHHS RSNPTD IYPSKW I ARLRH I KRLRQRVGNMGLPQRGVLAAQAV 

GRICEEAAYSNPSLPLVHPPSHSKAPAQTPAEPTPGYEVGQRKRLIS 

SVEDFTEFV 

2677 

A 

1 

5405 

makpts kd sglkekfki llglgtprpnprs aegkqte f 1 1 tab i lre 
lsmecglnnrirmigqicevaktkkfeehavealwkavadllqpert 
learhavlallkaivqgqgerlgvlralffkviqg/dypsne\dlhe 
\ rle vfkaltdngrh ityleeeij\dfvxiqwmd vgls se fllvlvn\ l 
vk™klvtldevqsqrngfkmicll\cvrtass\vdie\vsl7uilld 
avglanncl\paeslplfivtlcrtinvkelcepcwklmrniilgthl 
ghsaiynmctlmedraymedapllrgavffvgmalwgahrlyslrks 
pts vfps f yqamacpnews ye i vls itrl i kkyrkelqvvawd ill 
nil erllqqlqtlds pelrtivhdllttveelcdqnefhgsqeryfe 
lvercadqrpessllnlisyraqsihpakdgwiqnlqalmerffrse 
srgavrikvldvlsfvllinrqfyeeelinswisqlshipedkdhq 
vrklatqllvdlaegchthhfnslldi iekvmarslspppeleerdv 
aays as ledvictavlgllvi lqtkl ytl pashatrvyemlvs hi qlh 
ykhsytlpiassirlqafdflfllradslhrlglpnkdgwrfspyc 
vcdymepergsekktsgplspptgppgpapagpavrlgsvpysllfr 
vllqclkqesdwkvlklvlgrlpeslrykvli fts pcs vdqlcs alc 
smlsgpktlerlrg^egfsrtdlhlawpvltalisyhnyldktkq 
remvycleqgl ihrcarqcwals i cs vempdi 1 1 kalp vlwklth 
i satasmavplleflsttiarlphlyrnfaaeqyas vfai slp ytnps 
kfnqyivclahhviamwfircrlpfrkdfvpfitkglrsnvllsfdd 
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TPEKDSFRARSTSLNERPKSLRIARPPKQGLNNSPPVKEFKESSAAE 
AFRCRSISVSEHWRSRIQTSLTSASLGSADENSVAQADDSLKNLHIi 
ELTETCIiDMMARYVFSNFTAVPKRSPVGEFLLAGGRTKTWLVGNKLV 
TVTTS VGTGTRS LLGLDSGELQ SG PES S S S PGVHVRQTKEAPAKLE S 
. QAGQQVSRGARDRVRSMSGGHGLRVGALDVPASQFLGSATSPGPRTA 
P AAKPEKAS AGTRVPVQE KTNLAA YVPLLTQGWAE I LVRRPTGNTS W 
LMSLENPLSPFSSDINNMPLQELSNALMAAERFKEHRDTALYKSLSV 
PAASTAKPPPLPRSNTDSAVVMEEGSPGEVPVIiVEPPGLEDVEAALG 
MDRRTDAYSRS S S VS S QEEKSLHAEELVGRG I P I ER WS SEGGR PS V 
DIiSFQPSQPLSKSSSSP\EIiQTLQDIIjG\DPGDKADVGRIiSPEVKAR 
SQSGTLDGESAAWSASGEDSRGQPEGPLPSSSPRSPSGLRPRGYTIS 
DSAPSRRGKRVERDALKSRATASNAEKVPGINPSFVFLQLYHSPFFG 
DESNKPILLPNESQSFERSVQLLDQIPSY\DTHKIAVLYVGEGQSNS 
E\LAILSNEHGSYRYTEFLTGLGRL\IELK\DCQPDKV\YLG\GLGR 
WGEDGPVSPYLLGHD*HSWQAVFHIATLMPTKDVDKHRCDKK\RHL 
GNDFVS I VYNDSGEDFKLGT I KGQFNFV\ HVI RHPAG I TECNLGVPW 
QCRKI)MEGLVDTSVAKIVSDRNLPFVARQMALHANMASQVHHSRSNP 
TDIYPSKWIARLRHIKRLRQRICEEAAYSNPSLPLVHPPSHSKAPAQ 
TPAEPTPGYEVGQRKRLISSVEDFTEFV 

2678 

A 

1 

5469 

GASWSTMAKPTSKDSGLKEKFKILLGLGTPRPNPRSAEGKQTEFIIT 
AEILRELSMECGLNl^IRMIGQICEVAKTKKFEEHAVEALWKAVADIi 
LQPERTLEARHAVLALLKAI VQGQGERLGVLRALFFKVI KDYPSNED 
LHERLEVFKALTDNGRHITYLEEELADFVLQWMDVGLSSEFLLVIjVN 
LVKFNS C YLDE Y I ARMVQM I CLLCVRTAS S VD I E VS LQVLD AWC YN 
CLPAESLPLFIVTLCRTIOTKELCEPCWKLMRNLLGTHLGHSAIYNM 
CHLMEDRAYMEDAPLLRGAVFFVGMALWGAHRLYS LRUS PTSVFPSF 
YQAMACPNEVVSYEIVLSITRLIKKYRKELQVVAWDIL.LNIIERL1LQ 
QLQTLDSPELRTIVHDLLTTVEELCDQNEFHGSQERYFEIiVERCADQ 
RPESSLLNLISYRAQSIHPAKDGWIQNLQALMERFFRSESRGAVRIK 
VLDVLSFVLLINRQFYEEELINSWISQLSHIPEDKDHQVRKIaATOL 
LVDLAEGCHTHHFNSLLDIIEKVMARSLSPPPELEERDVAAYSASLE 
DVKTAVliGLLVILQTKLYTLPASHATRVYEMLVSHIQLHYKHSYTLP 
IASSIRLQAFDFLFLLRADSLHRLGLPNKDGWRFSPYCVCDYMEPE 
RGSEKKTSGPLSPPTGPPGPAPAGPAVRLGSVPYSLLFRVLLQCLKQ 
ESDWKVLKLVLGRLPESLRYKVLIFTSPCSVDQLCSAIiCSMLSGPKT 
LERLRGAPEGFSRTDLHLAWPVLTAX.ISYHNYLDKTKQREMVYCLE 
QGL I HRCARQCWAL S I CS VEMPD 1 1 1 KALPVL WKLTH I S ATASMA 
VPLLEFLSTLARLPHLYRNFAAEQYASVFAISLPYTNPSKFNQYIVC 
LAHHVIAMWFIRCRLPFRKDFVPFITKGLRSNVIjLSFDDTPEKDSFR 
ARSTSLNERPKSLRIARPPKQGLNNSPPVKEFKESSAAEAFRCRSIS 
VSEHVVRSRIQTSLTSASLGS7UDENSVAQADDSLKNLHLELTETCLD 

MMAR YVFSNFTAVPKRS pvge fllaggrtktwlvgnklvtvtts vgt 

GTRSLLGLDSGELQSGPESSSSPGYHVRQTKEAPAKLESQAGQQVSR 
GARDRVRSMSGGHGLRVGALDVPASQFLGSATSPGPRTAPA7UCPEKA 
SAGTRVPVQEKTNLAAYVPLLTQGWAEILVRRPTGNTSWLMSLENPL 
SPFSSDINNMPLQELSNALMAAERFKEHRDTALYKSLSVPAASTAKP 
P PL PR SNTVAS FS SL YQS SCQGQLHRS VS WADS AWMEEGS PGEVPV 
LVEPPGLEDVEAAIiGMDRRTDAYSRSSSVSSQEEKSLHAEELVGRGI 
P I ERWS S EGGR P S VDLS FQPSQPLS KS S S S PELQTLtjD I LGDPGDK 
ADVGRLSPEVKARSQSGTLDGESAAWSASGEDSRGQPEGPLPSSSPR 
S P S GLRPRG YT I SDS AP S RRG KR VERD ALKSRATASNAEKVPG INP S 
FVFLQLYHSPFFGDESNKPILLPNESQSFERSVQLLDQIPSYDTHKI 
AVLYVGEGQSNSELAILSNEHGSYRYTEFLTGLGRIilELKDCQPDKV 
YLGGLDVCGEDGQFTYCWHDDIMQAVFHIATLMP\TKDVDKHRCDKK 

rhlgndlwsivyn\nsgedf\klgtnqgpvnlwst*sstpl\dyec\ 
NIjVPL\QCRK\ dmeglvenqawpknrv* patlpfva\rqmp\ lhanm 
asqvhh * plotptd i yp \ s kw iarlr p i \ krlrqr i l/ ceeaallpt 
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PAIiPFV\HPPSHSKAPAQTPAEPTPGYEVGQRNAL\ISSVKDFTEFV 

O £" *~7 O 

2679 

A 

35 

239 

SPSSQPP\ PVI PH* PVCKQPTSCRSPRKRVLNLVGGGEGAASY 

2680 

A 

118 

359 

AVCKQPTS CRS PCCPSPGCQAS PC * AFRRLE I / CRAGF * A* ALGPGK 
SQGQNCPRSSPPPSP/CPLLKYGQNCG*KCPQRLLH 

2681 

A 

229 

455 

TMQGSLNKPLPSP*PRAATKIPNP/SPSPPPQPAPQTTPHSILSTKA 
MTPVTIQALFSRIVRYLCDCRRDSLTQSF 

2682 

A 

1 

373 

EKFRHNIAYFPQIVSVAARMLKDPRSYPGLALTSLYPQNTTLDLLDR 
GLQVHVWDACS SRSQVDRLVALARMRQSGAFLS TSEGL ILQLVGDA 
VHPQFKEIQKLIKEPAPDSGLLGLFQGQNS \ 

2683 

A 

72 

1031 

GCPRLILLRGRSGLEPGTFRKMAAARPSLGRVLP\DSSVLFLCDMQE 
KFRHNI AYFPQI VS VG \ SRMLKVARLLEVPVMLTEQ\ YPQGLGPTVP 
EAGD*GPSAAWQKTCFSM/CCLPLQQELDSRPQLRSVLLCGIEAQAC 
ILDPRSYPGLALTSLYPQNTT\WDLLDRGLQVHVVVDACSSRSQVDR 
LVALARMRQSGAFLSTSEGLILKLVGDAVHP\QFKEI\QKLIKEPAP 
DSGLLGLFQGQNSLPPLNSNPALREDHPPVHPGPQWEARFPPSLGFP 
RVGAIPPGSCRPLVGGGQWCCLPIGQLLPEMQMRLLETGWEMG 

2684 

B 

1 

792 

MGVLPSYFWDEEAEAQEGICLSLKQRMPVGRAGHSGHFLADATSRAL 
ARRREEWGKQEERGVKESSESTNTTSRMKTPKEDVRPGMTAFEPEAL 
GNLVEGRTS IDS ILKTVSNSFSLGACCLLSFHVLLDGTAGRGSDRDE 
QNAHSWPEQQARAHHHPESHIHLMGDESACIAYIRTRSTWTLAASH 
APPSRRDPCLAPPGWQMADRPLHRSGAPSVLPHSQDTDTEMNQERYM 
NASISFPKPPHSTSQQVLFIQPGEEREI* 

2 685 

C 

.10 

126 

MTAFRGGTPLPDLRPSLYPADPGGCAALPPEGVVHRDLK 

2686 

A 

2 

372 

HCIQQILEAVLHCHQ/MGVVHRDLK\PGEKKNQTLGRAVLLSFT*RR 
LQSLGLHQLHS LAPS AGTS W I S LL I LMVS VAGFAGT/ PGYLS PEVLR 
KDPYGKP / VDLWACGG * RFSGPLPGV ILYILLV 

2687 

A 

1 

1593 

MATI TCTRFTEE YQLFEELGKG AFS WRRCVKVLAGQEYAAKI INTK 
KLS ARDHQKLEREAR I CRLL KH PN I VRLHD SIS EEGHHYL I FDLVTG 
GELFEDIVAREYYSEADASHCIQQILEAVLHCHQMGVVHRDLKPENL 
LLASKLKGTVAVKIiADFGLAIEVEGEQQAWFGFAGT PGYLS PEVLRKD 
PYGKPVDLWACGVILYILLVGYPPFWDEDQHRLYQQIKAGAYDFPSP 
EVTOTVTPEAKDLINKMLTINPSKRITAAEALKHPWISHRSTVASCMH 
RQETVDCLKKFNARRKLKGAILTTMLATRNFSGGKSGGNKKSDGVKK 
RKSSSSVQLMESSESTNTTIEDEDTKVRKQEIIKVTEQLIEAISNGG 
FESYTKMCDPGMTAFEPEALGNLVEGLDFHRFYFENLWSRNSKPVHT 
T I LNPH IHLMGDES AC I AY I R I TQ YLD AGG I PRTAQSEETRVWHRRD 
GKWQI VHFHRSGAPSVLPHSQDTDTEMNQERYMNAS I SFPKPPHSTS 
QQVLFIQPGEERE I 

2688 

A 

3 

1607 

IPGSTISCSSVKPVLCLHSAARACKWSLGSGAEQQRLSPGPPVPSLT 
CLPSARMAT I TCTRFTEE YQLFEELGKGAFSVVRRCVKVLAGQEYAA 
KI I NT KKLS ARDHQKLEREAR I \ CRLL KHPN I VRLHD S I S EEGHHYL 
I FDLVTGGELVED I VARE YYS E ADASHC I QH I LEAVLHCHQMGWHR 
DLKPENLLLASKLKGAAVKLADFGLAIEVEGEQQACVGFAGTPGYLS 
PEVLRKDPYGKPVDLWACGVILYILLVGYPPFWDEDQHRLYQQIKAG 
AYDFPS PEWDTVTPEAKDL INKMLT INP S KR I TAAEALKHPWI SHRS 
TVASCMHRQEARGPA*KKFN\ARRK\LKGAILST\MLA\TRNFSGGK 
SGGNKKSDGVKES SESANTTI EDEATKVRNQEI I KVTEQLIEAISNG 
DFES YTKMCD PGMTAFE PEALGNLVEGLD FHR F YFENLWYRNS KPVH 
TTILNPHIHLMGDESACIAYIR TT\ OYTT5&nf5TPRTAO<3F''FTRVWHR 
RDGKWQIVHFHRSGAPSVLPH 

2689 

C 

6 

167 

MXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXWOO^ 
XXISKH* 

2690 

A 

310 

593 

LQVISEYLFWRINFFLFLF*TVSLSVAQARVQWRDPG\SLQPLLPGF 
KRFLAS ASRVARVTGAHHLHAQL/ 1 LYF * VETGFHHVGQAGLELLTS 
GK 

2691 

A 

58 

481 

LQVISEYLFWRINFFLFLF*TVSLSVAQARVQWRDPGSLQPLPPG\S 
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1 RDS *ASASRVARVTGAHHLHAQLFCIFK*RQGFHPGWSGWSSNLPDL 
QANPAASALPKVPGITGTSPLHPPWSINFKNIKIWNQBTSILVFFRL 

2692 

A 

1 

398 

DSIDARWFDLIRSINDPEHPLTLEELNWEQVRVQVSDPESTVAVA 
FTPTIPHCSMATLIGLSIKVKLLRSLPQRFKMDVHITPGTHASEHAV 
NKQLAERPLTPQPAYWYPGPSSCQGCYRCFLESLTMRN 

2693 

A 

3 

267 

AMVGGGGVGGGLLENANPLIYQRSGERPVTAGEEDEQVPDSIDAREI 
FDL IRS * AWPLTPOPAYWYPG PSS COGC YRCFL.F ^T.TMPKT 

2694 

A 

1 

942 

MVIYPLDKHFTSPCRSKLWGRRTPGPMCPLRPGSGLYVSARYWLARP 
SAAWPSQRLLLQRPSCSRVRT^ARAAVTASLAPQAENPQRGVSLARL 
RRHATPPSRSPEVRGLERKRTPARISGGGSAMVGGGGVGGGLLENAN 
PLIYQRSGERPVTAGEEDEQVPDSIDAREIFDIRRCWARAEELNWE 
QVRVQVSHFRGERWPESQKGFCAAGAGVIjYTUiEHRRVSDPESTVAV 
AFTPTIPHCSKATLIGLSIKVKLLRSLPORFKMDVHTTPGTHASEHA 
VNKQLADKERVAAALENTHLLEWNQCLSARS 

2695 

A 

3 

707 

AMVGGGGVGGGLLENANPLIYQRSGERPVTAGEEDEQVPDSIDARBI 
FD I RRCWARAGS GGLRWGEQ * YRAAGGAAS QQG VPGRG F * VGKGAES 
LFLHFPQ\LIRFLNDP\EHSLTLEELNWEQVRVQVSDPESTVAVAF 
TPT I PHCSMATL I GL S I KVKLLRS LPORFKMDVH I TPGTHAS EHAGK 
CGLVVRVLARAPPKQSEASPLHRSLWARLEMGNCVSEIARJDSSVWVI 
A 

2696 

A 

2 

604 

ARSHRISGGGSAMVGGGGVGGGLLENANPLIYQRFGE\RPVTAGEED 
EQVPDSIDAREVF\DLIRSHQMTPEHP\LTLEELNW\EQVR\VQVS 
DP\ESTVAVGFPKPTHSGTCRHGPPT,TRT»c;\ TJCVKA T.T.P'ST/P^ZXFcrp 
WDVAHLLPGD PLPQEHAVEQATLQ I KEAGWATALGRTPHIiLEWN\ Q 
CLSSPAPGTWAFVPLSLA 

2697 

A 

79 

690 

KVSKATAGCLRNYGVPTiAHT.'RTWA T PT?PP!OtfMPVVT?PT.QWMT.n r rWT.Q 
DPPGAG\TYPTLQPFQYIi\EEVHIS\HVGQAMLNFSFNSFHPDTRKS 
MH\RECGFHFASKPDT\NKVGLCQAPQNTGLGWNWKEGEVKRARSLC 
IRIPLPFARISFRQGKPHVEQIT/RGKFRL\NSEGKL*ADGSPMGNP 
RHS PMT\ QHHHDTYQEGEHRKP 

2698 

A 

1 

405 

RYHLLAHTTLTLAAVQDGFRTHDLTIxASHEENPAWLPLYGSVCCRLiA 
AQPLCMTQPTASGTLRVQETRVRAGELDQALGRPFTLSISNQYGDDE 
VTHTLQTESREALQSWMEALWQLFFDMSQWKQCCDEIMKIE 

2699 

A 

1 

1437 

MFSRI^HRSRVTVARGSALEMEFKRGRFRLSLFSDLPEDTELQRKLDH 
E I RMREG ACKLLAAC S QREQALEATKS LLVCNS R I LS YMGELQRRKB 
AQVLGKTSRRPSDSGPPAERSPCRGRVCISDLRIPLMWKDTEYFKNK 
GDLHRW AVFLiTjLiOT^GEH T ODTPM T T attyptt ,Tn t ci •poQU^/T .17a t? a npn 
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FELRLELYGACSTEEEGAxjTGGPKPwLATKIiSSSLGRSSGRRVRASLDS 
AGGSGSSPILLPTPWGGPRYHLLAHTTLTIxAAVQDGFRTHDLTLAS 
HEIWESFVQQAGEMONWAOVHGVLKGTI^FCYROPEDADTGEEPLIjT 
IAVNKVMGPLVETRVRAGELDQALGRPFTLSISNQYGDDEVTHTLQT 
ESREALQSWMEALWQLFFDMSQWKQCCDEIMKIETPAPRKPPQALAK 
QGSLYHEMVLSEPVAPGGPAIPTOQRPLQQRPTSHLAPVTSVREKGA 
GIGSAQKRK 

2700 

A 

78 

399 

FIHRPSDSGPPAERSPCRGRVCISGKKHSYPSO^PLPKHTASCPIS 
TSILTPLPLVDIjRIPLMWKIDTEYFKNKGDLiHRWAVFIjIjIjOLGEHIOD 
TEM I I VDRTLTYC I 

2701 

A 

2 

368 

WAGALGCSWLGIGAGFLSGREGACRRERGQQQTGTPLPSSLPIiLPS 
LEKRAGRTAATKARLDTLCLEPPLQLAFppGDPTYP\TSLSFSSLPG 
LPA/WPPEQQWGEENLSQQPEGV*QELQRDKGSFCPLPKSWPPL*RG 
GGARLQIJU^NRGWFPyWKGRGLQKGAGTAADRHTPAFLSPSPPFSGK 
ASWENSCHQSETGHSVPRAPSAAGLSSWGPHLSSLRFLFIiPCLGSRQ 
APGAAVGRGKFVTAARGGLTGVAEG 

2702 

A 

1 

588 

VRAGELDQALGR P FTLS I SNQ YGDDEVTHTLQTESREALQS WMEAX.W 
QLFFDMS KREGG WVEPLGRTGVCHFY * RPLFQD YVEGNVAGERAAWL 
SLLISALLPGQWKQCCDEIMKIETPAPRKPPQAXAKQGSIiYHEMAIE 
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PLDDIAAETDILTQREGARLETPPPWLAMFTDQPALPNPCSPASVAP 
APDWTHPL 

2703 

A 

46 

1825 

AAAGSETGLGSCLEGKGPAEGSGDSSRQAHPCLPLSLSSLLWKSEDT 
ELQRKLDHE I R\ LRKGPC\ KLLANLLP S EKQALEA\ TKSLLVC\NSR 
IL\SYMGE\LQR\RKRA\QLLGKTSRRPSDSGPPAER\SPCRGRVCI 
SDLRIPLMWKDTEYFKNKGDLHR\WAVF\LLLQLG\EHI\QDTEMIL 
VD\ RTLTD I S F \ QSNVLFAEAG PDF * TCGLEL YGACVEEEGALTGG P 
KRLATKL\SSSLGRSSGRRVRASLNSAGGSR/ GAIPFMLPTPEVGGP 
RYHLIJVHTTLTIxAAVQDGFRTHDLTLA SHEEN PAWL PL YGSVCCRLA 
AQPLCMTQPTASGTLRVQQAGEMQNWAQVHGVLKGTNLFCYRQPEDA 
DTGEEPLLTIAVNKVMGPLVETRVRAGELDQALGRPFTLSISNQYGD 
DEVTHTLQTESRGSTCRAW\MEAL\WQLFFDMSQWKQCCDEIMKIET 
PA\PRKPPQAL\AKQGSL\YHEM/AJLFEPLDDI\AAVTDILT\QREG 
ARLETPPPW\LAMFTDQPALPNPCSPASVAPAPDWTHPLPWGRPRTF 
SL\DAVPPDH\SPRASLRFAPLPTFSESPRTRGLLQAKGQPRTWLQS 
PV 

2704 


1 

370 

LVTRRAVLACLRQLVQREAAEVSEHAVMLAKDSREELTPDANIREVG 
LEGALLI LLDKETDERLCHD I KETLNYMLTSMAVEKLS LWLKLCKDV 
LAAS AGKYWTLS VKI I KLKFETLHKI VYP 

2705 

A 

1 

363 

TMES ITACLHALQALLDVPWPRSKIGSDQDIX3 ILLLNVLHRVILTRE 
SPSIQLASLEWRQIICAAQEHVKEKRRSAEGSISILPTILYLTIGV 
LRETAVKLPGGQLSSTVAASLQALKGI 

2706 

A 

158 

301 

I IMTLAEDLLCSAAQNSRLS AQRTQAGWLL I S ALMTLGNSL * KVYI P 

2707 

A 

182 

396 

CCCNVFNCFSLSLQTWALHSLSLIIDSAGPLYYVHVEPTLSLIIIW 
VNVPPTHAEVHQSLGRCLNALITT 

2708 

A 

1 

436 

SSYIPSFSIQIQLFPTHTFTL*HPVSWKNCRK*TRENLK/ ITAELEI 
FQEGI KVLETLVTVAEEHHRAQLVACLLP I LI S FLLDENSLGSATS I 
MRNLHDFALQNLMQIGPQYSSVFKSLVASSPALKARLEAAIKGNQES 
VKVKI 

2709 

A 

112 

501 

PKFNEKNCDQLGGLLQGMELAHSLLLNEEAYNQLGEVQKAEFIFEWL 
RYLEKLLLATSRNDVREKQKTLVEQLLSLLNS S PGPPTRKLLAKNLA 
ILYSIGDTFSVHEAIDKCNDLIRSKDDSPSYLPTKL 

2710 

A 

1298 

8400 

FPTNTTFPLLKVCYRFHYAEPAVPGHCRAGVRREGADWGTGSADKQR 
GPFFLLDYGKVWL\PCAQQNDVREKQKTLVEQLLSLLNSSPGPPTRK 
LLAKNLAILYSIGDTFSVHEAIDKCNDLIRSKDDSPSYLPTKLAAVV 
CLGSLYKKLGRILGNTFTDTVGNILKAMKSAESQGRYEIMLSLQNIL 
NGLGAAAAPCHRDVYKAARSCLTDRSMAVRCAAAKCLLELQNEAIFM 
WSTDLDSVATLCFKSFEGSNYDVRISVSKLLGIILAKAVISKHPGTA 
SRQSIRRVSLEEVLELLGTGFLRGSSGFLRASGDMLKGTSSVSRDVR 
VGVTQAYWFVSTLGGAWLEKNFAAFFSHILSIjASPSHPKATQTQID 
AVCCRRCVS F I LRTTIGGLLGEKAQLAAVKD I CQA I WKLKKVMDAVM 
SDGNLETRLGSTDVAASQHMLVCALQELGNLIHNLGTTAAPLLQDSS 
TGLLDSILSVILHPSISVRLAAAWCLHCIAVALPSYLTPLLDRCLER 
LTGHKSSPEAVTGFSFAVAALLGAVKHCPLGIPHGKGKIIMTLAEDL 
LCS AAQNSRLS AQRTQAGWLL I SALMTLASPKDLETEKSRGDSFTWQ 
VTLEGRAGALCAIKSFVSHCGDLLTEEVTQRLLPPLPCAVDLLTQEL 
AADLTAPDIQVAASTFLLPPLCHQDDLLILSPFLQETDHRFIEEQLL 
LGNG VACG S LE YDP YS I YE KD VEGD S V PKPLP P AL S V I S S AS KLFGV 
VCAHVGETQRLLI LEQLLDS I KHTKGARQQWQLHWS S VSSFLKYV 
AGSKGCLGPEEMKRFALTLVMGALESPNPLLRCAAAESWARLAQWD 
DGAFTAGLAQVSFDKLKS ARDWTRTGHS LALGSLHRYLGG I S SSQH 
LNS C IG I L YTLAQD STS PDVQTW ALH S L S L 1 1 DS AG PL Y YVHVE PTL 
SLIIMLLLNVPPTHAEVHQSLGRCLNALITTLGPELQGNSTSISTLR 
TSCLLGCAVMQDNPDCLVQAQAISCLQ\QLHMFAPRHVNLSSLVSCL 
CVNPCS P YM * LRRAVLACLRQLVQRQAAE VS EHAVMLAKD S REELTP 
VNWYQEWGTAI KI PKNLEAALELGNRQWLEWFGGLRRRQEDVAKSGT 
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SRDLLNGFDQNADJJNVDNEAQAVPAMAKRDQGTAQAIASESAS PKPW 
QliLHGVGPAGAQNSRTEVWELPPRFLRMHGNAQMSRQKSAAGVKPSW 
RTSARAVQRENEGSEPPHRVPTEALPSGASETPSQKNKKRRGSIiDPR 
WD PLTAYTVRLKKPH I LNAS CENANI REVGLEGALL I LLDKETDER 
LCHDI KETLNYMLTSMAVEKLSLWbKLCKBVLAASADFTAVTCVDTM 
QEEEGDKGDDASVLTTRRDEKSHPFTNPRWATRVFAAECVCRIINQC 
ENANSAHFDIALAQEMKKRDSRNDFLVIiHLADLIRMAFMTVATDHSDQ 
LRLSGLEMLLWIRRFATVPEPEFPGHVILEQYQANVARREFQTSVV 
SRDTDTAAKFIGAGSATVGVADSGAGIGAVFGSLIIVYARKLSLKQQ 
LLFYAILGFALSEAMGLFCLMISFFILFAIAWIASGWSDLNDLRRV 
HQLLVS SLTKIQAGKEALSHLYNESASTME ILAVIiKAWAEVY 1 1 AVQ 
RHKNHRQPLKTTTCLEDGIRNGSCSSDGLLDLVYADLGTLSRLWLAA 
LQDFAIjLTLPSEFASQLPAEGGAFYTAETSENAKLHYYNSWALILHA 
TALWIiTSTGFWADPDEGASNLSRPVTPTSMCQGSSSGATIKSPEDV 
YTDRFHLILGISVEFLCSLRSDATMESITACLHALQALLDVPWPRSK 
I GSDQDLGI ELLNVLHRVI LTRES PS I QLAS LEWRQ 1 1 CAAQEHVK 
EKRRSAEVDDGAAEKETLPEFGEGKDTGGLVPGKSLVFATLEIiCVCI 
LVRQLPELNPKLTGS PGVKATKPQ I LLEDGSRLVS AALVI LSELPAV 
C S PEGS I S I LPTI LYLT I G VLRETAVKL PGGQLS S TVAAS LQALKG I 
LSSPMARAEKSRTAWTDLLRSAIiTTILDCWDPEHFSLVDETHQELDE 
VSLIiTAITVFILSTSPEVTTIPCLQKRCIDKFKATLEIKDPWQIKT 
YQIiLHSIFQYPNPAVSYPYIYSLASCIMEKLQEIDKRKPENTAELEI 
FQEG I KVIiETL VTVAEEHHLVSSTRNTS S RS L F KHKS SLKIDFLNYG 
NAFVSVVNRSILSSAISHJtVRVKSSHSTRSSLVTCKAKEVSSCNTGKQ 
SFRPKKNLPRTLETPQGKQNTPKGMSDDFVIiNSLKGFKLLEAFSRFS 
DTTGGKEGRRFYSRHAVYALGECGLSHREGVEWLYAKKLEASCLIiTC 
DSHGMLQDL I PNI SQEE I S 

2711 

A 

2107 

2424 

TTRYLKKTSTTGQRKKRGRGNGSFPTENLVPSGTVTGSQQLGPPFR* 
N/HTEECWGPPT71DGRAGKGPRQQPGRAQRIYRWDPSDGTTWHHHRP 
CGSRGTDQPETK 

2712 

A 

1 

368 

FRREGRP AG AG AGFADS GG APS S CCG PTVR P VAATAENVLHVRG A FQ 
EFPVCQAGVQWRNHGSLRPPPP\GSSHPPISAS*VV^TTGMCHHVWL 
IF* *NFFVETGSPYVAQAXiILLMLGIKTL 

2713 

A 

1 

278 

LWSRSHPFCCSRGPLAMAGILFEDIFDVKDIDPEGKKFDRGDKFRLV 
IASTLYEDGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGRRG 

2714 

A 

188 

406 

HNSRKMPLLFLRLRLRAAEWQCSRITGWGLLFQLCVTCPMGGLLMRIi 
QGDANNLHGFEVDSRVYLLMKKLAF 

2715 

A 

123 

914 

PRAGGCGG S GRVTACLCACATLVWP PRFQEVXiLVLSGLVHARGCTY S 
QLWSRSHPFCCSRGPLAMAGILFEDIFDVEGY*PEGQEVWTECSRLH 
CE S E S FRMDL I LDVN I Q I Y PVDLGD KFR \ L V IAS TL YEDGTLDDG E Y 
NPTDDRPSRADQFEYVMYGK\VYRIEGD\ETSTEAATRL\SAYVS\Y 
GGLL\MRLQGDA\NNL\HGFEVGLPRVYTjLMKKLSIiLNLAIiNQPSIjP 
SHSGRGHCSSIiSGQPLFAHIiLRKGWLTVHRWRHL 

2716 

A 

1 

2415 

MGFHHVGQADLKLLTSDNAYDPDVNAKQIWIDKTVINDHICLTFTDN 
GNGMTSDKLHKMLSFGFSDKVTMNGHVPVGIiYGNGFKSGSMRLGKDA 
IVFTKNGESMSMIXJLAESKASLAAILEHSLFSTEQKLLAEIjDAIIGK 
KGTRIIIWNLRSYKNATEFDFEKDKTYDIRIPEDLDEITGKKGYKKQE 
RMDQIAPESDYSLRSKTVRITFGFNCRNKDHYGIMMYHRNRLIKAYE 
KVGCQLRANNMGVGVVGIIECNFLKPTHNKQDFDYTNEYRLTITAI.G 
EKLNDYWNEMKVKKNTE YPLNLPVED IQKRPDQTWVQCD ACLKWRKL 
PDGMDQLPEKWYCSNNPDPQFRNCEVPEEPEDEDLVHPTYEKTYKKT 
GI I SEYPPIEAYREGVTEEVIHEFYQFTCCIiKRRLSTRS S ILNAKNR 
RLSSQFENSVYKGDDDDEDVIILEENSTPKPAVDHDIDMKSEQSHVE 
QGGVQVEFVGDSEPCGQTGSTSTSSSRCDQGNTAATQTEVPSLVVKK 
EETVEDEIDVRNDAVILPSCVEAEAKIHETQETTDKSADDAGCQLQE 
LRNQLLLVTEEKENYKRQCHMFTDQI KVLQQR I LEMNDKYVKKETCH 
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QSTETDAVFLLES INGKSES PDHIWSQYQQAliEEI ERLKKQCSALQH 
VKAECSQCSNNES KSEMDEMAVQLDDVFRQLDKCSI ERDQYKSEVEL 
LEMEKSQIRSQCEELKTEVEQLKSTNQQTATDVSTSSNIEESVNHMD 
GESLKIjRSLRVITVGQLLTU^IVPDLDIjQQVNYDVDVVDEILGQVVEQM 
SEISST 

2717 

A 

2 

965 

NCEVPEJEPEDEDLVHPTYEKTYKKTLKRRLSTRSSILNAKNRRLSSQ 
FENSVYKGDDDDEDVIILEENSTPKPAVDHDIDMKSEQSHVEQGGVQ 
VEFVGDSEPCGQTGSTSTSSSRCDQGNTAATQTEVPSLWKKEETVE 
DEIDVRNDAVILPSCVEAEAKIHETQETTDKSADDAGCQLQELRNQL 
LLVTEEKENYKRQCHMFTDQIKVLQQRILEMNDKYVKKETCHQSTET 
DAVFLLESINGKSESPDHMVSQYQQALEEIERLKKQCSALQHVKAEC 
SQCSNWESKSEMDEMAVQLDDVFRQLDKCSIERDQYKSE 

2718 

A 

276 

482 

GDDFSLQQINAELLFRPTALSTPSFSSPKESVPRRHLSEGTNSYATR 
LLNNHQVPPQSGTWGANRVSG 

2719 

A 

329 

3354 

PVRQFIFPKLPIRHCLEAEVGGRGQTCAPPRKWAVPIGLHSRSATSQ 
SGCGGGRSWLCSSLKMAAQPPRGIRLSALCPKFLHTNSTSHTWGHSV 
QLLEL I G * CLW I PDVNAKQ I WGLTKQ W I \ NDH I CLT FTDNGNGMTSD 
KLHKMLSFGFSDKVTMNGHVPVGLYGNGFKSGSM\RLGKDAIVFTKN 
GESMS VGLLSQTYL\ EVIKAEHVWP I VAFNKHRQMINLAESKASLA 
AILEHSLFSTEQKLLAELDAI IGKKGTRII IWNLRS YKNATEFDFEK 
DKYDIRIPEDLDEITGKKGYKKQERMDQIAPESDYSLRAYCSILYLK 
PRMQI I LRGQKVKTQLVS KSLAYI ERDVYRPKFLSKTVR I TFGFNCR 
NKDHYGIMMYHRNRLI KAYEKVGCQLRANNMGVGWGI I ECNFDKPT 
UNKQDFD YTNE YRLT I TALGEKLNDYWNEMKVKKNTEYPLNIiPVED I 
QKRPDQTWVQCDACLKWRKLPDGMDQLPEKVfYCSNNP\DPQFRNCEV 
PEEPEDEDLVHPTYEKTYKKTNKEKFRIRQPEMIPRINAELLFRPT\ 
ALSTPSXFSSPKESVSKR/RHLSEGTNSYATRIjLNNHQVPPQSEPES 
NSLKRRLSTRSSILNAKNRRIi\SSQF\ENSVYKG\DDDDEDVIILEE 
NSTPKPAVDHDIDMKSEQSHVEQGGVQVEFVGDSEPCGQTGSTSTSS 
S RCDQGNTAATQTEVP SLVVKKEETVEDE IDVRNDAVI LPSCVEAEA 
KIHETQETTDKSADDAGCQLQELRNQLLLVTEEKENYKRQCHMFTDQ 
IKVLQQRILEJ4NDKYVKKETCHQSTETDAVFLLESINGKSESPDHMV 
SQYQQALEEIERIiKKQCSALQHVKAECSQCSNNESKSEMDEMAVQLD 
DVFRQLDKCSIERDQ\YKSEVEL\LEMEKSQIRSQCEGLKTEV\DQL 
KSTNQQTAPDVSTSSNIEESVNHMDGERLKLRSFRVNVGNLLAMIVP 
DLDLQQVNYDVD WDE I LGQWEQMS EISST 

2720 

A 

1135 

1476 

HGVSLLLPRLECTGH/VISAHCDPRLPGFE\RFSCLSLP\SSW\DY\ 
RHAPTHAQAKFCIFKWRRGFSHVGQAG*HFYFPSQVLGTNMEDFFLT 
GLILHGLFQEDSNAPYSFKWGSLR 

2721 

A 

670 

1253 

ACYALNLFPKHTDIFYFFFIYFFCRDRGPRYITQAGLKFVDFK*ILP 
PWPYQS VGI I S VSH\ PPGPH* CILNAINKNYRYRN* LLY/ IQYPMLF 
GVFLRQGFAVT/ SAAVQWYDHGSMQPQLPGLRWFSHLSLLSGWEHRR 
TPPRIjG*FCIFSRESFVMLPRLVIjNSWGSNDTPAIiASQSVVITGMSH 
HAPAFPMLF 

2722 

A 

25 

479 

RAGQEFQAHKAILAARSPVFSAMFEHEMEESKKNRVEINDVBPEVFK 
EMMCFIYTGKAPNLDKMADDLIJ^ADKYALERIiKVMCEDALCSNLSV 
ENAAE I LI LADLHS ADQLKTQAVDF INYIiGAGPRI SQRWYFGDTCLH 
LTEQNKS SGA 

2723 

A 

354 

426 

LWKSRLTFKLAMSRVPS P PPPAE 

2724 

A 

201 

1432 

LWKSRLTFKIJ^SRVPSPPPPAEMSSGP\VAESWCYTQIKVVKF\SY 
MWD/TINNFSFCREEMG\EV\IKSSTFSSGANDKLKWC\LRVNPKG\ 
LDEESKDYLVTFTCYLVSC\ PKSEVRANFKFS ILNAKGEETKAMESQ 
R\AYRFVQGKDWGFKKFIRRDFLLDEANGL\LP\DDKLTIiFCEVSW 
QDSWISGQNTMNMVKVPECRL7U)ELRGLWENS\RFTDCCIiCVAGQE 
FQAHKAILAARSPVFSAMFEHEMEESKKN\RV\EINDVEPEVFKE\M 
MCFlYTGKAPNLTKMA\NDLIJVAADKYALERLKVMCEDAIiCSNPV/S 
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VENAABILILADLHSADQLKTQAVDPINYHASDVL/EKTSGWKS\MV 
VSTPPNLVA* / AQYRLSGFQAQCPFFWGPPTQNALKAIPKDPALLLR 
IPF 

2725 

C 

95 

310 

MREYNLVVLGSGGVGKSALTVQFVQGIFVEKYDPTIEDSYRKQVEVD 
CQQCMLE I LDTAGTL S PRLTT YRT * 

2726 

A 

1 

1101 

MRE YKLWLG SGGVG KS ALTVQFVQG I FVE KYD PT I ED S YRKQVEVD 
CQQCMLE ILDTAGTEQFTAMRDLYMKNGQGFALVYSITAQSTFNDLQ 
DLREQILRVKDTEDVPMILVGNKCDLEDERWGKEQGQNLARQWCNC 
AFLES SAKS KINVNEG P 1 1 R PLVGETKGTGHTSLAAVI LALS I TNVS 
SCTGWGVQRSHQDQGNSKHAASSVWKVGFGIVRLGTGTCLSSLERGY 
MCMEKEMFDPYTPSRVLKPRVKLGPDVVAKEDKEMNPARKTSGKGSR 
KKGKNVEDRTVQ FVQG I FVEKYDPT I ED S YRKI VLHTVT III PFKWR 
S PMHWTQAQPS PTGAWSLMNLVGGSHGGKDKDS KEPHP 

2727 

A 

53 

772 

IPRSGITGSEGDYPVYKSMCLTAPSMALSRSSVFKQITSLREYKLVV 
LGS *GVGKSALTVQFVHGI FVEKYDP \ S I EDS YRKQVEVDCQQCMLE 
IL\DTAGTEQFTAMRDLYMKNGQGFALSIFYLQLQSTF\NGLYRDLE 
GNQ I LRV * GTRE / VM F PM I L VGNKCDLEDER WGKEQGQNLARQWCT 
LALLE S S A/ RRS E IQVNEVL\ YDPVRQ INRE Y/ TPVEKKKP * KNSMS 
TLLGPIVRRQL 

2728 

A 

377 

615 

VPAQCSF*DTASCLLVKRKMRLDAVAHAYNPSTLGGRGGWl/TRGQE 
FKTSIANMVKPVSTKNKKI SRVWWHNRNPSYSGG 

2729 

- 

B 

1 

1224 

M PTVG KAQARRRTRDTLLLD I TQVL I DQS PRKDTGLAQQTGVI VPS K 
G I VKYWRAQLKGDMKYAS P AFEL F Y FS VRAQS TDG I VTLLDPTTRED 
CD I SS APNPEVRTQRQAEVAG VGLLAVKVPRDEEGVLGGAPGPQKRS 
RVCGEPTLQRRSGPLPVARPGSRCPPPTSVSSPVPEDEYGRTWMIR 
LDNPGISGRTIENNPPELEKQLPGEPSENSPYLGPPQVLFLLRDPRE 
DVMLLLNQTPTTAEKQAVLQAAE I FRNEQQI S YNTSKGKKGDRECEE 
IAETPFQIGSEAVPLDNPDWNSSSSAGEWKRRHFLICILEGLERTKA 
KFLNCS KLSMVDQKPDENPAAFMERLREAL IEQTSLS PDS VEKQL IL 
KDKFVTQS ASNI RRKLQKQA I GP I S TLKNLLK * 

2730 

A 

877 

1536 

TNFLFALSPSCLAGLLPPKFKQFSCLSLLSSWDYRNFYIFW*RWGFT 
MLARLVSNS * PQ/CDPHTSASHSAGI TGMSHCAWS *TLNFFSYELFF 
S FTSALS * KDHCKGGHNMGCQFT *MHQ * L * E KP P * LLF\ ETESRS VA 
/QAGVQWRNLGSLQPPSPGFKRFSCLSLPSSWDYLRTPPRPANCCCI 
FSRDGVSPCWPGWSQSLDLVICPPWPPKVLGLQA 

2731 

A 

3 

643 

RLEEDLGRRQS LAP VG VRVET YFTWLWTI TMG I QG\ GS VLFG \ LLL 
VLAVLC\HS\GH*PCQC/YNGFLPQTADCQKQPFNCSSGFWMRVFIT 
QKLGYQVYNK\CWKFEHCNFNDRHNPA*RKMGLTILLCICKG/LRCNF 
TE\RLENGGTSL\SEKTVLLLGELHFWAAAWEPSIPKSNTRECFLTK 
LPRSLRSSVFPLGWPHSKGLDI FQMGSWLGKE 

2732 

A 

11 

431 

RHLLAAPPPAPPAAAAAAALSHCLRSSGRLAPHTSRRLPRVVKRRVN 
ALKNLQVKCAQIEAKFYEEVHDLERKYAVLYQPLFDKRFEIINAIYE 
PTEEECEWKPDEEDEI SEELKEKAKI VDEKK\ DEEKEDPKG I PEFW 

2733 

C 

257 

358 

M I LLVFL PXHQVFL ERX QE S E I LHHLNTLAD VL * 

2734 

A 

2 

387 

SRSSWSLPAEPLVTWVKTFGPFGSGTQDNLTMYMDLVDGIFLNQIML 
QIDPRPTNQRINKHVNNDVNLRIQNLTILVRNIKTYYQRWGDDYDEE 
GEEADEPGEEGGDEEHDPDYDPKKDQNPTECKQQ 

2735 

A 

64 

468 

PAAWLP ILVAARQLTVQMMQNPQI LAALQERLDGLVETPTG YI ESLP 
RVVKRRWALKNLQVKCAQIEAKFY\EEVHDLQRKYAVLYQPLFDKR 
FEIINAIYEPTEEECEWKPDEEDEISEELKEKAKIEDEKKD 

2736 

A 

1 

795 

^VTVSELLELFLQS^LVTWVKTFGPFGSGSQDNLTMYMDLVDGIFL 
NQ I MLQ I D PRPTNQR INKHVNND VNLR I QNLT I LVRN I KT Y YQDMG Y 
GQTQAWGAGILESFLGGNVCLRLVGLTGREGVSSGEERHLQSFGDIK 
QPEGWGVSCGYRFRKGELEPLEVAVFVGTGSVGAAVMVAMGAATLEA 
PEALES INQES S CGC I KPCGNS LEWRQLARALLVRWGDD YDEEGEEA 
DEEGEEGGDEENDPDYDPKKDQNPTECKQQ 
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2737 

A 

1 

982 

l^VTVSELLELFLQSPLVTWVKTFGPPGSGSQDNIiTMYMDLVDGIFIi 
NQ I MLQ I D PRPTNQR INKHVNNDVNLR I QNI»T I LVRNI KTYYQDMGY 
GQTQAWG AG I LE S FLGGNVCLRL VGLTGREG VS S GQERHXiQS FGD I K 
QPEEAQGRMVGSFERTPQICGLGYRFRKGELEPLEVAVFVGTGSVGA 
AVMVAMG AATLEAPEALE S I NQE S S CGC I KPCGNS LE WRQLARALIjV 
RWGDDYDEEGEEADEEGEEEGDEENDPDYDPKKDQNPAECKQQLKQG 
CMWPLRDNLHWSTFCFPGKGLNLHHLTSLFSKLFWCLLCFCFCS 

2738 

A 

2 

138 

SARGAASCSSRCCCRCRPESLPAQLRAAWLPILVADIWSSYNMADID 
NKEQSELDQDLDDVEEVEEEETGEETKLKARSA*LFRMM\QNPQILA 
\ALQERLDG/LWVETPT\GYIESLP\RVVKRR\VNA1»KNLQVKCAQK 
ETQ\ FYE\EVHDLERKYAVljYQPLFDKRFE I INAI YEPTE/ EKECEW 
ETR*RKDEDFGGNWKEKGQIEDEKKDE\EKEDPKG/IFPEFWLTVFK 
\NVDLLSDWQEHDEPILKHLKDIKVKFSDAGQPMSFVIiEFHFEPNE 
YFTNE\VIiTKTYRMRSEPD\DSDPFSFDGPEIMGCTRVPR*DWKKGK 
NVTLKTIK\KKQKHKGRGTVPYCDLKPVSNDSFFNFFGPLPESSIiRS 
GDLDDDAEA\ I LAAD F\ E IGHF YRER 1 1 PRSWNFT\ GEAI EDDDDD 
YDEEGR * RSG * GRLAPHTSRR YLE FLQHGRH 

2739 

A 

1 

435 

PRARAERPAPASCSSPTSTRTTPDTEDVCIVERLFSSSLVAIVSIiKA 
PRKLKVCHFKKGTEICNYSYSNTILAVKLNRQRLIVCLEESLYIHNI 
RDMKVIiHTIRETPPNPAGLCALSINNDNCYLAYPGS * SGEAGAGQLI* 
FANFNQDNT\DTEDVCIVERLFSSSLVAIVSLKAPRKLKVCHFKKGT 
EICNYSYSNTIIAVKLNRQRLIVCLEESLYIHNIRDMKATLHTIRETP 
PNPAGLCALS INNDNCYLAYPGSATIGEVQVFDTINLR 

2740 

A 

3 

438 

LLFANFNQDNTSIiAVGSKSGYKFFSIiSSVDKLEQIYECTDTEDVCIV 
ERLFS S S LVAIVSLKAPRKLKVCHFKKGTE I CNYSYSNT I LAVKLNR 
QRLIVCLEESIiYIHNIRDMKVLHTIRETPPNPAGLCALSINNGFGEG 
DRD 

2741 

A 

1 

1336 

MLLRLQR I KTLR PPGCPHPMTTCS AGTLS AVPCVS PRQVFVFERRFC 
LWHSHVEMFTHVLPFVISADTEDVCIVERDFSSSLVAIVSLKAPRKL 
KVXCHFKKGTXEIXOIYSYSNTILAVKFNRQRLIVCLEESLYIHNIR 
DMKVLHTIRETPPNPAGLCALSINNDNCYLAYPGSATIGEVQVFDTI 
NLRAANMI PAHD.SPIJ\ALAFDASGTKXJVTASEKGTVIRVFS I PEGQK 
LFEFRRGVKRC\VSICPLA^SMDG\MFLSASSNTED/TVHIFKLETV 
KEKPPEE\ PTTWTGYFGKVLMASTSYL\ PSQVT\EMFNQGRAFATV\ 
RLPFCG\HKNILLG*PTIQKIPRLLVGAA\DGYLYMY\NIiDPQEG\G 
ECALMKQHRIxDG\SLETTNEILDSASHDCPLVT\QTYGAAAGKAYTD 
DLGAVGGACLEDEASALRLDEDSEHPPMI IRTD 

2742 

A 

152 

1616 

VQPDPPSRARPPRPGPIJ^RPAMNLASQSGEAGAGQLLFANFNQDNT 
EVKGASRAAGLGRRAVWSLAVGSKS\GYKFFSLSSVDKL.EQIYECT 
DTEDVCIVERLFSSSLVAIVSLK\AP\RKLKVC\HFK\KGTEICNYS 
YS\NTIIAVKLNRQRLI\VCLEE\SLYI\HNIRDMKVLHTIRETPPN 
PAGLCALSINNDNCYL\AYPGSATIGE\VCQVFDTINLRA\ANMIPA 
HDS PL\ AALAFDASGT * L\ ATAS EKGTVI RVFS I PEGQK\ LFE FRRG 
VKRCVSICSLAFSMEK5MFLSASSNTETVHIFKIiETVKEKPPEEPTTW 
TGYFGKVIJ^STSYLPSQVTEMFNQGRAFATVRIiPFCGHKNICSIiAT 
I QKI PRLLVGAADGYL YMYNLDPQ\ EGGECALMK\ QHRL \ DGSLGKR 
PNE\ILDSA\SHDCPLVTSDITAQLAGKGTYVPSSPTETCPTRDD\L 
GAVGG AC \ LED\ EGP ALR\ LDEDS EHP PM I LRTD 

2743 

A 

122 

1162 

LGLEGDGGASQWVGGRQRLQDCRNRSLQGSPBTLIPP/SLQITVPSR 
TCFP/SLQVGLT/FPPRAVPSAAAAIjGRNLRL*RGWHRRIIPPWVAS 
ISRAASSWG/TKPPLFKVPSQPETLGETWVIAKMAGTYGGSRNTLPR 
SPGGNFRSTVPR*VQTTPRQPGAGLPKPVWATSSD*QASRSSPGRS 
RKDGPDSFPAIiPVS/RGEPESPSLQPPPPSTR*PSRLRGPRP\PSAL 
SGPLRAPEHAQSSTSPSRWTTSPRMLRAPARAPPCPSPTLVPVSTPS 
PPFP*RWSQ* SGHTPGRFFPCS PPRIRASACASLSGTTDGSTAYQIA 
HAPHHVFLTSLRAFLRLAGSCTNHG 
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2744 

A 

53 

394 

RETGLPALGTCAVPLARAAPILQRLQQSPRTSLKARPKTRITGAIiPM 
DHTEGSPAEEPPAHAPSPGKFGERPPPKRLTREAMRNYLKERGDQTV 
L I LHAKVAQKS YGNEKRFF 

2745 

A 

2 

358 

ALPRLIIRKVDKQTALLDADDPVSQLHKCAFYLKDTERMYTjCLSQER 
IIQFQLNGRWGRRQCLNLQGQNFTSKFTECGFGGCRKAGNYVTGVGG 
GVWLLCSSPDISGISEEGWEMGSR 

2746 

A 

3 

452 

I LMDKTVKLVCP VTGMALPRL I IGKVDKQTPLWDADD P VS QLHKCAF 
YLKDTERMYLCL S QER 1 1 QFQATPCPKE PNKEM INDGAS WTI I S TDK 
AEYTFYEGMGPVLAPVTPVPVVESIiQVRTPSSNWPSSSLQLLSAVTW 
HHFISWEF 

2747 

A 

1 

1814 

MRS AARG PRQS CS AFNRFRAANS S S PGFGAPCGRQCW I WES LGKERA 
KEGKDGGLQSPRTSLKARPKTRITGALPMDHTEGSPAEEPPAHAPSP 
GKFGERPPPKRLTREAMRNYLKERGDQTVLILHAKVAQKSYGNEKRF 
FCPPPCVYLMGSGW\RKKKEQMERDGCSEQ\GLNQCAFIGIGNSDQE 
MQQLKLW/ NGKNYCTAKTLYI SDSDKRNTFMLS VKMFYGPTS \ YDIG 
VFL \ S KR INVI SKP/ S PKRKQSLKNADLCI AS GKERWALFNRLLSQT 

V\ ^TRYT.HV / FK"P(*^WRT-TZ\ Q QOOWf2A T717TTJT T 1^TM^1?01CV""»I3»I?T?'T , \rt>TV"«V 
v \oii\.i un v / DivGuiiir hao q n u at r XnljJLtUlJLJh>£> EjVjHrJc*r X VKJL/vj x 

I HYGOTVKXiVCS VTGMAL PRT. T T P KVHKOT AT .T .D AnDPVQfYf ,HKP & T? 

YLKDTERMYLCLSQERI IQFQATPCPIAPS \ REM INDGAS WTI I STD 

K/AQKKTFYEGMGP/VSLPPWPCLV*ESlX?IiKGGAIX3SQCLNIiPGQ 

NFTPKFTECGLGKVEAETMYRCGESM\ LCWPDI \ SAFRER\ WRWVR 

QPSPRFPVTLVPKLIGIPFISHQPI>PFT\YTPEP\GPRATIAVPAGA 

ILRAQFKJPGCPLTESNTNSEGSl^KRQHKFNQCHIIYSHSGILTTVF 

LLGLKLT 

2748 

A 

54 

188 

NMTKLQWSKNLLLLKMLIKERETQRKGGSS * KSTQGAKGGAPR 

2749 

A 

572 

857 

GQKKGFSMGG*GEKGSIiGNPSWGTAGGQAPVPPQGLTGTRFTQGSSG 
\TAGPPASTFCLGGHGAWGTYGRQMYMYEFVFCSVGVAQKKDLCPR 
E 

2750 

A 

3 

127 

E I CQFLKGAKITRLQDQQVPYAVKGNQWVG YDDVKSMETKD 

2751 

A 

1 

359 

LI LVKAAAALDTGHLSPWFGVRCAFQNPGRRGCS / PEGKQLQSRVI * 
G * E FQSTAACEEH YMAS I RRS VS PSLS VCTS PLPS VLD I LPPTDL YF 
EQHSPIL*LKVKGSLH*RSSPLRKES 

2752 

A 

1 

1270 

MGATTMDQKSLWAGVVVLLLLQGEMGFCYVARAGLEIiLGSRSPPASA 
SQSAGITGSAYKLVCYFTNWSQDRQEPGKFTPENIDPFLCSHLIYSF 

AQTEMNKVT TKTlKQRVMT/VTYPTKIQT.Tr'ririsJDVT wtt t ctppvt vncvr* 
mcummmxw j. xai/aod vrjjj x v J- xjmqxjJvJ mm f m i jjXjij lOb xl_ir kaoiVur 

FHPMVDSSTSRLEFINSIILFLRNHNFDGLDVSWIYPDQKENTHFTV 
LIHELAEAFOKDFTKSTKERLLT.TAGVSAnROMTn>J^YnVT7K7.AKT)T. 

DFINLLSFDFHGSWEKPLITGHNSPLSKGWQDRGPSSYYNVEYAVGY 
WIHKGMPSEKYVMGIPTYGHSFTLASAETTVGAPASGPGAAGPITES 
SG FLAYYE I CQFLKGAK\ TRGSRDQQVP YAVKGNQWVG \ YDDVKSMG 
DQGSVL*RFKTWGGAMIWSIDMDDFTGKSCNQGPYPLVQAVKRSLGS 
L 

2753 

A 

385 

570 

KLGTEQDKGGRI I LLVWAKLHQLS Y * H I RVLS I GGN * GPGEG * MGMP 
GVHLLFFRTKIRIQI 

2754 

A 

1 

1022 

MTJjAPSDODLFVFRSGP PAAAGMPCT.FFPF VP PPTP VWO PTOPTV3 K* 
NLL1TOLFIACPGGLMAQGPU3PTESSVPOT 

QTFMQSAIQFLRKYNFDGLDIDWEYPGNRGSPADTQQLFTILLKEMY 
EAFEQESTRSKKPRLLMSAAVSAGKGTIETACQIPEMSSSTPSRKEP 
PRVWNAPWEVPYVYKGNQWIRYDNSKSFTLK/AVLPHSKD1W*TTEF 
FALLCLSPAHGKFTQAEPKSGGDRASGLLQSTHLYPGHQL\APIITA 
APGGGSTGGSGFCTGI^SKGLYP\VPHSKHAFYNCVDEHTYEEACQEG 
LVFDT\ S C \ S CCNW A 

2755 

B 

143 

867 

X VWLLLLQGGS AYKLVCYFTNWSQDRQE PGKFTPEN I D P FLCSHL I 
YSFASIE1WKVIIKDKSEVMLYQTINSLKTKNPKLKILLSIGGYLFG 
S KGFHPMVDS STSRLEF INS I ILFLRNHNFDGLD VS W I YPDQKENTH 
FTVLIHELA^^QKDFTKSTKERLLLTAGVSAGRQMIDNSYQVEKLA 
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KDLDFINLLSFDFHGSWEKPLITGDN6PLSKGWQDRGPSSYYNCEIC 
CGVLDT* 

2756 

A 

2 

3 94 

C S AH S TALR WRNHF Y F T I TATQVHKGS S SHPCTDRGNEMMAS S PQP 
S AFG I PGWGAR * PAPAPLPHRRGLPGS VTG * WRP I VGQ * / PSPHTHC 
KASTKAQPEGPGTPPP\GKPSWKELPESHKKSLAPSFP 

2757 

A 

499 

1592 

QPSQPFLS AGLNQTVPWGTCLHRAELSPST\ ENS SQLDF\ EDVWNSS 
YGVNDS FPDGDYDANL\ EAAAPCHSCNLLDDS ALAFF I LNQCS WG IL 
ASRTGLFMPFR\PFFRWQLCPG\WPVLGTAGCGAVAFFQHLWVPVFG 
PQGYGS\TRSSALL*LGATGVWYGSAFAQALLL\GC\HASLGHRLGA 
GQ VQG \ LTLGLTVG I WG VAALLTLP VTLA\ S GASGGLCTL I YSTE LK 
ALQATHTVACLAIFVL\LP\LGLFGA\KGLKEGIGLMGPGPLDEISIi 
WA\ WF I FWWPHGWLGLDF \ LVR \ S KRLLLS TCLAQQALDLLLNLAB 
ALAILHCVGTPLLLALFCHQATRTLLPSLPLPEGWFFHLDTLGSKS 

2758 

A 

1541 

1830 

KRFLHLHQGHTSGKGSRGFHLCCCGFSSKSP/GS*PGAGGASSSSQS 

KSPGPRPARVRRRLWVRAPPATSQEEASAGRRVRPPSSVHVNIPPPS 
VRPA 

2759 

B 

4 

1698 

MGDFNTPLSTLDRSMRQKVNKDIQELNSAIiHHMDLIDIYRTLHPKST 

EYTFFSAPQRTYSKIDHIVGSKALLSKCKRTEIITNCLSDHSAIKLE 

LR I KKLTQNHS I TWKLNNWLLND YWVHNEMKAE I KMFFETNE I KDTT 

YQNLWDTFKATIiNQEEVESLNRPITGSEIQAMINSLGTKKLQDQTDS 

QPS S TRDDM I VYLENP I VS AQNLLKL I SNFS KVSG YKINVQKSQAFL 

YTNNRQTESQ I MSKLPFT I ASKI I KYLGVQLTRDVKDPFKENYKPLL 

NEIKEDTNKWKNIPCSWIGRINIMKMAILPKVIYRFNAIPIKLPMTF 

FTELEKSTLKFIWNQKRACIARSILSQRNKAGGITLPDFKLYYKATV 

TKTAWYWYQNRDIDQWNITEPSETMPHIYHYLIFEKPDKNKKWGKDS 

LFNKWCWENWIJVTCRKLKLDPFLTPYTKINSRWIKDI^A^PKTINTL 

EENLGNTIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKE 

TTTRVNRQPTEWEKIFAIYSSDKGLISRIYKELKQIYKKKTTPSTSG 
* 

2760 

B 

1 

1044 

MGRNRSRKDENSKNQSAS SPPKDRSSS PATEQS WMENDFDELTEVGF 
RRLNQEEVESLNRPIRGSEIEAIINSLPTKKSPGPDGFTAEFYRRYK 
EELVPFLLKLFQS YEKEVILPNS F YEAS I IL I PKPGRDTTKKENFRP 
ISL1WINAKILNKILANQIPQHIKKLNHEDQVGFIPGMQALFHIQES 
INVIHHINRTKDKNHIIISIDAEKAFDKIQQRFMLKTLNKLDDMIVY 
LENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIM 
SELPFTIATKR I KYLGI ELTRDVKDLFKENYKPLLNQI KEDTNKWKN 
IPCSWIGRINIVKRATLPK* 

2761 

A 

1331 

2176 

TEPKTKTT*LSQ*MQKRPLTKFNNPSC* KLS IN/ IVLEVLARAIRQE 
KE I KG1 QLGKEEVKLSLFADDMI VYLENP 1 1 S AQNLLKL I SNFSKVS 
GYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRD 
VKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIVKMAILPKVIY 
RFNAIPIKLPMTFFTELEKTTLKFIWNQKRARITKSILSQKNKAGGI 
TLPDLKLYYKAIVT\KTAWYCYQNRDIDQWNRTEP*EIMLHIYNYLI 
F 

2762 

B 

1 

2898 

MRQKVNKDTQELNSALHQEDLIDIYRTLHPKSTEYTFFSAPHHTYSK 
TDHIVGSKALLSKCKRTEIITNCLSDHSAIKLELTIKKLTQNHSTTW 
KLNNLLI^YT^mEMKAEIKMFFETNKNKDTTYQNLWDTFKAVCKG 
KFIALNAHKRKQERSKIDTLTAQLKELEKQEQTHSKASRRQEITKIR 
AELKETETQKTLQKINESRSWFFERINKIDRPLARLTEKKRKKNQID 
AIKNDKGDITTDPTEIQTTIREYYKHLYANKLENIiEEIWKFLDTYVL 
PRLNREEAEPLNTPIAGSEIEAIINSLPTKKSPGPDGFTAKFYQRYK 
EELVPFLLKLFQS I EKEG I LPNS FYEAS 1 1 LI PKPGRDTTKKENFRP 
ISLIWINAKILNKVLANRIQQHNKKLIHHDQVGFIPGMQGWFNIRKS 
INGIQHINRTKDKNHTIISTDAEKAFDKIQKPFMLKTLNKLGIDGTY 
LKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLE 
VLARAIRQEKEI KDDM I AYLENP 1 1 S VQNLLKL I SNFSKVSG YKINV 
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QKSQAFLYTSNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDIiFK 
ENYKPLLKE I KEDTNKWKN I PCS WVGRIN I VKMAILPKVI YRFNAI S 
IICLPMTFFTELDKTTLKFIWNQKRACIAKSILSQENKAGGIMLPDFK 
LYYKATVIKTAWHWYQNRDIDQWNRTEPSEIMPHIYNYIilFDKPDKN 
KKWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVR 
PKTI KTLEENLGNTI QD I GMGKDFMS KTPKAMATKDKIDKWDL I KLK 

Of PTTWPTIiT 0'\7TvTt>r>T> r PTrTAJ"C > VT 1 !? TlTVC GT1 VD T TOO TVXTOT nUTVWV 

or ^.l/vKAj J. Hj.KVJMKyr lHWCi]\.irAl j ooJJJvKXjXoK J. xXMJb>JLiKiaX xxsJvK 
TNNP I KKW AKDMNRHC S KED I YAAKKHMKKC S S S L AI REMQ I KTTVR 
YHliTPVRMAIIKSQETTGAGEDVEK* 

2763 

A 

3 

670 

I QTT I RE YYKLLYANKLENLEEMDKFLDTYTLPRLNQEEVESiKTP I 
TGTE PR FG PTKNVQDQMD S QLNS TR VLEVLARAI RQEKE I KG I QLGK 
EEVKliSLFADDTIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQ 
AFLYTNNRQTESQIMSELPFRITTKRIKYLGIQLTRDVKDLFKENYK 
PLLNEIKEDTNKWKNIPCSWIGIINITKMVILPK 

2764 

B 

64 

1242 

MINSLGTKKLQDQTDSQPSSTRDDMIVYLENPIVSAQKTLLKLISNFS 
KV SG I K I IM V Qivb QAFIj i TNNRQTE S Q I MSKJjPFTIASKIIrvYIjGVQIj. 
TRDVKDPFKENYKPLLNEIKEDTNKWKNIPCSWIGRXNIMKMAILPK 
VIYRFNAIPIKLPMTFFTELEKSTLKFIWNQKRACIARSILSQRNKA 
GGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNITEPSETMPHIYHY 
L I FEKPDKNK3CWGKDS LFNKWCWENW^ 

DKWDLIK1.KSFCTAKETTTRWRQPTEWEKIFAIYSSDKGLISRIYK 
ELKQIYKKKTTPSTSG* 

2765 

B 

287 

898 

MNINAKILNKILANQIPQHIKKLNHEDQVGFIPGMQALFHIQESINV 
IHHINRTKDKNHI 1 1 S IDAEKAFDKIQQRFMLKTLNKLDDMI VYLEN 
PIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSEL 
PFT IATKR IKYLGI ELTRDVKDLFKENYKPLLNQ I KEDTNKWKNI PC 
SWIGRINIVKRATLPK* 

2766 

B 

1 

1443 

MKLPEEASGSNICCYAIFAVLQPLIiVIPKETVSGVDLQQTPTDLQLS 
D LT PGR KTNKQKG I AS P S T KRTS T P KPHLQ ANAQR FCHHQAC VTRVP 
KGSTKHGKEQLVP ATAKTCQI VKN INAMKKLHQI/TE I QI TIRBYHKH 
L» YANKL KN JjEE MD KFJjDT YTL PRJj KQE E VE SIjNRPIMGSEIEAIINS 
LPTKKSPGPDGFTAKFYERYKEELVSFLLKLFQSIEKEGILPNSFYE 
ASIILIPKPGRDTTKKENFRPISLKNIDAKII^ILANRIRQPIEKL 
IHHDQVGF I PEMKG WFKKCKS INVIHH INQTNDKNHMI I S IDAEKAF 
DKTQQPF^KTlaNKIiGTIRQKKEIKGIQLGKEEVKLSLFTUDDMIVYI, 
ENS IVSAQNLLKIjI SNFSKVSRYKINIiQKS QAFIj YTNNRHMESQITS 
ELPFTIDTKRIKYLGIQFTRDVKDLFKENYKLLLNEIKEDTNKWKNI 

2767 

A 

1 

966 

MGRNQCKKAENSKNQNTSSPPKDNNFLPAREQNWMENEFDELTEVRF 

KIOlEKNQINTIKrTOKGDIITDPTEIQTTIREYTKHLYANKLENIjEEM 
DKFLDTYTLPTLNKEEVESIiNKPITSSEIEAVIISIiPTKKVQDQKDS 
QPNSTRVLEVLATAIRQEKEIKDTQTGNEEVKLSLFADDMIVYLENP 
IVSVQNLFKLISNFSKVSGYKINVQKSQALLYTTNNRQTESQIMSKLP 
FT IATKR I KYLG I QLTRD VKDLFKGNYKPIiIjKE I REDTNK 

2768 

A 

1 

216 

LFVRCP VGS I S LPTLS S PVKRYRFPHMAYPSIjTRCIHSGLPQYCMS I 
NCFPMH PG YRVPS RCF PVS PIFFPT 

2 769 

/"V 

3 


QADLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCKRT 
ENVRNCLSDHS AI KLELR I KTLTQNCS ATWKLNNLLLND YWVNNEMK 
AEIKMFFETNENKYTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKI 
DTLTSQLK^LEKQEQTHSKASRRQEITKIRAELKEIEIQKTLQKINE 
SRSWFFEKIKTEIDRPI^ARIjIKKXREKNQIDTIKNDKGDITTNPTEIQ 
TTIREYYKHLYANKLENLEEMDKFLDTYILPRLNQEEVESRNRPITG ' 
AE 1 EAI INS LPTKKS PG PDGFTAEF YQR YKEEL VP 1 LLKRFQS I EKV 


1012-7 
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GIL PNS F YEAS 1 1 L I P KAGRDTTKKEN FR PIS LMN I DAKI LNKI LAN 
RIQQHI KKLIHHDRVGF I PGMQGWFNIHKS INVTQHINRTKDKNHMI 
I S I DAEKAFDKI QQPFMLKTLNKLDDM I VYLENP IDSAQNLLKLKSN 
FRKVSGQKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGI 
QLTRDVKDLFKEOTKPLLNEIK^DTNKWKNIPCSWVGRINIVKMAIL 
PKVIYRFNAIPIKLPMTFFTELEKTTLKFIRNQKRACITKSILSQKN 
KAGGITLPDFTLYYKATVTKTAWYLrYQNRD ID 

2770 

A 

1 

1632 

MHCILPTPSGSGSFPVYTSQSQPKTPGAPSGTLSVREPLSPEPDVLL 
PGSAKCAALCWFVSVLDAASPVTRWPVSRSPWECRCVPSPPVREAPV 
SPPFQCAPPPYTDCLSPYLYEYTVCPSPCARVPIPSSTHPWEYKVPP 
PHAERLCPVSEDPPCLCNWKTRAAGARESSIiVPCPYAARYPPSPPRV 
AWSSHFSYRGLSPSLGPNKPGNPRGPSESPAAAASRPRAAAASAPPP 
GAARLPASPTPRLAl^RGVSHPAJ^PCPAWPLRALTVAAASHGSATPA 
SGPPPGREGVRSASLSPACRPAAHLVALNRPHRPPSTLTGAGPKSLE 
FCSWDGTSERLNQEDVESLNRPITGSEIEAIINSLPTKNVQDQKDSQ 
PNSTRVLKVLARAVRQKKAIQGIQIGKEEVKLSLFADDMIVYLENPI 
VPAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELLF 
TI ATKRI KYLG IQLTRDVKDLFKENYKPLLKE I REDTNKWKNI PCS W 
IGRINIMKMALLPKVIYRF/NAIPIKL*LPTPSGSGSFPVYTSQSQP 
KTPGAFSGTLS VRE PLS PEPDVLLPGSAXCAALCWFVS VLDAAS PVT 
RWPVSRSPWECRCVPSPPVREAPVSPPFQCAPPPYTDCLSPYLYEYT 
VCPSPCARVPIPSSTHPWEYKVPPPHAERLCPVSEDPPCLCNWKTRA 
AGARESSLVPCPYAARYPPSPPRVAWSSHFSYRGLSPSIiGPNKPGNP 
RG P S ES PAAAAS RPRAAAAS APP PGAARL PAS PTPRLARDRGVSHPA 
RPCPAWPLRALTVAAASHGSATPASGPPPGREGVRSASLSPACRPAA 
HLVAXNRPHRPPSTLTGAGPKSLEFCSWDGTSERLNQEDVESLNRPI 
TGSEIEAIINSLPTKNVQDQKDSQPNSTRVLKVIJU^AVRQKKAIQGI 
QIGKEEVKLSLFADDMIVYLENPIVPAQNLLKLISNFSKVSGYKINV 
Q KSQAFL YTNNR QTESQ I MSELL FT I ATKR I KYLG IQLTRDVKDLFK 
ENYKPLLKEIREDTNKCKNIPCSWIGRINIMKMALLPKVIYRFMLSP 
SSYD 

2771 

A 

842 

2353 i 

TEPKTKTT*LSQ*MQKRPLTKFNNPSC*KLSIN/IVLEVLARAIRQE 
KE I KG I QLG KEE VKL S LFADDM I VYLENP 1 1 S AQNLLKXi I SNFS KVS 
GYKINVQKSQAFLYTNNRQTESQIMSELP\FTIASKRIKYLGIQL\T 
RDVKDLFKENYKL\PLIKEIREDTNTCWKNIPCSWIGRMNIMKMAILP 
KVI YRFNAI L I KLPLTS FTKLEKTTLKF I WNQKRAR I AKS I LSQKNK 
AGG I TL PDFKL YYKAI VT\ KTAWYCYQNRD IDQWNRTE P * E I MLH I Y 
NYLI FDK^EKNKQWGKDSLF NKWCWENWLTICRKLKLDPFLTSYTKI 
NSRWIKDLNVRPKTIKTPEENLGNTIQDIGMGKDFMTKTPKAMATKA 
KIDKWDLIKLKSFCTAQETTIRVNRQPTEWEKISAIYSSDKGLISRI 

WITT .VTlTWTVTMMD TlfVTJATfnMXnJUT?CVT7nTVA 7V VtJUMVVPOO OT 7\ 

IREMQIKTTMRYHLTPVRMAII/RIKSGNNRCWRGCGEI 

2772 

A 



KT-TPinvrP'KTnT.r*T.QC5P , f2QP\7TT aTCCni?VTJDOI?MT Trv^MOTTTf W r P*T ,p Pn 
A-rir ru^i\rvxi^ij^Xjo oijLaj-c* V X1j>\.1 ooiJll»jSJrlxrlr IjIm J- -LDvjJN i^JC* 1 r W X J. 1\J 

MFPQEFIICFHKHVRIERLVIQSYFDLVHTEGQLQNEEIVAPIPNYG 
STGHL 

2773 

A 

2 

294 

HERESQVKHFKMRKIDLCLSSEGSEVILATSSDEKHPPENIID\DLV 
HTEGQLQNEEIVAHDGSATYLRFIIVSAFDHFASVHSVSAEGTWSN 

2774 

A 

247 

411 

OLTSTAKS IYIDFFPIOAHDGSATYLRFI IVSAFDHFASVHSVSAEG 
TWSNLSS 

2775 

A 

136 

714 

RESQDKHFMMRKIDLCLSSEGSEVILRYQSSDEKHPP\ENIIDGNPE 
TF\ WTTTG\MFPQEF I IGF\HKHVRIERLVI \QSYFVQ\ TLKIEKTH 
RLNEP\VDF*RNWIEKDFGTHQRGSFKNEEIV/AHIDGSATYLRFII 
VS AFDHFAS VHS VS \ AEGTWSNLLL I M I TKWLLHDFLT I YLNRKLS 
LIYFIKRIFIK 

2776 

A 

3 

2077 

PAEEWTYlPLDFCHYHSEDGKQFLLPLTPTLLTHPLCIiVTGTTSLL 
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LVCVGKPBIFVAIHYYTPNTKIiVPIJ^PRNSHVPHPPERTTVTQYST 
CALLTALCLLLPVLQETAQSRRK\n?SHPEDRPALARKHGASQPAGLG 
FPRTQTVTPAFTFQTPTAAEPALLTAWLGRAPETETITDMAGAAGTH 
RRKRRLEPEKRPGASQKTTLKERPPETGPWDLPPPPTERTTVTQYST 
CALLTALCLLLPVLQETAQSRRMVTSHPEDRPAliARKHGASQPAGLG 
FPRTQTVTPAFTFQTPTAAEPALLSAWLGRAPDTENITDMARATSRS 
ITWGRMVSAKARGAAAVTEERETPERNLVQRNPDSNSSDISPESFLN 
SDLCWSQWASMPAEKSWNLCPHSLSALASSQLNLRLLGSGNDGSTSA 
ALRLLGLRDGSTSPTLRLLGSGDDGSTSAALRLLGLRDGSTSPTLRL 
LGSGDDGSTSAALRLLGLRDGSTSPTLRLLGSGGAHAWPQCSSIAD 
WEREEEVFKPHEDRGSLICTELVHTYYDLQEVMDGNVITAKRTAAVS 
AIATKCDNIiKTCHTSHGSVMAETAVINHKKRKNSPRIVQSNDLTEAA 
YSLSRDQKRMLYLFVDQIRKSDGTLQEHDGICEIHVAKYAEIFGLTS 
AEASKDIRQALKSFAGKEWFYRPEKDPAMKKL 

2777 

A 

3 

1038 

TAEGTWI AVRRLRLEGTGGEMSRVPAPLSAAEVEEHLRS SSLLI PPL 
ETAIJ^FSSGPEGG\VMQPVRTVVPVTKHRGYLG\VMPAYSAAEDAL 
TTKLVTLLRGPRHHLRSSLPTRVIVVTLLSPAMGTLLAVM\DGKCS* 
I AKRTAAVS CHL PPS FLKPPS SE VLC I LGAGVQAYSHYE I FTEQFS F 
KEVR I WNRTKENAEKFADTVQGE VRVC S S VQEAVAGADV 1 1 TVTLAT 
\ EPILFGEWV\ KPGAHINAVGAS \RPDWRELDDELMKEAVLYVDSQE 
AALKESGDVLLSGAEIFA*AGEK*FKGVKPSP/YCEKTPRCSKSLGM 
AV\EDTVAAQTHPMIPWSSGK 

2778 

A 

2 

2037 

LPPRSVGAYFAASWSVAMNPDLRRERDSASFNPELLTHILDGSPEKT 
RRRRE I ENM I LND PD FQHEDLNFLTRS QR YE V AVRKS A I MVKKMRE F 
GIADPDEIMWFKKLHLVNFVEPVGLNYSMFIPTLLNQGTTAEKEKWL 
LSS KGL»Q I IGTYAQTEMGHGTHLRGLETTAT YDPETQEF ILNS PTVT 
S I KWWPGGLGKTSNHAI VLAQLI TKGKCYGLHAFI VPIREIGTHKPL 
PG I TVGD I GPKFGYDE IDNG YLKMDNHR I PRENMLMKYAQVKPDGT Y 
^^CPLSNKXTYGTM^nFVRSFLVGEAARALSKACTIAIRYSAVRHQSEM 
KPGEPEPQILDFQTQQYKLFPLLATAYAFQFVGAYMKETYHRINEGI 
GQGDLSELPELHALTAGLKAFTSWTANTGIEACRMACGGHGYSHCSG 
LPNIYVNFTPSCTFEGENTVMMLQTARFLMKSYDQVHSGKLVCGMVS 
YLNDL P SQR I QPQQVAVW PTMVD INS PE S LTEA YKLRAARL VE I AAK 
N1.QKEVIHRKSKEVAWNLTSVDLVRASEAHCHYVVVKLFSEKLLKIQ 
DKAIQAVLRSLCLLYSLYGISQNAGDFLQGSIMTEPQITQVNQRVKE 
LLTLIRSDAVALVDAFDFQDVTLGSVLGRYT>GNVYENLFEWAKNSPL 
NKAEVHESYHKHLKSLQSKL 

2779 

A 

3 

194 

TFFFQVKPDG\ TYVKPL\ SNKLTYG \ TMVFVRS FLVGEAARALS \ KA 
CT I AIR Y SAVRHQS E I KPG 

2780 

A 

1 

903 

MTGGGFGGCTVTLLEASAAPHRMRHIQEHYGGLPPSTSLKQPMEPRC 
CACEAPPGQHTLHALTAGLKAFTSWTANTGIEACRMACGGHGYSHCS 
GLPNIYVNFTPSCTFEGENTVMMLQTARVGDNSIDCWKDARRLFQDG 
DKSRAPHDPILTDQTQEWSGSATFTSDGGIRLFYTDYSGKHHGKQSL 
TTAQVNVS KSDDTLKINGVEDHKTLFDGAGNTYQHVQQF I DEGNYTS 
GDNHTLRDPHYVEDKGQKYLVFEANTGTENGYQGEESLFNKAYSGGG 
TNFFRKESQKLQQSEERQK 

2781 

A 

133 

402 

IADSVTPDDSSLKCPSLIFPRSFVHRGRPEPLDLHLGMFLPTLLHQA 
TAEQQER F FMPAWNLE I IGTYAQTEMGHGME YS VYL KEMGG PL 

2782 

A . 

2 

242 

MTEPQI TQ VNQR VKELLTL I RSD AVALVDAFD FQD VTLG S VLGR YDG 
NVYENLFEWAKNS PLNKAEVKIQCLCTFLKVFT 

2783 

A 

26 

2229 

RFRAGAADLDCHCHLGRCLLRCQLWAMNPDLRRERDSASFNPELLT 
HTLDGSPEKTRRRREIENMIL\NDPDFQHEDLNF\LTRSQRYEVAVR 
KSAIMVKKMREFGIADPDEIMWFKNFVHRGRPEPLDLHLGMFLPTLL 
HQ\ATAEQQERFFMPAW\NL\EIIGHFIAQTEMG\HG\THLSRLSET 
TATYDPETQEFILNSPTVTSIKWWPGGLGKTSNHAIVLAQLITKGKC 
YGLHAFI VP IRE I GTHKPLPG I TVGD IGPKFG YDE I DNG YLKMDNHR 
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I PRENMLMKYAQVKPDGTY\mPLSNKLTYGTMVFVRSFLVGEAARAIi 
S KACT I GI L Y S AVRHHS E I KPGEPEPL I LDFQTQQ YKLF PLLATAYA 
FQFVGAYMKETYHRINEGIGQGDLSELPELHALTAGIiKAFTSWTANT 
GIEACRl^CGGHGYSHCSGLPNIYVNFTPSCTFEGENTVMMLQTARF 
LMKSYDQVHSGKLVCGMVSYLNDLPSQRIQPQQVAVWPTMVDIN\SP 
ESLTEAYKLRAARLVEIAAK\NLQKEVIHRKSKEVAWNLTSVDLVR\ 
ASEAHCHYVWKLFSEKLLKIQ\DKAIQAVLKE/SLCLLYSLYG\IS 
Q\NAGDFLQGSIMTGP\QITPSNPAV*RELLTLIRSDAVALVDAFDF 
QDVTLGS VLGRYDG\NVY\ ENLFEWAK\ NS PLEPKAEVHGI FTRHLE 
VHCQSQALEVFTRDKFKSGFQKAPWQLQILWESFSNSNRL 

2784 

A 

61 

241 

ANLGPTAPPRSGPVLGAGEKGRGEMRRAPFFLSAGGLETPPPSAALL 
WAPGRRADE I SGL . 

2785 


60 

341 

ANLG PTAP P RSG P VLGAGEKGRGEMRRAPF FL S AGGLETPPPS AALI* 
WAPGRRADE I SGIjWLLHTTAPQVGGELRDERRGQRQEKMLI KKPEN 

2786 

A 

70 

428 

IGGPQGFSPTPSLKPRPRIFKTPPIRGNViGPRPPPGVSPQFQGFGGD 
TKFNLIEILFWLTFSV*AQSGDPAAPPRSGP/RPGGRGEREG*NEK 
S PVL PQCWGG WRHHPPS AALLWAL I W 

2787 

A 

38 

243 

TPSTIDKSLTSLIKKK\KA*INNFRDKKGNIAKDIPEM*KIIREYYE 
NIHANTFENLGDIDKNKIDQN 

2788' 

A 

2 

114 

KNGMNMS FS FSNYAQS S QFRTWQDS S VS YAAG ALTVH 

2789 

C 

50 

214 

MS 1 1 EQLDSVS FSNYLKKYHNTI CGRHP I GVLLNA I TELQKNGMNMS 
FSXLNYAQ 

2790 

A 

1 

624 

MAEKCTSQQKRTQNKHGSTGIKMSNRVVRWEASHSGNWYTASGPQIiN 
AQLEDKDEHGIEMHIiPCTAKAMEIHKDEFTI I PVLVGALSESKEQEF 
GKLFSKYLADPSNFFWSSDFCHWGQRFRYSYYDESQGETYRSIEHL 
DKMARASPTSAAPCSTAPSPIDRPRAEECERMAQDWQAAPPAALVRD 
PLGEASWAPESGGDVENLYV 

2791 

A 

2 

851 

ARGTIGGTGTKMSNRWCREASHAGSWYTASGPQLNAQLEGWLSQVQ 
STKRPARAI IAPRRIFILGPSHHVPLSRCALSSVDIYRTPLYDLRID 
QKIYGELWKTGMFERMSLQTDEDEHSIEMHLPYTAKAMERCPVDWM 
Q I LPKKPLHCRGTGDL YDLF I S QLTGHKDE FT 1 1 P VLVGALS ES KEQ 
EFGKXiFSKYIiADPSNLFVVSSDFCHWGQRFRYSYYDESQGEIYRSIE 
HLDKMAI TELQKNGMNM S FSFLNYAQSSQCRNWQDS S VS YAAGALTV 
H 

2792 

A 

1 

1116 

MQVPCDHHHWDCAG SDLKPAMHWTLSKALS LRVTRI RDALGVGTQKG 
CHTDSSLSCSALKPSVDGKAKRALFVTHAFWGSRGRGQIjIjDAAVGWIi 
VFKPLLELYTGMSADEMICYSGKSTELSGKLVLNYVTjTVMEGERSRG 

vrpqqgndcddgflslrlgpqlnaqlegwlsqvqstkrparaiiapv 

GWTSATH I QRTE YGKRKI ATLQQTNLANTYYLNQTI KMQ I LPKKPLH 
CRGTGDLYDLFISQLTGHKDEFTIIPVLVGALSESKEQEFGKLFSKY 
liADPSNLFWSSDFCHWVYGELWKTGMFERMSLQTDEDEHSIEMHLP 
YTAKAMERCPVDWSRGPQLVRGLLGTGLCNRMRTVGQRALPT 

2793 

A 

53 

1408 

HRCCLWTFQCLS I SLG ITFLI KEGKEGKLE PAREGP YLVLLTTETAV 
LLI S ITIPTTSNSNPTLERFCGLGAEVTGKDPMGFFKVLRQSLDADE 
GTAVDIQSFRPQKDFGRRRTLEECHVTGKGGTGTKMSNRVVCREMH 
AGSWYTASGPQLNAQLEGWLSQVQSTKRPARAIIAPHAGYTYCGSCA 
AHAYKQVDPSITRRIFILGPSHHVPLSRCALSSVDIYRTPLYDIiRID 
QKIYGELWKTGMFERMSLQTDEDERSIEMHLPYTAKAMESHKDEFTI 
IPLLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQ \ 
SYYDESQGEIY*SIEHLiDKMGMSIIEQLDPVSFSNYLKKYHNTICGR 
HPIG\VLLNAITELQKNG\MNMSFSFL\NYAQSSQC\RNW\QDSSVS 
YAAWEHSRATEALNPQGCHLPHS S YS VRGS PA 

2794 

A 

53 

699 

SLPSIiTPPGSRQPPAGRLIjNTEHH INEH I IM YS LDLNS SSQMPSI\K 
AQTSDGEIFEVDGEMPNHLLPIKAHVE\DIiG\MDDEGDDDPVP\LPN 
VNAAILKKV\IQ\WCT/HPHKDDPP\PPEDDEEQRKSVPDDIPVW\D 
QG I PGKFDQGTLF * NS FLGLQTYFRTSKGLLDVTCKTVCQYDQKGKL 
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P*EIRKPFPKSKMTFTEEEEAQVRK\EN\QWCEEK 

2795 

A 

162 

522 

WVRRGT I S PARQRQDGVGAAPANYAKVLE WQLAPS PRHKCQQRPGTH 
LGCDSIGPLSPVGLPGLKG*PDNSSEGCPSS/QALPVPSSPGHPGSG 
GPKGQARSLLNCRGCPINWQHRCCRKI 

2796 

A 

2 

1214 

QPFQGSGCVIAILGRKMFSSVAH\LARANPFNTPH\ LQLVHDGLGDL 
RSSSPGPTGKPRRP\RN\LAAAAVEEQYSCDYGSGRFFILCGLGGII 
S CGTTHT\ ALVPLD \ LVKCRMKV\ DPQKYKG\ I FNG FS VTLKEDGV\ 
RGLAKGLAPTF\LGYSMQGLCKFGFYEVFK\VLYSNMLGEENTYLWR 
TSLYLAASASAEFFADIALAPMEAAKVRIQTQP\GYANT*EGISFPK 
C I KE KGL \ KAF YKGV\ ALLWMRQ I TYTMN * S S PCLERTV * SHCTSLG 
VPKP\ RS E \ CS \ KPEQTGCNHLVAGY \ I ARVFCA\ I VSHPC * FVWYL 
VLNKE * S VAVS \ LLVFS VTLDFKG V\ WKGLFARI I \ M IGT\ LT\ ALQ 
WFIYYSVKGYFRT/LPRPPPPRDGQSLLNNKFGLTPWRSKQIVD 

2797 

A 

276 

728 

C FG F VF PR S W P YG PLS TGD VPKLG PPN * AMGGFRPG \ KPQ VSL PGFA 
GGHACSPASLPSGPGPAPPRACLPSPRGGRKSLGAPGGFPPGDSHGA 
PCS R WTVS PGNDLKPGHS PRTLQRGGS LPGRRWVWSCG WCPVWWP 
VLSASISGWI 

2798 

A 

451 

1030 

VPEAIiSPFEMEGPQFPTMCCCSAPPAMPFPTT\*QPSEAAAGQGVPG 
PHRL * GGLAPAGRS * APAGAFWGQGPAQRGEPAYS S KGGAAASTQRV 
L*HRRGHRGDASRGCQPGDQNIiQ*KLCSSRAEGGAVSQPGWPALPHA 

AGEPLASALPGLHAAQAPAPAWSGPTTIRSCSGNSGKIFPSSCLDS 
CFPPS j 

2799 

A 

1 

933 

MLGAIILSKLTGTENQIPHVVIHKWQLNDESTWTQSGEQQTPGPVGG 
EIDLQILSKVQAQYPGVCINNEVVEPS/^EQIAKYKELVAKTSNIiENV 
KFAWHKETS SEYQSRMLEKKELQKWDFIHM IQML YYVKD I PATLKFF 
HSLLGTNAKMLIIVVSAHWSVGPTPTYPPGDYSSLLAVLPTSALMRQ 
YSLPIGSSGWDKLWKKYGSRFPQDDLCQYITSDDLTQMLDNLGLKYE 
CYDLLSTMDISDCFIDGNENGDLLWDFLTETCNFNATAPPDLRAELG 
KDLQEPEFSAKKEGKVLFNNTLSFIVIEA 

2800 

A 

1 

1563 

MRREGFKFSRSKVFVQDEMTKKWHKSDLHWVSRIQCTVLGVLTNATE 
MRSLLPIESLFLIQCFWSGASRNEATDCSGECYSSMRETRGSKECA 
AARFI KVKGKVLQKVES YKLYHP YFLANWFS AS FS KVGKLVEMRGQE 
EETLGTFAHVLMFQAASSRALVLPPTMQLELQNFGNTCSYRKPNMAS 
SMRSLFSDHGKYVESFRRFLNHSTEHQCMQEFMDKKLPGIIGRIGDT 
KSEIKIL\SIGG\GAGEIDLQILSKVQ\AQYPG\VCIQH*SCLRPSA 
GTNWPNYKELVA\ KTSNLENVKFAWHKETSSEYQSRMLGEKGAFKSW 
DF YS I * FKML YYVKD I PAT\ LKFFHSL\ LGTNAKMLI I VCVQGSSGW 
\DKLW\KKYGSRF\PQ\DDLCQYITSDDLTQMLDNLGIiKYECYDLLS 
TMDISDCFIDG\NENGDLLWDF\LTETCNFNA\TAPPDLR\AELGKD 
L\QEPEFS/ALRKEGKVLFNNTLSF\IVIEGITIQSQKYIQKLYFEQ 
LESLIYFHIKITNSSH 

2801 

A 

1 

6750 

MPTS ES EGG SGTQACRKTQPN I HD YYWNG I G VED W I AAER W I KFRFQ 
TMDGDWENS VSWADDENEAS IGSWSGASDKAGI IRSWAVACDETS VK 
S WAG ARAENW G I GTWARAGEQAS GGLWAGGQTS EGTWAGDKASGGA 
WTGAENQASGGSWALAGNQAIGELWAAGQASDGSWPGGQASGVSWVG 
EEAIGGSWTGAENQASEGSWAGAGAGNMSSVSYWAGWDQAGGGSWA 
GTNDPFGVRLHRTRNKEVMKAVALFTIAPVNLLKKTO 
SVRCIQETHLTCRDTHRLKIKGWRKIYQANGKQKKAGVAILVSDKTD 
FKPTKIKRDKEGHYLMVKGSIQQEELTILNIYAPNSGAPRFIKQVLS ! 
DLQRDLDSHTLIMGDFNTLLSTLDRSTRQKVNKDTQELNSALHKVPL 
IDNYRTLHPKSTEYTFFSAPHHTYSKIDHILGSKALLSKCKRTEIIT 
NYLSDHSAI KLELRlKNLTQNRSTTWKLNNLIiLiDDYWVHNEMKAE I K 
M FFETS ENKGTT YQNL WDTY KAVCRGKF I ALNAHKR KQER S KI DTLT 
SQLKELEKQEQTHSKASRRQE ITKI RAELKE I ETQKTLQKINESRS W 
FFERINKIDRPLARLIKKKREKNQIDAIKNDKGDITTDPTEIQTTIR 
EYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRP ITGAE I V 
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AI INSLPTKKSPGPDGFTAEFSQRYKEELVPFLLKLFQS 1EKEGILP 

NSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILSKILAKRIQQ 

HIKKLIHHDQVGFIPGMQGWFNIHKSINVIQHINRAKDKWHMIISID 

AEKAFDKIQQPFMLKTLNKLGIDGTYFKI IGAI YDEPTANI ILNGQK 

LEAFPLKTGTRQGCPLSPLLLNIVLEVLARAIRQEKEIKGIQLGKEE 

VKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAF 

LYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPL 

LKEIKEDTNKWKNIPCSWVGRINIVKMAILPKGVLGRRASSAVTRGP 

HVTRSCRATQPSLQSSLASGYQLPAALRSAGWHRARGKRSRSACMSA 

AGLKI TG S KETKRRLLL I S I DWS RDLMNLC I YFRVYCQE KQEERREL 

PRIITGPPPEAAWAFEWLKTSTIiTGLHPQLPLSLPQPECALPYLVR 

AFSRGDYMGRIQEVGWVTAGLVIWAGTCYYIYKFTKGRAQSVRTLAR 

NGSTVKMETWGVQSQTLAINEAE I KTKPQVE IGAETGARSGPRAEV 

ETKATAI A I HRANSQAKAMVGAEPETQS E S KVVAGTLVMTEAVTLTE 

VKAKAREVAMKEAVTQTDAEAGK I VKKE AVTQTKAKAWAIjVAKTEAK 

REAMTQTKAETHILAEKETE INRVMVTQS ETLAVPREVAKMGATNKT 

GIVDETKTRALEETVSVAKTQSEARPGATVDARGNPNGMSREVAGVD 1 

MKSCAQS QAVTKIQGDDMPGTGVEDMGNCKTMS RAESGADTRASAQP 

QI FAKTQTEAI PGAKIDAGGNTNAMWKI PMS YLKAITGADMRAAAQP 

QAVASTHAEAMSDAKVKNRGNPNAMTKAGAKAI^IjRANS qvealpdar 

DKSRGNPm^MAKVGDGTDMLSCTQPQLVASVQADTLSDGKIKVRGNV 

NSMPKEGAGVDMKAQDDNRRVSQVASVKSTDGKTEDRTLRKPSTCGA 

LAENSAELKPRLELPSTIKACEVTTLTHKLWDNSVKAGTLITKLFLS 

SAQSMKIFCCLWNFVYKQLEDAAQGLTMGGDVEEHESRLGAGEEAGV 

ESWTLARNVGEDELSRESSPDIEEISLRSLFWAESENSNTFRSKSGK 

DASFESGAGDNTSIKDKFEAAGGVDIGSWFCAGNENTSEDKSAPKAK 

AKKSSESRGIYPYMVPGAGMGSWDGAMIWSETKFAHQSEASFPVEDE 

SRKQTRTGEKTRPWSCRCKHEANMDPRDLEKLICMIEMTEDPSVHEI 

AimAIiYNSADYSYSHEVVRWGGISVIESLLNNPYPSVRQKAIiNALN 

NISVAAENHRKVKTYLNQVCEDTVTYPLNSNVQLAGLRIiIRHLTlTS 

EYQHMVTJT/ISEFLRLLTVGSGETKIDHVLGMIiLNFSKNPSMTKDLLI 

ANAPTSLINIFSKKETKENILNALSLFENINYHFKRRAKAFTQDKFS 

KNSLYFLFQRPKACAXKLRALAAECNDPEVKERVELLISKL 

2802 

A 

3 

91 

KKNI N * E I LN * KQMEE PTL YVGNFP I QLT 

2803 

A 

3 

7501 

GAGEEGELGGDARSSRDPELQSYAAFVAWTRIWLPAPRCPALGGLA 

SGPGKAEQFSRSLYLPDHLGEGNGGLLGKSLEPYRSACMSAAGLKIT 

GSKETKRRLLLISIDWSRDLMNLCIYFRVYCQEKQEERRELPRIITG 

PPPEAAWAFEWIiKTSTIiTGLHPQLPLSLPQPECALPYLVRAFSRGD 

YMGRIQEVGWVTAGLVIWAGTCYYIYKFTKGRAQSVRTLARNGSTVK 

METWGVQSQTLAINEAE I KTKPQVE IGAETGARSGPRAEVETKATA 

I AIHRANSQAKAMVGAEPETQSES KWAGTLVMTEAVTLTEVKAKAR 

EVAMkEAVTQTDAEAGKIVKKEAVTQTKAKAWALVAKTEAKREAM 

TKAETH I LAEKETE INRVMVTQS ETIAVPREVAIQ^GATNKTG I VDET 

KTRALEETVS VAKTQS EARPGATVDARGNPNGMSREVAGVDMKS CAQ 

SQAVTKI QGDDMPGTGVEDMGNCKTMSRAESGADTRAS AQPQ I FAKT 

QTEAI PGAKI DAGGNTNAMCKVGAGADVRAC I QPQTVAKKQAEVTSG 

ARVDGRGNTOTISKAITGADMR7VAAQPQAVASTHAEAMSDAKVKNRG 

NPNAMTKAGAKANLRANSQVEALPDARDKSRGNPNVMAKVGDGTDML 

S CTQPQLVAS VQADTLSDGKI KVRGNVNTMPKEGAGVDMKAQGMAQS 

UUfcJUjFN i KGKARGKAKAKCKTGPGMDMKTCTQPQAGVKTPAEALLD 

SRVDGRGNPNATSKAGTKADQRVCGQPLWANPQGEALPGAKNKVKG 

NPHTVLKVGAGEGTTDSAQPEAWSFQGEALLGTKNKVKGNPNVVLK 

AEVGEGAMGTAQLQIMASSKGEALLDSKNKVKGNSNAVSKAGAGTDT 

TGSVQPQIVANSQGEVLPGAKNKIRGNPTTVPNSGVGPYTTDSARLQ 

AVAN SQGE VLPGAKNKVKANLNAVS KAEAGMGATGS VQPQAVANSHC 

ETLPGAKNKVRGNWNAVSKAGAGMDTRGSAQPQAVANSQGEVLPGAK 
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NKVKGNPNWSKAGAREDTVGSTQPQVIASSQRETLPGARNKVKGNS 
NWS KAGAREDTMG S AQPQWANS QRETL PGARNKVKGNSNAI S KAE 
AGAG I MG S VQVQ WAS FQGEVLPG AKNKVRGNS NAVPKAEAGADTVG 
SAQPQAVANSQSETLLGARNKVKGNTIAVPKAGTGAGTRHSAQPQIV 
AGSQGETLPGARDKSMSTSEAEATAEDEAYAKPEAEAMPTSESEGGS 
GTQACRKTQPNIHD YYWNG IGVED W I AAERVf I KFRFQTMDGDWENS V 
SWADDENEASIGSWSGASDKAGIIRSWAVACDETSVKSWAGARAENV 
VGIGTWARAGEQASGGLWAGGQTSEGTWAGDKASGGAWTGAENQASG 
GSWALAGNQAIGELWAAGQASDGSWPGGQASGVSWVGEEAIGGSWTG 
AENQASEGSWAGAGAGNMSSVSYWAGWDQAGGGSWAGTSDQSGGGS 
KPRFEDQA5GEGSWAGAGGQASGGSMLGPEDQSSGRSWADTADQASG 
GSRLGHVDQSSGGAWAGTLDQSGGGSKPRFENQTTEEGSWAGAGGQA 
GGGSKVGPEDQSSGRSWANSGDQISGGFLVGIVDQANGGSWTGAGHP 
AS VGPKP I FEDQVSGRGS WADAREQ WGDSRLGLRDQS SGDSWADTG 
DQASGWFCVCPGSQTNGGSWGGASGQDVGGSRPGPTNQSSAGSWDSP 
GSQVSGSCWTGAGAVDQAGGCSKPGFEDQAIGGGFWPGAGDQTGGGS 
RPGSEDQSSGIGSWGVAGGQVLGGARPGPADQSSGGSWAGTGNQSSG 
RSWIGPGDQAVDCSKPEFEDQACGGGSWAGAGSQASGESWAGSRPGN 
EAIGGSRMGSEDQATGGSWARSEDQASGRFQVSFEVEANEGFWFGPG 
AEAVIGSWCWTEEKADIVSRPDDKDEATTASRSGAGEEAMICSRIEA 
ENKAS SGS W I RS EEVAYMGSCVGAEAGAGAEAGAGAEAGAGAGAEAG 
AEAGAGAGAGPGTESGAGIWSWDGDATTVESRLGAGEEAGVESWTLA 
RNVGEDELSRESSPDIEEISLRSLFWAESENSNTFRSKSGKDASFES 
GAGDNTS I KDKFEAAGG VD IGS WFCAGNENTS EDKS AP KAKAKKS SB 
SRGIYPYMVPGAGMGSWDGAMIWSETKFAHQSEASFPVEDESRKQTR 
TGEKTRPWSCRCKHEANMDPRDLEKLICMIEMTEDPSVHEIANNALY 
NSADYSYSHEWRNVGGISVIESLIiNNPYPSVRQKALNALNNISVAA 
ENHRKVKTYLNQVCEDTVTYPLNSNVQIjAGLRL I RHIiT I TSE YQHMV 
TOTISEFLRLLTVGSGETKDHVIjGMLLNFSKNPSMTKDLLIANAPTS 

lini fskketkeni lnalslfeninyhfkrrakaftqdkfs knslyf 
lfqrpkacakklralaaecndpecleaaetaagnsaqgkfykgggpl 
gvdpksip 

2804 

A 

3 

3223 

pppeaawafewlktstltglhpqlplslpqpecalpylvrafsrgd 
ymgr i qevgwvtagiiv i wagtc yy i ykftkgraqs vrtlarng s tvk 
metvvgvqsqtlaineae iktkpqve igaetgarsgpraevetkata 
iaihransqakamvgaepetqseskwagtlwteavtltevkakar 
evamkeavtqtdaeagkivkkeavtqtkakawalvakteakreaotq 
tkaethi laekete inrvmvtqsetlavprevakmgatnktg ivdet 
ktraiieetvs vaktqs earpgatvdargnpngmsrevagvdmks caq 

SQAVTKIQGDDMPGTGVEDMGNCKTMSRAESGADTRASAQPQIFAKT 
QTEAI PGAKIDAGGNTNAMWKI PMS YLKAITGADMRAAAQPQAVAST 
HAEAMSDAKVKNRGNPNAMTKAGAKT^NLRANSQVEALPDARDKSRGN 
PNVMAKVGDGTDMLSCTQPQLVASVQADTLSDGKIKVRGNVNSMPKE 
GAGVDMKAQDDNRRVSQVASVKSTDGKTEDRTLRKPSTCGAIiAENSA 
ELKPRLELPSTIKACEVTTLTHKLWDNSVKAGTLITKLF1.SSAQSMK 
IFCCLWNFVYKQLEDAAQGbTMGGDVEEHESRLGAGEEAGVESWTLA 
RNVGEDELSRESSPDIEEISLRSLFWAESENSNTFRSKSGKDASFES 
GAGDNTS I KD KFEAAGG VD IGS WFCAGNENTSEDKS APKAKAKKS SE 
SRGI YPYMVPGAGMGSWDGAM I WS ETKFAHQS EAS FPVEDESRKQTR 
TGEKTRP WS CRCKHEANMD PRDLEKL I CM I EMTEDPS VHE I ANNAL Y 
NS AD YS Y SHE WRNVGG I S V I E S LLNNP YPS VRQKALNAJLNNI S VG\ 
S ENHRKVKTYIJ^Q VCEDTVTYPLNSNVQLAGLRL IRHLT I TS \ E YQH 
MGYKLLFQDFFVC*RWGSGRNLKDPV\LGMLLNFSKNPSMTKE\LIiI 
ANAPT\YLGNIFSKKETKENIL»NA2jSLFENINYHFKRRAKAF/NPRD 
KFSKNSLYFLFQRPKACAKXLRALAAECNDPEVKERVEIjIiISKL 

2805 

A 

1 

374 

NYFLSLRFDWHS S KS * GHPTCVHCF * / TKNE / CASCHCGAGLLGFLG 
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GSGEGASHPCPSCLPLEVRSLHCSLGHCGECMPGASGWGPWTGPGHC 
PNLCQDKGS KR I S WRVGRNGWGRPS FDNAPVAIi 

2806 

A 

162 

789 

AGQNGVSGSAERCLSCLLIPLALWS I IENILLYI PNGHTFYASSNKL 
TNYVWYSERI *LSGIM\I^I\TPTVLLVIjENNNNYKC\CQSENCSKKY 
VTLLSIIFSSLGIAFSGYCIi\VISALGSCP*AIC/LAPLMGWGVCF* 
-RAL.IiGRFLTDS\SIWDSVPGNLAHWEWNIILFSILITLSG\VQVII 
CLIRWMQLSKILCGSYSVIFQPGII 

2807 

A 

1 

3282 

MVWRVPPFLLPILFLASHVGAAVDLTLLANLRLTDPQRFFLTCVSGE 

AGAGRGSDAWGPPLLLEKDDRIVRTPPGPPLRLARNGSHQVTLRGFS 

KPSDLVGVFSCVGGAGARRTRVIYVHNSPGAHLLPDKVTHTVNKGDT 

AVLSARVHKEKQTPVIVJKSNGSYFYTLDWHEAQDGRFLLQLPNVQPP 

SSGIYSATYLEASPLGSAFFRLIVRACREGRFGQSCQEQCPGISGCR 

GLTFCIiPDPYGCSCGSGWRGSQCQEACAPGHFGAPCRI»QCQCQNGGT 

CDRFSGCVCPSGWHGVHCEKSDRIPQILNMASELEFNLETMPRINCA 

AAGNPFPVRGSIELRKPDGTVX.IiSTKAIVEPEKTTAEFEVPRI.VLAD 

SGFWECRVSTSGGQDSRRFKVNVKVPPVPLAAPRLLTKQSRQIiWSP 

LVSFSGDGPISTVRLHYRPQDSTMDWSTIVVDPSElfVTLMNIjRPKTG 

Y S VRVQL S RPGEGGEGAWGP PTI.MTTDCPEPLLQP WLEG WHVEGTDR 

LRVSWSLPLVPGPLVGDGFLLRLWDGTRGQERRENVSSPQARTALLT 

GLTPGTHYQLDVQLYHCTLLGPASPPAHVLLPPSGPPAPRHLHAQAL 

SDSEIQLTWKHPEALPGPISKYVVBVQVAGGAGDPLWIDVDRPEETS 

T I IRGLNASTRYLFRMRAS IQGLGDWSNTVEESTLGNGLQAEGPVQE 

SRAAEEGLDQQLIIAWGSVSATCLTIIAALLTLVCIRRSCLHRRRT 

FTYQSGSGEETILQFSSGTLTLTRRPKLQPEPIiSYPVLEWEDITFED 

L IGEGNFGQV I RAMI KKDGLKMNAAI KMLKEYASENDHRDFAGELEV 

LCKLGHHPNI INLLGACKNRGYLY IAIEYAPYGNLiIiDFLRKSRVIiET 

DPAFAREHGTASTLSSRQLLRFASDAANGMQYLSEKQFIHRDIjAARN 

VLVGEI^J^KIADFGLSRGEEVYVKKTMGRLPVRVmAIESIiNY 

TKSDVWSFGVLLWEIVSLGGTPYCGMTCAELYEKLPQADRMEQPRNC 

DDEVYELMRQCWRDRPYERPPFAQlALQLGRMLEARKAYVNMSIiFEN 

FTYAG IDATAEE A 

2808 

A 

136 

447 

PRCPHNLPLSLCPGLQAEGPVQESRAAEEGliDQQLILAWGSVSATC 
LTIIAALLTLVCIRRSCLHRRRTFTYQSGSAFL»GSGAS1»CLHTSLAD 
VLSPAVATCQ 

2809 

A 

3 

3593 

SSDPRPSSFWLVWVGLWSMVWRVPPFLLPILFLASHVGAAVDLTLLA 
NLRLTDPQRFFLTCVSGEAGAGRGSDAWGPPLLLEKDDRIVRTPPGP 
PLRLARNGSHQVTLRGFS KPSDLVGVFSCVGGAGARRTRVI YVHNS P 
GAHLLPDKVTHTVNKGDTAVLSARWKEKQTDVIWKSNGSYFYTLDW 
HEAQDGRFLLQLPNVQPPSSGIYSATYLEASPLGSAFFRLIVRGCGA 
GRWGPGCTKECPGCLHGGVCHDHDGECVCPPGFTGTRCEQACREGRF 
GQSCQEQCPGISGCRGLTFCLPDPYGCSCGSGWRGSQCQEACAPGHF 
GADCRLQCQCQNGGTCDRFSGCVCPSGWHGVHCEKSDRIPQILNMAS 
ELEFNLETMPRINCAAAGNPFPVRGSIELRKPE)GTVIjLSTKAIVEPE 

kttaefevprlvladsgfwecrvstsggqdsrrfkvinncvppvplaa 
prlltkqsrqlwsplvsfsgdgpistvrlhyrpqdstmdwstiwd 
psenvtlmnlrpktgysvrvqlsrpgeggegawgpptlmttdcpepl 
lqpwlegwhvegtdrlrvswslplvpgplvgdgfllrlwdgtrgqer 
renvsspqartalltgltpgthyqldvqlyhctllgpasppahvllp 
psgppaprhlhaqalsdse iqltwkhpealpg pis kywe vqvagg a 
gdplwidvdrpeetsti irglnastrylfrmras iqglgdwsntvee 
stlgnglqaegpvqesraaeegldqqiiilawgsvsatcl.tilaall 
tlvcirrsclhrrrtftyqsg\sgeetilqfssgtltltrrpki*qpe 
plsypvleweditfe\dligegnfgqviramikkdglkmnaaikmlk 
eyasendhrdfagel\ evl\ cklghhpni inllgacknrg ylyi aie 
yapygnlldilrksrvletdpafarehgtastt,/saprqllrf\asd 
a\angm\qlpgvrkavhstgdlglareccwsgenlglqdlqtfglfs 
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/ RGEE VYVKKTMGRLPV\ RWMAI E \ S L \NYS VYTTKSDVWS FGVLLW 
E\IVSLEGTP\YCGMTCAEALMKSLPQG\YPHGSKPRN\CDDEVYEL 
\MRQCW\RDRPYERPPF\AQIAL\QLG\RML\EAREGPM*NHVAV*K 
TFTYAGN*MPTAEEGPEAAHPSPEPWVLLAGSKTLLS 

2810 

B 

85 

283 

MEKTEL I RRSAWRVI SSI EQKTDTSDKKLQL I KD YREKVESELRS I C 
TTVLELLDKYLI ANATNPEX * 

2811 

A 

1 

932 

KAKSRS KEVGLASRPRRRGSSRGS SRCGS ALALALLALRPGPGPGPA 
PAMEKTELIQKAKLAEQAES/RYDDMATCMKAVTEQGA/QSLSNEER 
NLLS\VAYKNWGGRRSA\WRVISKHPSRKTDTSDK\iOiQLIKDYRE 
KVD\SE\LRSl\CTTVLGieLDKYF/lLANGNLIPESK\VFYL\KMK 
G * FTFR\ YLCLEVACC\ DDRK\ QT IDN\ SQGAYQEAF\DI SKK\ EMQ 
PT\HPIRLGALLLTFSVFYYEI\LNNPEACLHAWLKTAF*WRAIA*L 
DTLNE\DSYKDST\LIMQLLRDN\LPLWTSDSAGEECDAAEGAEN 

2812 

A 

199 

214 

PPS PLNS STCCLAKTDEK* CKEKGSHLLH 

2813 

A 

1 

2176 

QGPLKFMDAG I E \ FSQDE WKCLDPAQRTLYRDVMLENYRNLVSLG IC 
LPDLSVTSMIEQKRDPWTLQSEEKIANDPDGRECIKGVNTERSSKLG 
SNAGNKPCKNQLGFTFQLHLSDLQLFQAERKISGCKHFEKPVSDNSS 
VS PLEK I S S S VKSHLLNKYRNNFDHAPLL PQEQKAH I RE KAYKCNEH 
GQVFRASASLTNQVIHNADNPYKCSECGKVFSCSSKLVIHRR>3HTGE 
KPYKCHECGKLFSSNSNLSQHQRIHTGEKPYKCHECDKVFRSSSKLA 
QHQRIHTGEK^YKCHECDKVFNQIAHLVRHQKIHTGEKPYSCNKCGK 
VFSRHSYLAEHQTVHTGEKPYKCEECGKAFSVRSSIilTHQLIHTGRK 
PYKCKECDKVFGRKCFLTSHQRIHTRERPYGCSQCGKIFSQKSDL\* 
RHRKTHTDEKPYNGNKCGTAFREFSDLTAHFLIHSGEKPYECKECGK 
VFRYKSSIiTSHHRIHTG\EKPYQM/CNRCGKVFSRSSNLVCHQKIHT 
GEKPYKCNQCGKVFNQASYLTRHQIIHTGERPYRCSKCGKAFRGCSG 
LTAHLAIHTEKKSHECKECGKIFTQKSSLTNHHRIHIGEKPYKCTLC 
S*GLSVTILTLHSFRGFHHEGPYKLC/DGKTIIMSSSINQHQ*VHTK 
WKSYK*NVCDTGFIKACQITGHIfflITVEDESTQMNCVYI,GYYSRTIA 
IEHDR I YTRSNSVSGSLIL I YD IA 

2814 

A 

3 

276 

VFSCGGCWSVTVKMLRNLLALRQIGQRTISTASRRHFKNKVTEKQKX, 
F/QGTAYAIYELAVASFP/NEAGVTSVIPAIAWFSFIQLSMDQ*SDK 

2815 

A 

1 

566 

MDECRKKERKPLFGPERFKFVNAYARLGGSSGSGVAIFVGGVQLWRL ! 
LVSNSQDAAESAGRYWVDVRSGEAGEVTRDSWALRPDLGQRTD*ATA 
S PQG I FKNKVPGEAKNCS PEDDE I P \ LYLKGG V\ ADALL YRATH \M I 

LTVG\GTSICPYYELAVASF\PKKAGS*LQVIPSNSLWFQFSFQLFY 
GTK 

2816 

A 

26 

736 

EP I PVTPDHRLVTMTHIVANLLS CQQRPRLPNYEMLKEEQEVAK\ LG 
APQKPCLPECPPVIH\IRNE\TSWPDHVRLVPVQ\HPLHEHLLRLGF 
IRFA\YSVKSLRDRK\MVGEVNRGPKA*CLHRQSA*KIWGPDFG\HL 
HEPILAHQSSPPKFWVVPGPAl\DREASIi\RPGALPVTC\lPRTIi\S 
SIPRPAPRGQEVLPLTCIPLTPPSIPRPVPTAESCISPLSSHAFLQW 
HSIKCICFW 

2817 

A 

748 

1637 

DIMESGFTSKDTYLSHFNPRDYLEKYYKFGSR\HSAESQIIiKHLL\K 
NLFK\IFCL\DGVEGETCLI\DIGFWAPLILSSSSSCFVESF*GRIV 
RSLDYFRTQEPGRELGRKW\LEGKEARGLFDWPPMVTPMCLNLBRE 
TE VKGPRRRRKKLK\ QAVKARCLKM * M * TQKPSHLGAVPL\ P PG * TA 
VPQAHLCLGCPPCPRPPQPTCKGASGTLGQPYLKPRGPSLVIMDAAQ 
EQPTYM IGEQKFS \ SLP\LGREAVKA\ AVKEAGYTI EWFEVI SQSYS 
STMANN \ EGLFS LVARKLSRPL 

2818 

A 

1 

1538 

MENP I VKS LAKARERLEDS KLE AVSDNNLELVNE I LED I T PL INVDE 
NVAELVGILKEPHFQ$LLEAHDIVASKCYDSPPSSPEMNNSSINNQL 
LP VD A I R I LG I HKRAGE PLGVTFRVENNDL V I AR I LHGGM I DRQGLL 
HVGDIIKEVNGHEVGNNPKELQELLKNISGSVTLKILPSYRDTITPQ 
QVFVKCHFDYNPYNDNLIPCKEAGLKFSKGEILQIVNREDPNWWQAS 
HVKEGGSAGLIPSQFLEEKRKAFVRRDWDNSGPFCGTISSKKKKKMM 
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YLTTRNAEFDRHEIQIYEEVAKMPPFQRKTLVLIGAQGVGRRSLKNR 
FIVLNPTRFGTTVPFTSRKPREDEKDGQAYKFVSRSEMEADIKAGKY 
LEHGE YEGNLYGTK1DS I LE WQTGRTC I LDVNPQALKVLRTSEFMP 
YVVFIAAPELETLRAMHKAVVDAGITTKLLTDSDLK\ KTVDESARIQ 
RAYNHYFDLI I INDNLDKAFEKLQTAIEKLRMEPQWVPISWVY 

28X9 

A 

1 

1749 

EFVLNYEPRVCRWGCSAAPVAEGEQRRRGATGSGGSGGAEAAEVRAA 
MQQVLENLTELPSSTGAEEIDLIFLKGIMENPIVKSLAKAHERLEDS 
KLE AVS DNNLE L VNE I LED I T PL I NVDENVAELVG I LKEPHFQS LLJ2 
AHDIVASKCYBSPPSSPEMNNSSINNQLLPVDAIRILGIHKRAGEPL 
GVTFRVENNDLVI ARI LHGGM IDRQGLLHVGD 1 1 KEWGHE VGNNPK 
ELQELLKN I SG SVTLKI LPSYRDT I TPQQVFVKCHFDYNP YNDNL I P 
CKEAGLKFSKGEILQIVNREDPNWWQASHVKEGGSAGLIPSQFLEEK 
RKAFVRRDWDNS GDELDTI KFDRHE IQ I YEEVAKMP PFQRKTLVL I G 
AQGVGRRSLKNRFIVLNPTRFGTTVPFTSRKPREDEKDGQAYKFVSR 
SEMKAD X RAG KYLE \ HGE YEGNL YGTKI DS I JjEWQTGR It- X XiD VN f 
QALKVLRTSEFMPYVVFIAAPELETLRA\MH\KAVVDAGIHHPRLi*P 
NSDLKKTVDESARIQRAYNHYFDLDPS INDNPRPKPFEK\LQTA\ IE 
KPENGPPQWVPQS AWVYLM I Q 

2820 

A 

14 

230 

FATLYPLKI Fl FPKAFNPWRE VGPLCPPPK/ CKGPS PKS PGW/ CLNR 
PPIGKKLIPCKPRLTWAPPRIL*NGPP 

2821 

A 

170 

303 

RPLRGTPC*CAAWASSWV3GCCSACCSHCCLLGPCWHSGPCSA*WAFW 
CSGG 

2822 

A 

208 

547 

AAPG AMG PW PDRLCGS ALPS CRLP CY S S AS VSLRCAAENTFTQAGPC 
C WAW PHRAH QD F PLQ I P FRHLEAPRC S P P L S A W P PRHLL F AGD PDLQ 
QEPPG I RGAP * RPLRGTPCCGS \ CGVAAG VRGCGPVWL* HPEP ADSH 
TATAEPRI*AAPGAMGPWPDRLCGSALPSCRLPCYSSASVSLRCAAE 
NTFTQAGPCCWAWPHRAHQDFPLQIPFRHLEAPRCSPPLSAWPPRHL 
L PAG D PDLQ QE P PG 1 R G A PL S LRRRL LL 

2823 

C 

48 

173 


2824 

A 

183 

423 

HGLFSVEENEENGEQEADNEVDEEEEEGGEEEEEEEEGDGEEEDGDE 
Dfc.JiAiloAlolO<AAij]JJJljL>JLJ 1 i\JK.y Is. 1 L)h.lJlJ 

2825 

A 

147 

588 

TIiAFFLIPCIGSPACPTMSDAA\VDTSSEITTKX>\LKEKKEVV\EEA 
ENG / REDRPCLTGNAENE\ ENG\ EQEA* QLR *DEB / EREGRLGREGG 

KllvjK Jvblj i-» VlSc.E*U(jKlilvX CjC>AJ!joA113IS\KA1 J& \t>JJiii^l^iJ^Kx yKoKrt 

2826 

A 

1 

77 

SGRPFFFFFFFFFFF*CKGFLIGYM 

2827 

C 

175 

351 

MKLIQLSCQPFNSRHHVLKKKPLTFMILSHFQASLSSIFGFCSSLFR 
QSSFr SNXJbr r * ! 

2828 

A 

56 

380 

KYL VDQHAG S P P VKN I LAEAVQN I VQ I LH * E I QKSVNHR YML * KLDT 
XJSXi* rl X vX»y \Ciir 1 ±Lj\iXJ\VUv x JvclcE x KXJKJJurJloiJf oIM x oonyvrr 
LFLLI FNKDNMA IN 

2829 

A 

3 

129 

GDCRLDPLHLAWNQPLWLS I LPS INCLRYHN I PS YMYQNS Y 

2830 

A 

472 

756 

S SADSKCTLTI GKPAKS RVQLCHKEETFPNQTNAYRTCVD * CLMSPT 
LLGTSIF*SQMIIIRWPICMPNIFLIISC*VLCIRKPVSELPHLLG 
F 

2831 

A 

1 

170 

NSTTNCLVISNPQGLQYKGKANLHDSRDLGEFSNLTDFKEKSHVVHI 
LFVLHFYYS 

2832 

A 

1243 

5595 

IME I EEQKPKQLELLEQ I EQQKLRLETDC FRAQLEEEKRKKTQPTGV 
GCR KS HHY I NHL VG I APAS C P V X S DED S HRQM I RNYQHQ LLQQN KX»H. 
RQSVETARKQLLEYQTMLKGRCPSVSAPSLITDSVISVPSWKSERPT 
AISEHWDQGQRLKLSPNKYQPIQPIQTSKLEQDHFQVARQNHFPQRQ 
VETTETLRASDILTNQALESQEHLRQFSQTETQQRDYKLVPKDSETL 
SRALSHDRQLISQDARKISETFGATTFQSLESQQLFSENSENISYHL 
TEPSSFVPLVPQHSFSSLPVKVESGKIQEPFSAMSKSTVSTSHSIIS 
QMHDRPLLPSENITAQQGNMKALQEQLDLQKKVLQATQEAQEQLLLC 
KQKEVEQQTGLSVFLPLVTPDSSALLPSAKADLGRIQESSPTKNNIA 
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VSSDHHVISQLQDKRLSLSQPILSQQNNFKFLQEQLNIQKDSLQARR 
EAQEVIiYVHKQSELDRRVCSEQAEPSFPFQVAQHTFTSLPSADTKSG 
KIQEQHSSKSEKGLVSCQSDIPISQDGSUSFLQQFLPLHDSLKLLQB 
QLTKQRDTLQARHEAQVELLLHRQRDLGDS KS GIiVS SSSS P VWQHS 
VASQASAKAEPRRIQELYLSEKENVGPSCHLI I PTFQDKSLSFPQHS 
LAQQENLTILQEQSQIQRVILGAKEGTQEFVHTESELEKRISSEQTG 
TSSSLSQVDESERFQECISIKSDSTIPLSHPKIPRCQERLLRVSQHM 
LPIiQDNLEEHQA WLDTE KEAFH FSQKTQENTS SEQTGS SSFI PQLVQ 
LSFTSLASAESGTILEPLFTESESKIFSSHLQIPQLQDRLLRISQLI 
QPQQDNLKALQEQLATQREAIJLARQEAREELLLHQSEWEGRISPEQ 
VDTSSLPLVPQHSFASLPLNESERNQEPCSINSDNIVSSGHSEIPTL 
PDGLLGLSHLVLPQQDNL I ALEEHLHAQTDFLPS I EKTQKELVLSKP 
CKFEEKVSSEHFIQSHHGDLQALQQQLDTQKKAIRSIQEVQEELLLQ 
RL S ELBKRVS S EQVCS S S FVSQVPVADS ERTQKS FPTKSNDTLPS SH 
REIPRLQDRLLSLSKPILPQQDNMTAQLDAQREVMYSYEKPQEELSL 
NKQRKLNKSESAEHTIPSLFLPKETEHSFIPLPFAEAKPKSTCELYS 
S QNEHAAP P S NP V I PG FQDRLLS FSQ S VLTQQDNIiGXjQKQLDLQREV 
LHYSQKAQEKLLVQRQTALQQQIQKHEETLKDFFKDSQISKPTVEND 
LKTQKMGQL.RDW F PNTQDLAGNDQEN I RHADRNNS DDNHLAS EDTS A 
KQSGEHLEKDLGRRSSKPPVAKVKCGLDIiNQHELSAIQEVESPAIGR 
TSILEENLLPSREGPSPGRIHHLLIKEPLGPSSGSQWYAMGLRQIP 
SFSKVFGI 1 1 RVATLIAVKGTS ATVVNKALAHMKCTIjQREG 

2833 

B 

2177 

- 

3174 . 

MGKTTPMIQSRPSFDTWELQVPPSTCGDYNSSFIKELRASQSNVIKP 
PGPVXLKESVVENHAVLSYAVEEEHAYLGPTVKPDDKAKTLSYEPLS 
SATVSTGSLLSYENTDLSLTDPESFSEHMDDSKQESTTSKEEETNII 
SS IVPSTQDIYQRQN^SDVHKSLIjPAVDETTCGHTHFQQMIDKALYQ 
SGKPETGHGIMEEPELTLISTTDTSIAEMDFANLTLEEKSENEAKCF 
FQOSKTRLKRLKRVKHATFLPLATEASDYPAVSELSIEKPRTASTET 
PRRLTPVPGSLQEAFIKRKKSFMERSHQRQKEIRNKIHVSENSQIKT 
VKEKPSISSIL* 

2834 

A 

2268 

2547 

KYLVDQHAGSPPGKSLCCLEKSQRMMMI SQNLDWLRV* TADDAN* E 
KCVPI LANLFFQVSNLLH* \ VLVYHLLKVCVTTCS FI HCRQQRFMNV 

2835 

A 

137 

716 

TTTMSSKKAKTKTTKKRP\QRATSNVFAMFDQ\SQIQGVQRGPFNMI 
DQ\NRDGFHSDK/ EKVLHDMLA\ SLGKNPH * WHYLDAM\ MNEAPGPI 
QFS PCFL\ TLFG * GSLNGHQ I P * RCS FRKRLAFALDE \ E ATGHPFRE 
DYL/ REELLTNPWGDRVYQIEEVG * TCTG EAP I * QKGGNSNYHRS FT 
RHPNWKAPKAQR 

2836 

A 

2 

437 

LVIIESDLERAEERAELSEG\KSDSWKNN*E*WIRP*KH*WLQRISK 
CAELEEELKTVTNNLKSLEAQA^KYSQKEDRYEEEIKVLSDKIiKEAE 
TRAEFAERSVTKLEKSIDDLEDKFLCFTSPKTPSSSWMSHLSELCIC 
LFSS 

2837 

A 

1 j 

834 

GSRVRRAAAGLSHCSPPARLPSGAMAGSSSLEAARRKIRSLQEQADA 
AEERAGTLQRELDHERKLRETAEA\DVASIiN\RRIQAWFEEEFGFVP 
KERLATAF\QKL\EEAEKAADESERGMKVIES\RAQKDEEKMEIQEI 
QLKEAKH IA\ EDADRKYEEVARKL VI IESDLERAEERAELSEG\QVR 
QLEEQLRIMDSDL\KAIiM7UVEDKYSQKEDRYEEEIKVLSDKLKEAET 
RAEFAERS VTKLEKS IDDLEEKVAHAKEENLSMHQMLDQTIiLELNNM 

2838 

A 

9 

1108 

PDRATRPRPRARPADPAJjVLGPLAAVTMGAGKKKMRMLKIjDKENAIjD 
RAEQAEADKKAAEDRS KQLEDE LVS LQKKLKGTDR * TGQ I L * RLSKM 
PKEETWNWQEKKATDAEAD\VASIiNR\RIQIiVEEEI*DRAQERLATA3j 
\QKLEEAEKAADE\SERGMKVIES\RAQKDEEKMEIQEIPT*KRAKH 
IAEDADRK\ YEEVARKLVI I ESD\ LEPCRGSGAELSEGKCAEA* KEE 
FENL*PNNFEVHLEGSRLRKYSQKERPDMEEEIKGPFPDKA*RRLET 
RAWSFA\ERSVT*N1iEKSIDD\LEDRAVTLQETEVQSHQRRSWDHAIi 
NGYDFHISFFASLLPRLPRRSWMSHLSELFHLSYSPADPGSLS 

2839 

A 

3 

268 

SGRCCECRIGTVISKSGIGGKRDKLSRYQTVTCQAAQQLPSTCLSHR 
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SLHGKGKGKGKDDGRGRGPGL.VAPSPSALKLLNPVWLSLRD 

2840 

A 

500 

682 

RTSLPCCSPAQEPCRRTAWPTSMATTPTPSLRNTASGTCGSSSTSMW 
KRWTCS RVTLKRN 

2841 

A 

1617 

2086 

GLAASSRKHQMVDDADAVGGPGGPGMAGCDGLPGGFGSDIRPRLWLW 
LGPRLWSS *RGT\NI ILASVPKKLLLM/DCYAIiARDRTATLG\TFDA 
' I S KTAS NW I PDLWKETVFTKF P YQE FTDHL / VKTHTRVS KQKTQ VP A 
VATTAARQTMEGQAPVEYI 

2842 

A 

55 

665 

AMAEDHGLSNGDGPVEVTQGLELLVSAAAQDIVLLDGIQGLFLTLQF 
DNIWVWDHFLGKLPHRVFKGGGEKQHLAVPGQHPLDADALILVALSG 
YHDI SL I QNKHLDFLGINEIA DLEHQS S TVPGVPMTICSIj I FWPLSN 
LLPRTA*ASFSSG*NLPICSITFPV*SASSYVGERHRHWGYLSDGST 
RLSMAKTKAAVLPVPD 

2843 

A 

43 

548 

I/TGP ITMAQAM I FCHTRKTASWLAAELS KEGHQVALLSGEMMVEQRA 
AVIERFREGKEKVLVTTNVCARGEQRTCPTWSARIiGVPGPIRARNPC 
IQGSRMVSGRWVGLVTLFLSRRDCLDFPEWRTHTCQVRWLMPAIPR 
QENCLNPGDRGCSEPRMCYSTPSWVTE 

2844 

A 

1 

876 

MLKNEEPRRRPIMSNHTATHILNFALRSVLGEADQKGSLVAPDRLRF 
DFTAKGAMSTQQIKKAEEIANEMIEAAKVTGSWERDPERTRVIiSIiDV 
GHAAGKGRMRGDKDLASHLQTVQPRPCSEWDLLNGHSRTLEEAPLSQ 
CCWGLRGLEPGALMRPVGGASLSTLRSQLHISFPLTLEPWWQGGHS 
I>AFCAMAYPKVTSTHNVVTLGIDVEQVSVVINFDLPVDKDGNPDNET 
YliHRIGRTGRFGKRGLAVNMVDSKHSMNILNRIQEHFNKKIERIiDTD 
DLDEIEKIAN 

2845 

A 

222 

484 

GNPLSSYSFCSMCLTELDPILTEVTLMNARSELYLRFLKKRISSDFE 
VGDSMASEEVKQEHQKSLDKLLNNCLLSCTMQELIGLYVT 

2846 

A 

3 

218 

SLFQLKTTKTQSRVMGAIYDSSWFQEDRQAAERFTDRSTHLRLRNAE 
AAA* TKPAQGPGS I P I L PATPS S L 

2847 

A 

322 

452 

LCCS I * I YLN/NLL I C * QS FSVLK* DFRVD * KKKNPRMSHRVLNI 

2848 

A 

3578 

5288 

TESREHFSPWYSVTAARRPQVASKAEENLLMVLGTDMSDRRAAVIFA 
DTLTLLFEGIARIVETHQPIVETYYGPGRLYTLIKYLQVECDRQVEK 
WDKFIKQRDYHQQFRHVQNNLMRNSTTEKIEPRELDPILTEVTLMN 
ARSELYLRFLKKRISSDFEVG\DSlWS*EVKQEHQKCLDKLLN*LAF 
1»S C YHG REL I GL YVTMEE YFMRETVNKAVALDTYE KGQLTS SMVDDV 
FYIVKKCIGRAIjSSSSIDCLCAMINLATTEIjESDFRDVLCNKIjRMGF 
PATTFQDIQRGVTSAVNIMHSSLQQGKFDTKGIESTDEAKMSFIiVTL 
NNVEVCSENISTLKKTLESDCTKLFSQGIGGE\QAQAKFDSCLSDLA 
AVSNKFRDLLQEGLTELNSTAIKPQVQAWINSFFSVFHNIEEEEFND 
YEANDPWVQQFILNLEQQMAEFKASLS PV I YDS LTGLMTSLVA\ VEL 
EKWLiKST\ FNRLGGLQFDKELRSL\ IAYLTTVTTW \ TIRDKFARLS 
QM\ATIL\NLERVTEILDYWGPNSGP\LTWRIjTPAEVRQVIjALRIDF 
RSEDIKKAALVS 

2849 

A 

233 

801 

WTLTGSNLYFYCSLFIFSVIiPDQDVSLV*ICYLSLPLDIVCFCNVK 
IQFFF\liSLESYY*FLTIFPHCFYFLLKTPFKHREIiFILVFGLLKIF 
Q/liC*FFYLIIiLLRFYFINFCFLLDLFLFWIFIWYFVIFACSWFIGV 
SLKFLL VPL/ CS F FLRCRHS PLF * GAS * TDI*Y I LG C I P F P * VCLLWE 
FCGGL 

2850 

A 

170 

495 

LCSLTGQSGAGNNWAKGHYTEGAELVDSVLDWRKEAESCDCLQGFQ 
LTHS LGGGTGSGMGTLL I S KI REE YPDR I MNTFS WPS PKVSDTWE 
P YNATL S VHQLVKN 

2851 


/ SJ 

±*± / *k 

RESG 1 wgPCjKAGTQTGAKFWEVI SDEHG IDPTGTYHGDSDLQLERIN 
VYYNEATGGKYVPRAVLVDLEPGTMDSVRSGPFGQIFRPDNFVFGQS 
GAGNNWAKGHYTEGAELVDFVL\DV* RKEA\ES *DCL\ QGFQLTHSL 
GGGTGSGMGTLLISKIREEYPDRIMNTFSVVPSPKVSDTVVEPYNAI 
LSGH\QLVENTDETYCIDNEALYDICFKTLKL\TTPPF\GDLNHLVS 
ATMSGVTTCL\RFPGQLNADLRKVAVNM\VPFP\RLHFFM\GGFAPT 
DQAGASQASTRALTVPRSSPKQ\MFDA\KNMM\AACDPRAWAAYLDG 
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LPPCFRG PHVPLKBVDE \ QML\ NVQKQKQP A I FVE \ W I PQQCE KRAV 

cdi phlrglknvrpps fgkqarp i pgafskphlpkqfhgpcfrrkal 
psfitggegmdem\efhprpesnknd\lvseyq\qy\qdataee\eg 
efegrslgrkva 

2852 

A 

284 

359 

QQEANPSAPLQHVSTSNC/CVLTCCKGAEGLASCCYCHCLCNECCQG 
ACGPDTHLAHCCRCQHSSKPPGGPRI/ASTQTCYY*CPCRLPGGSRT 
*CLASCPYHWCPWTSLPGSPVPNKAPP 

2853 

A 

295 

478 

YYKRI PEAKRLTNTVFNFS I FSHSTHRTFGKQQWQQLYDTLNAWKQN 
LNKVKNSLLSLSDT 

2854 

A 

1 

1167 

MVELLGSYTEDNASQARVDAHSCASSVARPSSLFRSAWSCEWSVRCA 
RACTMS VPAF I D I S EEDQQQNEKYRVPQFDQS T I KN I ES AKGLD VWD 
SWPLQNADGTVAEYSGYHVVFALAGSPKDADDTS I YMFYQKVGDNS I 
DSWKNAGRVFKDSDKFDANDPIMKDQTQEWSGSATFTSDGKIRIiFYT 
DYSGKHYGKQSLTTAQVNVS KSDDTLKINGVEDHKT I FDGDGKTYQN 
VQQFIDEGNYTSGDNHTMRDPHYVEDKGHKYLPNRGHARSSRRRQEI 
SKIREELKEIEIQKPLQKINESRSWFFEKINRVDRLLARLMKKREKN 
QIVAI KNDKGDMTTD PTE I QTTVRE YYKHL YAYKLENLEEMGKFLDT 
YTLPRLNQEEVDL 

2855 

A 

3 

1328 

SGILGTTTPSFPVGWLRVECPLCPGPAPMSVPAFIDISEEDQAAEL 
RAYLKSKGA\EISEENSEGGLHVDLAQIIEACDVCLKEDDKDVESVM 
NS V\ VS LLL I L \ E PDKQ \ EA\ L I ES LC \ EKLG PNFREGE PPRLLRLQ 
FVKATFSTGMD* E ILPVKIHSVICSlA I KV\ AASCG\ AIQYHPQLEP 
GI KFRKWDFLNWN\ LTTGK3CSTPLLKTTFMKALVDCKKSDAASKGM\ 
VELLGSYPGGQCFPRARVDAHR\CIV\RALKDPN\AFPFD\HLI.TLK 
PRQVFWKGEL I / H I VP F * PNFVS A* MGHHM SRF Y PD * LRTFH * I PLG 
PVTMDPGI WPKMRLLFLLWGMGQ * ENR\ E I S FWTTMPARNFR IGA\D 
D\ VEAFVI DAGKELKMVYCQNWSRPQRKVVVSHSTHR\ tfgk\ qqwq 
QTV/YDTLNAWKQNLNK\VKNSLLSPSGYLKFLCL 

2856 

A 

377 

2393 

FS LECLGHENR * D * IT I YKI TS LGTGKVFNKI QH P FTI KNKK* KTLN 
KLGIEGNFLNLIKGIYEEPMVSIILNGEGLDYFPLRSETRQGCIiLWP 
FLFN/ 1 VLiDSQERAIRQENKI KVI * VGKEEV* LNR\ MGDDMWLCI V 
IQGK\I*KIQRHDLLINKFSKVTGYRI/NICRK1iVIFHI*YQIjNKSQ 
NDVKKTI * FSTTWKLGGLC/RVIPTLWESEAERLLEARRPAWETVRS 
HLYKNKQTKNNKQTKN/TLDVMTLICRLSYSEAEV*GWLEPQSSKLQ 
*AITGSLQSSLDRQS*TLTQKKTKKXRKKLTASERVKYQYN*GSRNI 
D/PHLSSN*LLENSA*GQFQWGGKIVFFIK*CLETWISTWQNNGNWD 
SYLILHIKINSELT*DU^^ANIVSIFKEN/MGWLSDFELGNDFLN 
K* / PLKTQATKEENR * I KI KT I CVS KDN I KKAKI QPTE * DNFF *NHV 
LVKGGICI*NM*RTVTIIIKRHIIQFLKIQQGVWN*QFSKGGYTKGP 
LRHMEKVSQPSLAIREMQMRSTMRYHLT\ PLE* IYNQSDNIKCW*GC 
RE IGTLMHCW W\ YPRD I KTNVYTKTCAQMFIjPAIjF I * AQK* NGP * FL 
S \ LTSALTKTWH IHT/ME YYS SQ* KKEMKVLIWPPTTMEKPLEKPYP 
SEECPRHERPTYYM I S I YMKCPRTGE YI 

2857 

A 

3 

446 

GEFADSF/ SSMGSPVNAQDFCTDLiAVSSANFIPTVTAISTSPDLQWL 
VQPALVSSVAPSQTRAPHPFGVPAPSSGAYSRAGWKTMTGGRAQSI 
GRRGKVEQETDQLEDEKS ALQTE I ANLLKEKEKLEF I LAAHRPACKN 
PDDLGFPE 

2858 

A 

295 

1622 

DEMSVAGSFLFTAACGTGNSPHLCPEHARSRRLSPSVLF*DFCTDLA 
VS SANF I PTVTAI STS PDLQWLVQPALVS S VAPS QTRAPHP FGVPAP 
JdAvjAi bKA^jV Vis.lW\ lVjv^RA\QSIWQEGARWEQVRNr *RXLiPGNVGA 
GWEAAPEMQEPSTEDEAT*WGLLAHP*LGAIiAPSPFHPNSDSEFSP* 
EVLS * FLP * VS YRMLSDWALLCSIiAEDli I LNASHT* SATAGQKWFHS 
GVPGSREAAGASSTGVGEWR*WQTliLIiNVGLFL*LF*LSPEEEEKRR 
I RRERNKMAAAKCRNRRRELTDTLQAGRYSVGCS FLKLKGKVGRLEH 
KGP*VRLCLYAFPLSLCIQ\ETDQLEDEKSAIiQTEIANIiLKEKEKLE 
FIL\AAHRPCIiARFP\DGPGAFPEEMSVASLDLTGGLPEVA/TPESE 
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EAF/TLPLLNDPEPKPSVEPVKASA 

2859 

A 

3 

1467 

phlqratekprlsrrprrseravtvllpssasqrppvsaprplarlc 
ltatmmfsgfnadyeasssrcssaspard\slsyyhsp\ad\sffsm 
gs pvnaqd fctdla avs s an f i ptvtai st\ s pdl\ qw\ lvqpalvs 
/ s lw ap s qtrap stfsespp ptag a \ y s ragwktm tggraq s i g\ r 
rgkveqls\peeeekrrirr\ern\kmaaakc\rnr\rr\eltdtlq 
a\etdqled\rksalqteianllkekeklef/lslaahr\pa\ckip 
lidlglprkje\msvgs\ldltgglpegs\tpe\seeaftl\pllndp 
eaqgpqwepvqggisrhgs*kdrgpfddflfpsihpgpsgflrtars 
v\ pdmgp i lgpfygrqirgafshsgslgmgahghrgweplctpwtc 
tpscnclhvfprfhlprg*llpqlcscppqgqqqq*afl*laqlitha 
agpvrgqgrggsrhpqvplpelvhyreekhi fprgfl 

2860 

B 

1 

196 

MAEGKWRKKPLK^ESLGKDFLTGVLDNLVEQNVI^ 
AKTEDKVRVMADSMQEKQX* 

2861 

A 

1 

988 

MGSIiEGNHRKKPLKVLESLGKDFLTGVLDNLVEQNVI^KEEEKKKY 
YPAKTEDKVRVMADSMQEKQRMAGQMLLQTFFNIDQISPNKKAHPNM 
EAGPPESGESTDALKLCPHEEFLRLCKERAEEIYPIKERKNRTRLAL 
1 1 CNTEFDHLP PRNG AD FD 1 TGMKELLEGLD Y S VDVEENIjT ARDME S 
ALRAFATRPEHKSSDSTFLVLMSHGILEGICGTVHDEKKPDVLLYBT 
IFQIFNNRNCLSLKDKPKVIIVQACRGANRGELVATRDSPASIiEVASS 
QSSENLEEDAVYKTHVEKDFIAFCSSTP\KYSWCCHLEEVFRKHCSQ 

2862 

A 

12 

914 

GS FQRCKKGQRLFPMAEGNHRKKPLKVLE SLGKDFLTGVLDNLVEQN 
VLl^KEEEKKXYY\DAKTEDKVRVMADSMQEKQRMAGQMIiLQTFFNI 
DQ I S PNKKR * * IG VTEAE I T I YVLQ YP AAHPNM \ EA\ GP PE SGR I LP 
DCLK\LCPH\EEFLRLCKERAEEIYPIKER>mRTRLALIICNTEFDH 
LPPRNG\ ADFD ITG\MKELLEG \ LDYSVDVEENLTARDMES ALRAFA 
TRPEHKSSDSTFLVLMSHGILEGICGTVHDEKKPDVLLYDTIFQIFN 
NRNCLSLKDKPKVI I VQACRGGEC 

2863 

A 

12 

1353 

gsfqrckkgqrlfpmaegnhrkkplk^eslgkdfiitgvldniiveqn 
vi^nwkjeeekkXkyydaktXedkvrvmXadsmqekqrmagqmllqtff 
\nidqispn\kkahpimeagppes\gestdal»ti>cpheeflrtic*rk 
elkeiypikgrrkqphtaglffi\citefdhlppr\ng\adfditgm 
kelleg/lwdysvgvgrrfltargm\dsal\rafatrpehkssdstf 

L \ VLMSHG \ I LEG I CGTVHDEKKPD V\ LL YDT I FQ I F \NNRNLPQV * 
RDKPQGSSIVQ\ACRG\ANPSGN*WV\RDSP\ASLGKWPSSQSSGEP 
WRKDAV* QDPHV/ ERRDF I GFPGS FKRPHNRAPWERPAQWGS I FITQ 
\ LI TCLPRNI P WCC / HTLEEVFRKGTAIHLET PR\ AKAQM\ PTI ERT 
VR* QGYF YLFPGEFENWEGHKQPS PSIiNPTFKEAPFFVTACI F 

2864 

A 

66 

392 

AQCLLLQSPGFEPPWVLAAHPRALQSLQRKHGRSDCGGTGEEASGSS 
LESDAGFGPPGPGGCAVPQVFQDCPVCLGNHHSQVLGGYCTEGPGLA 
AGSQACSPLGRVAP 

2865 

A 

131 

551 

KIRGVSCPSAPKSGEQTGQHHNINTGFHPLPLGAVTSPVPQGPGDRL 
CPPPSSPEPAQPAQ/C*PCSDSGRSPGPGPGKAWTVAPC\PEPSVFC 
LQSPGFEPPWVLAAHPPSSERRASSSGLDDDLPAQETGNGSGATRTC 
S 

2866 

A 

943 

1111 

LMVLSDFIFKLKI/WPGAVAHACNLSTLGGRGRWIT*GQQFETSPAS 
IVKPPSLLKI 

2867 

A 

112 

1078 

ESSRQALLAKPLSACAEQPARAEVGAATALPVRWASGEMAPSGSLiAG 
PLAALVLVLWGAPWTHGRR\TNVRVITDEN\WRELLEGDWMIEFYAP 
WCPACQNLQPE WES F \ AE \ WGEDLE\ VNI AKVDVTK\QPGL\ SGRF I 
I\TALPTIYH\CKDGEF*AVIQGPRTKKDFINFISDKEWKSIEPVSS 
WFGPGSVLMSSMSALFQLSMWIRTCHNYFIEDLGLPVWGSYTVFALA 
TLFSGLLLGLCM I FVADCLCPS KRRRPQP YP YPSKKLLS ESAQPLKK 
VEEEQEADEEDVSEEEAESKEGTNKDFPQNAIRQRSLGPSLATDKS 

2868 

A 

1 

448 

HRGGLPGLKLPDFQDSIFEYFNTAFLLHDLTFRTSSASEQETQAPKP 
EASPSMSVAASEQQEPKAHQFSIKSFSSPTQCSHCTSLMVGLIRQGY 
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AG EVCS FACHVS CKDGAPQ VC P I P P EQS KRPLG VD VQRG I GTAYKGH 
VKVPKPTG 

2869 

A 

1 

536 

IISPHINKDEDVQRDLEHSLQMEAYERRIRRLEQEKLELSRKLQEST 
QTVQSLHGSSRALSNSNRDKEIKKLNEEIERLKNKIADSNRLERQLE 
DTVALRQEREDSTQRIiRGLEKQHRVVRQEKEELHKQIiVEASERLKSQ 
AKSALSGPAPPSPATPLPSWAFCWMGTWGVLVFTFLM 

2870 

A 

1 

5193 

MAAAGTGTCWLLAIGSQKVWKPLSLPVAVVKMKNTERIYAMKILNKW 
EMLKRAETACFREERDVLVNGDCQWITALHYAFQDENHLYLVMDYYV 
GGDLLTLLSKFEDKLPEDMARFYIGEMVLAIDSIHQLHYVHSWTPKA 
FRVRKS VADQDPVGAPHRD I KPDNVLLDVNGHI RLADFGSCLKMNDD 
GTVQSSVAVGTPDYISPEILQAMEDGMGKYGPECDWWSLGVCMYEML 
YGETPF YAES LVETYGKIMNHEERFQFPSHVTDVSEEAKDLI QRL I C 
SRERRLGQNGIEDFKKHAFFEGLNWENIRNLEAPYIPDVSSPSDTSN 
FDVDDD VLRNTKS LKI TNLVTACATV 1 1 E I RVCLLCFSDRGS LKSIM 
QSNTLTKDEDVQRDLEHSLQMEAYERRIRRLEQEKLELSRKLQESTQ 
TVQS LHGS SRALSNSNRDKE I KKLNEE IERLKNKI ADSNRLERQLED 
TVALRQEREDSTQRLRGLEKQHRVVRQEKEELHKQLVEASERLKSQA 
KELKI^AHQQRKIiAIjQEFSEIiNERMAELRAQKQKVSRQIJUDKEEEMEV 
ATQKVDAMRQEMRRAEKLRKELEAQLDDAVAEASKERKLREHSENFC 
KQMESELEALKVKQGGRGAGATLEHQQEISKIKSELEKKVLFYEEEL 
VRREASHVLEVKNVKKE VHDS E SHQLALQKE I LMLKD KLEKS KRERH 
NEMEEAVGTIKDKYERERAMLFDENKKLTAENEKLCSFVDKLTAQNR 
QLEDELQDLAAKKESVAHWEAQIAEIIQWSDEKDARGYLQALASKM 
TEELEALRS S SLGSRTLD PLWKVRRS QKLDMS ARLELQS ALEAE IRA 
KQLVQEELRKVKDANLTLERCS P WQPDFLC PLHEVAG I ARLPVRWL 
VLLGAATILPVMGKILMGLTAERPGPCVPEEQRTRLGTRIX3SKLKDS 
EAIO^ELLEEMEILKKKMEEKFRADTGLKLPDFQDSIFEYFNTAPIiA 
HDLTFRTSSASEQETQAPKPEASPSMSVAASEQQEDMARPPQRPSAV 
PLPTTQALALAGPKPKAHQFSIKSFSSPTQCSHCTSLMVGIiIRQGYA 
CEVCSFACHVSCKDGAPQVCPIPPEQSKRPLGVDVQRGIGTAYKGHV 
KVPKPTGVKKGWQRAYAWCDCKLFLYDLPEGKSTQPGVIASQVLDL 
RSAFGQHCPCSV^PQLSIAKRTVTASLLGAPSKTSSLLILTENENE 
KRKWVG I LEGLQS I LHKNR LRNQ WHVPLEAYDS SLPLI KAILTAAI 
. VGCVLQVSEATALPAALQCPSENQMKVTGPFGFVDSFSMHVKKQTRP 
DADRIAVGLEEGLYVIEVTRDVIVRAADCKKVHQIELAPREKIVILL 
CGRNHHVHLYPWSSLDGAEGSFDIKLPETKGCQLMATATLKRNSGTC 
LFVAVKRL I LCYEI QRTKPFHRKFNE I VAPGS VQCLAVLRDRLCVGY 
PSGFCLLSIQGDGQPLNLVNPNDPSLAFLSQQSFDALCAVELESEEY 
LLCFSHMGLYVDPQGRRARAQELMWPAAPVACSCSPTHVTVYSEYGV 
DVFDVRTMEWVQTI GLRR I R PLNS EGTLNLLNCEPPRL I YFKS KFSG 
AVLNVPDTSDNSKKQMLRTRSKRRFVFKVPEEERLQQRREMLRDPEL 
RSKMISNPTNFNHVAHMGPGDGMQVLMDLPLAAMLAET^ 
>\XjL.oiJij>^Xiijonl\^ V oAiilAJ^r TSTYPVTRPCAQASGRPGQ 

2871 

A 

1 

356 

DPLWKVRR S QKLDM S ARLELQS ALEAE I PJ\KQLVQEELRKVKDANLT 

LERCSPWQPDFLCPLHEVAGIARLPVRVVLVLLGAATILPVMGKILM. 

GLTAERPGPCVPEEQRTRLGTRLG 

2872 

A 

1 

401 

RRCQTACFREERDVLVNGDCQWITALHYAFQDENHLYLVMDYYVGGD 
LLTLLSKFEDKLPEDMARFYIGE^TVLAIDSIHQLHYVHRDIKPDNVL 
LD VNGH I RL AD FG S CL KMNDDGTVG I FVGDFPFG FGFG I 

2873 

A 

151 

5459 

AGARRRGRGGEEAPTiTiPOT.A 7\7A"RPPP aDDnr»T t\ "c tj Tv^ro/"* id ■dttyt! o/~* no 
GTMSAKVRLKKLEQLLL\DGPWRNESALSVETLLDVLVCLYTECSHS 
ALRRDKYVAEFLEWAKP FTQLVKEMQLHREDFE 1 1 K\ VI GRGAFGEV 
AWKMKNTERI YAMKI LNKWEMLKRAETACF \ RENRDVLVNGDCQ\ W 
ITALHYAFQDENHLYLVMDYYVGGDLLTLLSKFEDKLPEDM7UIFYIG 
EMVLAIDP IHQLHYVHRDI KPDNVLLDVNGHI RLADFGS CLK\M\ND 
DGTVGAVPPVAVGTPDY I S P \ E ILQAMEDGMGKYGPECDWWSLGVCM 
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YEMLYGETPFYAESLVETYGKIMNHEERFQFPSHVTDVSEEAKDLIQ 
RLICSRERRLGQNGIEDFKKHAFFEGLNWENIRNLEAPYIPDVSSPS 
DTSNFDVODDVLRNTEILPPGSHTGFSGLHDPFIGFTFTTESCFSDR 
GSLKSIMQSNTLTKDEDVQRDLEHSLQMEAYEftRIRRLEQEKLELSR 
KLQESTQTVQSLHGSSRAIiSNSNRDKEIKKLNEEIERLKNKIADSNR 
LERQLEDTVALRQEREDSTQRLRGLEKQHRVVRQEKEELHKQLVEAS 
ERLKSQAKELKDAHQQRKLALQEFSELNERMAELRAQKQKVSRQLRD 
KEEEMEVATQKTOAMRQEMRRAEKLRKELEAQLDDAVAEASKERKLR 
EHS ENFCKQMES ELE ALKVKQGGRGAGATLEHQQE ISKIKSELE KKV 
L F YEEELVRRE AS HVLE VKNVKKEVHD S E SHQLALQKE I LMLKDKLE 
KS KRERHNEMEE AVGT I KDKYERERAMLFDENKKLTAENEKLCS FVD 
KLTAQNRQLEDELQDLAAKKESVAHWEAQIAEIIQWVSDEKDARGYL 
QALASKMTEELEALRSSSLGSRTLDPLWKVRRSQKLDMSARLEliQSA 
LEAE I RAKQLVQEE LRKVKDANLTLES KLKDS EAKNRELLEEME I LK 
KKMEEKFRADTGLKLPDFQDSIFEYFNTAPLAHDLTFRTSSASEQET 
QAPKPEASPSMSVAASEQQEDMARPPQRPSAVPLPTTQALALAGPKP 
KAHQFS I KSFSS PTQCSHCTSLMVGLIRQGYACEVCSFACHVSCKDG 
APQVC PIP PEQS KRPLGVDVQRG I GTAYKGHVKVPKPTG VKKGWQRA 
YAWCDCKLFLYDLPEGKSTQPGVIASQVLDLRDDEFSVSSVLASDV 
I HATRRD I PC I FRVTAS LLGAPS KTS S LL I LTENENEKRKWVG I LEG 
LQS ILHKNRLRNQWHVPLEAYDS SLPLIKAILTAAI VDADRIAVGI* 
EEGLYVIEVTRDVIVRAADCKKVHQIELAPREKIVILLCGRNHHVHL 
YPWS S LDGAEGS FD I KLPETKGCQLMATATLKRNS GTCL FVAVKRL I 
LCYEIQRTKPFHRKFNEIVAPGSVQCLAVLRDRLCVGYPSGFCLLSI 
QGDGQPLNLWP\NDP\ALAFLSHSSF\DALCAVELESEEYLLCFSH 
MGLYVDPQGRRARAQELMWPAAPVACSCS PTHVTVYSEYGVDVFDVR 
TMEWVQTIGLRRIRPLNSEGTLNLL\NCEPPRL/VF*FKSKFFGKRV 
FQTWPD \ TSDNS KKQMLRTRS KRRFVFKVPEEERLQQRREMLRDP EL 
RSKMISNPTNFNHVAHMGPGDGMQVLMDLPLSAVPPSQEERPGPAPT 
NLARQPPSRNKPYISWPSSGGSEPSV\TVPLRSMSDPDQDFDKEPDS 
DSTKHSTPSNS SNPSGPPS PNS PHRSQLPLEGLEQPACDT 

2874 

A 

1 

764 

MEKI P I LRS LRAREQQAGKDVTLQGEHQHLPEPGCQQTVPLS VGRRP 
PDTPGPETNSMEAAPGSPPGEGCPACSRCLRWEPPPGRPCE*PEESP 
AGT/PAPCPCGSPG\GPAAQSLPPLPG/PLPQPSRPSPA\SGPAPGH 
RHPEGGAGQAAEGQ\GPRGQPRS SLQTSP\ PRPLAAPVPMA\ PRGLA 
WQDVSEPPP\GCSGLNSSQKSPLSERPGPPVSGEA/GP/DGSSHSAQ 
RLQPPNRAGSGLESVSVSLEGQSRGARD 

2875 

A 

1 

1641 

MKVCS S I AEVSDTTNPPSRTNSGHMTS VLKQALKDPCRTRHYRLVLW 
TLERTEVLDLF IERFCMTGGNS P I C YLKAYHTNL YLS ADAEKVREAI 
DEGIAAATIFSPSRDVWSQSQLRVAFDGDAVLFSDESERIVKATLD 
RFFEHEKAHENKPLAQEQGSTS FPDKVEAFRS PSVPLP I TGHSANVY 
QTPTCARRRTTRPLVNWELIPRDNPPLPPRQAFRDLGQSPAFSDAVA 
TTSAVPFGGDCHLSEVRGAETRQPAFAMKRPKEPSGSDGESDGPIDV 
GQEGQLRSSKLEKAEVLQMTVDHLKMLHATGGTGFFDARALAVDFRS 
IGFRECLTEVIRTPTIWLLPGRSKAVESQSQQFTGAHSTLGEPTAKK 
\ A* SNPQVYMD I KIGNKPAGC I QTLLRSD WPMTTENFLYMCTH/ S K 
GLGFKGSSFHRIIPQFICQGSDFTNHGGTRGKSIYKKKFDDENFIFK 
HTGPGLRRHGTKCPMAHLALMEASKGQDQHPLLRRKERAQQVHDVVK 
LTRGRGRGISLKRIARSLCKDKETGLCTLTWH 

2876 

A 

1 

949 

LLLLGHASTSTGGGGKRASKMATTKRVLYVGGLAEEVDDKVLHAAF I 
PFGDITDIQIPLDYETEKHRGFAFVEFELAEDAAAAIDNMNESELFG 
RTIRVNLAKPMRIKEGSSRPVWSDDDWLKKFSGKTLEENKEEEGSEP 
PKAETQEGE P I AKKARSNPQVYMD I K I GNKPAGR I QMLLRSDWPMT 
AENFRCLCTHEKGFGFKGS S FHR 1 1 PQFMCQGGDFTNHNGTGGKS I Y 
GKJCFDDENFILKHTGPGLLSMANSGPNTNGSQFFLTCDKTDWLDGKH 
WFGEVTEGLDVLRQIEKQEESAITSQPRSWKLT 
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2877 

A 

185 

339 

LLLPPPFPGSSPWLNLHLHAALDHIjPRPAAITCQKQGESAITSQPRS 
WKLT 

2878. 

A 

673 

1085 

GE PTAKK\ A * SN PQ VYMD I KI GNKPAGC I QTLLRSDWPMTTENFL Y 
MCTH/ SKGLGFKGSSFHRI I PQF ICQGSDFTN\ HGGHQGS AQ\HGTK 
CPMAQFGPHGGLKRAGPAPSST\GKERAQQKQGESAITSQPRSWKLT 

2879 

A 

141 

1959 

PRP7VNLLKKGGQITMSATVVDAVNAAPLSGSKEMSLEEPKKMTREDW 
RKKKELEE\ QQKLGNAPAEVDEEGKDINPHI PQYI SS VPWYIDPSKR 
PTLKHQRPQPEKQKQFSSSGEWYKRGVKENSIIT\KYRKGACENCGA 
MTHKKKDCFERPRRVGAKFTGTNIAPDEHVQPQLMFdYDGKRDRWNG 
YNPEEHMK I VEE YAKVDLAK\ RTLKAQKLQEELASGKLV EQANS PKH 
QWGEEEPNSQTEKDHNSEDEDEDKYADDIDMPGQNFDSKRRITVRNLi 
R I RED I AKYLRNIoD P I S A YYDPKTRAMRENP YANAGKNPDE VS YAGD 
NFVRYTGDTI SMGQT\ QV\ FAWEA\ YDKGSE\ VHXi\QADP\TKL>EIjIj 
YKS\ FKVKK\ EDFKEQQKES ILEKYGGQEHLDAPPAELLLA\ OTEDY 
VEYSRHGTVIKGQERAVACSKYEEDVKIHNHTHIWGSYWKEGRRGN\ 
KCCHS FS KYS YCTGEAGKE I VNSEEC I INE I TGEES VKKPQTLMELH 
QEKLKEEKKKKKiaCKKXHRKSSSDSDDEEKKHEKIjKKAIiNA*EA\RL 
LHVKET\MQI \ DERKRP \ YNSMY * TSRP\ I EEEMEAYRMK\RQRPDD 
PMASFLGQ 

2880 

A 

711 

1141 

GVGVCFFPVPSSLRVLGRWAAFCALPGRSLRKSRSLWGWTGHTQS*E 
GELPWKAG/ WQRCFWRGGGRQTLPIjTCS PRTRGCRAAQEGAGCKARG 
AWWVRS PGRAGR PLRRGGSR VTLNS QMGSDATGETEW I KEMKPKKKK 
KMS 

2881 

A 

1 

649 

MAGALVRKAADYVRSKDFRDYLMRVTQQPYKVFNSGPWAFSSSSYMS 
RPS AHI S SL I VS RVGSS STS FQGGLGAGMGVEPLDPHENIi YVDFYRG 
FIHNCQNLEANKMSFGRSQSEKAAYYLVPTTQRSGKGKIMELVERSV 
VARVEHFWGP VANWGLP I AA I NDMKKS PE 1 1 SGRMTF\ VQPRNWLLF 
ACHATNE VAQL I QGGRL I KHEMTKTAS A 

2882 

A 

1 

389 

FRAAGGLAGVSLALGSGSRGRDHSGSGVGTAMAGALVRKAADYVRSK 
DFRDYLMSTVRKLFPSISGFCRLSALTELGVTGLASFAYESLSSRHG 
WQWKGAYCTLACPGEPRHELCDSHAPGAAPYSCF 

2883 

A 

27 

554 

lavgcrgvglgvigsgkrqqrqgpixsvsvsaqpcagalvrkaadyvq 
sk\dfrdyl\msth\fwgpvan\wglpiaaindmk\kspgdysvgrm 
tf\aiic\cys\ltfmeilpykvqi*plnwi>/lcfachgtnevrpsfi 
qgegrlipntrmdlkrasginqwgkgknkvfegdslcpgcc 

2884 

A 

1 

162 < 

QCAEKIASNPKIWAMAKESVNAAFKITLPEG\DDRKEGMTPFVEKR 
KANFKDQ 

2885 

A 

1 | 

196 

FPGCYS S KFLKHWDHIiTQFKKP VI AAVNG YAFGGGCELAMMCD 1 1 YA 
GEKAQFAQPERTSAACQF 

2886 

A 

148 

427 

CLLGFTHHS A^HDQPLPAGAGGTQRLTRAVGKS LAMEMVLTGDR ISA 
QDAKQAGL VSM I CAVETL * CLLGFTHHS ASHDQPLPAG AGGTQRLTR 
AVGKSLAMEMVLTGDRI SAQDAKQAGLVSMI CAVETLVGGAI QCAEK 
IASNSKIWAMAKESVNA 

2887 

A 

1 

1484 

EATLTRPRPAGGPAAGVfYCPSAKTSGQIiGAGFTFSPLERHCPLSPSH 
LAVHPAGAGPVQQANREAPAS P I FGLACCSGPVPDLNAAGTPR I FTS 
PTLRP\ LARGANFE YI I AEKRGKNNTVGL IQLNRPKALNALCDGL ID 
ELNQALKTFEEDPAVGAIVLTGGDKAFAAGADIKEMQNIjSFQDCYSS 
KFLKHWDHLTQVK\ KPVI AAVNGYAFGGGCELAMMCDI I YAGEKAQF 
AQPEILIGTIPGAGGTQ\RLTRAVGKSLAMEMVIjTGDRISAQ\DAKQ 
AGLV\SKI\CPF*DTGWKEAIQF\AEKIASNSKIWAMAKESVNAGR 

spnlseciylegvavpracrwleagyseesii»kerrllspevfgkw 
lpwermlapalepqrLppsahldllclasckerawneqsqrrswcrp 
rpalalvlsflvfilgyfslfaaplpphslssraqfaispfiqflsl 
lg avg yrs vtfgghsravgs ars plgqcl 

2888 

A 

1 

946 

GRGVQRAMAALRVLLSCARGPLRPPVRCPAWRPFASGANFEYIIAEK 
RGKNNTVGL IQLNR\ PLPVNWLCDCIj\ ID * LNQGLKI FEEDPAVGAI 
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\VLTGGDKAFAAG\ADIQGKLQNLEFSRDC\YSSKFFGRHWGPPSPR 
VKKPVIAAV\NGYAFGGG\CELCH*CVDIIL/YAGEKAQF\AQPEIL 
IGNLPRVRGGTQED3PVPVGKSLAME\ MV\ I/TGDPDPQPQDAQASRV 
LSSKICP\VBTLVEEAI\QCAEKIA\SNSKIWAMAKESVNAAFEMT 
IiTEGSKL\EKKLFYFNLWPLDDRERRG*PRFVGKEKGPTFKDQ 

2889 

A 

1 

384 

VRD YNLTEEQKAI KAKYP PVNRKYE YLDHTADV/ QW I VLHRA* I YFF 
RLHAWGDTLEEAFEQCAMAMFGYMTDTGTVEPLQTVEVETQ/GWGEE 
FSLSKHPQGTEVKAITYSAMQVYNEENPEVFVI IDI 

2890 

A 

1 

332 

RDYNLTEEQKAIKAKYPPVNRKYfi*DDLOSLLFHFLDEWLYKFSADE 
FFIPREVKVLSIDQRNFKLRSIGWGEEFSLSKHPQGTEVKAITYSAM 
QVYNEENPEVFVIIDI 

2891 

A 

3 

425 

SFMAQEEEDVRDYNLTEEQKAIKAKYPPVNRKYEYLDHTADVQLHAW 
GDTLEEAFEQCAMAM FG YMTDTGTVE PLQTVE VETQV\ HADE FF I PR 
EVKVLS IDQRNFKLRS IGWGEEFSLSKHPQGTEVKAI TYSAMQVYNE 

2892 

A 

91 

598 

LKNRRR S RP S I RQS IG S TS VS RWLTS LFTYLDHT ADVQ * V * RE F I PL 
KPRQ * ED * M FQSWLHAWGDTLEEAFEQCAMAMFG YMTDTGTVE PLQT 
VEVETQGDDLQSLLFHFLDEWLYKFSADEFFIP\GWGEEFSLSKHPQ 
GTEVKAITYSAMQVYNEENPEVFVIIDI 

2893 

A 

359 

938 

IYFFRLHAWGDTLEEAFEQCAMAMFGYMTDTGTVEPLQTVEVBTQGK 
*LiQV\ LLFHFWDEWLYKFS ADE FF I PGKLCAI VF * KRL * RPGAVAYA 
CNPSTLGSRGRWIT*GQEFETSLTNKEVKVLSIDQRNFKLRFN\GWG 
EEFSLSKHPQGTEVKAITYSAMQVYNEENPGSFCDHWTFKTTQKIKR 
LPTGKK 

2894 

A 

220 

345 

LKKEKSLMEKHFF*QTDSSVSNLEKNHEPSDIVNSLQWAMVS 

2895 

A 

1 

281 

TLKYTGIRNKSSMWIDVKMLSGFTPTMSSIEELENKGQVMKTEVKN 
DHVLFYLENSNIiVFNIQPAPAlWYDYYEKEEYALAFYNIDSSSVSE 

2896 

A 

83 

426 

NPQSSRYIiPWADLSFLLGHPIPTVLQATLKYNVIjLPKKASGFSLSLE 
IVKNYSLTVFDLTVNLKYTGIRNK5SMWIDVKMLSGFTPTMSSIEE 
LENKGQVMKTEVKNDHVLFY 

2897 

A 

1 

3600 

MPRPASARTRCAHPLTCAHCPALPSEMNPDPQNNRIFQRQNVTSFRN 

ITQLSFQLISEPMFGDYWIWKRNSRETVTHQFAVKRYVLPKFEVTV 

NAPQTVT I SDDEFQVDVCAKYNFGQPVQGETQ I RVCREYFS S SNCEK 

NENEICEQFIAQLENGCVSQIVNTKVFQLYRSGLFMTFHVAVIVTES 

GTVMQISEKTSVFITQLLGTVNFENMDTFYRRGISYFGTLKFSDPNN 

VPMWKLLQLELNDEFIGNYTTDENGEAQFSIDTSDIFDPEFNLKAT 

YVRPESCYLPSWLTPQYLDAHFLVSRFYSRTNSFLKIVPEPKQLBCN 

QQKVVTVHYSI^SEAYEDDSNVKFFYl^MMVKGAILLSGQKE 

NGNFS F P I S I S ADLAPAAVLFVYTLHP SGE I VADS VRFQVDKCFKHK 

VNI KFSNEQGL PG SNAS LCLQ AAP VLFCALRAVDRNVLLLKS EQQLS 

AESVYNMVPSIEPYGYFYHGLNLDDGKEDPCIPQRDMFYNGLYYTPV 

SNYGDGDIYNIVRl^GLK^FTNLHYRKPEVCVMERRIiPLPKPIjYLET 

ENYGPMRSVPSR IAS SG I RGENADYVEQAI I QTVRTNFPETWMWDLV 

SVDSSGSANLSFLIPDTITQWEASGFCVNGDVGFGISSTTTLEVSQP 

FFIEIASPFSWQNEQFDLIVNVFSYRNTCVEISVQVEESQNYEANI 

HTLKINGSEVIQAGGRKTNVWTI I PKKLGAKASKQGVLDIiPNDWEG 

SARGFFTVVGDILGIjALQNLVVIjQMPYGSGEQNAALLASDTYVLDYI* 

KSTEQLTEEVQSKAFFLLSNAALDSGVTNGYNHAILAYAFALAGKEK 

QVESLLQTLDQSAPKLNNVIYWERERKPKTEEFPSFIPWAPSAQTEK 

SCYVLLAVISRKIPDLTYASKIVQWLAQRMNSHGGFSSNQAPVDCKR 

PAFFVI PAAFGPLSDS KS QDQNTVTFS SEGS SEI FQVNGHNRLLVQR 

SEVTQAPGE YTVD VEGHGCTF I QATLKYNVLLPKKASGFS LSLE I VK 

NYSSTAFDLTVTLKYTGIRNKSSMVVIDVKMLSGFTPTMSSIEEGFG 

RADSFPFSVEQSNLVFNIQPAPAMVYDYYEKGLIQRCPSRWSKGIAC 

AEDRPGRPMLASLLTSMAMAQEAFRSSCSGGSAWKLGQAVRERLPEP 

SVCHPRGTGWALGTCSLPLPELKPE j 

2898 

A 

1 

197 

GFGRADSFPFSVEQSNLVFNIQPAPAMVYDYYEKGLIQRCPSRWSKG 
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IACAEDRPGRPMLASLLT 

2899 

A 

1 

963 

MGVTRARGBVGLALS SRLAATLKYNVLLPKKASGFSLiS LE I VKNYSL 

TVFDLTVNLKYTGIRNKSSMVVIDVKMLSGFTPTMSSIEEEKLKSFP 

VCVSAKLEPKX)DMEEALGGSPQVSMLLVNIREILCSRKWSRRVYKKS 

VFPDGLAGAVTKTHVMLGVETDKLFSAPCPSVREQLEQKVEREGQRT 

TSTCQTWAKSPKVDVPSYLTCLADLKYGASI1GPSITHSAX1APGSSRS ; 

S PAPATAVQGAGQDDS S S I PFRRELSEATS VS FLDPWRI FCGWSCL 

FTSGAGEGPGGESPYVRQQTVSANPQQPQAEEEEDNWKS 

2900 

A 

49 

247 

DLTRNHSPLQTVHDVRWFMIFFQV* YRRIKFVKRNASPSGI *NI I * T 
INTFVCIYK* IFRYKNMHI 

2901 

A 

1066 

1768 

NPQS SS S EGSR * DFPRLTVTNRIjLVQRSEVTQAPGQYTVD VEGHGCT 
FIQATLK\ YNVLLPKKASGFSLSLEI SKELLFDCF * p/rqvnlqytg 
IRNKSSMWIDVKMLSGFTPNPCHPIEELENKGQSDED*KSKNDPCS 
FSYLENVFG\QPDSFPFFC*AEQPLCSTFQPGPSHGLTDYYEKEEYA 
LAFYQHRTVVPVSE*DKAITRRVGEAISCNKLILLYQTWKKIMNHLT 

s 

2902 

A 

620 

1124 

ETPAPCLQAWVPQACREGPLAFRELLGGVLALSTPGERCSVSVSEFE 
TRARCTQ/PCWGPSPKFLSPPHAKPAITLRPELKEAPWPAPSU3PPG 
GLSTPPSGIPCPPQCCQGHVALCRGLRPSPW*TE*PVCWKMPRCPEK 
LPPGIFGNETYLVPIVNWPTGTSFLSTY 

2903 

A 

2 

399 

WKESQGRCEW*VARGIi*7UDLT*GFHWHLPGAPP\WGLLTGGGQGRGH 
GHRGGWALSGRPP\ RGFHWHLPGAPP \ WGLLTGGGQGRGHGHRGGWA 
LSGIPPRLQSFQVWAESPLPQLPWGCVAGISVHTGVSWLP 

2904 

A 

73 

867 

GRAEQTSDTHFFASTSACSHNLTMSCPNYCSGNSNSGSLRTSRHIPL 
TSIDLCPTSVSCGDVLYLPTSSQDHTWVTDNCQETCGEPTSCQPVHC 
ETGNLETSCGS/SHC/CTTCPGPCQGSSFLPASFFSSSCLPVSCRTT 
EVC/RPAAVRPLRAWQ * FTS P * GDCVAQCL\ RPQFCLS KSCQPQNL 
LTSGCQPSSCIAYRPQSLHWSSSLRPLGPLFSGCQPLTHVFSTVSS 
HLALEHGSPNICFLQSS ILLHRSSSRAVCSSALF 

2905 

A 

1 

478 

VEKLVKGKFQDNFEFlQWSKKFFDAlfYDGKBYNPLIiARQGQDVAPPP 
NPVPQRTSPTGPPPCILRKNPPSARNGGHETDAQILELNQQLVDLKL 
TVDGLE KERDF YFS KLRD I EL I CQEHE S ENS P VI SG 1 1 G I L YATEEG 
FAPPEDDEIEEHQQEDQD 

2906 

A 

800 

1209 

LFSSGAVPQRTSPHQAPKNM\QT\SGRLS\NVAPP\CILRKNPPSAR 
NGG\HET*CPNSLNSNQQLVDLKLTVDGLEKE\RDFYFSKLRDIEIiI 
CQEHE S ENS P VI S G 1 1 G I L YATEE \ G FAPPEDDE I EEHQQEDQDE Y 

2907 

A 

1 

3961 

MSSKHLKSEQIKDQIRVPVISLIPKIHSAQEEQLSSTAVCSCPARVF 

CAVTKVVAVFQMWMLHDLVQMVPRSLQWLPHIFTLiFSNPLILYHSHE 

LSTVPSFAEQRECKlAYMPVSSASAHHLLLQAFDFNIiSVSLSSIiARNM 

LANSASVRILIKGGKVVNDDCTHEADVYIENGIIQQVGRELMIWRGQ 

GKLVI PGG I DTS THFHQTFMNATCVDD F YHGTKVMLLFAKGGFLFG F 

F ILLLGLLQTRLMASLAKSDWFS PENHQLFTADEELCTRVFMQGGPG 

AEVTCLENLQNCYPYPCESHIQVILGLHGGSMCSCNPDHKHPFMSGA 

LRSIMKRQNKEQVPPPAVEPKKPGNKKQPKKVATPPNQNQKQGRFRS 

LEEALKALDVADLQKELDKSQS VFSGNPS IWLKDLAS YLNYKLQAPL 

SEPTLSQHTHDYPYSLVSRELRGIIRGLLAKAAGSLELFFDHCIiFTM 

LQELDKTPGESLHGYRICIQAILQDKPKIATANLGKFLELLRSHQSR 

PAKCLTIMWALGQAGFANLTEGLKVWIiG I MLP VLG I KSLS PFAITYTi 

DRLLLMHPNLTKGFGMIGPKDFFPLLDFAYMPNNSLTPSLQEQLCQL 

YPRIiKVLAFGAKPDSTLHTYFPSFLSRATPSCPPEMKKELLSSLTEC 

LTVDPLSASVWRQLYPKHLSQSRQVQKSLQETIQSLKLTNQELLRKG 

SRNNQDVVTCAMACKGLLQQVQGPRLPWTRIjLIjLLiIiVFAVGFLCHDL 

RSHSSFQASLTGRLLRSSGFLPASQQACAKLYSYSLQGYSWLGETLP 

L WG SHLLTWRPS LQLAWAHTNATVS FL S AHCASHLAW FGDS LTSLS 

QRWQLTMTIFLEK^NYLAKGQSLRTAGQENWPVFKESTSGEAKGNVE ! 

QEQPKWPIGSRAGFARRGRKAPVSAGRPLTVFRKTLLEPSGRSCPPS 
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SPNDTRSATCFPRSPIVGNSMlRCWVLKGAIFSSFLHTGTAATHGTli 
ESDRSWGJ^VNVYSTSVTSENLSRHDMIAWVNDSLHLNYTKIEQLCS 
GAAYCQFMDMIjFPGCVHLRKVKFQAKLEHEYIHNFKVLQAAFKKMGV 
DKVGACALGGPEEQCDPGEEEDPPAQATGAAGVT P I IPVEKLVKGKF 
QDNFEFIQWFKKFFDA\NFDGKDYNPLLARQGQR/DVAPPPNP\G\D 
Q I FNK/ S PKKL I GTAS / VPQRTVP PTG PKKHAR PS GRLSNVAPPC I L 
PEESLPSARNGG\HETDA\QILELN\QQLV\DLKLTSGWGWEKERDF 

LLQANFRDIELICQ\EH\ESENSPVISG\ IIGHPLMPQRKGFAPPED 
DE I EEHQQEDQDEV 

2908 

A 

48 

310 

NPRCGPGGSRFVSSWCSVFPEAGGG*RKEPGMRGAGYKQRGMGS*SK 
GPFAFIMPFPRPRPGFVLPPPPHLPDLNKGPHFSQKKKKK 

2909 

A 

3 

214 

WG I QKNRP AMNYDKLSR S LR YYYEKG I MPKVAGER YS YK I VCE PEAL 
FSLAFPELAVPAQPFGPKGGYSY 

2910 

A 

312 

387 

LLSAAQVPDSDEQFVPDFHSENRE 

2911 

A 

3 

216 

NCLLRPKNKSVRWGPGAGAALLRPSPAALGAGSRACSVPPAAPAQTP 
RPQVSAPAWGPGRAARGSGRMERRMKAGYLDQQVPYTFSSKSPGNGR 
I*AR\TLIGPLGKLMDPGSLPPLDSEDLFQDLSHFQETWLAEAQVPDS 
DEQFVPDFHSENLAFHSPTTRIKKEPQSPRTDPALSCSRKPPLPYHH 
GEQCLYSSAYDPPRQIANQVPCPWCPLDSRPYSPFPGQSNGIS*DPL 
APPSPTLAMGTSGNIAPSSSSPWDICHSFTSQGG/APG/HPSQAPYQ 
HQLSEPCPPYPQQSFKQEYHDPLYGTGGASQAVGTRGGGSMGAQ/RY 
PGAGWIKQEQTDFAYDSDVTGCASMYLHTEGFSGPSPGDGAMGYGY 
EKPLRPFPDDVCVVP\EKFEGDIKQEGVGAFREGPPYQRRGALQLWQ 
FLVALIJDDPTNAHFIAWTGRGMEFKLIEPEEVARLWGIQKNRPAMNY 
DKLS RS LR YYYEKG I MQKVAGERYVYKFVCEPEALFSLAFPDNQRPA 
LKAEFDRPVSEEDTVPLSHL\DESPA*PPYSGLAPRPSVRAPGHARY 
PPRLQPRRPGLRSRPPLGAPAVRREGAAGWSGG 

2912 

A 

119 

444 

GFGVGTTPALTCPVTPS YSPKARPS *T/ ELEPPSRCPQPCQTLGPQA 
EGNQMSSAPSLASPPSKQPPLMANRCCCVPRSRIHLCGIQNRLGFWD 
SREERGGDATWVSVSD 

2913 

A 

1 

576 

APAEFRTSGEAAGSGERREQPSVTERVPTALGVPRMCVAQVLTAHLL 

PPRQALGSLCSPWAAPRVGPLPPAPAMVRISKPKTFQAYLDDCHRRY 

SCAHCRAHLANHDDLISKSFQGSQGRAYLFNSVVNVGCGPAEERVLL 

TGLHAVADIHCENCKTTLGWKYEQAFESSQKYKEGKYIIELNHMIKD 
NGWD 

2914 

A 

2 

552 

EFRLVRISKPKTFQAYI^DCHRRYSCAHCRAHLANHDDLISKSFQGS 
QGRAYLFNSV* VSGLPS CFP * PLL * PVTPG I LLRLPEFPGAGRKL * A 
S FLGC * GQTQCLRGWAL * WQDPPWD YPMSPQ\ VNVGCGPAEBRVLLT 
GLHAVADIHCENCKTTLGWKYVNSSTPDPWLALEPDLFNTFF \ 

j. j 


X 

574 

GADGGGAFRLSGPGP*APPGPSSALMMPSSCPWRTGALGPSPAGSRA 

LGRCTSSVGPGSRWLTRTSSPGCAYQNLENDEDGAQT^SPEPDGGVGT 

RLGPGIPAE/ PS IRSFPWLQHHSAAAPSAPTTPAAAGPNTL* SRRT 

AEWCWPPSCSCCLGPGADPGRRGNWRRPPLPRCLQRQSSSCRAFLLL 
VPGV 

2916 


•ice 

J J J 

CI 1 
911 

LFLETESCFVAQAGARWAVITHCSLKL/LGPCNPPISAL*VARTTGM 
SHHTEP 

2917 

A 

1815 

2152 

INKEMSGQARWLTPVIPALWEVKVVDHLRPGVRD\HQPGQQSETPVS 
TTIQK*A^YGGMCL*S*LLRRLTQQNHLNQGGRGCSELRSRYCTPAW 
ARQS KTS S QEKKKBKEMPW 

2918 

A 

315 

480 

LFHCFHFHGGPRVGSGGPFLTS^VPTVFFT OTQPDncnn VTimaDTu 
GLMLWSK 

2919 

A 

1 

1845 

MWESLELPRDLLNGFAQNADSNMDNKVQAEWLDGDGELVGNWSKGL 
PGCKASVLTLLKVRCQATERRVLQQCTGCGLWLLQVPWTTALGSTAP 
AAGLHGWLLNFDCSIVSIFMGVQGWDLAACNKXEAKVLLHEAQFMGL 
GS SVMLQPFDYDPNEKSKHKFMVQTI FAPPNTSDMEAVRCPSQS PAK 
LHGWSHRARARRSLTPSKHCTFINTATARILGSHNTPVLNINFILNQ 
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TLTRFTPAP AKS QGHTS D I CS S FS E PKS WAFCLPALWS TTGGRDKCR 
GHGTHGGGSQTEAKYICDIiLTPPNASVSSLPVRAPPTPTTDSTSQEC 
PHRHWLLSPQEVLSFLLFPLDFESLYTAMVGSEKETGEGTPSVGKHA 
YERAHADGPPGLGEGINSPLPTCTGVRGLTQDIQDTGQESDTLSTLG 
S EDDE KTAKPQGDTS E VLPAVKNS PRTFQEDGD I PRL PLGAAKNS AQ 

KPQRLGRSLCVWLAKSHRGDLDGPTFYFLRKGGAPAAPFAPGVQPTW 

I SPFQS PCSLELSPAQPDFSHNYALSVHFTVPKQTFDSLKNDMEPSK 

AVPLNASKQDGPMPKPHSVSLNDTETRKLMEECKRLQGEMMKLSEEN 
RHLR 

2920 

A 

407 

802 

VARSLMLIPKTMGKMSPGHIRDLHSSPSHHRPRSPGGKSGFVGRAQG 
PCAMCSPGTWCSVSQLLQPWLKGANVQLGLWLHRVESPSLGSFHVVL 
S LWVHG S QELRFGNLHPD FRRACTVNIiEKPQTLNAS I* 

2921 

A 

914 

1134 

QESLLLQFTKSSSCPSETTSARTLLSILLSAFWAKPFNKSLGSSKLS 
HIFLSSSDPSK/AVPTSAYYPVPKSLPH 

2922 

A 

1 

447 

MWESLELPRDLLNGFVQNADSNIDNKVQAEWSDGDEELVGNWSKAA 

PAVAEMRQCKAQAMVSEGASLKPWQLPCGVEPENAQTSRIEVWEPLP 

RFQKMYGSAWMSRQRFTAGLGLLWRTSARAVWEGNMRSEPPHRVPTG 
APPNGAVR 

2923 

A 

218 

544 

SGFSLSLRGS ISFSL/ CFKVGMCVTYPRCYC * S \ PVP * KPL I KPGWV 

SELPKPMNWASCSRTLASFLI^AARSHPWTPMKMETMEOSKLRSQPC 
KPAAGAVDPRAWHGT 

2924 

A 

3 

260 

VLTSVSLLPGHTAGLISQSLCRLVKNHCWGASGLVWRCSGPWGSLKE 
RLQQGDMPQPFWNGPSEGSSPHFHPGPRTCASHLLQCF 

2925 

A 

1 

534 

MWKRLWNQVTDTGWNSLEGSEEDRKMWESLELPRDLLNGFAQNADSN 
MDNKVQAEVVSDGDEKLVGNWSKGLIiLLEEGMDPEAVGFQALPPDCR 
GPWTNPVGSVDNPRGLVNMKVPFVRPLHEDPNPLKIFMLSAAGNKSR 
WI PG SLCLHVTATPTTAEL PGASRDGKVPAVCRVPAV 

2926 

A 

198 

562 

LKCGKQWS DP I TS PQTES QLLGS SQQQLHQQRHLQGP * K* NRDS PLC 
FSLSSCPIPKTYTNRHILLP/SSSKSLCKFL*PA*ISPQKMGFSFLS 
QSVCKFSKLLCSASLI KLKAFNS IQVTS 

2927 

A 

72 

270 

PRHPCPPLLPLCLEHGPGGAPHRAQPSSTGGPTEPEQDDHSLPLNTA 
LLLTRPQLVFLAATIHQC 

2 92 8 

A 

325 

535 

D S EEDRKMWE S LE L PRDLLNG FDQKPD SNMDNKI QAKWS DGDEELV 
GNWSKGDSCYVLAKRTGGILPLP 

2929 

A 

492 

696 

VPSWLRQVQKMPINIERALSRKSAAQIPTLGPPMKMETMGPSKVEKP 
THASQSAGAVDPRARGPRVL 

2930 

A 

822 

17S9 

SSAEPSPPSSPAPSQQTAAAGAPPLCAVSPMASASGAMAKHEQILVL 
DPPTDLKFKGDG * VFIRPEQYYTV* KWCKRSKRHGPFRP\ FTDWTT 
NLKLRNP\SDRKVCFKVKTTVPS/RRYC^P\NSGI\IDPGSTVTVS 
VMLQPFDYDPN\ERSKHKFMVQTIFAPPNTSDMEAW/WKEAKPDEL 
MDSKLRC\VFEMPNENDKLNDMEPSKAVPLNASKQD\GPMPKPHSVS 
L\OTTETRKLMEECKRLQG\EMMKLSEENRHLRDEGLRIiRKVAHSDK 
PGSTSTASFRDNVT$P\iPSLL\WIAAIFIGFFLGKFIL 

2931 

A 

2 

417 

FVAQQLQGDSLHFMQALYGQHFP1EVRHYLSQWIESQA/WNTYDRCP 
MELVRCIRHILYNEQRLVREANNGSSPAGSLADAMSQKHLQINQTFE 
ELRL VTQDTENELKKIjQQTQE YF 1 1 Q YQE S LR I QAQFGPLAQLS PQ 

2932 

A 

1 

855 

MA I YPQ I LD S VS TPS KG I LLRNLNA I VTLKKFS SDKGR VP FAVPDKV 

LWPQLCEAIJ^KFKAEVQSNRGLTKENLVFLAQKLFNNSSSHLEDYS 

GLSVSWSQFNRENLPGRNYTFWQWFDGVMEVLKKHLKPHWNDGAILG 

x vJNJi^yyAHijJjljlNKPDGTFL»LRFSDSEIGGITIAWKFDSRGAWERGIi ! 

GSSGLLLNRNEGASAVLPESSMPSTILGTGSAHQPMGEATLLVQEGL 

ALAGSSNSHSNLAVSECPQQWEEPPRARQGDSNRGKTTVELQVGYYH 
EST 

2933 

A 

1 

810 

MLLW PLLLLLLLL PTL ALLRQQRS QD ARLS WLAGLQHRVAWG ALVWA 
ATWQRRRLEQSTLHVHQSQQQALRDSVDLDNPQENIKATQLLEGLVQ 
ELQKKAEHQVGEDGFLLKI KLGHYATQLQRI PCDDLPKNKLLPLS EH i 
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HALKLELPVNKQKMAGTGDRAKAVDGYVKPQIKQVVPEFVNASADAG 
GAAPRIiSGWMCIiDSRCESPDSVLDTDGDFDLEDTMDVARRVEELLa 
RPMDSQWIPHAQSVTftDLSIVNFFIFTRRITIiVDV 

2934 

A 

224 

2606 

TMAVWIQAQQLQGEALHQMQALYGOHFPIEVRHYLSQWIESQAWDSV 
DLDNPQENI KATQLLEGIiVQELQKKAEHQVGEDGFLLKI KLGHYATQ 
LQNTYDRCPMEL VRC I RH I L YNEQRL VREANNGS S PAG S LAD AM S Q K 
HLQINQTFEELRLVTQDTENELKKLQQTQEYFIIQYQESLRIQAQFG 
PLAQLSPQERLSRETALQQKQVSLEAWLQREAQTLQQYRVELPEKHQ 
KTLQLLRKQQT 1 1 LDDEL I QWKRRQQLAGNGG P PEGS LDVLQS WCEK 
LAEIIWQNRQQIRRAEHL.CQQLPIPGPVEEMLAEVNATITDIISALV 
TSTFIIEKQPPQVLKTQTKFAATVRLLVGGKLNVHMNPPQVKATIIS 
EQQAKSLLKNENTRNDYSGEILNNCCVMEYHQATGTLSAHFRNMSLK 
RIKRSDRRGAESVTEEKFTI\LFESQFSVGGNELVFQVKTLSLPVVV 
IVHG\SQ\DNNATATVL\WD\NAFLQEPG\RVPFAVPDKVLWPQLCE 
ALNMKFKAEVQSNRGLTKENLVFLAQKLFNNSSSHLEDYSGLSVSWS 
QFNRENLPGRNYTFWQWFDGVMEVLKKHLKPHWNDGPILGFVTKQQA 
HDLLINKPDGTFLLRFSDSEIGGITIAWKFDSQERMFWNLMPFTTRD 
FS IRSLADRU5DLNYLI YVFPDRPKDWYSKYYTPVPCESATAKS/ V 
LDG YVK\ PQ I KQ WPE FVNAS ADAGGG S ATYMDQAPS PAVC / PPQAH 
YNMYP\QNPD*VLDTDGDFDLEDTMDVARRVEEL\LGRPMDSQWIPH 
AQS 

2935 

B 

123 

219 

XPPTGMTMDKSELVQKAKLAEQAERYDDMAAA* 

2936 

A 

1 

1815 

MTSRSPTLPSPLPYAAKQEMWRGYCAGGAGFISSDKGGKADGSVGQ 
GGGVAI TGDGEAVS SGKKVS SEFTNS VS PAL SGS SHTS P FQVAGSHS 
FPINALTLTVDTWLSLMLVRAIGACLFGYYPSWRMEGCAARRSGWAS 
SGISRSRRCKETASAKLACQQWLQVAKRAGLGGGQSGRTVLRERVRI 
EIASTHIAI^VECVSVRVWKVPRLPSRQKYAKHASQLVSNQKSEEEE 
YMKTFN I S QQDLELVE VATE KI TML YEDNKHHVG AA I RTKTGE 1 1 S A 
VHIEAYIGRVTVCAEAIAIGSAVSNGQKDFDTIVAVRHPYSDEVDRS 
IRVIPTYGTDEWEQWWNAFNEENLFCSEEMPSSDDEATADSQHSTPP 
KXKKKGMTMDKSELVQKAKLAEQAERYDDMAAAMKAVTEQGHELSNE 
ERNLLSVAYKNVKTERNEKKQQMGKEYREKIEAELQDICNDVLYSPP 
FPMDVASCEQTAVIWSLLGSSCSASLPGSGLVLGWCTESCDVNHL 
WVSQPAIPAPVSVAISEGAMDSVKVLSFGGLMFFVLIGLLPGGGAFQ 
KASAVWWRGIGGGQGPRTPLSSTTRVDRKGPPGGGGARSV 

2937 

A 

1 

510 

KTERNEKKQQMGKEYREKIEAELQDIOTOVLYSPPFPMDVASCEQTA 
VIWSLLGSSCSASLPGSGLVLGWCTESCDVNHLWVSQPAIPAPVS 
VAISEGAMDSVKVLSFGGLMFFVLIGLLPGGGAFQKASAVWWRGIG 
GGQGPRTPLSSTTRVDRKGPPGGGGARSV 

2938 

A 

107 

961 

WGLGRKR S S RSGAS LGR VF VPCS P PTGMTMDKS ELVQKAKLAEQAE 
R YDDMACS S ERHVTEQG \ HERLQPKKRNLLS \ VA YKNV\ VGARRFLP 
GRVISSI EHENREGMKKKQQMGK\ EYP * EDRRQNLQDICK* CFWELL 
\DKYLI PN/AVTQPES * GVLL* KLKGGYFRYLSEVAS *DNKQTTVSN 
SQPGFTREHFEISKKEMQPTHPIRLGLALNFSVFYYEILNSPEKACS 
LAKTAFDEAI AELDTLNEES YKDS TL I MQLLRDNLTLWT SENQGDBG 
DAGEGEN 

2939 

A 

208 

1178 

EGRGHLARSFARKGLCSLQPSHGRVSRDLGRYKKCILPIRLWIEKQE 
EH^T^SQGMTMDKSELVQKAKLAEQAERYDDMAAAMKAVTEQGHELS 
NEERNLLSVAYKNVV\GARRSSWRVISSIEQKTERNEKKQQ\MGKEY 
JCbKX bJUL Li QU 1 C \NDVFE \ LLEQ I S LF PNATQPRKVKVFLLEK*KGD 
YF \ R YLF * KWQS * RQQTQPTCVETPQQS L PRKQ FE I S KKEMQP \ THP 
IR\LG\ LAPKFPQSFYYE I PKLPLKKAW * AWAKTALCFRHFD * SNLL 
NLDTLEWKSLIKDSTSDSWQLLR\DN\LTLWDIRKTRGDEGDAGEGE 
N 

2940 

A 

1 

461 

IEIHIKCGGIPAVLAAPAMGLELFLDLVSQPSRAVYIFAKKNGIPLE 
LRTVDLVKGQHKSKEFLQINSLGKLPTLKDGDFILTE/SANRSVYLQ 
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GGCWHITEMRFSKCKAEGPAKSRSAILIYLSCKYQTPDHWYPSDLQA 
RARVHEYLGWHAD 

2941 

A 

2 

311 

HEQTPDHW YPSDLQARARVHE YLGWHADCI RGTFG I PL WVQVLGPL I 

GVQVPEEKVERNRTAMDQALQWLEDKFLGDRPFLAGQQPVAJ^GYELF 
EGRPRLAAW 

2942 

A 

205 

454 

EEGWLGSWRWRARLEGSPSNAEKPPGPTGRWEQLGVNAWSQCPYSH 
CPFVPSSAILIYLSCKYQTPDHWYPSDLQARARVHE 

2943 

A 

5 

437 

AQESRDCGDHDRATALQPGQQSEILLLQRLPPENAVD*VALEGSQPL 
* TLACLLSCKYQTPDHWYPSDLQARARVHE YLGWH7U3C IRGTFG I PL 
WVQVRRANLPFLS FPLAPVPSLQLHVKLCTDKTLS VLGSAS ILRCS I 
HTS 

2944 

B 

307 

447 

XSAILIYLSCKYQTPDHWYPSDLQARARVHEYLGWHADCIRGTFGIP 
L* 

2945 

A 

2 

385 

TALSAFTAIPAVLAAPAMGLELFLDLVSQPSRAVYIFAKKNGIPLEL 
RTVDLVKGGPSPFPRVSTNPVHPQAPALLCPERLAARTAPSPPPAAS 
ATRECCGLSGLGGITASLNLSLPSEKEDNVTFC 

2946 

A 

1 

- 

1278 

MPGLGFREKKGGSRTVIPASRGCGLPAPILCTKWELPLSGSSRCLAA 
AAIiQGTVWTAESSSLTPAFQSRGWGLIPYFPARRDPATAAAHTALSA 
FTAI PAVLAAPAMGLELFLi\DLVSQPSRAVYI FAKKNGI PLELRTVD 
LVKGQHKSKEFLQINSLGKLPTLKDGDFILTESLAILIYLSC\KYQT 
A\DH\WYP\SDLQGFGARVHEVPWAW\HADLHPVGTFWYYPWGVQGV 
WGHSLGVQVPEEKVGTQTRTAMDQALQWLEDKFLGDRPFIAGQQVTL 
ADLMALEG S * CKP VALG YELFEGR PRL \ AAWRR \ R VEAFLG \ AELCQ 
EA/HTS IILSI LEQAA\ KKTLPTP \ S P\ EAYQAMLLRNRPGS PEGSG 
MGAKE I SNKDS FC YLLAPFYLSIjLPQSLLS KLQCEALHRQRHSS VLW 
QVLLLLRCKHT 

2947 

A 

298 

458 

NI YQLE * K* FKNGQVWLGAWRACNPSTLGGR/ WIPWGQEFGTSLAN 
MVKPRLY 

2948 

A 

59 

190 

ARS FHSDAPQGHLECHW CPGVXiEIj *GHRNPDTCPHLEQG I S SK 

2949 

A 

319 

462 

HFSYIHWFYTEKYKIKITHQDYLQKFIIY/CTEEKHAYH*PNKILFI 
T 

2950 

A 

78 

3703 

FQELGAREPFGNMYDADEDMQYDEDDDE I TPDLWQEACW I VI SS YFD 
EKGLVRQQLDSFDEFIQMSVQRIVEDAPPIDLQAEAQHASGEVEEPP 
RYLLKFEQIYLSKPTHWERDGAPSPMMPNEARLRNLTYSAPLYVDIT 
KTVIKEGEEQLQTQHQKTFIGKIPIMLRSTYCLLNGLTR\RDLCELN 
ECPSDP\GRYFII\NGSEKVLIAQRKMATNTSFCRLQKTDSKYAYT\ 
GECRSCLENSSRPTSTIWVSMLARGGQG\AKKSAIGQRIVATLPYIK 
QEVP 1 1 IVFRALGFVSDRD ILEHII YDFEDPEMMEMVKP SLDEAFV I 
QEQNVALNFIGSRGAKPGVTKEKRIKYAKEVLQKEMLPHVGVSDFCE 
TKKAYFLGYIWHRLLLAALGRRELDDRDHYGNKRIiDLAGPLLAFLFR 
GMFKNLLKEVRIYAQKFIDRGKDFNLELAIKTRIISDGLKYSLATGN 
WGDQKKAHQARAGVSQVLl^LTFASTLSHLRRLNSPIGRDGKIiAKPR 
QLHNTLWGIWCPAETPEGHAVGLVKNLALMAY I S VGSQPS PILEFLE 
EWSMENLEE I S PAAI ADATKI FVNGCWVG IHKDPEQLMNTLRKLRRQ 
MD 1 1 VS EVSM I RD I RERE I R I YTDAGRI CRPLL I VE KQKLLLKKRH I 
DQ\LKEREYNNYSWQDLVASGVVEYIDTLEEETVMLAMTPDDLQEKE 
VAYCSTYTHCE IHPSM I LGVCAS 1 1 PFPDHNQS PRNTYQ S AMGKQAM 
GVYITNFHVRMDTI^UIVLYYPQKPLVTTRSMEYLRFKELPAGINSIV 
AIASYTGYNQEDSVIMNRSAVDRGFFRSVFYRSYKEQESKKGFDQEE 

VFEKPTREPPOf3\ TVFl'KT rrnrvN r*\ t nnncnnronrkn'irTTnvfn 

VTLP\ENEDE/LWESTNRRYTKRDCSTFLRTSETGIVDQVMVTLNQE 
GYKFCKIRVRSVRIP^IGDKFASRHGQKGTCGIQYRQEDMPFTCEGI 
TPD\ 1 1 INPHAI PSRMTIGHLI ECLQGKVSGLTRGE IG \ DA\ TPF * W 
MLFNVAGRFPNLFILDYGLSSSGGKWRSLYNGFTG\RKFTSQIFIGP 
TYYQRLKHMVDDKIHSRARGPIQIRQ*TSPMEGSSP\DGGLRF*\EM 
DRD\CQIAHGA\AQFL\RERLFEGHPEPHIQVPCFAN\LCGI\MAIA ! 
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QHPGPHTYECRGCPQ*NPRFSLVIU4PTACTL\LFQBLMS\MSIAPRM 
MSV 

2951 

C 

162 

293 

MGNIiLKVLTCTDLEQGPNFFLDFENAQPTESEKEIYNQVNVAIj* 

2.952 

A 

286 

1327 

AGAKFFP * F* KVADAQPTESEKEIYNQVNWLKDAEGILEDLQSYRG 
AGHE I REA IQHP ADEKV AKKKAWG AWPL \ VGKLKKF YE FS QRLEAA 
LRGLLGALTSTPYSPTQHLEREQALAKQFAEILHFTLRFDELKMTNP 
AIQNDFSYYRRTI,SRMRINN\YPAEGENEVANELANRMSLFYAEATP 
\MLKTLSDATTKFVSENKNLPIENTTDCLSTMASVCRVMAGNT\EYR 
SQILQNEETVSFLSWRAMVGCS*YSYDHRTSSGEAFCLKIiPKFDMKG 
CIKVIiKDQPPNSVEGLLK\StiR\YTTKHLEWMRIiPPSKLKSMLPITI 
LG I KHPAVDRRQYS AMTENAVF 

2953 

A 

75 

478 

AVKMCDFT*DQTA*FKEAFQLFDRTGDGKILYSQCG\DPKSDEMNVK 
VLDFEHFLPMLQTVAKNKDQGTY *DYVEGLRVFDKEGNGTVMGAEI R 
HVLVTLGEKMTEEEVQMLVAGHEDSNGCINYEELVRMVLNG 

2954 

A 

2 

343 

DFTEDQTAEFKEAFQLFDRTEMNVKVLDFEHFLPMLQTVAKNKDQGT 
YED YVEGLRVFDKEGNGTVMGAE I RHVLVTLG E KMTE EE VEML V AGH 
EDSNGC INYEELVRMVLNG 

2955 

A 

34 

353 

AVKMCDFTEDQTAEMNVKVIJDFEHFLPMLQTVAKNKDQGTYEDYVEG 
LRVFDKEGNGTVMGAE I RHVLVTLGEKMTEEEVEMLVAGHEDSNGC I 
NYEAFVRHILSG 

2956 

A 

2 

303 

liKFRPRRRRTFLPLLCSCCLGLGGGIiTAAGRGARGMENWSDSRWLLC 
KLTPGLVAVGMFPLMLPLQSGSPLALGVPLHSPVPWLIPPFLSTELV 
RMVLNG 

2957 

A 

1 

999 

MLYCFSS GNVS KS PTANNARTGRAD YVS I TDDEALEAFKTLCLHEG I 
I PALES SHALAHALKMMRENPDKEQLLIjKERKEGAFVPFVTliGDPG I 
EQSLKIIDTLIEAGADALELGIPFSDPLADGPTIONATLRAFAAGVT 
PAQCFEMLALIRQKHPTIPIGLLMYANLVFNKGIDEFYAQCEKVGVK 
TVLVADVPVEESAPFRQAAIiRHNVAP I F ICPPNADDDLLRQIASYGR 
GYTYLMS RAGTVAKNKDQGTYED YVEGLRVFDKEGNGTVMGAE I RHV 
LVTLGEKMTEEEVEMLVAGHEDSNGC IUYEVL IHTGPAHSRDRLRS S 
VRLD 

2 958 

A 

255 

528 

ARYKIHTGKGTDSGETGKVLSHSAPCGRCSRPWARHPDRSMPHKRFI 
VDTNQLLS SCPATS I STS S S VNVLPRCD/ FPEDQTAEFKEAFQLFDR 
TGDGKILYSQ\CGDVMRAIX3QNPTHA£\VIjK\njGNP\KSDEMl!rVKVL 
D FEHFL P \ MLQTVAKNQDPG S PMG FMSKE LR VF YK\ EGNGT\ VMGAE 
IRHVL\ VTPG* \ KIiTEEEVEMIjVAGHEDSN\ G\ CINYEAFVRHTPVG 
VTGPWAGASSAWC 

2959 

A 

26 

518 

avkmcdfte\dqtaefkeafqiifd/rkpgdgkipctaqcgdvmralg 
qnplpntveXvlkapgepprsddmsvkehXdcehfXlhilqtXvaqnq 
g\qgty\edyvedfrvfdkeg\ngt\vmga\eirhvlvtlgeed*qr 
kkvemlvagp * rtanglvsnyeelvrmvlng 

2960 

A 

453 

744 

APSNS LHGH SQPY FFTGI ARQWAGAVGGWPEDAADSEHNT/VP P PHLG 
GPPLLPLLALLAGSLGSPGTVALRQVPMQAHPLHHCPQHVCQCLPAY 
CH 

2961 

A 

736 

1062 

SW I SDGTGRSAF I GIGFTDRGDAFDFNVSLQDHFKWVKQESB ISQES 
QEMDARPKLDLGFKEGQTIIO.CIGIiCSKPGTTAIQIiGPVIiNGIGRTL 
RTDLRNKNDLEGTNL 

2962 

A 

295 

782 

PLIjCNPDPGWYWWVKQES E I S KESQEMDARPKIJDIjGFKEGQTI KLC I 
GNITNKKGGASKPRTARGGGLSLLPPPP*GKVTIPPPSS/VKLPSTN 
HVTPPSIPKSNHGGSDA\DILL\DLDSPAPVTTPAPTP\VSVSN\DL 
\WGDFSTASSSVP\NQAPQPSN\WVQF 

2963 

A 

192 

326 

SKSRMSAEVIHQVEEALDTDEK^MIjLFLCRDVAIDVVPPNVRDL 

2964 

A 

1 

1044 

MAEIGEDLDKSDVSSLIFLMKDYMGRGKISKEKSFIJDLVVELEKLNL 
VAPDQLDLLEKCLKNIHRIDLKTKIQKYKQSVQGAGTSYRNVLQAAI 
QKSLKDPSNNFREEPVKKSIQESEAFLPQSIPEERYKMKSKPLGICL 
IIDCIGNETELLRDTFTSLGYEVQKFLHLSMHGISQILGQFACMPEH 
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RDYDSFVCVLVSRGGSQSVYGVDQTHSGLPLHHIRRMFMGDSCPYLA 
GKPKMFFIQNYWSDGQLEDSSLLEVDGPAMKNVEFKAQKRGLCTVH 
READFFWSLCTADMSLLEQSHSSPSLYLQCLSQKLRQERGTIPGSGI 
TESKDMHFSSLGCILLDVL 

2965 

A 

1 

811 

MSAEVIHQVEEALDTDEKEMLLFLCRDVAIDWPPNVRDLLDILRER 
GKLSVGDLAELLYRVRRFDLLKRILKMDRKAVETHLLRNPHLVSDYR 
VLMAE I GEDLDKSDVS S L I FLMKD YMGRGKI S KEKS FLDLWELEKL 
mjVAPDQLDLLEKCLKNIHRIDIiKTKIQKYKQSVQGAGTSYRNVLQA 
AI QKS LKDP SNN FRS I PEERYKMKS KPLG I CL I IDC IGNETELCGVR 
GPAGGQQPLGGGWASDEECGIQGSEARAVHSSPRS 

2966 

A 

1 

750 

MSAEVIHQVEEALDTDEKEMLLFLCRDVAIDVVPPNVRDLLDILRER 
GKLSVGDLAELLYRVRRFDLLKRILKMDRKAVETHLLRNPHLVSDYR 
VLMAEIGEDLDKSDVSSLIFLMKDYMGRGKISKEKSFLDLVVELEKL 
NLVAPDQLDLLEKCLKNIHR I DLKTKI QKYKQSGGWNGTWMTKAS FS 
LWLEIASLYILHIPDLPTPLPCIFMISAKCCQAVLNISVFKKESGIC 
TGWQSYGHSGVLHVC 

2967 

A 

493 

1946 

SKSRMSAEVIHQVEEALDTDEKEMLLFLCRDVAIDVVPPNVRDLLDI 
LRERGKLSVGDLAELLYRVRRFDLLKRILKMDRKAVETHLLRNPHLV 
SDYRVLMAE I GEDLDKSDVS S LI FLMKDYMGRGKIS KEKSFLDLWE 
LEKLNLVAPDQLDLLEKCLKNIHRIDLKTKIQKYKQSVQGAGTSYRN 
VLQAAIQKSLKDPSNNFRLHNGRSKEQRLKEQLGAQQBPVKKSIQES 
EAFLPQSIPEERYKMKSKPLGICLIIDCIGNETELLRDTFTSLGYEV 
QKFLHLSMHGISQILGQFACMPEHRDYDSFVCVLVSRGGSQSVYGVD 
QTHSGLPLHHIRRMFMGDSCPYLAGKPKMFFIQNYVVSEGQLEDSSL 
LEVDGPAMK1JVEFKAQKRGLCTVHREADFFWSLCTADMSLLEQSHSS 
PSLYLQCLSQKLRQERKRPLLDLHIELNGYMYDWNSRVSAKEKYYVW 
LQHTLRKKLILSYT 

2968 

A 

539 

1215 

SKSRMSAEVIHQVEEALDTDEKEMLLFLCRDVAIDVVPPNVRDLLDI 
LRERGKLSVGDLAELLYRVRRFDLLKRILKMDRKAVETHLLRNPHLV 
SDYRVLMAE I GEDLDKSDVS SLI FLMKD YMGRGKI SKEKS FLDLWE 
LEKLNLVAPDQLDLLEKCLKNIHRIDLKTKIQKYKQSVQGAGTSYRN 
VLQAAIQKSLKDPSNNFRMITPYAHCPDLKILGNCSM 

2969 

A 

414 

1762 

SKSRMSAEVIHQVEEALDTDEKEMLLFLCRDVAIDWPPNVRDLLDI 
LRERGKLSVGDLAELLYRVRRFDLLKRILKMDRKAVETHLLRNPHLV 
SDYRVLMAE IGEDLDKSDVSSL I FLMKD YMGRGKI SKEKS FLDL WE 
LEKLNLV APDQLDLLE KCLKN IHR I DLKTKI QKYKQS VQGAGTSYRN 
VLQAAI QKSLKDPSNNFRS I PEERYKMKS KPLGI CLI IDCIGNETEL 
LRDTFTSLG YE VQKFLHLSMHG I SQ I LGQF ACMPEHRD YDS FVCVLV 
SRGGSQSVYGVDQTHSGLPLHHIRRMFMGDSCPYLAGKPKMFFIQNY 
WSEGQLEDSSLLEVDGPAMKNVEFKAQKRGLCTVHREADFFWSLCT 
ADMSLLEQSHSSPSLYLQCLSQKLRQERKRPLLDLHIELNGYMYDWN 
SRVS AKEKYYVWLQHTLRKKLI LS YT 

2970 

A 

449 

1305 

SKSRMSAEVIHQVEEALDTDEKEMLLFLCRDVAIDV\TPPNVR\DLL 
\DNLRERGKLSVGDLAELLYRVRRFDLLKRILKMDRKAVETHLLRNP 
HLVSDYRVLMAE IGEDLDKSDVSSL I FLMKD YMGR\GKISKEKSFLD 
LWELEKLNLVAPDQLDLLEKCLKNIHRIDLKTKIQKYKQSVQGAGT 
SYRNVLQAAIQKSLKDPSNNFRLHNGRSKEQRLKEQLGAQQEPVKKS 
IQESEAFLPQSIPEERYKMKSKPLGNLPGLIDWHVJALRTELSFEDTF 
TSLGY 

2971 

A 

473 

1164 

SKSRMSAEVIHQVEEALDTDEKEMLLFLCRDVAIDV\VPPNVRD\LL 
DILRERGKLSVGDLAELLYRVRRFDLLKRILKMDRKAVETPLFKDPP 
LVSD YGVLMAE I GEDLDKSDVS S LI FLMKD YMGRGKI S KEKS FLDLV 
VELEKLNLV/ APPDQLDL\ LEKCLKNI PQK* P *RQKS QKYKQS V\QG 
AGTSYRNVL\QAAI \QKSLKDP\ SNNFRMITP\ YAHCP\DLKI\LGN 
CSM 

2972 

A 

482 

692 

EKPL*SYLSSFFHFLEMGSHSIPRLVCSGVIVAHCSLQLLGSSDVPT 


tan 
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S/ASR* LGLTG SHTT IPS I/GLFLN 

2973 

A 

1 

358 

AQLVEFGEHSMAPPAHFRALLYHPGTATLVPHPASISQHSPSPWGNA 
/ RG * PV* RQRHI/TAPRSPPHPRFRHKPGKDPRENPSRWPEVPSLPQT 
HWPGQAA^DTVNTTVCKNRSTKPQD 

2974 

A 

3 

106 

L PSG I L VQTAARTVMMAPDAP S DG P YL P VATL PT ] 

2975 

C 

109 

294 

MDTAGMILGWKFFDHAAHLGGALFGIWYVTYGHELIWKNREPLVKIW 
HE IRTNGPKKRRWL * 

2976 

A 

3 

131 

IGRKIMASLPSGILVQTAARTVMMAPDAPSDGPYLPVATLPT 

2977 

A 

788 

1151 

QPTLENKG PLRLE SD VK I PGQ W P AAE KVLS I NRNTKEHL PANNTCQQ 
SAGPSTLGYSHPLVKSPFPESLPSFFWLNAYPNPIQPFNTHPK* /P 
PGT/ HW I S SD S YC PNS S * S KMHKPCKPNK 

2978 

A 

136 

502 

GTLPNSSLCESNNGHPRQRGRGSCPSRYTGVPSSPRLSRIQSWIR/H 
LHPPPSPTRPSYSSPQASASPVGLRGHPAQTWTTCSRSVPSPVRFSS 
GSDGEKRLGQSCPHGNDPGAACISRPPYA 

2979 

A 

242 

9991 

FKLELHQESDMEFNNTTQEDVQERIAYAEQLWELKDIIRQKDVQLQ 

QKD E ALQE ER KAADNKI KKL KLHAKAKLTS LNKY I EEMKAQGGTVL P 

TEPQSEEQLSKHDKSSTEEEMEIEKIKHKLQEKEELISTLQAQLTQA 

QAEQPAQSSTEMEEFVMMKQQLQEKEEFISTLQAQLSQTQAEQAAQL 

SSMQQWREKDARFETQVRLHEDELLQLVTQADVETEMQQKLRVIiQR 

KLEEHEESLVGRAQWDLLQQELTAAEQRNQILSQQLQQMEAEHNTL 

RNTVETEREESKILLEKMELEVAERKLSFHNLQEEMHHLLEQFEQAG 

QAQAELESRYSALEQKHKAEMEEKTSHILSLQKTGQELQSACDALKD 

QN S KLLQD KNEQAVQS AQT I QQLEDQLQQKS KE I SQFLNRLPLQQHE 

TASQTS FPDVYNEGTQAVTEENI ASLOKRWELElsTEKGALLLSS IEL 

EELKAENEKLSSQITLLEAQNRTGEADREVSEISIVDIANKRSSSAE 

ESGQDVLENTFSQKHKELSVLLLEMKEAQEEIAFLKLQLQGKRAEEA 

DHEVLDQKEMKQMEGEGIAPIKMKVFLEDTGQDFPLMPNEESSLPAV 

EKEQASTEHQSRTSEEISLNDAGVELKSTKQDGDKSLSAVPDIGQCH 

QDELERLKSQILELELNFHKAQEIYEKNLDEKAKEISNLNQLIEEFK 

KNADNNSSAFTALSEERDQLLSQVKELSMVTELRAQVKQLEMNLAEA 

ERQRRLD Y E S QTAHDNLLTEQ I HS L S I E AKS KD VK I E VLQNELDD VQ 

LQFSEQSTL I RSL.QSQLQNKES EVLEGAER VRH I S S KVEELS QALSQ 

KELE I TKMDQLLLEKKRDVETLQQTI EEKDQQVTE I S FSMTEKMVQL 

NEEKFSLGVEIKTLKEQLNLLSRAEEAKKEQVEEDNEVSSGLKQNYD 

EMSPAGQISKEELQHEFDLLKKENEQRKRKIrQAALINRKELLQRVSR 

LEEELANLKDESKKEIPLSETERGEVEEDKENKEYSEKCVTSKCQEI 

EIYLKQTISEKEVELQHIRKDLEEKIJVAEEQFQAIiVKQMNQTLQDKT 

NQIDLLQAE I SENQAI IQKL I TSNTDASDGDS VALVKETWI S PPCT 

GSSEHWKPELEEKILALEKEKEQLQKKLQEALTSRKAILKKAQEKER 

HLREELKQQKDDYNRLQEQFDEQSKENENIGDQLRQLQIQVRESIDG 

KLPSTDQQESCSSTPGLEEPLFKATEQHHTQPVLESNIiCPDWPSHSE 

DASAIjQGGTSVAQIKAQLKEIEAEKVELELKVSSTTSELTKKSEEVF 

QLQEQINKQGLEIESLKTVSHEAEVHAESLQQKLESSQLQIAGLEHI* 

RELQPKLDELQKLISKKEEDVSYLSGQLSEKEAALTKIQTEIIEQED 

L I KALHTQLEMQAKEHDER I KQLQVELCEMKQKPEE IGEESRAKQQI 

QRKLQAALISRKEALKENKSLQEELSLARGTIERLTKSIiADVESQVS 

AQNKE KDTVI^GRLALLQEERDKIil TEMDRSLLENQS LS S S CE \ S PKL 

ALEGLTEDKEKLVKEIES LKS S KI AESTEWQEKHKELQKE YE I LLQS 

YENVSNEAERIQHWEAVRQEKQELYGKLRSTEANKKETEKQLQEAE 

QEME EM KE KMRKFAKS KQQKI LELEEENDRLRAEVHPAGDTAKECME 

TLLSSNASMKEELERVKMEYETLSKKFQSLMSEKDSLSEEVQDLKHQ 

I EDNVS KQANLEATEKHDNQTNVTEEGTQS I PGETEEQD S LSMSTRP 

TCSESVPSAKSANPAVSKDFSSHDEINNYLQQIDQLKERIAGLEEEK 

QKNKEFSQTLENEKNTLLSQISTKDGELKMLQEEVTKMNLLNQQIQE 

ELSRVTKLKETAEEEKDDLEERLMNQLAELNGSIGNYCQDVTDAQIK 

NELLESEMKNLKKCVSELEEEKQQLVKEKTKVESEIRKEYLEKIQGA 
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QKEPGNKSHAKELQELLKEKQQEVKQLQKDCIRYQEKISALERTVKA 

LEFVQTESQKDLEITKENLAQA\VEHRQKGTSQNLASFK\VLLDDTQ 

SEAARVIADNIiKLKKELQSNKESVKSQMKQKDEDLERRLEQAEEKHL 

KEKKNMQEKLDALRREKVLLEETIGEIQVTLNKKDKEVQQI^ENLDS 

TVTQLAAFTKSMSSLQDDRDRVIDEAKKWERKFSDAIQSKEEEIRLK 

EDNCSVLKDQLRQMSIHMEELKINISRLEHDKQIWESKAQTEVQLQQ 

KVCDTLQGENKELLSQLEETRHLYHSSQNEIiAKLESELKSLKDQIiTD 

LSNSLEKCKEQKGNLEGIIRQQEADIQNSKFSYEQLETDLQASRELT 

SRLHEEINMKEQKIISLLSGKEEAIQVAIAELRQQHDKEIKELENLL 

SQEEEENIVLEEENKKAVDKTNQLMETLKTIKKENIQQKAQLDSFVK 

SMSSLQNDRDRIVGDYQQLEERHLSIILEKDQLIQEAAAENNKLKEE . 

IRGLRSHMDDLNSENAKX.DAELIQYREDLNQVITIKDSQQKQLLEVQ 

LQQNKELENKYAKLEEKLKESEEANEDLRRSFNALQEEKQDLSKEIB 

SLKVSISQLTRQVTALQEEGTLGLYHAQLKVKEEEVHRLSALFSSSQ 

KR I AELEEELVCVQKEAAKKVGE I EDKLKKELKHLHHDAGIMRNETB 

TAEERVAELARDLVEMEQKLLMVTKENKGLTAQIQSFGRSMSSLQNS 

RDHANEELDELKRKYDASLKEIiAQLKEQGLLNRERDALLSETAFSMN 

STEENSLSHLEKLNS\MLLSRDEQSLHLSSQLEDSYNQVQSFSKAMA 

SLQNERDHLWNELEKFRKSEEGKQRSAAQPSTSPAEVQSLKKAMSSL 

QNDRDRLLKELKKLQQQYLQINQEITELHPLKAQLQEYQDKTKAFQI 

M\QEE\LRQEKPSWQHELHQLRMEKSSWEIHERRMKEQYLMA\ISDK 

DQQLSHLHNL IRELRS S S SET\ EPLKVQYQRQAS PETS ASPDG\ SQN 

LVYETELLRTQLNGQL*RKFHQKELRIQQLNSNFSQLLEEKNTLSIQ 

LC \ DTS QS LRENQQHYGDLLNH C AVLE KQ VQELQ AVS KE KGPLNI D V 

APGAPQEKNGYHRKSDPEELREPQQSFSEAQQQLCTSRQ\EVNELR\ 

KX.LEEERDQRVAAENALSVPEEQIRRLEHSEWHSSRTPIIGLVGNQG 

AATVID\LTSNSCRRT\RSG\VGWKRVXiRSLCHSRDPECPLSSSHLT 

F*MIHVLLIMCYTGHL 

2980 

A 

3 

211 

GDLPWEINPPSSCSLLREKDPPTTSGPEIDQPKKHLTNFKSATPRGP 
GTLAQGSLTDSFPDLLGLVAED 

2981 

A 

1 

519 

MLLTQSLFGGLFTRTHMKFGAVTQIRGPPLGDKSPVLLLFALERQRR 
HVLSMDPKLRCWSRTGKAAFPWCLII71EMPDYSPTFQRCQTTQGRLP 
WSFTLSSKSRFSGEGARACYKCQKSDHQAKECPQPGIPPKPCPICAG 
PHWKSDCSTHLAATPRAPGTLAQGSLTDSFLA 

2982 

A 

291 

686 

GLGTFAYISQDVWKSLYVQAEVCCRGRVFLVCSCGSCWSKSSRCEPP 
YAVLSIQVEAAHMHEIWCRDSDRDLPWE\SSCSLLLEKDPP/SGPQT 
DQPKKHLTNFKSDKGDTFYPWTQNSGAGHGLGRQPSLGV 

2983 

A 

1 

639 

MWESLELNRYLIJ^DFDQKPDSDMDNKVQAEVVSDGDEEL,VGNRSKET 
KETRFIRGPKTPALVTDWEGSLPLVFNHCREACLGPSPbAASPAFLG 
QGQDL INIJ^KVYNNRKKLQFLAS TVRQTPATS PAHKNFQTPEPQRP 
GVPPEPPPTGACYKYQKSDHQAKECLQPRIPPKPCPICAGPHWKLDC 
S THLAATPRAPGTLAQG S LTDS F S A 

2984 

A 

2 

425 

GFLVLLTSRMKLRTLAKQRRHILSVDPKLRRRSRTGKAAJFPWCLIIA 
GTPL I IHPRFKGVRPCRD ACLGP S PLAAS PAFLGEGQELATS ARNLT 
TRPRNACSPGFLLSRVHLCGTPLEI *PPGQGMPAAQDSS *AVS ICAG 
PHWKSDCSTHLAGTPRAPGILAQGSL 

2985 

C 

80 

106 

MRMKFGLD* j 

2986 

C 

259 

531 

MSPAHKNFQTPEPQRPGIPPEPPPPGACYKCQKSGHQAKECLQPRIP 
PKPRPICAGPHWKLDCPTHLASTPRAPGTLAQGSLTASQIFLA* 


A 

3 

505 

TKETRF I RGP KTPALVTDWEG S L PLVFNHCREACLGP S P LAAS PAFL 
GQGQDLINIAFKVYNITOKKIjQFLASTVRQTPATSPAHKNFQTPEPQR 
PGVPPEPPPTGACYKYQKSDHQAKECLQPRIPPKPCPICAGPHWKLD 
CS THLAATPRAPGTLAQG S LTDS FS A 

2988 

A 

1 

1056 

MWKGPKGLDMYGKSSVSPKTSDILGRDTLLLALKVQTVVLQTACGEG 
HVAGNCGR PLETEG S LQLTATKKLRDS VLQPKS PE FCQQ FTRAWNRT 
QVPDETEAPAGTYAAQSGDLPWEINPLSSCSLLHEKDPPTASGPQTD 
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QPKEHLTNFKSGFRGVRPRRDACLGPSPLAASPAFLGKGQVPQPLLS 
VSLPLLRLSGGQETPNPFS FTLSGKS AFLEEQVPQPR I S VPQSL I ST 
PRPLISVAQSLISAPQPLISLCPNPLFPCPDLLSL.HPNPLFPRPNPF 
PAFLBGACYKCQKSGHQAKECLQPGIPPKLRPICAGPHWKSDCPTHli 
AATPRAPGTLAQGSLTASQIFLA 

2989 

A 

1 

1482 

CSEYEDSSPAPVPATDLSSTLSSSVPQPQDTGTSQQLHPLDPWHELL 
RAQELQGATNHKGYSHAEHEHAGLGVQGGNGALAFSNSGHRHAVPTI 
SSGTGRRRTPSSSAFGLLNLHQWFVSGFQAFSDRLKAAIiSASLLLRF 
GDSDWLPSSSACKCLMLGLHFVIVGNICATLKEKYSSMLHLDVTMKK 
NGEKRTRLQKRKKGMPPHPAYEDLNIAAITIiPANWLHQPSGFRTSG 
QLD PVWWS LDTDAHEI WCQDPGLG S GD FPWE I T PL S S YS LLHE KD P P 
TTSQPQTDQPKKHLTNFKSKTKETGFIHGPKTPAPVTDWEGSLPLVF 
NHCRDTSLIIHPCFKGVRPRRDACLGPSPLAASPAFLEKGQDLINLA 
FKVYNNRKKLQFLASTVRQTAATSPAHKNFQMPEPQRPGVPPEPPPT 
G AC YMCRKS GHWAKEC PQ PG I P P KPRP I C VG PHRKSDCS TYLAATPR 

n "Dr^KIT ari<lCT.TnCPDnT t r*T a 7\ r?r*» 
/\f OIM J-irtVvjoJj A i/of JrDijJjVjLiAAiiiD 

2990 

A 

399 

569 

ACYKCWKSGHRVKECLQPRIPPKPCPICAGPH*RSDCSTHLAATPRA 
PGTLAQGSL 

2991 

A 

1 

468 

MSSQRLFLTIDLPKTQCAQEKLRVIRGCRDQAMEAXiQMTKPCFFKST 
WSQNHVEKCVPHFSPPLYVALNHLMALKGSQKSPKNYNTVFTKIRNG 
ACYKCQKSDHQAKECLQPRIPPKPCPICAGPHWKSDCSAHLAATPRA 
PGTLAQGSLTDSFWA 

2 992 

A 

1 

984 

MSGAKGFTGGIPGAGGGVKEGGEPRVGILDTVAVAQDEGLHLGEKLD 
CPAELLREEERGLGKGWIDRCGFLVSIjTSRMKPRTIjAVSVTVLKDGV 
SGVCSFRCPDVSRASSFRWVliGLADFRSEAADLHKSRFSGLGSQVSR 
PSVHLQLLSVSVLCTAKVIEVVETKETCFIREPKTPAPVMDWEGSLP 
LVFNHCRDASLMIHSRFKGVRPRRDACLGPSPIxAASPAFLGQGQVPL 
NPFSFTLSAHKNFQTPEPQQPGVPPEPPPPGACYKCQKSDHQAKECL 
QPRIPPKPCPICAGPHWKLDCSTHLAATPRAPGTLAQGSLiTDSFSA 

2 993 

A 

1 

« 

900 

MLTEGFLAFLGSSLTATSLSAVSIGESSWIQCHTSLWGCKGGQERAP 
GEVEEPWWAKAKAGITFGCHLAKQEMGQTPVAQTAPKDVSPQSTDDP 
YSQSGDLPREINPLSSCYLLHEKDPPTTSNPQTNQPKEHLTNFKSVR 
LIPNLSSFPPTCPHSPNPKRCFQSSFSTDPSDLSLHPKIAPPQVYNN 
REELQLLASSLRESLATSPAHKNFKMPKPQWPGVPSGLHPSGSCFTC 
WKSGHWAKECLWPGIPPKLYPIYEGLHWKSDCPTHLATTPRAPGTIjA 
QDSLTDSFPDLLSLAAED 

2994 

A 

1 

522 

MPIKASWPHPPGPFLLLTIjLLGLTGPPMWTMLKNNI/^EKNDIHSRDS 

hasqsgmasymlhhleasafslqhlrcsssgsmkagvancypegmvp 
igscsskdsgscfkcqkfchwakeclqprippklcpicvglhwksdc 
pthqaatprapgtlaqgsltdsfpdllglvted 

2995 

A 

3 

1150 

waptpskagknagrvfkdsdkldandrilkdqtqewsgsatlpsdgk 
trlfytdysgkhygkqslttaqvnvsksddtlkingvedhktifdgd 
gktyqnvqqfidegnytsgdnhtlrdphyvedkghkylvfeantgte 
ngyqgeeslfnkayygggtnffrkesqklqqsakxrdaelangalgi 

IELimDYTLKKVMKPIiITSNTSLECDVPLPVSMCSHLGISRAARQVS 
TEEGLSLAQEYNCGFFETSAALRFCIDDAFHGLVREIRKKESMPSLM 
EKKIjKRKDSLWKKLKGSLKKKGILLEPAAAACYKCPKSGHWAKECPQ 
PG I PLKLR P I WAGPHRKSDCS THLAATPRARGTLiAQGS ltds fpdr l 
GliAAED 

2996 

A 

3 

501 

LEGASYGETKADVAEGKSOKKATEWP ^DfiDRPVnPFT^OT .KTICITRTTJ 

qlsvkrtnqlsviwtnqqdvgsyanplmaaepaqavklgwrpspkph 
igdksvsfwllwmgsVqqgqkigp/psqgnacrprnss*shvpcggg 
phwkldcsthlaatprapgtlaqgsl 

2997 

A 

3 

605 

GHSRPRNWSQWSPRFLWLEAHRRPSGSPDSAELRSQVSDCLGTSRHG 
TGALFLAIjGIYAT^SCYVARFIRVRPHRDAYSLQGRLSDHSPTFQGCQ 
TTQGRLPWSFTLSGKSRFSGEGATTS PAHKNFQTPEPQWPG I PPEPP 


I0ISH 
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PTGAC YTCRKS GHWAKEC PQAR I P PKPH P I CVG PH * KSDC S THLAAT 
PRAPGTLAQGSL 

2998 

A 

1 

669 

MDTVGRRLRDSLGRTVDLGLTGNHGVVHPSLADSANKPEENTYCFGR 
TVETLLLRFGKTIMEEQLVLKRVANILINLYGMTAVLSRASRSIRIG 
LRNHDHE EGQG YLGLG VKVSDMAQPH I LHFLP I QQGS PGQERRHDQR 
QMSHSGICPMPLSRRLDRPGACYKCQKSDHQAKECPQPRIPPKPCPI 
CAGPHWKLDCSTHLAATPRAPGTLARGSLTDSFSA 

2999 

A 

1 

762 

MENKEAFVFQAHKEEIFEREKQMEQDLTRRVRELGRDANNRLSPAAK 
ENEGTWLCWPFKNNRQRRHILSVDPKLRRQSRTGKAAFPWCLIIAGT 
PLL I IHPHFRGVRPRRDACLGPS PLAAS PAFLKERQDL INLAFKVYN 
NRKTLQFLASTVRQTPATSPAHKNFQTPEPQRPGIPPEPPSPGACYK 
CQKSGHQAKECPQPRIPPEPCPIRAGPHWKSDSSTHLAATPRASGTL 
AQGSLTDSFPDLLGLAAED 

3000 

A 

1 

519 

MLKDLPRAIRQEKETKGSGDLPWEINPPSSYTLLCEKDPPTTSGPQT 
NQPKKHLTNFKSETKETRFIRGPKTPAPVTDWEGSLPLVIiNHCRDAS 
LIIHPCFKGACYKCQKSDHQAKDCRQPRIPPKPRPICVGPHCKSDCS 
THLAATPRAPGTLAQGSLTDSFPDLLGLAAED 

3001 

A 

1202 

1315 

L I TAVS ALAATPRAPGTLAQGS LTDS FPDLLGLAAED 

3002 

A 

1 

930 

MEEKEERKRRRKRKRRREEGRDGERKRMEKQKENQDPKDPPTTSGPQ 
TOQPKKHLTNFKSETKETRFIRGPKTPAPVTDWEGSLPLVFNHSRDT 
SL 1 1 HPGFRGVRPRRDACLGPS PLAAS PTFLGKGPAAPRQTELGPNS 
SSASAPPPYNPFIASPPHTWSGLQFPSMTSPPPPAQQFTLKKVAGAK 
GIVKDLINLTFKVYNNRKKLQFLASTVRQTPATSPAHKNFQTPELQQ 
PGVPPEPPPRGACYKFQKSGHRAKECLQPRIPPKPHPICVGPHWKSD 
C PTHLAATPRAPGTLAQGS LTPSQ I FLA 

3003 

A 

132 

248 

LQNLSFAGLAATPRAPGTLAQGSLTDSFPDLLGLAAED 

3004 

A 

3 

439 

TSRSITTSPKESDSPAGPARQYYTKGNIiVRICLGAVILIILAGFLAE 
DWHSRRKRLRHRGRAVQRPLPPLPPLPQTRKSHGGACYKCQKSDHQA 
KECLQPR I P PKP C P I CAG PHWKSDCS THLAATPRAPGTLAQG S LTPS 
RLSG 

3005 

A 

1 

357 

MTSGPQTDQPKKHLTNFKSGVRPRRDACLGPS PLAAS PAFLREGQVP 
LNPFSFTLSGACYKCPKSDHQAEECLQPRIPPKPCPICAGPHWKSDC 
STHLAATPRAPGTLAQGSLTDSFSA 

3006 

A 

1 

765 

^fWSQNLVLSDVQICLEFLLPVGSWVLLTSGVlCLQTFTVSVTALKGG 
LSRWHSSWLLLIRSSQLGFRLFFIKYENPAQFMAHLVATLKRFTAL 
DLEGSEGLFSLCILLPNLLPTLNKAQKLDSGPQTPQQDLINLAFKVY 
NNREELQLLASAVRETPATSPAHKNFKTPKPQWSGLPSGPPPPGSCF 
KCQKSGHWAKECLQPR I PPKPCPI CVGPHWKWDCPTHLAATPRVPGT 
LAQGSLTDSFPNLLSLAAED 

3007 

A 

1 

1434 

SVAFVFACPAVDYLVKISWELQGDEEQDGLRNMIWQTLQKTKDYEED 
VRIQNAINIAQGASEAIGQRQSSAAKPRRSGKESVREPWARVPGALG 
VAARKYQKMKG I E I KRRERLKCGAKI ERRKRLKDREVG EES KKR PLT 
GFEIEPPKVTVMPKNQSFTGGSEVSIMCSATGYPKPKIAWTVNDMFI 
VGSHRYRMTSDGTLFIKNAAPKDAGIYGCLAKAPKLMVVQSELLVAL 
GDITVMECKTSGIPPPQVKWFKGDLELRPST/CFLLSILAP/PLQSL 
PSLNFNSFHFL\DFLAATPRAPGTLAQGSLTDSFPDLLGLA7^ED 

3008 

A 

166 

421 

RPRSERLLWGTS PLS/ CALTL*GDPPTTSGPQTNQLKEHLTNFKSGP 
HWKMDCPTHPAATPRAPGTLAQGSLTDSFPDLLGSAAED 

3009 

A 

1 

434 

MTSGPQTDQPKKHLTNFKSETKETHFIRGPKTPAPVTDWEGSLPLVF 
NHWRDASL 1 1 HPRFKGVRPRRDACLGPS PLAAS PAFLG KGQELATS A 
RNLTTKARNACSPGFLLSRIPSVRDPTGNRTVQLTWQPLPEPLELWP 
KAL 

3010 

A 

1478 

2240 

WQVPLS WGRGKYPSTPS PS PLAAS PTFLGQGQVPLNPFSFTLSGKSR 
F PGAGASTPQPLLLHPQRQVPLS WGRGKYP S TPS PS PS AAS PAFLGQ 
GQVPLNPFSFTLSSKSRFPRGQETPNRLFPHPNLLSLHPNPLFPHPD 
LLSLCPNPLFPCLNPFSAFLEGKNPPPLLRVSTLFSGLASFTMGACY 
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TCQKSGHWAKECPQPGIPPKPCPISVGPH*KSDCSTHLAATPGAPGT 

r 7\/^ / f""!T THCDnnT t/'»T ATI T?T"* 

JjAQ/ GJjTDSF PDjjajGJjAAJSD 

3011 

A 

1 

624 

MEFCISLGRKITFSNIFPYNGANWKVFLKKSQPIiLNAIiVKQDDAERL 
FRK r KAACjUIjF YGAF GE I FWETQCQEMQQIiADRVIACRQPGQSMEID 

HMLE I CHWAKECPQHG I PPKPRP I CVGPHRKSDCSSHLAATPRAPGT 

3012 

A 

.321 

512 

LRSGDIiPWEINALSSCSLLHEKDPPTTSGSQTDQPKKHLTNFKSETK 
ETRFI CGPKTPAPVTD 

3013 

A 

1005 

1251 

KGGWSQRHSQGACYTCRKSGHWAKECPQPGIPPKPRPICVGPH*KSD 
CSTHLAANPRAPGTLAQGSLTDSFPDLLGLAAED 

3014 

C 

18 

56 

MAINS IRSLQLM* 

3015 

A 

188 

365 

VAYPPKKVCPEKGPWPFVSHIHFGPNLYPQKKPWGEKGRKIiNCPVKG 
PWTTRGQRMPPA 

3016 

A 

602 

2459 

FWGEGEKATAFLICNLAGCLRCPVDMNS\QGSDSNEEDYDP\NCE\E 
KEEEEED\DP\GDIEDYY\VGVASDVEQQGADAFDP\EEYQFTCLTY 
KESEG\ALNEHMT\SLSFLSLKVSHSSWLNLYLVNF\HWQVSEILDR 
YKSNS AQLLVEARVQPNP / S QKHVPTSHP PHH \ C AGCGMQVCAKREN 
LL/ SLWACQL* FCRSCWEQHCS \ VLVKDGVGVG\ VSCMA\Q\DCPL\ 
RNTSGTFVFP\LLPNEEFEERKYRRYLFRDYVESHYQLQL\CPG\AD 
CPM\ VIRVQEPRA/ RARVQC\NRCNEVF\ CFKCRQMYHAPTD\ CATI 
\RKWLTKL\QDDSETANYI\SAHTKDCPKCNICI\EKDGG\CNH\MQ 
CSKCKHDFCWMCLGDWKTHGSEYYECSRYKENPDIVNQSQQAQA/RR 
EALKKYLFYFERWENHNKSLQLEAQTYQRIXHEKIQERVMNNLGTWI 
\DWQYLQNAAKFFGQGSFSSLLLQCRYTLQYTYPY\AYYM\ESGPRK 
KLFE YQQAQLEAE I ENLS WK\ VERADS YD \RGDLENPDAI ITEPTGG 
TNPCLKDF \ HDTLS WD VDVPGVRKI S LARS PRAAHTACLAGS AFHDP 
QATAQGPTPERHWQHLLVDFCFLLFSLFVSTQGRRPC 

3017 

A 

103 

769 

GPCCI PI IQQLILFQCFKHFSGPLCFQLCKLMMPSKSLVMEYLAHPS 
TJj \ CaliA VCj VACCjMCJjG \ WSLR V \ CFGMLPKS KTS KTHTDT \ E SEAS I 

lgrqpgspg * flwrn*l * rwgkgergc psashaavssptkqi skkk 
spemaqnngey^waspkvggskvpdeetlialXiahakmlgXltvsl 

IQDAGRTQI\APGSQTVLGIGPGPAJDLIDKVT\GHLKLY 

3018 

A 

99 

480 

GGLWPQRAVASGSGKWRQEPSLHFAMSFLIDSSIMITSQIRQYWQV 
IFSVTFAFSCTMFELIIFEILGVLNSSSRYFHWKMNLCVILLILVFM 
VPF Y IG YF I VSNI RLLHKQRLLFS CLLWLTFM 

3019 

A 

2 

87 

NKPSGFWGMIKSVTTSASGSESILCPSA 

3020 

A 

1 

921 

MAKEEPLAYICQDICWIPGVAYILNFLFHSQQHYFRATFPTSDAGRT 
RVS FNLHVP I G PHCGND I S QL SNGHLEQE YDL YMEDS VD P YHIiHRKE 
VDAiEELSRQLFLETADLYATKGTWIOTANDMTAMLIGKGGVILYT 
QRGFCPQAIFMS I SFENGKLKAARGTYFSSSQYLLMMLEVFKAVKEE 
IQFRRVEKIVESFELERATKHRVIiYVFCLRESTERELLELRTKVSKQ 
EWLLQSTAEHLKIANQQKESMEQFIVSQRSPKLLRKSSKLIKDAFHN 
ELMGDENI KNYA YKLEMSLTGNNI K 

3021 

A 

1 

2203 

MGGASARPAAPSGRWGVPLPGRPFWEVMSPSARPPPRLGGVPNSSLR 
TGHDDDGGFVEQRRGKGDSEGQLGLRIKLVRWFSWQLVMAFDKAEL 
PSSPKVLVLTTVTRATGGVGLQTLRRLQERPGIGQIiGALCLICTVRL 
CGHEEMTSTFNPRECKLSKQEGQNYGFFLRIEKDTEGHLVRWEKCS 

DSYEKAVKTRVDLKELGQSQKEQGLSDNILSPVMNGGVQTWTQPRLC 

YXVKEGGSYGFSLKTVQGKKGVYMTDITPQGVAMRAGVIJU^Dm 

NGENVEDASHEEWEKVREGPRRVGYGFDRQQMAAALQKRGLTVKNK 

QTESNNNINKKRPHKNPIQRSAASKIEELEKNYFKIQYGSKKEPVQV 

KTTVSKKNKAGG I TL PDFKL YYKVTVTKTACSRYFHWKMNLCVILLI 

b/VFPMVPFLQLAYFIVSNIRLL\HKQRLLFSCLLWLTFMYFFWKLG 

DPFPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMS 
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YFLRNV TUTU X LALERRLLQTMDM I I S KKKRLRGPG WGFLLIYSPV 
GQ I AEDLTLI QQEVDALEEL S RQLFLETADLYATKER I E YS KTFKGK 
YFNFLGYFFSIYCVWKIFMVKFWSQHISFILVGIIIVTSIRGLLITL 
TKFFYAISSSKSSNVIVLLLAQIMGMYFVSS 

3022 

A 

2 

215 

Q KMGM Y FVS S VLL I RMS M P LE YRT 1 1 TE VLGELQFNF YHR WFD V I FL 
VSALSSILFLYLAHKQAPEKQMAP 

3023 

A 

50 

1598 

GGLWPQRAVASGSGKWRQEPSLHFAMSFLIDSSIMITSQILFFGFGW 
LFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGVLNSS 
SRYFHWKMNLCVILLIL\VFM\VPFYIG\YFIV\SNIRLLLSPLSQV 
A*TTDWLFSCLLGIi\TFMYFFG\KLGDPFPIIiSPKHGILSIEQLISR 
VGVI G V\ TLMALL S G FGAVNCP YTYMS YFLRNVTDTD I LALERRI»LQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSE 
NLTLIQQEVDALEELSRQLFLETADLYATKERIEYSKTFKGKYFNFL 
GYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEITVNYLGIQFDV 
KFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLL 
AQIMGMYFVSSVLLIRMSMPLEYRTIITEVL/GEELQFNF/YIHRW\ 
F \ DV I FLVRRFS LG I L FPL FGLHKQAP EK\ QMAPLNLKP YLQTDKRP 
SGFQKFRI 

3024 

A 

1 

158 

AERHQFRDFRDLNKDGHLYGSEVGHWVLPPAQDQPLVEANHLLHESD 
TDKVQ 

3025 

A 

2 

1067 

AAELGTFAGTQSGAWRAERSWHNRGPMMWRPSVLLLLLLLRHGAQGK 
PSPDAGPHGTGRVHQAAPLSDAPHDDAHGNFQYDHEAFL*REVAKEF 
DQLTPQE S Q ARLGR I VDRMDRAGDGDG WVS LAELRAW I AHTHQRH I L 
DSVSAAWDTYDTDRDGRVGWEELRNATYGHYAPGEEFHDVEDAETYK 
KMLARDERRFRVADQDGDSMATREELTAFLHPEEFPHMRDIVIAETL 
EDLDRNKDGYVQVEEYIADLYSAEPGEEEPAWVQTERQQFRDFRDIiN 
KDGHLDGSEVGHWVLPPAQDQPLVEANHLLHESDTDKDGRIi\TKAEI 
LGNWNMFVGSQATNYGEDLTRHHDEL 


A 

2 

860 

RRGPCNGRYHHPVYPHPTAVQREPPAGPRACHSPCWHDAPAVRSPGQ 
STSPHDAVTWPFLHGSREGPTPRRTRTPPAFRPHTQACPSTCYCHTL 
ASRRGPCNGRYHHPVYPHPTAVQREPPAGPRACHSPCWHDAPADLRG 
ADQRPERI^SAWIWSSQRLATWKRVQRTGQNTWASESLSFEALANL 
PANMMQLPGHFSTFLVRAPHPAAEKPLLHSILTHRPVHPLATVTIjLP 
AEEAPVMADITAQS ILTPQLCSGTLLLAHVAARAHAGTTLQHVGVCG 
PHYR 

3027 

A 

2 

1681 

FGTLGSAFPESCPFQKEDGMIKFQIQGIHVQERVTFKDVAWFTKV\ 
*I^LDKAQINLYQDVMLENFMNLMLWIPLPVSFQLGSANGESQQEI 
QGRDKSEVRIFIPPGFSLEVAVEWLYSISIDQRDAHPKTDPKLRPEE 
AQRKSLSRNVKMHCFVKPRAAPQTEPTLEPTLFPSYPGVRPVPWRMR 
KEAYGLQS PES PAR S SGHSQGYPGEDLHRAEQRPERFAGS AWTWSLQ 
RLATWKRIQRTGQNTWLE/UjANLLANMTQLPGHFSTFLVSAPHPAAE 
KPLLHS I LTHR PVHLLATVTLL P AEEAPLMAD I TAQAI LTPQLCSGT 
LLLAHVAARAHADTNLQHVG I PAGDTTGDMASMTLLPG SDTVDLKDT 
APCIVAAPVPAMAKKGQGIAQAIASEGANPRPWQLPRTVGPWNGEG 
KCGVGAPRVPTEALPSGAVRRGPPSSRPQTHQQLPPCAWKRCRHSTP 
AHENS QVG YTLQSHRGRDAQGLG S S SLAS ACLDVRHGVKGDYFRTLR 
FNDCPAGFQTGMGPAAPLFCLISPFQNGGIYPKAVAPLYLGSN 

3 028 

IV 

-> -L 

569 

SLRTVPPHYLQGSPGTPIPPSSLGPASGTRRVES/CVAAPNEQEGSS 
KGGARPVTICDSQNRPCCPPPPSPALPQADPSLKTWLLLNHIRWDTD 
SM I CMEDSS * * GFK/ PNKDS I KVQVRLRGVOGVGHCMGG AALPSTSI, 
SSCLTPFPAPLNSRHTTLAPPLPQPENPAQRTTP 

3029 

A 

2 

396 

KCHDCGAILEEYDEEtLGloAIVVLSTFIHLSPDLAAPLLIiDIMQSVG 
Rl^SSTTFSNQAESMMVPGNAAGVAKQFLRCIFHQLAPNGIFPQLFQ 
ST I KDGTFLRTIiAS SIoMDFNELS S I AALSQLLEGLNN 

3030 

A 

2 

1038 

GLLEPFSKLLSFVIQNAVFTLAYLVELCGLCYRAFTKERDKFYLSRS ' 
GCS KS FCRRLKLKS PLPDTNLLLLVQF I \ LPDAGTKLD * VNNPEQAD 
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DSLCTWDVGTA\AMECVRQYINEVLDFM\ADMHTLTKLKSHMKTCSQ 
AS \DEDTFGGHLKVGLAQIAAMDISRGNHRDNKAVIRYLPWLYHPPS ! 
AMQQGPKEFIECVSHIRLLSWLLLGSLTHNAVCPNASSPCLPIPLDA 
GSHVADHLIVILIGFPEQSKTSVLHMCSLFHAFIFAQLWTVYCEQSA 
VATNLQNQNEFSFTAILTALEFW\SRVTP\AII/QLMG/HINKVMGRK 
WVLLSMW I S LMGGNLPGNAI WT 

3031 

A 

3615 

5322 

EVPPNPAC YQHQA* S VCLCLHLKS SLTS SRR * / CTDKCHDCGAI LEE 
YDEETLGLAIVVX,STFIHLSPDLAAPIiIjLDIMQSVGRLASSTTFSNQ 
AESWPPTASSRSCSKARSKVIHPLRIQAPQICIilPDTQTLNPIVDFP 
DGTFLRTLASSLMDFNELSSIAALSQLLEGLNNKKNLPAGGAMIRCIj 
ENIATFMEALPMDSPSSLWTTISNQFQTFF7UCLPCVLPLKSLLEPFS 
KLLSFVIQNAVFTLAYLVELCGLCYRAFTKERDKFYLSRSWLELLQ 
ALKLKS PLPDTNLLLLVQYENKMNEKAERLVLKLLNS AQQQFQF I CA 
DAGTKLAESTILSKQMIASVPGCGTAAMECVRQYINEVLDFMADMHT 
LTKLKTS VLHMCSLFHAF I FAQLWTVYCEQS AVATNDQNQNEFS FTA 
ILTALEFWSRVTPS ILQLMAHNKVLI PMWLPMIQSNIKHLSAGLQLR 
LQAIQNHVNHHSLRTLPGSGQSSAGIAAIjRKWLQCTQFKMAQVENPV 
LGKQPLNFILYEWTPSAPQCQHSGLAIMGLKTNNLDPQAGWGNHIGY 
LYNPLS 

3032 

A 

3 

1078 

FFSSVFPAAIEPGERASARRRVSRAGCLALTLPALLLVTHQVPAEAA 
VSRAMAEL\TALESLIEMGFPRGRAEKAIJU J TGNQGIEAAMDWL,MEH 
EDDPDVDEPLETPLGHIL\GREPTFSEQGGPLKDLVLA\AGEGKPAL. 
S * / EEERQEQTK\RMFGAWLAQKGSRB\ REEREGRE\ ALERERQRR\ 
RQGQE\LSAARQRLQED\KMRR\AAAEERRREKAEE\LAARQRVREK 
IERDKAERAKKYGGSVGSQPPQ\VAPEPGP\VPSSP\SQEPP\TKGE 
DDKCR I Q \ VRLPRWDPH * PQTFRAREQLAAVRL\ YVEL\ HRGEEL\ G 
GG ARD P VQ I AS VAFP \ RRAFLR S LTMER\ PLQELGMAAKTRNQD 

3033 

A 

2 

565 

IHFQQEHWPSQLLLRESLEDMl^HSAIXSLCLLLVTVSSNIiAIAIKKE 
KRPPQPLSRG\WGDDITGVQTYEEGSLLCSKK*GSH*WVIHHLG\DC 
QYSQALKKVFAQNEEIQEMAQNKFIMLNLMHETTDKNLSPDGQYVPR 
IMFVDPSLTVRADIAGRYSNRLYTYEPRDLPLLIENMKKALRLIQSE 
L 

3034 

A 

73 

262 

PLPARGKSTLPATFCSPPAPELASMSWPPNRSQTGWPRGVTQFGNK 
YIQQTKPLTLERTINL 

3035 

A 

29 

867 

PLEGIADNRFPHGETSTLPATFCSPSAPEIjASMSWPPNRSQTGWPR 
GVTQFGNKYIQQTKPL\TL\ERTINIiYPLTN\YTFGTKRAPLTRRTS 
SV\ AARF\ QR IEGKNFDKIGMRRTVEGVL\ I VHE \HRLTPWCYI»LQL 
GTNFLPNYLGWLNLTPGEDEV\ EGLKTA* * PEI PGS VQDGVL\QD\W 
GPLTDCHWVTWW\ RPKFLNPPS VSHI FPAH IT\ KPKE/HIRKLFPGS 
SLQEKALVLQFPKKFTSW*PATPLFELYDNA\ PGYGPIISSLPQLLS 
RFNFIYN 

3036 

A 

2 

817 

RARI PKMAAVKDSCG* GEMATGNGRR\ LHLGI PEAVFVYYFS Y*RKM 
YHYFMKQPGNETADTVTjKK\ LDEQ YQKYKFMEIjNLAQKKRRLKGQI P 
\EIKQTLE IIiKYMQKKKE\ STNFNGRPRFFAGRLTLYLQKLSVPPTR 
*RG/CLGLGANVMLEYDIDEAQ\ALLGKDL\STATKILDSRGGRT%T 
FJARDQFTTTBVNMARGLLIGDVKKEGNKD\DST 
GLVFPNMFIFKYPFIPYRFDINFGMFNQPRNFKEKINTILFIY 

3037 

A 

3 

468 

ERIMEWHRHTMTECS AQEPKSHD I YLRLLVKL YR FLARRTNS TFNQV 
VLKRLFMSRTNRPPLSLSRMIRKMKL.PGRENKTAVWGTITDDVRVQ 
EVPKIiKVCALRVTSRARSRILRAGGKILTFDQIjALD/SPYVRSKGRK 
FERARCRRARRGYKN , 

3038 

A 

1 

693 

mgvdirhnkdrkvrrk^pksqdiylrli/v^ 
lk31lfmsrtnrpplslsrmirkmklpgrenktavvvgtitddvrvqe 
vpklkhntrdvwkvpevlrtaavspgvlrvvndsvgyqgsaaprqw 
rvcalrvtsrarsrilraggki ltfdqlalds pkgcgtvllsgprkg 
revyrhfgkapgtphshtkpyvrskgrkferargrrasrgykn 
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3039 

A 

105 

756 

VDHPPYKRTEKVRAQRSPKSQYIYLSLLLKLYRFSGPEEPNSNIQTR 
VVLKRLFMSPHQTGPPIiS\LSRMIR\KMKLSWPGKQRRAVVC/VGPI 
TD\DVRGSRRYPKLKVCCTAP*PSRAPQAAILKAGGK\lLTF\DQIiA 
LGLPLKG\CGTVLLSG\PSKG\RKVYRHFRQRPQEPPHKPHQNPTFR 
SKGRKFERARG\RRAS\RGYKKLTLDPTLLYKKIFA 

3040 

A 

301 

390 

ENASMGPCLFKTTSVSHLVEWPQKLSWVI 

3041 

A 

1 

737 

MWFAYSPFTFCHDWKLPEASAEAKRMPALCFI.YSLQHPLCEDGVCAH 
EAGVWGIGGLIIAYVKGYEEVSQKFTSIRRVRGDNYCALRATLFQAM 
SQAVGLPPWLQDPELMLWFVAILLHSHLECREPLLIPILSLYMGAIi 
VRCTTLCLGYYKNIHDIIPDRSGPELGGDATIRKMLSFWWPLAIiILA 
TQRISRPIVNLFVSRDLGGSSAATEAVAILTATYPVGHMPYGWLTEI 
RAVYPAFDKNNPSNKLVSTSNTVTAAHIKKFTFVCMALSLTLKDSVQ 
KPD I SLTGRLVQTIiPTRMRHQRGESKDVAPLASWL SEPS TS SEAS QT 
SSKLTINSQGEGKAKQKLECGTLSIVLRSCEKNQERIKAAEKRNRSW 
TTLCGLGAWRPLLFELPVIVQTPDQTNRFQFRYPAKTQSGLCSFFHS 
FDLLTLEAFVKVWFPGCLLILLCFVMFWTPNVSEKIliIDIIGVDFAF 
AELCWPLRIFSFFPVPVTVRAHLTGWL/QGPGAHAGCFCSHFASQS 
PGMPGAPAHPDPLLVHGRTCALHHPVPGLLQEHSRHHP * QKWPGAGG 
RCNNKKDAELLVAFGSNSGHTENQSAYCQPLCFPGPWWQFCSHRGSG 
DFDSHI PCGSHAIRLVDGNPCCVSCFRQE 

3042 

A 

889 

1577 

RGKKNTVIiKKIRSQFTIRESFDLRYKEHIPLPKLEFPLDTKVFGLLG 
EDMVCTGPTGVALKGPHWKYIGIIDEGKDLVNFSYQKLVFRTSWSTW 
LPRGPRVLLRDCSLKTPKPAAQAQEPT\VHLRPRRQGFSGSMRSLID 
WVSLTHSDGDVIVPIVLSNERYHWP\VLGWVGGVDCDEFFRVVLGEA 
RLH\LDGVAHQH*GKKEPFPTCSKTTSVSHLVEWPQKLSWVI 

3043 

A 

80 

695 

SPAASGGFRAGPRNQDPSRALAELWPTKAV\SVLKGDGP\VQGIINF 
E\QKESNGPSEG\WGKHLKGLTEGL\HGFHVHEFGD\NTAG\CTSAG 
\ PHF I PLSRK\HGGPK\DEERHVEDLGQC * LLDKDGVGPMCLLKDS \ 
VI FTLQGDHLHHLGRTL VVP WKKPDDLGQRVGNE \ ES S KTG \ NAGKF 
VWACG VNLG S APNKHS P WD WX»R P P 

3044 

A 

2188 

2706 

RESIDQGFSNFFFFLRRSLDSVTQAGVQWHDLSSLQPPPPGFK*K*F 
SCLSLLSS*DYRSPATMPGLIFLYFFFLVETGGFTVIiAR\MVSIS*P 
Q/ CDPPASASQ\ SAGIQKNSFFFLETRVSAFVAQAGVQWCDLGSPQP 
PPPGV\QPFSCLSLPSSWDYRRTPPGPANFAVCN 

3045 

A 

33 

198 

CRHRQAQLLQMWHPRRIiGG WWR I F ARQEH I CC I RVP KVNKF YT.LQC 
VHIHFHG 

3046 

B 

6 

125 

MAGQFRIYLWDPLLILSQIVLMQTVYYGSLGLWLALVDGL* 

3047 

A 

1 

110 

YAQLGTRVPELASVTHFTPPLPINKCQPVSCSNIjGS 

3048 

A 

260 

798 

RPGHAQTLGSPSLALGMAGQFRSYWDPLLlLSQIVIiMQTVYYGSLG 
LWLA\ WDGLLRI SP SIJDQMFDAE ILGFS / TPFQGRLSMMS FILNTL 
NSA\LGLLYFIRRGK\QCLDFTVTVHFFHIiLGLLiGSNSSPFSPSGG* 
TLVGLGPKPWALHSLAVIGEYLCMRTELKEIPLNSAPKSNV 

3049 

B 

1 

528 

MRRQDCPNPMKKI RS SSPVQQERVS SLYFQSARGGLGVDVSALGTVT 
HVSPSSSARVAMAMSAERGCSMGHIMGYRGVGRFTRGNGDHARVGYK 
GNPPEVSISGYDDNWYIiGRTDATLSCDVRSNPEPTGYDWSTTSGTFP 
TSAVPRAPSWSSTQVTTVCS IHLRLHITNAVDGPR* 

3050 

A 

31 

206 

GSSCRSRSLVWLSIVLEGPAPAPGDPQTSPSAIiDKLKEFGNTLEDK 
ARELISRIKQSELSAKMRVPLRPRHEALPVAPGPGGGSVDRLGRPSP 
SPRGPPDVSQCLG*AEGW?KHTGGQGSGTHQPHQTE 

3051 

A 

21 

353 

SAAGTGPEPEPEPHRHLLNRPADRPPQSGPRASGRAQSPPGPSMARA 
AALLPSRSPPTPLLWPtiLLLLLLETGAQDVRVQVLPEVRGQIxGGTVE 
LPCHLLP P VPGLY I SLVTWQRPDAPANHQNVAAFH PKMGP S FPS PKP 
GSERLSFVS/UCQSTGQDTEAELQDATLAIiHGLTVEDEGNYTCEFATF 
PKGSVRGMTWLRVIAKPKNQAEAQKVTFSQDPTTVALCI S KEGRPPA 
RI SWLS S LDWEAKETQVSGTLAGTNTTVTSRFTIjVT SGRADGVTVTCK 
VEHESFEEPAL I PVTLS VR YPPE VS I SG YDDNWYLGRTDATLS CDVR 
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SNPEPTGYPWSTTSGTPPTSAVAQGSQIiV\ IHAVDSIA FNTTFVCTS 
QPNPWAI>ARAEQVIF\VRETPQALAPRCGPAGVGGRGGDTAGAAASG 
WGVLGLHPCMRVRRRRKSPGGAGGGASGDGGFYDPKASGVGKWGPRIi 
LDTSSPWSHGTRWQG*GGGGGGREGRERPHVASTPSTSRMTWSPSWT 
APSSHGG/PVYV*PGPPWPGPLPSCRRDRRRRRCCGRCCCCCSWKPE 
PRMCEFKCYPRCEASSGAPWSCRATCCHLFLDCTSPW 

3052 

A 

1 

1620 

K?U^AAALLPSRSPPTPLLWPLLLLLLLETGAQDVRVQVLPEVRGQLG 
GTVEXiPCHLLPPVPGLYISLVTWQRPDAPANHQMVAAFHPKMGPSFP 
SPKPGSERIiSFVSAKQSTGQDTEAELQDATLALHGLTVEDEGNYTCE 
FATFPKGSVRGMT\WLRVIAKPKNQAEAQKVTFSQDPTTVALCISKE 
GRPPARISWLSSIiDWEAKETQVSGTLAGTVTVTSRFTLVPSGRADGV 
TVTCKVEHESFKEPAUPVTLSVRYPPEVAISGYDDNWDLGRTDATIi 
SCDVRSNPEPTGYDWSTTSG\TFPTSAVAQGSQL»VIHAVI>SLFNTTF 
VCTVTNAVGMGRAEQVI CVRETPNTAG AGPTGG I IGG 1 1 AAI I ATAV 
AATGILICRQQRKEQTLQGAEEDEDLEGPPSYKPPTPKAKLEAQEMP 
\SHLFTLGA*EHSPLKTPYFDAGASCTEQEMPRYHELPTLEERSGPIi 
HPGATSLGSPIPVPPGPPAVEDVSLDLEDEEGEEEEEYLDT\INPIY 
DALSYS SPSDS YQGKGFVMSRAMYV \ 

3053 

A 

408 

2708 

DP S SGHWLHWS IRP PAAGWS PRSGRAAARAQS PRRREHGAPGRTGPG 
GGPRLLLIAVLLAAHPGAQDVRVQVLPEVRGQLGGTVELPCHLLPPV 
PGLY I SliVTWQRPDAPANHQNVAAFHPKMGPS FPS PKPGS ERLS FVS 
TUCQSTGQDTEAELQDATIiALHGLTVEDEGNYTCEFATFPKGSVRGMT 
WLRVIAKPKNQAEAQKVTFSQDPTTVALCISKEGRPPARISWLSSLD 
WEAKETQVSGTIjAGTVTVTSRFTIjVPSGRADGVTVTCKVEHESFEEP 
ALIPVTLSVRYP\P\EVSISGYDDN\WY\LG\RTDAT/Ii*ACDVRSN 
PEPTGY\DWSTTSGTFPTSAVAQGSQ\LVIHAVtf/SVLFNTTFV\CT 
VTNA\ VG IG AG AE \ QV I FVRETPRAS PRD VGPL VWGAVGGTLL VLLL 
LAGGSLAFILLRVRRRRKSPGGAGGGASGDGGFYDPKAQVLGNGDPV 
FWTPWPGPMEPDGKDEEEEEEEEKAEKGLMItPPPPALEDDMESQLD 
GSLISRRAVYTPPSPVPGSAGAAAVACTDPVPSSGVHGKNDHKNSSS 
MFRMKALAQLS PYKTPNTAGAGATGG IIGGI I AAI I ATAVAATGILI 
CRQQRKEQTLOGAEEDEDLEGPPSYKPPTPKAKLEAQEMPSQLFTLG 
ASEHSPLKTPYFDAGASCTEQVGAHGKTKEMPRYHELPTLEERSGPL 
HPGATSIiGSPIPVPPGPPAVEDVSLDLEDEEGEEEEEYIiDKINPIYD 
ALSYS SPSDS YQGKGFVMSRAMYV 

3054 

A 

115 

467 

VGCDLRSRRGGAQPGLGTDEDAI I SVIiAYRNTAQRQE IRTAYKSTIG 
RDLIDDLKSELSGNFEQVIVGMMMPTVLYDVQELRRAMKGAGTDEGC 
LI E I LASRTPEE IRRI S QTYQQQ 

3055 

A 

315 

433 

SHGHGTDEDA 1 1 S VLA YRNTAQRQE I RTA YKS T I GRDL I 

3056 

A 

53 

444 

PRVMAM/ ATKGGTVKAAS GFNAMEDAQTLRKAMKGLGTDEDAI I SVh 
AYRNTAQRQEIRTAYKSTIGRDLIDDLKSELSGNFEQGAGTDEGCLI 
EILASRTPEEIRRISQT\YQQQYGRSL\EDDIRSDTSF 

3057 

A 

186 

1268 

PRDMAMATIGGTVKAASGFNAMEDAHTLRKAMKGLGTDENAIISVLA 
YRNTAQRLEIRTAYKSTIGR\DLIDDLKSEIi\SGNF\EQVIVG\MKM 
PT*RDDV\QEIiRRAMKGAGT\DEG\CLIEILASRTPEEIRRISQTYQ 
QQ YGRS LEDD IRS \ DTS FM \ FKRVLV\ SLSAGGRDEGNYLDDALVRQ 
DAQDLYEAGEKKWGTDEVKFL\ TVLCS PE PKI TLLHVFDE YQKDIHR 
RUE PE VLNSETS G I FWKI ALLGL I \ VKCMRNKS AYFAE KL \ YKS \ M 
KGLGT \ DD NT \ Lt 1 RVMVo S S Q KMD \ MLD I P AHTS KRJjLi W J\i> P Jb 1 1> \ r 
IKNRVATSGD\ YRK\ VLLVLCGREMI KIKI PGRDREGS S TLLNFF 

3058 

A 

1 

4608 

MKQLLPASGLLPLSSSPLSPRQESPGVERAHEGCLSQNLQVQRATRE 
GEIDRVAKSGGQERAC^ASSETESEAGDIMDQQFEEMNNICLNSVTDPT 
GFLRMVRRNNLFNSQAEYCAQPWGLSGEQQGPQPRSLHPAPLARPDP 
IQSHPQKSKEIEGVSADSSNPDPLDEGNSRRSSRFNVEAVLCSSRGR 
PDS SMPDCCRNAALEARSCQSMTSLFSNTVS PTQDGTS S LPRRQSS F 
AKPPLRALYDLLIAPMEGGWESTEPLGTSRAGFASGTALFWLGDNG 
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KAAS PPGVSVS SLENEDDNTQCNRLEQG YREAAVPDWTQDLALCLL P 

TLQSLKARTAPHLCVPSSCYKAWYTLAVSKVSGMGLMHSSGPVGRHR 

QL I LVLEGELYLI PFALLKGS S SNE YL YERFGLLAVPS I RSLSVQSK 

SHLRKNPPTYSSSTSMAAVIGNPKLPSAVMDRWLWGPMPSAEEEAYM 

VS ELLGCQPL VGS VATKER VM S ALTQ AE CVH FATH I S WKLS ALVLTP 

SMDGNPASSKSSFGHPYTIPESLRVQDDASDGESISDCPPLQELLLT 

AAD VLDLQL P VKL WLG S S QE SNS KVTADG V I ALTRAFLAAG AQCVL 

VSLWPVPVAASKMFIHAFYSSLLNGLKASAALGEAMKVVQSSKAFSH 

PSNWAGFMLIGSDVKLNSPSSLIGQALTEILQHPERARDALRVLLHL 

VEKSLQRIQNGQRNAMYTSQQSVENKVGGIPGWQALLTAVGFRLDPP 

TSGLPAAVFFPTSDPGDRLQQCSSTLQSLLGLPNPALQALCKLITAS 

ETGEQL I SRLHQVLVQLQAGEKEQDLAS AP I QVS I S VQLWRLPGCHE 

FLAALDMGKPQAQMLAEGPCRHSMKLHKEPYSAHVQDLESSRRKQML 

IFFVPLGFDLCEVGQEEVILKTGKQANRRTVHFALQSLLSLFGPDRS 

GHSGHVGLGDFQALGTHPARLMFLRAKLATVGRANAGLEGGYGLQVE 

PTESAEDSTELPKRLSLDSSSSLESLASAQSVSNALPLGYQQPPFSP 

TGADS I ASDAI S VYS LS S I AS SMS FVS KPEGG S EGGG PGGRQDHDRS 

KNAYLQRSTLPRSQLPPQTRPAGNKDEEEYEGFSIISNEPLATYQEN 

RNTCFS PDHKQPQPGTAGGMRVS VS S KGS I STPNS P VKMTLI PSPNS 

PFQKVGKLASSDTGESDQSSTETDSTVKSOEESNPKLDPOELAOKTt* 

EETQSHLIAVERLQRSGGQVSKSNNPEDGVQAPSSTAVFRASETSAF 

SRPVLSHQKSQPSPVTVKPKPPARSSSLPKVSSGYSSPTTSEMSIKD 

SPSQHSGRPSPGCDSQTSQLDQPLFKLKYPSSPYSAHISKSPRNMSP 

SSGHQS PAG SAPS PALS YSS AGS ARSS PAD APD I DKLKMAAIDEKVQ 

AVHNLKMFWQSTPQHSTGPMKIFRGAPGTMTSKRDVLSLLNLSPRHN 

KKEEGVDKLELKELSLQQHDGAPPKAPPNGHWRTETTSLGSLPLPAG 

PPATAPARPLRLPSGNGYKFLSPGRFFPSSKC 

3059 

A 

1 

465 

S PGVERAHEGCLS QNLQVQRATREGE I DRVAKSGGQERACAS S ETE S 
EAGDIMDQQFEEMNNKiiNSVTDPTGFLRlWRRNNIjFNSQAEYCAQPW 
GLSGEQQGPQPRSLHPAPLARPDPIQSHPQKSKEIEGVSADSSNPDP 
LDEGNSRRSSRFNV 

3060 

B 

109 

192 

GS S SNE YL YERFGLLAVPS IRSLSVQS K* 

3061 

A 

1 

1542 

MGITK^IN^INVMEEAIGYFEQQLA^1LQQLSGNESVLDRGRAYGNLGDC 
YEALGDYEEAIKYYEQYLSVAQSLNRMQDQAKAYRGLGNGHRAMGSL 
QQALVCFEKRLWAHELGEAFNKAQAYGELGSLHSQLGNYEQAISCL 
ERQLNIARDMKDRALESDAACGLGGVYOOMGEYDTALOYHOLDLOIA 
EETNNPTCQGRAYGNLGLTYESLGTFERAWYQEQHLSIAAQMNDLA 
AKTVSYSSLGRTHHALQ1^SQA\^LQEGIJ^LAEQLGRREDEAKIRH 
GLGLSLWASGNLEEAQHQLYRASALFETIRHEAQLSTDYKLSLFDLQ 
TSSYQALQRVLVSLGHHDEALAVAERGRTRAFADLLVERQTGQQDSD 
PYSPVTIDQILEMVNGQRGLVLYYSLAAGYLYSWLLAPGAGIVKFHE 
HYLGENTVENSSDFQASSSVTLPTATGSALEQHIASVREALGVESHY 
SSIRNGHCRSLPCYLSLFHVSTAGHQDYKGTWVYDTCHLKSLHM 

3062 

A 

1 

4226 

EAVLCSSRGRPDSSMPDCCRNAALEARSCQSMTSLFSNTVSPTQDGT 
SSLPRRQSSFAKPPLRALYDLLIAPMEGGWESTEPLGTSRSGFASG 
TALFWLGDNGKAAS PPGVSVS SLENEDDNTQCNRLEQGYREAAVPDW 
TQDLALCLLPTLQSLKARTAPHLCVPSSCYKAWYTLAVSKVSGMVEG 
KAW 1 1 LLLLAAS THALLQR PQEGD I TPLAG S RAAAVS CQARPFS TFS 
GLMHSSGPVGRHRQLILVLEGELYLIPFALLKGSSSNEYLYERFGLL 
AVPSIRSLSVQSKSHLRKNPPTYSSSTSMAAVIGNPKLPSAVMPRWL 
WGPMPSAEEEAYMVSELLGCQPLVGSVATKERVMSALTQAECVHFAT 
H I S WKLSALVLTPSMDGNPAS SKS S FGHPYT I PES LRVQDDASDGES 
ISDCPPLQELLLTAADVLDLQLPVKLVVLGSSQESNSKVTADGVIAL 
TRAFLAAGAQCVLVSLWPVPVAASKMFIHAFYSSLLNGLKASAALGE 
AMKWQSSKAFSHPSNWAGFMLIGSDVKLNSPSSLIGQALTEILQHP 
ERARDALRVLLHLVEKSLQRIQNGQRNAMYTSQQSVENKVGGIPGWQ 
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ALLTAVGFRLDPPTSGLPAAVFFPTSDPGDRLQQCSSTLQSLLGLPN 
PALQALCKL I TAS ETGEQL I SRLHQVLVQLQAGEKEQDLAS AP I QVS 
ISVQLWRLPGCHEFLAALDMGKPQAQMLAEGPCRHSMKLHKEPYSAH 
VQDLESSRRKQMLIFFVPLGFDLCEVGQEEVILKTGKQANRRTVHFA 
LQSLLSLFGPDRSGHSGHVGLGDFQALGTHPARLMFLRAKLATVGRA 
NAGLEGGYGLQVEPTGSAEDSTELPKRLSLDSSSSLESLASAQSVSN 
ALPLGYQQPPFS PTGADS I ASDAI S VYSLSS I ASSMSFVSKPEGGSE 
GGGPGGRQDHDRSKHAYLQRSTLPRSQLPPQTRPAS\NKDEEEYEGF 
SIISNEPLATYQENRUTCFSPDHKQPQPGTAGGMRVSVSSKGSISTP 
NSPVKMTLIPSPNSPFQKVGKLASSDTGESDQSSTETDSTVKSQEES 
NPKLDPQELAQKILEETQSHLIAVERLQRSGGQVSKSNNPEDGVQAP 
SSTAVFRASETSAFSRPVLSHQKSQPSPVTVKPKPPARSSSLPKVSS 
GYSSPTTSEMSIKDSPSQHSGRPSPGCDSQTSQLDQPLFKLKYPSSP 
YSAHISKSPRNMSPSSGHQSPAGSAPSPALSYSSAGSARSSPADAPD 
IDKLKMAAIDEKVQAVHNLKMFWQSTPQHSTGPMKIFRGAPGTMTSK 
RDVLSLLNLSPRHNKKEEGVDKLELKELSLQQHDGAPPKAPPNGHWR 
TETTSLGSLPLPAGPPATAPARPLRLPSGNGYKFLSPGRFFPSSKC 

3063 

A 

2 

4195 

ALDFPGRRFRPTASFIWASVLFETIRHEAEVSTDYKLSLFDLQTSSY 
QALQRVLVSLGHHDEALAVAERGRTRAFADLLVERQTGQQDSDPYSP 
VTIDQILEMVNGQRGLVLYYSLAAGYLYSWLLAPGAGIVKFHEHYLG 
ENTVENSSDFQASSSVTLPTATGSALEQHIASVREALGVESHYSRAC 
ASSETESEAGDIMDQQFEEMNNKLNSVTDPTGFLRMVRRNNIjFNRSC 

qsmtslfsntvsptqdgtsslprrqssfakpplralydlliapmegg 
l^ssgpvgrhrqlilvlegelylipfallkgsssneylyerfglla 
vpsirslsvqskshlrknpptyssstsmaavignpklpsavmdrwlw 
gpmpsaeeeaymvsellgcqplvgsvatkervmsaiitqaecvhfath 

ISWKLSAliVLTPSMDGNPASSKSSFGHPYTIPESLRVQDDASDGESI 
SDCPPLQELLLTAADVLDLAAAL * RLW\ LGS SRESTSKFTADGVFA 
LTRAFLAAGAQCVLGSLWPVPVAAFKMFIHAFYSSLLNGLKASAALG 
EAMKVVQSSKAFSHPSNWAGFMLIGSDVKLNSPSSLIGQALTEILQH 
PERARDALRVLLHLVEKSLQRIQNGQRNAMYTSQQSVENKVGGIPGW 
QALLTAVGFRLDPPTSGLPAAVFFPTSDPGDRLQQCSSTL.QSLLGLP 
NPALQALC KL I TAS ETGEQL I SRAVKNMVGMLHQVLVQLQAGEKEQD 
LAS API QVS I S VQLWRLPGCHEFLAALGFDLCEVGQEEVI LKTGKQA 
NRRTVHFALQSLLS LFDSTELPKRLS LDS S S S LESLAS/ SLQS VSNA 
LPLGYQQPPFSPTGADR\IASDAISVYSLS SI ASSMSFVSKPEGGSE 
GGGPGGRQDHDRSKNAYLQRSTLPRSQLPPQTRPAGNKDEEEYEGFS 
1 1 SNE PLATYQENRNTCFS PDHKQPQPGTAGGMRVS VS SKGSI STPN 
SPVKMTLIPSPNSPFQKVGKLASSDTGESDQSSTETDSTVKSQEESN 
PiCLDPQELAQKILEETQSHLIAVERLQRSGGQVSKSNNPEDGVQAPS 
STAVFRASETSAFSRPVLSHQKSQPSPVTVKPKPPARSSSLPKVSSG 
YSSPTTSEMSIKDSPSQHSGRPSPGCDSQTSQLDQPLFKLKYPSSPY 
SAHISKSPRNMSPSSGHQSPAGSAPSPALSYSSAGSARSSPADAPDI 
DKLKMAAIDEKVQAVHNLKMFWQSTPQHSTGPMKIFRGAPGTMTSKR 
DVLSLLNLSPRHNKKEEGVDKLELKELSLQQHDGAPPKAPPNGHW\R 
TETTSLGSLPLPAGPPATAPARPLRLPSG\NGYKFLSPGKIFSLLSK 
CLKHLLLPIDFEQACRWGPGVCFSPSLSAFPSQPPAPS 

3064 

A 

8 

665 

SAQMAVTTADPRVRPRVRTQLCSLATSNRHAWCILSPEEKSAVTALW 
GK\VNVDE\VGGKALGRLL\WLPLVPDRFFE\SFGDLSTP\DAVNG 
QTPKVKVSWSRKCLGA\FSDGPGFTWDNLQGSPLPTLK*ACTC*QAC 
TWDP*RTFRLPGGNVAGFWAWAHSLLGKEFQPHQCKLA*SRKLVGL 
VIiAKCPWPHKYHLKLAFLAVQFLFKGFLWSLKTNY 

3065 

A 

8 

531 

SAQMAVTTADPRVRPRVRTQLCSLASLIQTRMVHLTPVE\KSAVTAL 
WGKVNVDE \ VGG KALGRLL\ WYPW \ TQRFFE \ S FGDLS \ TP * SLLR 
ANP * GERLHGKKS AS VPFSDG\ LAHL \ DN\ LKGTFAHTE VSLHCDK\ 
LHRGSLKNFRLLGQRAWSWAGPIHFWEKNFNPTSCRLA 
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3066 

A 

91 

951 

TPHFHQGTGKRAAWPTWFNPAGPGI FLRR*GPGKAQRRRS IAP\RPA 
VGFPSGPIVRCPHGFGYPHERLRAGPRLPALRKLRVAGIHKEGWPGP 
IG I \ S VDPRKPDQVPRS PLQ\ ANVQR\ LKKYASNFI LFPQRS PRAPK 
KGDRS \ AE E LKLGH PS LTG P \ VM P VRNVYKK\ EKARV I TEEEKN FKA 
FASLRMARANAPALRHTGQKEPREAAEQGC*KAHNKALPGDLESVGQ 
SCWVSTWCVSWEQLGLGWGFTAVTSSCQGIWGFLERQSKPWIMLNLIj 
VLKPCWLFG 

3067 

A 

1 

1603 

MKLLKSILLTILLDFSLHTCASIiVPALQWILLLLITVLWEGIAVAQ 
KTQDGQN I G I KH I PATQCG I WVRTSNGGHFAS PNYPD S YP PNKEC I V 
ILEAAPRQR I ELTFDEHYY I E PS FECRFDHLE VRDGPFGFS PLIDRY 
CGVKSPPLIRSTGRFMVJIKFSSDEELEGLGISEQIIPFIPDPDFTYIi 
GRYFKFPS/HDCQFE/HLRELDGDSAALVR*NKRRKQNQGQAFD\CI 
WDL * / M P FQKLKI YLRFLDYQMEH SNECKRNFVAVYDGS SSI ENIiKA 
KFCSTVANDVMLKTGIGVIRMWADEGSRLSRFRMLFTSFGGASPAQA 
ALSFCHSNMCINNSLVCNGVQNCAYPWDENHCKEKKKAGVFEQITKT 
HGTI IGITSGIVLVLLI ISILVQVKQPRKKVMACKTAFNKTGFQEVF 
DPPHYELFSLRDKEISADLADLSEELDNYQKMRRSSTASRCIHDHHC 
GSQASSVKQSRTNLSSMELPFRNDFAQPQPMKTFNSTFKKSSYTFKQ 
GHECPEQGPGKDRVMGGRFPC 

3068 

A 

163 

392 

HQELLLFLLLSSRRLSRSPSEGSRPTM/ALGEKKIGFPSLLWCFPPS 
AAGASSQIKWIQPCGSGAPPLPGCSGKRRA 

3069 

A 

190 

352 

HQELLLFLLLSSRRLSRSPSEGSRPTMVLEKRK*GFHPHSCLRAVmL 
KVCEGKS 

3070 

A 

1 

794 

MSRERPPGTD I PRNLSF I AAL/ DGARLLP * P AAQPRGGAGGGARRGR 
DAVRGP I PR SRTESGP/ SG PDLHQPEAAGPGRP / C SRS PDTRKRVRF 
ADALGLELAWRRFRPGELPRVPRHVQIQLQRDALRHFAPCQPRARG 
LQEARAALEPASEPGFAARLLTQR I CLERAEAGPLGVAGSARWDLA 
YEKRAPAAYAGPAPPPPRADRFAFRLPAPPIGGALLFAIiRYRVTGHE 
FWDNNGGRDYALRGPEHPGSGGAPEPQGWIHFI 

3071 

A 

133 

1079 

ARR I LQN F S L Y PS G EDLKG Y KV I QAQT AF SAN P ANPA I LTEA\ SAP I 
PHDGNLYPRLYPE\ LSQYMGLSLNEEEI \ RANVAWSGAPLQGQLV\ 
ARPSSINYMGGLLLTG*LMIiGIRRSR/ 1 1 *GKGFREVILCKDQDGK\ 
IGL\RLKSIDNG\IFVQLVQANSSSLIGLV*EFGDQVLQINGENCAG 
W\SSDKAPRWSNRPF*EKITMTIRDRPFERTITMHKDSTGHVGFIFK 
NGKITSIVKDSSAARNGLL\TGPNLCGNSMGQNVIGLKDSQIADILS 
T\SGTVGTITIMPAFIFEHIIKRMAPSIM\KSLM\DHTIPEV 

3072 

A 

242 

512 

AWVPCLLRHCDREPAPRGRTSPSLWPQEGT/P/GQDRGGGPWIiGQPS 
LQ * QRSHGGHLPAPQGPLGGPHDNSSFRPGRGGAGRGQG I EQGSK 

3073 

A 

1 

354 

GTRWRPGSMAGLELLSDQGYRVDGRRAGELRKIQARMGVFAQADGSA 
YIEQGNTKALAWYGPHEASGRAGWGIVWPWBLRGSRAERWLGDLRG 
NGRRAS VYTADAQHR I S LGVNAS P 

3074 

A 

3 

756 

VDPRVRVGRERTWRPGSMAGLELLSDQGYRVDGRRAGELRKIQARMG 
VFAQADGSAYIEQGNTKALAWYGPHEIRGSR/LSKPCRTGALVNCQ 
YSSATF\STGERKRRPHGDRKS\VEMGLQLRQTFEAAILTQL\HPRS 
QI * YLLLQVAYRADGGT * CS LCECS PRWAVLDAGI PMRDFVCACSAG 
C\VDGTALA\DFSHVEEAAGGPQLGPGPAAKPSGQIALLEMDGPAAR 
GPPGAGVGGCCPRLPEMCTPS 

3075 

A 

5 

792 

EDTGGIHMEEAPETYIPSRAVSLFFNWKQEFRTLEVTLRDFSKLNKQ 
D I RYLG KIFS SATS LRLQ I KRCAG VAGS LS L VLSTCKNI YS LMVEAS 
PLTIEDERHITSVTNLKTLSIHDLQNQRLPGGIiTDSLGNLKNLTKLI 
I^NIKIWEEDAIKLAEGLKMLKKMCLFHLTHLSDIGEGMDYIVKSLS 
SEPCDLEE I QLVS CCLST^NAVKI LAQNLHNLVKLS I LDLS ENYLEKD 
GNEALHELI DRMNVLKQLNALEI.PLGL 

3076 

A 

293 

1303 

GAAQLPPEAVLGARVFPERKGWGTSGRLAVRKAWEEVRALiASGFRPIi 
PGCPCTPWRKAARAAPSELSERLVGTSLADIFFTCSGFSSRIGPSAA 
SFAKAVRSLPEYPSVIL/QNSLIKLLWPISCTHYKYSVISSRLYTIK 
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FY * KFS F * AS \ IASCS PS LL SHKRLSTS S VI TLRS SS SNCHPTS LAS 

CRKVNLDNTWLSFLTNAGSGRNSLVLKSKNTNCLRFSNTPMKASHPS 

LLTRFPAKFNCWKFFRGFFPKNAPKILISVSVSLQFFNPSLTSCGTS 

SKCFNRLLRLPCTSQPQGSISAVSCSSTFILSTGRNKEQFRDWMVSK 
IDTNMRTEI 

3077 

A 

1 

3015 

ME VAR S G A I S LS L S RAHTRGG ARERRG PN S E S FL W WMAC WRAD VCCV 
LFRQHSPFNVLWTSSCLYVLPPMCSSRRPAALNFIKDNSRALIQRMG 
M TV I KQ I TDDL FVWNVLNRE E VN I ICCEKVEQDAARGI IHMILKKGS 
ESCNLFLKSLKEWNYPLFQDLNGQSLFHQTSEGDLDDLAQDLKDLYH 
TPS FLNF YPLGED IDI I FNLKSTFTEPVLWRKDQHHHRVEQLTLNGL 
LQALQSPCIIEGESGKGKSTLLQRIAMLWGSGKCKALTKFKFVFFLR 
LSRAQGGLFETLCDQLLDIPGTIRKQTFMAMLLKLRQRVLFIiLDGYN 
EFKPQNCPE IEALI KENHRFKNMVI VTTTTECLRHIRQFGALTAEVG 
DMTEDSAQALIREVLIKELAEGLLLQIQKSRCLRNLMKTPLFWITC 
AIQMGESEFHSHTQTTLFHTFYDLLIQKNKHKHKGVAASDFIRSLDH 
CGDLALEGVFSHKFDFELQDVSSVNEDVLIiTTGLLCKYTAQRFKPKY 
KFFHKSFQEYTAGRRLSSLLTSHEPEEVTKGNGYLQKMVSISDITST 
YSSLLRYTCGSSVEATRAVMKHLAAVYQHGCLLGLSIAKRPLWRQES 
LQSVKNTTEQEILKAININSFVECGIHLYQESTSKSALSQEFEAFFQ 
GKSLYINSGNIPDYLFDFFEHLPNCASAXiDFIKLDFYGGAMASWEKA 
AEDTGGIHMEEAPETYIPSRAVSLFFNWKQEFRTLEVTLRDFSKLNK 
QDIRYLGKIFSSATSLRLQIKRCAGVAGSLSLVLSTCKNIYSLMVEA 
SPLTIEDERHITSVTNLKTLSIHDLQNQRLPDLIIYSLSPQGPEVLN 
KRVKLEVQVLESNCWSSKSRLFLPQPQKPAVTETQRSASGHGRTAAP 
CVPGTVKAQLGTAAKGLRAYRTEWLVLRDGCMHCDADGLHSLSYHPA 
LNAILAVTSRGTIKVIDGTSGATLQASALSASRKLGFEGVSESHHWR 
RALLGGVNLGPGTGLSDA 

3078 

A 

1 

3220 

MNKKVSGLQELEASLKRKANTKKLYFKNMSWSPKKRAIGIjLSQRPLQ 
ADTQAAGRREEHIGGRTYKQLDVQRTLKGECWRKSTQQTSARQQAIH 
QRlTOSEFGLEWFIKDNSRAIilQRNGMTVIKQITDDLFVWNVLNREE 
VNIICCEKVEQDAARGIIHMILKKGSESCNLFLKSLKEWNYPLFQDL 
NGQSLFHQTS EGDLDDLAQDLKDL YHTPS FLNFYPLGED IDI I FNLK 
STFTEPVLWRKDQHHHRVEQLTLNGLLQALQSPCIIEGESGKGKSTL 
LQRIAMLWGSGKCK7^LTKFKFVFFLRLSRAQGGIi\LKTFGDQLIiDIP 
GT I RKQTFLAH\ VLKHAARGGLGLLDG YDE FKPQNC PE I EAL I KENH 
RFKNIWIVTTTTECLRHIRQFGAIjTAEVGDMTEDSAQ/UjIREVIiIKE 
LAEGLVLQIQKSRCLRNLMKTPLFVVITCAIQMGEREVP\SYTQTTL 
FPLPFNE\RLVQKNKHKHKGVAASDFIRSPDHCGDLA\LEGVFSPQF 
D FQH/ DRMCPA * MSHVLLTTGLLCKYTAQRFKPQY I FFLKS FLC\ FT 
AGRRLSSLIiTSHEPEEVTKGNGYLQKMVSISDlTSTYSSLLRYTCGS 
S V/ VSHQGC YETTSQQVDQHGCLLGLS IA* RPLWRQESLQS VKNTTE 
QEILKAININSFVECGIHLYQESTSKSSLSPRNLKLSFQGKAYIST* 
GTIPRLLI * LIj*T/ CCPI VQVPWTSLNWTFMGE/ PMASWEKAAEDTG 
G IHMEEAPETY I PSRAVSL FFNWKQE FRTLEVTLRDF S KLNKQDIR Y 
LGKI FS S ATSLRLQI KRCAGVAGSLS LVLSTCKNI YSLMVEAS PLTI 
EDERHITSVTNLKTLSIHDLQNQRLPGGLTDSLGNIiKNLTKLIMDNI 
KMNEEDAIKLGQISVLIFVSIFETIQSLNCSLFLPVDRMNVLEQLTA 
LMLPWGCDVQGSLSSLLKHLEEVPQLVKLGLKNWRLTDTEIRILGAF 
FGKNPLKNFQQLNLAGNRVSSDGWLAFMGVFENLKQLVFFDFSTKEF 

JjFUF^UjVKKljUPKCYPKIiTFLQEARLVGWQFDDDDLSVITGAFKLVT 
A 

3079 

A 

184 

562 

GQRGRGLRKGSGASGAGMMEHTHTEAHPHTWCTSTHTCTHTLGDTD 
YMHLHVCVSIHMYV/CPHT/MSLALGMSAHTLHMFSHTKLiLSR*HAG 
RHVPMPKQVTCTYVPTPSPTHVLGHACGCRPHAS 

3080 

A 

63 

428 

GLLFPQHCPAHRPRRCLRPRSSAGRGSAPALSRPSALCHLCLPRSSL 
SAPRSVS PAATPHH*ALQGLVEPWGPLHSAPLiHR*APASGPAGNVRA 
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WPACRGWCLSRPPLPGTVGEGAASRAF 

3081 

A 

1 

588 

HYGRLDCKCITLPQNNTTNRYYKEFPLSEILTVESAQNFSLVPPGTN 
PHCFEIVTANATYFVGEMPGGTPGGPSGQGAEAARGWETAIRQALMP 
VILQDAPSAPGHAPHRQASLSISVSNSQIQENVDIATVYQIFPDEVL 
GLGCPRTGISVGPVHHRTTTCRGWRSASVFSEVLCPRPAAALHSGSS 
QDPSNELF . 

3082 

A 

81 

289 

DPVPPSPDAPAPGSELAALLLGRDRLLQVPELERAASEAVSRADWIA 
AE WAL VTVRS AKS PRIiTS PPR 

3083 

A 

3 

387 

EGGKAHSFLGHIPLMRVEQSVRHTTRKSSTTLREGRWHYSNKDTLK 
KRHYWRLDCKS I TLFQNNTTNRH YKE I PLAE I LTAEPAQNFSLVPPG 
TNPHC FE I GTANAT Y FVGEMPGGTPGG P S GHG A 

3084 

A 

2 

290 

RPPGPSRSPPAPPPSSRRPL\PKMACCACCARVTAGPVPGLGWGVGG 
PGQRSPVTRRARPPSGLLGFEARPLVPWTSRSPFRCQCTACGHLVEF 
VL 

3085 

A 

2099 

5383 

VIQTSLSPSPATNDRGHPFSLDNWTGKPVHSMVAHLDAVTCLAVDPN 
GAFLMSGSHDCSLRLWSLDNKTCVQEITAHRKKHEEAIHAVACHPSK 
ALIASAGADALAKAHKEQGGKRLHLPVEDDGFGAQAPPCWEEVPQGT 
NAQFLGCKGLCPSPPAGPSEHTRKDGGRCLDTQDMAAFCWDSHLPFR 
KKGAELQSPPPLLPQIPAPGSGVSFHIQIGLTREFVLLPAASELAHV 
KQLACSIVDQKRPRVQVPRPPPPLSQEPRKVLLPARPTPVLGLEPVP 
ECGFYGLYDKILLFKHDPTSANLLQLVRSSGDIQ\EGDLVEWLSAS 
ATFEDLQIRPHAIiTVHSYRAPAFCDHCGEMLFGLVRQGLKCDGCGLN 
YHKRCAFSIPNNCSGARKRRLSSTSLASGHSVRLGTSESLPCTAEEL 
SRSTTELLPRRPPSSSSSSSASSYTGRPIELDKMLLSKVKVPHTFLI 
HSYTRPTVCQACKKLLKGLFRQGLQCKDCKFNCHKRCATRVPNDCLG 
EALINGDVPMEEATDFS\EADKSALMDESEDSGVIPGSHSENALHAS 
EEEEGEGGKAQS S LGY I PI»MR WQS VRHTTR \ KS S TTI*REG WWHYS 
NKDTLRKRHYWRLD C KC I TLFQNNTTNR Y YKE I P L S E I LTVES AQNF 
SL. VPPGTNPH\CFEIVT\ANATYFVGEMPGGTPGGPSGQCG*GRP/V 
AWETAI RHAL I P VI LQDAPS AP \ GHAPPKQASLG I S VSNSQ IQKNVD 
IATVYQIFPDEVLGSRQFGVXVYGGKHRKTXSRDVAVKAIDKXLRFP 
TKQESQLRNEVAILQSLRTPGIVNLECMFETPEKVFWMEKIiHGDML 
EMILS\SEKGRLPERLTKFLITQILV\AFRQLHFKNI\VHCDLKPEI 
VLLiASA\DPFPQVKXiCDFGFARIIGE\KSFRRSVGGARRPYIiGNPRL 
WL»\NQGYNRSLDIWSVGVIMYVSLSGTFPFNEDEDINDQIQNAAFMY 
PWPAPWEATSSAGSHLTFIQQPCCRLKMR\KRYSVDKSLSHPW\LQE 
YQTWLDLREL\EGKMGERYI\THESDE\TRWEQVC/EQSIRCPGSWA 
APRTGISVGACPTTGPRTCKGLAERISVXt 

3086 

A 

2 

426 

EDGSLATFLGRRQFAFAEMRHSKRTHCPDWDSRESWGHESYRGSHKR 
KRRSHSSTQENRHCRPHHQFKESD/CCSMCIPLEASHSVEEDTHPSH 
YLE ARSLNERDYRDRR YVDE YRND YCEG YVPRH YHRD I E S G YR I HCS 
RS 

3087 

A 

39 

1514 

GDGAAAI LWGVCRSGRGGKTAVWRHFSAEGPFAFAEMRHS KRTHCPD 
WGS I ES \ WGHES YRGSHKRTRI SHS RS \ QENRHCKPHHQFKESDCHY 
LEARSLNERDYRJDRRYVDEYRNDYCEGYVPRHYHRDIESGYRIHCSK 
SSVRSRRSSPKRKRNRHCSSHQSRSMKSVDTLGEGAFGKWECIDHG 
MDGMHVAVXIVWSTVGRYREAARSEIQVLEHLNSTDPNSVFRCVQMLE 
WFDHHGHVCIVFELLGLITYDFIKENSFLPFQIDHIRQMAYQICQSI 
NFIiHHNKLTHTDIjKPEN I LiFVKSD YW KYNS KMKRDERTLKNTD I KV 
VDFGSATYDDEHHSTLVSTRHYRAPEVILALGWSQPCDVWSIGCILI 
ELYPWGSQFFRLQ**KEHLAMMERNIRTPYPQHMI\QKTRKRKYFHP 
♦PSLDWDEHSSC/AGRYVRRRCKPLKEFMLCHDEEHEKLFDLVRRML 
EYDPTQRITLDEALQHPFFDLLKKK 

3088 

A 

2 

175 

TSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPG 
SIALKKVKCV 

3089 

A 

1 

534 

MLFPGSIDLKKVKFQAKLEHECIQNFKILQAGFKKLGVDKIIPVDKS 
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VKGKFQDNFBFVQWFKKFFDANYDGKIYDPVAARQVCCLEMEVKHAG 
DHAASFLDLWQREWSMKAIVTLGSARLGCFRGVGLVVFQEDEERPEI 
AHPAPLPCEAMCSIRTLQKVPTSKEAIiTRCSPSTLDF 

3090 

A 

1 

325 

LEKERDF Y/ FGKLR / N I EL I CQENEGENAP VLQR I VD I L YATDEGFV 
IPDEGGPQEEQEEYYSLGPAEQHGYSSLQIMCLTVKYS1.LLSLEDSL 
VSFHKQKVPLLKVPVF 

3091 

A 

3 

930 

SSRGRAGGVWRFERDEDQTGAGCGQWTRFCREPKMAVNVYSTSVTSD 
NLSRHDMLA\WINESLQLN\LTKIEQLCSGAAYCQFMDM\LFPGSIA 
LKKVKFQAKLEH \ E Y IQNFKI LQAG FK\ RMGVDKI I S VDKLLKGK\ F 
QDNFEFVQWFKKFFDANYDGKDYDPVA\AR\QGQETAVAPSLVAPAL 
1JIKPK\ KPLTSS S AAPQR\ P I \ STQRTA\ AAPKGWALG WRKNPGVG\ 
NGDDEAAELMQQGQRI*NLLFEDLGGKERDFYFGKLRNIELICQENE 
GENDPVLQRIVDILYATDEGFVILDEGGPQEEQEEY 

3092 

A 

457 

912 

DSSNLVPFSGFTSNSSSLAVSTTSTVTSYNE1LKLSVIKVRINFFQT 
PVNVAILISFHETSVFLLASRMLNSFQKIFSLPRFWIWAKRPGILPN 
I CNAQESS S PTKD * QRKWFLEMRSTPGEDAANNVEMTTKDLNYDRNL 
VDKAWGFERTDSNFERSCTVG\NSIA*YREIVHGKKSTWMQQTSLL 
S YFKKLPQPLHPS ATTTLVY * KPS I SRQDS PPTK/ RL * LTDGSDDH 

3093 

A 

337 

903 

CNRNTSTLTRTCTNVCINTPRPAPPLLA/ PQATLHFQFP * / PPLPL* 
♦QPQALSATRLSL/VCHGSGLRLLPLLQAAPGWLPSLGPLSLFSCRG 
RVLLC* PGWSCYLGLKLSSHLGLPKCWDDRPKPPSPAS FEWAPTRPN 
PLPQHGTQSQCLFSVGEAPRPHSGSAYYTNQCLQCSRPCSERTRRLG 
VLLPK 

3094 

C 

821 

1051 

MAMELINNKHWPSVRALCWNTVLAYVIGEGVNSGRSCLAQTL 
KGSAFRDWVLGPAPGGNGALRLFRKILFC* 

3095 

A 

1 

594 

S S PLRVAWCS SNQNRSMEAHN I LS KRGF S VRS FGTGTHVKLPG PAP 
DKPNVYDFKTTYDQMYNDLLRKDKELYPSGLPFKNPPCGPPWEGVTC 
GIjGHRRRVTLCS ANTGS S VFCES LVS I SGARR I VHGS PGNLRLEKSL 
RISEELSGHVLVHPVWGTASTWLCSRRGLRRSRPGKYLPRSARTESS 
FSSLIRKDLV 

3096 

A 

3 

761 

TSRGRVGTQAGEPRDLRPPPCPSSPLRV\AW\CLEQPERGAWEAHN 
IPQPNGDSAVRSFG\TGTHVKLPGPAPD\NPNVY\DFKTTYDQMYND 
LLRKDK\ELFTQNGILHIAG\RNKRIKP\GPERFQNCKDLFDLIIjT\ 
CEERVYDRVGWKI * ISR\EQGDLPSPVHWN\LDIQDNHEEATLG/A 
RFLICE\VCQCIQHTEDM\ENEIDELLQEFEEKSGRTFL\HTV\CFI 
LFSFLDICF 

3097 

C 

129 j 

314 

MSEVIGYEAFFLIFASLFIGXXXXXXXXXXXXFFAYSFLFFCSFWTL 
CVS FLRRPRTSGS YF 

3098 

A 

1 

1434 ! 

MGYQASMGQEYGNVLNVCLIRARSLVSLIRGQWTTDGTPLVGVNVS 
FVKYPKYGYTITRQDGTLIQSTLIEEHVKSWLVMFGFQLSNIIPGFP 
RAKMYFVPPPYELSESQASENGQGFSNPSHSPTPTGILHKGTGENQF 
LTQQPAIITSIMGNGRRRSISCPSCNGLAEGNKLLAPVALAVGIPGS 
LYVGDFNYIRRIFPSRNVTSILELRPSHRNHLGILGSFWVCMKRIKY 
LGIQLTRDVKDLFKI^KPLLKEIKEDTNKWKNLPCSWVGRINIVKM 
AILPKVTVKCLKEELLHGLASIKEVPANCDTAGRVVPHKGKDKDAMT 
GAAGM SLGALE KL PGGGVS CVPKPSTTS FAMKPPVP I SGYLLD I SVS 
RQAPQTTNRKLNPVLVSNNFASGLFCHYRVSCVPKPSTTSSAMKPQV 
S VSRY PLD 1 1 SVS RQARQTTTNRKLNP I LVSNNFAS CLFCHYHERVP 

3099 

A 

1 

831 

MPST I KGS RGD KKES VQVKG AERQRKG YRVGVTNGRF P3jR SVS PSD S 
SPCNNPAHKYYLAVD^VSGSLYVSDTNSRRIYRVKSLSGTKDLAGNS 
EWAGTGEQCLPFDEARCGDGGKAIDATLMSPRGIAVDKNGLMYFVD 
ATM I RKVDQNG 1 1 STLLGSNDLTAVRPLSCDSSMDVAQDHAYALPHS 
DVGLVLDKAVLCQLCTASPSQACLLHSRTRTAQAARPVSGDKERAWA 
SRLQFGFVQSHVTGAESALANFGLRGAEKHITLKKRPSRRQK 

3100 

A 

1 

1251 

MGF S MLAR I LMQN AS LRTSDS GM YRTG I PNCLGQFGLLQQKYHRLGG 
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LNNKHLPLAVLEAGKSKIKAPADSVSDGbVDCLDPDCCIiQSACQNSL 
LCRGSRDPLDI IQQGQTDWPAVKSFYDRI KLLAGKDSTHI I PGBNPF 
NSSEHLSPGHASHVLHRSLSRHAWVHGRNLLSIDYDRNIRTEKIYDD 
HRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMS 
ERTDIDKQGRIVSRMFADGKVWSYSYLDKDGCGKKSTGCLRKSRLCY 
QRQKSD S GHI AF I Q YE AAS PGEQQLYVFNADG I HQYTV SLVTGE YL Y 
IFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITIiTVG 
TNGGLKWSTQNLELGLMTYDGNTGLLATKSDETGWTTFYE 

3101 

A 

1 

5355 

MAGWLRAWLDGWLTGWI^GCMSEWMMDGWMDGWIDGWMDGWNPIVPE 
TQVLHEEIELPGSNVKLRYLSSRTAGYKSLLKITMTQSTVPIiNLIRV 
HLMVAVEGHLFQKS FQAS PNLAYTF I WDKTDAYGQRVYGLSDAVVS V 
GFEYETCPSLILWEKRTALLQGFELDPSNLGGWSLDKHHILNVKSGI 
LHKGTGENQFLTQQPAI I TS IMGNGRRRS ISCPS CNGLAEGNKLLAP 
VALAVG IDGSLYVGDFNY IRRI FPSRNVTS ILELRNKEFKHSNNPAH 
KYYLAVDPVSGSLYVSDTNSRRIYRVKSLSGTKDLAGNSEWAGTGE 
QCLPFDEARCGDGGKAIDATLMSPRGIAVPKNGLMYFVDATMIRKVD 
QNGIISTLLGSNDLTAVRPLSCDSSMDVAQVRLEWPTDIiAVNPMDNS 
LYVLENNVILRITENHQVSIIAGRPMHCQVPGIDYSLSKLAIHSALE 
S AS A I A I S HTGVIjY I TETDEKKINRLRQ VTTNGE I CLLAG AASDCDC 
KNDVNCNC YSGDDAYATDAI LNS P S SLAVAPDGT I Y I ADLGN I RIRA 
VS KNKPVLNAFNQYEAAS PGEQEL YVFNADG IHQYTVS LiVTG E YL YN 
FTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGT 
NGGLKVVSTQNLELGLMTYDGNTGLLATKSDETGWTTFYDYDHEGRL 
TNWRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITOLSSVEASY 
TWQDQVRNSYQLCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIG 
RCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLLSIDY 
DRNIRTEKIYDDHRKFTLRIIYDQVGRPFIiWLPSSGLAAVNVSYFFN 
GRLAGLQRGAMS ERTD I D KQGR I VS RM FADGKVWS YS YLDKS MVLLL 
QSQRQYIFEYDSSDRIiIAVTMPSVARHSMSTHTSIGYIRNIYNPPES 
NASVIFDYSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVT 
FGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRFSEEGMVN 
ARFD YTYHDNS FR IAS I KPVI SETPL P VDL YR YDE I SGKVEHFGKFG 
VI YYD INQ I ITTAVMTLS KHFDTHGR I KEVLA / YEMFRS LMYWMTVQ 
YDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVACNDRPTWRYS 
YELNGMLHLLNPGNSVRiMPLRYDLRDRITRLGDVQYKIDDDGYLCQ 
RGSDIFEYNSKGLIiTRAYNKASGWSVQY\RYDGVGRRASNKTNLGHH 
LQYFYSDLHNPTRITHVYNHSNSEITSLYYDLQGHLFAMESSSGEEY 
YVASDNTGTPLAVFSINGLMIKQLQYTAYGEIYYDSNPDFQMVIGFH 
GGLYDPL\TKLVHFT\ *RDYDVLA\GRWT\ S PDYTMWK\ NVGKGAGP 
PFKPYMFK\SNNPLSSELDLKKTYVTGEDSATKDVKSWLVMFGFQLSN 
IIPGFPRAKKYFVP\PPYELSESQASENGQLITGVQQTTERHNQAFM 
AliEGQV I TKKLHAS I REKAGHWF VTTTP 1 1 G KG I MLA I KEGRVTTG V 
SNIASEDSRKVASVLNNAYYLDKMHYSIEGKDTHYFVKIGSADGDLV 
TIX3TTIGRKVLESGVNVDI*/VSKPTLLVNG\RTRRFT\NMEFRTSRW 
WFSIRYGPNPRTLGRREKAAVLDQ/ARRQRALGT\AWAKEQQKARDG 
REGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQF 
LRQNEMGKR 

3102 

A 

205 

393 

RCSMRVHQSQPLSSVESTGAHHEKMSYQSTHDGCGDFCLPEFKNKVJL 
SWSLQTFLGCMGDRTA 

3103 

A 

2 

158 

GRTCSEEARPGAVACRAGDMDPLFQQTHKQVHEIQSCMGRLETADKQ 
SVHSE 

3104 

A 

1 

1923 

MRPPPALALAGLCLLALPAAAASYFGRQSRSLMPSRDPHAGALSAAI 
LASGEPSLPRTPSPPSAFALRSGDREGWVDGWDGLFPVSGQRIVRVQ 
GE WGKPRTS F AFDYNTKCFG I L I RE YAN I RRAGNI YVFG PLMNDCPQ 
QMAS SLQAKTWAAFAHQCS AAKGQLHTELLHEDALKKWTS PAMGWER 
SRSHDKPRRLSRPLVPPRPF PRAPC AG S SRVRRGLADQKGQQFPTQR 
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SLLPTGSASFTPDRGCAESWCLRPRAIilGCSLTSSNPAAPRWAREGG 
GCGWRCASDKPESHFQSQVDFVPTIGGVAPPLHGRGQTSSSAPLL.ME 
PHLLGLLLGLLLGGTRVXAG YP I WWCW AQGCVAVN I FAD I YL WVRAI 
LILHLYIHVILYAGKWSQWAMGAGEKVLRSEAETRFFSVRQPVPREH 
SGYPRQVHEIQSCMGRL\DTADKQSGHIVENEIQASIDQIF\SRLER 
L E I L S S KE P PNKRQNGR / L FR VNS L KYD VQH L QT \ ALRN F Q \ HRRHA 
REQQ \ ERQRERAFC/ RS KPFTT * TTLDTTI TNWDR I HLQF * LPSLPR 
Kr HWGriG ± LHj 1 IjLHj \ tiN X \ IxUGJ-iR rQRIj \ TL KGTQKKI LD I ANMI*G L» 
SNTVMRL I E KRAFQDKYFM I GGMLLTCWM FL WQYLT 


A 

1 

C CL C 

boo 

r oOKFXKFKr'OAVAi-KAolJMJJFJjr yQlHKQVHEIQoCMGRIjETADKQ 
S VH I VENE I QAS I DQI FSRLERLE XhSS KE PPNKRQNAKLRVDQLKY 
DVQHLQTALRNFQHRRHARE\QRRRQR\EELLSRTFTTNGTQKKDPb 
TLPNMLGLS\NTVMR\LIEKR/AFPQDKYFMIGGHPA*PLWALFLWF 
QYRT 




xz> X, 

t?Q Qr3T.T.'DTH , Pr>r , T.2iGVP , tJW'K'TT WflTCMA^A/r'l/ 
roouljJjrlnr yv^xxrt.£> x E*vt I raiYxljiV J\>c*iNiA v Vo V 

3107 

A 

432 

649 

GSHPFAQAGVQWCVCSTLQP/LNS*SSNLSLHSCWDYKFK1»VCICKG 
DGKEFLHMHI LKKDVTLTEKLI PRYV 

3108 

A 

2 

678 

PPSFQPQSDGRGDASGRNAAMAAQGEPQLVLVGDGGTGKTTFVKRHL 
TGEFEKKYvATLGVEVHPLVFHTNRRP I K\ FNVWDTARQEKFGGLRD 
G Y Y I QAQCAI IM FD VTS RVTYKNVPNWHRDLVRVCEN I P I VLCGNKV 
DIKDRKVKAKSIVFHRKKNLQYYHICAKSNYNFEKPFLWI^RKLIGD 
PNLEF/VAMPALAPPEWMDPAFASTYEHDLEVAQTTAIj 

3109 

A 

1 

549 

MAAQGQPQVQFKLVLVGDGX5TGKTTFVKHHLTGEFEKKYVATLGVEV 
HPLVFHTNRGPVKFITVWDTAGLEKFSGLRBGYYIQAQSTIIVFDVTS 
RVTYKNVPNWHRDLKNLQYYD I S AKSNYNFEKP FLWLARKL IGDPNIi 
EFVAMPALAPPEVVMDPAIiAAQYEHDLEVAQTTALPDEDDDL 

3110 

A 

2 

822 

APALAPASAAS PAFE FTL P PS FQPQSDGRGDAS GRNAAMAAQGE PQV 

QFKLVLVGDGGTGKTTFVKRHLTGEFEKKYVATLGVEVHPL>VFHTN\ 

RGPIKFNVWDTAWPWRNSGGL\RDGLYIPSPRVPIIMFD\VTSR\VT 

YK\NVPNW ALGDIiVTS V/ CEN I PHLCCVANKV\ D I KD \ R KVKAKS IV 

F \ HRKENF \ Q Y Y \ D I S AQKVPTNLKKP FL WALLG S LMGD P * LWN1»I>P 

MPxjJ->£>PFPR/ iiVVMfc.PQjbWPATSMKHDIj\EVASDNCLPPDED 

K 

3111 

A 

116 

599 

PQVISLPKCCDCRCEPLHLAAEKSLPTVRETTKVPGEAKSEFHKNKM 
U GRGG KPMGG U -bO P U AM X V KbXSSDG\HE\FIVlCRDHGIjT\SGTE*K 
PMLSGPG\QFAGGTKTH\EVNFR\EIPSH\IAIRKVCNVFFTYKV/R 
ATPNS SHPRFPEFPNLAPGNLHWELA 

3112 

A 

84 

350 

KYTIKTKTVLKKSGIFSTCPKPKSCYDFVPLNAGPIPPKSILGTNGP 
THTLEMLD * RLNKPTP I S PQRS PLTQVGHR * TRPTRRRPRT 

3113 

A 

122 

518 

KSLEFFPPALSPSPALILCHLMQAPYHLKVSWEPTpPPT/ALEMLD* 
RLPLPHA*HKGETHTCGPLSDHTFKAPPDPLQICLIiSHQGPASPSQQ 
THPYQPSEVTADSGGTQVDKANQEEATNTKHHEQEEEQD 

3114 

A 

1 

1452 

MSAYCLQLGGLQAXilJUjTQTPGHRDSLPDAQPAAVLVCGWPWLASFS 
ERT I VGHKEWTI KDKLKNKFKDE FD I VS LPCTLG SHPQPQDPMLKLP 
WS ITYGRCLAPGDHTVSLVI KLVES ASLSLAVSNMLSDM IRWMPAGC 
GRCLLGLQSESRAVCGQCVGMEQVCRVSGPHIRWIPQPQLHSCLFFIj 
YPSPYPLEPSSNTSFICPHSRTYNGSLLSCIPNSaaAAAPPTPQATS 
FPSCPSPKQRIiPTGRWEGSDHRVHEARLPGNHSRSRGVDSAPPTLA 

HFRTRGQVYDASSAPVTEHTATELPAGNQGSQPGLQEGGLEGTVVTL 
VS WKVT/MVSRRC * CP.YGQRPH/HHLKVLGNQRX PTHTLEMLD * RLP 
LPQRLTPKGKHTP WPPLLI \ QPFKAPPDPLQI CLLSHQGPAS PSQQ 
THPYQPSEVTADSRWDTR 

3115 

A 

1 

414 

MSRQWRSSKFRm^FGQPAKADQCYEDVRVSQTTWDSGFCAVNPKFV 
AxjICEASGGGAFLVLPLGKVINTLSDHHHRGTDFGGSPWIjHVIIAFP 
TSYKVVITLWIVYLWEQAFESSQKYKEGKYIIElxNITO 
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3116 

A 

112 

1655 

QEHLQRASPRLLQLLPPLPPPPPPPAFGGSLSSI/3SRMSRQV\VRSS 
KFRHVFGQPAKADQ\ CYEDVR VS QTT\ WDSGFWC/ SVNP * VLWALIC 
EASGGGAFLVLPLGKTGRVDKNAPTVRR*HS / APVLDIAWCPHNDNV 
I A\ SGSED \ CTVMVWE I PDGGLM\ LPLREPWTLEGHTKRVGI VAWH 
TTAQNVLLSAGCDNVIMVWDVGP\GAAMIiTLGPEGPP\DANYRGDWE 
P\DGGLICTSCRDKRVRIIEPRKGTWAEKDRPHEGTRPVRAVFVSE 
GKILTTGFSRMSERQVALWDTKHLEESPV/SLRKLHTRNGVPVAFFG 
P\DTNIVYLRGKG\SGPIRVFE\ITSE\APFLHYL\SMFS\SKESQR 
GHGLHGPNRGLEVNK\CEIAGFYKLHDGR/CVSPMPKPVPRKSDL\F 
QE \ DLGPTQP QG PD P CLHGLKE WLGGRD AG P L P S S P \ LKDG \ YVPPK 
SREL\RVNRGLDP\GRRR\AAPEASGTPSSDAVSRLEEEMRKLQATV 
QELQKRLDRLEETVQAKL 

3117 

A 

1 

2964 

MRRDPTPPRGLPRQRPTPSPRTRSRGHQSQRPKAKSEPQPAPAATPG 
GRRAWEPRAQAGCPPPPRAASPRRLVQGERPESGNRSRQAAYPRRAS 
RARTCTTRIiPPTLRQRALTQRPCSVPRTNRKLRRGPGPITLRRPESC 
SRPTlFATEAJ^AQALPRPPGVIARQSAMHRALAAIiKNKPLNHVHKW 
KERLCPWLCSRQPEPFGVKFHHAHCKKFHSKNGNDLHPLGGPVFSQV 
SDCDRLEQNVKNEESQMF YRRLSNLTS SEEVLS FI STMETLPDTMAA 
GALQRICEVEKKDGDQGLPKEILENSIFQALCFQFEKEPSQLSNTSL 
VTALQALILLHVDPQSSLLLNLVAECQNRIiRKGGMEVRNLCILGESL 
I TLHS S GCVTLEIi I INQLQGEKLE TF TP ED I VAL YR I LQACTE KVDE 
HQTFLNKINNFSLS I VSNLSPKLI SQMLTALWIjDQSQAFPLI IKLG 
KYVVRHVPHFTNEELRRVLEAFIYFGHHDTFFTKALEHRVAAVCLTL 
DPEWCRVKEYCSRELILSKPILNAVAETFVCQTEKFSPRQISALME 
PFGKLOTLPPNASALFRKLENVL.FTHFNYFPPKSLLKLLHSCSLNEC 
HPVNFIJUCIFKPLFLQRLQDVEIKLDEEGFVLPSTANEDIHKRIALC 
IDGPKRFCSNSKHLLGKEAIKQRHLQLLGYQWQPLLILSHLALPYP 
LHVRIQQTVLITLGTLNAPALGPNRKEPLPLRGLKQGSSDTKEARHS 
RGDCBALGQWSYKFRSQQLKERASTEQTGASDETCLHVDPLGQFFKA 
SPGPSERFMTIHQLPSAVTWAGALDGGGMEDDEEETTASTLRGKPRP 
PPVSAQSAFSYIPPRRLDPKEHSYYYRPARVKKVFASPLLSAMITGI 
ISLYDCIFKRRLDYDQKLHRDDREHAKSLGLHVNEEEQERPVGVLTS 
SVYGKRINQPIEPLNRDFGRANHVQADFYRKNDIPSLKEPGFGHIAP 
S 

3118 

A 

5 

622 

GETPRRPGASRGRGRRRAQGHAPGDTKAKGQRPSPSPSPRPRRPPEA 
EGRGN \ PG ASRLPTPAPGRLS QE I S AGRAAG I RE P * PTGS LPTPRQP 
GTNL\ PTRLPPTLRQRALTQRPCSNfflRALAALKN^ 
LCPWLCSRQPEP\LGSNSIMPIVKSFIRKMEMTFIHSVDQCSLKYLT 
ATGLNKMLKMRRVRCFTGD 

3119 

A 

1425 

2181 

GMSHISLNEELRRVLEAFIYFGHHDTFFTKALEHRVAAVCLTLDPEV 
VCRVMEYCSRELILSKPILNAVAETFVCQTEKFSPRQISALMEPFGK 
LNYLPPNASALFRKLENVLFTHFN\ IVPPKSLLKLLHSCSLNECHPV 
NFLAKlFKPLFLQRLQGKESHLDTLSRAQLTQLFLASVIiECPFYKPK 
NLAPC I P ATP VMTKRGQGTAWAVAS EGVS PKS WQLPHG VE PVGAQKS 
RIKVWEPLPSFQRMYENA 

3120 

A 

408 

1669 

HI F YKSPRAS CSCGCASRWDPEWCRVMG VIjQ * EN* FFSKP I LNAGA 
ETFVCQTEKFSPRQISALMEPFGKLNYLPPNASALFRKLENVLFTHF 
NYFPPKSLLKLLHSCSLNECHPVNFIAKI FKPLFLQRLQGKESHLDT 
LSRAQLTQLFLASVLECPFYKGPKLLPKYQVKSF/PYPMLFPGDPCG 
FSAL*ICED\GLTNLIjGARIiYFAPKVIiTPYCYTIGNCSLPEGKDSSE 
AFRARRPDVEIKLDEEGFVLPSTANEDIHKRIAXiCIDGPKRFCSNSK 
HLLGKEAIKQRHLQLLGYQWQPLLILSHLALPYPLHVRIQQTVLIT 
LGTLNAPALGPNRKEPLPLRGLKQGIiTSHSTLISGGPFRGAMTHLMK 
EASHLQYMPLS FAVS PRAPRTPRRQDTPGETVRPWGSGRTS SGVNN 

3121 

A 

28 

2034 

ESDGMAJLI TLRKNLYRLSDFQMHRAIiAAllKNKPIJ^^TraKVVKERLCP 
WLCSRQPEPFGVRFHHAHCKKFHSKNGNDLHPLGGPVFSQVSDCDRL 
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EQNVKNEESQMFYRRLSNLTS S EEVLS F I STMETL PDTMAAGALQR I 
CEVEKKDGDQGIiPKQIIiENSIFQAIiCFQFEKEPSQLSNTSLVTALQA 
L I LLHVDPQS SLLLNLVAECQNRLRKGGMEVRNLC ILGESL ITLHS S 
GCVTLELIINQLQGEKLETFTPEDIVALYRILQACTEKVDEHQTFLN 
KINNFSLSIVSNLSPKLISQMLTALVVLDQSQAFPLIIKLGKYVVRH 
VPHFTNEELRRVLEAFIYFGHHDTFFTKALEHRVAAVCLTLDPEWC 
RVMEYCSRELILSKPILNAVAETFVCQTEKFSPRQISALMEPFGKLN 
YLPPNASALFRKLENVLFTHFNYFPPKSLLKLLHSCSLNECHPVNFL 
AKIFKPLFLQRLQGKESHLDTLSRAQLTQLFIiASVLECPFYKGPKLL 
PKYQVKSF/ PYPMLFPGDPCGFSAL* ICED\GLTNLLGARLYFAPKV 
LTPYCYTIDVEIKLDEEGFVLPSTANEDIHKRIAL\VLMVQKRFCSN 
SKHLLGKEAIKQRHLQLLGYQWQIPYHEIGMLKSRREIiVEYL/RQR 
KP VS PQNTC P WGGQE 

3122 

A 

200 

470 

RGGGAGPLTGHHVGLLGIAHAAADQFDRLAAGICGHSHPHPGLPGPIi 
HCCAPLWSGPQQN * PTADVAESLQLVTVHQS ETEDGE FTECNN 

3123 

A 

1 

2520 

MAAYLQWRRFVFFDKEIiVKEPLSNDGAAPGATPASGSAASKFLCLPP 
GITVCDSGRGSLVFGDILRLELSFPSRDPEEIERLLDYPAHMEGQIW 
FliPRSLQLTGFQAYKLRVTHLYQLKQHNILASVGEDEEGINPLVKIW 
NliEKRDGFTDGSVTLNKGDITRDRHSKTQILHKGNYPVTGLAFRQAG 
KTTHLFVVTTENVQHPVQSKTTFFHPLPKYPKQTLLQAPPNTLLIiRH 
SWPNGDCRDDHVFSLYFMIPFFFTTLGLLYHNWYPSRVFVGDTFCY 
FAGMTFAWGILGHFSKTMLLFFMPQVFNFLYSLPQLLHIIPCPRHR 
IPRLNIKTGKLEMSYSKFKTKSLSFLGTFILKVAESLQLVTVHQSET 
EDGEFTECNNMTLINLLLKVLGPIHERNLTLLLLLLQVESVFETIiVB 
DS AEEESTLTKLGNT I S S LFGGGTTPDAKENGTDTVQEEEES PAEG S 
KDEPGEQVELKEEAEAPVEDGSQPPPPEPKGDATPEGEKATEKENGD 
KSEAQKPSEKAEAGPEGVAPAPEGEKKQKPARKRRMVEEIGVELWL 
DLPDLPEDKLAQS VQKLQDLTLRDLEKQEREKAANSLEAF I FETQDK 
LYQPE YQEVSTEEQREE I SGKLS AASTWLEDEGVGATTVMLKEKLAE 
LRKLCQGLFFRVEERKKWPERLSALDNIjLNHSSMFLKGARLIPEMDQ 
IFTE^EMTTLEKVINETWAWKNATLAEQAKLPATEKPVX,LSKDIEAK 
MMALDREVQYLLNKAKFTKPRPRPKDKNGTRAEPPLNASASDQGEKV 
IPPAGQTEDAEPISEPEKVETEPEQKEQSTGQKRPLKNDEL 

3124 

A 

1 

1950 

MKKASSQEKDFRMPRGKKEPSYIVSGKDYPRVELDTHGCGLRCSALS 
DPSQDLQFIVAGDECVYIiYQPDERGPCFAFEGHKLIAHWFRGYLIIV 
SRDRKVSPKSEFTSRDSQSSDKQILNIYDLCNKFIAYSTVFEDWDV 
IjAEWGSLYVLTRDGRMLFKKNLFEMAINLAKSQHLDSDGLAQI fmqy 
GDHLYS KGNHDG AVQQY I RTLGPLG AS PGLM I VQKAAI KGAWEAGWR 
REQKKPNS ADLGTVRAAS RNCGQRLCVlxAA(^ YG YYRTV I FSAMFGG 
YSLYYFNRKTFSFVMPSLVEEIPLDKDDLGFITSSQSAAYAISKFVS 
GVLSDQM S ARWLF S SGLLLVGLVNI FFAWS STVP VFAAXj WFLNGLAQ 
GLGWPPCGKVLRKWFEPSQFGTWWAILSTSMNIiAGGLGPILATIIiAQ 
S YS WRS TLAJLS GALCVWS FLCLLL IHNEP ADVGLRNLDPMPS EGKK 
GSLKEESTLQELLLSPYIiWVIiSTGSSYMSALEVGGIiVGSIAAGYLSD 
RAMAKAGLSNYGNPRHGLIiLFMMAGMTVSMYLFRVTVTSDSPKLWII* 
VLGAVFGFSSYGPIALFGVIANESAPPNLCGTSHAIVGLMAKCTIAK 
HYSWSTAFWVAEVICAASTAAFFLLRNIRTKMGRVSQEG 

3125 

A 

1249 

2166 

MGAVYQAAALSKAFKVKPFVVRDAVVYPILVEFTREVEEEPGIHSLK 
HNKRVLFSRMGPYPQRKVITFNRYSHDFNFHINYGDLGFLGPEDLRV 
FG S QNLTTVKLXGVGDS FKKYPD YES KG I KAHFNI*DESGVLS LDR I F 
QKTKCS I CNSALELPSVHFLCGHSFHQHCFES YSESDAPCPTCLPEN 
RKVMDMIRAQEQKRDLHDQFQHQLKCSNDSFSVIADYFGRAASPIIF 
MFVESVTQRTTYSQSNTL.RCVNGSGPPEPGADPQSAEEQQLWRQPLS 
RRQLCPLTPGPGKALGPAATMAVE 

3126 

A 

1 

2424 

WAIHSAIiESSLLFQEQSQLLYLVIjTVHENIjNFMAIGFTDGSVTLNIC 
GDITRDRHSKTQILHKGNYPVTGLAFRQAGKTTHIiFVVTTENVQDLQ 
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FIVAGDECVYLYQPDERDCRDDHVFSLYFMIPFFFTTL/VIALPQLV 
P I TG VCGR YLLL JjCW \ QTFAWG I LGHFS KTMLLF FMPQVFNFL YS L 
PQLLHIIPCPRHRIPRLNIKTGKLEMSYSKFKTKSIiSFLGTFILKVA 
ESLQLVTVHQSETEDGEFTECNNMTLINLLL/QSPWAHT*EKPHIAP 
AAAAAFS FLCDHWHPGR FLPS AD Y * LDFTYG FLQLATLS I»S I PS LQP 
\ 1 GT I G KLEPS YV I RKFLD AQR I HNLTAYLQTLHRQS LANADHTTLL 
LNCYTKLKDS SKLEEF I KKKSES E VHFDVETA I KVLRQAGYYSHALY 
LAENHAHHEWYLKIQLEDIKlTyQEAIjRYlGKLPFEQTVESNMKRYGKI 

NNPRELKAFLEHMSEVQPDSPQGIYDTLLELRLQNWAHEKDPQVKEK 
LHAEAISLLKSGRFCDVFDKAWFQQIMHYHMQHEQYRQVISVCERHG 
EQD PS LWEQALS YFARiKEEDCKE YVAA VLKH I ENKNLMPPLLWQTL 
AHNSTATLSVIRDYLVQKLQKQSQQIAQDELRVRRYREETTRIRQEI 
QELKASPKIFQKTKCSICNSALELPSVHFLCGHSFHQHCFESYSESD 
ADCPTCLPENRKV14DMIRAQEQKRDLHDQFQHQLKCSNDSFSVIADY 
FGRAS PNF I FYW 

3127 

A 

1 

411 

MAAYLQWRRFVFFDKELVKEPLSNDGAAPGATPASGSAASKFLCLPP 
G I TVCDSGRGSLVFGD ILRtiELS FPSRDPEEIERLIjDYPAHMEGQI W 
FLPRSLQLTGFQAYKLRVTHL»YQLKQHNIIiASVGEDEEGINPI» 

3128 

A 

2 

243 

VVHIxAQD<^LVKDVAEATTLQ^ 
ITRSLGTKPVNSFPQSIPSSQRTIEPRGKVLLV 

3129 

A 

1 

1176 

MAWAGSEPRRVT^RiGSTASEGEKQGVXEGPLPAAAAAAAAAAAAAAG 

EAAGRGRTGQRRSPGHRLRAARVPTAPGKPRPGFCARLPRPYAGAGR 
GQAGAGSAEPLGLRPLLEKRGKERRGVGDPKIFQKTKCSICNSALEL 
PSVHFLCGHSFHQHCFESYSESDADCPTCLPENRKVMDMIRAQEQKR 
DT^HDOFOHOEEN S PKMRVT RVGTR KS nLAR T OTD S WATLKAS YPGL 

XyxXlXX/\{X *-»XjAH *— ' J vl IXv V J. XV V V7 X i\iw JL 1 AV1>- -L ^ X V V -A* X XJlVrtiJ X * VJU 

QFEIIAMSTTGDKIIiDTAIiSKGFTQALSVRQIGEKSLFTKELEHALE 
KNEVDLVVHSLKDLPTVLPPGFTIGAICKRENPHDAVVFHPKFVGKT 
LETLPEKR VG P PGTI S 

3130 

A 

2 

496 

H S P SGG SRLGFGAELWAATS S G WNIiWGCWG S E I RHP VAG VTRTKGF S 
SWYSSPAGALQWRRFVFFDKELVKEPLSNDGAAPGATPASGSAASKF 
LCLPPGITVCDSGRGSLVFGDILRLELSFPSPGSRRDRVRMKSVCRR 
GPCRAPLWFSWS SRWLWS VS VW 

3131 

A 

2211 

2568 

GGFJvAEETWSRQTVPITGVCGRYLLLLCWHDL^ 

ILHAPGVQLPLLTASAPAYHPLPSPPHTQTQYQDRQTGDELFQVQDQ 
EPLFLGHLYFKGGREPPAGDSTPE 

3132 

A 

100 

2726 

AMMGPLPGPHLLLDPLLPSSFCLPPGITVCDSCRGSIiVFGDMEGQIW 
FLPRSLQLTGFQAYKLRVTHLYQLKQHNILASVGEDEEGINPLVKIW 
NliEKRDGGNPLCTRIFPAI PGTEPTVVSCLTVHENIjNFMAIGFTDGS 
VTLNKGD I TRDRHSKTQILHKGbTx'PVTGLAFRQAGxCITHL 
VQSYIVSGKDYPRV^LDTHGCGLRCSALSDPSQDLQFIVAGDECVYI* 
YQPDERGPCFAFEGHKLIAHWFRGYLIIVSRDRKVSPKSBFTSRDSQ 
SSDKQII^IYDLCNKFIAYST^FEDVATO^ 
LQEKDTQTKLE^FKKNLFEMAINL/^ 

YSKGNHDGAVQQY IRT I GKLEPS YVI RKFLDAQRIHNLTAYIiQTLHR 
QSL. ANADHTTLLLNC YTKLKDS S KLE\ E F I KKKSES EVHFDVETAI K 
VLRQAGYY\ SHAIxDIjAENHAHHEW YLKI QLED I KNYQEAXiR Y I GKLP 
FEQAESNMKRYGKI LMHHI PEQTTQLLKGLCT * L S AQPRRPQR *GGP 
RIxAGPxXRSFIPIFANK/HPPJ)LKAFLEHMSX^QPDSPQGIYBTLLE 
LRLQNWAHEKDPQVKEKLHAEAI SLLKSGRFCDVFDKALDLCQMHDF 
QDGGLYLYEQGKLFQQIMHYHMQHEQYPAGQSA/CCERHGEQDPSL\ 
WEQALSYFARK\KEDCKEYVAAVXKHIENGNLMPPLLWQTIxAHNST 
ATLSVIRDYLVQKLQKQSQQIAQDELRVRRYREETTRIRQEIQELKA 
SPKIFQKTKCSICNSALELPSVHFLCGHSFHQHCFESYSESDADCPT 
CLPENRKVMDMIRAQGTEPRFSMINSQHQLQVLQ 
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3133 

A 

11 

707 

AKMGAYKYIQB\LWRKKQSDVMRFLLRV\RCWQYRQL\SALHRGSRP 
HR/RPDKVHRLGYKQ\AKQGYVI yrirvrryi *dscsrdrrggrkrp 
\ VS KGATFGKP\ VHHGVNQAKVWFYP FTS VAE \ ERPGRH\ CGALRVL 
NSYWVGEDSHLPNFLGVIPHGSHSHKAIR\RNP*PPQW\ITQTQSHK 
H\REMRGLTSARPERAVGL\GKGHKF\HHTIG\GSRRAAWRRRNTLQ 
APPLPLI 

3134 

A 

1 

333 

DPMS LL YNMNDC YS KLKEL VPS I PQNKKVS KME I LQHVI D YI LDLQ I 
AliDSHPTIVSLHHQRPGQNQASRTPLTTLNTDISILSLQVRPAPGSP 
PRRRTLPRS SGLSLGDP 

3135 

A 

1 

1530 

MAVGKTGYNHSTCSDKYRPSSFIKSSLLDNEKSNRSFQLTLGKGCRE 
KKS S FQGLP FFCVYMSFRGTWTS KLG SRPAHPAG IGCERGRTKPTP 
RARAHQWKRPLVLIDVPPSANGDEGKLQRVAPASADKSRPAGLGLHS 
EPSPVPSAASSAGGSGGLSFRAASSLPVSPSLAVSMKAFSPVRSVRK 
NSLSDHSLGISRSKTPVDDPMSLLYNMNDCYSKLKELVPSIPQNKKV 
SKMEILQHVIDYILDLQIALDSHPTIVSLHHQRPGQNQASRTPLTTL 
NTDIS ILSLQRGAWWLI I IPGGSPEGRRGVFPVAGYPRTWPPGPRH 
LS PAAAPADLCAAGREARSGPNPTPAS PPGEAQTPAARS PPSHGDPR 
WRSDGPGERPRTEAVRAGLWGRPGPFRAPAGHRWRRWPRDPWSRSR 
GAAAPLGGDFPRRQPRCPTPAVEPGRHLLADTRTAALARGLVGLGPG 
HWEPEEQSVGWRRSREKTPGSEDTEDAWEKELAGGPCEGV 

3136 

A 

1 

554 

I LSQVRCQAQLAQQAAGGGLS FRAAS SLPVS P SLAVSMKAFS PVRS I 
RKN/SAIiSDHSLGl\SRSK\TPVD\DPDGALLYNMKDCYS\KLKEL\ 
VP S I P PE QRR * AK WE I L \ QHV I RLT S WD LADS P WDLAFPLiIVQPGIH 
QRTRGRNQ\RSRTPLGPPSTRDISILSFAGLLEFPSELM\SNDSK\A 
LCG 

3137 

A 

193 

561 

GDSGGS PWPDEKPKEG VKTENNDHIN3jKVAG\Q\ DGS WQ\ FKI KRH 
TPLSKLMKAYCE\RQGLSMRQIRFPI*PGNPINETDTPATVGKWKDE 
\DTID\VFQQARPGGVYLKKGTCFLYSQNSVSF 

3138 

A 

3 

251 

NNGGGGGFGVGSGSNFGGGGSNNDFGNYNNQSSNFGPMKGGNFGGRS 
SGPYGGGGQYFAKPRNQGGYGGSSSSSSYGSGRRF 

3139 

A 

1 

1674 

MRDPNTKRSRGFGFNTTYATVEEVDAAMNTTPHKVDGRVVEPKRAVSR 
EDSQRPG AHLTVKKI FVGG I KEDTEEHHLRD YFEQYGKI EVI E IMTD 
RGSGKKRGFAFVTFDDHDSVDKIVIQKYHTVKGHNCEVRKALPKQEM 
ASASS SQRGRRGSGNFGGGRGDG FGGNDNFGRGGNFSGRGGFGGSCG 
GGGYGGSGDGYNGFGNDGSNFGGGGSYSDFGNYNNQSSNFGPMKGGN 
FGGRSSGPYGGGGQYFAKPQNQAARCVAARWLFRTEARLVFLQKFPP 
WAVVEVVTVIVAAPAAATATTRDGGGCSRNCNI PEVFPEIitiGCPJJRR 
G PPGGVREKQQQTNS KS TRRQE I TKTI AELKE I ETRJCTLQK INE S RS 
W F YEKINKVDRLLDRL I KKKRE KS QI DA I KND I GD I VTD P AE I QTT I 
KEYYKRLYANELENLEEMDKFLATYSLHSLNQEEVESLNKPVTSSEV 
EAVTNSLPTKKSPGPDGFTVLLEVIiAPAIRQEKEIKHIPIGREEVKL 
SLFADDVIVYLENPIVSAQNLLKLISNFSKLSGYKINVQKS 

3140 

A 

1 

650 

MSLRKLTIMVEGKGGPNIPHACGCRQRSSIKVSLLPAVTSKSESPKE 
PEQLRKLF IGGLS FETTDESLRSHFEQRRTLTDC AVMRDPNTKCSKG 
FGFVTYATMEEKYHTVNGHSCEARKALSKQEVASASSSQR/GRSGSG 
NFGGGHGGGFGGNDNFGHGENFRGHSSFGGSHGGSGYGGSGDDYSGF 
GNDGSNFGGGGSYNDFGNYSNQSSNFGPM ! 

3141 

A 

579 

2102 

SPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVVRFGRDKA 
VKyFIbliAxLiCjAVr oECJj*K*LIAIi*LELCWQRNVIjIi* F* KLiTS* I* 
G * WETGRTF YKRLV * S FLLP YS KLK* QKLLRS DFVLHKLTLFSG\ MR 
DPNTKRSRGFGFVTYATVEEVDAAIWARPHKVDGRVVEPK^VSREV 
SGFFFFFFU^LLGYVLL*T*DSGVF*TYQNFLFEYRLC*SKPMVFLIj 
L \DSQRPGA\ HLTV\ KKI YVGG I KEDTEEHHIi\ RDYFEQYGKI \ EW 
EIH*LDR\GQWPRKRGFAFVTFDDHDSVDKIVSKYQIVAFSKGSTIC 
MAF * TL I PCC I YVFFL VQKYHTVNGHNCE VRKALS KQEMAS AS SSQR 
/ G Y ACCL IKP*R*Ii* VTP V * M I * CLNFM S * GR S G S GNFGGGRGGG FG 
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GNDNFGRGGNFSGR / G YVWF I YM * F * LI/T I FAMKI LQYGNC I QNVTI> 
SPSHT*NLKLFLTGGFGGSRGGGGYGGSGDGYNGFGNDGKFFRNK 

3142 

c 

42 

152 

MIXXXXXXXXXXXXXXXXXXVSPSAFAPXXXXKSLG* 

3143 

A 

3164 

3256 

LLTNCS ISCDPPASASQSAGITGVSHSARP 

3144 

A 

12 

336 

SPVQL* F*LFLVFC*LWSWSAVVYLGPLiGTPSADAHT/ AGLSKTPPH 
WT^ARARLDDVFSLRTFSSHSLNMELVQDLTASAPMYSSTSRDPP/CIi 
GLPKCWDYKREPPRPAH 

3145 

A 

602 

745 

TPASLKIPVE**NTLLAKMVSIS*PRDLPASASQSAGITGVSHRARC 

3146 

A 

269 

484 

DKTS RGTI RQQHS R FTKI RCS AATAADTQANRVW SGPLAN SNRPAAE 
GPRD P PAS AS QS AG I TG VS HRAWP 

3147 

A J 

2 

ODD 

y o o 

AS AS RVAGTTDVHHHAQL I / 1</ YVFLVETG FHRVSQDGTjNTjLTRLHTj 
PKCWDYR/ R* ATTPGLCFVF * KKRNAREGGRQ P PDHKS * FL/VFFFL 
FFLKWSL/DSVSKAGVQWHGLSSLQAPPRGFTPFSCLSLPSSWDYKR 
PLPRPANFFVFL* * \DGGFTVI1ARMVT1I S * PRDLPASASQRAG\ ITG 

T.QHr , 7xWQFPF /r , PPKTn<?MQVT r H7iP\/OP'HTvT\ T.T">QQT.P7i*TPf?T.N*R < ? 

RLSLPSSWDYRSTPPSPANF\CSFSRDGVSPC*PGWSRSSDLVIRPP 
RPPKVPGLQA 

1 1 A ft 

TV 

z 

fen n 

DUO 

PFJQPPP<?T.nc;f?APPnaPP'RCipp7ippp 7iU7A ATWPT.V'PPP'WP phqt.tap 
LPVHWAGTTEPXiPS PRPATGTRERPPLHFPATPRSHS / WRLARPCRQ 
PGPS*DRLFSAGAGPGWSCLDREEKNAGHWLSSAFAI,VWLWI J TFJIW 
IjLIVVGFIGYVKAGSVPYPGCPJXCFFWAVIA^ 
VFPRLHLVPWLALWE 
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VPR T A T KMGP T FT ,DH T GG TP T . F P A MP D T T T /TTsTR FT i T S TR FTG AT 
GRAFL FNKWNLO YS EVODR VMLTGRHMVRDVSC KNCNSKLGWIYEF 
ATEDSQRYKEGRV 

3150 

A 

403 

787 

MGRIFLDHIGGTRLFSCAN\CDTILTNRSELISTRFTGATGRAFIjFN 
im^LQYSEVQDRVMLTGRHMVRDVS \CKNCN\ SKLG\ W I YGVWPLK 
DSQRF*GKGRR*SGER\VLVSEEELRALEEH\VPPDNS 

1 1 R i 

_> -L Z> X 

/A. 

171 

J O -<S 

KNS YKCEECGKI FNE YSHL TAR K/RI YT * EK1.YOYKDCEKAINVCSH 
LTQHQRV/H * KKYYKFNYCQKVFQKI 
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7V 
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X J J 

VHTjIjNPOHMPLiPS THKORRTTT . * KGHAEHIiOYY TjNNOKLiS 

3153 

A 

2 

.634 

WVWE S UmEDLWPCFLRRI LGGS S YSTAF^SGQTLQRAHLREAAS I 
SxxNSQGVWRKSARFSHQLERGTEEQQLQIPDLGQHPSSKAPVFPQVQ 
AHPRFQTGPHGHKLQRNQDPGQLQ* IPMLGQTPWTQATGPTI *TQPS 
GRSPCSHYSGMSTOSGFIvlIxAHSLTQ^ 
PR I SG WTDRQ VFG S VDRDL ISP 

3154 

A 

275 

464 

GKFFSLCETLFPSGVGSFFSSCTMDSSRSxjGVGSKNSHFSLA*lTrDIi 
RLS LC PFPG WKED I KF 

3155 

A 

5904 

7919 

FMFIIWSVCRLITTKYLLLLFLHESL^ 

QTSICIGILLRSAEIxALLLHPVDQANTLKSPVSESVSPVVPDYLPTE 

NGDFLSSKRKQISPJDIimiRSVTVNHMSDlTOSMSVDLSHIPLKDP 

FKSA\ SDTOlxAKRAFPFMD YLSDKHLGKI \ SEDES SGLVYKSG\ SGE 

IGSETSDKKDSFYTOSSSIlxlTx^GFRIYLSFG*VMGNQNILS\STL 

TSKGl^T\lESIFKABDLLPEAASLSENIi\DISKEETPPVR\TLK\S 

QSSLSGESLRERLPHPRLGF\PLVFSYKNMKRSSSQMSF/EATISLD 

CMT\T PPOT.T.PC*nr2CriCWMT7T T? 'WflvrD VX1 C? TT'KTVT? r~"TTi.X?<2.\ FNI 2V. C2 Zi "NT ,C\ 

1^GETSPDAISTOSEGAQENHDD\LMSVVVFKITGVNGEXDIRGEDT 

EICLQVNQVTPDQLGNISLPJIYTjCNRPVGSDQKAVIHSKSSPEISLR 

FESGPGAVIHSLLAEK\NGFLQCHI\F^FSTGFLTSSlaMNIQHFLED 

ETVATVMPMKI QVSN^KINLKDDS PRSSTVSLEPAPVTVHI \ DHLW 

ERSDDGSF\HIRDS\HMxxNTGlTOL\KElWKSDSVLLTSGKTO 

SV\TQATQTSPGVPWPSQSANFP\EFSFGLHLGEQLMEE1^ESLKQEL 

\ AKAKI^1ALAEAHL\ EKDAJLlxHHIKKMTVE 

3156 

A 

149 

382 

ARHCGSCP*SQHF\GRSR*AEHLRTGV*DQPAQRGETPSLWKNTKIR 
RAQWWCPPVI PATSGGLRQENYLNPGRQRLQ 
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3157 

A 

162 

342 

TYSECEDGTYSPEISWHHRKGTKGSEDSPPKHAGNNESHSSRRKNRH 
SKSKVTNGVGKK 

3158 

A 

1 

1995 

FRLVTPGVPAFSGCGRRHRGRTGRRAMASCVGSRTLSKDDVNYKMHF 
RMINEQQVEDITIDFFYRPHTI\TLLSFT\IDSLMYFAFTTDDSVP* 
DNIWRGILSGILLFLIIKV*AFPNGPLTRPHPALWRMVFGLSELYFIi 
FLVFLLFPEFEQAKSLMDWLDPNLRYATREANVMEYAVNCHVITWER 
1 1 S H FD I FAFGH FWG W AMKALL I RR YGLCWT I S I TWELTEL FFMHLL 
PNFAECWVTOQVILDILLCNGGGIWLGMVACRFLERRTDHWASFKDIH 
TTTGKIKRAVLQFTPASWTYVRWFDPKSSFQRVAGVYL\FMIIW\QIi 
T \ ELNTF FLKH I FVFQ ASHPL S WGR I LF \ I GG \ INS FPQLRQ YY AYL 
TD xQCKK/ Li * CjTyCW VFGVIGFI4A \ AX VCIKFGQ\DliF\SKTQXl/lV 
VLWLLCVAFTTFL \ CL YGM I WYAEHYGH PRKRPTPECEDGT \ YS PE I 
SW\DHRKGTKGSEDSPPKHAGNNESHSSR\RRNR\HSKSKVTNGRWE 
JK-Nr»JvJrW vi>Joj\J y lr JrHi-Xilj 1 £»£»V» JvWX^XjxIXA? 1 FKOoCjoKAilKWi\Jrw»KVjt»lJ 
TLGVIILRSLSLCFPTDLGRVQADHRLGGPFANVGSLLTSALDMRSP 
VAARCVERETVAISFHSCQEQLRAMLDRGCSVNIFPSDEALTPWLMD 
WSPVFNFIFYESTFPID 

3159 

C 

410 

538 

MKMDTRIQEQVSSKSSLYSCFLSQPHTNRENTLCLCLLWMALC 

3160 

A 

4 

334 

FFPKALIFCREVGPISPPPKIRFP/SPKFPGNRFSPPRVVKPPPGP/ 
SPFKRPPIRKKILPCQPP*TLAPPRIL*KGPP/PSSSSG*NPYYIiFI 
QQKTE YLLWS RP CPWHCEQR 

3161 

A 

1 

1130 

PLLPTSLSVEDAAILLQQVMRAFSKQASTWFSDTVWSEKF\INDC 
TELFRELMHQKAEKEMKNNPVHLI TEEDLKQI S TLESVS TS KKDKKD 
ERRRKATEGSGSMRGGGGGNAREYKIKKVKKKGRKDDDSDDeSQSSH 
TENEESGYDKKDENILEDVKWGKTSQFLEIFQDIEKARDKTLEADSE 
L.ESVFMSSTTSASGTGRKRTIKDLQEEVSNLYNNIRLFEKGMKFFAD 
DTQAALTKHLLKS VCTD I TNL I FNFLASDLMMAVDDPAA I TSEIRKK 
I L S KL SEETKVALTKLHNSLNEKDQHAIjL VXYQGLVVKQL VSQS KKT 
GQGDYPLNNELDKEQEDVASTTRKELQELSSSIKDLVLKSRKSSVTB 
E 

3162 

A 

20 

2478 

RVCSSSASTASQAVKADAWEEIRRLAADFQRAQFAEATQRLSERNCI 
EXVNRLIAQKQLEWHTLDGKEYITPAQISKEMRDELHVRGGRVNIV 
DLQQ VINVDL I H I ENR I GD 1 1 KS EKHVQL VLGQL \ I DENYL \DR VGQ 
EV1TOK1QESGQVTISELCKTYDLPGNFLTQALTQRLAI 1 1 IGHIDLD 
NRG V I FTEA FVARHRAR IRGL F S AI TR PTAVNSL I S KYG FQEQLL YS 
VLDELWSGRLRGTVVGGRQDKAVFVPDIYSRTQSTWVDSFFRQNGY 
LEFDALSRLGIPDAVSYIKKRYKTTQLLFIiKAACVGQGLVDQVEASV 
EEA I S SGTWVD I APLLPTSLS VEDAAI LLQQVMRAFS KQASTWFSD 
TVWSEKFINDCTELFRELMHQKAEKEMKNNPVHLITEEDLKQISTL 
ESVSTSKKDKia)ERRRKATEGSGSMRGGGGGNAREYKIKKVKKKGRK 
DDDSDDESQSSHTGKKKPEISFMFQDEIEDFLRKHIQDAPEEFISEL 
AE YLI KPLNKTYLEVVRS VFMSS TTS ASGTGRKRTI KDL»QE\ E VSNL 
YNNIRLFGKRGWKFF\ ADDTQGWLLPKHLLEVQCVIilSLNLI \ FNF\ 

X>or oXXjMIA^V w XJJJlroKJrr Xo \r.LiKJMc#X \IjoiSXioJ&JbiX JwAJjXXVXjJINoXj 

NEKSIEDFIFC\LDSAAEACDIMVKRGDKKRERQILFQHRQALAEQL 
KVTEDPALILHLTSVLLFQFSTHSMLHAP\GRCVPQIIAFLNSKIP\ 

"PFin^T ft T .T .VTTV Cid T A A/TTOT A7 Q D Q K" VTnrin T"YV "D \ T.KTKTFT riWRnPDVZi CT 
EtU\£Zln U Li V JV X yvjjj V V J_i V 0 r^Ol\J\ 1 Vjr^kjU X if \ XuNJV j3ti_iiyJ\JCi^dX^ VMO A. 

T\RKELQ\ELSSSIK\DLV\LKSR\KSSVTEE 

3163 

A 

1 

390 

IPSEISRSVAFSL/ljPLIiSVSGLEATORTPKNOVYSRHPPENGKSKF 
IiNCYVSGFHPSDIVGDLLMNGERIE*A*HSDL»AFRKDWAFYLljDYTE 
FTPT/EKDEYACRVNHVTLSQPKICKWDRDKLSASMEA 

3164 

A 

2 

409 

RPRVRPRWSIRAEM^RSVAIAVLALLSLSGLEAIQRTPKIQVYSRH 
PAENGKSNFLNC YVS G FHPSD I E V\ DLLKNGER I EKVEH SDLS FS KD 
WAFYLLYYTEFTP\TEK\DEYACR\VNH\VTIiSQPKI\VKWDRDM 

3165 

A 

26 

299 

WE FTTA W S G TVKQ S G EVXi VNVKEH S RQ I ND I QL S RDM TM FVTAS K 
DNTAKLFDS TTLEHQKTFRTERPVNS AAL S PNYDHVRE PHLPS C 
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3166 

A 

11 

487 

RVTAGMKP ILLQGHERS ITQI KYNREGDLLFTVAKDP IVNVWYSVNG 
ERLGTYMGHTGAVWCVDADWDTKHVLTGSADNSCRLWDCETDNNEP^ 
MKIPCNDSKITSAVWGPLGECIIAGHESGELNQYSAKSGEVLVNVKE 
HSRQINDIQLSRDMTMF 

3167 

B 

198 

282 

XPSLEDLSNS IQKLHLAENAEPEEQSAA* 

3168 

A 

80 

827 

AQQAVLARSKLESLCRELHGHNRSLKEEGVQRAREEEEKRKEVTSHF 
QVTLND I QLQMEQHNERNSKLRQENMELAERLKKIi I EQYELREEH ID 
KVFKHKDLQQQLVDAKLQQAQEMLKEAEERHQREKDFIiLKEAVESQR 
MCELMKQQETHLKQQLALYTEKFEEFQNTLSKSSEVFTTFKQEMEKM 
TKKIKJ<XEKETTMYRSRWESSNKALLEMAEEKTVRD/ERTGGPAGKN 
PTAGEAVPGTADRAQ 

3169 

A 

- 

1 

2511 

MEPELLVRKVSALQALGRGFLVRRQFQSVRAEYEAIVRKVEGDLGTL 
QWTEGRIPRPRFLPEKAKSHQTWKAGDRVANPEQGLWNHFPCEESEG 
EATWEEMVLKKSGESSANQGSLCRDHSSWLQMKQNRKPSQEKTRjDTT 
RMENPDRGFVPGKECSFEQLEHVREMQEKLARLHFSLDVCGEEEDDE 
EEEDGVTEGLPEEQKKTMADRNLDQLLSNVGSCLGALVPGGl^RGGEG 
EGTKPVQDGRPGWEEGLVQSATILDGHSQKLHLAENAEPEEQSAAWP 
QTLSRTVPSQDPPVDAKKPIL1.QGHERSITQIKYNREGDLLFTVAKD 
PSGPLVIEKVRNYSLPLWSVLSVGSTQEEALSRCPQEQCPWLPTSAK 
RPNPRKPLLPGDGEGLMPSQRPMDDKKKKRSPKPCIAQPAQAPGTLR 
RVPVPTSHSGSLALGLPHIiPSPKQRAKFKRASPAGELHAPGGGLWGIj 
G AKHGOLAE S AGRLPLTVTCKAGTEHHL PGS PGHGEIi FDS TTLEHOK 
TFRTERPWSAALSPNYDHVVLGGGQEAMDVTTTSTRIGKFEARRKA 
LQDAVNAIiCATSPPGSQRCPPPITRNGKQLALLKTNSAVRTCGFDFG 
GNI IMFSTDKQMGYQCFVS FFDLRDPSQIDNNEPYMKIPCNDSKITS 
AVWGPLGECI I AGHESGELNQ YS AKAVGEEDEEDEEGES LDSVKALT 
AKLQLQTRRPSYLEWTAQVQSQAWRRAQAKPGPGGPGDICGFDSMDS 
ALEWLRRELREMQAQDRQLAGQLLRLRAQLHRIjKMDQACHLHQELtiD 
EAELELELEPGAGLALAPLLRHLGLTRMNISARRFTLC 

3170 

A 

93 

517 

GRGHLGG/VWC*RSQERAQQIKEASAEIT/CSWLQMKQNRKPSQEKT 
RDTTRMENPDRGFVPGKECSFEQLEHVREMQEKLARLHFSLDVCGEE 
EDDEEEEDGVTEGLPEEQKKTMADRNLDQLLSNVGSCLGALVPGGMR 
GGE 

3171 

A 

3 

348 

FFAFFLGLIPFNAPLPHSTWGAAEHPPESVKSHSLKRMRGAVW/CLR 
GTPTQR/HCSSGSAFSARCSFWIELLRSSRLLSSWSRLRSAIVFFCS 
SGKPLSDPNPLPLHRPPPPHTHPG 

3172 

A 

2618 

5946 

PGCNFSLDVCREEEDDEEEEDGVTEGLPEEQKKTMADRNIjDQLLSNA 
RGRAWQLAPMEPELLVRKVSALQACVRGFLVRRQFQSLRAEYEAIVR 
EVEGDLGTLQWTEGRIPRPRFLPEKAKSHQTWKAGDRVANPEQGLWN 
HFPCEESEGEATWEEMVIiKXSGESSANQGSLCRDHSSWLQMKQNRKP 
SQEiCTRDTTRMENPEATDQRLPHSQPQIiQELQYHRSHXJVMEIiLWIiQQ 
AINSRKEYLLLKQTLRSPEAGPIREEPRVFLEHGEQACERDQSQPSA 
PLEDQSYKRQDHWRARTGGMTPVTGVT<5PHRFPGSFGHYTKKHLLGA 
KFRE PC YS KSG P P S S I PSNSQALGDRLTKGPDDGRQTFGGTCLLQMK 
ILEDQTPRGLKPRNHCPRKSRTQLSALYEDSNIKEMSPRKLDHKEPD 
CRTVRTQELGIjSEDHI I wdggs PAAKRVWYRNGDPFFPGSQLWTQ 
RRFPTMEAFLCEVTSAVQAPLAVRALYTPCHGHPVTNLADLKNRGQY 
VAAGFERFHKLHVFRNGDLVSPPFSLKLSQAMQDWETVIiKLLTEKV 
KLQSGAVCKLCTLEGLPLSAGKELVTGHYYVAVGEDEFKDLPYLELli 
VPSPSLPRGCWYAQGHRAQVTQPSPKEPDRIKPSAFYARPQQTIQPR 
SKLPTLSFPSGVIGVYGAPHRRKETAGALEVADDEDTQTEEPLDQTV 
QGKGTLMATRKQNSCriGKGRMGKGRVLTHSGEMGSGKELTQVASNTK 
QNSSKPGPTCRCQGEVIEKGSTLVEEKGHQKPILLQGHERSITQIKY 
IsmEGDLLFTVAKDPRTRARVNFVIFLNNSSGDTKHVLTGSADNSC\R 
LWDCETGK\QLAbLK\TNSAVRTCGFGLWGATFIMFSNGQSRWGYQC 
FVELFWTWRD S EARFDNNE ALHGRFPCNDS K\ I TS A \ VWG PLGE \ C I 
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IAGPWRVGEIiNQYSAKS\GEVL,VNVKEHS\RQl\NDIPVYSRI)M\TM 
FVTASKDNTAKLFDSTTLEHQKTPRT\ERP\VNSA\ALSPN**QRWF 
WAGGQEAMD/V*PTTLHPGLGKFEARFLSIWPF\EEEFG\RV\KGHF 
GP\INSVAF\HPIX5KS\YSSGGEDG/YTSPIHYFRPHQYFEFEV 

3173 

A 

3 

1131 

ENSPLPKAVRGVRPRTSLIGRTSPIjVTLFCLSWQRIKAAAAALGTAIf 
PRLCYPWLGGPAKQLLRAAPGRLAAKEASVLCRLQGGDQKAESTAGD 
AGAS AVHQRS TDTLS S S VMAS 1 1 ARVGNSRRLNAPLPPWAHSMLRSL. 
GRSLGPIMASMADRNMKLFSGRWPAQGEETFENWLTQVNGVLPDWN 
MSEEEKLKRLMKTLRGPAREVMRVL\ QATNPNLS V\ EIFLRAMKLGV 
FGGV * KQF * L S PWVNFFNTLTKLKGEKAS LYV IR\ LEVQL.QNAIQAG 
1 1 A\ RKMQTGTRLQQLLLGGELSRDLRLRLKDFLRMYANEQERLPNF 
LELIRMVREEEDWDDAFIKRKRPKRSESMVERAVSPVAFQGSPPIVI 
GSA 

3174 

A 

16 

319 

LGTRPAHRSSQ*GKGSPLPPGAPLISRGAGKEGGPGDRAAHADFLPA 
EEGAHGM/RGSHGSPLIAPLPTYPHPPSGLFPLGALPLARPRHQGAA 
LGSVHEIQ 

3175 

A 

2 

458 

NWAGTGHSLQQVTGHDHYNADLKPIDGFNGRFGYRRNTPALRQSTSV 
FGERPDR1WSVLNRKLSAEELERKRQEMMENAKWREEERLNILKRHA 
KDEEREQRLEKLDSRDGKFIHRMKIiESASTSSLEDRVKRNIYSLQRT 
SVALEKNFMKR 

3176 

A 

91 

521 

SLISQKKMANSSPVLSKVPGYGLQVRNSDRNQGLQGPLTAEQKRGHG 
MKNHSRSRSSSHSPPRHASKKSTREAGSRDRRSRSLGRRSRSPRPSK 
LHNSKVI^UIETGQTRSPSPKJCEVYQRRHAPGYTRKLSAEELERKRQE 
MM 

3177 

A 

3 

1636 

R PSEATWGMAYLS ECRLRLEKG F I LDGVAVS TAARAYGRS RPKLW S 
AIPPYNAQEDYHARSYFQSHWPPLLRKTDQSLRTLVDCVFSMGNAS 
DYAWIiASHTTPRGMGKPRGHSLQQVTGHDHYNADLKPIDGFNGRESG 
TSGNVATRSSQNETSTAKASTAEPGPLQNLQVTPTSQSATEPCTSSF 
RDADAMCERLLEENLAIVFIDILPSHACIjGYGRJ^KKSWHPQTLRNVE 
KVWKAEQKHEAERKKI EELQRELR\ KERAREEMQR YAEDVGAVKTRK 
KEEKLDLDVPGSWVTOG*TVNEYLLGRPIDKYVFEKMEEKEAGCSSET 
GLLPGS I FAPSGANSLLDMASKIREDPLFI IRKKEEEKKREVLNNPV 
KMKKIKELLQMSLEKKEKKXKKEKKKKHKKHKHRSSSSDRSSSEIDEH 
SAGRHNSKV\KGERQAKLGAHHLKKRSTKGDMLPDTPENSLQRN*SE 
NGQEMMENAKWREEERLNILKRHAKDEEREQRLEKLDSRDGKFIHRM 
KLGECILLSSLGGSGLKRISYSFYRELR 

3178 

A 

11 

301 

NVDLLIPNSSITLLSDRPASILLRTFIFSSKFSWYPPLLFILNLTL 
NHQQEKTKGKALMNWQKKKPTFFKQNLAES PKRHGHCIGG * WTEGAN 
HD 

3179 

A 

915 

2482 

S LEGLVPAHRVAVAVSCS P WGKRRTATS VXiRGCRAAFG PGS LAGVKF 
SMNKRGKYTTLNLEEKMKVLSR I EAGRSL KS VMDE FGIS KSTFYD IK 
KNKKVI LDFVLEPDMPL I GAEKRKRTTGAKYGDVDDAVYMWYQQKRS 
AGVPVRGVELQAAA*RFARCDRRTDLIASTGWLFRLPNRHAIGNRIG 
CGEQVLSSVSENVEPFRQKLSMIIKEEKLCLAQLYSGDETDLFWKSM 
PENSQASRKD I CLPGKKI NKERLS AFLCANADGTHKLKS 1 1 IGKSKL 
PKRVKEDTSTLPVIYKPSKDVWFTRELFSEWFFQNFVPEVRHFQLNG 
LRIHDEDVRALLLLDSCPAHPSSESLTSEDGRIKCMFFPHNTSTIilQ 
PD\DQGVIL,SCKRLYRWKQLEESLVIFEESDDEQEKGDKGVSKIKIY 
N I KSAI FNWAKS WEEVKQ I T I AHAWENLL YKKE PE YDFQGLEHGE YR 
EIS*KTGGSWNLSWMKIGCWLNGDEEKGCLLKTKGGITKGSCSKRRR 
S*EADC 

3180 

A 

417 

736 

YGLGAVAHACNPSTL<5GQAGQITRIiAVQDQPDQHCETPSIiIKMQKN/ 
LPGHGGVCLQSQLLRGLRQGNCLNQGGGSCSEPRSHQCTPAWAIE*E 
SVSKKKKKKKLW 

3181 

A 

1 

2061 

MGAVRDEXiHSLWPPLMGRGAQTPMAAAALRDPAQGCVTFEDVTIYF 
S QEEWVLLDEAQRLL YCDVMLENFAL IAS LGL I S FRSH I VS QLEMGK 
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EPWVPDSVDMTSAMARGAYGRPGSDFCHGTEGKDLPSEHNVSVEGVA 
QDRS PEATLC PQKTCPCD I CGLRLKD I LHLAEHQTTHPRQKPFVCEA 
YVKGSEFSANLPRKQVQQNVHNPIRTEEGQASPVKTCRDHTSDQLST 
CREGGKDFVATAGFLQCEVTPSDGEPHEATEGWDFHIALRHNKCCE 
SGDAFNNKSTLVQHQRIHSRERPYECSKCGIFFTYAADLTQHQKVHN 
\KENPYECFEGGKFFKPAL\SLVKHRRVHTGESPHVCGDCGKFFSRS 
SNIilQHKRVHTGEKPYECSDCGKFFSQRSNLIHHKRVHTGRSAHECS 
ECGKSFNCNSSLIKHWRVHTGERPYKCNECGKFFSHIASLIQHQIVH 
TGERPHGCGECGKAFSRSSDLMKHQRVHTGERPYECNECGKL.FSQSS 
SLNSHRRLHTGERPYQCSECGKFFNQSSSLNWHRRLHTGERPYECSE 
CGKTFRQRSNLRQHLKVHKPDRPYECSECGKAFNQRPTLIRHQKIHI 
RERSMENVLLPCSQHTPEIS\LRTDt»IRALSSTMLNLVHPSTHTGEV 
P*EC*LCWKJOSGDKLHSLTVEFISVFSLLGIi 

3182 

A 

441 

1074 

QRVQRSLKK*CWG/FNWGHSGGKI*KQIGPYLNTG/TRLKSKWIIVP 
QNCIISRRQQGKRV\HLGLGKDFLNPTPKAQSM/RGKIDKLYFIKIE 
NFS LKGT I RRTKRQAVE WQK/ 1 YLQNI SDK* VASRLHKEVS KLNKVK 
PIVDLMRCF/RPELAASGPCSLSFSICFQH*IQNVLLEIQVRSHQPS 
ELSNRRPWHSKVHNSLQCWAKHEAPLFSIiL 

3183 

C 

1 

900 

MGHVSSDDGHSDEDELRKSLSELVDDKFGTGTKKVTRILDGGNPFLD 
VPQALSATTYKHGVLTRKTHADMDGKRTPRGRRGWKKFYAVLKGTIL 
YLQKVRDCPRDLTHPGASDYSKKSNVLKLKTADWRVFLFQAPSKEEM 
LSWILRINLVAAIFSAPAFPAAVSSMKKFCRPLLPSCTTRLCQEEQL 
RSHENKLRQLTAELAEHRCHPVERGIKSKEAEEYRLKEHYLTFEKSR 
YETY IHLLAMKI KVGSDDLER I EARLATLEGDDPSLRKTHS S PALSQ 
GHVTGS KTTKDATGPDT* 

3184 

A 

178 

490 

VSSTVFGNTVRNKTEDDSGGNWEPPRTILQASGKLKNPKARIiASECL 
S PFDSTGKAIHLCQVCGRASRSHYSLGHH/MRHSHVGE * S * KCPGCG 
KGFSTKLDLRW 

3185 

A 

7 

310 

KKR SHCAQAGLELLSGQLG * TS S VQKI RKLAGCGGVHP WS QL.LERLR 
WEDS\liPQGGQGCSEP*SHHCTPAWATE*DPVSNNNKQKQKPKESNL 
SRATSRP 

3186 

A 

65 

203 

KGDFQGHQGPWDAPP*PAFWWQREARGEGEAMSQSPCRGRGNAEMYR 
PS AQPETGPASG * *TEAARGSSGWAEPWPRG * KPPEFPMRLGWPFPT 
PGAFSLPILCDEERP*GQRSSGCESQQELRSPRETELQVTSPTCFSA 
GPRRPGPAPGGPQAARAYCCHLKTSSEPTFHQAPTQAEVSWEFSPA/ 
PPADKEMTSMATPEKCPTSQ\APAIRKAALPHPERGNYFLSAAWPQG 
APFPLWGSASPFVALPWSPRAGAAPLCPGKAPPSKTRLGRAVTFP*A 
EGG TG RGRGHVAVPL P WERQRGNV 

3187 

A 

210 

951 

ARSCRAALSEASAGRPLPPFPGRAKETCRAAAMEAIWLYQFRLIVIG 
DSTVGKSCLIRRFTEGRFAQVSDPTVGV\DFFSRLVEIEPGK\RIKL 
QIW\DTAGQERFRSITRAYYRNSVGGLLLSDITNRSSFQNVHASWQE 
PKLHVQP YQ I VLVLVGH\ KCDLDTQRQVTRHEAEKLAA\ D TAW KYI E 
TSARDAINV/EESPSQT*PRDIYELVKRGEITIQEGWEGVKSGFVPN 
WHSSEEWKSERRCLC 

3188 

A 

1 

2559 

MPLRAFNPVS S FRWARGMTI VAALMTVFF IMQLVGQVPAALWVI FGE 
DRFRWS ATM I GLS LAVFG I LHALAQAFVTG PATKRFGEKQA 1 1 AGMA 
AD ALG YVLLAF ATRG WMAFP I M I LLAS GG I GMP ALQAMLS RQPNFDH 
WTADRHGDLCRLGEHMERVGMDCRRRPIPCLPPRVASRCMEPGHLDL 
NGSRRHIiANGFTTPRIGANQFLRRTVNAQTNPWQNI S I AS AI S SSRT 
itn ±Z)\jz>vvjz>yi fKi>Vs£HAa v £> AJLjHH YVPDIjHRRMLLATLWNT Y I C INEA 
LALTLSDF SLVP PHP YRQLFTLTTFHNRACS S SVTR I VS I LSRF IG I 
ITPMNRNPPYTEASV^KQEKTALNMARFIRSQTLTLLEKLNELDADE 
QAD I CESLHDHADELYRSCLARFGDDGG ATQAFAKENNQKAYKETYG 
VSHITRHDMLQI PKQQQNEKYQVPQFDQSTI KNIESAKGLDVWDSWP 
LQNADGTVAEYNG YHWFALAGS PKDADDTS I YM FYQKVGDNS IDSW 
KNAGRVFKDSDKFDANDPILKDQTQEWSGSATFTSDGKIRLFYTDYS 
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GKHYGKQSLTTAQVNVS KSDDTLKINGVEDHKTI FDGDGKTYQNVQQ 
FIDEGNYTSGDNHTLRDPHYVEDKGHKYLVFEANTGTENGYQGBBSL 
FNKAYYGGGTOFFRKESQKLQQSAKKRDAELANGAIiGIIELNNDYTI, 
KKVWKPLITSNTVTDEIERANVFKMNGKWYlfFTDSRGSKMTIDGINS 
NDIYMLGYVSNSLTGPYKPLNKTGLVLQMGLDPNDVTFTYSHFAVPQ 
AKGNNWITSYMTNRGFFEDKKATFAPSFLMNIKGNKTSWKNSILE 
QGQLTVN 

3189 

A 

259 

3222 

NFRLVGQTSVDRLLQLSQAQAVKGNQLLPVSLVKRKTTLAPNTQTAS 
PRALADSLMQLARQVSRLESGQDFADFGTTIKQDFRLLGQTSVDRLL 
QLS QGQAVKGNQLL P VS LVKRKTTLAPNTQTAS PRAJLAD SLMQt»ARQ 
VSRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLL 
PVSLVKRKTTLAPNTQTASPRALADSLMQLARQVSRLESGQDFADFG 
TTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPN 
TQTASPRALADSLMQLARQVSRLESGQSCSTPPPNTDKQSKEKCPMK 
FTFQQVETNMTGDAVGMDDI PQEARQYRHNQAYAYS IQGDGAEDDDB 
RIVRFHTRVTVDSDTLASDAARLTCRHGLGNQILNVMAGESGCEENC 
GNGGGGNKGNRNKRKQIRKRRKRAEPRKGPTKRGTPPTPTEARRARV 
ARSEESTETPERRHQQRRTGDPQAPGARPCTAMRTTRNETQEKTLRA 
PTKPAKHKGGTESGKKHLAQTPRATAWTAYKRKTRKTSFSNLGLEGQ 
TRFKRTKDDQHQRTPRVYAPQRAKRRARTGDTPKHLKRAYRVIPFW 
TPTRDSADMIGPPKDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAV 
KGNQLLPVSLVKRKTTLAPNTQTASPRALADSIiMQLARQVSRLESGQ 
WKLRLLKAVKEKKAQFYPALIHICARLMKTraRRFRMAIKDGFIENMF 
FVSANPWVS FTSFDLNVANMDNFFAPVFTMGKYYTCK3DKVLMPLAIQ 
VHHAVCDGFHVGRMLNELQQYCDEWQGGADFADFGTTIKQDFRLLGQ 
TSVDRLLQLSQGQAVKGNQbLPVSLVKRKTTLAPNTQTASPRALADS 
LMQLARQVSRliESGQDFADFGTTIKQDFRLLGQTSVDRIiLQLSQGQA 
VKGNQLLPV SLVKRKTTLAPNTQTAS PRAIiADSLMQLARQVSRLES G 
Q 

3190 

B 

114 

2876 

MDKFLDTYTLPRLNQEEVES LNRP ITGSE I EAI INSLPTKKS PGPDG 
FTSEFYQRYKETWCRLSPPSFNGGLPLGSDLVFYAPGPQKIIWVMV 
HVGALALKDGFS PLDVGVHGFCRFRNHHQTGFS PAGANQRDPLAATL 
SGPGGEGQSAVARLTGEKKNHPGAQYANRLSPRVGRFINAAGTTGFP 
TGKRAVSATQLMVKDDVTYAIKLSCWPGLDIIPSCLALHRIETELMG 
KFDKGRLPTHPHLMLRLAIETVAHDYDVIVIDSAPl^GIGTINrWCA 
AD VL I VPTP AELLD YTS ALQF FDMLRDLLKNVDLKGFE PD VR I LLTK 
YSNSNGSQSPWMEEQIRQAWGSMVLKNVEFCRFRTPHQPGFS PAGAN 
QRGPLAATLSGPAGEGQSAVARLTGEKKNHPGAQYANRDSPRVGRFI 
NAAGTTGFPTGKRAGIREISDIiMIIGETVITRSPTGTRSPVGQFLAV 
WGGKLPSPAFGSGSRDF/UDFGTTIKQDFRIiLGQTSVDRLLQLSQGQA 
VKGNQLLPVSLVKRKTTIiAPNTQTASPRALADSLMQLARQVSRLESG 
QAWSLCTPIMWFYPRAFSRLRQLTRYSSSGGVAGCHSGKPSRKLSTL 
GFASLVQHSTLSRATLLIYKGFCRFRNPHQTGFSPAGANQRGPLAAT 
LSGPGGEGQSAVARLTGEKKNHPGAQYANRLSPRVGRFINAAGTTGF 
PTGKRAVSATQLMDFADFGTTIKQDFRLLGQTSVDRIiLQLSQGQAVK 
GNQLLPVSLVKRKTTliAPNTQTASPRALADSLMQLARQVSRLESGQD 
FADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKT 
TLAPNTQTASPRALADSLMQLARQVSRLESGQLAGFPRQALNRGLPb 
G FRFSALRHLDPKKLDLGDG SRSG P S P * 

3191 

B 

1 

2640 

ME IRGRVEQRVGYT I EQINHMRDVFGTRLRRAEDVFPPVTGVAAHKG 
GVYKTSVSVHLAQDLALKGLRVLLVEGNDPQGTASMYHGWVPDLHIH 
AEDTLL P F YLGEKDD VTYA I KPTCWPGLDI I PSCLALHR I BTELMGK 
FDEGKLPTDPHLMLRLAIETVAHDYDVIVIDSAPNLGIGTINWCAA 
DVLIVPTPAELFDYPSALQFFHMLCDLVKNVDLKGFEPGELAPLYGC 
WGLRTGIAVAC^PHHASLNIJIGIJ^QIRNVKKQSWLMNLRKSGTLGH 
PGSVDETTYERLAEESLDSLAEFFEDLADKPYTFEDYDVSFGSGVLT 
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GKLGRDLATYVINKQTPNKQIWLSSPSSGLKRYDWTEKNWVYSHDCV 
SLHDVXiAAELTKALKTKLDLSSIAYSGKEI^AQPGYYMLHAQYMLRP 
PLSSPMQHRAPETGPTSRNSVLNAVTPPAPTTPPSPLFDSRGSPRRK 
TYQNVQQFIDEGNYTSGDNHTLRDPHYVEDKGHKYLVFEANTGTENG 
YQGEESLFNKAYYGGGTNFFRKESQKLQQSAKK3^AELANGALGIIE 
LNNDYTLKKVMKPL ITSNTVTDE I ERANVFKMNGKWYLFTDSRGSKM 
TIDXGSPXKIRLFYTDYSGKHYGKQSLTTAQVWVSKSDDTLKINGVE 
DHKTIFDGDGKTYQNVQQFIDEGNYTSGDNHTLRDPHYVEDKGHKYL 
VFEANTGTENGYQGEESLFNKAYYGGGTNFFRKESQKLQQSAKKRDA 
ELANGALGIIELNNDYTLKKVMKPLITSNTAAANQESSHVQQQALAIj 
EQQFLERTQALE AQ I VALERMRAADQTTAKQGM CTQVHI/TNTS RHAG 
RPSKCQFCSTRRQRHRLSDPKQDLGNRRFSPPP* 

3192 

A 

1 

454 

MERRNRRTGRTEKARIWEVTDRTVRTWIGEAVAAAAADGVTFSVPVT 
PHTFRHS YAMHMLYAGI PLKVLQSLMGHKS I S S TE VYTKVFALD VAA 
RHRAIR\VPRQQGDYRTRIWKFEDGLSm^LVIQLNKLIICVMCLVRD 
CDVLKTYFHL 

3193 

A 

2758 

2946 

DGQQ1»I ALHRLAIiRELQQA VHAGL PQQAKI h FDGGS E IG KI PG VP PT 
PAKANNRRGYNPPFGL 

3194 

A 

2372 

3570 

EALLPGDQDSQSGKGVAAREVWFLPSSFAPVLLRLVGNHHVGDNSID 
SWKNAGR/VFKDSDKFDANDPILKDQTQEWSGSATFTSDGKIRIiFYT 
DYSGKHYGKQSLTTAQVNVSKSDDTLKINGVEDHKTIFDGDGKTYQN 
VQQFIDEGNYTSGDNHTLRDPHYVEDKGHKYLVFEANTGTENGYQGE 
ESLFNKAYYGGGTNFFRKESQKLQQSAKKRBAELANGALGIIELNND 
YTLKKVMKPL ITSNTVTDE I ERANVFKMNGKWDFAD/ SRNHHQTGFS 
PAGANQRGPLAATL SG PGGEGQS AVARLTGEKKNHPGAQYANRIiS PR 
VGR F INAAGTTG FPTGKRAVS ATQLMDFADFGTT I KQDFRLLGQTS V 
DRLLQLSQGQAVKGNQLLPVSLVTCRKTTLAPNTQTASPRALADSLMQ 
LARQVSRLESGHNYSKGQSKVSFKTRQVPSAYEPVKSKAGGCFRVAP 
NRAFKSRRDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLL 
PVSLVKRKTTLAPNTQTASPRALADSLMQLARQVSRLiESGQDFADFG 
TTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPN 
TQTASPRALADSLMQLARQVSRLESGQIVRSATGQLIHNWTQRTQGF 
S PQP AGNPKRQ VFSDAS LEVAKVRY P LST I GGP P SRGPGLPKWEGRG 
CKGSLVSPVFFCTCTFALGWKSSRRRQLNRQLEKRGPFFKDSDKFDA 
NDPILKDQTQEWSGSATFTSDGKIRLFYTDYSGKHYGKQSLTTAQVN 
VSKSDDTLKINGVEDHKTIFDGDGKTYQNVQQFIDEGNYTSGDNHTL 
RDPHYVEDKGHKYLVFEANTGTENGYQGEESLFNKAYYGGGTNFFRK 
ESQKLQQS AKKRDAELANGALG 1 1 ELNND YTLKJO/MKPL ITSNTVTD 
EIERANVFKMNGKWDFADFGTTIKQDFRLLGQTSVDRLIiQLSQGQAV 
KGNQLLPVSLVKRKTTLAPNTQTASPRALADSLMQLARQVSRLESGQ 
DFADFGTTIKQDFRLLGQTSVDRLLQLSQGQANHLMTQFSEPVIillE 
QRTTDEE 

3195 

B 

1 

2232 

MKLMETLNQCINAGHEMTKAIAIAQFNDDSPEARKITRRWRIGEAAD 
L VG VS S QAI RD AEKAGRLPHPDME I RGRVEQRVG YT I EQ INHMRD VF 
GTRLRRT^EDVFPPVIGVAAHKGGVYKTSVSVHLAQDItALKGLRVLLV 
EGNDPQGTASMYHGWVPDLHIHAEDTLIiPFYLGEKDDVTYAIKPTCW 
PGLD HPS CLALHR I ETEFMGKFDEDNNAGMVR VLQPFTSDEKKS V I 
STLTFPGKHLRRDKSLTTAQGCDKINNKTSPEPTSPDIiALKGLRVLL 
VEGND PQG T AS M YHG WVPDLH I HAEDTLLP F YLG EKDD VTYAI KPTC 
WPGLD HPS CLALHR I ETELMG KFDEG KL PTD PHLMLRLAI ETVAHD 
YDVIVIDSAPNLGIGTINWCAADVLIVPTPAELFDYTSTOiQFFDML 
RDLLKNVDLKGFEPDVRILLTKYSNSNGSQSPWMEEQIRDAWGSMVL 
KNVEPENGEVGKXGSPQLINQEKKMFRIRNRSYIDRDSEYLLQENEP 
DGTLDQKLLEDLQKKKNDLRYIEMQPFPLVSSSRWVVKRGELTAYVE 
DTVLFSRRTSKQQVYFFLFNDVLIITKKKRRSDSSLREGDNQKAYKE 
TYGVSHI TRHDMLQ I P KQQQNEKYQVPQFDQST I KN I E S AXGLDVSD 






AGMVPVLQPLHLTEKSGYSTLTIPWLRQTSLTQRKVMCQNLVTHLK 
S TEWEKHKTI VDGERKTYQKRS AVFRMKGI LHS ATTIR * 

3196 

A 

1260 

1340 

LPMALAVGFCRFRNHHQTGFSPAGANQRGPLAATLSGPGGEGQSAVA 
RLTGEKKNHPGAQYANRLSPRVGRFINAAGTTGFPTGKRAGFCRFRN 
HHQTGFSPAGANQRGPLAATLSGPGGEGQSAVARLTGEKKNHPGAQY 
ANRLS PRVGRF INAAGTTGFPTGKRAGFCRFRNHHQTGFS PAGANQR 
GPLAATLSGPGGEGQSAVARLTGEKKNHPGAQYANRLSPRVGRFINA 
AGTTGFPTGKRAQAQAK*WHYREPLLLRAQIjTNPPSGQQLKSVDQWP 
VQKQFSSGSDLYQLHPFHVQHFLEPCFPRHPEFAPPSTGTESHYYCW 
IW 

3197 

B 

200 

2602 

MQLARQVSRLESGQVKHEGQHTKMSQVLWRSQEENPATLGNNIQLFD 
WI PQNDLLGH PKTKAF I THGGTNG I YEAI YHGVPMVG VPMFADQPDN 
IAHMKAKGAAVEVNLNTMTRPGGEGQSAVARLTGEKKNHPGAQYANR 
LSPRVGRFINAAGTTGFPTGKRAVSATQLMDFADFGTTIKQDFRLLG 
QTSVDRLLQLSQGQAVKGNOLLPVSLVKRKTTLAPNTQTASPRALAD 
SLMQLARQVSRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQ 
A\nCGNQLDPVSLVKRKTTLAPOTQTASPRALADSLMQLARQVSRLES 
GQILPDFGPHHQTEFSPAGANQRGPLAATLSGPGGEGQSAVARLTGE 
KKNHPGAQYANRLSPRVGRF INAAGTTGFPTGKRAVSATQIiI PKVDK 
TTKI GKKQS RKTGNS KNQS ASP PPKES S S SHATEQNWTENBFDELRE 
EGFKRSNCSELQEEVQTHGKEVKNLEKKLDEWLTRITNEEKPLKDLM 
ELKTKAQELRDKCTSLSSRFNQLEERVSVMEDQMNEMKREEKFREKR 
IKRNEQSIiQEIWDYVKRPNLRIiIGVPESDGENGTKLENTLQDIIQDF 
ADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTT 
LAPNTQTASPRALADSLMQLARQVSRIiESGHQAPCMKSNNAIilVILG 
TVTLDAVGIGLVMPVLPGLLRDIVHSDSIASHYGVLLALYALMQFLC 
APVIJUU^SDRFGRRTVLLASLLGATIDYAIMATTPVLWIYPLVNSPS 
C* 

3198 

B 

1 

3198 

MKLMETLNQCINAGHEMTKAIAIAQFNDDSPEARKITRRWRIGEAAD 
LVGVSSQAIRDAEKAGRLPHPDMEIRGRVEQRVGYTIEQINHMRDVF 
GTRLRRAEDVFPPVIGVAAHKXSPVIGVAAHKGGVYKTSVSVHLAQD 
LALKGLRVLLVEGNDPQGTASMYHGWVPDLHIHAEDTLLPFYLGEKD 
DVTY A I KPTCW PGLD 1 1 PS CLALHR I ETELMGKFDEGKL PTD PHIjML 
Rl^IETVAHDYDVIVIDSAPNLGIGTINVVCAADVIilVPTPAELFDY 
TSALQFFDMLRDLLKNVDLKGFEPDVRILLTKYSNSNGSQSPWMEEQ 
IRDAWGSMVLKNVVRETDEVGKAVINHKKRKNSPRIVQSNDLTEAAY 
SLSRDQKRMLYLFVDQIRKSDGTLQEHDG ICE I HVAKYAE I FGLTSA 
EAKI)PRGVYKTSVSVHLAQDLALKGLRVLLVEGiaDPQGTASMYHGWV 
PDLH I HAEDTLLPFYLGEKDD VTYA I KPTCW PGLD HPS CLALHRI E 
TEIiMGKFDEGKLPTDPHLMLRLAIETVJ^HDYDVIVIDSAPNLGIGTI 
NWCAADVLIVPTPAELFDYTSALQFFDMLRDLLKNVDLKGFEPDCI 
NAGHEMTKAIAIAQFNDDSPEARKITRRWRIGEAADLVGVSSQAIRD 
AEKAGRLPHPDME IRGRVEQRVGYTI EQINHMRDVFGTRLRRAEDVF 
PPVIGVAAHKGGWKTSVSVHLAQDLALKGLRVLLVEGFIENMFFVS 
ANPWSFTSFDLNVANMDNFFAPVFTMGKYYTQGDKVLMPIiAIQVHH 
AVCDG FHVGRMLNELQQYCDEWQGGAD YRVLVGELDDEQMAALS RDG 
NDYRPTSAYERGQRYASRLQNEFAGNISALADAENISRKIITRCINT 
AKLPKSWALFSHPGELSARSGDALQKAFTDKEELLKQQASNLHEQK 
KAGVFIRDKLMERRl^RTGRTEKARIWEVTDRTVRTWIGEAVAAAAA 
DGVTFS VPVTPHTFRHS YAMHMLYAGI PLKVLQSLMGHKS I SSTEVY 
TKVFALDVAARHRVQFAMPESDAVAMLKQLS * 

3199 

A 

1 

2259 

MKLMETLNQCINAGHEMTKAIAIAQFNDDSPEARKITRRWRIGEAAD 
LVGVSSQAIRDAEKAGRLPHPDMEIRGRVEQRVGYTIEQINHMRDVF 
GTRLRRAEDVFPPVIGVAAHKGGVYKTSVSVHLAQDLALKGYVFCSW 
KCINAGHEMTKAIAIAQFNDDSPEARKITRRWRIGEAADL.VGVSSQA 
IRDAEKAGRLPHPDMEIRGRVEQRVGYTIEQINHMRDVFGTRLRRAE 
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DVFPPVIGVAAHKGGVYKTSVSVHl^QDLALKGLRVLLVEEPPSAPN 
LQIGTINWCAADVIilVPTPAELFDyTSALQFPDMLRDLLKNVDLKG 
PEPDVRILLTKYSNSNGSQSPWMEEQIRDAWGSMVLKNWRETDEVG 
KAAPMWSLIARVGVMARGNAITLPVCGRPVKFTLEVLRGDSVEKTS 
RCDNLKTCHTSHGSVMAETAVINHKKRKNSPRIVQSNDLTEAAYSLS 
RDQKRML YL F VDQ I RKS DGTLQEHDG I C E I HVAKYAE I FGLTS AEAS 
KDIRQALKSFAGKEVVFYRPEEDAGDEKGYESFPWFIKRAHSPSRGL 
YSVHINPYLI PFF IGLQNRFTQFRLSETKE ITNPYAMRLYESLCQYH 
PLTIEKVMKPLITSNTVTDEIERANVFKMNGKWYLFTDSRGSKMTID 
GINSNDIYMLGYVSNSLTGPYKPLNKTGLVLQMGLDPNDVTFTYSHF 
AVPQAKGNNWITSYMTNRGFFEDKJCATFAPSFLMNIKGNKTSDKLP 
H 

3200 

A 

1 

2418 

MAETAVINHKJO^KNSPRIVQSNDLTEAAYSLSRDQKRMLYLFVDQIR 
KS DGTLQEHDG ICE IHVAKY AE I FGLTS AEASKD I RQALKS FAGKEV 
VFYRPEEDAGDEKGYESFPWFIKRAHSPSRGLYSVHINPYLIPFFIG 
LQNRFTQFRLNFVQKSRLVDLALKGLRVLLVEGNDPQGTASMYHGWV 
PDLH IHAEDTLLP F YLGEKDDVTYAI KPTCWPGLDI I PSCIiALHR I E 
TELMGKFDEAVINHKKRKNSPRIVQSNDLTEAAYSLSRDQKRMLYLF 
VDQ I RKSDGTLQEHDG ICE I HVAKYAE I FGLTS AEAI ABYNGYHWF 
ALAG S PKDADDTS I YM FYQKVGDNS IDS WKNAGR VF KD S D KFD AND P 
ILKDQTQEWSGSATFTSDGKIRLFYTDYSGKHYGKQSLTTAQVNVSK 
SDDTLKINGVEDHKTIFDGDGKTYQNVQQFIDEGNYTSGDNHTLRDP 
HYVEDKGHKYL VFEANTGTENG YQGE E S L FNKAYYGGG TNF FR KE S Q 
KLQQSAiaaU3AELANGALDLTTAQVNVSKSDDTLKINGVEDHKTIFD 
GDGKTYQNVQQFIDEGNYTSGDNHTLRDPHYVEDKGHKYLVFEANTG 
TENGYQGEESLFNKAYYGGGTNFFRKSSQKLQQSAKKRDAELANGAL 
GIIELNNDYTLKKVMKPLITSOT^TITDEIERAl^FKI^GKWYLFTDSR 
GSKMTIDGINSNDIYMLGYVSNSLTGPYKPLNKTGLVLQMGLDPNDV 
TFTYSHFAVPQAKGNNWITSACGAESTFYVEDPPGWSQTPGLKRSS 
CLGLPKC 

3201 

A 

1907 

5097 

TSKKI VKQAPVLTFTTA/ LLAGGAI QAFAKENNHKAYKETYGVSH I T 
RHDMLQ I PKQQQNEKYQVPQFDQSTI KNI ESAKGLDVWDS WPLQNAD 
GTVAE YNGYHWFALAG S PKDADDTS I YMF YQKVGDNS I D SWKNAGR 
VFKDSDKFDANDPILKDQTQEWSGSATFTSDGKIRLFYTDYSGKHYG 
KQSLTTAQVNVSKSDDTLKINGVEDHKTIFDGDGKTYQNVQQFIDEG 
NEGILPISEPPIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLV 
KRKTTIiAPNTQT AS PRALAD S LMQLARQVSRLES GQD FADFGTT I KQ 
DFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNTQTAS 
PRALADSLMQLARQVSRLESGQDFADFGTTIKQDFRLLGQTSVDRLL 
QLSQGQAVKGNQLLPVSLKPYACQIPGCTKRYTDPSSLRKHVKAHSS 
KEQQARKKDHNS S PAREHNWTANEFDELTEVG FRED FAD FGTT I KQD 
FRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNTQTASP 
RALADSLMQLARQVSRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQ 
LSQGQAVKGNQLLPVSLVKRKTTLAPNTQTAS PRALADS LMQLARQV 
SRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLP 
VSLVKRKTTLAPNTQTASPRALADSLMQLARQVSRLESGQDFADFGT 
TIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNT 
QTASPRALADSLMQLARQVSRLESGQDFADFGTTIKQDFRLLGQTSV 
DRLLQL S QGQA VKGNQLL P VS L VKRKTTLAPNTQTAS PRALAD S LMQ 
LARQVS RLES GQD FAD FGTT I KQDFRLLGQTS VDRLLQL S QGQAVKG 
NQLLPVSLVKRKTTLAP1TOQTASPRALADSLMQLARQVSRLESGQDF 
ADFGTT I KQD FRLLGQTS VDRLLQLS QGQAVKGNQLLPVS LVKRKTT 
LAPNTQTASPRALADSLMQLARQVSRLESGQ 

3202 

A 

891 

2136 

GATQAFAKENNQKAYKETY\ESAKGLDVWDSWPLQN7U)GTVAEYNGY 
HWFALAGSPKDADDTS I YMFYQKVGDNS IDS WKNAGRVFKDSDKFD 
ANDPILKDQTQEWSGSATFTSDGKIRLFYTDYSGKHYGKQSLTTAQV 
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NVSKSDDTLKINGVEDHKTI FDGDGKTYQNVQQF IDEGNYTSGDNHT 
LRDPHYVEDKGHKYLVFEANTGTENGYQGEESLFNKAYYGGGTNFFR 
KESQKLQQSAKKRDAELANGALGI I E LN1TO YTLKKVMKPL I TSNTVT 
DEIERANVFKMNGKWYLFTDSRGSKMTIDGINSNDIYMLGYVSNSLT 
GPYKPLNKTGLVLQMGLDPNDVTFTYSHFAVPQAKGNNVVITSYMTN 
RGFFEDKKATFAPSFLMNIKGNKTSVVKNSILEQGQLTVN 

3203 

A 

1 

3987 

MILKPKDGNIEEVNSLRKQRAGSSSSKMNSLGFCRFRNHHQTGFSPA 

GANQRGPL»AATLSGPGGEGQSAVARLTGEKKNHPGAQYANRLSPRVG 

RFINAAGTTGFPTGKRAVSATQLMDFADFGTTIKQDFRLLGQTSVDR 

LLQLSQGQAWKIPKT I FDGDGKTYQNVQQF I DEGNYTSGDTHTLRDP 

HYVEDKGHKYLVFEANTGTENGYQGEESLFNKAYYGGGTNFFRKESQ 

KLQQS AKKRDAELANGALG 1 1 ELNND YTLKKVMKPL I TS NTVTDE I E 

RANVFKMNGKWYLFTDSRGSKMTIDDTSLSTPAAPMVDSLIARVGVM 

ARGNAITLPVCGRDVKFTLBVLRGDSVEKTSRVWSGNERDQELLTED 

ALDDLIPSFLLTGQQTPAFGRRVSGVIEIADGSRRRKAAALTESDYR 

VLVRKLFFLVHCSELSTILVGNIPAMVPNLLILPLHCYDIVHPKRSE 

VIS PS YR I L YHHEKED VGCL I DVN I LDLNGLQP VLQS QS I S LVQQS E 

FSLEVTQ YQEREPVLVRAS PS CS VDVTERAGS SLQS PLLS ADLTTAQ 

TPSEFLALTEDSVSEITOAFSKSGSMASHCDNFGHNDSTSQMSLNSAA 

VTKTTTELTGTVPCAIMSPGKYRLEGQCSTESNSLPETSAANQKEVL 

LQIAELPATSVSHSESNLPADSEREENIGLQNAACSWGRGDTSTPW 

TPTGTMLGQTRSQNSTGSFPRPAIATTYLYVCSRSRALQSAELEKTT 

LKF I WS QKRAC I VKS I LS QKNKAGG ITLPDFKL YYKATVTKTAWDFA 

D FGTT I KQD FRLLGQTS VDRLLQLS QGQAVKGNQLLP VS L VKRKTTL 

APNTQTASPRALADSLMQLARQVSRLESGQAIKNHSLPRKTLQCLPY 

I LTGFRRGQSE YFS I LSNKALADTVMFLDFADFGTTI KQDFRLLGQT 

S VDRLLQL S QGQAVKGNQLLPVS LVKRKTTLAPNTQTAS PRALADS L 

MQLARQVSRLESGQSPQSGLKHAFSQLLCFDYYLLSTKSLFTTWPQL 

SALTALSTVGHNPFSEQSALLDQYGDQTQEWSGSATFTSDGKIRLFY 

TDYSGKHYGKQSLTTAQVNVSKSDDTLKINGVEDHKTIFDGDGKTYQ 

NVQQFIDEGNYTSGDNHTLRDPHYVEDKGHKYLVFEANTGTENGYQG 

EESLFNKAYYGGGTNFFRKESQKLQQSAKKRD7VELANGALGIIELNN 

D YTLKKVMKPL I TSNTVTDE I ERANVFKMNGKWYLFTDSRGS KMT ID 

GINSNDIYMLGYVSNSLTCYPLPSPGILHAVLFFIIHWDPTNGILS 
SNKRNKVLKQATT 

3204 

A 

2 

247 

QGAGFSSSLRMASIFSIHSVGTRDLSRLSTILSPLYNTVAPGAN*CR 
ELKLVLDADDVLSTEVKRVITSSASNITPAFFCS 

3205 

A 

1099 

2224 

DGQQL I ALHRLALRELQQAVHAGLPQQAKI LFDGG S E / TRQNPLQQL 
VHMGLPRPLDKKNFQEP 

3206 

A 

1366 

2673 

CDNLKTCHTSHGSVMAETAVINHKKRKNSPRIVQSNDLTEAAYSLSR 
DQKRML YLFVDQIRKSDGTLQEHDG I CE IHVAKYAE I FGLTSAEASK 
D I RQALKS FAG KEWFYR PEED AGDE KG YE S FPW F I KRAHS P S RGLY 
SVHINPYLIPFFIGLQNRFTQFRLSETKEITNPYAMQSPYTDYSGKH 
YGKQSLTTAQVNVSKSDDTLKINGVEDHKTI FDGDGKTYQNVQQF ID 
EGNYTSGDNHTLRDPHYVEDKGHKYLVFEANTGTENGYQGEESLFNK 
A YYGGGTNFFRKES QKLQQS AKKRDAELANGALG 1 1 ELNND YTLKKV 
MKPLITSNTVTDEIERANVFKMNGKWYLFTDSRGSKMTIDAGISHKK 

RKNSPRIVQSNDLTEAAYSLSRDQKRMLYLFVDQIRKSDGTLQEHDG 
I CE IHVAKYAE I F 

3207 

A 

1 

7035 

wjjXWiU? w± faJJj? Qi WDAQPMFQKEQVDPLQMKLQQVNGLGQGLIQSAG 

KDCDVQGLEHDMEEINARWNTLNKKVAQRIAQLQEALLHCGKFQDAL 

EPLLSWLADTEELIANQKPPSA^YKWKAQIQEQKLLQRLLDDRKAT 

VDMLQAEGGRIAQSAELADREKITGQLESLESRWTELLSKAAARQKQ 

LEDILVLAKQFHETAEPISDFLSVTEKKLANSEPVGTQTAKIQQQII i 

RHKALEEDIENHATDVHQAVKIGQSLSSLTSPAEQGVLSEKIDSLQA 

RYSEIQDRCCRKTUU^LDQALSNARLFGEDEVEVLNWLAEVEDKLSSV 
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FVKDFKQDVLHRQHADHLALNEE I VNR KKNVDQAI KNGQALLKQTTG 
EEVLLIQEKLDGIKTRYADITVTSSKALRTLEQARQLATKFQSTYEE 
LTGWLREVEEELATSGGQSPTGEQIPQFQQRQKELKXETVWEHRLVLD 
TVNEVSRALLELVPWRAREGLDKLVSDANEQYKLVSDTIGQRVDEID 
AA I QRS QQYEQAADAELAWVAETKR KLMALGP IRLEQDQTTAQLQVQ 
KAFS IDI IRHKDSMDELFSHRSE I FGTCGEEQKTVLQEKTESLIQQY 
EAISLLNSERYARLERAQVLVNQFWETYEELSPWlEETRALIAQIiPS 
PAIDHEQLRQQQEEMRQLRESIAEHKPHIDKLLKIGPQLKBLNPEEG 
EMVEEKYQKAENMYAQIK^EVRQRALALDEAVSQSTQFHDKIEPMLE 
TLENLS SRLRMPPLI PAEVDKIREC I SDNKS ATVELEKLQPS FE ALK 
RRGEEL I GR S QG AD KDLAAKE IQDKLDQMVF FWED I KARAEERE I KF 
LDVLELAEKFWYDMAALLTTIKDTQDI VHDLESPGIDPS I IKQQVEA 
AET I KEETDG LHEELE F I R I LGADL I FACGETEKPEVRKS IDEMNNA 
WENLNKTWIOERLEKLEDAMQAAVQYQDTLQAMFDWLDNTVIKLCTMP 
PVGTDLNTVKDQLNEMKEFKV^VYQQQIEMEKIJraQGELMLKKATDE 
TDRD 1 1 REPLTELKHLWENLGEKI AHRQHKLEGALLALGQFQHALEE 
LMS WLTHTEELLDAQRP I SGDPKVI EVELAKHHVLKttDVLAHQATVE 
TVNKAGNELLESSAGDDASSLRSRLEAMNQCWESVLQKTEEREQQLQ 
STLQQAQGFHSEIEDFLLELTRMESQLSASKPTGGLPETAREQLDTH 
MELYSQLKAKEETYNQLLDKGRLMLLSRDDSGSGSKTEQSVALLEQK 
WHWSSKMEERKSKLEEAI^LATEFQNSLQEFINWLTIAEQSLNIAS 
PPSLILNTVLSQIEEHKVFANEVNAHRDQIIELDQTGNQLKFLSQKQ 
DVVIilKNLLVSVQSRWEKWQRSIERGRSLDDARKRAKQFHEAWKKL 
IDWLEDAESHLDSELEISNDPDKIKLQLSKHKEFQKTLGGKQPVYDT 
TIRTGRALKEKTLLPEDSQKLDNFLGEVRDKWDTVCGKSVERQHKLE 
EALLFSGQFMDAIiQALVDWLYKVEPQLAEDQPVHGDLDLVlWLMDAH 
KVFQKELGKRTGTVQVLKRSGRELIENSRDDTTWVKGQLQELSTRWD 
TVCKLSVSKQSRLEQALKQAEVFRDTVHMLLEWLSEAEQTLRFRGAI, 
PDDTEALQSLIDTHKEFMKKVEEKRVDVNSAVAMGEVILAVCHPDCI 
TTIKHWITIIRARFEEVLTWAKQHQQRLETALSELVANAELLEELLA 
WIQWAETTLIQRDQEPIPQNIDRVKALIAEHQTFMEEMTRKQPDVDR 
VTKTYKRKNIEPTHAPFIEKSRSGGRKSLSQPTPPPMPILSQSEAKN 
PRINQLSARWQQVWLLALERQRKLNDALDPS I VS PKCRRPGGRQAYP 
LVNSGGSKRQIAKNRGATQAFAKENNQKAYKETYGVSHITRHDMLQI 
PKQQQNEKYQVPQFDQSTIKNIESAKGLDVWDSWPLQNADGTVAEYN 

ulnv V r MJ_iM\j f j\UJ\ULf loll rir X yivvbUWa XUo YiJSSiMxDK. vf ]SJJ£>U]\. 

FDANDPILKDQTQEWSGSATFTSDGKIRLFYTDYSGKHYGKQSLTTA 
QVNVSKSDDTLKINGVEDHKTIFDGDGKTYQNVQQFIDEGNYTSGDNT 
HTLRDPHYVEDKGHKYLVFEANTGTENGYQGEESLFNKAYYGGGTNF 
FRKESQKLQQSAO^AEI^GALGIIELNNDYTLKKVMKPLITSOT 
VTDEIERANVFKI^GKWYLFTDSRGSKMTIDGINSNDIYMI^YVSHS 
LTGPYKPLNKTGLVI^MGI/DPNDVTFTYSHFAVPQAKGNNVVITSYM 
TNRGFFEDKKATFAPSFLMNIKGNKTSVVKNSILEQGQLTVN 

3208 

A 

1 

2328 

MKLMETLNQC INAGHEMTKA I A I AQFNDDS PE ARKI TRR WRI GE AAD 
LVGV S SQAI RDAEKAGRXiPHPDME IRGRVEQRVGYTI EQINHMRDVF 
GTRLRRAEDVFPPVIGVAAHKGGVYKTSVSVHLAQDLALKGLRVLLV 
EGNDPQGTASMYHGWVPDLHIHAEDTLLPFYLGEKDDVTYAIKPTCW 
PGLDIIPSCLALHRIETEIjMGKFDEGKIjPTDPHLMLRIAIETVAHDY 
DVIVIDSAPNIiGIGTINWCAADVLIVPTPAELFDYTSAIiQFFDMLR 
DLLKNVDLKGSQKTVHTEGDMN>INIKKIVKQATVLTFTTAXiIAGGAT 
QAFAKENNQKAYKET YGVSHI TRHDMLQI PKQQQNEKYQVPQFDQST 
IKNIESAKGLDVWDSWPLQNADGTVAEYNGYHWFALAGSPKDADDT 
SIYMFYQKVGDNSIDSWKNAGRVFKDSDKFDANDPILKDQTQEWSGS 
ATFTSDGKIRLFYTDYSGKHYGKQSLTTAQVNVSKSDDTLKINGVED 
HKTIFDGDGKTYQNVQOFIDEGNYTSGDNHTLRDPHYVEDKGHKYLV 
FEAOTGTENGYQGEESLFNKAYYGGGTNFFRKESQKLQQSAKKRDAE 
LiANGALGI IELNNDYTLKKA^KPLITSNT\TTDEIERA1^FK1WGKWY 
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LFTD5RGSKMTIDGINSNDIYMLGYVSNSLTGPYKPLNKTGLVLQMG 
LDPNDVTFTYSHFAVPQAKGNNVVITSYMTJ^GFFBDKKATFAPSFIj 
MNIKGNKTSWKNSILEQGQLTVN 

3209 

A 

1 

2328 

MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRN 
LPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIV 
LVDWSDIREQKRLlWLRASVAliHGRSVTIiYEKAFPLSEQCSKKAHDQ 
FIJUDLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQ 
YADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSR 
SKGRKNQRSTRTHCHHPSPKIYSASAKEPWVLATNX.PVEIRTPKQLV 
NIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQ 
LTCWLAGVHAQKQGWDKHFQANTVRNRNIPKQQQNEKYQVPQFDQST 
IKNIESAKGLDVWDSWPLQNADGTVAEYNGYHWFALAGSPKDADDT 
SI YMFYQKVGDNS IDSWKNAGRVFKDSDKFDANDPILKDQTQEWSGS 
ATFTSDGKIRLFYTDYSGKHYGKQSLTTAQVNVSKSDDTLKINGVED 
HKTIFDGDGKTYQNVQQFIDEGNYTSGDNHTLRDPHYVEDKGHKYLV 
FEANTGTENGYQGEESLFNKAYYGGGTNFFRKESQKLQQSAKKRDAE 
LANG ALG 1 1 ELNND YTLKKVMKPL I TSNTVTDEI ERANVFKMNGKWY 
LFTDSRGSKMTIDGINSNDIYMLGYVSNSLTGPYKPLNKTGLVLQMG 
LDPNDVTFTYSHFAVPQAKGNNW I TS YMTNRGF FEDKKATF AP S FL 
MNIKGNKTSWKNSILEQGQLTVN 

3210 

B 

1 

2541 

MPRQVGSGPGPLLGVAGQAAVGTGGKKSLCQYRKPDGSGIVSLKIDW 
IIERYQLPQSYQHRKAGECLLHEYEDLVPIRDTLRLFPGGRYLPRAK 
HVAPSEPDPEQDEQKLRDFADFGTTIKQDFRLLGQTSVDRLLQLSQG 
QAVKGNQLLPVSLVKRKTTLAPNTQTASPRALADSLMQLARQVSRLE 
SGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLV 
KRKTTLAPNTQTASPRA1»ADSLMQLARQVSRLESGQDFADFGTTIKQ 
DFRIjLGQTSVDRIjLQLSQGQAVKGNQLLPVSLVKRKTTLAPNTQTAS 
PRALADSLMQLARQVSRLESGQDFADFGTTIKQDFRLLGQTSVDRLL 
QL S QGQAVKGNQLL P VS L VKRKTTLAPNTQTASPRALAD S LMQLARQ 
VSRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLL 
PVSLVTCRKTTIAPNTQTASPRALADSLMQLARQVSRLESGQDFADFG 
TT I KQD FRLLG QTS VDRLL QLS QGQ AVKGNQLLP VS L VKRKTTLAPN 
TQTAS PRALADSLMQLARQVSRLESGQDFADFGTT IKQDFRLLGQTS 
VDRLLQLS QGQAVKGNQLLPVS LDDVTYA I KPTCWPGLA 1 1 PSCLAL 
HRI ETELMGKFDEGKLPTDPHLMLRLAI ETVAHD YDVI V IDS APNLG 
IGTINWCAADVLIVPTPAELFDYTSALQFFDMLRDLLKNVDLKGFE 
PDVRILLTKYSNSNGSQSPWMEEQIRDAWGSMVLKNVVRETDEVGKG 
QIRMRTVFEQAIDQRSSTGAWRNALSIWEPVCNEIFDRLIKPRWEIR 
* 

3211 

A 

1 

3849 

MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRN 
LPTKARTKHN I KR I DRLLGNRHLHKERLAVYRWHAS F I C S GNTMP I V 
LVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQ 
FLADLAS I LPSNTTPLIVSDAGFKVPWYKS VEKLGWYWLSRVRGKVQ 
YADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSR 
SKGRKNQRSTRTHCHHPSPKIYSASAKEPWVLATNLPVEIRTPKQLV 
NIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQ 
LTCWLAGVHAQKQGWDKHFQANTVRNRNLKI YSHMVTLWGNYEGI SQ 
YQVPQFDQSTIKNIESAKGLDVWDSWPLQNADGTVAEYNGYHWFAL 
AGSPKDADDTS I YMFYQKVGDNS I DS WKNAGRVFKDSDKFDANDP IL 
KDQTQEWSGSATFTSDGKIRLFYTDYSGKHYGKQSLTTAQVNVSKSD 
DTLK I NG VEDHKT I FDGDGKTYQNVQQF I DEGNYTSGDNHTLRD PHY 
VEDKGHKYLVFEANTGTENGYQGEESLFNKAYYGGGTNFFRKESQKL 
QQS AKKRD AELANG ALG 1 1 ELNND YTLKKVMKPL I TSNTVTDE I ERA 
NVFKMNGKWYLFTDSRGSKMTIDGINSNDIYMLGYVSNSLTGPYKPL 
NKTGLVLQMGLDPNDVTFTYSHFAVPQAKGNNWITSYMTNRGFFED 
KKATFAPS FLMNI KGNRGTEGKQGIQYFPHKPLGVGSPH IRFDYLLV 
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S I LPS IRNPTGKFWKDIJUaiKNYAPVAMLQSAhn'RTHNHRSCEEPV 
KDMLLAFPNDRAATIRSKDHHKHFRDFADFGTTIKQDFRLLGQTSVD 
RLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNTQTASPRALADSIJ4QL 
ARQVSRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAHAIV 
TGDAVGMDD I PQEARQ YRHNQAYAYS IQGDGAEDDDER I VRFHTRVT 
VDSDTIASDAARLTCRHGLGNQGTAPMYHGWVPDLHIHAEDTLLPFY 
LGEKDDVTYAIKPTCWPGLDIIPSCLALHRIETELMGKFDEGKLPTD 
PHLMLRLAIETVAHDYDVIVIDSAPNLGIGTINWCAADVLIVPTPA 
ELFDYTSALQFFDMLRDLLKNVDLKGFEPDVRILLTKYSNSNGSQSP 

WMEEQIRDAWGSMVLKNVVRETDEVGKGQIRMRTVFEQAIDQRSSTG 
AWSNALSILGTCLQ 

3212 

B 

1 

6724 

MDNF F AP VFTMG KYYTQGDKVLM PLA I QVHHAVCDGFHVGRMLNELQ 

QYCDEWQGGADFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQ 

LLPVSLRKQNTSRPPSMHVDDFVAAESKEVVPQDGIPPPKRPLKVSQ 

KI S SRVENLNTLNLPKELMLCHHQEEGAGEDLDRVYDLM I FFVRENR 

TQVDHHLCMWMTLLLLKVKKWFLKMEYLHQNGHSKYHRRFLPVVGFQ 

AIEEDGVL STVR IG FSHHLLQKALQNFQR YTG I QHVHR IGMAERMWC 

DRNRERHTVS S S GGNRL PNPG PGGEGQS AVARLTGEKKTHPG AQ YAN 

RLS PRVGRFINAAGTTGFPTGKRAVSATQLMDFADFGTTI KQDFRLL 

GQTSVBRLLQLSQGQAVKGNQLLPVSLVXRKTTIiAPNTQTASPRALA 

D S LMQLARQ VSRLE SGQDFADFGTT I KQDFRLLGQTS VDRLLQLS QG 

QAVKGNQLLPVS LVKRKTTLAPNTQTAS PRALAD SLMQLARQVS RLE 

S GQDFADFGTT I KQD FRLLGQTS VDRLLQLS QGQAVKGNQLLPVSLV 

KRKTTLAPNTQTASPRALADSLMQLARQVSRLESGQDFADFGTTIKQ 

D FRLLGQTS VDRLLQLS QGQAVKGNQLLPVSLAIRDAEKAGRLPHPD 

MEIRGRVEQRVGYTIEQINHMRDVFGTRLRRAEDVFPPVIGVAAHKG 

GVYKTSVSVHI^QDLALKGLRVLLVEGNDPQGTASMYHGWVPDLHIH 

AEDTLLPF YLGEKDDVTYAI KPTCWPGLDI I PSCLALHRIETELMGK 

FDEGKLPTDPHLMLRLAIETVAHDYDVIVIDSAPNLGIGTINWCAA 

DVLIVPTPAELFDYTSALQFFDMLRDLLKNDFRLLGQTS VDRLLQLS 

QGQAVKGNQLLPVSLVKRKTTLAPNTQTASPI^ALADSLMQLARQVSR 

LESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVS 

LVKRKTTLAPNTQTASPRALADSLMQLARQVSRLESGQDFADFGTTI 

KQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNTQT 

AS PRALADSLMQLARQVSRLESGQDFAD FGTTI KQDFRLLGQTS VDR 

LLQLSQGQAVKGNQLLPVSLACKTTFQACSPYLKLKEEYSFQSEEDQ 

RNTKLYQQLVRNSYDHDYEFELGTHIWNFFAPVFTMGKYYTQGDKVL 

MPLAIQVHHAVCDGFHVGRMLNELQQYCDEWQGGADFADFGTTIKQD 

FRLLGQTSVDRLLQLSQGQAVKGNQLLPVS LVKRKTTLAPNTQTAS P 

RALADSLMQLARQVSRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQ 

LS QGQAVKGNQLLPVS LVKRKTTLAPNTQTAS PRALAD S LMQLARQV 

SRLE SGQDFADFGTT I KQD FRLLGQTS VDRLLQLSQGQAVKGNQLLP 

VS LVKRKTTLAPNTQTAS PRALADSLMQLARQVS RLE SGQDFADFGT 

TIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLGFCRFRNHPIK 

QDFRLLGQTS VDRLLQLSQGQAVKGNQLLP VSLGFCRFRNPP I KQDF 

RLLGQTSVDRLLQLFSGPGGEGQSAVARLTGEKKNHPGAQYANRLSP 

RVGRFINAAGTTGFPTGKRAVSATQLMD FAD FGTTI KQD FRLLGQTS 

VDRLLQLSQGQAVKGNQLL PVS LVKRKTTLAPNTQTAS PRALADSLM 

QLARQVSRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVK 

GNQLLPVSLVKRKTTLAPNTQTASPRALADSLMQLARQVSRLESGQD 

mur\j± J- xi\yur iVUi-Hay J. ^ VlJKljljyij£>Q(jQAVKGNQLLPvSLEHDGI 

CEIHVAKYAEIFGLTSAEASKDIRQALKSFAGKEWFYRPEEDAGDE 
KGYESFPWFIKRAHSPSRGLYSVHINPYLIPFFIGLQNRFTQFRLSE 
TKE ITNP YAMRLYES LCQYRKPDGSG I VS LKI DW 1 1 ERYQLPQS YQR 
MPDFRRRFLQVCVNE INSRTPMRLSYIEKKKGRQTTHICKLVEANNP 
QAISQQDLVHMAIQIACGMSYLARTEVIHKDLAARNCITDDTLQVKI 
TDNA VS WT I TVWKTMKTGQ FVG WLLKVWL I TS FLAL VMCGAFG VMLW 
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ELMTLGQTP YMDTDPFEMAAYLKDG YR I AQP INCPDEPFAVMACCWA 1 
LDPGER PKFQQL * 

3213 

A 

532 

1232 

SDNKMCELDILHDSLYQFCPELHLKRLNSLTIiACHALLDCKTLTLTB 
LGRNL PTKARTKHN I KR I DRLLGNRHLHKERLAVYR WHAS F I CSGNT 
MPIVLVDWSD1REQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKK 
AHDQFLADLAS I LPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVR 
GKVQ YADLG AENWKP I SNLHDMS S SHS KTLG YKRLTKSNP I SCQI LL 
YKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWILATNLPVEIRTP 
KQLVN I YS KRMQ I EETFRDLKS PAYGLGLRHSRTS S SERFDIMLL I A 
LMLQLTCWLAG VHAQKQGWDKHFQANTVRNRNLKI YSHMVTLWGNYE 
GI SQTQAFAKENNQKAYKETYGVSHITRHDMLQI PKQQQNEKYQVPQ 
*■ ■»-' v-* x v wxjovv^jjywAiJo x V AEYNGYliVvFALAGS PK I 
DADDTSIYMFYQKVGDNSIDSWKNAGRVFKDSDKFDANDPILKDQTQ 
EWSGSATFTSDGKIRLFYTDYSGKHYGKQSLTTAQHTVNARGLQSKD 
MKSYNQPPYVLWSSSWSHTGTENGYQGEESLFNKAYYGGGTNFFRKE 
SQKLQQSAKKRDAELANGALGIIELN1TOYTLKKVMKPLITSNTVTDE 
IERANVFKMNGKWYLFTDSRGSKMTIDGINSNDIYMLGYVSNSLTGP 
YKPLNKTRLVLQMGLDPNDVTFTYSHFAVPQAKGNNG 

3214 

A 

8749 

11698 

CSWHDRFPDWKAGRILPISEPPSNRIFACWGKPAWTGLLQLSQGQAV 
KGNQLLP VS LG FRG P P ASNPGG APGS AS VAAAAAVAAKAAAAAAP P V 
DNAAPHSAAKLEERRKPDPRRDFADFGTTIKQDFRLLGQTSVDRLLQ 
LS QGQAVKGNQLLPVSLVKRKTTLAPNTQTAS PRALAD SLMQLARQV 
SRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLP 
VSLVKRKTTLAPNTQTASPRALADSLMQLARQVSRLESGQDFADFGT 
TIKQDFRLLGQTSTORLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNT 
QTASPRALADSLMQLARQVSRLESGQDFADFGTTIKQDFRLLGQTSV 
DRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNTQTASPRAIiADSLMQ 
I^QVSRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKG 
NQLLPVSLVKRKTTLAPNTQTAS PRALADSLMQLARQVSRLESGQDF 
ADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTT 
LA PNTQTAS PRALADS LMQLARQVS RLESGQDFAD FGTT I KQDFRLL 

GQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNTQTASPRALA | 

DS LMQLARQVS RLESGQD FAD FGTT I KQDFRLLGQTSVDRLLQLSQG 

QAVKGNQLLPVSLDLVLKGLRVLLVEGNDPQGTASMYHGWVPDLHIH 

AEDTLLPFYLGEKDDVTYAIKPTCWPGLDIIPSCLALHRIETELMGK 

FDEGKLPTDPHLMLRLATETVAHDYDVIVIDSAPNLGIGTINVVCAA 

DVLIVPTPAELFDYTSALQFFDMLRDLLKNVDLKGFEPDVRILLTKY 

SNSNGSQSPWMEEQIRDAWGSMVLKimrcETDEVGKGQIRMRTVFEQ 

AIDQRFSTGAWRNAPSIWEPGCMEIFERLNKPKLSKLFHVRPVTQGD 

VYRAETEE IPKIFQILWPI KHKKGAGGNEPSEQFS KTLFGES ASG YL 

E I FKDFTGNRNI FRQYRGNEGQMRELQDQLEAEQYFSQQNEKYQVPQ 

FDQSTIKNIESAKGLDVWDSWPLQNADGTVAEYNGYHWFALAGSPK 

DADDTS I YMFYQKVGDNS IDSWKNAGRVFKDSDKFDANDPILKDQTQ 

EWSGSATFTSDGKIRLFYTDYSGKHYGKQSLTTAQVNVSKSDDTLKI 

NGVEDHKT I FDGDGKT YQNVQQF IDEGNYTSGDNHTLRD PHYVEDKG 

HKYLVFEANTGTENGYQGEESLFNKAYYGG\GTNFF\RKESQKLQQS 

AKKRDAELANGALG 1 1 ELNND YTLKKVMKPL I TSNTVTDE I ERANVF 

KMNGKWYLFTDSRGSKMTIDGINSNDIYMLGYVSNSLTGPYKPLNKT 

GLVLQMGLDPNDVTFTYSHFAVPQAKGNNWITSYMTNRGFFEDKKA 

TLGDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLV 

KRKTTLAPNTQTAS PRALAD S LMQLARQVS RLE S GQD F ADFGTT I KQ 

DFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNTQTAS 

PRALADSLMQLARQVSRLESGQDFADFGTTIKQDFRLLGQTSVDRLL 

QLSQGQAVKGNQLLPVSLVKRKTTLAPNTQTAS PRALAD SLMQLARQ 

VSRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLL 

PVSLVKRKTTLAPNTQTAS PRALADSLMQLARQVSRLESGQDFADLG | 


/0I2Q, 
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TTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPN 
TQTASPRALADSIiMQLARQCLEPRPLSPVPFVPIjMFEFQGS 

3215 

A 

3 

452 

QTQREPTMVLSPADKTNVKAAWGKVGAHAGEYG^EALERMFLSFPTT 

KTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDIi 

HAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVS 
TVLTSKYR 

3216 

A 

24 

609 

G I PQTQREPTMVLSPADKTNV\ KAAWGKVGAHAG\ EYGAEALERMFL 

SFPTTKTYFPHFDLSHGFCPGLRATGKKVAJDALTNAVAHVDDMPNAL 

SAL\SDLHAHICLRVGPGSTFKLLSHLPCW*TLGRPPSPAEFQPLAVA 

RLPWEKFPGLFVEAPLLDLQITFKVGSFGWPLFFCPFGPPPSPSSPF 
LHPYPRGL 

3217 

A 

3 

583 

EPRLRKAGGLLSRSRQNDQEGREGAQGWPAYGPARGWPQPWVQTGT 

WQGNTGQRVPQLPPHPPPIHI.VSRHRGKLRHGFLRPMPEPRGLESGK 

T/ GQCSGSCSMYVTMRAKRPWEGGHDTLTQQGGCRSSVCGRRAHEAL 

RPRVWCGEGPQWTW\CAVCP\NRSAPGAGLAD\RQHPGESRAWGETR 
LGEAGGAE 

3218 

A 

3 

398 

QTPEKRHKSCPGGLHFLSRSINLHFPKLSISGTYKLKRVLRNLGITK 
IFSNEADLSGVSQEAPLKLSKAVHVAVLTIDEKGTEATGPPHLEEKA 
WSKYQTVMFNRPFLVIIKDDITNFPLFIGKWNPTQK 

3219 

A 

1766 

3896 

NTKLDQAQQAPEDHYPISLLLPSHMAIGIiliMAQEGHCKDSSAMGEEA 
HHPLTPATPPFPPLSPDWGHMQPDFFVPVAVPAVFRGPPQLQCHGRR 
LFLNSPCAQKSSSGLWEPGLSRTLLEMVKLTSMRGQFLEAQIPTGI 
SLTLQYQLYQKQTNKNMSYSFVLFLKWVALGQGRRAGYPSLEDADSR 
RFNGSRSFLITVIGITLTVEIVTSGMMKGTRVRWSGAGNEGMMGLEE 
GRNERSVKEAPPRRAVEAQPKDRTWDVGKGQGTEGEGRGLEVEGQQH 
QGSEPGTIPFSVSWGVLLLAGLCCLVPSSLVEDPQEDAAQKTDTSHH 
DQGDWEDIiACQKI S YNVTDLAFPL YK\ SWL I YH\NQ\ HVLVTPTSVA 
MAFAMLSLGTKADTRTEIIjEGLNVNLiTETPEAKIHECFQQVIiOALSR 
PDTRLQLTTGSSLFVNKSMKLVDTFLEDTKKLYHSEAS S INFRDTEE 
AKEQINNYVEKRTGRKVVBLVKHL 

AERIMVEGFHVDDKTIIRVPMINHLGRFDIHRDRELSSWVLAQHYVG 
NATAFF I LPDPKKMWQLEEKLTYSHIiEN I QRAFD IRS I NLHFPKLS I 
SGTYKLKRVPRNIiG I TKI FSNEADLSGVS QEAPLKLS KA VHVAVLT I 

DEKGTEATGAPHLEEKAWSKYQTVMFNRPFLVIIKEYITNFPLFIGK 
WNPTQK 

3220 

A 

3 

1226 

MRRNKPRQRGVREARPAGGAGPGWRGARCSGAGEGGGGERRGSPPAA 
LALAPASGPRRNFPPDARCLIYQQIHQGAFLLAGSASLSAVTGVPEG 
EARFTEDYCPEEKMFGFHKPKMYRSIEGC\CISGAKSSSSRFTDSKR 
YEK\DFQ\ SCFGLHETRSGDI \CNACVLLVKRWKKL\ PAG\ SKKNWN 
HWDARAGPS\LKTTFEPK\KVKTLSG\NRIK\STQISKLQKEFK\R 
&w £>ua±1£> 1 x o^ASJt'AQSPCYSNQSDDGSDTEMASGSNRTPVFSFLDL 
TYWKRQKICCGIIYKGRFGEVLIDTHLFKPCCSNKKAAAEKPEEQGP 
nttrxjtr 10 i v-c>»* v VbWRPPLiSSIjPAGCSIjHGIjAADWTTDMI 

* " *• •*■ «->*v^»-*vJ->jr*J vwvJj'JJf \£l\~M\.r<nT r JnjCxj j^JS.im.M.c. V \J V LiXjXJ 

3221 

A 

1 

269 

GGGGPLGSGRNGGGSMNAPPAFESFLLFEGEKQLLKDPQVLFAGYKV 
PHPLEHKI I IRVQTTPDYS PQEAFTNAI TDL I SELSLLEERF 

3222 

A 

240 

377 

SNTHRVPTFTHZXW'RV'FMV'T'CT CT AOT?T T T nuTum v~r A*r\rit rrt->r>, 

jHinnvr ± r x utwn c> v r,i v i v i oLoLybr IjJjIjPH xHCJJLiKXjQ* DPJLjHS P 

3223 

A 

47 

263 

LTVLQAVQEAWLGRPQETDNHGRRRRGSRHVWPCWSRRDRESEVGGL 
HAFKQPDPTSAHS ISREQQGGSRPP 

3224 

A 

416 

498 ! 

KRLSLQSFLLLPHIHCLKXiQ*DPLHSP 

3225 

A 

1 

305 

RTRGRTRGGDGGGHLGSGRNGGGSMNAPPAFES FLPLRG / QRRITIN 

KDTKVPNACLFTM\NKEDHTLGNIIKSRACFPFAFCRDCQFPRASPA 
TLPVQPAEL 

3226 

A 

74 

562 

GSGGGGGGHLGSGRNGGGSMNAPPAFESFLLFEGEKK/ 1 SDAGCGGP 
RITINKDTKVPNACLFTINKEDHTLGNIIKSQRDLKDPQALFAGYKV 
PHPL\EHKI I IRVQTTPDYS PQ\RAFTNRQSPDL\ ISELSLLGGSAF 
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RGGHKKTSQERNLSRGARGGLCSGL 

3227 

A 

3 

146 

IiKNKKNIYFCKLSBCEU^VIiYNPNTVQISSPKRNEQGKTEKELYQMCS 

3228 

A 

45 

592 

PSPPPPARAPSARTRRAPIPAAGQGRARPPRTVSRCGSAAMATKIDK 
EACRAAYNLVRODGSAVIWVTIKYDGSTIVPGEQGAEYQHFIQQCTD 
DVRLFAFVRFTTGDAMSKRSKFALlTWIGENVSEIiQRAKTGTDETLV 
KE WQNFANE YV I SDRKELEEE F I KS \ ELKKAVGANYDAQTE 

3229 

A- 

1 

2661 

MRQKVNKDIQELNSALHQVDLIDIYRTLHPKSTEYTFFSAPHLTYSK 

I DHLVS S KALL S KCKRTE 1 1 RNCLSDHS ATKLELR I KKLTQNR S TTW 

KLNNLLIJsfDYWVHNEMKAE I KMFFETNENKNTTYQNLWDTFKAVCRG 

KFIALNTHNRKQERSKIDTLTSQLKEPEKQDQTHSKASRRQEITKIR 

AELKEIETQKTLQKINESRSWFFEKINKIDRPLARLIKKEGEKNQIH 

TIKNDKGDLTTNPIEIQTTIREYYIHFYANTLENLEEMDKFLNTYTL 

LRLNQEEVESLNRRITGSEIEAIINSLPTKKSPGPDAEKAFDKSQQR t 

FMLKTLMKLGIDGTYLKIIRAIYDKPIANIILNGQKLEAFf>LKTGTR ' 

QGCPLSPLLFNIVLEVLARAIRQEKEIKGIQSGKEELKLSLFADDMI 

VYLENP I LS AQNLLKL IGNFS KVS G YKINVQKS QAFLYTKNRQTESQ 

IMSELPFTIASKRIKDLGIQLTRDVKDLFKENYKPLLEEIKEDTNKW 

KNIPCSWIGGINIVKIAILPKPCGKLKLDPFIjTPYTKINSRWIKDLH 

VRPKNVTCTLEENLGNTIQD IGMGKDFMSKTPKAMATKAKIDKWDLIK 

LKSFCTVKETTIRVNRQPTEWEKIFAIYSSDKGLISRIYNELKQIYK 

KKTNNP I KKWAKDMNRHFS KED I YAAKRHMKKCS ASLAI REMQI KTT 

MRYHLTPVRMATIKKSGNNRGYMTMLEQESFFLSLNIAYDSVIiLPVI 

LPVPPFGVPDFPQQVPGSSHWDWLDSECSPWRCSIiTGRHIiPVGANRH 

L I EVGAPLGR 5 FQRKDQAAI FAVLQPLLV I PRQTASGVDLRQPPTDL 

QEHNSSPAREENWTENEFDELTEVGFRRLVITKFSELKEDV 

3230 

A 

1 

1454 

MI I S IDAEKAFDKIQQPFMLKTLNELVLEVLARAIRQEKE I KVIQLG 
KEEVKLSLFADDMIVYLENPIVSAQNLLKXISNFSKVSGYKINVQKS 
QAVLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENY 
KPLLNE I KEDTNKWKKI LCS WAGR IN I VKMATLS KVI YRFNAI PIKL 
PMTFFTELEKTTLKF I WNQKRAC I AKS I LSQKNKAGG ITLPDFKLYY 
KATVTKTAWYWYQKRDIDQWNRTEPSEITPRIYNYLIFDKPEKNKQW 
GKDSLFNKWCWENWLAICRKLKLDRFLIPYTKINSRWIKDLNVRPKT 
IKTLEENLGITIQDIGMGKDFMSKTPKAVATKAKIDKWNIilKLKSFC 
TAKETT I RVNRQPTKWEKI FATYS SDKGL I SG I YKELKQ I YKKKTNN 
PIKCTAKDMNRPFSKEDIYAAKRHMKKCSPSLAIREMQIKTTMRYHL 
TPVRMAI IQKSGNN 

3231 

A 

1 

2364 

ETLIDWCDEKELNLILTTGGTGFAPRDVTPENSAVNPSGPGLFLiVVL 
EVLARAIRQEKEIKGIQLGKEEVKLSVSADDMIVYLENPIVSAQNLIi 
KLISNFS KVSGY KINVQKSQAFLYTNNRQRAKS I KYLG I QLTRD VKD 
LFKENYKPLLKEIKEDTNK^JKNIPCSWVGRINIVKMAILPKVIYRFN 
AI P I KLPMPFFTELEKTTLKF I WNQKRAR I AKS I LSQKNKAGG ITLP 
DFKL Y YKAT I TKTAWYWYQNRD I DQWNRTE PSEITPHI YNY L I FDKP 
EKNKQWGKDSLFNKWCWENWLAVCRKLKLDPFLTPYTKINSRWIKDIi 
NVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWPLI 
KLKSFCTAKETTIRVNRQPTKWEKI FATYS SD KALI SRI YNELKQIY 
KKKTNNP I KKRAKDMNRHFSKED I YAARKHM KKCS PS LA I REMQIKT 
TMRYHLTPVRl^IIKXSGNNSHTPAATGCl^WRENLCIWERNSTVIV 
RRSIGHSAAIiSQQKATPGLTQLNTAHRGTIWTSPTQRGTTNPCDWKE 

ADLQQRGLTVRRKTNEQRGI ASTSTKRI STQKPHPKVTN I KEQSEYY 
KYLYANKLENLEEIVKFLDTYTPPRLNQEEVKSLTRPITSSEVEAVI 
NSIPTKKSPEPDRFTAEFYQKYKQELVPFVLKLFQTTEKEGILLNLL 
YEAS I ILIPKPERDTTKKISGQYPRGTWMHKSSIKY 

3232 

A 

1 

1368 

MQNQEQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDV 
KDLFKENYKPLLKE I KEDTKKWKN I PCS WVGR IN I VKMAI LPKVIYR 
FNA I P I KLPMTFFTELEKTTliKF IGNQKRAH I AKS I LS QKNKAGGI T 
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LPDFKZ»VYKATVTKAP W YWYQNRD I DQ WNRME PS E I TPHMYNYL I FD 
KPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTSYTKINSRWIK 
DLNVRP KT I KTLEENLG I T I QD I GMG KD FMS KTPKAMATKAKI DKWD 
LIKLKSFCTAKETTIRVNRQPTKWEKIFETYSSDKGLISRIYNELKQ 
I YKKKTNNP I KKWAKDMNRH FS KEP 1 YAAKKHMKKC S P S IjAI REMQ I 
KTTMRYHLTPVRMAI I KKSGNNRSEAADLRGECYSSEGGTSGWHS S 
RWARALAGSGVKLQTFWSVTAHKGSVDPKSEQ 

3233 

A 

114 

1474 

VKGDRFGALRFNDCPAGIKLPMTFFTELEKTTLKFIWNQKRARIAKS 
ILSQKNKAGGSTLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTGPSE 

T p PT5'LJ r nVTvTVT T PhVOO VKTVY*\Mr* , MT"\ C*T T^XT ITT. Ti'-'TVITr* KTTVT T n T PDVT VT TM">t7T 

X TPH r XN i 1j 1 r JJKP J&Wy Ay WuWlJ S 

TPYTRINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPK 
AMATKDKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDK 
GLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKRHMKK 
CS SSLAI REMQIKTTMR YHLTPVRMAI I KKSGNNRCWRGCGE IGTLL 
HCWWDWKLVQPLWKSVWRFLRDLELEIPFDPAIPLLGIYPNDYKSCC 
YKDTCTRMFIVAIiFTIAKTWNQPKCPTMIDWIKKMWHIYTMEYYAAI 
KNDEFM/ S FVGTWMKLE 1 1 1LS KLLQEQKTK 

3234 

B 

1 

1890 

MDKFIiDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDG 
FTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIIIilPKPGR 
DTTKIENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFTPG 
MQGWFNICKS INI I QHINRTKDKNHM IIS IDAEKAFDKIQQHFMLKT 
LNKLGIDGTYLKMIRTIYDKPTANIILNGQKLEAFPLKTGTRQGCPIi 
SPLLFNIVMELLARAIRQEKEIKGIQLGKEEVKLSLFAJDDMIRIKYL 
G 1 QLi I l^MKDJbFKEN YKPIjIjNE I KEDTNKWKI^IPCSWVGKIWl VKMA. 
ILPKVIYRFNAIPIKLPMTFFTELEKTTLNFIWNQKRARTAKSILSQ 
KNKARG IMLPD FKL YYKATVTKTAWC W YQNRD I DQWNRTEPS E I TPH 
IYNYLIFDKPDKNKQWGKDSLFNKWCWENWIiAICRKLKLHPFLTPYT 
TINSRWiroLimiPKTIKTLEENIX3NTIQDIGMGKDFMSKTPKAMAT 
KAKIDKWDLIKLKSFCTAKETTISVNRQPTKWEKIFATYSSDTGLIS 
RIYNELKQIYKKKTNNPINKWAKDMNRHFSKEDIYAAQKHMKNAHHH 
WPSEKCKSKHNEI PSHTS * 

3235 

A 

1 

2677 

MKAE I KM FFEINENKDTTYQNL WDAFKAVCRGKF I ALNAHNRKQERP 
KIDTLTSQLKELEKQEQTHS KAS RRQEMTKI RAELKE I E IQKTLQKI 
NESRSWFFERI1^IDRPL7U^LIKKKREKNQIDAIKNDKGDITTDPTE 
I QTT I RE Y Y KHLYANKLENLEEMDKFLDTYTLPRLNQEE VESLNRP I 
TGPEIVAIINSLPTKRSPGPDGFTAEFYQRYKEELVPFLLKLFQSIE 
KEGILPNSFYEAS I ILIPKPGRDTTKKENFRPISLMNIDAKILNKIL 
ANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSI1TWQHINRTKHKNH 
MI ISIDAEKAFDKIQQPFMLKTLNKLS IDGTYXKI I RAT YDKPTAN I 
ILNGQNLEAFPLKTGTRQGHPLSPLLFNIVLEVLARAIRQEKEIKAQ 
NI*LKLISNFRKVSVYKINVQKSQAFLYTNNRQTESQIMRELPFTIAS 
KR I KYLG I QLTRD VKDLFKENYKPLLNE I KEDTNKWKN I PCS W I GR I 

KTILSQKNKAGGIMLPDFIOjYYKATVTKTAWYWYQKRDIDQWNRIEL 
SEIIPHIYNHLIFDKPDKNKKWGKDSVFNKRCWENWIAICRKLKLDT 
FtiTPYTKINSRW I KDLHVRPKAI KTLEENLGITI QDIGMGKDFTSKT 
PKAMATKAKIDKWDLIKLKSFCTAJCETTIRVNRQPTKWEKIFAIYSS 
DKGLISRIYKELKQIYKKKT1WPIKKWAKDMNRHFSKEDIYAANRHM 
KKCS S S LAIREMQ I KTTMRYHI*TPVRKAI I KKSGNNRCWRGCGE IGT 
LLHCWWDCKIjVQP\I>WKTVWQFLRDLBLEIPFYPAIPIjLGIYPKDY 

3236 

A 

1 

3045 

MDKFLNTYTLPRLKQEEVESLNRPITGSDIEAIINSLPTKKXSPGPD 
GFTAEFCQRYKEE/LEKEGILPNSFYEASIILIPKPASDTTKKENFR 
PIS L»MN INAKI LNK I LAKQIRQH I KKLIHHDQVG F I PGMHGJjFNI CK 
SWIIQHINRTNDKNHMIISIDAEKPFDKIQQHFMLKTLNKIjAQNLIj 

klignfskvsgykinvqksqaflytnnrqtesqimnefpftiaskri 
kylg iqltrdvkdlfkenykallne ikedtnkwknipcs \ wekttlk 
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F I W/NQKRAHI AKS 1 1 SQKNKAGGITLPDFKLYCKATVTKTAWYWYQ 
NRD IDQWNRTES S E IMPH I YNHLI FDKPDKKKKWGKDS LFtfKWCWEN 
WIAICRKLKLDPFLTPDTKINSRRIKDpiVRPEMIKTLEENLGNTIQ 
DIGMGKDFMSKTPKAMATKAKIDKWDLIKIiKS FCTAKETTIRVNRQP 
TEWEKIFAIYSSDKGLISRIYNELKQIYKKKTNNPIEKWAKDMNRHF 
SKEDIYAAKKHMKKCSSSLVIREIQIKTTMRYHLTPVRMAIIKKSGN 
NRCWRGCGE IGTLLHCWWDCKLVQPLWKSVWRFLRDLEIiEIPFDPAI 
PDLAAPSLPSGLRSPSKSSPSPPSRCTLVIILLHVFWDIVFFDGCEK 
KRWYILLIVLLTRLLVSACTFTEGYTVGFSTFEALRLGLSRYWLPCS 
SACRRPIVGLQLVMINSGNFQVIAMEGTVASECCHGNGKIjTWHRPVX, 
SVCSFSRCTVQAAGGSAILEDGDPLLTAPLGSTPQAAVCRGPRGREIi 
RAAPADSHLFQ]RDLWPFNKVIVHGEKGSNQTSQGLLNTGSEMTIVLE 
NPKYH S G P P VR VS PDGGQ V 1 1 E VLADPS YTG PTALNNVF F AFQCNF Y 
FDHIPENCGFSDPSDPQNLQKGEGCPSLVRASTAPPQEKATEQPLLC 
KTTESPFGMTVGPCTDETLDHGAPSKHVPGTAHNELALLDLRVIKSA 
GSAAVHHKLKVLHWRSSLSNNKGTGRLYEQVA 

3237 

A 

1 

2142 

MLKTLNKLRIDGTYLKIIRAIYDKSTANIMLNGQKLEAFPLKTGTRQ 
GCPLSTLIjFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIV 
YLENPIVSAQNLLKLIGNFSKVSGYKINVQKSQAFLYTNNRQTESQI 
MSELPFTIASKRIKYLGIHLIRDVKDLFKENYKPLLNEIKEDTNKWK 
NIPCSWIGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKF 
IWNQKRARIAKSVIjSQKNKAGGITLPDFKXYYKATVTKTARYWYQNR 
D I DQWNRTE PS E IMPH I YKYI» I FDKLEKNKKWGKDS h FNQ WCWENWL 
AICRKLKLDPFLTPYTTINSRWIKDIiNVRSKTIKTLEENLRIAIQpT 
GMGKDFMSKTPKAMATKAKIDK^LIKLKSFCTAKETTIRVNRQPTK 
WEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSK 
EDIYAAKKHMKKCSPSIJVIREMQIKTTMRYHIjTPVRMAIIKKSGNNS 
TFDFGQHEFELHRSTYTRIVFSKITPILEVLARAIRQEKEIKGIQLG 
KEEVKLSLFADDMIVYLENPIVSAQNLLKLIGNFSKVSGYKINVQKS 
QAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENY 
KPLl^EIKEDTNKWK^IPCSWVGRINIVKMAILPKVIYRFNAIPIKL 
PMTFFTELE 

3238 

A 

1 

1552 

MRFKEKIHIiHNIKEPSEAASADGGAVASYPDLAKIVDEGRYKAEVMQ 
LRCG WRAPASDC VHSVAVVGVDS VLEVLARAI RQEKE I KG I QLGKEE 
VKLSLFADDMIVYLENPTVSAQNLLKLISNFSRVSGYKINVQKSQAF 
LYTNNKQTESQlMSELPFTIASKTIKYIiGIQLTRDVKDLFKENYKPL 
LNKIKEDTNKWKNIPCSWIGRINIVKMAILPKVIYRFNAIHIKLPMT 
FFTELEKTTLKFI WNQKGVH I AKS I LS KKNKAGG IMLPHFKLYYKAT 
VTKTAWYWYQNRYTDQCNRTEPSE 1 1 PHI YNHLMFDKPDKNKKWGKD 
SLFNKWCWENWLAI CRKLKLDPFLTP YTKINS RW I KDLNVRPKP I KT 
IiEENLGITIQDIGMGKDFMSKTPKAIATKAKIDKWDLIKIiKSFCTAK 
ETTIGVNRQPTELEKIFAIYSSDKGLISRIYKEIiKQIYKKK\NNPIK 
KWVKDMNRH FS KED I YAVNRHMKKCS S S LV I REMQI KTTMRYYLTPV 

3239 

A 

1 

1459 

GLSGDLLGAHQLPDVLCGVQPLPDLLLPPQNLLALQSLQDDLLWALD 

PAAAAPWAMDRGAATQWAVGPVVSDPWVMEAVASLPSAMPLDSAAQP 

TWLLGAAS LLVTDQPMDQPS ADQLAEFPDLLS KVSQSLR I KYLGIKL 

TRWKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPK 

VIYRFNAIPIKLPMTFFTELENTTLKFIWNQKRACIAKSILSQKNKA 

GSIMLPDFKLYHKATVTKTAWYQYQNRDIDQWNGTEPSEIMSHIYNY 

LIFDKPEKNKQRGKDSLFSKWCWENWLAICRKLKLDPFLTPYTKINS 

RWIKDIiNVRPKIIKTLEENIiVNTIQDIGMGKDFMSKTPKAMATKAKI 

DKWDLIKQKSFCTAKETTIRVNRQPTEWEKIFAIYSSDKGLISRIYK 

ELQQIY\RKKTNNPIKKWAKDMNRHFSKEDIYAANRHMKKCS 

REMQIKTTMRYHLTPV 

3240 

A 

1 

2028 

MIISIDAEKAFDKIQQPFMLKTLNKLGIDGRYLKIIRAIYDKPTANI 
ILNGQICLEAFPLKTGTRQGCPLSPLLFHIVLEVLARAIRQEKEIKGI 
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QLGKEKVKLSLFPDDMVVYLENPIISAQNLLKLISNFSKVSGYKINV 
QTSQAFLYTNNRQTESQIMSELPFTTASK^IKYIiGIQLT3U>VKDLFK 
DNYKPLLKE I KEDTNKWKNI PCLWVGR IN I VKMAI LPKVI YRFNAI P 
I KLPMTFFTELEKTTLKFI WNQKRAR I AKS I LSQKNKAGGITLPDFK 
LYYKSTVTKTAWYWYQNRDIDQWNRTEPSEIMLHISNYLIFDKPDKN 

nqwgkdsl™kwcwenriaicrki,kldpfltpytkinsrwikdlnvr 

PKTINTLQENLGNTIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLK 
SFCTAKETTIRVNRQPTEWEKIFAIYSSDKGLISRIYNELKQIYKKK 
TNN P I KKWVKDMNRHFS KED I YAAKKHMKKC S S SLATREMQ I KTTMR 
YHLTPVRMAIIKKSGNNRPLWPTMPGDSQRGDKLRDGGTPGKDHFPT 
PFAFQAPIHPTESHLHHSPSLPAFTILQVHGFECTAAKQTSHTPVAS 
PTGESGNFLASVTSRLSGGNVKIILDWKMRIMNKEDYGTISVIRRSY 
NRTLKQDGGS KAEENKQR 

3241 

A 

2 

1547 

SNISLYSWSLRPLPADYVPSIVMLPGTCQTQDVEVSEETVEPSDEIi 
IEYDSPEQLNEQLVTLSLLPESRWKNLIiNIiDVIKRALELRYNINVQK 
SQAFLYTNNRQTESQIMSELPFTMASKRIKYLGIQLTRDVIPLFKEN 
YKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFSAIPIK 
LPMTFFTELEKTTLKFIWNQKRARIVKSILSQKNKAGGTTLPDFKLY 
YKATVTKI AWYWYQNRD IDQWNRTEPS E IMPHI YNYLI FDKPEKNKQ 
WGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPK 
TIKTLEENLGITIQDIGMGKDFMSKTPKAMATKPKIDKWDLMKLKSF 
CTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKKKTN 
NP I KKWAKDMNRHFSKEDI YAAKKHMKKCSPSLAIREMQ I KTTMRYH 
LP P VRMAI I KKSGNNRCWRGCGE IGTLLHCWRDCKLVQPLRKTVW 

3242 

A 

3 

1945 

NYPDQHFPIFHGIKEKSPRTGSWLIELTSERPFKTKSQCQDPVYPSC 
HYSQSSLFMDPGYNEPPEIAPVNEAVGIQIRGPGAASCARFCEQLAT 
ACRMLEFlJU^AIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIIiS 
AQNLLKIiISNFSKVSGYKINVQNSQAFLYTONRQTDSQIISELPFAI 
ASKRIKYLGIQLTRDVKNLFKENYKPQLSEIKEDTKKWKNIPCLWVE 
RIYIVKTAIQAKVIYRFNAIPIKIPMTFFTELEKTTLNFIRNQKRAR 
I TKS I L S QKKKAGG I TL PDFKL Y Y KATVT KTAWYW YQNRD I DQ WNRT 
EPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKCYWENWLAICRKLKL 
DPFLTPYTKINSRWIKELNVRPKTIKTLEENLGITFQDIGMGKDFMS 
KTTKAMATKAKIDKWDLI KLKS FCTAKETTIRVNRQPTKWEKI FATY 
SSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKK 
HMKKCSPSLAIREMQTKTTMRYRLTPVRMAIIKKSGNNSLLITIREY 
YKMLYANKLENLEEMDKFLDTYTLPKLSEEEV^SLDMLSFYTTGSKD 
CSPNFDTKNALLMYTPVKTRTKIQLQIRTLHKALTV 

3243 

A 

1 

2124 

MAQELRDECTSLSSRFDQLEERVSEMEDHMNEMKREEKFRKKGIKRN 
EQSLQEIWDYVKRPNLRLIGVPETQNLLKLISNFSKVSGYKINVQKS 
QAFLYTNNRQTESQIISELPFTIASKRIRYLGIQLTRDVKDLFKENY 
KPLLNEIKEDTHKWKNIPCSWIGRINIVK^ILPKVIYRFNAMPIKI* 
PLTFFTELEKTTLKFIWNQKRARIAKSILSQKNKTGGITIjPDFKLYY 
KATVTKAAWYWYQNRDIDQWNRIEPSEI I PHI YNYLI FDKPEKNKKW 
GKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLHARSKT 
IKTLEENLGNTIQDIGIGKDFMSKTPKAMATKAKIDKWDIiIKLKSFC 
TAKETTIRVNRQPKEWEKIFAIYSSDKRLISRIYEEIKQIYKKKTNN 
PINKWAKDMNRHFSKEDIYAANRHMKKCSSSLVIREMQI KTTMR YHL 
TPVRMVI IKKSGNNRSLFHVT PKTCELGSLPRLAHEGHIiQVTRGVQK 
WGHQLEFGPRVWIPKQESCSMALFCYAAEGLYFGSKGSQGKDKLCGC 
GTFARFNILQEDDLSLRVVRIjEDKERGSGENGIHTYNTCELKQIiIFI 
MKKSTKDGAAWPLPHAGSCRCKGRTTANGTEKVPACIIQEPEDRPAP 
PTTATASACGHCLGPEDKPVSPIIKATCTQACHPGICLLQPAIiLHTI 
GEA 

3244 

A 

1 

2250 

MGKFLDTHTLARLNQEEIESLNRPITGSEIVAIINTLPTKKSPGPDG 
FTAKFYQSRAETQPKKNFRPISLMNIDAKILNKILANQIQQHIKKLI 
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HHDQVGFIPGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKAFD 
KIQQPFMLKTLNKLGTDGTYLKI IRAI YDKPTANI ILNGQKLEAFPL 
KTGTRQGCPLS PLLFNIMLEVLARAI SQEKE I KG I QLGKEEVKLSLF 
ADDMIVYLDNPVVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNR 
QTESQIMRELPFTIASKRIKYLGIQLTRDVKDLFKENYEPLLNEIKE 
DTNKWKNVPCS WVGR INI VKMAI VPKVI YRFNVI P I LLPMTFFTELE 
KTTLKF I WNQKRAR I AKS I LSQKN KAGG I TLPDF KLYY KATVTKTAW 
YWYQNRDIYQWNRTQPSEITPHIYTYLIFDKPEKNKQWGKDSLFNKW 
CWENWLAICRKLKIiDPFLTPYTKINSRWIKDLNVIPKTlKTIiEENLG 
ITIQDIGMGQDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRV 
NRQPTKWEKIFATYSSDKGLISRMYNELKQIYKKKTNNSIKKWAKDM 
NRHFSKEDIYAAKKHMKKCSPSIjAIREMQIKTTMRYHLTPVRMVIIK 
KS GNNS LGDKARPCLKKRKEKKMFLTVGTKS LEE KAEG I RGTS S S LV 
KEYLKVS KI LPALEYWTPS S S VLELGLALLAPQPADGRLWDLVI L 

3245 

A 

2 

2483 

GKYYKLSSGTAPTCVSLGWGLARGDSAAPAIiGSRTSACAPCSHGTWK 
LSLEPSDRLSPCDRSSEEAHTHAPHRLIALVASLPWSRLPLLAPQSH 
SEAEATSQPTGVENHHQKTRYVKAGGPVICRSLPESRGFLWASEGRK 
CMLIGSWAAMGRLRKSTISSRFGPQTLAGTGRPQAIPVLIOCHSDAVL 
LGVCFLKLLHQHHQELGENADSQTLPQTHWEFILSEDYNKMTPVKNY 
QVLEVKARAMRQEKQIKSIQLGKEEVKLSVFADDMIVYIiENPIVSAQ 
NLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIISELPFTIPS 
KRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRI 
NIMKmILPRVIYIFNAISIKLP^f^FFTELEKTTLKFIWNQKRARIA 
KT I LSQKNKAGG I TLPDFKLY YKATVTKTAW Y WYQNRGVDQ WNR I EP 
SEIIPHIHNHLIFDKPDKNKKWGKDSLFTKWCWENWLAICRKLKIjDP 
FLTPYTKINSTWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKT 
P KAMATKAKIDKWDLI KLKS FCTAKETT I RVNRQPTEWEKI FT I YPS 
DKGLIPRIYKELKQ\IYKKKS1WPIKKWAKDINRHFSKEDIYAANRH 
MKKCS S S L V I REMQNKI T I R / YHLTP VRMA 1 1 KKSGNNRDMDEAGNH 
HSEQTIARTENQAPYLLTHRWEIoNNENTWTQVEEHHTLGPIVGVICR 
KVFPGNSGPSKPSGLHFSQPLPQVTSWAKITIVPWEMKLIAMGVQD 
ELNIAFHKNHLLMNDTTIHMTPYIQPAPKS 

3246 

A 

1 

2622 

MDKFLDTYTLPRIjNQEEVECIjNRPITGAEIVAIINSIjPTKKSPGPDG 
FTAEFYQRYKEEL VPFLLKLFQS I EKEG ILPNS FYEAS 1 1 LI PKPGR 
DTTKXENFRP I S LMN I DAKI LNTCI LADR I QQH I KKL I HHDQ VGF I PG 
MQGLFNIRKS INVIQHINRAKDKNHMNI S IDAEKAFDKI QQHFMLKT 
LNKLADKHFSKVSGYKINVQKSQAFLYPNNRQTESQIMSELPFTIAS 
KRI KYLGIQLTRDVKDLF KENYKPLLKE IKEDTNKWKNI PCSWVGRI 
NIVKMAILPKVIYRFNAILIKLPMPFFTELEKTTLKFIWNQKRARIT 
KSILS QKNKAGG I TLPD FKL YYKATVTKTAWYWYQNRD I DQWNRTEP 
SEIMPHI YNYL I FDKPEKNKQWGKDSLFNKWCWENWLAI CRKLKLNP 
FLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKT 
PKAMATKDKIDKWDL I KLKS FCTAKETT I RVNRQPTKWEKIFATYSS 
DKGLMSRIYI^LKQIYKKKTNNPIKKWAKDMl^FSKEPIYAAKKHM 
KKCSPSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNSKLjPFEPSRNK 
RVMPEPCAHTILSSLARPASKMHSPPLSMEPSAGPGGSQNFPNEEAB 
DRPRTLNFEGSETWALTGPDARYFNYTREARGENSLVSGGCRPLAAG 
RARASTSGWAAKRVGTLQGGPLTWAMPAPISPPPMTVTCLTIIFLAE 
AEAVDEEDTERMNCLVTKAMWQRRRQRGDPGREDARGGQPRKARLW 
GRGPTRPCSPRAPGVPVTPQGGGGASSPALPIGPGCNVPALSGYTRA 
GSQLSAAGARVSAGLRPAEQRDLAAGVL 

3247 

A 

853 

2831 

YPESTMNSNKFTRKKQTTPSKSG*RI*TDTSQKKTFMQPKDT*KNAQ 
HHW SLEKCKS KPQ * DT I S HQLE WRS LKS QDRKD * QS TLL AI h I KKKG 
QKNQ I DT/ 1 KNDKEG I TTDPRE IQTT I REY YKHLYTNKVENLEEMDK 
FLDTYTLPTLKQKKEVKTLNRPITGSEIEAIINSLPT/KKSPGPDRF 
TAEFY\R/DIRSSGQGNQARERNKGYSIRKRGSQIVPVCR*HHCIFR 


\o\ 4 \t 


.1 0 Hill S M-6 Ml- 3! ,., O 2 2 6. CI S 






KPHHLSPKSS*ADKQLQQSLRIQNQSAKITSIPIHQ*QTNREPNHE* 
TPIHNCFKENKMPRNPTYKGCEGSLQGELQTTAQ*NKRGHKRMEEHS 
MLMDRKNQYRENGHTAQGNL*IQCHPHQATOTFFHRIGKNYFKVHME 
PKKSPHRQVNPKPKEQSWRHHTT*LQTILQGYSNQNSMVLVPKQRHR 
PMEQNRAPGNNTIHLQLSDL*QI *QKQEMGKGFPI * *MVLGKLANHM 
* KAETGSLPHI LYKN * FKMD * RLKC * T * NHKNPRRKPRQ YHSGHRHG 
QGLHD*NTKSNGNKSQN*QMGSN*TKELLHSTRNYHQSEQATYRMGE 
NFCNLLI*QRANIQNLQRT*TNLQDKNKQPHQKVGKGYEQTLLKRRH 

LCSQKTHEKMLI ITGHQRNANQNHNEIPSHTS *NGDH/SNQVRKQQV 
LERMWRN 

3248 

A 

1 

4954 

MVFSIDAQKAFDKIQHRFMLKTLNKLGIDGTYLKIIRAIYNKPTGNI " 

ILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGI 

QLGKEEVKLSLFADDIIVYLENPIVSAQNLLKLIGNFSKVSGYKINV 

QKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKNLFK 

ENYKPLLNEIKEDTDKWKNIPCSWIGRIHIVKMATLPKVIYRLHAIH 

IKLPMTFFTELEKTTLKFIWNKKRARIAKSILSQKNKGGGITPPDFK 

LYYKATVTKTARYWYQNRD IDQWKTRE PS E 1 1 PH I YNHL I FDKPDKN 

KKWGKDSLFNKWCWENWLAICRKLKLNPFLTPYTKINSRWIKDLNIR ! 

PKTIKTLEENLGNTIQDKGVGKDFMSQTPKAMATKAKIDKWDLIKLK 

SFCTAKETTIRVNRQPTEWEKIFAIYSSDKGLISRIYKELKQIDKKK 

ANNPINKWAKDMNRHFSKEDIYAANRHMKKSSSSLAIREMQIKTTMR 

YHLTPVRMVIIKKSGNNSEGLNPGYKGFPTIIWAPLPVAQSKDSGLA 

SLNSDPDI PSMLECSLKAPQL YRSKNVGQVF IIS S ASQAFTKKAR I Y 

ARLRVS QALKTLCKS S CHDG WS FERLAR I QE VS L P I S PDI» I LCS EA Y 

hygtkpqwlvaatgtaqtflelnqksqqyqkqeqthskasrmqeitk' 

IRAELKEIETRKTLQKIDESRSWFFERINKTDRPLARLTKQKREKNQ 

IDAIKNGKGDITTDPTGIQITIREYYKHLYAKKLENLEEMDKFLDTY 

TLPRLNQEEVDSLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQR 

HKEELVP FLLKLFQS I EKEG I LPNS FYEAS 1 1 L I PKPGRDTTKKENIt 

RPISLMNIDAKILSKILANRIQQHIKKLIHHDQVCFIPGMQGWFNIR 

KS INVI QHINRAKDKNHM 1 1 S IDAEKAFDKI QQTFMLKTLNKLGI DG 

TYFKI IRAI YEKPTANI I LNGQKLEAFPLKTGTRQGCPLSPLLFNI V 

LEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNL 

LKLISNFSKVSGYKIYKIDVQKSQAFLYTNNTDKQESQIMSELPFTT 

ASKRIKYLGIQLTRDVKDLFK\ENHKPLLNEIKEDTNKWKNIFIPCL ' 

WVGRINIVKMAILPKGIYRFNAIPIKLPMTFFTELEK\TTLKFIWNQ 

KRARITKSILSQKNKAGGITLPDFKLYYKATLTKTAWYWYQHRDINQ 

WNRTEPSEIIPHIYNHLIFDKPDKNKKWGKHSLFNKWCWESWLDICR 

KLKLDPYTKFTPYTKINSRWIKGLNVRPKTIKTLEDKPIQVFNTIQD 

IGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTGKETTIRVNRQPT 

EWEKIFAIYSSDKELISRIYKELKQIYKKK\TNNPIKKWAKDMNRHF 

SKEDI YAANKHMKK\ CSSSLAIREMQIQTTMRYHIjTPVRMAI \ IKKS 

GNNRCWRGCGEVGTLLHCWWDCNLVQPLWKSVWRFLRDLELEIPFDP 

AIPIiLGIYPKDYKSCCYKDTCTRMFIVALFTIAKTWNQPKCPTMMDW 

IKKMWHIYTMEYYAAIKNDEFMSFVGTWMKLEIIILSKLSQEQKTKH 
RIFSLIGGN 

3249 

A | 

25 1 

527 

EFHRLRENPPMVAVSCPTKTNVKA\AWG\ KVGAHAVRSMCAEALERM 
FLSFPT\TKTYFPHFDL\SHG\SAQVKGATGKKVADALTNAVANV\D 
DMPNVAVRPEATLHAHKLRVDPV\NF\KLLKPLACLVDPGPAHLPRP 
SFTPGGATSSLGQSFLGFLLKHRCLNLPNYR 

3250 

A 

97 

1360 

SACAWRLPSPGPSAMWPLWRLVSLLALSQALPFEQRGFWDFTLDDGP 
FMMNDEEASGADTSGVLHPDSVTPT\YS7U4CPFGCHCHLRVVQ\CSD 
LGLKS VP KE I S PDTTLLDLQNND I S ELRKDD FKG LQHL YALVIiV\ NN 
KISKIHEKAFSPLRKLQKLYI SKNHLVEI PPNLP\ SSLV\ ELRIHDN 
RIRKVPKGVFSGLRNMNCIE\MGGNPLENSGF\EPGAFDGLKLNYLR 
I SEAKLTGI PKDLPETLNELHLDHNKI OA I ELEDLiLRYS KLYRLGLG 
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HN\QIRMIENG\SLSFLPTLREV\HIi\DNNKLARVPSGFP\DLKLLQ 
W/YILHSNNIHQSGVFNEFLFPWGFGGEAGPYYNGH/ISLFNNPVP 
YWEVQ\ PATFR\ VVTDRLAIQFGQLQKVEAAAATLVSQWGSLGNRAR 
HPDGEAEPGS 

3251 

A 

1 

2535 

MAERGQCTAQAVASEGGRSKPWQLLRGIEPAGAQKSRIEVWEPLPRF 
QKMDAQAIVLLQGQGTHGEPLAASYPSSGQIQRRYQGLALRTLGIFL 
SFYSTAAELTPKPKDKLLLILLSRFLKQRNLSPWPPLPQACSKYCKA 
VADIHSRSKGSLVSLWSMLIGLGISLQGNGLPSGPEIKRVKLISNKG 
TETDNDPSCVHPIIKRRQCRPEIRMWQTREKAKFSDGEKCRREAFRR 
LGNGVSDDLSSEEDGEARTQMILLRRS VEGAS SDNGCEVKNRKSILS 
RHLNSQVKKTTTRWCHIVRDSDSLAESEFESAAFSQGSRSGVSGGSR 
SLNMSRRDSESTRHDSETEDMLWDDLLHGPECRSSVTSDSEGAHVNT 
LHSGTKRDPKED VFQQNHLFWLQNS S PS SDRVS AI I WEGNECKKMDM 
SVLEISGIIMSRRFLFAKLFSHITSARKARKYEIPHFRLKKVENIKI 
WLSLRSYLKSMCKNMLTEAPGSSYSVDSWLGVRQIIAAADRLAGQQ 
VCjCHIj S AKbVGVGHEDQ VPGGDL PLLCASRTR FKAFVTIEDYNGYVG 
LGVSAPRRGVGTVLAPVPRNLLLMVVTHHCCTLTGDCAATLANFAKG 
TFDAISKTYSYFTPNLCCGKLNHKLSKCLDSPLAGIKTYLSRKVMTA 
ETSEIPTCLFTCTGALSSAKANEWCFRRSKSFSNWVGDQIVEINGEP 
TQGITHTRAIELIQAGGNKVLLLLRPGTGLIPDHVIAIVLYLMMPIV 
NISEVIjGPLCLMLLMGTVHCQIVSTQITRPSGNNGNRRRRKLRKTVN 
GDGSRENGNNSSDKVRGIETLESVPI IGGFWETIFGNS ICYRLSFA 

3252 

A 

5 

2333 

NPILWLETQMASNERDAISWYQKKIGAYDQQIWEKSIEQTQIKGLKN 
KPKKMGHIKPDLIDVDLIRGSTFAK7UCPEIPWTSLTRKGLVRWFFP 
LFSNWWIQVTSLRIFVWLLLLYFMQVIAIVIiYLMMPIVNISEVLGPL 
CliMLLMGTVHCQIVSTQITRPSGNNGNRRRRKLRKTVNGDGSRENGN 
NSSDKVRGIETLESVPIIGGFWETIFGNRIKRVKLISNKGTETDNDP 
SCVHPIIKRRQCRPEIRMCQTREKPKFSDGEKCRREAFRRLGNGVSD 
DLS S EEDGEARTQM I LLRRS VEGAS SDNGCEVKNRKS I L SRHLNSQV 
KKTTTRWCHIVRDSDSLAESEFESAAFSQGSRSGVSGGSRSLNMSRR 
DS ESTRHDS ETEDMLWDDLLHG PECRS S VTSD SEG AHVNTLHSGTKR 
DPKEDVFQQNHLFWLQNSSPSSDRVSAIIWEGNECKKMDMSVLEISG 
1 1 MS R VNAYQQGVG YQMLGNWT I GLAFFPFLHRLFREKS I*D \ QLKS 
ISAEEILTLFCGAPPVTPIIVLSIINFFERLCLTWMFFFMMCVXAER 
TYK\ QRFLFAKLFSHI YFCQGKLGKYE I PHFRLKKVENI KI WLSLRS 
x .uivK.Kiji'y Kt> vij v v \ v £> b VFJjIjTLiS I AF I CCAQVLQG \HKT\ SWNDA 

KMEKKPNKKEQLTLVNNVLKLSTKLLKKLDTPFRL\YGLTMNPLIYN 
I TRWI LSAVSGVTSDT.L.nFN T R T .Wlf T Tf C 

3253 

B 

73 

358 

XVPGSRGPETKLWDDFSMSQATKRKHVVKEVLGEHIVPSDQQQIVRV 
LRTPGNNLHEVETAQGQRFLVSMPSKYRKNIWIKRGDFLIVDPIEEG 
EK* 

3254 

A ! 

80 

201 

TSEPMFLTKTPCHPSVKGPSPSSEQSWVLSLPSKELRRRS 

3255 

A 

498 

748 

FLPRRGDNDS YPQ/ WTKACCRRRRTCW* T * TWWIRSGQRKMWS CGRR 
S Li TTCPMPRTR T A A P T .M A r*n Q n cup dohdc r*n r>-o t c r> 

3256 

A 

1 

1764 

MTTSQKHRDFVA^^MGEKPVGSLAGIGEVI^KKLEERGFDKAYVVLG 
QFLVLKKDEDLFREWLKDTCGANAKQSRDCFGCLRE/WCAKSRPAAE 
VSELKADSKEGPQAQGPEQERTGIj 

3257 

A 

842 

1186 

FLPRRGDNDSYPQ/ WTKACCRRRRTCW *T* TWWIRSGQRKMWS CGRR 
SLTTCPMPRTRAWTTWRSKNIjALSCPSMTKSLKGSGHIPSAWSRAAR 
LMACGSGSWRRSGPSCGCRLSP 

3258 

A 

526 

1158 

S CGLSL IKMTTSQKHRDFVAJBPMGEKPVGSLAG IGEVLGKKLEERGF 
DKGL/YWLGQFI^IEIQ^KTSFREWLKDTCGANAK\QSRDCFRMPF 
EKWCEALLVDAIiLGKFSIPPAPQSRSLASRSRDSSPCPSYEGKD\CY 
C\RTHL\RRYFRGSFGEFSPLNHFQLFFGILRSWHAFPRPFFPWPVS 
WVTVYQLFLEWDFPGP I PHPHPHFQS V 
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3259 

A 

35 

407 

TFMFVFLSSAGNMPVTCWCWEAPRCNQKCTDPAARRPDPQTCASQDR 
LRCAPCTCHQPLASRYTPPPACPPSHVSLHTLATSLQHFLNKNYKNY 
I AKMCKLS I LSLS FLKEGAGDKNS SEPNLG 

3260 

A 

3859 

4052 

ASCPNPKEFAPPRAGPPWFWGPPLARVNPG\LGGPFSGTRPTPPQPL 
PEQRVTLTSTNPWLAASV 

3261 

A 

493 

712 

AHGRQLRKYSQQQGCSWRLPFCGCHAVSGGGCGKCC\SPANGPCGAA 
\ CLGFQPHNFPDPCETC / CGCHAVSGGGCGKCCI SSKWSLWCS PAWV 
S SHTTSRI PVKLAGLPS PS FLPG 

3262 

A 

51 

541 

MGSLSLNLPFVVTVLLMCLSRSRKGDAASQFLVFCQKHRDEVVAEHP 
DASGEEIEELLRSQWSLLSEKQRARYNTKFALVAPVQAEEDSGNVNG 
KKRNHTKRIQDPTEDAEAEDTPRKRLRTDKHSLRKRDTITDKTARTS 
SYKAMEAASSLKSQAATKNLSD 

3263 

A 

742 

4858 

ERKHLGWMEF S I KQS PLS VQS WKCI KMKQAPE I LGS ANGKTPSCEV 
NRECSVFLSKAQLSSSLQEGVMQKFNGHDALPFIPADKLKDLTSRVF 
NGEPGAHDAKLRFESQEMKGIGTPPNTTPIKNGSPEIKLKITKTYMN 
GKPLFESSICGDSAADVSQSEENGQKPENKARRNRKRSIKYDSLLEQ 
GLVEAALVSKI S S PSDKKI PAKKESCPNTGRDKDHLLKYI^VGDLVWS 
KVSGYPWWPCMVSADPLLHSYTKLKGQKKSARQYHVQFFGDAPERAW 
I FEKSLVAFEGEGQFEKLCQE S AKQAPTKAEKI KLLKP I SGKLRAQW 
EMGIVQAEEAASMSVEERKAKFTFLYVGDQLHLNPQVAKEAGIAAES 
LGEMAES S GVS EE AAENPKS VREECI PMKRRRRAKLCS S AETLESHP 
DIGKSTPQKTAEADPRRGVGSPPGRKKTTVSMPRSRKGD7UVSQFLVF 
CQKHRDEWAEHPDASGEEIEELLRSQWSLLSEKQRARYNTKFALVA 
PVQAEEDSGNVNGKKRNHTKRIQDPTEDAEAEDTPRKRLRTDKHSLR 
KRDTITDKTARTSSYKAMEAASSLKSQAATKNLSDACKPLKKRNRAS 
TAASSALGFSKSSSPSASLTENEVSDSPGDEPSESPYESADETQTEV 
S VSS KKS ERGVTAKKE YVCQLCEKPGS LLLCEG PCCGAFHLACLGLS 
RRPEGRFTCSECASGIHSCFVCKESKTDVIOICVVTQCGKFYHEACVK 
KYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAY 
HSGDACIxAAGCSVIASNSIICTAHFTARKGKRHHAHVNVSWCFVCSK 
GGSLLCCESCPAAFHPDCLNIEMPDGSWFCNDCRAGKKLHFQDIIWV 
KLGNYRWWPAEVCHPKNVPPNIQKMKHEIGEFPVFFFGSKDYYWTHQ 
ARVFPYMEGDRGSRYQGVRGIGRVFKNAIjQEAEARFREIKIjQREARE 
TQESERKPPPYKHIKVNKPYGKVQIYTADISEIPKCNCKPTDENPCG 
FDSECLMRMLMFECH POVCPAGEFPONOPFTJCR OVPETK T T KTDRKG 
VJGLVAKRDIRKGEFVNEYVGELIDEEECMARIKHAHENDITHFYl^T 
IDKDR 1 1 DAGPKGNYSR FMNHS CQ PNCETLKWTVNGDTRVGLFAVCD 
IPAGTELTFNYKLDCLGNEKTVCRCGASNCSGFLGDRPKTSTTLSSE 
EKGKKTKKKTRRRRAKGEGKRQSEDECFRCGDGGQLVLCDRKFCTKA 
YHLSCLGIiGKRPFGKWECPWHHCX>VCGKPSTSFCHL»CPNSFCKEHQD 
GTAFSCTPDGRSYCCEHDLGAASVRSTKTEKPPPEPGKPKGKRRRRR 
GWRRVTEGK 

3264 

A 

2 

658 

AFLRMLFPESWMETRRGAPAPPAPRSETSHDLVATLMRCTPHYIR*S 
NPTRPRGPETGRRTA*TRSPQVPHLQHLPPTPRLVTPHPEGTQPTLP 
TRSSTRWNTWA*RRTSGCESRLRLPPPVRQIPAEYAILTPETWPRWR 
GDEPRAS9TCFGRSTWSPTRPDGEHQGLCQEPRVAFPPGGGAKRKFD 
GFARTIQKAWRRTWLSGSTRRCGRKLPTSC 

3265 

A 

1 

4095 

MEFS I KQS PLS VQS WKC I KMKQAPE I LG SANGKTPSCEVNRECS VF 
LSKAQLSSSLQEGVMQKFNGHDAXPFIPADKLKDLTSRVFNGEPGAH 
DAKLR FESQEMKGIGTPPNTTP I KNGS PE I KLKI'TKTYMNGKPLFES 
SICGDSAADVSQSEENGQKPENKARRNRKRSIKYDSLLEQGLVEAAL 
VSKISSPSDKKIPAKKESCPNTGRDKDHLLKYOTGDLVWSKVSGYPW 
WPCMVSADPLLHSYTKLKGQKKSARQYHVQFFGDAPERAWIFEKSLV 
AFEGEGQFEKLCQE S AKQAPTKAE K I KLLKP I S G KLRAQWEMG I VQA 
EEAASMSVEERKAKFTFLYVGDQLHLNPQVAKEAGIAAESLGEMAES 
SGVSEEAAENPKSVREEC I PMKRRRRAKLCS SAETLESHPDIGKSTP 
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QKTAEADPRRGVGSPPGRKKTTVSMPRSRKGDAASQFLVFCQKHRDE 
WAEHPDASG EE IBEltliRS QW S LLSEKQRARYNTKFALVAPVQAEED 
SGNVNGKKRNHTKRIQDPTEDAEAEDTPRKRLRTDKHSLRKRDTITD 
KTARTSSYKAMEAASSLKSQAATKNIiSDACKPLKKRNRASTAASSAL 
GFSKS S S PS ASLTENEVSDS PGDEPSESPYESADETQTE VS VSS KKS 
ERGVTAKKEYVCQLCEKPGSLLLCEGPCCGAFHLACLGLSRRPEGRF 
TCSECASGIHSCFVCKESKTDVKRCWTQCGKFYHEACVKKYPLTVF 
ESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYHSGDACL 
AAGCSVIASNSIICTAHFTARKGKRHHAHVNVSWCFVCSKGGSLLCC 
ESCPAAFHPDCLNIEMPDGSWFCNDCRAGKKLHFQDIIWVKLGNYRW 
WPAEVCHPKNVPPNIQKMKHEIGEFPVFFFGSKDYYWTHQARVFPYM 
EGDRGSRYQGVRGIGRVFKNALQEAEARFREIKLQREARETQESERK 
PPP YKHI KVNKPYGKVQI YTAD I S E I PKCNCKPTDENPCGFDSECLN 
RMLMFECHPQVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAKR 
DIRKGEFVNEYVGELIDEEECMARIKHAHENDITHFYMLTIDKDRII 
DAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 
TFNYNIJDCLGNEKTVCRCGASNCSGFLGDRPKTSTTLSSEEKGKKTK 
KKTRRRRAKG EG KRQS EDEC FRCGDGGQLVLCDRKFCTKA YHLS CLG 
LGKRPFGKWECPWHHCDVCGKPSTSFCHLCPNSFCKEHQDGTAFSCT 
PDGRSYCCEHDLGAASVRSTKTEKPPPEPGKPKGKRRRRRGWRRVTE 
GK 

3266 

A 

26 

173 

RDTITDKTARTS S YKAMEAAS S LKSQAGNWSALS FLAS GCPGWAGW 

VG - 

3267 

A 

387 

1443 

PHRKQAEPPRHHERLGRRVRHHARHGRGSRPDTAAEAAGGCGDPRAF . 
QQLERRLRHP PLRWQGLLRRQRLbREEPRRSLLRPHRADAVQ * PGLP 
PDALPREAGWRQEGAPQHRRLQD YETCQLTWWPH* •* GAHPTTS AALN 
PTRPRGPETGRRTESSTRWNTWA*RRTSGCAEPASPTA/SQFAKFLQ 
RYAILTPETWPRWRGDERHGVQHLLRAVNMEPDQYQMGSTKVFVKNP 
/WAFPPGGGARAICVRWLCPNHPEGLAAPRGCPEVRGDA/RRKLPTS 
C*TRRSGGATASIGTSSGTTWGWRSGPSCVSSWARRSGWTSPIRSPS 
TTAASSPSSGT* S * RPSVCM* LGERR 

3268 

A 

1 

4496 

MEGNVNAHRASGAQIRTCRAPPGATGCTALFKANSPEFTGLKSAQRG 
RGPWGAPRKRFSGLRGHREAERRTLGWGSOAPPPAAQAHTPAAGDS 
TLLLRSAVPGTCLLYAPHPQDFPPAGFLREQWWPRLLTLQVFLHLRT 
GGRVGPVDWVLLGGGRLIjGGHRMPGPC^LAEAVIjLREARWPWGEAGG 
S EEGE PS GG S VGGNCQRARPL PS AH P EAQQGVGLEVGRGMVAPAAS Y 
FADEGRPNGDHPARQPRLGPLGAHGRRHLKSFPDTGSLNIiELEEQNP 
NVQDILGELREKVGECEASAMLPLECQYXNKNAIiTTIAGPIiTPPVKH 
FQLKRKPKSATLRAELLQKWPCRSGVHANVHISTAAQRVMTKHLQRA 
DEHPHFC VEEWHSGL I S AFY YCEKHLAENH I LDR 1 1 DAGP KGNYS RF 
MNHSCQPNCETLKWTVNGDTRVGLFAVCTIPAGTELTFimrnDCLGN 
EKTVCRCGASNCSGFLGDRPKTSTTLSSEEKGKKTKKKTRRRRAKGE 
GKRQSEDECFRCGDGGQliVLCDRKFCTKAYHLSCLGLGKRPSPDHIP 
LKNGFVASCLLFCGDCGLQTPAAPYCCTSGVLHLSTVDISGRIAIiliR 
GHPVRCRRCS S IHDRNPSES WKC I KMKQAPE I LGS ANGKTPSCEVN 
RECSVFLSKAQLSSSLQEGVMQKFNGHDALPFIPADKLKDLTSRVFN 
GEPGAHDAKLRFESQEMKGIGTPPNTTPIKNGSPEIKLKITKTYMNG 
KPLFESSICGDSAADVSQSEENGQKPENKARRNRKRSIKYDSLLEQG 
LVEAALVSKI SSPSDKJCIPAKKESCPNTGRDKDHLLKYNVGDLVWSK 
VSGYPWWPCMVSADPLLHSYTKLKGQKKSARQYHV\RFFVYAPERAW 
IFEKSLVAFEGEGQFEKLCQESAKQAPTKAEKIKLLKPISGKLRAQW 
EMGIVQAEEAASMSVEERKAKFTFLYVGDQLHIiNPQVAKEAGIAAES 
LGEMAESS * VSEEAAENPKSVREECI PMKRRRRAKLCSSAETLESHP 
DIGKSTPQKTAEADPRRGVGSPPGRKKTTVSMPRSRKGDAASQFLVF 
CQKHRDESGGRDSAFTCRS PQNYPVEEPASELVGAEPVSRLS KGLW 
AEH PDAS GEE I EELLR S QWSLLS EKQRARYNTKF ALVAP VQAEED S A 
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PLKGIPKQAPFRSPTAPSVFSPTGNRTPIPPSRTLLRKERGVKAGLC 
LS WQLLDI SELD MVGAGREAKRRRKTLDAE WEKPAKEETWBNATP 
DYAAGLVSTQKLGSLNNEPALPSTSYLPSTPSWPASSYIPSSETPP 
APS SREASRP PEEPS APSPTLPAQFKQRAPMYNSGLSPATPTPAAPT 
SPLTPTTPPAVAPTTQTPPVAMVAPQTQAPAQQQPKKNLSLTREQMF 
AAQEMFKTANKVTRPEKALILGFMAGSRENPCQEQGDVIQIKLSEHT 
EDLPKADGQGSTTMLVDTVFEMNYATGQWTRFKKYKPMTNVS 

3269 

A 

911 

2827 

ERKHLGWMEFS I KQS PLS VQS WKCIKMKQAPE IL\GS ANGKTPSCE 
VNRECS VFLS KAQL S S S LQEG VMQKFNGHDALPFI PADKLKDLTSR V 
FNGEPGAHDAKLRFESQEMKGIGTPPNTTPIKNGSPEIKLKITKTYM 
NGKPLFES S I CGDS AADVS QSEENGQKPENKARRNRKRS I KYDSLLE 
QGLVEAALVS KISSPSDKKI PAKKE S CPNTGRDKDHLLKYNVGDLVW 
SKVSGYPWWPCMVSADPLLHSYTKLKGQKKSARQYHVQFFGDAPERA 

W J. r lilVolj VAr iluJluyc JiAJLit-yiioAA.vAr' X JSAJiJVJLi\XiIjiS^i&v»ISJjrCA\J 
WFMO T VO AEE AA S M S VEPR KA KPTFT.YVnnnT .HT.NPOVAKRAG I AAE 
SLGEMAESSGVSEEAAENPKSVREECIPMKRRRRAKLCSSAETLESH 
PDIGKSTPOKTAEADP /RRRGVGS PP\ GRKKTT\ VSMPRSRKGDAAS 
QFLVFCQKHRDEWAEHPDASGEEIEELLRSQWSLLSEKQRARYNTK 
FALVAPVQAEEDSGNVNGKKRNHTKRI QDPSENA\DLEDTPRKGLRT 
DKH S LR KRDT I TDKTARTS S YKAME AAS S LKS QAATKNL SDACKPLK 
KRNRASTAASSALGFSKSSSPSASLTENEVK 

3270 

A 

3 

114 

SCSGDHAGVKALVALSFSGAIGLTFLMLGCALEDYG 

ion 

.5 Z / -L 

A 

A 


A "7 T 
*i / -L 

fi FNP p T.VFO T KGG A P A WT .AfiN A V T F T .T T Of? F FT . T FG R C3 DD FS WE O 

w 

3272 

A 

1 

190 

GTSRSGLGRLPGPWQEAGSSRGPSSGDMAGVKALVALSFSGAIGLTF 
LMLGCALEDYGCTSLK 

3273 

A 

2 

479 

SGLGRLPGPWQEAGSSRGPSSGDMAGVKALVALSFSGAIGLTF/LHM 

T fZr^AT 'PriVf3\7VWDT.17\7T \ FITTTTA T CDT Dtf TTT 7A VDITTVnCri AT*\ QQAPW 
JjIjCAXjU. JJ X vjr V I WJrJur VXj \ x* rrlAJLor'XJrJrir aAxVKVX 1 L/oUHl \ uortLK 

ELAYFFTTG I VVS CLW I S PVI LARVAL I K\ WGAC \ AFVL \ AGNAVI F 
PYNSRGFSLYLGRGDDFSW\EQW 

3274 

A 

3 

586 

ARAMGISRDNLHKRRKTGGKRKPYHKKRKYELGRPAANT\KIGPRRI 
HTVRVRGGNKKYRALRLDVGNFSWGSECCTRKTRIIDIVYNASNNEL 
VRTKTLVKNCIVLIDSTPYRQ/WTPEEEEILNKKRSKKIQKKYDERK 
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NTKI LAPRR I QHS P VCRGG * QVNTVPLRFD \ VGNFS WG/ SKECCTRK 
TR\ I IDWYNAI * * PSWVRYPRPLVERICI \VLIDEQHPY\RQWVRS 
PTY AL\ PLGPQRKGAKLDS LRKEE I FKPKTDLK* IQKKYD * / ERKKN 
CQNSASLPGRSSFQQGKASLRCIAFK/RPGQC\GRA\DGYMTRGHRV 
RSYYLRKIKARQSKINT 

3276 

B 

48 

209 

XKNQCETRTMQENGYSSHAVDGTGPAGGAGRPAGSTGAQVSVQPNFQ 
QDKFLGRX* 

2 z / / 

A 
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TRTMLLQPAGSLGSYSYRSPREWGLHRPPGPSl^ATLAGTTLGQPPA 
AEIHGVGGDGCPTSVRGKGQAWA 
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SGQHRRPRSPCQPNFQQDKFLGRWFKRG\LASNSSWLREKKAALSMC 
KS WAPAT\DGG\ FNLTSTFLQEKTSVETRTML\ LQPRGVPSASLQL 
TGVPHWGQA/ HYS VS WETDYDQYALLY\ TRASKGPGEDFRMATLYS 
RTQTP\RAELKEKFTAFCKAQGFTEDTIVFLPQTDKCMTEQ 

3279 

A 

154 

522 

HVACGLLWI YVS P S AHLNLDGTI TTK/ ENLGTVNEI LLGSNPTEAEL 
QDMINEVMSDGNGTIDFP\EFLTMMAJRKMK\DTDSEGRKL\EEAF\R 
VFGLRVGNGLYLVACRNFRHVDGQTLGGGSLPD 

32B0 

A 

1 

1446 

MELEQLMTL I S AARE YE I E F I Y A I S PGLD I TFSNPKE VS TLKRKLDQ 
TLTSSHLNIPEKRMSVKGGGDFKQGDQEDLVDNFGSEHTPSGPSLGN 
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SGTGCS PWASAES EKEVRGMATGTARNQKWtiGVRDCAS PLMGGAGTG 
KobFAJLiCV&vjrAisr ££QS YSriAAGCELSGHTRSF TF KVkEEDDAJEHVL 
ALTMLCLTEGAKDECWVEWARNHDHQBIAVPVANLKLSCQPMLSL 
DDFQLQPPVTFRLKSGSGPVRITGRHQIGHPPGPTKKALKQRPLKL.I, 
PCCGPQALPSVSESKCLSCASGGGARCVHSVDDEFELSTVCHRPEGL 
EQLQEQTKFTRKELQVLYRGFKNECPSGIVNEENFKQIYSQFFLKED 
FVAGLSVILRGTVDDRLNWAFNLYDLNKDGCITKEEMLDIMKSIYDM 
MGKYTYPALREEAPREHVESFFQKMDRNKDGWTIEEFIESCQKDEN 
I MRSMQL FDNV I 

3281 

A 

774 

3564 . 

VLSKGNERSQPRSPLRLLAPQLKAEAAADKGLAPVPPPFSSGHSGPC 
\ EREGEGQRGRGRSRRGAHLELKPS PGLRAGAPTDRGRGGPAEVAAA 
GGRRMVQKESQATLEERESELSSNPAASAGASLEPPAAPAPGEDNPA 
GAGGAAVAGAAGGARRFLCG WEEQLMTL I SAAREYE IEFIYAISPG 
LDITFSNPKEVSTLKRKLDQVSQFGCRSFALLFDDIDHNMCAADKEV 
FSSFAHAQVSITNEIYQYLGEPETFLFCPTEYCGTFCYPNVSQSPYL 
RTVGEKLLPGIEVLWTGPKWSKEIPVESIEEVSKIIKRAPVIWDNI 
HANDYDQKRLFLGPYKGRSTELIPRLKGVLTOPNCEFEANYVAIHTL 
ATWYKSNMNGVRKDWMTDSEDSTVSIQIKLENEGSDEDIETDVLYS 
PQMALKLALTEWLQEFGVPHQYSSRQVAHSGAKASVVDGTPLVAAPS 
I^ATTVVTTVYQEPIMSQGAALSGEPTTLTKEEEKKQPDEEPMDMVV 
EKQEETDHKNDNQILSEIVEAKMAEELKPMDTDKESIAESKSPEMSM 

LFYLPYEHGPKGAQMLREFQWLRANSSWSVNCKGKDSEKIEEWRSR 
AAKFEEMCGLVMGMFTRLSNCANRT I LYDMYSYVWD I KS IMSMVKS F 
VQWLEDEDGICGYALGTVDVTPFIKKCKISWIPFMQEKYTKPNGDKE 
LS EAEDCAS PLMGGAGTGRGS PALCVSGAEFEEGS YSMAAGCELSGH 

IAVPVANLKLSCQPMLSLDDFQLQPPVTFRLKSGSGPVRITGRHQIV 
TMSNDVSEEESEEEEEDSDEEEVELCPILPAKKQGGRP 

3282 

A 

51 

672 

VLSKGNERSQPRSPLRLLAPQLKAEAAADKGLAPVPPPFSSGHSGPC 
\ EREGEGQRGRGRSRRGAHLELKPS PGLRAGAPTDRGRGGPAEVAAA 
GGRRMVQKESQATLEERESELSSNPAASAGASLEPPAAPAPGEDNPA 
GAGGAAVAGAAGGARRFLCG WEEQLMTL I SAAREYE I EF I YAI SPG 
LDITFSNPKEVSTLKRKLDQ 

3283 

B 

506 

2416 

MNPSAPSYPMASLYVGDLHPDVTEAMLYEKFSPAGPILSIRVCRDMI 
TRRSLGYAYVNFQQPADAERALDTMNFDVIKGKPVRIMWSQRDPSLR 
KSGV GN I F I KNLD KS I DNKAL YD TF S AFGN I LS CKWCDENG SKG YG 
FVHFETQEAAERAIEKMNGMLLNDRKVFVGRFKSRKEREAELGARAK 
EFTNVYIKNFGEDMDDERLKDLFGKFGPALSVKVMTDESGKSKGFGF 
VSFERHEDAQKAVDEMNGKELNGKQIYVGRAQKKVERQTELKRKFEQ 

GRSKGFG FVCFS S PEEATKAVTEMNGR I VATKPLYVALAQRKEERQA 
HLTNQYMQRMAS VRAVPN P V I NP YQPAPPS G YFMAAI PQTQNRAAYY 
PPSQIAQLRPSPRWTAQGARPHPFQNMPGAIRPAAPRPPFSTMRPAS 
SQVPRVMSTQRVANTSTQTMGPRPAAAAAAATPAVRTVPQYKYAAGV 
RNPQQHI^AQPQVTMQQPAVHVQGQEPLTASMLASAPPQEQKQMLGE 
RLFPLIQAMHPTLAGKITGMLLEIDNSELLHMLESPESLRSKVDEAV 
AVLQAHQ AKEAAQKAVNS ATGVPTV * 

3284 

A 

183 

3080 

PRRCSTGNSGRPKIIRRAAEAENEDRTLGRLLPGNERSQPRSPLRLL 
APQLKAEAAVDKGLARVPPPFS SGHS GPC\ EREGEGQRGRGRSRRGA 
HLELKPFSGL\RAGAP\TDR\GRGGPAEVAAAGGRRMVQKESQATLE 
ERES ELS SNP AAS AGASLEP PAAPAPGEDNPAGAGGAAVAGAAGGAR 
RFLCGVVEGFYGRPWVMEQRKELFRRLQKWELNTYLYAPKDDYKHRM 
FWREMYSVEEAEQLMTLISAAREYEIEFIYAISPGLDITFSNPKEVS 
TLKRKLDQVSQ FGCRS F ALLFDDI DHNMCAADKEVFS S FAHAQV\ S I 
TNEIYQYLGEPETFLFCPTEYCGTFCYPNVSQSPYLRTVGEKLLPGI 
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EVLWTGPKWSKEIPVES IEEVSKI IKRAPVIWDNIHANDYDQKRLF 
LGPYKGRSTE \ VI PRLKGVLTNPNCEFEANYVAIHTLATWYKSNMNG 
\ VRKD WMTDS EDS TVS I Q I KLENEG SDED I ETD VL YS PQMALKLAI* 
TEWLQEFGVPHQ YS S RQVAHSGAKAS VVDGTPLVAAPS LNATTVVTT 
VYQEPIMSQGAAIiSGEPTTLTKEEEKKQPDEEPMDMVVEKQEETDHK 
KDNQI LSEI VEAKMAEELKPMDTDKES I AESKSPEMSMQEDCI SDI A 
PMQTDEQTNKEQFVPGPNEKPLYTAEPVTLEDLQLLADLFYLPYEHG 
PKGAQMLREFQWLRANSSWSVNCKGKDSEKIEEWRSRAAKFEEMCG 
LVMGMFTRLSNCA\ GRTI LYDM YS YVWD I KS IMSMVKS FVQWLGCRS 
HS SAQFL I GDQEPWAFRGGLAGEFQRLLP IDGANDLFFQPPPLTPTS 
KVYTIRPYFPKDEASVYKICREMYDDGVGLPFQSQPDLIGDKLVGGL 
LSLSLDYCFVLEDEDGI CGYALGTVDVTPF I KKCKI S WI PFMQEKYT 
KPNGDKELSEAEKIMLSFHEEQEVLPETFPC 

3285 

A 

3 

452 

QTQREPTMVLSP7VDKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTT 
KTYFPHFDIiSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDL 
HAHKLRVDPWFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVS 
TVLTSKYR 

3286 

A 

26 

526 

NSTDSERTHPWLLS PADKTTVK/ APAWGKVGAHAGEYG\ SEALERMF 
LSFPTTKTYFPHFPLSHGFCPGLRATGKKVADALTKRRGAPLDDMPN 
ALVRPL\SDLHAHKL\RVGPGSTFKLLKPLACL\LTLGPAHLPRPSF 
TPGGCKASLGQS FLGFLLKHRCLNLPNYR 

3287 

C 

410 

484 

MRTRGSAXXFXSXCXSXAGXQFXXS 

3288 

A 

3 

170 

IRGSWLSLNFLLNTFFTP*RSFISTSVMF*KPFIFSFIiMLLL*VFI 
FSLKILSY 

3289 

A 

831 

1559 

I AWAFKINWLP I I FL I FS VLFYPI FGF I FF YLL YFSNTCLSLFFHFL 
SETLDNIFIFYLVSIFQFSSKFVHFAISFMFPSLSFFFCFILSRFKF 
I FFS SKKYVF I FLI SLSF\ I FF 

3290 

A 

253 

414 

VRSLVISMTCKTAVSFLTSRVCSNRHGLIRKYGLNMCRQCFRQYAKD 
IGFIKLD 

3291 

A 

3 

225 

CTAESKMGHQQLYWSHPRKFGQGSRSCRVCSNRHGLIRKYGLNMCRQ 
CFRQYAKDIGFIKKDLSCLPWHCLWR 

3292 

A 

218 

602 

S FLIjLPRCTAEKQRWGHQ\ QL YWSHP\RKFGQG\ SRS \ CRVCSNNRH 
GL\IRKYGLNMCRQCFRQYAKDIRFHLSWTKCSSFRGLSGASTSMKK 
PWINSFVYI I IHFEKATEKKRKKNE IKRGPFCFKGPL 

3293 

A 

33 

494 

QPQTDT^^raLTPEEKTAVNAI J WGKVNVDAVGGEALGRLLV^n^PWTQR 

FFESFGDLSSPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFSQ 

LSELHCDKLHTOPENFRLLGNVLVCVLA^ 

AGVANALAHKYH 

3294 

A 

1 

609 

PLKRSDGCNDGRPTRPPTRPDTTVFTSNLKQTRMVHLTPEEKSAVTA 
LWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKV 
KAHS\KKGLRGAFSDGLiAHIi\DNLKGTFATL\SEL\HCDKAAPWIPE 
ELQAPWATCLVCVAWP I TFGKR I STP PVAGLPNQENWIiAWCWLNALG 
PTSNHLSLAFLAGPIStf 

3295 

A 

8 

519 

SAQMAVTTADPRVRPRVRTQLCSLASLIQTLLVHLT\PEEKSAVTAL 
WGKVNVDE \ VGG KAI/GRLLWYPWTQRFL\ ES FGDLSTPDAVMGNPK 
VKAHS\ KKVLG\AFSGGPGCTWDNLKGTFAHTEVSLHCDK\ LHRGSL 
KNFRLLGQRAWSWAGPIHFWQKNFNPTSCRLA 

3296 

A 

3 

333 

GRGGPAAEPCSGEAPPPCFPWPSRGRARTRPSRAGSPPLDSCDPHRT 
PTRARRMMKLKSNQTRTYDGDGYKKQASYFETLRQGYSANNGTPWA 
TTYSVSAQSSMSGIR 

3297 

A 

35 

717 

RRSSPSLLPLAERGGARARGRPERAPHPSTPATRTAPPPWARRMMKL 
KSNQTRTYDGDG YKKRAACLCFRS ES EE \ EVL.LVS \ S S R \ HPDRW IV 
PWKEGMEARRKEAKCGKQVREVCEGRLGVKGT\ LGRLVG I F \ ENQER 
KH\RTYV\YVLIVTE\VLEDWEDSVNIGRKREWFKIEDAIKVLQYHK 
PVQASYFETLRQGYSANNGTP\W\ATTYSVSGFRASMFRAFRWT 

3298 

A 

81 

426 

PSFHYAVLPLHNCLGFFPSIiRHRWLHSMTDDPPTTKPLTARKFIWTN 
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HKFNVTG\PQNNMYLILPLERRFRSGSHLQHLTSKDNEEQLKHAKYG 
AFHVITLLLFTIHYNSQLKLCD 

3299 

A 

180 

484 

RQQAIFWHRWLHSMTDDPPTTKPLTARKFIWTNHKFNVTG\PQNNMY 
Ii I LPLERRFRSGSHLQHliTS KDNEEQLKHAKYGAFHVT TLLLFTIHY 
NSQLKLCD 

3300 

A 

591 

942 

MAKTHFWDVDGSMVPPEWHRWLH\ SMTDDPPTTKPLTA\RKF\ IWDG 
T I KFQRGLGH PRNNM \ YL I LPLE \ RRFRSG SHLQHLTS KDNEEQLKH 
AKYGAFHVI TLLL FT IH YNSQLKLCD 

3301 

A 

1 

508 

NFALEAKN SARA I S S YVQTPMGH FTRGG PRLT I TS L WGK\ VNVE \ DA 
GGETPGKGSLWYP\WT\QRFFDSFGNLSSAFCPSWPTPKVKAHGK\ 
K\HLiT\SLGDAHKSTWDDLKGHLLPKPEVNCTVDKPAMWDPENFKAPG 
EMLLVTRFGQSLFRQKNFTPEGCRASLGKKDG 

3302 

A 

3 

452 

QTQREPTMVLSPABKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTT 
KTYFPHFDLSHGSAQVKGHGKKVADAIiTNAVAHVDDMPNALSALSDL 
HAH KLRVD P VNF KLLSHCLL VTLAAHL P AE FTPAVHAS LDKFLAS VS 
TVLTSKYR 

3303 

A 

1 

627 

TLLVPQDSERTHPWLLSPADK\TNVKA\AWGKVGAHAVRSMCAEALE 
RMFLSFPTTKTYFPHF\DLSHG\SAQVKGATGKKVADALTKRRGAPL 
DDMPNAL/SSA1jEATLHAHKL\RVGPGSTSKLLKPLACLVDPGPAHL 
PRPSSTPGGCNVFPGTKFPGLFVEAPLLEPSKLPLKLGSLRLAIVFL 
PLWGFPPAPPPLS CTRTPWFE I KS 

3304 

A 

131 

406 

EAMGILKLQVFLIVLSVALNHLKATPIESHQVEKRKCNTATCATQRL 
ANFL\VHSSNNLGGILSSTNVGSNTYGKRNAVETVLKREPIjNYLPL 

3305 

A 

154 

1702 

I GHRDPARGRS CRCS G YYS RMVCEKLAPQSEMAS AG \ VSLRATI LCL 
IJVWAGLAAGDRVYIHPFHLVIHNESTCEQLAEANAGKPKDPTFIPAP 
IQAKTSPVDEKALQDQLVLVAAKLDTEDKIJ^AAMV\GMLANF\LGFP 
YYMGMHS E \ 1»WG V \ VHG \ ATVLS PTAVFGTLAS L YLGALDHTADRLQ 
AI LGVPWKDKNCTSRLDAHKVLS AL \ QAVTGLLVAPGRADKQA\ QLL 
\LSTWGVFTAPGLHLKQPFVQGL7UiYTPWLPRSLDFTELDVAAET 
ID\RLMQAVTGWKTGCSLTGAKADSTLAFNTYVHFQGKMKGFSLLAE 
PQ\EFWVDNSTSVSVPMLSG\MGTFQHWSDIQ\DNFSVTQVPFTD\S 

afllliqphyasdl\dkvegltfqqn\sf1wmrklf\prtihltmpq 
lvlqgsyxdiiqdllrpgsscppflhtelnlgrisgnxdrirvgevian 
siff\eleadereptestqqlnkp\evle\vpltrpf\lfavy\dqg 
atalalswgrvgkpa 

3306 

C 

223 

417 

MHVFLKLS VGVTQMVL I S EAE KLTGXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXQAGRHAGKQG 

3307 

A 

3 

742 

HEASCRSERRRQMADFDTYDDRAYSSFGGGRGSRGS\AGGHGSRSQK 
ELPTEPPYTAYVGNLPF\NTVQGDIDAIFKDLSIRSVRLVRDKDTDK 
FKGFC YVEFDEVDS LK\ EALTYD \ GALLGDRSL\ RVDI AEGRKQDKG 
GF\GF\RKGGPDDR\GFRDDFLGGRG/GLSRPGDRRTGPPMGSR\FR 
DGPPLRG\SNMDFREPTEEERAQRTTTPAIjKPRTVATPIjNQVANPNS 
S\ IF\GGARPREE wqkeqe 

3308 

A 

1 

861 

MVPSEGLWRGLREMLDQSNQWGGTALWPAFEIRRARRMPMNKNELV 
QLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVYFQAR 
LAGAGNS TAR I LS TNHTGPEDAFNFNNWVTLLSQLPLLLFTLLNS FL 
YQCVPETMAKLRPREERILSEAPQWLLGHGEGSLPKARTVPEASSSH 
HVPTGEGAGADEFLPLLSLVLAHCDLPELLLEAEYMSELLEPSLLTG 
EGGYYLTSLSASLALLSGLGQAHTLPLSPVQELRRSLSLWEQRRLPA 

3309 

A 

1 

2076 

MVKAQLRTAI^SGGVIjDASGDYRVYRGLLKTTMDPNDVI LlATHASVD 
l^LHLSGLLERWEGPLSVSVFAATKEEAQI^TVLAYALSSHCPDMRA 
RVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGIN 
YALGTNVSYPNNLLRNI^REGANYALVIDVDMVPSEGLWRGLREMLD 
QSNQWGGTALWPAFEIRRARRMPMNKNEIiVQLYQVGEVRPFYYGLC 
TPCQAPTNYSRWVNLPEESLLRPAYWPWQDPWEPFYVAGGKVPTFD 
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ERFRQYGFNRISQACBLHVAGFDFEVLNEGFLVHKGFKEALKFHPQK 
EAENQHNKI L YRQFKQELKANWLR VS PELGVGGSGPAS PLARRGEMA 
ADEGHRLQSTEPSGPSLQAEGLRECRPVTMETGPRYPSMPRDQTVQL 
VQMPVQPPSKDTEEMEAEGDSAAEMNGEEEESEEERSGSQTESEEES 
SEMDDEDYERRRSECVSEMLDLEKQFSELKEKLFRERLSQLRLRLEE 
VGAERAPEYTEPLGGLQRSLKIRIQVAGIYKGFCLDVIRNKYECELQ 
GAKQHLESEKLLLYDTLQGELQERIQRLEEDRQSLDLSSGPYIVYML 
QEIDILEDWTAIKKPGGPSEQLALHPGFSSSSCRQADPQAPQGLPSQ 
APWLLGPPTPSGTGLDSSSALLEACTAVAVELT 

3310 

A 

34 

657 

SDGRCGRCC APRAPRAMHMS FAIS/ RCAFYQLLLAALMLVAMLQLLY 
LSLLSGLHGQEEQDQYFEFFPPSPRSVDQ\VKAHVRTALASGGVLDV 
SGD\ YRV\ YRGLLKTTM \DPNDV I L\ ATHAS VEQTCLHL \ SGLL\ ER 
W \ EGPAVPCS VFAA\ TKEE\ AQLATVIiAYALS SHC PDMP AR\ VAMHL 
VCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTN 
VSYPNNLLRNLAREGAOTALVIDVD1WP\TEGLWRGLREMLDQSNQW 
GGTALWPAFE IRRARRMPMNKNE\ LVQLYQVGEVRPFY YGLCTPCQ 
APTNYSRWFNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPNFRRSGF 
RQYGFNRISQACELHVGGFDFEVL\NEGFLVHKGFKEALKFHPQKEA 
ENQHNKI L YRQFKQB \ LKAKYPNS \ PRRRAQLATVLAYALS SHCPDM 
PARGRHAPRVPLALRGSRARPPGAGGVCPAAVLPGGL 

3311 

A 

160 

441 

PPGRTAILSRMKIFLPVLLAALLGVERASSLMCFSCLNQKSNLYCLK 
PTI CSDQDNYCVTVSAS AG I VNLVTFGHSLSKTCFPGCP I PESGNVG 

3312 

A 

107 

691 

RTAILSRMKIFLPVLLAALIX3VERASSL\MCFSCLNQKSNLY\CLKP 
TICSDQ\DNY\CVTVSAS\AGIG\NLVTFG\HSL\SKTCFPCLAPFP 
EGRSMLGV\ASMGHS AFCQS FLVAI FSCGPMAGLRGKRS PLLGARAC 
CLSLAGRALL\RFGPLDRPEPCSPDPPAQEGKPSPFWIPQCMGAPDS 
SRALI CALGPRSG 

3 313 

A 

388 

525 

PSLLQVAKQEKXK^TGRAKRRMQYNRRFVim^PTFGKXKGPNANS 

3314 

A 

194 

1131 

PQHGGHFPRKIKSCSWQARPLEDEATLGQCGVEALTTLEVTRPACLE 
VKSMVPWPVLEKVRGQTPKVAKHGEKKKKJCTGRAKRRMQYimRFVNV 
VPTFGKKKGTTFTKI FVGGLPYHTTDASLRKYFEGFGDI EEAW ITD 
RQTGKSRGYGFVTMADRAAAERACKDPNPIIDGRKANVNLAYLGAKP 
WCLQTGFAIGVQQLHPTLIQRTYGLTPNYMYPPAIVQATVVIPAAPV 
PSLSS PY I EYTPASPAYAQYPPATYDQYPY \ AASPAT\ VRSFVGYSY 
PAAVPQALSAAAPAGTTFLQYQAPHVQPDRMH 

3315 

A 

1 

407 

DAERQEALG I VRR IGTOTEAATE PAGATVPAAAAAAR I GTVGPQPPA 
MPRRKRNAGSSSDGTEDSDFSTDLEHTDSSESDGTSRRSARVTRSSA 
RLSQSSSRISRSCSKSGSLLALRSLLTLPEE*PWSSSLPQ 

3316 

B 

78 

316 

XSYLLGQWPRDADGAFTCCTNDKATQTPLSWQELEGERASSCAHKRS 
ASWGSTDHRKEISKLKQQLQRTKLSRSGKEKERX* 

3317 

A 

1 

2319 j 

GHPAFLEDGSPSPVLAFAASPRPNHSYIFKREPPEGCEKVRVFEEAT 
TRG KYGEGAKQETFTFALTL VF I QCVINAVFAKI L I QF FDTARVDRT 
RS WLYAACS I S YLG AMVS SNS ALQFVNY PTQVLGKSCKP I PVMLLGV 
TLLKKKYPLAKYLCVLL I VAGVALFMYKPKKVVG I EEHTVG YGELLL 
LL S liTUD GliTGVS QDHMRAHYQTG SNHMMLN I NLWS TLLLGMAVS CP 
DQGPELVPRCPFVQALEKPSWKNLHQQDLFATCYWHNEGESCVSCHG 
KTLSSKTQGGILFTGELWEFLSFAERYPAIIYNILLFGLTSALGQSF 
I FMTWYFGPLTCS 1 1 TTTRKFFT I LAS VI LFANP I S PMQWVGTVLV 
FLGSTQI ILDNLKILNTSAKTLLYKTPI.SWQELEGERAS SCAHKRS A 
SWGSTDHRKEISKLKQQLQRTKLSRSGKEKERGSPLLGDH/VSAGST 
EGVPS / RASPQGPLS CDS APACT/ VS LEGLNQELEEVFVKEQGEEEL 
LR I LD I PDGHRAP AP PQSGS CDH PLLLLEPGNLAS S P SM SLAS PQPC 
GLASHEEHRGAAEELAS TPNDKAS S PGHPAFLEDG S PS P VLAFAAS P 
RPNHSYIFKREPPEGCEKVRVFEEATTRGKYGEGAKQETFTFALTL.V 
F I QCVINAVFAKI LI QFFDTAR VDRTRS WLYAACS I S YLG AMVS SNS 
ALQFVNYPTQVLGKSCKPIPVMLLGVTLLKKKYPLAKYLCVLLIVAG 
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VALFMYKP KKWG I EEHTVG YGELLLLLS LTLDGLTG VS QDHMRAHY 

QTGSNHMMIjN inlwstlllgmavs cpdqg pelvprcp fvqalekp s w 
knlhqqdl f atc ywhnege s cvs chgktls s ktqggi lftgelwe fi# 
sfaerypaiiynillfgltsalgqsfifmtvvyfgpltcsiitttrk 

FFTILASVILFANPISPMQWGTVLVFLGSTQIILDNLKIIiNTSAKT 
LL YKTPLS WQELEGERAS S CAHKRS AS WG STDHRKE I SKLKQQLQRT 
KLSRSGKEKERGSPLLGDHAVRGALRASPPSFPSGSPVLRLSPCLHR 
SLEGLNQELEEVFVKEQGEEELLRILDIPDGHRAPAPPQSGSCDHPL 
LLLEPGNLASSPSMSLASPQPCGLASHEEHRGAAEELASTPNDKASS 
PGHPAFLEDGSPSPVLAFAASPRPNHSYIFKREPPEGCEKVRVFEEA 
THQRRDS FD I RE KVAELRKKRNSGLS KEQKE KYMEHRQT YGNTRE PL, 
LENLTSEYDLDLFRRAQARASEDLBKLRLQGQITEGSNMIKTIAFGR 
YELDTWYHSPYPEEYARLGRLYMCBFCLKYMKSQTILRRHMDGSGSL 

3318 

A 

845 

1343 

LSLGDSAQCLLPHASWCQVAGHPAFLEDGSPSPVLAFAASPRPNHSY 
IFKxfepPEGCEKVRVFEEATS/RRVLTGPFLTSCPDK\NKVHFQP\T 
GS AFCP VNTLMKPLFPGMGF I FRNCPSNPG I S S S PGQPQATTSEGSGS 
LQGLPTAIRAMAVAPHLQMSLCFSRAP 

3319 

A 

44 

499 

GGILEELERRRKEEEERRRAEEEKRRVEREQECRWREMEEHRQAERL 
QRQLQQEQAYLLSLQHDHRRPHPQHSQQPPPPQQERSKPSFHAPEPK 
AHYEPADRAREVPVRTTSRSPVLSRRDSPliQGSGQQNSQAGQRNSTS 
IEPRLLWERV 

3320 

A 

1 

164 

SGERFRVRSSSKSEGSPSQRLENAVKKPEDKKEVFRPLKPAVRIVQD 
QFYLFQT 

3321 

A 

2 

439 

ARAARVFLHRSSLNLSNGETESVKTMIVHDDVESEPAMTPSKEGTLI 
VRQTQSASSTLQKHKSSSSFTPFIDPRLLQISPSSGTTVTSWGFSC 
DGMRPE AI RQDPTRKGS WNVNPTNI RPQSDPPE IRKSGKG * L»* ILG 
AGLW 

3322 

A 

3 

174 

RAVEDVRPSHKVTDYSSSSEESGTTDEEDDDVEQEGAPESTSGPEDT 
RAASVPGLF 

3323 

A 

2 

4978 

RRTSAVS RGLLFM S FVQPLLQPP VHLLTARVRVPG YLQ I DGVKI KEH 
NGKLMNCFKTKMTYYSFMKCVGLPAQLTIGFSGPSRETGSDPRAARR 
GEDPQSKDRGDGLSEQTEAQRSGVHCPRSQLVGELAGQSPRTIPAAL 
RVCLYLLCITSYNHTSKDESSKDEEEEIKLEINMLKKYSHHRNIATY 
YGAFIKKSPPGHDDQLWLVMEFCGAGSITDLVKNTKGNTLKEDWIAY 
ISREILRGLAHLHIHHVIHRDIKGQNVLLTENAEVKLDHHHHPIGWD 
AEVTEKSAEVPSALGSLKALKVVVHSKGPLLVTVGEAGEAARSRPHR 
S FQTVDFGVS AQLDRTVGRRNTFIGTPYWMAPEV IACDENPDATYD Y 
RSDLWSCGITAIEMAEGAPPLCDMHPMRALFLIPRNPPPRIiKSKKWS 
KKFFSFIEGCLVKNYMQRPSTEQLLKHPFIRDQPNERQVRIQLKDHI 
DRTRKKRGEKDETEYEYSGSEEEEEEVPEQEGEPSSIVNVPGESTLR 
RDFLRLQQENKERSEALRRQQLLQEQQLREQEEYKRQLLAERQKRIE 
QQKEQRRRLEEKLLEDVFQQSKKEDDKDSREVCGQKYRMTGKQSVSK 
NKERIPDFFNCVERSVMQQRREREARRQQEREQRRREQEEKRRLEEL 
ERRRKEEEERRRAEEEKRRVEREQEYIRRQLEEEQRHLEVLQQQLIiQ 
EQAMLLECRWREMEEHRQAERLQRQLQQEQAYLLSLQHDHRRPHPQH 
SQQPPPPQQERSKPSFHAPEPKAHYEPADRAREVEDRFRKTNHSSPE 
AQSKQTGRVLEPPVPSRSESFSNGNSESVHPALQRPAEPQDPCPPSR 
SEVLSQSSDSKSEAPDPTQKAWSRSDSDEVPPRVPVRTTSRSPVLSR 
RDSPLQGSGQQNSQAGQRNSTSSIEPRIiLWERVEKLVPRPGSGSSSG 
SSNSGSQPGSHPGSQSGSGERFRVRSSSKSEGSPSQRLENAVKKPED 
KKEVFRPLKPAGEVDLTALAKELRAVEDVRPPHKVTDYSSSSEESGT 
T\DEEDDDVEQEGADESTSGPEDTRAASSLNIjSNGETESVKTMIVHD 
DVESEPAMTPSKEGTLITOQSTVDQKRASHHESNGFAGRIHLLPDLL 
QQSHSSSTSSTSSSPSSSQPTPTMSPQTPQDKLTANETQSASSTLQK 
HKS S S S FTPF IDPRLLQ I S PS SGTTVTS WGFS CDGMRPEAIRQDPT 
RKGS WNVNPTNTRPQSDTPE I RKYKKRFNSE I LCAALWGVNLLVGT 
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ESGLMLLDRSGQGKTOPLINRRRFQQMDVL^^ 
VYYLSWLRNKILHNDPEVEKKQGWTT^ 

VI AIiKS S VEVYAWAPKP YHKFMAFKS FGELVHKPLL VDLTVEEGQRL 
KVIYGSCAGFHAVDVDSGSVYDIYLPTHIQCSIKPHAIIILPNTDGM 
ELLVCY EDEGVYVNTYGRITKDVVLQWGEMPTSVAY I RSNQTMGWGE 
KAI E I RS VETGHLDG VFMHKRAQRLKFLCERNDKVF FAS VRSGGS SQ 
LQNSLSLNDSVLWVSDSGDDEHAVGFIGSGHKEAFHKMKHATVNTLV 
YISREEGKAQVENSKRPGRHDRRRANL.GPCSQCWHLALVSRAARAAS 
SSIRHINKPLRVSS 

3324 

A 

197 

332 

DPVSKKXEKKRKKKKGQAQWLTPVIPTLWEAEVGGSPEVRSSRPA 

3325 

A 

1 

1788 

MPVIPATQEAEAEESLDPGRRRLRTIGKLEPSYVIRKFLDAQRIHNL 
TAYLQTLHRQS LANADHTTLLLNC YTKLKDS S KLEEF I KKKS ESEVH 
FDVETA I KVLRQAG YYSHALY LAENHAHHEW YLKI QLED I KNYQEAL 
RYIGKLPFEQAESNMKRYGKILMHHIPEQTTQLLKGLCTDYRPSLEG 
RSDREAPGCRANSEEFIPIFANNPRELKAFLEHMSEVQPDSPQGIYD 
TLLELRLQNWAHEKDPQVKEKLHAEAISLLKSGRFCDVFDKAIjVLCQ 
MHDFQDGVLYLYE<^KLFQQIMHYHMQHEQYRQVISVCERHGEQDPS 
LWEQALSYFARKEEDCKEYVAAVIiKHIENKNLMPPLLGTWEDRWDEL . 
RVRR YREETTR I RQE I QELKAS PKI FQKTKCS I CNS ALELPS VHFLC 
GHSFHQHCFESYSESDADCPTCLPENRKVMDMIRAQEQKRDLHDQFQ 
HQPPKDDKKKKDAGKSAKKDKDPVNKSGGKAKKK^ 
LVljFDKATYDKLCKEVPNYKLITPAVVSERLKIRGSIiARAALQELLS 
KGLI KLVS KHRAQVI YTRNTKGGDAPAAGEDA 

3326 

A 

3 

111 

HAYEGLIKLVSKHRAQVIYTRDTKGGDAPAAGEDA 

3327 

A 

220 

425 

HEELKSGPYYLLTFRDCFLHFWAIiVSKR/LALNFM*TSAPT*KALSK 
RNICLVNKNRNI KI PYPKKKKK 

3328 

A 

273 

501 

RSGVRDQPGQHGKITSLLKIQKLARRGGACL*SQLLRRLRQENRLNP 
GGGGCS E PRS CHCTPAWETEQDS I S KI K 

3329 

A 

1 

1176 

MAWAGSEPRRVRRLGSTASEGEKQGVKEGPLPAAAAAAAAAAAAAAG 
YPGGGRSGRGEGSGRRRECAGRAPSERASPGRAGRVPEADRLTGKLG 
EAAGRGRTGQRRS PGHRLRAAR VPTAPGKPRPG FCARLPRP YAGAGR 
GQAG AG SAEPLGLRPLLEKRGKERRGVGDPKI FQKTKCS I CNSALEL 
PSVHFLCGHSFHQHCFESYSESDADCPTCLPENRKVMDMIRAQEQKR 
DLHDQFQHQEENSPKMRVIRVGTRKSQLARIQTDSWATLKASYPGL 
QFEIIAMSTTGDKILDTAIiSKGFTQALSVRQIGEKSLFTKELEHALE 
KNEVDLVVHSLKDLPTVLPPGFTIGAICKRENPHDAWFHPKFVGKT 
LETLPEKRVGPPGTI S 

3330 

A 

245 

394 

GPGCIPAALLQPPKDDKKXKDAGKSAKKDK35PVNKSGGKAKKKVEIR 
P L j 

3331 

A 

458 

701 

GPAPTRRGPAPHGAHTR* * PAGTARAACGSA* S AGTAS PAHKGKGHH 
PGSRASGTGPGPCQRRRRSDHSSAGKWPLREASL 

3332 

A 

365 

814 

AALRSSENSSRHRSLVKMSDKKAKDPVN\KSGGQGPKRKNWSKGKSS 
GTSF^\LVLFDKATYDKLCKEVPNY\NIiITPAVGSERL\KIRGSLG 
QGKPFQELIiS\KGFIPNWFSKHRASSYFTPGITKGG\DAPSLLGEDC 
MNRSNPPVHLEK 

3333 

A 

41 

565 

APSPRRPWGHFTEED\KATIK\NLWGKGEMWKDAGGKNPWERLPWIiS 
YPMGPQRFFDQLLANLSLCPLPIMGNPPKVKGTWPRKVIjTSLGXSAH 
KSTWDDLKGHIiLPKPEVNLHC\DKPAMWDPENFKAPGEMIjLVTRFGQ 

shfrqknftpggcraswgrkmgdlelasalvpsryh 

3334 

A 

399 

593 

KCIHFPGPTPKr VCGVIVVTAVP/ TFGM*TVYV * * LPF& XLiAUN Vtj w 
RDDCDPGWKLNLFYAKKKK 

3335 

A 

28 

312 

HITNIDPSAINKSGQETWPHGVGLGTAIiLGRGPSAMLWGALGPDLSR 
LWAAVSELRAQPAHTGAHRTATSSQKLQKSGPRRGPQEQVPPADRET 
Q 

3336 

A 

26 

417 

QEQDQVFHTRDQAKTRLPLPLLGPHCPQKAAAVIMGNIFGNLLKSIiI 
GKKEMR I LMVGLDAAGKTT I LYKL.KLGEI VTT I PT I G FNVE TWAVL 
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IASESFPCPHKPFYSLFAFPLPPSPCFTWKLHGPLC 

3337 

A 

16 

3567 

EPSSQSKPGKEQGEGATVFPSPCQKRLSSSRLLHQPDGGRGGEAEGG 
RPGDSTLEERTFRIIX3LPAPEVNLRDAPTQPGSPEHQALGPAASGAQ 
GRAEGSEVAWQRRAGRGWARTPGPYAGALREAVSGIRRHTAPDSDT 
DEAEELS VHSG S SDGSDTE APG AS WRNERTLPEVGNS S PEEDGKTAE 
LSDS VGE I LD VI S QTEEVL FGDKS S LVGLPPSKLSGP I LGPGS FPSD 
DRLSRPPPPATPSSMDVNSRQLVGGSQAFYQRAPYPGSLPLQQQQQQ 
LWQQQQATAATSMRFAMSARFPSTPGPELGRQALGSPLAGISTRLPG 
PGCNCAESCCFFCSSFFSCSNRSSMSRGNSLATSELSILSSSSPDAA 
GPPGKVKAPLSSSNFSSREDPPGPLSGLGVSSFCEDVAVLWAPVCAG 
CALSSETAAHNLLRKPGNDSKMLVCETCDKGYHTFCLKPPMEELPAH 
SWKCKACRVCRACGAGSAELNPNSEWFENYSLCHRCHKAQGGQTIRS 
VAEQHTPPRMLRIiFLQSRSPGECMQAVEPLHRGRLEQAADEGFDCVS 
CQP YWKPVAP VAP P EL VPMKVKE P AF S E S LLHAKLVG YTELNQLNH 
LQQLMVQQG I PQADAWQYDVEL I AL I RANYWLKLVKG I LPLKEKMA 
KTPLSDEKQLLLFQQKLLAEEEMAKKKERLLSQFLKNNLSNPPTPPS 
SLPPTPPPSVQQKMVNGVTPSEELGEHPKDAASARDSERALRETRRF 
STAGTGFRLLFHNAHANSRAWLLPAPLDPRPILCFAYTWRAVYQPPR 
VTMETGAS AS I PEL I CEAMRR I WS S S PALDGCELLCCGRGHRTRTQR 
VTERCNCTFHWCCHVSCRNCTHTRVLHECLGI VNVAS STNLLTDSKS 
LQLVLEPSLQLLSRKQRRLIRQNPGILHSVSGGLQSAVRECKWQFRN 
RRWNCPTAPGPHSSARSSTEPQHCGTYYLPPSHSLFPAVSSLIRNKI 
HRLPVIDPESGNTLYILTHKRILKFLKLFKAAAVIMGNIFGNLLKSL 
IGKKEMRILMVGLDAAGKTTILYKLKLGEIVTTIPTIDEENKERRQA 
I LQE TCNW PHTLNS TSMS KA YQS TFTGETNTP YS KQFVHS KSSQYRR 
LRTEWK3^NVYIJ^SRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVAN 
RREKIYEEQ 

3338 

A 

6385 

6599 

HMPLIGHHSHGWSQALLEGMSWWLGGHQWLGQTWLSGTCKTQGKLLL 
GPSQTGRRALMWQRHGLGRAWCRQ 

3339 

A 

2 

758 

ERGGFFSPEPGEPDSPWTGSGGTTPSTPTTPTTEGEGDGLSYNQRSL 
QRWEKDEELGQLSTISPVLYANINFPNLKQDYPDWSSRCKQIMKLWR 
KVP AADKAP YL PG S PEHQALGP AAS GAQGRAEGS E VA WQRRAGRG W 
ARTPGP YAGALREAVS R I RRHTAPDS DTDEAEELS VHSG S SDG SDTE 
APGAS WRNERTLPEVGNS S PEEDGKTAELSDS VGE ILDVI S QTEEVL 
FGPPPIRGTQQGNRKRQ 

3340 

A 

116 

627 

RLFPVIIVAAPLLDSPWTSRPATGLAGSRTARVGRARDPHSRRPAGP 
GPPGCCRRRPTGGRGPWPRSELGSKRKGKRRRRSGRPRRPSEKGARL 
VGPDGAPRVDPARS PATPLGWPS WQGS LLPCGRSAWRLWGERPVHPR 
SRRATQGRRGRGPGEVGG PG PQATRLTCC 

3341 

A 

1830 

2259 

GDWQEKVPRGKGAGRVLQEVGPRAVAHPDWRHCRYTCAEWVALGRGS 
PVSGGSRCLVASSGYSQPTIaHQTASANQGQCLGSCLPDPFPSHQDPQ 
LPPSLAQCLPGRGSLGTAPHRSWGSSCSAAPPLPPSRHSGSCLQQQR 
L 

3342 

A 

1 

619 

FPCRPTRPPKARKPSRAPLAMETVI SSDS S PAVENEHPQETPESNNS 
VYTS FMKSHRC YDL I PTS SKLWFDTS LQ VKKAFFAL VTNGVRAAPL 
TOSKKQSFVGMLTITDFINILHRYYKSALVQIYELEEHKIETWREVY 
LQDSFKPLVCISPNASLFDAVSSLNSGTRSTSLAVIDPRIQAILCTI 
LNPQAHSGSSSKLFNTGG 

"1 O A 1 

TV 

A 

1830 

1 *i C D 

2258 

PVSGGSRCLVASSGYSQPTLHQTASANQGQCLGSCLPDPFPSHQDPQ 
LPPSLAQCLPGRGSLGTAPHRSWGSSCSAAPPLPPSRHSGSCLQQQR 
L 

3344 

A 

290 

1377 

RSFRAIGEVGSIRGWGGGYAAQLQSPSRLPVLGGGKKGGSRGVGQGS 
SRAEVGLGSQRERSGTCGAEAAAAAAQLIAGTGSSLPYQGPGQNQTA 
IAPTCAPLSPCKAAAVIMGNIFGNLLKSLIGKKEMRILMVGLDAAGK 
TTILYKIiKIiGEIVTTIPTIGFljn^TVEYKNISFTVWGCGVARDK\IR 
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PL\WRH\YFQN\TQGFDILWSDSN\DR\ERVNEA\REEL\MKNAGAE 
TSLRDAVLLVFGQTNRICPNAMNAAG\ IQDKLG\LHSLRHRNWYI\Q 
ATCATRGN\GLYEGLHWLANSVPKQEVESRTAIiTKHSTSPLTYLLSP 
CPQFLPLLSMQGLARAWGIMSTLPSRTLPPLTSLPFSCPPI 

3345 

A 

292 

574 

VTTENSGGQTQCGKQDRIQVPRHPARRGPPGSQY/PVTPPVPPGPLH 
ATQLPPGPIPLGKRKAGQERRLDPRPTTCQAAAPTRSSPFCPPQKVW 
R 

3346 

A 

3 

142 

TISSIIRKARKP\QDLLNNYTPRSSPLRSSGRTRMYKGKNVRPGNSS 

3347 

A 

1 

1418 

MAAALFVLLGFAIiLGTHGASGAGEERARKQPCGKSRFQGHSEGAIiAT 
RFSAGNTGFVQAPLSQQRWVGGSVELHCEAVGSPVPEIQWWFEGQGP 
NDTC SQL WDG ARLDR VH I HATYHQHAAS T I S I DTL VE EDTGTYECRA 
SNDPDRKHLTRAPRVKWVRAQAVVLVLEPGTVFTTVEDLGSKILLTC 
SLNDSATE\TTGHRWLKGGVVLKEDAI,PGQKTEFKVDSDDQWGEYSCV 
FLPEPMGTANIQLHGPPK\VKAVKSSEHINEGETAMLVCKSESVPPV 
TDWAWYKI TDSEDKAIjMNGSESRFFVS S SQGRSELHI ENLNMEADPG 
QYRCNGTSSKGSDQAI ITLRVRSHLAALWPFLGI VAEVLVLVTI I FI 
YEKRRKPEDVIiDALRRCTWPGLMSCPEPPLGVTEGRAGSTDAPARGL 
SGASADRLSSRPLFHGGGPSSDDDAGSAPLKSSGQHQNDKGKNVRQR 
NSS 

3348 

A 

3 

896 

S CRGRRRQRLE VVRGFRRG IG IMAAAL FVLLG \ FALLGTHGAS GAAG 
TVFTTVEDLGSKJLLT\CSLNDSATEVTGHR\W\LKGGVVI*KEDALP 
GQKT\EFKVDSD\DQWGEYSCVFLPEPMGTANIQLHGPPRV\KAVK\ 
SSEHINEGETAMLVCK\SEFVPP\VTDWAWYKITDSEDKAL\MNGSE 
\SRFFVNSSQG\RSELHIENLNMEA\DPGQE\RCNGTSSKGIiRPRPL 
QFLRVRSHLAALWPFLGIVG\EVLVLVTIIFIYBKRRKPEDV1,DDDD 
AGSAPLKSSGQHQNDKGKNVRQRNSS 

3349 

A 

870 

997 

KWASNTYV I S SNEESM IS* LSGRQ * NLRVWRLRKTWWLNWTL 

3350 

A 

2 

225 

GILKGL YYPLATNS FPATCWRL * VS PAHS KDPRNAATVHLS FFS PSG 
GPRYLLGHTTWQQGLGKRGRRTEQHSL 

3351 

A 

1 

336 

FLLS S RWGWPSMAS TAGCPQGQVCGMTGNAGVHNVGAAL WHSLGWVK 
GEQNTKGPFGDQNRPVQAIWASPFRTSWVKFPRPWRQMPAILGPWGT 
YCLSLAREGERRPAAQSG 

3352 

A 

1193 

2524 

SKPPAASFAPRRPPATSRTLSAACAFRSRCRLAACSFVRPLPHTDTN 
MNGQLNGFHEAFIEEGTFLFTSESVGEGHPDKI \ CDQIS \DAVLDAH 
LQQDPDAKVACETVAKTGMILLAGEITSRAAVDYQKVVRRAVKHIGY 
DDS $ KGFDYKTCNVLVALEQQS PD I AQGVHLDRNEED IGAGDQGLMF 
GYATDETEECMPLTIVLAHKLNAKLiAEIjRiU^GTLPWLRPDSKTQVTV 
Q YMQDRGAVL P I RVHTIV I S VQHDEEVCLDEMRDALKEKVI KAWPA 
KYLDDVTIYHLQPSGRFVIGGPQGDAGLTGRKIIVDTYGGWGAHGGG 
AF S GKD YTKVDR S AA YAARWVAKS LVKGGLCRR VL VQ VS YAI GVSHP 
LS I S I FHYGTS \ RKSERELLEI VKKNFDLRPGVI VRDLDLKKPI YQR 
TAAYGHFGRDS FPWEVPKKLKY 

3353 

A 

346 

461 j 

TLHKKTFYYLEQLILKYGMHQNTLRI KE IHGELKLKS I 

3354 

A 

3 

1632 

MTEQLEP I LMQAF I KSTAGLEARKLKAYRTME YMAKS TDRS PGH I LC 
CECGVPISPNPANICVACLRSKEDISQGIPKQVS I S FCKQCQRYFQP 
PGTWIQCALESRGTFLLWCLEKNPKPLWSKVRLVDAG\FVWTEPHF\ 
KRLKVKLTIQKEVMNGAILQQVFVVDYWQSQMCGIX3HESEKLKDFW 
K\AVIQVRQKTLHKKTFYYLEQLILKYGMHQNTLRIKEIHDGLDFYY 
S S KQHAQKMVE FLQCTVPCRYKASQRL I SQDIHSNTYNYKSTFSVE I 
V V x L JfdJN V V CLi b PKXiAQSLGNMNQ I CVC I RVTS AIHLIDPNTLQVAD 
IDGSTFWSHPFNSLCHPKQLEEFIVMECS IVQDIKRAAGAG\MISKK 
HTLG\ EVWGTEGHLEIWYRIKQYFCRT\HIiGTSCLNPGD\ RVLGFDL 
AN \ CNLNDEHVNKMNSDR VPDVVL I KKS YDRTKRQRRRNWKLKELAR 
ERENMDTDDERQ YQDFLEDLEEDEAIRKNVNI YRDS AI PVESDTDDE 
GAPRI SLAEMLEDLHFSPGCPLVKEGASMLT 

3355 

A 

35 

378 

PRSVPSRFSGGRTHEGREGAQA*GEASNPGNPRRG\QGSGGGT*GDS 
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S KGER\ KQDRNKEKKEALSKVVIRRLPPTLTKEQLQEHLQPMPEHDY 
FELFSNDTRLYAHMDARSLHQIi 

3356 

A 

11 

1379 

PRSGQAASAGYVSMKEEKEHRPKEKRVTLLTPAGATGSGGGTSGDSS 
KGEDKQDRNKEKKESLSKVPGRPLARJERAGRVRTAGRMAGNGAAPFIi 
SQVLLLSLRLNGSRIILVHPFTRNGESPSRPNPGWGLYPHMYARAYI 
NFKNQEDI I LFRDRFDG YVFLDNKAKKTTPLLS FLKNKQRMREEK/ K 
RRKEEERNRKKKTKRRREEEME\EEEKRKRKDIEKLKKIDRIPERDK 
LKDEPKIKVHRVSVTSCESEKS/RSRSQKKGDEKELDKREKAKKLDK 
ENLSDERASGQSCTLPKRSDSELKDEKPKRPEDESGRDYREREREYE 
RDQERILRERERLKRQEEERRRQKERYEKEKTFKRKEEEMKKEKDTL 
RDKGKKAESTESIGSSEKTEKKEEWKRDRIRNKDRPAMQLYQPGAR 
SRNRLCPPDDSTKSGDSAAERKQESGISHIKEGGEE 

3357 

A 

3 

1015 

I RHEE I IKVTPLQQMLASCTGAI LTS V I VTPLD WKI RLQAQNNPLP 
KGKCFVYSNGLMDHLCVCEEGGNKLWYKKPGNFQGTLD\AFFKIIRN 
EGIKSLWSGLPPTLWAVPATVIYFTCYG\QLSAIiLRSKLGENETCI 
PIVAGIVARFGAVTVISPLELIRTKMQSKKFSYVELHRFVSKKVSED 
GWI SLWRGWAPTVLRDEPF\ S AMYWYNYG ILKKWLCEKSGLYEPTF\ 
MINF\TSGALSGSFAAVATLPFDWKTQKQTQLWTYESHKISMPLHM 
STWIIMKNIVAKNGFSGLFPG\LIPRL/ILKLAPACAIMISTYEFGK 
AFF \ QKQNVRRQQ Y 

3358 

A 

231 

877 

PKPGGYSQCRTTSSRATERNRIDYVSSAVPYLTTPPDLPRPEV\VFI 
G\RSNV\GKSSLIKALFSLAPEVEVRVSKKPGHTKKMNFFKVGKHFT 
VVDMPGYGFRAPEDFVDMVETYLKERRNXjKRTFLIJVDSVVGIQKTDN 
. IAIEMCEEFALPYVIVLTKIDKSSKGHLLKQVLHIQKFVIJMKTQGCF 
PQLFPVSAVTFSGIHLLRCFIASVTGSLD 

3359 

A 

3 

234 

GSVLKKTTGLVGIAVCOTPHEEPDVKKLEDQLQGGQLEEVILQAEHE 
LNIiARKMREWKLWEPLVEEPPADQWKWPI 

3360 

A 

25 

546 

QSGDLrGGRLSRSENCSQRWRHRWGAVMAGVLKKTTG\ LVGLA\VC 
NTPH\ ERLRI LY\TKI \ LDVLEE IPK\ NAAYRKYTEQI TNEKLAMVK 
AEPDVKK\LEDQL\QGGQL\EEVIL\QA\EHELNLAKKK*GEWKL\W 
EPL\VE\EPPA\DQ\WKWANIIIKLTLWFMGKLDVIKIFCYI 

3361 

A 

2 

368 

YAGFSLSAQKCPGAMAE * S YAKSTKLVLKGTKTKSKKKKSKEKKRKR 
EEDEETQFD/IVGTIAIEMDEGTYIHALDNGLFTLGAPHKEG\KMAL 
LASNGCF I RCNEAGD I EAKS KTAGEEEM I T 1 

3362 

A 

268 

393 

DGRRKEKWHKVERRHRPYLLSSLSQHRW*TVTNFGEISGTIAIEVDE 
GTYIHALNNGLFTLGAPHK/ERIALKPGYGKYIiSINSDELW/GRSD 
AIGPREQWEPVFQNQEVRNGGPAEMGEEKRNGTKWREDTDHTSFPLF 
PSTGGQPKAHSNWRKVCH 

3363 

A 

6 

223 

TVTN FGE I SGTI A I EMDEGTY I HALDNGLFTLG APHK/ ER I ALKSG Y 
GKYLG INSDELVGHSDAIGPREQWEH 

3364 

A 

329 

877 

RPWKQDRRAGDPWPRTHEFRSSEASLQASACKKKKSKEKKRKREEDE 
ETQFDMFGIW*TVTNFDEISGTIAIEMDEGTYIHALDNGLFTLGSST 
QRRIALKSGYGKYLGINSDELW/GRSDAIGPREQWEPVFQNGACAA 
VFTVIGSEKQSECSLLRESRAKYHGCTHGQISSSLKQHPRWMY 

3365 

A 

97 

704 

AGFS LS AQKCPGAMAE * S YAKS TKLVLKGTKTKS KKKKS KEKKRKRE 
EDEETQFD/ 1 VG I W * TVTNFDEI SGTI AI EMDEGTY IHAUDDGLFTL 
GAPHKEG\KMALLASNGCFIRCNEAGDIEAKSKTAGEEEMIKIRSCA 
ERETKKKDDFLEEDKGNVKQCE INYVKKFQS FQEHKLKI SKEDS KIL 
KKAQKDGFLHETLLDR 

jjbb 

A 

24 

981 

AIQRLGEDGGGFYRDVGCQGVFISRPLLFLRAPVLPRLTYSSRTGLS 
LSAQKLPGAMAEYSYVKSTKLVLKGTKTKS KKKKS KDKKRKREEDEE 
TQLD I VG I WWTVTNFGE I S GT I AI EM \ DKGT Y I HALDNG \ L FTLGAP 
HKE\SDEGPSPPEQFTA \VKLSDSRIAIi\ KSGYGKYLGI IRDGLWG 
\RSDA\ IGPREQWEPVFK\MGKMALSTSNSCFIRI \HEAEDIEAKSK 
TAGEEEM I KIRSCAERETKKKDD I PEEDKGN^/NKCE INYVKKFQS FQ 
DHKLKI S KEDSKI LKKARKDG FLHETL \ LDRRA\ KXEAD \ RYCK 
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3367 

A 

1 

555 

PLKRSDGCNDGRPTRPPTRPDTTVFTSNLKQTRMVHLT\PEEKSAVT 
ALWGKVNVDB\VGGKALGRLL\WYPW\TQRFFE\SFGDLSTP\DAV 
\MGNPKVK\AHGKKVLG\AFSGGPGCTWDNIjKGTFAHTEVSLHCDK\ 
LHRG S LKNFRLLG \ NVLGLCCLAHS LLGKE FQPHQLQACL I KKI GWL 
GVG 

3368 

A 

1 

390 

WEE I QELNEVARHRPRS TLVMG I QQENRQ I RELQQENKELRTSLEEH 
QSALELIMSKYREQMFRLLMASKKDDPGIIMKLKEQHSKELQAHVDQ 
ITEMAAVMRK\ PLKLTSNRVARNKNEYFNLNKKTKA 

3369 

A 

769 

912 

FPTP I PLFQELQAHVDQITEMAAVMRK\ PLKLTSNRVARNKNEYFNL 
K 

3370 

A 

1 

463 

MKKNIAFLLASMFVFSIATNAYADIQMTQSPSSLSASVGDRVTITCR 
SSQSLVHGIGNTYLHWYQQKPGKAPKLLIYKVSNRFSGVPSRFSGSG 
SGTDFTLTISGLQPEDFATYYCQHYDSLPYTFGQGTKLE\IKRTVAA 
PSVFIFPAHLMSS 

3371 

A 

2 

166 

GDSPLNLSPFQYSNAFDGFAAYGGLNEKSFVDGLNFSLNNMQYSNQQ 
FQPVMAN 

3372 

A 

39 

1200 

FCGVAAMQLEIQ VALNF 1 1 S YLYNKLPRRRVN I FGEELERLLKKKY\ 
EGHWYPEKPYKGSGFR\CIHIGEKVDPVIEQASKESGLDIDDVRGNIi 
P\QDLESFRIDPFBVSYQIGEKGPVKVLYVDDN\NENGCE\LDKEIK 
NS FNPEA\ QVFMP I SDPAS S VS S S PS PPFGHSA\ AVS PTFMPRS TQP 
LTFTT\ATF\AATKF\GSTK\MKNSG/RGSSRFARTSPINLRLGMWN 
DLLEGRKAISSS\MHSLYGLGLG\SQQQPQQQQQPAQPPPPPPPPQQ 
QQQQK\ TS ALS PNA\ KEFI FPNMQGQGS S T\ NGMFP \GDS PLNLS PL 
Q YSNAFD VFA\ TYGGLNEKS FVDGLNF\ S LNNMQYSNQQFQPCYGLT 
KKKKMYRTKLKCTGPRGIFFFSPP 

3373 

A 

1 

1197 

MEYYATIKKNKIISFAAAWMQLDAITLSKLTQKRKTKYRMFLFVSEG 
R P P WAVGRE S G WR P P VP FS S VLLLRD PGTG VLKD LNNQEVDGEHEDG 
ALQANHHLLPDEFGALESVKAASELYSPLSGEVTEINE^AENPGLV 
NKSCYEDALTFYFDKYYKHRGSNPSNSLANFHFKAIiTIASKHHQIVK 
MDLRSRYLDSLKAIVFKQYNQIATVLRLLRVMEHVQTTLLLGKLPGA 
LTACCH PLHLC VHKQAKCVTI AYS I QYSNMLYR F VAQEQRLYHTAKV 
KVTVLEDNDPFAQYHYLVTVYTGHRRGAATSSKVTVT^ 
HHLADPDTPVFERGAVDAFLLSTLFPLGELRSSGCGMTTQGTAHRAS 
TCDVLKVTQLASMEQGLCI LEAK 

3374 

A 

66 

632 

RRDPRTPANMALRWRSVRALLCTLRAVPLPA\APCPPRPWQLGVGA 
VRTLRTGP\ALLSVRKFTEKQESGNNQEKSMNQLGSGNFAQ\EALGD 
WYCSLPEVGTKLNKQ\DE\FGALESVKAASELYSPL\SGEVTEIN\ 
EAL\AENPGTCKTNFCY\EDGWLIKMTLSNP\SEL\DELMSEEAYEK 
YIK\SIEE 

3375 

A 

162 

552 

VASEHSPKIGASQGLX)YEPLLVVAKVWYLTRPTGTKAGSVFSQYLPF 
LEPGILGPASLPWLRQTLTGKEIEIDIEP\TDKVERIKERVEEKEG\ 
I PP \ QQQRLHLQVAKQM \ NDEKTAS W I TKI LRWVQS FQT 

3376 

A 

77 

427 

GRDKEGEYPPAPSSGWGKSVGIMLTELEKALNSIIDVYHKYSLIKGN 
FHAVYRDDLKKLLETECPQYIRKKGADVWFKELDINTDGAVNFQEFL 
I LV I KMG\ WQPTKKAMKKATKS S 

3377 

A 

1 

143 

GKSVGIMLTELEKALNSIIDVYHKYSLIKGNFHQLCSNIYFKKFYVF 

3378 

A 

200 

899 

GKSTGPVCVHMSLCEWTLSLATRVSLSSHPSHQSHSHLLVWLFGESR 
PGQGLRLGHESSAYCPGQMQIPCHGIPQKVLFFRWGKSVGIMLTELE 
KALNS 1 1 D VYH KYS L I KGNFHAVYRDDLKKLLETECPQ Y I R VRRGLG 
WRG JjS AW S WGCPGPAVIjPATLHRWLCLG S L * DL * TLAS S S S I LT \ KK 
G AD VW FKELD I NTDG AVNFQEFL I L VI KMG \ WQPTKKAM KKATKS S 

3379 

A 

99 

498 

RESGGKRLNIYHQTVYH\KYSL\IKGNF\HAVYR\DDLKKLLET\EC 
PQYIRKKGADVWFKELDINTDGAVNFQ\EFLILVIKMG\WQPTKK\A 
MKESHKGVSLSLLGPKGWALGGHVTCRIIKVINTSKKKKKVDAA 

3380 

A 

1 

282 

PLKRSDGCNDGRPTRPPTRPDTTVFTSNLKQTRMVHL\T PEEKS AVT 
ALWGKVNVDE\VGGKALG\RLL\VVLPLGPPKGPLKSFGGICPNSLN 
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3381 

A 

24 

452 

AG 

APSPDAMG/HSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLS 

SASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKL 

HVDPENFKLLGNVLVTVIjAIHFGKEFTPEVQASWQKMVTGVASALSS 
RYH 

3382 

A 

3 

549 

FCPRGQEFGEGNKLLVPDA\MGHFTEED\KATI\TSLWGK\WVE\D 

AGGET\ LGRL\ L WYP\ WT\ QRFFD\ S FGNLSSASAI \MGNP\ KVKA 

HG \ KKVLTFLGRCQQS TLDDLKGHLLPKPEVNCTVDKPAMWDPENFK 

APGEMLLVTRFGQSHFRQKNSPPGGCKASWAERWVT\GV\ASALVPS 
RYH 

3383 

A 

347 

716 

REPCVSRSSTCCPLPWMEPRAMCPPVGAMRGSRRDPASSSLSPDHLA 
QLQPGAGFLFCSAPALSGSFPPRPSPQGHLATAGSWL/CPPVQAQPQ 
APALCGPRRG VAWVG I AWEDSGKS VGEVAS 

3384 

A 

1 

1362 

MEKSLLFPSLTTSAGNDRDSRSRSGGGESSGETGSESGPRAPGGPMT 
PPSPARGSLLSPALTCLPFPTPRRIQVQDVEKPMLGQQRQGLPAGRS 
PTVAAAILELETSGGDEEEPGRDHLRGVGTVLSEGDGGHACTPRLSG 
AKFYSEIIAIMWLLQIVRDHWVHVLVPMGFVIGCYLDRKSDERLTA 
FRNKSMLFKRDGSSGSREFQLARQDACQWAYSWVGGSLRGKRTEQHP 
QWQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGV 
TS ED YRTFLQW FNLNS LLTG PEL I SDT YLAL FLAQLQQEG YS I FWK 

GDLPDCEADQLLQMIRVQQMHRPKLIGEELAQLKEQRVHKTDLERVL 
EANDGSGMLDEDEEDLQRALALSRQEIDMEDEEADLRRAIQLSMQGS 
SRNI SQDMTQTSGTNLTSEELRKRREAYFEK 

3385 

A 

284 

540 

KLFQKKNKKSSCISDNFFFLLAIhTWLLQIVRDHW\VH\VLVPMGFV 
IGCYLDRKSDERLTAFRNKSMLFKSFPHRELQPSEEVTWK 

3386 

A 

194 

353 

KVPATTADMIREHLPAPGDDVLVLLCGPPPMVQLACHPNLDKLGYSQ 
KMRFTY 

3387 

A 

57 

880 

DPHSVSSTGGPLPIWFWPLSSGPGAVAADPAPQGSGPTVDAGVPIPH 
I RARDS ARL PLC PWHAS ALELG I ATPGTVTTQAPG S Y PAWRS RCLLY 
PQSPVLLASLGVGLVTLLGLAVGSYLVRRSRRPQVTLLDPSEKYLLR 
LLDKTVSWGRKGQGEGGDQSAPGM*GTAPTWGMSKGRGSARRAW*N 
TCVLHPP\TVSHNTKRFRFALPTAHHTLGLPVGKESVEWAPFPGYPQ 
KVTPLLGSHETRCFNRFAPHTISPSPTWLCILQAVRCHWA " 1 

3388 

A 

430 

483 

NQPCLTLPTRKSECTDTVISWSLSFGFLFTRWVTGMAVLLRLSLRQA 
EYTHSQLLLPLDT*RLGNWKFRGP\SGLLSYTGKGHFNIQPNKKSP 
PE PRVAKKLGM I AGGTG I TPMLQL I RA I LKVPED PTQCFLLFATQTE 

KDIILREDLEELQARYPNRFKLWFTLDHPPKGILPISGHPTIPSSSK 
SKPCPFVNSGFIEINLASHCQLGSLSAQTQ 

3389 

A 

3 

1007 

HACAHAS AHASGRL VRWGRKRRS VMG I QTS PVLLASLGVGLVTLLGL 
AAGSYLVRRSRRPQVTLLDPNEKYLLRLLNKTTVSHNTKRFRFALPT 
AHHTLGLP VGKH I YL S TR I DGR VR \ I KP YTPVTSDEDQG YVDLVI KV 

YLKGV\QPKFSEG\GKMSQYLDSLKVGDV\VEF\RGPSGLLTYTGKG 
HFNIQPNKKSPPEP\RVAKKLGMIAGGTGITPMLQLIRAILKVPEDP 
TQCFLLFANQTEKDI I FAGGLKRNLQARYPNRFKLWFTLDHPPKDWA 

YSKGFVTADMIREHLPAPGDDVLVLLCGPPPMVQLACH\PNLGQTGV 
TSQKMAISPY 

3390 

A 

2568 

2783 

RSPGRPPHDPPNAPV/RRPASCHS/AAESQHGGWKKSKISASRKLQL 
KTLLLQI AKQE/ LEREAEERRGEKGRAL 

3391 

A 

2 

410 

S S S PEETDGPWDPEGHPGG PGTEPPNAPVLALPPAI PGLRLSMADGS 
au*\v K&rnr+u? i ^ J-KKK^bNYRAYATERRGEKGRALSTRCQPLELAG 
LGFAELQDLCRQLHARVDKVDEERYDIEAKVTKNITEIADLT 

3392 
3393 

A 
A 

3 

2023 

492 

2613 | 

SSPEETVGPWDPEGHPGGPLTDPPNAPVLALPPAIPGLSLIMADGSS 
DAVRAAG * GVAGTPRARAREPRPAPAP I RRRS SNYRAYATE PHAKXK 

SKISASRKLQLKTLLLQIAKQELEREAEERRGEKGRALSTRCQPLEL 
AGLG FAELQDLCRQLHARVDK 

TLQTPLSSSLPPAI PGLSLSMADGS SDAAREPRPAPAPIRRRS SNYR 
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AYATEPHAKKKSKISASRKLQLKTLLLQIAKQELEREAEERRGEKGR 

ALSTRCQPLELAGLGFAELQPDHTWFIDGSSTRPNRHTPAKAGYAIV 

SSTSIIEDTALPPSTTSQQAELIALTQAIiTLAKGLRVNIYTDSKYAF 
HILPHHAVI 

3394 

A 

150 

848 

VSAWRNGSSDAAREPRPAPAPIRRRSSNYRAYATEPHAKKKSKIS\A 
SRKLQLKTLLLQIAKQE\LEREAEERRGREGARSLAPRC\QP\LELA 
G\ LGFADLPGLCCRQFHARV\ DKVDEER YD I EAKVSKNITE IAN \ LT 
\ QKI FDLRSKFKRPTIA RRVR I S ADAMMQALLGARAKES LDLRAHLK 
Q VKKED PEKENRE VGDWRKN I DAL S GM EGRMKKWRE LS L PAYC P C PE 
EGH 

3395 

A 

1 

2880 

MSKIKTLLQYSPSSTNSQPWHFIVGSKAFVESVRAEHPDARHHCVAW 

VAGAPDDSQQLGFSDDGEPAGTAGKPMLAQLMGSGVGEITAVWRYY 

GGILLGTGGLVKAYGGGVNQALRQLTTQRKTPLTEYTLQCEYHQLTG 

IEALLGQCDGKIINSDYQAFVLLRVALPAAKVAEFSAKIADFSRGSL 

HAPARLASISAGCRFPTQDRSKSMMNCSYGAMVPAITKNPNERASLA 

AWRQGGATLGLLLCTVGFVPVMNLIEGNQQLGYIFAATLFSLFGLLF 

MWICYSGVKERYVETQPANPAQKPGLLQSFRAIAGNRPLFILCIANIi 

CTLGAFNLAIQVYYTQYVLNDPILLSYMGFFSMGCICIGVFLMPASV 

RRFGKKKVYIGGLLIWVLGDLLNYFFGGGSVSFVAFSCLAFFGSAFV 

NSLNWAIiVSDTVEYGEWRTGVRSEGTVYTGFTFFRKVSQALAGFFPG 

WMLTQIG YVPNVAQADHTI EGLRQL I F I YPS ALAWTI VAMGCFYSL 

NEKMYVRIVEEIEARKRTAFLPAPLAFNVNIYAEFASGFFQIFRSHI 

GVSNTRRTGRYADDSWRVGKRGRRLRASLVIQRREQLIRRRRRLYTR 

PNLIIRKARGGSCACAGSCKCKKCKCTSCKKSECGAISRNLGLWLRL 

LDSIETEKGKRSLFYIFRVTPVSCEQKSRYALAIRANSREETMDNDK 

IDQHSDEIEVESEEKERGKKIEIDEDRLPSRAMAIHEHIRQDGEKEL 

DARIALGVENIIESCTWSRQGKDADIMIIGGIPDKLKDDKQIDLLV 

QATESWVKTPMRQTPFPGIVPDESDRAAETRSTLTSSGAMAAVIGFQ 

SPYNDQRSVIALLADSPRGYEMLNDAVNDSGKRATMFGSVAVIRESG 

INSLRVGD\nTYVGHLPWFERVWYALANHPILI^VIiAAIS 

WRLLR 1 1 SRRRLNPDNE I YE 

3396 

B 

61 

5070 

MGPTSGPSLLLLLLTHLPIJVLGSPMYSIITPNILRLESEETMVLEAH 

daqgdvpvtvtvhdfpgkklvlssektvlt^ 

efksekgrnkfvtvqatfgtqwekwlvslqsgylfiqtdktiytp 

gstvlyriftvnhkllpvgrtv1wnienpegipvkqdslssqnqlgv 

lplswdipelvnmgqwkirayyenspqqvfstefevkeyvlpsfevi 

veptekfyyiynekglevtitarflygkkvegtafvifgiqdgeqri 

slpeslkripiedgsgevvlsrkvlldgvqnlraedlvgkslyvsat 

vilhsgsd^qaersgipivtspyqihftktpkyfkpgmpfdlmvfv 

tnpdgspayrvpvavqgedtvqsltqgdgvaklsinthpsqkplsit ! 

vrtkkqelseaeqatrtmqalpystvgnsnnylhlsvlrtelrpget 

lnvnfllrmdraheakiryytyiiimnkgrllkagrqvrepgqdlvvii 

plsittdfipsfrlvayytligasgqrewadsvwvdvkdscvgslv 

vksgqsedrqpvpgqqmtlkiegdhgarvvlvavdkgvfvlnkknkl 

tqskiwdvvekadigctpgsgkdyagvfsdagltftsssgqqtaqra 

ELQCPQPAARRRRSVQLTEKRMDKVGKYPKELRKCCEDGMRENPMRF 
SCQRRTRFISLGEACKKVFLDCCNYITELRRQHARASHLiGLARSNLD 
EDIIAEENIVSRSEFPESWLWNVEDLKEPPKNGISTKLMNIFLKDSI 
TTWEILAVSMSDKKGICVADPFEVTVMQDFFIDLRLPYSVVRNEQVE 
IRAVLYNYRQNQELKVRVELLHNPAFCSLATTKRRHQQTVTI PPKSS 
LS VP YVI VPLKTGLQEVEVKAAVYHHF I SDG VRKSLKWPEG I RMNK 
TVAVRTLDPERLGREGVQKED I PPADLSDQVPDTESETR ILLQGTPV 
AQMTEDAVDAERLKHLIVTPSGCGEQNMIGMTPTVIAVHYLDETEQW 
EKFGLEKRQGALELIKKGYTQQLAFRQPSSAFAAFVKRAPSTWLTAY 
WKVFSLAVNLIAIDSQVLCGAVKWLILEKQKPDGVFQEDAPVIHQE 
MIGGLRmmEKDMALTAFVLISLQEAKDICEEQVNSLPGSITKAGDF 
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LEANYMNLQRS YTVA I AGYALAQMGRLKG PLLNKFLTTAKDKNRWED 
PGKQLYNVEATSYALLALLQLKDFDFVPPWRWLNEQRYYGGGYGST 
QATFMVFQALAQYQKDAPDHQELNIiDVSLQLPSRSSKITHRIHWBSA 
SLLRSEETKENEGFTVTAEGKGQGTLSVVTMYHAKAKDQLTCNKFDL 
KAAGPMVLTDTSPSMSLDKAFSDRNTLIIYLDKVSHSEDDCLAFKVH 
QYFNVELIQPGAVKVYAYYNLEESCTRFYHPEKEDGKLNKLCRDELC 
RCAEENCFIQKSDDKVTLEERLDKACEPGVDYVYKNRLVKVQLSNDF 
DEYIMAIEQTIKSGSDEVQVGQQRTFISPIKCREALKLEEKKHYLMW 
GLSSDFWGEKPNLSYIIGKDTWVEHWPEEDECQDEENQKQCQDLGAF 
TESMWFGCPN* 

3397 

A 

3 

560 

EGKGQGTLSVVTMYHAKAKDQDTCNKFDLCLPGSRQKVEVFLGPCLQ 
PGTFLWSRNRRVLGFPSMNGEDMGLLFLCSEWERSSEGWLCNREGGS 
GHSIEPHCCTFLHLTHRSLAFSLLAGVSCTCASSCKCKE\CKCTSCK 
KSECGAISRNLGLWLKVGREPKAVPEVRASGEPAFLCPCRLSLPV 

3398 

A 

3 

1584 

SNQPLRREAFPETPKGLRQGIPGEPENGDRSHFFSFDVTAVMILYKE 
SQALTGNLPGPQLPRRLRAAGRGGHPSVPAPSRRGALSRFPASGRSA 
VPTIGWLCRSHYPHSIRGANGWDRQELVRASHTMSATAQTNHRAEKA 
GRELSQANNDRTGRLAHPAPPPSQARQVPRLEHLASPRCRWPAVISR 
P FR S WHC I LE I RFRKGKLGGRAS PENEG AGE I RVKVP KDRDWI*TAKL 
QGDIiLWSQTAKLEDIWLQTAKSTQDFLTPTARSYSPrTKLKGKVNT 
KS PTLLTPTARGRHYLLGAP I AS WRCHTS T I R YHTVWMSMTLTESE 
HQAGSVDMSANDYAHYSGASGDNEMCWIACPTPLFLSPRAWKPCDA 
GS KVKNWG I ENGNLVHTVLS STQHGDPEHVHAP FGMEAE YAHPLETL 
ILGTGFFIGIVLLCDHVILLWAWVTIRLLETIDVHSGYDIPLNPLNL 
IPFYAGSRHHDFHHM\NFIG\NYGFQHFTRW\DWNFLGTDSQYNAYN 
EKRKKFEKKTE 

3399 

A 

2 

873 

GRVGEMSQGRGKYDFYIGLGLAMSSSIFIGGSFILKKKGLLRLARKG 
SMRAGQGGHAYLKEWLWRAGLLSMGAGEVANFAAYAVAPATLVTPLG 
ALSVLVRAILSSYFLNERLNLHGKIGCLLSILGS\IVMVIHAPKEEE 
IETIiNEMSHKLGDPGFWFATLWIVALILIFWGPRHGQTNILVYI 
TICSVIGAFSVSCVKGLGIAIKELFAGKPVL\GIPWAWIIjLLSLIVC 
VSTQINYI^\PWDIFNTSIVTPIYYVFFTTSVLTCSAILFKGVGKD 
MPVGRCPLVIj 

3400 

A 

90 

458 

HFSRGYLEAFSEISNIRFVPPHSVTVVVVFGACFLCILGIWPWACLP 
GPGGEGSGGFGEGRGSEAGRLGSVELTP\ATLPIiQAPEAYPVFEPVP 
PVPEAAQGDTEDSEGAPPLKRICPNAPDP 

3401 

A 

3 

787 

PGS TI S WRPGLARSLS PPDGRRPRRGLGPGPS PASMAGRTVRAETRS 

rakddikkvmatiekvrrwekr.wvtvaatpfrilnvata\vvdpqeee 
r\rragggaersrg\rerrgrgaspdggg/plfilldl\ndensnqr 
fplrkgslqrgte\pspggtpk\pnrpcvtlpdppeggpctraqppt 
/rlgqeerspggitv\gstdepp\mltkeepvpelleaeapea\ypv 

FEPVPPVPEAAQGDTEDFGSAPP\LKRICPNAPDP 

3402 

A 

1 

172 

DCVRKQLKKERPIILDPADPTLNVAEGYWLGLGIYGIQDSDTLILSK 
KKGEALFPAS 

3403 

B 

1 

5501 | 

MVSKLSQLQTELLAALLESGLSKEALIQAIiGEPGPYLLAGEGPLDKG 
ESCGGGRGELAELPNGLGETRGSEDETDDDGEDFTPPIIiKELENLSP 
EEAAHQKAWETLLQEDPWRVAKMVKSYLQQHNIPQREWDTTGLNQ 
SHLSQHLNKGTPMKTQKRAALYTWYVRKQREVAQQFTHAGQGGLIEE 
PTGDELPTKKGRRNR FKWGP AS QQ I L FQAYERQKNPS KEEREAQGLG 
SNLVTEVR\TYNWFANRi^KEEAFRHKLAMDTYSGPPPGPGPGPALPAH 
S S PGLPPPALS PS KVHGVR YGQ PATSETAEVPS S SGGPLVTVS TPLH 
QVSPTGLEPSHSLLSTEAKLVSAAGGPLPPVSTLTALHSLEQTSPGL 
NQQPQNLIMASLPGVMTIGPGEPASLGPTFTNTGASTLVIGLASTQA 
QSVPVINSMGSSLTTLQPVQFSQPLHPSYQQPLMPPVQSHVTQNPFM 
ATMAQLQSPHALYSHKPEVAQYTHTGLLPQTMLITDTTNLSAIASLT 
PTKQVFTSDTEASSESGLHTPASQATTLHVPSQDPAGIQHLQPAHRL 
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SASPTVSSSSLVLYQSSDSSNGQSHLLPSNHSVIETFISTQMASSSQ 

3404 

A 

158 

755 

RGGRPAWPCGSRAMVSKLSQLQTELLAALLESGLSKEAIjIQALGEPQ 

PYI/LAGEGPLDKGESCGGGRGELAELPNGLGETRGSEDETDDDGEDF 

TPPILKELENLSPEEAAHQKAVVETLLQEDPWRVAKMVKSYLQQHNI 

PQREWDTTGLNQSHLSQH/HQQGHSHEDAEAGRPVHLVRPQAARGG 
AAVHPCRAGRAD 

3405 

B 

133 

2369 

MV S KLS QLQTELLAALLE S GL S KEAL I QALGE PG P YLLAGEGPLDKG 
ESCGGGRGELAELPNGLGETRGSEDETDDDGEDFTPPILKELENLSP 
EEAAHQKAWETLLQEDPWRV7VKMVKSYLQQHNIPQREWDTTGLNQ 
SHLS QHLNKGTPMKTQKRAAL YTWYVRKQREVAQQFTHAGQGGL I EE 
PTGDELPTKKGRRNRFKWGPASQQILFQAYERQKNPSKEERETLVEE 
CNRAEC I QRGVS PSQAQGLGSNLVTEVRVYNWFANRRKEEAFRHKLA 
MDTKVHGVRYGQPATS ETAEVPS SSGGPLVTVSTPLHQVS PTGLEPS 
HSLLSTEAKLVSAAGGPLPPVSTLTALHSLEQTSPGLNQQPQNLIMA 
S LPGVMTI GPGEPASLGPTFTNTGAS TLV I VPTLDQSLC Y ISDTWVN 
QTDQNLSNSSREAGTKHNTSILWYLR^PGLHAGTECAGHQQHGQQPD 
HLQPVQFSQPLHPSYQQPLMPPVQSHVTQSPFMATMAQLQSPHX* 

3406 

A 

1 

2112 

^LKGKAHCSILDFGFLDLQCSTNSAAEFIRSWEDPWRVAKMVKSYL 
QQHNIPQREWDTTGLNQSHLSQHLNKGTPMKSQKRAALYTWYVRKQ 
REVAQPHIPYPIPENSTGTEWLIELTWIiDPTHAKNDQEHRTDSPVNY 
GKKKGKENKNLKPPPEVYVSLI KACGGPGNFCPS FSELQRNFVKHRP 
TKLKSLLRLVKHWYQQWLQPHREWKEEVLDAVRTVEEFLRQEHFQGK 
RGIiDQDVRVLKVVKVGSFGNGTVLRSTREVEIiVAFLSCFHSF\QEAA 
KHHKDVLRLIWKTIWQSQDLLD1A3LEDLRMEQRVPDALVFTIQTRGT 
AEP\ITVTIVPAYRALGPSLPNSQPPPEVYVSLIKACGGPGNFCPSF 
SELQRNFVKHRPTKLKSLLRLVKHWYQQYVKARSPRANIiPPLYALEIi 
LTIYAWEMGTEEDENFMLDEGFTTVMDLLLEYEVICIYWTKYYTLHN 
AI IEDCVRKQLKKERPI I LDPADPTLNVAEGYRWDIVAQRASQCLKQ 
DCCYDmENPISSWNVKRARDIHLTVEQRG\YPDFNLIVNPYEPIRK 
VKEKIRRTRGYSGLQRLSFQVPGSERQ\LLSSR\CSLAKYGIFSHTH 
IYLLETIPSE\IQVFVKNPDGG\SYAYAI\NP\NSFILGLKQQIED\ 

HQGLPKK\QQQLEF\QGQ\VLQGLVGVWGFYGIQD\SDTL\ILS\KK 
KGEALFPAS 

3407 

A 

108 

442 

AI>LSWEMS7VACWEEPWGLPGGFAKRVLVTGGAGF\IR*WRR*PSSGS 
PETPTFSRSSACPSASHMIVSLVEDYPNYMIINLDKLDYCASLKNLE 
TISNKQNYKFIQGDICDS 

3408 

A 

115 

1196 

ALLSWEMSAACWEEP\WGLPGGFAKRVI,VTGGAGF\ISASHMIVSLV 
EDYPNYMIINLDKLDYCASLKNLETISNKQNYKFIQGDICDSHFVKL 
\LFETEKIDIVLHFAAQTHVDLSFRTCPWSFTHVNV/YMGTHVLVSA 
AHEARVEKFIYVSTDEVYGGSLDKEFDESSPKQPTNPYASSKAAAE\ 
CFVQSYWEQYKFPWITRSSNVYGPHQYPEKVIPKFISLLQHNRKCC 
IHGSGLQTRNFLYATDVVEAFIjTVLKKGKPGEIYNIGTNFEMSVVQL 
AKEL I QL I KETNSE S EMENWVD YVNDR PTNDMRYPMKS EKIHGLRWR 
PKVPWKEGIKKTIEWYRENFHNWKNVEKALEPFPV 

3409 

A 

359 

532 

IVMCHCLELVIGEPOTEICGSVCVC^AAVC\TLOTCnmTCICFGAC^ 
/ CVCVCRRVCPVCLCTYLYLLWGLCWCYQRVNFWK 

3410 

A 

27 

1389 

GQPRSQQGAGSWGKRRQESYRAREGDGGSGSDTGLLRTAAGWLLPAR 
ASPRERATRHPVRSPQRRARRSLQRGPERPRRDPPQTAWAARMCTKM 
EQPFYHDDSYTATGYGRAPGGLSIjHDYKXLKPSIiAVNIiADPYRSIjKA 
PGARGPGP\EGGGGGSYFSCQGSDTGPSLKLASSELE\RLIVPNSKT 
A\ITTTPTPP\GQYF^PRGGGSGGGAGGAWGGVTEEQEGFADGFVKA 
LDDLHKMNHVTP PNVS LG ATGG P PHG PG / G I R S AS PE P P \ PVYTNL S 
SYSPASASSGGAGAAVGTGSSY\PTTTISYLPHAPPFAGGHPAQLGIi 
GRGASTFKEEPQTVPEARSR\DATPPVSPINMERPRAPSKLEPSGLR 
NRLAGHQS AGKRK\ LKRFARLEDKV\ KTLKA\ ENAGP VE VPPGLPPG 
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SRLAQAQTRKVIDPRVSNGLSSLLFGV\RGHAFLNVPCPL 

3411 

A 

1 

3589 

- 

IRRPLDQRGGIPLRKKPLEDLVCKLADISINYVNERKSEQHLRFLST 
LIJDSFSSSRVFKMLLGDEKQSIVQAKPLEIAKLVQKNPAVQFLYQKL 
I GWLNEDQ\ RKD FGFLVD I L YS ALft CCDNDMERKKVLDDLTKVDLKW 
NSLLKIIEKACPSSDKHALVTPWLKGDILGEKLVNLADCLCNEDLES 
RVSSESHFSERWTLLSLVLSQHVKNDYLIGDVYVERIIVRLHETLFK 
TKKLS EAE S SDS S VS F I CDVAYN YFS S AKGCLLM PS S EDLLLTLFQL 
CAQSKEKTHLPDFLICKLKNTWLSGVNLLVHQTDSSYKESTFLHLSA 
LWLKNQ VQAS S LD INS LQVLLS AVDDLLNTLLE S ED S YLMG VY I G S V 
MPNDSEWEKMRQSLPMQWIjHRPLLEGRLSLNYECFKTDFKEQDIKTL 
PSHLCTSALLSKMVLIALRKETVLENNELEKIIAELLYSLQWCEELD 
NPPIFLIGFCEILQKMNITYDNLRVLGNTSGLLQLLFNRSREHGTLW 
SLI IAKLILSRS ISSDEVKPHYKRKESFFPLTEGNLHTIQSLCPFLS 
KEEKKEFSAQC I PALLGWTKKDLCSTNGGFGHLiAI FNSCLQTKSIDD 
GELLHGILKIIISWKKEHEDIFIiFSCNLSEASPEVLGVNIEIIRFLS 
LFLKYCSSPLAESEWDFIMCSMLAWLETTSENQALYSIPLVQLFACV 
SCDIiACDLSAFFDSTTLDTIGNLPVNLISEWKEFFSQGIHSLLLPIL 
VTVTGENKDVS ETS FQNAMLKPMCETLTY I S KEQLLSHKLPARIjVAD 
QKTl^PEYLQTLLNTLAPLLLFRARPVQIAVYHMLYKLMPELPQYDQ 
DNLKSYGDEEEEPALSPPAALMSLLSIQEDLLENVLGCIPVGQIVTI 
KPLSEDFCYVLGYLLTWKLILTFFKAASSQLRALYSMYLRKTKSIjNK 
LLYHLFRLMPENPTYAETAVEVPNKDPKTFFTEELQLSIRETTMLPY 
HIPHLACSVYHMTLKDLPAMVRLWWNSSEKRVFNIVDRFTSKYVSSV 
LSFQEISSVQTSTQLFNGMTVKARATTREVMATYTIEDIVIELIIQIi 
PSNYPLGS I I VESGKRVGVAVQQWRNWMLQLSTYLTHQNGS IMEGLA 
LWKNN\VDKRFEGVEDCMICFPVIHGFNYSLPKKAC\RTCQKKF\HS 
A\ CLYKWFTSSNKSTCPLCRETFF 

3412 

A 

112 

828 

PVFKTPLPSHRRAHAAALDLIiGASSSDPHADSGTGNWAEVDPGSAQH 
LPRPSSQLPHFLLSGGGPGQCRRGQRGELLPSSGGTATPCAPRGLGW 
GLRSCALRAAAAPPTAPQTLLRRAPRPARPRAGPGLAPQPS I SRDFL 
GQAACASGTMLRWLRDFVIiPTAACQDAEQPTRYETLFQALDRNGDGV 
VD I GELQEGLRNLG I PLGQDAEE VGRRRG AA * A * GGLRALGTLRG PR 
RAAA 

3413 

A 

1 

1413 

RDARRYSGTTPHPS I SRDLLGQAVCASGTMLRWLRDFVLiPTAACQDA 
EQPTRYETLFQALDRNGDGWDIGELQEGLKNLGIPLGQDAEEKIFT 
TGDVNKDGKLDFEEFMKHLKDHEKKMKKAFKSLDKN^roGKIEASEIV 
QPLQTLGLTI SEQQAELILQSIDVDGTMTVDWNEWRDYFLFNPVTDI 
EE 1 1 RFWKHS TG I D I GD SLT I PDEFTEDEKKSGQWWRQLLAGG I AGA 
VSRTS TAPLDRLK I MMQVHGS KSDKMN I FGG FRQMVKEGG IRS L WRG 
NGTNV I KI APETAVKFWAYEQVMKTRLAVG KTGQYSG I YDCAKKI LK 
HEGLGAFYKGYVPNLLGIIPYAGIDUVVYELLKSYWLDNFAKDSVNP 
GV^r^LGCGALSSTCGQLASYPLALVRTRMQAQA\MLEGSPQLNMVG 
LFRRI I S KEG I PGLYRG I TPNFMKGIiPAVG I S YWYENMKQTLGVTQ 
K 

3414 

A 

1 

222 

AKRLLFLLLT I VFCQ I LMPQEGVPAPLP PEDAPNAASLAPTPVS PVL 
EPFNLTSEPSDYALDLSTFLQQHPAAF 

3415 

A 

2 

528 

srvdprvrhsarltmchsrxschptmtilqaptpapstipgprrgsg 
\peiftfdplpepaadp\agrpsasrghrkrsrrvlyprvvrrqlp\ 
veepnpXtucrllfllXltivfcqilmaeegvpaplppXedapXnaas 

L\APTP\VSPVLEPFNLTSEPL\DYASWNIiSTFLQQH\PAAF 

3416 

A 

1623 

2346 

TFPSGAKPPLAASHHHRHLRDASRLWRGRSGGPSVTRRPTFLPSYPP 
APAHAPASAPGPS\DSDCSPPFSRRRSPTRTDCFVCRPVCPFPFIVP 
PGRAPSRMAEVLLLLLLLSLPSAAAKIQGYPSCPGPQLSRDILSSGD 
GWWAGLSGHHRVCGG PGVQE PQTTWHNATDG I KYI HHRI> I HLTPAD 
YDDFVNAlRSARSAFCLTPMGMMQFNDILQNIiKRSKQTKELWQRVSL 
EMATFSP 
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3417 

A 

256 

658 

KGLSLSKKAGENHYFYLPHSFPEKAPPGAIGHLSPRGTIEEAJLDLFS 
ARIVAEFIVLPASHHPRHLRDASRLWRGRSGGPSVTRRPTFLPSYPP 
APAllAPASAPGPS\DSDCSPPFSRRRSPTRLARMRGLLGF 

3418 

A 

2 94 

575 

SHLLGLHEPSRHNRAS PGSLFCTHRRGIHS AS S KPS S PTS P * CKS PR 
PSPGAAPRRG*LE*AGLLGFSTGFRGFPNRARRPRRLGVAIAHQAP 

3419 

A 

1 

931 

MTLEELVACDNAAQKMQTVTAAVEELLVAAQRQDRLTVGVYESAKLM 
NVDPDSWLCLLAIDEEEEDDIALQIHFTLIQSFCCDNDINIVRVSG 
MQRLAQLLGEPAETQGTTEARDLHCLLVTNPHTDAWKSHGLVEVASY 
CEESRGNNHTTPPKQPNPRGPSGAESLETEEEEEEEKGSERPPPGRR 
SRSWRPPIRWRRGDPGQQETGPPKLRPWDGAEAGVAGHAAAFPITEG 
PDCPLGGGQLRDRF I TKPGRRATTVTGCAG ANPKLWAHMEAFQREGV 
LAAAPDRLGGHRSPRS GS VKGAPGTTQR 

3420 

A 

640 

757 

TPLPPSTTPPKQPNPRGPSGAESL.ETEEEEEEEKGSERT 

3421 

A 

865 

1314 

TPLPPSTTPPKQPNPRGPSGAESLETEEEEEEEKGSERPPPGRRSR\ 
S LRP P \ I R WRRGD P \ GQ \ QETGPP \ KLR P \ WDG AE A\ G VA\ GHAAAF 
PH/ ITEGPRLVPLGG\GVRLTPSP \ TIA LRTGAGLLHYERLWIiNLVG 
P\CLHP\RQDHTLGLGSWG 

3422 

A 

2 

310 

ARDVHPLPLPFSLtGNSSLALGMQMRPQLLLSRCRCLPATCASSCKCK 
ECKCTS/TARKKSCCSLNPCGLVPKCAPRACICKRGHRRSASIiLRLM 
TGTALLPTYK 

3423 

A 

41 

317 

LLLKFFFETGSHSFIQVGVQWCDHDSLDLPGSGDPSS/AS/RVAGTA 
GMHDHTWL I F \ VFLVRRWS LTVLLRLVFNS WAKV I LHLGLPKGWGLQ 
V 

3424 

A 

1 

3560 

MLHHPS QGFRFGTVRE S S AED YVRQS FPEMHEYMRR YNVPATPDGVE 
YLKI^PEKLDAFIMDKAIiLDYEVSIDADCKLIiTVGKPFAIEAFFFLL 
SQERGADEARFRTFVDSWPYTTAHDMDSLEFSDPSHHLDCPLGKTLN 
LYPCGFSWNSCKLQGLPLHGMKCEVHLGPVNSFQDSQGVPLPGTRNT 
CFPGVDHLTS FTGKRLS S SAKMSG I GtfKRAAGEPGTSMP PEKKAAVE 
DS GTTVET I KLGGVS S TEELD I RTLQTKNRKLAEMLDQRQA I EDELR 
EHIEKLERRQATDDASI»LIVNRYWSQFDENIRIILKRYDIi\SRAWET 
YSQNEKPL/ CVPEPEPDSDSNQERKDDRERGKWEGQEPAFSFIiATLA 
SSSSEEMESQLQERVESSRRAVSQIVTVYDKLQEKVELLSRKLNSGD 
NLI\^EAVQELNSFLAQENMRIiQELTDLLQEKHRTMSQEFSKLQSKV 
ETAESRVSVLESMIDDLQWDIDKIRKREQRLNRHLAEVLERVNSKGY 
KVYGAGSSLYGGTITINARKFEEMNAELEENKELAQNRIjCELEKIjRQ 
DFEEVTTQNEKLKVELRSAVEQVVKETPEYRCMQSQFSVIjYNESLQL 

kahldeartllhgtrgthqhqvelierdevslhkklrteviqledtl 
aqvrkeyemlriefeqtlaaneqagpinremrhlisslqnhnhqlkg 
evlrykrklreaqsdlnktrlrsgsallqsqsstedpkdepaelkpd 
s \ gtypps pqpqrhlrrmpmks s LNGMKKNENEKGGRRRGNEKE / TR 
EKEKEREREKQKLKESEKERDSAKDKEKGKHDDGRKKEAEI IKQLKI 
ELKKAQESQKEMKLLLDMYRSAPKEQRDKV\QLMAAEKKSKAELEDIi 
RQRLKDLEDKEKKENKJO^ADEDALRKIRAVEEQIEYIjQKKLAMAKQE 

eeallsemdvtgqafedmqeqnirlmqqlrekddanfklmseriksn 

QIHKLLKEEKEELADQVIjTLKTQVDAQLQVVRKLEEKEHLLQSNIGT 
GEKEIjGLRTQAIiEMNKRKAMEAAQIiADDLKAQLEIjAQKKLHDFQDEI 
VENS VTKEKDMFNFKRAQED I S RLRRKLETTKKPDNVPKCDE I LMEE 
IKDYKARLTCPCCNMRKKDAVLTKCFHVFCFECVKTRYDTRQRKCPK 
CNAAFGANDFHRIYIG 

3425 

A 

94 

203 

PLNFRPIFPVIFGISCSRIFKQVFRACVRPSHSDFL 

3426 

A 

3 

343 

RSAAGIRHEVSSAPGLGFGGAGMRSLLSVGLPPIPIGRGCHFTLITA 
FL YCTRVGRKGTCVAPPDLPVS AS TFGTS CLRQEAAAEGRP WRCGW 
WW P ARL I S S KEQGFP AL I L 

3427 

A 

2 

256 

FVGKSGEGIVSIIKSWAVGGTNAPTTEAGRKLSGGARAQRRPELPK 
QRGFGDGGFVRRNTKSRAWWGIFDILRSRTTGAHETD 

3428 

A 

245 

691 

CGGGRGVAGASVCRGGRVPVRVSSAPGLGFGGAGMRSLLSVGLPPIP 
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IGRECHFTLITVFLYCTRVGRKGTCVAPPDLPVSASTFGTSCLRQEA 
AAEGRPWRFDKIQSCQVKWT\VRF\REMEGSWDLIDDALAEKTSCIi 
MRMSAIPLN 

3429 

A 

781 

949 

LLCLGLAYRRHFEMSLWLGMVAHVCNPSAWGGRGGQIT/R/GQEFET 
SLVNMVKTCLY 

3430 

A 

3 

1029 

• 

RAFRFDDTMAGGGCRAGPGATCSDGDAGSDQRQNEEIEAMAAIYGEE 
WCVIDDCAKI FC I R I SDD I DDPTWTLCLHVMLPNE YPCTAPPI YQLN 
APWLKGQERADLSNSLEEIYIQNIAESILYLWVEKIRDVIiIQKSQMT 
EPGPDVKKKTEEEDVECEDDLILACQPESSLKALDF\EVSEARTEVE 
VEELPPIDHGIPITDRRSTFQAHLAPWCPKQVKMGLSKLYENKKIA 
SATHNIYAYRIYCEDKQTFLQDCEDDGETVAGGRLLHIiMEILSVKDV 
MGVRARWYVGMGLGPDRFKHINNCV\RNILVEKNYTSSIiEESSKALG 
KNKKVRKDKKRNEH 

3431 

C 

153 

224 

MVAHACNPSTLGDRGGRFTRSGD* 

3432 

c 

20 

58 

MTQLTKRKETIL* 

3433 

A 

837 

1125 

LDVTTS KNQFVLRTWLKI PGNC YRRHFFFFETES CS VAQAGVQWHNL 
S SLQAPPPG\ SRHS PTS AS \ QAAGTYRYPAHHAWPNFVFVFLVETGF 
HPC 

3434 

A 

3 

474 

DGS IMASMQKRLQKELLALQNDPPPGMTLNEKSVQNS IT\QWI VYME 
GAPCTLYEGEKFQLLFKCRRRYPFDSSQGMFTGENITDHPHVYSNGH 
MCLS ILTKNWFPALSVQS VCL\ S 1 1 SMLS SCKE \ KTTDHPDNS FYVR 
T\ CNKMPKKTKWWYHDDTC 

3435 

A 

1 

993 

MAATPPHRAGPNPCRHSRHLSEGREETRTHPHSPHTHGGGVEGPAPP 
ASRLQRDKVTHPGARQRAGCALTRARRPRASDAGSSSSSRRRRRRHL 
RPSTSADGGRERVAALKRVLYLVWAFIPESWIiNSLGLTYWPQKYWAV 
ALPVYLLIAIVIGYVLLFGINMMSTSPLDSIHTITGHERNTNVLQTM 
GSSVFEGHDQFSVTVTGDPMARAGRSQLEPHSDSDATLQLHCSQPTC 
VIKPVYTKPSILEAAVSQPASAAAAAPRGPSLPVILWFLRQPLLNVIi 
SECTMGGHILGKVQSLTPHHHSVKDLRTWQELKCSVYLIADGARFSS 
AA 

3436 

A 

157 

501 

SPRKNGGKFTVAIARKSDLWLCSFLKLPIWLQKSEGISCLLSVPCAS 
TSLGGFSLYKYLYSTRRDGKVDKAWPAIiKEVVVLWRRQTVLYLVWG 
\ FIPESWA* TL*GLTYWPPKY 

3437 

A 

48 

544 

GGENGSMVSRSTSLTLIVFLFHRLSKAPGKMVENSPSPLPERAIYGF 
VLFLKLPl\GFIIiYLVWALIPESWLNSDGLTYWPQKY\WAVALPVYL 
LIAWIGYVL\FFGINMMSTSP\PTSIHTITDNYAKNQOQKKYQEEA 
I PALRD I S ISEVNQMFFLAAKELYTKN 

3438 

A 

56 

291 

SAFGLLLAWFRFLAAKQAKNVALDCGGVSRLVIFVSWRNPQVAPTSA 
HQNRPS RNPVSR PPNTQRVARRKHYALADG Y 

3439 

A 

466 

993 

VLLACFWPGSDFWPRSRRKTYGTAPQSCYSFYVALDCGGVSRLVIFV 
SWRNPQVAPTSAHQNRPSRNPVSRPPNTQRVARRKHYALAI>GYTERR 
WTNAPCRAES S FPPNCPS AAP I PDS YDK* PSRATLFTSHLTVEAFPV 
WSYLYPGETPRLHPRAHTRTVPAAIRYPDRQTPNGLRGANTMHSQMD 
TLNGDGRTHLiAGQRARSPRTARPQRQYRIHTTSDLPGLVRKYASGCC 
LRIWYCCAWIERALSLQVGGGVFGDPLSEGQAECDHGST 

3440 

A 

1 

3114 

MI.QQDSNDDTKDVSLFDAEEETTNRPRKVKIRHPVASFFHLFFRVSA 
IIVCLLCELLSSSFITCMSKKWLAWIVGWliQGANLYGYLRCKGQE 
WETFAEPSLQATQMKLKRARLADDLNEKIAQRPGPMEL.VEKNILPV 
DSS VKEAI IGKTLKI YYLGAPAEAATKEDERTTSGPGHHATNYHFLL 
KFDFYLSWLHFVHKDAILSGHPLVRLLSTRVLRGPNDVFHGVSSVDS 
VLAIFVLAEPMGSLS^SLENQLEAFLDGTLPSANEIPPLQSSSEDRE 
PFSLIEDLQNDLLSHSGMLDHSHSPMETSETQFAAGFRliLRSQNRKP 
GQKQPKGTLEQTSEYPGNRGIPIIRQSIiVFRVSVTGTTVNFCDASTN 
HNRVFKVLQALPLDSLHNVLVFLSMPVVAFSVINIVLTLSSFLiWLNR 
PLACLF I LARLHQ YF I MTYG IV I DRS M I AN 1 1 DTTPAE S YALMTP PM 
SSTLGFSGVLAALIACWIQIKPATSRLRSVLFRGANIIiVSVLLILLV 
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AALFYKDYASLFRNNKELVKSLSPSNS IVASWSWYSHQRLANLPLVR 
IGEDAHRNPIJ^QNEQRKNLTIIiMVGATSRAENFSIjNGYPRETNPRLA 
KDNWYFPNTASCGTATAVSVPCMFSDMPREHYKEELAQHQEGVARI 
SFSERASTVLWNDNDGGCKGACDRVGCLNIjVYIGEACNDHLAKKPPK 
GQREKHLPGATTEYAGVSYQTVSRWNQASHVSAKTREKVEAAMAEL 
NYIPNRVAQQLAGKQSLLIGVATSSLALHAPSQIVAAIKSRADQLGA 
SWVSMVERSGVEACKAAVHNLLAQRVSGLIINYPLDDQDAIAVEAA 
CTNVPALFLDVSDQTP INS 1 I FSHEDGTRLGVEHLVALGHQQI ALLA 
GPLS S VS ARLRLAGWHKYIjTRNQ I QP I AEREGDWS AMSGFQQTMQML 
NEGIVPTAMLVAHFRLRVCAWTSPSLSPSLAPVPSVAAAVLLVSLS 
PVKTKPWEHKYAGGQCLVHIQALMNEDTEKHQVHYSTPKCGRHAQW 
SDAY 

3441 

A 

3 

401 

TSSLSLSGKSGRYIVFLRRSVGIQSPSAVATVRLLLAGSDRRFAAGS 
AGCAVIjSRAERS * EPGCCYRIRRSAFRLRAGS IGNRDKPVAWPDTGS 
GDNTEVYFRAHQAG I LPDTVLAG PAQRVAVSGGTPVDW 

3442 

A 

2874 

3062 

GNRAGALPGATLLILAGFLPSAHQNRPSRNPVSRPPNTQRV7VRRKHY 
ALADGYTERRWTNAP 

3443 

A 

2766 

3323 

MPDLVEGREKSDKGAQPQPRTQSRSARTNGSGVRILHSLWRIIGIPL 
LLGYSLVCSRVLLACFWPGSDFWPRSRRKTYGTAPQSCYSFYVALDC 
GGVSRLVIFVSWRNPQVAPTSAHQNRPSRNPVSRPPNTQRVARRKHY 
AIiADGYTERRWTDAPCRAESSFPPNCPIi*RQYRIHIVDASVCGP 

3444 

A 

968 

1441 

PESFSCIAAERFLARRRKKKSEVTSRPKQARAWRPVLKLiASDGKARIi 
AVCCRRWSRDLALSPGRPPAPVPGIRSPRMSESFFLGLFGHSWPVAT 
ERTFFPARVSPPAGDSGLLSRDAGTPVACEASFIFIAPSSPCRIIPG 
SGRNEGLLLLASTGTS 

3445 

A 

3 

640 

LSWRSRIHMGVGVGVGCPVRYRNQDDHELQITHGNKILCGIVCDKG 
AQPQPRTQSRSARTNGSGVRILHSLWRIIGIPLLLGYSLVCSRVLLA 
CFWPGSDFWPRSRRKTYGTVPQSCYSFYVAJjDCGGVSRLiVlFVSWRN 
PQVAPTSAHQNRPRRNPVSRPPNTQRVARRKHYAP\ADGYTERRWTI 
APCRAESSFPPNCPSAAPIPDSYDK 

3446 

A 

1 

3140 

MVYKMRYRSQHPYSIKEKQMKSEVLSVKEKIGYGMGDAASHIIFDNV 
MliYMMFFYTDIFGIPAGFVGTMFLVARALDAISDPCMGLLADRTRSR 
WGKFRPWVLFGALPFGIVCVLAYSTPDLSMNGKMIYAAITYTLIjTLL 
YTVVNIPYCAIiGGVITNDPTQRISLQSV^FVLATAGGMLSTVLMMPI* 
VNLIGGDNKPLGFQGDLSPMFSTPEEIARPGPYENDVHWGASSLAA 
GHKTL I PELVRS AEQHMGTRRYLRGRRVAVI PPTGLRLPARARQWRR 
FMVQHTRPGHDAGGKASPRHQALSTQLLDAIMPYCGNTLRLGVTGTP 
GAGKSTFLEAFGMLLIREGLKVAVIAVDPSSPVTGGSILGDKTRMND 
IJUlAEAAFIRPVPSSGHIiGGASQRARELMLLCEAAGYDVVIVETVGV 
GQSETEVARMVDC F I S LQ I AGGGDDLQG I KKGLMEVADL I VINKDDG 
DNHTNVAIARHMSRVPCIFCDDRLCIVFLETLRSTCFLEPLWPICCP 
RTAMATLGPRNRYGFDPLAVSSGTRSCSLGVPRAHRFALAGSRDPGY 
THKRVIiAIFQAFNAFPEPPANTPEDPAGLSLLPSGHTSAREPGLCLP 
INSLCPIVQSLWPSSLRSTQPLTPLPSCPSQAHHPGSVAPCGAAWYP 
AGHTHRKLPGVFSQRPRSHGGGTRWHSSTSVGTDQAAGTGADSPGRS 
DARRQGSAGKRAAGPSLGQSQDGD PARTS RQQGHSAAAGNRGRAGFG 
QRHRAERQQLWGPSSREGGVGAREEQEQFGDRGSGVAGLESGSRSGD 
WLPQFAGQWPRLREAVSQRRAHAAHGRTAAACGYCTHFGGLSEYLCY 
WGIRWFVIiECFWPAFGLVPIFGRRSRRKTYGTVPQSC*IiFLRRT*\R 
GGVSRLVIFVSWRNPQVAPTSAHQNRPSRNPVSRPPNTQRVARRKHY 
ALADGYTERRWTNAPCRAESSFPPNCPSGAPIPDSYDK*PSRSGVLI 
KACLRARAPTRYPVYRT j 

3447 

A 

750 

1340 

ILHAPAPPFSASASHEQPEWSDKGAQPQPRTQSRSARTNGSGVRILH 
SLWRIIGIPLLLGYSLVCSRVLLACFWPGSDFWPRSRRKTYGTAPQS 
CYSFYVALDCGGVSRLVIFVSWRNPQVAPTSAHQNRPSRNPVSRPPN 
TQRVARRKHYALADGYTERRWTNAPCRAESSFPPNCPFAAPIPDSYD 
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K*PFRIVS 

3448 

A 

93 

699 

RASVQQKKLSTDDKGAQPQPRTQSRSARTNGSGVRILHSLWRIIGIP 
LLLGYSLVCSRVLLACFWPGSDFWPRSRRKTYGTAPQSCYSFyVALD 
CGGVS R \ RS YL YPG ETPRVAPT S AHQNRP S RKPG \ TRP PNTQRVARR 
KHYALADGYTER\SATNAPCRAESSFPPELPVPQVPNTGFIRQVTFR 
DCFVNTPQAAASGFW 

3449 

A 

1000 

1530 

QRYSDNPPNDKGAQPQPRTQSRSARTNGSGVRILHSLWRIIGIPLLI* 
GYSLVCSRVLLACFWPGSDFWPRSRRKTYGTAPQSCYSFYVALDCGG 
VSRLVIFVSWRNPQVAPTSAHQNRPSRNPVSRPPNTQRVARRKHYAL. 
ADG YTERRWTNA PCRA/ VE PF PPNC P WEAQ YR 1 1 RK 

3450 

A 

106 

375 

LGPGKSLLCGYLVAMTDVETTYADFIASGRTGRRNAIHDILVSSASG 
NSNELALKLAGLDINKTEGEEDAQRSSTEQSGEAQGEAAKSES 

3451 

A 

232 

376 

KKGVLSCVRRHSMET*NEKPALLKMSGI KI PS PVHHFVFHKQSEKGA 
V 

3452 

A 

481 

781 

IKDKKVSNQSLLCGYLVAMTDVETTYADFIASG\RTGRRNAIHDILV 
S SASGNSNELALKLAGLD INKT \ EGEEDAQRS \ STEQSGEAQGEAAK 
SESLTPHF 

3453 

A 

15 

369 

RVGEYDGYEAGPAPCPPRASLCQ/PGPDQGPPSCTQAIPAPRKPHPC 
PWPQGPKKLGELPGLCLQGSPTAGHPPCPPQETGAQGGAPASPWFSQ 
NNAACGLICFCPPQPRIFLCINSNS 

3454 

A 

201 

711 

KKMARTKQTARKSTGGKAPRQTAG / LATKAARKS AP STGGVKKPHRY 
RPG \ TVALREIRR \ YQKSTE\ LLI \ RKLPFQR\LVRE I AAGFS KTAL 
R\FQSARIRCACRRLAKRYLVGLFED\NNLC\AIHAKRVTIHAPKTI 
Q\LARRDTGEKRALSEGSFYGVLLVKFLLKYFGLIC 

3455 

A 

758 

1188 

GKLSGRGTGSLLRGDALPRLQKTGCPKAGGNRPPAVSQSRGGAVRDB 
GLLAAQTAEGGPRVEEARPPQLPQARGSIiPETPPPPAPDA\TPPPPS 
GSHISPSSAEGPSRPPPVGAVDSGGAPQPQDPG\PAPAPALLRHRHQ 
TPRP 

3456 

A 

2 

297 

ACVPKTCPSSYFHCDNGNCIHRAWLCDRDNDCGDMSDEKDCPTQPFR 
CPSWQWQCLGHNICVNLSWCDGIFDCPNGTDESPIiCSKFPDHSLLA 
INSF 

3457 

A 

1 

1128 

MP I ENVSGTE I G WRAQRGT I PLQCGGRDDS VNEGEWPADTWTRGIER 
MPRLCCHLAEPLPVAGYSGYSFCQDIiTAVLTRFPVPNLCKQICSHLC 
LLR PGG YS CAC PQG S S F I EG STTECDAAI ELP INLPP PCRCMHGGNC 
YFpETDLPKCKCPSGYTGKYCEMAFSKGISPGTTAVAVLLTILLIW 
IGALAIAGFFHYRRTGSLLPALPKLPSLSSLVKPSENGNGVTFRSGA 
DliNl^IGVSGFGPETAIDRSMAMSEDFVMEMGKQPIIFENPMYSARD 
SAVKVVQPIQGRKHSEWSMLQLPYLQEQDQPKDIRPENNQNPVLLHS 
GGGIAAMPGCWPRTFHFEWCQSKRGLSPFRGLGNTVPEAPQLLYKYIi 

3458 

B 

58 

1282 

MRTLLKNTVIVLIiPYKGSGKLYGESSTELNVEVLNCTASQFKCASGD 
KCIGVTNRCDGVFDCSDNSDEAGCPTRPPGMCHSDEFQCQEDGICIP 
NFWECDGHPDCLYGSDEHNACVPKTCPSSYFHCDNGNCIHRAW2L»CDR 
DNDCGDMSDEKDCPTQPFRCPSWQWQCLGHNICVNLSVVCDGIFDCP 
NGQMSPHFALVAAAVDWLSQDSKWLQDSHIVPDVKLCVVASAYRQSY 
QSLGNDISLGYRCQPKFSRSIDPTGKAVQTADIRIiSARATLWLGGSI 
EESPVLCSTLRLLLRRLPPPLTWTSPNRPTQPCTAQTQTNQSVGI71A 
PSAIRVIYPESWLNAVIYLPGDPEVSGLPRAFKRRFSVEVRLDCGT 
FKLLLVYCTHPGDIKVNTCKTGALVAFRCFIjPX* 

3459 

A 

1 

1870 

MGKVPMRSEKPAHEVRASTISTTPDLSS IES FFSRGPESTNHTS IKG 
RPLTAELKNKTNRAEEKIKKEELAISGVDEDDGGKGIKDTGDLVEMV 
DLAFRCDRHNDCGDYSPERGCLYQTCQQNQFTCQNGRC I SKTFVCDE 
DNDCGDGSDELMHLCHTPEPTCPPHEFKCDNGRCIEMMKLiCNHLDDC 
LDNSDEKGCGEFIFRASSPGAILILIVAVSTIHIAITDPRLGNALPV 
RRIVQPQYSWTPLTIiKLWASRVSNCQGLLSAAKTHHTVQFSILMALP 
KGHLFSARLTGKCVAQVIVEGFFAGRDKFVCVLNVAFVPYQFVLSLV 
RTGTASKSTVGFYRNNISVITGFLIGFLCQSSALDLETFTWSHFYF 
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GINECHDPSISGCDHNCTDTLTSFYCSCRPGYKLMSDKRTCVDIDEC 
TEMPFVCSQKCENVIGSYICKCAPGYLREPDGKTCRQNSNIEPYLIF 
SNR YYLRNLT IDG YF YS L I LEGLDNWAUl) FDRVEKRLiYW I DTQRQV 
IERMFLNKTNKETIINHRLPAAESLAVDWVSSALKGGFVHP\LNGGS 
QHFTLLVD*VIKWLYLISNSVPC*KRRWGLPVFWFGGKREMSWEMMG 
H * LVEGLCGCCH 

3460 

B 

1 

2973 

MEKQSINQQLPYPDQLPTQCSPLTGLNEYPLSGVSIPDESWDRKSKH 
SAAMTLRQVHQRHQNN1.FLDHNTNVEEYSAQMRIGTHCCCTSSESLL 
LLVASQNKI I ADS VTSQVHNI YSLVENGS YI VAVDFDS I SGRIFWSD 
ATQGKTWS AFQNGTDRR WFDS S 1 1 LTETIAI DWVGRNL YWTDYALE 
TIEVSKIDGSHRTVLISKNLTNPRGLALDPRMKIjLYFMDSYLDYMDF 
CDYNGHHRRQVIASDLGFEVGPKLLALPPQLVLTESSTCRPGDWPAS 
PLEPPPTSMHTACDPENHLTSATAITYAMSAVQGRNAliHVDVDVSSG 
FIYWCDFSSSVASDNAIRRIKPDGSSLMNIVTHGIGENGVRGIAVDW 
VAETKIERSFLDCTNRTVLVSEGIVTPRGIiAVDRSDGYVYWVDDSLD 
IIARIRINGENSEVIRYGSRYPTPYGITVFENSIIWVDRNLKKIFQA 
S KEPENTEPPTVI RDN INWLRDVT I FDKQVQPRS PAEVNNNPCLENN 
GGCSHLCFALPGLHTPKCDCAFGTLQSDGKNCAISTENFLIFALSNS 
LRSLHLDPENHSPPFQTINVERTVMSLDYDSVSDRIYFTQNliASGVG 
QISYATLSSGIHTPTVIASELQMKLIiNLHSAGPRQRIERSTLTGVDR 
E V I VNAAVHAFGLTL YGQYI YWTDL YTQR I YRANKYDG S GQ I AMTTN 
LLSQPRGINTVVKNQKQQCNNPCEQFNGGCSISVHQKWKNFLSVRFS 
KHGWFISVSKDLMKITOQPFLITVRQHIIFGISLNPEVKSNDAMVPIA 
GIQNGLDVEFDDAEQYIYWVENPGEIHRVKTDGT1TOTVFASISMVGP 
SIWLALDWISRNLYSTNPRTQSIEVLTLHGDIRYRKTLIANDGTALG 
VGFP IGI TVDPARG IC I S TVTTNS CKAETNFETTEAVMEGEGTQHVG 
GNGNAGCVGVGGGDAASQTKSTNTYIVILDSGYCLNNPLGLLSRKKY 
LLS* 

3461 

A 

257 

3561 

RHKKPFKKSMNPGAG/MLYWSDQGTDSGVPAKIASANMDGTSVKTLF 
TGNLEHLECVTLDIEEQKLYWAVTGRGVIERGNVDGTDRMILVHQLS 
HPWG I AVHDS FLYYTDEQ YEV I ERVDKATGANKI VLRDNVPNLRGLQ 
VYHRRITEP/ CPN*PKTIKIGHYIKDKPTCSSE/AKWYLPHPGQM*I 
A/ DKPEAQRGSRRLTDLEGGNNBIDWS I SCDCI YPSSQVFDLSAKIE 
GKVDKRTCCPS S FPKGYGP I AKNS QVYESS SANTDSVAWGGVCFAJL 
WSSRVAGLRRVQREETQLEEPFYTNPCKGTNQDSRER/NWP*RRNKY 
S YP* KWLSAPCSAKVHSAAARE*GSCHTLS *WSYCQATMFECKNHVC 
IPPYWKCDGDDDCGDGSDEELHLCLDVPCNSPNRFRCDNNRCIYSHB 
VCNGVDDCGDGTDETEEH\HCRNTKGSYECVCADGFTSMSDRPGKRC 
AAEGSSPLLLLPDNVRIRKYNLSSERFSEYLQDEEYIQAVDYDWDPK 
DIGI^SNNNKRSQl/CLSSF*VFC*GSLLVTPNIiYNREEQKSECCSC 
PCLRPWP IHSNNRS / SMLRS / CEDKASTCKRELS IGDDDS YI*RP IDG 
QAIGPSKSSVYYRQECDGQNDCGDNSDEENCAPRECTESEFRCVNQQ 
CIPSRWICDHYNDCGDNSDERDCEMRTCHPEYFQCTSGHCVHSELKC 
DGSADCLDASDEADCRNPYTFTWYRLSALGPGVCPAGFSDSTFPYTK 
NASRNSKLKVRFSSTHLQIMLQGIKITHYKVTKW/ ISLQTKI *TNYQ 
L* KSNLCT/ DWRTSKRKVLLRHHLHHLRSLLSLSLLREETQLQP/NS 
ATEDTFKDTT^NLVKEDSEGHERAHACHFLEFSVDFFSPDNCCPWCFA 
KEVIiY/ASLCRCIPIWWKCDGQKDCSDGSDEIiALCPQRFCRLGQFQC 
SDGNCTSPQTLCNAHQNCPDGSDEDRLLC\CANKRCIPESWQCDTFN 
DCEDNSDEDS SHCASRTCRPGQFRCANGRCI PQAWKCDVDNDCGDHS 
DEPIEECMS S AHLCDNFTEFS CKTNYRC I PKWAVCNGVDDCRDNSDE 
QGCGAKGSGSADTTWTLLAVEAAVQKALEGLRFAGVFC 

3462 

A 

1 

10725 

MAGAPPPASLPPCSLISDCCASNQRDSVGVGPSEPGVGYSLWRRFL 
SRSEKRNIRVGVTRFSRELPDPLRFPEHLFFTDWRLGAIIRVRKADG 
GEMTVIRSGIAYILHLKSYDVNIQTGSNACNQPTHPNGDCSHFCFPV 
PNFQRVCGCP YGMRLASNHLTCEGDPTNEPPTEQCGLFS FPCKNGRC 
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VPNYYLCDGVDDCHDNSDEQLCGTLNNTCSSSAFTCGHGBCIPAHWR 

CDKRNDCVDGSDEHNCPNPTQPASCLDTQYTCDNHQC I SKNWVCDTD 

NDCGDGSDEKNCILNCTASQFKCASGDKCIGVTNRCDGVFDCSDNSD 

EAGCPTRPPGMCHSDEFQCQEDGICIPNFWECDGHPDCLYGSDEHNA 

CVPKTCPSSYFHCDNGNCIHRAWLCDRDNDCGDMSDEKDCPTQPFRC 

PSWQWQCLGHNICVNLSWCDGIFDCPNGTDESPLCNGNSCSDFNGG 

CTHECVQEPFGAKCLCPLGFLLANDSKTCEDIDECDILGSCSQHCYN 

MRGSFRCSCDTGYMLESDGRTCKVTASESLLLLVASQNKII7UDSVTS 

QVHN I Y S LVENG S Y I VAVDFD S I SGR I FW SDATQGKTWS AFQNGTDR 

RWFDSSIILTETIAIDWVGRNLYWTDYALETIEVSKIDGSHRTVLI 

SKNLTNPRGUUiDPRmEHLLFWSDWGHHPRIERAS^GSMRTVIVQ 

DKIFWPCGLTIDYPNRLLYFMDSYLDYMDFCDYNGHHRRQVIASDLI 

IRHPYALTLFEDSVYWTDRATRRVMRANKWHGGNQSVVMYNIQWPLG 

I VAVH P S KQPNYDQP FL I TVRQH 1 1 FG I S LNPEVKSNDAMVP I AG I Q 

NGLDVEFDDAEQYIYWVENPGEIHRVKTDGTNRTVFASISMVGPSMN 

LALDW I SRNL YS TNPRTQS I EVLTLHGD IRYRKTL I ANDGTALGVGF 

PIGITVDPARGKLYWSDQGTDSGVPAKIASANMDGTSVKTLFTGNLE 

HLECVTLD I EEQKIiYWAVTGRGVI ERGNVDGTDRM I LVHQLSHPWGI 

AVHDS FLYYTDEQYEVI ERVDKATGANKI VLRDNVPNIiRGLQVYHRR 

NAAESSNGCSNNMNACQQlCLPVPGGIiFSCACATGFKLNPDNRSCSP 

YNSFIWSMLSAIRGFSLELSDHSETMVPVAGQGRNALHVDVDVSSG 

F I YWCDFS S S VAS DNA I RR I KPDG S S LMN I VTHG I GENGVRG I AVDW 

VAGNLYFTNAFVSETLIEVLRINTTYRRVLLKVTVDMPRHIVVDPKN 

R YLFWAD YGQRPKI ERS FLDCTNRTVLVS EG I VTPRGLAVDRSDG YV 

YWVDDSLDIIARIRINGENSEVIRYGSRYPTPYGITVFENSIIWVDR 

NLKKIFQASKEPENTEPPTVIRDNINWLRDVTIFDKQVQPRSPAEVN 

NNPCLENNGGCSHLCFALPGLHTPKCDCAFGTLQSDGKNCAISTENF 

LIFAIiSNSLRSLHLDPENHSPPFQTINVERTVMSLDYDSVSDRIYFT 

QNL7VSGVGQISYATLSSGIHTPTVIASGIGTADGIAFDWITRRIYYS 

D YLNQM INS MAEDG SNRTV I ARVP KPRA I VLD PCQG YLYWADWDTHA 

KIERATLGGNFRVPIVNSSLVMPSGLTLDYEEDLLYWVDASLQRIER 

S TLTG VDRE V I VNAAVHAFGLTL YGQ Y I YWTDL YTQR I YRANKYDGS 

GQIAMTTNLLSQPRGINTWKNQKQQCNNPCEQFNGGCSHICAPGPN 

GAECQCPHEGNWYLANNRKHCIVDNGERCGASSFTCSNGRCISEEWK 

CDNDNDCGDGSDEMESVCALHTCSPTAFTCANGRCVQYSYRCDYYND 

CGDGSDEAGCLFRDCNATTEFMCNNRRCIPREFICNGVDNCHDNNTS 

DEKNCPTHTCSSSEFQCASGRCIPQHWYCDQETDCFDASDEPASCGH 

SERTCLADEFKCDGGRCIPSEWICDGDNDCGDMSDEDKRHQCQNQNC 

SDSEFLCVNDRPPDRRCIPQSWVCDGDVDCTDGYDENQNCTRRTCSE 

NEFTCGYGLCIPKIFRCDRHNDCGDYSDERGCLYQTCQQNQFTCQNG 

RCISKTFVCDEDNDCGDGSDELMHLCHTPEPTCPPHEFKCDNGRCIE 

MMKLCNHLDDCLDNSDE KGCG INECHDPS I SGCDHNCTDTLTS FYCS 

CRPGYKLMSDKRTCVDIDECTEMPFVCSQKCENVIGSYICKCTVPGYIi 

REPDGKTCRQNSNIEPYLIFSNRYYLRNLTIDGYFYSLILEGLDNW 

ALDFDRVEKRLYWIDTQRQVIERMFLNKTNKETI INHRLPAAESLAV 

DWVSRYLYWADWGHRAY I GRVGMDGTNKS VI I STKLEWPNGITIDYT 

NDLLYWADAHLGYIEYSDLEGHHRHTVYDGALPHPFAITIFEDTIYW 

TDWNTRTVEKGNKYDGSNRQTLVNTTHRPFDIHVYHPYRQPIVSNPC 

GTNNGGCSHLCLIKPGGKGFTCECPDDFRTLQLSGSTYCMPMCSSTQ 

FLCANN/EKCLSSPWTDIDLQGSKGWEFSRPRESSCMSFLGIQC*LL 

LSR*LLPLVLC*RSAVCSLCRCIPIWWKCDGQKDCSDGSDELAIiCPQ 

RFCRLGQFQCSDGNCTSPQTLCNAHQNCPDGSDEDRLLC\CANKRCI 

PESWQCDTFNDCEDNSDEDSSHCASRTCRPGQFRCANGRCIPQAWKC 

DVDNDCGDHSDEPIEECMSSAHLCDNFTEFSCKTNYRCIPKWAVCNG 

VDDCRDNSDEQGCEERTCHPVGDFRCKNHHCIPLRWQCDGQNDCGDN 

SDEENCAPRECTESEFRCVNQQCIPSRWICDHYNDCGDNSDERDCEM 

RTCHPEYFQCTSGHCVH5ELKCDGSADCLDASDEADCPTRFPDGAYC 
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QATMFECKNHVCIPPYWKCDGDDDCGDGSDEELHLCLDVPCNSPNRF 
RCDNNRC I YSHE VCNGVDDCGDGTDETEEHCRKPTPKPCTEYBYKCG 
NGHCIPHDNVCDDADDCGDWSDELGCNINECEQFGTCPQHCKNTKGS 
YECVCADGFTSMSDRPGKRCAAEGSSPLLLLPDNVRIRKYNLSSERF 
SEYLQDEEY I QAVD YDWDPKD I GLS WYYTVRGEGS RFGAI KRAY I P 

NFESGRNNLVQEVDLKLKYVMQPDGIAVDWGRHIYWSDVKNKRIEV 
AKLDGRYRKWLISTDLDQPAAIAVNPKLGLMFWTDWGKEPKIESAWM 
NGEDRN I L VFEDLG W PTGL S I D YLNNDR I YW S D FKED V I ET I KYDGT 
DRRVI AJCEVPNLCKQ ICSH LCLLR PGGYS CACPQGS SFIEG STTECD 
AAIELPINLPPPCRCMHGGNCYFDETDLPKCKCPSGYTGKYCEMAFS 
KG I SPGTTAVAVLLT ILL I VVI GALA I AG FFHYRRTGSLLPALPKLP 
SLSSLVKPSENGNGVTFRSGADLNMDIGVSGFGPETAIDRSMAMSED 
FVMEMG KQ P 1 1 FENPMYS ARD S A VKWQP I QVTVS ENVDNKNYGS P I 
NPSEIVPETNPTSPAADGTQVTKWNLFKRKSKQTTNFENPIYAQMEN 

EQKESVAATPPPSPSLPAKPKPPSRRDPTPTYSATEDTFKDTANLVK 
EDSEV 

3463 

A 

3 

14105 

S WRRCRPKGAFAS RGAAGAVCYARP PPGEGTARPGT I AEMDRGPAAV 

ACTLLIJUjVACLAPASGQECDSAHFRCGSGHCIPADWRCDGTKDCSD 

DADE IGCAWTCQQG YFKCQS EGQC I PS S WVCDQDQDCDDGSDERQD 

CSQSTCSSHQITCSNGQCIPSEYRCDHVRDCPDGADENDCQYPTCEQ 

LTCDNG AC YNTS QKCDWKVDCRDS SDE INCTE I CLHNE FS CGNGEC I 

PRAYVCDHDNDCQDGSDEHACNYPTCGGYQFTCPSGRCIYQNWVCDG 

EDDCKDNGDEDGCESGPHDVHKCS PRE WS CPESGRC I S I YKVCDGI L 

DCPGREDENNTSTGKYCSMTLCSALNCQYQCHETPYGGACFCPPGYI 

INHNDSRTCVEFDDCQIWGICDQKCESRPGRHLCHCEEGYILERGQY 

CKANDS FGEAS 1 1 FSNGRDLL I GDIHGRS FRI L VESQNRGVAVGVAF 

HYHLQRVFWTDTVQNKVFSVDINGLNI QEVLJWS VETPENIjAVDWVN 

NKI YLVETKVNR I DM VNLDG S YRVTL I TENLGHPRG I AVDPTVG YLF 

FSDWESLSGEPKLERAFMDGSNRKDLVKTKLGWPAGVTLDMISKRVY 

WVDSRFDYIETVTYDGXQRKTVVHGGSLIPHPFGVSLFEGQVFFTDW 

TKI^VLKANKFTETNPQVYYQASLRPYGVTVYHSLRQPYATNPCKDN 

NGGCEQVCVLSHRTDNDGLGFRCKCTFGFQLDTDERHCIAVQNFLIF 

SSQVAIRGIPFTLSTQEDVMVPVSGNPSFFVGIDFDAQDSTIFFSDM 

SK™iFKQKIDGTGREIIJUU^VENVESI^FDWISKm,YWTDSHYKS 

ISVMRLADKTRRTVVQYLNNPRSVVVHPFAGYLFFTDWFRPAKIMRA 

WSDGSHLLPVINTTLGWPNGLAIDWAASRLYWVDAYFDKIEHSTFDG 

LDRRRLGH I EQMTHPFGLAI FGEHLFFTDWRLGAI IRVRKADGGEMT 

VIRSG I AY I LHLKS YDVNI QTGSNACNQPTHPNGDCSHFCFPVPNFQ 

RVCGCPYGMRLASNHLTCEGDPTNEPPTEQCGLFSFPCKNGRCVPNY 

YLCDGVDDCHDNSDEQLCGTLNNTCSSSAFTCGHGECIPAHWRCDKR 

NDCVDG SDEHNCPTHAPAS CLDTQ YTCDNHQC I S KNWVCDTDNDCGD 

GSDEKNCNSTETCQPSQFNCPNHRCIDLSFVCDGDKDCVDGSDEVGC 

VLNCTASQFKCASGDKCIGVTNRCDGVFDCSDNSDEAGCPTRPPGMC 

HSDEFQCQEDGICIPNFWECDGHPDCLYGSDEHNACVPKTCPSSYFH 

CDNGNCIHRAWLCDRDNDCGDMSDEKDCPTQPFRCPSWQWQCLGHNI 

CVNLSVVCDGIFDCPNGTDESPLCNGNSCSDFNGGCTHECVQEPFGA 

KCLCPLGFLLANDS KTCED I DECD ILGS CS QHC YNMRGS FRCS CDTG 

YMLESDGRTCKVTASESLLLLVASQNKIIADSVTSQVHNIYSLVENG 

SYIVAVDFDSISGRIFWSDATQGKTWSAFQNGTDRRWFDSSIILTE 

TIAIDWVGRNLYWTDYALETIEVSKIDGSHRTVLISKNLTNPRGLAL 

DPRMNEHLLFWSDWf?T-THPP TFPaCMnPOMB'nrTTrATMrTPMnnriT mT rk 

YPNRLLYFMDSYLDYMDFCDYNGHHRRQVIASDLIIRHPYALTLFED 
SVYWTDRATRRVMRANKWHGGNQSVVMYNIQWPLGIVAVHPSKQPNS 
VNPCAFSRCSHLCLLSSQGPHFYSCVCPSGWSLSPDLLNCLRDDQPF 1 
LITVRQHIIFGISLNPEVKSNDAMVPIAGIQNGLDVEFDDAEQYIYW 
VENPGE IHRVKTDGTNRTVFAS I SMVGPSMNLALDW I SRNLYSTNPR 
TQS I E VLTLHGD I R YRKTL I ANDGTALG VGF P IG I TVDPARGKLYWS 
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DQGTDSGVPAKIASAN1TOGTSVKTLFTGNLEHLECVTLDIEEQKLYW 

AVTGRGVIERGNVDGTDRMILVHQLSHPWGIAVHDSFLYYTDEQYEV 

IERVDKATGANKIVLRDWPNLRGLQVYHJU^NAAESSNGCSNNMNAC 

QQICLPVPGGLFSCACATGFKLNPDNRSCSPYNSFIWSMLSAIRGF 

SLELSDHSETMVPVAGQGRNAI,HVDVDVSSGFiyWCDFSSSVASDNA 

I RR I KPDG S S LMN I VTHG I GENG VRG I AVD WVAGNL Y FTNAFVS ETL 

I E VLR INTT YRR VLLKVTVDM PRH I WD P KNR YLFWAD YGQR PKI ER 

SFLDCTNRTVLVSEGIVTPRGLAVDRSDGYVYWVDDSLDIIARIRIN 

GENSEVIRYGSRYPTPYGITVFENSIIWVDRNLKKIFQASKEPENTE 

PPTV I RDN INWLRDVTI FDKQVQPRS PAEVNNNPCLENNGGCSHLCF 

AliPGLHTPKCDCAFGTLQSDGKNCAISTENFLIFAIiSNSLRSLHLDP 

ENHSPPFQTINVERTVMSLDYDSVSDRIYFTQNLASGVGQISYATLS 

SGIHTPTVIASGIGTADGIAFDWITRRIYYSDYLNQMINSMAEDGSN 

RTVIARVPKPRAIVLDPCQGYLYWADWDTHAKIERATLGGNFRVPIV 

NSSLVMPSGLTLDYEEDLLYWVDASLQRIERSTLTGVDREVIVNAAV 

HAFGLTLYGQYIYWTDLYTQRIYRANKYDGSGQIAMTTNLLSQPRGI 

NTVVKNQKQQCNNPCEQFNGGCSHICAPGPNGAECQCPHEGNWYLAN 

NRKHCIVDNGERCGASSFTCSNGRCISEEWKCDNDNDCGDGSDEMES 

VCALHTCSPTAFTCANGRCVQYSYRCDYYNDCGDGSDEAGCLFRDCN 

ATTEFMCNNRRC I PRE F I CNG VDNCHDNNTS DEKNCPDRTCQSGYTK 

CHNSNI C I PR VYLCDGDNBCGDNS DENPTYCTTHTCS S S E FQCASGR 

CIPQTFGICDQETDCFDASDEPSSCGHSERTCLSDEFKCDGGRCIPN 

EWICDGDNDCGDMSDEDKRHQCQNQNCSDSEFPCVNDRPPDRRCIPQ 

SWP PDGDVDCTDGHDE IQNCTRRTC S ENE FTCGYGLC I PKI FRCDRH 

NDCGDYSDERGCLYQTCQQNQFTCQNGRCISKTFVCDEDNDCGDGSD 

ELMHLCHTPE PTCP PHE F KCDNGR C I EMM KLCNHLDDCLDNS DEKGC 

G INECHDPS I SGCDHNCTDTLTS F YCS CRPG YKLMSDKRTCVDI DEC 

TEMP FVCS QKCENV I GS Y I CKCAPG YLR E PDG KTCRQNS N I E P YL I F 

SNR YYLRNLTIDG YFYSL I LEGLDNWALDFDRVEKRL YW IDTQRQV 

IERMFLNKTNKETIINHRLPAAESLAVDWVSRKLYWLDARLDGLFVS 

DLNGGHRRMLAQHCVDANNTFCFDNPRGLALHPQYGYLYWADWGHRA 

YIGRVGMDGTNKSVIISTKLEWPNGITIDYTNDLLYWADAHLGYIEY 

SDLEGHHRHTVYDGAIiPHPFAITIFEDTIYWTDWNTRTVEKGNKYDG 

SNRQTLVNTTHRPFDIHVYHPYRQPIVSNPCGTNNGGCSHLCLIKPG 

GKGFTCECPDDFRTLQLSGSTYCMPMCSSTQFLCANNEKCIPIWWKC 

DGQKDCSDGSDELAIiCPQRFCRLGQFQCSDGNCTSPQTLCNAHQNCP 

DGSDEDRLLCENHHCDSNEWQCANKRCIPESWQCDSFNDCEDNSDED 

SSHCASRTCRPGQFRCANGRCIPQAWKCDVDNDCGDHSDEPIEECMS 

S7VHLCDNFTEFSCKTNYRCIPKWAVCNGVDDCRDNSDEQGCEERTCH 

PVGDFRCKNHHC I PLRWKCDGQNDCGDNSDEENCAPRECTESEFRCV 

NQQCIPSRWICDHYNDIiWGDNSDERDCEMRTCHPEYFQCTSGHRVHS 

ELKCDGSADCLDASDEADCPTRFPDGAYCQATMFECKNHVCIPPYWK 

CDGDDDCGDGSDEELHLCLDVPCNSPNRFRCDNNRCIYSHEVCNGVD 

DCGDGTDETEEHCRKPTPKPCTEYEYKCGNGHCIPHDNVCDDADDCG 

DWSDELGCNKGKERTCAENICEQNCTQLNEGGFICSCTAGFETNVFD 

RTSCLDINECEQFGTCPQHCRNTKGSYECVCADGFTSMSDRPGKRCA 

AEGSSPLLLLPDNVRIRKYNLSSERFSEYLQDEEYIQAVDYDWDPED 

IGLSVVYYTVRGEGSRFGAIKRAYIPNFESGRIWLVQEVDLKLKYVM 

QPDGI AVDWVGRH I YWSD VKNKR I E VAKLDGR YRKWL I S TDLDQPAA 

I AVNPKLGIiMF WTD WG KE PKLE S AWMNG ED PN I LVFEDLG WPTGLS I 

DYLNNDRI YWSDFKEDVI ET I KYDGTDRRVIAKEAMNP YS LD I FEDQ 

LYWISKEKGEVWKQNKFGQGKKEKTLWNPWLTQVRIFHQIiRYNKSV 

PNLCKQICSHLCLLRPGGYSCACPQGSSFIEGSTTECDAAIELPINL 

PPPCRCMHGGNCYFDETDLPKCKCPSGYTGKYCEMAFSKGISPGTTA 

VAVLLTILLIWIGALAIAGFFHYRRTGSLLPAIiPKLPSLSSLVKPS 

ENGNGVTFRSGADLNMDI GVSGFG PETAIDRSMAMSEDFVMEMGKQP 

IIFENPMYSARDSAVKWQPIQVTVSENVDNKNYGSPINPSB\IVPE 


I02Z0 


„i o o e a-s-is mi- 3; ,„ o a e: & o s: 






TNPTSPAADGPQVPKW\NLFKRKSKQT\TNF\ENPFY\AQMGERVKR 
ESVA\ ATPP\ PS \ PSLPAKP\ KPPSRRDP\TPTYSATEDTFKDTANL 
VKEDSEV 

3464 


4 ft 


LWPLGPEPRAGC I CRGNLS S TRGTS TRPGATAD FGLHHGGLPAGLS S 
CSAGVPTSTCCAGK*NL * FGLPRGCSCPGFLPSKGTI ILDCVNKDLK 
NGKATPKDSKYVEKLKALF 

3465 

A 

3 

450 

L PKRND I FDS LQKAKFD VSGLTTEQMLRKDQKT I YRQG VKVAI S AI Y 
MDLE I CE VTjER SHS P PLKLTPAS S THPNLHAYLQGNTQVSRKKLLPL 

LQEALSAYFDSMKIPSGQPETADVSREQVDKELDRASNSL I SGLSQD 
EEDPPPK 

3466 

A 

x 

AO Q 

*x 4 2? 

YD Y 1 FNY IiPHSIiPQREAVFPSLTHLAVLAFLVLGPWPTLFDP * SDIC 
FLGLTTEQMLRKDQKTI YRQGVKVAI S AI YMDLEAFLQRSNLLADLH 
AFCQAHS \ YDVLVAMTI FFNTHNEPVRQLAI FCPHVALQTTI CEVLE 
RS 

3467 

A 

283 

587 

FWTPQVCCSCRWRVLPMQRSDT\ALEEAV\7VEVLDHRPIEPKHCPPC 
HVSVELVGSCATLVTERILQGAPEILDRQTAALL/HGTIILDCVNMD 
L*IGMATPKD 

3468 

A 

2 

1460 

GRAAFVGETSPRPGAPLLDQGRRRTLGFIMEDYLQGCRAALQESRPIi 
HVVLGNEACDLDSTVSALAIAFYTAKTTEAEEVFVPVLNIKRSEDPIi 
RGDIVFFLQKVHIPESILIFRDEIDLHALYQ\AGQLTLILVDHHILS 
KSDTAL\EE\AVAEVL\DHRPIBPKHCPP\CHVSVELVG\SCATLVT 
ERI \ LQG\ APE ILDRQTAALLHGTI ILDCVNMDLKIGKATPKDSKYV 
EKLEALFPDLPKRND I FDSL\ QKAKFDVSGLTTEQMLRKDQKTI YRQ 
GVKVAISAIY74DLEAFLQRSNLLADLHAFCQAHSYDVLVAMTIFFNT 
HNEPVRQLAIFCPHVALQTTICEVLERSHSPPLKIiTPASSTHPNIiHA 
YLQGNTQVSRKKLLPLLQEALSAYFDSMKI PSGQPET\ ADVS \ REQV 

DKELDRASNSLISGLSQDEEDPPLPPTPMNSLVDECPLDQGLPKLSA 
EAVFEKC S Q I S LS QS TTAS L S KK 

3469 

A 

2 

230 

WQDFYCQ*HIFGAAFFTHPDGPSNWAFSNFVMFCHCLPKCWDYRGES 
FKPAHrKI FVDQCTWKKKKI RKI KNKFL 

3470 

A 

134 

929 

RTAARGCNGIPGAAAWEAALPRRRPRRHPSVNPRSRAAGSPRTRGRR 
TEERPSGSRLGDRGRGRALPGGRLGGRGRGRAPERVGGRGRGRGTAA 
PRAAPAARGSRPGPAGTMAAGSITTLPALPEDGGSGAFPPGHFKDPK 
RLYCKNGGFFLRIHPDGRVDGVREKSDPHI KLQLQAEERGWS IKGV 
VC/ SNRYLAMK\EDGRLIASKCVTDECFFFERLESNNYNTYRSRKYT 
SWYVALKRTGQYKLGSKTGPGQKAILFLPMSG 

3471 

A 

1 

2604 

MATF I S VQLKKTS EVDLAKPLVKF IQQTYPSGGEEQAQYCRAAEELS 
KLRRAAVGRPLDKHEGALETLLR YYDQI CS I EPKFPFSENQI CLTFT 
WKDAFDKGSLFGGSVKIJUASLGYEKSCVLFNCAALASQIAAEQNLD 
NDEGLKIAAKHYQFASGAFLHIKETVLSALSREPTVDISPDTVGTLS 
LIMLAQAQEVFFLKATRDKMKDAI IAKLANQAADYFGDAFKQCQYKD 
TLPKEVFPVLAAKHCIMQANAEYHQS ILAKQQKKFGEEIARLQHAAE 
LIKTVASRYX)EYW^KDFSDKINRALTAAKKDNDFIYHDRVPDLKDL 
DPIGKATLVXSTPVNVPISQK^TDLFEKWVPVSVQQSLAAYNQRKAD 
LVNRSIAQMREATTLANGVLASI^LPTUVIEDVSGDTVPQSILTKSRS 
VIEQGGIQTVDQLIKELPELLQRNREILDESLRLLDEEEATDNDLRA 
KFKERWQRTPSNELYKPLRAEGTNFRTVLDKAVQADGQVKECYQSHR 
DTIVLLCKPEPELNAAIPSANPAKTMQGSEVVNVLKSLLSNLDEVKK 
EREGLENDLKSVNFDMTSKFLTALAQDGVINEEALSVTELDRVYGGIi 
TTK^QESLKKQEGLLKNIQVSHQEFSKMKOSNNEAl^REEVLKNLAT 
AYDNFVELVANLKEGTKFYNELTEILVRFQNKCSDIVFARKTERDEL 
LKDLQQSIAREPSAPSIPTPAYQSLPAGGHAPTPPTPAPRTMPPTKP 
QPPARPPPPVLPANRAPSATAPSPVGAGTAAPAPSQTPGSAPPPQAQ 

GPPYPTYPGYPGYCQMPMPMGYNPYAYGQYNMPYPPVYHQSPGQAPY 
PGPQQPSYPFPQPPQQS YYPQQ 

3472 

A 

478 

697 

YLIYQSRFFLFSSIRYCQMPMPMG/YCNPYAYGQYNMPYPPVYHQSP 
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GQAPYPGTQQPS Y PFP\ QPPQQS YYPQQ 

3473 

A 

21 

194 

HVSSRLREFLFKHPKRTATLSMRNTSVMKKGGIFSAEFLKVFLPSLL 
LSHLLAIGLG 

3474 

A 

2 

767 

APLS PGAQLGRGAPTSAFPPPAAEAHPAARRGLRS PQLPSGAMSQNG 
APGMQEESLQGSWVELHFSNNGNGGSVPASVSIYNGDMEKILLDAQH 
ESGRS S SKS\ SHCDSPP\ S SQTPQDTNRAF\ ETDTH\ S IGREKQLTV 
LREDDIERRKEVESILRKNSDWIWDWSSSAGKIFPPKEFLFKTPGSR 
TATLSMR\NTSVLKKGGIFSAEF\LKVFLPFSAALSIAWPIGLGIYI 
GKASDQPSTSTFGWKEPGSLDLVR . 

3475 

A 

1 

4220 

MLMIVQDSALLEAIGCQMEMGGGENNLKSHSRTNSGISSASGGSTEP 
TTPDS ER PAQALLRD YALNTOS AAGLL I RS IHL VTQRLNS Q WRQDMS 
ISIAALELLSGIJUCVKVMVDSGDRKRAISSVCTYIVYQCSRPAPLHS 
RDLHSMIVAAFQCLCVWLTEHPDMLDEKDCLKEVXiE I VELG I SGSKS 
KNNEQEVKYKGDKEPNPASMRVKDAAEATLTCLVNETTL I KYS RLPT 
INKHSFRYFVXiDNSVILAMLEQPLGNEQIISHLTICDYHCDPNYPQV 
HFFFLMVIGISQYVKEACSQTLIAGKGKESEGYRICGYKLFVPEPRP 
VPKNDVG FKYS VKHRPFPEEVDKI PFVKADLS I PDLHE I VTEELEER 
HEKLRSGMAQQIAYElHLEQQSEEEIiQKRSFPDPVTDCKPPPPAQEF 
QTARLFLSHFGFLSLEALKEPANSRLPPHIilALDSTIPGFFDDIGYIi 
DLLPCRPFDTVFI F YMKPGQKTNQEI LKNVES S RTVQPHFLEFLLSIi 
GWSVDVGRHPGWTGHVSTSWS INCCDDG\ EGSQQEEVI SSED IGAS I 
FNGQKKVLYYADALTEIAFWPSPVESLTDSLESNISDQDSDSNMDIi 
MPGILKQPSLTLELFPNHTDNIiNSSQRNEHAFLSISRKQWKCLPCES 
SWMAKPPAQAHTCILVRTKAGPGPGEAAVLVDSRQWEAEDRERSSQR 
LLRFTYGPSFPAFKVPDEDASLIPPEMDNECVAQTWFRFLHMLRNAF 
PVVDAAIiVLLPVTAARLMIFLLGFCHILWGLEVALLLSVPSTDCNPV 
DLSNPAIISSTPKFQEQFLNVSGMPQEIiNQYPCLKHLPQIFFRAMRG 
ISCLVDAFLGISRPRSDSAPPTPVNRLSMPQSAAVSTTPPHNRRHRA 
VTVNKATMKTS TVRVY I TI VML YKVS TAHAS KVQHQTS S TS PLSS PN 
QTSSEPRPLPAPRRPKVNSILNLFGSWLFDAAFVHCKLHNGINRDSS 
MTAITTQT^SMEFRRKGSQMSTDTMVSNPMFDASEFPDNYEAGRAEAC 
GTLCR I FCS KKTGEE I L PAYLS RFYMLL I QGLQ INDYVCHP VLASVI 
LNSPPLFCCDLKGIDVWPYFISALETILPDRELSKFKSYVNPTELR 
RSSINILLSLLPLPHHFGTVKSEWLEGKFSNDDSSSYDKPITFLSL 
KLRLVNILIGALQTETDPNNTQMILENFPLSELMTEISTGVETTANS 
STSLRSTTLEKEVPVIFIHPLNTGLFRIKIQGATGKFNMVIPLVDGM 
I VS RRALGFL VRQTVIN I CRRKRLESDS YS PPHVRRKQKI TD I VNKY 
RNKQLEPEFYTSLFQEGILEPDECLQAWITHSGVNGPLETIMRTIVA 
MPLPRAADVCPLNCAWELPTLHCVFCLLCPRTTGPPAGPSGQRR 

3476 

A 

3 

1676 

HAS DS FR YFVLDNS VI LAMLEQPLGNEQNDFFPS VTVLVRGMSGRLA 
WAQQLCLLPROAKANQKIiFVPEPRPVPKNDVGFKYSVTCHRPFPEEVD 
KIPFVKADLSIPDLHEIVTEELEERHRK\IiR\SGMAQQIAYEIHLEQ 
QSEEELQKRSFPDPVTDCKPPPPAQEFQTARLFLSHFGFLSLEAIiKE 
PANSRLPPHLIALDSTIPGFFDDIGYLDLLPCRPFDTVFIFYMKPGQ 
KTNQE I LKNVE S SRTVQPHFliEFLLSLGWSVDVGRHPGWTGHVS TS W 
SINCCDDGEGSQQEEVISSEDIGASIFNGQKKVLYYADALTEIAFW 
PSPVESLTDSLESNISDQDSDSNMDLMPGILKQPSLTLELFPNHTDN 
LNS S QRLS PS SRMRKLPQGRPVPPLGPETR\ VS \ WWVERYDD I ENF 
PLSELMTEISTGVETTANSSTSLRSTTLEKEVLVIFIHPLNTGLFRI 
KI QGATGKFNMVI PLVDGMIVSRRALGFLVRQTVINI CRR KRLESDS 
YSPPHVRRKQKITDIVNKYRNKQLEPEFYTSIiFQEVGLKNCSS 

3477 

A 

288 

589 

WCSRRRGWYLLLGFHNYWRSSTFLVRCTPSCPGGCCPRYGIYPVRSC 
PRLPGGVSRYGSIHSG/RWCSWSPSWSPWLTSVTPRIiYVALM*AWC 
PWGKQP 

3478 

A 

1250 

1909 

GAG PDMVWDTELELALKI SKGLQRP I KAHREERED I GKHE SRCV I YF 
GTAKKWILKDKNGRSRVDVISHRLKVSSGLCKTHEIGFDPLALKCPL 
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1 RSRTAPWWPLDRVS FDLHHLVI GNFFVGNRKI PLDYLVYGFAHNNRW 

1 KLLVOSWSDGC7VWRTFV5T.VV Cl?CVliQT7PTDT»Pvr\T»v<-ipTTnT ■»» 

■ ivuuvvijnQuvjv- v nrt i r vjJj v ivo r oJ*Aor LXFITKQRjKSSED1jAX»K\QI 

CANTARVILKLKHFHFVSYMCTFLFTCENGHL 

3479 

A 

3 

246 

AAAMSALSLLILGLLTAVPPASCQQ/ GEPAHMILTVGNKADGVLVGT " 
1 DGRYSSMAASFRSSEHENAYFTJVPPPPr:TrwpcT»r>M 

3480 

A 

162 

530 

ELLQPMS ALSLL I LGjLLTAVPPAS CQQGLGNLQPWMQGL I AAAAFLV 
L VA I AFAVNH F WG AKEE P / E S P AHM I LTVGNK\ ADG VL VGT\ DG \ R Y 
1 SSMAASFRSSEHENAY\ ENVPPFPnTrvrpQnnoM 

3481 

A 

4 

411 

NAATSLTANPDATTVN I EDPGETPKHQPGS PRGSGREEDDELLGNDD 

1 SDKT/EGTVSGlTOSSEVNDTWPT,PWWnvnnTsrDT>T T CT T nr«nvvr.rnn 

WTFEYYQTFFDVDTYQVFDRIKGSLLPIPGKNFVRLYIRSIV 

3482 

A 

160 

518 

1 MTCNLKLLAGQKKS SPFWTFE YYQTFFDVDTYQVFDR I KGSLLP I PG 
1 KNPVRLYIRSNPDLYGPFWTPATT.VPnTaTcr«>vTT cmdt tut r»T?vrrvtt 
YVPEFRKVS I AATI I YAYAWLVPLA 

3483 

A 

12 

197 

SIIFLPTLSSTFSIKIRLKPQDLGNWWQQTALVLGVISLNIHEYPFIi 
QCSSFLSGWGRALS 

3484 

A 

2 

321 

ARGVYGYSLFIYIPTAILWIIPHKAVRW\ILVQ\IAIX3ISGSLLAMT 
FWPAVREDNPJ^VALATIVTIVLLHMLLSVGCLACFFDAPEMDHLPTT 
TGTPNQTVAAAKSS 

3485 

A 

187 

1280 

PLARLNCLCSSLSPSPLSNLSLSFLIGTSLGCTCSLKHSHKPCQIFF 
Jjj. J-^kkijXUKMAAVUD1jQFEEFGNAATSLTANPDATTVNI\EDPGET 
PKHQPGSPRGSGREEDDELLGNDDSDKTELLAGQKKSSPFWTFEYYQ 
TFFDVDTYQVFDRIKGSLLPIPGKNFVRLYIRSNPDLYGPFWICATL 
VFAIAI SGNLSNFLIHLGEKTYH YVPEFRKVS IAATI IYAYAWLVPL 
| -u* irviviMojv vi'jimx vo x or Lih, JL VL V Y va x oIjFIYIPTAIIjWI IPQK 
AVR W I Ij VM I ALG I S G S L LAMTF WPA VREDNRR VALAIj \ IVTIVLLHM 
L»LSVGCLAYFFDAPEMnTTT i P r r r r r rn , TDKrr\ r mrA 7\ nvc o 

3486 

A 

112 

563 

LDSSHCCSCSTALFRTQTTAAAVPRMVIRVYIASS\SGSTAI/RGKR 
QQD VLGFLEANK I G FGR KD I AANGENRKWMRENVPENS R PGTG \ YPL 

PPOIFNESOYRG\ DY\ naPPr;DDT7TriSTM\ 7\\rvr»T T nnr tit*, nnr,,.,,^-,-. 

QEGAKPKAGQALTL 

3487 

A 

105 

335 

GRLF PKVLS YHS VG YLPL I LFCHFLLANC I LCCLMHFL * FFQS YRF * 
G * KFGFTQHHCHY I FHKQWPLLWKNFPEH 

3488 

A 

993 

1338 

ONOTiKPTTPPPPP'CTPTT'CT T T DVDDA Xji/^"-t»t r-»rrrrT/~rvT-r ^-.y----.— —^- — --— _ 

r r r r c iKVis XjijjjFKKKAMGT I STHCNTjCLPGFQGNS PAS 
ASQGRL/ AGLHGMGPPCR/ RGTFVFLIGDRGFLHVGSSWSWNSQPQV 
IRP\PRPSPKCWGLQGMEATVPSP 

3489 

A 

2 

454 

x-Aijjr-r'AiJMir'OV^iXrtJ_iA±'£>i> X JL VAJ.JKAJbEPPSPYHGQjLKMQSIPG\PPQ 

PRSFFLSSPLDVDSPQAPRHCTGLPAPSLSSPPWSCPTLFSWFFEAS 
G VCHHWTS VI AASNSSP T FPPT.PHPTTPT.P t q bujt TroDT*jr>nijer\-r»T->T r> 

RGA1* SPVLNS 

3490 

A | 

1109 

1448 

PRLISKCFLLLLFFRQSHFVAQAGVQWCDLRSVYHLPPGjLKRFSCLS 
LPSSWDYRHLPPCPANFCNFSRDRVSPCCPGWFRTPD\PGDPPASAS 
PKWWE YTL * PPGPDHPKAF 

3491 

A 

14 

250 

VLSVGLPTGDTGIGLSRKTSPAJPVAXIHSHSLKQAHP*LTI*GSRTGN 
IHAI YYRGVLKYCNFI CLGFMLR YQLVS PS F 

3492 

A 

1 

1065 

MGSVPLIKRPKRASLSLPPHEVSNAJNTODDPLESEKPNPNKLT^ 
no lAJjVjwoir'XiAXjFljKFFXijFAVKCTRIVVVMCISPM^ 
TGHLCIFFGEMSIQVERIVDKRKNKJCGKTEYIiVRWKGYDSEDDTWEP 
EQHLVNCEEYIHDFITRRHTEKQKESTLTRTNRTSPN^ 
SNFSKTSPKALVIGKDHESKNSQLFAASQ 

AKSG I KI LVPKS PVKSfcTAVDG FQSE S PEKLDPVEQGQEDTVAPEVA 
AEKP VGALLGPGAERARMG SR PR I HPLVPQVPGPVTAAMATGLAVNG 
KGECLGSCSGSWIPEGSGPSLEIRP 

3493 

A 

90 

1937 

AGGNQRTQSPRKNFMAFQASHRPAWGKSRKKiWQYEGPTQKLFLKRj^ 
NVSAPDGPSDPS I SASSEQSGAQQPPGLQVERIVDKRKNKKGKTEYIj 
VRWKGYDSEDDTWEPEQHLVNCEEYIHjOFNRRHTEKQKESTLTRTNR 
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TSPNNARKQISRSTNSNPSKTSPKALVIGKDHESKNSQLPAASQKFR 
KNTAPSLSSRKNMDLAKSGIKIIiVPKSPVKSRTAVDGFQSESPEKLD 
PVEQGQEDTVAPEVAAEKPVGALLGPGAERARMGSRPRIHPLVPQVP 
G P VTAAMATGLAVNG KG T S P FMD ALTANGTTN I QTS VTG VTA S KR KF 

IDDRRDQPFDKRLRFSVRQTESAYRYRDIWRKQDGFTHILLSTKSS 

ennslnpXevlrevqsalstaaandskvXvlpravgsvfXccglnfn 

YFKPR\LPNARKRESIKMAEAIRNFVNTFIQFKKPIIVAVNGPAIG\ 
LGASILPLCDWWANEKAWFQTPYTTFG\QSQEGCSPVMFPKIMGGS 
I C I LD AVLGDRNLTAQ \ EACGKGL VS R V F WPGTS PRKVMVR I KELAS 

CNPWLEESKALVRCNMKMELEQANERECEVLKKIWG\SAQGDGTRM 
LKVLCRGKIE 


3494 


572 


WCLQHD/ LGHAS I FKKS WWNHVAQKFVMGQ/ LKGFSAHWWNFRHFQH 

HAKPNIFHKDPDVTVAPVFLLGESSVEYGKKKRRYLPYNQQHLYFFIi 

IGPPLLTLVNFEVENLAYMLVCMQWAVSGVAQDPGHTAAVAGGGASG 

DSTCP*RQTGCT\DLLWAASFYARFFLSYLPFYGVPGVLLFFVAVRY 
GREWR 


3495 


25 


430 


YKNNS FPP I LALGKCAMLTFPLPLSHQAQSQGHRAAE YTCEGRS PFP 
VGSLPWGQAAMRAGGWVGRTAECSFSGMESRPG*RVSGRGAGGQPRP 
SPGMSLGGPVPPAPSSPLLPPAGRCNPRTERTWNLPTRSWA 


3496 


3497 


3498 


3499 


3500 


821 


32 


77 


18 


28 


1412 


616 


403 


753 


342 


GSARASPQGPGKPLVRVDHTDEPHPQGDRPREAPGLGQLSAGSHLQR 
GALTF\HQLVQRAPQLPDRAPPLPQDAETQ\LQPGCPAC\QVAVCQ\ 
AR PQLR \ S EAL P \ HRAGGHRQVPEE S I» VTS C WTPTS I S EGNTQAGRE 

GLRAPATKPSPRRDRIPPTPPLTSLGVPCLPSWYCCLPLGPLTCVFS 
SPMALALGLMGQG 


VLQCSHGCFSQPSSGG\LTDEAASSCCSDADPSTKDFLLQQTM\LRV 
KDPKKSLDFYT\RVLGMT\LIQKCDFPIM\KFSLYFLAYEDKNDIPK 
\EKDEK\ IAWAV\ SRKATLELTQQFGALKDDATQS YHNGNSDPRGFG 
H I G I A VP \ D VY S A CK\ R FE ELG VKFVKKPDDG KMKG L \ AF I QD PDG Y 
W I E I LN PNKMATLM 


CSIRCSFQCPACRWPSLSHAHPLPQLTLCVSSFPQVNIDHETRELTR 
MNLQTATATCFDAAQGKTRTLMERDSYPRFLKSPAYRDLAAQASAAS 


ATLSSCSLDEPSHT 


TQPQLTSTCYRAFASWRTRSLLEPATILPTTCCPAPAAMCRTLAAFP 
TTCLERAKEFKTLLGIFPHKS\ELGFDTGSTGKFEWGSKHSKGDLKT 
SQKDVLGWRESF\DLLLSSKNG\VAAFH7VFLKTEFSEENIiE\FWL\A 
CEEFKKI \ RSATKLASRAH\ QI FEEF ICSEAPKEVNIDHBTRELTRM 
NLQTATATCFDAAQGKTRTLMEKD / S PYPR FL KS PAYRDL\ AAQASA 
ASATLS S CS \ LDEPSHT 


PGSTHAS ADAWVHPKNI / S S VN ALS PGPHCAQTEV / 1 VS PAPRCLCH 

RRGPRPSCCPNPVPSLTSCLTRFPSLCRATLKNGRKACLNPASPIVK 
KIIEKMLNSDKSN 


3501 


A | 1 

a i r 


274 


MARATLSAAPSNPRLLRVALLLLLLVAASRRAAGASWTELRCQCLQ 
TL<y3IHIiKNIQSVN\ATLKNGKKACLNPASPMVQKIIEKILNNP 


3502 


3503 


5504 


893 


MSSREVTPPGKDVARSLRGSKGSDPRRTALGSRSASSQAVISVSLRA 
AGSRSRSRDSGQKENIPQLAGVTQDSQTRTSLVSAPTPLHPRGGAIA 
FLPNSGSIWSSGNFPGPGLRAFQPQPCIKGVRRSRRATEPGPQAAPC 
QLSSSHSRSNRLLSPMARATLSAAPSNPRLLRVALLLLLLVAASRRA 
AGAPLATELRCQCLQTLQGIHLKNIQSVKVKSPGPHCAQTEVM*VSP 
IAAAATAGVPDS PAAPNPVLS PTS CLTR FP S LCRATLKNGQKACLNP 
ASPMVKKIIEKMLKK 


407 


406 


ATEPGPQAPPRQLFRSSHSRQTRLLSPMARAALSAAPSNPRLLRVAL 
LLLLLVAAGRRAAGASVATELRCQCLQTLQGIHPKNIQSVNVKSPGP 
HCAQNEVHKHTQEMGGKACLNP \ AS P 1 1 KKI I EKMLNSDKSN 


GPDAGWGDRIPLEILVQIFGLLVAADGPMPFLGRAARVCRRWQEAAS 
QPALWHTVTLSSPLVGRPAKGSVKAEKKLLASLEWLMPNRFSQLQRL 
Tli IHWKS QL VGECCPRLTFLKL SGCHG VTADAliVMLiAKAC 
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3505 

A 

2 

347 

AARVCRR WQBAAS QP AL WHTVTLS S PLVGRPAKGGVKAEKKLLAS LB 
WLMP1TOCWVGRTRVGTLVQLCLRCRPRFS QLQRLTL I HWKSQVHPVL 
KVRAPGCDCTSPATLWDWSVL 

3506 

A 

2 

1333 

VYKPRPFRSLGLPRAPSVVMAAPASRQVlO^ARAAPRPRSAEDWWWD 
RLAPRGSGYHLItQSDSMLIiVLSEPGPARPRAQRRASRRTPRQPPRGP 
SAAAKPKAGLRSEAAAAPAPAPAPTPTPEEGPDAGWGDRIPLEILVQ 
J. r (j>LL V AADGPMPFLGRAARVCRRWQEAASQPAL WHTVTLS SPLVGR 
PAKGG\VKAEKKLLASLEWLMPNRFSQLQRLTLIHWKSQVHPVLKLV 
GECCPRLTFLKLSGCHGVT7VDALVMLAKACCQLHSLDLQHSMVESTA 
WSFLEEAGSRMRKLWLTYSSQTTAILGDLLGSCCPQLQVLEVSTGI 
NRNS I PLQLPVEALQKGCPQLQPGPCPQVLRLLNLMWLPKPPGRGVA 
PGPGFPSLEELCLASSTCNFVSNEVLGRLLHGSPNLRLLDLRGCARI 
TPAGLQDLPCRGQQSWWAGR j 

3507 

A 

47 

O J± 

ivyx-'x* tL v wt'J\\jKjt!,KU&\3AKV JUvEARLLR WCPMS FKRNRSD \RFYS TRC 
CG\CCHVRTGTIILGTWYMVVNLLMAILLTVEVTHPNSMP\AVNIQY 
E\ VI \GNYYS \ SERMADNAC/ VFFFAVS VLKVYNPVPMLVYGSNFLY 
royvvjifliirf r \uiKJjrur vaj£> \l_ljVAloSLTYLPRIKEYL\DQLPDF 
\PDKDDLRAL\DS\SCLLFIVLVFFAIiFIIFKAYLINCVWNCYKYIN 

NRN/VCPEIAVYPAFEAPPn\ YVT.T>T^T?Mf2T?17TrMDT7irC»D , CJ , D , DT T T»T*Jr> 

EGNSGLWTINP 

3508 

A 

246 

559 

MPVAVMAES/AFSFKKLLDQCENQELEAPGGIATPPVYGQLLALYLL 
HN/GQRIWQRDFPGIYTTINAHQWSETVQPIMEALRDATRRRAFALV 
SQAYTS I I ADD FA 

3509 

A 

2 

250 

1 1 ADDFAAFVGLPVEEAVKGI LEQGWQADSTTRMVLPRKPVAGALDV 
S FNKF I PLSEPAPVPP I PNEQQLARLTD YVAFLEN 

3510 

A 

2 

119 

xjaxmo va.1 xjjj«i\\^icv±\ix*laK/W J. I-Lio x-NKQLQI FRGLiSHRJE \ 

3511 

A 

197 

921 

GTVWGLAVRTVQRRGPAAKMPVAVMAESAF\ S FKKLLDQCENQELEA 
P \ GG I A\ TPKVYG QLLAL YLLHNDMN \ NAR YLWKR I PPA I KS ANS EL 
<jvj \ j. w t> v L?y k x wy \ kxjffg I ytt IN \ AHQWSET\VQPIYGKHFRDAT 
R / RDRAFAPGLLQAVY FQS IRPM I FAAFVG \ L PV\ EE AVKG I L \ EQG 
™ \W rt \i-JZ> x a rtiNVjov-±r^»iVf V^KvjJf WJJ vr FrNKx ISLYSEP \ APV \ PPI 
PNEQQLARLTDYVAFLEN 

3512 

A 

3 

356 

HEVYSGGDDGLLRGWDTRVPGKFLFTSKSYDEHILLWDTRNMKQPLA 
LTSHTLPDSLVYGADWSWLLFRS 

3513 

A 

1 

179 

VSSSVPRSSRGP 

3514 

A 

63 

386 

LRTLISCHLWLMLLFLHTGGDDGLLRGWDTRVPGKFLFTSKR*PPK 
GQHSVTTPAPAFPGSPALPFPGWCVLDTGSEAASACRHTMGVCSIQ 
S S PHREH I LATGR 

3515 

A 

1 

147 

GI^TPGQGGVWRIKGHPFHHHLLI^a^HSGFKILNCQKAMGCGSL 
RA 

3516 

A 

450 

1154 

ivon vA-in±r boo JjAxjiiiiyuijAljiijiJWo I GKTGRAGDQPLKI I SSDSTGQ 
LHLLMVNETRPRLQKVASWQAHQFEAW/TCCFQLPWHPEIVYSGGDD 
GLLRGWDTRVPGTFLFTS / 1 KTHHGCVAS IQS KPS SGAHPWPRES YD 
ETHPTV\DTRNMKOPLAnTPVru^^\7WPTirLJTiDi?xjTJXJT t t t\ nr-MUCPu 

KILNCQKAMEERQEATVLTSHTLPDSLVYGADWSWLLFRSLQRAPLV 
VLS 

3517 

A 

73 

168 

P I QAGLS LLKTAGKKGPEGS PPVFYS PLNPGA 1 

3518 

A 

322 

673 

RLWASPAAPGKIOCEMGNSMKSTPAPAERPLPNPEGLDSDFIiAVLSDY 
PSPDINPPIFRRGEKLRVISDERGWWKA\ISLSTGRESYIPAICVAR 
SYHGWLLRGPG\KNMAEELLQLPD 

3519 

A 

624 

1580 

KAATSENKIICCEWRTSQAALMLHRLWASPAAPGKKKEMGNSMKSTP 
APAERPLPNPEGLDSDFLAVLSDYPSPDISPPIFRRGEKLRVISDEG 
GWWKAISL\ STGRESYIPG\ ICVARV\ YHGL/ WLFEGLGRDKAEELL 
QLPDTKVGSFMI\RESETKKGFYSLSVRHRQVKT\ x^IFRLPNNWYY | 


.1 0 O S <+ 6 '4- 3 , O S £ 6. 0 R 






I S PRLTFQCIA EDL\ VNHYSEVADGLCCVLTTPCIi\TQSTA\ APAV\ 
RACSSPVTLRQKTTOWRRVSRI^EDPEGTENPLGVVBSLFSYGLRES 
IASYIiSLTSEDISSFDRKKKSISLMYGGSKRKSSFFSSPPYFED 

3520 

A 

24 

688 

VDHPPYKRTEKVRAQRSPKSQDIYLG\LLVTCLYRFSGPEEPNSNIQT 
RVVLK\RLFMSRT\NRP\PLS\LSRMIREDEAFLAGKKKNPAVVVGP 
QKLIMCGVQKGTPKLKVCALGGTQAGPHKRI \LRARGQRS ITFDQLA 
LD / S PLRGCGTVLlA SGPSKG \REVYRHFGKAPGNP\HSHTKPYVRS 
KGRKFE \ RARG \ RRASRG YKICLTLDPTLLYKKI FADS EKKKK 

3521 

A 

109 

446 

EDEKVERVPSPDHRRRSYRDLDKPRRSPTLRYRRSRSRSPIRRSRSP 
KRRS PFPRRERHRS KS \ SRRHRSRSRDRRHRSRS KS PGHHRSHRHRS 
HSKSPERSKKSHKISRSRE 

3522 

A 

195 

391 

PGVAGARHPLLGAAPKGRL\ EVIiAGVFHGS AANQVWVS KSHPQF I PT 
DQE IGGDRALKGTCRLPLS 

3523 

A 

112 

492 

AHSRTPARPENRAAASAPRKPRRAMSSPPEGKLETKAGHPPAVKAGG 
MRIVQKHPHTGDTKEEKDKDDQEWESPSPPKPTVFISGVIARGDKDF 
PPAAAQV\AHQKPHASMDKHPSPRTQHIQQPRK 

3524 

A 

3 

982 

GTRVGVAWRSVRLLLGPGAGLRGGVVLKVSPSPCRGGRRVPVRAEGA 
RGRGPRNKVPGAS SGAAVGAARLTAPLLiAKAMAS KLLRAVTlA GPPG 
LGKGAPVCQRIAQNFGLQ\HLS \ SGHFI»RENIKA\ STEVGEMANQVY 
REKVFL VPDHV I TRLMMS ELENRRGQHWLlA DGF PRTLGQ AEALDKI 
CEVDLVISLNIPFETLKDGLN\RRWIHP\PSG\RVYNL\DFNPPHVH 
GIDDVTGEPLVQQEDDKP\EAVAA\RLRQYKDVAKPVIEli\YKS\RG 
VLHQFSGTET\NKMWP\YVY\TLFSNKITPIQSKESILTIiAQWEEPW 
NDVGHSFQ 

3525 

A 

19 

445 

LYFRRRKFQLC^LCTQMLEKSPYDQAAWILKARALTEMVYIDEIDV 
DQEGIAEMMLDENAIAQVPRPGTSLKLPGTNQTGGPSQAVRPITQAG 
RPITGFLRPSTQSGRPGTMEQAIRTPRTAYTARPITSSSGRFVRLGT 
A 

3526 

A 

1 

623 

OAAWILKARALTEMVYIDEIDVDQEGIAEMMU5ENAIAQVPRPGTSL 
KLPGTNQTGGPSQAVRP\TSHSEKPITGFLRPSTQSGRPGTMEQAIR 
TPRTAYTARPITSSSGRFVRLGTASMLTSPDGPFIN*SRL»NI»TKYSQ 
KPKMAKALPEYIFHHENDVKTALELAAIiSTEHSQYKPWWWKVQIEKR 
YYRLGMYREAEKQLI SAMKQ 

3527 

A 

1 

1509 

MKAEICKSRTVKDWHNHQKLGEGNKTDSPPQPSEGTNTANTLILDFW 

PLEMIACCWETFPQRLTAELFMNPTHRWHGYKNQSVGALRAPLGQGP 

SRRGIiPGRVGRRQLFTPRPPLSWSAGPSLAAPAAMSSEMEPLIiLAWS 

YFRRRKFQLCADLCTQMLEKSPYDQEPDPELPVHQAAWILKARALTE 

MVYIDEIDVDQEGIAEMMLDENAIA/PSSTPITQAGRPITGFLRPST 

QSGRPGTMEQAIRTPRTAYTARPITSSSGRFVRLGT\LGMYREAEKQ 

FKSALKQQEIWDTFLYIJVKVYVSLDQPVTALNLFKQGLDKFPGEVTL 

LCG I AR I YEEMNNMSSAAE YYKEVLKQDNTHVEAI AC IG SNHFYSDQ 

PEIALRFYRRLLQMGIYNGQLFNNLGLCCFYAQQYDMTLTSFERALS 

lAENEEEAADVVmn^HVAVGIGDTNLAHQ 

NLAVIjGDAEG / RHVEQARALLQTAS S I S \ PHMYEPHF 

3528 

C 

179 

289 

MHRRI J VASPXSVYVRQSVHMLVCFP^4ASDKXLNKYIJ* 

3529 

A 

3 

206 

SWSGSRDVTGALLVAAAVASEAVGSLRVAEGGPNTLLLQVCRGSKGT 
SATCPFRPFPPLWSLLGTGR 

3530 

A 

1 

857 

mpraepratlgeqekaglplgawrlyllrhfrkqtelrrsgsrdvtg 
allvaaavas eavg s lrvaegg pntlllq vlr \ s w \ p wcnkelktme 
erkvkrrs pks fsahctqwnvqkkms i pvs k\ st\ g fsnpasqvnf 
tsrpkvkkevmk\ekt:kpqgleegqrhssqlpiqhsflt\dvsrcsr 
kmergllsl\lndfhswktfkhfgn\ecs i \ eqmehvsgelqeklar 
lnlelygeleelpedkektasds\nldrllsdlee3anssiqklhla\ 
daqdvpntfr 

3531 

A 

1 

174 

MDAVAVYHGKISRETGEKLLLATGLDGSYLLRDSESVPGVYCLCVLQ 
HLGYIKDISGK 
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3532 

A 

1 

190 

IWAVAVYHGKISRETGEKLLLATGLDGSYLUUDSESVPGVYCLCVLX 
FQKPDQGI VI PLQYPV 

3533 

A 

3 

210 

CPRVHQAMDAVAVYHGKISRETGEKLLIiATGIiDGSYLLRDSESVPGV 
YCLCVL*V*YVCCEYDTVDPCLWYIiVMAA 

3534 

A 

1 

386 

GISLAQFSSSACPRVHQAMDAVAVYHGKISRETGEKLLLATGLDGSY 
LLRDSESVPGVYCLCVLYHGYIYTYRVSQTETGSWSA/EAFQKPDQG 
IVIPLQYPVEKKSSARSTQGTTGIREDPDVCLKAP 

3535 

A 

224 

751 

LVPGCSSSGISLAQPSSSACPRVHQAMDAVA\VYHGKISRETGEKLIi 
LATGLDGS YLLRDSESVPGVYCLCVL YHG YI \ YTYRVSQTETGSWSA 
ETAPG \VHKRYFRKMKKSHFQHFQKQDQ\ GI V I PLQ\ YP VEKKS S7VR 
STQGTTGIREDPE\VCLNAPLIKISPLRPLFSITFNIC 

3536 

A 

3 

199 

SIRAEMSRSV\ALAVLALLSLSGLEAIQRTPKIQVYITSSSREWKVK 
FPELLCVWVSSTPTLKLTY 

3537 

A 

134 

405 

HLYPASRAGKGDLVSWQQGEPGLGWPQLSLQVRSPGPCSSCDLHGHG 
VLGKREVELVPLVEDAGDAHSEPRAPEDHRLQEAPHACQEVHS 

3538 

A 

415 

533 

CSQEKSGDLRGCDVPPGPRHDLEHVCCGLDGAEGRSAGP 

3539 

A 

1 

3349 

MDQPEAPCSSTGPRLAVARELLLAALEELSQEQLKRFRHKLRDVGPD 
GRS I PWGRLERADAVDLAEQLAQF YGPEPALEVARKTLKRADARDVA 
AQLQERRLQRLGLGSGTLLSVSEYKKKYREHVLQLHARVKERNARSV 
KITKRFTKLLIAPESAAPEEALGPAEEPEPGRARRSDTHTFNRLFRR 
DEEGRRPLTVVLQGPAGIGKTMAAKKILYDWAAGKLYQGQVDFAFFM 
PCGELLERPGTRSLADIilLDQCPDRGAPVPQMLAQPQRLLFILDGAD 
ELPALGGPEAAPCTDPFEAASGARVLGGLLSKALLPTALljLVTTRAA 
APGRLQGRLCSPQCAEVRGFSDKDKKKYFYKFFRDERRAERAYRFVK 
ENETLFALCFVPFVCWIVCTVLRQQLELGRDLSRTS KTTTSVYLLF I 
TSVLSSAPVADGPRLQGDLRNLCRLAREGVLGRRAQFAEKELEQLEL 
RGSKVQTLFLSKKELPGVLETEVTYQFIDQSFQEFLAAIiSYXLEDGG 
VPRTAAGGVGTLLRGDAQPHSHLVLTTRFLFGLLSAERMRDIERHFG 
CMVSERVKQEALRWVQGQGQGCPGVAPEVTEGAKGLEDTEEPEEEEE 
GEEPNYPLELLYCLYETQEDAFVROALCRFPELALQRVRFCRMDVAV 
LSYCVRCCPAGQALRLISCRLVAAQEKKKKSLGKRLQAR\LGGGSWIi 
GTQLAPEVPFRPPCCDICPTPPPDPRLLQGKAFARVPLNIAPIQPLP 
RGLASVERMNVTVLAGAG PGD PKTH AMTT1PT .r*WT . cj qt .tt . CHPKT .Pna 

VCRDLSEAIiRAAPALTELGLL^mRLSEAGLRMLSEGLAWPQCRVQTV 
R VQLPD PQRGLQYLVGMLRQS PAIiTTLDLSGCQLPAPMVTYIiCAVLQ 
HQGCGLQTLSLSLPSDPTPSSFSGRCREPGRRLGLESRWPRSAPEPS 
GRQRGEDPGGGGRGRGRREEAREGTPGPRAPPTAAPGRSSGSRLELC 
SLRALRAGNARPPDATHAAAASGDRGEPGPRPRVHVPPPGPAQRPPP 
PPRDRPRLPATARAIiGAGTADLPGGAAAGRLLLPPGPGVEQRDTGSH 
AGARRPGGAAAAQAQQLHGGRRRGPHHVCCPLSAQ 

3540 

A 

3 

261 

AIHLYPEILPLGNEKSTTGDGRPELDNLSDEYLSCLRKLQHCREELN 
QSQQLPPRRQCGRWLPVLMVVIAAALAVFLANKDNliMI 

3541 

A 

3 

431 

DAWADAWADAWALRIQQREQLQRARLQQFFRRRNLELEEKGKAQHPQ 
AREQGPSRRPGQVTVLKEPLSa\RRISSPRJEQVTGTSSEVFPAQHPP 
PSGICRDLSDHLSSQAGGLPPQDTPIKKPPKHHRGTQTKAEGPTIKN 
D 

3542 

A 

1 

859 

ALGKQKCEEWRSKY/EALK/ ED/ WRNLGAQHRE \ LE SQLHVLQS KLQ 
GADSRDLQMNQALRSLQNEHQQLQAKIECLQADRDLCSLYTQDLQDQ 
LKRSE AEKLTLVTRVQQLQGLLQNQS LQLQEQEKLLTK/ KRS ANL»LP 
QILRYCSFSYLPRGSGMDYMTSWSPSQFWEFC*VRDVWEIiFKD*SLA 
LRTLQLVSLLS *DQALPVWSPKS FPNEVEPEGTGKEKDWDLRDQLQK 
KTLQLQAKEKECRELHSELDNLSDEYliSCLRKLQHCREELNQSQQLP 
PRRQCGRWLP 

3543 

A 

392 

2016 | 

AKRNRCLEVIMISPDPRPSPGLARGAESYEAKCERRQEIRESRRCRP 
NVTTCRQVGKTLRIQQREQLQRARLQQFFRIiRNLELEEKGKAQHPQA 
REQGPSRRPGQVTGTSSEVFPAQHPPPSGICRDLSDHLSSQAGGLPP 
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QDTPIKKPPKHHRGTQTKAEGPTIKNDASQQTNYGVAVLDKEIIQLS 
DYL KEALQREL VLKQKMVILQDLLS TL IQASDSS WKGQLNEDKLKGK 
LRSLEWQLyTCTQKYSPWGMKKVLLEMEDQKNSYEQKAKESLQKVLB 
EKMNAEQQLQSTQRSLALAEQKCEEWRSQYEALKEDWRTLGTQHREL 
ESQLHVIiQSKLQGADSRDLQMNQALRFLENEHQQLQAKIECLQGDRD 
LCSLDTQDLQDQLKRSEAEKLTLVTRVQQLQGLLQNQSLQLQEQBKL 
LTKKDQALPVWSPKSFPNEVEPEGTGKEKDWDLRDQLQKKTLQLQAK 
EKECRELHSELDNLSDEYLSCLRKLQHCREELNQSQQLPPRR\QCG\ 
KGLPVLMWIAPALAVFLANKNNLMI 

3544 

A 

35 

219 

DTEVKPQSALPQASALTIWVSYQQGVLSATILYEILLGKATLYAVLV 
SALVLMAMVKRKDF 

3545 

A 

293 

479 

GRPEPPVHRCLHALFLSTES YQAGVL S ATI LYE I LLGKATL YAALVS 
ALVLMAMVKRKDSRG 

3546 

A 

1 

968 

VKLPSCPDPAMGTSLLCWMALCLLGADHADTGVSQNPRHNITKRGQN 
VTFRCDPISEHNRLYWYRQTLGQGPEFLTYFQNEAQLEKSRLLSDRF 
SAERPKGSFSTLEIQRTEQGDSAMYLCASSLAGLNQPQHFGDGTRLS 
ILEDLNKVFPPEVAVFEPSEAE ISHTQKATLVCLATG\ FFPDHVELS 
WVTVNGKEVHSGVSTDPQPLKEQPALNDSRYCLSSRLRVSATFW\QNP 
RN\HFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEAWGRADCGFTSV 
\ S YQQGVL S \ AT I LYE I LLGKATL YAVLVSALVLMAMVKRKDF 

3547 

A 

278 

1283 

GGIGEIKQRPSCLGRCLDPSLSVLMNISLELGSVFSAVISQKPSRDI 
CHRGTSLTIQGQVDSQVTMMFWYRQQPGQSLTLIATANQGSEATYES 
GFVIDKFPISRPNLTFSTLTVSNMSPEDSSIYLCSVEEQGFVGGAET 
QYFG\ PGTRLLVLEDLKNVFPPEVA/ VLFEPS EAE I SHTQKATLVCL 
AICFYLDHWLSWWVNGKEAHSGVSTDPQPLKEQPALNDSRYCLSSR 
LRVSA\TFWQNPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEA 
WGRADCGFTSGK\SYQQGVLSATILYEILLGKATLYAVLVSALVLMA 
MVKRKDSRG 

3548 

A 

151 

286 

FLKDLSSTPMASNiaTASIAQARKLVEQLKMEANIDRIKVSK/LAAD 

3549 

A 

28 

177 

GVFLKDLSSTPMASiraTASIAQARKXVEQLKMEANIDRIKVSKAAAD 
LMA 

3550 

A 

422 

671 

RIYPALRMPSINTAS IAQARKLVEQLFMF\ANIDRIKVSKAAADLMA 
YCEAHAKE\DP\LLTPVP\ASENPFKE\KKFFCAHPLSL 

3551 

B 

62 

299 

M F SLKKWNAVAMW S WDVE CDTCA I CRVQ VMDEG I G VRNWS EALNL I Y 
ASEMGFDLDMGSQTSLSPLAHWLLNNLGWMNLX * 

3552 

A 

1 

602 

ADVEDGEETCAliASHSGSSGSKSGGDKMFSLKKWNAVAMWSWDVECD 
TCAICRVQVmEGIGVRNWSEALNLINASEMGFDCRSGSTALAVPSV 
SLASHQPCLDDHR*QPIGSIQSSLF*RNSQVS*GRK*KVASYELLKE 
GFCVNACLRCQAENKQEDCVVVWGEC^SFHNCCMSLWVKQNNRCPL 
CQQDWWQR I GK 

3553 

A 

2 

428 

SGGSTEPLGLSAFPKPTSSAVGSVGAAMADVEDGEETCALASHSGSS 
GSKSGGDKMFSLKKVmAVAMWSWDVECDTCAICRVQVMDACFRCQAE 
NKQEDFFWLGECWH\ S FHNCCHVPCGSKQNNRCPLCQ\ QDWW\QR 
IGK 

3554 

A 

114 

1139 

EETSGHCVWGPAAKRFSAGCFPRFPRPVPRALLIAPGPSVSARLTV 

GCDDMSNKEGSGGFRKRKHDNFPHNQRREGKDVNSSSPVMLAFKSFQ 

QELDARHDKYERLVKLSRDITVESKRTIFLL\HRITSAP\DMEDILT 

ESEIKLDGVRQKIFQVAQELSGEDMHQFHRAITTGLQEYVEAVSFQH 

FI KTRSLI SMDEINKQLI FTTEDNGKENKTPS SDAQDKQFGTWRLRV 

TPVDYLLGVADLTGELMRMCINSVGNGDIDTPFEVSQFLRQVYDGFS 

FlGOTGPYEVSKKLYT£KQSLAK\reNACYALKVRGSEI^ 

SVKTEMIDQEEGIS 

3555 

A 

3 

378 

QAQPSLEMQNDAGEFVDLYVPRKCSASNRI IGAKDHAS I QMNVABVD 
KVTTGRFNGQFKTYAICGAIRRMVSVSLGFAHHFGTSWTLPCTUjECVM 
VPEQICWQSSLSQLDWAGRLPLLEGGRGVL 

3556 

A 

2 

299 

LLNSRPRRRQQAQPSLEMQNDAGEFVDLYVPRKCSASNRIIGAKDHA 
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SIQMNVAEVDKVTGRFNGQFKTYAICGAI\RRMGESHDSFLRMAKAD 
GIVSK 

3557 

A 

1 

392 

RRYLCRVTS CFLSLSRAVWW PARRS PAFEMQNGRRRSSSDLYRARGK 
LSASNRI IG\AKDHAS IQMN \VAEVDKVTGRFNGQFKTYAICGAI \R 
RMGE SDD S I L \ RLAKADG I VFKEL FD WRE I TD VG I FCHK 

3558 

A 

1 

417 

FRDIVNENGEGEIQDEEEDGYDDD\EMTGTGMKES*NSPWVMAG/HG 
GSNPQANRQTSDSSSAKMSTPADKVLRKFENKINLDKIjgVTDSVIYK 
ATEKF/DTKGIKYVLDPTTRMILFKMLTRGIITDIDGCISTGREANV 

3559 

A 

139 

1914 

AVPFHRRAAASGGALQSWTTGGLLMSRWPGQFDDADSSDSENRDLK 
TVKEKDD I LFEDLQDNVNENGEGE I EDEE / WKEGYDDD \ EMTGTGMK 
ELENSPRVMSGIGRKQPTGKSTD\PDSSSAKMSTPADKVIiRKFENKI 
NLDKLNVTDSVINKVTEKSRQKEADMYRIKDKADRATVEQVLDPRTR 
MILFKMVTRGIITEINGCISTGKEANGYHASTANGESRAIKIYKTSI 
LVFKDRDKYVSGEFRFRHGYCRGNPRKMVKTWAEKEMRNIilRLNHSR 

QVIQYMRRMYQDARLVHADLSEFN\ IMYHGGGV\ YI ID\ VSQSVEHD 

HPHALEFLRKDCA1WNDFFMRHSVAVMTVRELFEFVTDPIPLHHEN\ 

LDAYLSKAMEI ASQRTKERTV/ SSQDHVDEEVFKRS \ YI PRNI*\DEV 
KNYERDMDI IMKLKEEDMAMNAnOF)'NrTT.Yn r rn/ T rr , T itktit cr«\/r\xnrt3n 

LLENQVEERTCSDSEDIGSSECSDTDSEEQGDHARPKKHTTDPDIDK 
KERKKX EVKEAOREKRKNKI PKHVKKR KPTCT ATrrKTrnir 

3560 

A 

129 

567 

S YKSRLARKS \ TGGKAP \ RK\ QLAYKKPAS QECGPLLGGLKKPHRYR 
\PGTVALREIRR\ YOKSTE\LMTRK\T.PPnR\ T.vpPTnnnT?\ vrr>T © 

FQERSNQVALQ\EA\RRAYLVWPFLKDTNLVCLSMPKRVPIMPKRHS 
SS ARRHHVENVL 

3561 

A 

3 

256 

MDHEFVAKGHLFRKGIMKI^1VYKIFRILVPGNTDSTEALSLSYI>VEL 
SWAPAGQDMVSDDMKNFAEQLKPLVHLEKIDPKRLM | 

3562 

A 

3 

205 

KI^f\^fKIFRILVPGNTDSTEALSLSYLVELSVVAPAGQDMVSDDMKN 
FAEOLKPLVHIjE K T npiTR T .M 

3563 

A 

66 

852 

PECWYLIVAQKLLSARVDIIPGFSRRHYSLKEVYPVPYLTGGSESSC 
VVFNLDTMEAPPVTMMPVTT!fiTTKrMMT?VT t ot'c^tt nuoT t?ot Ttro\ t 

RGLCDNMEPETFLDHEMVFLLKGQQASPFVLRARRSMDRAGAPWHLR 
YLGQPEMGDKNRHALVRNCVDIATSEI^TO 

HLFRKGIMKIWVTYKIFRILVPGNTDSTEAIjSljSYIjVELSVVAPaGnTj 
MVSDDMKNFAEQLKPLVHLEKIDPKRLM 

3564 

A 

194 

338 

GKVPGDDCPLVWGQCSHCFHMHCILKWLHAQQVQQHCPMCRQEWKFK 
E 

3565 

A 

197 

522 

GSA7^KVKIKCWNGVAH\WLWVANDENCGICRMAFNGCCPGCECSPS 

MLSERPRLQGAPATDCPAGCGGPVLPTCFQHAIiHPQVGLHAQQVQQA 
LAPMLPPRIGKFKE 

3566 

A 

1 

6521 

MLRKRK>/LGVVAHACNPSTLGGQGHPDKQPQVRRIiLSAPGSRIAGQW 
VLD S LKS PHLLL KGENDKN S L S QKAF L YVAHVLDE S ERG VLVTMTTE 
TGPDSEVKKAQEEAPQQPEAAAAVTTPVTPAGHGHPEANSNEKHPSQ 
QDTRPAEQSLDMEEKDYSEADGLSERTTPSKAQKSPQKIAKKYKSAI 
CRVTLLDASEYECEVEKHGRGQVLFDLVCEHLNLLEKDYFGLTFCDA 
DSQKNWLDPSKEIKKQIRSEWLVVFGEVGSPWNFAFTVKFYPPDPAQ 
LTED ITR YYLCLQLRAD 1 1 TGRLPCS FVTHALLGS YAVQAELGDYDA 
EEHVGNYVSELRFAPNQTRELEERXMELHKTYRGMTPGEAEIHFLEN 
AKKLSMYGVDLHHAKDSEGIDIMLGVCANGLLIYRDRLRINRFAWPK 
ILKISYKRSNFYIKIRPGEYEQFESTIGFKLPNHRSAKRIiWKVCIEH 
HTFFRLVSPEPPPKGFLVMGSKFRYSGRTQAQTRQASAIiIDRPAPFF 
ERSSSKRYTMSRSLDG^FSRPASVSEmflDAGPDGDKRDEDGESGGQ 
RS EAEEGEVRTPTKI KELKFLDKPEDVLLKHQAS INELKRTLKEPNS 
KL IHRDRD WERERRLPSS PAS PS PKGTPEKANESQRTQD I SQRDLVP 
EPGAAAGLEVFTQKSLAASPEGSEHVTinPIEREYTRPEELGLLKVTTM 
QQEERQAGLAG I LANGRLS KVDVLVDKFKVE VATEEMVGNRRANTQQ 
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QGKMIASPEDFESVGEEGPWIRESPGGAAIiASGRTLAEKLLEGSELR 

ADTREATIRNRCMSDGQPEGQTELRKGLEEPHTCGRPTAPGTRPAEV 

DVLS PASDKGGLQS FLLDPAHAEARAELSNETDTS FAERS FYLNYEE 

KDSEDQVLPPPLEERKGRLDAPPGGEPRPTLNSIiDIiRVSAAASSRSK 

DEAHMTS PKEGAGTPKNHGGPGDLKGS PAGQTFAEGWEDAQWGVEGE 

FPHLTASAAREEGTPVSGDLLGKAEESPTEELKKHPPHRGQGVHPDP 

QACALPRAIPLNVRKPVKPDRGNFPPKERGWPTQKGGAELKDREAS 

AFLHME V 1 1 PL PAS PGHS EDL AALEEAS P S PTSHGS GE PS ELREPFL 

RHVHLSKASPEPKDQVGFWSPATGGERRPPPITSRKPRWPEEAEG 

RIPLGFGFPSGKRREMTSFQAGDQEGSLEDISKTSVANKIRIFETHG 

AETRRMSEGEARSLPlTOVSSEAPVGQAEQQRSTLSDLGFAQIiQPPGD 

FAS PKATHSTVI PLATRHFREDTSAS YQEAHTELEPVSPNSGCETTI* 

AEATGTGVTGRNKSGDAVREEKRSTNLAANTPGKGGRLRFASPSGPQ 

RAGLREGSEEKVKPPRPRAPESDTGDEDQDQERDTVFLKDNHLAIER 

KCSSITVSSTSSIiEAEVDFTVIGDYHGSAFEDFSRSLPELDRDKSDS 

DTEGLLFSRDLNKGAPSQDDESGGIEDSPDRGACSTPDMPQFEPVKT 

ETMTVSSLAIRKKIEPEAVLQTRVSAMDNTQVDGSASVGREFIATTP 

SITTETISTTMENSLKSGKGAAAMIPGPQTVATEIRSLSPIIGKDVL 

TSTYGAT71ETLSTSTTTHVTKTVKGGFSETRIEKRI I ITGDEDVDQD 

QALuAIiAIKEAKLQHPDMLVTKAVVYRETDPSPEERDKXPQKLKRETK 

NWKRQPCIRRNLQARAILPATQGSAGPLPPSLQSDAKAGELPVRFRC 

TGTWGEALFIKEKGFFWSPTTGPMAAVQITOPELAIOILFFEGATVVIL 

NMPKGTEFGIDYNSWEVGPKFRGVKMIPPGIHFLHYSSVDiqVNPKEV 

GPRMGFFL\SLHQRGLTVLRWSTLREE/VLDLSPAPESEVEAMRANIi 

QkJLiDQr JuCjP x P YArL»KKWT \ SLiTNFI SEATvKKLQPENRQICAFSDV 

LPVLSMKHTKDRVGQNLPRCGIECKSYQEGLARLPEMKPRAGTEIRF 

SELPTQMFPEGATPAEITKHSMDLSYALETVLNKQFPSSPQDVIjGEIi 

QFAFVCFLLGNVYEAFEHWKRIjLI^LCRSEAAMMKHH 

YHQLGE I PAD FF VD I VS QDNFLT S TLQKGFL ID I S ED F I E VLFLRGT 

ERFSKANLLLDESLIAVDATLRKKAEKFQAHLTKKFRWDFAAEPEDC 

APVWELPEGIEMG j 

3567 

A 

1 

1232 

GCTGTWGEALFIKEKGFFWSPATGPMAAVQMDPELAKRLFFEGATW 
ILNMPKGTEFGIDYNSWEVGPKLRGVKMIPPGIHFLHYSSVDKANPK 
Jfc#VVjFKM«jr x* JbbijHyKLilj 1 V1jRWSTLREEVDL»SPAPESE VEAMRAN1»Q 

eldqflgp yp yatlkkw i sltnf i seatvkklqpenrqi cafsdvlp 

lptqmfpegatpae i tkhsmdls yaletvhnkqf ps s pqdvlgelqf 
afvcfllgnvyeafehwkrllnllcrseaamkk/hatrlwinlisi 
hhqlge i padffvd i vsqhnflts tlqvffs s acs i avd atlrkkae 
kfqahltkkfrwdfaeepedcap weelpeg i emg 

3568 

A 

2 

166 ! 

FKLYKLFIKFLPVLENGGVVQVGELLPCKICGRTFFPVAIiVSRIilLY 
LYGFTDL 

3569 

A 

56 

1072 

argggamegleenggwqvgellpckicgrtf\fpvalkkhgpicqk 

iatiraakgldqalkeg\gkiAppppp\ps\ydpgfiiqw/cpy\cq 
rkipmenaadrhinfckeqaarisnkrrfstdtkgkptsrtqvykpp 
alkksnspgtassgssrvpqpsgag\ktwgvp/sqgkvsssssslg 

/ r« X rvJ_»V -L»»-3 ±r orirvvjXrt^l^rl/^lj/\iM V iVirrCW 1 Jrlror A \KiNlr \iiroVblW 

K\RKTYTESYIARPDGDCASSLNGGNIKGIEGHSPGNFPKF\CHECG 
TKYP VEWAKFCCECG I RRM I L 

3570 

A 

2 

413 

PR VRG ILPSCS WVFSLLLLLLLRLGEPS WGRMVC\ EKCEKKLGTVI 
TPDTW\KDGARNT\TEN\GGRKLNENKALTSKK\ARFDPYGKNKFST 
CRICK\SSV\HQPGSH\YCQ\GCAYKKGICAMCGKK\VLDTKNYKQT 
SV 

3571 

A 

32 

469 

PTCLHLLPYATVPNFPTFSCLFLSPPNCSNLCLLPSSKVTSTFSGIF 
S AMPHTTLLQFP I SFS FPSETSLAWPSFS I SLSAFWSQLFNNCL.GSA 
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KLSCLLLSPPNCSNLCLLSSSKASSTFSSITTRLESLLRGLSQDVSG 
KYKT 

3572 

A 

659 

1149 

EVTACRQPSVAFSTSLAIiALTLALLEEPFSLLLHCESPLLGWASSCO 
GCAGSPSSAGPAGIVLK/SIiAGPQLPPCRVGLGTCSLPYLSLPPAVG 
SFMARASLMSAAPCSMARVPSTTQGLLSRTPQCGQMAQDWQAAPPAS 
PVQDPLEEASWAPESSGDLENFYV 

3573 

C 

361 

501 

MLFLRYAPLVGGSFCIWSHPLQFFLTSIKVNFVETLFCFQSFKDIF* 

3574 

A 

124 

579 

EPLPGLAKAGAGSLSCS VRPCAAGAS PTSATPC q TAPQPTnMDPnTry 

CGRMVQDWQAAPPAPRCGIHWKPAGLLSLLMS1WIRINMIVKLLVF 
V I LLVT * FTVKVCLY I FQGLCCGLLFCKI LHGS PSHHRPKDLWHHCD 
LDVRHEVKGDH 

3575 

A 

2765 

3139 

YSLKSGSVMPPALFFWLRIAIAMQALFWFHMNFKVVFSNSLKKVIGS 
FMGMALNL*ITLGSMAIFTILILPTHEHGMFFHLFVSSFISLSSGL* 
FSLKRSFTSLVSWIPRYFILFEAIVNG3SL 

3576 

A 

265 

665 

GRASG WR S PR WVCDLAG FRGE AADLHG VC? Y <5 R T .R p nn PTOW T nwn 
GSRWSCLPVPRRAPALL\ALGWSMG\WAPPQPLAWVLSPSLPGAGSA 
DWLLQVRGPPSPRPPGTPAGPPAPCAALVPARVSPSTLPS 

3577 

A 

82 

361 

LRSPAGFTQW IRTGAAGGAACQFRAVRPALLS PWWDGTGRRGAGGG 
AYRGG S GRTGAHGTGGGS GMAGCRSRAVPRRKAAKTRPCEVQKRQHN 
V1T^LFPVVETKAQTVK>JFVLA*ASPCSVGSCAARASPISAAPCSTAP 
S P IDHPRAEECWAHGTELAG S STCS PGADPLGEAS WAPE 

3578 

A 

806 

934 

KILSSKFPFSFLPSYVCTSSFP*THPSIISTSSSSSHSVPTN j 

3579 

A 

1034 

1393 

AVTLTGKVCS FTPEASETTNPPGGTNNSRRAS LRAATTiTVKVRSFTP 
RGSAASFFKSVRPRTHQFRTQYSPRMSKIRDCSRSCSFISFLASWRI 
ICLL*VLISLCAW*RYRLFWPLPSS 

3580 

A 

529 

750 

REFRVGVASADPAFGAAGRPCPQGQ * GA* HTGQWLRRVYWVPQQCQP 
TGVALDFS PGLSCLPAGQGSGPAARHA 

3581 

A 

2 

343 

HAVFFISIRSFMFFFKLVILVSNSSKLFSTFLVSLHWVRTCSFSSEE 
FVITHFLKPTS VNS SNS FS VQFCSLAGEEL * S FGGEEAFWFLEFS AF 
LHWFFL I FMDLSTFGLCCC 

3582 

A 

289 

422 

WIPHRGCRWSCLPVPVPCARTPQPLGGGWDWARTHQTHCKLDLS 

3583 

A 

189 

309 

WLGQMEIQNHVPKIQMELCGQHSAAAVSLLKAGGRALQE 

3584 

A 

587 

802 

LKTFHTAVFVRFGPWL PDTG FCSSLSFQFSFLIS LDTLP YRKGVI Ch 
LEALDDLLFFLVETWYHQEAVSAL 

3585 

A 

76 

1740 

SLLTAASSFQPQLKQSSHLSHPSGWDYRLRMTQNKLKLCSKANVYTE 
VPDGGWGWAVAVSFFFVEVFTYGIIKTFGVFFNDLMDSFNESNSRIS 
WIISICVTFVLTFSAPI^TVLSNR\LGHRLVV\MLGGLLVSTGMVAAS 
FSQEVSHMYVAIGI ISGLGYCFSFLPTVTILSQYFGTRRS IVTAVA\ 
ST\GECFAVFAFAPAIMAIiKERIGWRYSLLFVG\LLQIiNIVIFGALL 

RPIFIRGPASPKIVIOENRKKAOYMT.PKrRTfTPTQTnCTriQniTTJT TTC 

PKNVPTHTNLELEPKAD\MQQVLVKTSPRPSEKKAPLLDFSILKEKS 
FICYALFGLFATLGFFAPSLYIIPLGISLGIDQDRAAFIiLSTMAIAE 
VFGR IG AG FVliNRE P I RKI Y I EL I CVI LLTVS LFAFTFATE FWGLMS 
CS I FFGFMVGT IGGLTFHCLLKMMS WALQKMS SAAGVY IFIQS I AGL 
AGPPLAGLLVDQSKIYSRAFYSCAAGMAIiAAVCLALVRPCKMGLCQR 
HHSGETKWSHRGKTLQDIPEDFLEMDLAKNEHRVHVQMEPV 

3586 

A 

1 

2472 

GFVFLLLLAPLSRLEFHKSRGLKNVFEGLEAEECFYKVHLDGGEEEL 
EWVRPDAETTTHQGRVSQEKHQALGCKWPKLCGDALCVILSAPSPID 
HPKAEECRRRARDWQAAPPAAPVRDPLGEASWAPESGQQMKEAAEIiG 
VS CMGPDLE KLTL YEVXLRLQGCKT^AQRPLGCTLLAI QGTLYQR I FS 
PLTQPELVNGKGWHLTOESLSQNGSLEFLTSEPHSPNPNEGSSRRQS 
LHTNANNMAFASEQFPl^PSGSGSRGFPGRARVLHLDCQNCQRXTAG 
PAQLQGSRAANRRKALVSPSSSVAREDGFAEEMVFTYGI I KTFGVFF 
NDLlTOSFNESNSRISWIISICVFVLTFSAPLATVIiSNRFGHRLVVML 
GGLLVSTGIWAAS FSQEVSHMYVAIGI I SAIMALKERIGWRYSLLFV 
GLLQLNIVIFGAIiLRPIFIRGPASPKIVIQENRKEAQYMLENEKTRT 


|02}V 
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S IDS IDSGVELTTSPK3WPTHTNLELEPKADMQQVLVKTSPRPSEKK 
jjijur £> xxj*\J5J\fc>ir j.u x AJjFGDFATLGFFAPSIVYI I PLGI SLGIDQD 

RAAFLLSTMAIAEVFGRIGAGFVLNREPIRKIYIELIC7ILLTVSLF 
AFTFATEFWGLMS CS I FFGFMVGTIGG\LTFHCLLRMMSWAL/QKMS 
SAAGVYIFIQSIAGLAGPPLAGLLVDQSKIYSRAFYSCAAGMALAAV 
CLALVRPCKMGLCQHHHSGETKWSHRGKTLQDIPEDFLEMDLAKNE 
HRVHVQMEPWRRRFAAAGPAGLGAAGDSDAFPAREGPERRAGYSGP 
AAACFDFSTAAPKREQRGPLSLVGLEM 

3587 

A 

323 

1303 

u luiurrti iw x V^il Ir Li JLfJj/i±iol oAWbal LtAAT 'PEPFSxjPLHCGSPF 

LGWSRPELAPSACREVVn^EKPEWELGLRTVLAGQCEFRVGVGSGDST 
RSSWPPLLAPGNEGLSTRASSCGGCTGSPSRAGPPVLCSISCQALAA 
FPRGRARGLQPAMPEPPPRLRGLLCSPEPPRLVPPPAPGHPVPLTTQ 
GLRSTGTKHRDWQAAPPAAPVRDPLGEASWAPESANLVGKWRTFVSS 

sgtanapistlskqttglyqsagcgwgqnlgakyra/lcrpvksd*h 
grqtkdtgvrlsgkglsnnp*lfrvgtvgshrphfptymdglasgv 

3588 

A 

1 

3552 

MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRN 

LPTKARTKHN I KR I DRLLGNRHLHKERLAVYRWHAS FI CSGNTMP I V 

LVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQ 

FI^LASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQ 

YADLG AENWKP I SNLHDMS S S HS KTLG YKRLTKSNP I S CQ I LL YKS R 

SKGRKNQRSTRTHCHHPSPKIYSASAKEPWVIiATNLPVEIRTPKQLV 

NIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQ 

LTCWI^GVHAQKQGWDKHFQANTVRNRNKEVE VNDLSGL I SDWMSP 

IPLTSLLCGNPDEIRSFPSLCPECTEGPIDLVFVIDGSKSLGEENFE 

WKQFVTG IIDSLTIS PKAARVGLLQYSTQVHTE FTLRNFNS AKDMK 

KAVAHMKi'MGKGSMTGIJU^KHMFERSFTQGEGARPLSTRVPRAAIVF 

TDGRAQDDVSEWASKAKANGI TMYAVGVGKAIEEELQE IASEPTNKH 

LFYAEDFSTMDEISEKLKKGICEGINYCALNKPGCEHECVNMEESYY 

CRCHRGYTLDPNGKTCSRVDHCAQQDHGCEQLCLNTEDSFVCQCSEG 

FLINEDLKTCSRVDYCLLSDHGCEYSCVNMDRSFACQCPEGHVAAAM 

GRRVQSVLGEEDAPEKRFALLSDPGDLTAHELDSCALGDHGCEHSCV 

SSEDSFVCQCFEGYILREDGKTCRRKDVCQAIDHGCEHICVNSDDSY 

TCECLEGFR1AEDGKRCRKSSCENKRADLVFIIDSSRSVNTHDYAKV 

KEFIVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERA 

v iviu jiuijjo ivj a wxviJUAxy * AxjNIAFSEAEGARPLRENVPRVTMIVTD 

GRPQDS VAEVAAKARDTGILI FA I GVGQVDFNTLKS IGSEPHEDHVF 

LVANFSOT ETTjTR VFOK'VT.r'T'TT 7\ Cr , T r K70KTT>^T ncnDitnm t ▼ »t/-iti 

1J¥ni,r ^vj-oiiJi o v r yi\_r\jjv_ ± lijii&b 1 WbWKyLPSAFAPPLLIiAICP 
RFRSRSSGSARRALHCGKSFLSRS PAILAAAVGRKMVAAKKTKKSLE 1 
S I NS RLQL VMKS GKYVLG YKQTL KM I RQG KAKL V I LANNC P ALRKS E 

IEYYAMLAXTGVHHYSGNNIELGTACGKYYRVCTLAIIDPGDSDIIR 
SMPEQTGEK 

3589 

A 

375 

646 

KKSLLSINSRLOIjVVMKSGKYVT,n\ YirnTT.TnviTT3nr , Tf7\TrT utt t\ yx\i/~* 
PALSGNNIELGTACGKYYRVCTLAI IDPGDSDI IR\ SMPEQTGEK | 

3590 

A 

75 

335 

LRILKFLSCSLSLLGWGPSPSLRQEDGGRKRRRKKSLESINLRLQLV 
M\KSGKIiLPGGTSQTLKMl\RQGKAK\LVILANN/CGPALRKSE\lB 
YYCYVWLKTGC/HHPLTVGNNI \ ELGATA\ CGKITYRVLHTGLFIDP 
G * LI/TS FRKACPEQTC * KMVAAKEDE KS RWS R S TLGSNS LWKS GKL1» 
PGGTSQTLKMIQTRQSEMWS FSLTTGPS FEE I 

3591 

A 

1 

2580 

MGSHHLVVVLIESPQTESQIVSDSHASKRIKYLGIHLTRDVKDLFKE 
NYKPLl^IKEDTl^KNIPCSWVGRIN^ 

KLPMTFFTELEKTTLKFI WNQKTAHI AKS ILRQKNKAGGIMLPDFKL " 
YYKATVTKTAWYWYQNRD IDQWNRTEPS E IMPH I YNYL I FDKPEKNK 
QWGKDSLxTOKWCWElWLAICRKXiKL^ 
KTIKTLEENLGITIQHIG1X3KDFMSKTPKAMAT 

FCTAKETTIRVNRQPTKWEKI FATYS SDKGL I S R I YNELQQI YKKKT 
NNP I KKWVKDMNRHFS KED I YAAKKHMKKC S P S LA I REMQ I KTTMR Y 
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HLTPVRMAIIKJCSGNNSKLPFEPSRNKRVMPEPCAHTILSSLARPAS 

KMHSPPLSMEPSAGPGGSQNFPNEEAEDRPRTLNFEGSETWALTGPD 

ARYFNYTREARGENSLVSGGCRPLAAGRARASTSGWAAKRVGTLQGG 

PLTWAMPAP IS PPPMTVTCLTI IFLAEAEAVDEEDTERMNCLVTKAM 
VWQRRRORGDPGREDARGGOPRKAPTiWRPnPTP dpcdd n -Dn\7T>\wnr\ 

GGGGASS PALP I GPGCNVPALSGYTRAGSQLS7VAGARVSAGI»RPAEQ 
RDLAAGVLWHALSREEQAKYYELARKERQLHMQLYPGWSARDNYKRG 
I L S CHRL YHEHTDE CDVEVQAGGQQRAFS RELAP PVAAWLPATCTL 
KMGSQSVSPYQKRLCNFLVMGELFPLPSPLQFLDQVIKTPMTSQKTF 
ES LVDFS KALGKHP VS CKVAAATGHTWLVDQTEDI LAKS KKGIEES 
LR KVAKKKF AENP K 

3592 

A 

1 

1386 

APVTENNGSPFGSQNELFPEPCFLLDKTEILTPNLGMALGLNDCQGA 
VTCEIRGSLQIFGLRNLDCDDDVPHIRSSNKVPWQPSHAVHPLTPL 
ITYSDEHFSPGSHPSHIPSDVNSKQGMSRHPPAPDIPTFYPLSPGGV 
GO I T P P LG W F FNE OL I F FKNEFRR I F YT , P T .T . V Q no\n? ncz n pw d t th 

GFRQPYPSSLSVDTSMSRCQRESRVGALRCAPAGPATPSGDGLSLLG 
KRCLQTNS W P PD CGMGTVRKAD I TRRVCL I QNMPG VG FS HUM I PGP P 
GPHTTGIPHPAIVTPQVKQEHPHTDSDLMHVKPQHEQRKEQEPKRPH 
IKKPI^AFMLYMKEMRANVVAECTLKESAAINQILGRRPJ^QYLSLCL 
PPRHI IHMS 1 1 VIEMWHVSGTPVFSAPRLHTVTQLLSGQFCSAGGRL 
I KALGKVRHS WSGDTQPAARAALTE PCTAEAVAQMERLF 

3593 

A 

2 

1419 

RPPGIRAPRQLHPAAGRRPDASARPRFRPTVLLHDPFQLSFPPPPLS 

YPSVFPAVARVLPQRSGDYRAAGMPQLSGGGGGGGGDPELCATDEMI 

PFKDEGDPQ\REKIFAEIVNPEEEGDLADIKSSLVNESEIIPASNGH 

EVARQAQTSQEPYHDKAREHPDDGKHPDGGLYNKGPSYSSYSGYIMM 
PNMNNDPYMSNGSIj55PPTPPTC3MlTl7P\nrnocuj\TiTjr»T tot ttvphpu 

FS PGSHP SH I PSDVNS KQGMSRHPPAPD I PTFYPLS PGGVGQ ITPPL 
GWQGQPVYPITGGFRQPYPSSLSGDTSMSRFSHP\MIPGPPGP\HTI ! 
GIP\HPAIVT\PQV\K0DHPHH\ SSDIjMTTVTCPOPP\ OPfTPnRPTTPPP 

TKX\PL•NAFMLYMKE^1PJ\1WVAECTLKESAAINQIL 
QAKYYELARKERQLHMQLYPGWSARDNYGKKXKRx^ 
PRMTAAYI 

3594 

A 

59 

435 

STCPAPQSARVQWCNLSSLQPPPPTPTPRSSNSPASASQIAG\ITGV 
RHHTQLIFV\FLVGDGGFHCVGSGWVSNCWPSGWRPLWPHLFLGFHK 
DLEWG FLVFAS I S FRS WRWDLKNS S VLTHFFK 

3595 

A 

54 

442 

NHFQHTLLWP KG F SLRTSNIiTTTNLGFSH I S GHWS WGAHGACS PCPH 
I S S Q PG PCP AS I YRPDS PHLPPCC PELN TDIJrnnVTW VTWPPYR c*t 
I PRPTVAHGAKTSKQPPSWKGNAKALGVMEKTFYK 

3596 

A 

360 

1023 

GQGRLPERG YLAEAGVYR I CGRQSRHEDGTHRGAQ VI EQRRKWLPAA 
LGFQPIILQLCGKRVP\APSRASTGOPOASPJ\WSGPTa/PHPnrRc;nT 
RLRVRAEYCEHGPALEQGVASRRPQAIiARQIiDVFGQATAVLRSRDLG 
SWCDIKFSELSYLDAFWGDYLSGALLQALRGVFLTEALREAVGREA 
VRLLVSVDEADYEAGRRRLLLMEEEGGRRPTEAS 

3597 

A 

3 

370 

QMYCVFNRNEDACRY/ GS/AIGVLAFLASAFFLVVDAYFPQISNATD 
RKYLVIGDLLFSG\Vx^SI^YQRYKAGVDDFIQNYVDPTPDPNTAYA 
SYPGASVDNYQQPPFTQNAETTEGYQPPPVY 

3598 

A 

1 

1356 

MGRGRGDLQPSGLGRPRTPAAPPJVGREPGAARVRIDRGADAAAAARP 
RIjLCEPVPGPGCREI^PAPLARAYWKFJIP 

GRARCSGAPGCRALLGHTGRVPPPGTRPVLRAPRASQRRSPQAVVKG 

MKLGFAQGGGHQASSGSSGSSNEIiGGHQPYQVLPSRQVFAljIVFSCI 

YGEGYSNAHESKQMYCTFmi^DACRYGSAIGVLAFI^ 

YFPQISNATDRKYLVIGDLLFSALWTFLWFEGFCFLNNQWAVNKPKD 

VLVGADSGSEPPKRGSKVNPGARRLSSGCPSDRPELIRPRCRIRAEA 

RNSTSRKHIQRNIHERRRTHDTPEQQKKNTATTGTAQRFNLPQFNCP 

LIGPPLT^RGSPLHEIGIRTSKGFFSSLLNGNHFEYLSTLTGELPWS 

RNCIGYGRNPLYHYMPTGRLQRRIEQRST 
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3599 

A 

3 

706 

GGGS GGDGDMESGAYGAAKAGG S FDLRR FLTQPQVVARAVVL \ VFAL 

IVFSCI YGEGYSNAHFSKO^^^^T5^^JT?^T^7T^an^^vr , Qa t\ r>\rr act ivojv 
a w *. uv,4, AwtiUA IV, vr MICH fiJL/Al_K.XV9 ± \uv ' »Ar IxAjS A 

F\FLVVDAYFPQISNATDRKYLVIGDLLFSALWTFLWFVGFCFLTNQ 
WAVTK\ PKTVLVGADSVRAAITFS FFS I FS WGVLASLG\ YQRYKAGV 
DDF I QNYVD PTPD PNTA YAS Y PG AS VDNYQQP PF \ TQNAETTEG YQP 
PPV 

3600 

A 

1 

166 

VI PQ IQKVCE WDGF I YVANAEAH K\ VQDTEAETLTGFLNG IEWILE 
EVESKRAR 

3601 

A 

1 

1193 

DSWEFQAMAGSEPRSTRTSPPPP\FSD\WGRLEAAILSGWKTFWQA 
LS TERVARTTS LEE VDEAANTLTRL P I DVQL Y I LS FLS PHDLCQLG \ 

x x niMD x vivlvo \ J. J-irV \KIc xjoKJJijiro WAb V xJWKSLPDLEISKKF 

ISDVTDGALFDYIAHYRMCCPYTRRASESSRPMYGADTSFLHSLIIQ 
NE PR YALFG PGLEELNTS LVLS L.M <3 <5 F FT .PPTfifiT .onp n t nr t n c mr 

NFQLNNQHKFNILILYSTTRKERDRAREEHTSAVNKMFSRHNEGDDQ 
QGSRYS VI PQIQKVCEWDGFI YVANAEAH KRHEWQDEFSH IMAMTD 
PAFGSSGRPLLVLSCISQGDVKRMPCFYIJ^ELHLNIiIJraPWI,VQ\D 
TEAETLTGFLNGIEWILEEVESKRAR 

3602 

A 

1 

171 

RILMAINGKVFDVTKGRKFYGPVKYHHVGKLLTAGEEPA\VYSDEEE 
PKDESARKND 

3603 

A 

3 

253 

LSLQEFGTSFAGRDASRGLATFCLDKEAIiKDEYDDIiSDLTAAQQETL 
SDWESQFTFK/LLKEGEEPT\ VYSDFPPPKnPQnpvTjn 

3604 

A 

3 

701 

PARHSLAQREEKVASSGSLPSAAQPLLSEIMAAEDVVATGADPSD\L 

ESGGLLHEIFTSPLNLLLLGL\CIFLLYKIVRGDQPAA\SGDSDDDE 

PPPLPRLKRRDFTP\AE\LRRFDGVQDPAPYSWAINGKVFDVT\KG\ 

RKFLTGPEGPYGVFAGRKCIQGALPTFLLDKEA\LKDE\YDD\LSDL 

T\AAQQE\TLSDWESQFTFKYHHVGKI,LKEGEEPTVYSDEEEPKDES 
ARKND 

3605 

A 

3 

678 

KRLPKMAE VQVLVL\ DGRAHS S LG \ RLA\ AI VAK\ QVLLGRK\ WW 
\RCEGI\NISGNF\ YR\NKT.TTV r T.riTrT5DT>cr2MisJ r nKiT5c?'D\ pdvupcV 7\r» 

SRILLAGTVRGM\LPHKTKAEAKAA\ LDRL\ KVFDGIP\PPYGQEKS j 
G^TVVP\AALKVVR\LKPTRKFCLIWGRTA APFVfi\ WKrnziVTnTT in? 

\KRKREKPRFHYRKKENSIMRL\RKOAREETWRKXIDKYTE\n J KT , Hn 
LLV 

3606 

A 

114 

353 

ALSSRGVMAEYGTLLQDLTNNITLEDLEQLKSACKEDIPSEKSEEIT 
TGSAWFSFLESHNra^DKDl\TLFHSWPTQTVAIi 

3607 

A 

104 

342 

ALSSRGDNLSYIEHIFEISRRPDLLTMVVDYRTRVLKISEEDELDTK 
LTRI PSAKKYKDI IRQPSEEEI I KIxAPPPKKA 

3608 

A 

211 

241 

NFNGRGGLAAGRRKKPDWLDSTIxNS * PSH 

3609 

A 

37 1 

116 

RPTRPPTRPDLLRKPFFTRYPG PLGL 

3610 

A 

104 

516 

ALSSRGVMAEYGTLLQDLTNNITLEDLEQLKSAFKE\DIPSEKSEEI 
TTGSAWFS FLESHNKLDKDNLSY I EH I FE I SRRPDLLTMWDYRTRV 
LKISEEDELDTKXTRIPSAJOCYKDIIRLPSEEEIIKL\APPPKKA 

3611 

A 

569 

838 

PTVLWEPPSFPPPPPQGIPIHSG/PDTLTGRRRASPSGLCYSENGGL 
TEEGSTRPPAH/SPGATVSSPPPPPPPPSSSLPCCLGNRHFDDRV 

3612 

A 

268 

356 

EFFILTFKNNIFIYRYNRISREWTQKYAM 

3613 

A 

489 

605 

QLQVCYCT*RPYSGTRCGVSlWOLLLCSLI J SHl J CrSAS 1 

3614 

A 

391 

561 

AYTERSEEAJ^QAHTMALK^INKELSDlxARDPPAQCSAGPVGDDMFH 
WQATIMGPND 

3615 

A 

367 

855 

GARRQAHTMAiK\RINKELSDLAllDPPAQCSA\GPVG\DDMra\WKA 
TINGDLMDKPIFKGGVFFLTIHFXPTDYPFia'PXKVAFXTTRIYHPN 
INSNG\SICLDILRSQWSPALTISKVLLSI\CSLL\CDPNP\D\DPL 
VPE\ IARV\ YKTDRDKYNRISREWTQKYAM 

3616 

A 

1002 

1480 

MLLLKTTER FE VS VCMACT YV \ SNIjGKKQRS VS FLASGLMRVS TG PE 
LRLHHSFVLTGDVGRRICRLLVGLFTKGDTSSKRV\HPFSPGP\CFL 
LCDLAR\VGSSPKINRVPHFTRTQTSTQRSCTVFVWQRCSLVGPFQV 


;§. O O S "-ll- €•» «•!!■■ 3; . O 2 H £» O 2! 






TVFTM\ YFHHSLRS ISRFS SG 

3617 

A 

223 

415 

MEPAAPSQKPSRGSGPLPRIPLLAATPLHTRRQPGPRHRRTAEEIVG 
ALFGTANI/TANGAVGGE 

3618 

A 

1 

858 

MQNIiKLDSLLLKATPLIREQPHLNPAVPHLQQKERLPGLSDKAAAPL 
AVRFAVPKSCKKTGLETGIQLQIHPLHAKGPGRASTAPPQPEVTHRA 
DY FFRRPPVARPRLS P AGS IHAKSVGSGS CRAKGGCRTRS SRTWNNR 
GRLRDPGASLGPRMYSHGIELACQKQKEFVKSSVACKWNLAEAQQKL 
GSLALHNSESLDQEHAKAQTAVSELRQREEEWRQKEEALVQREKMCli 
WSTDAISKDVFNKSFINQDKRKDTEDEDKSESFMQKYEQKIRHFGFG 
YDAL 

3619 

A 

144 

1339 

DPEQPDMEQPWPPPGPWSLPRAEGEAEEESDFDVFPSSPRCPQLPGG 
GAQMYSHGIEI^CQKQKEFVKSSVACKWNIAEAQQKIX5SLALHNSES 
LDQEHAKAQTSSIRTEGNGKKNWRQKEEAIi\RQREKMCLWSTDAISK 
DVFNKSFINQDKRKDTEDEDKSESFMQKYEQKIRHFGMLSRWDDSQR 
FLSDHPYLVCEETAKYtiltiWCFHLEAEKKGALMEQIAHQAWMQFIM 
EMAKNOJVDPRGCFRLFFQKAKAEEEGYFEAFKNELEAFKSRVRLYS 
QSQSFQPMTVQNHVPHSGVGSIGLLESLPQNPDYLQYSISTALCSLN 
SVVHKEDDEPKMMGHCIIWVKTAEAKCYFVTGKGGTWAIFLDTFMGV 
LALYFCSVFDGRERVLKCFVNFF 

3620 

A 

2 

486 

ESGQVKXRPVSSASRGLRPAKPRRTTMLKAKILFVGPCESGKTVLTN 
FLTESSDITEYSPTQGVRFEPCWPALMKDAHGVAIVFDADIPSHRKE 
LEMWDS CFCQQPSLQDTQCMLI AHHKPS SGDDKGS LSLS PPLNKLKL 
VHSNLEDDPEEIRMEFIKYL 

3621 

A 

1 

1080 

MRSHFSALGWSMGLSALEQGWLVGEARAAQEPMEWVGGSGXAGCRS 
RALPRGKAAKARFESCWPALMKDAHGWIVFNADIPSHRKEMEMWYS 
CFVQQPSLQDTQCMLIAHHKPGSGDDKGSLSLSGGGGNRRQGRSCPR 
GWTLGVAGAAAAAAQG KEARGLHVGS TWAPATPLGNLS PAAP YSGDL 
DSNHCPEEVVEASPVLMSPEGPSKDLYLHSFSQSGKTVXANFLTESS 
D I TE YS PTQGVRILEFEN PHVTSNNKGTGCEFEL WDCGGDAKYVS FK 
ESHFIKWFKNQLPKPGAVAHTCNSSTLGGQGGAMATPGTLRAAAPEP 
AFEFPQKT I S MQVKVAKRL VC I TRLNLAFL I 

3622 

A 

4 8 

653 

RAASRGLRPAKPRRTTMLKAKILFVGPCESGKTVLANFLTESSDITE 
YSPTQGVRIIiEFENPHVTSNNKGTGCEFELWDCGGDAKFESCWPALM 
KDAHG WI VFNA\ E S QATGRKWRWG I P AFVQQPS LQDTQC \ I L I AHH 
KPGSGDDKGSLSLS PPLNKLKLVHSNLEDDPEEIRMEFI KYLKS I IN 
SMSESRDREEMS IMT 

3623 

A 

44 

421 

SICLPPVAPWLRPVHRQLGEANEEFALRVQQLVAKELGQTGTRIiTPA 
DKAEHMKRQRHPRLRPQSAQSSFPPSPGPSPDVQLATIjAQRVKEVLP 
HVPLGVIQRDLAKTGCVDLTITNXjLEGAVAF 

3624 

A 

20 

422 

lgihvflvscalpdsvlrrfwrtmcavlglvarqedsglrdhsvrv 
lisnhvtpfdhnivnllttcstpllkrfcastrlpptplllfpeeea 
tngregllrfsswpfsiqdwqpltlqvqrplvsvtvsda 

3625 

A 

1 

346 

LRPQS AQS S FPPS PGPS PDVQLATLAQRVKE VL»PHVPI>G VI QRDLAK 
TGCVDLTITNLLEGAVAFMPEDITKGTQSIiPTASASKF/PRQESLQE 
RKQALYEYARRRFTERRAQEAY 

3626 

A 

162 

461 

LVAKELGQTGTRLTPADKAEHMKRQRHPRLRPQSAQSS FPPS PGPS P 
DVQLATLAQRVKEVLPHVPLGVIQRDLGMGKGGPTLGDGHREEKWVV 
KEESK 

3627 

A 

39 

260 

DCKPVARPSTKAKTDQSGPFLSLLPHRYLSLSLTLRSSSPESMTCWM 
EGREKEKS ASQ * VTTTFRASMS S YTA 

3628 

A 

1 

2160 

FRPGAPEAAVMELSSW.PGPERLFDSHRLPGDCFLLLVIiLLYAPVGFC 
LLVLRLFLG IHVFLVS CALPDS VLRR FVVRTMCAVLGIiVARQEDSGL 
RDHSVRVLISNHVTPFDHNIVNLLTTCSTVSESEADSATVRFPGAQL 
KAPLS PLAFPMEDTEALPLTP I L YPTCQFFFF I FLN I FLLAFS S PGS 
QPLLNKS PQAFV\ CWSRGFMEMNGRGE \ L»VE \ ShKRFCAS TRLPPTP 
LLLFPEKKATNGREGLLRFSSWPFSIQDWQPLTLQVQRPLVS/VGE 
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3629 A 


789 


3630 A 1706 


2082 


CVLNRESLGLEGEISHPWPWLRPHLMPSPQTVSDASWVSELLWSLFV 
PFTVYQLVAKELGQTGTRLTPA\DKAEHMKRQRHPRLRPQSAQSSFP 
PS PGP S PD VQ/ 1 WATLAQRVKE VL PH/ VGPLGVT QRDLGMGKGGPTL 

GDGHREEKWWKEETKTGCVDLTITNLLEGAVAFMPEDITKGTQSLP 
TAS AS KFPSSGP VTPQPTALTFAKS S WARQESLQERKQALYE YARRT 
FPQVTRSHIISTALEDPCLSKISTTCQRGNSLRSCQRATMTSQPLRL 
AEEYGPSPGESELAVNPFDGLPFSSRYYELLKQRQALPIWAARFTFL 
EQLESNPTGVVLVSGEPGSGKSTQIPQWCAEFALARGFQKGQVTVTQ 
P YPLAARSLALRVADEMDLTLGHEVGYS I PQEDCTGPNTLLRFCWDR 
LLLQEVASTRGPGKLGRAGTR 


MGFRHVGQAGLKLLTSSDLPALASQVQCKLWPASAERHISPLQRGL 
IAFSLRPDPGLAGDRASLGVPLCGYRIGNPRRSPKRTRHCARTAAVS 
CSAYSKCLYQFRPNVGHSASTPKAMCQTTAAGEGAHAADEGYTQRW 
GARRGTPQRYTGGAETKNRAFRSYPKMETSWPRGGGSTPPVALDAQV 
ERECALGSGAQGKHSCLAVRWLIFCINEHYPQTLTYRLPFVLPLRS 
GGTPTSWRALFFPSSSDSNCKLCGGTS S 


ILLFFSFFLRWSFALVTQAGVQWRDLGSLQPPPPGFKRFSCLSPPSS 
WDYRCPLPRLANF\LVETGFHHVGQAGLKLLTSGDPPTLASQSVGIT 
GRSHRARPNLILTGYCRIISVLSLMILNPII 


3631 C 44 


340 


MKCSQPXRCHFQSDFQKCAPCPRAQTHWLEPPGRVQTISSMRNAQKG 

FADSIRLWRLPASGVGVATVSPEGAGDPSHLLDPPGHSAPYSPAPROL 
SRVP* 


3632 I C | 2037 I 2426 

3633 | A | 184 19818 


MKCSQPPRCHFQSDFQKCAPCPRAQTHWLEPPGRVQTISSMRNAQKG 
FADS IRLWRLPASG VG WWS PP I QTQE VAPEGM YLVGS S SGTLGGCR 
ALTQVFLS LS S LGCVCACACACLCF S L WAHODVS S * 


IEEPRDTRLQVCSGVHIWCLDKFKMRKHRHLPLVAVFCLFLSGFPTT 
HAQQQQADVKNGAAADIIFLVDSSWTIGEEHFQLVREFLYDWKSLA 
VGENDFHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQ 
TGKGLEYIM\QSHLTKAAGSRAGDGVPQVIWLTDGHSKDGLALPSA 
ELKSADVNVFAIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIV 
GNLVS CVHSS VS PERAGDTETIiKD I TAQDS AD 1 1 FL IDGSNNTGS VN 

FAVILDFLVNLLEKLPIGTQQIRVGWQFSDEPRTMFSLDTYSTKAQ 

VLGAVKALG FAGGELAN IGLALDF WENHFTRAGG SRVE EGVPQVLV 

LISAGPSSDEIRYGWALKQASVFSFGLGAQAASRAELQHIATDDNL 

VFTVPEFRSFGDLQEKLLPYIVGVAQRHIVLKPPTIVTQVIEVNKRD 

IVFLVDGSSAIiGLANFNAIRDFIAKVIQRLEIGQDLIQVAVAQYADT 

VRPEFYFNTHPTKREVITAVRKMKPLDGSALYTGSALDFVRNNLFTS 

SAGYRAAEGIPKLLVLITGGKSLDEISQPAQELKRSSIMAFAIGNKG 

ADQAELEEIAFDSSLVFIPT^EFRAAPLQGMLPGLLAPLRTLSGTPEV 

HSNKRDI I FLLDGS ANVGKTNFPYVRDFVMNLVNSLDIGNDNIRVGL 

VQFSDTPVTEFSLNTYQTKSDILGHIiRQLQLQGGSGLNTGSALSYVY 

ANHFTEAGGSRIREHVPQLLLLLTAGQSEDSYLQAANALTRAGILTF 

CVGASQANKAELEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTTYVS 

GGVEEVPLAQPESKRDILFLFDGSANLVGQFPWRDFLYKIIDELNV 

KPEGTR I AVAQ YSDDVKVESRFDEHQSKPE I LNLVKRMKIKTGKALN 

LGYALD YAQRY I FVKSAGS R IEDGVLQFLVLLVAGRS SDJR VDGPASN 

LKQSGWPFIFQAKNADPAELEQIVLSPAFILAAESLPKIGDLHPQI 

VNLLKSVHNGAPAPVSGEKDWFLLDGSEGVRSGFPLLKEFVQRWE 

SLDVGQDRVRVAWQYSDRTRPEFYLNSYMNKQDWNAVRQLTLLGG 

PTPNTGAALEFVLRNILVSSAGSRITEGVPQLLIVLTADRSGDDVRN 

PS VWKRGGAVP IG IG IGNADITEMQTI SFI PDFAVAI PTFRQLGTV 

QQVISERVTQLTREELSRLQPVLQPLPSPGVGGKRDWFLIDGSQSA 

GPEFQYVRTLIERLVDYLDVGFDTTRVAVIQFSDDPKAEFLLNAHSS 

KDEVQNAVQRLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQ 

FLVLISSGKSDDEVWPAVELKQFGVAPFTIARNADQEELVKISLSP 

EYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAVR.g 
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DAADIVFLIDSSEGVRPDGFAHIRDFVSRIVRRIiNIGPSKVRVGWQ 

F SND VF PE F YL.KT YRS Q AP VUD AI RR1»RI»RGG S PLNTG KAL E FVARN 

LFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSRFAQVIRSSGIVSLGV' 

GDRN I DRTELQT I TND PRL VFTVRE FREL PN I EER I MNS FG PS AATP 

APPGVDTPPPSRPEKKKADIVFLLDGSINFRRDSFQEVLRFVSEIVD 

TVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAINKWYKGG 

RHANTKVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGKSVEDAQD 

VSLALTQRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSEL 

SEQVLETLHDAMHETIiCPGVTDAAKACNLDVILGFDGSRDQNVFVAQ 

KGFESKVDAILNRISQMHRVSCSGGRSPTVRVSVVANTPFG\PEEAF 

DFDEYQPEMLEKFRNMR\ S QHPY\ VLTEDTLKVYLNKFRQS S PDS VK - 

VV IHFTDGADGDLADLHRAS ENLRQEGVRAL I LVGLER WNLERLMH 

LEFGRGFMYDRPLRLNL\LALG\YX)QADQLDNIAEKACCGVPCKCSG 

QRGDRGPIGSIGPKGIPGEDGYRGYPGDEGGP\GERGPPGVNGTQGF 

QGCPGQRGVKGSRGFPGEKGEVGEIGLDGLDGEDGDKGLPGSSGEKG 

NPGRRGDKGPRGENGERGDWILGGPGLSY/GQDSQERGPGGETGDL 

GPMGVPGRDGVPGGPGETGKNGGFGRRGPPGAKGNKGGPGQPGFEGE 

QGTRGAQGP AGPAGPPGLI GEQG I S GPRGSGGAAGAPGERGRTGPLG 

RKGEP\GEPGPKGGIGNRGPRGETGDDGRDGVGSEGRRGKK\GERGF 

PGYPGPKGNPGEPGLNGTTGPKGIRGRRGNSGPPGIVGQKGRP\GYP 

GPAGPKGNRGDS IDQCALIQS IKDKCPCCYGPLECPVFPTELAFALD 

TSEGVNQDTFGRMRDWLS IVNDLTI AESNCPRGARVAVVTYNNEVT 

TE IRFADSKRKSVLLDKIKNLQVAIiTS KQQSLETAMS FVARNTFKRV 

RNGFLMRKVAVFFSNTPTRASPQLREAV3jKLSDAGITPIjFLTRQEDR 

QLINTO.QINNTAVGHALVLPAGRDLTDFLENVLTCHVCLDICNIDPS 

CGFGSWRPSFRDRRAAGSDVDIDMAFILDSAETTTIiFQFNEMKKYIA 

YL VRQLDMS PD P KG \ S QH F ARVAWQHAP S E S VDN ASMP PVKVEF S L 

TDYGSKEKLVDFLSRGMTQLQGTRALGSAIEYTIENVFESAPNPRDL 

KI VVLMLTGEVPEQQLEEAQRVI LQAKCKGY FFWLG I GRKVN IKEV 

YTFASEPNDVFFKLV\DKSTELNEEPLMRFGRLLPSFVSSENAFYLS 

PDTRKOCDW\ FOGDOPTICWTA/KFOHKO\nWP\TJTJVT 

TK\ PVTTTKPVTTTTKPVTTTTKPVTI INH\ PFVKPAA/ AKSPAPAK 

PVA\ AKP VATKTATVRP PVAVKP ATAAKP VA\ AKPAAVR PPAAAAAK 

P VATKPEVPRPQAAKPAATKPATTKPMVKM SREVQVFE I TEN S AKLH 

WERPEPPGPYFYDLTVTSAHDQSLVLKQNLTVTDRVIGGLLAGQTYH 

VAVVCYLRSQVKAIYHGSFSTKKSQPPPPQP\ARSAS\SSTINIiMV\ 

S TEP \ LALTETDI CKLPKDEGT / CARDF I LKWY YDPNT\ KSCA\RFW 

YGGCGGNENKFGSQKECEKVCAPVLAKPGVI S VMGT 

3634 

A 

65 

244 

SRRRGGVSAPTSFYGRDRRMFPAQEEADRTVFVGNLEARVREEILYE 
LFLQCCS FLTMK 

3635 

A 

65 

262 

SRRRGGVSAPTSFYGRDRRMFPAQEEADRTVFVGNLEARVPJBEILYE 
LFLQVLCPREMGILS ISP 

3636 

A 

2 

649 

GKSRRMFPAQEEADRTVFVGNLEARVREE I LYELFL»QFL I \ AGPLTK 
VTICKDREGKPKSFGFVCFKHPESVSYAIAIiLNGIRLYGRPINVQYR 
FGS SRS S EPANQS FES CVKINSHNYRNEEMWGRS S FPMQYFPINNT 
SLPQEYFLFQKMQRHVYNPVLQLPYHEMTAPLPNSASVSSSLNHVPD 
LEAGPSSYKWTHQQPSDSDLLSDDSSTS 

3637 

A 

312 

805 

G PQAGS S PE I LLPEDNS L I RWAGAGCHTQDS ATTVTAS WAPLCAGAG 
REEDGTPCVCLGRSARPKLVFGRPLRTSAALRPGACIPHGLLCYPRE 
PVDFRPGSGRAWPPRSPSP/CQDPRPSRSAPGSEPGW/CPDPPAPPR 
LVTLGRASPHLTLPRRAAPPPCRWRP 

3638 

A 

2 

1581 

HGDVSPRCLFKEGSAAQLQTLELAGSLHLSAPSGSAKATKVESLQSR 
IYIHLLVVLMAVTACLEQLLRGCQLQYGRRGQSCPLLRDSGRQEHAR 
APSPTTSARRETHVPGHSCSRPAAALHPAIPALLGAQETPCPCRLKV 
PAPEVAVHSMSASAPAAEGEGTPTQPASEK\DPEMPGPREESEEEED 
ED\DEEEEEEEJCEKSLIVEGKREKKKVERLTMQVSSLQREPFTIAQG 
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KGQKLCEIERIHFFLSKKKTDELRNLHKLIiyiJRPGTVSSLK\KNVGO 
FSGFPFEKGSVQYKKKEEMLKKFRNAMIiKSICEVLDLERSGVNSEIiV 
KRILNFLMHPKRSGKPLPKFKKSCSKGSKKDRNSSGMASKAKRTKCP 
EILSDESSSDEDEKKNKEESSDDEDKESEEEPSKKTVKRE\KPKQRA 
TSKSKKSVKSAmnCKADSSTTKKNQNSSKKESESEDSSDDEPLIKKL 
KKPPTDEELKETIKKLLASANLEEVTMKQICKKVYENYPTYDLT\ER 
KDFIKTTVKELIS 

3639 

B 

199 

286 

XSELESALQMEPAAFQALYSAEKPKLEDEHLVFFCQMGKRGLQAHAA 
GPESWIHWGSQLRWSL* 

3640 

A 

754 

1082 

GMV 1 1 CCLGAPRTQP FQ AQLPNLS AKLLAFPS TL S T P P VS ELE S ALQ 
M\DQPAFQALYSAEKPKLEDEHLVFFCQMGKRGLQATQLARSLGYTG 
ARNYAGAYREWLEKES 

3641 

A 

113 

737 

AGPDSFCNFSVCPSSTSTAWILVLAQFLLSPTLLPSPRMLTVALLAL 
LCASASGNAIQARSSSYSGEYGSGGGKRFSHSGNQLDGPITALRVRV 
NTYYIVGLQVRYGKVWSDYVGGRNGDLEEIFLHPGESVIQVSGKYKW 
YLKEAGYLVPTKGP\FLSFGKSKGP\SFNAVPLHPNTWRFISGRSG 
SLIDAIGLHWGCFTHTSCSRC 

3642 

A 

1724 

1819 

AFVSLGLMCFGALIGLCACICRSLYPTIATG 

3643 

A 

1 

894 

RGESRVL WS ELEGEAGGAGGWAS SLNARMDNRFATAFVIACVLSliI S 
TI YMEG\ S IGTDFWYE YRS PGQENS SDLNXS I WDEFI SDEADDATYN 
\DALF\RYNGTVGFLRRVYSPYPKTLHWVLAHHERTESFDVVTK\CV 
SFTL\TEQFMEKF\VDPGK/HTNSRGLDLLRTYL\WRCQFLLPF\VS 
LGFDVLWGALIGLCA\CICRSLYPSIATGILHLLAGLCTLGSVSCYV 
AGIELLHQKLELPDNVSGEFG\WSFCLACVSAPLQFM\ASALFIWAA 
HTNRKE\ YTLMK\ AYRVGMS KKPAC F 

3644 

A 

323 

898 

IWHLVPTTQRSGKGKIMELVERSWARVEVGHRGFLERGLPIAAIN 
DMKKS PE 1 1 SGRMTFGKNADLDNWALS AS PGGS LQCTRDQEQI LQVI 
RAGTGPDKNQGNSIjVQWTFLVLPRRE YGE I EAS KVI PAGQN1ANRTH 
PNETSVWKCHKEAGSLWNIPFAGLPAASVMEKRWFFPTVAEVQAIiTGG 
QQQ 

3645 

A 

816 

1171 

LFKHFWGPVANVJGLPIAAINDMKKSPGDYSVGRMTF\Al»C\CYS\liT 
FMEILPYKVQPSGTWASCFACHGTNEVRPSFIQGEGRIilPNTRMDLK 
RASGINQWGKGKNKVFEGDSLCPGCC 

3646 

A 

205 

369 

SSQPPTLLPAGIDVEQVSWINFDLPVDKDGNPDNETYLHRXGRTGR 
FGKRGLAV 

3647 

A 

3 

452 

CSGAGAEPAANPRSPRSLVPSLESTSTSVPPAPGTMATDSWAIAVDE 
QEAAAE / S G A WKTNANAE KTDE EEKEDRAAQ S LLNKL IRS NL VDNT 
NQVEVLQRDPNSPLYSGKSFEELRLKPQLLQGVYAMGFNQPSKIQEN 
ALPLMLAEPY 

3648 

A 

1 

186 

WF PVP I S AE PAANLWS PRFLVPSLECTSTS VP PAPGTMATDS WALAV 
DEQEAAAESVSWLQP 

3649 

A 

2 

261 

S S ANN I S I LGAFRS VTKWL S * THPL* FFCQCLCLSPTYELALQTGKV 
IEQMGKFYPELKLAYAVRGNKCEYGRRQLNSEQGRGLGV 

3650 

A 

1 

2613 

MPSQTRKTASWLAAELSKEGHQVALLSGEMMVEQRAAVIBRFREGKE 

KVX.VTTNVCARGVKLQTFTTOVTALKVVRRELLVPU5GL 

EAVDLCAYVSSGSYSYMTSSASEFLLSLTLPGAPGAQIVHSQVEEOQ 

LUVAGSSWSCLLEACPLGPGNVLQHCHbLVKPERVGTLGLGMATLP 

YI»D SGALDGTHRVKLVPG YAGLQRLS KERLSGKSCACRRCMGDAGAS 

DWFDSHFDGNISPVWTRENMDLPPDVQRWWMGFFFVPRMKPKGSSWS 

TAPSISWAKVLFLLQLLKKLGYRI^QAPTVGFEQDVGSRTTHHFMYP 

ESAKNLPANVSFVLVP^KVLDLLWIASALSTGQIRFSCIQGLSGSVF 

KKRSCPS I CGHKASREWNEHI AVRTLFWROGAGTYIiNVPWSEFFKRT 

GVRAIINYPTQGPSLKEDTGVLAANTNSRGYHVIKSMSKSVLALTLH 

TKMLCRQGDRMLQPQFKSHNTNEVLEKLFQIVPGENPYRFRDPPPCR 

RCAWGNSGNLRGSGYGQDVDGHNFIMRLPFCLQGTLVEYGISPALP 

QNLIAOSQSGTGKTAAFVLAMIiSRVEPSDRYPQCLCLiSPTYEIiALQT 
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GKVIEQMGKPYPELKLAYAVRGNKSGYWGPMFTLPSRILPLPSTIiGP 
VSTRSVGBEALTRECQSQKCVSAARTSRMBLTAVSTKSYNIRISDYH 
PIFHISRIRTIRRFTLSRYSPFLEIPQFPNLHLHTIPMYAVLRIHRV 

T I WVS A I TKYR Y PMYT /NTPMT? W TDT TTtTT 1 CUT dt dxttt VTvmmT«TTrt« 

SPFVSHFLKPSTVHIRFADPNSLSPLSFTIPFRHSNTFRIPLPNNKQ 
FQVTISVPVSESGAMTSLHLERNFQ 

3651 

A 

3 

550 

LCSSR/DEKFKALCNLPG/AITMAQAMIFCHTRKTA\SWLAAELSKE 
GHQVALLSGEMMVEQRAAVIERFREGKEKVLVTTNVCARGEQRTCPT 
WSARLGVPGPIRARNPCIQGSRMVSGRWVGLVTLFLSRRDCLDFPEV 
VRTHTCQ VR WLM P AI PRQENCLNPGDRGCS E pRMC YS TPS WVTE 

3652 

A 

1 

1571 

CRGAG AE PAANPRS PRS L VPSLE YTS TS VPPAPGTMATDS WA\ LAVD 
EQEAAAESLSNLHLKEEKIKPDTNGA\VVKTNANAEKTDEEEKEDRA 
AQS LLNKL I RSNLVDNTNQ VE VLQRD PNS PL YS VKS FEELWL \ KPQL 
SQGVYAMGFNRPSKIQENALPLMIxAEPPQNLIAQSQSGTGKTSLPLV 
la\ mlsr\ VEP I RPDTPQCLCLFPT\ YEAGGFKQGKVI EQMGKFYPE 
xjxvxxM x v ^\vjix Ju»t»Kby a. j. o r,y J. V X G rPGT/ VGLNW \ CFQLKFIDPKK 
I KVFVLDEADVM I ATQGHQDQS IRIQRMLPRNCQMLLFSATFEDS VW 
KFAQKWPDPNVI KLKREEETIxDTI KQYYVLCS S RDEKFQALCNL YG 

n*. x -inyru j j.x- \ v-XT 1 xvxVX/^ r¥xj/\MJ}±j \ O xVJboixy V VMIjoGEMM VEQRAA 

VIERFREGKEKVLVT/TPNLCSRGIDVDQVSWINFDLPVDKD\GNP 

DNET\YLH\RIGRTARFGK\RGLA\ VWMVTlQK\ UQMMTT \XTOTA\ TTtJ 
F \NKKIER\ LDTDDLDE IEEN 

3653 

A 

3 

421 

QNTIDx^SLLGPKVIjLFLAAFIITSDWIPLGVNSQRGDDVTQATPET 
FTEDPNLVNDPATDETVLiAVXiADIAPSTDDLECWDEKPTCTRLYSVH 
RPAJCQCIHQLCFTGLRRMYIVNKEICSCLGWKEHEAMKEELCRSK 

3654 

A 

209 

436 

xvxjxjv- j-x: x: xjxjx uor x' i xro XxjxTJjv~v*r X XjVLtljXr Lt I WAoSECWDEKFTCT 

RLYSVHRPVKQCIHQLCFTSLRRMYIVN 

3655 

A 

126 

775 

nxnAviMrtvciURUir ooyKiKWAiiyw I XxJNMAJbljGPKGLIjFIxAAFII 
T\SDWIPI^V\NSQ\RGDDVTQ\ATPETl^EDP\x^VxTOP\ATDETV 
LAVIiADIAPSTDDL\ASLSEKNTTA\ ecw\dekft\ctrlyscasgp 
VNQMHFIQLCFHPVLR\RMYI VQPRE I \ CSSVXiVCKGTPKPMKDELC 
\RQMAGLPPLGRL\RRPQLLPDFPPCENVDLQRPNGL 

3656 

A 

3 

220 

SRSAAHCCHGGTKMAAIxAYNLGKREIj^ 
HIAFIGDSRIRQLFYSFVKIINPQF 

3657 

A 

525 

1625 

GAPRCWPRWPCCCSQRATSPPAATI^IPCEYLLSSGRFIXSEKVWQP 

now «irixvxx\xor*/vi\xMvwijVUJ\JlJ. \Ax* lCi\DSRIRQL»FYSFvKl INPQF 

KEEGNKDFLWHPEVNGSMKQCI KVWTEDS IAKPHVI VAGAATWS I KI \ 

HNGSSEALSQYKMNITSIAPLLEKIjAKTSDVYWVIjQDPVYEDLLSEN 

RKMITNEKIDAYNEAAVSILNSSTRNSKSNVKMFSVSI^ 

SLDGLHLPESSRETKLAACFFTLS I IGYLI FYI IHRNAHRKNKPCTD 

LESGEExOCNIINTPVSSLEILLQSFCK^ 

NKF YTHS S F F I P 1 1 Y T T. VT ,RVP YNTPTJT \cw VTrvT>ovTrirtir» 

3658 

A 

190 

892 

SLPRRLE S PGRLAMAARGV I APVGE S / LCR YAE VPCS PS AKR PDADV 
DQQRLVRS L I AVGQGVAALAFAG\RYAL \ R I WKPLE \ QV I TETAKGR 
FQTPSFSSYYK\GGFDQK\MS\RREA\GLILGVSP\SAGKAK\IRT\ 
AHR \ RVM I LES PQ I KGG S P \ YVAAKINEAKD \ IjLETTPPTLMIjKDHT 
EGKKNEGTSKKKKS PAJCYS KTWS FL I F YMGFDHSL.I FRPLSCITNKM 
LIVLLFIIF 

3659 

A 

32 

763 

ARGNIxNHHPGAAWAPCVRLVPCSLLVI^ 

KNSCQHLEEDVD\QTKWRTDTPS^KILMEEVKLEEQLKEAVEEDKQ 
AxxMDTEGSEQSSQKLVEEGxTOYS IQGFCKDSL 

EEIKI)NNPHLKNLSLWDHLVAGKEGQDSQSFYIMSMEKLSSTKLVPA 

PPKIGDRCTQxxREKNKKLKSLPLSPQPRCLTPLSLMQQEAPGERFSP 
ILYFCHVLITVW 

3660 

C 

162 

425 

MVGPSLHAGXXXVYIPRFLYIRSWLPCIFFSGGVTVGNIGRQLAMGV 
PEKPIVIESSKPXILESXGRFLEENIxXLVDYXKGLSFFLK* 
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3661 

A 

89 . 

308 

1 PMPRSGLGLGITIAFLATLITQFLVYNGVYQYTSPDFLYIRSWLPCI 
FFSGGVTVGNIGRQLAMGVPEKPHSD 

3662 

A 

2 

454 

GRVGGRVGLS LTLAALS LGLWWTFDRSRSGLGLGI T I AFLATLI TQF 
LG YNGVYQYTSPDFL Y IRS WLPC I FFSGGVTVGNIGRQLAMLI PFCE 
ELNLKTTWLFHKTRSNYRVFLKS P I VI ES S KPPILRARKI LEENLTV 
DYDQDYLFS 


TV 

A 

570 

1418 

PMPRLHDHFWSCSCAHSARRRGPPRAIAAGLAAKVGEMI IVFVSGPS 
LMAVLSASDADPAPRGRSAVKSGPYPGSPYPNTWHHSLMQKSLVLFS 
VGEVLALVLNLLQIQRNVTLFPEEVIATI FS SAWWVPP\ CCGTAPAD 
VGLLYPCIDSHLGEPHKFKERMGQVSMRC I AVFVG INHAS AKLDFAN 
NVQLS LTLAALS LGL WWTFDR SRSGLGLG \ ITIAFLATL\ ITQFLVY 
NGVYQYTSPDFLYIRSWLPC\ IFFSGSVT \VGNIG\RQLGYGVFLEK 
PHSD 

3664 

A 

195 

462 

RI FSMTSGRLRWRCTWRPATALWSASLRLGTSSMHP3 PRS ISLP/ PV 

1 TT1 TTTfT1tT\t A/^lt^mt /—% t 0% % — ■ 0+ — — — _ 

HVHTHNSGKEVLGLQVQRSRSGTGPACSQAGSGAVQGGNWCIF 

3665 

A 

12 102 

± Z J Z / 

1 RIFSMTSGRLRWRCTWRPATALWSASLRLGTSSMHPSPRSISLPLSM 
MLS PLPSNTRGLS PTALFRPHS LS PQM 

3666 

A 

535 

992 

RI FSMTSGRLRWRCTWRPATALWSA\ SLRLGTSSMHP\ S PRS I \ SLP 
LS \MMLS\ PLPS \NTRGLSPTAIjVRSP\NSEHATSCP\RLHLWRCRA 
PLRS PS PLGRLPGATPGVPSHVHT\HNSGKEVLGLQVQRSRSGTGPA 
CSQAGSGAVQGGNWCIF 

3667 

A 

164 

280 

CGQVHMANKWQTEEVSPPSALLWAGPMPLLPRTPLSS 

3668 

A 

3 

2142 

GRMLDGPLFSEGPDSPRELQDEESGSCLWVQKSKLLVIEVKTISCHY 
RRRAPSRQPMDFQASHWARGFQNRTCGPRPGSPQPPPRRPWAYRVLQ 
EATNWRAGPLAEVRAREQEKRKAASQEREAKETERKRRKAGGARRSP 
| PGRPRPEPRNAPRVAQLAGLPAPLRPERLAPVGRAPRPSAQPQSDPG 
SAWAGPWGGRRPGPPSYEAHLLLRGSAGTAPRRRWDRPPPYVAPPSY 
EGPHRTLGTKRGPGNSQVPTSSAPAATPARTDGGRTKKRLDPRIYRD 
VLGAWGLRQGQGLLGGSPGCGAARARPEPGKGWEKSLGLAAADLNS 
GSDSHPQAKATGSAGTEIAPAGSATAAPCAPHPAPRSRHHLKGSREG 
KEGEQIWFPKCWIPSPKKQPPRHSQTLPRPWAPGGTGWRESLGLGEG 
AGPETLEGWKATRRAHTLPRSSQGLSRGE\GVFVIDATCWIRSQYV 
PTPRTQQVQLLPSGVTRWGDS PS QS KPGKEEGEGATVFPS PCQKRL 
SSSRLLHQPGGGRGGEAEGGRPGDSTLEERTFRILGLPAPEVNLRDA 
PTQPGS PEHQALGPAASGAQGRAEGS EVAWQRRAGRGWARTPGPYA 
GALREAVSR IRRHTAPDSDTDEAEELS VHSGS SDGSDTEAPGAS WRN 
ERTLPEVGN S S PEEDGKTAELSDS VGE ILDVI SQTEEVLFGVRDIRG 
TQQGNRKRQ 

3669 

A 

76 

419 f 

AFI PAMAEL IQKKLQGEVEKYQQLQKDLSKSMSGRQKLEAQLTENNI 

VKEVRDWDLWGEEGPVLAIWLITYVPSLHQELALLDGSNVVFKLLGP 
VLVKQELGEARATVGKRLDY 

3670 

A 

71 

350 . 

AFIPAMAELIQKKLQGEVEKYQQLQKDLSKSMSGRQKLEAQLTENNI 
VKE \ KRYESQLRDLERQSEQQRETLAQLQQEFQRAQAAKAGAPGKA 

3671 

TV 


1191 1 

GLLPHLGPRVQRLPRLSLSTLACGLTRGPHPFLLPQIHIHLTRIVGI 
GGTFDVSKLPFLSSPDLSKSMSGRQKLEAQLTEYNIAKEVRDWDLWG 
EEG P VLAMVL I TYVPS LHQELALLDG SNWFKLLGP VL VKQELGEAR 
ATVGKRLDYITAEM*VFIPPPCAAPCDASEPLE*RC*TIAEQLSIVA 
PSPPVPPTLSFPF*PPFFSLPWISRFSTYLFLAFSTLHSESY*FLPF 

CLSPLSLLSKRYESQLRDLERQSEQQRETLAQLQQEFQRAQAAKAGA 
PGKA 

3672 

A 

210 

617 

AFI PAMAELIQ \ KKLQGEVEKYQQLQKDLSKSMSGRQKLEAQLTENN 
IVKEELALL\DGSNV\VFKL\LGPVLVKQELGEARATVGKRLDYITA 
E IKRYESQLRDLERQSEQQRETLAQLQQEFQRAQAAKAGAPGKA 

3673 
3674 

A 
A 

3 

31 

266 S 
500 j 

GARGEGKTYGAGVALEAGDSDSGSRGGLSHHRAKLSSLQQLPTPPPL 

HELNYNAANSQSSREEPPEVVESATQKSLLDHRGTVTSGS 

ARVTAAVSAAAVAKRVWRDGFDCSGYYSLTTILLVQPTKRP\EGRTY 
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ADYES\VNECMEG\VCKMYEBHLK\ RMNP\NGPSI /THNTTSOL.FDF 
I D \ DLADLS CLVY RA\ DTQTY \ QP YN \ KD W I KDEDPTVLLS SGKAQQ 
ARE 1 1 WGSHWGGWGGAWDTGV 

3675 

A 

1 

653 

MLQQDSNDDTKDVSLFDAEEETTNRPRKVKIRHPVASFFHLFPRVSA 
1 1 VCLLCELLS S S F I TCMVTI ILLS CDFWAVKN ITGRLMVGLRWWNH 
VDEDGKSHWVFESRKESSQENETVSEAESGIFWLGLVACPVLWVILA 
FSARFSFR/VKKWLAWIVGWLQGANLYGYLRCKVPSVRIESLDTQ 
LGIQNWRIGWYEGKAHTSGVRSVGNRNRSE 

3676 

A 

399 

659 

VQPVDCIPERGNAAGPDGGEKCFLWDLQCSHSL*HVGSKPSHPLFRG 
RSDSCVI*SKDPSSGHHTSESDTR*IPRPCWECPGPLPV . 

3677 

A 

291 

948 

RGSWKQGWRRAAMLQQDSNDDTEDVSLFDAEEETTNRPRKAKFSIPV 
\ASFFHLFFRVSAIIVCLLCELLSSSFITCMVTIILLL\SC\DFWAV 
KKVP\GYLL\VGLRWWNHIDEDGKSHWVFESRKESS\QENKTVSRGL 
NSRIF\WLGLIACPVLWVIFAFSALFSF\RVKWVGGWVIMGVVLQ\G 
ANLY\GYIRCKVRSRKHLTSMATFILLESSFLRQNTWR 

3678 

A 

1 

134 

KNKDWKFKVRCSRYLYTLVITDKEKAEKLKQSLPPGLAVKELK 

3679 

A 

162 

437 

RRGLFPAGPGPRQEPSRASS PMPSERLRNI KD \ FLLT\ ARRKDAKS \ 
VKIKiCNKDNV\KFKV\RCSRYLLHPWSSPDKEKAEKL\KQSLPPRFG 
K 

3680 

A 

60 

216 

ISLTGEELWHLLRSTSLGIASPPMHEVPFLEVDGDRTKTQVFPLTNL 
RRAC 

3681 

A 

505 

795 

NWE CLFKDAL PQL S ADRFLAS S I LLGDLG VEEV\ H F S VSNLTGEEL 
WHLLRSTS LGI AS PPIHEVPFLEVDGDRTKTQVFPLTNLHVRAHASA 
HAS 

3682 

A 

685 

1484 

LRLTFPE FSHTKVRTMS L FPS L PLLLLSMVAAS YS ETVTCED AQKTC 
PAVIACSSPGINGFPGKDGRDGTKGEKGEPGOGLRGLOGPPGKLGPP 
GNTPG PSG S PG PKGQKGDPGKS PDGDS S \ QAASERKALQTEMARI KKW 
LTFSLGKQVGNKFFLTNGEIMTFEKVKALCVKFQASVATPRN\AAEN 
GAIQNLIKEEAFLGMPDEKTEGQFVDLTGNRLT\YTNWNEGEP\NNA 
GS \DEDCVL\ LLKNG\ QWNDSPC\ STSHLAVCEFP I 

3683 

A 

427 

751 

CLSPSNPPCWGFLEDYAFWRGLLDLYEASQESAWLEWALRLQDTQD 
RLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSPQPAQV 
PITDPCELRKLLHP 

3684 

A 

31 

560 

EFVPHSEKMLYDQAQLAVAYLYAFQLSGDEFYSDVAKGILQYVARSL 
SHRSGGFYSAEDADSPPERGQRPKEGAYYVWTVTCEVQQLLPEPVLGA 
TEPLTSGQLLMKHYCLTEAGNI S PSQDPKGETCRARMLLDRPVLRWE 
LTAARFGLDV/ EKAVRTLLNTGLEKLFQARKHRPKP 

3685 

A 

3 

399 

KCV CLLTAFS ERMRRVPVALPEMVRALSAQQQTLKQ I VI / VWRPSGQ 
RTPKALVQCVHSVYIPNKVLILADGDPSSFLSRQLPFLSTLRRLEDQ 
ATA YVCENQAC S VPI TDP CELRKLLHPLTCPYPLGWGRR 

3686 

A 

2 

2238 

NVAAMLGARAWLVRVXiLLPRAGAGLAASRRGSS SRDKDRS ATVSSSV 
PMPAGGKGSHPSSTPQRVPNRLIHEKSPYLLQHAYNPVDWYPWGQEA 
FDKARKENKPIFLSVGYSTC^CTMMEEESFQNEEIGRLLSEDFVSV 
KVDREERPDVDKVYMTFVQATSSGGGWPMNVWLTPin^\QPCVGGTYF 
PPEDGLTRVGFRTVLLRIREQWKQIWNTLLENSQRVTTAIiLARSEIS 
VGDRQLPPSAATVNNRCFQQLDEGYDEEYGGFAEAPKFPTPVILSFL 
FSYWLSHRLTQIX5SRAQQMALHTLKMMANGGIRDHVGQGFHRYSTDR 
QWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARS 
LSHRSGGFYSAEDADSPPERGQRPKEGAYYVWTVKEVQQDLPEPVLG 
ATEPLTSGQLLMKHYGLTEAGNI SPSQACELADKGHS PQD PKGELQG 
QNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQARKHRPKPHL 
DSKMLAAWNGLMVSGYAVTGAVLGPRQXLINYATNGAKFLKRHMFDV 
ASGRLDADLLHR/HLGGLWSTATPPCWGFLEDYAFWRGIiUDLYEAS 
QESAWLEWALRLQDTQDRLFWDSQGGGYFCSEA\DWGAGLPLRLKDA 
QDEAEPSANSVSAHNLLRLHGFTGHKDWMDKCVCLLTAFSERMRRVP 
VALPEMVRALSAQQQTLKQIVICGDRQAKDTKALGAVRPLCLHS 
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3687 

A 

5 

188 

NTFYWBRSWFFQAHILPKCKWLFLQSGSRIVSFSWMVGSRSILPWIP 

tt oljliJDJjQMCiVVTIj 

3688 


AO A 


tilTl'Si? J-SKXibCLiSFCLIiLET/CPVTCWCWEAPRCNQKCTDPAARRPD 
PQTCASQDRLRCAPCTCHQPLASR 

3689 

n 

*» 

•5 ft A 

MCFQRKS YFH IQTL YCPL I C I S ALRYS TGPAPAMWKCHTAPVQKDWL 
TPLPITLISSPNFLTLINL* 

3690 

c 

57 

9 n q 

z U J 

Mli V AAAAAAA XXJCXXXXXXXXXXXXX 

GPL* 

3691 

A 

363 

424 

KEPDLEPPWILIGELPSVWI 

3692 

A 

2 

264 

KFSLI KLRKENFPTARI CRLVYPLYPLKI FI FPKALNFCREVGPICP 
PPKK\GSFPKIPKVAFNRPPLRKKIIPSQPALNWGPPRVP 

3693 

A 

1 

237 

SVIFVFYSMFQDACELDVIEISLFVNRCFSVQLIHFLICKLIPHGCH 
QLSQVIFLNGACCFFIKASRS ISDDI FRICAI 

3694 

A 

230 

946 

RERDGGGGELSRCG I PHAVNS FLQFPADLQKLTSNLRT I DLSNNKI E 

SLPPLLIGKFTLLKSLSL1WNKLTVT,PDEICNLKKL\ETLSLNNNHL 

RELPSTFGQLSALKTLSLSGNQLGALPPQLCSLRNLDVKDLSKN\QI 

\EGIPDSVGELQVIDLNLNQSQISQISGEFTNSCWPRI^KILCI*\EKN 

CLELSMLPQSILSDSQICLLAVEGNLFEIKKLRELEGYDKYMERFTA 
TKKKFA 

3695 

A 

2 

201 

GRVGSDSCQRQQQGDTKRRRL S S YPWRRRQRWRRRR \ TS S RRKPQFL 
Ky Ky K PRQARRR WRATR FA 

3696 

A 

31 

445 

EKAVWSAASSLWTVYTAQPAPRSFHPFCSTWSAKEPNTTAGEGGS 
TNECQCLSA/CPGANCPGLGTLKSPQNCPSSSLPRPPLPACRGSYES 
WARLLQSvTmiGQRRLRESWSPKLYWVHPPQEVVGAFRTFLFLV 

3697 

A 

73 

404 

AYSRGTS SLSTMNQTAI L I CCL I FLTLSG IQG VPLSRTVRCTCI S I S 
NQPVNPRSLEKLEIIPASQF\CPR\VEIIATMKKKGEKRCLNPESKA 
I KNLLKAVSKERS KRS P 

3698 

A 

507 

1212 

GHRPTWLAFPLLNGVCCSCPLRALFPGIIiDFRVGQQAGKMLSSIKCV 
L\VGDSAVGKTFLV\VRFTSRTLREGYSPPVYENTGVDVFMD\GIQI 
S LG \ LWDTAANDAFRS I R PLS YQQAD WLMC YS VANHNS FLNLK\ NK 
WIGEIRSNLPCTPVLWAT\QTDQR\EMGPHKASCV^TALEG\KKLAQ 

DVR\ARGYLECSALS\NRGVQQVFECAV/RETAVNQARRRNRRRLF\ 
SI\NECKIF 


c* 

"5 *^ n 

4 95 

MSGIALSRLAQERKAWRKDHPFGFVAVPTKNPDGTMNLMNWECAIPG 
KKGTPWEGGLFKXRMLFKDDYPSSXPKCKFEPPLF 

3700 

A 

226 

854 

RVPGTLNMSGIALSRIjAQERKAWRKDHPFGFVAVPTKI^DGTMNIJW 
WECAIPGKKGTPWEGGLFKLRMLFKDEYPSSPPKCKFGPPLFPPNCV 
TSGASCL\SFLEEDKDWRPAITIKQILLGIQELLNEPNIQDPAQAEA 
YTIYCQNRVEYEKRVRAQAKKFAPSCRSSLEPSRTWAVLLLSLRPQV 
S I S VLRHFLCVYMAFCLCCCLE 


A 

813 

1637 

TCQGIALSRLAQEY\KAW\RKDHPFG\YNAVPTKNPDGTMNLM\NWE 

CAI PGKKGTPW\ EGGLFKLRMLFKDDYPS \ SPPK\ CKFE P \ PLFSPR 

IVYPSGAKCACPSLEE\DQRTWRPSPFTIQTRFLLRNYRNFLKWNPN 

I\QDPSFQAEA\ YTI Y\ CQNRV/ ESYEKRVPSTKPRKFCAPHKQRPC 

GNRQKERDWFGQELVYNIFCKSKVCSHTMTSHLGGVGRAPSSHCPPR 

GVRSRIRLNCPVSIQGSSFLRCFCIFGLLCKTRIiFILILMSSISTAV 
KL 

3702 

r> i 

JO 

i JO 

362 

KQE PQ I PCQGGKD AAVLQGS Q P S LAG I RTTRLRD PS KKW PKGX * 

3703 

A 

1 

808 

TPPFIJU4APALLLlPAAIxAJ5FTl^FOTnVT?T?\7PT? , rc;T utdt t npTDrc 
GGPDARQGW\ LAALQDRS I LAPLAWDLGLLLLFVGQHSLMAAERVKA 
WTSRYFGvTjQRSLYVACTALALQLVT^RYWEPIPKGPVXWRARAEPWA 
TWVPLLCFVLHVI SWLL I FS \ I FLVFDYAELMGLKQVYYHVLGLGEP 
LALKSPRALRLFSHLRHPVC^LLTVLWVVPTLGTDRLLLAFLLTLY 
LGLAHGLDQQDLRYLR\AQLQRKLHLLSRPQDGEAE 

3704 

A 

221 

1728 

GQAGCSDTMGSCCSCLNRDSVPDNHPTKPKVTNVDDEGVELGSGVME 
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IiTQSELVLHLHRREAVRWPYLCLRRYGYDSNLFSFESGRRCQTGHGI 
FAFKCSRAEBIFNLLQDLMQCNSINVMEEPVIITRNSHPAELDLPRA 
PQPPN\ALGYNV\SSFSNGCPGEGPRFSAPQRLSTSSLRHPSLGEES 
THALIAPDEQSHTYVNTPASEDDHRRGRHCLQPLPEGQAPFLPQARG 
PDQRDPQVFLQPGQVKFVLGPTPARRHMVKCQGLCPSLHDPPHHNNN 
NEAPSECPAQPKCTYENVTGGLWRGAGWRLSPEEPGWNGLAHRRAAL 
LHYENLPPLPPWESQAQQLGGEAGDDGDSRDGLTPSSNGFPDGEED 
ETPLQKPTSTRAAIRSHGSFPVPLTRRRGSPRVFNFDFRRPGPEPPR 
QLNYIQ\VK1»KGWGGDRPKGP\QNPSSPQAPMPTTHPARSSDSYAVI 
DLKKTVAMSNLQRALPRDDGTARKTRHNSTDLPL 

3705 

A 

1143 

1849 

PAI L S PNLDDLNFS VRAS VPTL AP SAPS AS S S PAR I PAATRQGRRS P 
R I PPAPSNEAS PAPGRPRVPGAA\ P PRFPKPT I QASGADRAVDCG I L 
KLQKS\PARLARPRWPRRPSKRFWSAAGSVEEQPKPPKAP/PAKSPE 
QSPQLSVLPLATITPQAYGASFEKMKNLKTRRGGRSRWHFPFNVIPQ 
APSLKWGFQTSDASSAEVFHVDRSLASHKMGPGRRPRSPLWAPPFWN 
PRH 

3706 

A 

130 

360 

SLPKKKNPDLHGFVGKF YQTFKEKI PAS S S SNKF / PQDI YEAS ITL/ 
I PKPDQD FRKKENC S S VS L 1 1 FPE I IN KI LEN 

3707 

A 

46 

292 

DQ71SSGNGVTMIARRNPDTI.RFLPDEARSLPPPKLTDPRLLYIGFLG 
YCSGLIDNV\ IRRRPIATADKKTYGEIFEKFHPIR 

3708 

A 

269 

476 

STCLGLPRCWNYRREQPRLVYM* FVHLYYF 

3709 

A 

448 

894 

DQASSGNGVTMIARRNPEPLRFLPDEARSLPPPKLTDPRLLYIGFLG 
YCSGL I DNVI RRRP I \ ATAG \ LH\ RQLLY I NGLFFLLG YYLVK\ RED 
YLY AVRDREMFG I YGNLHPEDFFLKEDK\ KTYGE I FLKNSHPHTLKV 
LQKCLAPSFH 

3710 

A 

629 

790 

SLRGEQLRPWAPGDLSVMSFRDLRSKAAPRPCDLRVFQRFRAPARRV 
PCVGDP 

3711 

A 

2 

323 

KSGSNDDSDIDIQEDDESDSELEERRLPKPQTAMEMLMQGRPGKRIV 
GTMQGGDSDDNEDSEESEIDMEDDDDEDDDLEDESISLSPTKPNRRV 
RKSEPLDESDNDF 

3712 

A 

3628 

3908 

AS S S LHKS ACQS HPRC P S LHTVAAAP AEDRXiL PFLSGTD S PDPAAS S 
PFPALCRCLAAPAVPSGQSSSRLAQRAVLGVGGPLVPVPALPAPGG 

3713 

A 

70 

1296 

GGEQLRPWAPGDLSVMSFRDLRNFTEMMRALGYPRHISMENFRTPNF 
GLVSEVLLWLVRRYEPQTDIPPDVDTEQDRVFFIKAIAQFMATKAHI 
KI^TKKLYQADGYAVKELLKITSVLYNAMKTKGMEGSEIVKEDVNK\ 
FKFDLGSK\IAD\LKAARQLAS\EITSKGASLY/DLSLGMEVE\LRE 
MRTEAI ARPLEINETEKVKR I AI KE I LTQVQKTKDLLNNVASDEANI. 
EAKI EKRKLELERNRKRLETLQS VRPCFMDE YEKTEEEIiQK\ QYDTY 
LEKFQNLTYLEQQLEDHHRMBQERFEEAKNTL/ CAVI QNKLKEEEKR 
LLKSGSN\DDSDIDIQED\DES\DSELEERRLPK\P\QTAMEM\LMQ 
GRPGKRIVGTMPGXGDSDDNVSPPLPSWLCTLGIAVPIDGSG 

3714 

A 

1 

948 

MDKKDQVYLNSQVVSAAI GPKRl^SLS KS VTLTFQHVKMTPSTKKV 

C\T^KSTGQGSQWSRDGCFLIHWKSHTMCNCSHLSSFAVI^1A^ 

EEDPVLTVITYVGIiSVSLIiCLLIiAALTFLLCKAIRNTSTSIiHLQLSL 

CLFLAHLLFLVG IDRTEPKYLCS I IAGALHYLYLAAFTWMLLEGVHL 

FLTARNLTVVNYSSINRLMKW1MFPVGYGVPAVTVAISAASWPHLYG 

TADRMLAFKATAQLFILGCTWCLGLLQVGPAAQVMAYLFTIINSLQG 

FFI FLVYCLLSQQAKHKFTPMEVPRQNHRCLQQN 

3715 

A 

644 

2036 

AI ETQAI TDNCSEERKTFN1*NVQMNSMDI RCSDI I QGDTQGPS VI AF 
ISYSSLGNIINATFFEEI^KJCDQWLNSQWSAAIGPKRNVSLSKSV 
TLTFQHVKMTPSTKKVFCVYWKSTGAG/ CSQWSRDGCFL IHVNKSHT 
MCNCSHIiSSFAVLMALTSQEEDPVLTVITYVGLSVSLLCLL\LAALT 
FLLCKAIQNTSTSIiHLQLSLCLFLAHLLFLVAIDQTGHKVLCS 1 1 AG 
TLHYLYIATLTWMLLEALYLFLTARNIiTVVNYSS INRFMKKLMFPVG 
YGVPAVTVAISAASRPHLYGTADRCWLHLDQGFMWSFLGPVCAIFSA 
NLVLFILVFWILKRKLSSLNSEVSTIQNTRMLAFKATAQLFILGCTW 
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CI^GLLQVGPAAQVMAYLFTI INSLQGF\ FI FL\ VYCL/liSAQQVQKQ 
YQKWFREIVKSKSESEITTLSSKMGPDSKPSEGDVFPRTSEEKIL 

3716 

A 

1 

451 

FFFFFFVRQDLALSPRLECNGIIFAHCNLCLPDSSDPP\TSAFQVAR 
TTGRNYYAQLI \ LYFFVETEFAMLTRAGLNLLSSRGPLASASQSVGD 
YR\NSHHAQLIFCIFGRDR\VSLW\CPEWSQTPEFKRSACLPKCWDY 
RGEPPHPVP5AL 

3717 

A 

" 1 

5898 

MAFVATQGATWDQTTLMKKYIiQ F V AALTD VNTQH 1 1 PRFLTFLQDG 
EVQFLQEKPAQQLRKLVLE I IHR I PTNEHLRPHTKNVLS VMFRFLET 
ENEENVLICLRIIIELHKQFRPP1TQENRYFENPQVIPENTVPPPEM 
VGM I TT I AVKVN PERED S ETRTL YKLN I HNWAE F VPL I MNT I AI Q V 
SAQARQHKLYNKELYADFIAAQIKTLSFLAYIIRIYQELVTKYSQQM 
VKGMLQLLSNCPAETAHLRKELLIAAKHILTTELRNQFIPCMDKLFD 
ES I L IGSG YTARETLRPIAY STLADLVHHVRQHLPLSDIiS LAVQLFA 
KNIDDESLPSSIQTMSCKLLLNLVDCIRSKSEQESGNGRDVLMRMLE 
VFVLKFHTIARYQLSAI FKKCKPQSELGAVEAALPGVPTAPAAPGPA 
PSPAPVPAPPPPPPPPPPATPVTPAPVPPFEKQGEKDKEDKQTFQVT 
DCRSLVKTLVCGVKTITWGITSCKAPGEAOFI PNKQLQPKETQIYI K 
LVKYAMQALDIYQVQIAGNGQTYIRVANCQTVRMKEEKEVLEHFAGV 
FTMMNPLTFKEIFQTTVPYMVERISKNYALQIVANSFLANPTTSALF 
ATILVEYLLDRLPEMGSNVELSNLYLKLFKLVFGSVSLFAAENBQML 
KPHLHKIVNSSMELAQTAKEPYNYFLLLRALFRSIGGGLNMLQSGLH 
KQHMKDLFVELCLTVPVRLSSLLPYLPMLMDPLVSALNGSQTLVSQG 
LRTLELCVDNLQPDFLYDHI QPVRAELMQALWRTLRNPADS ISHVAY 
RVLGKFGGSNRKMLKESQKLHYVVTEVQGPSITVEFSDCKASLQLPM 
EKAIETALDCLKSANTEPYYRRQAWEVIKCFLVAMMSLEDNKHALYQ 
LLAHPNFTEKTI PNVI ISHRYKAQDTPARKTFEQAIiTGAFMSAVI KD 
LRPSALPFVASLIRHYTMVAVAQQCGPFLLPCYQVGSQPSTAMFHSB 
ENGS KGMDPLVIi IDAI AI CMAYEEKELCKIGE VALAVI FDVAS 1 1 LG 
SKERACQLPLFS YI VERLCACC YEQAWYAKLGG WS I KFLMERLPLT 
WVLQNQQTFLK7VLLFVMMDLTGEVSNGAVAMAKTTLEQI>LMRCATPL 
. KDEERAEEIVAAQEKSFHHVTHDLVREVTSPNSTVRKQAMHSLQVLA 
QVTGKSVTVIMEPHKEVLQDMVPPKXHLLRHQPANAQIGLMEGNTFC 
TTL.QPRLFTMDLNWEHKVFYTESREKI IAALFKALNSTNSELQEAG 
EACMRKFLEGATIEVDQIHTHMRPLLI^LGDYRSLTUWVNRLTSVT 
RLFPNSFNDKFCDQMMQHLRKWMEVWITHKGGQRSDGNEMKICSAI 
INLFHL I PAAPQTLVKPLLEWMKTERAMIilEAGSPFREPLI KFLTR 
HPSQTVELFMMEATLNDPQWSRMFMS FLKHKDARPLRDVIiAANPNRF 
ITLLL.PGGAQTAVRPGSPSTSTMRLDLQFQAIKI IS I IVKNDDSWLA 
SQHSLVSQLRRVWVSENFQERHRKENMAATNWKEPKLLAYCLLNYCK 
RNYGDIELLFQLLRAFTGRFLCNMTFLKEYMEEEIPKNYSIAQKRAL 
FFRFVD FND PNFGDELKAKVLQH I LNPAFLYS FEKGEGEQLIK3PPNP 
EGDNPESITSVFITKVLDPEKQADMLDSLRIYLLQYATLLVEHAPHH 
IHDlWK^n^SKLRRLMTFAWPCLLSKAC^PACKYSGHLLIiAHIIAK 
FAI HKK I VLQVFHS LLKAHAMEARAI VRQ AMAI LTPAVPARMEDGHQ 
MLTHWTRKI I VEEGHTVPQLVHI LHLIVQHFKVYYPVRHHIjVQHMVS 
AMQRLG FTPS VT I EQRRLAVDL SE WI KWELQR I KDQQKS VS KTKRL 
CVFKPDSDMDPNSSGEGVNSVSSSIKRGLSVDSAQEVKRFRTATGAI 
S AVFGRS QS L PGAD S LLAKP IDKQHTDTWNFL I RVACQ 

1 "7 1 Q 

i / lo 

TV 

1 

333 

\7VVD\7TDUTTT \7f~\TlM\TC AMHD T rPTDCWTUADDT 7AA7T\T O TTXrM'T .O 

RIKDMKSVSKTKRLCVFKPDSDMDPNSSGEGVNSVSSSIKRGLSVD 
S AQE VKRFRTATGAI S A 

3719 

A 

1 

5699 

VITONTNTAGSPGE^SRRCVNLLKTALRPDMWPKSELKLQWFDKLLM 
TVEQPNQVNYGNI CTGLEVLS FLLTVLQS PAI LS S FKPLQRG I AACM 
TCGNTKVLRAVHSLLSRLMSIFPTEPSTSSVASKYEELECIiYAAVGK 
VIYEGLTNYEKATNANPSQLFGDQGSWVTLAPGTLMILKSACSNNPS 
YIDRLISVFMRSLQKMVREHLNPQAASGSTEATSGTSELVMLSLELV 
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KTRLAVMSMEMRKNF I QA I LTS L I EKS PDAKI LRAWKI VEEWVKNN 
SPMAANQTPTLREI^ILLVKMMTyiEKRFPEDLELNAQFLDLVNyVY 
RDETLSGSELTAKLEPAFLSGLRCAQPLIRAKFFEVFDNSMKRRVYE 
RLLYVTCSQNWEAMGNHFWIKQCIEVTSAVQWLVHLPGVNETRSPR 
TMS PETAGTEGGLVLFDEKKGLLLAVCEKSTPIGTSCQGAMLPS ITN 
VINUVDSHDRAAFAMVTHVKQEPRERENSESKEEDVEIDIELAPGDQ 
TS T PKTKEL S E K\ D I GNQLHMLTNRHDKFLDTLRE VKTGALLS AFVQ 
LCH I STTLAEKTWVQLFPRLWKI LSDRQQHALAGE I SPFLCSGSHQV 
QRDCQPS7VLNCFVEAMSQCVPPIPIRPCVLKYLGKTHNLWFRSTLML 
EHQAFEKGLS LQ I KP KQTTEF YEQE S I TPPQQE I LDS LAEL YSLLQE 

EDMWAGLWQKRCKYSETATAIAYEQHGFFEQAQESYEKAMDKAKKEH 

ERSNAS PAI FPE YQLWEDHW IRCS KELNQWEALTE YGQS KGHINPYL 

VLECAWRVSl^TAMKEALVQVEVSCPKEimWKVNMYRGYLAICHPEE 

QQLSFIERLVEMASSLAIREWRRLPHWSHVHTPLLQSLLCWSHQAA 

QQIIELQEAAQINAGLQPTNLGRNNSLHDMKT\ATKTWRNRLPIVSDD 

LSHWSSIFMWRQHHYQAIVTAYENSSQHDPSSNNAMLGVHASASAII 

QYGKIARKQGLVNVALDILSRIHTIPTVPIVDCFQKIRQQVKCYLQL 

AGVMGKl^CMQVRTGHLKNEARSEEANKAFSAAVQMHDVLVKAWAMW 

GDYLENIFVKERQIJILGVSAITCYLHACRHQNESKSRKYIAKVLWLL 

SFDDDKNTLADAVDKYCIGVPPIQWIAWIPQLLTCI^VGSEGKIiLIjNIj 

ISQVGRVYPQAVYFPIRTLYLTLKIEQRERYKSALRALASVGFALTL 

ALCVLLSDSGQQQPSSVGNQSHSASDPGPIRATAPMWRCSRIMHMQR 

ELHPTLLSSLEGIVDQMVWFRENWHEEVLRQLQQGLAKCYSVAFEKS 

GAVSDAKITPHTLNFVKKLVSTFGVGLENVSNVSTMFSSAASESLAR 

RAQATAQDPVFQKLKGQFTTGHTLRTPALGPALVWVQHATAAVSSTR 

QPASLGQLTNKWGRPQPSSLPQRWDQRALCGPGPCFAAVLVGFVLTS 

SRPDFDFSVPGSMKIJWLISKLKKWIKILEAKTKQLPKFFLIEEKCR 

FLSNFSAQTAEVGNSWGSFCMPKPT\HYYIKIARFMPRVEIVQKHNT 

AARRLYIRGHNGKIYPYLVMNDACL/TQSSRREERVLQLLRSAGTPV 

WEKRKETTKRHLFFTVPRWAVSPQMRLVEDNPSSLSLVEIYKQRCA 

KKGIEHDNPISRYYDRLATVQARGTQASHQVLRDILKEVQSNMVPRS 

MLKEWALHTFPNATDYWTFRKMFTIQLAIilGFAEFVLHLiNRtiNPEMli 

QIAQDTGKLNVAYFRFDINDATGDLDANRPVPFRLTPNISEFLTTIG 

VSGPLTASMIAVARCFAQPNFKVDGILKTVLRDEIIAWHKKTQEDTS 

SPLSAAGQPENMDSQQLVSLVQKAVTAIMTRLHNLAQFEGGESKVNT 

LVAAANS LDNLCRMDPAWHP WL 

3720 

A 

109 

11749 

ARRGERRLVELMDPILFSGEANQPKRKMAFVATQGATEVDQTTLMKK 
YLQFVAALTDVNTPDETKLKMMHEVSENFENGTS S PQYS TFLEH IIP 
RVLTFLQDGEVQFLQEKPAQQLRKLVLEI IHRI PTNEHLRPHTKim, 
S VMFRFLETENEENVL I CLRI 1 1 ELHKQFRPP ITQE IHHFLDFVKQI 
YKELPKVVNR YFENPQVI PENTVPPPEMVGM I TTI AVKVNPEREDS E 
TRTHSIIPRGSLSLKVLAELPIIVVL^QL^^ 

NT IA I QVS AQARQHKL YNKEL YAD F I AAQ I KTLS FLAY 1 1 R I YQEL V 
TKYSQQMVKGMLQLLSNCPAETAHLRKELLIAAKHILTTELRNQFIP 
CMDKLFDESILIGSGYTARETLRPLAYSTLADLVHHVRQHLPLSDLS 
LAVQLFAKNIDDESLPSSIQTMSCKLLLNLVDCIRSKSEQESGNGRD 
VLMRMLEVFVLKFHTIAR YQLSAI FKKCKPQSELGAVEAALPGVPTA 
PAAPGPAPSPAPVPAPPPPPPPPPPATPVTPAPVPPFEKQGEKDKED 
KQTFQVTDCRSLVKTLVCGVKTITWGITSCKAPGEAQFIPNKQLQPK 
ETQ I Y I KL VKYAMQALDI YQVQ I AGNGQTY I RVANCQTVRMKEEKE V 
iicnr rtv* vri i*u*iw fu I to ls_b X FQTTVP YMVER I S KNYALQI VANS FLAN 
PTTSALFATILVKYLLbRLPEMGSNVELSNLYLKLFKLVFGSVSLFA 
AENEQMLKPHLHKIVNSSMELAQTAKEPYNYFLLLRALFRSIGGGSH 
DLLYQEFLPLLPNLLQGLNMLQSGLHKQHMKDLFVELCLTVPVRLSS 
LLPYLPMLMDPLVSALNGSQTLVSQGLRTLELCVDNLQPDFLYDHIQ 
PVRAELMQALWRTLRNPADS I SHVAYRVLGKFGGSNRKMLKESQKLH 
YWTEVQGPSITVEFSDCKASLQLPMEKAIETALDCLKSANTEPYYR 
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RQAWEVIKCFLVAMMSLEDNKHALYQLLAHPNFTEKTI PNVI I SHRY 
KAQDTP ARKTFEQALTGAFMS AVI KDLRP S ALPFVAS L I RHYTMVAV 
AQQCGPFLLPCYQVGSQPSTAMFHSEENGSKGMDPLVLIDAIAICMA 
YEEKELCKIGEVALAVI FDVAS I ILGSKERACQLPLFS YI VERLCAC 
CYEQAVnfAKLGGWSIKFLMERLPLTPm,ONQQTFLKALLFVMMDLT 
GEVSNGAVAl^KTTLEQLLMRCATPLKPEERAEEIVAAQEKSFHHVT 
HDLVREVTSPNSTVTIKQAMHS LQVLAQVTGKSVTV I ME PHKEVLQDM 
VPPKKHLLRHQPANAQIGLMEGNTFCTTLQPRLFTMDLNWEHKVFY 
TELLNLCEAEDSALTKLPCYKSLPSLVPLRIAAIiNALAACNYLPQSR 
EKI I AAL FKALNS TNS BLQEAGEACMRKFLEG AT I E VDQ I HTHMRP L 
LMMLGDYRSLTLNVVNRLTSVTRLFPNSFOTKFCDQMMQHLRKWMEV 
WI THKGGQRSDGNEMKI CS AI INL FHL I PAAPQTLVKPLLEWMKT 
ERAMLIEAGSPFREPLIKFLTRHPSQTVELFMMEATLNDPQWSRMFM 
S FLKHKDARPLRD VLAANPNRF I TLLL PGGAQTAVRPGS PS TSTMRL 
DLQFQAI KI I S 1 1 VKNDDS WLASQHSLVSQLRRVWVSENFQERHRKE 
NMAATMWKEPKLLAYCLIjNYCK31NYGDIELLFQLLRAFTGRFLC1WT 
FLKEYMEEEIPKNYSIAQKRALFFRFVDFNDPNFGDELKAKVLQHIL 
NPAFLYSFEKGEGEQLLGPPNPEGDNPESITSVFITKVLDPEKQADM 
LDS LR I YLLQ YATLLVEHAPHH IHDNNKNRNS KLRRLMTFAWPCLLS 
KACVDPACKYSGHLLLAHI IAKFAIHKKIVLQVFHSLLKAHAMEARA 
IVRQAMAILTPAVPARMEDGHQMLTHWTRKI I VEEGHTVPQLVHILH 
LIVQHFKVYYPVRHHLVQHMVSAMQRLGFTPSVTIEQRRIAVDLSEV 
VIKWELQRIKDQQPDSDMDPNSSGEGVNSVSSSIKRGLSVDSAQEVK 
RFRTATGAISAVFGRSQSLPGADSLLAKPIDKQHTDTVVNFLIRVAC 
QVNDNTNTAGS PGEVLSRRCVNLLKTALRPDMWPKS ELKLQWFDKLL 
MTVEQPNQVNYGNICTGLEVLSFL.LTVIjQSPAI ls sfkpl»qrgiaac 
MTCGNTKVLRAVHSLLSRIiMSIFPTEPSTSSVASKYEELECLYAAVG 
KVIYEGLTNYEKATNANPSQLFGTLMILKSACSNNPSYIDRliISVFM 
RSLQK^^VREHLNPQAASGSTEATSGTSELVMLSLELV1CTRI*AVMSME 
MRKNFIQAILTSLIEKSPDAKILRAVVKIVEEVAn^NNSPMAANQTPT 
LREKS I LLVKMMTYI EKRFPEDLELNAQFLDLVNYVYRDETLSGSEL 
TAKLEPAFLSGLRCAQPLIRAKFFEVFDNSMKRRVYERLLYVTCSQN 
WEAMGNHFWI KQCI ELLLAVCE KS TP IGTSCQGAMLPS ITNVINLAD 
SHDRAAFAMVTHVKQEPRERENSESKEEDVEIDIELAPGDQTSTPKT 
KELSEKDIGNQLHMLTNRHDKFLDTLREVKTGALLSAFVQLCHISTT 
LAEKTWVQLFPRLWKILSDRQQHALAGEISPFLCSGSHQVQRDCQPS 
ALNCFVEAMSQCVPPIPIRPCVLK^LGKTHNLWFRSTLMLEHQAFEK 
GLSLQIKPKQTTEFYEQESITPPQQEILDSIiAELYSLLQEEDMWAGI* 
WQKRCKYSETATAIAYEQHGFFEQAQESYEKAMDKAKKEHERSNASP 
AI FPE YQLWEDHW I RCSKELNQWEAI/TE YGQS KGH INP YLVLECAWR 
VSNWTAMKEALVQVEVSCPKEMAWKVNMYRGYLAICHPEEQQLSFIE 
RLVEMASSLAIREWRRIiPHWSHVHTPLLQAAQQIIELQEAAQINAG 
LQPTNLGRNNSLHDMKTVVKTWRNRLPI VSDDLSHWSS I FMWRQHHY 
QAI VTAYENS SQHDPS SNNAMLGVHAS ASAI I QYGKI ARKQGLVNVA 
LD I LSRIHTI PTVP IVDCFQKI RQQVKC YLQLAGVMGKNECMQGLEV 
I E S TNLKYFTKEMTAEF YALKGMFLAQINKS EEANKAFS AAVQMHDV 
LVKAWAMWGDYLENIFVKERQLHLGVSAITCYLHACRHQNESKSRKY 
IJVKVLWLLSFDDDKNTLADAVDKYCIGVPPIQWIiAWIPQIiLTCLVGS 
EGKLLLNLISQVGRVYPQAVYFPIRTLYLTLKIEQRERYKSDPGPIR 
ATAPMWRCSR I MHMQRELHPTLLS SLEG I VDQMVWFRENWHEEVLRQ 
LQQGLAKCYSVAFEKSGAVSDAKITPHTIiNFVKKLVSTFGVGLENVS 
1WSTMFSSAASESLARRAQATAQDPVFQKLKGQFTTDFDFSVPGSMK 
LHNLISKLKKWIKILEAKTKQLPLFFLMEEYCRFLSNFSSQTAEVD 
IPGEFLMP\KPTHYYIKIARFMPRVEIVQKH\NSAARRIi\YIRGHNG 
\ KI YP YLRS WNDALPSQS SRARRERWWQLLRLLYPFLEKIKETTKR 
HLFFTVPRWAVS PQMRLVEDNPS SLS LVE I YKQRCAKKGI EHDNP I 
SRYYDRLATVQARGTQASHQVLRDILKEVQSNMVPRSMLKEWALHTF 
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PNATDYWTFRKMFTIQIJUjIGFT^FVLHIJ^RI^PEMLQIAQDTGKIjN 
VAYFRFDINDATGDLDANRPVPFRIiTPNISEFIiTTIGVSGPLTASMI 
AVARCFAQPNFKVDG I LKTVLRDE 1 1 AWHKKTQEDTS S PLSAAGQPE 

NI^SQQLVSLVQKAVTAIMTRLHOTiAQFEGGESKVNTLVAAANSL\D 
NLCRMDPAWHPWACDCGRHGTRECEGRSGL 

3721 

A 

106 

11708 

EANQPKRKMAFVATQGATVVDQTTLMKJCYLQFVAALTDVNTPDETKL 
KMMQEVSENFENVTSSPQYSTFLEHIIPRFLTFLQDGEVQFLQEKPA 
QQLRKLVLE I IHRI PTNEHLRPHTKNVLS VMFRFLETENEENVLICL 
RIIIELHKQFRPPITQEIHHFLDFVKQIVKELPKWNRYFENPQVIP 
ENTVPPPEMVGMITTI AVKVNPEREDSETRTHS 1 1 PRGSLSLKVLAE 
LPIIWIiMYQLYKLNIHNWAEFVPLIMNTIAIQVSAQARQHKLYNK 
ELYADFIAAQIKTLSFLAYIIRIYQELVTKYSQQMVKGMLQLLSNCP 
AETAHLRKELLIAAKHILTTELRNQFIPCMDKLFDESILIGSGYTAR 
ETLRPLAYSTLADLVHHVRQHLPLSDLSLAVQLFAKNIDDESLPSSI 
QTMSCKIiLLNLVDCIRSKSEQESGNGRDVLMRMLEVFVLKFHTIARY 
QLSAIFKKCKPQSELGAVEAALPGVPTAPAAPGPAPSPAPVPAPPPP 
PPPPPPATPVTPAPVPPFEKQGEKDKEDKQTFQVTDCRSLVKTLVCG 
VKT I TWG I TS CKAPGEAQF I PNKQLQPKETQ I Y I KLVKYAMQALD I Y 
QVQIAGNGQTYIRVANCQTVRMKEEKEVLEHFAGVFTMMNPLTFKEI 
FQTTVP YMVDR I S KNYALQ I VANS FLANPTTSALFATILVEYLLDRL 
PEMGSNVELSNLYLKLFKLVFGSVSLFAAENEQMLKPHLHKIVNSSM 
ELAQTAKEPYNYFLLLRALFRSIGGGSHDLLYQEFLPLLPNLLQGLN 

mlqsglhkqhmkdlfvt:lcltvpvrlssllpylpmlmdplvsalngs 

QTLVSQGLRTLELCVDNIiQPDFLYDHIQPVRAELMQALWRTLRNPAD 
S I SHVAYRVLGKFGGSNRKMLKESQKLHYWTE VQGPS I TVE FSDCK 
ASLQLPMEKAIETALDCLKSANTEPYYRRQAWEVIKCFLVAMMSLED 
NKHAIiYQLLAHPNFTEKTI PNVII SHRYKAQDTPARKTFEQALTGAF 
MSAVIKDLRPSALPFVASLIRHYTMVAVAQQCGPFLLPCYQVGSQPS 
TAMFHSEENGSKGMDPLVLIDAIAICMAYEEKELCKIGEVALAVIFD 
VAS I ILGSKERACQLPLFS YIVERLCACCYEQAWYAXLGGWS IKFL 
MERLPLTWVLQNQQTFLKALLFVMMDLTGEVSNGAVAMAKTTLEQLL 
MRCATPLKDEERAEE I VAAQEKS FHHVTHDL VREVTS PNSTVRKQAM 
HSLQVIAQVTGKSVTVIMEPHKEVLQDMVPPKKHLLRHQPANAQIGL 
MEGNTFCTTLQPRLFTMDLNVVEHKVFYTELLNLCEAEDSALTKLPC 
YKSLPSLVPLRIATUjNALAACNYIjPQSREKIIAALFKALNSTNSEIjQ 
EAGEACMRKFLEGATIEVDQIHTHI^PLLMMLGDYRSLTIjNVVNRIjT 

svtrlfpnsfndkfcdqmmqhlrkwmevvvithkggqrsdgnesise 
cgrcplspfcqfeemkicsaiinlfhlipaapqtlvkpllewmkte 
ramlieagspfreplikfltrhpsqtvelfmmeatlndpqwsrmfms 

FLKHKDARPLRDVLAANPNRF I TLLLPGGAQTAVRPGS PSTSTMRLD 

lqfqaikiisiivknddswlasqhslvsqlrrvwvsenfqerhrken 
maatowkepkllaycllnyckrnygdiellfqllraftgrflo^ 
lke ymeeei pknys I aqkralffrfvdfndpnfgdelkakvlqhiln 
paflysfekgegeqllgppnpegdnpesitsvfitkvldpekqadml 
dslriyllqyatllvehaphhihdnnknrnsklrrlmtfawpcllsk 

ACVDPACKYSGHLLLAHI IAKFAIHKKIVLQVFHSLLKAHAMEARAI 
VRQAMAI LTPAVPARMEDGHQMLTHWTRKI I VEEGHTVPQLVHI LHL 
IVQHFKVYYPVRHHLVQHMVSAMQRLGFTPSVTIEQRRLAVDLSEVV 
I KWELQR I KDQQPDSDMDPNS SGEG VNS VS SSI KRGLS VDS AQEVKR 
FRTATGAISAVFGRSQSLPGADSLLAKPIDKQHTDTWNFLIRVACQ 
vxvum xim AAVjo±r^c,Vij&KK^VNijljKTAIjRPDMWPKSELKXiQWF 
TVEQPNQVNYGNICTGLEVLSFLLTVLQSPAILSSFKPLQRGIAACM 
TCGNTKVIiRAVHSLLSRIiMSIFPTEPSTSSVASKYEELECLYAAVGK 
VI YEGLTNYEKATNAN PS QL FGTLM I LKS ACSNNPS Y I DRLiI S VFMR 

SLQKMVREHLNPQAASGSTEATSGTSELVMLSLELVKTRLAVMSMEM 
RKNFIQAILTSLIEKSPDAKILRAWKIVEEV7VKNNSPMAANQTPTL 
REKSILLVKMMTYIEKRFPEDLELNAQFLDLVNYVYRDETLSGSELT 
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AKLEPAFLSGLRCAQPLIRAKFFEVFDNSMKRRVYERLLYVTCSQNW 
EAMGNHFWIKQCIELLLAVCEKSTPIGTSCQGAMLPSITNVINLADS 
HDRAAFAIWniVKQEPRERENSESKEEDVEIDIELAPGDQTSTPKTK 
ELSEKDIGNQLHMLTNRHDKFLDTLREVKTGALLSAFVQLCHISTTL 
AEKTWVQL FPRLWKI LSDRQQHALAGE I S PFLCSGSHQVQRDCQPS A 
LNCFVEAWSQCVPPIPIRPCVLKYLGKTHNIjWFRSTLMLEHQAFEKG 
LSLQIKPKQTTEFYEQESITPPQQEILDSLAELYSLLQEEDMWAGLW 
QKRCKYSETATAIAYEQHGFFEQAQESYEKAMDKAKKEHERSNASPA 
IFPEYQLWEDHWIRCSKELNQWEALTEYGQSKGHINPYLVLECAWRV 
SNWTAMKEALVQVEVSCPKEMAWKVNMYRGYLAICHPEEQQLSFIER 
LVEMAS S LAI RE WRRLPHWS HVHTPLLQAAQQ 1 1 ELQEAAQ I NAGL 
QPTNLGRNNSLHDHKTWKTWRNRLPIVSDDLSHWSSIFMWRQHHYQ 
GKPTWSGMHSSSIVTAYENSSQHDPSSNNAMLGVHASASAIIQYGKI 
ARKQGLVNVALDILSRIHTIPTVPIVDCFQKIRQQVKCYLQLAGVMO 
KNECMQGLEVIESTNLKYFTKEMTAEFYAIiKGMFLAQINKSEEANKA 
FSAAVQMHDVLVKAWAMWGDYLENIFVKERQLHLGVSAITCYLHACR 
HQNES KS RKYLAKVLWLLS FDDDKNTLADAVDKYC IG VPPIQWLAW I 
PQLLTCLVGSEGKLLLNLISQVGRVYPQAVYFPIRTLYLTLKIEQRE 
RYKSDPGPIRATAPMWRCSRIMHMQRELHPTLLSSLEGIVI>QMVWFR 
ENWHEEVIjRQLQQGLAKCYSVAFEKSGAVSDAKITPHTLNFVKKLVS 
TFGVGLENVSNVSTMFS S AAS ESLARRAQATAQDPVFQKLKGQFTTD 
FDFSVPGSMKLHNLISKLKKWIKILEAKTKQLPKFFLIEEKCRFLSN 
FSAQTAEVEIPGEFLMPKPTHYYIKIARFMPRVEIVQKHNTAARRLY 
IRGHNGKIYPYLVMNDACLTESRREERVLQLLRLLNPCLEKRKETTK 
RHLF FTVPR WAVS PQMRL VEDNPS S LS L VE I YKQRCAKKG I EHDNP 
I S R YYDRLATVQARGTQASHQVLRD I LKEVQSNMVPRSMLKEWALHT 
FPNATDYWTFRKMFTIQIALIGFAEFVLHLNRLNPEMLQIAQDTGKIi 
NVAYFRFDINDATGDLDANRPVPFRLTPNISEFLTTIGVSGPLTASM 
I AVARCF AQPNFKVDG I LKTVLRDE 1 1 AWHKKTQEDTS S PLS AAGQP 
ENMDSQQLVSLVQKAVTAIMTRLHNLAQFEGGESKVNTIjVAAANSLD 
NLCRMDPAWHPWL 

3722 

A 

2 

581 

RRFERRPMRPHEVYTPPLVLQPLSLVSPWKSSQHIF\RVLNT\NID\ 
GRRKNS PL P I TAHLR VWARRYAHWLRKAD I DLTK\ RAGELT\ EDB\ 
VERVIT/ 1 SMQNPRQ YKDPKTGFFEQDKKDVKDWKI TARVLAQWVWT 
TSSRERPGSRLEERFRAHKKGL\RHFLGGLSCSEGQAHPRPTWPAVG 
RHRGASAGARR 

3723 

A 

6 

297 

GIRPRKREQAASEDADKGPKEPLEYGKAKLSFQSCECGKAFQRHDHL 
ARHRSPIL\HLKDKARPFQCRYC\nCSFTQNYDLLRHERLHMKRRSKQA 
LNSY 

3724 

A 

201 

405 

TANVRCLQAQPSAMACPLDQAIGLLVAIFHKYSGREGDKHTLSKKEL 
KEL IQKELT I G S KLQDAE I A 

3725 

C 

78 

422 

MASCPHAGGPAIPCPVQPLDPDWGHLWRSPAGGIVRSSSAVPAPSPA 
LSHGMPPGSGHWLLVAIFHKYSGREGDKHTLSKKELKELIQKELTIG 
S KLQDAE I ARXMEDLDRNKDQ 

3726 

A 

591 

966 

EVLVTMALTRPSAASSKPQPSSPWQCPLDSSHWPPSVAHLPTSYSGR 
E \ GDKHTLS K\ RELKEL I QKG AP PLAS KLQDA\ E I ARL \ MEDL \ DRN 
KDQ\EVNFPRKYVTFPGGAFAFESTIEGLKRVEK ! 

3727 

A 

3 

1274 

GGS S VI FYGL I CMYTLWWMLRA\ SLKKYS FE S I REES S YSD I PDVKN 
DFAFMLHLIDQYDPLYSKRFAVFLSEVSENKLRQLNLNNEWTLDKLR 
UK Jj X jum Ayu IvJj £i IjHXj r M Li SG I PDTVFDL VEL E VLKL E L I PDVT I P P S 
I AQLTGLKELWLYHTAAK I EAPALAFLRENLRALH I KFTD I KE I PLW 
IYSLKTLEELHLTGNLSAENNRYIVIDGLRELKRLKVLRLKSNLSKL 
PQVVTDVGVHLQKLSINNEGTKLILWYNHIAYIPIQIGNL.TNLERLY 
LNRNKI EKI PTQLFYCRKLRYLDLSHNNLTFLPADIGLLQNLQNLAI 
TANRIETLPPELFQCRKLRALHLGNNVLQSLPSRVGELTNLTQIELR 
GNRLECLPVELGECPLLKRSGLVVEEDLFNTLPPEVKERLWRADKEQ 


,i. ooa a + 3 . o 2 e & o s 






A 

3728 

A 

165 

828 

KEKARGRPKPLLLLPITSATTAMGLTISSIjFSRLFGKKQMRILMVGI* 
D\AAGKA\TILYKL\KLGEIVTTIP\TIGFGV\KTIGYRNICFHSMG 
MLGGPDRIRPLWKHYFQ\NT\QGLIFW\DSNDRERIQEVADELHKM 
LLVDELRDP\ VLLLFANKQD\ LPNAM\ AI SEMTDKLWLQSLRNRTWY 
VQATCATQGTGLYEGLDWLSNELSKQLNEIGYLTQGPWFQ 

3729 

A 

191 

1574 

GVSRQDLTRL FG F S S PQPLKKKLDAWLSEDMNYARF I TAAS ARRNPS 
PIRTMTDILSRGPKSMISLAGGLPNPNMFPFKTAVITVENGKTIQFG 
EEMMKRAL\ QYS PSAGI PELLS W\ LKQLQ I KLHNPPTIHYPPSQGQM 
DLCVTSGSQQGLCKVFEMIINPGDNVLLDEPAYSGTLQSLHPLGCNI 
INVASDESGIVPDSLRDILSRW\KPEDAKNPQKNTPKFLYYCFQMGN 
NPTGNS LTSERKKE I YELARKYDFLI I EDDP YYFLQFNKFRVPTFLS 
MDVDGRVIRADSFSKI I S S GLR 1 G F LTGPKPL I ER VI LH I Q VS TLHP 
STFNQLMISQLLHEWGEEGFMAHVDRVIDFYSNQKDAIL7VAADKWLT 
GLAEWHVPAAGMFLW I KVKG I ND VK\ ELI ERKRPLKMGVL\MLP\GN 
AFLRRIAQLPSPYLRASFSSASP\EQMDVAF\QVLAQLIKESL 

3730 

A 

2 

454 

RSFFFFFCEVGSWVGSMRWMARLLSEGEQCIPTACAAFAQQPGGRP 
RRGLAGVGEGGPQCSWVNYRCTLEFLVSLLGTDLARGRGNSATGP\T 
APADSKQLSCKTFIAVLSLSKEAGFCNWQGWVSTSWGSSSPSVPQF 
FPKLLEFTGQ 

3731 

A 

379 

640 

DGVSLCYPGWNTVAQSQLTVGS I SWAQVIL\ QTQPPRVAVAPGRTTV 
YRDRVSLCCPGWSGLIVLGLKPPRVLGITGMGSPQPRPGI 

3732 

A 

373 

598 

YPCPPE/RCAQRHGQACLLPSLERKCVISLPPPSPHQTFAGPKGLGH 
GDALSLGIWPQTPSCEPTQTPKQSNQLVN 

3733 

A 

2 

445 

GRVGGKPTTSSSEACRFCGSRSGTELSAVGSVCSDADCQEYAKIACS 
KTHPCGHPCGGVKNEEHCLPCLHGCDKSATSLKQDADDMCMICFTEA 
LSALDCSHIFHLQCCRRVLENRWLGPRITFGFISCPICKMKINHIVL 
KDLLDP 

3734 

A 

151 

487 

S PRGH YQLLL SGRALADR YRR I YTAALNDRDQGGGS AGH PAS / RSEN 
LENTVI I PDIKLHSNPSAFNI YO^VRHCVLEWQKKEISLAAASKNSV 
QSGESDSDEEEESKEPPIN 

3735 

A 

1 

176 

NPSAFNIYCmnraCVLEWQKKEISLAAASKNSVQSGESDSDEEEESK 
EPPIKLPKVSH 

3736 

A 

1 

6570 

VCAH I VQAI RMEATRVREEWEHA I SS KENANSQPNDEDAS SDAYCFE 
LLSrn^LALSGSNVGRQYLAQQLTLLQDLFSLLHTASPRVQRQVTSLL 
RRVLPEVTPSRLASIIGVKSLPPADISDIIHSTEKGDWNKLGILDMF 
LGC I AKALTVQLKAKGTT ITGTAGTTV GKGVTTVTLPMI FNSSYLRR 
GESHWWMKGSTPTQISEIIIKLIKDMAAGHLSEAWSRVTKNAIAETI 
IALTKMEEEFRSPLWLALASLCVLDQDHVDRLSSGRWMGKDGQQKQM 
PMCDNHDDGETAA 1 1 LCNVCGNLCTDCDRFLHLHRRTKTHQRQVFKE 
EEE A I KVDLHEGCGRTKL FWLMALAD S KTMKAMVEFREHTGKPTTS S 
SEACRFCGSRSGTELSAVGSVCSDADCQEYAKIACSKTHPCGHPCGG 
VKNEEHCLPCLHGCDKSATSLKQDADDMCMICFTEALSAAPAIQNKI 
NHIVL KDLLDP I KEL YEDVRRKALMRLE YEGLHKSEAI TTPGVRFYN 
DPAGYAMNRYAYYVCYKCRKAYFGGEARCDAEAGRGDDYDPRELICG 
ACSD VSRAQVWE YLHYQLS I PNLKS KMLQAALTEQWRLEAFKFAKPB 
KNLKRHQTKGPERRELQEKEPEPQRWSRVGALRGTSASDAGGARGAA 
AAGAR S GG PG APHVRAAG P AAAA VPG ARGG VWGKP P PAAAKI E S G W 
VHADTLEAEVEGLLDLRR S RLQMKRGGRDSDRNS S EEGTAEKS KKLR 
TTNEHSQTCDWGNLLQDI ILQVFKYLPLLDRAHASQVCRNWNQVFHM 
PDLWRCFEFELNQPATSYLKATHPELIKQIIKRHSNHLQYVSFKVDS 
SKESAEAACDILSQLVNCSLKTLGLIST7^RPSFMDLPKSHFISALTV 
VFVNSKSLSSLKIDDTPVDDPSLKVLVANNSDTLKLLKMSSCPHVSP 
AGILCVADQCHGLRELALNYHLLSDELLI^SSEKHVRLEHLRIDW 
SENPGQTHFHTIQKSS WDAF IRKS PKVNLVMYFFLYEEEFDPFFRYE 
IPATHLYFGRSVSKDVLGRVGMTCPRLVELWCANGLRPLDEELIRI 
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AERCKNLSAIGLGECEVSCSAFVEFVKMCGGRLSQLSIMEEVLIPDQ 
KYSLEQIHWBVSKHLASTSEKSELSSKHSRSLKPDGRMSRTTADQKK 
PRGTES LS AS E S L I LKSD AAKLRSD S HSRS LS PNHNTLQTLKSDGRM 
PSSSRAESPGPGSRLSSPKPKTLPANRSSPSGASSPRSSSPHDKNLP 
QKSTAPVKTKLDPPRERSKSDSYTLDPDTLRKKKMPLTEPLRGRSTS 
PKPKSVPKDSTDSPGSENRAPSPHVVQENLHSEVVBVCTSSTLKTNS 
LTDSTCDDSSGFKGVGWKVSNKVHFI IGKAPLKDEQEMRAS PKI SRK 
CANRLTRPTLEQSSFLFKGDGSKPLEPAKQAMSPSVAECARAVFASF 
LWHEGIVHD/VMACSSFLKFHPELSKEHAPIRSSLNSQQPTEEKDTKIi 
KNRHSLEISSAL^^FNIAPHGPDISKMGSINKNKVIiSMLiKEPPLHEK 
FLKKXECEKENKKSKKEKKKKEKAEVRPRGNIiFGEMAQLAVGGPEKD 
TICELCGESHPYPVTYHMRQAHPGCGRYAGGQGYNSIGHFCGGWAGN 
CGDGGIGGSTWYLVCDRCREKYLREKQAAAREKAAEGRESAPVAPPG 
PCSPDLAPQPHIPQHLRSCVQSRSLPPPAAPARPLAADSALAHSPTG 
ALAR WR WGPRKR VVMVMAV I S KEHVKQSTDTTRRMS S CSNH I EMRQG 
GWAGNHSGVSGDFLDVLAERAGVKRGQCS SGGQPSVI 1 1 FHPPSWG 
1 JLtjb AAAAbfLt JUGJjSS rKCREALAIAVSHAGAPMANAATQTKPLHIG 
NAAKHG IEAAFLAMLGLQGNKQVLDLEAGFGAFYANYS PKVliPS IAS 
YS WLLDQQDVAFKRF PAHLS THWVADAAAS VRKHLVAERALLPTDY I 
KRIVLRIPNVQYVNRPFPVSEHEARHSFQYVACAMLLDGGITVPSFH 
ECQINRPQDGATFTDRSDTFYGHWRKPLSQEDLEEKFRANASKMLSW 
DTVESLIKIVKNLEDLEDCSVLTTLLKGPSPPEGLAVAQGQRDILED 
HQLQMDESLFWGMPNSHGKSLTTLRPTSYAEAQDCHREWSHGCMASL 
QLFSHPSPGTRHESEDTWDIQPSQAFR 

3737 

A 

114 

14222 

AAPVAAPGALFMPVPDGSVAAAGLGLGLPAADSPGHYQLLLSGRALA 
DR YRR I YTAALNDRDQGGGS AGHPASRNKKI LNKKKLKRKQKSKS KV 
KTRS KSENLENTVI I PD I KLHSNPS AFNI YCNVRHCVLE WQKKE ISL 
AAASKNSVQSGESDSDEEEESKEPPIKLPKIIEVGLCEVFELIKETR 
FSHPSLCLRSLQALLNVLQGQQPEVLQSEPPEVLESLFQLLLEITVR 
STGMNDSTGQSLTALSCACLFSLVASWGETGRTLQAISAILTNNGSH 
ACQTIQVPTILNSLQRSVQAVIiVGKIQIQDWFSNGIKKAALMHKWPL 
KEISVDEDDQCLLQNDGFFLYLLCKDGIiYKIGSGYSGTVRGHIYNST 
SRIR^KEKKSVn^YAQGYLLYRDVNNHSMTAIRISPETLEQDGTVM 
LPDCHTEGQN I L FTDGE YINQ I AAS RDDGFWR I FATSTEPVLQQEL 
QL KLARKCLHACR ISL FDLE KDLH IIS TG FDE E S A I LG AGRE FALM K 
TANGKI Y YTGKYQSLG I KQGGPSAGKWVELPITKS PKI VHFS VGHDG 
SHALLVAEDGS I FFTGS AS KGEDGES I KSRRQS KPYKPKKI IKMEGK 
I \ WYTACNNGS SS VI SKDGEL YMFGKDAI YSDS S S WTDLKGHFVT 
QVAMGKAHACVLMKNGEVWTFGVNNKGQCERDTGAMNQGGKGFGVES 
MA.SAMDEALEEELDEKDEKYMMCPPGMHKWKLEQCMVCTVCGDCTGY 
GASCVSSGRPDRVPGGICGCGSGESGCAVCGCCKACARELDGQEARQ 
RGILDAVKEMIPLDLLLAVPVPGVNIEEHLQLRQEEKRQRVIRRHRL 
EEGRGPLVFAGPIFMNHREQALARLRSHPAHVKHKRDKHKDGSGERG 
EKDASKITTYPPGSVRFDCELRAVQVSCGFHHSWLMBNGDVYTFGY 
GQHGQLGHGDVNSRGCPTLVQALPGPSTQVTAGSNHTAVLLMDGQVF 
TFGS FS KGQLGRP I LD VP YWNAKPAPM PN I GS KYGRKATW IG ASGDQ 
TFLRIDEALINSHVLATSEIFASKHIIGLVPASISEPPPFKCLLINK 
VDGSCKTFNDSEQEDLQGFGVCLDPVYPVIWRFRPNTRELWCYNAVV 
ADARLPSAADMQSRCSILSPELALPTGSRALTTRSHAALHILGCLDT 
LAAMODLKMGVASTEEETOAVMKVYS KEDYS WNP PF 9HGGGWGYS A 
HSVEAIRFSADTDILLGGLGLFGGRGEYTAKIKLFELGPDGGDHETD 
GDLLAETDVLAYDCAAREKTAMMFDEPVLLQAGWWYVAWARVSGPSS 
DCG SHGQAS I TTDDG WFQ FKS S KKS NNGTD VNAGQ I P QLL YRLPTS 
DGSASKGKQQTSEPVHILKRSFARTVSVECFESLLSILHWSWTTLVL 
GVEELRGLKGFQFTATLLDLERLRFVGTCCLRLLRVYTCEIYPVSAT 
GKAV^ETSKXAECIGKTRTLLRKILSEPLDHCMVKLDNBPQGYLSQ 
PLSLLEAVLQECHNTFTACFHSFYPTPALQWACLCDLLNCUDQDIQE 
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ANFKTSS SRLLAAVMS ALCHTS VKLTS I FP I AYDGEVL.LRS I VKQVS 
TENDSTLVHRFPLLVAHMEKLSQSEBNISGMTSFREVLEKMLVIVVL 
PVRNSLRRENEIiFSSHLVSNTCGLLASIVSEIiTASALGSBVDGLNSL 
HSVKASANRFTKTSQGRSWNTGNGSPDAICFSVDKPGIVWGFSVYG 
GGGIHEYELEVLVDDSEHAGDSTHSHRWTSLELVKGTYTTDDSPSDI 
AEIRLDKWPLKENVKYAVRLRNYGSRTANGDGGMTTVQCPDGVTFT 
FSTCS LS SNGTNQTRGQ I PQ I LY YRSEFDGDLQS QLLSKANEEDKNC 
SRALSVVSTVVRASKDLLHRALAVDADDIPELLSSSSLFSMLLPLII 
AYIGPVAAAIPKVAVEVFGLVQQLLPSVAILNQKYAPPAFNPNQSTO 
STTGNQPEQGLSACTTSSHYAVIESEHPYKPACVMHYKVTFPECVRW 
MTIEFDPQCGTAQSEDVLRLLIPVRTVQNSGYGPKLTSVHENLNSWI 
ELKKFSGSSGWPTMVLVLPGNEALFSLETASDYVKDDKASFYGFMCF 
AIGYEFSPGPDEGVIQLEKEIANLGGVCAAALMKKDLAIiPIGNELEE 
DLEILEEAALQVCKTHSGILGKGLALSHSPTILEALEGNLPLQIQSN 
EQSFLDDFIACVPGSSGGRLARWLQPDSYADPQKTSLILNKDDIRCG 
WPTTITVQTKDQYGDVVHVPNMKVEVKAVPVSQKKMSLQQDQAKKPQ 
RI PGS PAVTAAS SNTDMTYGGLASPKLDVSYEPMIVKEARYIAITMM 
KVYENYSFEELR\FASPTPKKPSENMLIRVNNDGTYCANWTPGAIGIi 
YTLHVTIDGIEIDAGLEVKVKDPPKGMIPPGTQLVKPKSEPQPNKVR 
KFVAKDSAGLRI\RSHPSLQ\SEQIGTSEKSMETITFIDEIHNDDGC 
V/ WLRLNDETIKKYVPNMNG YTEAWCLS FNQHLGKSLLVPVDESKTN 
TDDFFKDINSCCPQEATMQEQDMPFLRGGPGMYKWKTGPSGHNIRS 
CPNLRGIPIGMLVLGNKVKAVGEVTNSEGTWVQLDQNSMVEFCESDE 
GEAWSLARDRGGNQYLRHEDEQALLDQNSQTPPPSPFSVQAFNKGAS 
CSAQGFDYGLGNSKGDRGNISTSSKPASTSGKSELSSKHSRSLKPDG 
RMSRTTADQKKPRGTESLSASESLILKSDAAKLRSDSHSRSLSPNHN 
TLQTLKSDGRMPSSSRAESPGPGSRLSSPKPKTIiPANRSSPSGASSP 
RS S SPHDKNLPQKSTAPVKTKLDPPRERS KSDS YTLDPDTLRKKKMP 
LTEPLRGRSTSPKPKSVPKDSTDSPGSENRAPSPHWQENLHSEWE 
VCTSSTLKTNSLTDSTCDDSSEFKSVDEGSNKVHFSIGKAPLKDEQE 
MRASPKISRKCANRHTRPKKEKSSFLFKGDGSKPLEPAKQAMSPSVA 
EC ARAVF AS FLWHEG I VHDAMACS S FLKFHPELS KEHAP I RS S LNSQ 
QPTEEKETKLKNRHSLE IS YS VQMI FLMNVFNI / AHPHGPRYILRWG 
SINKNKVLSYALRNPPLHEKCEDGKTET\TFEMS\MHNTMKS\KSPL 
PLTLQHLVAFWED I S LAT I KAASQNMI FP S PGS CAVLKKKECEKGRN 
KKSKKEKKKKEKAEVRPRGNLFGEMAQLAVGGPEKDTICELCGESHP 
YPVTYHMRQAHPGCGRYAGGQGYNSIGHFCGGWAGNCGDGGIGGSTW 
YLVCDRCREKYLREKQAAAREKVKQSRRKPMQVKTPRALPTMEAHQV 
I K\ AQCQS FLL SL S S AAE PS I LC YHP AKP FQS QLP S VKEG I SEDLPV 
KMPCLYLQTLARHHHENFVGYQDDNLFQDEMRYLRSTSVPAPYISVT 
PDASPNVFEEPESNMKSMPPSLETSPITDTDLAKRTVFQRSYSWAS 
EYDKQHSILPARVKAIPRRRVNSGDTEVGSSLLRHPSPELSRLISAH 
SSLSKGERNFQWPVLAFVIQHHDLEGLEIAMKQALRKSACRVFAMEA 
FNWLLC^JVIQTTSLHDILWHFVASIiTPAPVEPEEEEDEENKTSKENS 
EQEKDTR VCEHPLSD I VIAGERAHPLPHTFHRLLQTI SDLMMSLPSG 
SSLQQMALRCWSLKFKQSDHQFLHQSNVFHHINNILSKSDEKSDSEES 
FSISIQSGFEAMSQELCIVMCLKDLTSIVDIKTSSRPAMIGSLTDGS 
TETFWE S GDED KNKTKN I T INCVKG / 1 S LPR YVS C FTVD I S \ RDLG \ 
NKVTSMTFLTGKAVQD/LCPRIKQ\VDLDSRHIGVArrSEIiPGGDNHI 
IKIELKGPENTLRVRQVKVLGW\ KDGESTKIAGQI S\ AS VAPARGNC 
EAGDSCRVFRIilTSQVFGKLISGDAEPTPEQEEKAIiLSSPEGEEKVY 
NATSDADLKEHMVG I TFSRSKLTNLQKQVCAHI VQAIRMEATRVREE 
WEHAISSKENANSQPNDEDASSDAYCFELLSMVLALSGSNVGRQYIiA 
QQLTLLQDLFSLLHTASPRVQRQVTSLLRRVLPEVTPSRIxASIIGVK 
SLPPADISDIIHSTEKGDWNKLGILDMFLGCIAKALTVQLKAKGTTI 
TGTAGTTVGKGVTTVTLPMI FNSS YLRRGESHWWMKGSTPTQI SEI I 
I KL I KDMAAGHL S EAWS RVTKNA IAETI I ALTKMEEEFRS P VRC I AT 
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TRLWjLALAS LCVI>DQDHVDRIjS SGRVfMGKDGQQKQMPMCDNHDDGET 
AAI I LCNVCGNLCTDCDRFLHLHRRTKTHQRQVFKEEEEAI KVDLHE 

GCGR\ TKLF WIiMGTGQ I lktmegqwvgfprntqgkptts sskacrfc 

GSRSGTELSAVGSVCSDAHCQEYAKIACSKTHPCGHPCGGVKNEKHC 
LPCLHGCDKSATSLKQDAHDMCMICFTEALSAAPAIQLDCSHIFHLQ 
CCRRVliENRWLG PR I TFGF I SCP I CKNKINH I VLKDLLDP I KELYED 
VRRK \ AJjMRJjB YEGIiHKS E A I TTP / GWKr YND P AG YAMNR YAYYV C 
YKCRKAYFGGEARCDAEAGRGDDYDPRELICGACSDVSRAQMCPKHG 
\TDFL\EYKCRYCCSVAVFFCFGTTHF\ CNACHDDFQRMTS IPKEEL 
PHCP\AGPKAJCQL\EGTECPLHWHPPT\GKE\FAIiGCGVCRNAHTF 
KTTQ I LCLQRE KLPS S PTEDGGEV 

3 738 

A 

373 

662 

URKNSCiQAQNrNLRIPrS I rjNJUXNljFIir *NDSQKST*/ SDSHLiARb 
SQFCSJLN*NY*I*TAKSHDWCTRQHFPSLSESYIWHVKEKKYNPTA 

AAI 

3739 

A 

29 

658 

LSVASLSFLPNASAEDTMSRLSRSLLWAATCLGVLCVLSADKNTTQH 
PNVTTLAPISNVTSAPVTSLPLVTTPAPETCEGRNSCVSCFNVSVVN 
TT\CFW\IDCKDESYCSHNST\VSD\CQVGNTTDFCSVST\ATPVPT 
ANS\TAKTHSSSPPLLQLPRQLLHQGTTNNTVNS\TSQPVRKSTFDA 
AS F I GG I VLVliE I RCHTRNY I PDLKK 

3740 

A 

1 

183 

KKGLSPEQDIKFSGHVSVfVGKTSMEVKMQMFQ\PAFVNPIiIPESPEE 
EELFRQGECTLICL 

3741 

A 

185 

1548 

LNQGPQNPKKQGNFHIHETSSSIHANHVRDKLREIV\KASTNVmi)HV 
KAMEERKLLHSFIJUCSQDGLPPRRMKDSYIEVLLPLGSEPELREKYL 
TVQNTVRFGRI LEDLDSLGRFLFCYMHNKIHS AKMSPLS IVTAIjVDK 
IDMCKKS LS PEQDI KFSGHVS WVGKTSMEVKMQMFQLHGDEFCPVLD 
ATFVMVARDSENKGPAFVNPLIPESPEEEELFRQGELNKGRRIAFSS 
TSLL\KMAPSAEERTTIHEMFLSTL\DPKTISFRSRVLPSNGSVGWE 
NSKLKEFWEI/CAHPSGSRNIFNRILVGFLMRKAYELAWATACS\FG 
\GSRPFVGAVDD\IMF\QNPVEVG\SLLF\LSSQVCFTQNNYI\QVR 
VH\SEVALPLQEKPAIQPHPMSFHFHRFHVGKKESAPL\VFPPKTYG 
ESPCCTLDG\QRAFPTPMSGP\ATLEKGPTLVEPLRTPTIC 

3742 

A 

352 

1692 

RRG VS GARVRVAFQGNERG S KWAARGGARDAGGRGRSGLWKGPKLMR 

RAPAAERLLELGFPPRCGRQEPPFPIiGVTRGWGRWPIQKRREGARPV 

PFSERSQEDGRGPAARSSGTLWRIRTRLSLCRDPEPPPPLCLIiRVSL 

LCALRAGGRGSRWGEDGARliLLLPPARAAGNGEAEPSGGPSYAGRML 

ESSGCKALKEGVLEKRSDGLLQLWKKKCCILTEEGLLLIPPKQ\LQH 

QQQQwQQQwQQwQQPGQGPA\EPSQPSGPA\VASIjEPPVKI^ 

SNMKTVX>CVERKGKY\MYFTVVMAEGKEIDFRCPQDQGWNAEITLQM 

VQYKNRQAILAVKSTRQKQQHLVQQQPPSQPQPQPQLQPQPQPQPQP 

HQIPHPHPQPHSQPHGHRLLRST\SNSA 

3743 

A 

2 

286 

TALQRH S E P S KLT F VG E L AHGR F S AK/ MG TLAL.G VYHGL PAS HMELA 
RP 

3744 

A 

1 

287 

RHEDLPBISSRMTQRHIQRGPPHLQIRPPSQDTNHDSRIiLKPEQVSV 
GRPEGGFEWPGTGVTEWTAQGAALPGLVKPQVSPIQGSGGAAVTPE 
A 

3745 

A 

1 

339 

RPLLAFRLEEEQKNRPEIAGLKPANPPVLPAPQKADTDPENLPGXMG 
* PSGOPRVOLCPGS * SLiRCPOSKGLEGLP * LOR PFAPGLALiVFTS WT 
QGSSWPPRRGLGGLDGKSV 

3746 

A 

2 

991 

LEFRELSRIiTGDKKFQEAVEKVTQHIHGLSGKKDGLVPMFlNTHSGL 
FTHLGVFTLGARADS^EYXiLKQWIQGGKQETQLLEDYVEAIEGVRT 
HLLRHS EPS KLTFVGELAHGRF S AKMDHLVCFLPGTLAIiGVYHGIiPA 
SHMELAQELMETCYQMNRQMETGLSPEIVHFNLYPQPGRRDVEVKPA 
DRHNLLRPETVESLFYLYRVTGDRKYQDWGWEILQSFSRFTRVSTCP 
RPAWS RP PGHRHG WAVGLRLAPLLVVAVTWVREGRACRS LiGVATLQL 
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GG PG I P I PTELHGCGTQS TWRG WAPL I F I S QGLSREGLCCGPS I PP V 

3747 

A 

1 

2296 

FRVDGAAMAACEGRRSGALGSSQSDFLTPPVGGAPWAVATTWMYPP 
PPPPPHRDFISVTLSFGESYDNSKSWRRRSCWKKWKQLSRLQRNMIIi 
F\LLAFLLFCGLLFYINIiADHWKALAFRLEEEQKMRP\EFAGLKPAN 
PPVLPAPQKADTDPENLPEISSQKTQRHIQRGPPHLQIRPPSQ\DLK 
DGTQEEATKRQEAPVDPRPEGDP\QRTVISWRGAVIEPEQGTELPSR 
RAEVPTKPPLPPARTQGTPVH\LNYRQKGVIDVLLHAWKGYR\KFAW 
GHDDLKPVSR S FS EWFGLGLTL I DALDTMW I LGLRKEFEEARKWVS K 
KLHFEKDVDVNL,FESTIRILGGLLSAYHLSGDSLFIiRKAEDFGNRLM 
PAFRTPSKIPYSDVNIGTGVAHPPRWTSDSTVAEVTSIQLEFRELSR 
IiTGDKKFQEAVEKVTQHIHGLSGKKDG\LVPLFHQLPTVGLFTHIiG\ 
VFTLGARADSYYEYLLKQWIQGGKQETQIiLEDYVEAIEGVRTHLIiRH 
S E P S KLTFVGELAHGRF S AKMDHLVC FLPGTLAIiGVYHGLPASHMEL 
AQELMETC YQMNRQMETGLS PE I VHFNLYPQPGRRDVEVKPADRHNL 
LRPETVESLFYLYRVTGDRKYQDWGWEILQSFSRFTRVPSGGYSSIN 
NVQD PQKPE PRDKMES F FLGETLKYLFLL FS DD PNIiLS LDAYVFNTE 
AHPLPIWTPGLGWMAAGVGTSGGQRHLAGSVAFSKGPRRHRQPPSGP 
GSELGLGFLLVSALIRTP 

3748 

A 

2 

183 

KMTEPGASPEDLWVKVEYAYSDNSLDPDPAGDCDWRPAPIEKPALLT 
GSWKPGEQPTRGP 

3749 

A 

1 

3816 

MARTVVLITGCSSGIGLHIiAVRLASDPSQSFKVYATLRDLKTQGRLW 
EAARALACPPGSLETLQLDVRDS KS VAAARERVTEGRVDVIiVCNAGIi 
GLLGPLEALGEDAVASVLDVNWGTVRMLQAFLPDMKRRGSGRVIiVT 
GSVGGLMGLPFNDVYCASKFALEGLCESLAVLLI>PFGVHSLSbIECG 
PVHTAFMEKVLGSPEEVLDRTDIHTFHRFYQYLAHSKQVFREAAQNP 
EEVAEVFLTALRAPKPTLRYFTTERFLPLLRMRLDDPSGSNYVTAMH 
REVFGDVPAKAEAGAEAGGGAGPGABDEAGRSARPRKQAHQRIWWR 
EPPTAARYTQGLSFS I CEMGTWTDLS S SLPCENPRTERRGSDRHQGF 
THSQSLRDQPHPISRTQPMPLEISRESRAEAKSRRLTFLRFRSSELR 
FLRLGSMAVFRSGLIiVT.TTPIASLAPRIJVSILTSAARI.VNHTLYVHIi 
QPGMS LEG P AQPQY SPVQAT FEVLDF I THLYAGADVHRHLDVR I LLT 
NIRTKSTFLP PLPTS VQNIxAHPPEWliTDFQTIiDGSQ YNPVKQQIiVR 
YATSCYSCCPRIiASVLLYSDYGlGEVPVEPLDVPLPSTIRPASPVAG 
SPKQPVRGYYRGAVGGTFDRLHNAHKVLLSVACILAQEQLWGVADK 
DLLKSKLLPELLQPYTERVEHLSEFLVDIKPSLTFDVIPLLDPYGPA 
GSDPSLEFLWS EETYRGGMAINRFRLENDIiEELALYQI QLLKDLRH 
TENEEDKVSSS S FRQRMLGNLLRPPYERPELPTCLYVIGLTGI SGSQ 
KS S I AQRLKGLGAF VI DSDHLGHRAYAPGGPAYQ P WEAFGTD I LHK 
DGI INRKVLGSRVFGNKKQLKILTDIMWPI I AKLAREEMDRAVAEGK 
RVCVIDAAVLLEAGWQNliVHEVWTAVI PE TE AVRR I VERDGLS EAAA 
QS RLQS QMSGQQLVEQSHWLSTLWEPH I TQRQSLQPGG VGRCGWGIi 
LCPVPDTPASSAGPVEYAYSDNSLDPGIiFVESTRKGSWSRANSIGS 
TS AS S VPNTDDEDSDYHQEAYKES YKDRRRRAHTQAEQKRRDAI KRG 
YDDLQTIVPTCQQQDFSIGSQKLSKAIVLQKSMARESTGGDGAKRGC 
EEASASYPPMARPWCGDCHGIVGVWAIDYIQFLHKEKKKQEEEVSTL 
RKDVTALKIMKVNYEQIVKAHRDNPHEGEDQVSDQVKFNVFQGIMDS 
LFQSFNAS I S VASFQELS ACVFSW IEEHCKPQTLiREI VI GVLHQLKN 
QLY 
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3751 

A 

16 

689 

PSPGACHEETWVKVEYAYRDHSLDPGLFVESTRKGSVVSRANSIGST 
SASSVPNTDDEDSDYHQEAYKESYKDRRRRAHTQAEQKRRDAIKRG\ 
YDDLQT I VPTCQEQDFS IG S QKLNKA I LLQK/ TYG * GKHWRGWS PEG 
Ii* KGQSLrLPTHGSALVR*LP *DSWGLG * GEQSQP YLLG * EYFCTCPF 
FLAPTLAIDYIQFLHKEKKKQEEEVSTLRKDVTRLKIM 

3752 

A 

2 

981 

IEALEPPVDPVDPRVRKMTEPGASPEDPWVKVEYAYSDNSLDPGIiFV 
ESTRKGSWSRANSIGSTSASSVPNTDDEDSDYHQEAYKESYKDRRR 
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RAHTQAEQKRRDAIKRGYDDLQTIVPTCQQQDFSIGSHKLSKAIVLQ 
KTIDYIQFI»HKEKKKQEEEVSTLRKDVTALKIMKVNYEQIVK7\HQGQ 
PHKKGRTEFSDQSRFQRVSKGIMDSIiF\QSFNAS\lSVA\SFQELSA 
CVFSWIEEHCKPS/QTLREIVIGVLHQLKNQLYLTGSWKPGRTANKR 
PLNLYVATELLGPGDWTTTPHTGQLDIYLVFGFSQPHFIFSGAAVFV 
L 

3753 

C 

150 

584 

MHPSCCPCICKPFSTLVKSHVTASASSVPDSTCLRRGQQSCEWIiWQ 
DRVSFQARELKAVINSAVQTSSCFIRHLPRSVSSAIAEGVTSVAIIS 
SSSPSAGGALLWSAPFFETQQTAVAPPASLETLGFHLQPHIiLHTAGS 
FG** 

3754 

A 

292 

483 

LGSLLSLSFLFCTPVELGPLVFCSGEDEAADENPESQEMLEEQLVRM 
LTREVMDLITVCCVSKK 

3755 

A 

84 

316 

ggklini kylfyts y* flsfpwhfllaasraqqgevmrlacrl.dpkt 
sfqmagewlkyqlstfld\agsvnwkffsll 

3756 

A 

423 

1186 

LPKDPLNILEEENHIKDALSRIWEMIKREWPQHWPDMIjIELDTLSK 

qgktdtsqeskaqancrvgvaalntlagyidvatsmshitaencklle 
ilclllneqelqlgaaeclliavsrktcqeaegavrpsltkgawngg 

GDGTSEPGAEGWAYRSPLHSPNIiRAAALSPDSGCVWQALCSLLTCSW 
MLQNQKMKLS FVLGSWS I QMERGNALWLRS LLW PGLTFYHAPRTKN 
YGYVYVGTGEKNMDLPFML 

3757 

A 

222 

435 

PCFLALAVCCVS KKGADHS S APPADGDDEEMMATEVTPSAMAELTDL 
GKCLMKHEDVCTALL ITAFH\ SLAW 

3758 

A 

1 

936 

MAEPFTKAIiDMLDAEKSAILGLPQPLLELNDSPVFKTVIiERMQRFFS 
TLYEN/CECPWVGLFI*CTEHYEAIiVSPILGPLFTYLH\CGEDEAAD 
ENPESQEMLEEQLVRMLTREVMDLITVCCVSKKGADHSSAPPADGDD 
EEMMATEVTPSAMAELTDLGKCLMKHEVLSGTLLADAVTWLFTSVLK 
GLQMHGQHDGCMAS LVHLAFQ I YEALRPR YLE IRAVMEQ I PE I QKDS 
LDQFDCKLLNPSLQKVADKRRKDQFKRLIAGCIGKPLGEQFRKEVHI 
KNLPSLFKKTKPMLETEVLDNDGGGLATIFEP 

3759 

A 

30 

3498 

VAIVRHFGLQILEHVVKFRWNGMSRLEKVYLKNSVMELIANGTLNIL 
EEENHIKDALSRIWEMIKREWPQHWPDMLIELDTLSKQGETQTELV 
M F I LLRLAED WTFQTLP PQRRRD I QQTLTQNMER I FS FLLNTLQEN 
VNKYQQVKTDTSQESKAQANCRVGVAALNTLAGYIDWVSMSHITAEN 
CKLLEILCLLLNEQELQLGAAECLLIAVSRKGKIiEDRKPLMVLFGDV 
AMHYILSAAQTADGGGLVEKHYVFLKRLCQ\n^CALGNQLCALLGADS 
DVETPSNFGKYLESFLAFTTHPSQFLRSSTQMTWGALFRHEILSRDP 
LLLAIIPKYIjRASMTI^VKMGFPSKTDSPSCEYSRFDFDSDEDFNAF 
FNSSRAQQGEVMRLACRLDPKTSFQMAGEWLKYQLSTFLDAGSVNSC 
S AVGTGEG S LCSVFSPS FVQWEAMTLFLE S VI TQMFRTLNREE I P VN 
DGIELLQMVLNFDTKDPLILSCV\LTt^SALFPF\VTYRPEFLPQVF 
S KLFS S VTFETVEES KAPRTRAVRNVRRHAC S S 1 1 KMCRD YPQLVLP 
NFDMLYNHVKQLIiSNELLLTQMEKCAliMEALVLISTQFKNYERQKVF 
LEELMAPVASIWLSQDMHRV\LSDVNAFVAYVGTDQKSCDPGIiEDQC 
GLNRARMSFCVYSILGVVKRTCWPTDLQEAKAGGFAVGYTSSGNPIF 
RNPCTEQILKLIjDNLLALIRTHNTLYAPEMIiAKMAEPFTK7Uj\DMLD 
AEKSAILGLPQPLL\ELNDSPVFKTVL\ERNAAFLSSYPLMENCFHI 
LGEGQGPSMQQDFYTVEDllATQLLSSAF\VNllNNIPDYRLiRP^fI>RVF 
VKPLVLFCPPEHYEALVSPILGPLFTYLHMRLSQKWQVINQRSLLCG 
EDEAADENPESQEMLEEQLVRMLTREVMDL I TVCCVS KKGADHS SAP 
PADGDDEEMMATEVTPSAMAEItTDLGKCLMKHEDVCTAIjLITAFNSL 
A\WKD\TLSCQRTT/SQKFCLPLLQTKCLSGTLLADAVTWLFTSVLK 
GLQMHGAARTGCMAS^>GPIWPFQIYEALRPR\Y\LEIRAVMEQIP\E 
IQKDSLDQFDCKLLNPSLQKVADKRRKDQFKRLIAGCIGKPLGEQFR 
KEVHIKNLPSLFKKTKPMLETEVI,D\NDGGG\LATIFEP 

3760 

A 

1 

168 

PLKRSIX3CNDGRPTRPPTRPDTTVFTSNLKQTRMVH1.TPVERVCRYC 
PVG\QGKPG 
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3761 

B 

54 

1731 

XKLSRECE I KYTGFRDRPHEERQARFQNACRDGRSE I AFVATGTNLS 
LQFFPASWQGEQRQTPSREYVDLEREAGKVYLKAPMILNGVCVIWKG 
WIDLQRLDGMGCLEFDEERAQQEDALAQQAFEEARRRTREFEDRDRS 
HREEMEVHBLEKSKRALETQMEEMKTQLEELEDELQASEDAKLRLEV 
NMQALKGQFERDLQARDEQNEEKRRQLQRQLHEYETELEDERNERAL 
AAAAKKKLEGDLKDLELQADS AI KGREEA I KQLRKLQAQMKDFQREL 
EDARASRDEIFATAKENEKKAKSLEADLMQLQEDLAAAERARKQADIj 
EKEELAEELASSLSGRNALQDEKRRLEARIAQLEEELEEEQGNMEAM 
SDRVRKATQQAEQLSNELATERSTAQKNESARQQLERQNKELRSKLH 
EMEGAVKSKFKSTIAALEAKIAQLEEQVEQEAREKQAATKSLKQKDK 
KLKEILLQVEDERKMAEQYKEQ^KGNARVKQLKRQLEEAEEESQRI 
NANRRKLQRELDEATESNEAMGREVNALKSKLRGPPPQETSQ* 

3762 

A 

1 

2448 

PRWPDQRSKFENEEFFRKLSRECEIKYTGFRDRPHEERQARFQNAC 
RDGRS EI AFVATGTNLS LQFFPASWQGEQRQTPSREYVDLEREAGKV 
YLKAPM I LNGVCVI WKGW IDLQRLDGMGCLE FDE ERAQQEDALAQQA 
FEEARRRTREFEDRDRSHREEMENEVESVTGMLNEAEGKAIKLAKDV 
ASLSSQLQDTQELLQEETRQKLNVSTKLRQLEEERNSLQDQLDEEME 
AKQNLERHI STLNI QLSDSKKKLQDFAS TVEALEEGKKRFQKE IENL 
TQQYEEKAAAYDKLEKTKNRLQQELDDLVVDLDNQRQLVSNLEKKQR 
KFDQLLAEEKNISSKYADERDRAEAEAREKETKALSLARALEEALEA 
KEELERTNKMLKAEMEDLVSSKDDVGKNVHELEKSKRALETQMEEMK 
TQLEELEDELQASEDAKLRLEVNMQALKGQFERDLQARDEQNEEKRR 
QLQRQLHEYETELEDERNERALAAAAKKKLEGDIjKDLELQADSA I KG 
REEAIKQLRKLQAQMKDFQRELEDARASRDE I FATAKENEKKAKSLE 
ADLMQLQEDLAAAERARKQADLEKEELAEELASSLSGRNALQDEKRR 
LEARIAQLEEELEEEQGNMEAMSDRVRKATQQAEQLSNELATERSTA 
QKNESARQQLERQNKELRSKLHEMEGAVKSKFKSTIAALEAKIAQLE 
EQVEQEAREKQAATKSLKQKDKKLKEILLQVEDERKMAEQYKEQAEK 
GNARVKQLKRQLEEAEEESQRINANRRKLQRELDEATESNEAMGREV 
NALKSKLRGPPPQETSQ 

3763 

A 

1 

2655 

PRWPDQRSKFENEEFFRKLSRECEIKYTGFRDRPHEERQARFQNAC 
RDGRSEIAFVATGTNLSLQFFPASWQGEQRQTPSREYVDLEREAGKV 
YLKAPMILNGVCVIWKGWIDLQRLDGMGCLEFDEERAQQEDALAQQA 
FEEARRRTREFEDRDRSHREEMEAKANLDKNKQTLEKENADLAGELR 
VT.GQAXQEVEHKKXKLEAQVQELQSKCSDGERARAELNDKVHKLQNE 
VESVTGMI^NEAEGKAIKLAKDVASLSSQLQDTQELLQEETRQKLNVS 
TKLRQLEEERNSLQDQLDEEMEAKQNLERHISTLNIQLSDSKKKLQD 
F ASTVEALEEGKKRFQKE I ENLTQQYEEKAAAYDKLEKTKNRLQQEL 
DDLVVDLDNQRQLVSNLEKKQRKFDQLLAEEKNISSKYADERDRAEA 
EAREKETKALSLARALEEALEAKEELERTNKMLKAEMEDLVSSKDDV 
GKNVHELEKSKRAIiETQMEEMKTQLEELEDELQASEDAKXiRLEVNMQ 
ALKGQFERDLQARDBQNEEKRRQLQRQLHEYETELEDERNERALAAA 
AKKKLEGDLKDLELQADSAIKGREEAIKQLRKLQAQMKDFQRELEDA 
RASRDE I FATAKENEKKAKSLEADLMQLQEDLAAAERARKQADLEKE 
ELAEELASSLSGRNALQDEKRRLEARIAQLEEELEEEQGNMEAMSDR 
VRKATQQAEQLSNELATERSTAQKNESARQQLERQNKELRSKLHEMB 
GAVKSKFKSTIAALEAKIAQLEEQVEQEAREKQAATKSLKQKDKKLK 
EILLQVEDERKMAEQYKEQAEKGNARVKQLKRQLEEAEEESQRINAN 
RRKLQRELDEATE SNE AMGRE VNALKS KLRG P P PQETSQ 

3764 


1 

J ^ O 

MRLHRLHARPSAVACGLLLLLLQGHYEAPAEGLCCRPVITVIASGSS 
WLSMCGVLGQGGALSPPKGPQTSMMRKPACFDGSKLGKFGKESKNMA 
GYQGVISWDTKGEKGGKMPRWPDQRSKFENEEFFRKLSRECEIKYT 
GFRDRPHEERQARFQNACRDGRSEIAFVATGTNLSLQFFPASWQG\D 
QRQTPIREYVDLEREAGTVYLKAPMILNGVCAIWKGWIDLQRLHGMG 
CLEFDEERAQQEDALTQQAFEEARRMTREFEDRDRSHRQEMEARVSQ 
LLAVTGKK\QLDPRPGSNLGGGDDLKLR 
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3765 

A 

1 

234 

METSSLRITPDLLNARATERES PKVGP I IGVLPRPPGTLAS ICGGLV 
MLLPB I TG I ALPETVDDVE KLGRHCPKFNAP 

3766 

A 

1 

1697 

LRAAGGGRTMPSFDEALQRVGEFGRFQRRVFLLLCIiTGVTFAPLFVG 
WFLiGTQPDHYWCRGPSAAALAERCGWSPEEEWNRTAPASRGPEPPE 
RRGRCQRYLLEAANDSASATSALSCADPLAAFPNRSAPLVPCRGGWR 
YAQAHS T I VS E FDLVCVNAWMLDLTQAI LNLG FLTGAFTLG YAADR Y 
GR I V I YLL S CLG VG VTG VWAF APNF P VF VI FR FLQG VF F \ SLTWMT 
CYVIVTEIVGSKQRRIVGIVIQMFFTLGIIILPGIAYFIPNWQGIQL 
AITLPSFLFLLYYWEVPESPRWLITRKKGDKALQILRRIAKCNGKYL 
S SNYS E I TVTDEEASNPS FLDLVRTPQMRKCTL I LMFAWFTS AWYQ 
GLVMRLGIIGGNLYIDFFISGWELPGAIiLILLTIERLGRRLPFAAS 
N I VAGVACLVTAFLPEG I AWLRTTVATLGRLG I TMAFE I VYLVNSEL 
YPTTLRNFGVSLCSSLCDFGGIIAPFLLFRIiAAVWIiELPLIIFGILA 

S ICGGLVMLLPETKGI ALPETVDDVEKLGS PHSCKCGRNKKTPVSRS 
HL 

3767 

A 

103 

370 

IGVPGAI S SSASGRLAVALGTILS FSFS I SEFFRFWLRWGQVDLSG 
DWYCTYPLF I FLS ALSLED YS PETKVFK* LKTKHDSHMTGN 

3768 

A 

97 

1639 

CPLQW I CA INN I S RQI YLTDNPEAVAI KLNQTALQAGTP I TS FGKKQ 
ESSCPSQNLKNSEMENENDKIVPKATASLPEAEELIAPGTPIQFDIV 
LPATEFLDQNRGSRRTNPFGETEDESFPEAEDQQQQEVNEILSGKCC 
LPDSLLQQMFIVRFLGSMAVKTDSTTEVIYET^MRQVIAARAIHNIFR 
MTES HLMVTS Q S LRAPGGRGGCGRS FS RLKCS CL PALKRAAGL P AQR 
LS S AKGQTAS SSES LTLWPDGETPPSRNRQTPHTGE FQLASGHQHQ 
RPKVDKS TKMRKKQHRKAENS KNQNAS PPPKDHNSLPAREQNWTENE 
FEEFTERGFRSDRENGTKLENTLQDIIQENFHNLARQANIQIQEIQR 
TPQRYS SRRATPRHI I IRFTKVEMKEKMLRAAREKGQWTHKGKSIR 
LTADIiSAETLQVRREWGPIFNNLKEKNFQPRISYPAKLSFISKGEIK 
F\FTDKQMLKDWTTRPAL\KELLKEALNMERKNQYQPLQKHTKM ' 

3769 

A 

159 

277 

WDNEWinjVLTDSTPMLSQASLCVYRYNRIAREWTQKYAM 

3770 

A 

1 

474 

HPPPTMAIiKRIHKELNDLARDPPAQCSAGPVGDDMFHW\QA\TIMGP 
NDGPL/YQGGVFFLTIPFPQQITPFKP\PKVAFTTRIYHP\NINHNG 
\SICLDILRSQWSPALTISKVLLS\ICSLLCDPNPDDPLVPEIARIY 
KT\ DRE \ KYNR I AR\ EWTQKYAM 

3771 

A 

153 

291 

LKQLGGLFTFSNVQTYFPLS I FS FTS FCL S I PHPCLFLKP I DGTH 

3772 

A 

1 

336 

MGRAWWLMPVIPALWEAEAGGSPESPSFLCAQQPLSRNRQPLSPSSS 
VFVPPQSPSFLCAQPPTQNIMLVSHIGGIMLLESGELEVANISERRD 
QPTLTQDLATLLKCPGIQ 

3773 

- - 

A 

1 

2336 

MVRSWLTATSASQVQAILLPQPPKTQLNFTVAIDFTASNGETRMSEK 
VGGNPLQPTSLHYMSPYQLSAYAMALKAVGEIIQDYDSDKLFPAYGF 
GAKLPPEGRISHQFPLNNNDEDPNCAGIEGVLESYFQSLRTVQLYGP 
TYFAPVINQVASNSWSSVTLGTDSEPAVEVPQYVGIRLLVEGFTIKK 
PMAMCHRRMGVRPAVPLLTQRGSGEGKDSGTPTHSLHTKAQLPSPHV 
LRHQGQVLRRQHS KL VGTKALS TTGKALRTLPTAKVF I S LP * GPWHR 
R*AGRDSCSAGR*AAEDSLRLFHSSVSHHWKP*SQWLL*SSAVSWEA 
EAGELLEPGRRRLQ 

3774 

A 

1015 

1292 

AGRAWWRMPYNPQHFGRPRRVDHLRSGVQDQPGQ\HGETP\SLLKGP 
KIGWAWWRTPVIP\ATWEAEP\GELLEPRRQRLQVSQDRTTALQPGQ 
E 

3775 

A 

183 

511 

SQHFGRLRWADCLRSGIQDQPGQHGETLSLLKIQ\KLARSGGTCLQS 
\£uijj\KijK\i&iHrir oJj^yijiivjUi>E.PRSGHCTPAWAR V/ GHSVLiKTNEQKN 
SSTLHTKSHWIFQFPGH 

3776 

A 

238 

362 

LKRQANTFPILS*TKNTHNVAFLLGKTCFLYFT*ECNITLP 

3777 

A 

276 

503 

RCS PCSEENKPVFLPFT* KRS IKTTVLS *GCGEIRTHI WEWNAKVKL 
WLTRIFGQVQWLTPVIPTPWEAKVGRIT 

3778 

A 

131 

518 

CSMVPSSASGQVRSHYVDVmMWRDVKRRKMAYEYADERLRINSLRKN 
TILPKILQDVADEEIAA\LPR\DSCPWIRNRCVMTSRPRG\VKRRW 
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RLSR I VFRHLADHGQLSG IQRATCLNS IQLFLYFFFK 

3779 

A 

60 

363 

RAEQVCEAKCLRGAPRGENPEEKETAR I G PEKQPTS S PL IMANI HQE 
NEEMEQPMQNGEEDRPIX3GGEGHQPAGNRRGQARRLAPNFRWAIPNR 
QIHDGM 

3780 

A 

2 

616 

ERTEKSDRAVRSGAPHADGSREAVFCACIWRWRVGGRGRSCCTGRPS 
WGASSAVTRGPHVTRSCRATQPSLQSSLASGYQLPAALSSARSAFGP 
GKSGAGSATAKC\SSGAPRGENPEEKETARIGPGKTKKMEQPYARME 
EEDR / HLWGGGEGPPS PARKI ERG \ KARPTCPLNFRMGP YPNRPESM 
DGDGWKMEMDMGKYSMGGR 

3781 

A 

2 

886 

RQRQRPRAG PAGRTGGQ KAPGAR VP PG P AMEGAS FG AGRAG AALD P V 
SFARRPQTLLRVASWVFSIAVFGPIVNEGYVNTDSGPELRCVFNGNA 
GACRFGVALGLGAFLACAAFLLLDVRFQQISSVRDRRRAVLLDLGFS 
GLWSFLWFVGFCFLTNQWQRTAPGPF\TTHAGDAARAAIAFNFFSIL 
S WVALTVKALQR FRLGTDMS LFATEQLS TGASQAYPGYPVGSG VEGT 
ETYQSPPFTETLDTSPKGYQVPAYYRLAGTDQGSKATPPTQAPGSPG 
PPLGPSSSVPRTE 

3782 

A 

1 

1944 

MADRTKELESIPDSRNRGSIMGNSRDFADFGTTIKQDFRLLGQTSVD 

RLLQLS QGQAVKGNQLLPVS LVKRKTTLAPNTQTAS PRALADSLMQL 

ARQVSRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGN 

QLLPVSLVKRKTTIJ^NTQTASPRALADSLMQLARQVSRLESGQWTL 

VPNVn^TTQPLSGDILLIYKGFCRFRNHHQTGFSLAGANQRGPLAATL 

SGPGGEGQSAVARLTGEKKNHPGAQYANRLSPRVGRFINAAGTTGFP 

TGKRAGYLSEGLVTKWYRSPRLLLSPNNYTKAIDMWAAGCILAEMLT 

GRMLFAERDRAS AS S LLTI ACLLLTKQPS S VAYQMAPLLLYLVLPCQ 

HLISSPIIPHTEAVGTYVMASLSTIGPTSFWLDFADFGTTIKQDFRL 

LGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNTQTASPRAL 

ADSLMQLARQVSRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQ 

GQAVKGNQLLPVSLVKRKTTLAPNTQTASPRAIJ\DSLMQLARQVSRL 

ESGQNFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSL 

VKRKTTLAPNTQTAS PRALADS LMQLARQVS R/ HGKRAG YLSEGLVT 

KWYRS PRLLLS PNNYTKA I DMWAAGC I LAEM LTGRMLFA / GAG * S LC 

ILLAHHCLPAINKAALLCGLPDGPSSSLSCPPV/RSTSSHLPSSPTL 

RLLAPTSWPPSPQLVPHHSGWILPISEPPS / QQDFRLLGQTSVDRLL 

QLS QGQAVKGNQLLPVS LVKRKTTLAPNTQTAS PRALAD S LMQLARQ 

VSRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLL 

P VS LVKRKTTLAPNTQTAS P RALAD SLMQ LARQVS RLES GQD FAD FG 

TT I KQDFRLLGQTSVDRLLQLS QGQAVKGNQLLPVS LVKRKTTLAPN 

TQTASPRALADSLMQLARQVSRLESGQWTLVPNWNTTQPLSGDILLI 

YKGFCRFRNHHQTGFSLAGANQRGPLAATLSGPGGEGQSAVARLTGE 

KKNHPGAQYANRLSPRVGRFINAAGTTGFPTGKRAGYLSEGLVTKWY 

RS PRLLLS PNNYTKAIDMWAAGCILAEMLTGRMLFAERDRAS AS SLL ' 

TIACLLLTKQPSSVAYQMAPLLLYLVLPCQHLISSPIIPHTEAVGTY 

VMASLSTIGPTSFWLD FAD FGTTI KQDFRLLGQTSVDRLLQLS QGQA 

VKGNQLLPVSLVKRKTTLAPNTQTASPRALADSLMQLARQVSRLESG 

QDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKR 

KTTLAPNTQTAS PRALAD S LMQLARQVS RLES GQNFADFGTT I KQDF 

RLLGQTSVDRLLQLSQGQAVKGNQLLPVS LVKRKTTLAPNTQTAS PR 

ALADSLMQLARQVSRLESGQ 

3783 

A 

1 

578 

MGVEGRYPVSLSSDLEWRPDRCQDASEVQRDPRAGSAPLAEDVQVDP 
RKDSHSSSERFLEQSHSSMERAFEADYGRSCDYKVGSPSYLDKLLWR 
DNKPHHYS EPKL I LDLSHWKQAAGAPPTATGLADTGAREDEPASLFL 
ENPQGVKSTQG\AQOTPARPPTTPSAAWLPRPP\PPGPTLADSCSVT 
KWGQV 

3784 

A 

861 

2636 

GARGRPIjAETWPFLTAPVLPGQLQITEPTMAEKGDCIASVYGYDLGG 
RFVDFQPLGFGVNGLVLSAVDSRACRKVAVKKIALSDARSMKHALRE 
IKIIRRLDHDNIVKVYEVLGPKGTDLQGELFKFSVAYIVQEYMETDL 
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ARLLEQGTIAEEHAKLFMYQLLRGLKYIHSANVLHRDLKPANIFIST 
EDLVXiKIGDF\ GLAKESLNQ\ HY S \ HKG YLSEGLVTKWYRS PRLLLS 
PNNYTKAIDMWAAGC I LAEMLTGRML FAGAHELEQMQL I LET I P VIR 
EEDKDELLRVMPSFVSSTWEVKRPLRKLLPEVNSEAIDFLEKILTFN 
PMDRLTAEMGLQHPYMSPYSCPEDEPTSQHPFRIEDEIDDIV\LMAA 
NQSQLSNWDTCSSRYPVSLSSDLEWRPDRCQDASEVQRDPRAGSAPL 
AENVQKDPRKDSHSSSERFLEQSHSSMERAFEADYGRSCDYKVGSPS 
YLDK1.LWRDNKPHHY SE PKL I LDLS HWKQAAGAP PTATG\ LADTGAR 
EDEPASLFLE\IAQWVKSTQG\AQSTP\TRPPTTPSAACLPRPP\PP 
GPGGRR\RQPPVRPGRVHLPRPEALHQARGPAGQ 

3785 

A 

3 

495 

GG I PRVFLYELLPES PFFLECNS FTSPDPHKGLVLLPKTECDVREVE 
LMRCLRLRQSSLEPVAFRLPRVRKEFFQDDVFPDTAVIWEPVLSAEA 
WLQGANGQPWLLSLQPPDMSPGAPRDWVWGGSGMGVGERDRRPRLKF 
HS CWHAES SGLWEGG VAVS I S L 

3786 

A 

3 

1356 

GYKLLPCLSRTSAATGHKWIHPRLVAKDCRSHKLHLSSVPVRI*SRV 
LQAEAQK*CQDTQDGGLLPRVPASKNHGPGPFHGP/GPSPGLGQAPS 
ALKHLLQPPNPLAAKVSASCSYMKLRPWPADPWQCWA/SDVAPSTLL 
PS YT)PDTGLVLLTGKGDTRVFLYELLPES PFFLECNS FTS PDPHKGIi 
VLLPKTECDVREVELMRCLRLRQSSLEPVAFRLPRVRKEFFQDDVFP 
DTAVI WEPVLS AEAWLQGANGQPWLLSLQPPDMS PVS Q APREAPARR 
APSSAQYLEEKSDQQKKEELLNAMVAKLGNREDPLPQDSFEGVDEDE 
WAKYLAQI I VMGVQWGRAFARALRQEFAASRAAADARGRAGHRS AA 
ASNLS GLS LQEAQQ I LNVS KLS PEEVQKNYEHLFKVND KSVGGS FYL 
QSKWRAKERLDEELKIQAQEDREKGQMPHT 

3787 

A 

1 

2016 

MLLAGSAGAAGLGCLVPLLDPDSGLLVLAGKGFSQQTGLSSDSFQGE 
RQLYCYEWPQQPALSPVTQCVLESVLRGAALVPRQALAVMSCEVLR 
VLQL S DTA I VP I G YHVPRKAVE FHEDLFPDTAGCVPATD PHS WWAGD 
NQQVQKVSLNPACRPHPSFTSCLVPPAEPLPDTAQPAVMETPVGDAD 
ASPGALTLQHQWHRDQPQFEVAAEPAGSKFRHAQGTVLHRDSHITNL 
KGLNLTTPGESDGFCANKLRVAVPLLSSGGQVAVLELRKPGRLPDTA 
LPTLQNGAAVTDLAWDPFDPHRLAVAGEDAR I RLWRVPAEGLEEVLT 
TPETVLTGHTEKICSLRFHPLASHGL\ASPSYDLTVRIWDLQAGADR 
LKLQGHQDQIFSLAWSPDGQQLATVCKDGRVRVYRPRSGPEPLQEGP 
GPKGGRG A \ AL S G YVMVAVC WCLALTAKVS AS CS YMKLR PWPAD P WQ 
CWA/ SDVAPS TLLPS YDPDTGLVLLTGKGDTRVFL YELL PES PFFLE 
CNSFTSPDPHKGLVLLPKTECDVREVELMRCLRLRQSSLEPVAFRLP 
R I \ RKE FFQDDVFPDTAVI WEPVLSAEAWLQGANGQPWLLSLQPPDM 
S P VSQAPREAPARRAPS S AQYLEEKSDQQKKEELLNAMVAKLGNRE \ 
DPLPQDS\ FEGVDENEWD 

3788 

A 

1 

167 

ILTQCGGFQPTLDKFSSRASEIALAKAKQVINTPRTSSSFNATTVLS 
AFLLGEKS 

3789 

A 

1 

1685 

NTIHDTIELYLSRCWVCCAVSAIARARAALPNIIVGEKGGAGSARSS 
PRRAAAECAQIPQPRRSPAVQAEPARRSRFPPFTLQGAFSSAIIGAG 
IGGTSAAYYLRQKFGKDVKIDLFEREEVGGRLATMMVQGQEYEAGGS 
VIHPLNLHMKRFVKDLGLSAVQASG\GLLGIYNGETL\VFEESNWFI 
INVIKIiWRYGFQSLRMHMWVEDVVDKFMRIYRYQSHDYAFSSVEKL 
LHALGGDD FLGMLNRTLLETL \ QKPGC S E KLLNEM I GPVMRVNYGQS 
TDINAFVGAGSLSCSDSGLWAVEGGNKLVCSGLLQASKSNLISGSVM 
YIEEKTKTKYTGNPTKMYEWYQIGTETRSDFY\DIVLVATPLNRKM 

SNITFLNF\DPPl\EEF\HOYYOHTVTTT.\rKT2T?T MTQTPQCT>DTnVT? 

GLNTVLTTDNSDLFINSIGIVPSVREKEDPEPSTDGTYVWKIFSQET 
LTKAQILKLFLS YDYAVKKPWLAYPHYK/ PPPEKCPLLS FSHDRLYY 
\ LNG I ECAAS A\ MEMS AI AA\ HNAALLAYHRWNGHTDM I DQ\ DGLYE 
KLKTEL 

3790 

A 

52 

264 

QSFAPGPPPFIPPPWGGRAGGFPGPGMG/PPPGPPGGTSSSSKIPKF 
GRPRGGPPFPPLPKRVRPGDFFYP 
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3791 

A 

627 


SHGENCKCPSTGDYRKLIYHLKGCQGLRNKGSSPFTSIVSAAKVCGA 1 
ASESPSVKSLCLLVADQDFSFKAGQWVDFFIPGVSWGGFSICSSPR 
LLEQERVIELAVKYTNHPPALWVHNTCTLDCEVAVRVGGEFFFDPQP 
ADAS RNL VL I AGGVG I NPLL S I LRHAA \ DLLREQ ANKRNG YE IGT I K 

TIIiQCKKYPAELLFKKNILDLVNEFPEKIACSLHVTKTDYTNPMRNS 
KP YI TEGR I TEKE I RDH I S KETLF YI CGPPPMTDFFS KQLENNHVPK 
EHICFEEWVGGRQRQKK 

3792 

A 

331 

758 

' JO 

WKKbl.KiljWJjIjKbHHsrit'SSJSLiTPSLSPSAEAEEDGDLQCLCVKTTS 
QVRPRHITSLEVIKAGPHCPTAQL/ I VSPRTASVSAPAPCLLCPSLP 
aw I isSH VS S L F PARATLKNGR KI CLDLQALL YKKI I KEHIi 

ES 

3793 

A 

3 

402 

HEETQPEFPIALSTEILLEALPQHELRSRVLR/AQRPGLLFLGLLLL 
PLWAFR\RAEAEEDGDLQCLCVKTTSQVRPRHIHQPGR*SKAGP\H 
CPTCPT* * PR* KNGRKI WLGTWQAPLLQGK* FKETFWESLAN | 

3794 

A 

2 

146 

QTTANTiLLKIQCHI IAPPEALTFPLPTWPLEYTQNRKKKGKVGGKGT 
A J 

3795 

A 

3 

1034 

SEFPARTPRRCPRPSPGPAMGLTVSALFSRIFGKKQMPILMVGLDAA 
GK/ TSHILYKLKLGEIVTTIPTIGFNVETVEYKNICFTVWDVGGQ\D 
KI RPLWRHYFQNTQGLI FWDSND\ RERVQESA\ D \ ELQKMLQEDEL 
RDAVLLVFS \NKQDMPNAMP\ VSELT\DKLG\ LQHLRS \ RTWYV\ QA 
rtLiLiF I U^TGFVTMVLDWL, \ SHELFKALNQPWGRPLDARKSRCVQS PT 
MSQNSPTPRGKCPFPPTFSLPHSPQASASAPCSCLHVLSWGAVEPL 
LSGHRGVPLFLPAGTYGRGFAGPRPPSSRGGAGIWVSFFFFCFGCTL 
GGQVGRGKVRAFGWCYNVALDLE | 

3796 

A 

1536 

1838 

GLPQQTLKRIQQVLIKCCIAFYLFLFFFFLRWSLALLPILE\CSGVI 

SAHCNLRLPGLGDSLASASRVAGM\TTGTCHHAQLIFCI FSRDGVSV 
LARTVWIS I 

3797 
3798 

A 
C 

2 

204 

740 
272 

GRVGQSTSKVPKEGLLSLHLLCSTAHYQKTAEMKSIYFVAGLFVMLV 

QGSWQRSLQDTE\EKSRSFSASQ\ADPLSDPDQMNEDKRHSQGTFTS 

DYSKYL\DSRRAQDF\LQW\LMNTKRNRNNIAKRHDEFERHAEGTFT 

&JJVi5b i JjfaCjQAAKEFIAWLVKGRGRRDFPGEGRPI,LKNIiAGRHA\DG 1 

SFSDEMNTILDNLP\AR\DFINWL\IRTKI\TDREITYITYSEIIFT 
NITLLAHVGCLEMFKSW 

3799 

A 

1 

350 

GVLSPPVKSMDLNKPIDKRISKGAQPTCHDFNQFTAATETISLLVGF 

SAGQVQYIiDPlKKDTSKLFNEEVIAAPRAPNFSAFSPTHYMPLGCSC 
LRNSVGLRGAGPGRDGEWGDSS 1 

3800 

A 

1 

397 

QRPI*PTKPIGQRVSKGHQP\TRH\DFTQF\LAAHETISRLVGFSAG 
QV\QYLDLIKK\DPSKLFNEERLIDKTKVTYLKWLPESESLFLASHA 
SGHLYLYNVSHPCASAPPQYSLLKQGEGFSVYAAKYNDLFL 

3801 
3802 

A 
A 

1 
1 

2066 
1155 

MAAGGAEGGSGPGAAMGDCAEIKSQFRTREGFYKLLPGDGAARRSGP 
ASAQTPVPPQPPQPPPQPASASGPGAAGPASSPPPAGPGPGPALPAV 
RLSLVRLGEPDSAGAGEPPATPAGLGSGGDRVCFNUSRELYFYPGCC 
RRGSQRWHTPLTPFLPPLKSIDLNKPIDKRIYKGTQPTCHDFNQFTA 
ATETISLLVGFSAGQVQYLDLIKKDTSKXFNEERLIDKTKVTYLKWL 
PESESLFLASHASGHLYLYNVSHPCASAPPQYSLLKQXAWGFSFYAA 
KSKAPRNPIJUCWAVGEGPLNEFAFSPDGRHLACVSQDGCLRVFHFDS 
MliLRGLMKSYFGGLLCVCWSPDGRYWTGGEDDLVTVWSFTEGRWA 
RGHGHKSWVNAVAFDSLYTTRAEEAATAAGADGERSGEEEEEEPEAA 
GTGSAGGAPLSPLPKAGSITYRFGSAGQDTQFCLWDIiTEDVLYPHPP 
LARTRTLPGTPGTTPPAASSSRGGEPGPGPLPRSLSRSNSLPHPAGG 
GKAGGPGVAAEPGTPFSIGRFATLTLQERRDRGAEKEHKRYHSLGNI 
S RGG S GGSGS GGE KP S G P VPRS RLD PAKVLGTALC PR I HE VPLLEPL 
VCKKI AQERLTVLLFLEDC 1 1 TACQEGL ICTWPRPGKAFTDEETEAQ 
TGEGSWPRSPSKSWEGISSQPGNSPSGTW 

MGGTTLAWSMARDSAGLVAGNLDLSEKHDPRPPPLLHPPGPTAVIAG | 
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DGSFRNPALGKPLTKSQATWMTTLSYDAARRAGGGGDGPGGPEPGWV 
DPRTWLSFQGPPGGPGIGPGVGPGSEVWGIPPCPPPYEFCGGMAYCG 
PQVGVGLVPQGGLETSQPEGEAGVGVESNSDGASPEPCTVTPGAVKL 
EKEKLEQNPEEARKVFSQTTICRFEALQLSFKNMCKLRPLLQKWVEE 
ADNNENLQE I CKAETLVQARKRKRTS I ENRVRGNLENLFLQCPKPTL 
QQ I SHI AQQLGLEKDVVRVWFCNRRQKGKRS S SD YAQREDFEAAGSP 
FSGGPVSFPLAPGPHFGTPGYGSPHFTALYSSVPFPEGEAFPPVSVT 
TLGSPMHSN 

3803 

A 

285 

1444 

LLISPGPRLGAPSFPMAGHLASDFAFSPPPGGGGDGPGGPEPGWVDP 
RTWLS F \ QRPPGGPGIGPGVGPGSEVWGI PPCPP \ PYEFCGGMAYCG 
PQVGVGLVPQGGLETSQPEGEAGVGVESNS\DGASPEP\CTVTPGA\ 
VKLEKEKLEQKTPEK\ S QD IKALQKELEQFAKL\ LKQKRITLGYTQAD 
VGLTLG VLFGKVFSQTTI CRFEALQLS FKNMCKLRPLLQKWVEEADN 
NENLQE I CKAETLVQARKRKRTS I ENRVRGNLENLFLQCPETPHVfQQ 
I \ SHI AQQLGLKKHWRWRVWFCN\RRQKGK\RSS SDFA\QREDFE 
AAGSPF\SGGP\VSLFP\LPPGPHFGTPGYGSPHFTA\LYSSVPFP\ 
EGEAFPPVSVTILG\ S PMHSN 

3804 

A 

1 

824 

MVPDER YI TQEGHKLETGAPRP PATVTNAVS WRS EG I KYRKNEVFLD 
VIESVNLLQLEVTFFAGEKRFETETPTVSLTLSCGNPRLIVARATPP 
P FLEQLS QS KRS EAA I RAE I VFANRKCGTPATEALE S VMEAYEQVQK 
GPLKL\KGFAELGVTKRK\ KKK\ KDKDKAKXLEAMGT\ S KKN\ EEEK 
RRGLDKRTPAQAAFEKMQEKRVL I CRNYRGDVDMS EVEHFMP ILMEK 
EEEGMLS P ILAHGG VRFMW I KHNNLYR I PLLGVL PGDS CVGV 

3805 

A 

3 

475 

AASTESGMVAYYQVQKGPLKLKGVAJCLGVTK/RVRPEGPRDSVFIPL 
KPFIX5TPGPDPPTRRLLMFFSRKKKKKDKDKAK\ LLEAMGTSKKNEE 
EKRRGIi\DKRTPAQADFEKMQEKRQMERILMKA\SKPHRQRVEDFNR 
HL \DTLTEHY\ D I PKV\ TWTK 

3806 

A 

3 

225 

GQTSQHHVALRTTQHHWALGQHRVALHTVEYYIALKRKEMDTGC\ KW 
MELEYIVLSGISQSQEDMHCMIPFVSGP 

3807 

A 

874 

1444 

CQARPMPLRAFNPVSSFRWARGMTIVAALMTVFFIMQLVGQVPAALW 
VIFGEDRFRWSATMIGLSLALFGILHALAQAFVTGPATKRFGEKQAI 
IAGMGPNALGYVLIAFATRG\VMAFPIMILLASGGIGMPALQAMWLA 
E * ITDTRANVKRLLLQNVCDLSNNMNGLRFPCFVKSCGTPTSVLTKR 
WH 

3808 

B 

1 

1917 

M I GLS LAVFG I LHALAQAFVTGPATKRFGEKQ AI I AHGG RRAGLR1A 
GVPTRGWMAFPIMILLASGASGCRVAGHAVQAALRAQGLLKEAEHVE 
S QS AETVLTPDECQLLG YLDKGKRKRKE KAG S LQ WAYMA I ARLGG FM 
DSKRTGIASWGAIjCAGAAVLFVKTDLSGAIJ^LQDEAARLSWI*ATTG 
VPCAAVLDVVTEAGRDWLLLGEVPGQDLIiSSHLAPAEKVS I MAD AMR 
RLHTLDPATCPFDHQAKHRI ERARTRMEAGLKDLMRRDQDLI KRQDE 
DRSHDEQDDARRS PAAWWRAYS A I AATTDNSGRP VLFVK^TDLSGALN 
ELQDEAARLSWIATTGVPCAAVLDVVTEAGRDWLLIiGEVPGQDLLSS 
HLAPAEKVS IMADAMRRIjHTLD PATCPFDHQAKHR I ERARTRMEAGL 
VDQDDLDEEHQGLAPAELFARLKARMPDGEDLVVTHGDAACRISWWK 
MAAFIJDSSTVAGWDIALATRDIAEELGGEWADRFLVLYVIjLSGTLGF 
EMTDQATPNLPSRDFDS TAAFYBRLGFG I VFRDAG WMI LQRGDLMLE 
FFAHPGLDPLASWFSCCLRLDDLAEFYRQCKSVGIQETSSGYPRIHA 
PELQE WGGTMAALVR I F VKE P Y FCGVT * 

3809 

A 

1057 

1905 

ALEVLHVLQQPQDLPVLGGQRGLQPRARAQSEVHL I PVS S FRW ARGM 
T IVAALMTVFF IMQL VGQVPAAL WVI FGEDRFRWS ATM IGLS LAVFG 
ILHALAQAFVTGPATKRFGEKQAI I AGMAADALG YVLLAFATRGWMA 
FPIMILLASGGIGMPALQAMQPLAEHIHRVRHLQQPHAAHLGQRWVL 
ATGAHDRAP WEDP WS RMNHR YAS EREATAAAKRLRPE QQHE WS S V 
SWRKVLLEKLNELDADEQADICESLHDHADELYRSCLARFGDDGEN 
L 

3810 

A 

1 

2928 

MTMWVLHCAFTYFVSAWSKHGYAHAVTAKILYAGRI VAG I TGATGAV 


1 011*0 
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AGAY I AD I TDGEDRARHFGLMS ACFGVGMVAGPVAGGLLGAI SLHAP 
FLAAAVLNGLNLLLGCFLMQESHKGERRPMPLRAFNPVSSFRWARGM 
T I VAALMTVFF I MQL VGQVPAAL WV I FGEDR FRWS ATM I GLSLAVFG 
I LHALAQAFVTGPATKRFGEKQAI I AGMAADALGYVLLAFATRGWMA 
FPIMI LLASGG IGMPALQAMLSRQVDDDHQGQLQGSLAALTSLTS 1 1 
GPL I VTA I YAAS AS TWNGLAW I AFAKS FTLPQALRAQGLLKEAEHVE 
SQSAETVLTPDECQLLGYLDKGKRKRKEKAGSPAAWVERLFGYDWAQ 
QTIGCSDAAVFRLSAQGRPVLFVKTDLSGALNELQDEAARLSWLATT 
GVPCAAVLDVVTEAGRDWLLLGE VPGQDLLS SHLAPAEKVS I MADAM 
RRLHTLD PATCPFDHQAKHR I ERARTRMEAGLVDQDDLDEEHQGLAP 
AELFARLKARMPDGEDLWTHGDACLPNIMVENGRFSGFIDCGRLGV 
ADRYQDIALATRDIAEELGGEWADRFLVLYVLLSGTLGFEMTDQATP 
JNLFoKJJr,Ub ±Af\J? X bRLGrGX VFRDAGWMILQRGDLMLEFFAHFGLD 
PLASWFSCCLRLDDLAEFYRQCKSVGIQETSSGYPRIHAPELQEWGG 
TMAALVR I FVKE P YFCETL VKVKDAEDQLGAR VG Y I ELDLNSGKI LE 
SFRPEERFPMMSTFKVLLCGAVLSRVDAGQEQLGRRIHYSQNDLVEY 

QPVT'T^'KT-TT ■T r nr?MT , \7T?'PT.P , Q H Zi TTMCTIMTfl 7\>JT T T-TTTPPDVPT T»n T?T . 
Orv i li-rVrlij 1 Ubrl J. Vi\E»±j^£>/4AA in&UPi i A/UV XjXjLi J. 1 X uVjlr JVtS Jj XAf U 

HNMGDHVTRLDR WE PELNEA I PNDERDTTMPAAMATTLRKLLTGELL 
TLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGGRSLWKNA 
SNAAFLRFLAFCWPFAHMFFPALSPDSVDNRITAFE 

3811 

A 

630 

1159 

G APWGQ VPAALWVI FG\ EDRFRWS ATM IGLS LA VFG I LHALAQAF VT 
GPATKRFGEKQAI I AGMAADALG YVLLAFATRGWMAFP I MI LLASGG 
IGMPALQAMLSRQVDDDHQGQLQGSLAALTSLTS I IGPLI VTAI YAA 
SASTWNGLAWIVGAALYLVCLPA\LRRGAWSRATST 

3812 

A 

1 

3525 

MNMN I KKI VKQATVLTFTTALLAGGATQAFAKENNQKA YKETYGVSH 
ITRHDMLQIPKQQQNEKYQVPQFDQSTIKNIESAKGLDVWDSWPLQN 
ADGTVAEYNGYHWFALAGS PKDADDTS I YMFYQKVNHTFIDYRTGR 
LQPAIPQSLPEALRLAADLAEQKQRLEQKMLMDAPKVEFAERVATAR 
KVTi I GNY AKVLGLGQNYL FTWLRDNG I L IATGERRNVPKQEYISRGY 
FTLKETV I DTSNG SR I S FTTR I TG KGQQW LM KRL LDAD WRTTS I VM 
LAK\nTFLSCITMSDFTFSGYEI^CFVTHSGLSRSAGHILSQCANLAA 
TTSEYFIHKPHRLIAAETGYSQSTVVRAFREAVNKGILSVEIVIGDH 
RERRANLYRFTPS FLAFAQQAKNAL I ES KLK I S S AATKVKAVLAKTL 
ALFNFLSTPPCQNDTPSPCQDDVAIKNKKSQVKKTKRSVSGGAGTTS 
LKKLTSWIAKAKAKADl^RLSKKRTQKHEFKQKVEAAARKYAYLKNK 
RS PDIGGI SNFDNLPHC^TVNEALNAVLAKNKDNEQWGAVAGAYI AD 
ITDGEDRARHFGLMSACFGVGMVAGPVAGGLLGAISLHAPFLAAAVL 
NGLNLLLGCFLMQESHKGERRGEEEDMSFFSTLKTALSLKEKLAATG 
VLVLICALVGAGF AWERHQLKQAI EKI GSLDQAVKERDKS IMDLNQT 
IETMNKAEQHFHSQEVKNESEQAKYADRQMERKAEVQKQLVAAGNVR 
QRIPADTQRLLRESISEFNADADKGIFMSDKVTVKQTINKATSIYKI 
c.y ± L V^ivfc'VjoJ&y x KKAr liljADyLGLIGiPDCIEHVFPTYAlJEQCT^ 
TEEDFFSTEEREGVDRCIGVICSSVSDELFPNVPEYGGIGYQFLYEG 
DELKCYEHDIESKAKELTN^SNOT^QPVALMRLGVFVPKPSKSKGES 
KE I DATKAFSQLE IAKAEGYDDI KITGPRLDMDTDFKTW I GVI YAFS 
KYGLS SNTI QLS FQEFAKACGFPSKRLDAKLRLTIHESLGRLRNKGI 
AFKRG KD AKGG YQTGLLKVGRFDADLDL I ELEAD S KLWELFQLD YRV 
LLQHHALRALPKKEAAQAIYTFIESLPQNPLPLSFARIRERLALQSA 
VGEQNRI I KKAIEQLKTIGYLDCS I EKKGRESFVI VHSRNPKLKLPE 

3813 

A 

1 

1606 

MLMKAHKGVT.GEQEKVPRRLMKVLKGVPGEQEKVLWEAREEGVSSRW 

TGSPMGCEQRVWMGDEQAMEPLGLADGKLWRTFMGAADAERKEGRGM 

KVAPAPAWKXQEAKKVVNPLFEK3^PKNFGVGQDIQPKRDLTRFVKW 

PRYIKLQHQRAFLYKRLKVPPAINQFTQALDRQTATQLLELAHKYRP 

ETKQEKKQRLLAQAEEKTAGKGDI STKRLPVLQAGVNWEDKSALAKL 

VEGIRTNYNERYNEIHHHWGGlfVI^PESVARIAKL 

VCKKAWIVSHPVSQRKLPQQKRQNRKKRQEVQAWGSGPCLSLWGSE 
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PATGGS SCSETSGFGKSCKVKETNPQPMAATPTIiBQGEAVAAHAE I P 
PACSHGQVPAALWVIFGEDRFRWSATMIGLSLAVFGILHALAQAFVT 
GPATKRFGEKQAI I AGMAADALG YVLLAFATRGWMAF P I MI LLASGG 
IGMPALOAMLSRO\ LTS I IGPIiIVTAI YAASASTWNGIiAWI VGAALY 
LVCLPALRRGAWSRATST 

3814 

A 

1701 

1904 

VPGLLRDIVHSDSIASHYGVLLALYALMQFLCAPVLGALSDRFGRRP 
VLLASLLGAMSAVLAPGCCF 

3815 

A 

1 

427 

MAXMKNQIIPSVGEKLEQQKPSYDARAFVTGPATKRFGEKQAIIAGM 
ATDALGYVLLAFATRGWMAFPIMILLASGGIGMPALQAMLSRQ\LTS 
IIGPLIVTAIYAASASTWNGLAWIVGAALYLVCLPALRRGAWSRATS 
T 

3816 

A 

1 

1629 

MKSNNALI VI LGTVTLDAVG IGLVMPVLPGLLRD I VHSDS I ASHYGV 
LLALY ALMQFLC APVLG ALSDRFGRRP VLLAS LLG AT I D YAI MATTP 
VLW ILYAGR I VAG ITGATGAVAGAYI ADI TDGEDRARHFGLMS ACFG 
VGMVAGPVAGGLLGAISLHAPFLAAAVLNGLNLLLGCFLMQESHKGE 

PP PMPT .P JiT?MPV<5 ^ PP W ZiPHMTT V J\ AT .*MT r \7T? PTMHT .VfinVP ZV 1T.WVT 

FGEDRFRWSATMIGLSLAVFGILHALAQAFVTGPATKRFGEKQAIIA 
GMAADALGYVLLAFATRGWMAFPIMILIiASGGIGMPALQAMLSRQVD 
DDHQGQLQGSLAALTSLTS 1 I GPLIVTAI YAASAS TWNGLAW I VGAA 
LYLVCLPALRRERSVAAGRFLLVRIAATCRANRAMSSSTTPHSRTAS 
SIENFGIQSLFHLLICDLSTPTASAVFSVPSIAILLSNALSIRRASL 
RLF ART I C S YMQ I ETKEMV I FLRDDG I SLQY PDGQCAG YG IDGNML I 
FMGQSCELFPTKI ILHDQRTTNFTHK 

ion 

TV 

X 

JLU ±J 

GNCYLGRTQPGTRMQWS PMKLS VELRAGQRS LLRE S KGKE I PEPRCA 
AVGGKEPEAKPPKVPRPMPLRAFNPVSSFRWARGMTIVAALMTVFFI 
MQLVGQVPAALWVI FGEDRFRWS ATMI GLS LAVFG I LHAJLAQAFVTG 
PATKRFGEKQAI IAGMAADALGYVLLAFATRGWMAFPIMILI»ASGGI 
GMPALQAMLSRQVDDDHQGQLQG\ SLAALTSLTS I IGPL I VTAI \ YA 
ASASTWNGLAWI * PGMFI I SNPYREHPLSFHRYHYPHEQKSPLHGGI 
SDQTGKNRP 

3818 

A 

1 

2634 

MPLRAFNPVSSFRWARGMTIVAALMTVFF I MQLVGQVPAALWVI FGE 
DRFRWSATMIGLSLAVFGILHALAQAFVTGPATKRFGEKQAIIAGMA 
ADALG YVLLAFATRGWMAF P IM ILLASGG IGMPALQAML SRQVDDDH 
QGQLQGSLAALTSLTSIIGPLIVTAIYAASASTWNGMLLATLWNTYI 
CINEALALTLSDFSLVPPHPYRQLFTLTTFHNRACSSSVTRIVSILS 
RFIGIITPMNRNPPYTEASVTKQEKTAIjNMARFIRSQTLTLLEKIjNE 

ldadeqadicesijfldhadelyrsclarfgddgnsydhpyaklaclqv 
egrgggtfahellaveyagwispafrlkvnqtfidyrtgrlqpaipq 
slpealrijuudlaeqkqrleqkmlmdapkvefaervatasgvligny 
akn^glgqnylftwlrdngiliatgerriwpkqeyisrgyftlketv 
idtsngsrisfttoitgkgqqvnlmkrlldarnrdrykqwkqdffhds 

reavnkg i ls vei vigdhrerranl yrftps flafaqqaknali e s k 

lkissaatkddvaikwkksqvkktkrsvsggagttslkkltswiaka 

kakadl^rlskkrtqkhefkqkveaaarkyay^ 

dnl phcmt ilyagr i vag i tgatgavagayi ad i tdgedrarhfglm 

sacfgvgmvagpvaggllgaislhapflaaavlnglnlllgcflmqe 

SHKGERRGELLKIVVLPGDIWGQEITAEAIIWLKAISDVT^SIWKFDF 
ENHLIGGAAIDLQGGGGKPRCVSKSLMLHCTR 

3819 

A 

1328 

2536 

RSQAPOvlKSKNALIVILGTVTLDAVGIGLVMPVIiPGLLRPIVHSDSI 

ASHYGVLLALYALMQ£LCAPVLGALSDRFGRRP\^1^SLL^ 

I MATTP VLWI L YAGR I VAG I TGATGAVAGAY IAD I TDGEDRARHFGL 

MSACFGVGMVAGPVAGGLLGAISLHAPFLAAAVT^NGLNLLLGCFLMQ 

ESHKGERR PM PLRAFNPVS S FRWARGMT I VAALMTVF F I MQLVGQVP 

AALWVI FGEDRFRWS ATMIGLSLAVFGMLHALAQAFVTGPATKRFGE 
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KQAI I AGMAADALG YVLLAFATRGWMAFP IM I LLASGG IGMPALQAM 
LSRQVDDDHQGQLQGSLAALTSLTSITGPLIWAIYAASASTWNGLA 
W I VG AAL YL VCLPALRRG AWS RATST 

3820 

A 

1 

1594 

MSAAPCSTVPSPIDHPRAEECGHTARDWQVAPPAAQSHDDDEEEEED 
EKEEEEEEEELLLAGSPYSPTVPCFPLECPCLENGSRQKQDKHTPGF 
KHKFQHAFFFSFPLAT I THS FWVAEHSLRLEGKQGAQI SPHLQVGPG 
LVWQGYLWQNREAGSSRDSHLNMPCGMAAGHSHSGTLFSDQGLWEQC 
QLLKSTSVFARCHPLVDPEPFVALCEKTLCECAGGLECACPALLEYA 
RTCAQEGMVLYGWTDHSACSPVCPAGMEYRQCVSPCARTCQSLHINE 
MCQERCVDGCSCPEGQLLDEGLCVESTECPCVHSGKRYPPGTSLSRD 
CNTC I CRNS QW I CSNEE C PDREGHNSRQREVALS AQ VGCNHVT P ALV 
SGQVPAALWVIFGEDRFRWSATMIGLSLAVFGILHALAQAFVTGPAT 
KRFGEKQAIIAGMAADALGYVLLAFATRGWMAFPIMILLASGGIGMP 
ALQAMLSRQ\ LTS 1 1 GPL I VTA I YAAS AS TWNGLAW I VG AALYLVCL 
PALRRG AWS RATS T 

3821 

A 

3 

995 

TS PTATG PALVC YRAS L VALREEDTRSQEYVP FRGNRDFAHVKKMGG 
LGLLAMDVPEELGGAGLDYLAYAIAMEEISRGCASTGVIMSVNNSLY 
LGPILKFGSKEQKQAWVTPFTSGDKIGCFALSGTRAFNPVSSFRWAR 
GMTIVAALMTVFFIMQLVGQVPAALWVIFGEDRFRWSATMIGLSLAV 
FG I LHALAQAFVTGPATKRFGE KQAI IAGMAADALGYVLLAFATRGW 
MAFPIMILLASGGIGMPALQAMLSRQVDDDHQGQLQGSLAALTSLTS 
I IGPL I VTA I YAAS AS TWNGLAW I VGAAL YLVCLPA\ LRRG AWS RAT 
ST 

3822 

A 

1 

619 

MPLRAFNP VS S FRWARGMT I VAALMTVFFI MQL VGQ VP AALWV IFGE 
DR FR WS ATM I GL S LAVFG I LHALAQAFVTG P ATKRFGEKQ A I IAGMA 
AD ALG YVLLAFATRGWMAFP I M I LIiASGG I GMPALQAML S RQVDDDH 
QGQLQGSLAALTSLTS 1 I GPL IVTAI YAAS AS TWNGLAW I VGAAL YL 
VCLPA\ LRRGAWSRATS T 

3823 

A 

1 

565 

MPLRAFNPVSSFRWARGMTIVAALMTVFFIMQL VGQ VPAALWVIFGE 
DR FRWS ATM I GLSLAVFG I LHALAQAFVTG PATKR FGE KQAI I AGMA 
AD ALG YVLLAFATRGWMAFP I M I LLASGG I GM PALQ AML S RQ \ LTS I - 
IGPL I VTAI YAAS AS TWNGLAW I VGAAL YL VCLPALRRG AWSRATST 

3824 

A 

1 

934 

MTMWVLHCAFTYFVSAWSKHGYAHAVTAKILYAGRIVAGITGATGAV 
AGAYI ADITDGEDRARHFGLMSACFGVGMVAGPVAGGLLGAI SLHAP 
FLAAAVLNGLNLLLGCFLMQESHKGERRPMPLRAFNPVSSFRWARGM 
TIVAALMTVFFIMQLVGQVPAALWVIFGEDRFRWSATMIGLSLAVFG 
I LHALAQAFVTG PATKR FGE KQAI IAGMAADALGYVLLAFATRGWMA 
FPIMI LLASGG IGMPALQAMLSRQ\ LTS I IGPL I VTAI YAASASTWN 
GLAW I VG AAL YL VCLPALRRG AWS RATS T 

3825 

A 

657 

7035 

LRTCWHSDCQNTAADHSAQGSGRRHRKPCNRCAPGSQNWAVPLLPDI 
SCRTGCAVRKDRLIAGPIQ*N/LKTAWGRQPYLTKLCLMLGCLGRM 
RWRCSTGWLRLRTSSVQCS*RISNRMSCTGSMLTTWL*MKLQQVNGL 
GQGL IQS AGKDCDVQGLEHDMEE INARWNTLNKKVAQR I AQLQEALL 
HCGKFQDALEPLLSWLADTEELIANQKPPSAEYKWKAQIQEQKLLQ 
RLIJDDRKATVDMLQAEGGRIAQSAELADREKITGQLESLESRWTELL 
SKAAARQKQLEDILVL7VKQFHETAEPISDFLSVTEKKLANSEPVGTQ 
TAKI QQQII RHKALEED I ENHATDVHQAVKIGQSLS SLTS PAEQGVL 
SEKIDSLQARYSEIQDRCCRKAALLDQALSNARLFGEDEVEVLNWLA 
EVEDKLS S VFVKDFKQD VLHRQHADHLALNEE I VNRKKNVDQAIKNvj 
QALLKQTTGEE VLL I QE KLDG I KTR YAD I TVTS S KALRTLEQ ARQLA 
TKFQSTYEELTGWLREVEEELATSGGQS PTGEQI PQFQQRQKELKKE 
VMEHRLVLDTVNEVSRALLELVPWRAREGLDKLVSDANEQYKLVSDT 
IGQRVDEIDAAIQRSQQYEQAADAELAWVAETKRKLMALGPIRLEQD 
QTTAQLQVQKAFSIDIIRHKDSMDELFSHRSEIFGTCGEEQKTVLQE 
KTESLIQQYEAISLLNSERYARLERAQVLVNQFWETYEELSPWIEET 
RALIAQLPSPAIDHEQLRQQQEEMRQLRESIAEHKPHIDKLLKIGPQ 
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LKELNPEEGEMVEEKYQKAENMYAQIKEEVRQRAIjALDEAVSQSTQF 

HDKIEPMLETLENLSSRLRMPPLIPAEVDKIRECISDNKSATVELEK 

LQPS PEALKRRGEELIGRSQGADKDLAAKE IQDKLDQMVFFWEDI KA 

RAEERE I KFLDVLELAEKFWYDMAALLTTIKDTQD I VHDLES PGIDP 

SIIKQQVEAAETIKEETDGLHEELEFIRILGADLIFACGETEKPEVR 

KSIDEMNNAWENI^KTWKERLEKLEDAMQAAVQYQDTLQAMFDWLDN 

TVIKLCTMPPVGTDLNTVKDQLNEMKEFK^VTQQQIEMEKLNHQGE 

LMLKJCATDETDRDIIREPLTELKHLWENLGEKIAHRQHKLEGALLAL 

GQFQHALEELMS WLTHTEELLDAQRP I SGDPKVI EVELAKHHVLKND 

VLAHQATVETVNKAGNELLESSAGDDASSLRSRLEAMNQCWESVLQK 

TEEREQQLQSTLQQAQGFHSEIEDFLLELTRMESQLSASKPTGGLPE 

TAREQLDTHMELYSQLKAKEETYNQLLDKGRLMLLSRDDSGSGSKTE 

QSVALLEQKWHWSSKMEERKSKIiEEALNIiATEFQNSLQEFINWLTIi 

AEQSLNIASPPSLILNTVLSQIEEHKVFANEVNAHRDQIIELDQTGN 

QLKFLSQKQDWIjI KNLLVS VQSRWEKWQRS IERGRSLDDARKRAK 

QFHEAWKKLIDWLEDAESHLDSELEISNDPDKIKLQLSKHKEFQKTL 

GGKQPVYDTTIRTGRALKEKTLLPEDSQKLDNFLGEVRDKWDTVCGK 

SVERQHKLEEALLFSGQFMDALQALVDWLYKVEPQLAEDQPVHGDLD 

LVMNLMDAHKVFQKSLGKRTGTVQVLKRSGRELIENSRDDTTW 

LQELSTRWDTVCKLSVSKQSRLEQALKQAEVFRDTVHMLLEWLSEAE 

QTLRFRGALPDDTEALQSIiIDTHKEFMKKVEEKRVDVNSAVAMGEVI 

LAVCHPDC I TT I KHWI T 1 1 RARFEEVLTWAKQHQQRLETALS EL VAN 

AELLEELLAW I Q WAETTL I QRDQE P I PQN I DRVKAL I AEHQTFME EM 

TRKQPDVDRVTKTYKRKNIEPTHAPFIEKSRSGGRKSLSQPTPPPMP 

ILSQSEAKNPRINQLSARWQQWLLALERQRKLNDALDPSIVSPKCR 

RPGGRQAYPLVNSGGSKRQIAKNRGATQAFAKENNQKAYKETYGVSH 

I TRHDMLQI PKQQQNEKYQVPQFDQSTI KNI ES AKGLDVWDS WPLQN 

ADGTVAEYNGYHWFALAGSPKDADDTSIYMFYQKVGDNSIDSWKNA 

GRVFKDSDKFDANDPILKDQTQEWSGSATFTSDGKIRLFYTDYSGKH 

YGKQS LTTAQVNVS KSDDTLKINGVEDHKTI FDGDGKTYQNVQQFID 

EGNYTSGDNHTLRDPHYVEDKGHKYLVFEANTGTENGYQGEESLFNK 

AYYGGGTNFFRKESQKLQQSAKKRDAELANGALGI IELNNDYTLKKV 

MKPLITSNTVTDEIERANVFKMNGKWYLFTDSRGSKMTIDGINSNDI 

YMLGYVSHSLTGPYKPLNKTGLVLQMGLDPNDVTFTYSHFAVPQAKG 

NNW I TS YMTNRG FFEDKKATFAPS FLMN I KGNKTS WKNS I LEQGQ 

LTVN j 

3826 

A 

140 

2298 

LFPRLLSFLTTPPHCSFSICFVICSRTLILKGSSLITG/CTQFRLSE 
TKEITNPYAMRLYESLCQYRKPDGSGIVSLKIDWIIERYQLPQSYQR 
MPDFLPPLPAGALSLISKVTDRANDSMEQGAETSQGALGTLANVVTS 
LANLSESLNNGDTSEIQPEDQSASEITRAFDTLAJCALNTTDSSSSPS 
LADGIDTSGGGSIHVISRDQSTPI IEVEGPLLSDTHVTFKS IREDRN 
GR S QKTVHTEGDMNMN I KKI VKQATVL.TFTTALLAGG ATQAFAKENN 
QKAYKETYGVSHITRHDMLQIPKQQQNEKYQVPQFDQSTIKNIESAK 
GLDVWDS WPLQNADGTVAE YNG YHVVFALAGS PKDADDTS I YMFYQK 
VGDNSIDSWKNAGRVFKDSDKFDANDPILKDQTQEWSGSATFTSDGK 
IRLFYTDYSGKHYGKQS LTTAQVNVS KSDDTLKINGVEDHKTI FDGD 
GKTYQNVQQFIDEGNYTSGDNHTLRDPHYVEDKGHKYLVFEANTGTE 
NG YQGEE S LFNKAYYGGGTNF FRKE SQKLQQS AKKRDAELANGALG I 
I ELNND YTLKKVMKPL I TSNTVTD E I ERANVFKMNGKWYLFTDS RG S 
KMT IDG I NSND I YMLG YVSNS LTG P YKPLNKTGLVLOMGLDPNDVTF 

TYSHFAVPQAKGNNWITSYMTNRGFFEDKKATFAPSFLMNIKGNKT 
S WKNS I LEQGQLTVN 

3827 

A 

1 

2604 

MPLRAFNPVSSFRWARGMTIVAALMTVFFIMQLVGQVPAALWVIFGE 
DRFRWS ATMIGL S LAVFG I LHALAQAFVTGPATKRFGE KQAI I AGMA 
ADALG YVLLAFATRGWMAFP I M I LLASGG IGMPALQAML S RQVDDDH 
QGQLQGSLAALTSLTS I IGPLI VTAI YAASASTWNGNRACS S S VTRI 
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VSILSRFIGIITPMNRNPPYTEASVTKQEKTALNMARFIRSQTLTLL 
EKLNELDADEQADICESIiHDHADELYRSCLARFGDDGNSYDHDYAKL 
ACLQ VEGRGGGTFAHELLAVE YAGW I S PAFRLKVNQTF ID YRTGRLQ 
PAIPQSLPEALRLAADLAEQKQRLEQKMLMDAPKVEFAERVATASGV 
LIGNYAKVLGLGQNYLFTWLRDNGILIATGERRNVPKQEYISRGYFT 
LKETVIDTSNGSRISFTTRITGKGQQWLMKRLLDARNRDRYKQWKQD 
FFHDSYNRQSAGHILSQCANLAATTSEYFIHKPHRLIAAETGYSQST 
WRAFR E AVNKG I L S VE I V I GDHRERRANL YRFTP S FLAF AQQAKNA 
LIESKLKISSAATKVKAVLAKTLALFNFLSTPPCQNDTPSPCQDDVA 
IKNKKSQVKKTKRSVSGGAGTTSLKKLTSWIAKAKAKADNLRLSKKR 
TQKHEFKQKVEAAARKYAYLKNKRSPDIGGISNFDNLPHCMTVNEAL 
NAVLAKNKDNEQWGAVAGAY IAD I TDG EDRARH FG LMS AC FGVGMVA 
GPVAGGLLGAISLHAPFLAAAVLNGLNLLLGCFLMQESHKGERRGEL 
LKIVVIiPGDHVGQEITAEAIKVLKAISDVRSNVKFDFENHLIGGAAI 
DLQGGGGKPRCVSKSLMLHCTR 

3828 

C 

1 

2076 

MNMN I KKI VKQATVLTFTTALLAGGATQAFAKENNQKAYKETYGVS H 

I TRHDMLQ I PKQQQNEKYQVPQFDQS TI KNIES AKGLDVWDS WPLQN 

ADGTVAE YNG YHVVFALAGS PKDADDTS I YMF YQKVGDNS IDSWKNA 

GRVFKDSDKFDANDPILKDQTQEWSGSATFTSDGKIRLFYTDYSGKH 

YGKQSLTTAQSIREDRNGRSQKTVHTERDMIMNIKKIVKQATVLTFT 

TALLAGGATHAF AKENNQKA YKETYGVSH I TRHDMLQ I PKQQQNEKY 

QVPQFDQS TI KN IESAKGLDVWDS WPLQNADGTVAE YNG YHVVFALA 

GS PKDADDTS I YMFYQKVGDNS IDSWKNAGRVFKDSDKFDANDPILK 

DQTQEWSGSATFTSDGKIRLFYTDYSGKHYGKQSLTTAQVNVSKSDD 

TLKINGVEDHKTIFDGDGKTYQNVQQFIDEGNYTSGDNHTLRDPHYV 

EDKGHKYLVTEANTGTENGYQGEESLFNKAYYGGGTNFFRKESQKLQ 

QSAKXRDAELANGALGIIELNNDYTLKKVMKPLITSNTVTDEIERAN 

VFKMNGKWYLFTDSRGSKMTIDGINSNDIY>a»GYVSNSLTGPYKPLN 

KTGLVLQMGLDPNDVTFTYSHFAVPQAKGNNWITSYMTNRGFFEDK - 

KATFAPSFLMNIKGNKTSWKNSILEQGQLTVN* 

3829 

A 

1 

3603 

MKLMETLNQCINAGHEMTKAIAIAQFNDDSPEARKITRRWRIGEAAD 
L VG VS S QA I RD AE KAGRL PH PDME I RGR VEQR VG YT I EQ INHMRD VF 
GTRLRRAEDVFPPVIGVAAHKGGVYKTSVSVHLAQDLALKGLRVLLV 
EGNDPQGTASMYHGWVPDLHIHAEDTLLPFYLGEKDDVTYAIKPTCW 
PGLDIIPSCIiALHRIETELMGKFDEGKLPTDPHLMLRLAIETVAHDY 
DVIVIDSAPNLGIGTINVVCAADVLIVPTPAELFDYTSALQFFDMLR 
DLLKNVDLKGFEPDAEEIAKFLGLDKRNFKATRGSATLHHWQQLAQP 
HLGGILDPRPGWTKGFRTLDVDLDEVYCLNDFEEDDTATSTPVQHP 
ETSGERSQARVTVSGSRSYPSRPQASPEEMQEPPAATEEEEEEEEEE 
GSVLCAERVGQMTKTYNDIDAVTRLLEEVASRFWVIJVLAI^AAGEAGV 
HFLCPVGKALQDIPASLLPGSVWNDSHICLGPGPLPNELICPWSPKL 
IRGFSYNLGCGPQMGKGQRLPHRLNAGIPSNSHGLENLHEVLVLCVW 
QAGVAQEETWHSASQLLLFQLKLVLKGAKECACTASEHLYWGPLGSV 
NGSPQGQQRTPISRNISWQALDTHLEILLLTICAAGTLMSFLFSDLW 
VLMEAMESRRRPKEGRQVSKVLKPE I WPFQNTTRTPGPVGSPHVTTD 
CQDDAAPPFLETQQRDSESPKTGI IRGSSCLLQRSNGLI PKALGAGG 
ATQAFAKENNQKAYKETYGVSH I TRHDMLQ I PKQQQNEKYQVPQFDQ 
ST I KNIES AKGLDVWDS WPLQNADGTVAE YNG YHVVFALAGS PKDAD 
DTS I YMFYQKVGDNS IDS WKNAGRVFKDSDKFDANDPILKDQTQEWS 
yj^ri j. r x oL/oi\AK.ur jl xljx oVjAliXLjAyolj J. J.AQ VNVSKSDJDTLKINGV 
EDHKT I FDGDGKTYQNVQQF IDEGNYTSGDNHTLRDPHYVEDKGHKY 
LVFEANTGTENGYQGBfeSLFNKAYYGGGTNFFRKESQKLQQSAKKRD 
AELANGALG 1 1 ELNND YTLKKVMKPL ITSNTVTDE I ERANVFKMNGK 
WYLFTDSRGSKMTIDGINSNDIYMLGYVSNSLTGPYKPLNKTGLVLQ 
MGLDPNDVTFTYSHFAVPQAKGNNWITSYMTNRGFFEDKKATFAPS 
FLMNI KGNKTS WKNS I LEQGQLTVN | 
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3830 

A 

2455 

3151 

DRRCPIFCLLPFGIRFILLFGSSLQRGPFIILFFFFLIFFFPFIFLL 
SILGSFFFFLLSPVLLLLHHFFLSLFPACLLLHLLLLLLQVLGAQLG 
RCLHA/SSPTWCRGRGR/VLAGSRGPGAETEGPWTAEGGRGRNLARW 
RRRRRQRQRRPLGKLGLWKRRQRRAGWGGGGSGAETRRAAAVRRAGG 
RAVQAVAGALGRGS S I PGS PPPHQQSAADQPCRPREGNPLIRRRLP 

3831 

A 

1 

2376 

MKLMETLNQC INAGHEMTKAI A I AQFNDD S PEARKI TRRWR IGEAAD 
L VG VS S QA I RD AE KAGRL PHPDME I RGR VEQR VG YT IE Q I NHMRD VF 
GTRLRRAEDVFPPVIGVAAHKGGVYKTSVSVHLAQDL7LLKGLRVLLV 
EGNDPQGTASMYHGWVPDLHIHAEDTLLPFYLGEKDDVTYAIKPTCW 
PGLDIIPSCLALHRIETELMGKFDEGKLPTDPHLMLRLAIETVAHDY 
D* FVQLYNQHVAQPVL/ G I S ISGCGSRPAFS ASLIA* EETPTKSAAS 
PILQRRVIFLASGLSSLNCAMAIAFVIS 

3832 

A 

1289 

3109 

ETPYVSIiYAFWLFSFAKA*VAPPSGFSPTWSSDSWLGSLVPYTLIA 
RT/ PEL I FWRQQVLD VERGAAACG * FLPRLVG * VPDLDAVGL YSTA 
P 1 1 * G A I PG QGQ * G \ LRMS E AQG VP AGG S GLS TTVRLTWACLL PS LF 
STS S V * QPQCS WRAARMVSLLPFSLFSTFMCSLS WICFRNFIQVQVM 
G* VPVTLPNSSTDS PARTFWSSGNWM I * GG I AAFGGVLG AG FFFTVA 
S / PLS S * HGRKRALGPPGFHLWGLGVLTGWS I IHGDLAGALRLAS VI 
LGYTLV*ASVLRQCLLDGDG\QSEPSWERMMNLVVLMVFPFSVQMMV 
AGGSEDTWHWSSSFLPSAT*TSCSFCLKAGAVPWLSDLFSLSWPAW 
PLLQFTSFLTSFLSSSLLADFRFSSGLALPMGFRVSAGLALPIGFRV 
SAGLALPTGFKGPEGCT^LRGLELSTALAFRVSALLLPFSAGNFFLP 
PSTERKSGVAGFGGGTFSGTGVLEVPFLARTERKSTCWGLCTFLSSS 
DTVFGFTCRCRLARKSICSAGISFRSSSDRVFVLTFFPRRSRKSSCS 
TSCWRIASSVCCLVSTRLFSTPRTSSPSSSSSQPCPLAGQPGLRDP 

3833 

A 

1 

2968 

MTIVAALMTVFFIMQLVGQVPAALWVIFGEDRFRWSATMIGLSLAVF 

GILHALAQAFVTGPATKRFGEKQAI IAGMAADALGYVLLAFATRGWM 

AFPIMILLASGGIGMPALQAMLSRQPNFDHWTADRHGDLCRLGEHME 

RVGMDCRRRPIPCLPPRVASRCMEPGHLDMNGSRRHLANGFTTPRIG 

ANQFLRRTVNAQTNPWQNISIASAISSSRTRRISGSVGSWPRSGNAE 

VSALHHYVPDLHRRMLLATLWNTYICINEALALTLSDFSLVPPHPYR 

QLFTLTTFHNRACSSSVTRI VS ILSRFIGI ITPMNRNPPYTEASVTK 

QEKTTUjNMARFIRSQTLTLLEKIjNELDADEQADICESLHDHADELYR 

SCLARFGDDGENSDTCSSRRRSQLVCKRMPGADKPLNQTFIDYRTGR 

LQPAIPQSLPEALRIJUU)LAEQKQRLEQKMLMDAPKVEFAERVATAS 

GVLIGNYAJCVLGLGQNYLFTWLRDNGILIATGERRNVPKQEYISRGY 

FTLKETVIDTSNGSRISFTTRITGKGQQWLMKRLLDAGVLVPVAATL 

TDVVRTTSIVMLAKVTFLSCITMSDFTFSGYELACFVTHSGLSRSAG 

HILSQCANLAATTSEYFIHKPHRLIAAETGYSQSTVVRAFREAVNKG 

ILSVEIVIGDHRERGANLDRVTPSFLALAQQAKSALTESKLKISSAA 

TKI LYAGR I VAG I TGATGAVAGAYI AD ITDGEDRARHFGLMS ACFGV 

GMVAGPVAGGLLGAISLHAPFLAAAVLNGLNLLLGCFLMQESHKGER 

RPMPLRAFNPVSSFRWARGMTIVAALMTVFFIMQLVGQVPAALWVIF 

GEDRFRWSATMIGLSLAVFGILHALAQAFVTGPATKRFGEKQAIIAG 

MAADALGYVLLAFATRGWMAFPIMILLASGGIGMPALQAMLSRQ\LT 

S I IGPLI VTAI YAASASTWNGLAWIVGAALYLVCLPALRRGAWSRAT 
ST 

3834 

A 

1 

563 

MPLRAFNPVSSFRWARGMTIVAALMTVFFIMQLVGQVPAALWVIFGE 
DR FRWS ATM I GLS LAVFG I LHALAQAFVTG PATKR FGEKQA 1 1 AGMA 

ADALGYVXiLAFATRGWMAFPTMTT.T aCrPTPMDTiT r\7vMT r>nM t mn -r 

* xjx^m. ** • xvvj iff i Lt\r c xrJXijixfiooLjJLijMi^AijQAMLiSRQxDTSI 
IGPLIVTAIYAASASTWNGLAWIVGAALYLVCLPALRRGAWSRATST 

3835 

A 

850 

1280 

PGCIRCKCVAVDGDPCMKSNNALIVILGTVTLDAVGIGLVMPVLPGL 

LRD I VHSDS I ASHYGVLLALYALMQFLCAPVLGAIiSDRFGRRPVLLA 

SLLGATIDVSCPCSRMLFLTTDVLLPLMFIKKLGANVAFLSSKKPLF 
VM 

3836 

A 

3 

3971 

VCTGSSTRHIVTFDGQNFKLTGSCSYVLFQNKEQDLEVILHNGACSP 
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GARQGCMKS IEVKHSALS VELHSDMEAFRFLVHGHPYS I TTACFLVK 
LCKVKLCFLVKIiFFILEELPTNIKVTVNGRLVSVPyVGGNMBVNVYG 
AIMHEVRFNHLGHIFTFTPQNNEFQLQLSPKTFASKTYGLCGICDEN 
GANDFMLRDGTVTTDWKTLVQEWTVQRPGQTCQPILEEQCLVPDSSH 
CQVLLLPLFAECHKVLAPATFYAICQQDSCHQEQVCEVIASYAHLCR 
TNGVCVDWRTPDFCAMSCPPSLVYimCEHGCPRHCDGNVSSCGDHPS 
EGCFCPPDKVMLEGSCVPEEACTQCIGEDGVQHQFIiEAWVPDHQPCQ 
ICTCLSGRKVNCTTQPCPTAKAPTCGLCEVARLRQNADQCCPEYECV 
CDPVSCDLPPVPHCERGLQPTLTNPGECRPNFTCACRKEECKRVSPP 
SCPPHRLPTLRKTQCCDEYECACNCVNSTVSCPLGYLlASTATNDCGC 
TTTTCLPDKVCVHRSTIYPVGQFWEEGCDVCTCTDMEDAVMGLRVAQ 
CSQKPCEDSCRSNTVPYKLLGKCQVTSWGSPNLHVGEQYWDRVFVLY 
GRLALREWRLDPSHPECASRASLTFCMKASAVEGACHLPVGSQWASP 
ENPCLINECVRVKEEVFIQQRNVSCPQLEVPVCPSGFQLSCKTSACC 
PSCRCERMEACMLNGTVIGPGKTVMIDVCTTCRCMVQVGVISGFKLE 
CRKTTCNPCPLGYKEENNTGECCGRCLPTACTIQLRGGQIMTLKRDE 
TLQDGCDTHFCKVNERGEYFWEKRVTGCPPFDEHKCLAEGGKIMKIP 
GTCCDTCEE PE CND I TAR LQ YVKVG S C KS E VEVD IH Y CQGKCAS KAM 
YS IDINDVQDQCSCCS PTRTEPMQVALHCTNGS VVYHEVX*NAMECKC 
SPRKCSKGDTGIFLKWSCIPAEKGLQTSGAWVEAADIPLLPGSPRRL 
SPQAGSRGGQGPKHGQQCLKMPGPRAPGLQGGSNRDPGQPCGGESTR 
SSSVINNYLDANEPVSLEARLSRMHFHDSQRKVDYVLAYHYRKRGVH 
LAQGFPGHSLAIVSNGETGKEPHAGGPGDIELGPLDALEEERKEQRE 
EFEHNLMEAGLELEKDLEWQSLDVWHLNLNCSTESKGDILGQVPAA 
LWVIFGEDRFRWSATMIGLSLAVFGILHALAQAFVTGPATKRFGEKQ 
AIIAGMAADALGYVLLAFATRGWMAFPIMILLASGGIGMPALQAMLS 
RQ\ LTS I IGPLI VTAI YAAS ASTWNGLAWI VGAALYLVCLPALRRGA 
WSRATST 

3837 

A 

1838 

4062 

KRHKKPFKKSTNPGAGFFEKINKIDRLLARLIKKKREKNQIDAIKND 
KGDITTDPTEMQTTVREYYKHLYVNKPENLEEMDKFLDTYTLPRLNQ 
EELESLNKPITGSEIEAIINSLPTKKS PGPDGFTAEFYQRYKEELLP 
FLLKLFQS IEKEGILPNSFYEAS I ILI PKPGRDTTKKENFRPISLMN 
IDANILNNILANQIQQHIKKLIHHDEVGFIPGMQGWFNIRKSINLIP 
HVNRTKDKNHMVISVDAEKAFDKIQQHFMLKTLNKIiGIDGTYLKIIR 
PIFDKPTADIILNRQKLEALPLKTGTRQRCSLSPLLFNIVLEVLARA 
IRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLIGNF 
SKVSEYTINVQKSQSFLYTNNRQTESQIMSELPFTIASKRIKYLGIQ 
LTREVKDFFKENYKPLLDEIKENTNKWKNIPWSWVGRIDIVKMAILP 
KQMPDVNVSWDGEGPKQLPFIDISVAVATDKGLLTPIIKDAAAKGIQ 
E I ADS VRSNI VLQCQARPMPLRAFNPVS S FRWARGMTI VAALMTVFF 
IMQLVGQVPAALWVIFGEDRFRWSATMIGLSLAVFGILHALAQAFVT 
GPATKRFGEKQAIIAGMAADALGYVLLAFATRGWMAPPIMILLASGG 
IGMPALQAMLSRQVDDDHQGQLQGSLAALTSLTS I IGPI* IVTAIYAA 
SASTWNGLAWIVGAALYLVCLPA\LRRGAWSRATST 

3838 

A 

346 

777 

GELYFQVCGQFWNRCGVVMKKMYILLFLSGEFCRCL*YSLMVVCISV 
GSWISPLSFFIVSI*FFSLFFFISLASSLSILLILSKNQLLDSLIF 

3839 

A 

1 

564 

MPLRAFNPVS S FRWARGMTI VAALMTVFF I MQLVGQVPAALWVI FGE 
DRFRWSATMIGLSLAVFGILHALAQAFVTGPATKRFGEKQAIIAGMA 
ADALGYVLLAFATRGWMAFPIMILLASGGIGMPALQAMLSRQPNFDH 
WTADRHGDLCRLGEHMERVGMDCRRRPIPCLPPRVASRCMEPGHLDD 
NGSRRHLANGFTTPRIGANQFLRRTVNAQTNPWQNISIASAISSSRT 
RRISGSVGSWPRSGNAEVSALHHYVPDUTRRMLLATLWNTYICINEA 
LALTLSDFSLVPPHPYRQLFTLTTFHNRACSSSVTRIVSILSRFIGI 
ITPMNRNPPYTEASVTKQEKTALNMARFIRSQTLTLLEKXiNELDADE 
QADICESLHDHADELYRSCLARFGDDGLGGGQSCRTVLRERVRIEIA 
STHI ALQHAI VTGDAVGMDD I PQEARQYRHNQAYAYS IQGDGAEDDD 
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ERIFRTXKEYLSHEQDMLRLRIMIWRTLATDTFDIALPVNQSFDVWA 
TIIRGKFQTVYRDIIERVKSSDSCDVSGSEVLKFHVIRTPSTSQAPC 
MKSNNALIVILGTVTLDAVGIGLVMPVLPGLLRDIVHSDSIASHYGV 
LLALYALMQFLCAPVLGAIjSDRFGRRPVIjLASLLGATIDYAIMATTP 
VLWICLAGLPQFFNIiPAQLFVLNFSIPFGIPFYDGGRVIKHLITLAT 
ASQNGHSLFLPVtiNESLAIKDWHRLGYAGTAGPLAGYRPFRQHRQSL 
WRAVALYALMQFLCAPVLGALFDRFAAAKSCNFATWSHYRLRDHATT 
PVLWICLAGLPQFFNLPAQLFVLNFSIRFGIPFYDGGRVIKHIilTLA 
TASQNGHSLFLPVLNGSKALEYSNGIFDCQSPTSPFMGSLRALHLVE 
DLRGLLEMMETDEKEGLRCQ1 PDSTAETLVEWLQS QMTNRR I LNS SR 
TKKINIMFIPYFKIHCLHSALGVAETEKETAEHLDLAGASSRPKDSQ 
RNSPFQIPPPSPDSKKKSRGIMKLFGNIjGPYGGHRRSNQRAYTSRPF 
LFISSNIGSSMSMRQPGSPGCRMCSEAAKRSYWRSKTRYMFLIALWP 
CLYRQCYFSVNRALWPRRTFSSVFQRVFDQRFNVGINTRFNFELIGT 
DGYLVLRWLITLDTK7U>WDVGAVAGAYIADIPDGEDRARHFGLMSA 
CFGVGMVAGPVAGGLLGAISLHAPFLAAAVLNGLNLLLGCFLMQESH 
KGERRPMPLRAFNPVSSFRWARGMTIVAALMTVFFIMQLVGQVPAAL 
WVI FGEDRFRWS ATM IGLSLAVFG I LHALAQAFVTGPATKRFGEKQA 
1 1 AGMAADALG YVLLAFATRGWMAFP I M I LLASGG I GMPALQAMLS R 
Q\ LTS I IGPLI VTAI YAASASTWNGLAWI VGAALYIiVCLPALRRGAW 
SRATST 

3840 

A 

3 

1049 

GSLQGCAASCARAFLRTCCARSESGLAVRGTKGVAFGENPRAAGTCS 
QSPFFPVAPPPGVRLWVLCPDRFTGAFLLRTTLALNVRAGDEGPRRL 
LRAPRVPSRPSAPLAPAPASTPARRLPPVPMQAPGRGPLGLRLMMPG 
RRGALREPGGCGSCLGVALALLLLLLPACCPVRAQNDTEPIVLEGKC 
LWCDSSPSADGAVTSSLGISVRSGSAKVAFSATRSTNHEPSEMSNR 
TMTIYIDQWSNIGHHFDLASSIFVAPRKGIYSFSFHWKVYNRQTI 
QVSLIA LNGYPVI SAWAG YQDVTGEAASNGVLLLMEREDKVHLKLER 
GNLMGGWKYSTFSGFLVFPL 

3841 

A 

1 j 

401 

KGASSPVQVPGPVAASTEAiLQEAQCSGLSWWALPQVKQEKADTQD 
EWTPGTAVLTSPVLVPGCPSKAVDPGLPSVKQEPPDPEEDKEENKDD 
SAS KLAPEEE ITE I FSLGG PRFRDTAV* LPRS KDLKKPR 

3842 

C 

108 

2 93 

MNMACLGCFWALIILPFSNPAPPPLKSNFLYSGSWEEKMHQLSTLS 
LPFPLTSSRSSKYS* 

3843 

A 

1 

1490 

MIREQKIYHKYToAQRREEEKAQEKJEFDRILEEDKAKKLAEKDKELRL 
EKEARRQLVDEVMCTRKLQVQEKSQYVKLLLAAHPTGDRIRSKVELT 
RYIiGPACDLTLFDFKQGILCYPAPKAHPVAVASKKRKKPSRPAiCTRK 
RQ VGPQSGE VRKEAPRDETKADTDTAPAS F PAPGCCENCG I S FSGDG 
TQRQRLKTLCKDCRAQRIAFNREQRMFKQPYTNRRQNRKCGACAACL 
RRMDCGRCDFCCDKPKFGGSNQKRQKCRWRQCLQFAMKRLLPSVWSE 
SEDGAGSPPPYRRRKRPSSARRHHLGPTLKPTLATRTAQPDHTQAPT 
KQEAGGGFVLPPPGTDLVFLREGAS S PVQVPGPVAASTEALLQEAQC 
SGLSWWALPQVKQEKADTQDEWTPGTAVLTSPVLVPGCPSKAVDPG 
LPSVKQEPPDPEEDKEENKDDSASKLAPEEEAGGAGTPV/VTSVDSI 
YPERQL* VSSWSYLHSCGTERYPCGS I 

3844 

A 

1 

1758 

MAEDWLDCPALGPGWKRREVFRKSGATCGRSDTYYQSPTGDRIRSKV 
ELTRYLGPACDLTLFDFKQGILCYPAPKAHPVAVASKKRKKPSRPAK 
TRKRQVGPQSGEVRKEAPRDETKADTDTAPASF PAPGCCENCG I SFS 
GIX5TQRQRLKTLCKDCRAQRIAFNREQRMFKRVGCGECAACQVTEDC 
GACSTCIiLQLPHDVASGLFCKCERRRCLRIVERSRGCGVCRGCQTQE 
DCGHCPICLRPPRPGLRRQWKCVQRRCLRGKHARRKGGCDSKMAARR 
RPGAQPLPPPPPSQSPEPTEPQPYT/lPPAEPQVRGLCSLPTAEWMW 
PLR\FCCDKPKFGGSNQKRQKCRWRQCLQFAMKRLLPSVWSESEDGA 
GSPPPYRRRKRPSSARRHHLGPTLKPTLATRTAQPDHTQAPTKQEAG 
GGFVLPPPGTDLVFLREGASSPVQVPGPVAASTEALLQAVDPGLPSV 
KQEPPDPEEDKEENKDDS AS KLAPEEE AGGAGTPV I TE I FS LGGTRF 
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RDTAVWLPSLQGRHSGREDGCKVWETEDTVEPTSTSWNPRGWPGTHV 
SkSPPPASMMWVSCRRSWCPSSQS 

3845 

A 

1 

1647 

MAEDWLDCPALGPGWKRREVFRKS GATCGRSDTYYQS PTGDR IRSKV 
EliTRYLGPACDLTLFDFKQGILCYPAPKAHPVAVASKKRKKPSRPAK 
TRKRQVGPQSGEVRKEAPRDETKADTDTAPASFPAPGCCENCGISFS 
GDGTQRQRLKTLCKDCRAQRIAFNREQRMFKRVGCGECAACQVTEDC 
GACSTCLLQLPHDVASGLFCKCERRRCLRIVERSRGCGVCRGCQTQE 
DCGHCPICLRPPRPGLRRQWKCVQRRCLRGKHARRKGGCDSKMAARR 
RPGAQPLPPPPPSQSPEPTEPHPRALAPSPPAEFIYYCVDEDELQRL 
LPSVWSESEDGAGSPPPYRRRKRPSSARRHHLGPTLKPTLATRTAQP 
DHTQAPTKQEAGGGFVLPPPGTDLVFLREGASSPVQVPGPVAASTEA 
LLQEAQCSGLSWWALPQVKQEKADTQDEWTPGTAVLTSPVLVPGCP 
SKAVDPGLPSVKQEPPDPEEDKEENKDDSASKLAPEEEAGGAGTPVI 
TE I FS LGGTRFRDTAVWLPRS KDLKKPG ARKQ 

3846 

A 

1 

1821 

MAEDWLDCPALGPGWKRREVFRKSGATCGRSDTYYQS PTGDR IRSKV 
ELTRYLGPACDLTLFDFKQGILCYPAPKAHPVAVASKKRKKPSRPAK 
TRKRQVGPQSGEVRKEAPRD\ETKADTDTAPASFPAPGCCENCGISF 
.SGDGTQRQRLKTIiCKDCRAQRIAFNREQRMFKRVGCGECAACQVTED 
CGACSTCLLQLPHDVASGLFCKCERRRCLRIV\ERSRGCG\VCRGFQ 
\TQED/CLGHCPICLRPPRPGLRRQWKCVQRRCLRGKHARRKGGCDS 
KMAARRRPGAQPLPPPPPSQSPEPTEPHPRALAPSPPAEFIYYCVDE 
DELQPYTNRRQNRKCGACAACLRRNGCGRCDFCCDKPKFGGSNQKRQ 
KCRWRQCLQFAMKRLLPSVWSESEDGAGSPPPYRRRKRPSSARRHHL 
GPTLKPTLATRTAQPDHTQAPTKQEAGGGFVLPPPGTDIiVFLREGAS 
SPVQVPGPVAASTEALLQEAQCSGLSWWALPQVKQEKADTQDEWTP 
GTAVLTSPVLVPGCPSKAVDPGLPSVKQEPPDPEEDKEENKDDSASK 
LAPEEEAGGAGTPVITEIFSLGGTRFRDTAVWLPRSKDLKKPGARKQ 

3847 

A 

1 

1723 

MAEDWLDCPALGPGWKRREVFRKSGATCGRSDTYYQSPTGDRIRSKV 
ELTRYLGPACDIiTLFDFKQGILCYPAPKAHPVAVASKKRKKPSRPAK 
TRKRQVGPQSGEVRKEAPRDETKADTDTAPASFPAPGCCENCGISFS 
GDGTQRQRLKTLCKDCRAQRIAFNREQRMFKRVGCGECAACQVTEDC 
GACSTCLLQLPHDVASGLFCKCERRRCLRIVERSRGCGVCRGCQTQE 
DCGHCPICLRPPRPGLRRQWKCVQRRCLRHLAHRLRRRHQRCQRRTP 
LAVAPPTGKHARRKGGCDSKMAARRRPGAQPLPPPPPSQSPEPTEPH 
PRALAPSPPAEFIYYCVDEDELKRLLPSVWSESEDGAGSPPPYRRRK 
RPSSARRHHLGPTLKPTLATRTAQPDHTQAPTKQEAGGGFVLPPPGT 
DIiVFLREGAS S PVQVPGPVAASTEALLQEAQCSGLS WVVALPQVKQE 
KADTQDEWTPGTAVLTSPVLVPGCPSKAVDPGLPSVKQEPPDPEEDK 
EENKDDSASKLAPEEEAGGAGTPVITEIFSLGGTRFRDTAVWLPRSK 
DliKKPGARKQ 

3848 

A 

1 

1950 

MAEDWLDCPALGPGWKRREVFRKSGATCGRSDTYYQSPTGDRIRSKV 
EIiTRYLGPACDLTLFDFKQG I LCY PAP KAHPVAVAS KKRKKPS RPAK 
TRKRQVGPQSGEVRKEAPRDETKADTDTAPAS FPAPGCCENCG I S FS 
GDGTQRQRLKTLCKDCRAQRIAFNREQRMFKRVGCGECAACQVTEDC 
GACSTCLLQLPHDVASGLFCKCERRRCLRIVERSRGCGVCRGCQTQE 
DCGHCP I CLR PPRPGLRRQWKCVQRRCIjRHLAHRLRRRH QRCQRRTP 
LAVAPPTGKHARRKGGCDSKMAARRRPGAQPLPPPPPSQSPEPTEPH 
PRALAPSPPAEFIYYCVDEDELQPYTNRRQNRKCGACAACLRRMDCG 
RCD FCCD KP KFGG SNQKRQKCRWRQCLQFAMKRLLP S VW S E S EDGAG 
SPPPYRRRKRPSSARRHHLGPTLKPTLATRTAQPDHTQAPTKQEAGG 
GFVLPPPGH*TLCFLREG\ASSPVQVPGPVAASTEALLQEAQCSGLS 
WVVALPQVKQEKADTQDEWTPGTAVLTSPVLVPGCPSKAVDPGLPSV 
KQEPPDPEEDKEENKDDSASKLAPEEEAGGAGTPVITEIFSLGGTRF 
RDTAVWLPSLQGRHSGREDGCKVWETEDTVEPTSTSWNP 

3849 

A 

2 

1682 

LCDTPPGRGGGGRGGGRGGRLQEGPSFMGPRGLEWRKIPAWLPNWLL 
LLPVASMAED\WLYCPALGPGWKRREFFRKSRATCGRLNTYYQSPTG 
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DRILSKVELTRYLGP\ACDLTLFDFKQGIL\CYPSPQRPIPLGGLPG 
KKRKKP\SRPAKT\RKRQVGPQSGEVRKEAPRDETKADTDTAPASFP 
APGCCBNCG I S FSGDGTQRQRLKTLCKDCRAORI AFNREOPMFKT? vn 

CGECAACQVTEDCGACSTCLLQLPHDVASGLFCKCERRRCLRIVERS 
RGCGVCRGCQTQEDCGHCPICLRPPRPGLRRQWKCVQRRCLRGKHAR 
RKGGCDSKMAARRRPGAQPLPPPPPSQSPEPTEPHPRALAPSPPAEF 
IYYCVDEDELKRLLPSVWSESEDGAGSPPPYRRRKRPSSARRHHLGP 
TLKPTLATRTAQPDHTQAPTKQEAGGGFVLPPPGTDLVFLREGASSP 
VQVPGPVAASTEALLQAVDPGLPSVKQEPPDPEEDJCEENKDDSASKL 
APEEEAGGAGTPVITEIFSLGGTRFRDTAVWLPRSKDLKKPGARKQ 

3850 

A 

1 

393 

SPLNKVQLINELNEREVQLGVANKVSWHSEYKDSAWIFLGGLPYDLT 
\KGDIICVFSQ\QRSTIVAVDNFNGIKIKGRTIRVDHVSNYRAPKDS 
EE IDDVTRQLQEKGCGARTPSPS LSES S EDEKPTKKP 

3851 

A 

1 

135 

ARTPSPSLSESSEDEKPTKKHKKGKALRLKRRFWVVLMSALPCIH 

3852 

A 

18 

238 

SPPPPFLSAEREPEEMNPLTKVKLINELNEREVQLGVADKVSWHSEY 
KDSAWIFLGGLPYELTEGDIICVFSQ 

3853 

A 

254 

1424 

SPLTRVKLINELNEREVQLGVADKVFWHSEYKDSAWIFLGGLPYELT 
EGDIICVFSQYGEIVNINLVRDKKTGKSKGFCFLCYEDQRSTIIiAVD 
NFNGIKINGRTIRVDHVSNYRAPKDSEEIDDVTRQLQEKGCGARTPS 
PSLSESSEDEKPTKKHKKAPQNATVFGDKVFEEVTQDGNYLVQPLDI 
SKVTDIQVGFSEALSWRPRQWLLHAAAKVVLLMHIPHFLAYKKEKKK 
KKKEKEKADRE VQAEQPS S S S PRRKTVKE KDDTG PKKH S S KNS ERAQ 
KSEPREGQKLPKSRTAYSGGAEDLERELKKEKPKHEHKSSSRREARE 

EKTRIRDRGRSSDAHSSWYNGRSEGRSYRSRSRSRDKSHRHKRARRS 
RERESSNPSDRWRH 

3854 

A 

8 

1015 

FLS AERE P EEMNPLTKVKL I NELNEREVOLG VADKVS WH <5 F YKD q a \ 
WISLG\GLPYELTEG\DIICVFSQYGEIVNINLVRD\KKTGKS\KGF 
C FLC YEDQR S T I LAVDNFNG I K I KGRT I R VDHVS \ NYRA PKDS ED I D 
DVTRQLQEKGSGARPPSPTLSESSEDEKPTKKHKKDKK\EKKKKKKE 
KEKADREVQAEQPSSSSPRRKTVKEKDDTGPKKHSSKNSERAQKSEP 
REGQKLPKSRTAYSGGAEDLERELKKEKPKHEHKSSSRREAREEKTR 
IRDRGRSSDAHSSWYYWRS\EGRSYRSRSRSRDKSHRHKRARRSRER 
E\SSNPSDRWRH 

3855 

A 

299 

374 

PRANKHFFLLFFSMLQEKEREFQEV 

3856 

A 

1 

993 

MRDLFGTRLRRAEDVFPPVIGVAAHKGGVTKTSVSVHLAQDIiALKGL 
RVLL VEGNDPQGTAS MYHG WVPDLH I HAEDTLL P FYLGE KDD VTYA I 
KPTCWPGLDI I PSCLALHRIETELMGKFDEGKLPTDPHLMLRLAIET 
VAHDYDVIVIDSAPNLGIGTINTWCAADVLIVPTPAELFDYTSALQF 
FDMLRDLLKNVDLKGFEPDDLKKSFKSPEPRLFTPEEFFRIFNRSID 
AFKDFWASETSDCWSSTLSPEKVLRASWKRDSDNSLKSLSPTQIR 

LGEVLTPVNSAFWEAEVWNSGDSDDMALDFDCTSSEVDAESTNRKVIi 
RP 

3857 

A 

3009 

3583 

LFCLFFH I LQPSHCW I SEMWQLS \ DSLTDLLDKFSNI S EGLSNYS I 
IDKLVNIVDDLVECVKENSSKDLKKSFKSPEPRLFTPEEFFRIFNRS 
IDAFKD FWAS ETS DCWS S TLS PE KGKAKNPPGDS S LH WAAMALP A 

LFSLIIGFAFGALYWKKRQPSLTRAVENIQINEEDNEISMLQEKERE 
FQEV 

3858 

A 

2 

989 

TPAPAEAEQGKSQCPSGSTAQSWSAMDILVPLLQLLVLLIiTLPLHLM 

ALI^CWQPLCKSYFPYL1^VLTPKSNRKMESKKRELFSQIKGLTG\P 

PGKVALLELGCGTGGNFQFYPPGCRVTCLDPNPHFEKFLTKSMAENR 

HLQYERFWAPGEDMRQLADGSMDVVVCTLVLCSVQSPRKVLQEVRR 

VLRPGGVLFFWEHVAfiPYGSWAFMWQQVFEPTWKHIGEGCCIiTQKTW 

KDLEK\AQFSENQRERQAPSLKGPTLWAPPSLERGCKKFFPRDGEQE 

TGALQPDKGAYRSLRESGPTGAGLRNRSQLSVLPTGLQGHLPRPKSP 
L 

3859 

A 

880 

1196 

SPTTRRGTPGTCLKPTACAGAA*PGCSARRTCASAAPLSTCPPSSCA 
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APPPAPAAVPSTPRPTSPSPWAAPASPSRRRTQTASTPASTNRAPSS 
CWAPTTR PLAP 

3860 

A 

305 

631 

I EKI SLEPKNRPS PQI PNLVGP/RLFIjVPGDFQSQVPNFLPFKPSSG 
PHPAAGKGQPFSFLPAWRSTGPERATPGPGRPAACSALEAPSLRGPS 
G YS AQEP PRRP I S AR I 

3861 

A 

2 

205 

QLPDPAIEDQGGEYVQPMLSKYAAVCVRCPGYGTRGPAKKPSAHSQS 
QASSSPEWIPELLPCPDAAA 

3862 

A 

1 

627 

DRVDEIRAANRDEFYSRPSKLADFWGNNNEILSGLDMEEGKEGGTWIi 
GISTRGKLAALTNYLQPQLDWQARGR/ GTAKGDVICYYGNRGEPDPI 
VLTPGTYGLSNALLETPWRKLCFGKQLFLEAVERSQAIiPKDVIiIASL 
PDVLNNEEA*VGRSCWG\DPSVPPPGQRERQALLPREAHGSGQAGSP 
AAGKLMDILSSLPVPLSQLRFPT 

3863 

A 

2 

336 

PRHSPEATGAAVPANASVLLGGPQSVPTCSPCTSQAGPAPRRWGLDL 
DAG S RG VPEGRW I RKRS DARAAR S FGAAPGAGRQE VAAS ERQRAQPT 
GHRHAARRPAPRCAAQW 

3864 

A 

68 

354 

TRLRAGHRGTNGHHFSPSCRQLPDPAIEDQGGEYVQPMLSK\YAAV\ 

WCAC/AGYGTRNNTIILVDADGHVTFTERSMMDKDLSHWETRTYEFT 
LQS 

3865 

A 

1 

2364 

MTCLRTGTEGAQGGVGRGTQAGGPGGAPSLTPEAAGAARREDTSGPS 

VPLSFPAARCPRKVPGLPPLGLLESFGGCRALAARRQASRGAGTAAG 

SGRARLECSVQDARTGPARNPRPRASPPALTRPVRKADRAGDANLRP 

APSDSKFSHWSRDPARHGNRJIRLGGCAEMTACHWASGMSVPADDRPP 

RGGAAAPHLPEPLRRRWQGTRDRSLPADLREHPPTIQPRAGTGRAGA 

PS PDSGRVAS ATTPS SQVS FFS ACY SLRAARRSLLS TSNCI LHASRE 

REFCCYPHFTVGKQPQRGSHLLEHSRLTLGPGGSLILTPLGPRSGET 

MEGAAESPKAAGSGSAQTPAQPEALAQPRGRGARRWQPPSDSAPSQR 

DAGMRRAAPPLAARPNGERAGFGLGGTGLAAVARGRGTRKSAAVAEA 

TSRPVSAEPPCTTMCI I FFKFDPRPVSKNAYRVLGAQPVLMPPKLLC 

AGRLI LAA\ NRDE \ FYS \ R PS KLAD \ WWGNNNE I LSGLGH / EEGRAR 

KEGTW AG AFS TRGQ AGQHFTNYL \ Q PQLD WQ ARGRG E L VTHFLTTDV 

DSLSYLKKVSMEGHLYNGFNLIAADLSTGKG\DVICYYGNRGEPDPI 

VLTPGTYGLSNALLET\PWWKLCFGKQLFLEAVERSQTOiPKDVLIAS 

LLDVLNWEEAQLPDPAIEDQGGEYVQPMLSKYAAVCVRCPGYGTRYC | 

STVGAPPPIPCPTSHARGPAKKPSAHSQSQASSSPEWIPELLPCPDA 

AAGTNTI ILVDADGHVTFTERSMMDKDLSHWETRTYEFTLQS 

3866 

A 

136 

230 

FITSIiGHRCLIYKISCQISKIA*NTFVSHLM j 

3867 

A 

109 

1114 

LEFFKMSGWPTAPEQPAGEMENQTKPPDPRPDAPPEYNSHFLPRPP 
GTAVPPPTGYPGSLPMGYYSPQQPSTFPLYQPVGGIHPVRYQPGKYP 
MPNQSVPITWMPGPTPMANCPPGLEYLVQLDQXIHVLQHFEPLEMMT 
CFETNl^YDIKlWSDQMVYVVTEDTDDFTRNAYRTLRPFVIiRVTDCM 
GREIMTMQRPFRCTCCCFCCPSARQELEVQCPPGVTIGFVAEHWNLC 
RAVYSIQNEKKENVMRVRGPCSTYGCGSDSVFEVKSLDGISNIGSII , 
RKWNGMLS AMADADHFDIHFPLDLDVKMKAMI FGACFL I DFMYFERS 
PPQRSR 

3868 

A 

980 

1498 

VSLINLQVLRQSLEQVTETFTSYFKPSCQ/PPTSPIRAMKWTELRFL 
SCLPAGRQHYLLPATPPCSVRGVHGKPSTVPPAAPSPTSAQSHQPAS 
HPLSSPTFLFEPIKDAGTHASLFSRDRHLPFPTCHIRRPPQPCLSPP 
HCS QG SRTGRAVTAALRMAAAAAMTGNQS PTR I 

3869 

A 

94 

484 

AAAAIiAAASLLLRREDPGPGAGPSMAETEALSKLREDFRMQNKSVFI j 
LGASGETGR\VLLKEILE\QGLFSKVTLIG\RRKIiTFDEEAYKNVNQ 
EVGGLLKSLDDYASAFQGHD\ LGFC\ CLGYHQEGKAGGG 

3870 

A 

302 

1187 

TALVPRDKDWQS PDTLKTG I CRC YFP SMAETEALS KLRED FRMQNKS 
\VFILGATVETGRVLLKEILEQGLFSKVTLIGRRKLTFDEEAYKNVN 
QEWDFEKLDDYASAF\QGHDVGFCCLG\TTRGKAGAEG\FV\RVDR 
DYV\ LKSAELE KAGGC KHFNLL S S KGADKS SNFL YLQVKGEVEAKVE 
ELKF\DRYSVFRPGVLLCDRQESRPGEWLVRKFFGSLP\DSWA\RGH 
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SVPVVTVVRAMIjN\NVVRPRDKQMELLEN\KAIHGPGGKRHGSSQGH 
DHTGRNGFYL.FNLNTHDQIG 

3871 

A 

71 

335 

PTRSTSSGSYMKRRSGSCSP/MISDTSWLSMDNSRSSRAWVKKIK 
TRDGKLVSESSDVLPK*TAAAAPPSLPLLRCPRAWEGGRYAW 

3872 

A 

35 

450 

D I EECE S MH * DD I RKHTEMENE FAL I KKDMDEAYKNKVELESHLEGL 
TEEISFLTQLYEEEIRELHVPDLGHICGAVH*QQSLPGHEQHHR*GQ 
GTVRGDRQPQPG * G * EHE I KYEELQTLAWKHGMTRGAQRLRS PR 

3873 

A 

2121 

2371 

LENQSPRHVYSLSLLKS KNDVMLL/ DTAI FC PEATS EALDWTS WAS 
EAGTEELAFGFGLVSLSYCKLSTSLSVASSFVGGDWL 

3874 

A 

29 

3977 

AEVAHARWERCFFSPTPFLWGCFRDRLLPGRRACFRRAGRLKRGSYN 
CSQLWLCVE KALAMDRSGFGE I S S P V I REAE VTRTARKQS AQKRVLL 
QASQDENFGNTTPRNQVIP\RTPSSFRQPFTP\KSRSLL.RQPDISCI 
LGTGGKS PRLTQS SGFFGNLSMVTNLDDSNWAAAFS S QRS \ GLFTNT 
EPHSITEDVTISAVMIiREDDPGEAASMSMFSDFIiQSFLKHSSSTVFD 
LVEEYENICGSQVNILSKIVSRATPGLQKFSKTASMLWLLQQEMVTW 
RLLASLYRVTQKSFKVSNSGPRAFSSHSYMCGPSACISSSSFSRMGS 
S SFRGS LGGDFGRASGRGG I TPVTVNQSLLS PLNLEVDPNI QAVCTQ 
VKEQIKTLNNKPASFIDKLYTLGQEKLKLEVELDNMQGLVEDFKNKY 
EDD I KKHTEMENEFVL I KKDMDEAYKNKVELES HLEGLTEE I S FLRQ 
LYEEE IRELQSQISDTS WLS IDNSRSLDMNS I IAETE ISKMNRNI S 
RLQDEIEGLKGQRASLEAAIADAEQHGELAVKDANTKLSKLEAVNAS 
EKTWEALFQRDSLVRQSQLWDWLESIAKDEIGEFSDNIEFYAKSV 
YWENTLHTLKQRQLTSYVGSVRPLVTELDPDAPIRQKMPLDDLDRBD 
EVRLLKYLFTLIRAGMTEEKSSVLVTIVTRQITSQLSGVKRPLYYAH 
GFRGGLSPREVHIRKHLAHGTPVLAAQCRGSKSAYLGREEARTSFLV 
LKNAGKPVGLVPDLLSLSIIVDFLLNISFPTSIQRATWRENQFLKRC 
PEKSWSQTVSAVGVPVPTVKPSDFVEGSVAQIRSALSGFLRYVIiNER 
TADRRDFVS PLRSLHS PVSLKRTYRETGSDSASLHPGE INSLVAHTK 
PVWWSLHTDVRERYHVGSESKKQEDLFPWGLKEENCAGNRTWSLQWE 
EYEKGSYSVKLDGPGLQRITLEMAEEPSASSEGLPAFAGSHLRAWRQ 

brum vju x \j^\jrxs^ri\^\^rxc»^\tr I\J\jnJ. r>lv-2iv30VJc' x by V XAo o VUU J. x.\«lj 

PIRLEKKFSVTSEKYPFSFLKASGNGHLSGSLQGHILAIRVYLSPLV 
YKEENRGTRGAHAIGPTANKTCNSSHKPKPLGKEILYSHSKETGMEK 
FLQTP VP W AWGR VEEAE VWDQFQ EQRG I VS R I KQPTRKP AS LERDG Y 
I VH I DRN I S S LHKD S W AH S MTWEOO K T T KTPD R nrvKT . c: czj . p n p tt.KT? 
TFPDHP I GRRLAAL I D VH I L I PETCE YVTLHGKRGIiAG I ERAEDLNV 
LFLEKGDGCPDGKDNPCPFMYRYGFQSAYRAVliDDLRNPPLKFLHRQ 
KL 

3875 

A 

597 

1088 

GCQRQ WPS WRP PLQRAKQDMAWQLLE YQELMN I KLALD I E I ATYRK 
LLEGEESQLDSGMQNMSIHTKTTGGYAGGLSSAYGDLTDPGLSYSLG 
SSFGSGAGSSSFSRTSSSRAVVVKKIETRDGKI J VSESSDVxjPK*TAA 
AAPPSLPLLRCPRAWEGGRYAG 

3876 

A 

1537 

1827 

PTTLSDWIaAGYVFIjSSOLCGOCHTPSWTTTAIjACTFOrr'PrORRnoc! 

HGSSRTSPELSEECPGEMVKMTKSiCTFQAYIjPNCHRTYSCIHCRAHIi 
ANH ! 

3877 

A 

109 

696 

PGPGPDGRCGPGRGPRSGRYVFLSSQLCGQCHTPSWTTTALACIFQC 
CFCQRGQQSHGSSRTSPELSEECPGEMVKMTKSKTFQAYTiPNCHRT\ 
YS\CIHCR\AHWPNHDELISKSFQGSQGRAYLFNSWNVGCGPAED\ 
RVLL\TGLHAVADIYC\ENCK\TTLGWKYEHAFESSQKYKEGKFI\I 
ELAHM I KDNGWE 

3878 

A 

1 

2474 

MPGYSALSNKPMLETTQMSITRGMGKLLWRCGTEKAGPAVPGGMEG 
PR S S THVPLVL PLL VLLLLAP ARQAAAQRCPQAC I CDNS RRHVACR Y 
QNLTE VPDA I PELTQRLDLQGNLLKVI PAAAFQGVPHLTHLDLRHCE 
VELVAEGAFRGLGRLLLx^LASNHLRELPQEALDGLGSLRRLELEGN 
ALEELRPGTFGALGALATLNLAHNALVYLPAiyiAFQGLLRVRWLRLSH 
NALSVLAPEALAGLPALRRLSIJlHx^LQALPGPVLSQARGL^ 
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HNPLTYAGEEDGLALPGLRELLLDGGALQALGPRAFAHCPRLHTLDL 
RGNQLDTLPPLQGPGQLRRLRLQGNPLWCGCQARPLLEWLARARVRS 
DGACQGPRRLRGEALDALRPWDLRCPGDAAQEEEELEERAVAGPRAP 
PRGPPRGPGEERAVAPCPRACVCVPESRHSSCEGCGLQAVPRGFPSD 
TQLLDLRRNH F P S VPRAAF PGLGHL VS LHLQHCG I AELE AGALAGLG 
RLIYLYLSDNQLAGLSAAALEGAPRLGYLYLERNRFLQVPG7UVLRAL 
PSLFS LHLQDNAVDRLAPGDLGRTRALRWGL / HLSGNR I TEVSLGAL 
GPARELEKLHLDRNQLREVPTGALEGLPALLELQLSGNPLRALRDGA 
FQPVGRSLQHLFLNSSGLEQVGTGHLAGLVQEAAQGHRQRAFTQQAF 
ASPLVPGLGPGLQSLHLQKNQLRALPALPSLSQLELIDLSSNPFHCD 
CQLLPLHRWLTGLNLRVGATCATPPNARGQRVKAAAAVFEDCPGWAA 
RKAKRTPASRPSARRTPIQRKTVWTR 

3879 

A 

1 

531 

ETPSSEPMEEEEDDDLELFGGYDSFRSYNSSVGSESSSYLEESSEAE 
NEDREAGEL/ HDLPAAFAQPWDS SLLGWQWF * ASCL/ LRCWSWVQG 
KPSSPRPRGSAASPAPGATPPTPRKPVSWLGYRENHRPKKPKSCTRL 
PGLPKLEPSSTLKGQDSWQMGHQQDKTLWSWASTGGSS 

3880 

A 

143 

606 

NKYTDYCQLLKKSKGVDKKGKTASSLPEGSQS I SREHLPAQSWLPWA 
CLLLWLPALL/HDEMQMRKQLPGLRSRAHSARQVRW/PCREAAAGCR 
MRQASHLKSPEGKFHGHFLNTRVGRAGGNPHRPGKKWLLPSSVYVCN 
TEHCLWGAWCPKPNG 

3881 

A 

253 

514 

ECTLEGIMPNVTISLSLPTNGSPLQDILVHPCVTSLDSAIIiTSSSID 
AMDDSAFSGPYKFPFTPPLESFNLCFYTSQVPVPPILGFY 

3882 

A 

950 

1612 

1 1 P I LAS VTQPQKQ PAWKTGT YKGKPQVS I S I TEKVKSMQYDKQG I A 
DTW\QWGTVTCKCDLEGIMPNVTISLSLPTNGSPLQDIIiVHPCVTS 
LDSAILTSSS IDAMDDSAFSGPYKFPFTPPLES FNXiCFYTSQSVIDD 
QPSKLLLNDGIGELMGFNWSNRSCGQVGSNQYAYCMPYAADQRTWRR 
GAWTSKYRNKNALYTFVKRFSHLRKQREKNPYP 

3883 

C 

110 

199 

MDSSIRGSLARLCKGLQACCYMFNWAPQV* 

3884 

A 

3161 

4765 

SAFLRNENWSGNRFEPLESKEKGKSDSDSQIGRMEERLGRLLPADWG 

VGGTAEGGGGAACRRCCLRCWAPGQLPGTLRPPWTSRRVTASSSATT 

TRAPRCKVLWALRSAAAAATGAAWYASLPDVQCLENCCFIYFVEFFG 

YLRRGELQNEAVFMWTAELEN IATLCFKALENSNYGVRVAVS KLLGT 

VMATALMPKQATVMRQNVKRATFDEVLELMATGFLRGGSGFLKSGGE 

MLKVGGSVNREVRVGVTQAYWFVTTLGGQWLERSFATFJLSHVLDLV 

SHPRATQTHVEAVYSRRCVS F I LRATVGSLLGEKAQ I AAAKE I CQAI 

GKQMKAVEAVVNDTSGENKSGAADIAASQHVMVCALQEIiGSLVQSLN 

ATAS PLIQEAS IGL\ LE I VTS VLLHP\ TMAARIiAAAWCLRCVAVALP 

FQLTPFLDRCAERLNNLKTSPEAVSGYSFAMAAIiLGGVHQCPLGIPH 

AKGKMWSIAEDLLRTAAQNSRLSLQRTQAGWLLLGALMTLGPSWR 

YHLPKMLLLWRNVFPRSLKELEAEKARGDS FTWQVTLEGRAGALCAM 

RSFVAHCPELLTEDVIRKLMTPIEGAMTMMSHIPSVMKAHGAHLKAS 

AAMVRIiRLYDILALLPPKTYEGSFNALLREIjVAEFTLTDNSANTTTS 

LLRSLCHYDDSVLLGSWLQETDHKSIEDQLQPNSASGSGALBHDPSS 

IYLRIPAGEAVPGPLPLGVSVIDASVALFGWFPHVSYKHRLQMLDH 

FAECWQAKGVRQQAVQLNIFTAVLSALKGLAENKSTLGPEEVRKSA 

LTLVMGPLDNPNPILRCAAGEALGRMAQWGEATFIARMAQYSFDKIi 

KSARDWSRTGHSLALGCLHRYVGGIGSGQHLKTSVSILLALAQDGT 

SPEVQTWSLHSLALIVDSSGPMYRGYATEPTLSLVLTLLLTVPPSHTE 

VHQCLGRCLGA 1 1 TTVGPELQGNGATTS T I R S S CL VG CA I TQDHSDS 

LVQA/^AISCLQQLHMFAPRHVNLSSLVPSLCVHIiCSSHLLLRRAAVA 

CLRQLAQREAAEVCEYAMSLAKNTGDKESSSANVSPFAPGVSSRTDI 

HCRHQGVNITETGLEGLLFGMLDRETDRKLCSDIHDTLGHMLSSLAV 

EKLSHWLMLCKDVIAASSDMSTATLLSSGKDEEAEKKDEMDDDTMFT 

TLGEEDKSKPFVAPRWATRVFAADCLCRIINLCENADQAHFDLALAR 

SAKLRNPTNDLLVLHLSDLIRMAFMAATDHSNQLRMAGLQALEDIIK 

KFASVPEPEFPGHVILEQYQANVGAALRPAFSQDTPSDI IAKACQVC 







STW IGSG WSDLNDLRRVHNLLVS SLDKVQAGKGS S SQL YRES ATTM 
E KliAVL KA W AEVYVVAMN I KKE AE S K P KRA I KNTDDDDDD CGT I DEL 
PPDSLITLVQPELPTLSRLWLAALKDYALLTLPAEFSSQLPPDARSD 
EEVLSVSRGNNRRWRGIAEEERNRGKKQSSLWIEREDDRVLGATGN 
JNEfiVNoVTTGPEEPY 

3885 

A 

1 

687 

ARGDASEFSKNPHIGSFVGNQVTIRKADGSLVHISITPYPAILHEYV 
S S S KWED AVRLCRFVKEQTMWACLAAMAAANRDMTTAE I AYAAIGE I 
DKVQYINSIKNLPSKESKMAHILLLSGNIQEAEIVLLQAGLCYQAIQ 
aim XViU \ x nvviiK^UjCiijuR vyxliVXxi VJJx VLAx RQKF LEXFGLQETNKRYL 
HYAKGLPIYWEKSKATIEMEITQEREHFLRSQSSKSIGLKP 

3886 

B 

146 

326 

MDESAFPLCRTLGRVDSDDPGLIPGCASGPLAVPESEQKVHERKSLH 
WDLGTLVSLVRAQX * 

3887 

A 

575 

828 

CYPKVGRKP I KGEGEAPGAREKKNFAAEKRGLI PHGRKCLP I S PCTG 
TRGPWCP\CSSSGCASADTCTPRGALGAQKPIRESNF 

3888 

A 

3 

267 

AMVGGGGVGGGLLENANPLIYQRSGERPVTAGEEDEQVPDSIDAREI 
FDLIRS*AWPLTPQPAYWYPGPSSCQGCYRCFLESLTMRN 

3889 

A 

1 

942 

MVIYPLDKHFTSPCRSKLWGRRTPGPMCPLRPGSGLYVSARYWLARP 
SAAWPSQRLLLQRPSCSRVRAARAAVTASLAPQAENPQRGVSLARL 
RRHATPPSRSPEVRGLERKRTPARISGGGSAMVGGGGVGGGLLENAN 
PL I YQRSGERPVTAGEEDEQVPDS IDARE I FDIRRCWARAEELNWE 
QVRVQVSHFRGERWPESQKGFCAAGAGVLYAHEHRRVSDPESTVAV 
AFTPTIPHCSMATLIGLSIKVXLLRSLPQRFKMDVHITPGTHASEHA 
VNKQLADKERVAAALENTHLLEWNQCLSARS 

3890 

A 

3 

471 

A24VGGGGVGGGLLENAN PL I YQRSGERPVTAGEEDEQVPDS I DARE I | 
FD I RRCWARAGSGGLRWGEQ* YRAAGGAASQQGVPGRGF * VGKGAES 
LFLHFPQ \ L I RFLND P\ EHS LTLEELNWEQVRVQVSDPE STVAVAF 
TPTISHWSMGTLIGVC 

3891 

A 

2 

604 

ARSHRISGGGSAMVGGGGVGGGLLENANPLIYQRFGE\RPVTAGEED 
EQVPDS IDAREVF\DL I RSHQMTPEHP\ LTLEELNW\ EQVR\ VQVS 
DP\ESTVAVGFPKPTHSGTCRHGPPLIGLS\IKVK\ LLRSLPSAFSR 
WD VAHLL PGDPL PQEHAVEQATLQ I KEAGWATALGRTHHLLEWN \Q 
CLS S PAPGTWAFVPLSLA 

"3 O Q ~* 

J 892 

A 

73 

204 

D I C YFQLTRLEQHDTTNVADQHDVLCRWLGTEEARALRRNKHRR 

3893 

A 

1 

846 

QGANMPRYAQL\VMGPAGIVKSTYCATMVQHCEALNRSVQVVNLDPA 
AQHFNYSVMADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFD 
WLENCLGHVEDDYILF\DCPGQIGWTLTWPV\MKQLVPA\LEQWEF 
RVCGVFLVDSQFMV\DHSRFISGHL\AALSA\MISLEIPQVNIMTK\ 
MJJ L1L1 rUUUUvb X E Kr LD PDM YS LLED S T S DLR KAKN S RKLD L KAIMW 

DLIGWTYSLVRIF/ ipydqsdeesmnivlqhi \gfaiqyg\ edlefk 

RNPKEREE 

3894 

A 

225 

405 

eflhmapgevtitvrlirsfehrnfkpvvyhgwldqtvkefivflk 
qdalkiihqahk 

3895 

A 

1 

403 

REQEFLHMAPGEVTITVR\ LIRSLEHR\NFKPVVYHGVNIiDQTVKEF ! 
IVFLKQDIP1^TNLPPPFR1WKYDALKIIHQAHKSKTNELVLSLEI)D 
ERLLLKEDSTLKAAGIASETEIGIF\REEDYRNYKANPISSW | 

3896 

A 

1 

1319 

MDYGLLCGQQLVPQVEPVIQRGHESLVHHILLYQCSNNFNDSVLESG 
HECYHPNMPDAFLTCETVIFAWAIGGEGFSYPPHVGLSLGTPLDPHY 
VLLEVHYDNPTYEEVLDAEKYKYKVl^ 

CTLECLEEALEAEKPSGIHVFAVLLHAHIJVGRGIRLRHFRKGKEMKL 
LAYDDDFDFNFQEFQYliKEEQTILPGDNLITECRYNTKDRAEMTWGG 
LSTRSEMCLSYLLYYPRINLTRCASIPDIMEQLQFIGVKEIYRPVTT 
WP F 1 1 KS PKQYKNLS FMDAM\ NKFKWTKKEGLS FNKLVL\ S LPVNVR 
CSKTDNAEWSIQGMTALPPDIERPYKAEPLVCGTSSSSSLHRDFSIN 
LLVCLLLLSCTL\ STKSL 

3897 

A 

1 

394 

GEEPRGRASSVAGWVGAFRAPFIEAAVAGLGAGSGKRRRGWKMPVHS 
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RGDKKETNHHDEME VD YAENEGS S SEDEDTES SS VSEDGDS SEMDDE 
DCERRRMECLDEMSNLEKQFTDLKDQAKSCCYMIQSR 

3898 

A 

1 

150 

SEKLLLYDTVQSELEEKIRRLEEDRHSIDITSVyMLQDLDILEDWTT 
IRK 

3899 

A 

708 

911 

SLPNLDNAAICLYPAPPPEHDNHFEMDDEDCERRRMECLDEMSNLEK 
QFTDLKDHAGLFSSLPGCHH 

3900 

A 

1 

1349 

MAAGQRRSSLSRLGSQFINCDPECQLSCLVGLEVQKEEVIRAAEGWK 
RAAPAGLRPTNYNSRQASEGRWIAGNQSYQSRASPRPLARWKASAN 
GRAPIAEPAAAVAGLGAGSGKRRRGWKMPVHSRGDKKETNHHDEMEV 
DYAENEGSSSEDEDTESSSVSEDGDSSEMDDEDCERRRMECLDEMSN 
LEKQFTDLKDQLYKERLSQVDAKLQEVI V\ ERAPEH\ LEPAGN\LQE 
NMQIRTKVAGIYRELCLESVKNKYECEIQASRQHCESEKLLLYDTVQ 
SELEEKIRRLEEDRHSIDITSELWNDELQSRKKRKDPFWPDKKKPGV 
VSGP YI VYMLQDLD ILEDWTTI KNAMATLGPHRVKTDPPVKLEKHLH 
SAISEEGRLYYDGEWYIRGQTICIDKKDECPTSAVITTINHDEVWFK 
RPDGSKSKLYISQLQKGKYSIITFIIMI 

3901 

A 

38 

384 

QRGRVISPLTRTGGAGSKTGRLEEQLLASMQSRERHGMSKEFLQEHF 
NPQRRYNPSSHVLSGACAGAVAAAATTPLDVCKTLLNTQESIiAIiNSH 
I TGH I TGMAS AFRTVYQVGGV 

3902 

A 

252 

528 

I W I LS I TG WLL S P F PS SLMALL WKQRMQMYNS P YHRVTDCVRAVWQ 
NEGAGAFYRSYTTQLTMNVPFQAIHFMTYEFL.QEHFNPQRRYNP 

3903 

A 

777 

1719 

LLTLTFPRCSVAQPLPFTGGTSVSWSALLALHVAWFLFAGAAGCVA 
TLLHDAAMNPAE WKQRMQMYNS P YHRVTDCVRAVWQNEGAGAF YRS 
YPTQLDHDVPF\ QAIHFMDLWNSCRSTLTPKRS YNPDS \ HFFSGLCA 
GAVA7^AATTPIi\DVCKTLLNTPGVLGFELTHYRTY\QGMASAFRTVY 
QVGGVTAYFRGVQA\RVIYQIP\STAIAWSVYEFFKYLIHLKGQEEW 
RAGKVKLALNESPGVQDDTDTVSWSHSLSPGMIjPPQVELKGKEKGLSP 
TDFGVLTNTSSCQPLLPPPFLPGPKHVQQSTPQHIi 

3904 

A i 

1 

1788 

MLQQGAILEAETKPSPDTKYTSALIFNFPASTADLGKTKRPAVEEKE 
TPNCTRAQSLFLLAKERRQDPFRPGLSALPLVTOOAAIGGAAHSEGRR 
LPCRPRPFTRSPPRPSRLDALQVQRDQSDGAALFGQPLQGRAGLSER 
PCQTERGKKPQQRQLSNRLQTAFRRGPLGSGFQLRWSCRGCGVNAAR 
SPPGSAQRSRSREPLLGPRWPGSRGWIQLCESLKKIAKGWVSQDPSP 
RQLKECGQSASTSPRTPYSQDLLQHPKLEVT^ADPRTRGTREPAALV 
ERLEMLSGAGSVVDAVPLLVVRRRRRRRRRHSAQRALAAAAAAAASE 
PLD CQMRLC AS LG VF S TAP S S S ACRAD I TD C VS G LAGG EMVPRRVLA 
DQLTEEQIAEFKEAFSLFDKDG\DGTITTKELi\GTV?4RSIjGQTQTEG 
\EL\QDMINEVYADGNGTIDFPVFLTMMARKMKDTDSEEEI\RBSFR 
V\FAKDGNGYISAAELRHVMTNLGEKTT\DLE\VDEMIREA\DIDGD 
GQVNYEEFVQMMTGKMEDLTFQLPFSPPLEESNWNLLLTLI^KKEKK 
KKVHLFHSCFLYSKTECQKYLLVHTHQNLHVLVGGPVP 

3905 

A 

1 

351 

GPPRPPTPPLLKPIRGATARPPRSRF*PRSPIPPLSRVIiVARPPRSP 
PLGKPLWIMGRPRPPIAPLARAPMGRPNRAWMLLLPRRCFKLFEGNR 
RQCSSRGCHHGSGRRAGDRPNLS 

3906 

A 

3 

345 

SCRCRRKAGKGRPGEQTDRPTRQPTSNNELSSHTTAYASNQPPI*LT 
RAP PP STQVRPVPGPPAGAWAVPGGALAS VS FD SRDS KTAAQS S SE 
VGALP PHQDVT I CAVRAAT 

*J OAT 

TV 

rt 

3 

388 

RPLQTKGRKGQAGFGRSPARLPEPWWQPREEH/WASVSFDSRDSKMA 
AQSAPKVVLKSTTKMSLNERFTNMLKNKQPTPVNIRASMQQQQQLAS 
ARNRRLAQQMENRPS VOAALKLKOKSIjKOR T^K<5TJ 

3908 

A 

565 

719 

GRDVTPRSKPAPR WTSRSS PNGLKKRWCSRS WRS WERGPRA* S S ALG 

KSNIQARL/WPTHRGPGQGSNRRTRPTPYSREALPREGLRGGRATEN 

PT*GRDVTPRSKPAPRWTSRSSPNGLKKRWCSRSWRSWERGPRAWSY 
GSWRNRW 

3909 

A 

299 

557 

ADGSADLDSQRSSNELSSHTTAYASNQP/LHIiANPSPSTSQLRFRPV 
PAVAVPGAVGGSPREEHLASVSFDSRDSKMAAQVSSEKLC 
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3910 

A 

1 

420 

KLCTMVAPKLRSWMYAVYGALAVMGTMGPWYLLLLLGHCVGLYVASIi 
LGQPWLCLGLGIiASIASFKMDPI*TSWnvc;nvT?'D\rPT?T7r , 'K'T mutda^ 

GLSWAIMAVDIFFHFFYILTIPSDLKFANRLPDSALAGLAYSNLV 

3911 

A 

2 

438 

HPSCLARGILEDPGLTKYPASVCYHAGR1X3TSRSVTSIX3HTI»VECAIj 
TRPSLPSPHRTSPKVEESNSKTjLESERKT.nFPPTTP'Tnnrr -pour uvth 

LBPGKASSSQRAAPMTKTAPLLDVCCVRGLGCDGHNf5PT.VPn a a auc 
LCGP 

3912 

A 

10 

250 

PAQVQLLPAMNFWAIIMYNLVSLNSLKFTELVARRLLLTGFPQTTLS 
ILFVTYCGVQLVKERERTLALEEEQKQDKEKPE 

3913 

A 

2824 

3156 

SNEGVKGLSRQKAWTADYNFCLGGNCHFSGFYFDSDSCYSSYRRCRT 
CCDFCNSGRNCSGFWFGLCCTILCSDF*SSFLISRSTWTPGLAIIIP 
TRRLLIT*TLLISWSTV 

3914 

. A 

1 

70 

LVKERERTLALEEEQKQDKEKPE 

3915 

A 

42 

140 

TACFLSPLFRTFNFWPPLMFPPSASLMFKFTF 

3916 

A 

4594 

4848 

HI YCE * SLYLFRE *DLDDGDLDLECEL,ART.Rr>T.wnT.T7 

3917 

B 

565 

1161 

MYHGWVPDLHIHAEDTLLPFYLGEKDDVTYAIKPTCWPGLDIIPSCL 
ALHRIETELMGKFDEGKLPTDPHLMLRLAIETVAHDYDVIVIDSAPN 
LG IGT INWCAADVL I VPTPAJELFD YTS ALQFFDMLRDLLKNVDLKG 

FEPDVRILLTKYSNSNGSQSPW1TOEQIRDAWGS^^^^JKNVVRETDEVG 
KDLHALLVRL* 

3918 

A 

3 

226 

LSVLFCCFGVESNFNFYVYSCCVFMATSQTDFICMHTYSLIEMIVIH 
YNLEFF* ISKMATIAVLGFLRLDI YHV 

3919 

A 

1469 

2806 

CGSPPFATSSFFALSGHVLFAKLCTMVAPKLRSWMYAVYGALAVMGT 
MGPWYLLLLLGHCVGLYVASLLGQPWIiCl^LGIJVSLASFKMDPLISW 
v^ur v x^iruijyiiVijtH(jObbFTVXvRCTSFALESCAHPDRHY 
LKYNFYLPFFFFGPIMTFDRFHAQVSQVEPVRREGELWHIRAQAGLS 
v v r ^- Li *" VL,ir r nr r x xx» x x ir oXJXjivr AJ>fKJ-iir , iJSAIjAC7XfA X »SNT i VYDWV 
KAAVLFGWNTVACLDHLD P PQP PKCI TAL YVFAETHFDRG INDWL \ 
CKYVYNHIGGEHS AVI PELAATVATFA I \ TTLWLGPCD I VYLWS FLN 
CFGPQTLKL\WVPKTGQKWGPPKHEIEGLLCQVQMSRRV\RALFGAH 
ELSWPIIMYNXLVSLNSLXKFTEXLVARRIiLLTGFPQTTLSILFVTY 
CGVQLVKERERTLALEEEQK\ qdkkkpe 

3920 

B 

54 

316 

TIWDGEEx^CFKERSPAALKACYRGITOYPTPDEKRPJ^ATLTGLSLT 
QVSNWFKNRRQRDRTGAGGGAPCKRKLPPGQPCVWQPHRDG* 

3921 

A 

1 

2219 

MATLPAEPSAGPAAGGEAVAAAJUVraEEEEEARQLLQTLQAAEGEAA 

AAAGAGAGAAAAGAEGPGSPGVPGSPPEAASEPPTGLRFSPEQVACV 

CEA\LLQAGHAGRLSRFx^ALPPAERLRGSDPVLRARALVAFQRGEY 

AELYRLLESRPFPAAHHAFLQDLYLRARYHEAERAJ^GRAIiGAVDKYR 

LRKKFPLPKTIWIXSEETVYCFKERSRAAIiKACYRGNRYPTPDEKRRL 

ATLTGLSLTQVSNWFKNRRQR0RTGAGGGAPCKSESDGNPTTEDESS 

RS PEDLERGAAPVSAEAAAQGS I FLAGTGPPAPCPAS S S ILVNGSFL 

AASGSPAVLLNGGPVIINGLAXGEASSLGPLLLTGGGGAPPPQPSPQ 

UrtOCi1 *^ A ox# v xjux-'w * vkxj JhutAy & KAPETKGAQVAAPGP AIjGEBVL 

GPIiAQWPGPPTAATFPLPPGPVPAVAAPQWPLSPPPGYPTGLSPT 

SPLLNLPQWPTSQVVTLPQAVGPLQLLAAGPGSPVKVAAAAGPAI^V 
HLINSGVGVTAIjOTjP^ AT'ADrJMTTT.T aXTt> - Xfor*0'D"nmv« , i77v t rwvvTTY 

TATFPTSMLVSQVLPPAPGlxALPLKPETAISVPEGGLPVAPSPALPE 
AHAXiGTLSAQQPPPAAATTSSTSLPFSPDSPGLLPNFPAPPPEGIjML 
S PAAVPVWSAGLELSAGTEGLLEAEKGLGTQAPHTVLRL PDPDPEGL 
LLGATAGGEVDEGLEAEAKVxjTQxjQSVPVEEPLEL 

3922 

A 

185 

479 

VSDTLWAiAGLAAGRWAGGCQTLGLQGVVQGLGLLINGPRAGAAPLS 
LGNLHPPSAARWHPS*AASHTGTGQSLVGIVEPHNPTSSGG*RPGTA 
GRSQ 

3923 

A 

74 

593 

RGWVMWVDEFQPALVLSPYERQLNLDASVQHLEDGDGKRKRSSSSPR 
S INKKAKALDNSLQPKSLAASS PPPCSQPSQCPEEKPQGVKPHEGPR 
GPLSASGVPGPQPWPGQGG/WWGAAGRVAEPRTESREPVSGAGTSSR 
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S PSPT\GFRQPPHPSARPS PRAAPGQRGWAGDRR 

3924 

A 

1 

284 

^w*-c»*vt>jv4j v v_ jvivajj^ i x x vo "Vvw*jvy/it.w Ji l v I ivDFRR VENAYHMEA 
EKLLRRGGV*L*SQLPGIUiKQKKRSNLGGRAYSEPRSHHCTSAWVTK 

3925 

A 

2 

795 

AKIAMSLQKTPPTRVFVELVPWADRSRENNLASGRETLPGLRHPLSS 
TQAQTATREVQVSGTSEVSAGPDRAQVWRVS S TKEAAAEAKKS VCR 
RLD Y I TQSLQQQG VQAENI TVTKDFRRVENAYHMEAEVCI TFTEFGK 
MQNICNFLVEKLDS \ LLS S AHPS S I /NTPGS VENLRRQACLVAVENA 
WRKAQEVCNLVGQTLGKPLLIKEEETKEWEGQIDDHQSSRLSSSLTV 
QQKIKSATIHAASKVFITFEVKGKEKRKKHL 

3926 

A 

1 

1810 

EPIDLHYAKLGTRATGGGGRGPSGHNVFLDEGGTPLLAESLVYQIFLS 

LGLADLLAAGLVCRQWQAVSRDEFLWREQFYRYYQVARDVPRHPAAM 

SWYEEFQRLYDTVPCVEVQTLREHTDQVLHLSFSHSGYQFASCSKDC 

TVNIWSNDLTISLLHSADMRPYNWSYTQFSQFNKDDSLIiLASGVFLG 

PHNSSSGEI AVI SLDS FALLSRVRNKP YDVFGCWLTETSLI SGNLHR 

IGDITSCSVIjWLNNAFQDVESENVNVVKRLFKIQNI^ASx^ 

uv - OAr uor i/xjujjiiM.\j»jj j^/v i fLK -L r DJjGSDNEE \ GWLCS PPAHAKEG 

LRHFLDRVLEGRAQPQLSERMLETKVAELLAQGHTKPPERSATGAKS 

KYIiIFTTGCLTYSPHQIGIKQILPHQMTTAGPVLG\EGRGSD\AFFD 

RRLDH\ VID IHG\HI IG\MGI#\ SPDNRYLYVNSRAWPNG \ S WA\DP 

MQP \ PP I AEE IDL\ LVFDL\ KTMREVRRALRAHRAYTPNDECFFI FL 

\DVS\RDF\VASGAE\DRHG\YI\WDRHYNI\CLARLRHEDVVNSVV 

FS PQG ARRCCFTAKGHAT\ I KS W \ RS PRTMGVLQ \ APRPRPRTFFS W 

LASQRR 

3927 

A 

1 

2936 

MALMFTGHLLFLALLMFAFSTFEESVSNYSEWAVFTDDIDQFKTQKV 
QDFRPNQKLKKSMLHPSLYFDAGEIQAMRQKSRASHLHLFRAIRSAV 
TVMLSNPTYYLPPPKHADFAAKWNEIYGNNLPPLALYCIjIjCPEDKVA 
FEFVLEY^IDRMVGYKDWLVENAPGDEVPIGHSLTGFATAFDFLYNIiL 
DNHRRQKYLEK I WV I TE EM YE YS KVR S WGKQLLHNHQATNM I ALLTG 
ALVTGVDKG S KAN I WKQAVVDVMEKTMFLLNH I VDG S LDEGVAYGS Y 
TAKSVTQYVFI^QRHFNIigNLDN^LKMHFWFYYATLLPGFQRTVGI 
ADSNYOTFYGPESQLVFLDKFILKNGAGNWLAQQIRKHRPKDGPMVP 
STAQRWSTLHTEYIWYDPQLTPQPPADYGTAKIHTFPNWGWTYGAG 
LPNTQTNTFVSFKSGKLGGRAVYDIVHFQPYSWIDGWRSFNPGHEHP 
DQNSFTFAPNGQVFVSEALYGPKL\THLNNVLVFAPSPSSQCNKPWE 
GQLGECAQWLKWTGEEVGDAAGE I ITASQHGEMVFVSGEAVSAYSSA 
MRLKS VYRAIiLLLNS QTLL WDH I ERQEDS P INS VS AFFHNUD IDFK 

YIPYKFMNRYNGAMMT^VWn2VTTVVT^lTLJI?T^lXlJ/^vrc?T>MTv nTAOh t-i/-\-»v tvtiy-» 

4VA 1 ^xvj-mu™ ii uj v wi^Arix ivmr Wr JJH±i(jNSPMASIQEAEQAAEF 
KKR WTQFVNVTFQMES T I TR I AYVF YG P Y INVS S CRF IDS SNPGLQ I 
SLNVNNTEH\A^SI VTDYHNLKTRFNYLGFGGFAS VADQGQITRFGLG 
TQAIVKPVRHDRIIFPFGFKFNIAVGLILCISLVILTFQWRFYLSFR 

EGHHMDLPDWITSLPGSGAEILKQLFFNSSDFLYIRVPTAYIDIPE 
TELEIDSFVDACEWKVSDIRSGHFRLLRGWLQSLVQDTKLHLQNIHL 
HE PNRGKLAQ YFAMNKDKKRKFKRRESLPEOR OMTTR p T . 

3928 

A 

2 

513 

SSWEFSRPRGGTSQSDISTLSTFPSSPDLEVSETAMEVDTPAEQFL 
QPSTSSTMS AQAHSTS S PTES PHSTPLLS S PDSEQRQS VEASGHHTH 
HQSDNNNEKLSPKPGTGEPVLSLHYSTEGTTTSTIKLNFTDEWSSIA 
SSSRGIGSHCKSEGQEESFRPTEFSAPTR 

3929 

A 

1 

399 

EGETAMEVDTPAEQFLQPSTSSTMSAQAHSTSSPTESPHSTPLLSSP 
DSEQRQSVEASGHHTHHQSDSPSSVVNKQLGSMSLDEQQDNNNEKLS 
PKPGTGEPVLSLHYSTEGTTTSTIKLNFTDEWSSIASSF 

3930 

A 

1 

1812 

NRRTRPGCGWTCYRREFIQRLKLEAALNVHDGCVNTICWNDTGEYI 
LSGSDDTKLVISNPYSRKVLTTIRSGHRANIFSAKFLPCTNDKQIVS 
CSGDGVI FYTNVE \ QDA\ ETNRQC \ QFTCHYGTTYE IMTVPNDPYTF 
LS CGEDGTVRWFDTR I KTSCTKEDCKDD IL INCRRAATS VAI CPP I P 
YYLAVG\CSDSSVR\IYDR\RMLGHKELQGNY\AGRG\TTGIGLPRF 
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ISLPHLNNKSCRVTSLCYSEDGQEILVSYSSDYIYLFDPKDDTAREIi 
KTPSAE\ERREELRQPPVKRLRLRGDWSDTGPRARPE\SERERDGEP 
ES PMCPLM\ QRM S \ DMLSR WFEEAS BVAQSNRGRGRSRPRGGTS QSD 
I S TLPTVPS S PDLEVSETAM E VDTPAEQ FLQP S TS S TMS AQAHS T \ S 
SPKESPHSYSFCYFLPDS\DQRQSVEASGHHTHHQSEFLRGPEIALIj 
RKRLQQLRLKKAEQQRQQELAAHTQQQPSTSDQSSHEGSSQDPHASD 
SPSSWNKQLGSMSLDEQQDNNNEKLSPKPGTGEPVLSLHYSTEGTT 

TSTIKLNFTDEWSSIASSSRGIGSHCKSEGQEESFVPQSSVQPPEGD 
S 

3931 

A 

1 

1878 

MVSQAHCATLHAPTPAAATLTGGRRSLGTAGRAPPGLGTRDHAPAPG 
RRVSGGEGSRKAAAAALAAVAAAPGPVRRCSSQSCFSSSGSSHYSAR 
TS PVRVRPRRSLS SRSAAGNRAEATESAMEKTLETVPLERKKREKEQ 
FRKLFIGGLSFETTEESLRNYYEQWGKLTDCWMRDPASKRSRGFGF 
VTFSSMAEVDAAMAARPHSIDGRWEPKRAVAREESGKPGAHVTVKK 
LFVGGIKEDTEEHHLRDYFEEYGKIDTIEIITDRQSGKKRGFGFVTF 
DDHDPVDKIVLQKYHTINGHNAEVRKALSRQEMQEVQSSRSGRGDGY 
GSGRGFGDGYNGYGGGPGGGNFGGSPGYGGGRGGYGGGGPGYGNQGG 
G YGGG YDNYGGGNYG S GNYND FGNYNQQP SNYGPMKSGNFGG SRNMG 
^Pit^AiiAbQVIINKAFLiGSFSGVGSKMQKM^ 

EEFVVEKVLDRR\A7NGKVEYFLKWKGFTDADNTWEPEENLDCPELIE 
AFLNSQKAGKEKDGTKRKSLSDSESDDSKSKKKRDAADKPRGFARGI* 
uiri&K J. Xv3/\.iUi3^»Ljiiiji v ir liWKWi^SDEADLVLiAKjEANMKCPQIVIAFY 
EERLTWHSCPEDEAQ 

3932 

A 

2 

202 

RNYGGGNYGSGNYNDFGNYNQQPSNYGPMKSGNFGGSRNMGGPYGGG 
NYGPGGSGGSGGYGGRSRY 

3933 

A 

9 

111 

SRQAWHEASNYGPMKSGNFGG I \ RNMGGP YGGGNL 

3934 

A 

297 

613 

QWGLRFMNGVLSGGLSVQTASPPSASRRHFPSLGVGDGELSHGGRCP 
AGVKVGRGGGNS YSLS KARCFGGHG VS S AS VS AGSRS S * ARLRS FPS 
PS LTGTG FGGR 

3935 

A 

14 

1276 

KSGGTALAAVAAAPGPVRRCSSQSCFSSSGSSHYSARTSPVRVRPRR 
SLSSRSAAGNRAEATESAMEKTLETVPLERKKREKEQFRKXFIGGLS 
FETTEESLRNYYEQWGKLTDCV\VMRDIiAS\KRSRGFCFVTFSIHGL 
EVDAAI AS \ RPH\ S INGE WEQKRAVAREESGKPG AHVTVKKLFVGG 
IKEDTEEHHLRDYFEEYGKIDTIEIITDRQSGKKRGFGFVTFDDHDP 
VDKIVLQKYHTINGHNAEVRKALSRQEMQEVQSSRSGRGGNFGFGDS 
RGGGGNFGPGPGSNFRGGSDGYGSGRGFGDGYNGYGGGPGGGNFGGS 
PGYGGGRGGYGGGGPGYGNQGGGYGGGYDNYGGGNYGSGNYNDFGNY 
NQQPSNYGPMKSGNFGGSRNMGGPYGGGNYGPGGSGGSGGYGGRSRY 

3936 

A 

827 

2064 

QQRRVPCGGAPRR WSRAAAVLTTAPGRVRFVFVRGDLSHLARLRE I 
GLKRLSPRWREKKKQFRKLFIGGLSFETTEESLRNYYEQWGKIjTDCV 
VMRDPASKRSRGFGFVTLSSMAEVDAAMAARPHSIDGRVVEPKRAVA 
REESGKPGAHVTVKKLLVGGIKEDTEEHHLRDYFEEYGKIDT\IEII 
TDRQSGKKRGFGFVTFDDHDPVD\KIjDLRKYHTINGHNAEVRKALSR 
QEMQEVQSSRSGRGGNFGFGDSRGGGGNFGPGPGSNFRGGSDGYGSG 
RGFGDGYNGYGGGPGGGNFGGSPGYGGGRGGYGGGGPGYGNQGGGYG 
GGYDNYGGGNYGSGNYNDFGNYNQQGSS\YGPMKSGNFGCWR\NMGG 
P\YGGGNYGP\GGSGGSGG\YGGRSPILKLLPICHGASLV 

3937 

A 

36 

320 

GPGRWVVVDEVTMWLELEQYPVSFLKQPFEIGRHLDLPKGTYGICVN 
MCPGNKSC/PRGKRCOSNGCGHVCKVVPRT^K^nPT.Rr^nnKMTRW*? 
SL 

3938 

A 

3 

715 

LCGGVIDVNDHAPEVTMS AFTSP I PENAPETWALFSVSDLDSGENG 
KI SCS IQEDLPFLLKS SVGNFYTLLTETPLDRESRAEYNVTITVTDL 
GTPRLTTHLNMTVLVSDNNSPALHIGSVSATDRDSGTNAQVTYSLLP 
PQDPHLPLTSLVSINADNGHIiFALRSLDYEALQGFQFRVGASDHGSP 
AL S S E AL VR VWLDANDD S P F VL Y P LQNG S APCTE L VPRAAE PG YLV 
TK 
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3939 

A 

1 

519 

LLKATELGLFGVWAHNGEDNGEPPRSATATLHVLLVDGFSQPYLPLP 
EAAPAQAQAEADLLTVYLVVAL.ASVSSLFLLSVLLFVAVRLCRRSRA 
ASVGRCSVPEGPFPGHLVDVRGAETLSQSYQYEVCLTGGPGTSEFKF 
LKPVISDIQAQGPGRKGEENSTFRiJSFGFNIQ 

3940 

A 

1 

967 

MGHHLM I DKLVALGGL YYA I QRHYATKC S VLKNDQ I LV I GLFM I QNV 
IYRKHFANPLSALFLQQGIELFAAIAEIHITTVAERNHAITQIRL.EA 
QT I FDSLKEWHNAI R KS LD YEALQAFE FRVG ATDRGFPALS S EALVR 
VLVLDANDNSPFVLYPLQNGSAPCTELVPRAAEPGYLVTKVVAVDGD 
SGQNAWLS YQLLKATEPGLFG VWAHNGEVRTARLL S ERDVAKHRLW 
LVKDNGEPPRSATATLQVLLVDGFSQPYLPLPRAAPAQAQADSLTVY 
LWALAS VS SLFLFS VLLFVAVR\ LCRRSRAAPVGRCSVL 

3941 

A 

1 

2785 

RILSVDDFVELITVSSLVAVAEGIGYRDLDSNMKKLGRIHPNRQVIiAF 
ILMVFLSQVRLEPIRYSVI,EETESGSFVAHLAKDLGLGIGEIiASRSA 
RVLSDDDKQRLQLDRQTGDLLLREKLDREELCGPIEPCVLHLQWLE 
MPVQFFQGELLIQDINDHSPIFPEREVLLKILENSQPGTLFPLLIAE 
DLDVGSNGLQKYTI SPNSHFHI LTRNHSEGKKYPDLVQDKPLDREEQ 
PEFSLTLVALDGGSPPRSGTVT4VRILIMDINDNAPEFVHTPYGVQVL 
ENS PLDS P I VRVLARD IDAGNFGS VS YGLFQASDE I KQTFS INEVTG 
EILLKKKLDFEKIKSYHVEIEATDGGGLSGKGTVVIEVVDVNDNPPE 
LIISSLTSSIPENAPETWSIFRIRDRDSGENGKMICSIPDNIjPFIL 
KPTLKNFYTLVTERPLDRE\TSAEYNITIAVTDLGTPRLKTQQNITV 
QVSDVNDNAPAFTQTSYTLFVRBNNSPTVLHIGSVSATDRDSGTNAQV 
TYSLLPPQDPHLPLASL\ VS INTDNGHLF\ AL\ RSLDY\ EALQAFRV 
S AliGATDRGS PALS SEAL VRVLVLDTNDNL \ PFVLYPLQNGSAPCTE 
Ii\lPGRAAEPGYLVTKVVAVDGDSGQNAWLSYQLLKATEPGLFGVWA 
HNGEVRTARLLSERDAAKHRLVVLVKDNGEPPRSATATLHVLLVDGF 
SQPYLPLPEAAPAQAQAEADLLTVYLVVAIoASVSSLFLLSVIiLFVAV 
RLCRRSRAASVGRCSVPEGPFPGHLVDWSGTGTLSQSYQYEVCLTG 
DSGTGEFKFLKPIIPNFLPQSTDLCGGVIDVNDHAPEVTMSAFTSPI 
PENAPETWALFS VSDLDSGENGKI SCS IQEDLPFLLKS SVGNFYTL 
LTETPLDRESRAEYNWITVTDLGTPRIjTTHLNMTVLVSD 

3942 

A 

383 

703 

NRITPTSARINLSILVPSAINNGSMTEDINRTVISGXPRQNSIKMTD 
KIRTIGIWERRPGVVVLTTCLIMV*HIjTSAIANRLNIjIjRKLGRMVIF 
SHRLKVSSMKSA 

3943 

A 

1887 

3072 

NFVMC\HGEPWLAGRTWPVSTLSGPELALQKLGKTPLGRYLFTSST 
LTRDFIEIGRDAGLWGRRSRLRLSGKPIiLLTELFYRRHRCTKRKKIC 
DC I S ANVWAVLKTVEAAGSDRPVQCIiTITAVQRERDTQSIjSVI TAEL 
E P VS S PH F I REAGQ YGTGRS S KEREVQP WQRS FSNW I RNNCGD ICE 
LS PTSGDHPPPSGPGLYSARKAIMQHYQARGMRDVTVED I YIGNGVS 
ELIVQAMQALLNSGDEMLVPAPDYPLWTAAVSLSSGKAVHYLCDESS 
DWFPDLDD I RAKI TPRTRG I VI INPNNPTGAVYS KELLME I VE I ARQ 
HNLI I FADEI YDKILYDDAEHHSIAPLAPDLLTITFNGLSKTYRVAG 
FRQGWMVHRFFTPRLTVQAI 

3944 

A 

623 

2587 

SPRKGGGKGKKANRVLLTGAPLDPIDLSQSAVSHSVKEIi/ENHTGVR 
LLDRTTREVVL/TDAGQQLALRLERIiLD/ELNSTLRDTGRMGQQLSG 
K\WVAASQTISAHIiIPQCIAESHRRYPDIQFVLHDRPQQWVMESIR 
QGDVDFGIVIDPGPVGDLQCEAILSEPFFLLCHRDSALAVEDYVPWQ 
ALQGAKLVLQDYASGSRPLIDAAIiARNGIQANIVQEIGHPATLFPMV 
AAGIGISILPALALPLPEGSPLVVKRITPVVERQLMLVRRKKRSLST 
AAEAIjWDVCNCAIQPFFTAITRCIGSVTVKLVLVPVGKSEFLAAQHK 
WTCNPAGDIAWASVIGIjICITLSKERFEPPTAAGEIVADTVQEHI 
WEIHAIRRTVGAIRMQVEIKSGVTQAYLQIiGINEKLGLSGRPDRPI 
GCLGTSKIYRIIjGKTVVCYPIIFDLSDFYMSQDVFLLIDDIKNALQF 
IKQYWKMHGRPLFLVIilREDNIRGSRFNPILDMLTU^LKKGIIGGVKV 
HVDRLQGPLENDLWHVALIAESQRLQVFLNTYGIQTQTPQQVEPIQ 
IWPQQELVKVTLGAFGHEEEVISNPLSPRVIQNIIYYKCNTHDEREA 
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VIQQELVIHIGWIISNNPELFSGMLKIRIGIiQNECSLVIQVSTIIPA 
T 

3945 

A 

102 

2392 

RKQMNYSLKQLKVFVTVAQEKSFSRAGERIGLSQSAV\SHSVKELEN 
HTGVRLLDRTTRAVVLTDAGQPLAIjRLERLIjDEIJ^STIiRDTGRMGQQ 
L S G KVRVAAS QT I S AHL I PQC I AESHRR YPN I QFVLHDR PQQWVMES 
IRQGDVDFGIVIDPGPVGDLQCEAILSEPFFLLCHRDSALAVEDYVP 
WQALQGAKL VLQD Y ASG S R PL I D AALARNG I QAN I VQE I GH P ATL FP 
MVAAGIG I SILPALALPLPEGS PLWKR I TP WERQLMLRVDLRFAQ 
KFQIGDRKILRSGNNAVNVLPAATCGFPFPVCCWRFAYGCCSALLRI 
PPRLYSVFI I ISLLYGFSRAHISSPGACWVFIHERFGQFMYGLYPHD 
QRWRINLALLIGLVS IAPMFWKILPHRGRYIAAWAVIYPLIVWWLMY 
GG FFALERVETRQWGGLTLTLI I AS VG I AGALPWG ILLALGRRSHMP 
I VRILSVI FIEFWRGVPLITVLFMSSVMLPLFMAEGTS idkliralv 
GVILFQSAYVAEVVRGGIjQALPKGQYEAAESLALGYWKTQGLVILPQ 
ALKLETVRQPQTLAMEEINCHHNYVQKEQHFGEEIYVTRKGAVSARA 
x \3 X X x^JdMLtM-KlO r X V KL>ljoWBr»Sr CSCSHGAGRVMSRTKAIQCIjFS 

\7T7FjOT'DZi r T 1 ZVTJ\rC , r % I>'KT^RT?\rTT^*C , T , DMA V'V"P»TT"v7\XTIVJIT\ Tv ApnT tn?iTTVPT 
v CiU^iKH x .mjt. v i<. isjjj\e, V x,L/c»X £rI v lA x iLUXIJA VPlAALjolJXl V II* V X X I JL» 

RQVHSCRAGETGAVAPASDRGKGGDGNLRSNRGRRGAGVAASALPAR 
HRARGRLPLCYR 

3946 

A 

2 

331 

LRDRDFHRIVQISSFIELHAENLKNEDDVDTGLLGFWTLIillSLTAG 
FSCCSFSWTVTYFDSFEPGMFPPTPLSPARFKKLTGHSFHMGYSMAI 
LNG I VAALTVAWCLM 1 

3947 

A 

3 

116 

CSLLFAILLAFLLAIAYLIVKAFSGLQMAWHLKPYNA 
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CS xjJUFAX IjIiAFLiLiAI AYIj I VKELHAENLKNEDDVDTGIjIjG I IC I SRK 
FSLML 

1 OA Q 

TV 

1 Q1 

804 

RPLPR IGCCGPGAGQVGEARS PRVSLCFAPCRLLRAGGGRALRRRGP 
GTPGPVARPSYSSFTQGDSWGEGEVDEEEGCDQVARD\LRAEFIGMG 

qwseprklsvlppdrngspvlpdkXrngifpsgrgqqepslrrwpfq 
vlisilcsllfaillafll^iaylivkelhaenlkneddvdtgli^™ 
tl l 1 1 s ltag i lllqlflgq 

3950 

A 

1 

149 

KQMKLRVMNSQEQCLPSGAEDIMSGGPTLEAAEFDDLIGKCLPDRPI 
AE 

3951 

A 

1 

1896 

MPFAPHLLSAIiEWAAGEETLLFGHHDKGPEARSTGADGLSEPEGISL 
KRVAWEDFFDI I YSMHVESSAEPGKAPKHAGQKKTYRAIAETYAFI* 
PREAVTRFLMSCTECQKRMHFNSNGLEPKGGFGTLRRPSTFGWYNYN 
AALRNGRQPVSWPTVPRHKAANVQVAQQRHKVCLYPGQWEMLGGFTG 
GSVARLTHEAK7UCVAPADSSASSGTQVQGQGLHSEPVAHTVCQPRGL 
S I CYAGFQS VAQPLGG PGDDTS FGAP S VS P YVKGY FRLE KB I CVPE I 
MFLEEDGEQKALPASSPLGSYLSPVXiCLPCEPSSSQHPSGAVFLVWQ 
MELTLPAPVEPSAGWESPRERAQTWGFSFIHLFSKHIWDTYYVSGIV 
LDVGDTAVNRTELCPLGMYIPVGLELPVAQSGPPHCQVAGIPAHLGP 
o o HLo ISKy V 1 W AJU>o Kby V 5>GME I KKEQLPPGTV I AS PQELGI VGDL 
PHTPPSMPLLKDPSDTDRLKTQGLFLQKSGHLSKLQVFVCKMEPFRK 
RRAATS PKGKPS VLAPEENEP PS PLVSGI IDYNMPLTS TYLKQMKLR 

vimoycyo i»u£i v x rcuxj X XXwlYHXsJJoJrlJ!t.Koo X Jr W ILL VL.V_XiU.NXj JL/uUl 

GLIPLSAPFTGEKTBPLTELS 

3952 

A 

3 

277 

vtEitr xrojri-i v OvjX llJ xnillrXjX oil X»l\Linj\XjK VPlxNoLi-bVio X JWsrti V xlCCiXiX 

IEKADDSHERSSIFWTLCVCCLQNLTA7VHGLIPLSAPFTGEKTE 

3 953 

A 

2 

396 

LSPAPAPLTRGSRPPISASLAAQDETSVSSEDFDMSDSTWMSADPHIj 
ASSLSPSQDERMRSPQNLHSQEDDDSSSESGSGNGSSTLNPSTSSST 
QGDPAFPEMNGNGAVAPMDFTTAAEDQPINLCDKLPP 

3954 

A 

1 

209 

SRPPISASLPAQDETSVSSEDFDMSDSTWMSADPHLASS1.SPSQDER 
MRS PQNLHSQEDGGVCLQGPTA 

3955 

A 

466 

3315 

EMVPRWAI YQGFQDCLVHHAS SRASGLWKLLNPCRGLRC PDTRLPTS 
RW PNRDTRS AS TQLLGKQS LPPAPE ADPVFLPQEENEP P S PLVSG 1 1 
DYNMPLTSTYLKQMKLRV^SQEQAGSSPTPIQTERHTAQLLLTADQ 
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TLVIiPATQHYWLKKFPRPGAEDIMSGGPTLEAAEFDDLIGKCLPDR 
P I AOS VRLGDR I WNPRG S PPRRCRLHGQC VPG SGLBRRPRD Y P I TLQ 
LWLARRQDPARCGLAEHNDRRRGGGGHAGGGRWGSRRCLGRTVTRRG 
GAS PS PRAAAAAAAAAAAG AVGL S P PP APLTRG S RP P I S AS L P AQDE 
TSVSSEDFDMSDSTWMSADPHLASSIiSPSQDERMRSPQNLHSQEDGW 
AVGGLLSKSRVEPWLGQVMPTLPWAHPWVTAALSHQEAQNSGLLLPP 
PGPNTHGQTSALEQRAGAQAFQGLAVGHTGSPGSQESVHHQLGRRRH 
RQCWAVLGPGPPRESHGGGTASSSHTSGHGRQSARRACWPPVAVYLA 
YGPRDEPWPTAGATMEIGADDSSSESGSGNGSSTLNPSTSSSTQGDP 
AFPEMNGNGA/DGPHGLH\QA7^EDQPINLCDKLPPATALGTASYPSD 
GCGADGLRSRVKYGVKTTPESPPYSSGSYDSIKTEVSGCPEDLTVGR 
APTADDDDDDHDDHEDNDKMNDSEGMDPERLKAFNVSTGRKQMFVRL 
FVDENLDRMVPISKQPKEKIQAIIESCSRQFPEFQERARKRIRTYLK 
SCRRMKKNGMEMTRPTPPHLTSA24AENILAAACESETRKAAKRMRLE 
IYQSSQDEPIALDKQHSRDSAAITHSTYSLPASSYSQDPVYANGGLN 
YSYRGYGALSSNLQPPASLQTGNHSNGPTDL\SMKGGAFTTSTTPTP 
TPSSTSTSRPVPTAQLSPTEISAVRQLIAGYRESAAFLLRSADELEN 
LILQQNRPRRHL 

3956 

A 

11 

755 

ETHAPPSGVWGLERPSHPADQEIAGGQGQTAPAWEKPHSHPTPKSCG 
DQTKTTLHTERREDKPSFTRAGWGWENLKGPPVTEGAGSPGGSSHGA 
GLNRDHPLQSQPGRRAJPWPQRAPTHMLFSMTRGQSPGAAPGAQPEI 
SKLAGYKKQKKQRSCLRKKCHP / QEAGHRQRVAHSAGPGLQGEAAGE 
GRGEGAFWLKARNSKYSSLGINIPPNKGAQGSCLGKVGCRRHDPRHS 
GLSAPFPRPMSQAL 

3957 

A 

175 

559 

VKRQLRRLRRS PWPDEKPKEGVKTE1WDHINLKVAGQ\ DGS VEQFKI 
KRHTPLS KLMKAYCE\ RQGLSMKQI KI PDFDGQP\ INETDTPATVGK 
WKDERYQLMVFQQARPGGVYLKRELCFLYSQNSVSF 

3958 

A 

140 

380 

HQKQVRMTTLTHRARRTEISKNSEKKMESEEDSNWEKSPDNEDSGDS 
KD I RXiTLMEEVLLLGLKpKEVMQLGLLG YLKDY 

3959 

A 

98 

426 

LLTPSSSAALKGESVQWHLREAGQSLPRRLTCAGWNKRFLHWIDQLS 
G * TQTCLLL WFVW I ATE * LGESLPL * AP I GGTQPW * FL IGAMKKHQ 
ETGENDHFNS PGPSH 

3960 

A 

234 

1050 

ARTLKRRWKVRKTVIGRKVQTMKIIiETLRISALLLWEEVLLLGLKDK 
EG YTS FWNDC I S SGLRGG I L I ELAMRGR I YLEPRTMRKKRLLDRRVIj 
LKSDSPTG\DVYWDETLKHIKATEPTETVQTWIELLTGETWNPFKLQ 
YQLRNVRER IAKNLVEKG I LTTEKQNFLLFDMTTHPVTNTTEKQRLV 
KKLQDSVLERWVNDPQRITOKRTLAIjLVIJVHSSDVLENVFSSLTDD^ 
DVAMNRAKDLVELDPEVEGTKPSATEMIWAVLAAFNKS 

3961 

A 

1 

1074 

MEPIGARLSLEAPGPAPFREAPPAEELPAPWPCVQGGGDGGGASET 
PSPDAQLGDRPLSPKEE/ CRPPGAGGAAGMPPPLPRALLFLARRPHIi 
/VRRPSSCSSSSNRRWHWAA/ SGAEDAQLGPGGCCAKCKKRVQFADT 
LGLSI^SVKHFSEAEEPQVPPAVLSRLRSFPMRAEDLEQLGGLLAAA 
AVAAPLSAPPSRLRPLFQLPGPSAAAERIiQRQRVCLERVQCSTASGA 
EVKG SGRVLS CPGPRAVTVRYTFTEWRS FLDVP AELQPE PLEPQQPE 
APSGASEPGSGDAKKEPGAECFHFSLCLPPGLQPEDEEDADERGVAV 
HFAVCYRCAQGEYWDNNAGANYTLRYARPADAL 

3962 

A 

42 

426 

NGFCVQPPYFTGEITVLKMLFLSQDFLCVSSRGDLLYGGGSPEPWPR I 
AVCHIHTVSGHCLLFLPFSAVNKDRHFSCEDCNGNVSGGFDASTSQI 
VTjCQNN I UNQAHMNRWTHEL IHAFDHCRAHVD 

3963 

A 

3 

899 

PPNVFSPASLPFFPAEGGPGRSRWREGGYLSQSRSGRLSQEEAASRS 
AGGMAGAPDERRRGPAAGEQLQQQHVSCQVFPERLAQGNPQQGFFSS 
FFTSNQKCQLRLLKTLETNPYVKLLLDAMKHSGCAVNKDRHFSCEDC 
NGNVSGGFDASTSQIVLCQNNIHNQAHMNRVVTHEIiIHAFDHCRA\H 
VDWFTNIRHLACSEVRAANLSGDCS\LV\NEIFRV\QFGLKHPP\QT ! 
\ CVRDRATL S I LAVRNI S KE VAKKAVDE VFES C FNDHE P FGR I P \ HN 
KTYARYAHRDFENRGR\ YYSNI 
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3964 

C 

165 

347 

MTGQISNKKCIELENXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXX 

3965 

A 

871 

1194 

EFVFFFFETESFSVT\QLECSGHNLGLHCNLHLPGSSNAPASASRVA 
a vj/vnjxjxf-vrtij ±r v r iiVDlubf -Hy -LvjWAMIj V L»i>Jj±rlJj[QV ICPPWPPQS 
AGIYRHEPQHPGRK 

3966 

C 

54 

197 

MVLSSFNFQKAIKXALLKYFKISFQLFPFYVHKIGCRHSHFAMNVS* 
* 

3967 

A 

263 

744 

SRSHDHLYTVYVPSPWNFFRALRI SNSKNEMFSMS FIiFREMKKKKLP 
SDSGDLEALEGKDKEKESTVHIETHQNTSKNVAAVQPMKR\GQKSKM 
KKMKE KY*DQDEEDREIiIMKLI»G SAG SNKEE KG KKGKKGKTKDEPVK 
KQPQKPRGGQRVSDNIKKE 

3968 

A 

189 

1393 

SHTDFPYDNSTDGGDTSSDEDKEEHETPVEVELMTQVDQEDITLQSG 
icu n, j_»jm is & jj X y k. b, b b JbD EGE YE E V / RE KDQD S VG EMKDEG E ETLNY PD 

T\TIDLSHL\QPQRSIQKLASKEESSNSSDSKSQSRRHLSAKERREM 

KKKKLPSDSGDLEALEGKDKEKESTVHIETHQNTSKNVAAVQPMKRG 

QKSKMKKMKEKYKDQDEEDRELIMKLLGSAGSNKEEKGKKGKKGKTK 

DEPVKKQPQKPRGGQRVSDNIKKETPFLEVITHELQDFC/AVDDP/H 

LIDQGKEPDLDQQGNEENLFXDSFDQGQPHXPEDVL/LGFAIPICAP 

YTTMTNYKYK\^TPGVQKKGKAAKTALNSFMHS 

VKDTDLSRNIPGKVKSVCTQSSERKKEIAEMKF 

3969 

A 

192 

1510 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAEIiATFPLDLTKTRL 
QMQGEAALARLGDGARES AP YRGMVRTALG 1 1 EEEGFLKLWQGVTPA 
1 Y RH WY SGGRMVTYEHLRE WFGKS EDEHYPLWKS VI GGMMAGVIG 
QFLANPTDLVKVQMQMEGKKKIiEGKPLRFRGVHHAFAKILAEGGIRG 
LWAGVTVPNIQRAALVNMGVTAFIHNRSHPSRNAHRLRLGEPVRQKPL 
vjjijKJJLiHbAVAODDNAIjLFAAALHRAAGHAIPARAAPFQPASGRRKN 
LRRPPCIKGWIPWIGVGFEFGKAPLEFIEKARIKMRSLEKLRSVTKV 
QQLFDQADLGIQAFTYCAASLWHCVWNLWVLGLTDFKNEATAPRGMK 
LQTFAVSVTALKATRLDLFVPPGGLWSLASGVKLQTFSIPPPPPRD 
LNSSGLNSSDYIMSSR 

3970 

A 

68 

382 

MSVPDEE\ERLLRIiTQRWPRASKFLLSGCAATVAELGTRLPTPGPPG 
PVARAALGEETAG AASRPGSAQS KRS PPSHARLAEVAVEARPRGGRR 
RDLEVSTAFSF 

3971 

A 

3 

1451 

GAVRTWGRGFQTEKCQASLLNFWNPPTTAQVTIEAEPTKVSKGKDVL 
LLVHNLPQNLAGYI WYKGQMKDLYHYITS YWDGQI 1 1 YGPAYSGRE 
TVYSNASLL I QNVTREDAG S YTLH I VKRGDGTRGETGHFTFTL YRHS 
AU£>ij.ts v i obby^lj w DTWENCPTLWLHCLMTELTSGLDSVLPCVI 
SAEVPSPRSGFPVSKGFKDNWKFHHPSLGCPWQGKLQRKHTSGGKVR 
LKLRRFQHCMLQVRTTS\ETPKPSISSSNLYPREDMEAVSLTCDPET 
PDASYLWWMNGQSLPMTHSLQLSKNKRTLFLFGVTKYTAGPYECEIR 
NPVSASRSDPVTIJSrLLPKLPK^YITINNLNPRENKDVIiAFTCEPKSE 
NYTYIWWLNGQSLPVSPRVKRPIENRILILPSVTRNETGPYQCEIQD 
R YGG I RS YPVTLNVL Y\ TTKHSGLYACS VRNSATGMES S KSMTVKVS 
APSGTGHLPGLNPL 

3972 

A 

1 

1008 

rirt i Aiiijt3K I A vIjTAVLRKFLDPRL I STEENTQAAETMGPLS A 

PPCTEHIKWKGLLVTASLLNFWNLPTTAQVTIEAQPPKVSEGKDVLL 

LVHNLPQNLTGY I WYKGQIRDLYHYITS YWDGQI 1 1 YGPAYSGRET 

AY SNAS LL I QNVTREDAG S YTLH 1 1 KRGDGTRGVTG YFT FTL YLETP 

KPSISS SNLNPREAMETVI LTCDPETPDTS YQWWMNGQS LPMTHRFQ 

LSETNRTLFLFGVTKYTAGPYECEIRNSGSASRSDPVTI*NIjLREYLL 

FLCGPDCHPKYTWPEARPLSPSQVQVQRPLPLGIKAGHDFLPQANQN 
RPRLDPQ 

3973 

A 

3 

1374 

TAQLTAVLREFLDPRLISTEENTQAAETMGTLSAPPCTQRIKWKGLL : 
LTASLLNFWNLPTTAQVTIEAEPTKVSEGKDVLLLVHNIiPQNLTGYI 
WYKGQMRDLYHYITS YWDGEI 1 1 YG PA YS GRETAYSN AS LL I QNVT 
REDAGS YTLHI I KGDDGTRGVTGRFTFTLHLETPKPS I S SSNLNPRE 
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TMEAVSLTCDPETPDASYLWWMNGQSLPMTHSLKLSETNRTLFLLGV 
TKYTAGPYECEIRNPVSASRSDPVTLNLLPKLPKPYITINNLNPREN 
KDVLNFTCEPKSENYTYIWWI^NGQSLPVSPRVKRPIENRILILPSVT 
RNETGPYQCEIRDRYGGIRSDPVTLNVLYGPDLPRIYPSFTYYRSGE 
VLYLSCSADSNPPAQYSWTINEKFQLPGQKLFIRHITTKHSGLYVCS 
VRNSATGKESSKSMTVEVSAYSSSINYTSGNRN 

3974 

A 

1 

1854 

MATEKRLSAEGGRTAQLTAVLRKFLDPRLISTEENTQAAETMGPLSA 
PPCTEHIKWKGLLVTASLLNFWNLPTTAQVTIEAQPPKVSEGKDVLL 
LVHNLPQNLTGYIWYKGQIRDIiYHYITSYWDGQIIIYGPAYSGRET 
AY SNAS LL I QNVTRED AGS YTLH 1 1 KRGDGTRG VTGY FTFTL YREAM 
ETVILTCDPETPDTSYQWWMNGQSLPMTHRFQLSETNRTLFLFGVTK 
YTAGPYECEIRNSGSASRSDPVTLNLLQHSWLIDGKFQQSAQVFFIP 

RKLITKPETWENKDVLAFTCEPKSESYTYIWWLNGQSLPVSPRVKRP 
IENRILILPSVTRNETGPYECEIRDRYGGIHSDPVTLNVLCIFCSSV 
GQATSLNPNDQGPGLSVSLRSKYRQLYFWTSKLAMTPCPGKTWVGTA 
LTKNIRGVDGLWGDLGPTSCDGPDLPRIHPSYTNYRSGDNLYLSCF 
ANSNPPAQYSWTINGKFQQSGQNLFIPQITTKHSGLYVCSVRNSATG 
EESSTSLTVKVSGCREMRREGASGQTPKPGPGSEVSGCMKIiWAQYVG 
HSMEDCD 

3975 

A 

1 

1319 

VLREFLDPRLISTEENTQAAETMGTLSAPPCTQRIKWKGLLLTASLI, 
NFWNLPTTAQVTIEAQPPKVSEGKDVLLLVHNLPQNLTGYIWYKGQM 

x\±j±j ini iiai v vjJODi x x iurni ouKc X /\ X oIM/\oxjJj X \Jrt V 1 K rr 1 ?Hi\tcy 

YTLHI I KGDDGTRGVTGRFTFTLHLETPKPS I SS SNLNPRETMEAVS 

LTCDPETPDASYLVWMNGQSLPMTHSLKLSETNRTLFLLGVTKx^ 

PYECEIRNPVSASRSDPVTLNLLPKLPKPYITINNLNPRENKDVIiNF 

TCEPKSENYTYIWWLNGQSLPVSPRVKRPIENRILILPSVTRNETGP 

YQCEIRDRYGGVRSDPVTLNVLYGPDLPRIYPSFTYYRSGEVLYLSC 

SADSNPPAQYSWTINEKFQLPGQKLFIRHITTKHSGLYVCSVRNSAT 

GKESSKSMTVEVSEAL 

3976 

A 

1 

2247 

^ILPKVIYRFNAIPIKLPMTFFTELERTTLKFIWNQKRACIAKSIL 
SQKNKAGG ITLPDFK1YYKATVTKTAWYW YQNRD I DQWNRTDPSEI T 
PHIYNYTjIFDKPEKNKQWGKDSLFNKWCWEN^ 

YTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAM 
ATKAKI DKWDLI KLKS FCTAKETTIRVNRQPTKWEKI FTTYS SDKGL 
ISRIYNELKQIYkKKTNNPIKKLISTEENTQTAETMGLLSAPPCT 
ITWKGIiLLTAS LLNFWNPPTTAQVTI EAQPTKVS EGKDVX»LL VHNLP 
QNLTGYIWYKGQIRDLYHYITSYVVDGQI 1 1 YGPAYSGRETI YSNAS 
LLIONVTOEDAnQYTT.HT TMRnnTTMPnVTfiHPTPTT vt T7TDVDCTOO 

S KLNPREAMEAVS LTCD PETPD AS YLWWMNGQS L PMSHRLQLS ETNR 
TLFLLGWKYTAGPYECEIRNPVSASRSDPFTLNLLQHSWLIDGKFQ 
QSAQVFFIPQITKTYRGVYVCFIHNSATAGTNLIIKRIIVPAKLPKP 
YITINNLKPRENKDVLNFTCEPKSENYTYIWWLNGQSLPVSPRVKRP 
IENRILILPSVTRNETGPYQCEIRDQYGGIRSYPVTIiNVI*YGPDIiPR 
IYPSFTYYRSGEVLYLSCSADSNPPAQYSWTINGKFQLSGQKLFIPQ 
ITTKHSGLYACSVRNSATCKESSKSMTVKVSGGIALADTSFSCS 

3977 

A 

1 

1993 

MGPLSAPPCTQRITWKGLLLTALLI^FWNLPTTAQVTIEAEPTKVSK 
GKDVLLLVHNLPQNLAGYIWYKGQMKDLYHYITS YWDGQX 1 1 YGPA 
YSGRETVYSNASLLIQNVTREDAGSYTLHIVKRGDGTRGETGHFTFT 
LYLETPKPS I S S SNL YPREDMEAVS LTCD PETPDAS YLWWMNGQS LP 
MTHSLQLSK^nCRTLFLFGVTKYTAGPYECEIRNPVSASRSDPVTLNL 
LQHSWLIDGKFQQSAQVFFIPQITKTYRGVYVCFIHNSATAGTNLII " 
KRIIVPDTLRKRGNHEKVESSDDIENSNQPFSHPKAFKNIQVQHGQY 
GIHRKLITKLEKWLDSGLPCVISAETSSPRPGCSVSSGFQGHGKSHY 
YS SLECPWKWKLQRNH I YGGKVGWNLEG AHQLH I P AKLP KP Y I T I NN 
LNPRENKDVLAFTCEPKSENYTYIWWLNGQSLPVSPRVKRPIENRIL 
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ILPSVTRNETGPYQCEIQDRYGGIRSYPVTliNVLCIFCSSVGQDTSL 
NPNDQGPGLSVSLRSKYRHIiYFWTSEVAMTPCPGKTWVDTALTKNIR 
RGDALVMGDLGPTACDG PDLPR I F PS VTS YYSGENLDLS CFANSNPP 
AOYSWTINGKFQLSGQKLFIPQITPKHNGLYACSARNSATGEESSTS 
LTIRVI 

3 978 


i 

x. 

inns 

i-> iZiiM x yn/VD x nor us>/\r xr\+ x nxj jl x v» i\vj v xjXj i Ljj rlV f WW .fir X X V 1 X 

E AQP PKVS EGKD VLLLVHNI*PQNI*AG Y I W YKGQMT YVYHY I TS YWD 
GQR 1 1 YG PAYSGRERVYSNASLL IQNVTQEDAGS YTLHI I KRRDGTG 
GVTGHFTFTLHLETPKPSISSSNLNPREAMEAVILTCDPATPAASYQ 
WWMNGQSLPMTHRLQLSKTNRTLFIFGVTKYIAGPYECEIRNPVSAS 
RSDPVTLI^LHGPDLPSIYPSFTYYRSGENLYLSCFAESNPRAQYSW 
TINGKFQLSGQKLS I PQITTKHSGLYACSVRNSATGKESSKSITVKV 
SDWILP 

3 979 


3 


n^Jui Xnv Xjrvrvr XtXJ tr xvxj ± o X c»£tlM X ^/-i-rt_C* X I t Ivj> cJjOrVcr ^, X £»2xJL IvnAvUil V 

TASLLNFWl^POTAQVTIEAQPPKVSEGKDVLLLVHNLPQNLTGYIW 
YKGQIRDL YHYITSYWDGQI 1 1 YGPAYSGRETAYSNASLLIQNVTR 
EDAGS YTLH 1 1 KRGDGTRGVTG YFTFTLYLETPKPS I S SSNLNPREA 
METVILTCDPETPDTSYQWWMNGQSLPMTHRFQLSETNRTLFLFGVT 
KYTAGPYECEIWE\SGSASRSDPVTIiESP/RMGPDLPRIFPFSSPLN 
YFREPJJLGFVPGLGDF*TPP\AQYSWTINGKFQLSGQKLFIPQITPK 
HNGLYACSARNSATGEESSTSLTIRVIAPPGLGTFCFQ 

3980 

A 

3 

1442 

IRADPAHELENCSCPGKRLSTERGRTAQLTAVLREFLDPRLISTEEN 
TQAAETMGTLSAPPCTQRIKWKGLLLTASLLNF^WLPTTAQVTIEAQ 
PPKVSEGKDV^LLVHl^PQINnJTO^ 

TTVnDaVCr!DT?THVCWaCT T TfYKT\7 r P'DI?'ri7Vf^O , V r PT CT Tirr , nr/"Pt>r , \nr 
J. X I urrt.1 ov>KJtS oin/^oxjIjXS^IN V IKbUAbbi 1 Xoii X x KLaXJXA? 1 Kb V X 

GRFTFTLHLETPKPSISSSNLNPRETMEAVSLTCDPETPDASYLWWM 
NGQSLPMTHSLKLSETNRTLFLLGVTKYYLQDPIECEIRNPVSASRS 
DPVHPGISCPK\LPKPYITI*mLNPREKKDVl^ 

WIjNGQS L PVS PRVKR P I ENR I L I L PS VTRNETG P YQCE I RDR YGG VR 
SDPVTLNVLYGPDLPRIYPSFTYYRSGEVLYLSCSADSNPPAQYSWT 
INEKFQLPGQKLF1PJ1ITTKHSGLYVCSVRNSATGKESSKSMT\VEV 
SGKVDPKHRWQ 

3981 

A 

79 

1565 

EAS LLEDDDGMVYLRPLTSHQ YRTAQLRAMLRKFLDPRLS STEENTQ 
AAETOGPLSAPPCTQRITWKGLLLTASLl^FWNPPTTAQVTIEAEPT 
KVS KG KD VLLL VHNL PQN LAG Y I WYKGQMKDLYHY ITS YWDGQI 1 1 

I KjfM. I obKHi V I o1M>VoXjxjX. V XKCil/Abo X X XjxI X V IvKLjXXj IKbL X L»rl 

FTFTLYLETPKPSISSSNLYPREDMEAVSLTCDPETPDASYLWWMNG 
QSLPMTHSLQLSKNKRTLFTjFGVTKYTAGPYECEIRNPVSASRSDPV 
TLNLLPIGliPKPYITIlWLNPPJENKDVLAFT 

GQSLPGSVPRViaiPl\ENRVIiIFTPCSRGIETG\PY\QCEl\Rr)RLW 
VGLRSLPSSPWNVPLMGPDLPRI\YP\YFTYLPFQGGKPSDLSWLSR 
ES * PHRAGVFFWGQFNGGSFSGIQGQKLFIPQITTKHSGLYACS VRN 
S ATGMESS KSMTVKVS APSGTGHLPGLNPIj 

3982 

A 

1 

2045 

^EAAVHAPGNSDVTRTRCATQVLKPLVSYGQAETGFSVPQLKYPPW 
DGLLHRDIPPGTLPVEHSPCSlxAALQFLNATSTHGECSAF'ARRKLES 
SFSQHSLQEGHRHGSRLSPSGAPVKDSGSRTAPGSRGGQQHPSQLLL 
CGPJDDLFQASSGHGPFLLLFPELPSPRDALQLxjREASKFPTDKRQGR 
SWSVAITiSTTAATSLEWVPFHAFAIPRSTIiVSQQRLSTERGRTAQLTA 
VIjREFLDPRL I STEENTQAAETMGTLSAPPCTQRI KWKGL.LI1TASLL 
NFWl^LPTTAQVTIEAEPTKVSEGKDVIiLLVHNLPQl^TGYIWYKGQM 
RDLYHYITSYWDGEIIIYGPAYSGRETAYSNASLLIQN\VTQEDAG 
\SYTL\HI I KRRDGTGGVTGHFTFTLHLETPKPS I S S SNLNPREAME 
AVI LTCDPATPAAG Y\ QWWMNGQR/ LS PMTHRLQLS KTNRTLF I FGV 
TKY I AGPYECE I RNP\ VS A\ SRSDPVTLNLLPKLPKPYITINNLNPR 
ENKDVLNFTCEPKSENYTYIWWLNGQSLPVSPR\VKRPIENRILILP 
SVTPJ^TGPYQCEIRDRYGGIRSDPVTIiNVLYGPDLPRIYPSFTYYR 
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SGEVLYLSCF\ADSNPPAQYSW\TINGKVFSL\QGQKLSIPQITTKH 
SGL YACS VRNS ATGKES SKSI TVKVS DW I LP 

3983 

C 

302 

403 

MGSMWGVWSAFLFTPFFVFTGCGVAAMMGXVA 

3984 

A 

2 

2728 

VPGEECDGITSMSAESGPGTRLRNIiPVMGDGLETSQMSTTQAQAQPQ 

PANAASTNPPPPETSNPNKPKRQTNQLQYLLRWLKTLWKHQFAWPF 

QQPVDAVKLNLPPSPPGQFHELCQNLTYHLLMPVGAVAPVDLPVEQP 

GDYYKIIKTPMDMGTIKKRLENNYYWNAQECIQDFNTMFTNCYIYNK 

PGDDIVLMAEALEKLFLQKINELPTEETEIMIVQAKGRGRGRKETGT 

AKPG\VSTVPNTTQASTPPQTQTPQPNPPPVQATPHPFP\AVHPGPL 

FVQTPVH* QWVPPQPLQTPPPV\ PPQPQPPPVSGS * PVQSHPPI IAA 

TPQPVKTKKGVKRKADTTTPTTIDPIHEPPSLPPEPKTTKLGQRRES 

SRPVKPPKKDVPDSQQHPAPEKSSKVSEQLKCCSGILKEMFAKKHAA 

YAWPFYKPVDVEALGLHDYCDIIKHPMDMSTIKVSGMCLREGTSLRK 

SKTSPIiALGAQCPRYLMTVDAPWELGLGGLASLFVVEKALLSPLCW 

RHASKLEAREYRDAQEFG^VRLMFSNCYKYNPPDHEWAMARKLQD 

VFEMRFAKMPDEPEEPWAVSSPAVPPPTKWAPPSSSDSSSDSSSD 

SDSSTDDSEEERAQRLAELQEQLKAVHEQLAALSQPQQNKPKKKEKD 

KKEKKKEKHKRKEEVEENKXSKAKEPPPKKTKKNNSSNSNVSKKEPA 

PMKSKPPPTYESEEEDKCKPMSYEEKRQLSLDINKLP\GEKLGRWH 

IIQSREPSLKNSNPDEIEIDFETLKPSTLRELERYVTSCIiRKKRKPQ 

AEKVDVIAGSSKMKGFSSSESESSSESSSSDSEDSETAFCTSGDFVS 

PG P S P YHSHVQCGRFREMLRW FL VD VEQTAAGQPHRQS AAG PAI TWA 

PAIAYPSPECARCCVGCS 

3985 

A 

190 

2406 

VPGEECDGITSMSAESGPGTRLRNLPVMGDGLETSQMSTTQA\QAQP 
QPANAASTNPPPPETSNPNKPKllQTNQLQYIiLRVVLKTLWKHQFAWP 
FQQPVDAVKLNLPDYYKI I KTPMDMGTIKKRLENNYYWNAQE\CIQ\ 
DFNT\ MFTNCYI YNKPGDD I VL\ MAEALEKX»FLQKINELPTEET\ BI 
MIVQAK\GRGRGRKETGTAKPGVSTVPNTTQASTPPQTQTPQPNPPP 
VQATPHPFPAVTPDLIVQTPVMTWPPQPLQTPPPVPPQPQPPPAPA 
PQPVQSHPPIIAATPQPVKTKKGVKRKADTTTPTTIDPIHEPPSLPP 
EPKTTKLGQRRESSRPVKPPKKDVPDSQQHPAPEKSSKVSEQLKCCS 
G I LKEMFAKKHAA YAW \ PF YKPVDVRPLGLHD YCD 1 1 KHPMDMS T I K 
S K\ LEAREY \ RDAQ\ EFGADVRLMFSNCYKYNPPDHEWAMARKLQD 
VFEMRFAK\MPDEPEEPWAVSSPAVPPPTKWAPPSSSDSSSDSSS 
DSDSSTDDSEEERAQRIiAELQEQLKAVHEQLAALSQPQQNKPKKKEK 
DKKEKKKEKHKRKEEVEENKKSKAKEPPPKKTKKNNSSNSNVSKKEP 
APMKSKPPPTYESEEEDKCKPMSYEEKRQLSLDINKLPGEKLGRWH 
I IQSREPSLKNSNPDE I E I DFETLKPS TLRELERYVTSCIjRKKRKPQ 
/ ASEKVDVIAGSSKMKGFSSSESESSSESSSSDSEDSETGPA i 

3986 

A 

212 

461 

TEEHLYSPHWKGCGRKVKRLFQVRRGRGWIQTRQPGSMVKEPPLSEA 
GAESHVSC/RQGDCSRPPSFPLSPERVTPWLPDSHTW 

3987 

A 

1 

2353 

MRPPRAP WPPDLGG PRLC PVPAAGGARS PSS P YS VETPYGFHLDLD 
FLKY I E ELERG PAARRAPGPPTSRRPRAPRPGLAGARS PGAWTS SES 
LASDDGGAPGILSQGAPSGLLMQPLSPRAPVRNPRVEHTLRETSRRL 
ELAQTHERAPSPGRGLVREQMAAALRRLRELEDQARTLPELQEQVRA 
LRAEKARLLAGRAQPEPDGEAETRPDKLAQLRRJjTERLATSERGGRA 
RAS PRADS PDGLAAG RS EGALQ VLDG E VG S LDGTPQTRE VAAEAVPE 
TREAGAQAVPETREAGVEAAPETVEADAWVTEALLGLPAAAERELEL 
LRASLEHQRGVSELLRGRLRELEEAREAAEEAAAGAPAOT.PPATTDT 
PWSCAEKAAQTESPAEAPSLTQESSPGSMDGDRAVAPAGILKSIMKK 
RDGTPGAQPSSGPKSLQFVGVLNGEYESSSSEDT\NDSDGDSENGGA 
EPPGSSSGSGDDSGGGSDSGTPGPPSGGDIRDPEPEAEAEPQQVAQG 
RCELSPRLREACVALQRQLSRPRGVASDGGAVRLVAQEWFRVSSQRR 
SQAE P VARMLEG VRR LG PELIJVHVVNLADGNGNTALH YS VSHGNXA I 
ASLLLDTGACEVNRQNRAGYSALMIAALTSVRQEEEDMAVVQRLFCM 
GDVNAKAS QTGQTALMLA I SHGRQDMV\ ATLLACGADVNAQDADGAT 


\01tS 
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ALMCASEYGRLDT\VRLLLTQPG\CDP\SILDNEGPSALA\IALE\A 
ETGMKVAALL\HAHLSSGHPDTQSGVTPLAPQTANTW 

3988 

A 

71 

401 

RRCKVRPSARGVLRYAVCVCVCVCVY^ 

AQWRFTVGVRDVHAAFASS PKKGGALSRLWLCTVPPSSGGTGLS * * S 
SLWPGSDGISATTPCC 

3989 

A 

205 

2190 

QIQVSHITETPLPEEADELDGEIDLRSCTDVTEYAVQRNYGFQIHTK 
DAVYTLSAMTSGIRRNWIEALRKTVRPTSAPDVTKLSDSNKENALHS 
YSTQKGPLKAGEQRAGSEVISRGGPRKADGQRQALDYVELSPLTQAS 
PQRARTPARTPDRLAKHEELERDLAQRSEERRKWFEATDSRTPEVPA 
GEGPRRGLGAPLTEDQHNRLSEEIEKKWQELEKLPLRENKRVPLTAL 
LNQSRGERRGPPSDGHEALEKEVQALRAQLEAWRLQGEAPQSALRSQ 
EDGH I PPG Y I SQETCERSLAEMES SHQQVMEELQRHHERELQRLQQE 
KEWLLAEETAATASAIEAMKKAYORFT.cjppt.cjvtpct nnrDnrr r>vr\ 

HQSDVEALKRELQVLSEQYSQKCLEIGALMRQAEEREHTLRRCQQEG 
QELLRHNQELHGPPVRGDRPAARLHCLAGHGQWLRAQQRAEFLRARG 
AASRKRKRTPVPKEGGAVPPGRAPDDAEGOAliHL.GKVPRP\ T.nfiZXPP 

\HQDHGSEREIPJVAEGAPASOTGRPSRRRSRCANSLAEVEVPPSCRP 
S KAGGPAALLPS LDG S KKPS FLPTLCWATRAIJVPTTHTHTHTHTHTD 

TQTHTHTRAHMYTRIHTHTHTHTHCISDARPSHWGLTLHLLQDFICE 
EIFI 

3990 

A 

53 

364 

RSKVEAGIiAPEPGPLGARVGS/ PGTRARHGRPG\ PA\GAS IGPGCGP 
NEQVDEQAGPAQLQGPQGPGAPRAIGFPHLFJLGGSCKIKGDGPGVRR 
PGSDRLSSSSPT 

3991 

C 

408 

620 

MKAWKKIKDHPTTSTLTXSNPJDDVTGGRXDPNHSEXSTTLLGRL.YT J c ; 
XXTHEGKQXLXXSDSAXXGPFXXL 

3992 

C 

64 

300 

MDSSHSTTLXPTANPNTGLVEDLDRTGPLSMTTQQSNSQSFSTSHEG 
LEEDKDHPTTS TLTS SNRNDGHRWKKRPKS F * 

3993 

A 

1 

530 

DT YC FNAS APPEEDCTS VTG PL SMTTOO <?Tfl ^D^FQtq WTrnT .prnvnu 
PTTPTLTSSNRNDVTGGRRDPNHSEGSTTLLEGYTSHYPHTKESRTF 
IPVTSAKTGSFGVTAVTVGDSNSNVNRSLSGDQDTFHPSGGSHTTHG 
SESDGHSHGSQEGGANTTSGPIRTPQIPEWLIILA 

3 994 

A 

1 

436 

PFAPDTMDKFVWHAAWGLCLV^ 

YSISRTEAADLCKAFNSTLPT^QMEKALSIGFETCSLHCSQQSKKV 

WAEEKASDQQWQWSCGGQKAKWTQRRGQQVSGNGAFGEQGWRNSRP 
VYDS 

3995 

A 

3 

1009 

EKVRPTSEAGSLPSGPOPLSGFGRHQ'RQVT ppdt DuanrccccpDUD 

TPFAPDTMDKIWWYAAWGLCLVPLSLAQIGECPPQPGQQDGCGVLSA 

DPAAAPPAESALGDWSQPQSREKSLDTRATSGTLELGLQYLIQTGSV 

LCGGYGGEGLRSLVCFYQQHENELMHLVTCQLLDQGKGCQAGTFKPA 

EAVCCLLFRYAIiPPEVESREAVGLVELWS I SDCCAS SBQGS VGMGLT 

KPGTGRDLLVCWI^PJLGKAQYLGESVPLLQIHLLTGNWAEKCLIiTDL 

ENFNASKAGEHSKTMLLFIYFSFHGKRKIKGNCFESNNSKRSNISGK 
QANLKE 

3996 

A 

1 

2226 

MDKFWWHAAWGLCLVPLSLAQIDLWITCRFAGVFHVEKNGRYS ISRT 
EAADLCKAFNSTLPTMAQMEKALS IGFETCRYGFIEGHWI PRIHPN 
S I CAANNTGVYILTYNTSQYDTYCFNAS APPEEDCTS VTDLPNAFDG 
P ITI TI VNRDGTRYVQKGE YRTNPED I YPSNPTDDD VS S GS S SERS S 
TSGG YI FYTF STVHP I PDEDSP WI TDSTDRI PATTLMS TS ATATETA 
TKRQEAWDWFSWLFLPSESKNHLHTTTQMAGTSSNTISAGWEPNEEN 
EDERDRHLSFSGSGIDDDEDFISSTISTTPRAFDHTKQNQDWTQWNP 
SHSNPEVLLQTTTRMTDVDRNGTTAYEGNWNPEAHPPLIHHEHHEEE 
ETPHSTSTIQATPSStTEETATQKEQWFGNRWHEGYRQTPREDSHST 
TGTAAASAHTSHPMQGRTTPSPEDSSWTDFFNPISHPMGRGHQAGRR 
MDMDSSHSTTLQPTANPNTGLVENLDRTGPLSMTTQQSNSQSFSTSH 
EGLEEDKDHPTTSTLTSSNRNDVTGGRRDPNHSEGSTTLLEGYTSHY 
PHTKESRTFIPVTSAKTGSFGVTAVTVGDSNSNVNRSLSGDQDTFHP 
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SGGSHTTHGSESDGHSHGSQEGGANTTSGPIRTPQIPEWIillLASLL 
AIiAL I LAVC I AVNS RRRCGQKKKL V I NS GNGAVEDRKP S GLNGEAS K 
SQEhTVHLVNKESSETPDQFMTADETRNLQNVDMKIGV 

3997 

A 

2 

1599 

QPLPGSVRHPRPVLRRPLPRAQGSSSSFRPRPPFAPDTMDKFWWHAA 
WGLCLVPLSLAQIDLNITCRFAGVFHVEKNGRYSISRTEAADLCKAF 
NSTLPTMAQMEKALS IGFETCRYGFIEGHWIPRIHPNS ICAANNTG 
VY I LTSNTS Q YDT YCFNAS AP PE EDCTS VTDL PNAFDG P I T I T I VNR 
DGTRYVQKGEYRTNPEDIYPSNPTDDDVSSGSSSER\SSTSGGYIFY 
TFSTVHPIPDEDSPWITDSTDRIPATNMDSSHSTTLQPTANPNTGLV 
EDLDRTGPLSMTTQQSNSQSFSTSHEGLEEDKDHPTTSTLTSSNRND 
VTGGRRDPNHSEGSTTLLEGYTSHYPHTKESRTFIPVTSAKTGSFGV 
TAVTVGDSNSNVNRSLSGDQDTFHPSGGSHTTHGSESDGHSHGSQEG 
GANTTSGPIRTPQIPEWLIILASLLALALILAVCIAVNSRRRCGQKK 
KLVINSGNGAVEDRKPSGLNGEASKSQEMVHLVNKESSETPDQFMTA 
DETRNLQNVDMKIGV 

3998 

A 

245 

1148 

FPGSTPTPFAPDTMDKFWWHAAWGLCLVPLSLAQIDLNITCRFAGVF 
HVEKNGRYS I SRTEAADLCKAFNSTLPTMA\ QMEKALS IGFETCRYG 
FIEGHVVIPRIHPNSICAANNTGVYILTYNTSQYDTYCFNASAPPEE 
DCTSVTDIiPNAFDGPITITIVNRDGTRYVQKGEYRTNPEDIYPS\NP 
T\DDDVSSGSSSE\REST\SGGYIFYTFST\VHPIPDEDSPWITDST 
DRIPATRDQDTFPPQWGFPLPLHGSEFRWDTHMGSQGRWEQTQPLGP 
IRDTPNSQNGLIHLGHPLIiALGL 

3999 

A 

3 

327 

AGPGEVAGDGRDSGGPAGGRRCWLNWSGSGKS LRKGNPS VS I YCFCL 
PNHYLFDRN/SCLRSSAGTKSGSYPPQISSLMLILVPPCVPQQTHPG 
LDLKTQKLSLLSALI 

4000 

A 

6 

280 

SAQMAVTTADPRVRPRVRTTVFTS I AHTDTMVHLTP VE \ KSAVTALW 
AK\VNVNEVGGEAPGQGCWWLPLGPPKGPLKPFGGICPNSLNAG 

4001 

A 

405 

647 

RHVLFLLKNGAl^VKHHRWFRSVDWKAVPQRKLSRPEAKCLIiHGLRT 
FPVFSVPQIWHCSNLLCMRSQGPAVAGVSALKK 

4002 

A 

405 

608 

RHVLFLLKNGANDVKHHRWFRSVDWKAVPQRKLKPPIVPKIAGDGDT 
SN FETYPENDWDTAAPVPQK 

4003 

A 

1 

849 

MAEGTGTFGRVHLVKEKTAKHFFALKVMS I PDVIRLKQEQHVHNEKS 
VLKEVSHPFLIRLFWTWHDERFLYMLMEYVPGGELFSYIjRNRGRFSS 
TTGLFYS AEI I CA I E YLH S KE I VYRDLKPENT LLDRDGH I KLTDFGF 
AKKX»VDRTWTLCGTPE YLAPEVIQSKGHGRAVDWWALG ILI FEKLSG 
LNQSIDCDLCQPIRIQQASTSQNSVSVNQLEPNKCQTVRAHRGSTHQ 
NSTSVDSSEPKCQTHQNSTSAHQSEPNKCQPIRTQQVSAYQNSMSIS 
P 

4004 

A 

2 

1173 

PQQSPECDVPLPVSVCSHWSIPTSHRLLRVLVTVECELYVRWPRCNA 
DLTCTKAVQNGWDLLKVPTCLHLLLAVPGVGLF I DEGVKQGFS VETF 
RDTRGVLVHGCIMSCLCNl^VRPYXiGKKQKSKKTTDVGVEVEESGRG 
CAAGAEEVFQCMQQGSHSAAEGQESSSKVESGRHSYNIjYIjWVISPRT 
PASLQNLQILKCIRFWTWHBERFIiYMIiMEYVPGGELFSYTjRNRGHFS 
STTGLFYSAE I ICAIEYLiHSKEI VYRDLKPENILLDRDGHIKLTDFG 
FAKKL\ AYRLMWKSLLSEKRWVWRCX3GEAKER* Q * QQFG WVPASRKV 
RS * SQDTCPHHRKQGRLRTWTLCGTPEYLAPEVIQSKGHGRAVDWWA 
LGILIFEMLSGWVL 

4005 

A 

149 

1358 

APPSHCLCVSVSRAEPRTQRRGRAWGASVDAAAAAAPPSEAASRGVR 
LPER SGLARRPG PECL PMEAPGPAQAAAAE SNSREVTED AADWAPAL 
CPSPEARSPEAPAYRLQDCTALVTMGTGTFGRVHLVKEkTAKHFFAL 
KVMS I PDVIRRKQEQHVHNEKS VLKEVSHPFLIRLFWTWHEERFLYM 
LMEYVPGGELFSYLRNRGHFS STTGLFYSAE I ICAI E YLHSKE I VYR 
DLKPENI LLDRDGHI KLTDFGFAKKLVDRTWTLCGTPE YIaAPEVI QS 
KGHGRAVDWWALG I L I FEMLSGFPPFFDDNPFG I YQKI LiAGKLYFPR 
HLDFHVK/DGANDVNHHRWFRSVDWKAVPQRKLKPPIVPTIAGNGDT 
SNFET YPEHDWDTAAPVPQKDLE I FKNF 
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4006 

A 

1 

683 

MAAEVLTVMAEGTGTFGRVHLVKEKTAKHFFALKVMSIPDVIRRKQB 
QHVHNEKSVLKBVSHPFLIRLWPRCNADLTCTKAVQNGWDLLKVPTC 
LHLLLAVPGVGL F IDEGVKOG FS VELTAYP'L.Pr Pfif3AT . c; vrrDHD irw 

TWHEBRFLYMLMEYVPGGELFSYLRNRGHFSSTTGLFYSAEIICAIE 
YLHS KE I VYRDLKPEN I LLDRDGH I KLTDFGFAKKLVDR 

4007 

A 

521 

1163 

S QDTCPHHRKQGRLRTWTLCGTPE YLAPEVI Q S KGHGRAVDWWALG I 
LIFEML/SGAFPKHVSSNASPGLQNWGRFPPFFDDNPFGIYQKILAG 
KLYFPRHLDFHVKCQTPKEKQKTKNKVIFLHARHQKQNKTNNNNNSN 

NKMKFIKNGANDVKHHRWFRSVDWKAVPQRKLKPPIVPTIAGNGDTS 
NFETYPEHDWDTAAPVPQKDLE I FKNF 

4008 

• 

A 

181 

1448 

APPSHCPRRSVSRAGPRTRRRERAWGASVDAAAAPPSEAASREARLP 
ERSGLARCPGPECVPMEAPGLAQAAAAESDSRKVAEETPDGAPALCP 
SPEALSPEPPVYSLODFDTIJ\TVGTGTFnRVKT.VTa?TrrairwT?i?nT vrr 

MSIPDVIRLKQEQHVHNEKSVLKEVSHPFLIRLFWTWHDERFLYMLM 
E YVPGGELFS YLRNRGRFS STTGLF YS AE HCAI KYT ,H«?KP T VYPDT . 

KPENILLDRDGHIKLTDFGFAKJCLVDRTWTLCGTPEYIxAPEVIQSKG 
HGRAVDWWALG\ILIFEMLSGFPP\FFDDNP\FGIYQKILAGK\IDF 
PRHLD F\ HVKD\ L I KKXLVVDRTRRLGNMKNGA1TOVKHHRWFRSVDW 

EAVPQRKLKPPIVPKIAGDGDTSNFETY\PENDWDTAAPVPQKDLKI 
FKNF 

4009 

C 

3 

167 

MPMXAQIEEPQPIQVKTRTNLRLTQLREQDQTQKCLCPLYNQQAPRK 
KFLLLWA* 

4010 

A 

1 

2195 

MDPGTLQSASQGPTAINPCDYVLKRRNIQTSWQRLTPIIKWHVYILM 
I GPGEKEAGRNLG I FGKWTPFPKI PAKRLRESNCPVDAQE I WLPQAF 
REYLGRRGNFGPGRRTCEFWEVESICSVESWELWLRQADSGDSGKCS 
PDACGI IDTSGLRAGHCYLTRLWHVSGRI PPS FKLHHPGVCKFPKVG 
GKMTTFKEAVTFKDVAWFTBEELGLLDPAQRKLYRDVMLENFRNLL 
SVGHQPFHQDTCHFLREEKFWMMGTATQREGNSGGKIQTELESVPEA 
GAHEEWSCQQI WEQIAKDLTRSQDS I INNSQFFENGDVPSQVEAGLP 
TIHTGQKP S QGGKCKQS FSDVP I FDLPQQLY SEEKS YTCDECGKS I C 
YISAIiHI \HORVHMGEKPYKTn\7rnVT?PcriCCUT An^innTrurrinV rvn 

FKCG \ Q \ CGKGFSRRS ALNVHHKLHTGEKP Y I CEACGKAFIHDSQLK 
EHKRIHTGEKPFKCDICGKTFYFRSRLKSHSMVHTGEKPFRCDTCDK 
SFHQRS71LNRHCMVHTGEKP YR\ CEQCGKGF IGRLDFYKHQWHTGE 
KPYNCKECGKSFRWSSCIxLNHQRVHSGEKSFKCEECGKGFYTNSQLS 
SHQRSHSGEKPYKCEECGKGYVTKFNL\DLHQRVHTGERPYNCKECG 
\ KNFSRAS \ S I LNHKRLH/ S PEKNPFKCEDCGKRLVHRTYRKDQPRD 
YSGENPSKCEDCGRRYKRRLNLDILLSLFLNDT 

4011 

A 

1 

4559 

MDGAKAFFSAVAAGFVIILTCQLQWQWQHGGVHAHQLWQLASGYRDA 
CFCVSTYSSGPSEFPGPTLGTLNEI^IDLEERKHRRKAKVWKGEGGG 
LRS PRQ S LLAFRAVLAS RGTRS CHVS P YLG VS GALS S PNPHPSTTGA 
GPIPPVQRWGLPQINQKNREIAIPLTLGMSTHLRRGCKNMSRFSPSL 
HCCTTPISTNFTDPGSHCKGAHGDICLLIQNLKGALPMSINRSYFNA 
HPLLQISEAVTFKDVAVVFTEEELGLLDPAQRKLYRDVMLENFRI^L 
SVGNQPIPTQDTF\HFLGKEKFWKMKTTS\QREGNSGGKIQIEMETV 
PEAGPHEEWSCQQIWEQIASDLTRSQNSIRNSSQFFKEGDVPCQIEA 
RLSISHVQQKPYRCNECKQSFSDVSVFDLHQQSHSGEKSHTCGECGK 
SFCYSPALHIHQRVHMGEKCYKCDVCGKEFNQSSHLQTHQRVHTGEK 
PFKCGQCGKGFHSRSALNVHCKLHTGEKPYNCEECGKAFIHDSQLQE 
HQRIHTGEKPFKCDICGKSFRVRSRLNRHSMVHTGEKAFRCDTCGKN 
FRQRSALNSHSMVHIEEKPYKCEQCGKGFICRRDFCKHQMVHTGEKP 
raCKECGKTFRWSSCLLmiQQVHSGQKSFKCEECGKGFYTNSRRSSH 
QRSHNGEKPYNCEECGKDYKRRLDLEFHQRVHTGERPYNCKECGKSF 
GWASCLLKHQRLHSGEKPFKCEECGKRFTQSSQLHSHQTCHTGEKLY 
KCEQCEKGYNSKFNLDMHQRVHGGERPYNCKECGKSFGWASCLLKHQ 
RLHSGEKPLKSGVWEEI YSEFTAS FTSESSGQKLVS I PLGQSFQKKB 
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KAAI FAVLQLSLVTPPGTGKTEATRVS SRPQQTVAALQLIiTVSGLTV 
KNRQKTTRTSTKRPHKNPIQGSATSRLKSKWTPFPKIPAKRLRESNC 
PVDAQE I WLPQAFRELWHVSGRI PPSFKLHHPGVCKFPKVGGKMTTF 
KEAVTFKDVAWFTEEELGLLDPAQRKLYRDVMLENFRNLLSVGHQP 
FHQDTCHFLREEKFWMMGTATQREGNSGGKIQTELESVPEAGAHEEW 
SCQQIWEQIAKDLTRSQDSIINNSQFFENGDVPSQVEAGLPTIHTGQ 
KPSQGGKCKQS I SDVPI FDLPQQLY SEEKS YTCDECGKS ICYISALH 
vxiyKVrl vt?Jc#JS_L»r n^jj vtAaJvfcjr bUooHJLiQTHQRVHTGEKPFKCEQCGK 

PFKCDICGKTFYFRSRLKSHSMVHTGEKPFRCDTCDKSFHQRSALNR 
HCMVHTGEKPYRCEQCGKGFIGRLDFYKHQWHTGEKPYNCKECGKS 
FRWSSCLLNHQRVHSGEKSFKCEECGKGFYTNSQLSSHQRSHSGEKP 
YKCEECGKGYVTKFNLDLHQRVHTGERPYNCKECGKNFSRASSILNH 
KRLHCOKKPFKCEDCGKRLVHRTYRKDOPRDY<5C5FKTPc;KPFDrf;RPY 
KRRLNLDILLSLFLND 

4012 

A 

125 

226 

INPSTSHLPHSGAPEPWTMIHPFELFLVX.KCW 

4013 

A 

1546 

2204 

RICVNILYRALYQINSHMWLLITSSVIPSIJ\TAQWEWSLIWMLPPSM 
FRPRTVAMVCGLLVSLDWLGDLGQYLQSCCFCFICKNYDPHTPAAAQ 
GTVIFYTCVDVFVHSSWFITPITLCYNVGTLQASENGISVCDLLLPH 
FTCS TPDFNLTVAHKTL I PERLL\ PHTRK\ EERCNREAQEAFWTD\ R 
PCLGLNPNAFLSQFHLNTAAMLQPMPYPIDGLP 

4014 

A 

1 

1387 

FRAASGRETRAALGASQCLTPHPLPRVGGGGASGGGLRGAGQAWSAM 
SSPDAGYASDDQSQTQSALPAVMAGLGPCPWAESLSPIGDMKVKGEA 
FAJNIooAirAUAAOKAJviaJboKl KK PMN AFMVWAKX>ERKRIjAQQN PDLHN 
AELSKMLGKSWNA\ LTLAEKRPLVEEAERLRVQHMQDHPNYKYRPRR 

PAGPPLLPPHMGGHYRDCQSLGAPPLDGYPLPTPDTSPLDGVDPDPA 
PFAAPMPRnPPAAr»TVQVZiOVQriVariI5"DT?'DD7if2T3TvruDOT r , Di?'D?vr»t)0 

I PGLLAPPS ALHVYYGAMGS PGAGGGRGFQMQPQHQHQHQHQHHPPG 
PGQPSPPPEALPCPJDGTDPS\QPAELLGEVDRTEF\EQYIjHFACKPE 
MG\LPYQGHDSGVNIiPDSHGAISSEVSDASSAVYYCNYPDV 

4015 

A 

277 

459 

DADSMCELPSDLQQSCQHAVSDSDTQAWCPRRPGSLCGTGRRAPGS* 
GGSRVIDRGRPLAI 

4016 

A 

175 

370 

YYIVCKTFLKYVLLIA5FLGRSSVDLEESSTK5LEPKAARIKim,KE 
RKVLEKK* P * AKKKKKVD 

4017 

A 

1 

2143 

MDGTPJ^YCVISVVLGDNDAPPGTSASQEGDGSVNIDGTEGESAVTSTG 

ITEDGEGPASCTGSEDSSEGFAISSESEENGESAMDSTVAKEGTNVP 

LVAAGPCDDEGIVTSTGAKEEDEEGEDWTSTGRGNEIGHASTCTGL 

GEESEGVLICESAJSGDSQIGTVVEHVEAJ^GAAIMN^ 

TEKGSKDTD I CS S AKGI VES S VTS AVSGKDEASDDQTTPS YRRKFKP 

MAKKLKALKKKLDEWITRITNAEKSLIODLMELCT 

SQCDQLEERVSLMEEEMNEMKQEEEFREKRIKIU^QSLQEIWDYVKR 
Pl^RLIGVPESDGENGTKLENl^ 

HFyKl-lo bKKA 1 FKJi x J. VRFTKVEMKEI^LRAAREKGR V TIjKGKP IR1» 
TAEGVGDNSIDSWKNAGRVFKDSDKFDANDPILKDQTQEWSGSATFT 
SDGKIRLFYTDYSGKHYGKQSLTTAQVNVSKSDDTLKINGVEDHKTI 
FDGDGKTYQlSTVQQFIDEGI^TSGDlfflTLRDPHYVEDKGHKYLVFEAN 

tgtengyqgeesl™kayygggtnffrkesqklqqsakk3^dael^ 

ALVL I LL I VTMRRRKKE PL I FDEERD I REN I VRYDDEGGGEEDTEAF 
HMAALRNLNV I RJDTKTRRDVTPE I QFLS RPAFKS I PDNVIFREFICF 
RIiKAAAVDP 

4018 

A 

4423 

9194 

SCLLEVLLTDVIVENENITNEGGLTO^ 
VENGMKDGIAVDHVVCLNTEKYA£ 

NENSEVDTSAGSGSAPSVXiHQRNGQTEDVATGPRRAEKTSVATSTEG 
KDKDVTLSPVlCAGPATTTSSETRQSEVAiPCTSIEADEGLIIGTHSR 
ITOPLHVGAEASECTVFAAAEEGGAVVTEGFAESETFLTSTKEGESGE 
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CAVAESEDRAADLLAVHAVKIEANVNSWTEEKDDAVTSAGSEEKCD 
GSLSRDSEIVEGTITFISEVESDGAVTSAGTEIRAGSISSEEVDGSQ 
GNI^RMGPKKETEGTVTCTGAEGRSDNFVICSVTGAGPREERMVTGA 
GWLGDNDAPPGTSASQEGDGSVNDGTEGESAVTSTGITEDGEGPAS 
CTGSEDS SEGFAISSES EENGE S AMDS TVAKEGTNVPLVAAGPCDDB 
GIVTSTGAKEEDEEGEDWTSTGRGNEIGHASTCTGLGEESEGVLIC 
ESAEGDSQIGTVVEHVEAEAGAAIMNANENNVDSMSGTEKGSKDTDI 
CSSAKGIVESSVTSAVSGKDEVTPVPGGCEGPMTSAASDQSDSQLEK 
VEDTTISTGLVGGSYDVLVSGEVPECEVAHTSPSEKEDEDIITSVEN 
EECDGLMATTASGD I TNQNS LAGGKNQGKVL 1 1 STSTTNDYTPQVS A 
ITDVEGGLSDALRTEENMEGTRVTTEEFEAPMPSAVSGDDSQLTASR 
SEEKDECAMISTSIGEEFELPISSATTIKCAESLQPVAAAVEERATG 
PVLISTADFEGPMPSAPPEAESPLASTSKEEKDECALISTSIAEECE 
ASVSGWVESENERAGTVMEEKDGSGIISTSSVEDCEGPVSSAVPQE 
EGDPSVTPAEEMGDTAMISTSTSEGCEAVMIGAVLQDEDRLTITRVE 
DLSDAA 1 1 STSTAECMP I S AS I DRHEENQLTADNPEGNGDLS ATEVS 
KHKVPMPSLIAENNCRCPGPVRGGKEPGPVLAVSTEEGHNGPSVHKP 
SAGQGHPSAVCAEKEEKHGKECPEIGPFAGRGQKESTLHIjINAEEKN 
VLLNSLQKEDKS PETGTAGGS STAS YSAGRGLEGNANS \ LAKLRGPE 
QTSGQTAKDSS VSS I RYLAAVNTGAI KADDMP PVQGTVAEHS FLPAE 
QQGSEDNLKTSTTKCITGQESKIAPSHTMIPPATYSVALLAPKCEQD 
LTIK^YSGKWTDQASAEKTGDDNSTRKSFPEEGDIMVTVSSEENAY 
VPSEEEKNGEILAPPESLCGGKPSGIEISSGRKDNAEAISGHSVEAD 
PKEEENSRDLEELPKTSSETNSTTSRVMEEKDEYSSSETTGEKPEQN 
DDDT I KS QEDDGEEKI VTS VRRRGRKP KR S LTVS DD AE S S EPERKRQ 
KSVSDPVEDKKEQESDEEEEEEEEDEPSGATTRSTTR\ SEAQRKQH\ 
SKPFAR\ATSKLGSPDTVSPRNRQKLAKEKLPTSEKVSNSPPLGRSK 
TQLSPSIKRKREVSPPGA\RTOGQQRVEEAPGARDKRPGEGGGSPCE 
KSEAI ILTTAALGLWRNTVERKETCLGGHRLLFNQEKDMHVL 

4019 

A 

77 

407 

ESDGGKRKKKSRGDPSQLEEKISKGCPQRRKKPRLYRPSNYNRAFRR 
MRR/ ARLPTRTSPETRTPQPH\ SPHS \ PERTPTPPPGRPPFLGAGQL 
QRLFMGIPQVGDCPPPSS 

4020 

A 

20 

464 

WWNSLEVTVYAFSIENFKRSKSEVIXSLMDIiARQKFSRLMEEKEKLQK 
HGVCIRVLGDLHLLPLDLQEL I AQAVQATKNYNND I SESLLDKCLYT 
NRSPHPDILIRTSGEVRLSDFLLWQTSHSCLVFQPVLWPEYTFWNLF 
EAILQFQ 

4021 

A 

572 

- 

1594 

SSDWKRTMSWI KEGELS LWERFCANI I KAGPMPKHIAFIMDGNRRYA 
KKCQVERQEGH FTGFKQ \ LAETLRWC LNLG I LEVTVYAF S I ENFKRS 
KSEVDGLMDLARQKFSRLMEEKEKLQKHGVCIRVLGDLHLLPLDLQE 
L I AQAVQATKNYNKC FLNVCFA YTSRHE I SNAVREMAWG VEQGLLDP 
SDISESLLDKCLYTNRSPHPDILIRTSGEVRLSDFLLWQTSHSCLVF 
QPVLWPEYTFWNLFEAILQFQMNHSVLQKARDMYAEERKRQQIiERDQ 
ATVTEQLLREGLQASGDAQIjRRTRIjHKLSARREERVQGFLQALELKR 
ADWLARLGTASA 

4022 

A 

2 

366 

EDLLDLDEELRYSLATSRAKMGI^RAQQESAQAENHIjNGKNSSLiTLTG 

etssajclprcrqggwagdsvkasngtqtgkqqldlnacy^kthhrdl 

GLASLEEADI PI I PDLEEVQEEDFVLQ 

4023 

A 

1 

466 

RRWRICSTWTRSFATAWLPPAPSANFPRTDRSQGDRGAPAGFALAPI 
LEFFLWDTSSQKFLQKPHCFMHGRERAKMGRRAQQESAQAENHLNGK 
NSSLTLTGETSSAKLPRCR/QGGWAGDSVKASKFRRKASEEIEDFRL 
RPQSLNGSDYGGDIP s 

4024 

A 

254 

662 

NPQKVENTLSPPASRTVGNFRAFLRFLEPRVRTSPEQIDHRRAKMGR 
RAQQESAQAENHLNGKNSSLTLTGETSSAKLPRCRQGGWAGDSVKAS 
KFRRKASEEIEEYVSSILILMVSYVDIiGQQGRLGGHDLFHLC 

4025 

A 

3 

392 

MKYS AIQTLDGE I DL KLLTKVLAPEHEVR * VQWWQQFPGLSTLAI P I 
NPAELPGLALCLP*RPWNGREKVSL»QLREK7UliSLSYPAKPFL\EDDV 
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G WDWDHLFTEVS SBVLTE \ WDPLQTBKEDPAGQARHT 

4026 

A 

63 

779 

GPIRGPWGPGFGGWGGATPPAPTSLRRLRAAGLAPRRRAKMGRRAQQ 
ESAQAENHLNGKNS SLTLTGBTS SAKLPRCRQGGWAGDSVKASKFRR 
KASEEIE\DFR\ LRPQSLNGS \DYGGDI PI \ IPDLEEVQEEDFVLQV 
\ AAPPS I \QIKRVMTYRDL\DNDLM\ KY\ SAIQTLDGED/ IDLKLL/ 
TPKVLAPEHEVRERNPSWQDDVGWD\WTHLFTEVSSE/VSFTEWDPIi 
QTEKEDPGGQ\ARHT 

4027 

A 

2 

311 

AAAKGAAAMSAHLQWMVVRNCSSFLIKRNKQTYSTEPNNLKARNSFR 
YNKNARATLSSIRHMIRKNKYRPDLRi47^AIRRASAILRSQKPVMVKR 
KRTRPTKSS 

4028 

A 

1 

136 

VAAAKGAAAMS AHLQWMWRNCS S FLI KRNKQPHQELLS PLPPEQ 

4029 

A 

2 

170 

HEGQRKPATSYVRTTINKNARATLSSIRHMIRKNKYRPDLRMDMLAS 
TGSGLCCSV 

4030 

C 

64 

276 

MCKALGTPYSPQQQRLGTEASSVHRTCWHLLGQGSAARXXXXXXXXX 
XXXXXXXXXXXXXVSIDPTTQG* * 

4031 

A 

3 

295 

QVAAAKG AAAMS AHXiQ WMWRNC S SFLI KRNKQTYS TE PNNLKARNS 

FRYNGLIHRKTVGVEPAADGKGWWIKRRSGEFCIiVWARERPLSRV 
WEL 

4032 

A 

3 

279 

CTPLTPGRPTRPQVAAAKGAAAMS AHLQWMVERNCS SFLI KRNKQT Y 
STEPNNLKARNSFRYNGLIHRKTVGVEPAADGKGVEVVIKRRSE 

4033 

A 

3 

3 92 

RIRHEAAAKGAAAMSAHLQWMWRNCSSFLIKRNKQTYSTEPNNLKA 
RNSFRYNGLISPARLWGVEPA\ADGKGW\WI\KRRSGQR\KPA\T 
S YLRTTINKNARATLS S I RHMI RK\NKYRPDLAHGHAG I 

4034 

A 

2 

564 

VAAAKGSRRQCLRHLQWM\WRNCSSFLIKRNKQDPTAI,SPNNLKA\ 

RNSFR\YNG\LIHRKDCGTWSRSADGK\GVW\VIK\RRSG\QRKLA 

TSYVAGPPSTRNARSHASGRHQ/VHIDPARNKLPPPTCRMGSPFRRA 

QRPFLAQAGRLVDWLKEGRRDPAPPRSSLKPLGPQKQLKSAGVFSKK 
KKKSR 

4035 

A 

1 

139 

DRRAGDPWPRTHEFGSPEASLQASAWRDGKKVSRSELKPLPLRTLS 

4036 

A 

1 

299 

VG I W * P VTNFG VI S GTKA I EMD * G / TP Y I HALTNGLLTVEAPHK / ER 
IALKPGYGKYLSINSDELW/ GRSDAIGPREQWEPVFQNDGHPVMRM 
NSLQLKATI 

4037 

A 

796 

911 

AMA*YSYVKSSKLVLKGT\SKKKKSTDKKRKREDEETQLD/IVGIW* 
TVT\NFGEISGT\lAIWNGRKGTYIHALDNGLFTIiGAPHK/ ERIALK 
SGYGKYLGINSDGLWGRSDAIGPREQCEPVFQNGKMAIiLASNSCFI 
RQ^EAGDIEAKSKTAGEEEMIKIRSCAERETKKXDDIPEEDKGNVKQ 
CE INYVKKFQS FQDHKLKI SKEDS KILKKARKDGFLHETL\ LDRRA\ 
KLE AD\ RYCK 

4038 

A 

193 

754 

EPSRGVWPHEDARINGSKKKKSKEKKRKREEDEETQFD/IVGIW*TV 
TNFDEISGTIAIEMDEGTYIHALDNGLFTLG7VPHKEG\KMALLASNG 
CFIRCNEAGDIEAKSKTAGEEEMIKIRSCAERETKKKDDIPEEDIGN 
VKQCE I NYVKKFQS FQEHKLKI SKEDS KI LKKAQKDGFLHETLLDRG 
P 

4039 

A 

213 

92 0 

YVQSLKQI LS / GC I * ES I AI KKKKNKDKKRKREEDEETQLD I VGI WW 
TVTNFGEISGTIAIEMDKGTYIHAIiDNGLFTLGAPHKEVDEGPSPPE 
QFTA\VKLSDSRIALKSGYGKYLGINSDGLWGRSDAIGPREQWEPV 
FK\MGKMAIiSTSNSCFIRl\HEAEDIEAKSKTAGEEEMIQGSP*SSV 
QP FHLLWTFAF I LTQEHRSLLLKI RS CAERETKKKDD I PEEDKGNI K 
QCEI 

4040 

A 

802 

1463 

AEATALENPLEESRPHEDARINGSKKKKSKDKKRKREEDEETQFD / 1 
VGIWWTVINFGGISGTIAIEMDEGTYIHALDNGLFTLGAPHK/ERIA 
LKPGYGKYLS INSDELW/ GRSKAIGPREQWEPVFQNGACAAGFTVI 
GSEKQSECSLLRESRAKYHGCTHGQISSSLKQHPRWMYSHQEDLKVW 
SLVEKKVTFEHI YLLKRNNSHHS I AEAMYGRLSNYI 

4041 

A 

879 

1051 j 

GGARYHFFFFFFFFDGACSVTQAGVQWPILGSLQPPPPRFKQFSCLS 
LPSSW\DYRRD 
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4042 

A 

125 

1366 

RDSSSSTLSHKSAKCACGFFPSTHKSGTFRFTSVRCLKYKSAKSFLH 
IGYREAAWCCQRLVPCTRRTQESSLDFATNLQHSAWSRLCRRGASR 
TSSAARSRSRSPAVEGCNRSPGAPQPAPRARRRPSRGAPGRAMVKVA 
\ FNSALAQKEAKKDEPKSGEEAliI I PPDAVAV\ DCKDPDDWPVGQR 
RAWCW(^CFGLAF\MLAGVIL\GGAYLYKYFALQPDDVYYCGIKYIK 
\DDVILN/ESPSADAPAA\LYQTIEENIKIFEERRSLNFISVPVPEF 
\ADSDPAKIVQDFNQETYRPYLDFNL\DKCYVIP\3UNTSMCYATPKT 
LLELLIN\lKAGNLFALSPYLDSMRHMGYIiLDRl\ENIDHPGF\FlY 
RLCHDKETYKL\QRRETIKGIQKREASNCFAIRHF\ENKFAVET\LI 
CS 

4043 

A 

42 

557 

QAGKLRPGGLDSRSRS SS AAGLDRDQGLHRVAFLPKRRTGGTRMDRS 
ATAATAPPAAPAGEGGP PAPP PNLTSNRRLQQTQAQVDD WD IMRVN 
VDKVLERDQKQSELDDRC/ADALQAGASQF\ETSAAKLKRKY\WWKN 
LK\MMI I LGVICAI ILI I I I VYFQHLNPRGVCPA 

4044 

A 

178 

731 

SSERCHQQVFIPMPALPPGF\SQAGSCVPTGSSLVli\CLLAASLLIiF 
VPTLAIiLTGATTCWCLHNKRLAV/RRPLAWAGAFGGWSTRLIHGRT 
\SFYFNSLP\LQTNSSTC\QN\HSWDSGGRGDLPWPIAAPRRVGLDIi 
CSCQAHTCQQGADWIHALLCLMGVTGVPTALPAKHLEQNTSPFWDA 
E 

4045 

A 

3 

155 

GAISAHCmRLPGSSDSPDSASNLIQMSLLRVLLLEMVLTI^TTRKP 
SEI 

4046 

A 

100 

448 

WWEVKGMSHGQKNKQEGGAIHFRACVCVMESHSVT\RLECSGAISAH 
CNLRLQG S SNS PDS ASR VAGTTS TRHEAQL I FVFLAETG FHHVGQAA 
LELLTSSDPPTSASQSADMLYK 

4047 

A 

151 

461 

RLTAAATASLCPLPRP\TTRLPLSRGSKMKEGMSNNSTT\ S I SKPRKA 
\TEQLKMEACMDRVKVSQAAAD\LLAYCEAHVREDPLIIPVPASENPF 
REKKFFCTIL 

4048 

A 

1 

447 

FPVCWNSEAAAAEDPSPDPGLRSQNHEPGHQHRSPANRWVGGPCAAS 
P S S LKEALLVPVLQVSRGQKVTS LTACLVDQS LRLDCRHENTS S S P I 
Q Y E FS LTRETKKHVL FGTVG VPEHTYRS RTN FTS KYNMKVL YLS AFT 
SKDEGTYT 

4049 

A 

553 

1938 

L YQTAS CPE YL VD YLENYGRKGS ASQVPSQAPPSRVPE IIS PTYRP I 
GRYTLWETGKGQAPGPSRS S S PGRDGMQLLACVSLGERS VLPLALVS 
SSEKRADVCSACIAGLGFSICTLEMITVPVLEDGTWLC7VPTALTIRQ 
IHVSVS IDDILPRGKVFREDVI 1 1 YKAKHRGEVSSPYHYLQLSPILK 
AASVHSAQNVVDTLLSELQTGTDPRILPPGSPTGGTYLPLLGGWTSR 
TGHKHGAAVHKNTLHTKLLLLGDGGMASAIQDCFSYLFSTSFSDMKL 
NPDPGLRSQNHEPGHQHRSPANSLKEALLVPVLQVSRGQKVTSLTAC 
LVDQS LRLDCRHENTS S SP I QYEFS LTRETKKHVL FGTVGVPEHTYR 
SRTNFTSKY1WKVLYLSAFTSKDEGTYTCALHHSGHSPP ISSQNVTV 
LRDKLVKCEGISLLAQNTSWLLLLLLSLSLLQATDFMSL 

4050 

A 

303 

478 

DCTSRPSLCPLSARCDVHGLSPGPELVR*GLIVIEDALPKSHSGGRG 
EVRQPWDFCGA 

4051 

A 

458 

964 

DPRTMNLAI S I ALLLTVLQVSRGQKVTS LTACLVDQS LRLDCRHENT 
/ S AVS P I QYE F SLTRETKKHVLFGTVGVPEHTYRSRTNFTSKYNMKV 
LYLSAFT\SKDEGTYTCALHHSGHSPPI\SS\QNVPVFRDHGVK\CE 
GISLLA\QNASWLL\LLLLSLSLLQ\ATDFMSL 

4052 

A 

411 

557 

YS TDEL PNI KI STSGRLQWLMS 1 1 PALWEA/ EDQAGRS F EFRNLRLV 
WTT 

4053 

A 

508 

1222 

PLPPAATPCP KKQLG S QAWACRAARGAEQGVWGWRKHP AP SRGRVDK 
GSTKQKDPSPAHTTSTLNYQMHSCS/HKRRHTHTHTHTHTHTHTHTH 
TRQHNIPGGPSPDQEEAGQWEKKAANPSHNPTLQDRRVSSILSCCK 
RKLEAPS PEQI SMVGPRGSNLWFPHWQRRQEPAKCS FGL KEQWGGDF 
SRMNRIJ^GPGVNPSTSFPWSRKLLHGKDVRCWLRHRAKCLSARCPRI 
WRVW 

4054 

A 

1609 

1808 

DRWJLLPRLEYSGMVSAHCNTHLLGSSQS\PTSASQAAGTTGAHHYV 
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PQLVYIYIYILGFLVETRFR 

4055 

C 

169 

297 

MILEFHXXXXXXXGGGRFKEPLGGPNLPGAGKVLFFSLWGAD* 

4056 

A 

270 

614 

PFKFILITQVIRDPLFSSEPYLGEKANHPPGFHHVKGQQASHVCKTH 
PAPSCPP/PDCCLPCPSLGPAPHFRGCCEMEFPGKNFRCLDPFYLDN 
IFRFFCSLVTYPGGKEIARTFF 

4057 

A 

637 

890 

TKWAGSIVYLSFSFLHMFMSFLLLV/STFLALCMLILWLFLLKLPP 
KLFSELPLLKLTFLPCELLHLHLLFQDLAYFFFNGLRK 

4058 

A 

3 

297 

ATAARPCPGLGRRAGESRGPGHIGVRP\GSTLCQIIATCHMSVNDGG 
C\KYVLCRWEKRLWPAK^ARTATSTKMKRRKEYFLAVQILSLEEKL 
SWLF 

4059 

A 

2 

1942 

CPGLGRRAGESRGPGHIGVRPGSTLCQI IATCHMS VNDGGC\ KYVLC 
RWEKRLWPAKVLARTATSTKNKRRKEYFIjAVQILSLEEK\IKVKSTE 
VEILEKSQIEAIASSLASQNEVPAA/PPLEELAYRRSLRVALDVLSE 
GSIWSQESSAGTGRADRSLRGKPMEHVSSPCDSNSSSLPRGDVLGSS 
RPHRRRPCVQQSLSSSFTCEKDPECKVDHKKGLRKSENPRGPLVLPA 
GGGAQDESGSRIHHKNWTIiASKRGRNSAQKASLCLNGSSLSEDDTER 
DMG S KGGS WAAP S L P SGVREDDPCANAEGHDPGLPLGS LTAPPAPE P 
SACSEPGECPAKKRPRLDGSQRPPAVQLEPMAAGAAPSPGPGPGPRE 
SVTPRSTARLGPPPSHASADATRCLPCPDSQKLEKECQSSEESMGSN 
SMRSILEEDEEDEEPPRVLLYHEPRSFEVGMLVWHKHKKYPFWPAW 
RSVRQRDKKASVLYIEGHMNPKMKGFTVSLKSLKHFDCKEKQTLLNQ 
AREDFNQDIGWCVSLITDYRVRLGTWGAGEGWSWAAGVQPLKTPWTG 
WAGAAWRRGI FPWAWG I PCGWAGVLAWPDYFliNRDAGPAPGTHFLGL 
LLGLHSALSPPPPAGCGSFAGSFLEYYAADISKSTGCI 

4060 

A 

1 

2722 

MERPLRGSRHPRLFPVPPGVLGPSQGHIGVRPGSTLCQIIATCHMSV 
NDGGC\ KYVLCRWEKRLWPAKVTAI I / RVTSGLRNDFRVTKTS VTGP 
SILKYLVFEPVTLNQRLTFKRQITDDQAAHCSAPIKVKSTEVEILEK 
SQIEAIASSLASQNEVPAAPLEELAYRRSLRVAIiDVLSEGSIWSQES 
SAGTGRDN\RSLRGKPMEHVSSPCDSNSSSLPRGDVLGSSRPHRRRP 
CVQQSLSSSFTCEKDPECKVDHKKGLRKSENPRGPLVLPAGGGAQDE 
SGSRIHHKNWTLASKRGGNSAQKASLCLNGSSLSEDDTERDMGSKGG 
SWAAPSLPSGVREDDPCANAEGHDPGLPLGSLTAPPAPEPSACSEPG 
ECPAKKRPRLDGSQRPPAVQLEPMAAGAAPSPGPGPGPRESVTPRST 
ARLGPPPSHASADATRCLPCPDSQKLEKECQSSEESMGSNSMRSILE 
EDEEDEEPPRVLIjYHEPRSFEVGMLVWHKHKKYPFWPAVSWDESTRG 
WISRIiKAAFRTIALHIPKSNAQNRCHIiTSPSLRVPGAHLRSDQPKPS 
GPRCPPAFRPAQAIGSPVPTCVQISPSHRVPGAHLRSDOPKPSGPRr 
PPFEQLRAQTCAWCVSHLPEGPCARRLPEPPLSGRNQAREDFNQDIG 
WCVS L I TD YRVRLGCGS FAGS FLEYYAAD I TS RACPGAL I PWATS S V 
SLLWLVAAHIRCCEECRPGASFAGYPVRKSIQQDVLGTKLPQLSKGS 
PEEPVVGCPLGQRQPCRKMLPDRSRAARDRANQKLVEYIVKAKGAES 
HLRAIIiKSRKPSRWLQTFLSSSQYVTCVETYLEDEGQLDLWKYLQG 
VYQEVGAKVLQRTNGDR I R F I LDVLLPEA I 1 CA I S AVDE VDYKTAEE 
KYIKGPSLKLPGKRNI 

4061 

A 

57 

235 

PR VRQKTVGLF I L E I PP FVLDCKL S F FLFRPFTTRTENVCKS S VTVQ 
KENNFLCIASKR 

4062 

A 

754 

1767 

WTSWWMSSVLTILliFSLQGNKMLNYSAPSAGGCLLDRKAVGTPAGGG 
FPRRHSVTLPSSKFHQNQLLSSLKGEPAPALSSRDSRFRDRSFSEGG 
ERLLPTQKQPGGGQVNSSRYKT\ELCRPFEENGACKYGDKCQFAHGI 
HELRSLTRHPKYKTELCRTFHTIGFCPYGPRCHFIHNAEERRALAGA 
RDLSADRPRLQHSFSFAGFPSAAATAAATGLLDSPTSITPPPILSAD 
DLLGSPTLPDGTNNPFAFSSQELASLFAPSMGLPGGGSPTTFLFRPM 
SESPHMFDSPPSPQDSLSDQEGYLSSSSSSHSGSDSPTLDNSRRLPI 
FSRLSISDD 

4063 

A 

568 

715 

NFISTSPALKISMLGKAKLNILAMVFVTMLVSGMVPCPPPPSPPTPL 
KI 
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4064 

A 

65 

347 

GLPAPLPPP/PPRSLPFPAPGLRSQRFSTSAPPRHARPPPVARARAA 
P PHPQASGRKS QVS APLVERAPLRAPGLTATAVVTVLGDPACAFPLB 
M 

4065 

A 

164 6 

1974 

KVPFCFYRIWCICTCVC^CVC/TLQTL/CYSIANMLTSSQCIiQSCG 
SQSWCQMHIKSSKAIMTIPCKFISRKPWEGDCSSLEPHGVSAFDIWV 
PQLCI KKVLNHFSPRKN 

4066 

A 

12 

1662 

FLRGGCCFCYMTCQRPSLRPTALLPERSVPPLPPAAPPRPVRHPPVA 
APPARRCPAPAPPPPFPPDDWVMLDMGDRKEVKMI PKSSFS INSLVP 
EGLQNDNHHASHGHHNSHHPQHHHHHHHHHHHPPPPAPQPPPPRAAQ 
QQQPPPPPI*APQAGGAAQSNDEKGPQLLLtiPPTDHHRPPSGAKAGGC 
CRPGELGPVGPDEKEKGAGAGGEEKKGAGEGGKDGEGGKEGEKKNGK 
YEKPPPFSYNAIilMMAIRQSPEKRLTLNGIYEFIMKNFPYYRENKQG 
WHNSIRDNLSLNKCFVKVPRHYDDPGKGNYWMLDPSSDDVFIGGTTG 
KLRRRS \ TTSRAKLAFKRGAA\ LTSTGLTFMDRAGSLYWPMS PFLSL 
HHPRASSTLSYNGTTSAYPSHPMPYSSV^TQNSLGNNHSFSTANGLE 
AWDRLV\NGEIP/YTATHHLT\AAALAA\SVPCGLSVPCSGTYSLNP 
CSVNLLAGQTSYFFPHVPHPSMTSQSSTFMSARAASSSTSPQAPSTL 
PCESLRPSLPSFTTGLSGGLSDYFTHQNQGSSSNPIilH 

4067 

A 

3 

260 

STEQQRRSARRAGAHGPFPTASASAGPTPRGRGRPLFPGLRAEVDEF 
ESCGPLNVAARGRRAPTYSGPGKVALRLWNFCPALIX5E 

4068 

A 

1 

1386 

MNESKPGDSQNIACVFCRKHDDCPNKYGEKKTKEKWNIjTVHYYCLLM 
SSGIWQRGKEEEGVYGFLIEDIRKEVNRTISKIjVQAINAGVFFFRCTI 
CNNSDIFQKEMLRMGIHIPEKDASWELEENAYQELLQHYERCDVRRC 
RCKEGRDYNAPDSKWEIKRCQCCGSSGTHLACSSLRSWEQNWECLEC 
RGIIYNSAIiKENLYYEAGKMLAISLVHGGPSPGFFSKTLFNCLVYGP 
ENTQP I LDDVSD FD VAQ 1 1 1 R I NTATTVADLKS I INECYNYLELIGC 
LRIilTTLSDKYMLVKDILGYHVIQRVHTPFESFKQGLKTLGVLEKIQ 
AYPEAFCSILCHKPESLSAKILSELFTVHTLPDVKALGFWNSYIiQAV 
EDGKSTTTMEDILIFATGCSSIPPAGFKPTPSIECLHVDFPVGNKCN 
NCLiAIPITNTYKEFQENMDFTIRNTLRIiEKEESSJTYIGH 

4069 

A 

137 

611 

MKVNLGDSQNIACVFCRKHDDCPNKYGEKiCTKEKWNLTVHYYCLLMS 
SG I WQRGKEEEG VYG FL I ED I RKEVNRAS KLKCCVCKKNGAF I GC VA 
PRCKRSY\HFPCGLQRECIFQFTGNFASFCWDHRPVQIITSNNYRES 
LPWHHLLGNLFEPIPSY 

4070 

A 

1 

507 

MSAYQVFCKEYRVTIVADHPDFGELSKKLAEVWKQLPEKDKLIWKQK 
AQ YLQHKQNKAEATTVKRKAS S SEGSMKVKGSDHI PPLLFS KACEFC 
FKAiO<X»GKKLAVLEVDLDSQGNW 
TQLPELNGCPKQVLSNTLDNIAYIMPGL 

4071 

A 

259 

1409 

FLLWRYFVFGIVTEEKEKVQPTVYLIQLLDLLKAIT/SPHLEAGSKP 
SKKT\GEKSSGSSSHSESKKEHHRKKVSGSSGELPLEDGVSHKSKKM 
KPLYVNT\ETLTLREPDGLKMKLILSPKEKGSSSVDEESFQYP\SQQ 
ATVKKS S KKS ARDEQG ALLLGHEIjQS FLKTARKKHKS S S DAHS S \ PG 
PLKGLLGLDA\SQFRKSPHSAN\LDLSGLEP\ ILVESDSSSGGELEA 
GELVIDDSS\REIKKKKKSKK5KKKKDKEKHKEKRHSKSKRSLGLSA 
VPVGEVTVTSGPPPSIPYAGAAAPPLPLPGLHTDGHSEKKKKKEEKD 
KERERGEKMGPSSWKKISSGLPLILHDGPWKNCFPRNRYPVPKRLRT 
ADLVGLYLGSQKHK 

4072 

A 

436 

2063 

RRREVEGWYFGGFSHRRTVCICDTPDISSLESSQKKKKKSSPQSTDT 
A1^LLKAITSPLAAGSKPSKKTG\EKSSGSFKAISGELKREHHRKKV 
SGSSGEIiPl»EDGGSHKSKTfMTfPT.WM T rT? r TT tt oironr'T vtvivt tt cdv 

EKGSSSVDEESFQYPSQQATVKKSSKKSARDEQGALLLGHELQSFLK 
TARKKHKSSSDAHSS^GPEGCGSDASQFAESHSANLDLSGLEPILVE 
SD S S SGGELEAGELVIDDS YRE I KKKKKS KKS KKKKDKE KHKEKRHS 
KSKRSLGLSAVPVGEVTVTSGPPPSIPYAGAAAPPLPLPGLHTDGHS 
EKKKKKEEKDKERERGEKPKKK1WSAYQVFCKEYRVTIVADHPGIDF 
GELSKKLAEVWKQLPEKDKLIWKQKAQYIjQHKQNKAEATTVT<RKAS^ 
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QLLGESLSLIGHRLQETEGMVAVSGSLS VLLDS I ICALGPLACLTTQ 
LPELNGCPKQVLSNTLDNIAYIMPGL 

4073 

A 

323 

539 

PFMICLVHFCAPGN\PPSARPRPSPSPRCHPCAPQEGKMTHPQFRVM 
PAPSC YNWXjLENVPFI IONSGVP T S 

4074 

A 

27 

425 

GGGSGPRAPSATIiLDTGES VAAVASHRDKH T a a q a a a a avpa r* c r»c t> 
DPQSSTIWPVYSPVQPGAPYGNPKNMAYTGYPTAYPAAAPAYNPSLY 
PTNS PS YAPATLVM\ KQAWPQNS SSCGTEGTFHLPVDTG 

4075 

A 

1 

434 

WTAHEFPGRRFRGGSGPRAPSATLLDTGESVAAVASGEDKGIAASAA 
AAAVFACS CS PDPQS STMNPVYS PVQPGAPYGNPKNMAYTGYPTAYP 
AAAPAYNPSLYPTNSPSYAIiFLHGLQRRGLKTGQRLHGGGWQEQKLQ 
MKK 

4076 

C 

117 

251 

MVPGPSLQNSWQASFNIPWPALPKPDLGPSPSVSPVDIFCGACC* 

4077 

A 

58 

313 

SSKQNKNPKESKTTKQNQIYTSYTQARPWGFP!FT\PGLCFENTYKF 
FDQTTWKTLVKSSSSSSSSSSSPILFLFKM*VPIKFKN 

4078 

A 

1 

279 

MDAAGGYPTAYPAAAPAYNPSLYPTNSPSYAPGLGSPQVAQQQFLKH 

4079 

A 

54 

946 

GGGSGPRAPSATLLDTGESVAAVASGEDKGIAASAAAAAVFACSCSP 
DPQSSTMNPVYSPVQPGAPYGNPKNMAYTGYPTAYPAAAPAYNPSLY 
PTNS PS YAPEFQFLHS AYATLLMKQAWPQNS S \ S CGTEGTFHLPVDT 

*^ A »-*A'X\. 4. a vrtoortrt.x' ivx X.AV7 XxrXIV.Virr'X ^olv X J\jrir xr X isir orWr x y X AM 

YP I RS AYPQQNL YAQGAYYTQPVYAAQPHVIHHTTWQPNS I PS AI Y 
PAPVAAPRx^GVAMGMVAGTTMAMSAGTLLTTPQHTAIGAHPVSMPT 
YRAQGTPAYSYVPPHW 

4080 

A 

1 

502 

RHEGGMWRAGSMSAELGVGCALRAVNERVQQA\VARRPR\DLPSPSQ 
PRLVAV/ S AKTKPADMVIEAYGHGQRTFG\ ENYVQELLEKASNPKI L 
SLCPEIKWHFIGHLQKQNVNKLMGKSEFLIKRENIMPRTELNSPFSG 
V I YTWETYPTSMF I I I WWS FRKI I KNCS P 

4081 

A 

171 

1184 

xx»xmi\jjjj^ulilv^ovjiJx'Px LiHLir AKSKHSTIjPFSLGIjGVGPRGMWR 
AGSMSAELGVGCALRAVNERVQQAVARRP/RGDLPAIQPRLV\AVSK 
T\KPADM\VIEAYGHGQRTFGENYVQEL\LEKAS\NPKILSLCPEIK 

WTTT**T\ r5HT.OVOKTVTvFVT.Ma , \7'DKIT T?MT \ t lr m/nT?Pr»PT nr\u^rxic»PM\ /-\T"»tr 

-i: \vjnj_»vxvyi>i vm ajupi/vv rTJJjr riij \£» x Vbr Lc»LLADKVNSoW\QRK 
\GSP\ERLKVMGPGFNTSREEIYLFSVSLLEGKHG\LPPSETIAIVE 
HINAKC\PNL\EFVGL\MTLG\SFGHDLSQGPNPDFQLLIiSLPEE/l 
CGKKI^IPAEQVELIMGMSADFQHAVEVGSTNVRIGSTI FGERDY\ S 
KKPTPDKCAADVKAPLEVAQEH 

4082 

A 

1 

418 

YATLGTRQQIEVARQLIDEKVGGTNLGAPGAFGQSPFSQPPAPPHQN 
TFPPRSSGCFPNMAAKVNGNPHSTPVSGPPAFLTQGWGSTYQAWQQP 
TQQVPSHAAS AAPQAS S P PDYTMAWAE YYRQQVAF YGQTLGQAQA 

4083 

A 

2 

2447 

KF FMVAL P FVS VAFIiAETE I S EDHRP FD I L YQTTQ WTRTHGG AYVRV 
PHDYSCCGAVLNNASKEGKSSHLFLKEKEGSLSIFTLSLDPEQEGVG 
NQLGALVHQRQAKRLIiGQIVDRCRNGPGFHNDIDSNSTIQEILIPAS 
KVGLVIGRGGET I KQLQERTGVKMVM I QDGPLPTGADKPliRITGDAF 
KVQQAREMVLEIIREKDQADFRGVRGDFNSRMGGGSIEVSVPRFAVG 
I VIGRNGEM I KKIQNDAGVRIQFKPDDG I S PERAAQVMG PPDRCQHA 
AHIISELILTAQERDGFGGLAAARGRGRGRGDWSVGAPGGVQEITYT 
VPADKCGLVIGKDl^GLCSCKMQEWLSFRI^SELVWVRVHLPAGV 
RAGGENIKSINQQSGAHVELQRNPPPNSDPNLRRFTIRGVPQQIEVA 
RQLIDEKVG\TFPPRSSGCFPNMAAKVNGNPHSTPV/QGWGSTYQAW 
QQPTQQVPSQQSQPQSSQPNYSKAWEDYYKKQRCFLLVGRDRVEGYQ 
RSGPPGVGI I ARFMAIiLGVGGHAAS AAPQAS S PPDYTMAWAEYYRQQ 
VAFYGQTLGQAQAHSQALGKQVGSHTLDAGVSLHLCLLYPCQSTGSL 
LSSRDTYVDGIQVAiPVSNWYRV^IAAKIDSIPHLNNSTPLVDPSVY 
GYGVQKRPLDDGEEGLVGPDETNEIPNANFRQQIRKIilKDGLILRHR 
KPVx^SRAQCWKSTLARRKGRHLGIESKKIDRHI^miSLYLKLKGW 
FKHKRILTEHSHKLKADKARKKPL^ 


.1. 01 0 8 «4 6 4» 3 . O E! iS 6 O 5 






TKKEEIIKTLSQEEKAKK 

4084 

A 

1 

271 

ALGKQVGSHTLDAGVSLHLCLLYPCQSTGSLLSSRDTYVDGIQVALP 
VSNWYRVLI AAKIDS I PHLNNS TPLVDPSVYGYGVQKRPLDDG 

4085 

A 

1 

1814 

GPQAEREAGPG SRA/ RGVGGVGGGGDGGGGGNGGAGAGAERS CGDEG 
RG\ FVDAIiHRVRQI AAKIDS I PHliNNSTP\ LVDPSVYGYGVQKRPLD 
DG VGNQLG ALDHQRTV I TEEFKVPDKMVGF 1 1 GRGG E QI SRIQAESG 
CKIQIASESSGIPERPCVIiTGTPESIEQAKRLLGOIVDRCRNGPGFH 
NDIDSNSTIQEILlPASKVGLVIV\RGGETIKQLQERTGVKMVMIQD. 
GPLPTGADKPLRITGDAFKVQQAREMVIiEIIREKDQADFRGVRGDFN 
SRMGGGSIEVSVPRFAVGIVIGRNGEMIKXIQNDAGVTTHYRPEDAV 
\ S PQRAAQVMGPPDRCQHAAH\ I I SELILTAQERDGFGGLAAARGRG 
RGRGDWSVGAPGGVQEITYTVPADKCGLVIGKGGENIKSINQQSGAH 
VELQRNPPPN SDPNLRRFT I RG VPQQ I E VARQL IDE KVGGTNLG APG 
AFGQSPFSQPPAPPHQNTFPPRSSGCFPNMAAKVNGNPHSTPVSGPP 
AFLTQGWGSTYQAWQQPTQQVPSQQSQPQSSQPNYSKAWEDYYKKQS 
HAASAAPQAS S P PD YTMV\ WF\ E YYRQQVAF YGQTLGQAQAHSQEQ 

4086 

A 

1 

265 

AGNRLRFQLELEFVQCLANPNYLN/WYPQCLHMLEIiLQYEHFRKELV 
NAQCAKFIDEQQILHWQHYSRKRMRIjQQAIiXAKQQQQNNTSGK 

4087 

A 

1 

987 

MIGPPGSWLAGWQLEAFLSWLRAIiAPWWPLGSSISSGRFFVLAGGRI» 
GRGVGLSAQCRDVSTLRDEALTWREWSVRCEEDSTCREDASRMRGS 
RFPEGEREGPSLRRWEVSTRRDDVSTRQKRGKLRYCHSRERHNTMTW 
KHHGPSKAKTLDSCPSGAPTARAPPPRYFGASAGGAIPGLVGDGRSR 
PAPYPTVPRDRKREPWFPELRPRLWAFALSGWWRFGVFVCYGRCCRY 
GDRGYFKDKAFVNYLKYLLYWKDPEYAKYLKYPQCLHMLELLQYEHF 
RKELWAQCAKFIDEQQILHWQHYSRKRMRLQQALAEQQQQNNTSGK 

4088 

A 

1 

491 

RG FRNFARVS GL1iIjC!QAGGVIjVS S FVMAAAVAMETDDAGNRLRFQLE 
LEFVQCYANHNTLIFLPQR\GYF\KDKAFVNYLKYI,LYW\KDPEYAK 
YLKYPQCLHMLGAAPNMEHFRKEAGWNAQC\AKFIGWNRQVSTU3KH 
YS \ RKRM \ RLQQALAEQQQQNNTS G K 

4089 

A 

55 

858 

EKGRARGRERRRKMQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPE 
RDDHEGQPRPRVPRKRGHISPKSRPMANSTLLGLLAPPGEAWGILGQ 
PPNRPNHSPPPSAKGKJCIFGWGDFYSNIKTVALNLLVTGKIVDHGNG 
TFSVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNC 
\RMEWEKVE\RGRRTSLFTHDPAKICSRDHAQSSS\TWS\CSQ\PFK 
WCVYIAFYSTDYRIiVQKVCPDYNYHSDTPYYPSG 

4090 

A 

1 

368 

MRARNLMTKLTVISRKEELQQEAAAHREVQSLPPGEMEERSSSKTCP 
QVCDGRILSCMIKVPP/VGREKH*RPRGRSH*HEAGCGPFTPPKVDH 
HEHLPPRRVEAGSRQQKSTTEPFPQAAGW 

4091 

A 

253 

536 

YLASLEIPQSKVPESRPSHQVPSSQSLDSETPPSN/RHRQPPVDRKC 
VCRL\PLPPPPTFRPPKVDHHEHLPPRRVEAGSRQQKSTTEPFPQAA 
GC 

4092 

A 

52 

310 

SLGSPVLPGSGAAGVSTVAHSSAH*VEANWDSCQGGGLGSARAARRA 
PAVHWQQLHSGDWVPQIRTRALVHHHFRRPLLRPAHSQQ 

4093 

A 

247 

363 

DANDGSTVSATVETPAAPDPGRTGEPRDHAAFEKSSRIS 

4094 

A 

1 

608 

ME I AI TKAKVDFS S WCLPPS V I AVNGLDGGGAGENDDE PVLVSLS A 
APSPQSEAVANELQELSLQPELTLGLHPGRNPNLPPLSERKNDLKLF 
LHGYLRLTFFRLFKLYIiSSDSLCHSELLLSFSAYLLNFVTLQHSALE 
ERGCFNYDNCNKPCCMGEVKGSGPQDFPELQMPIDSGGSSGYPQLLS 
ELATNQRFPQAS PQVGS FARTTH* VEANWDSCQGGGLGS ARAARRAP 
AVHWQQLHSGDWVPQIRTRALVHHHFRRPLLRPAHSQQ 

4095 

A 

144 

3332 

srsgasrgtrpgaarqviggygwhgiefhlsvgivskiaitkakvdf 
sswclppsviavngldgggagenddepvlvslsaapspqseavane 
lqelslqpeCtlglhpgrnpnlpplserknvlqlklqqrrtreelvs 
qgimpplkspaafheqrrslerartedylkrkirsrperselvrmhi 
leetsaepslqakqlklkrarladdlnekiaqrpgpmelveknilpv 
esslkeaiivgqvnypkvadsssfdedssdalspeqpashesqgsvp 
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SPLEARVSEPLLSATSASPTQWSQLPMGRDSREMLFLAEQPPLPPP 
PLLPPSLTNGTTIPTAKSTPTLIKQSQPKSASEKSQRSKKAKELKPK 
VKKLKYHQ Y I P PDQKQDRGAP PMDS S YAKI LQQQQL FLQLQI LNQQQ 
QQHHNYQA I LPAPPKS AGEALGS SGTPPVRSLSTTNSS S S SGAPGPC 
GLARQNSTSLTGKPGALPANLDDMKVAELKQELKIiRSLPVSGTKTEL 
IERLRAYQDQISPVPGAPKAPAATSILHKAGEWVAFPAARLSTGPA 
LVAAGLAPAEVWATVASSGVVKFGSTGSTPPVSPTPSERSLLSTGD 
ENSTPGDTFGEMVTS PLTQLTLQAS PLQILVKEEGPRAGS CCLS PGG 
RAELEGRDKDQMLQEKDKQ I EALTRMLRQKQQLVERLKLQLEQEKRA 
QQPAPAP APLGTP VKQENS FS S CQL S QQPLGP AH P FNP S LAAPATNH 
IDPCAVAPGPPSWVKQEALQPEPEPVPAPQLLLGPQGPSLIKGVAP 
PTL I TDSTGTHLVLTVTNKNAD S PGLS SGS PQQMDLEHPLQPLFGTP 
TSLLKKEPPGYEEAMSQQPKQQENGSSSQQMDDLFDILIQSGEISAD 
FKEPPSLPGKEKPSPKTVCGSPL\AAQPSPSAELPQAAPPPPGSPSL 
^JjKijiiiJr Lbbb XvjJLii'JjJbI&(jHDGPEPIjSLIDDIjHSQMLSSTAILiDHP 
PSPMDTSELHFVPEPSSTHGPGTWANGHL\DSMDWL\ELSSGGPVLS 
LAPLSTTAPSLFSTDFLDGHDLQLH\WDSCL 

4096 

A 

318 

543 

KSSWSRWAFGTVRSPCGRQHRDIGATDNSGRSGPAEQRPQSS/ASSA 
MSAAGGSRLTFHPRNLPPRTPTQDSAALT 

4097 

A 

42 

342 

ENE I LE WQLGSRVCVFHF YHLHCCRHHCPENVHAG / C FGDCIX3/HG 
RCS IQNCTFER* I *VG*NNVAATAVMFGLVFPLVLSLKYQVYFFKVL 
ALMNFCSSS 

4098 

A 

1 

1289 

MYERVYVSTDIAIKLCCKLLNRQFDRLYKKHSTGSASGEGLKALPLI 
VEGEGEPVCAEITRQEREEGAVHLFGLTWQLQPVEGELYANGVSKGN 
RGTESMDTTYSPIGGKVSDKSEKKVFQKGRAIDTGEVDIGAQVMQTI 
PPGLFWRFQITIHHPIYIiKFNISLAKDSLLGIYGRRNIPPTHTQFDF 
VKLMDGKQLVKQDSKGSDDTQHSPRNLILTSLQETGFIEYMDQGPWY 
LAFYNDGKKMEQVFVLTTAIEIMDDCSTNCNGNGECISGHCHCFPGF 
LGPDCARDSCPVLCGGNGEYEKGHCVCRHGWKGPECDVPEEQCIDPT 
CFGHGTC IMG VC I CVPG YKGE I CEEEDCLDPMCSNHG I CVKGECHCS 
TGWGGVNCETPLP/EQCSGHGTFLLDAGVCSCDPKWTGSDCSTEHKG 
GSSRLFL 

4099 

A 

1 

4509 

VRLE W PTDLiAVNPMDNS L YVLDNN I VLQ I S ENRR VR 1 1 AGRP IHCQV 
PGIDHFLVSKVAIHSTLESARAISVSHSGLLFIAETDERKVNRIQQV 
TTNGEIYIIAGAPTDCDCKIDPNCDCFSGDGGYAKDAKMKAPSSLAV 
SPDGTDYVADLGNVRIRTISRNQAHLNDMNIYEIASPADQELYQFTV 
NGTHLHTLNL I TRD YVYNFTYNS EGDLGA I TS SNGNS VH I RRDAGGM 
PLWLWPGGQVYWLTISSNGVLKRVSAQGYNLALMTYPGNTGLLATK 
SNENGWTTVYEYDPEGHLTNATFPTGEVSSFHSDLEKLTKVELDTSN 
RENVIJ^STNLTATSTIYILKQGGDTRHSKSTLTFVSGILWSTATVMV 
G PGENTQ S TYR VNPDGS LR VTF AS GME I GLS S EPH I LAG AVNPTIjGK 
CNI SLPGEHNAKTLI EWRQRKEQNKGNVS AFERRLRAHNRNLLS IDFD 
HITRTGKIYDDHRKFTLRILYDQTGRPILWSPVSRYNEVNITYSPSG 
LVTFIQRGTWNEKMEYDQSGKIISRTWADGKIWSYTYLEKSVMLLLH 
SQRRYIFEYDQPDCLLSVTMPSMVRHSLQTMLSVGYYRNIYTPPDSS 
TS F I QDYS RDGRLLQTLHLGTGRRVLYKYTKQARLSEVL.YDTTQVTL 
TYEESSGVIKTIHLMHDGFICTIRYRQTGPIilGRQIFRFSEEGLVNA 
RFDYSYNNFRVTSMQAVINETPLPIDLYRYVDVSGRTEQFGKFSVIN 
YDLNOV I TTTVM KHTKI F S ANHOV T RVnvp tt.ttzi t zv ywm tt rwnTJWi 

/ RHGNM\ cirvgvdanitryfyeydadgqlqtvsvndktqwrysydl 

NGNINLLSHGKSARLTPLRYDLRDRITRLGEIQYKMDEDGFLRQRGN 
DIFEYNSNGLLQKAYWKASGWTVQYYYDGLGRRVASKSSLGQHLQFF 
YADLTNPIRVTHLYNHTSSEITSLYYDLQGHLIAMELSSGEEYYVAC 
DNTGTPLAVFS S RGQ V I KE I L YT P YGD I YHDT Y PD FQVI IGFHGGLY 
DFLTKLVHLGQRDYDWAGRWTTPNHHIWKQLNIiLPKPFNLYSFENN 
YPVGKIQDVAKYTTDIRSWLELFGFQLHNVLPGFPKPELENLELTYE 
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LLRIiQTKTQEWDPGKTILGIQCELQKQLRNFISLDQLPMTPRYNDGR 
CLEGGKQPRFAAVPSVFGKGI KFAIKDGIVTADI IGVANEDSRRLAA 
ILNNAHYLENLHFTIEGRDTHYFIKLGSLEEDLVLIGNTGGRRILEN 
GVNVTVSQMTSVLNGRTRRFADIQLQHGAIiCFNIRYGTTVEEEKimV 
LEIARQRAVAQAWTKEQRRLQEGEBGIRAWTEGEKQQLLSTGRVQGY 
DGYFVLSVEQYLELSDSANNIHFMRQSEIGRRYLLQQYSSSQYQNLY 
WSVQAAITRYHRQGGLNNRNLFLTILETGKSKIKVLQIAAIQPSGKQ 
ELLAGF PAD FAM I HSHCVHGLVD S WAGRGG E LCKGDAAATAARHKS T 
T 

4100 

A 

338 

8522 

SEMEQTDCKPYQPLPKVKHEMDLAYTSSSDESEDGRKPRQSYNSRET 
LHEYNQELRMNYNSQSRKRKEVEKSTQEMEFCETSHTLCSGYQTDMH 
S VSRHG YQLEMGSDVDTETEGAAS PDHALRMW I RGM KSEHSS CLS S R 
ANSALSLTDTDHERKSDGENGFKFSPVCCDMEAQAGSTQDVQSSPHN 
QFTFRPLPPPPPPPHACTCARKPPPAADSLQRRSMTTRSQPSPAAPA 
PPTSTQDSVHLHNSWVLNSNIPLETRHSLFKHGSGSSAIFSAASQNY 
PLTSNTVYSPPPRPLPRSTFSRPAFTFNKPYRCCNWKCTALSATAIT 
VTLALLLAYVIAVHLFGLTWQLQPVEGELYANGVSKGNRGTESMDTT 
YSPIGGKVSDKSEKKVFQKGRAIDTGEVDIGAQVMQTIPPGLFWRFQ 
ITIHHPIYLKFNISIJUCDSLLGIYGRRNIPPTHTQFDFVKLMDGKQI* 
VKQDS KGSDDTQHS PRNL I LTS LQETGF I E YMDQGP WYLAFYNDGKK 
MEQVFVLTTAIEIMDDCSTNCNGNGECISGHCHCFPGFLGPDCARDS 
CPVLCGGNGEYEKGHCVCRHGWKGPECDVPEEQCIDPTCFGHGTCIM 
GVCICVPGYKGEICEEEDCLDPMCSNHGICVKGECHCSTGWGGVNCE 
TPLPVCQEQCSGHGTFLLDAGVCSCDPKWTGSDCSTELCTMECGSHG 
VCSRGICQCEEGWVGPTCEERSCHSHCTEHGQCKDGKCECSPGWEGD 
HCTIAHYLDAVRDGCPGLCFGNGRCTLDQNGWHCVCQVGWSGTGCNV 
VMEMLCGDNLDNDGDGLTDCVDPDCCQQSNCYISPLCQGSPDPLDLI 
QQSQTLFSQHTSRIiFYDRIKFltlGKDSTHVIPPEVSFDSRRACVIRG 
QWAIDGTPLVGVNVSFLHHSDYGFTISRQDGSFDLVAIGGISVILI 
FDRS PFLPEKRTLWLPWNQF I WEKVTMQR WS DPPS CDISNFISPN 
PIVLPSPLTSFGGSCPERGTIVPELQWQEEIPIPSSFWLSYTiSSR 
TPGYKTLLRILLTHSTIPVGMIKVHLTVAVEGRLTQKWFP7UVINLVY 
TFAWNKTDIYGQKWGIAEALVSVGYEYETCPDFILWEQRTVVLQGF 
EMDASNIiGDWSLNKHHILNPQSGIIHKGNGENMFISQQPPVISTIMG 
NGHQRSVACTNCNGPAHNNKLFAPVALASGPDGSVYVGDFNFVRRIF 
PSGNSVSILELSTSPAHKYYLAMDPVSESLYLSDTNTRKVYKLKSLV 
ETKDLSKNFEWAGTGDQCLPFDQSHCGDGGRASEASLNSPRGITVD 
RHGFIYFVDGTMIRKIDENAVITTVIGSNGIiTSTQPLSCDSGMDITQ 
WLEWPTDLAVNPMDNSLYVLDNNIVLQISENRRVRIIAGRPIHCQV 
PGIDHFLVSKVAIHSTLESARAISVSHSGLLFIAETDERKVNRIQQV 
TTNGEIYIIAGAPTDCDCKIDPNCDCFSGDGGYAKDAKMKAPSSLAV 
S PDGTL YVADLGNVR I RT I S RNQAHLNDMN I YE I AS PADQEL YQFTV 
NGTHLHTLNL I TRD YVYNFT YNSEGDLG AI TS SNGNS VH I RRDAGGM 
PLWLWPGGQVYWLTISSNGVLKRVSAQGYNPALMTYPGNTGLLATK 
SNENGWTTVYE YDPEGHLTNATFPTGEVS S FHSDLEKLTKVEIiDTSN 
RENVLMSTNLTATS T I Y I LKQENTQSTYR VNPDG S LR VTFAS GME I G 
LSSEPHILAGAVNPTLGKCNISLPGEHNANLIEWRQRKEQNKGNVSA 
FERRLRAHNRNLLS IDFDHI TRTGKI YDDHRKFTLR IL YDQTGRP I L 
WSPVSRYNEVNITYSPSGLVTFIQRGTWNEKMEYDQSGKIISRTWAD 
GKIWSYTYLEKSVMLLLHSQRRYIFEYDQSDCLLSVTMPSMVRHSLQ 
TMLSVGYYRNIYTPPDSSTSFIQDYSRDGRLLQTLHLGTGRRVLYKY 
TKQARLSEVLYDTTQVTLTYEESSGVIKTIHLMHDGFICTIRYRQTG 
PLIGRQIFRFSEEGLVNARFDYSYNNFRVTSMQAVINETPLPIDLYR 
YVDVSGRTEQFGKFSVINYDIJSrQVITTTVMKHTKIFSANGQVIEVQY 
EILKAIAYWMTIQYDNVGRHGNM\CIRVGVD7^NITRYFYEYDADGQL 
QTVS VNDKTQWR YS YDLNGN INLLSHGKS ARLTPLRYDLRDR I TRLG 
E IQYKMDEDG FLRQRGND I FE YNSNGLLQKAYNKASGWTVQ YYYDGL 
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GRRVASKSSLGQHLQFFYADLTNPIRVTHLYNHTSSBITSLYYDLQG 
HLI AMELS SGEE YYVACDNTGTPLAVFS SRGQVI KE IL YTPYGD I YH 
DTYPD FQV I IG FHGGLYD FLTKLVHLGQRDYD WAGRWTTPNHH I WK 

LPGFPKPELENIiELTYELLRLQTKTQEWDPGKTILGIQCELQKQLRN 
FISLDQL PMTPR YNDGRCLEGGKQPRFAAVPS VFGKGI KFAI KDGIV 

Lr\XJ X X\J V t\lv Z\r\.Xit\r\.X XjvUXfXTl 1 XjCjIS XiriC X X Ej\Jt\XJ Ail X C X l\JL>VJO Li tit 

EDLVL I GNTGGRR I LENG VNVTVS QMTS VLNGRTRRFADI QLQHGAIj 
CFNIRYGTTVEEEKNHVLElARQRAVAQAWTKEQRRLQEGEEGIRAW 
TEGEKQQLLSTGRVQGYDGYFVLSVEQYTiELSDSANNIHFMRQSEIG 
RR 

4 1 Ol 

A 

139 

52 0 

L S L PLS PRLECS GT I LARCNLiHP S PPRS KR FS CLS L. PS S WD YRRA PP 
NLAI FLLLLFLVEMGFHHVGQTGLELLASNYLPTSA/ FPKCWDYRHE 
PPCPADKTNFLSLWFIGYNIFLESEVNFFLPWDA 

x \j 

a. 

7 

4 97 

VFTDM SGLRHE PKALRREL PEE AAAEEERPK T EVPS E T EVPREALEP 
SVPLMVSLEISLEAAEEEKSRISLIPPEERWAWPEVEAPEAPALPW 
PELPEyPMEMPLVLPPELELLSLEAVHSPRRMAARVTYLLLVLSAQQ 
ILHVKQEKPYGRLLIQPGPRFH 

4103 

B 

93 

182 

PPPKALRRELPEEAAAEEERRKIEVPSEIE* 

4104 

A 

51 

394 

WGGPGLKPLLMVLDP YRAVALELQANRE PDFS SLVS PLS PRRMAARV 
FYLLL\ VS VCMCVCVCGAGTQRPEARTGTPPTCP]LLAS * PVLS AQQI 
LHVKQEKPYGRLLIQPGPRFH 

4105 

A 

427 

499 

KTRGTMF Y YPNVLQRHTGCFAT I W 

4106 

A 

796 

2482 

KTRGNMFYYPNVLQRH\NGCFATIWLAATRGSRL 

CEEIIiNYVLVRVQPPQPGLPRPRFSLYLSAQLQIGVIRVySQQCQYL 
VEDIQHIL\YRIiHRAQLQMRIDMETELPSLLLPNHLAMMETLEDAPD 
PFFGMMSVDPRLPSPFDIPQIRHLLEAAIPERVEEIPPEVPTEPREP 
ERIPVTVLPPEAITIL\EAEPLRMIjEIEGEREI,PEVSRREL\DLL\I 
Afc. fc» fc. A 1 L» Jj hi x P HLi P P P A P AE V EG X GE ALiG PE E IjRxjTGWx^PCjAJjxjMK 
VTPPEELRLPAPPSPERRPPVPPLLRRRRRRRLLFWDKETQISPEEF 
PRNNLQTRAH\CWECPMV\QPPERTIRGPAELF\RTPTLSGWLPPEL 

T.HT.WTWV r , Ar>PPPK"AT.PPPT.PPPAA APPPPPTTTP\7PQPTPVPPPAT.P 

PSVPLIWSLEISLEAAEEEKSRISLIPPEERWAWPEVEAPEAPALPV 
VPE\lPEVP\MEMPLVLPP\ELELLlxNW^ 

D FS S \ L» VS P \ LS PRRMA\ ARVFYLL \ LVLSAOOILH\ VKOEKP \ YGR 
LLI \ QPG\ PRFH 

4107 

A 

136 

424 ; 

SPEHPQLPGSLLRPPGAQIPSEWQVAEATAIiVHTLDGWSVVQTMVVS 
TKTPDRKL I FGKGNFEHL\ KKELEAAWGVEVFDRFTVVLH I LRCNAR 

TK 1 

4108 

A 

1 

212 

RPPRRWRGRHQPHGEPAVRSGAPVRVPGEDDAAQPSAAERVGPRAV 
PGAG FTVHQHHRGGLHRRRRLRQ * RHQPHGEPAVRSG APVRVPGEDD 
AAQPSAAERVGPRAVpGAGFTVHQHHRGGIxHRRRRIjRQ 

4109 

A 

1 

1746 

AGRRTPGAAALPRGPRPxXSSAGSRAVLPRARASRCRPQEPRESGGAV 

GRREGPAGGRRTKTERKTTRRSRKMRTRTPRRSCCGESLCCRRGPKR 

VCLVHPDVKWGPGKSQMTRAEWQVAEATAIjVHTUDGWSVVH 

KTPDRKLIFGKGNFEHLTEKI^GSPDITC^FlxWERMAAPTKKEL 

AWGVEGCLTAFTV\VXQHLPACNARTK/ERARLQVA1xAGDARLHRSN 

IjKPDVAHTiYRRVr?/ ^PAYTMf?cinpc:piJ /MOT^nPT.T.PPTCPAKTRKAXi 

DRLRKKRHLL\RRQRxTmEFPVISWGYTN\CGKTTLIKAliTG\DAA 
I\QPPJDQLFATLDVT\AHAGTLPSRMTVLYVDTIGFLSQL,PHGLIES 
F\ S ATLEDVAHS \DLIIiHVKDV\ SHPEAELQKCS VLSTLRGLRLPAP 
LLDShTTOVHNKVDLVPGYSPTEPlW\VPVSALRGHG\LQELKVEIxDA 
AVLKATGRQILTIxRWlxAGAQL*^ 

IIS\NFPYGK\FPJCLFPRMNGRPQKACGVGASPALGELRRYPLCWGQ 
LGVRCSRVxjLWFCTRLASQPFAGMYRACR 

4110 

B 

50 

382 

XGEQLVRQDLDAGVSEHSGDWLDQDSVSDQFSVEFEVESIiDSEDYSL 
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SEEGQELSDEDDEVYQVTVYQAGESDTDSFEEDPEISLADYWKCTSC 
NEMNPPLPSHCNRCWAHX* 

4111 
4112 

A 
A 

1665 
1431 

1789 
1555 

F FVLL VETG FHR VS QDGLDLLTS * S AHLGLPKCWDYRHEPP 
FFVLLVETGFHRVS QDGLDLLTS * S AHT.fiT.T3K- rwnvi>ui?Dn 

4113 
4114 

A 
A 

1740 
1 

1864 
752 

FFVLLVETGFHRVSQDGIiDLLTS * SAHLGLPKCWD YRHEPP 
DLDAGVSEHSGDWLDQDSVSDQFSVEFEVESLDSEDYSPSEGGQELS 
*« v * v A vrtVDoi/iuor "^^"^i-oijAlJYWKCTSCNEMNPPLP 

SHCNRCWALRENWLPEDTGKDKGEISEKAKLENSTQAEEGFDVPDCK 
KTIVl^SRESCrVEEJ^DKITOASOSOESEDYCinPQTQcicTTVce^pr* 
VKE FEREETQDKEE S VES S L PLNAI EPCVI CO/GST* KWLHrPwnup 
TSYGLLYMCKEAKEKE 

4115 

A 

426 

815 

SSRR FVWRAKLLCERAQSGTVYEI * QCAHRHPRHRHPGrrPHRT fiva 
GTAGPLAGYRPFRQHRQSLWPJUVSAICVDAISMRTRSRSTVRPLWPP 
PSPARFATWSHYRLRDHGDHTRPVDLPTSQFTILL * 

4116 

A 

1174 

1354 

FFVLLVETGFHRVSQDGLDLLTS * S AHLGLPKCWDYRHE PLP P APT P 
KHSPGLYSQPILT 

4117 

A 

2251 

2373 

FFVLLVETGFHRVS QDGLDLLTS * S AHLGLPKCWDYRHEPP 

4118 

A 

1027 

1193 

FFFFFFFFFGFL VETG FHR VSQDSLDLLTS * SS RLGLPKCWDYRHEP 
PRPAEEGI' 

4119 

A 

983 

1386 

QEVRYRKVETLRCLLFSSCLVPVCAASPVSRPGCRFLRSSLHWPTGR 

LVFRQRGETFLVPEKTVLRGVASAPAQKAAGRTPVPGRPRDARLRAD 
ARS*SC* RAARPRRGASGAVGARCGPR Pfi FP FT .P g nrz t py 

4120 
4121 

A 

A J 

6133 
333 

7646 
430 

YMLFLFLSTKGWIVIQ>TRQDGSVDFGRKWDPYKQGFGNVATNTE)GKN 

YCGLPGNEQACKI KSF YLKWDFF * LKNIHCWKPVLGS *EEFPDKNVE - 

AKDKGRKAVFSFPKFYFW*EILFCFSFR\EYWLGNDKISQLTRMGPT 

ELL I EMEDWKGDKVKAH YGGFTVQNEANKYQ I S VN\ KYRGT\ AGNAL 

MDGAS \ HLMG\ ENRDHDPFH1JGHGS FQPP YD\ RD\ NDGVTYVWHSLLL 

L*KSH*YHYSESLTIFLIATTSWALTVSHCPKLFMHHSKAFQL*GRH 

SYSHFTDEI*RDYVICPMSHNYPEIKXEFEHSYFLNNEHLDKYLYLY 

ILKCV*KLSFSFPGFSDTKGCKSYYSSIK*QTQSLDGLPQRPSYLSF 

LL * GTGGLWC I S VTLC I APKGKTTVHTS VAVF YG * S AKRNLTTWLF 

LITPNTFSFRLTSDPRKQCSKEDGGGWWYNRCHAANPNGRYYWGGQY 

TWD^IAKHGTDDGVVWMiWKGSWYSMRKMSMKIRPFFPQQ 

4122 

A 

2 

1679 

GRGDKPYPSPGDSWFVQD*ILVQL*ICPQEPS 

QRGHWE EAVGGRRAGAFG VGAQGRAGX3RGVEAGRMRLS KTLVDMDMA 
DYSAALDPAYTTLEFENVQVLTMGNGPSSPHCLTVALLGAWHSDMMI 
LLPLRLARLRHPLRHHWS I SGGVDS S PQGDTS PS EGTNLN APNS LGV 
SALCAI CGDRATGKHYGAS SCDGCKGFFRRS VRKNHMYS CRFSRQCV 
vMiwxu^Myv^n x v-n.-uruM^.r *<AoMKJ^AVQNERDRlSTRRSSYEDSSLP 
S I NALLQ AE VLS RQ I TS PVSG INGD I RAKKI AS I ADVCESMKEQLL V 
LVEWAKYIPAFCELPLDDQVALLRAHAGEHLLLGATKRSMVFKDVLL 
LGNDYIVPRHCPELAEMSRVSIRILDELVLPFQELQIDDNEYAYLKA 
IIFFDPDAKGLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELL 
LLLPTLQSITWQMIEQIQFIKLFGMAKIDNLLQEMLLGGSPSDAPHA 
HilPLHPHLMQEHMGTlWIVANTMPTHLSNGQMCEWPRPRGQAATPET 
PQPSPPGGSGSEPYKLLPGAVATIVKPLSAIPQPTITKQEVI 

4123 

A 

42 

1465 

GRWPPRGGRENALSKTL\VDMDMADYSAALDPAYTTLEFENVQVLTM 

GNDTSPSEGTNLNAPNSLGVSTUIjCJVI CGDRATGKHYGAS SCDGCKGF 

FRRSVRKNHMYSCRFSRQCVVDKDKFJtfQCRYCRLKKCFR 

QNERDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIR ! 

AKKIASIADVCESMKEQLLVLVEWAKYIPAFCELPLDDQVALLRAHA 

GEHLLLGATKRSIWFKDVLLLGNDYIVPRHCPELAFJ^SRVS^ 

LVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKIKRLRSQVQVS 

LEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQFIKLFGMAK 

IDNLLQEMLLGGSPSDAPHAHHPLHPHLMQEHMGT1WIVANTMPTHL 

SNGQMSTPETPQPSPPGGSGSESYKLLPGAVATIVKPLSAIPQPTIT 


/ 
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KQEVI 

4124 

A 

3 

1840 

TSPSEGTNLNAPNSLGVSALCAICGDRATGKHYGASSCDGCKGFFRR 
SVRKNHMYSCRFSRQCVVDKDKRNQCRYCRLKKCFRAGMKKEAVQNB 
RDRISTRRSSYEDSSLPSINALLQAEVLSRQITSPVSGINGDIRAKK 
IAS IADVCESMKEQLLVLVEWAKY I PAFCELPLDDQVS S FMGS S FMM 
PISQFRQVALLRAHAGEHLLLGATKRS1WFKDVLLLGNDYIVPRHCP 
ELAEMSRVS IRILDELVLPFQELQIDDNEYAYLKAI IFFDPDAKGLS 
DPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQ 
MIEQIQFIKLFGMAKIDNLLQEMLLGGSPSDAPHAHHPLHPHLMQEH 
MGTNVIVANTMPTHLSNGQMCEWPRPRGQAGKTIPVTHPIAAGDSQA 
WALVWEKQSKSAYQINDGGDGCPCHGVQTEPCEAGSNCGTWGTLLLL 
LLLPPPLLPPSAVTLLSHPVFSNTTSPEAKEALETIPPVILGTCWTW 
VGTWVRIPEGLQPEKTNPGWRLQQELGVERKSIRKRQTIHQAFEKGR 
RGATVSV3ETNEDTSALDQIMDYVEKQWKQRRPQVAWAVVQVKDDGSM 
D 

4125 

A 

6233 

6874 

VLIGCIPQVPPSISYSPLFWFSAIX3NVAEIFFCLCLGNHNSCWRTLV 
VEVPESLCPHC*RLLV*HKSGDIWVLTPQMLQLDPAALPCPPSSIVD 
AKGLSDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQ 
SITWQMIEQIQFIKLFGMAKIDNLLQEMLLGGPCQAQEGRGWSGDSP 
GDRPHTVS S PLS S LAS PLCRFGQVA 

4126 

A 

1 

1481 

GFGVGAQGRAGGRRVEAGRMRLSKTL\VDMDMADYSAALDPAYTTLE 
FENVQ VLTMGNDTS PS EGTNLNAPNSLG VS ALCAI CGDRATGKHYGA 
SSCDGCKGFFRRSVRKNHMYSCRFSRQCWDKDKRNQCRYCRLKKCF 
RAGMKKEAVQNERDRISTRRSSYEDSSLPSINALLQAEVIiSRQITSP 
VSGINGDIRAKKIASIADVCESMKEQLLVLVEWAKYIPAFCELPLDD 
QVALLRAHAGEHLLLGATKRS1WFKDVLLLGNDYIVPRHCPEIAEMS 
RVSIRILDELVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKIK 
RLRSQVQVSLEDYINDRQYDSRGRFGELLLLLPTLQSITWQMIEQIQ 
F I KL FGMAK IDNLLQEMLLGG S PSDAPHAHHPLHPHLMQEHMGTNV I 
VANTMPTHLSNGQMCEWPRPRGQAATPETPQPSPPGGSGSESYKLLP 
GAVATIVKPLSAIPQPTITKQEVI 

4127 

A 

602 

800 

ENPKQQENLWSQYFTL ISTI * KKKSWLC I VAHACDSSTLGD* GGHIA 
*AQEFETSVGNIVKPPSLY 

4128 

A 

646 

1905 

EPPFSTTFALHAFTFFSDQQNQVSDDDLDPSFTVSTSKASGPHGAFN 
GNCEAKLSWPKVSGLERSQEQPPGPDPLLVPFPPKEPPPPPVPRPP 
VSPPAPLPATPSLPPPPQPQLQLRVSPFGLRTSPYGSSLDLSTGSCL 
GRWGRLHGQEPRPGRAGRPLLVRFGTGRVAAALPGLGAWLYNGAESE 
NEEESPRQESSGEEIIMGDPAQSPESKDSTEMSLERSSQDPSVPQNP 
PTPLGHSNPLDHQI PLD P PAPE WPTP SD WTKACEAS WQWG ALTTWN 
SPPWPANEPSLRELVQGRPAGAEKPYICNECGKSFSQWSKLLRHQR 
IHTGERPNTCSECGKSFTQSSHLVQHQRTHTGEKPYKCPDCGKCFSW 
SSNLVQHQRTHTGEEPYKCTECEIAFTQSTNLIKHQRSHTGEKP 

4129 

A 

6717 

7569 

AERSSNRQKSKWELGRKDLDQAKEEWRLPQEEEEGPGAGDESSCGT 
GGGTHRRSKKAKAPERPGTRVSERLRGARAETQGANHTP VI SAHQTR 
STTTPPRCSPARERVPRPAPRPRPTPASAPAAIPALVPVPVSAPVPI 
SAPNPITILPVHILPSPPPPSQIPPCSSPACTPPPACTPPPAHTPPP 
AQTCLVTPSSPLLLGPPSVPISASVTNI»PIiGLGGLREELCAQGLGIF 
QSPWSWLLWPVQKPPHFLLCPLKICCQLLWRSCLCQRRTFLSPLLHP 
A 

4130 

A 

2 

93 

DFTMSHHYHSAHGTLVLAPRALGNPLSLYPSNYTFPFQHSPSDPCTQ 
SYSLPPEYATHGSSCFDTTDGGLNASRALAGSCAPVSGSLPCCPTLG 
IPSPPLPHMHKSLPTYPPTPTISMKPP*PCPGSQGPWQSAFTLPI 

4131 

A 

743 

955 

ESDFTEYFTHNS CLFC YNK I Y I N I Y I KLS LNCN YGNY FLY I VAAHTQ 
TYIYI *NIFCFFCHLLLTFCLGF 

4132 

A 

49 

313 

PLRIWPC*ALFLKLSLGFSLF*FYSSLFLLSPPLC*SLPRVQYIYMY 
ILYICICN*YTYIQVCICNGYMYSCSWCSNVWYGYTENMRT 
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4133 

A 

230 

364 

S E R YL SNP TPPVP P * S AVYL S N PT PHVP P * <5 n P Y T . <z MP <3 P pvp o * qkt 

RYLSNPTTPVPR*SDRYMYNNTPPVPP*SERYLSNPTPPVPP*YDRY 
LSNRTPHVS P * YDRYLSNPTPPVS P 

4134 

A 

508 

780 

VSFSSFPPSFPHLSLFFFSTj^PQPPPQT.PPQQPPT , pprrKTtrr t *tc?/iki 
PNLMYQWG I FKYKY YQC I FWLLSDF * LKYNENDMS VAFGLRTS L 

4135 

A 

34 

329 

DRLKRKNKOSLS YLWEYNKKSNTCVTOSLFRPPnnp aPTT^/\7T? * tmao 

NFPYLSRDVNLKIQDS *RIL1WIKLKKSISKYI IDLLETKDI IRNLE 
SSQR 

4136 

A 

582 

869 

MCQRVQTPKSPSNPYVHSNICTQLFIAPLSIIAKRWKQPKCPSAEE* 
INEM^YIHSMEYDSAIK^FAnjIHVTTWKNFENIRLMKDASHKGLYV 
A 

4137 

A 

285 

346 

AVLFYVFH I P YLLVMTLLLYFLP * LPNOCRFS SLHS KTTAWPSTTVT 
LF 

4138 

A 

97 

298 

DNLTLLPRLECGGMIMAHCSLNLTFSSDPAASAS*VAGTTGT*HHAW 
I FVFLCNWLRNRGVTVKKRS 

4139 

A 

147 

502 

IFHSKMPISEWKLV*TLWQFFKELKIELPFDPAIPLLGI*PKLAVVA 
PQHS 

4140 

A 

435 

660 

NLNQHEKI,SMLE*ILKSYPKILDNINDHEVSHCSKGNNFKNRSKEWW 
CGTQDSWRIiGNLKVGEEEFNDLTRNTTL 

4141 

A 

293 

563 

NVWPSSSVRGRl^*GREGGRPJ?TOOAAnT,npapc;nT.wwT wt qwpt mr 
NM S QGGENL PAGPG s CCGREAQEGAGTS ledap ffeaqllqlh 

4142 

A 

1329 

1563 

WPRNQPTRRPWLKDKLPGVPGGLTGPNPCGQGPPGPPPPLRPSALWS 
SHSGPPPNGEPLPPRGE*YLHPSPRRTSYL 

4143 

A 

304 

589 

RTWWTAAS YFR S Vf?f? ^ a CJMT? DPQ T DPDVPmT r)r>r»uor>t^r^T"T«\ t> ct-> n 
*>•»■«« x-rwo x r rto v uvaortow itr r o I r or I lir y±j±rK\jHoKrvGTT\RSRA 

RGHSRKGTTEEQSQGAIYTVEYACSAVKNLVDSSVYFRSVEGLLKQA 
I S I RDHMNAS AQGHR 

4144 

A 

131 

690 

VSPAVTTSSSLSWARSDLAAEGHARCPLCSGWAQHLSNEEADALSPH 
TRPIPTTCI * SRTSSLRASTS WPTSHPWPCTfirR c?mpvp QT.PPT.anH 

KADNAALGRSRAREPSTLWSTPAAP*RTWWTAASTSAAWRVCSNRPS 
ASGTILNASCPGPQVAPGDRPPPLPFPSLQGPSLSCAGEPLLLAY 

4145 

C 

332 

421 

MEEKI FSQPGMVAPTCNPSTLGGQGRWIT* 

4146 

A 

685 

929 

KVGGAKPSLGYLRLGLRLPVFCLHIS * AGDFPRGV* LLPRRGSG VVA 
HTCNPS TLGGQGRQ I T * GQEFKTGPVNMVKPHL Y 

4147 

C 

710 

892 

MCVCAHTCPQTS I YIHVFTHG YDCIOVTAOKCLHMCFHrHMSTHACV 
DTYICIYIHVKLI* 

4148 

A 

98 

317 

QLTSHLSGGRVNSAWRVXPGPJCDSMCVTPVSVSVGSSLSFHFARAWL 
RIFYGSACEDKPGPPSSHTGNPQRAC 

4149 

A 

656 

1037 

FVQVQAHlUiACKHVQGAYFVFKEQQPEGNVFRCQGVSTS*EHGASSE 
PWHGPAGAGVTTS FS S PFSGGDRHKFE VSGRRLPTEGLPSNS S FES F 
PSSIPWPFLFPQVQLSTIWGLKFQFFGDLKILP 

4150 

A 

1365 

1503 

LKTLQRAGCSAHACNPSTLGGQGRWI I * AQEFNTSLGNVAKPCLY 

4151 

A 

397 

682 

YSKSCTIOVYRMKCQLITVGLSCKC^ ! 
AHACNPNTLGGQRGRI A * GQKFKNS LNNI ARPHLFIOCRKKISLMRWT, 
K 

4152 

A 

477 

686 

LYLCSALKKNQ*EPGMWHAWOT 
RACLYKI Y I FKI S QAWWHAFVI 

4153 

A 

108 

419 

SHTLGGQHGRIMRSGVTTOQPGQHGETPSLLKIQK^ 

SQLLPJlLRLENRLlttGGGGGSETEIEPLPPQPG*TQ*RLPSQKQNKN 

LETVSKKKS 

4154 

A 

175 

190 

DKVRVS QVRLTSRHL 1 TLR * IRKGRGKKRGCRERKRENRGK* KYV 

4155 

A 

301 

962 

IMGDAKNFLYAWCGKRXMTPSYEIPJ\VGSPNGHRFMPEGDVHGLGKC 

GMGNSTNKKDALSNAARDFV1^SGRI1^IYM*EQGSAIGELSHMVIS 
FIRPE 

4156 

A 

20 

430 

RPQTYTMVTIIjTPEEKCAvTALWGKAIWDEVGGEAIjGRLLVV^ 
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VFETFGDVSTPDAVMG I PNGNADGMNVLWAS IDGLAHLDNLRGTFAT 
LTEHD * D * LHVDPE ILMLLGNVLI CVLAHHCVGEFS PSWQAA 

4157 

A 

31 

492 

QRILQSRPFEGRVKQGVTWLLQLVGLVSESHIGPELHLPCSEEECHL 
LVSFGVNSPEDSSPLLTEWLWTSHLPFLLHCLFGASGRGLQTVPPFV 
STALQNVCFPQKGIKAPFLLLSCATGGCSAGLRVYKLRSAPSCD*AG 
RTRGSSGESAPML 

4158 

A 

165 

435 

GSGQADHPCVGWRQNSVNPRQVFLQATG * L I S AS SGS INTNMAQGTA 
WCFGPVYPPKAKGRARSQVTPCLTLPSKGRLCKILCYLFKL 

4159 

A 

132 

257 

EFLIRKGDVTGENGFQTMPTTRQAQEQGQERVGCDWCEAM 

4160 

A 

2 

111 

QLLKDSARSTLVLYPCWARVSQKSGCCPLKVLSQRPGKKLGLLVATEA 
WECPGAVARCLCHPGPLCPASGTFTVPGTEPHS * LLLRNELPSPLF* 
LLGQSEPEKWMLPPQGAISETR 

4161 

A 

3 

116 

GWRAPTSQPGGPAQVPCS S F * KI LPGQHWS STPAGPE 

4162 

A 

273 

793 

DSLLL,IQVDTMPFTLHLRSRLPSAIRSLILQKKPDIRNTFSMAGGAP 
TSQPGGPAQVPCSSF*KILPGQHWSSTPAGPE*ARKVDAAPSRCYLR 
DQPKGPFKVPGTEPHS*LLLRNELPSPLF*IiLGQSEPEKWMLPPQGA 
ISETSLRDLSKCIAQNLTHDCCSGTSCPPHCSDSFPYHLAPVLABIF 
EIMEKVQPWRNPTGSAICPLHRPSQKPACTGLRRK 

4163 

A 

391 

868 

NFQAQTGRRALLRCFQTFYISIVDLCENGGKRPTTNSSASFTKEQAD 
AIRRIRNSKDSWDMLGVKPGASRDEVNKAYRKLAVLLHPDKCVAPGS 
EDAFKAWNARTALLKN I K* KVQKKATCGTQMQTDFP * R * NNQRGGF 
LPKNLTALF IHVLAFVYQ 

4164 

A 

212 

494 

FSEAQFLANVAIKAILSEEISEGKGDRRI4LADHTLILTERGSEGPCC 
* PQEVRPTVIiSEWGCKCAQANCECSP * PWPHKSRAGHFPPLSLGKLL 

4165 

A 

281 

587 

E I F FNPKRE YLS LHWMDEHQC PNF IHHVGQGGLiGTI WQD VLQE YNQK 
NFLYNISLKDKFIiGWMW*LTPVIPALWEAEVGGSLEPRSSKTSLGNM 
ARSHPYKK 

4166 

A 

113 

564 

IDRLDSAKIFGHLSIFDRCEKRRPDLVSGTGTCSGSFRRVTFQPAVP 
GRTDS P I PRTHGPNFVP * TLVEKKPPAAW * TPACDLTS KPS IGPTIL 
VPTPKVE PGS LPTPA* DPLEP IQS PEKK* S I VPQTGS FQTASQLTEA 
VPELVLVAH 

4167 

A 

1173 

1432 

PIPVDLNTLMFCHYFIYVYHPCLEMFSFTCGLLSL*AQRPSDGYFFE 
AFI YI ILFCTMFFLNVQILYSSEKNTVFVDNHS YYTVLR 

4168 

A 

75 

270 

QYISWDTYLLMPRLATAI ILWITN*VFEKRVAHFGYHWSLMVRDRR 
ISGVDRYYVSKGLENID 

4169 

A 

271 

825 

SPPACLVDGQQALPSGGGLYPNSLRQGSVGQAGRGCLSWFCMVHFAN 
PEDQFLGFQASGFLLHVLKPPQGAATPSGSSQGGPAMFSGLPSEPPT 
PPATPQNPI LHS STKLLCEAAAPTRGKPCTPS S I FQPWHRTDGKGRA 
GFSASGTRGRPHLTPPSPVHQTG*EASQKGPALMPPGFQPPKGK 

4170 

A 

1 

211 

TH P PP AE APG AEG PQ * G * AKH P SAP * APLL / PRRRARAS S RP VTS VT 
PSSGGSFRPGTPAQGVITAVTVSA 

4171 

A 

1 

424 

NSRVDDFVRPGGSSAAGRQMRGGGQQVPNPRPGFRGQQP*RNGHPVP 
ASPWALAGDAGGMLWNMMRSWPVSGRPTLNPFNKQSGPRQHEQGPGE 
EVPDVTPEEALPELPPGEPEFRCPERVMDIiGLSEDHFSRPVGLFPGL 

4172 

A 

1091 

1421 

RGI ITEINKHPDHIHSPTHTCRHTFTNMHAHTHTPTYHHTHTGMFW* 
K*N*GSLNSRSGFK 

4173 

A 

1407 

1609 

LSDI^GNFHSKVMTKttVNIKGimjKl^ 
TA*AQELETSLGNWRLCLY 

4174 

A 

1 

294 

FAQPALWR/ CVPFGS * HRLFPFSKKI I PSP * KPGPSTLQQHSDLDPA 
FISG 

4175 

A 

560 

824 

FWKVGCLPWELAHPAYLLPFPLGITSP*ALGFPRTGGPWPTPFPPLP 
RVSRPSQRRLGTIPT*SQPHACPLPLPHSGSRSCFHFWIiVK 

4176 

A 

3 

325 

FFHVSRNLFYFFRYIKHCYFIFSLIILVSGSRVNLLLLLGLIVYCFL 
WFMNF * TW I SHFLS FFFFFCRDGVSHCVAQAALFTPGLKCWDSRRE 
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PPRPAEMFLFLGT 

4177 

A 

292 

543 

SPGTYAPKRIGKGAGNWKPGVPFRSSMWKETQPLSSRVSKVKLSVSP 

S PP T PVPVPTiT.P AT. PTC* RHQ PT.VP S PPT.WP Q QfVTP^ o 
xr xt x xt v xr v xtxjxjzt J-\-i-tsr i\ on Or Jj V xroxr Jr xjrTx\. O O vvjD o x^ 

4178 

A 

3 

367 

IQI^CVLLFPAQAQISAPPLIMKTTPTISWGLBHICPSYPWNHSPSL 
YHR YAMG FGM F FO S G PCS S W A W KL.S R TIj P Q T . VV A T » Q a WP T .TP n * P T . 

DLGPPAPSSYVGFSSQASPSPSPAAQC 

4179 

A 

67 

283 

MCAYIKIDKGNRMYCTNMIYLVEGIGVTFSLTLIWKYTKLIIM 
KRLTLISCAFYVNLWINLYYAMFI *MCAYIKIDKGiNn?MYPTTJMTYT.V 
EG IG VTFSLTLMYKYTKL 1 1 MLS YVKRLTL I SCAFYVNLW INLY YAM 
FIK 

4180 

A 

268 

469 

GNRRALLRL PALG P PR S S P / DG FT PGTTAP P S T T , K"R PPM PT PT,P PD P 
S AAAPGTS ELWTS PD S S S S S R 

4181 

A 

880 

1200 

GNPTWLFLPS YOGNSGPAE I OS * GFTPGTTAR PS T TjTTR <5 PM PTPI.PP 

WAV J. X '14Ji iJi w X )^Wli VJ£ ■»» 1 * JL \?1 X XT VJ X X .fVXVXr X XJXVTvO *T 1 IXt X XT XJ XT XT 

DPSAAAPGTSELWTSPD*QQQQQDGKEAPQSGRHDSSSALRPAAPEA 
LPVSRPPPPPLAA 

4182 

A 

343 

721 

SCLLTVISFHQIKlOflYVHTQLTKAGQK^ 

LAMAPOTSPASRQDIVLG\GPQPPSGPAPTAPQQPA*ARWQLGPPSP 
GPSILPPREGRPSAPSGAGWGGDLAGPSQVHF 

4183 

A 

789 

1015 

MSLMVPLMPYGPG*YLPPSPPAYPIWELPLPQRDSADVSRRLGGRGV 
AVERDS PTEKSN I S KFGLYQVACAVTHL 

4184 

A 

319 

471 

IMWY I H I ME Y I * P O KG RKI L THAT I * M KLED T MT. S E T S O S OKNKYC T 

J ■» x x i i -x * li-j x x x yivUlviVX XJ X ILn X x> X 1X> 1 IxjJ / X I XXJ»J Xj X J\^fiJ VJxvxil XV X w JL 

IS VI 

4185 

A 

68 

422 

GSPSRCPAFLSSPVAARPPLTHFPTGAGLLRASGEPACLQTGSVSQ* 
PPRAPSSGLPLKHSPHQCSAQHSGKEGLGGGTNTTREN*PSPRPLCM 
GAQAGRMLLLPAAVlvNHVLESGSV 

4186 

A 

135 

500 

PPWVTPRWERGWGVIRDTGDRVRSQRQQGLPPKLQSGWEVESWLGPR 
PRVF\PGSVPGTPR\SPAAGGSQPRCPDARPAHKGPRAPGARCGPPP 
APPPGLPPAY*PGRPR/ PR PARR * TPLTAA 

4. 1 ft 7 
*± x o / 

j± 

£ c; 

J D 3 

1 A d *> 

xOFxWFxivjv-Hr K.bvjxibKvj^ 

GRKI PCLGLRGAI ILGPQI AVRLGSRIMARPAAPGLSQARSLARRGC 

PR ZxOf^QO'DPPD'rUM? DlUWdDDJi T>fl RDPPDDT3A "D DDPT T>"D7V"V* "Di^'D DD 

RGLHGGRHRSRRPPRRGPAAPGSPPAEVAGASARGRAPGPPPCPRPP 
PPIPAAPRVEAGPSARPSVPAPAPAPSRSPPLPOPPaAPWnRPA«?nP 

x *^ <*» * av w Ktfiw* •wjxaxvxt *-» v a /li rVi r\r iji\D jr XvJ-iXr \^lrlrxvllr inr Vjv3xVr\0 VJx 

KDPGTAPPPPTRTQSARNPSRSPPPAPPSPASAGASADDPGTPWHRT 
GPPCQSQraKHTGP*AARDQRQSRRKDDSPTGLCHCLRPGAPKCLKG 
LWFAFSVHPPWSSTPWLQSGRVNLPLAQDIFLL 

4188 

A 

200 

390 

GPHAx<5AFFPP*LSEQLGQG*QPGQTPVGRLlxALSVAFPQGVGFPKA 
GGGTGEAFGGLACPLP 

4189 

A 

805 

1030 i 

LVLCPLPHSQLTKGTSHTCLTLQGPL*KLTRLALFRLGAVAHACNPS 
TLGGPRGP IT * CQEFKTRLANMVKPLL Y 

4190 

A 

90 

417 

XJrtXJXi XUO if 1 xjxj X XJxr V rvnir VJXT Wo V VjHXjOxjIxXXjIM X X V X XJX7 OO x* xYOxJivIM 

FSALVSIFFVFCFSSFSACFSLNSKDFF*NPDIEK^ 
QYSKDHPQRENQFR 

4191 

A 

2 

672 

GRVGFTPGSWEAFMVKNIVPKLGMCLGELVINPHQQHMDAFYWVIDW 
EGMISVSSLVGLLEKHFFPKWLQVLCSWLSNSP1TYEEITKWYLGWKS 
MFSDQVLAHPSVKDKLNEALDVMlSrRAVGSYVG\AYMQPG\ARE\NIA 
YLTHTERRKD FQ YD AMHERRAG PQ IGLLRS LG KNLKI CAKKE PLE I Y 
LWANLKEYYLAKTGQGPPPFFLTRAPEGLITAWALWG 

4192 

A 

1642 

2785 

RMTSSCPMADRTCPLPFSlxRLIWEVWMPFVRNIVTQWQPRNCDPMVD 
FLD S WVHI I P VW I LDN ILDQLI F PKLQKEVENWNPLTDTVP IHS W IH 
PWL PLMQARLEPLYS PIRSKLS S ALQKWHPSDS S AKL I LQPWKDVFT 
PGSWEAFMVKNIVPKLGMCLGELVINPHQQHMDAFYWVIDWEGMISV 
SSLVGLLEKHFFPKWLQVLCSWLSNSPOTEEI/TPKWYLGWKSMFSD 
QVL\ AHS I WSRDKIT^E^D IMNRAVS STVGAYKQPGARENIAYLTHT 
ERRKDFQYEAMQERRRPENT4AQRGIG\VAASSVPMNFKDIiIETKAEE 
HN I VFMP VIGKRHEGKQL YTFGR I VI Y IDRGWFVQGEKTWVPTSLQ 
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SI»IDMAK 

4193 

A 

264 

o o o 

t^oKxKXi\AJ\UPlAl£?il,XljPbFV^ 

x i rvW i ±jvj w is^3 1*1 r oJuy vjjAri±/b VJ^xCrNEAliDIMNRAVSSNVGK*GFP 
FWMSWPQVSDFTHVNDTESEFSGSCRQPKP 

4194 

A 

2413 

2707 

EMHVY/WTERSGSRL*SEHFGRPRWVDH*/RQEFETQPGQYGDTPSL 

LHPLLSTSCPSFLSLFSFLVTFFFSSSFLSLFLIYLLSRHDFLACDC 
FPLFLD 

4195 

A 

835 

1050 

GSR*HNLQIIGRPRRADHLRSGV*DQPGQHGETPVSTRNTRISQAWW 
WTPVI PATQEAEAGESLEPGRWRLW 

4196 

A 

2 

112 

SHI PAASRR I I PQVGHPPLPLPPPPQSHAGVAAGDE 

4197 

A 

36 

172 

GQQICPENKSHIPAASRRILPQVGHPPLPLPPPPQNHAGVAAGDE 

4198 

C 

1 

1482 

MVTHACNPTTIGKIiEPSYVIRKFLDAQRIHNIiTAYLQTLHRQSLANA 
DHTTLLLNCYTKLKDSSKLEEFIKKKSESEVHFDVETAIKVLRQAGY 
YSHAIiYIJ^NHAHHEWYLKIQLEDIKNYQEAl,RYldKLPFEQAESNM 
KRYGKILMHHIPEQTTQLLKGLCTDYRPSLEGRSDREAPGCRANSEE 
F I P I FANNPRELKAFLEHMS E VQPDS PQG I YDTLLELRLQNWAHEKD 
PQVTCEKLHAEAISLLKSGRFCDVFDKALVLCQMHDFQDGVLYLYEQG 
i\Xj1* y y xrwi i iiroyHx^y YKQVI S VCERHGEQD PSLWEQALS YFARKEED 
CKEYVAAVLKHI ENKl^MPPLLVVQTLAHNSTATLSVI RD YLVQKLQ 
KQSQQIAQDELRVRRYREETTRIRQEIQELKARVKAMPPPGKVPRKE 
NLWLQC E WGS C S FVCS TMEKFFEHVTQHLQQHLHG SGEEE EEEEEDD 
PLENSVNKDIRNKSAILPSRSNC* 

4199 

n 
*\ 


"1 O Q Q 

ELG I VG KHAPTGGAGYH FLLGMAAEVLS KFTAVLEGAGTAFPATQHH 
RVVLADCFVPTETDCVLPHADKTTQDCRRNSVPGATHSAPTDNRKWS 
GI SGMTFRLWKSRMQGPRSAWLC* ACSPHCFSLVW 

4200 

A 

1 

587 

GAEETWEPTDGSGSAAQDGVQWCDLGSLQPLPPGLKPSSHLNLPSSG 
I AG * RPCJUjPGKVPRKENIjWLQCEWGSCS FVCSTMEKFFEHVTQHIjQ 
QHLHGSGEEEEEEEEDDPIiEEEFSCLWAGMWLLFSGQFCLTSSAMSY 

r i n x i\xii\.yw^ijy/Ujy^yA^ljGPCI PGISRAGTSSLISLTTSCVWG 
TTGEILR 

4201 

A 

262 

409 

RSGGRNIS SRKS I TAHRS I S APGS I S AQGS I PALKS I LAHGS I PS QG 
*DTPLGWDAPMG*DAIjEGWDAPLG*DASRG*DTPMGRI)AXiSG*YVPS 
• J '- > »»«««xjr- i^xi.r jjrynr ±r XilxM vMJfc' ±NirXoAJiJL^L.GRGAGWQEWMC 
PGLLQAPVPNHIHSCSPTASRGLRGQLSSQLPETAKAAW 

4202 

A 

729 

922 

WFETIEATQLYFNELPDTFHCVLCFLLCHFYINVKAFCCCCCCSCC* 

NTCSLI FRPTVYYAT.MT. 
a viJijir i\-jr x v x x " > 11*1 1 1 

4203 

B 

3 06 

392 

LTSGTYPKSGKAVEEAQPNS PPAEDVRDR * 

4204 

A 

965 

12 98 

oimjjv x>ir yjjv jjuxi*jJJic>ljxKi>XL.r IjLHHQKLHISVPQSPRWGWTSLG 
IiAGTPRCSPLPTPPRLLLPQPSRPKGATGKIPRPDRRTPVRPIiGHKE j 
PRGT * GKS WPAVAQP PG 

4205 

A 

1 

324 

AGS TTCAIiAVAVAQPTDVVKVRFQAQARAGGGRRYQS TVNAYKT I AR 
EEGFRGLWKGTSPNVARNAI VS VLVELTFDS VGGLiVPAR I PSVPSLG 

4206 

A 

310 

513 

VKERKEDWKTRYMNSALGQ YS S AGHCALTMLQKEGPRAFYKGFMPS 

runijuQni\i v vrii: v x x xi»yxil\JKA 

4207 

A 

65 

191 

QE I S IMVGFKATDVPPTATVKFLGAGTAAC I ADL I TFPLDTAKVRLQ 
VRG*SLGVLMVSTLFPPQRHRPLKGQCLEHRDDWRWEGQHAYPCSYP 
VLAljQIQGESQGPVRATASAQYRGVMGTILTiWRTEGPRSIiYNGLVA 
GLQRQMSFASVRIGLYDSVKQFYTKGSEREYGARV*AGNQHHGWVQG 
HRCAPYCHCEVSWGWHSCLHRRSHHLSSGYC 

4208 

A 

12 

237 

NFKYILFKILHI^EKIYNNVYTCFNNLFHCTTYILYNSTINQP^ 
S *HLNKNLPCKKHSMGHYKWFKIFFK 

4209 

A 

361 

444 

QRLLKNNCVI TCS * QRPRVFS S S PQGLAACH YRGLRS S P QSRGRCL S 
HFSCQGVHPRSCQRGSAWSPRTTGQRGPTVGTGRTKRRLNTAYLLGA 
ASADILRPIPSTSR 


low 
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4210 

A 

89 

198 

AP P * THKWKI PHPTS CDRSLSTCWHTTVnciTQT .PPK- 

4211 

A 

119 

699 

PTLMLHYIPWGRGRLGGWGQCGQSHSARCAGTGTCLPMGTLMHMHEA 
SRQVWGGCILPLPFPYPLPPQLPS*LGPSNIWVAKDGWLRDLLDPP* 
YEWTLGVNWPS PG I LAYWTVS APS LG PS G P T TD A AT ,ppd vktt jnn o c o 

HPRRGSRFQKTGLSLTEGNLPFPPPIPPTPYLPHFLHIRVWPANVEQ 
MRCRK 

4212 

A 

728 

916 

FFFPI*SV*KVSAOG*SLWPGVAAHTYHP<?TT.nnnr^f^PT'f*r*rki?i?v''T» " 
SLGNIVRPHLYKSGG 

4213 

A 

1 

319 

NTLP P W F * L P S FS AAAS VQRP PA P P ALAS PS TC P AAS APAP AG CARG 
RRAAAACEARPALL P P PAP PARRRRRPGS PR A T .r c. a p n p c nr»p r> a 
GTGGGLCGMWGP 

4214 

A 

1 

416 

FFFTEEELFIVGGWNCRKVSSAVGRDPSQ/PGS*GWAGRHPPRFPS | 
PSAHPWGAAITRQIPHOT*GPGEPR*SPPWP*T.PYTfQr:w-PT.*TW'r>PLJ 

v «-*" i **>** vJt OJETJtrrVjr XJtr X JvOO nfJciXJ X X\-rjtvlnF 

GELAAAAKLQEGCGCGEPQALLPAAKPPGSTSSKREHLGASSSSC 

4215 

A 

582 

913 

GRAPRLSG PAPAPTTPPVGG S PARAQPAAGG CAAPASGS GRLS RATP 

APRVPELRWPPQQQALQGP*APPRi5R* S *LRSPGQAPEQALPLRPTP 
PTGRKVHGPPTLRPAS 

4216 

A 

162 

530 

LLPYGLJDRGPYGGVWCGI^PYP / c:f3T p \rQ , Trioprk'D r P7\ o-oT nvncc /r^r»/-* 
*vwa, a wvj v »» v^vjj_ixt i jt / ooi v o X oir ir yiX. l/^oirljlT JS\/o££ / GJrQ 

PPEFAACTGTDAI * DARDAGDAGEGGS S/GAEPASSPAGAAGGGCGL 

AARSQVPSEIRSGWLKGSLPQLASSGALVICP 

4217 

A 

160 

835 

GGCGRRTVLVKDYQFFAEGRxjIQFRWENAAARFSALAPDMQRQNFRP 
PTPPYPGPGGGGWGTGSNFRGTPGGGGPRTALPSRRVRGVRTTRRRT 

GPRS * ARNGSKSLSAKRRr)T.QDnaT>rPCDCDnr»vnnc'vnrinT»Rr«n/^ 

"^^u^iwjjonj\i\nyjuoKuj^r vjojet ojtCjCjx PGSYSRSPAGSQQ 

QFGLLPRAAADPPPRVLQGHLHHLDQGVLEIOCECLMSWKINFKAFK^ 
LKI PWG WPRTOG I CSG YKPNNTAT T .KH Q nzv V"K-prvrpr» 

4218 

A 

2 

668 

TVCQHTQYKESRESGAFCRRAWGLSGGDKGKQKPMSRSPGLSPQASP 
QKSLLWGEQPCIGHRCTKLTVNPKPPQTSQAACPHPEPSGLPCMESH 

SQKSLFPESAIVARGHVFOORL.G c ?f?PT.r , PT.PTT qdwcdddt t»ot tvtu 
* * " VVJ1 * * *■ wv^xi^ooxtxi^JctJLjit l Xjo ±t IvJojKjet xtXj toJ uft | fH 

LGVPPSPRYGPR*EGLGMGEAQNAGEMRPGEPWGKPSQGPPTELAXG 
FVSPTPFPSPLPPTHSGGPHQDSWAQCGSPTCLF 

4219 

A 

198 

394 

AQPAVRPRFSASCSQPPFTPPLSPHLRARARPRRERQKYTPSLDGFF 
WGETPAGGCGNHIHPWQG 

4220 

A 

318 

718 

YHHLQGRDIPGNLRVEGSVPHPAVRPRFSASCSQPPFPPPLSPHLRA 
RARPRV^RQKFTPSLDGSFWGETPA^OnciHTUPMor'PT? it ct>t>tj cr\ 

* PRR*AHMEGRVGRRSPLLHPPWEKTPS*LSGLTDRKQA 

4221 

A 

1 

563 

ETFSSHFERPYIjQFKAEPPLTSRGFVEK<?TAT.HPTQi\PWPPCMMCir\7 
HERALQQHPQAQGGLLETREKMHADNNFTKTQDRLHASSDAFSGLRF 
QSGTSRGYQRQMVPREEIKSTSAKSSWSIxADTAPNFSRLLSNVTVM 
EGS PVTLEVEVTGFPEPTLTWWVAYNDKP * METNS ITEKNHS VSTN 

4222 

A 

366 

622 

LSTI^HQHLLSVLLKLSQSLYVVYSFFLIQSTLQSGTRVTFLKCRLQ 
AGAVAQACNAS TLEG * GGW I ARAOEFETNIiGNR AR Pn. 

4223 

A 

43 

245 

NGWVGDVDDYSLHGGCLVTRGTKWIANNWINVDPSRARQAL 
RLAREGGTDSQPEWALDRAY 

4224 

A 

634 

1316 

GSGPHSPGKWMVDDSREHSRDVRSGSRLTL1WTE 

D FHKYMR S HKAE S S ELVRNSHHTCLYQGEGAHH I MRAI RQRVLRLTR 
LSPEIVELIEPLQVVRYGBGGHYHAHVDSGPVYP*TICSHTKLVANE 
S VPFETSCR YMTVLF YLl^NVTGGGETVFPVADN I T YDEMSL IQDDVD 
LRDTRRHCDKGNLRVKPQQGTS SLLVPTTCLMGKVGWVT 

4225 

A 

506 

709 j 

FLMRSEFIi*KVFPHP*HS*GFPCVCSLMSCVG*LLGEGFSTFMTFIR 
FFPWMCSPDDSGFEIFPLIS 

4226 

A 

1 

1662 

MIISIDAEKAFDEIQQCFMLKTlANKiGIDGTYLKIIPJVIYDKPTANI 
ILNGQKLEAFPLKISTRQGCPLSPLLFNIVLEVLARAIRQEKEIKCI 
QLGK^EVKLSLFADDMIVYPENPIVAAQNLKLISNFSKVSGYKINVQ 
KSQAFLYINWRQTESQIMSELPFTIASKRTKYLGIQLTRDVKDLFKE 
NYKPLLSEIKQDTNKWKNIPCSWGRINIMKMAILPKGIASFTGYCI 
PKPEVILKLETGKEPWILEEKFRSQSHLEEPSEYNNNGNSFWIiNEDL 
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IWHQKIKNWEQPFEYNECGKAFPENSLFLVHKRAYTGQKTCKYTEHG 
KTCYMSFFITHQQTHPRENHYECNECGESIFEESILFEHQNVYPFSQ 
NIiNPTLIQRTHSISNIIEYNECGTFFS*DRVSLCLPG*STVARSQLS 
AASISQ\VKRSSH/CLCLLSTSDYRCMPPC1ANI,CIFCRDNCAMLPR 
LVSNS *AQNIfKL I SNFSKVSGYKINVQKSQAFIjYINNRQTESQIMSE 
LPFT IAS KRTKYLG I QLTRD VKDLFKENYKPLLSE I KQDTNKWKNI P 

CSWVGRINIMKMATT.PTC"fiT 21 Q T?Tf2VP T DTf DTJUT T vt pmnvx»DMTT c»r* 

KFRSQSHLEEPSEYNNNGNSFWLNEDLIWHQKIKNWEQPFEYNECGK 
AFP ENS LFLVHKRAYTGOKl'CKYTEHGICTr'YM QPPT TUOOTWtiD ttktv 

YECNECGES I FEES ILFEHQNVYPFSQNLNPTLIQRTHS ISNI IEYN 
ECGTFFSEKLALHLQQRTHPGEKPYECHECGKTFTQNFTEHLRRHTG 
EKPFGC^CGKTFHQKIJUjIVHQRTHIROKPYGCNECGKSFCVKSKIj 
IAHHRTYTGEKPYECNSGGVRPALWDERSCGYVSAGTKRAEGEVWKG 
QGEEMGS I VERLVTLSK 

4227 

C 

472 

681 

MNHKIYVYGHTQQFLYCSYVFKRYEFYSNPTLQGFLCSPHSYFYKYL 
LLTGRNLAII ILNIFAQWYQSL* 

4228 

A 

2 

277 

RAEI PTAGEVWDGAGPMGAELKPWARQRKVEMDHVGPLLSPPPGRS * 
SPGPGRGRSRWTMLGPFSPRPQAAVRGPRPGVLGSPREALEPPGARH 
PRAVTAAPRELQGCGSTVRAVAR 

4229 

A 

1 

240 

ALLGWFVNLYKEITTTTPSFRRRSYILGFVKLYVSEHFHFFFWVLYY 
FL*MHCEKFYFRRCNAGRRSQIMYIVFLKSLSS 

4230 

A 

1508 

1686 

I FFQPKECLYK1^*GLGAVAHACNPSTIjGG *GGWIS * VQEFETSLGN 
MAKPRLYKKYKN 

4231 

A 

596 

729 

YFPNKHGILCRVQWLSL*YQPPLWESEVGVIiLELKSWRTAWATW | 

4232 

A 

1212 

1344 

ATCLSLEKIQKbLIQNPGRR*ASCDSSLLLGCSPDLSSHFTVIi 1 

4233 

A 

2 

114 

NRPCSQKAQWLI PMLSRWFLAWELLS FAAPFPKLCSS 

4234 

A 

1 

326 

SWTAPEEALFSSRKLDGGSQKWLIGRGQASFQGSAVPSWFREGRAWL 

SIxALSLSPCLSITTFPPEEKNYLPCKARFYTDFTNCAKNRPCSQETC 
LE ILGWTI EAQKDT 

4235 

A 

37 

199 

VLQLFKMSSLPLMITDTQQAVSSGLWIGIQIQSLTQRGRGCAPRMPR 
KAYISLT 

4236 

A 

3 

263 

TTHAAHALRIGSRIRDFPGRRFRCYHHRTSLILTDRDKYPFTPVILT 
SLHSFCFLLPSFPRQSPPAVYSNYVGNYLFYLLNKRSDG 

4237 

A 

164 

298 

S S S KRNR YKFKKKRTS YKVNTAE LSS LEANWR VCAG S CRDTTRL 

4238 

A 

60 

152 

RAGWIFS ICLVRKS I FLLKKSLASSAVFST 

4239 

C 

512 

805 

MS S LPLM I TDTQQAVS SGLW I G I PNYRVFT PTGTWVRX PRATES LH K 

FNLNRTWEMGLQRRAVXTXGTNVKALESAQQPWNQAGRGDNFLLWRK 
GGD* 

4240 

A 

1 

361 

E PRML YTRYVSD PEFVI S RVDDFWLOL FRMS S LPLMI TDTOOAVS S 
GLWIGIQIQSLTQPGSTIS*YCSFFKSVIICHLMIT*YTTSSFFRPC 
GLASQI PES YPAGTWVRAPHAMEKLT 

4241 

A 

545 

865 

PIWKHPYKSVTILSLCAVLFSLYIFNSGI^ 

LF YQTFVLRTKFL * 1 1 * Q I KAEPL I VEHK* KI I YKKP YLFNTQVIxAL 
KLKR*GFLYIIS 

4242 

C 

47 

319 

MVFHKARTRMKERTDTAMQAESASETNKRAHPALPPVLPGXGRWARR 
GQEAPRPHLLSPQSGQGGQAPPWRERGVTRLKLGEGCRGRLAP* 

4243 

A 

211 

472 

STLGGSLCAQSGCSFEPLPLRGPHPAMPVKCAQPLFS\SPCPPCLPR 
AAREGEQGEHACWFQMHFLLALPYLCVPSSWSWLYGTPCF 

4244 

B 

1 

1023 

MCWAPS TRGG S PALHLPLLLLPHPLPARCTGFRAPAVPS S PAG I FGH 
IAVIGSPFPTAGGNTSDSFSNGCSQDVGSPRLCGCGLLGVGIWLSVS 
QGNFATFSPSFPSLSXANLVIAIGTIVMVTGFLGCLGAIKENKCLLL 
SFFIS PVAEL I LL I LFFVYMDKVNENAKKDLKEGLLLYHTENNVGLK 
NAWNI I QAEMRCCGVTDYTDWLPSAGGEHGSRPLLHGRTPRAAGATA 
PRLCGERAAMKRSWGI^FSMTLFQHIHRTGGiraVHLGPKIFASFPGA 
TDISSGLGGDYIICDPKEKKYRKPKREPHLQ* 
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4245 

A 

630 

841 

DTGPCWRERGVTRLKLGEGCRGRLAP*HLLPPPKSSSQPCAPLTPTA 
GWLPEEKAEVCSGCPPCSPSYKP 

4246 

A 

29 

360 

HS LI S QRE KES ERERARE S ERE * * RAS ERERERGFJDKI PTRKGGEEV 
QFLQTIHFFFLIFLTAFFEQYFKERGSYYRYRGGKTUCKGGRGEEKNfS 
RSRNPSRSFTGRKKRV 

4247 

A 

253 

451 

NWSAGAALLSAAPAHTPS PHQARPTPGS PPS AALWGS PS PSRS PPHF 
SPTPDL*MRPERHPSSAPF 

4248 

A 

355 

500 

ARCGGMHLYSQIFGRLRWEHCLR*KVIjSQVWWHAPVFADIWEVEVGA 
LLEARSLRL* CAMLLPVNSHGPPTWAMQ 

4249 

C 

175 

387 

MLKAQHLQGPAWGPPMPGLFQLRSAHGSHGAAAKAMVQRQMPPQAVT 
GTXXXQGGHCPQPPRAXYAQDAPG 

4250 

A 

342 

608 

AGSGTIGMLRPADERGGQSRCRGSAHP I SQAAPCPAFLNRPFTGPS / 
CGPSVTSLRNTAGFCPE\GPGCPPSLAEASCPLPSCRAGSHWA 

4251 

A 

56 

277 

TVS VQRHQAVQS PVWFTEH I PQDAWCS PSCAPGPN* PPLCPTNKPC 
ALQQGMKPLALKKSLGGTVCCGSKRD 

4252 

A 

235 

-721 

DLADCSLQLQSRSSWKKCSTMLQKVliVPPEAATSLSHPTPRPIiPHPP 
ALGSPLQPPSNTFGP*CCLSQDAPVFPSPVKLHLVTVPASSLANESR 
KGPSPRVFSRTPAPSSSSITSISDSGIQNATWSGVRLSVSASSFGL 
LTSPFSASGVSLSVSPLASTL 

4253 

A 

1866 

2283 

IRILYRSPHLQLPCLTRFLVKKSYSVTGNWMVSLKD*SLRIiDCWHGP 
LNLVKLDEVWONXiGTSTKS LL *NNFKTJTT.GKTNTFK^T.FnnnMKT.TP 
CKVSFS I YLMVNLESHDSTWHLQANQTPHFDVS PVLVLEFPFKF 

4254 

A 

2 

767 

ELLFDKSTMRRPFGQKASSLPPIVGSRKSKGSGNDRHSKSTVGSSDN 
SSPQPLKRKGKKEDVNSEKLTKLKQNVKLKNSQETIPNSDEGIFKAG 
AERSETRGAAEVQEDEDTQVEVPVDQRPAEIVDEEEDGEKANKDAEQ 
KEDFSGMNGDL* EEGGREATDAPEQVEE I LDHSEQQARPARVNGGTD 
EENGEELQQVNNELQLVLDKERKSQGAGSGQDEADVDPQRPPRPEVK 
I TS P EENENNQQNKD Y AAVA 

4255 

A 

79 

928 

TWQRCNLNPCLH I PVLCS LS STMWDVLLPIiDRGDLHLSC IDS SNWAL 
FRCQSKVVLYHCVEADSKGLRSVLDELTLNKTNLELKMESQTBKLNY 
LRNNHEEAGTTQNE FKVS VCQ YL S R YGGGG YH YKRQS FHKTQS KELM 
GRHRIjLLT I ANRHE I AELR WTLOTLR T.F.T .O AOWPT .VP MT .DO * P P * PR 
LTC*R*RENQEPPVQVVEEQPRANTKETRKKSSFSQPMSASTKEESQ 
DGRRKGK*LKGRAREGKKLHQKSLGFKEYFEGRGAGPTPSGPTVTQL 
K 

4256 

A 

235 

674 

SGKQNNPQATAKAEACTETRSQMGATETS STFDYLGTASLKS PPHRK 
WFFCFTIETSLHSNSL^ILLPLHSISNIQQKTQTSNIiNIllLQPATA 
GPPSQAPPTALSELPGGRSAPEPSHAAPFPARVPPPSSGSPRPNARW 
AARAA 

4257 

A 

60 

473 

CIiGSLQPPPTRVKQFSYLCLPISFFFFLFFFGLFLRFSLCHRF*AV* 
L*CSLVLFSSCFLCLGFAMVFGSVGIi*FSSSLECFQALFIiQTFLLSF 
PLFFMNSN*PCIRLTEWPRLSDALFIFKNSFSCCADAWADAW 

4258 

A 

392 

517 

MTL KDWAHACNPNTSGGRGGW IT* GQFKI S LANMAKPRIiY | 

4259 

A 

446 

767 

IWYKSTHSSVIFKCEFFLQFLSQSLALSPRHECSGAIMAHCSPSLIX3 
S SDLPAS FLNFS WR * CLHYVAQPGLELLGLK* S I PPWASESAGITGL 
SHHAWPKYELLLF 

4260 

A 

119 

342 

ISICIKININIIYACYVYIIILCIRIYLSKYILLENCI^CTIIiFYVI 
FYVYI YLHI I YILLRLCPFFIYSWSPV 

4261 

A 

374 

471 

GKK* KCPTHLSHLY INPGEKSYLFETSGLNF 

4262 

A 

255 

496 

LYKFQTSVPLT*FIIKAIiYITFYYNSR*RGYYSKFKT*S*RPGMVAH 
ACNPSTLGGRGGRIT^SQEFETSLiANVAKPHVY 

4263 

A 

231 

417 

CELPVTFJiGAGC^CSSDTVIjP*PCPFSPVSQSPSVCIiSPL,FSDPCEE 
PPYPVLESLPDCSF 

4264 

A 

1204 

1398 

ASFLAISMSWRVGVPFYHLFFLRQSLILSPRLVGVR*CDRG*LQPL 
PPGYK*FSCLGLPSSWDF [ 


1. 0 Q 8 4-ft 4-3 . 01 H H ft O H 


4265 

A 

581 

705 

LLAECGAHAC IPS TLGDQGRQI T * GQEFETSLANWKPCLY 

4266 

A 

3840 

4074 

AQEYRGQLRQYSETMSLQKKFKNRSAGCGATCL*SQLLRRLGWEDHL 
SPGGRGCSEP*SHHCSPAWVTEQDLSPCHP 

4267 

A 

173 

421 

PRPIaAHRPRKQQYRLLLRSSYQVSLAGWVVLRQKGWMGRKSHPKSPG 
GGQAQWLTPVI PALWETKVGGS PEVRS LRPAWPTW 

426B 

A 

423 

595 

SCLS INFHLGQAQWLMPVI PALWEAKVGRS P * VRMVS IS* FRDLPAS 
ASQSAFNISH 

4269 

B 

1 

1224 

MPTVGKAQARRRTRDTLLLDITQVLIDQSPRKDTGLAQQTGVIVPSK 
GIVKYWRAQLKGDMKYASPAFELFYFSVRAQSTDGIVTLLDPTrRED 
CD I S S APN PE VRTQRQAEVAGVGLLAVKVPRDEEG VLGGAPG PQKRS 
RVCGE PTLQRRS G PL P VARPG SRCP PPTS VS S PVPEDE YGRTWM IR 
LDNPGISGRTIENNPPELEKQLPGEPSENSPYLGPPQVLFLLRDPRE 
D VMLLLNQT P TTAE KQ AVLQAAE I FRNEQQI S YNTS KGKKGDRECEE 
IAETPFQIGSEAVPLDNPDWNSSSSAGEWKRRHFLICIIiEGLERTKA 
KFLNCSKLSMVDQKPDENPAAFMERLREALIEQTSLSPDSVEKQLIL 
KDKFVTQSASNIRRKLQKQAIGPISTLKNLLK* 

4270 

A 

1016 

1393 

IFQLGEYIFLFLFSVFEMESHSVTQAGVQWCNI*SSLQPLPPRFK*FS 
CLSLPSSWDYRQMPPCLANFCIFSRDGVSPCWPGWSQTPDLKRSAHL 
GLPKCWDYRRETTAPSMFLQNLLTUjVPMQQR 

4271 

A 

825 

1246 

RAEGFCPNWCCQSNILIPCHDPGSRSAQSDWACHLGWQGSGSGQNHG 
VPAFGDALQLPGNTSRVGAPQGGSGRGRKDGPGNSH*AHGTGWRECY 
GDRSRKRNRQFWHLAPPRSEIFLIILACTLRSNGSASCPRSCFDSSF 
LRVPRTGRFWGSTWGVTPGI 

4272 

C 

442 

522 

MRCASPSTSKEFSCALLLQCWLQDLG* 

4273 

A 

322 

1025 

PLLCVLKAPPEGAVGDICKKEDAGNMPSTSEGSIYLEMAHFLRNKIA 
GSSVRKPDSGFLWEGALRAWSNMAAPSGGVNCEEFAEFQELLKVMRP 
IDDRIVHELNTTVPTPSFAGKIDASQTCKQLYESLMAAHASRDRVQK 
NCI AQTSAWKNLREEREKNLDDLTLLKPT* EKSRQS * KWVQSELNV 
VEWNDRSWKVFNERCPNSLQASKE*IKRDSFFFFFFLRTGSSHKS 

4274 

A 

22 

54 

TSRLFHFSFPR 

4275 

A 

506 

623 

TRDSAHAYNPSTLGG*GGRIA*VQEFKTTRAHSYTLIPT 

4276 

A 

296 

495 

KHVRPYITHSLGGKKDFLILEKIWLGIVAHACNPSTFRGQGGRTA*A 
*EFKTSLGNIVRPCLYLKT 

4277 

A 

270 

558 

FW I FLRS F YV I RKHL I GHFHP WPRVLKC * KHGG WKVHGGPGQ * RQKG 
TGKEGRKGLRGRRELTEEGARPLFSSGPVESTQIVPLPQPPYVGAAA 
S 

4278 

A 

51 

113 

I SHHPRPHEHKCAVS S AIGR 

4279 

A 

179 

387 

I LVGFCCLFPPPSNK* * THTHTHTHTHPHTHPVVTNR I S LQG ICRKG 
KQKTSQGPFTATSLS PLSDS AV 

4280 

A 

464 

596 

EFSPGMGAYTRSPSTLEGHSGQIT*AQELETSLANRVRPCSYMW 

4281 

A 

468 

564 

YAFYRTQVPGGIRLV*CRLRVLWSNILKKQS 

4282 

A 

780 

839 

SQHFGRPRRVDHEVGRWRP 

4283 

A 

993 

1052 

SQHFGRPRRADHEARRSRP 

4284 

A 

175 

387 

HFGEAEAGWITRSGDRDHPG*NDETPSLLKIQNISRAWWQAPWSAT 
PEAEAGEWHEPQEARACSEAEFEP 

4285 

A 

161 

409 

PPGVIYYLFVKVRLIWGNVMTV*VYMTLIiSFPYPQICIYTYIKYVYI 
LPI*NIYIYTHICIYIPLFLCLLILAIKEGAAFNV 

4286 

A 

1910 

2120 

SVSGFWLEAMLPSIiSEGQPCRLQPFIGNKSL.TGWVAHACNPSTLGG* 
GKW IT * AHEFQTS LANMMKPCLY 

4287 

A 

1156 

1320 

KNSS*FFYFLSYIjKNI9^RPGAVAHTFNPNTLGGGWIT*YQEFETSIjA 
NIGKPCLY 

4288 

A 

1497 

1767 

PSAS PPQVP I PALPQWTGPGR QNPIi * CLH IGLCS PQPSDGPWRKACS 
LFQVPSHQCGR IRKKINKVH I CVFTASRVRWKPKPAAANDQL 

4289 

C 

1434 

1655 

MKKFSHILKXKPFCTLVPIIMLKIiMENIiKSQCICTKEGKYCTIjAENLF 
HHKIHCFPSSDHRSHFSLELAWKKL* 
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4290 

A 

275 

485 

TTC>7KLSLCEKLYSTHFCTDHKAVL 

4291 

A 

889 

1199 

SLQGHV 

4292 

A 

502 

768 

SPCDQTPVEEGMLPKFRKSLQGFSLNFHDILILPSQK*FCIRGSKTM 

4293 

A 

3 

309 

IFGRDGFHHVGQAGLD*TPDLK*STHLGLPKCWDYRHEPPCLARRYF 
KNNNKKDNLLSHHTHKTTVNI LPI FS S CTDVFLHNHDVECNVFS LDL 
VNIFQSC 

4294 

A 

204 

317 

NSLGRSRHCESQADWQGTAKTPPFPAPAGKSSALCSP 

4295 

A 

1905 

2032 

VKHLDGTQK*GCNAFNVGQSGPGPDAAPRWASRTPGGVNAGI 

4296 

A 

2 

225 

^"v v * vvj_ii>r Lo i rl v lolr't»i>WboCS/ PPCSAQPYPAPVSCDGPPHRFPE 
PLLD*GRLKTYFPPPCVNWGCWGQCVISQ 

4297 

A 

299 

663 

IAHITEIRNLILHSLNNPLWLEPTRIEKFSPRGKVTSLHPRVKKMSP 
SSPKNSHPPEEVGSASSSFLREPSPGGSQHSPYPAPSIDAMVPPYRV 

*r X ir A^ VJJL IvC*L^XjI\. A IS_fc*J_i.C Hoi JL IT Jj.l-i.lrl_ V 

4298 

A 

1 

406 

MDWEGSLPLVFNYCRDASLIIHPRFKGVRPRRDACLGPSPL71ASPAF 
LGKGQTITDAELRVTLTVEDSARLHPGEINSHVAHTKPVWWSLHTDA 
RETTHLQRTHFQQSWLV/CIRP/MNPAKSTVSSVRTQT*APLP 

4299 

A 

236 

768 

VPCRES PRALALPRKEMMRA* NQNKTKHDSRHCRI S PVGNMDI AENN 
YNSFKAELGPNS S SASTPPP YNLFIAS PPHTCSGLQFHS TITDAELR 
VTLTVEGNRMSVVRSSFLGLNQLKLAKSMMAAYSIFEEPLASLYSNF 

il 1 IS. 

4300 

A 

302 

897 

SVAKEGVEVEVSHTCGLKWGDIRGGCEGGFGLGEKA7VMRCGCSPGIV 
ke»aijisij_» v jvv b JKFij VLil KJvGTDLPSTVRVTRRS AS VMVLTMPLAPATF 
LRGNCWAGGGGLVTE*NCKPDQV*GGEMIKGL*GGG/VRGCGRIET* 
LGLARRSLGRRGEVRLVWRSGTLLPKYCAPPTVFTTARPGDPLPCLA 

4301 

A 

111 

275 

SRNETESRTRCEEGR* * KDYRVEEQRLRKNWDLARPGKEQLPAFQSG 
NLSROLH 

4302 

A 

3 

220 

SLQPLSREDLGRSQSESLGPEFQGLWEWLPGELNSPICSGVPHRWDV 
A*EESWAAGSPWPGGQISGTCSKLL 

4303 

A 

2 

347 

LTMPLAPAT FLRGNCW AGGGGLVTE * NCKPDQV * GGEM I KGL * GGG / 
VRGCGRIET*LGLARRSLGRRGEVRLWRSGTLLPI<^CAFPT\^TTA 
RPGDPLPCLAQWPCPWSLAHC 

4304 

A 

2 

265 

DFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRK 
1 1 LiAFlM I y IA^PRAIjADSLMQLiARQVSRLESGQLLLARPS 

4305 

A 

3 

865 

LTQQHILTADTNLLCMDYGAGNGKIIn^IGFWHACIRATQEDA^ 
ST I RPL ICCFQTTLGAGKQNKHVKTLLKi^ 

b P 1 bPGNRGSCNGVWGICrQICrGFTSLEYGRSLSVIRSFLGQVTATPS 

stetganalqssaafqvknifkavngvsgyfegkptvepsvcggrle l < 

& ^"v A av? v i> 1 vy i%Jj1\\£> Jj Jvr D VRGQEES VWEKDEACLHPGB INSHVA 

HTKPIWWSLHRDAHIOCKHRGH^ 

AQLKP 

4306 

A 

288 

597 

A - L ' r ° v ^^-Xvov_rc ibrf JjKJjJ.r ^UKJ. VCjVELiIAV/ C; * RHfiPR p.SSCRTf-S 

CRRPFDKTCRLMCHQRLSGFPQEQTQRDSRCIQLLFRQRFAAASCPT 
HQVLLQTQYF 

4307 

A 

3 

188 

KPRRSRSESLGPEFQGLWECIiPGELNSPISSGVPQRWDTA*EESWAA 
GIPWPGGQIDFWHL 

4308 

A 

498 

803 

HLGPEFQGLWEWIjPGELNSP ISSGVRHRWDVA*EESWAAGI PWPGGQ 

ISGTCSKLLEEEVLEEGLAAAVQAFGSSCVLEMWLGFVSQWRQGIAT 
FFYYCTP 

4309 

A 

3 

338 

KPRRSGKESESLGPEFQGLWEWLPGELDSPISSGVPHRWDVA*EESW 
AAG I PWPGGQ I SGTCS KLLGEE VLEEHLAAAVQAFG S S C VLEMWLG F 
VSQWRQGTATFFYYCTP 


10310 
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4310 

C 

321 

734 

M I KHQGKAAFPVRDRRQS FVSTDKTCLIiCLSQKMKGI E I KRRERLKS 
GKEKVWLRDRERLEKRTILRASVPKnEKPAanPPnKVWQr'nc: tjjxtt r\ 

EKGSRPQISVLCILNHRKYSPRFSVWSLKPPSMKLEVRRKCQR* 

4311 

A 

867 

1111 

RHCRYGTR/ ARCRSANAGAMI LTMKOT .ATVMPVYriHO\n? d q mt> or-iaxr 
GARSGMS AVGAYYS S C IRVMTALQS CTGDDEAVLL 

4312 

A 

1 

560 

RQRRHILiSVDPKLWRRSWTGKAAFPWCFI I AGTPLI IHPCFRGVRPC 
RDTWLG PS PLAASPAFLGEGQR I KYLG I QLMRDVKDLFKENYKPLLN 
EIKVDKHKWKNIPCSWIGRINIMKMATVPKRSFSMPSTISPLNNPPA 
YLCLRTFLIMKTMMAAKKWMMEV 

4313 

A 

2 

268 

WMKLETIILSKLLOGOKPNTACSHLOR^PQMPc;'r T QT> T mmddrvt r»T 
RTFLIMKTMMAAKMMMMEVMMMMTVRPLRPSQTQPACTQVK 

4314 

A 

11 

783 

NLFLRPQKDQTWYHRIRLGOTKHIjLICETjKvytvt T.vziTTCP*nv*i5U 

C/RSAIIAEAKETKETHFIRGPKTLAPVTDWEGSLPLVFNQCRDASL 
IIHPRFKGFRPRRDACLGPSPLAASPAFLGKGORIjKTDTARLPWKPP 

DHHRRRASGNSHSGRVQPPSRLPSWAHVECLRLFQEHGASCQWIYRS 
G I WRTVVEMCVR VKR PPNRLCVSNM^ 
ALRMIHVESLACGVRESSTNDNYY 

4315 

A 

177 

434 

LCLWLQPPF*E* IAGQVGEG*SRNETVSRTQCEEGR* *KDYRVEEQR 
LRKNVTOLARPGEEQLVPS SEKRDLPLRVKDQGRCPCVV 

4316 

A 

1332 

1602 

GPARNRAQHWWAGTAGGPSTPSAAAGPGAKSLIARGQQGWPTAPSVG 
PAKPTPTRNSSWPASAVCSPGSCSRLSLHTSLPLPQKSGTCR 

4317 

A 

797 

955 

LNTKGRIjPSOSVtGARVLiGPR TKrVQ TJ*\7Q Q^rDTcncocMuncorrnn 
VTAFH 

4318 

A 

1 

817 

MWESLELPGNI FNGFDQNADNDMDDE IQAEVI SDGDEEFVGNWNKPR 
DLVPC I LATLALTKRGODTAOTMAS EGAPPKTrwnT.Pf5f2W3 pvrs a nvc 
R I EVWE PLP I FRRM YGKACMSRQKFAAGAG FS WYVP VAVVGAKVHDV 
NLHMLSFPSKWKLHTCMKFGAVTQIVTSLGRSSCSLLLEKDPPMVLR 
PTSPRNISPISNLTKETRFIRGPKTPAPVTDWEGSLPLVFNHCRDAS 
LDYSPTFQRCQTTQGRLPWSFTLSSKSCFSEGKARFN 

4319 

A 

3 

93 

LSPS*AIISPVPCTYTPRWPEATEDPQKK I 

4320 

A 

874 

1114 

ESLGAQAEDYCRSGALTESLY*SSSKRKCGVGVPKQSLLYSTV*WSC 
GRGS LS FRPQNDRASS FQPQYEKSTEAKLPKGL 

4321 

A 

264 

1259 

GDRWEGKKEDLGRVALGTETRKGLMCKRMPGH/AGTSDHLSILGQEL 
FRSCRMEKLK/RAI/CPAIWNYCRVCIGVKRHCRRK*DA*1LQDLNN 
SCPKIDKWSEVPDVQAFFYTSVPS 

4322 

A 

607 

769 

SQVLRPTQPKEHLANFKSEDRSVRPLSPS*AIKSPVTCTCTSR*PEA 
TEDPQKK 

4323 

A 

495 

658 

GLTGTTWAHAFTHWLWLSHSTELSGCERLCGPAELGTFTLWLF*RK 
PFADHWP 

4324 

C 

218 

352 

MVGSDFQLCWDHWLITSXXXXXXXXXXXXXXXXXXXXXXXXXXXT 

4325 

A 

444 

996 

PMTHPFFFFPAEVHEPNQROAFLVPVSOARWTA q qi\Fq q WVq A QTq P 
QLEELQGSTQTWHFSATCSQQPSRC^^TVDPSSLLTQQGLAFSAPTG 
RQAQRCSARMAWNPGDPLPPPSVHPCTPHCP*GPPQQPPSASTPWGS 
CCSRWGHQAILGCRDRERCRGTCSSRPWLPLCLLPRFPDKWS 

4326 

C 

296 

625 

MVSEGHTSPLAKSSASELHRTSWSTDNGMDFWRECQFRAKEIQGEi^ 
ALNKSSKPJVQKLVSDL1^ERQVRRKYIVNAGSLI*HFI 
KTAFCVLLQGNYLTL * 

4327 

A 

221 | 

344 

KVLSSDFCSVPFCVCVCVCVCVCVCVCVLTLL*NSSIYYGH 

4328 

A 

441 

656 

AHNY I YFQR YI H I QTYTDTGRGMRIG^1PJDQKKRPGTVAHTCNPSTMG 
G * CRKTS QGQE F KTRL SNTVR PNL 

4329 

A 

409 

749 

RRGRERKPJiTHIHTHTHTANLFVSVFLYFLSFLLHCAIiCLS*WM*RC 
SANPILPTGPYASQTFMILPSCTTVISLPCHWLLFPQYSQTLYSLFT 
ELNGQITVTSAKITYTSKI 

4330 

A 

1682 

2410 

LNKGGQTDIASSQTCLVGHVESDGAXHTTMIGPITOPA^LSPEIWNCP 
I WQRER WT 1 1 QKKX^1NG^1HTHRRNEAET I AG I LTVQEL F PRTNS LTV 
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IDVHSHS VLFCTTIQVL I SQVNHL YLPLPQAFKFPVI F III TSTGLL 

G OPGEGRAS R E L C TF YT FVLCTALIj I LH P T M T T. T, R P T C KTCT .T ,P Y T? T r» 
VYVNNTT 

4331 

A 

2 

57 

NKTQKKLPNACKCVLYSC 

4332 

A 

19 

192 

WKCSKIRLWLWLNKSVNLPKCWTLKTAECYDLYLVKLFLKYGSTTA 
RDIDIKDAEFE 

4333 

C 

218 

364 

MTRVSSCFSDLATDRSFTHKGNNIVSSNSKIISLTWMGFEILKTQHK 
I* 

4334 

A 

172 

304 

PSPVFLAPGGVRAGNSLYPRGPGFH*PRFGPFPPPRGTKREPPF 

4335 

A 

258 

509 

RGHS KQQRD I PRVLDHLM INFS FKI VTVCRSLYS F I * KQAQVKXiFLH 
LHPIPSSHTHTHTHTHTKPTLLLTKPSDDSWVFRWC 

4336 

A 

1 

- 

2411 

MSFLGILCKCPLQ1STESQEESYNAYPLPAVKVSMDWLRLRPRVFQEAV 

VDERQY I WPWL I SLLNS FHPHEEDLS S I S ATPLPEEFELQGFI-ALR P 

SFRNLDFSKGHQGITGDKEGQQRRIRQQRLISIGKWIADNQPRLIQC 

ENEVGlOiLFITEIPELILEDPSEAKENLILQETSVIESLAADGSPGL 

KSVLSTSRI^SITOCDTGEK^^ 

RDYS KG I TVTKIHDGKKDNNI^ 

QTELRKTPVSEARKTPVTQTPTQASNSQF I P IHHPGAFPPLPSRPGF 

PPPTYVIPPPVAFSMGSGYTFPAGVSVPGTFLQPTAHSPAGNQVQAG 

KQSHIPYSQQRPSGPGPMNQGPQQSQPPSQQPLTSLPAQPTAQSTSQ 

LQVQ ALTQQQQS PTKAVPALGKS PPHHSGFQQ YQQADAS KQLWNTPPQ 

VC_GPLGKIMPVKQPYYLQTQDPIKLFEPSLQPPVMQQQPLEKK^fKPF 

PMEPYNiTOPSEVKVPEFYWSSYS*^ 

A. r K¥iL\jU& V FKMJrr fci\^±jJLjEKPSEIjM 

NSMFNEVYGKNLTSSSKAELSPS^1APQETSLYSLFEGTPWSPSLPAS 
\ RTADRWK\ TDKPAMGG FG I DYLSATS FS ES SWHQAS T\ PSGTWTGH 

fJP^MRnQQZVVT .IS/TP CT.TfC TMCCCMMUDP'D07\t ~c>/~\t t iui<^#^_r/^iv^rsor«r4/^ 

TMNPPH 

4337 

A 

17 

300 

TFKPYYRN1»NFL I LKKS YMYATFF IIYLYI YAS FLLVLTE I F YF 1 1 1 
SKELIjPVFIYOVYFPPRPT»nY*PTYT.^QTrMiOPV>JT ptt t df visit? "fxttj 

4338 

A 

257 

476 

KAQMSLLDTLTHP VWLLCLKLCGAV* VKL* E YF * S I CNI CTAI CVCG 
VCLC I YAHTMCVCVRAI ERKIjKFFPT , 

4339 

A 

283 

494 

GQRCEPPPLPPSFG*DSG*EPPFPPAFQKAGGPASPSPGPPGPH*DP 
TG P A S E W KMATMGNR P P EMVS Q A 

4340 

A 

766 

1083 

FKHFGPFHKHHLl^lVLFK^ 

RADCLMSSW*SQPGQCGETLSLKKNTKISQA*WHVPVVQATW 
GGSLEPRRLRIjO 

4341 

A 

2 

163 

FVFFVETGFHYVGQTGLELLTS * SARLGLPKCWDYRHEPPRPAYVEY 
FYSALL 

4342 

A 

192 

440 i 

ATWEEPPCSPPPIPRARFWGPRPPECSSRGQKPKPPAYMSPPDAWR 
PAWLPSSTPTYD*T 

4343 

A 

1875 

2210 

PLSQVITGKHQKRKRJiTJEEGKERPEKEQSra 

■■■"•V^^ -3_a__» _V V XV V O O I\J_I XVTj J\ J\i\i\I\o XT VI JT Jr K. IyJVM Vox vj x\JviVKJ>i Xj Jj 

KKGQKRKC FGMSLFLG F 

4344 

C 

217 

396 

MSEETAKEWtRAYLQQLRQ 
QFMNKSLSXPGQ* 

4345 

A 

1 

546 

MPAYHS SLMDPDNKL IGNMAFLS I KSQFKGPAPRETKIDRDIVDEAIY 
YFKA1WFFKNYE I KNEADRTLI Y I PLY I S ECLKKLQKCNSRSQGTS P 
VAGVIVX^CEWGSSSSHGTYVLISDL^ 

EKIGFKQFAQLFGK3SIGSFLPNIQNKLLQS1jYPRL 1 

4346 

A 

216 

324 

VFFFSL*NEADRTLIYITLYISECLI<XLQKVIKLG 

4347 

A 

71 

650 

NPGAAKMPAYHS\ SLMDPDTKLIGNMALLPIRSQFKGPAPRETKDTD 
IvT}EAIYYFK\ANVF\FQNYEII^ 
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AKSGNSKKPKVEERKCFTLGIH*FFPFPG\ PPRPFPT *T?r«?T?i\trD\ n 

NKQEDEVMRAYLQQLRQETGLRLCEKVFDPQNDKPSKWWTCFVKRQF 
MNKSLSGPGQ 

4348 

A 

552 

874 

SG PNCCEE ADAQLAEE SGNASHTS FTV * VRPRRACHACNP S TLGG * G 
GRIA*SQEFETNLGNTVRPCRNSGEVLNRKTVSIRRAT.PVT?Qt>T utzv 
RLHPAAAARAARA 

4349 

A 

199 

375 

FSLGGIPPFHRRGNWVSREWPGTVAHTFNPSTLEGRGRWIT*GQEFE 
TSLTNMVKTCLY 

4350 

A 

368 

546 

PSMVCSNASCWQLPPVRRFCPDLFVPEVSVCGRHLGPFPPFSAPPGL 
GGPGGPHSWSLG 

4351 

A 

2 

211 

YVYIYKGIESRVSRAICTPSMAALVDKRWKKPKCPLMYEWINKM*YT 
HTMEYYTE * KRKYCSTLSHATI 

4352 

A 

1 

676 

PK*WGFLPIiIYKKGKDGGGTWGGDVLPSPSLGQGGGKKGWPKREGGR 
DLGAERKAGKCREKWKQSKMKGRSRKTGGERVKKPEKHQDTGLFPFQ 
IGGGGLRLSRVCVPQPQVGRGSVSLAQFLSCTSSD 

4353 

A 

1029 

1252 

CKFVYSSLLFIYSSFFICLFIFLPFAPVLIYPI J AH*LCKPCSSYMSW 
VNI QYS I YFRHKCFRNI VQKILYS IKM 

4354 

A 

299 

562 

FPS WQRDAQAS 1»TS PAPL I Y PKATPFRKRPFRROL. WKTKfJT . T D «5 YT .to 
RLYTVAHTCNPSTLGG* GRQMTWGQKFKTSLANMVKPRLY 

4355 

A 

17 

195 

FMDAQVTFKLYIYIIYTYVCTYMYIYTYV*MCEYLFIYIYARIYMYI 
*MHYYTYISLTP 

4356 

A 

3 

565 

NKIPRNPTYKGCEGPLQGELQTTAQRSKRGHKQMEENSMLMGRKNEE 
LENGHTAOEENVGNTI ODTCMftKn pmwvtdv?\m7\tvti vwMnimT -r xtt 

KSFCTAKETTIRVNKQSTEWEKIFAIYPSDNVLITRISKELKRIYKK 
ISNNPIKXWAKDMNRHFSKEDIYAAKRHMKNAJ^ 

4357 

A 

1 

597 

MS ELP FT I TTKR I KYLG I QLTRD VKDLFKENYKS LLNE I RGHKQTEE 
HSMLMDWKNQYHENGHTAQAICRKLKLDPFLTPYTKINSRWIKDIJW 
ftrtv * -*-* vx xjx^tix* ijuo x xyx^xoi ¥ iiji\JLJr rl 1 IV 1 ir jVAJlA rKAKx DKWDTiT KT » 
KSFCTAEETITRVNRQPKEWEKI FAI YPSDKGklSS I YKELKQI YKK 
KTTPSKSGPRI 

4358 

A 

1 

567 

MGKDFMSKSPKA^TKAKIEK^^ 

EK I FA I Y S S DKGIj I S K I YNET.KO T Y KTCKTKTM ta T vttlj avtimcduvcvu 
DTYAAlOGlMKKCSSSxxAIREMQI KTTLRYHLTPVRMAI I KKSGNNRN 

TTTREYYGHFYAYQPENLEEIDKFLDKYILPRQTSGRIGIPEHTSDE 
L 

4359 

A 

1596 

12368 

ENWIaAICTKLKLDPFLTPYTKINSRWIKDLlWRPKTIKTLEENLGNT 
IQD I G I S KN FMNKTPKAMATKAK IDKWDLI KLKS FCTGKETTIRVNR 
HPTEWEKI FAI YPSDKGLI SRI YKBLKQMYKKKNPKTPI KKWAKDMN 
RHFSKEDIWANKHMKKMHHHHWPSEKCKSKPQ 

4360 

A 

48 

436 

RTGSSSGLSISATCLISQRRYSEDCWNVYSLVRSG*ERGS*LLSTYY 
V1^GSLPDVFTYINSFQ1\TYLK*MGPGMVVHAWNPSTL»GG*GRRTA*G 
RELETSLSNI VT^PCIxNl^lSWlS^lWSlTOEWG IRKEG 

4361 

A 

2 

2552 

GQRD VXTTKRDDQAHKPNGLGD ARETRWKD PRS LNDL VE PNS LRCSM 

AVT*PAQSPHKRPGKT$IFKNQYHPDLRSSKATGKBNFI^ 

CTFEEERI,LYCRFLDWI<x,LLSRGKTYMCKEPGWFRLIFADELPRLKL 

GSPFQVTSYl^LETRPDLKELLGSCHCGPGICEENTLGIiDSDRESPAP 

DIERAPTLWTNPAWPPSPLSGSNAQTSRMAGSDISQPAARNQRRGRC 

VFCDGCGARTLTPAIjGSELYGEESSQSSALPPPLMRLHFSPDAGDGS 

LTDIAGVCKJDGGESTARYPHSMRPWNPASGAATAHEEPKRPSTGRRT 

SRERRLQVSRQRKOTRRQGIRDGRAEAFAKCTEGGLNGFLLCKVVDP 

x^IxAELEEAVlxALGGRPSRVGFSPPGNALRRLTRGPHTTLGEVGFIT 

AIFNKEKLKTRKARDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAV 

KGNQLLPVSLVKRKTTIxAPOTQTASPR^ 

ECESCLCPTGDLPIRGSDISVFYQSSFQDYNAYQKDKYHKAKNTLGK 
KVRGLVLINPQNPLGDIYSPDSLMKYLEFAKRYEFYPLRLEQCSKKA 
HDQFKADIxASILPSNTTPLIVSDAGFl^ 
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KVQ YADLGAENWKP I SNLHDMS S SH S KTLG YKRLTKSN P I SCQILL Y 
KSRS KGRKNQRS TRTHCHHPS PKIYSASAKEPWVLATNLPVE IRTPK 
QL VN I YS KRMQ I EETFRDLKS PAYGLGLRHSRTS S SERFDIMLLI AL 
MLQLTCGLRGGS SRVNWREENGDYTYVERVKI PLDHGTLLIMEGATQ 
ADWQ 

4362 

A 

322 

1288 

RKNWGLRKI PWS SLKQEQNLPES FGSKKAFFGS TWRKKDGEPEFQGS 
LGCPLLKARPLLSQLPLLRAISTRKPGQTWKNKEHHLSDREFVFKEP 
QQWRRAPEPRVIDREGVYEISLSPTGVSRVCLYPGFVDVKEADWIL 
EQLCQDVPWKQRTGIREDITYHQPRLTAWYGELPYTYSRITMEPNPH 
WHPVLGTL*NRIEENTGHTFNSLLCmYRNEKDSVI)WHSDDEPSIiGR 
CPI IASLSFGATRTFEMRKKPPPEENGDYTYVERVKI PLDHGTLLIM 
EGATQADWQHRSAQKNTTLREPRVNLTFRTVYPDPRGAPW 

4363 

A 

249 

906 

RIHFPRVSGPSQSNPKFAVASRGFFSLSLSSAQPDPLPPPLGEALAL 
SLHPVPRRSTETVAGDSSELQLGLRSPQQPLAGLAFLARLFLL.FPPP 
* RCKS KPN * NDRRRS S VDS QI HLVGRES AHPLAGLRVCVS LPLLARC 
FGQVLQG VPGLW IPSPGGS / AGVSGRRREERHMG WVMRVRVEARVS 
S *ESKI / SRALR* STHLGLPKCWDYRREPPCPAH 

A 1 dZ A 

4 364 

A 

843 

1186 

QHSTKNSGGWAQTPYGSSQISVCVYIHTYIHIYIFFFFFLVETESHS 
VTQAGVQWHNLSSL*PPPPGFKRFFCFSLPSSWDYRRTPPHPANFCI 
FSRDGVS P YWPGWSRTPDLR j 

A 1 (Z C 

A 

127 

601 

NLI LNVTWPTQLAKQERLHLRLKTSVVWVVRKEKPTRFPRIiGAVSHC 
PQLLILQLVLSPSTSKTLP*EPPDSESP*PKRKGKPSWGRTGAPNLH 
SPPLSPEGPPWAAWNPLKLPPPQHSSGAVPSSACSPWAGSVPAAPPS 
VCYLIYWNLHSQALAHR 

4366 

A 

30 

436 

RRRCGGCFPALPGSPHKPHEQI1PPI1CPRVPVA* PS SS AEHPDPRAGK 
GGGRGL*AGSSPG\AVPHPPSIPGRKNRMGSLEG/RGAPAPRRGGSG 
AGAAE/ASIiTWPGSTGPAPGSHGCSAGVGPSRWAWSLPQRCGRC 

4367 

A 

497 

924 

KRGSRGWFFSTFHHLWGRLQAHLLGPTPAIiQPCDPGAGARTARPGQT 
ASAAPAPLP PRRGAGAPPLRDLARVPAPPCCAQRAPPSS SQPGRNQ * 
*aASGPWRPPRQAEKEaSAGKRRDRPVGARKTAAERDRRGQASGGQT 

K 

4368 


22 6 


MCbJvgFQNTTSFQGSYPKYTRGSLPELRQLLKVELFKKGNWEHLQPR 
SMLLVRSGLSQE* 

4369 


•a 

<£ / D 

1 bMi VLiF VAJ^JjijHFbAEEKrmKHTMKRIjAQSPNSYFMEAKSP * CYK 
ITTAFTHAQTVGWWVGCSTVLCQPTGGEARLTERCAFTRNQRF 

4370 

A 

266 

455 

AQWHTS IVLAAWEAAAAGLLEPRS * SRLKCTMI I PINSHCTPAWQHM 
RPCLLKKNFSNSFISK 

4371 

C 

409 

507 

MNTRFLTCVPGVA^KNDNYSIVCIDFPLWIAI* 

4372 

A 

870 

1027 

HliCMCVRVCVCACMCTLLHLFSNGFI I * CFCTST* AIHVMG* VAI KM 
YINDP 

A 1 "7 1 

A 

1 

690 

FHHLED SIIEI FDAFLTLRKKKRKKKHNGNMP I FS KLFHMVNPMKI S 
TSLFF*HIC5fMIVPISPQSHQYSSNGIjSVLFLP'\rYRHSIjFFMILSTLS 
FRCLANI LSHSLF I LLGLL INRSLSM * S VLLI Y* Y * S YI LIjQYLLTV 
LEFFSFHI*VFSLLRILFFFKIFTFSIWMPLSQSYLLRHSVFTDVPY 
LISYILICVDLFLNTLFCSIGLFILGLTQHCLNSSSFIIIVI*QSIS 
SHLVQ* DCLCLHLCI FCALSCATKTKNKDVEI FIGLTM 

4374 

A 

393 

491 

GLYLHRP I FPHDKLL VQL S CPFHR * THRGTES 

4375 

A 

435 | 

526 

GSTRHTHTHTHTHTHTHTQKERERERERER 

4376 

A 

293 

1228 

NADNSLENFCRKSASCDLEQRRFIIRFIKLRRGSFSERKQQHSVTRK 
YSFIONWVLEKl^NlCATjNTRVTYTraTH^FTN^ T rev 

HVLDRHCCKGSTCIDPFFLMTNYWCNYHAHFIDEHTEAQRARNGKQG 
DECEVGGIESRRRSGNSGARYVHDSSPPTSGIHSNARPSVRLFLATL 
SKIAFLLTVSLLYEMTVHIYGEPLPRRQRITQLLEGVGTLNGSVASV 
DPVFRNQQLHIiATGSLMASSSFCFSVLREI IGQSLIMKTLP* ITHTH 
THTHSHTHTHTERERERERETEREERVKF 

4377 

A 

953 

1317 

VLTLGVSPLYVSKGFWFFLLLFS*TMFSHSPTWAILESWSSIARFSG 
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MSGEPQMTPIGLQYLA*G*RPRNPTLVTFTEIS*VYFFILPLMSQ*M 
CLMHLFPLPQTFCQILQIiYFFMAHIST 

4378 

A 

9 

360 

HKPMLTLR VTWREMLR \ PRDLK* E VEGGCRRCS S S PR / VD P PGG \ RR 
EFRGGVRVRGRG\ SAPPGGPGLSAML 

4379 

A 

223 

458 

CLRR INPCS PRCRRPAWRRRRPG PRTRRRTDS * IRPKSGAASHPPGG 
AEGIPRGCQSTRSRTRRRPEAPASRPCCLRI 

4380 

A 

887 

1173 

FFPESELAOPEKI?F* VPC5Kf3Pf5T.QPPPT T DDDrDDDnwDDDOnnnnrp 

RRRTDS*IRPKSGAASHPPGGAEGIPRGCQGTRSRTRRRPEAPASRP 
C 

4381 

C 

22 

255 

MPSSSAOLLiLPHEVSPVLi < ?PnT.r , FT.TrTTP\7r*cvT urt\7Tt r»T nvrTJc 
LIPCLVSYFLLSLSLPCISWFRTQILLHY* 

4382 

A 

23 

361 

LTTLVFAGTPGKKRRGGNQWR*SPHYSCSGGTGGRPRPAFSLPGAPG 
QSGQ I SQSDWRRGPCS * S SGWCNSC/HPGWPGLVRGGMTS * WS YS * A 
TGP /GPG YKAI PGAS * R «3Tc;T. 

4383 

A 

2 

310 

PLLPAHPPPWSLPVTS*PRCGSPQRTSSWNRHSHRPA*SLPCLGFAG 
T PGKKRRGGNOWR * S PHY P nnTnm? DaD&rcT dp n opnopntcoo 

DWRRGPCS 

4384 

A 

1021 

1565 

PMXiSPAGTPLiTjPAHPPPW^T.PVT'Q * dp pp c dhdtc o TaTkttd xj o ur t> n a * o 
LPCLGFAGTPGKKRRGGNQWR*SPHYSCSGGTGGRPAPAFSLPGAPG 

OSGOISVPPCRVPPLLOSRAAGPSP t 5<5F c ;KlJF , T.Yr , QWWTT.PPT T VAU 

CQESLSNTQQRLEASLSSLNASEPTTAKCPHGTAACLLGD 

4385 

C 

361 

612 

■ * * ™wui\vi\ui\ruj v vj x vj vr vdv XxrVX VjIwUxV^Xj V XjXxXM xX_U^\v3 IVjAoixnJj 

GATAIxAQDAAS FQALARAS PCLWGASAKQRFPLEL YC 

4386 

A 

448 

683 

Vl^ERKKDljTESSLiPAVPTKVPriMPMT<'T.G* r rT ttpot tv t'd p^tt cot r» 
SPPTNQRLSQQEGVGVGEKGELRIYLAMWIK 

4387 

A 

1977 

2162 

VNMERKKDLiTES55IjPAVPTTTVPr;MPMVT.c *tt vtct AnsrwrT odt r« 

"j"-i^wu x o^i^jjrn V XT X XX. V X^Vj1*XX\X*1xVLjo X XjIVX NliH 1 Kt 1 1 iSh 1 it 

S PPTNQRLS QQEGV 

4388 

A 

145 

370 

GRVLLVLFCFFETESRSVA* AGVnwPWPrj^T.OP* aqvqvtjt.fa A\/r*v 
GGTTAIiQPGRQSRALYLKTKTKNFLNLS 

4389 

A 

248 

452 

GWVFTMLARiWLQ*iWx^PGGGGCSELRSraCTPAWATE*DSVSI 
El^KKMRNETKTKWTCS PF IT 

4390 

A 

141 

354 

ARWADWLPLGIANCPRAHLAYLETWELEA^ 
VCLVALRL FCS SGMHHS FNRALQ 

4391 

A 

354 

592 

I RCTQAFALTS WSGPGTVAHASNPS TLGG * GGQ IT* PREFKTSLGK 
^KPCLYKl^GRHGDLCLQARLLQLAEVEGFT 

4392 

A 

63 

353 

SIFlVIDYKCLSPCPFKQFLSSPPQSTEPSPACELGLINVNIiSSRPSLP 
SR 

4393 

A 

357 

561 

L I LETG S VCVFQAGVK* LCTS VI IAHYS LDLLGS SNPPTSDSRVPGC 
GDVHLYS OVDAAANLVVV AA 

4394 

A 

1048 

1599 

CRSKWGLAS ILWLKS FLLGHHLGGDGFKA* FKFHS KFTVLASSKCLL 
LVGF* SHNYG* VRIHLFI Y * I F IHF * KDFKSMFPRLVLSSWPQAIIiP 
PWPPIWLGLQVLAI^CIQPMFHFNWLKSSKWLRI 

4395 

A 

24 

452 

APSPDAMC3 /HSTiWr?TT\7 F NIVP''nAri/^'P p rT pot t ^7^rv , T3i*j'r»^T5t?Ti»'r\ot?rnk.TT o 
•stv/n Jw/ xjoxiimox\.v« v CiUMuuCi X xjvjKxjXjV V IlrW Xyiv*? A? IJo x* LjINIjO 

SASAIMGNPKVKAHGKKVLTSLGDAIK^ 

HVDPENFKLLGNVLVTVXAIH^ 

RYH 

4396 

A 

38 

553 

APSPDAMGHFTEEDKATIT\SLWGxWiWEDAGGETLGRI,LVVYPWTQ 
RFFDSFGtfLSSASAIMGNPKVKAHGKlWL^ 
TFAQA*SELALVDKL\HVGS*RTFKLPGEM 
PEGARLSWAERWVTWSWPVALVLPRLPLKLNCP 

4397 

A 

74 

432 

TTGTLESLCSNNACFGALL PHPPAPGE * RPWS *HFTRYIiGPLNRCAW 
EGGRG * TG PGLPS LGKALVEG IPSES PGPTVSHPCCS PRPDPDQSAA 
SLHQVPPPTVHCSHIjTLPMGWPGPQ 


± O O 8 H-6 4-25; ,„ O H £ 6 O Ef 


4398 

A 

1 

176 

HTHTHTHTHTHTHTHTHTHTHTHTHTH^ 

THTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTH 

HTHTHTHTHTHTHTHTHTHTHTHT 

4399 

A 

1 

188 

HTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHra 

THTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHra 

HTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHT 

4400 

A 

1 

118 

HTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTH 
THTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTH 

4401 

A 

1 

227 

HTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTH 
THTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHT 
HTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTH 
THTHTHTHTHTH 

4402 

A 

1 

157 

HTHTHTHTHTHTHTHTHTHTHTHTHAHTHTHTHTHTHTHTHTHTHTH 
THTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHAHTHTHTHTHTHTHT 
HTHTHTHTHTHT 

4403 

A 

1 

133 

THTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTH 
HTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTH 

4404 

A 

1 

169 

HTHTHTHTHTHTHTHTHTHTH THTHTHTHTHTH 

THTHTH THTHTHTHTHTH THTHTHTHTHTH THTHTHTHTHTH TO 

HTHTHTHTHTHTHTHTHTHT 

4405 

A 

1 

144 

THTHTHTHTHTHTHTHTHTHTHTHTHTHTHTO 
HTHTHTHTHTHTHTHTP 

4406 

A 

1 

89 

HTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTH 
THTHTHTH 

4407 

A 

1 

166 

HTH THTHTHTHTHTH THTHTHTHTHTH THTHTHTHTHTH 
THTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTH^ 
HTHTHTH THTHTHTHTHTH 

4408 

A 

1 

190 

HTHTHTHTHTHTHTHTHTHTOTHTHTHTHTHTHTHTHTH THTHTHTH 
TOTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTO 
HTHTHTHTHTHTHTHTHTHTHTH THTHTHTHTHTH 

4409 

C 

21 

218 

MXC^TVCXVCVEC^CXACGVCGXCGXWCWXXWSXWXVCVCVCVCVC 

vcvcvcvcvcvcvcvcvcv 

4410 

A 

1 

206 

TAHTNTAHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHra 
HTH THTHTHTHTHTH THTHTHTH TH THTHTHTH 
THTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHTH 

4411 

A 

2 

185 

TPHPALHTLHTHTH THTHTHTHTHTH THTHTHTH 
HTHTHTHTHTHTHTHTHTHTHTHTHTHTHTH 

4412 

C 

247 

393 

MELFCNG YNEEVS KNTHTHTHTHTHTHTHTHTHTXTHTHTHTXTHXW 
XL 

4413 

A 

1229 

1784 

RCRREMGSLSSHSVMVDSGFLWPCWRGPGSRTSNTVENRNPPPPPRH 
PLGQLFILFQSLSLHLSLS/CLSLSTSFFQSLFLNLSVSLCQSLPSP 
S Ii/ C FS I PLSNTHTHTHTHTHTHTHTHTHTHTE S GRC * S VIjF FPPCP 
YLYHYR*G*GVGSAALSLPTPHYEMTVFKGNLLYLPAVSIVSRVNL 

4414 

A 

76 

297 

EIRTLSRSSQSLWRRDVRWISGSCLKGHASDHQLCQH*CTHTHTHTR 
THTHTHTH THTHTHTHTHTHTHTPKMA 

4415 

A 

48 

327 

EELEALRR\QRLAELQAKHGDPGDA\ AQQEAKHREAEMRNS I LAQVL 
DQSARAR\VSEQGLIEILKKVSQQTEKTTTVKFNRRKVMDSDEDDDY 

4416 

A 

786 

905 

VSEQGL IE ILKK* CQLTEKTSTVKFNRRKVMDSDEDDDY 

4417 

C 

58 

237 

MAAEELEALRRPEAGRAAGPNTGDPGDAAPTGSQSTGQAEMRNSILA 
QVSGSSPAPGPG* 

4418 

A 

115 

432 

LSK*CNKYTIHTHTHTHTHTHTLLNQPLK 

4419 

A 

10 

398 

ENTVFIGLLKQQTRNI ILPLKI FCLRLKIPPLTMPSPPPSPFRKSPD 
LVPAHTPKEGRLWPWPGPPGGPSKAAHARRPLSTCQD*PPQREILPG 
PQPTSPTHCRMSREGRSGGTGARACQLPLLRPLAS 

4420 

A 

551 

792 

INFFFF* I IDRFSLCHPGWTAVAQSRLTATIjLPSRFKRFIiCLSLPSS 
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WEYRPLPPYPANFCSKLLICLSTFYYKDCGNSA 

4421 

A 

56 

193 

HATRHIRFHRIMAPSYPSRCFWYSRLPFMLVSLKVYLKGFKYSFT 

4422 

A 

1 

3552 

MCELDILHDSLYQFCPELHLKRI^SIiTIiACHAL,LDCKTLTIiTELGRN 
LPTKARTKHNIKKIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIV 
LVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQ 
FIxADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQ 
YADLGAENWKP I SNLHDMS S SHS KTLG YKRLTKSNPT SCQ I LLYKSR 
SKGRK^QRSTRTHCHHPSPKIYSASAKEPWVLATNLPVEIRTPKQLV 
NIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQ 
LTCWLAGVHAQKQG WDKHFQANTVRNRNKEVE WDLSGL I SDVVMS P 
IPLTSLLCGNPDEIRSFPSLCPECTEGPIDLVFVIDGSKSLGEENFE 
WKQFVTGIIDSLTISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMK 
KAVAHMKYMGKGSMTGLAIiKHMFERSFTQGEGARPLSTRVPRAAIVF 
TDGRAQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEPTNKH 
LFYAEDFSTMDEISEKLKKGICEGINYCTUjNKPGCEHECVNMEESYY 
CRCHRGYTLDPNGKTCSRVDHCAQQDHGCEQLCLNTEDSFVCQCSEG 
FLINEDLKTCSRVDYCLLSDHGCEYSCVNMDRSFACQCPEGHVAAAM 
GRRVQSVLGEEDAPEKRFALLSDPGDLTAHELDSCALGDHGCEHSCV 
SSEDS FVCQCFEGY I LREDGKTCRRKDVCQAIDHGCEHI CVNSDDS Y 
TCECLEGFRLAEDGKRCRKSSCENKRADIjVFIIDSSRSVNTHDYAKV 
isjsr x vuijjyi? LiJjj.gpdv TKVGIjLQYGSTVKNEFSLKTFKRKSEVERA 

vkrmrhlstgtmtglaiqyalniafseaegarplrenvprvimivtd 

on. ^ o v «ji v/\>u\>\±<_u iblLlrAlbVG Q VD FNTL KS I G S E PHEDHVF 
LVANFSQIETLTSVFQKKLCTILASGTNSNRQLPSAFAPPLLLAICF 
RFRSRSSGSARRALHCGI^FLSRSPAILAAAVGRKMVAAKKTKKSIiE 
SINSRLQLVMKSGKYVLGYKQTLKMIRQGKAKLVIIiANNCPALRKSE 
x *.*-*rvxvx vj van jt o kjis im x.c>Xjicr l x xKVL. 1 IjAI IOPGDSDI IR 
SMPEQTGEK 

4423 

A 

373 

593 

LR*SKRLNVLRLPRIHRERIYRSVNL 

4424 

A 

118 

206 

VGGRPRSLRQGRWARKEDEKSRWESINS\RL\QLVMKSGKYCPWGT 

^ ^ ■* AJ1N - 1 ' AA x \ *JAruvu v xxjrtXVi>iv^Jt*rtXjKi\jOiLX.E»x I .Aj v 1JLiIj1v\ XvjVHHYSG 

NNIELGTA\CGKYY\RVCTIi\AIIDSGGL*PFI*KACPEQTW*KEDE 
KSRWESINS 

4425 

A 

1134 

1287 

HGOINOMEVNIi PMDR KV * r TM r rw r T T H' r rTJ r PU ^pri^ptrn^ti corners nvmi imui/ 
VKS 

4426 

A 

102 

443 

SPIIPLPITTINILVYYLLAFLCIYIHT*IYLYIIFFFTNIVSYYTY 

CFITCVFHLIYIVNIIVYSLKNFTDTTCSLW 

SHLPEINAFTLWQVSFQTL 

4427 

A 

263 

615 

LNVSEGNIWKLCHELQHGPLNSSPFLILLSHSEKINRASIMLKRKY 
KL INNYI LS AFNPPPGKIHTHTHIHTHTHTHTHTESQKVKST * E I T * 
I FPQQYTNLCQREEHCYFLSHSE 

4428 

A 

839 

964 

CTOGLVLGQAQLLRPVIPAL*EAEAGRLLEPRSSRSAWANM 

4429 

A 

526 

710 

MG VWI»N I IiTPTFGDOS AKLGTT.OFK'P AP WT?P c; n Qr3P t c e» o t m d c -do t 
PDVA*ENPNGSLDP 

4430 

A 

2 

539 

RLDLPSNIjPGLILTSEHCNEGVOTjPT.<;qt Qr>nQKT \tvtc\t t ut>mtmt 
HQEPREPLx^SWRNFlSTSEKKSSLLSEEQQETSTLVETIRQSIQHNlW 
LKPINLLSQQMKPGMKRQRSLYREILFLSLVSLGRENIDIEAFDNEY 
GIAYNSLSSEILERLQKIDAPPSASVEWCRKCFGAPLI 

4431 

A 

920 

1110 

TAVVFxjNKLQLHF * KFLSEAFDNEYGIAYNSLSSEILERIjQKIDAPP 
S AS VEWCRKCFG APL I- 

4432 

A 

38 

571 

GSHRRCSFITNYVHLYIFTLYIYIYIYIYIYIYIYTQMHSPPAHSVT 
* L YGNTSLSATLERPVTPAWAVGFRVRS EKQL I VYEGARPRG 

4433 

A 

145 

675 

NVAVASLSGAHTWKKQSQVCFVSAGLETSMEAAIGDLSVFPGSLTQL 
EGIQGLVEPVQA*ASSWSSR*AAHSQGPGHPSYHRSHLPKPTQRNPE 
RSQMEATSPKPLDQPVPEKRHRGASA^RRRKPISFGFLAEMLI^^ 
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4434 

A 

328 

333 

RE I ELTNCPHPE PNLATGKMTWLRS S SGFCESNRVT 

AFE FKR I D I LTFKS S FAG S L * E * LHTHTHTHTHTHTNTCFLTVMKEIj 

SNHPGNK 

4. A "i K. 

1 *\ 

14 61 

1710 

SGVLGRAALLKPPSRARSSGGSSLSSIIiRLSSPGPLSSPSLPCPTSQ 
DRG I PFQTFHLNVALT * AAHEMDWREPCVKTRTGS 

4436 

A 

267 

499 

LLLYCGTNLGIPNSQEYFLLFGRQVGLQEQTDLLHFLQRQQRVADRV 
LKQHVQQHHGAI CRVRGRVGRPGVPAQG PQ 

4437 

A 

5 

418 

SSWDALWNFFFFLELLLPAASSGPLLTSSSLENFLVFLEPLLWPDSS 
VSLTVSSLGEDFFLLGRLVLPAVSSRSLLINSSLEKDFFFLGLEKEI 
DGSSIPFS**ISWGFCFFFFLGFSIVSSHALSPSPFCPKLTPL 

4438 

A 

1 

3297 

MGFHHVGHAGLEHLTSGDRLGRRSSSKRALKAEGTPGRRGAQRSQKE 

RAGGSPSPGSPRRKQTGRRRHREELGEQERGEAERTCEGRRKRDERA 

SFQERTAAPKREKEIPRREEKSKRQKKPRSSSLASSASGGESLSEEE 

IAQILEQVEEKKKLIATMRSKPWPMAKKLTELREAQEFVEKYEGALG 

KGKGKQL YAYKMLMAKKWVKFKRDFDNFKTQC I PWEMKI KD IESHFG 

S S VAS YFI FLRWMYGVNLVXjFGLIFGLVI I PEDVYVI PEEPS VMLQE 

LAGKAPLDDKFMYFSSNTGSYGLGFDQGYNYLEAELKKIRFQAHSHG 

YWQISEDTISSLLTWLLAGLRKSASNLTYVAVPKVQGYIKYSALFYG 

YYNNQRTIGWLRYRLPMAYFMVGVSVFGYSLIIVIRSMASNTQGSTG 

EGE SDNFTFS FKMFTS WDYTi I GNS ETADNKYAS I TTS FKES I VDEQE ! 

SNKEENIHLTRFLRVLANFLIICCLCGSGYLIYFVVKRSQQFSKMQN 

VSWYERNEVEIVMSLLGMFCPPLFETIAALENYHPRTGLKWQLGRIF 

ALFLGNLYTFLLALMDDVHLKIiANEETIKNITHWTLFNYYNSSGWNE 

SVPRPPLHPADVPRGSCWETAVGIEFMRLTVSDMLVTYITILLGDFL 

RACFVRFMNYCWCWDLEAGFPSYAEFDISGNVLGLIFNQGMIWMGSF 

YAPGLVGINVLRLLTSMYFQCWAVMSSNVPHERVFKASRSNNFYMGL 

LLLVLFLSDLPVAYTIMSLPPSFDCGPFSGKNRMYDVLQETIENDFP 

TFLGKI FAFLANPGL 1 1 PAI LLMFLAI YYLNS VSKSLSRANAQLRKK 

I QVLRE VEKSHKS VKGKATARDS EDTPKS S S KNATQLQLTKEETTPP 

SASQSQAMDKKAQGPGTSNSASRTTLPASGHLPISRPPGIGPDSGHA 

PSQTHPWRQSLGLGLGLGLGLGLRLRLPWERVPADAEVALTLAPRLP 

FQVLLHVLFEHAVG YALLALKEVEE I SLLQPAACTARSHVAGRPVEB 

SVLNLGQIPQHRSSGGLLSLCLIPGCLGKMPTPCLKGWHEDLRLLL 

EDPPAVQKEEKYSWELGD 

4439 

A 

3 

451 

QTQREPTMVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTT ~ 

KTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNAJLSALSDL 

HAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVS 
TVLTSKYR 

A A a n I 

^» U 1 


26 

529 j 

NSTDSERTHPWLLSPADK\TNVKA\AWG\KVGAHAGEYGAEALERMF 
LSFPTTK\ TYFPHFD \ LSHGF\ AQVKG \HGK\ KVA\DAI»TKAVAHV\ 
DDMPKRRCPP * SDLHAHKL\RVGPGSTFKLL\ SHLPCLGEPWAAHLP 
RPSFNPWRLQRLPWGQSFLGFLLKHRCLNLPNTR 

4441 

A 

2 94 

600 

ISCFLCSETLSNFALFLKESNGPVKVWGSIK/AMTENLHGFHVHEFG 

DNTAGRVLCFFLECIWEL*FII*LFFLLLINR\CTSAGPHFNPLSRK 
HGGPKDEER 

444 "> 1 


517 

758 

CFFIIRHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTIiWH 
EKADDLGKGGNEES TKTGNAGSRLACGVTGIAQ 

4 44^ 1 

1 

o rv 

725 

SPA7VSGGFPLQSLEPRTLGRGLSDFGDRSPVCVLKGRRPKCRGIINF 

\EQKCRESNGP\VKVWGSH/IKGLT*KAJLHGF\HVH\EFG\DNTARL 
YQVQGPHF\NPLSOKTTVGPK\DFP<5Wr*nT nnrnpi Tin^inunnvpT 

\ ED FVNLTLQGD \ HC I \ I G \ RTLWPWKKPDDLGPKVGNE \ ES S KT\ 
GNAGKSVWPCGV\lGiARINIPIxGCSLRPPKLinVfir 

4444 

A 

415 

635 

QQTLVYSAACVICFPHFHLLSPNPILFFFFFAFLFFFFSFYLLYLVC 
LFLVASQVRKLFTERFPLCICTKMNV 

4445 

A 

2 

478 

^GKQAVSASGKWLMGIRKWYYNAAEFNKI^ 
IRRLPENLYNDRMFRIKRALDI^ra^ 
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ALCFLLR*KDQPIECPSRSQEELL*SKLSPL*TAFET*AKENFYLEP 
YLK\EVIRERKEREEWAKK 

4446 

A 

323 

400 

ENFYLEPYLKEVIRERKEREEWAKK 

4447 

A 

2 

396 

RPPTTTKFAAARQMAGKQAV*STQAKGLNG/IFKKWYY\NA^IQNK 
LGLM\RDDTIY\EDEDVKRSP*EDFPENLYNDRMFRH*EGHWTLNLK 
\ HQI LPKEQWTNF\ EEKNFY\ LEP YLKE/ VLFRERKERE \ EWAKK 

4448 

A 

3 

316 

ATVLLD YHLNYLKEVDQLRLERLQI DEQLRQ IGAS SRPPPNRTDKEK 
SYVTDDGQGMGRGSRPYRNRGHGRRGPGYTSESDHRDELSDWSLAPT 
EEERESFLRR 

44 49 

A 

5 

340 

GSGTSAKAFRSIWGPLPPVHRHGSPRSSVQR/DGPGLGTGEPRVYIR 
NKVANTGVPGAPGPS IGGVTAPATDYCHRIAPILAARRRRRRRRRRR 
RRRRRGGGGGVAGGGGGGG 

4450 

A 

1 

117 

LSRMERLRSFNNNNFFFLFFFCFLKCGGLVLFFESFFFK 

4451 

A 

1 

1973 

DGGARARGRAAARRRRRPRRRRRRRRRRRRRRRRRRRRRRRRLGLER 
PQPTSRGRAPGASRAEEKMEELWEVRGSNGAFYKAFVKDVHEDSIT 
VAFENNWQPDRQIPFHDVRFPPPVGYNKDINESDEVEVYSRANEKEP 
CCWWLAKVRM I KGE F YVI E YAACDATYNE I VT I ERLRS VNPNKPATK 
DTFHKI KLDVPEDLRQMCAKEAAHKDFKKAVGAFS VTYDPENYQLVI 
LSINEVTSKRAHMLIDMHFRSLRTKLiSLIMRNEEASKQLESSRQLAS 
RFHEQFXVREDLMGLAIGTHGANIQQARKVPGVTAIDLDEDTCTFHI 
YGEDQDAVKKARSFLEFAEDVIQVPRNLVVIGKNGKLIQEIVDKSGV 
VRVRIEAENEKNVPQEEEIMPPNSLPSNNSRVGPNAPEEKKHLDIKE 
NSTHFSQPNSTKVQRGKVPFVFVGTKDSIANATVLLDYHLNYLKEVD 
QLRLERLQIDEQLRQIGASSRPPPNRTDKEKSYVTDDGQGMGRGSRP 
YRNRGHGRRGPGYTSGTNSEASNASETESDHRDELSDWSLAPTEEER 
E S FLRRGDGRRRGGGGKG \ QGGRGRGGG FKGNDDH S RTDNRPRNPRE 
AKGRTTDGSLQNTSSEGSRLRTGKDRNQKKEKPDSVDGQQPLVNGVP 

4452 

A 

1 

2115 

TASAGGGGDGGAAARGRAAARRRRRRRRRRRRRRRRRRRRRRRRRRR 
RRRLGLERPQPTSRGRAPGASRAEEKMEELWEVRGSNGAFYKAFVK 
DVHEDSITVAFENNWQPDRQIPFHDVRFPPPVGYNKDINESDEVEVY 
SRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYl^IVTIERLRSV 
NPNKPATKDTFHKI KLDVPEDLRQMCAKEAAHKDFKKAVGAFS VTYD 
PENYQLVILSINEVTSKRAHMLIDMHFRSLRTKLSLIMRNEEASKQL 
ESSRQLASRFHEQFIVREDLMGLAIGTHGANIQQARKVPGVTAIDLD 
EDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPRNLVGKVIGKNGKL 
IQEIVDKSGWRVRIEAENEKNVPQEEEIMPPNSLPSNNSRVGPNAP 
EEKKHLDIKENSTHFSQPNSTKVQRVLVAS SWAGE SQKPELKAWQG 
MVPFVFVGTKDSIANATVLLDYHLNYLKEVDQLRLERLQIDEQLRQI 
GASSRPPPNRTDKEKSYVTDDGQGMGRGSRPYRNRGHGRRGPGYTSG 
TNS E ASNASETESDHRDELSDWSLAPTEEERES FLRRGDGRRRGGGG 
RGQGGRGRGGGFKGNDDHSRTDNRPRNPREAKGRTTDGSLQIRVDCN 
NERSVHTKTLQNTSSEGSRLRTGKDRNQKKEKPDSVDGQQPLVNGVP 

4453 

A 

2 

424 

LFWS PQTQREPTMVLS PADKTNVKAAWGKVGAHAGEYGAEALERHFD 
LSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVD 
PVNFKLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR 

4454 

A 

1 

456 

RPRRPQREPTMVLS PADKTNVKAAWGKVGAHAGE YGAEALERCFCLS 
HHQDLL S \ HFDL SHGS S QVKGHG KKVADALTNAVGHVDDMPNALS AL 

SDLHAHKLRVDPVNFKLLSHCLLVTIJ\AHLPAEFTPAVHAFLDKFIiA 
SVSTVLTSKYR 

4455 

A 

25 

640 

EFHRLRENPPWCLSPA\DKTKAQRPPRLKLGAHA\GEYGGEPLERMF 
LSFPTTKDLLPALRP*ATVSAQG*RAHGK\KVA\DALT\NAVA\HVD 
\DMPQTALSAPERTCNGAQAFGWDPVQLSSS*SHLPCLGEPWAAHLP 
RPSSTPGGCKASLGTKFPGLFVEAPLLEPSKLPLKLGSLRVGHCFFA 
PLGLPPSPSSPSCTRTPWFE 

4456 

A 

906 

1046 

PDHHNWSQ * TTTGAQRQT* KRTVKEV* SAHNEAMCFGTCASDCLYR 

4457 

A 

1 

383 

MVLMPVVSAIPEAEAKSSLIENDVCLSAKENHKSKRAITRTAERFPT 
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E/SPHD/YPRKTARTGLRHAQQASL\PNWSTPLAARIAPVPPAANPQ 
GVLLKSAGPHLLLLARTTTCNR* VP I ASPQTTHRMB 

4458 

A 

1 

1509 

MG S KLS QPL S PGVAWQLHV I LRRTDD WMDGRA WTLM I RMCF S VYTM 
YRRAANVWIiSSGMGAASRQAELDGMPAINAKRVTGSCARMRCCLSEN 
LLSAIETGTYRQSGRERKHQRWCSDGFEFCCDNGEDCVSRSRWTAVI 
VRHCMGGHYRRLQQRVNGSKELAEAFEHYNEWHPHSARVIARHGNIC 
GSGLVMGEVPENAMHDANIRVAIAGAGGLWAARLIQAALALEGVHWA 
LRWSVKDLLYWAATRRYRAKQLDAVKDDFDVFIDFTRPKRWVNVMLK 
LLEKAAKVMGATPISKLLKHIIDIKLIAVRHRTGNGRGDRPRPDKDL 
KDCAVYSREGHTVWCLAPLVLPPCVQVVSMATTAGFTTDTLPAGRS 
FCRYCFYVQPYRRLRRVNGRWPEGDPIIiliLFKQGNLTNRQYPANHAM 
FQDNGFQQLCRVGRYGGNCGMI I STSG S TAFQQAATI P VE I QLLMNM 
PH S KYLTCGNFAS TVS I R PR Y ADCVDRAKLHPQ 

4459 

A 

579 

919 

PGSM/IPLSVSHSTGDDGRSLPKRRSTAPSMTSVYCFNCERCLVSCS 
LLLLSSLGPPCIAVLI^FFAPFF*VFIiLCVVVAGIiPALIiLLLVSSIi 
LFLVLLS FCAAS ACFGFLS V 

4460 

A 

1477 

1691 

YCNVSFG ANLERA* TSFS ALF IDLQPPGYRTTTS KHKVS S SLI KGHV 
LLDHSFHDLNTQLWISTNAFRFGN 

4461 

A 

1076 

1354 

QQKCAVNLHSFLPITKLCRLFASCRAARCCSHLTSQPVTA*LAGKVN 
PLPLGNLWECDTKAGSQAMRTHLS ITWRAVMQLPVINTTYQRGKR 

4462 

A 

880 

1242 

RTDAVLFDQWSQRVEVRLFRHSLVERGVKYSNVFVFQLWERFQSFF 
DTOQVSR WQRCKRS C I FDTLNNRL IDHYG AG IL»F AAVYDTVTDS S Q 
LRRQFWFLCQNS INDKVERFAVSGACT 

4463 

A 

832 

1386 

TIIRQPLHWRRRKIVAEA71FHRSQHDVLYCFTVEAAGSDRPVQCLTI 
TAVQRERDTQSLSVITAELEPVRAPSLIAFFHGHSACMCPISTARYS 
RFSLKQQR ILAHDPVNTFG IDRRHT I KFCLS AKQRPYPT ITIRW.QLS 
DNMVYTEKHIRI I S VTTAAAIHPVIGS SENDVQLRTRHPETTAD 

44 64 

A 

46 

492 

HPVTLGDNTYRTH I KQL I KGEVLFLHFTPDAVDVFRTT/ MTL PL * RL 
QLS SQRADDQRTLRCSVRGRDAVHAAVWQFVCIRLD\ EVTEAVI FQF 
PFQLSDSETVSQRCVDVGTLFCRQYTFIFRRIFYFAQMSNTLRLFDD 
YAEEIIDHRQ 

4465 

A 

7259 

7441 

IAS PGWS * RQECSSRSCAQCCCQSTTTCRHAKQRSLAIiAHKPLAKLA 
GNGITPRLRQICR 

4466 

A 

538 

1056 

TIIRQPLHWRRRKIVAEAAFHRSQHDVLYCFTVEAAGSDRPVQCLTI 
TAVQRERDTQSLSVITAELEPVRAPSLIAFFHGHSACMCPFRVJRYSR 
FSLKQQRILAHDPVNTFGIDRRHTIKFCLSAKQRPYPTITIRWQLSD 
NMVYTEKHIRIISVTTAAAIHPVIGSFPIPGQ 

4467 

A 

1 

1134 

MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQI*F 
LWRKQ YQEG S LTAVAAGEQ WPAS ELAAAMKQI KEIjQRLIjGKKTMEN 
ELLKEAVEYGRAKKWIAHAPLLPGDGDRHTDDTDVLLRIHHVIGELP 
TYGYRRVWALLPRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPP 
SKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREAL 
HWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRA 
NETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPD 
GLTAAKNLAEAFEHYNEWHPHSALGYRS PRE YLRQRACNGLSDNRCL 
EI 

4468 

A 

745 

1263 

TIIRQPLHWRRRKIVAEAAFHRSQHDVLYCFTVEAAGSDRPVQCLTI 
TAVQRERDTQSLSVITAELEPVRAPSLIAFFHGHSACMCPFRWRYSR 
FSLKQQRIIxAHDPVWTFGIDRRHTIKFCLSAKQRPYPTITIRWQLSD 
NMVYTEKHIRIISVTTAAAIHPVIGSFPIPGQ 

4469 

A 

3506 

3826 

EVWRPARRSRDVSCSRSHAYYWPGTADWLWPS *RPSTTGVCG* PVT 
WRWLEGQWTPHLGRVLQEDSDSPTFPSALGIQETRPEPLKSPREVPT 
SGR TGVFHYPVA 

4470 

A 

1 

751 

MRDFAGCPRSSVSETGNRRPNAGHNQEWNGGKDETHDTNQSRTRHYG 
VRNFSTIPNYFEGNPKWGPEKRRRRTTQEKIAIVQQSFEPGMTVSLV 
ARQHGVAASQLFLWRKQYQEGSLTAVAAGEQWPASELAAAMKQIKE 
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LQRLLGKKTMENELLKRAVEYGRAKKWITOiAPLLPGDGERTDDWMDG 
RRSRHTDDTDVLLRIHHVIGELPTYGYRRWALLRRQAELDGMPAIN 
AKRVYR IMRQNAP VA 

4471 

A 

462 

1200 

VFSGPFSGVGFFPFSGLCPCELSATVLEVAPPLLPGNRMSSRITMLS 
ESCQRIWIVGYGRCFADRQNLMVCLRSMPNVFTGSCARMRCCLSENIj 
/PVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCC 
DREALH WAVTTGGFNS ETVQDVMLGAVERRFGNDLPS S PVEWREGVF 
VRNIDEEFGNESAQTGMLHHQLDPSVEGELHQNTGSTACCRADSTHI 
SHWRFM 

4472 

A 

1271 

1681 

TVITTGIGRPFSMESVFALNALQNSMMFTPC*PRAGPTGGLGCTSFS 
DEPILDGWPPQSSQLMIRMCFSVYTMLSESCQGMVIVGYGRCFADRQ 
NLMVCLRSMPNVFTGSCARMRCCLSENLLYRHRNGHIQAEWP 

4473 

A 

739 

1257 

TIIRQPLHWRRRKIVAEAAFHRSQHDVLYCFTVEAAGSDRPVQCLTI 
TAVQRERDTQSLSVITAELEPVRAPSLIAFFHGHSACMCPFRWRYSR 
FSLKQQRILAHDPVNTFGIDRRHTIKFCLSAKQRPYPTITIRWQLSD 
NMVYTEKHIRIISVTTAAAIHPVIGSFPIPGQ 

4474 

A 

1 

2661 

EGRTKEHERKKERKTTTTKDNERKNRTEQKRKKAQNRKKERKQRHEQ 
TEDRKKERTTERTKEGRKDGRKEGTEGTEGRKERKNTTGNLQTINQK 
VQTPLAFVTAGEDQDNYLEPVWRKRTTNHRHGGGDS AFE IKPVLLRA 
FRTTKVIFMTQTFIPGKDAAIiEDSIARFQQKLSDLGFQIEEASWLNP 
VPNVWSVHIRDKECTUjCFTNGKGATKKAAIiASAIiGEYFERLSTNYFF 
ADFWLGETIANGPFVHYPNEKWFPLTENDDVPEGLLDDRIiRAFYDPE 
NELTGSMLIDLQSGNEDRGICGLPFTRQSDNQTVYIPMNIIGNI.YVS 
NGMS AGNTRNEARVQGLSEVFERYVKNRI I AES I SLPE I PADVLARY 
PAWEAI ETLEAEGFP I FAYDGS LGGQ YQPLHWRRRKI VAEAAFHRS 
QHDVL YCFTVEAAG SDRPVQCLT I TAVQRERDTQS L SV ITAELE PVR 
APSLIAFFHGHSACMCPFRWRYSRFSLKQQRILAHDPVNTFGIDRRH 
TIKFCLSAKQRPYPTIPYVGSSPIHGVYGEAHPYHQCDDCGGIHPVI 
GSSENDVQLRTRHRRQRLTKLTPHPRAIRARKTLGGDSCTMTNCGSE 
EAVATNALQPMPKSQTHLGELQSLLRTYNFDVIDGVNYQIDVTQPAR 
YDGECQMINANAERIKNLTFNGKPIDPNAMFLVATNNYRAYGGKFAG 
TGDSHIAFASPDENRSVIiAAWIADESKRAGEIHPAADNNWRIiAPIAG 
DKKLDIRFETSPSDKAAAPIKEKGQYPENKlWNTTVSLVLIiKYPMVP 
CEFAVAPGAFENQVGKQSREAQLKTVANIESSFRSAPVSETREARFR 
QTIWGKESSYCLCGKTGQQKYLSPVPVDAWVGNTTLHPIDR 

4475 

A 

1 

2370 

MSKSANR I KPKKNTQINNTQSKATRTYNTNTQYKPGKCRTQGWIPVC 
RAYRQYDRGRDQRSSEYSRRRLRPSAPTSAKSHRRRHRWSVYQQSGH 
SDAGFS SRERFLNLAS PWHCHI VEENGHREGGNLPFGDS WANTVDK 
ETNFF I AQRMTI TFFTNNFLCEKQAAGALTVAVTQRFQQACQQAGSA 
KAWTR I C P VSTEQF I CNGE I FLR I FRS THAARRFETNGFF YAH I TD 
HFHHDARAFRRRVHSHFTGRGFDEVRTRFDGDFRRFTDQRFVFQFAG 
FDDH FQQ Y I RCRAS LFTG FHQVKTNLL VTRHQRAVREHNVNF I S TVG 
DCRTGFCQRDYDVWTVWEVGYRSDTDFGVPCSFRVCEHSDNAMIDT 
DRRVCRAEFGLMTQAITSRRCSSKQWPASELAAAMKQIKELQRLLG 
KKTMENELLKEAVEYGRAKKWIAHAPLLPGDGERTDDWMIX5RRSRHT 
DDTDVLLR IHHVIGELPTYG YRRVWALLRRQAELDGMPAINAKRVYR 
IMRQNAIiLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNG 
ERLRVTFAIjD ccdrealhwavttgg fns ETVQDVMLGAVERRFGNDL 
PSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAES 
FVKT I KRDY I S IMPKPDGLTAAKNOiAEAFEH YNE WHPH I R YRNQDDH 
ELQITHGTRFYVELCGESLPAEGTGLTPLRTHARFQVDHNALVSRRV 
VRSCTCCLYQVPKLRS PDTI AQTALSLAI S YTHTASGH 

4476 

A 

421 

972 

TI IRQPLHWRRRKI VAEAAFHRS QHDVL YCFTVEAAGSDRPVQCLTI 
TAVQRERDTQS LSVITAELEPVRAPSL I AFFHGHSACMCPFRWRYSR 
FSLKQQRILAHDPVNTFGIDRRHTIKFCLSAKQRPYPTITIRWQLSD 
NMVYTEKH I R 1 1 S VTTAAAIHPV IGSS END VQLRTRH PETTGD 
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4477 

A 

766 

1073 

HRQLVIVFQQPLVSGFHEAELSLDDAXWVLHPGPDAGFHVFDVDGRP 
VLA\ * CCFKVLTLPGRSAI SQST\ PPRPAPRAVAPLGSRHRLRQIAP 
LHAADYPAD 

4478 

A 

1662 

2045 

PTALAAFSRRIVRITFLPVGDPGFHQSGSLRLS/NPDALQIQPACRI 
FSSALITSVFACSVHEGE*LPVDPGIIRALLHR*R*KAESACAVPCI 
*AWTDREYGCHNRRKSGVILRRTDDWMDGRRSRHT 

4479 

A 

1 

1803 

MTRDNPVIPRFIHRREWFIGQENGGIQNARFIAALFFQHGVNLCQR 
VGGLLKGIGVEVFRYTSWERVWDYYIRPAGFSIDTNDRGSVTDDF 
APDGQLAKAI PGFKPREPQRQMAVAVTQAI EKGQPLWEAGTGTGKT 
Y AYLAPALRAKKKV IIS TG S KALQDQL Y S RDLPTVS KALKYTGNVAL 
LKGRSNYIiCLERLEQQAIiAGGDLPVQILSDVILLRSWSNQTVDGDIS 
TCVSVAEDSQAWPLVTSTNDNCLGSDCPMYKDCFVVKARKKAMDADV 
VVVNHHLFIJAD^T^A^KESGFGELI PEADVMI FDEAHQLPDIASQYFGQ 
S L S S RQLLDLAKD I T I AYRTELKDTQQLQKCADRLAQS AQDFRLQLG 
EPGYRGNLRELLANPQIQRAFLLLDDTLELCYDVAKLSLGRSAIiLDA 
AFERATLYRTRLKRLKEINQPGYSYWYECTSRHFTLALTPLSVADKF 
KELMAQKPGSWIFTSATLSVNDDLHHFTSRLGIEQADSCSIGNSYVF 
RLVIGEQMIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHG 
VAASQLFLWRKQYQEGSFICCRPPENRLFLPIiNIiLPP 

4480 

A 

230 

368 

LGPLPQGTFS PEG WRQP * PCGWPPCRAFMES S S LQKRMRSVLDPLB 

4481 

A 

3 

334 

QGYEYDNLYVHFFVELPTAHWSSPAFQQLSGVTQTCTTKSLAMDKVA 
HFSYPFTFEAFFLHEDESSDALPEWPVLYCEVLSLDFWQRYRVEGYG 
AWLPATPATMTWPWH 

4482 

A 

47 

2 96 

QKTKKQTNKKTQ*KIPPS*QKGLSTAFSLHIiLRNNSYLNLSKPQAVE 
♦GFPGKRIiRLTGWGFKTLSPSTFLPSLVLEDERLHF 

4483 

A 

2 

1317 

YTDSDRYTNLEEHCQRMTTAASEVPSFLVERMANVRRRRQDRRGMEG 
GILKSRIVTWEPSEEFVRNNHVINTPLQTMHIMADLGPYKKLGYKKY 
EHVLCTLKVDSNGVITVKPDFTGLKGPYRIETEGEKQELWKYTIDNV 
S PHAQPEEEERERRVFKDL YGRHKE YLS S LVGTDFEMTVPGALRLFV 
NGE WSAQGYEYDNLYVHFFVELPTARELTI FGLWLSDWSS PAFQQL 
SGVTQTCTTKSLAMDKVAHFSYPFTFEAFFLHEDESSDAIiPEWPVIiY 
CEVLSLDFWQRYRVEGYGAWLPATPGSHTLTVSTWRPVELGTVAEL 
RRFFIGGSLELEDLSYVRIPGSFKGERLSRFGLRTETTGTVTFRLHC 
LQQS RAFME S S SLQKRMRS VLDRLEG FS QQS S IHNVLEAFRRARRRM 
QEARESLPQELMGPF 

4484 

A 

875 

1075 

TS LTFFSD P FADAS KGDDLL PAGTED Y IH I R I QQRNGRKTLTTVQG I 
ADDYDKKKLVKAFKKVGLQ 

4485 

A 

2097 

2179 

DQEIKDCVCVCVCVCVFKCLNLVYLSL 

4486 

A 

4274 

5255 

HTLFGDKLCTVATLRETYGEMADCCAKQEPERNECFLQHKDDNPNLP 
RL VRPEVD VMCTAFHDNGETFLKK* V IRCL * FKI KKHGVTP * ANTL * 
KLP*QKYFQH*DLEVLL* * FFKEWFDTTKFYTAKNMIKDILKFIET 
GYNItSQKFKIDKFFNVFRRYVYMWI IDFVLVSNI ILPKFNHLCTHT 
HTHTHLTLFSTYLKNDRDKTIMCKLSLIG* L\ ESLEFGGSGENVDYN 
YFCNIVCYRK/ADCFSFLKFRYLYEIARRHPYFYAPELIiFFAKRYKA 
AFTECCQAADKAACLLPKVLCTRIEKKSLLSNIilLSILWLDLGTLSV 

4487 

C 

102 

230 

MFLSCRSVXXXXXXXXXXXXXXXSPLFLSPLLFLLFHTYMRA* 

4488 

A 

2 

567 

VKVRLMREYKVVVIiGSGGVGKSALTVQFVTGTFIEKYDPTIEDFYRK 
E I EVDS SPSVLEI LDTAGTEQFASMRDL Y I KNGQGFI LVYS LVNQQS 
FQDIKPMRDQIVRVKRYEKVPLILVGNKVDLEPEREVMSSEGRALAQ 
EWGCPFMETSAKSKSMVDELFAEIVRQMNYSSLPEKQDQCCTTCWQ 

4489 

A 

125 

522 

NTPQQGRRSYTTTWMNPKNLTLKTLHNLVLTFISSIVTSFPLITPWS 
IL ILEDCQLSLIRS IKVLTYFPFFYVKSDLLS FLNILYFLI PKPFPI 
SFLLPRMPLLLSYAYPKPHLSRTSSKPTLTHQNALGLS 

4490 

A 

195 

473 

TS EAQS TKVFG SGCWGWSNQLNS NF PL I L I FS L I PLS AVAR PLCAE P 
CVSIGSCPFHFSVQSGTISHLLCIFPKKCWTSLGYILFFLILCNS 

4491 

A 

1 I 

966 

MSVGNLEEIIDDNHAIVSTSVGSEHYDSIISFVEKDLLEPGCSILLR 
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HKVHAVIGVIJ^DTGPLVTMMKVEKAPQETYVNTGGLDNQIQEIKES 
MELPLPHPBYYEEMGTKPPKGVILCGPPGTATFLRWGSELIQKYIiG 
DGPKLVRQVFQVAEEHAPSIMFTDEIEAIGTKRYDSNSGGEREIQQT 
MLELELLNQLGGFDSREDVKVIMATKQVETLDPVLIRPGRIDKKIEF 
HLPDEKTKKHI FQIHTSRMTLANDVTLDDLIMAKDDFSGADI KAICT 
EAGLMALREHRMKATNEDFKKSIESVLYKKHEGIPEGLYL 

4492 

A 

1 

960 

MWESLELPRDLEGSEDFLNGFNQNADVDNEVQAKWSDGDEELVRNW 
SKGKSGETLYLRLRKAGNKFLPGGILHNIVIKHGLSLAEEADGNKSN 
RGEELSESEEKLFYTSIHEANQLNKTVKSAQISESDFNFPSVSSESH 
PTDSGSILCECTVQRPAVQGPGASRATLLLQPLVENPPLLPPSSGCC 
HSLWLHRPNLCFHFHMAFSSVYPSNQVLAMTTGLSCSDSSKEVLEET 
S KS GERAVPHS ADFLKKR I FQ I HTS RMTLADDVTLDDL I MAKDDL S G 
ADIKAICTEAGLMALRERRMKVTNEDYIDFSKIEIREG 

4493 

A 

1743 

1935 

L PNL F V P AGR I DRK I E F PL PDEKTKKR I FQ I HTSRMTLADD VTLDDL 
IMAKDDLSGADI KVRA 

4494 

A 

1 

1540 

NSGGSGGGTSGSGSSSGQGKMGQSQSGGHGPGGGKKDDEDKKKKYEP 
PVPTRVGKKKKKTKGPDAAS KLPLVTPHTQCRLKLLKLER I KD YLLM 
EEEFIRNQEQMKPLEEKQEEERSKVDDLRGTPMSVGTLEEIIDDNHA 
I VSTS VGSEHYVS ILS FVDKDLLEPGCSVLLNHKVHAVIGVLMDDTD 
PLVTVMKVEKAPQETYADIGGLDNQIQEIKESVELPLTHPEYYEEMG 
IKPPKGV\ILLVPPG\TGKTFLPKAVANQT\SATFLRWGSELIQKY 
IX3DGPKT/LSRELFR\VSEELAPSLRFIDRILPPLGHKRYD\SNS\G 
VRRELQRTMLELELLNPVGMGFDS * GRL * KLS WATNRI \ ETL\DPAIi 
IRPRPAFDSKIE\FPLH\DEKTKKRIFQIHTSRMTLADDVP\liDD\L 
IHGLKIDLS\GA*HSRAICTEAGL\MGL*GERRMKVTNEDFKKSKEN 
VLYKKQEGTPEGWSLMNHGLSSGKWIiGDFSIPERDEVGGSCPEESL 
FPLIFY 

4495 

A 

125 

320 

KDATGLFL I FETGSRS VTPAGAQWCNY S S LQPELSRFQGS S YLSIiPS 
GLGTIGVRHPHPAVTGLV 

4496 

A 

1376 

1509 

ERKMGFPFSDHGPLES I PWPLPS IQYKPE I RTYVLTCVCECVCV 

4497 

A 

2120 

2313 

NSLRFFFFFLQFSIFIFLSSSTMVSVSVFYMWPKKILPMRSREAKIL 
DTPCLGEHCFVYLVSSS 

4498 

A 

2 

266 

KGSTEAFISGTAGWGTGLLPSSAGLPGGWGPAGGWAGTDRRGPRARP 
I PQKS P P WP W SGD AAKGQS G FLP VAAWAGQGRL PGGG 1 1 VH 

4499 

A 

918 

1475 

PEPTTVFEWHGVGPAWPWESVPRWGRAGLPERDWVGGCPRITRDGGG 
GVSGSWLRSLRGPLWRCPLATELPPQLRKPRARSQSGRQMEPDNRRG 
RQRDRSGWQGGPWTAKAKPGVEKLFPRPKTQALCPNPTQTVPKTRLC 
LDPDRVRRQPTAYSQVRPSSQRSARAVGRPRRGRSEGLTKPSNR 

4500 

A 

1 

822 

FFFFKKGSTEAFISGTAGWGTGLLPSSAGLPGGWGPAGGWAGTDRRG 
PRAQAHPTEKPSLAMVGRCSQRTVRLPPGGGVGLGKEGFPEEASLST 
EKHRGAWP L PVCLLGAGHGWALQTLLQPWEDSRVRGVAGWGAWWGRE 
GAGGTTGGTCRGPLGHWYWLGEDSVLQRLCDMSSYTTHAAEFLSRHE 
HS FF I CVLASTHATPRPS PRRKE I TARAAGRAEAAGG SGRAEAVCGS 
GRAEPVGGAGRAEALGGASEHLCHPWDCSSVTRTRVPEP 

4501 

A 

158 

668 

MEVYHPLWAAFPSNPTSGKTRAPARRGPLPASHRPRAGPRSEGLGPP 
TSGAGERVFPYATCPAHRRGAGIRALGSSLFTRRLLRESWLVSFPPIi ! 
TNMLKFSGSPRLLVPNSARTVMPLDVRGCTRATLTGSACAYPTPAGA 
GNPLNPIRDGDRGLQLFPHERGIPSKCGS 

4502 

B 

151 

336 

ANSDSMVGYVLGPFFLITLVGVWAVGGPAAHHIiLPMYSYDPAEELH 
h> Lj u fc> IjMoDP kv * 

4503 

A 

748 

949 

HQQPEGWWSMGSWRFJGRAPPLGSLPPLQPRQERPRLAPGPRKSRQA 
RDQRTRGNGPGLSTRTQPIR 

4504 

A 

1139 

1241 

WYEFRKMKMSSLGKEYSRKMEHVQRLSVQTVWKA 

4505 

A 

334 

466 

SSGRGGGPLGSLSSGWRCFWRLGAGHSVAPLTRLPYREGNLFAK 

4506 

A 

1235 

1446 

NYLQPPLSRDNISVGILFCQARWLTPLIPVLWEAEAGGSLEPRNSRL 
HHCTPAWETEQDPNPPPAKKKKK 
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4507 

A 

41 

489 

RLQLLNSNYVKPPSLSGGFPRGIQDPPAGCQAAGISAGEEGSPIPTA 
MRCPGDWRSSGRLGSQSPAYSSSRECRALEQDQAASADTESLQGEVR 
SLFLSYFSLRLPSQQAAIEHSAIARQSPSGLQNALRQAHRENQSRLL 
APVGKMAL 

4508 

C 

187 

342 

MICKNKNKVILLXXXXXXXXXXXXXGGAXXXXXTTPPRRRGKKQFFL 
PPKQ* 1 

4509 

A 

1304 

1434 

KKLPEMEGPDETLKHTNTHTHTHTHTHTHTHTRIjHESKIiLHEV 

4510 

A 

1 

1383 

MPGRAPLRTVPGALGAWLLGGLWAWTLCGLCSLGAVGAPRPCQAPQQ 
WEGRQVM YQQS SGRNSRALLS YDGLNQRVRVLDE RKAL I PCKRE I FR 
EEGLLGIAWSGTGRGAAPTEGGLLAGDSVTPSEAVAECIPKMWTQNG 
AMWSKSGSENTILEI\AA^QVTDQASVAIiDATQGHAGPFPAQLCWQIi 
LPGSLHGALESS ILTEMIELSS IKSCKPKMKESLLEQRKLIVTLLFE 
YILLYKDGVMFQIDQATKQCSKMTLTQPWDPLDIPQNSTFEDQYSIG 
GPQEQIWQEWSDRKSARSYETWIGIYTVKDCYPVQETFTINYSVIIi 
STRFFDIQLGIKDPSVFTPPSTCQMAQLEKMSEDCSWTLIHYLGCEL 
LGYGYLQKEVKKKEQEGEEEDTKKKRKKKKKKKRKRKKNKRRRRRKR 
KRRKRKRKREKEKEKRKRRGRREEEREGRGMEEEKRRM 

4511 

A 

1 

674 

MPGRAPLRTVPGALGAWLIiGGLWAWTLCGLCSLGAVGAPRPCQAPQQ 
WEGRQVMYQQSSGRNSRALLSYDGLNQRVRVLDERKALIPCKRIiFEY 
ILLYKDGVMFQIDQATKQCSKMTLTQPWDPLDIPQNSTFEDQYSIGG 
PQEQITVQEWSDRKSARSYETWIGIYTVKDCYPVQETFTINYSVILS 
TRFFDIQLGIKDPSVFTPPSTCQMAQLEKMSEDCSW 

4512 

A 

19 

249 

KFLWLLTTQVCTHVAARQRSPTRLPVSCTQTCGQIVASGCLRHFPLC 
IGFSCISNYSCNRHPPHCAHWFCLCPQELI 

4513 

A 

1000 

1174 

GNRKMGRWPMGNQVRARQTPNPMQQHSCPRSLPGPSSSHSHNLSSGT 
RPSSPDTPPER 

4514 

B 

60 

378 

NAVLEADFAKRG YKIiPKVRKTGTTI AGVVYKDG I VLGADTRATEGMV 
VADKNC SKI HF I SPN I YCCGAGTAADTDMTTQL I S S LAAMAVFEDKF 
RPDMEEEEAKNLX* 

4515 

A 

1 

437 

DPRATEGMWADKTCQKSTGRLPELVTAIRMLKQMLFRYQGYIGA/Ui 
VLGGVDVTGP/HLYSIYPHGSTDIAAGIFNDLGSGSNIDLCVISKNK 
LDFIiRPYTVPNKJCGTRLGRYRCEKGTTAVLTEKITPLEIEVLEETVQ 
TMDTS 

4516 

A 

1 

1059 

MRKKFLGSTILDLHYSGCRALGYSCGDHYVADVDLFVPFTSAAHLSC 
SGKWNCTLESRDTLCDAEEEAKNIjVSEAITVAGIFNDLGSGSNIDLCV 
I S KNKLDFLRP YTVTNKKGTS PEEVSTTFCRLAS I SHCTHNSTSGVI 
HKEGMERSEPAGIHCGLHSTLRKI,SGLLMGWGSTGPENAGTGVGECW 
WLAWWL PANQS VLEMELNQMTSNE PTKSLG PLAGRQATF QAQS CVGE 
PGFAELQPSSTRSCSPLARKAEALNACRKNRALSPQCPCVQGRGWVF 
REHKAMAQPAAFWQFIiKDS KFDRD ITPAGARAEEQLGRYRCEKGTTA 
VLTEKITPLEIEVLEETVQTMDTS 

4517 

A 

1115 

1307 

FGRGHYCRRSVSQEEEAKNLVSEAIAAGIFNDLGSGSNIDLCVISKN 
KLDFLRPYTVPNKKGTR 

4518 

A 

3 

866 

FI^KMAAVSVYAPPVGGFSFDNCRRNAVLEADFAKRGYKL\PRPRKT 
GTTIAGVVYKIX3IVLGADTRATEGMVV7VDKNCSKIHFISPNIYCCGA 
G\ TAADTAMT\ TQL I S S \ NLKLHSL \ S TGR\ LPRV\ VTA\ NRMLKQM 
LFRYQGYIGAALVLGGVDVTGPHLY\SIYP\HGSTEKVP\YVTHGFL 
APLA\AMALFER*V*APDMEEEEA\KNLVSEDSPPQFPPPSWRIFN\ 
DLGSRKPNIDPLPSSARNKLGFSPPHTQLPNKKGTRLGWP/RYRCEK 
G\ TTAVLTEKI PLLWST 

4519 

A 

4 

181 

SCVHHWEWREINDWMVAANI SFKI EHS KTKGCLKPVPCPSDLHKHSS 
PGKVEPPSLSPE 

4520 

B 

55 

372 

MERFLMDGFQPQQLSTYALTLYKHTATVDGKTIIjVADINVTQKSFNF 
AKKFSLPLYFVSAADGTNVVKLFNDAIRLAVSYKQNSQDFMDEIFQE 
LENFSLEQEEEN* 

4521 

A 

1 

270 

VFDVQRKVTYRNLSTWYTELREFRPEIPCIWANKIDDRPMSYLLST 
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ADINVTQKSFNFAKKFSLPLYFVSAADGTNWKVWLTAEVASK 

*X _J ^ ^ 

n 

•> 

-3 

347 

F WDTAGQERFQS MHAS YYHKAHAC I MDAVLTAPPGDPG WKQS LQEK 

NAAEERAEKGKRSSVYLCPVEVPTREFPGWVFDVQRKVTYRNLSTWY 
TELREFRPEIPCIWANKID 

4 523 

J± 


C. Ck £Z 
DO O 

TQACCFLLSSSRPTQWSAET* * *HSGQACFSWGCSSEDIWAASGVAV 
SWAA 

4524 

A 

2 

274 

LDQGKYDADDNVKIICLGDSAVGKSKYVGRLGNKYQPQRGSRSFKKS 
SRGRGAWLLGAGEGAVLAGAQMFMGERGESHDPLCVSGGGGRS 

4525 

A 

26 

410 

RERQKAKS S FMAPCLKTFFLLFRDRVSLCRPGWS AVARS RLTAASNS 
WTQAILPPRHPWTIGVSHRLRAKTCLSVSSPSKWVCYLSINVSGHQP 
CPGIARLPVPPPTTASRSHLSVLPPTRTPFRPPR 

4526 

A 

791 

1407 

PECSQPQQLSTYALTLYKHTATVDGKTILVDFWDTAGQERFQSMHAS 
YYHKAHAC I M/LDINVTQKS FN FAKXFSLPLYFVSAADGTNVVKLFN 
DAIRIJWFYKQNSQDFMDEIFQELEVGQVHISGGMEETAPLQG*GLQ 
PSRVTLA*VCPTKCIRAAVEQMQGQASPATLFT\NFSLEQEEEDVPD 
QEQSSSIETPSEEVASPHS 

4527 

A 

203 

92 8 

EPMAEDKTKPSELDQGKYDADDNVKIICLGDSAVGKSKLMERFLMDG 
FQ PQQL S T Y ALTL YKHTATVDGRT I LVD FWDTAGQERFQ SMHAS YYH 
KAHACIMVFDVQRKVTYRNLSTWYTELREFRP * I PCI VVANKIDADI 
NVTQKS FNFAKKFSLPLYFVS AA\ DGTOVVKVWLTAEVAS KLFNDAI 

RLAVSYKQNSQDFMDEIFQELENFSLEQEEEDVPDQEQSSSIETPSE 
EAASPHS 

4528 

A 

674 

1910 

SLSTTPTARSPSPALGRMRTPCGPPARRRWAASDRRGPSRSAGQPPS 

GSLRGCVAAARTRGEPEVESLRRELGRRAWFESAQSPDWRQGPKGPT- 

RSVPGLSSPHSEPHEPMAEDKTKPSELDQGKYDADDNVKIICLGDSA 

VGKSKLMERFLMDGLYPSRFEVLLVPVGLPTLMYQCPTAHPFVPAAQ 

EQEGGLDFDGQDTLSSPPTPHPSMELVPVCSQPQQLSTYALTLYKHT 

ATVDGKTI LVDFWDTAGQERFQSMHAS YYHKAHAC IMVFD I QRKVTY 

RNLSTWYTELREFRPEIPCIWANKIDAJDINVTQKSFNFAKKFSLPL 

YFVS AADGTNWKVWLTAEVAS KLFNDAI RLAVS YKQNSQDFMDE I F 

QELENFSLEQEEEDVPDQEQSSSIETPSEEVASPHS 

4529 

c 

58 


MLRYVETLLTHMGVNPLEAEELIRDEDSPDFEFIVYDSWKITGRTAB 
NTFNKTHTFHFLLGSIYGECPVPKPRVDRPRKVPVIQEERAMPAHLI 
P PM S F FD FVGD PHS F PRTVEN I FHVS F 1 1 RDGXAE * 

4530 

A 

2 

329 

PKPRVDRPRKVPVIQEERAMPAQLRRMEESHQEATEKEVERILGLLQ 

TYFREDPDTPMSFFDFWDPHSFPRTVENIFHVSFIIRDGFARIRLD 
QDRLPVIDTENMASV 

4531 


207 

J _L o 

WSVPSKRSSPGCPLSCFGFRFGQRESKAAALRPELL 

4532 

A 

1 

178 

IFSRRSDTQWSFFDFWDPHSFPRTVENIFHVSFIIRDGFARIRLDQ 
DRLPVIGDCEDL 

4 533 

TV 

X 

44 0 

SGRGPEGRGRGRDPHRDRTRSRSRSRSPLSPRSRRGSARERREAPER 
PSLEDTEPSDSGDEMMDPASLEAEADQGLCRQIRHQYRALINSVQRK 
AAPPGGAGPDGPLSPPCAGRGGRAPEAGAGRRPGAALVETRRVPAPR 

4534 

A 

23 

370 

TMNSKSGLS S S RM S S S AS SG FTP I CV S R VS TYLNMS KELR S ERS CFA 
FSLPKSEAKTRKWASRTASLARDTSLNRVLASSRTSVNLSPAFSMSS 
Kb Jb X b KNTNFGKFVQHSLISS 

4535 

A 

600 

997 

QGRTAENTFNKTHTFHFLLGS I YGECP VPKPRVDRPRKVP VI QEERA 
MPAQLRKNGKNLIKKSTEKEVERILGLLQTYFREDPDTPMSFFDFW 
DPHS FPRTVENI FHVSFI I RDGFAR IRLDODRLP VT T)X 

4536 

A 

195 

489 

NLVYISKVRIKKLPCEFRWFMKYQHVDPEEAVRIHTDVQTKKSMAIH 

WGTFALANEHYLEPPVKLNEALERYGLNAEDFFVLKHGESRYLNNND 
ENF 

4537 

A 

535 

1631 

EDVKIQHGIPEQVILLGFLGKASNWDYRLEEGVTKSKKGKDGRFVNP 
WPTWKNPSIPNVLRWLIMEKDHSSVPSSKEELDKELPVLKPYFITNP 
EEAGWEAGLRVTWLGHATVMVEMDELIFLTDPIFSSRASPSQYMGP 


lows' 
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KRFRRSPCTISELPPIDAVLISHNHYDHLDYNSVIALNERFGNELRW 
FVPLGLLDWMQKWGCENVIELDWWBENCVPGHDKVTFVFTPSQHWCK 
RTLMDDNKVLWGSWSVliGPWNRFFFAGDTGYCPAFEEIGKRFGPFDL 
AAIPIGAYEPRWFMKYQHVDPEEAVRIHTDVQPKKSMAIHWGTFALA 
NEHYLEPPVKIiNEALERYGLNAEDFFVIiKHGESRIP 

4538 

A 

1511 

1687 

GESVPLEDRVLRCFHSKQDSDNSTALLPSLAPFLSHLLGLFASCKLC 
GWTLRDPE FR I 

4539 

A 

425 

619 

EKKTENTCQVFL FS AVLCAPAPGAVQGMG S LL S WGNNR CLVPGCTP P 
TS SAGS VP I TPTRS CCP 

4540 

A 

122 

621 

TFARWMPERSWLNSQMEGEKPHLHCPSPTGQTPLPPSTPPSSRKSCP 
HTGFHRAPLSSGHLEAGWELTHVCRLKKDSAGSAQPCRGFCLCPDVC 
FPVAFSLGEENRETLVKCFCFLPCCVPPPPGAVQGMGSLLSWGNNRC 
LVPGCTPPTSSAGSVPITPTRSCCP 


7k 
A 

2 3 15 

2790 

AATTPALRPLRRSSFLMATTITKASTGTWSSSPSLPTPPPTSTLRKA 
PTTLI QKWRPGAQQPS CPKPRS * PS S STPRTGP I PGTSTSE PMTTQW 
P*STPSMR*LPPAX.THPRSCVPSRTAAWVLGWYATHIERWLSAYSRQ 
PDSGLGWQTCFAQNLPK 

4 542 


<r/ c 

*1 c c 

PS YI AYMC I CMCVCVYVYINICVC I YTHVHTP YNPVI SKN 

4543 

A 

294 

504 

FFYFPQWFFIPPQGLELVLVKKNKKKSTLDREESRQENKLNSGGGG 
<_ \3Ct FKIjCHCPPS WATRTKLCLK 

4544 

A 

969 

1160 

GSISLTESWATVSCGPHPQTLEDSHHKPWKGGCHCCRAQGCDSGVPH 
CRGAGAGSASTCKMGI 


A 

147 

4 02 

GRGIQLFHEIQLQLSKTLCEPAVPVQRLRGRTAARWPGSRESPLRPV 
FDVEYCGVLI FI ILFI * WRHFHSLCWSGDDI ILACLL 

y| C A C 

4546 

A 

240 

433 

SWSAPSRPGHVCGSFSFFLFFFFFKKKKGSAEISVASSQRISSIREK 
LCVCVCVCAHVCACARV 

4547 

A 

3 

310 

GGQPPHPPAARPGGRWGCQPPARQPLRPGGSVSTGTGFPPGRWQPET 
SWVSCRHCGPHLSAVWGLPR/LSPLPAGPNPRQPQLSEFPRPKEFLT 
PRTQKTKPN 

4548 

A 

3 

195 

WPPGPLQPRGTEHRPPTPDTTPRAARSTRSHSPSEPRTRTWPQQR*L 
PRPSKPSHGPDVIEAA 

4549 

A 

1 

1439 

MATEIGSPPRFFHMPRFQHQAPRQLFYKRPDFAQQQAMQQLTFDGKR 
MRKAVNRKTIDYNPSVIKYLENRIWQRDQRDMRAIQPDAGYYNDWTP 
EGRRLVTG AS SGEFTLWNGLTFNFETI LQAHDS PVRAMTWSHNDMWM 
LTADHGG YVKYWQSNMNNVKMFQAHKEAI REAS FS PTDNKFATCSDD 
GTVRIWDFLRCHEERILRGHGADVKCVDWHPTKGIiVVSGSKDSQQPI 
KFWDPKTGQSLATLHAHKNTVMEVKLNLNGNWIjLTASRDHLCKLFDI 
RNLKEELQVFRGHKKEATAVAWHPVHEGIiFASGGSDGSLLFWHVGKF 
WTRNRPGDKMRDRYNLNLLPGMSEDGVEYDDIiEPNSIiAVIPGMGIPB 
QLKLAMEQEQMGKDESNE I EMTI PGLDWGMEEVMQKDQKKVPQKKVP 
YAKP I PAQFQQAWMQNKVP I PAPNEVLNDRKED I KLEE KKKTQAE IV 

VO'HPVTTTV 

4550 

A 

2 92 

R "7 ** 

vrrtfUr r Jvi 1 x lCCaVx IHICIYIYIYTYTYIYTHTHI YIYIYIYIY 
xx xxxx ininiHiHihiix XI YIYIYIYIYIAASWAANAISWRQVSC j 

4551 

A 

376 

456 


4552 

A 

1663 


f \iz> x Jvi>fiijKr i JvF^^£.L.lli , PJjSSPAAPNYIFKQTSRILALMLVLDIGH 
SCYQNRDTGWRVSTS PQHR I KL I SGAEM I AKDLSHVNPKATPS PLWN 
QMGNRPTNCFILI 

4553 

A 

1735 

1868 

LDHGGFT I VAKWWE VAYLKMYVCVCCI FSHFVNKAHLLKILNFL 

4554 

A 

138 

279 

EKPQISLQIDFVCRSQFRI^THTHTHRDVFIFPNWILVSEIESWYK 

4555 

A 

72 

192 

ILFADLSSECTHTHTHTOraTHTHTEMYLSFQIGFWFQR 

4556 

A 

571 

690 

CQQGFSFLQAYGPAQHAIS\MRKFKAKYPDYEVTWANDGY 

4557 

A 

130 

94 3 

TGEAEHLGLARADTWLGEEGPRDGAFPGGSLEGKAWCEQPRGPPNCP 
VPRVPPDPAGEEIiLRWLISSPPIADSTFPLSGAGGAARRQGTGLRIiV 
STCLGPRYNWLRRWALGSIPRRDWSSLSQSPPPAVQAAPGWGWEGGD 
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SGRRNMAVADLVLI PDVDIDSDCVFKYVLIRVHSAPRSGAPAAESKE 
IVRGYKWABYHADIYDKVSGDMQKQGCDCECLGGGRISHQSQDKKIH 
VYGYSMVSRSPVPPCRRPQYQLRGPPEPAALTRGPS 

4558 

A 

45 

405 

G I PGRRNMAVADLDLI PDV\ D IDSDGVFKYVL I PS PLGI PAPG IRPA 
ESKE IVRGY KWA\ GHHAD I YDKS VGATCRKQGLRTVS I LGGGR IS/H 
TKSPGQERFTVY\GYSMG1*WSCPRTPIST 

4559 

A 

613 

866 

NSVLTIJUjLFSLHVHLPPPPSGVTDTAGALPTGGPPSPTSVTGRGPA 
HSHASQLPPAPGEFAPLNESCRGWAGEAFLEKVSLLI 

4560 

A 

69 

432 

GGGQRAIERSLSSIPEDCAMGLSDGEWQLVLNWGKVEADIPGHGQE 
VL IRLFKGHPETLE I LSMLFLTWIG S LCPLFMTDRVI VLLF I S FLY 
HHYCYDSYHLFLTYNILYRSCLSMLP 

4561 

A 

144 

294 

ELLIWTSLWWELCAYTCLCVCKCAGSFVCMCTCGIILCLSYFSSFHKG 
FF 

4562 

A 

13 

759 

PGDPRVRGRVAWRSNNSRKSSADTEFSDECTTAERVLMKSPSPALHP 
PQKYKDRGILHPKRGTEDRSYQPSLKSTDSRSYPSRGARPSPRGSGK 
GSKSPSPRPNMPDRYFIMKSSNLRNLEISQQKGIWTTTTSNERKLNR 
AFWES S I VYLVS S VQG S GHLQG FS RM S S E I GREKS QDWGSAGLGG VF 
KVEWIRKESLPFQFAHHLLNPWNDNKKVQISRDGQELEPLVGEQLLQ 
LWERLPLGEKNTTD 

4563 

A 

177 

328 

INAHLKFK\nfFLSQAPDKCTWKTSAARENHV*LTFLMFYGTLGTGVC 
FCL 

4564 

A 

218 

343 

LHVY I KTLTHTHTHTHTHTQFCMS VNN I AKLLCNF YLG I KQ 

4565 

A 

708 

916 

CSRYSVPRGMFPRHTLLSTALGSSRTTSQIITQIFKIVETCPNIGLC 
HREKGLLLRLTGTC KSHLR S PQ 

4566 

A 

829 

1002 

YTQMEEQNHFHLQENSKQSDPFSSKICGLASELLENKLISSSTLEEF 
CWLGAGS I ARK 

4567 

A 

51 

209 

L PGAR PGDCG S AG EGTLDLLS S LAHKAP P I KRG S LCHTHTHTHTHTH 
THTHTQ 

4568 

A 

8 

218 

LPGARPGDCGS AGEGTLDLLS SLAHKAP PI KRGS LCHTHTHTHTHTH 
THTHTM IGGLYDPALASWCAKHA 

4569 

A 

3 

359 

LLKYPINHFWGVSSIYRMILQQDFTSIRFPALEHCYTGGEWLPKDQ 
EEWKRRTGLLLYENYGQSETGLICATYWGMKIKPGFMGKATPPYDVQ 
VIDDKGS ILPPNTEGNIGIRIKPV 

4570 

A 

2 

217 

ECGDVCDTGDRASVDEEGYIWFLGRSHHTINAS/GWKALIVLTPQF 
LSHDKDQLIKELQQHVKSVTAPCIGC 

4571 

A 

1 

846 

MYANVHS S FTLNS QNLKTT I QMAFNRLYVNAEL I AGDPEKTAKVECG 
DFYNTGDRGKMDEEGYICFLGRSDDIINASGYRIGPAEVESALVEHP 
AVAESAWGSPDPIRGESKGWRQNGISAMWLPNIHALKTGILLRDSR 
NYRGSPDASVLVSRSSLSRGNLGSDS IGQLAAS IQRQS PVAVAVRIW 
VDQTQHKAVEATKSGGVKGIRKDKLRVHKVGPGDQHWYGGCEDPKLW 
E PTL F I GDQTKKQVTLLKYP I NH FWGVS S I YRM I LQQD FTR YGRSGL 

4572 

A 

361 

916 

FTGLLLQVIDDKGSILPPNTEGNIGIRIKPVRPVKLFMCYEGDPEKT 
AKVECGDFYNTGDRGKMDEEGYICFLGRSDDIINASGYRIGPAEVES 
ALVEHPAVAESAWGSPDPIRGEWKAFIVLTPQFLSHDKDQLTKEL 
QQHVKSVTAPYKYPRKVEFVSELPKTITGKIERKELRKKETGQM 

4573- 

A 

128 

416 

RQVLLFWGRGKYHNPFS PCL YPFS AFLGEGQVPLNPFS FTLSGKSRF 
SRGQEP/LNPLFPHPDLLSVCPNPLFPHPDPFPAFLEGKNPRTPRTR 
GST 

4574 

A 

127 

135 

NKIVLKCWGGENFWGWWRENGRCFSGLLQAGLGAAWEPRVG/IVRR 
NICCIE*LVMAWIRFCMN*KTK 

4575 

A 

35 

386 

QLKTDTAQSPRSPLDHHGHRASGNSHSGSIHPDGLK*LKNHKRSEKP 
CPTLTDDIPPQKKCKWPVLALSADITL*KSFSWLILAQKAPPLSTL* 
PPLLPAREQMS FDCNFPLPTQ I L 

4576 

A 

148 

336 

IRTPLKIGLFNSPGSHSQSPWNSGPRLSD/WTPSQIFSA*QLKTDAA 
GSPRKPPRPSRTPSFR 
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4577 

A 

4418 

5132 

DTISYQLEWRSLKSQETTDQAPMSSL*NFIiRSNPWLS*RLGA*/WNA 
DEOGVPLIOEMIjESWASICKVPKYPNAPA\n?PPPnTPHT.WPr!T?TKiow 

w -*- \x*-»* -i—i ' rir\Lj _u v.i\¥ r XX X i l»rtX<ri V XVxr XTf X Xr li X-tXi XT VJ H X, IN O Xi 

VAJiTKLVWWSLHTDTlTCIWCRDSDRGTSLGRSIPCPPALCSVRKIHL 

RPQVLRPTSPRNISPISNPHTRT/SQTPEPQRPGVPPEPPSPGACYK 

CQKSDHQAraCLQPRIPPKPCPICAGPHWKLDCSAHLAATLRAPGTI, 
AQGSL 

4578 

A 

522 

923 

PSAAGLLEFAGGPLOTLFAWVSAAEAAEOPVT.T.MQn/MT.T.DnTJCcnc 

FVS\GSIWPCEVSVCPSSRCWRPRRKPLEAHLVQPQPPTLLAPVLVL 
LAV^CYGWSPRPLAilTLFAASRIxAHHGVDRQNNLVD 

4579 

A 

1 

569 

MGKKQITOKTGNSKKQSTSPPPKERSSSPATEKSWMENDFDELREEGF 
RRSl^SELREDIQTKGKEVENFEP^LEECITRITNTEKCLKELMELK 
TKARELCEECRSLRS/LMRSTGRLLIKKTPF^c;n* crt* FNTJ* AQM«? 

LY*NINSNKSIiP*PLKGPILSKPSASNIPCISLQWVQAFQKSESLPA 
PR 

4580 

A 

23 

660 

FIDSMPSPSRYQ*LSSQ1WKKLL*SSYGTKEEPASPSQS*AKRTKLE 
ASH YLTSNYTTRLQ * L FQKHTML S PRENLTQPACTQVK * TAMLLTQ/ 
RPVWWSLHTDAHEIWCCDSDRGTSLGRSIPRPPALCSVRKIHLRPQV 
LRPTS PMN I S P I LNOLS S LRLTLPDRLRS PLDHHRRT? A ^APT.HPfTF T 
NSHVAHTKPVWCSLHTDTHENDKNKQ 

4581 

A 

1 

764 

M I QEHADKQVQRLQG VLLGS I PTAAS RARLHPGE INNHVAHTEPVWW 
SLHMDAYEIWCRGSDRRTSLGRS IPLPPALCSLRRTHT.RPnVT.P QTQ 
PRNISPI SNPGHLSDYTPTFQGCQTMQGRIiPWS FTLSGKSRFSGEGA 
TQRQYPIPQQALKGLMPAITRIi/STAWPFKAYKLSLPFPHFTCPKTR 
QALQVS SGS AP YQPNCFAYP PHGAKP I YS S I LNTCL YNPL FCSR$ QT 
SFL YYS FAPF I PASLR IHLD 

4582 

A 

1 

400 

MWNAVTLWQQRESCIEEESEIGTLETKETHFIRGPKTLAPVTDWEG/ 
SPSU3V*SMQGRLSDYTPTFQGFQTTQGRLPWSFTLSSKSCFS/AGR 
GKLLLARPS * VP ILPOPLLLHPTHiLS PPLT.TPnnAY qptp 

4583 

B 

1 

933 

MEEKEERKPvRRKjRKRRFJEEGRDGERKRM^ 

TDQPKKEITjTNFKSETKETRFIRGPKTPAPVTDWEGSLPIjVFITOSR 
SLIIHPGFRGVRPRRDACLGPSPLAASPTFLGKGPAAPRQTELGPNS 
SSASAPPPYNPFIASPPHTWSGLQFPSMTSPPPPAQQFTLKKVAGAK 
GI VKDL Il^TFKVYNlTOKlKXOFl^STVROTPATc; P ahktsipotppt. nn 

PGVPPEPPPRGACYKFQKSGHRAKECLQPRIPPKPHPICVGPHWKSD 
CPTHXAATPRAPGTLAQGSLTPSQI FLA* 

4584 

A 

1 

582 

MATAVHLPASQPGTHQKI ID^lAJNlKAIJDAGQQPAXiWALASTDFTSLKK 
LRPQSVTSRNSRAVTSSSERKGGTPDYSPTFQRCQTTRSLYTCRKSG 
HQAKECPQPGFLLSRVPSVRDPx^lTOLFNSPGQH^ 

DSFPDLLGLSA^TDTARSPRKPRRPSRTRASVSQFIQNHIQAITIIIi 
YDKCFF 

4585 

A 

1 

622 

riAKSFSGSGKSVCKSPKGlKMmre^ 

LFGFNIIEGDFCEDSNFQTGESVSDVYFENVSALQRESTGSGIAQER 
1TOEGLESGRAVLCIEECLSA5LA5TSQSSIEQPLPTSS\PTKTKQNM 
TPDI AEYPQWAI WTSLKTTTI VLRPS * VLILPQPPLLHP 1 1 FLSPPL 
LTPAPAYSFIPPSQTPSFG ! 

4586 

B 

102 

1151 

MEEHFMLMGS KNO YRENGHTAOEKLL. Y APHN T Q T .KO qcot.ot.c otjp "D 
lWENxTOGSKDDDDGGDDDDDCQASEAJCPNSACLHPGEINSHVAHTKP 
VWWSLHTDTHE1TONGDVGDD1WDDDVDDDGGDDGKDDGDDDDGDTVM 
VA7MM IMMMVLMVVM I VVWVAAATVVASRKAFLFLINGEESELESHLT 
GFAN I PNHHLTVQH I YTPTLP SGETKETR F I CGPKTP AL VTD WEGS L 
PLVFNPCRjDASLIIQPRFKAASPAILGEGQVPLNPFSFTLSGKSRFS 
GEGAJIAYYKCRx^DHQAJCECPQPRIPPKLCPI^GPHWxn^DCSTHj^ 

aaprtpgtlaqgsltdsfla* 

4587 

A 

258 

479 

taaarrssrtsstoslliwpeiwplgqgmpaardss*aashlcrtpp 

kiglfnlpgshsqspwksgprlsd*llprssrlss*rax,gqisrgsg 
sgcqvs 
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4588 

C 

259 

531 

MS PAHKNFQTPEPQRPG I PPEPPPPGAC YKCQKSGHQAKECLQPRI P 
PKPRPICAGPHWKLDCPTHLASTPRAPGTLAQGSLTASQIFLA* 

4589 

A 

1 

446 

MNLNRNALiVLiTLFSD S ARLHPGE I NS HVAHTKP VWWS LHMDTHE I WC 
RDSDRGTSLGRSIPCTPVIi/VLSRFKKIKACYHSPATSWPFKTYKLS 
LQLPHFTCPKTRQALQVSSGAVPYQPNCFAYPPCGAQPVPSFVLNTF 
LHNSLCRA 

4590 

C 

228 

260 

MFLGLVGLRT* 

4591 

A 

268 

390 

KQHS S FKN I QS RPFS AM S KLRP WR F WRKENCKASNYLLKKD 

4592 

A 

1 

499 

MAPTNCKGSRKCGRPHEQMPHPYLPLLTLFSDSARLHPGEINSRVPH 
TKPVWWSLHTDAYEIWYHDSDWGDLPWE\TIPCPPALCSMRKIHLRP 
QVLRPTS PRN ISPI SNPEK*NGLKVF * KHTS PS S ATNLKRPGQYF YH 
FPFSEFRPVLGMIjQVTSAFQKFTTjL 

4593 

A 

1 

355 

MEVYTQKAYWRVPLGSTVGDQGGHGRFIHPFAIKIEQGSTNRCPSGS 

PATKYVLiLiCLHGKKVDTiTiPAnnoc; TPQUnPCTAOUT TCTTrm mktdtj 
■*■ «*■ v ««v«n\joi\ v uuu it fu/yyo X r & ruKo X s^vjj nljlo x. x \j v JLiPlIN Kn 

STYMDKEKVKMVTALWTDEELSKLTWTRMKFDAMTQLGDLPWEINPIj 
s s cs llr e kd p p tts g p otd op keitltn f ttww a p t ,t t pt ,\c ptjw <5 
YPAECQYPVPQHALKGLKS I ITRLLQHGLLKPINCPYNSPILPVLKP 
DKAYRLLQDLRLINQIVLS IHPVLPKPNTLLSS I PPSTTHYS VLDLK 
HAFFTIPLHLSSQPLFAFTOTDPDTHQAQQITWAVLPQGFTDSPHYF 
NQAQ I S S S S VT YLG 1 1 LMKTHVLS LI> I VSG * ES I WFWQHGVDRQNNL 
VDKAQILK 

4594 

A 

260 

487 

AQACMYTSRWPEATEEPQKK*KWLVPALTDDITL*HSFSGTVSLRSS 
PLSTL*PPPLPAREQPPLTVIFHHLPKS 

4595 

A 

1 

508 

MNMN I KXI VKQATVxjTFTTAxjLAGGATQAFAKEITOQKA YKETYGVSH 
ITRHDMLQIPKQQQNEKYQVPQFDQSTIKNIESAKGLDVWDSWPLQN 
ADGx^AEYNGYHWFALAGSPKDADDTSIYMFYQKTA/PSPGQA/TV 
PIliPQPPLLHPIILLSPPLLTPSPAYSFVP 

4596 

a ; 

1 

1246 

MGKGRRSLWLLPADGGSSGNGNELQGTSPHDYFSIAIKSTHIGLEYP 
x on^own v o r^DivKiKyr -uVjlji J t>vjt> x 1 h.KtjPPNivRQKGYTRQQFCHE 
STPNKGDRVIYl^TCLLYCCVRFPIxAGTGPHILYLSRLASNLELFKR 
DKGRGEQRKEEVTCFHKLLAEKENNPPFSFIYPNPIKRPHPYLPSLT 
LFSDSAHLHPGEINSFMAHTKPVWWSLHTDAHEIWCRDSDRGTSLGR 
SILRP PALC SMRKIHLRPQVLRL I S PRNI S P I SNPHALKGLKPVITR 
LLOHGLIjKPINSPYNSPILPVTjKPnKPYKT,vnnT.nT.TMOT\n" dthdm 

MPNPYTLLSSIPP/CHNPLFCSGSQTCFLYYSFVPVIPASLRFHLD* 
P * HS S / WFS KL PGL YCRKAS QTAP I TS VKPKFHPHLLP I S A 

4597 

A 

182 

750 

KWP YCPS S AGQQTQLCS S S TNSNNCLP YRGTSLGRS I PC PPALCSMR 
K/MPPTTSGPOx^OPKKHLTISNPVAEAKOT\ncvtaappciT.c:n\ ppjtt 
S *RLGSFSSNHPYLGQRQYPI PQQAiKGLKPVITCLLQHVLLKPVNS 

PYNSPILPVQKLDMPYRLVQDLCLINQIVLPIHAMVPNPETFIPLTV 
LNP 

4598 

A 

3 

504 

KRPHPYLPLLTLFSDSAHLHPGEINNHVAHTRPVWWSLHTO 
RDSDQGTFPWE INPPS S CS LLREKDPPTTSGPQTDQTKKHLTNFKSG 
KKPLFT\PSPTSLTIPQPLSPFNLGATLQSLPSLNFNSFHFLVFTKE 
TCFIHEPKTPAPVTDWEGSIiPLVFNHC 

4599 

A 

1 

1032 

1 Jr * * r W r Mi v,lr X XxMJlXjOXXXy VVLjJbyj^iuJujHWW 

DVLAXRACDG KGGTDS APLHPGE I KS F I AHTKPVWWSLHTDAHE I WC 
RDSDRGDLPWEINPLSSCSLLHEK15PPTTSGPQTOQPKELLTNFKSV 
PTPSVAESFQSSFSTDPSDPSSPPQAAPRQAKPGPNSSSASAPRPYN 
PF I TS PPHTWSGLQFHS ATS PPPPAQQFPLKKVAQVKGI VKHALKRL 
KPVITRLLQHGLLKPINSPYNSPILPVLK^GKPYKLVQDIJUJINQIV 
LPIHPVVPNPYTLLSSIPASTTOYSVLDLKHAFFTIPLHPSSQPFFA 
FTWTD PDTHQAQQ I T 

4600 

A 

72 

724 

FQIAGKWHFQFFHPTRSK*FLS*N*QMV*GA*CPGILLHISPFLVSV 
PSATRPKSSFFPSHLSPQYQPQVSLSLSNLPFLQTHVTSAFLPRLLL 
ARPS*VPS/RSSASA/LSTLQSLYHLPSSHLVRLTVSFRDYPSPTCP 
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AI YS * KGGWSQRHSQG * CS F/ SLYPKSDS V* ALFHQI * KSSPVHGS F 
GSNPKTLYSPSP*KVKRPSYSQYTFYYPICSRH 

4601 

A 

2009 

3197 

NL WS I RHMEETNS S S KDMYQS PGEAEGWRWKKGSDRGQNLFRKHS K 
CL ILLI RERFES PHS YI CS PVTFQFF YAQS VKHVNVTIDCLPEGAAT 
RGTARTSKPTTKSQKTLPSTSPGHWTQSTPWASALRSSPWTBTAAPS 
ETEETLNTGRPPELPARATATWFSASHTLPALATRRVARTQWLT7UDR 
QT WAS I S S VP WAQT I S EKKPGG S L WETR S S P PTTAGTEE AMNTTS LL 
AP AAE I MAT PG S P S Q AS PTSG AFTHGTQT P S PTKATAPRYPQTGDL S 
AEWPFTAGEEPVLVPRPHQVSRCPQPLFKVGAMAAAPLTL»AIQRI*NP 
CLMELCQFFQQCLCMSQRSPRTEDMRYCLEYYSWFLKNATYICQRVK 
RVSHSHTLKQKCLENICKSV 

4602 

A 

3948 

5007 

VLQTFAVSVTAHKGGTPRVVCSSQWVRGLADFRNEAADPHDQGMDEN 
PTDFLERLRG ALVKHMS LS PDLVEGQL ILKDKFI TQAAPD I ERKLQK 
QNGPSLRDIFFSPQNGYNWRMASFYLPVANKWKILKSLYQIbHLGPD 
WEGPYTVFLSTPMAVEVTGIDSWIHYTQVKAWKADGATSVNPEEHPK 
YQGEINSHVAHRKPVWWSLHTDVNEIWCRDSDRGTSLGKS/HPRSSC 
VLCSVRKIHLRPQVLRPTSPRN ISP ILNQELATRA/ VKSGH* AKECli 
QPGIP±KPHIjICVGPH*KSDCS/TSPGSHSQSPWNSGPR 
QX v o A w KJbJv I JL) 1 AKb PQKFPRJr' b RMPS r R 

4603 

A 

567 

678 

RNKTCWAWQYTPWPAAWEAKAGGSLQPRSLSPAMG 

4604 

A 

1 

305 

SDIETLDIPEPPPNSGYECQLRLRLSTGKDLKLWRSTDTVFHMKRR 
LHAAEGVEPGSQRWFFSGRPLTDKMKFEELKIPKDYWQVIVSQPVQ 
NPTPVEN ! 

4605 

A 

3474 

3657 

QICTCVCIYICICLYIYICICLRKWFFKMYVYSTHMDTLQSSDFPCA 
TSCVSLLMRICNG 

4606 

A 

796 

937 

HRSLLRLPASVSPTAARAPLCCSSGPHSPAFGSWTFGIIiPSTLVIiSE 

4607 

A 

1 

2722 

MAWRRIiSRRRVAGNDRTWPAPQRHWRYTTLFEMKRSKELPVRWCDFS 
AFTPMMRTHEGNRPGDNWQFDGDAETIAHFARMTTVFTTLKPYLKEA 
VALN AKSGLPQIiVD IREEMPEQTAKRMjL S LLG I LKYKPS GNATDMGK 
EHRTLPANFAFNEFVARHLRHRRTVNQLQPGENHIiGGGVAVASSFGT 
SI IATPIAVTPPINLPTIIGKIMAFMLAAAKPKWSPRAKAAETAKAQ 
LMTLPRNSVILFASRLSSHAPAGLQGHINFTISTKILDSNEPCGGQK 
GFVPQKGRSDTLIKPVLVKDPMVSQGLRCLKNSATSTTOTiYTQALLA 
YIFSLAGEMDIRNILLKQbDQQAIISGFILVTVIKSSNADDTSIYMF 
YQKVGDNSIDSWKNAGRVFKDSDKFDANDPILICDQTQEWSGCDPQIiG 
CDFWSEARAFASDHRQAEWSSKDDFMRLGKPILFFGVTAGNMDSMI 
NRYTADRRLRHDDAYTPDNVAGKRPDRATLVYTQRCKEAWKDVPVIL 
GG / VGMMCS / LS EQGKQLA/ 1 QVSN I / LGMDVCG / IDLLMKD/DG/ S 
FCV/CEANANVG\YPAIiGSTMGCSNSG^n^ITPIKWLSMISVQCI ^ ANG 
TFIiSPSLTKCPKGIRASVPLSGGPRSLKARTFSAFSGQPSSOPSSQP 
TNYPSTICGHNSGIQPKFQLNG 

4608 

A 

17 

1507 

VCGSVSVLEWGEGEADGTRGCEKLGECARTPGQPAGAAVGGGERR.RL 
PVSRPPPPPLLPPLPAARRPALSRTPQPGSVGRERGRSPPTQVSGRR 
YI QEE I IQARKHKLI KMCS S VAAKLW FLTDRRIREDYPQKE I LRALK 
m AKCCEEELDFRAWMDEWLTIEQGNLGLR INGELI TAYPQVWVRV 
PTPWVQSDSDITVLRHLEKWGCRLMNRPQAILNCVNKFWTFQELAGH 
GVPLPDTFSYGGHENFAKMIDEAEVLEFP^WVKOTllGHRGKAVFIlAR 
DKHHIJUDIjSHLIRHEAPYIjFQKYVKESHGRDVRVIWGGRVVGTMLR 

MKDDGSFCVCEANANVGFIAFDKACNLDVAGI I adyaasllpsgrlt 
RRMSLLSWSTASETSEPELGPPASTAVDNMSASSSSVDSDPESTER 
ELLTKLPGGLFNMNQiLANE I KLLVD 

4609 

A 

664 

1059 

EGTLGSGIPCI.SRQCFVSFPIiFFLLTGIITIEKEVPLSMYVSASRMS 
SEREMTQCLTQAPGSLATSFFYPVFMTEKGKPCSDNRHFRQPCWLST 
PAWPPPPSTLPLCPPRPPTSTKQGGVGDICGIPTFNLW 

4610 

A 

775 

944 

IFTWINELFAQILLKLYLVPIKLMDTVYMSIYLSLELCFVISPFIiLT 
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GIYLNLYIY 

4611 

A 

279 

416 

ALSKTIDVAI^ERRHTHAHTHTHTHTHTHTHTHSFFLPCDEKGL 

4612 

A 

36 

4704 

TGAQSLLTCQVFILSCFPFFIiNBVKMPIRpDLQQLEKCIDDALRKND 
FKPLKTLLQIDICEDVKIKCSKQFFHKVDNLICRELNKEDIHNVSAI 
LVS VGRCGKN I S VLGQAGLLTM I KQGLI QKMVAWFEKS KDI I QS QGN 
S KDEAVLNMI EDLVDLLLVIHD VS DEGKKQWES FVPR ICSLVIDSR 
VNICIQQEIIKKMNAMLDKMPQDARKILSNQEMLILMSSMGERILDA 
GDYDLQVAIVEALCRMTTEKQRQELAHQWFSMDFIAKAFKRIKDSEF 
ETDCRIFLNLVNGMLGDKRRVFTFPCLSAFLDKYELQIPSDEKLEEF 
WIDFNLGSQTLSFYIAGDNDDHQWEAVTVPEEKVQIYSIEVRESKKL 
LTIILKNTVKISKREGKELLLYFDASLEITNVTQKIFGATKHRESIR 
KQG I S VAKTS LH I LFDASGSQ I LVPE S Q I S P VGEEL VS I/KEKS KS PK 
EFAKPSKYIKNSDKGNRNNSQLEKTTPSKRKMSEASMIVSGADRYTM 
RSPVLFSNTSIPPRRRRIKPPLQMTSSAEKPSVSQTSENRVBNAASL 
KS RS S EGRHRRDN IDKH I KTAKCVENTENKNVEFPNQN FS ELQDVI P 
DSQTU^KRDHTILPGVLDNICGNKIHSKWACWTPVTNIELCNNQRAS 
TSSGDTLNQDIVINKia^TKQKSSSSISDHNSEGTGKVKYKKEQTDHI 
KIDKAEVEVCRKHNQQQNHPKYSGQKNTENAKQSDWPVESETTFKSV 
LLNKTIEESLIYRKKYILSKDVNTATCDKNPSASKWQSHRKAEKEL 
TSEIiNSWDSKQKKMREKSKGKEFTNVAESLISQINKRYKTKDDIKST 
RKLKESLINSGFSNKPWQLSKEKVQKKSYRKLKTTFVWTSECPVN 
D VYNFNLNGADDP 1 1 KLG I QEFQATAKEACADRS I RLVGPRNHDELK 
S S VKTKDKKI ITNHQKKNLFSDTETEYRCDDS KTD I SWLREPKSKPQ 
LIDYSRNKNVIOraKSGKSRSSLEKGQPSSKMTPSKNITKlCMDKTIPE 
GRIRLPRKATKTKKNYKDLSNSESECEQEFSHSFKENIPVKEENIHS 
RMKTVKLPKKQQKVFCAETEKELSKQWKNSSLLKDAIRDNCLDLSPR 
SLSGSPSSIEVTRCIEKITEKDFTQDYDCITKSISPYPKTSSLESLN 
SNSGVGGTIKSPKNNEKNFLCASESCSPTPRPLFLARHTPTKSNTIV 
NRKKISSLVLTQETQNSNSYSDVSRYSSEERFMEIESPHINENYIQS 
KREESHLASSLSKSSEGREKTWFDMPCDATHVSGPTQfTLSRKRIYIB 
DNLSNSNEVEMEEKGERRANLLPKKLCKIEDADHHIHKMSESVSSLS 
Tl^FSIPWETWQNEFAGIEMTYETYERIiNSEFKRRimiRHKMI,SYFT 
TQS WKTAQQHLRTMNHQSQDSR I KKLDKFQF I 1 1 EELENFEKDSQSL 
KDLEKEFVDFWEKIFQKFSAYQKSEQQRIiHLLKTSLAKSVFCNTDSE 
ETVFTSEMCLMKEDMKVLQDRLLKDMLEEELLNVRRELMSVFMSHER 
NANV 

4613 

A 

124 

672 

WS CSNKKYPLPPGNLMQSIiLPEEAEPCHRE I S S FYVS EPQQGDQDRV 
SSEEKQHEQLWQQRHTSAAAPPSAAAPHVI,CNPKSVI*KRPVTSEELtIj 
TPG AP YARKTFT I VGDAVGWGFWRGS KPCHI QAVDPSGPAAAAGMK 
VCQFWSVNGLNVIjHVDYRTVNNLILTGPRTIVMEVMEEIjEC 

4614 

A 

688 

940 

NSLGEIGIKPFLLKVPFCFGLPPPKLLLLFVAFSSRLWTuliPSCFGIjT 
PHFPGFPSHFLTRISSCICLPLFLPSFLSCLFFFEYN 

4615 

A 

561 

663 

WESHI S PPFNG I Y IHTHTHTHTHFIiLKVYLQI C 

4616 

A 

217 

378 

RETDKTLAS LKKRVKTQINKI INEIRGVTTDTTEVQRI IRDTLNNPV 
GRSTGRE 

4617 

A 

552 

973 

NWAGAQRLTMTHGSCLPQLSDKVAPAKHWRPKPRSITPIVTPPISSA 
SAVKGKREKGSEALVPPHLVGRYASSRSHPSQGGEGRPQAGCGHSWL 
TLSSATDRLSHSWAGIJ^PRCPPAATWPPCSTFSPFASSGCRQPSSF 

a a i d 
tOXo 

n 

M. 

1 OP 

A *7 

A 7VTYV7T TfZOCfYWT OHDrWDDPCV T>Qr , 2i'D"k r \7r , r»'DnT t>T r>r>"DT IVVP ZkT.TJV 

LT 

4619 

A 

201 

426 

FS PWS WVCC YGHWTS SAQVGSRPLLPS AS VS PVSTWPLCCLLQGGLL 
AYTI FHTQRLCGPFP ILLP I PSLHSST 

4620 

A 

3 

299 

HASVLPPRSCRVFWIHSGTTMSKVS FKITLTADPRLPYKVLSVAEST 
PFTAVLNACMQNTVQSLLGRCNHYQPGNTDLALRILESIjPLSFWPTE 
NEVI 

4621 

A 

305 

440 

LKINERGRAQWLRPILPAPWEAESGGSFGPRSSRTAWGISGEASS 
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4622 

A 

936 

1380 

VI^INSSI^SVQGTSVLSPFLHIGSVITLAAMIYKKSAVQLPEKHP 
CLYILTFGFVSAKITNKLWAHMTKSEMHIiHDTAFIGPALLFLDQYF 
NSFIDEYIVLWIALDFSFFDLIRYCVSVCNQIASHLHIHVFRIKVST 
AHSNHH 

4623 

A 

9 

1230 

DWEGDPPEGPATSPLTNSGHPGSGGSSSSDAAATAQSREAAPPAGRP 
ERAAPG PL PGH PAFMQRLVAWDPACLPLPP P P PAFKSMEVANFY YEA 
DCLAAAYGGKAAPAAPPAARPGPRPPAGELGSIGDHERAIDFSPYLE 
PLGAPQAPAPATATDTFEAAPPAPAPAPASSGQHHDFLSDLFSDDYG 
GKNCKKPAEYGYVSLGRLGAAKGALHPGCFAPLHPPPPPPPPPAELK 
AEPGFEPADCKRKEEAGAPGGGAGMAAGFPYALRAYLGYQAVPSGSS 
GSLSTSSSSSPPGTPSPADAKAPPTACYAGAAPAPSQVKSKAKKTVD 
KH SDEYKI RRERNNI AVRKSRDKAKMRNLETQHKVLELTAENERLQK 
KVEQLSRELSTLRNLFKQLPEPLFASSGHC 

4624 

A 

1 

932 

RTRGRETKPHTKERGMMAKKPPKPGPRRIFQERIjKITAIiPLYFEGFL 
LI KRSGYREYEHYWTELRGTTLFFYTDKKS II YVDKLDI VDLTCLTE 
QNSTEKNCAKFTLVLPKEEVQLKTENTESGEEWRGFILTVTELSVPQ 
NVSLL PGQVI KLHEVLEREKKRR I ETEQSTSVEKEKEPTEDYVDVLN 
PMPACFYTVSRKEATEMLQKNPSLGNMILRPGSDSRNYSITIRQEID 
IPRIKHYKVMSVGQNYTIELEKPVTLPNLFSVIDYFVKETRGNIiRPF 
ICSTDENTGQEPSMEGRSEKLKKNPHIA 

4625 

A 

942 

1293 

MGNFFFFEPGTCYVAQAGLELLNSSDPLTSASQIAETTGTHHCTWLK 
TIFLKNKSTALHLYLLVSLQFKHTINDYNILFKAGRSGSWLQLEQFI 
TSGYLRARKIQKKKKLGVCNTV 

4626 

A 

40 

154 

RCWLCPHALFLWLASHLCHFYTFECVQAEC * VMPPHPL 

4627 

A 

4 

436 

EGGGASLNIQPVHTQRCRSDKDDSTNRTSHDWPARGRGHEDTASSSY 
WVVMTQTCRDTPVGSRHWMGLRAEAGALAGMALEATCLALQGSLLEA 
SMKRS FPWATHCLAPQRRTTQPALPR I FLVLALNVHSLRNPS VSGKW 
DVW 

4628 

A 

465 

748 

VLFLVPIYIHKLVYIPYLQKHASNHIQKEQKVMGNFLFYNKALLSFW 
TYFTSLERLSAHTSTIIFFLLCQLFSAHVLHQKWHLDDRSSKSHICL 

4629 

A 

92 

394 

WPRSGPLRLLRTVCCCVLPLLSHSKLHKI SVS APGP * * QI PPPFQNR 
SFSCDTERG *GGETPLVPGKFSS PHQTLL *GTS SGRTGSAGQLPLLY 
LSYKTR 

4630 

A 

1 

1614 

MSLHGKRKE I YKYEAPWTVYAMNWS KGKVKEAAQRYQYALKKFPREG 
FGEDLKTFRELKVSLLLNLSRCRRKMNNNSSSCVHILSSRQFAAALE 
DLNEAIKLCPNNREIQRLLLRVEEECRQMQQPQQPPPPPQPQQQLPB 
EAEPEPQHEDIYSVQDIFEEDPPPVGGQGKEYPSPPPSPLRRGPQYR 
ASPPAESMSVYRSQSGSPVRYQQETSRPYQMPQLPVAVPQQGLRLQP 
AKAQ I VR SNQ PS PAVHS S TVI PTGAYGQVAHSMAS KYQS S QGD I GVS 
QSRLVYQGSIGGIVGDGRPVQHVQASLSAGAICQHGGLTKEDLPQRP 
SSAYRGGVRYSQTPQIGRSQSASYYPVCHSKLDLERSSSQLGSPDVS 
HLIRRPISVNPNEIKPHPPTPRPLLHSQSVGLRFSPSSNSISSTSNL 
TPTFRPSSSIQQMEIPLKPAYESNIAFTNKTNNAQNGHIiLEDDYYSP 
HGMLANG SRGDLLBRVSQAS S YPDVKVARTLPVAQAYQDNLYRQLSR 
DSR0GQTSPIKPKRPFVESNV 

4631 

A 

1 

410 

KGKVKEAAQRYQYALKKFPREGFGEDLKTFRELKVSLLIiNLSRCRRK 
MIWNSSSC^HILSSRQFAAALEDLNEAIKLCPNNREIQRLLLRVEEE 
CRQMQQPQQPPPPPQPQQQLPEEAEPEP*RT*TRPSSCVPTTVRSRD 
FC*EWKKSVD/TMQQPQQPPPPPQPQQQLPEEAEPEPQHEDIYSVQD 
IFEE 

4632 

A 

1 

972 

RPYQMPQLPVAVPQQGLRLQPAKAQIVRSNQPS PAVHS S TVI PTGAY 
GQVAHSMASKYQSSQGDIGVSQSRLVYQGSIGGIVGDGRPVQHVQAS 
LSAGAICQHGGLTKEDLPQRPSSAYRGGVRYSQTPQIGRSQSASYYP 
VCHS KLDLERS S SQhGS PDVSHL IRRP I SVNPNE I KPHP PTPRPLLH 
SQSVGLRFS PSSNS I S STSNLTPTFRPS S S IQQME I PLKPAYESNI A 
FYNKTNNAQNGHLLEDDYYSPHGMI*ANGSRGDLLERVSQASSYPDVK 
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VARTLPVAQA YQDNL YRQLS RDS RQGQTS P I KPKRPFVBSNV 

4633 

A 

196 

390 

HGALERTRTTYPQSLFSALS PNSCSRKWQTGGVPFALGTRGGSGPAK 
PPRSSATIQHSAPGPLCR 

4634 

A 

2 

4417 

DEDFIILirKSLNEAEFLKPDYGDTIVLFLSKMMGKFPSWLKIIVTVR 
TSLQEITKLLPFHRIFLDRLEENEAIDQDLQAYILHRIHSSSEIQNN 
ISLNGK^NTTFGKLSSHLKTLSQGSYLYLKLTFDLIEKGYLVLKSS 
SYKVVPVSLSEVYLIXJCNMKFPTQSSFDRVMPLLNVAVASLHPLTDE 
HI FQAINAGS I EGTLEWEDFQQRMENLSMFL I KRRDMTRMFVHPS FR 
EWL I WREEGBKTKFLCDPRSGHTLLAFW FSRQEGKLNRQQTI ELGHH 
ILKAHI FKGLS KKVGVSSS I LQGLWI S YSTEGLSMALASLRNLYTPN 
IKVSRI/LIIjGGANINYRTEVIiNNAPIIXZVQSHIX3YTEMVAI*LLEFGA 
NVDASSESGLTPLGYAAAAGYLSIVVLLCKKPAKVDHIiDKNGQCALV 
HAALRGHLE WKFL I QCDWTMAGQQQGVFKKSHA I QQAL I AAASMG Y 
TEIVSYLLDLPEKDBEEVERAQINSFDSLWGETALTAAAGRGKLEVC 
RLLLEQGAAVAQPNRRGAVPLFSTVRQGHWQIVDLLLTHGADVNMAD 
KQGRTPLMMAASEGHLGTVDFLLAQGAS IALMDKEGLTALSWACLKG 
HLSWRS LVDNGAATDHADKNGRTPLDLAAF YGDAEWQFIiVDHGAM 
IEHVDYSGMRPIJ3RAVGCRNTSVVVTLLKKGAKIGPATWAMATSKPD 
IMI ILLSKLMEEGDMFYKKGKVKEAAQRYQYALKKFPREGFGEDLKT 
FRELKVSLLLNLSRCRRKhlNGESQNSPWTFIVSLDFGMAEEFATKAI, 
ELKPKS YEAYYARARAKRS SRQFAAALEDLNEAI KLCPNNRE IQRLL 
LRVEEECRQMQQPQQPPPPPQPQQqlpeeaePEPQHEDIYSVQDIFE 
EE YLEQD VENVS IGLQTEARPS QGLP VI QS P PS S PPHRDS AY I S SS P 
LGSHQVFDFRS SSS VGSPTRQTYQSTS PALS PTHQNSHYRPS PPHTS 
PAHQGGSYRFSPPPVGGQGKEYPSPPPSPLRRGPQYRASPPAESMSV 
YRS QSGS PVR YQQETSVSQLPGRPKSPLSKMAQRPYQMPQIiPVAVPQ 
QGLRLQPAKAQ IVRSNQ PS PAVHS STVI PTGAYGQVAHSMASKYQSS 
QGD IGVSQS RLVYQGS I GG I VGDGRPVQHVQASLS AGAI CQHGGLTK 
EDLPQRPSSAYRGGVRYSQTPQIGRSQSASYYPVCHSKLDLERSSSQ 
LGSPDVSHLIRRPISVNPNEIKPHPPTPRPLLHSQSVGLRFSPSSNS 
I S STSNLTPTFR PS S S I QQMEI PLKPAYERSCDELS PVS PTQGG YPS 
EPTRSRTTPFMGIIDKTARTQQYPHLHQQNRTWAVSSVDTVLSPTSP 
GNLPQPESFSPPSSISNIAFYNKTNNAQNGHLLEDDYYSPHGMLANG 
& K^UiauttKVSQAS S YPDVKVARTIiPVAQA YQDNL YRQLSRDSRQGQT 
S PI KPKRPFVBSNV 

4635 

A 

158 

358 

utL^ju^^KJLK TAQQCPPS PPPHPSHPLVFS PPEGPPRVNGFPCPRC 
GGQRKGEADPGAGRGWPGS 

4636 

A 

4 

152 

r UVJV^UiiU/u^luaifc,FVCVIVlraQQ^ 

SS 

4637 

A | 

382 

458 

SSS FKTSGSPCPGPEEEEDEEDEED 

4638 

A 

46 

314 

SSSFKTSGSPCPGPGEPFPWGYPAPRLRRCFPVALGCWSSSLLSEEB 
EDEEDEEDEEEEEEEEEDLFLSYCSWEKSPAFPSCLWKSVSP 

4639 

A 

46 

271 

PRRVPPPPRS I SHRVEAHPSKRQAPHAPGPBEEEDEEDEEDEEEEEE 
EEEDLFLSYCSWEKSPAFPSCLWKSVSP 

4640 

A 

1 

1815 

MTLVA YERYFNAEDDYTRESALQEH I AS VRNH I RLLGRKD YQQESAA 
YLDRVSAHIRDPRYPWELRIiMISAPGSGNDTVILKLTWAFTVSVGTA 
LTWNWNAQCANFQLSGY I PRHGPCNNHFPGEPEDWA I FTS PQKGHTG 
REKRG IKIMRG I S KVAQRCPVFKSDAARTRQRLS ARKGRTLPPGRGK 
FPHSTTESFRNTFWLKKIMEHCFNMVDQQTTTAQTNANFLQIRFTTM 
SKKIAVI GECM I ELS EKGAD VKRGFGGDTT.NT c; w T a v> n\mo» n t t\t 
HYVTALGTD S FS QQMLDAWHGENVDTS LTQRMENRLPGL YY I ETD S T 
GERTFYYWRNEAAAKFWLEMSITLRRDTRSGGSSSEGIIQFFFEETQ 
DENG FN I Y I LDESNR VEVYHHCEGS KEELVRD VS RF YS S S HDRFT YG 

SSFINFNLPQFYQIVKVDGREQQIPEGFAAAITNPLIAFQVKMVAAL 
FGCQP YLVQRLLAVDNDFAAI LKGNGQ YAAVDFAVD IAVA I PWQAF 
FNGQPBLISQAMKFPWHCCILFLSDCGSLAAWGPGSKLCLWGGFAG 
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ISGRIARFQERIKWVYELFPRLHERRIQRAGTMSGGEQQRN 

4641 

A 

1 

1284 

MDKKQVTDLRSELLDSRFGAKSISTIAESKRFPLHEMRDDVAFQIIN' 
DELYLDGNT^QNLATFCQTWDDEimiKLMDLSINKNWIDKEEYPQSA 
AIDLRCVNMVADLVmAPAPKNGQAVGTNTIGSSEACMLGGMAMKWRW 
RKRMEAAGKPTDKPNLVCG PVQ ICWHKFARYWDVELRE I PMRPVRLQ 
QSVPHVLWEFQTMQENHFLQRYGFFLSAVIRKTQYTLGHQLQQHHO 
GMSQLSCNPLSSPRGITRWCLGERLSRSSTRVEGILRGRISPDGFQE 
RIKWVYELFPRLHERRIQRAGTMSGGEQQMLAIGRALMSNPRJjLLLD 
iBPSLGLAP 1 1 IQQI FDTIEQLREQGMTI FLVEQNANQALKLADRGYV 
LENGHWLSDTGDALLANEAAYWVSGNMLNLHDLVGLIRRLRRIRHE 
TMSNL 

4642 

A 

1 

1650 

MFILWYSASSTFGKDSDIVMGVRAQQKEKTRRSLVEAAFSQLSAERS 
FASLSLREVAREAGIAPTSFYRHFRDVDELDPNFGEKGNMAPIiDCFG 
RTRAVAS R VKLLG I EE V I ELN P PLFDLFLKHD AMHDPMVNE S Y CETF 
GWVSKENLARMKELTYKANDVLKKLFDDAGLILVDFKLEFGLYKGEV 
VLGDEFSPDGSRLWDKETLEKMDKDRFRQSLGGLIEAYEAVARRLGV 
RHKMEKVML S FDKVS AHYG K I Q ALH E VS LH INQG E I VTL I GANGAGK 
TTLLGTLCGDPRATSGRIVFDDKDITDWQTAKIMREAVAIVPEGRRV 
FS RMTVEENLAMGGFFAERDQFQER I KWVYEL FPRI*HERR I QRAGTM 
SGGEQQMLAI GRALMSNPRLLLLDEPSLGLAP 1 1 1 QQI FDTIEQLRE 
QGMT I FLVEQNANQALKI^RG YVLENGHVVL SDTGDALLANEAS ER 
KLWLEMLSPPKDDSKDPQAKLFITAATEGRGMDALREHLLQLPEREH 
ASQHS FRLAIDRAFTVKGAGLWTGTALSGEVK 

4643 

A 

1 

1605 

PJWLFLCPPUUJVQCALKDPRSKYSLGGRTTLIITLCK5SGKIQJNIPH 

PSSLSERVMTAKDGFVSRCHLLMQPKQQKWSLMYPMEGEVLENGCWP 

TLQDSLLCTALVDKLLVFLGRCFCTAVEWMLVTCRTAAAVSAFLIV 

GRVSSPVCRAVSVQPWTLTADHTPGRYCLKLVCRQLCLCPSSTPLTE 

VFCSKEAFFIILDCSNIiPHALLPVDSPKGLSKCSNPREKARRKLQGH 

YHVASEVSFVPVRRFPKGEIGANQPGTHRKFYHLTHYRQNLKQPDVP 

HGRIVFDDKDITDWQTAKIMREAVAIVPEGRRVFSRMTVEENLAMGG 

FFAERDQFQERIKWVYELFPRLHERRIQRAGTMSGGEQQMLAIGRAL 

MSNPRLLLLDEP S LGLAP 1 1 IQQI FDTI EQLREQGMTI FLVEQNANQ 

ALKLADRGYVLENGHVVLSDTGDALIjAN^ 

ELIRSIjPCGASYGGIjSI^PWSRGHIPQSHQSSESVRVM 

t TCQCaTMDfDT "DTTUT r*D 
i X O O i>t\ X n r Ij Jr JLi trJSJrl Lt\jtr 

4644 

A 

303 

410 

FNVGGKGNSGLRCIPLLCLLFRLKRKRRKMRQRSK 

4645 

A 

92 

395 

HHHSKLTVTLSPPFPCKXPLPPMKAKLVEERK31NARG*SAKRRKMRQ 
RSQVNPPSFVAPVEATQEHETWNAQTLGMLGTSCGTACYCLELVSMG 
SRTSSAL 

4646 

A 

362 

498 

DVim.IYTVILKEALWLMSVISALCDAEAGELLEAKSLRPAWVTK 

4647 

A 

609 

701 

LRGWJGGRG W AGGVP AL.G PCRGCG PQGRRG 

4648 

A 

1 

104 

TYV* PSVFKVN *HPT * SS S *HRKDQTDILNAINQ 

4649 

A 

1 

1398 

MTKKRRl^GRAKKGRGHVQPIRCTNCARCVPKDNAIKKFVIRNIVEA 
AAVRDISEASVFDAYVLPK1,YVK1^CVSCAIHSKVVRNRSREARKD 
RTPPPRFRPAGAAPRPPPKPMSGAGRGRSHIiDEGREGAGDBVGELSIj 
QQE I DKD PLGG ALG AG S RGTKTTGMQ AG I GGGEQVLVFWD S S LPAW 
DSCFHPVPS PDTS AALS WCLLS FSRTPQQPS PGNFRPEHLtPLLFIiST 
VLPQSVAQTQASGKYLPSRWSQDRCGESREAPLPSKTLEGETPEGVG 
AELRCVRLPG PE WNS VLKPGFS LQLDS S SARMALVFVYGTLKRGQPN 
HRVLRDGAHGSAAFRARGRTLEPYPLVIAGEHNIPWLLHL.PGSGRLV 
EGEVYAVDERMLRFLDDFESCPALYQRTVLRVQLLEDRAPGAEEPPA 
PTAVQCFVYSRATFPPEWAQLPHHDSYDSEGPHGLRYNPRENR 

4650 

A 

1 

1149 

MTKKRRNNGRAKKGRGHVQPIRCTNCMC^PKANAIKKFVIRNIVEA 
AAVRDISEASVFDAYVLPKLYVKLHYCVSCAIHSKVVRNRSREARKD 
RTPPLDLANEKKVDQPPEAKKPKIKVVNVELPIEANLVWQLGKDLLN 
MYIETEGKMIMQDKLEKERNDAKNAVEEYVYEFRDKLCGPYEKFICE 
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QDHQNFLRLLTETEDWLYEEGEDQAKQAYVDKIiEELMKIGTPVKVRF 
QEAEERPKMFEELGQRLQHYAKIAADFRNKDEKYNHIDESEMKKVEK 
SVNEVMEWMNNVMNAQAKKSLDQDPVVRAQE I KTKI KELtiNTCEPW 

TQPKPKIESPKLERTPNGPNIDKKEEDLEDKNNFGAEPPTSGMVNVT 
PNEENFC 

4651 

A 

3 

242 

RASKMTKIRRNNGRAKKGRGHVQPIRCTNCARCVPKDKAIKKFVIRN 
I VEAAAVRD I S E AS VFDGKWVTG ANCVR I P VS 

4652 

A 

213 

658 

EG PARHRLLS PVRASKMTKKRRNNGRA\ KKGRAPRQGTAPLKI FPSG 
PLPPNCARCVP/RRDKAIKKFRHSET*WEAAAV\RDLSEASVFDAYL 
\LPKL\YVKLH\YCVSCAIHKQK*SGNRSS*SPAKTRTPPPRF\RPA 
GCWPHGPPTKSP 

4653 

A 

161 

295 

DLLGEIIYVYIFIQLMYTYIYVYTQTHTHTTNNHHYTTHAYNRH 

4654 

A 

2 

3 94 

QLMGADLCR I KMKKHLTTFLV I LWLY F YRGNGR YQPEL S PQPL IILE 
G KNCS LRC I YTAS P FTS LWR YNHDTGRG P VS LT I MTFS ENTKPNGR Y 
TATLDADTKQSSLHITAYQLIDSASYICWSALCST 

4655 

A 

355 

476 

MPTHTHTHTHTHTHKHML I KL IGWGEANCFQGRWH I LEKT 

4656 

A 

662 

979 

EVLYLKNLSPRVTERDLVSLFARFQEKKGPPIQFRMMTGRMRGQAFI 
TFPNKE I AWQALHLVNG YKLHGKI LV I EFGKNKKQRSNLQATS LISC 
ATGSTTEISGS 

4657 

A 

960 

1387 

NTDKQWSWRARIASRTQKLRNSKRHFVGHLLAQKPQKQSPGRTHAN 
LNS FSLQQKAQEAKPQAEPCLTLTAQPAQKAQKQS PRQNHANLTAQP 

AEKHRSKAQAEPMPNLNSLSLCPFKKRERQTDTWGLRLLTQAGLKLT 
G 

4658 

A 

696 

998 

SLPTTSQPVRltQATDTPIP/VSKRAQLSPEGAGTGQAEEVAQREPPTG 

RGGGAFRPPTRAGDPRRRDSRSGGRNVPRAPGSRKSGDCPSLLPPQP 
SAPSPH 

4659 

A 

240 

463 

GLVFSRLSPEYYDLARAHLRDEEKSCPCLAQEGPQGDLLTKTQELGR 
DYRTCLTIVQKLKKMVDKPTQRSVSNA 

4660 

A 

2939 

3295 

PVCTVSGPMAPGASSSHPVLPLIiGLLEPAGIJ\ELGRTHPITMSTVVIj 
LLQRSVSNAATRVCRTGRSRWRDVCRNFMRRYQSRVIQGLVAGETAQ 
Q I CEDLRLC I P S TGECRGD SRDTS 

4661 

A 

44 

319 

liPRSQGFPERTCAAAGTGVLSFRRSEGLSSCRGSRPPFSAPSLIAEP 
TEQGTGMAKAGE I G IDLGTTYS CVGE SHPGNEKI IRFYQ INLS F 

4662 

A 

2 

896 

FVIAGRRRPLRSRKMSRSGAAAEKADSRQRPQMKVNEYKENQNIAYV 
SLRPAQTTVLIKTAKVYIAPFSLSNYQLDQLMCPKSLSEKNSNNEVA 
CKJCTKIKKTCRRIIPPKMKNTSSKAESTLQNSSSAVHTGKNKLQPKR 
TADAlWIiSVDVESSQDGDSDEDTTPSIiNPSSTPPSPPNPGEISMEKV 
KVKRYVSGKRPDYAPMESSDEEDEEFQFIKKAKEQEAEPEEQEEDSS 
SDPRLRRLQNRISEDVEESRNGVHHVGGNQRELARQKNMKKQSDSVK 
GKRRDDGLSAAARKQR 

4663 

A 

68 

439 

GARPSRPSCFPQLPDPLPSSVFLLWVFVTSSRCLLAGHSSPDPQLCA 
VPQRTQGSPAQPDLGRRLISPSLLLPHSPLPQGPLPQGPLPQSPCCF 
KPWEFMAKHRPHLPRADVAQTRGGEKEEA 

4664 

C 

208 

345 

MSPDRNIECKATGFTETTIYDLLFCKSVDSFYQLLISSSGSKSIT* | 

4665 

A 

1 

320 

RPVPAKLNPRSWPRTAGALPLRPPPLTMAVFHDEVEIEDFQYDEDSE 
TYFYPCPCGDNFSITKEDLENGEDVATCPSCSLIIKVIYDKDQFVCG 
RNSPSPFQPTKN 

4666 

A 

54 

230 

RWVLLFFLS FVNAVMLCAQVG S HHPAL P WAS P PE CCRDH ICPH PVG P 
PGPP\P*PFQPG 

4667 

A 

1 

2303 

MARGSLGVSPRAPLAAPRFSQCGMWRVCAFQTRSRLECCVCLRSEPV 
GYTGSDGRGHRPHSPAVALriVQSRVWQWRVPLVTKNVLRLGTGVRGR 
PCTRFGEIKSPAAAINTLQFEYALCLGRVQRPGHDFCVPAERGVTED 
VRGCFRPEGQDMLALLLVGRLSCPQPGLSAPVTVS PRDCPTI PWWRG 
TGLLDLFTAVPPALSPVPGWAEEQERLRLEQEREQEQKKANSLARLA 
HTLPVEEPRMEAPPLPLSPPAPPPAPPPPLATPAPLTVIPIPWTNS 
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PQPLPPPPPLPAAAQPLPIxAPRQPALVGAPGLS I KEPAPLPSRPQVP 
TPAPLLPDSKATIPPNGSPKPLQPLPTPVLTIAPHPGVQPQLAPQQP 
PPPTLGTLKLAPAEBVKSSEQKKRPGGIGTREVHNKLEKimRAHLKE 
C F ETIjKRN T PNVDDKKT SNIi S VT.R T ATjR Y T OSIjKRKEKEYEHEMEWTi 
AREKIATQQRLAELKHELSQWMDVLEIDRVLRQTGQPEDDQASTSTA 
SEGEDNIDEDMEEDRAG1X5PPKLSHRPQPELLKSTLPPPSTTPAPLP 
PHPHPHPHSVALPPAHLPVQQQQPQQKTPLPAPPPPP7UVPAQTLVPA 
PAHLVATAGGG S TV I AHTATTHAS V I QTVNHVLQG PGG KH I AH I AP S 
APSPAVQLAPATPPIGHITVHPATLNHVAHLGSQLPLYPQPVAVSHI 
AHTLSHQQWGTAGLGPPATVMAKPAVGAQVVHHPQLVGQTVI^P\n: 
MVTMPSFPVSTLKLA 

4668 

6 

1 

651 

MQKEDCVIVLVKLAVEGMAPTQQIHLFVAITFSGWPRQCTGSSFKAI 
DLFLDPDSVMQQIYSFHFGIMCNFDNHAFQNLMAVIDNIKQDRDMDE 
AGSHHSQQTNTGTGNQTPHVLTHKWELSNENTWTQGGEHHTPGPWVL 
GIADFKNEVVDPHGVKPQTFTVSVTALKDDVSGIHSFLLVGSWSANF 
RNEAAE PSALFT IAKIW INLS I HQQMNG * 

4669 

A 

1 

112 

ETHQYTSKDS IDQDLGFIWMNLFFPLRQDKKERKYWE 

4670 

A 

474 

616 

HTKGl^TSVFSQHRIPILPECFIVKLDCVCVCVCVCVFYINIQMYVF 

4671 

A 

197 

334 

IJ^QVFLFCLLVF^LIRCSPIISLEVNTPCVPLRIFSFFKLGHLiI 

4672 

B 

82 

402 

MGLEKLHPFDAGKWGKVINFLKEEKLLSDSMLVEAREASEEDLLVVH 
TRRYLNELKKKVLRPLRTQTGGTIWAGKLAVERGWAINVl^IXRKVE 
LEWGTEDDEYLDK* 

4673 

A 

1 

1524 

MGEKIjAPGNMFEGLDQAVIiEVHPCTLGSGTSESEAPDETSLGWRGHW 

KVKATWGDIPSAATVERVGLGLSIHPSAFTPTLLGAPTTRPIFQIEK 

LRIGDKSL^IMQAPGAQAQAMLNSHEKATYYTIAFMRNVQNRHIYRSK 

MW TNG FOWM PGGRKWVMT AKP T iHTTOL YOHVP ETR W P T VY PR YN T T 

FMGLEKLHPFDAGKWGKVINFLKEEKLLSDSMLVEAREASEEDLLVV 

HTRRYTjNELKWS FAVAT ITE I PPV I FLPN FLVQRKVIjRPIjRTQTGGT 

IMAGKIxAVERGWAINVGGGFHHCSSDRGGGFCAYADITIxAIKFLFER 

VEGISRATIIDLDAHQGNGHERDFMDDKRVYIMDVYNRHIYPGDRFA 

KQAIRRKVELEWGTEDDEYLDKVERNIKKSLQEHLPDVVVYNAGTO 

LEGDRLGGLS I S PAG I VKRDELVFRMVRGRRVP I LMVTSGGYQKRTA 

RI IADS ILNIjFGIiGLIGPESPS VS AQNSDTPLLPPAVP 

4674 

A 

39 

786 

GQGGQEAAQPSLGEAKGCLLSESACSLCPRVMTTPNKGNKALKVKRE 
PGENGTSLTDEELVTMSVREI^QHLRGLFKEEFVQLKQRRRTLKNRG 
YAAS YR VKR VTQKGELE KQKAELQQE VE KLAS KNAS I KLELDALRS K 
YEAMQTFARTVARS SVAP ARGPIxAAGLGPLVPGKVAAT S VIT IVKS K 
TPRPIVGTRVCPGRILCGATRHMANxxAALSLCFLLFSFLPIjFPTIiLS 
SLQSTTCTPGAPG 

4675 

A 

434 

1333 

GASLCQWLNAHCLARHPPAPGWRSPSSLWTGSLASTTYCRLCPSSTG 
FFSlT^^SAEGHQLFLCNVERSVSHFDAKLLSKx^PVANRYEGTEDD 
YGDDPSTNS FEKEKQDYVYCLES SLQTYNPDYVLMVEDDAVPEEQI F 
PVLEHLLRARFSEPHLRDALYLKLYHPERLQHYINPEPMRILEWVGV 
GMLLGPLLTWIYMRFASRPGVSWPGMLFFSLYSMGLAEIiVGRHYFLE 
LGRVS PS LYS WPAPQCCTPAMLF PAPAARRTLTYLSQVYCH KGFGK 
DMALYSLVEGPRERGPM 

4676 

A 

772 

1107 

GLSGRERGNQAALTSEEWAIxAQDFNRPEAHRDQSTSKFSCPGPRVEP 
TGVPGCGSTHMPHCQPRQEPSWPGSGSACSPARPSPSPDASWHCTLT 
LHKHLYADGLRDPASLVC 

4677 

A 

282 

377 

TGEPLLGQCGKKRWGWRLHTES PLAHCLVEL 

4678 

A 

233 

459 

TOKAPESSPNSFPPPRGTLEERWGRPPQEWVDRETOMDGTDVGGEGR 
EKGSPGTWGRGIGEKGWGCLPHCPHQRF 

4679 

A 

761 

962 

EILRNTOFPSSLGWFNGSE^IDEHTHSRAHTHHTHTQTRSHTLPS 
VNIFLPKGNGSSFLRTFLID 

4680 

A 

3 

190 

FFFFCHLSPRLDCSGAIMSYCSLKLLGSSGPPASASQVAGTTGMCHH 
PQPKHPYFLLEWFTL 
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4681 

A 

150 

338 

AAGLDGKHWIGGSCACAGSCKCKKCKCTSCKKSCCSCCPLGCAKCAQ 
GCI CKGASEKCS CCA 

4682 

A 

168 

375 

ATS PRLAMDPNCS CE AGGSCAC AG S CKC KKC KCTS CKKS CC <5 rr PT JCl 
CAKCAQGCICKGASEKCSCCA 

4683 

C 

55 

444 

MLAKRVQGC YKKKKKKN PRFNI GCL YLHLKS TAVPG T OAOOT mt crm/ 
PLQMXAPGQSYCTAHRAAGAAARLCQDGAICDSIAICOEPAOAPGAN 
RRAGAVGSPFRGQGEARSSRSKRKTRSGTDEKRPEL 

4684 

A 

64 

322 

DLGWGFFVKGSTKKMGPLKKTGYQKKFWSFPGIKGPKKPKAFEAKNL 
GPRGPAHYSHPFGGLNQRIGLTRGVPGPPGPHNEALFFF 

4685 

A 

2 

38 

LALPRPQHFQKV ~ 

4686 

A 

185 

451 

SQHFGRPRQVDHLRLGVGDQLDQHGETPSLLKIQNWPGWAHACNPS 
YSGG*GRRI/RLTLGGGGCGELRSCHCTPALATRAKIjRLKKKK 

4687 

A 

344 

468 

MGYNFETGCLMDGSGFLSVGNKRFGEHGIPRSRMGPVTDSK 

4688 

A 

204 

286 

LLKLRRSLFGFVKVLRKYMIYIYTKTI 

4689 

A 

206 

395 

TFCLVFFQFSyNFQLVPVFNTAI>VC*RH*LIAFLNLRII*F*YKYIS 
YIFVKL*QNQISKPWF 

4690 

A 

2112 

2221 

KPYQWFTSYCNVFISFFFLSLNLAKSYCKEIFGCHK 

4691 

A 

116 

382 

GHVSQELYLWELICTILPFFFFFFETELCSVTRLECSGAISAHYNLC 
LPGSSDSPASASQTSGITGVTTAPSHHFISLKSGPSIKPIL 

4692 

A 

11 

666 

SSAVEFGVALGRNQPLKKEKPKWKSDYPMTDGQLRSKRDEFWDTAPA 
FEGRKEIWDALKAAAliAFESlNrDHElxAOAIT^ 

ELGNRYQLPVYCIiAPPINMIEEKSDIETLDlPEPPPNSGYECQIiRLR 
LS TGKDLKLWRS TDTVFHMKRRIiHAAEGVEPGS OR W FFSfiP PT-TDK" 
MKFEELKIPKDYWQVIVSQPVQNPTPVEN 

4693 

A 

47 

241 

RPDSHCLFSPPQLTHASPSLPTEDRGPPQPETFPEDMGNTEAETAHP 
PRSCPPHCREPAPRAAR 

4694 

A 

246 

485 

CQRSKNSGEAASVnDQRRASSRQPGDRTLTHASPSLPTEDRGPPQPET 
FPEDMGNTEAETAHP PRSCPPHCREPAPRAAR 

4695 

A 

2428 

2657 

HPVKHIIRPPKHTSGKKPNKRVTRKNSIFMDEV*PRKIEVEFQEEW 
IRGMMPWSTSCSLVLLCCSLSWSTQIVNF 

4696 

A 

1137 

1491 

AVPKRRL TLPVAS GWNTMDHTGAIDTEDE VG PIxAHLAPS PQ S EAVAH 
EFQELSLQSS01^PPlxNERKlTV1^0LPLOnpp T rPT?OT unnfTMDDVTv 
TI TAI PNVTEHDGRVQEEPAS CS 

4697 

A 

3 

728 

YNSARPRPPAPPAPQQNSAQRSRAPGSERPAAQSGGETDRSADRDSG 
RDSGETRVCERAGPSGPRSGSAAORAPSANMS VDPT, q q kat.ttt ttp wt 

SEOTPHLSDEALMGLSVRELNRHLRGLSAEEVTRLKQRRRTLKNRGY 

AASCRVKRVCQKEELQKQKSELEREVDKlxARENAAMRLELDAIiRGKC 

BALQGFARSVAAARGPATLVAPASVITIVKSTPGSGSGPAHGPDPAH 
GPASCS 

4698 

A 

825 

941 

KVHWCRAWWHVPVFPASGKAEAGRFLEPRSLRQAWTTQ ] 

4699 

A 

236 

449 

GRMAQDL S E KDLLKME VEQLKKE VKNTR I P I S KAGKE I KE YVE AQAG 
NDP FLKGI PEDKNPFKEKGGCL I S 

4700 

A 

1 

299 

MALRKNEVFEGWDGRKGVSHGEGCFSKGSNIGAYLGKSQEQFEGNIR 
GlTOQIIVNx^SKLKISEEDLTPRMRQRSNTLPKSFGSQLEKEDEKKQ 
ELVDKAIKPSVl^TLESIQRXLQEKPAESSRPEDIKDMTKDQIANEK 
VALQKALLYYESIHGRPVTKNERQVMKPLYDRYRLVKQILSRANTIP 
IIVSRIPSRGWYNTFP*CPQGGCFPLASPGASSE 

4701 

A 

1 

238 

AVSHKPFGVLKWPlWHQSIxNEPVSDPPLVPPSPCRLLPPPLTPPTSP 
QILFSSSAAATVTAVSAPVFLLQSFPPQYLQI 

4702 

A 

1082 

1436 

VPPAGGRGVPPCVRGQHSVQPDPHLQGPWGLLLQHPGREYHQQDTSV 
PQDPGVALQNPAGCLCFPMCYTYHCDVGLHHVHDPAECHSAKGHGGE 
PGATL WGQVLL PD VLWAFLAGDS I 

4703 

A 

126 

417 

PCAQRQAKYSYRSKYRWPDNSTALDPVFFPIDGPVQVLRLLPLPHPT 

PPHVRIIQGPLWPSGPIPPQLGRSRLSWGEKGTMGLGRWSGPCREWS 
GP 


± I ;ii l!l !R 4- '£■» ■+ 3 ». O 2 E» 6. til E? 


4704 

A 

867 

963 

RIVICWAQWLTAVIPPLWEAKAGGLPELRSS 

4705 

A 

3 

242 

GSQGSRRCSGRPGMHLWLSVPTDVTASASLPQSPPAAKPGPHLGVIS 
CliTSSRHLGFFGLISGSLALTLALTQLIPASP 

4706 

A 

3 

3082 

ASQAFLRVPALPMDVLPLliWEDPPTGAARGPGPDPTTHQPPEVAIF 
AASRVEGGLRPEQENRRDAGRPQIRTPKQGRAEQTRHPARVSSGGGG 
SQE PTQ VNTFMS F I FPMWI SVLNT 1 1 ANKLTVMVRQAAEQGQVCTV 
GGEHSTFSMAIEPGRVQALRHGVRVLRAWIAFWCWLPYHVRRLMF 
CYISDEQWTPFLYDFYHYFYMVTNALFYVSSTINPILYNLVS7VNFRH 
IFIiATLACLCPAKAQGLCQDTTWEGAQASASRSSAVASRARQKGRRI 
RGLVSSTCPSGRGQDGDPGSTEHMTSQAPGLRRSPKPKRRQSQEKLA 
GTSQELGATGSKSRDTCPRTTDTPTTPDLVSLACGQVMMWPRKLAIiR 
AMSASVMESGAPEPAPGDGTAIiTAQtPTPTPTICRPVSLLAQSGTRA 
LDCCPPRAPCUTLSTPRFVPRVGHSVMFKGSLGSVPRRKELEQRGNK 
ADGRAHVRTTREPVPDQFPRPQLSPRRSEGHRRSLHPNRRNTDIDDN 
IIGMEIQPWNSSSVMEREIWGHWPGSLPQAPLNSSPIiKPEPGPTNQV 
TSRATGDTPSQAQTPANYPGPGARPCQLNPHHLTWFKAHGHREPVCS 
WAKGQTGPGEVDVSCTYTSSRQEDARRTDGETGWERECDLPVIPEVW 
LPELELKPEPFTKSSSQDLSWPWMNPVRCLSCLYRGIRALPASHYWT 
LVKELGDRVAAEGTEPAVSRVQRRTTRQSAKGSVRRSSMGRAPCHFQ 
QGAHREGEPPVKHPSPKSPAGIGCSSLCVETSHSGVFFTQGIiHLGVH 
WGGLSLSRVLHPTHLSLGSHGEQEHRGQQAEGSARAKGTFALASCSP 
GTRSRTSADPKGVHSGYGPSGVPLKAGRGQDFILFLPPLRAVTNCSP 
DEVNSMSDGWPSLPDGWPSLSDGMVPGLSDGWPSLSDGWPGLS 
DGWPGLSDGMVPGLSDGWPSLSDGWPGLSDGLVPGIjFDGWPSL 
SEGWPSLSDGWPGLSDGLVPGLFDGWPSLSEDGERS 

4707 

A 

3 

347 

GGLRPEQENRRDAGRPQIRTPKQGRAEQTRHPARVSSGGGGSQEPTQ 
VNTFMSFIFPIWVISVLNTIIANKIjTVMVRQAAECKSQVCTVGGEHST 
FSMAIEPGRVQALRHGVRVL 

4708 

A 

280 

417 

YEIRCRGCVCGAWTRVPTFASCGSQLLAGPGQLLLGLSSIiRLRRPS 

4709 

A 

1 

1626 

SSSPRAARAGLGAVLGGLGNRAVWRSEAPGTRGKRRAGRQPEEPRVL 
ELGAGSWESGGERSPEPGARGGASGSGASRLDGAPAGLRHAPSPGLA 
FIGPRLRRAHSCPDFQPRRRRTEPRTPAPTMRLNSSAPGTPGTPAAD 
PFQRAQAGLEEALLAPGFGNASGNASERVLAAPSSELDVNTDIYSKV 
LVTAVYIALFWGTVGNTVTAFTIJ^ 

DLLTXjLLAMPVELYNFIWVHHPWAFGDAGCRGYYFLRDACTYATALN 
VASLSVERYIJVICHPFKAKTIjMSRSRTKKFISAIWLASAIiLTVPMLF 
TMGEQNRSADGQHAGGLVCTPTIHTATVKWIQVNTFMSFIFPMVVI 
S VLNT 1 1 ANKLTVMVRQ AAEQGQVCTVGGEHSTF smai e pgrvqalr 
HGVRVLRAWIAFWCWLPYHVRRLMFCYISDEQWTPFLYDFYHYFY 
MVTNAL FYVS S TI N P I L YNL VS ANFRH I FLATLACLC P VWRRRRKRP 
AF S RKADS VS SNHTLS SNATRETLY 

4710 

A 

244 

318 

RKLLHIYIYIYIYIYDLIYWQFYLM 

4711 

A 

1 

1605 

MGIVEPGCGDMLTGTEPMPGSDEGRAPGADPQHRYFYPEPGAQDADB 
RRGGG SLGS PYPGGALVPAPPSRFLGAYAYPPRPQAAGFPGAGES FP 
PPADAEGYQPGEGYAAPDPRAGLYPGPREDYALPAGLEVSGKLRVAL 
NNHLLWSKFNQHQTEMIITKCfGRRMFPFLSFTVAGLEPTSHYRMFVD 
VVLVDQHHWRYQSGKWVQCGKAEGSMPGNRIjYVHPDS PNTGAHWMRQ 
EVSFGKLKIiTNNKGASNNVTQMIVLQSI^KYQPRIJIIVEVNDGEP^ 
ACNASNTHI FTFQETQFI AVTAYQNAE 1 TQLKIDNNPFAKGFRENFE 
SMYTSVDTSIPSPPGPNCQFLGGDHYSPLLPNQYPVPSRFYPDLPGQ 
AKDWPQAYWLGAPRDHSYEAEFRAVSMKPAFLPSAPGPTMSYYRGQ 
EVLAPGAGWPVAPQYPPKMGPASWFRPMRTLPMEPGPGGSEGRGPED 
QGPPLVWTEIAPIRPESSDSGLGEGDSKRRRVSPYPSSGDSSSPAGA 
PS PFDKEAEGQFYNYFPN 

4712 

A 

1 

816 

VYIAAFAVSAYSSTYHRAGCKPFNPVLGETYECERPDRGFRFISEQA 
VTAMNWVSWKVKICVLEALQAASGSPSFTSTICSRGCMYTSVDTSIP 
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SPPGPNCQFLGGDHYSPLLPNQYPVPSRFYPDLPGQAKDWPQAYWIi 
GAPRDHSYEAEFRAVSMKPAFLPSAPGPTMSYYRGQEVLAPGAGWPV 
APOYPPKMGPASWFRPMRTLiPMEPGPGGSEGRGPEDOGPPLVWTETA 
PIRPESSDSGLGEGDSKRRRVSPYPSRGDSSSPAGAD 

4713 

A 

395 

1374 

KVTLMRWLYKG*NSGPMSV/TEEI^LAIEASEMIVliQSLHKYQPRLH 
IVEVNDGEPEAACNASNTHIFTFQETQFIAGTAYLNAEITQLKIDNN 
PFAKGFRENFESMYTSVDTSIPSPPGPNCQFLGGDHYSPLLPNQYPV 
PSRFYPDLPGQAKDVGGSSRVNREVSKDFADFGTTIKQDFRLLGQTS 
VDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNTQTASPRALADSLM 
QI.ARQVSRLESGQDFADFGTTIKQDFRLLGQTSVDRLLQLSQGQAVK 
GNQLLPVSLVKRKTTLAPNTQTASPRALADSLMQLARQVSRLESGQ 

4714 

A 

1 

2109 

MQEIHGALDSPGDTDSDMKMQRQTQTRYRDKA5SEAHSPRDPLARGG 
AGPRRIiREGASPPRMGlVEPGCGDMLTGTEPMPGSDEGRAPGADPQH 
RYFYPEPGAQDADERRGGGSLGSPYPGGALVPAPPSRFLGAYAYPPR 
PQAAGFPGAGESFPPPADAEGYQPGEGYAAPDPRAGLYPGPREDYAIi 
PAGLEVSGKLRVALNNHLLWSKFNQHQTEMI I TKQGR S AI>WWWV 
AVVVKRKAKPLELVMELVLVEVVVVVTVIiLV^^ 

G AGGDRG YRDGGGG S CS DGGCDGGDGWRM PPFLS FTV AGL EPTSHYR 
MFVDVVLVDQHHVniYQSGKWVQCGKAEGSMPGKALHPPGFIjSPGIjEC 
KRGELSTGTRLYVHPDS PNTGAHWMRQEVS FGKLKLTNNKGASNNVT 
QMIVljQSLHKYQPRIiHIVEVl^GEPEAACNASNTHIFTFQETQFIAV 
TAYQNAE I TQLKIDNNPFAKGFRENFESMYTS VDTS I PS PPGPNCQF 
LGGDHYSPLLPNQYPVPSRFYPDLPGQAKDWPQAYWLGAPRDHSYE 
AEFRAVSMKPAFLPSAPGPTMSYYRGQEVLAPGAGWPVAPQYPPKMG 
PASWFRPMRTLPMEPGPGGSEGRGPEDQGPPLVWTEIAPIRPESSDS 
GLGEGDSKRRRVSPYPS SGDS S S PAGAPS PFDKEAEGQFYNYFPN 

4715 

A 

251 

623 

RKSAGPPGRTEGSRQVPGRRCPAALQAPVIPLVPALAMPYLPSEPEP 
IFSYPHSMQGAPSPRNTORQTHTHGHSPQLPGFPMCSGKGPAKPEQP 
PLNLDAHLSS FQLPSEDQGAQIRRKGEGLR 

4716 

A 

36 

69 

PFPMHQSHFP 

4717 

A 

328 

441 

LKFWDETFLYFKTTSPCKHHTHMACMYAHAHTHTHTPQ 

471S 

A 

2 

786 

GRVGEAQHYLSKTPESSAQPSETILWDVQTDSNPNQEKDSSSNEKQT 
ISI^PVSTSKBRKESTEPKTCIESMEKKTDSLVQNGNERSDDTVSNIS 
IilDQKMPDIEPNSENNCSSSDIVNGHSEIEQTPLVSSDPAIiKIDTNR 
IRTENGSILPSWPQEHNTLPVSQAPSKPNLTSEHTSYGLILTKPYV 
RPLPPSYLDERYLSMPIGIRKFLTDRVDACSDQDNVYKKSVKRLRCGK 
CLTTYCNAEAIiEAHLAQKKCQTLFG I 

4719 

A 

560 

688 

ARYNPPKKKMLQRKCLFPRLVLNSWPQAILIiPWPSKVLGLQA 

4720 

A 

245 

382 

CCWCCWCCWCWCWCCCCCCHRRRCLIHLEFVVAGIjAKPAQENKNK 

4721 

A 

235 

425 

PVLCRGNSGSLSRKFPPKPQKPAPKDHPRTCVYLENRSPGKSESKKI 
PAYAETNPPSQLSPWS 

4722 

A 

253 

363 

CQQFLLLCQGSQAEPTEHGPSRVFPEPPPPYWNPSSV 

4723 

A 

235 

320 

QKNQEPAGHQPGP I PPCVLGLGNEL IRS 

4724 

A 

1 

1326 

MVG PRS QPDTANRAPTGLTQLI ALQS PS AHRPDTAHHAP PTDIVTQPM 
ALHPQPDT AMALCPQ APKRRTHP VLNECTVNDRVNDP PN I QVPRPPG 
LL S AP P P P P I LHLTIiIjQAS PDMRL YGL VFLFLS GGTRQR CS PELL VR 
AAGMCAPSTCTRVLFTQRDRGQSPGPLQGPCHPYLAPHMTSCRLFWF 
FAGRQTTAGVPVLKOVGCKLSLVFLQYCIMANFFWLLVEGLYLHTLL 
VAMLP PRRCFLAYIjLIGWGLPTVCIGAWTAARLYTjEDTGCWDTNDHS 
VPVWVIRIPILISIIVNFVLFISIIRIIiLQKLTSPDVGGNDQSQYKR 
LAKSTLLLIPLFGVHYMVFAVFPISISSKYQILFELCbGSFQGLVVA 
VLYCFLNSEVQCELKRKWRSRCPTPSASRDYRVCGSSFSRNGSEGAIi 
QFHRGSRAQSFLQTETSVI 

4725 

A 

2 

1502 

C I ERAPAAGTEGAAPAIiG ALGYS CGARGLRAIiAPG PC WRRRNRGDLG 
RRRRALGG PRHAELGMRTLL P PALLTCWLLAPVNS I HPECRFHLE IQ 
EEETKCAELLRSQTEKHKACSGVWDNITCWRPAOTGETVTVPCPKVF 
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KA I YTLGYS VS LMS LATGS 1 1 LCL FRKLHCTRNY IHLNLFL S F ILRA 
ISVLVKDDVLYSSSGTLHCPDQPSSWGCKLSLVFLQYCIMANFFWL 
LVEGL YLHTLLVAMLPPRRCFLAYLL I GWGLPTVC IGAWTAARL YLE 
DTGC WDTNDHS VPWWV IRIPILISI I VNFVLFI S I IRILLQKLTSPD 
VGGNDQS Q YKRLAKS TLLL I PLFG VH YMVF AVFP I S I S S KYQ I LFEI* 

CLGSFQGLWAVLYCFLNSEVQCELKRKWRSRCPTPSASRDYRVCGS 
S FSRNGS EGALQFHRGSRAQSFLQTETS VI 

4726 

B 

144 

2704 

XHGSVVVV?DLREDSRLHYSVTLSDGFWTFRTATFSTDGILTSVNHRS 
PLQAVEPISTSVHKKQSFVLSPFSTQEEMSGLSFHIASLDESGVLNV 
WVVVELPKADIAGSISDLGLMPGGRVKLWDSSTDSHAVTGLQWSPTR 
PAVFLVQDDTSNIYIWDLLQSDLGPVAKQQVSPNRIiVAMAAVGEPEK 
AGGSFI^VLARASGSIDIQHLKRRWAAPEVHSCVERKWSGHTASRV 
HTHSPSLLLFCNPLDDWSRSFETEAPKHWVFFEPYRNVKFVKKETSK 
KANDLPLSLTPREELARCLWAPPSPSVEGPMTAEAVPWNESSHSSKG 
RREGPEPHRRRS S AEQE VQAARLQGKERKARGEHGRS I KGRENGAKQ 
DKPRQAPPTVSVFROPSPPSRESAHS c 5nTPFnT.rir , QtrrTPT thput rc 

LCDYLLQTPRPGVPEHRASPGSKCTRHSHSHTLGRGEGWCTCPSAT 
DTPQWERESNPLDEVRRGHAGEHRLLSQSLSTVGAESISSKSTGWA 
PLNAPVRTFLGESLGSAFSSAS I ALRTS VKRRHKWARPHTNGARQPP 
GGKGTPPGGKEHPYEIKAPPPCVSRWLSSPPPGTPSCTTVRRSWLPS 
ALKHCCVLRREMAVCHRLSKCPANAI VKHPCEGKSPPYRALWHRSGV 
WTVAGTQRS PN I RAHRDQTLRNNYT I L I PNGNWRHG AY AGVS S FS RX 
* i 

4727 

A 

8 

3281 

S KINE YDMY IRNFGKKNTKQAYVQCNEDNVERDIQTED I ETREVWTQ 

HPGESTWSGGSEQRDTSDAWMPKIDTPRLCSFLRAACQVMAVLLE 

EDRlWVEPSWNLRAQDRAIiYFSDSSSQLNTSLPFLQNRKVSSLHTSR 

VQRQMWSVHDLPEKSFVPLx^SKYVLCVWDIWQPSGPQKVIilCESQ 

VTCCCLSPLKAFLLFAGTAHGSVWWDLREDSRLHYSVTLSDGFWTF 

RTATFSTEMSGLSFHIASLDESGVLNVWEEDGLPRAACFLRKRTASR 

ALPASSGRGRPPARCRLPQEEDGLPRAAGFLRKRTASRAJjPASSGRG 

RPPARCRLPQEEDGLPRAAGFLRKRTASRALPAFSGRGRPPARCRLP 

QEEDGLPRAACFLRKRTASRLLPVSSGRGRPPARCRFPQEEDGLPRA 

ACFLRKRTASRALPVSSGRGRPPARCRLPQEEDGLPRAAGFHRKRTA 

S RALPAS SGRGRPPARCRLPQEEDGLPRAAGFLRKRTAS RALLALSG 

LLRFS FECHHTHTCDS FPPQRETSEWTS S LS VRAKCTQFCS VTTSTT 

GQCEVFELVLTENCQLPKDLrGALLPTPGASIRVRSVVTLiQSAHFSYY 

MRDSSRTCSERSFTMSDTLYSLRQETFSLDTISAQHYYSPAPDEVCQ 

RPRGGYTAQIVDRVAMQPAYLSHKGNEFWGTTQTI^VKFLPSDPNHF 

IIGTDMWDSSTDSHAVTGLQWSPTRPAVFLVQDDTSNIYIWDLLQSD 

LGPVAKQQVSPiraOTEKONKKXRMFnRKT^FPYTC^^ nor t 

MTYPxAHV\^KQVGCKLSLVFLQYCIMANFFWLLVEGLYLHTIiLVAl^ 
PPRRCFLAYLL I GWGLPTVC I GAWTAARL YLPHTfja pt .<? wwuvrrii 
PKTVLDARNAGHliLFVF I SRDTPGHLKFS FY I QVERPGHJjYDPAPRV 
HTVRGHAASALKHCS S ATRDAVCHRLS KC PANA I VKHPCE VKS P PYR 
ALWHRSGVWTVAGTQRSPNISAHRDQVCGOSOARSGAAAAGPPRSGR 

WVPFQGDLTGAHCSSCFSPRELSVPLGTQHSLCTVLHMDLCGHHLLA 
FIPSSSSLPC 

4728 

A 

162 

347 

SQHFGRPRWADDLRSGVLPASASQSVGITGMSHCTQPLFLMLITYFY 
LLMHQLGQSVQHEQ 

4729 

A 

201 

522 

AETKPKQl^LCPRSSWLEGARARPH^ 

PTWIMGAGNXERWSTPKGRLKMPRLGLETSLPTKRRWLSFPRALNS 
QQQGFTPQSLYA 

4730 

A 

205 

424 

GTYRLGRLAH PLPGS S AP VPCPS YRS S PPP PHPE I ARDTRRPPAS PG 
AGRGTPGVEEGPIYMLFSLPTRAGLA 

4731 

A 

343 

500 

GSL WAMLS FFLFFLS FS FFLS FFAFFS FFL.FFFFFANNLQAYVKYS 
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KNTGS 

4732 

A 

103 

361 

FWLRGLMACGRFAQRPANHQARPRRI/GSAPKPSPSAWPAMPCTLAES 
GVAQPGVGFGPARWSALiPRS PGGWGRFGEWPVAKTKGIW 

4733 

A 

1097 

1298 

FIVESQLFLVFWFETLSWSIRAGVFTFLWSRWISYVCLFAYFVYEF 
CNIjNWQVSYI YVFTINS IKR 

4734 

A 

874 

1184 

PWLGLLIFSLPLPFLSLSSRSHLPGLKYFVGIAYYIILADEPQDNVY 
THTHT YTHTKSQLLKSGLGI RLLCPVKNS CTGSHCDLS S YRL I PVEI# 
DSNFLFDSN 

4735 

A 

451 

641 

YLRERREKERGREREKERKKERKKERKKERKKKKRRKKERKRQEEEB 
RDRKNKKTAILRMRIN 

4736 

A 

1 

1215 

MTLKEHAAFKHLFlTKAHLAPPLIHLTLSGHSTCFREHWVGDIVHRDIi 

KLEN I MVKS S L I DDNNE TNLN T KVTD FfiT , AVTCKH Q I? «3 T7 Zi MT n JX TPHT 

PIYMDRGWNSLEGSEQDRKMWDSLELPRDLLNVFEQNADNDMDDEIQ 
AEVLSDGDEELLGNWSKGDSCYVLLKQLAAFCPCLRDLWNFKLERDD 
LGYLVEE I SKOOS 10EATWVLLKAFS FI REAEHKNT.RML.OPDNA I EK 
KIPFSEEKFKLAAEICISNQEANVNPQDNEKNVSRACQRSSQQPLPL 
PRVLGGKS G FVG WAQDSHAE I YTFGR I LGKG S FG IV I E ATDKETETK 
WAIKKVNKEKSTAYEKQFPATSKDNFDMCSSSFTSSKLLPAEIKGEM 
EKTPVTPSQGTATKYPAKSGALSRTKKKL 

4737 

A 

1 

417 

PRS PTMPADLSGTWTLLS SDNFEGYMLALG IDFATRKI AKLLKPQKV 
IEQNGDSFTIHTNSSLRNYFVKFKVGEKFDEDimGLDNRKCKSLGIW 
DNDRLTC I QKGEKKNRG WTHW I EGDKLHLEM FCEGQVCKQTFQRA 

4738 

A 

19 

345 

AATLGHCPSSIQCWRSAGSCPRASSSGPQASPMAW*STDLR*MSWA 
TSI^WCPQWPPTPSCSQSSWAA\LHHRCGWVCHCHLTGEDLRPEAR 
LPGGQQPGAGGPGPQ 

4739 

A 

40 

209 

DGRCNSQWSRRLQGPRWVHTEGPGPASARLPQPHPGPGCPLLCGHCV 
PQEPEEYFP*LPQPHPGPGCPIiLCGHCVPQEPEEYFP 

4740 

A 

2 

4097 

PCP PHADCRDLWQTFS CTCQPG YYGPGCVDACLLNPCQNQGS CRHLP 

GAPHGYTCDCVGGYFGHHCEHRMDQQCPRGWWGSPTCGPCNCDVHKG 

FDPNCNKTNGQCHCKEFHYRPRGSDSCLPCDCYPVGSTSRSCAPHSG 

QCPCRPGALGRQCNSCDSPFAEVTASGCRVLYDACPKSLRSGVWWPQ* 

TKFGVx^TVPCPRGALGLRGAGAAVRLCDEAQGWLEPDLFNCTSPAF 

RELSLLLDGLELNKTALDTMEAKKLAQRLREVTGHTDHYFSQDVRVT 

ARLIAHLLAFESHQQGFGLiTATQDAHFNENLLWAGSALIjAPETGDLW 

AALGQRAPGGS PGS AGLVRHLEE YAATLARNMELTYLNPMGLVTPNI 

MLSIDRMEHPSSPRGARRYPRYHSNIjFRGQDAWDPHTHVIjLPSQSPR 

PSPSEVLPTSSSIENSTTSSWPPPAPPEPEPGISIIILLVYRTU3G 

LLPAQFQAERRGARLPQNPVMNS PWS VAVFHGRNFLRGILESP I SL 

EFRLLQTANRSKAICVQWDPPGLAEQHGVWTARDCELVHRNGSHARC 

RCSRTGTFGVLMDASPRERLEGDLELLAVFTHVVVAVSVAAI,VLTAA 

ILLSLRSLKSNVRGIHANVAAALGVAELLFLLGIHRTHNQLVCTAVA 

ILLHYFFLSTFAWLFVQGLHLYRMQVEPRNVDRGAMRFYHALGWGVP 

AVLLGLAVGLDPEGYGNPDFCWI S VHE PL I WS FAG PWI> VI VMNGTM 

FLIJU^TSCSTGQREAKKTSALTLRSSFLLLLLVSASWLFGLLAVNH 

S ILAFHYLHAGLCGLQGLAVLLLFC\njNADARAAWMPACL.GRKA 

EARPAPGIiGPGAYNNTALFEESGIiIRITIiGASTVSSVSSARSGRTQD 

QDSQRGRSYLRDNVLVRHGSAADHTDHSLQAHAGPTDLDVAMFHRDA 

GADSDSDSDLSLEEERSLSIPSSESEDNGRTRGRFQRPLCRAAQSER 

LLTHPKDVTONDLLSYWPAIiGECEAAPCALQTWGSERRLGLDTSKDA 

ANNNQPDPALTSGDETSLGRAQRQRKGILKNRLQYPLVPQTRGAPEL 

SWCRAATLGHRAVPAASYGRIYAGGGTGSLSQPASRYSSREQLDLIiL 

RRQLSRERLEEAPAPVLRPLSRPGSQECMDAAPGRLEPKDRGSTLPR 

RQPPRDYPGAMAGRFGSRDALDLGA\PREWLSTLPPPRRTRDLDPQP 

PPLPLSPQRQLSRDPLLPSRPLDSLSRSSNSREQLDQVPSRHPSREA 

LGPLPQLLRAREDSVSGPSHGPSTEQLDILSSILASFNSSALSSVQS 

SSTPLGPHTTATPSATASVLGPSTPRSATSHSISELSPDSEVPRSEG 
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HS 

4741 

A 

2 

10321 

ISHSTPSQSYENAHGHQNFRPKSGTRFRTSGQEGGTRDLEGQKKAGR 
RSWDSALEGGSQBELGGGGHQGWDPGLAATTGPRAHIGGGAIiALCPB 
SSGVREDGGPGLGVREPIFVGLRGRRQSARNSRGPPEQPNEELGIEH 
GVQPLGSRERETGQGPGSVLYWRPEVSSCGRTGPLQRGSLSPGALSS 
GVPGSGNSSPLPSDFLIRHHGPKPVSSQRNAGTGSRKRVGTARCCGE 
LWATGSKGQGERATTSGAERTAPRRNCLPGASGSGPELDSAPRTART 
APASGSAPRESRTAPEPAPKRMRSRGLFRCRFLPQRPGPRPPGLPAR 
PEARKVTSANRARFRRAANRHPQFPQYNYQTLVPENEAAGTAVIiRVV 
AQDPDAGEAGRLVYSLAALMNSRSLELFSIDPQSGLIRTAAALDRES 
MERHYLRVTAQDHGS PRLSATTMVAVTVADRNDHS PVFEQAQYRETL 
RENVEEGYPILQLRATDGDAPPNANLRYRFVGPPAARAAAAAAFEID 
PRSGLISTSGR\TOREHMESYELVVEASDQGQEPGPRSATVRVHITVL 
DENDNAPQFSEKRYVAQVREDVRPHTWLRVTATDRDKDANGLVHYN 
IISGNSRGHFAIDSLTGEIQWAPLDFEAEREYALRIRAQDAGRPPL 
SNNTGLAS IQVVDINDHI PI FVSTPFQVSVLENAPLGHS VIHIQAVD 
ADHGENARLEYSLTGVAPDTPFVINSATGWVSVSGPLDRESVEHYFF 
GVEARDHGS PPLS AS AS VTVTVLDVNDNRPEFTMKE YHLRLNEDAAV 
GTSWSVTAVDRDANSAISYQITGGNTRNRFAISTQGGVGLVTLALP 
LDYKQERYFKLVLTASDRALHDHCYVHINITDANTHRPVFQSAHYSV 
SVNEDRPMGSTIWISASDDDVGENARITYLLEDNLPQFRIDADSGA 
ITLQAPIJ5YEDQVTYTLAITARDNGI PQKADTTYVEVMVNDVNDNAP 
QFV7VSHYTGLVSEDAPPFTSVLQISATDRDAHANGRVQYTFQNGEDG 
DGDFT I E PTSG I VRTVRRLDREAVS VYELTAYAVDRGVP PLRTPVS I 
QVMVQD VNDNAPVFPAEE FE VRVKENS I VGS WAQ I TAVDPDEGPNA 
HIMYQI VEGNI PELFQMDI FSGELTALIDLDYEARQEYVI WQATSA 
PLVSRATVHVRLVDQNDNSPVLNNFQILFNNYVSNRSDTFPSGIIGR 
I PAYDPDVSDHLF YS FERGNELQLLWNQTSGELRLSRKLDNNRPLV 
ASMLVTVTDGLHS VTAQCVLRWI I TEELLANS LTVRLENMWQERFL 
SPLLGRFLEGVAAVIiATPAEDVFIFNIQNDTDVGGTVLNVSFSALAP 
RGAGAGAAGPWFSSEELQEQLYVRRAAIiAARSIjLDVLPFDDNVCLRE 
PCENYMKCV S VLRFDS S APFLAS ASTLFRP IQP I AGLRCRCPPGFTG 
DFCETELDLCYSNPCRNGGACARREGGYTCVCRPRFTGEDCEIiDTEA 
GRCVPG VCRNGGTCTD APNGGFRCQCPAGGAFEGPRCEVAAR S FP PS 
SFVMFRGLRQRFHLTLSLSFATVQQSGLLFYNGRLNEKHBFLALELV 
AGQVRLTYSTGESNTVVSPIVPGGLSIXjQWHTVHI»RYYNKPRTDAIjG 
GAQGPS KDKVAVLS VDDCDVAVALQFGAE I GNYSCAAAG VQTS S KKS 
LDLTGPLLLGGVPNLPENFPVSHKDFIGCMRDLHIDGRRVDMAAFVA 
NNGTMAGCQAKLHFCDSGPCKNSGFCSERWGSFSCDCPVGFGGKDCQ 
LTMAHPHHFRGNGTLSWNFGSDMAVSVPWYLGLAFRTRATQGVLMQV 
QAGPHSTLLCQLDRGLLSVTVTRGSGRASHLLLDQVTVSDGRWHDLR 
LELQEEPGGRRGHHVLMVSLDFSLFQDTMAVGSELQGLKVKQLHVGG 
LPPGSAEEAPQGLVGCI.QGVWLGSTPSGSPALLPPSHRVNAEPGCW 
TNACASGPCPPHADCRDLWQTFSCTCQPGYYGPGCVDACLIiNPCQNQ 
GSCRHLPGAPHGYTCDCVGGYFGHHCEHRMDQQCPRGWWGSPTCGPC 
NCDVHKGFDPNCNKTNGQCHCKEFHYRPRGSDSCLPCDCYPVGSTSR 
SCAPHSGQCPCRPGALGRQCNSCDSPFAEVTASGCRVLYDACPKSLR 
SGVWWPQTKFGVLATVPCPRGALGAAVRLCDEAQGWLEPDLFNCTSP 
AFRELSLLLDGLELNKTALDTMEAKKLAQRLREVTGHTDHYFSQDVR 
VTARLLAHLIiAFESHQQGFGIiTATQDAHFNENLLWAGSALLAPETGD 
LWAALGQRAPGGSPGSAGbVRHLEEYAATLARNMELTYLNPMGLVTP 
NIMLSIDRMEHPSSPRGARRYPRYHSNLFRGQDAWDPHTHVLLPSQS 
PRPSPSEVLPTSSSIENSTTSSWPPPAPPEPEPGISIIILLVYRTL 
GG LLPAQ FQAERRGARL PQNPVMNS P WS VAVFHGRNFLRG I LES PI 
SLEFRLLQTANRSKAICVQWDPPGLAEQHGVWTARDCELVHRNGSHA 
RCRCSRTGTFGVLhIDASPRERLEGDLELIjAVFTHVVVAVSVAALVliT 
AAILLSLRSLKSNVRGIHT^NVAAALGVAELLFLLGIHRTHNQVQDQG 
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QGTCVLMTLLAQEAWGQNSGSELVCTAVAILLHYFFLSTFAWLPVQG 
LHLYRMQVEPRNVDRGAMRFYHALGWGVPAVLLGLAVGLDPEGYGNP 
DFCWI S VHEPLI WS FAGPWLVI VMNGTMFLLAARTSCSTGQREAKK 
TSALRTLRSSFLLLLLVSASWLFGLLAVNHSILAFHYLHAGLCGLQG 
LAVLLLFCV^NADARAAWMPACLGRKAAPEEARPAPGLGPGAYNNTA 
LFEESGLIRITLGASTVSSVSSARSGRTQDQDSQRGRSYLRDNVIiVR 
HGSAADHTDHSLQAHAGPTDLDVAMFHRDAGADSDSDSDLSLEEERS 
LSIPSSESEDNGRTRGRFQRPLCRAAQSERLLTHPKDVDGNDLLSYW 
PAIiGECEAAPCALQTWGSERRLGLDTSKDAANNNQPDPAIiTSGDETS 
LGRAQRQRKGILKNRLQYPLVPQTRGAPELSWCRAATLGHRAVPAAS 
YGRIYAGGGTGSLSQPASRYSSREQLDLLLRRQLSRERLEEAPAPVL 
RPLSRPGSQECMDAAPGRLEPKDRGSTLPRRQPPRDYPGAMAGRFGS 
RDALDLGAPREWLSTLPPPRRTRDLDPQPPPLPLSPQRQLSRDPIiLP 
SRPLDSLSRSSNSREQLDQVPSRHPSREALGPLPQLLRAREDSVSGP 
SHGPSTEQLDI LS S I LAS FNS SALSSVQS S S TPLGPHTTATPS ATAS 
VLGPSTPRSATSHSISELSPDSEPRDTQAIiLSATQAMDLRRRDYHMB 
RPLLNQEHLEELGRWGSAPRTHQWRTWLQCSRARAYALLLQHLPVLV 
WLPGYPWA 

4742 

A 

157 

273 

NFHS FFLLFPKHP I LLLGRDLLTKFGAT I S LNQDR I EVF 

4743 

A 

469 

676 

LECPRFSGTRSWGCLAEAPHLPPRVLSSPEVPKPLQIKLSPPNPPAB 
PLGNTLPTKRYPVWNPGPYFRD 

4744 

A 

237 

497 

GGLSQRKAGRSSVSRPSPLEPEAPPAQASKVQEASHIiETSADLTTHP 
RASKGPTPEPTSDPTVPRDRSQPCPSLLDPRVPCGHYYC 

4745 

B 

40 

285 

MPKKAGATTKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSK 
QNAVTDLTAEAVNVAIAAPPSEGEANAELCRYLSK* 

4746 

A 

3 

292 

SSSQAGMIiRLRSGLRHLRATPNTRGSARLLRAEMPKKAGATTKGKSQ 
SKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTGIPGHWGGI 
QV 

4747 

A 

3 

163 

GDMIFNLQSRKTAFNK*SKDLGSS/SPACMVQS*VSLSTSGYRKSKA 
PQQNERC 

4748 

A 

3 

307 

SARGCAPQRAEALRATPNTRASARLLCAEMPKKAGATTKGKSQSKEP 
ERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTALLFHSSTIGGDS 
GGRFRYS 

4749 

A 

2 

478 

PSQAGMLRLRSGLRHLRATPNTRGSARLLCAEMPKKAGATTNGKSQS 
KEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVA 
IAAPPSKGEANAELCRYLSKVLELRKSDVVIJDKKMRDLLKIVPCKLA 
TRTQIIiGLLAECSFSRL 

4750 

A 

303 

1978 

RIYKVQQTPQSQLFQDHPESLVKIFGSKAMQTKAKAVIDNFVKKLEE 
NYNSECGIDTAFQPSVGKDGSTDNNWAGDRPLIDWDQIREEGLKWQ 
KTKWADLPPIKKNFYKESTATSAMSKVEADSWRKENFNITWDDLKDG 
EKRPIPNPTCTFDDAFQCYPEVMENIKKAGFQKPTPIQSQAWPIVLQ 
GIDLIGVAQTGTGKTLCYLMPGFIHLVLQPSLKGQRNRPGMLVLTPT 
RELALQVEGECCKYS YKGLRS VCVYGGGNRBEQIEELKKGVDI I IAT 
PGRLNDLQMSNFVNLKNITYIiVLDEADKMIiDMGFEPQIMKILLDVRP 
DRQTVMTSATWPHSVHRLAQS YLKEPMI VYVGTLDLVAVSSVKQNI I 
VTTEEEKWSHMQTFLQSMSSTDKVIVFVSRKAVADHLSSDLILGNIS 
VESIiHGDREQRDREKALENFKTGKVR I L I ATDLASRGLD VHDVTHVY 
NFDFRRNIEEYVHRIGRTGRAGRTGVSITTLTRNDWRVASELINILE 
RANQS I PEELVSMAERFEAHQRKREMERKMERPQGRPKKFH 

4751 


142 

4 n r 

*± yj o 

ijjvr iJiv^is^^ijFoFAlr'KFljr r PLiGQRDSKTRHYPSTSRIjPGIjSSTYPG 
HWGAPLISVPGPEAPPTPGIGPAPVSQDPASSCGGGKAASYS 

4752 

A 

500 

665 

TKEQMSGPLFPVCSCPQLPFMLSPCHMHHHPGHVALSQTVSPASLLT 
QGLGLPQH j 

4753 

A 

49 

163 

QIIFKFISLFKYKAKPQTGIIKTLCLTFTLPSDPLPIC j 

4754 

A 

665 

733 j 

AQEP/LTWCLLPTLVPGKCEWRQV j 

4755 

A 

334 

520 

GAKGFHLNCEGRRNRLTASSRPFHRTDSHTLVPSVQGAGAQEPMTWC 
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LLPTLVPGKCEWRQA 

4756 


3 


PRDKVS LCHRLECNGR I TAHCS LKLLNS S DL PAS AS QVAGTTGVCHH 
AWPLLFFLMKEEMYRIDIKTKCRKN 

4757 

A 

133 

205 

DLPQAQSNLQWPRKTS *LKPHERL 

4758 

A 

250 

532 

AYEVGYNTVATSEGGNRARRVGELPEVTELVSGPDWNPNTSQPMPGS 
PAPAPRRLPGCQRHPHPFKCCPVPAPQSLPFNTRPHLPPTPQGQASR 

4759 

A 

524 

678 

ARCEWLMPVIPALWEAEACGSLEPRSSRPAWATWLMNTKHSFDVFHI I 
KDFL 

4760 

A 

110 

1520 

HLSQAC P FAGHQGS WGRS S FHLQPRM I GDGLS EDENDTKQNAQS SDE 
SFEPYPERKVSGKKSESKEAKKSEEPftlRKKPGPKPGWKKKLRCERE 
ELPTIYKCPYQGCTAVYRGADGMKKHIKEHHEEVRERPCPHPGCNKV 
FMIDRYLQRHVKLIHTEVRl^ICDECGQTFKQRKHLLVHQMRHSGAK 
PLQCEVCGFQCRQRASLKYHMTKHKAETELDFACGQCGRRFEKAHNL 
NVHMSMVHPLTQTQDKAIiPLEAEPPPGPPSPSVTTEGPGRCEVCGFQ 
CRQRASLKYHMTKHKAETELDFACDQCGRRFEKAHNLNVHMSMVHPLf 
TQTQDKALPLEAEPPPGPPSPSVTTEGQAPPRPPDRAAPEAAAVLRV 
QRPVSRREQSSRASPRLLPPDPQPRKPTAAALAEPPLHAVPARAQRS 
PAAPGQSPHAPTPTRPRGVSLTPGGHRIWIARQGQLVRGPGAVPGDV 

4761 

A 

1 

635 

YALLGTRAAS SRSCGAATRATTTPWI PAPAARPSCWTVRWQSSGHGT 
KKRASRLPPPRRVKPWGFPLSPSKLEGQGPQGGLRPKTLPSRNVAKL 
LSDSNALGPRSGFPPQPSLPLCRAPGQLGEKQLPFSTSDDRVKDBFS 
DLSEGDVLSEDENDKKQNAQSSDESFEPYPERKVSGKKSESKEAKKS 
EEPRIRKKPGPKPGWKXKLRCER [ 

4762 

A 

2 

4398 - 

AAAGAVGAKAMSDSWVPNSASGQDPGGRRRAWAELLAGRVKREKYNP 

ERAQKLKESAWLLRSHQDLNALLLEVEGPLCKKLSLSKVIDCDSSB 

AYANHSSS FIGSALQDQASRLGVPVGILSAGMVAS S VGQI CTAPAET 

SHPVLLTVEQRKKLSSLLEFAQYLLAHSMFSRLSFCQELWKIQSSLIi 

LEAVWHLHVQGIVSLQELLESHPDMHAVGSWLFRNLCCLCEQMEASC 

QHADVARAMLSDFVQMFVIiRGFQKNSDLRRTVEPEKWPQVTVDVLQR 

MLIFALDALAAGVQEESSTHKIVRCWFGVFSGHTLGSVISTDPLKRF 

FSHTLTQILTHSPVLKASDAVQMQREWSFARTHPLLTSLYRRLFVML 

SAEELVGHLQEVLETQEVHWQRVLSFVSALWCFPEAQQLLEDWVAR 

LMAQAFESCQLDSMVTAFLWRQAALEGPSAFLSYADWFKASFGSTR 

G YHGCS KKALVFLFTFIiS EL VP FE S PR YLQVH I LHP PL VP S KYRS LL 

TDYISLAKTRIADLKVSIENMGLYEDLSSAGDITEPHSQALQDVEKA 

IMVFEHTGNI PVTVMEAS I FRRPYYVSHFLPALLTPRVLPKVPDSRV 

AFIESLKRADKIPPSLYSTYCQACSAAEEKPEDAALGVRAEPNSAEB 

PLGQLTAALGELRASMTDPSQRDVISAQVAVISERLRAVLGHNEDDS 

S VE I S KI QLS INTPRLE PREH IAVDLLLTS FCQNLMAAS S VAPPERQ 

GPWAALFVRTMCGRVLPAVLTRLCQLLRHQGPSLSAPHVLGIAALAV 

HLGESRSALPEVDVGPPAPGAGLPVPALFDSLLTCRTRDSLFFCLKF 

CTAAI S YSLCKFSSQSRDTLCS CLS PGL I KKFQFLMFRLFSEARQPL 

SEEDVASLSWRPLHLPSADWQRAALSLWTHRTFREVLKEEDVHLTYQ 

DWLHLBLEIQPEADALSDTERQDFHQWAIHEHFLPESSASGGCDGDL 

QAACTILVNALMDFHQS SRS YDHSENSDLVFGGRTGNED 1 1 SRLQEM 

VADLELQQDLIVPLGHTPSQEHFLFEIFRRRLQALTSGWSVAASLQR 

QRELLMYKRILLRLPSSVLCGSSFQAEQPITARCEQFFHLVNSEMRN 

FCSHGGALTQDITAHFFRGLLNACLRSRDPSLMVDFILAKCQTKCPL 

ILTSALVWWPSLEPVLLCRWRRHCQSPLPRELQKLQEGRQFASDFLS 

PEAASPAPNPDWLSAAALHFAIOOVREENTPKnT.K'Trr.nr'T^T?'PP'T t \tv 

lff fslmglls shltsns ttdlpkafhvcaai lecle krki s wlalf 
qltesdlrlgrlllrVapdqhtrllpfafysllsyfhedaaireeaf 
lhvavdmylklvqlfvagdtstvsppagrslelkgqgnpvelitkar 

LFLLQLIPRCPKKSFSHVAELLADRGDCDPEVSAALQSRQQAAPDAD 
LSQEPHLF 

4763 

A 

2016 

2227 

EKSTLNNPMFCIWEIVFKNKKERKYIKLLFIPAFPYISIALSVYPFW 
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YILPLHIYICICNSELYIYINCK 

4764 

A 

1388 

1520 

SITFWLKGCMCIGVCVCVCVCVFGIiVWRMKKGFHLGICKYSMAS 

4765 

A 

2 

110 

GGG VGGGGGGHLG * D S KGR * G WG P WLAGGDC I KENT 

4766 

A 

120 

233 

L PAATNRLKRGKG S S TGS S S GNHGG S GGGNGHKTREL 

4767 

A 

3 

796 

GLQGFPPPACGGQQLRFSKSLRSSEYPRPHLAFSMLGITSCSDQQAK 
EGEGLEGSSTGSSSGNHGGSGGGNGHKPGCEKPGNEARGSGESGIQG 
FRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVN 
TVNSETSPGMFNFDTFWKNFKSKLGFINWDAINNNQVPPPSTRALLY 
FIRLWEDFLTEHSFSSTWKQLFEGSGRVITAETCRQSRFRTTITTSM 
RIPLPMVGKYSVKTPCKRGGSLTFFLGLP 

4768 

A 

1 

1263 

MTRQCQASNQQLGPQPLATTPPDLFLGFQQLFGTFASWGREPHCGGP 
DFPQPIQTVEDALRKPSGLQLINYIHTYLPLPAKVLDFPPKPPQPGM 
DVQGFVLYLMSPPASQGHPVPKYQIHRTFRPAPPGGVAGVLVGHPFD 
TVKVRLQVQSVEKPQYRGTLHCFKSIIKQESVLGLYKGLGSPLMGLT 
F I N AL VFG VQGNTLRALGHDS PLNQFLAGAAAG A I QCV I CCPMELAK 
TRLQLQDAGPARTYKGSLDCLAQIYGHEGLRGVNRGMVSTLLRETPS 
FGVYFLTYDALTRALGCEPGDRLLVPKLLLAGGTSGIVSWLSTYPVD 
WKSRLQADGLRGAPRYRGILDCVHQSYRAEGWRVFTRGLASTLLRA 
FPVNAATFATVTWLTYARGEEAGPEGEAVPAAPAGPALAQPSSL 

4769 

A 

1 

1285 

MTRQCQASNQQLGPQPLATTPPDLFLGFQQLFGTFASWGREPHCGGP 
DFPQPIQTVEDALRKPSGLQLINYIHTYLPLPAKYQIHRTFRPAPPG 
GVAGVLVGH P FDTVKVRLQVQS VEKPQYRGTLHCFKS 1 1 KQESVLGL 
YKGLGS PLMGLTF INALVFGVQGNTLRALGHDS PLNQFLAGAAAGAI 
QCVICCPMELAKTRLQLQDAGPARTYKGSLDCLAQIYGHEGLRGVNR 
GiWSTLLRETPSFGVYFLTYDALTRALGCEPGDRLLVPKLLLAGGTS 
GIVSWLSTYPVDVVKSRLQADGLRGAPRYRGILDCVHQSYRAEGWRV 
FTRGI^STLLRAFPVNAATFATVTVVLTYARGEEAGPEGEAVPDRLR 

GLPWRSPPACDAHPALLPQGSFSETWDINWPLSRLPCFLLGCCELWS 
LSDVG 

ATI C\ 

t\ 

bzU 

904 

GMVLNGS AG PHAS PWSNHG S QRDG S LT AS LGHL PRCCGHM PLTEGQK 
CSWGQQREQPPQSSEECRHRSHRAAPHHPCALHPSTIELGKVLIVLV 

4771 

C 

230 

487 

MLNLAS PLDKLTLQNTTQNGDS S TH I S SNS I HKHTS S S S S VLFPDGT 
LEALFVSSPKVEASPLEAXVPCSASHSCNTSSKLNRRL* 

4772 

A 

1 

350 

QASNGDAS TLGEETKS ASNVPSGKSTDEEEEAQTPQAPRTLGPS PPA 
PSSTPTPTAPIATLNQPPPLLRPTLPAAPALHRQPPPLQQQARFIQP 
RPTLNQPPPPLIRPANSMATPS 

4773 

A 

76 

1941 

RRRFSAGGRSVRRVGESCVRDCNSQHTFLSDQAPSLKSSAIALAMLQ 
DWCRRMGVNAERSLLILDI PDDCEEHEFQEAVRAALS PLGRYRVLIK 
VFRKELGARAALVEFAEGUsFQSLI PRQI AGKGGPWKVI SLPQALDAE 
FQDIPSFPAQPQGQAVARGAGEAGAAGEAGSVGEAGGVNEERSAGED 
EAGG I G EAGG VGEAGAAGE1AGAAGEAGAAGEAGG AGEAGG AGEAGGA 
GEEGGTGEEGGAGEAGGAGEEGGEDEAGAAGEAVGAGWEAWTQSWR 
QTLRPLVKTMAYRELRPFSGREQPGCVEESFESWLEDAKDMLQLWCH 
ASERERRRRLLDSLDGLALDIVSGLLEEDPDFSAQDCLTALGQVFRS 
RDTWMTSRMKFIiTCTQGPQEGLFAFWRLEGLLQKAVEKGAVHPAMA 
NHLRLRQVLSRARPSE^QDTLRRMQLERRPPDFLRLLRLIRDMEAW 
AAS LARS QQGVAWAAAPVE S EDPAAAQAS PAQGDAS EAD PGAEDADE 
AASTTKEAARVAPATGEDENAPAGLEGLGQGR S PDAPGGL PARMGS A 

VDMAPGGPSWEPEGLVQVGGQEAEEPPQEGLKPILEESENEDEDGAG 
EAGKPKSPPGK 

4774 

A 

619 

2097 

WLAG S RNR I LRTRTi^WVGRFSE YCPS CLENDQRWIiCNEN YKWEQV 
PTKKNFS ILARNLNTIQVEEMSACNI S IQGPS I YNKEPKN I INPHEK 
VQMKS I CAN S P I KAQQDQLQVKNN I KAS LHNVKS S LPL FNTKS S TS V 
GQLQS PTLNS P I YMQKQGKNEHLAFNTKSKASTVGS EL VL VSTTVPT 
VHHVSDLEMSSTLDCLPVLADWEDWLLPASQPEENVDCTVPISDSD 
LE I S FNSGERLMVLKELEMS SHENFGD I EETPQKS ETS KS I VYKS PH 
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TTIYNVKEAKDPGSDISAFKLPEHKSSTFNRVWANMSHPLVLGKHPIi 
LSGGTKRNPCSPQAFPPAKKQPFTIHEEKPTSSDCSPVRSSSWRRLP 
SILTSTVNLQEPWKSGKMTPPLCKCGRRSKRIiWSNNGPNHGKVPYC 
CPIGKYQENRKCCGYFKWEQTLQKERANSMVPSHSTGGLTFSSPETS 
HICDRNLS ISTKNSLRLRPSMRN 

4775 

A 

97 

185 

FMFFNQICKCICCCIVIVLQNKIACHLTA 

4776 

A 

845 

942 

MEECTDCGDPSPTHTHTHTHTPYSNTASRGII 

4777 

A 

510 

636 

AHAYLYGKVRKSTMTKPCGRTKRKRRRRTSSSSQDGLALPP 

4778 

A 

22 

333 

ERGLLSPDRLSLVPPWHWERRGAGPRTPVGRGLYLLAPKAYKEVSGP 
VHGQMGPGKTLRVGIQWTRRSQMCCFYGCPLPPPVHPLCPGNKEPLR 
TQESPHPPSC 

4779 

A 

310 

422 

GVGTATTGQWYFGQLAVCGVCGFLGLHSIKTSGWLSA 

4780 

A 

1 

140 

KIRCEGLSRLDKGE/RR/RGKGHDG/LYQGLSTATKDTYDAFHMQAIj 
PPR 

4781 

A 

17 

283 

PLESGGIPKLVQPRLLPVGPGGLFLGLWRLGGWSQGCAAETPVGLWV 
GSCPHRLLTHSLFSDLLSRQGLSTATKDTYDALHMQALPPR 

4782 

A 

2 

673 

GECEFLGLSAGAFSKAPQSSTSWGGSCRIKPAETPFLLTVPATAASA 
SASQPLSEGKDKMKWKALFTAAILQAQLPITEAQSFGLIjDPKIjCYLL 
DG I L»F I YG V IIiTALFLRVKFS RS AD APAYQQGQNQL YNE LNLGRREE 
YDVLDKRRGRDPEMGGKPHRTKNPHEGLYNELQKDKMARAYSEIGMK 
GERRRGKGHDGLYQGLSTATKDTYDALHMQALPPR 

4783 

A 

105 

1378 

ELIPALTSPLFLEESEKSRAEGAPDTGRGRARGAWRVFPGLSGWSAP 
PASWPRTPLQMSRRAGVTKMSDPFLKLVFIKDTTFRPKRKLEPGTQR 
liELHKKAQASLNAGLDLRLAVQLPPGEDI^DWVAVHVVDFFNRVNLI 
YGTISDGCTEQSCPVMSGGPKYEYRWQDEHKFRKPTALSAPRYMDLL 
MDWIEAQINNEDLFPTNVGTPFPKNFLQTVRKILSRLFRVFVHVYIH 
HFDRIAQMGSEAHVNTCYKHFYYFVKEFGIilDTKELEPLVRGLGAEG 
VRLRWRPRLELGPGDPADGTSGEVAGRDPAFSHGTNAGDRSENADSL 
ASWGSLLPLVSERNDRPDVPLRRPAGLPVPEPPLGPRRLGGGRSGTI 

IPLLSGLSISGRPGAEPLSRCWSSGAPTRGLQMPSSVSSLSLPSGTK 
T 

4784 

A 

131 

524 

DCMTHCLQPFSWCMPFNATLLFQHCKTMSANQAPPYRHVVFKHKTTK 
KEKEKRERRSLPQNPFSMCLDSKCTGMQLLPSVNSFKPLEMVALCAQ 
YPGHQGIHRAGPAPGCWLSKQKDKCPSLFVKAWSVL 

4785 

A 

197 

582 

ALISNNYHDMCYLCAYCCTAFFGFLTRATAEQKHFSLVWHLMNLINV 
CYGKS I PLYNFMGEAMCKS E FKKKEKKKPCAELLPTQY 1 1 YMPEMAK 
AAI CQGLQKHEEAGME INTL WCNVFKVGVFNDT 

4786 

A 

235 

416 

GQGNRKKGFLGLERGHAGAGGREPFS PAGKLQGGRKAFAPGI IWGVP 
NPQLNIGVGMPWP 

4787 

A 

280 

569 

KSPSHPQPENRRRRCPHSCHAPTTCVSRKSGFSFKAKTLGTVLEQGR 

SGGIYSHGHCVAGSLTQTAGETGPWEGQHVGLQTSDSSSLPLQFLRM 
GH 

4788 

A 

208 

487 

VQRHCLAGLIHHPFQKWNPKDQILDPIWRQGRRSTSPRAPGRRGPGA 
RGPAPSPAPRG\RRRWRAGFRQRTPHTGPPAPTKAKTLI*AGREPPR 

4789 

A 

190 

594 

S FVVDNEKKEVQHVRDDADDAEIiLQHKVQNVGQVQG PHARHDGGRHE 
DQSRHSACCHAWRQGRRSTSPRAPGRRGPGARGPAPSPAPRGRRRWR 
AGFRQRTLHTGPPAPTAAPTLLAGREPPSSRGRVGGRVGRP 

4790 

A 

523 

722 

SQHFGRSRWVDCLSSRVRDQPGQCHETPSLQKITKISWAWWRMPIVP 
ATWEVEVGGLLEPRRQRLQ 

4791 

A 

171 

320 

QDLF S IS SRQLEAFSNTCKRTQRTCRTNRMYR * RYSCGGTE *GS YPTN 
LE 

4792 

A 

243 

445 

LRRQLQTVTCGLGHLFVMCFTFPCFPVNWLPWFCRWSRSCGCSRAG 
DACPSGSNPPPCPQCLTPGF 

4793 

B 

320 

434 

XQCPLNRMAGHISSPYRSEAWSSPGAPVRTGPSWEPGK* 

4794 

A 

146 

609 

ALS S WPARELALLSERHPSLLLLQRQCPLNRMAGHI SS P YRSEAWS | 
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SPGSTSEDRPQLGARARWESSVLTTLGARVSERPRLEKVRNPCVLLP 

TPARLARPHSSPSLAQSPGPGGRGRGWLRSRAGPSHSATHIAPSPLS 
S QLWS LR WCHDRL 

4795 

A 

874 

1126 

YHDG INI KYQTHKVIKMENLQF YLF I S S APQCVS TKGCSG YLRP VGL 
VQVTICPLPLQIKVQIEKYLGFRTHQRYFCCIFHLKN 

4796 

A 

1 

957 

METSSPRPPRPSSNPGLSLDARLGVDTHLWAKVLFTALYALIWALGA 
AGNALSVHVVLKARAGRAGRLRHHVIiSLALAGLLLLLVGVPVELYSF 
WFHYPWVFRDLGCRGYYFVHELCAYATVLSVAGLSAERCLAVCQPL 
RARSLLTPRRTRWLVALSWAASLGIJUJPMAVIMGQKHELETADGEPE 
PASRVCTVLVSRTALQVFIQEAIWMYVICWLPYHARRLMYCYVPDD 
AWTDPLYNFYHYFYMVTNTLFYVSSAVTPLLYNAVSSSFRKLFLEAV 
SSLCGEHHPMKRLPPKPQSPTLMDTASGFGDPPETRT 

4797 

C 

209 

406 

MTRCGSLGPSPLQGVYMATVRRGDSVWWGTECGRDSPAVRVSLFLLC 
SCWGWWSRGLTPWRPCGP* 

fl / 17 O 

(~* 

133 

237 

MFMDXXXXXXGGRFKGSLGGPKFTRACKVKFFSL* 

4799 

A 

250 

422 

ASAMVTSTSTPGS ILME VICLTI SEGLCKSMSRL * ILI WKRSHVLEP 
SPQGVFLWI 

4800 

C 

33 

158 

MRMQQPSVKPGFKDLQKCKTMPLLLMKYIVLENIGLKIFLR* 

4801 

A 

3 

344 

AQDRLHWGQTQNQQRGGHALVPAE V * WDH PS WQ I jMS RF YPKEQ * KVF 
STS S S S S SGGAVLKNPWGGQSLRGLARKNLFP YRGPNKNLPGNFGKE 
TLLWGGDKLGQPPSRNLRIi 

4802 

A 

7 

1268 

EGGSAGRGSGMETSSPRPPRPSSNPGLSLDARLGVDTRLWAKVLFTA 
LYALIWALGAAGNALSVWVLKARAGRAGRLRHHVLSLALAGLLLLL 
VGVPVELYSFVWFHYPWVFGDLGCRGYYFVHELCAYATVLSVAGLSA 
ERCLAVCQPLRARSLLTPRRTRWLVALSWAASLGLALPMAVIMGQKH 
ELETADGEPEPASRVCTVLVSRTALQVFIQVNVLVSFVLPLALTAFL 
NGVTVSHLLALCSQVPSTSTPGSSTPSRLELLSEEGLLSFIVWKKTF 
IQGGQVSLVRHKDVRRIRSLQRSVQVLRAIVVMYVICWLPYRMIRLM 
YCYVPDDAWTOPLYNFYHYFYMVTNTLFYVSSAVTPLLYNAVSSSFR 
KLFLEAVSSLCGEHHPMKRLPPKPQSPTLMDTASGFGDPPETRT 

4803 

A 

65 

376 

IKPREPNFPAARLCRLVYPFYPLKFFIFPKAFNFCREVGPICSPPKG 

RVLSQNSQVADFPVPAFLKHGPGLGFNSAPHKEKNFTLQARVKFGPP 
QGSFKTGPP 

4804 

A 

2 

903 

CCPCSPRSWPPPARCRPSAAGPRTRPASSGCPPMLQSTRPPRTSPSP 
RGCWPRRPPGPPS ARARRRRRRRRRLCNI S VQRQMLS SLLVRWGRPR 
GFQCDLLLFSTNAHGRAFFAAAFHRVGPPLLIEHLGLAAGGAQQDLR 
LCVGCGWVRGRRTGRLRPAAAPSAAAATAGAPTALPAYPAAEPPGPL 
WLQGEPLHFCCLDFSLEELQGEPGWRLNRKPIESTLVACFMTLVIW 
WSVAALIWPVP I IAGFLPNGMEQRRTTASTTAATPPQVPAGTTAAAA 
AAAAAAAAAAVTSGVATK 

4805 

A 

64 

1355 

PPPRRVPADMAESWLRLSGAGPAEEAGPEGGLEEPDALDDSLTSLQW 
LQEFS I LNAKAPALrPPGATDPHG YHQVPG SAAPGS PLAAJDP ACLGQP 
HTPGKPTSSCTSRSAPPGLQAPPPDDVDYATNPHVKPPYSYATLICM 
AMQASKATKITLSAIYKWITDNFCYFRHADPTWQNSIRHNLSLNKCF 
IKVPREKDEPGKGGFWRIDPQYAERLLSGAFKKRRLPPVHIHPAFAR 
QAAQE P S AVPRAGPLTVNTEAQQLLRE FEEATGEAG WG AGEGRLGHK 
RKQPLPKRVAKVPRPPSTLLPTPEEQGELEPLKGNFDWEAIFDAGTL 
GGELGALEALELSPPLSPASHVDVDLTIHGRHIDCPATWGPSVEQAA 

DSLDFDETFLATSFLQHPWDESGSGCLPPEPLFEAGDATLASDLQDW 
ASVGAFL 

4806 

A 

2 

451 

AQKESRSAVQRSWKLKPLIVLSPLQTGLTVQSTWSTCPVSRLVTLSQ 

LLGVHSRSLSSRCRDEGFSDTDPEATEPQGNPNP\PSC*GAGTAWGR 

MEPPAPQHPGDFPTPL*MSWC*DCVNPKTTHGPVLWPPFEAKTKQPD 
TLCSGAGLY 

4807 

A 

239 

580 

ASTREVSLSGRTWPMIMWPPTSAAWGMSLLSCTPLCFLPCRLSDR* 
PCSKASFSSDPRSSRRQVEWHLVKALPSPPSEDAPPPPARNSPAEAS 
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4808 


4809 
4810 
4811 


180 


4812 


4813 
4814 


4816 


4817 
4818 


4819 


4820 
4821 


1018 
704 


72 


4822 


4823 
4824 


4825 


4826 


63 


137 


265 
1817 


569 


1868 
1003 


810 


133 
76 


1284 
978 


813 


131 


NNCLWVSCSAVGSERRNVLPCHTHALPCERARERERGRARECASP^" 
SGTS AAARQRQQLRRR PTQQREARGGCLEEAAS RMKCKPNQTRTYDP 

^ B ^ EEAGVKGKLGRLLGV FEQN0DRKHRTYVYVLTVTE L LE?W 


448 


553 


766 


PDKTHHILILLEHIAWKWMPiT. 


^ IFLSFFgVFFIFLLFNVIIF TVFSSHFF MLFFLLFCLFFIHHp - 
FFDQPLHHAGLDLDLSHARIGHGPSRRRRRRGGE 

GFYLXNAUSFRRLLFTYFMRIYTVKQNHQLFLPFLVKKEQNVGNLWN 


262 


RSKYAASKPALAI WDHHPGNLVRI S AVH * RHTWFKLLFNVI I FTVFS 
SHFFNLFFLLFCLFFIHHFFFDOPGHCHLCVTPATCVRHLGIMRKSC 


394 
2233 


GRALLKYVSPPSSLWNSVPPDPMSPQPCPLKKKNPnfiT.faT'p 


IPELLVSSDPPDSASRVAGTTGTCYCTSLQRDFFYFTnVTMfif^ 


950 


^^^SARPKTARQVCRLPAAAAKSPLARSRSACP 
RCLQKVTEERECGRITGSP*TSSFTKTLRTITPFTSHKRRVGIIOSN 
QPPEPDSNDRTLRPEPECGCWSESWAPPHFASSNRGIiPGHRRMP! 


1912 


PLESRRLARSSGGWAGITGTPMNIFTGPDPGPSERSAQPRVWDSTCC 

p™r R ^ TTPPMSS ^ QIPRW ^ AW ^^SR RD ASTPMK 
ETMMMTTPRRMRMMAGARKALSRVT.VPf PQT-oA 


TFVKN «DRS KHI TY 


1094 


970 


261 


RERLREREREREREREREMFRPO T.T.T.PFPp" 


^fr QLKFFFKHNFWGQVQWLT SV:[PALWEA ^EGLLECQEFKTS 


475 


1647 


DYCNHSEKRR WQLGPVGSSRWTRKFKVTPVT PILI.T.PRKrr CT 

RTWQGGHEGDKMGREGAESQAQVQLI.TASPPACPAASFSSSPSSASA 
VGSPASPGSTPQAKATSTDNCSVLSDT.qAMnnQgMor'Afi 


1459 


r^^ SDLRYLPGRFIFPGTLP FPRRRRGK MTKFQEAVTFKDVAV 
AFTEEELGLLDS AQRKL YRDVMLENFRNL VS VGHQS FKPDM I S QLER 

P^^^t^d^^ RG ^^ G ^ RN ^^^^ HKAG ^' R 
^^^ SQDSTINIEGKSSQFPKHHDSp CQVGAGESIQASVDDN 

S^Sp™^^™^^ 

^" FAPYVDIFSCISHHHD DNIVHKRDKVHSNSDCGKDTLKVSP 
^ R ^^ Q ^ QGOTCE ^ F ^SSSLEI < HKQVHLGKKSPACSTHE 
raTSYSSGIPVQQSVRTGKKRYWCHECGKGFSQSSNLQTHQRVHlS 

IHCRVHTGEKPYKCEVCGKGFTQRSHLQAHERIHTGEKPYKCGDCGK 

rfscssnlhthqrvhteekpykcdecgkcfslsSshSSSS 

PYKCEECGKGFSSASSFQSHOSARGOTFK"gTp iQRVHTGEK 


MHSSVYSSPAAAGLRFPGIRVSITEEGVFKTMSSEPLLNNGSRDSED 
n d^^ G ^ R ^ VF ^kk|»EVTGAE FPGG S PAGAAAQRGQNP SSDPCHA 

ALRPYRSHFPHGQGQrFMNRKALIXJRDVKHAQLDNGHEQQKlSVAAA 
EAAASEAAGETSEQRRPJUlPSRLPGPRLI.QRP^RppsSpsspSFL 

tfp LS1JTOL llppggi.ta T l LL ra S phrplfSasSppKr?S 
ppgsarlqtfeq OTS lgkrvpqllli. L plvpaasaqtpJpdgS 

VTPAQELEEAIjCPPSYTiAaT.T.aar^yirtj^»j.T.««^^^^py P ^ P j?^^ A ^ R ^ 
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SSVYSSPAAAGLRFPGIRPEEEAYGEDGNPLPDFGGSEPPGAGSPAS 
APRAAAAWikPAGRRDVAHG I L»NEA YRKVX»DQIjS AGKHIjQS LiVARGV 
GG SLGGGAGDDAEPLSKRHSDG I FTDSYSRYRKQMAVKKYLAAVLGK 
"PvvnpvTnjTfrs'PP t 7\vt. 

JK.X JVvrv V IVOrvix iHILi 


A 
Jt\ 


tar 

P FT . P FT .Pi P P VTA P T T DKTK R TP TT 7i m*Y\7"D P P Pf2 DM T . Q P JV WTP P Pfi CO 7i P 

PYAGPLCPGRRCRDSPSGRPWGQPVLWTLPCFAYV 

** O O 

n 


/ -7 J 

FM ZV TV W P P WD Q TVT .P <3 P ClCl \CD F V"p PTW 7i P F T .CI 71 P. P 71 Pf2P P 71 ZkVnPTAnn 
RRG VPOWARRMS WSRLWRP AAYRGELVER I R ECTAARFArOEOPPfir 
ERRESS FLPKRGE VFDAGHGGESMVWVRHGR 

4829 

A 

553 

734 

NTHTHTHTHTHTHTHTHTQC I VCFQQ YNKHCKI FMGVKTMPHYLKTG 
SFSLMSRCSSVRLL 

4830 

A 

246 

346 

AFEEmiLRVCKKL YS VF\ TQMPKMNFANVF I GAN 

4831 

A 

3 

241 

SCERARFWAAAVAGVAATRVRRRLAITGLTMERKVLALHA^ 
KKDKAQRK* / RLVPQHP ILW I FLRHLLC SGFTS 

4832 

A 

195 

266 

GVWRPLSGCNASSSA* SQEPCTT 

4833 

A 

1 

1278 

MGFRQSFEKKKKKKRPGESSLPGQFCSDDRSLRVGLCPSASPGSSPS 
LSAQRTQGPEPQRGRITGEGPSAPGRQGDPQGSGPSPGACCRAATLR 
EGKAQGAS QNRAPAPP PTS RSG PECCTR WTAYLRHEAWHQ IGPRSR I 
AKPRREKEPWYTGLPNTKGIPRLRYEEGDPGHNGSGTAAGIRVRSAA 
SQARAVWDPSARTPAPAKEPGMAQLERSAISGFSSKSRRNSFAYDVK 
REVYNEETFC^EHKi^KASSSGNMNINI TTFRJiHVQCRCS WHRFLRCV 
LT I FPFLE WMCM YRLKD WLLGDLLAG I S VGL VQ VPQGLTLS LLARQL 
IPPLNIAYAAFCSSVIYVIFGSCHQMSIGSFFLVSALLI1TVLKVSPF 
1TOGQLVMGSFVKNEFSAPSYLMGYNKSLSVVATTTFLTGIIQKSSTT 
DLG 

4834 

A 

2854 

3176 

PVISSSVCRASPKSRIFSSQSSFTARLLGFRSL*IMSAECIYFRPRR 
IW*IRN*TWSSVSF*HFHNWQICPHEMGHQIHIIEFLQRPCRCKNV 
QQTNH I FMFHMFK 

4835 

A 

1 

2991 

VIGLLDVFTPARSLEEFNDVHCSRYFRYYREQKITLSSTNIVQADNN 
KQQYLVTHLMGADLNNIVKC^ 

NFSNGKFKHIK^QDLKPSNIxAVNEDCELKIl^FGLARHTDDEMTGYV 
ATRWYRAPEIML1TOMHYNQTGMAQLERSAISGFSSKSRRNSFAYDVK 
REVYNEETFQQEHKRKASSSGI^lNINITTFRHHVQCRCSWHRFIjRCM 
LTI FPFLEWMCMYRLKDWLLGDLLAG I S VGLVQVPQGLTLSLLARQI, 
IPPl^IAYAAFCSSVIYVIFGSCHQMSIGSFFLVSALLIlTVIiKVSPF 
IWGQLVMGSFVI^EFSAPSYLMGYNKSLSWATTTFLTGIIQLIMGV 
LGLGF I ATYL PE S AMS A YLAAVALH I ML S QLTF I FG I M I S FHAG PIS 
FFYDIIlTi'C^ALPKANSTSILVFLTVVVALRINKCIRISFNQYPIEF 
PMELFLCLRSRPERKGPRPRRTKPKGTTWPRGSSEGRSPATMWKRAA 
VT^RQAWVLAAPGLLERAVGGGPEPARFLLPVTPDFSLLPKIILQAF 
SLSLVSSFLLIFLGIOCIASLHlsrx'SWSNQDLIAIGLC^WSSFFRSC 
VFTGAIARTIIQDKSGGRQQIKLQQANLRGVPSQIiCRVTKCQRAIGI 
ETGVARGMS QHSD7U<ETVAEVKDKX»LLQKHS KRKI DMLLTKQKAS VA 

t^T3T^i j^y-VX7TJT~V/~' "CO DT^ C17C CM T C/~»'OT^T>^T*vTV T\ TtT\f~^V?T\t m *fV % T TT»T UTJVVPtJT 

TAGAHS SLEHKTGGQKGETARNKRWRI ECDERRTQERQRNMKGDNFW 

QHTDPyT PPTkPTiVTT T f PflT P"Kf*T v KTT VP O TXTT^VPl?'\7T^M"t7V"tJ r PT KTTPPT 
P <3 T. FN Q <5 DTMT .OR CJ TC T C R P PPMPDn T . P P T , P P T T , VTP P FFTvTTCl.D PF 7i Q 
SINLIHCSHFES^TSQTASEDQV^YTVSSVSQ2<NQGQQYEF^EVW 
LPNNSSRNSFTRDCLDVGGKQGGGRSLIPYSDASLIiAQCPHHHPGFL 
HGTLRG FHGG 

4836 

A 

472 

575 

APK^CKQPKCLSTNEQWKTWPVHTTEYYSSKKN 

4837 

A 

1 

248 

SISKKGLMEEKGSLYEEMGPSSAHSLVSLLAPGSGFCGR1.TSQKRGV 
GQSSPPRSKSKQKQARARREDKSLFFLFFTYANHL 

4838 

A 

676 

788 

LCTKHTYTHTHTHTHTHTHTHTHTVTPLSAQRCHSLL 

4839 

A 

212 

677 

SRTFPAXiCLS P WS SL YS WQHQLTAP S S PLG AEDLLSDS S E PPGLNQ 
VS S EVTSQLYAS LRLS RQAEATARAQ1»YL P STS P PHEGLDGFAQELS 
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RSLS VGLEKNLKKKDGSKH I FEMESVRGQIjQTMLQTSRDTAYREGSS 
HSWRWLRETGRGLL 

4840 

A 

455 

2705 

PKIIKRKACLLMIIiTSPKKREGRAQIWPVTFRDVTVIFTEAEWKRL 
SPEQRNLYKEVMLENYRNLLSLENFHSSAEPKPEIYTCSSCLLAFSC 
QQ FL S QHVLQ I FLGLCAENHFHPGNS S PGHWKQQGQQY S HVS CW FEN 
AEGQERGGGS KPWS ARTEERETSRAFPS PLQRQSAS PRKGNM WETE 
P S S AQRPNPVQIiDKGLKELETLRFGAINCRE YEPDHNLE SNF I TNPR 
TLLGKKPYICSDCGRSFKDRSTLIRHHRIHSMEKPYVCSECGRGFSQ 
KSNLSRHQRTHSEEKPYLCRECGQSFRSKSILNRHQWTHSEEKPYVC 
SECGRGFSEKSSFIRHQRTHSGEKPYVCLECGRSFCDKSTLRKHQRI 
HSGEKPYVCRECGRGFSQNSDLIKHQRTHLDEKPYVCRECGRGFCDK 
STLIIHERTHSGEKPYVCGECGRGFSRKSLLLVHQRTHSGEKHYVCR 
ECRRGFS QKSNL I RHQRTHS NEKP Y I CRECGRGFCDKS TL I VHERTH 
SGEKPYVCRCMGKAFVITHHRENLHVGTLWSSAQMTFSGVTPQLLER 
TVLLLAEMHSRDALRSGTHSQPQGAACTTGAMHLCRGTFWPQPLTQR 
GQLQKVIPDPEIPIELKDHWVADTLQVPPINLIIMAIEFQHEFWRRH 
SHH S INY AVAPQSGLMREKD VHVWECG P PPFARTAPI TPTSDQNWR P 
LREKKGHYQDLSTPSQKETELKRTRVSLRRKRDANSPLQDYGFGA 

4841 

A 

92 

316 

YKQGKYQTLIKPSDLVKLTHYLKNSMGETALMIQLPPPGPALGTWGL 
WDLQFKMFTASVYMCLPVFTCIiPSTKH 

4842 

A 

1 

578 

MGETALMIQLPPPGPALGTWGLWDLQFKTNTTSTDTDPRSHLQETGD 
NILTLFTMHPPLESEWTICNFRQIWLLSSWSTLETRAQPLHSYFRKL 
KGRGTAIAGIVFGIVFIMGVIAGIAICICMCMKNHRATRVGILRTTH 
INTVSSYPGPPPYGHDHEMEYCADIiPPPYSPTPQGPAQRSPPPPYPG 
NARK 

4843 

A 

99 

413 

TAGFIWLFKHHRFLKKTKQKLTVCPIINGEDHLRLLNFQHNFITRIQ 
NISNLQKLISIiDLYDNQIEEISGLSTLRCLRVLLVGKNRIQKNLHLE 
NLKKPKRPWI 

4844 

A 

2 

639 

LEGRSNREPGLSPIMIPGKYRSVSGRAANNVNCGLHLVIQTSSLPEK 
NKVEFKLNKDTSSFPGRLLQHDLERNYSCRQGDHINLVSSSLPSFPI 
LQRSSEEKILYSDRLSLERQKLTVCPI INGEDHLRIiLNFQHNF ITRI 
QNISNLQKLISLDLYDNQIEEISGLSTLRCLRVLLL.GKNRIKKISNL 
GESKKLRMSWDLHGKSGLPKIGKY 

4845 

A 

788 

1174 

PSCTRLLRRRQNQRSLKTNISPRK31ATVLKXPSQMLGLVPGVDGRSP 
RGGRGGLGWRSCFLSDGEWILRTGSVGSGLVGSRGSAGGPRLEMDPN 
CSCTTGGSCTCAGSCKCKECKCTSCKKSCCSCCP 

4846 

A 

35 

406 

SGTPASPCLEMDPNCSCSPVGSCACAGSCKCKECKCTSCKKSECRAF 
P ANLGDGP S * S REPRALQAG AGQ * PAS PNPS FNT * FR I RPQ I ALKMG 
CCSCCPVGL/CAKCAQGCICKGTSDKCSCCA 

4847 

A 

299 

981 

ALVPAGAAPPTRVAAMEVLRRSSVFAAEIMDAFDRSPTDKELVAQAK 

ALGREYVHARLLRAGLSWSAPERAAPVPGRLAEVCAVLLRLGDELEM 

IRPSVYRNVARQLHISLQSEPWTDAFLAVAGHIFSAGITWGKWSL 

YAVAAGIiAVDCVRQAQPAMVHALVDCLGEFVRKTI^ 

VLKCWSTDPGLRSHWLVAALCSFGRFLKAAFFVLIiPER 

4848 

A 

152 

797 

DPPNLYLDFDGDVTKLETFGVTTTKASKPRSPASTSTVTNVTDAPTA 
PKAGTTPVAPSAPDISANSRSLSQILIEQSQKEKQLVTGMDGGPKEC 
KNKDDOGFESCGKKVSNTDKSSRQDSDLKTSDALQLENSQEIETANK 
YDMTIDMVHVDAERPNVLENLDNSKEKTVTSEAAKTEDTVLCSSDTD 
KECLI IDTECKNNSDGKTAVMGSNIjNS 

4849 

A 

1 

124 

GTSLLLPYHIHHGRIPCTFPPKSLDTTTQQKVRSFIiKLKVH 

4850 

A 

2 

719 

PNAEKLVSRYHPQIALTSQSLFTLLNNHGPTYKEQWEIPVCIQVIPV 
AGSKPVKVIYINSPLPQKK^fn^RERNQIFHEVPLKFMMSKl^SVPVS 
AVFMDKPEEFISEMDMSCEVNECRKIESLENLYLDFDDDVTEIiETFG 
VTTTKVS KS PS P ASTSTVPNMTDAPTAPKAGTTTVAPSAPDIS ANSR 
SLSQILMEQLQKEKQLVTGMDGGPEECKNKDDQGFESCEKVSNSEQA 
FDTR 
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4851 

A 

98 

263 

LVNTPSTCALLSLHYRLRSPAIiPRLEMDPNCSCTTGGSCTCAGSSCQ 1 
VTLFHLQE 

4852 

A 

741 

965 

AHLPEFLLSPCPSLGGPRPRSHCRW*HCPRLRAAPPPA* SALCXiGGC 
CECVAAPGRWYCWVPGDRACS PHTCAR 

4853 

A 

105 

284 

LVNTPSTCALLSLHYRLRSPALPRLEMDPNCSCTTGGSCTCAGSTHS 
RADGSQQAMLRH 

4854 

A 

4 07 

1815 

TRATRWKRTKRYTLRKTERWSPRVIMDSASR/HAPGNGLYVNYGIiKA 
LNIHGGQKLTLAEHGGAYGATADMSAKIGGEGDLAINTVRQVSLSNG 
QNDYOGATYVQMGTLRTDADGALGNTRELNISNAAIVDLNGSTQTVE 
TFTGQMG S TVLFKEG AX»TVNKGG I S QG ELTGGGNLNVTGGTLA I EGIj 
NARYNALTS I SPNAEVSLDNTQGEFS SKRRQTRKEI FLSRMEQILPW 
QNMVEVIEPFYPKAGNGRRPYPLETMLRIHCMQHVTifNLSDGAMEDAIj 
YE I AS MRL F ARLS LD S ALPDRTT IMN FRHliLEQHQLARQLFKTINRW 
IiAEAG VMM TQGTIj VDAT 1 1 E APS S T KNKEQQRD P EMHQTKKGxlQW PL* 
WHEGPHCLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREEL 
AEVDVDWL I AERPGKVPARHRLS AHGCVL YTALS RR I VALPL PVS AP 
P 

4855 

A 

1020 

1352 

RAEARAPGPWGKEQDDEQVQMSYPPVRIiTCWLRVLSPQHQPHGFPLH 
GLMCGNHWLTSSRLPCSVPPRTRYAPPLPPSVQNGAVPPDGHHPPSW 
HRENHSAYRVENLHTR 

4856 

A 

491 

628 

KTITNWRCYSRWPTCFGRTK*YDWHRQMTPPRQRAVLWTKSPKTV 

4857 

A 

1 

1872 

mvivpigyaaqelfdvsqvrggtpygattiaggdgsrqpsqeelsia 

ryqgeyvagvryrstfypgis/alsvlshilvlyctaglhigaXaiv 

ptryayrasefsskrrqirkeiflsrmeqilpwqnmveviepfypka 

gngrrpypletmlrihcmqhwynlsdgamedalyeiasmrlfarlsi, 

dsalpdrttimnfrhlleqhqiarqlfktinrwlaeagvmotqgtl 

hqtkkgnqwhfgmkahigvdaksglthslvttaanehdlnqlgnllh 

geeqfvsadagyqgapqreelaevdvdwliaerpgkrsasciqklnt 

tmiyvthdqteamtmatrivimkdgivoqaeerndwhftigamyeie 

NVEGYGEDLDGIiADPSVYFNAANGPCRIALGYYHEGPENITRAPEEM 
RS FLLDAGAE YNGYAADLTRTWS AKMTNDYAQLVKDVNDEQLAJj IAT 
MKAGVSYVDYHIQFHQRIAKLLRKHQIITDMSEEAMVENDLTGPFMP 
xlLrJLvjtXlirljVjLitJ VriD VAbf Myiil^t^xHIjAAPAiVx PlJ-iKU IK J-JjyJrtaMVlj 
TIEPGIYFIESLLAPWREGQFSKHFNWQKIEALKPFGGIRIEDNWI 
HENNVENMTRDLKLA 

A Q C Q 

TV 

A 

"1 O T T 

12 3 1 

1407 

fTPXTD FP/**7V VY*MJ , T I 7V fO P'PL'TV POD T DtJCPTACTJDDCmTTnTKnyTVMT rp 

QV IvxriKr KCaAXyti xAt-br x KDtjFnKJjRr SrIASRKJISTVRVNVAWLi 
RFNAC I TQS I CH 

4859 

A 

491 

553 

KTITNWRCYSRWPTCFGRTK 

4 860 

A 

3 

278 

KL S C PD P QS EE KLQGKREG Q I RETG EEGG AAKEAA VP PQ SSSDSAPG 

A 1 yxjlCK^Ky U.t- o o o DbyVAvKy 1 xi VxA?Xjc<IjXjO ry PxAj W A W Ir\N vjXj 

4861 

A 

2455 

2871 

RWLIVTLFIGEDLNTQITVYIREQGQFKVINRTGCQGMGIiRLNIRNI 
QLVIKYIiDIEYRTKQRFITCRSPAVTDNIjFGIVTLMTTHFFQLCCQS 

OV-V" Xvvivr JjrvrtiM VIM X VIlJlV30wO*\\JI%rvr\>i^O X xlI\X-/I\.OOV3 v vn 

4862 

A 

656 

949 

LLHRFCRWPALAPGAILRQIPKKQQRVGLHYSPMSYVSWRYRVCHWT 
IWFCLSKlT^TSYARPASISVKOTKWRCFPLPKVKxTm*NIRICLP 
PPR 

4863 

A 

448 

582 

GPSTAISPSRVLLVAAHLVPRCLHFLGGGSAIiVADPAANFSFWCS 

4864 

A 

495 

920 

WRRRDRCASHHFAVSVTTIGSSRDPDAGSLPVHSWRRKERRSALYLH 

KboKonKKl X PxOCUKivrCAr Wi» VyPKC-W JjAAXJjoorl* look, L W W XxCosl 

QLGTSQLTLRC^QNHRCRRraSSDRPRLRVLSWHARKYQTPGRCSER 

4865 

A 

1 

2799 

MTQI FTGTLVEADPFHAVTLVAmiDTQPLQALEAPVEPWFKPLAYAL 
ILLRENGVPSVFYPDLYGAHYEDVGGAGQTYPIDMPIIEQLDELILA 
RQRFAHGVQTLFFDHPNCIAFSRSGTDEFPGCVVVMSNGDDGEKTIH 
LGENYGNKTWRDFLGimQERVVTOENGEATFFCNGGSVSVWVIEEAA 
NNRQ YD YGAGHKI PVINYTDVHLRI ERS CRFRADPRQQHELQLS S KL 
AVHD VLTN I YNRR Y F FN S VE S LL S R PWKD F C VML VD I N Q FKR I N AQ 
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WGHRVGDKVLVS I VDI I QQS IRPDD I LARLEGBVFGLLFTBLNSAQA 
KI I AERMRKNVELLTGFSNRYDVPEQMTI S IGTVFSTGDTRNI SLVM 
TEADKALREAKSEGGNKHWYNLSDGAMEDALYEIASMRLFARLSLDS 
ALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQGTLVDA 
TIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSGLTH 
SLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVD 
WLIAERPGKVRTLKQHPRKNKTAINIEYMKAS IRARVEHPFRI IKRQ 
FGFVKARYKGLLKNDNQIiAMIjFTtiANLFRADQMIHSQYSALRRHQDG 
SVSFKPETHTCDAHRPALKPANDYVRTFFRGVDISQVFSAKDIARRT 
PNGL I R KTPG FGPRS ALKLLQDEDRE YG YV I ERGNKFVG AVS I DSL K 
TAAVPPVQLI/FGWCVTAKAPPD I S AI S At/TAVKHGNCS S LTPLLNPP 
GSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTM 
ATLGRLMS LL S P FD VVI WMTDGWPLYESRIiKGKLHV I SKRYTQRI ER 
HNLNLRQHLARLGRKSLS FSKS VELHDKVIGHYLN I KH YO ; 

4866 

A 

3675 

4064 

VSSDQNRRFCNRCYGRLQPGGCFSLWCQFSASKSYLSPSDL* *W*/Q 
LSPS ASRIiPLAS LT I S S S LSCAGS AS FHS \ RH I PGRCTCR YLRVFG V 
AFWAVW* *HVDFLKSQQGKSAKKRQIAEKGRNYTQNP 

4867 

B 

1 

2325 

MNLMEQPAKDQYLSTQGLPVLVNSLARYPQEALPITNYFAASELAPA 
VARAFKKLKTLRENARSWLLKYPEHALTGLLPAALGKAGEAQDNARA 
ALRMLTENGHQPLLQElARRYNQPEVTDAVNALLAIiDPLDNHPTKIP 
TLPAFYQPSLWTRPVLKANAQSLPDSALLHLGEMLRFPQEEALYPGL 
LQVKDVCS ADS LAGFAWDLFTAWQTAGAPS KE S WAFTAEFS S KRRQT 
RKE I FLSRMEQI LPWQNMVEVI EP FYPKAGNGRRPYPLETMLRIHCM 
QHWYNLSDGAMEDALYE I ASMRLFARLSLDS ALPDRTTI MNFRHLLE 
QHQLARQLFKTINRWLAEAGVMMTQGTLVDATIIEAPSSTKNKEQQR 
DPEMHQTKKGNQWHFGMKAHIGVDAKSGLTHSLVTTAANEHDIiNQLG 
l^I^GEKQFVSAMPATKEPQREELAEVDVDLLIAERPGKVKTLKQNP 
RKNKTAINIEYMKASIRARVEHPFRIIKRQFGFVKARYKGLLKNDNQ 
LAMLFTLANLFRVDQMIQLPTYELSSGIGAVRRQAELDGTPAINSKR 
VYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCC 
DNGERLRVTFAIJDCCDREAIJIWAVTTGGFNSETVQDVMLGAVERRFG 
NDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGI 
AE S FVKTI KRD Y I S IMPKPDGLTAAKNIjAEAFEHYNEWHPHSALG YR 
S PRE YLRQRACNGLSDNRCLE I * 

4868 

A 

82 

199 

GLLLGTGPERKPEQKMCLLVHWLLTSGGLPGLPSGTPRC 

4869 

A 

882 

1091 

VCHAAPPILRTTATSVPAVPGAASDSGYAAQFAAYIAC*IiRAAFPSG 
G I HTAASHPS SIS PRQR VTAG S 

4870 

A 

3 

274 

LSCPDPQSEEKLQGKREGQIRETGEEGGAAKEAAVPPQSSSDSAPGA 
TQLRRQRQCCS S SEGQQAGRQTEVLGLEGLS PQPLG WAW PNGI* 

4871 

A 

2888 

3076 

KTITlWRCYSRWPTCFGRTK*YHVmOLLSMVRQLMVLSWKKISSWFM 
GPPIIRKSILAENSI 

4872 

A 

1 

2043 

MVARAAALTATPQVDKVVLSRL ID ITTDAAIDSGVLIiERIi I AQNPVS 
YNFHVPLADGGVtiLGASPEIjLLRKDGERFSS I PLAGSARRQPDEVLD 
REAGNRLLASEKDRHEHELVTQAMKEVLRERSSELHVPSSPQLITTP 
TLWHLATPFEGKANSQENALTLACLLHPTPALSGFPHOAATQVIAEL 
EPFDRELFGGIVGWCDSEGNGEWWTIRCAKLRENQVRLFAGAGIVP 
ASSPLGARMS I PFTRWPEEFARRYREKG YWQDLPLTD I LTRHAASDS 
IAVIDGERQLSYRELNQAADNLACSLRRQGIKPGETALVQLGNVAEL 
Y I T F FALLKLGVAPVLAIiFSHQR S ELNA YAS Q I E PALL I ADRQHAL F 
SGDDFLNTFVTEHSSIRWQLLNDSGEHNLQDAINHPAEDFTATPSP 
ADEVAYFQLSGGTTGTPKLIPRTHNDYYYSVRRSVEICQFTQQTRYL 
CAI P AAHNYAM S S PGS LGVFIiAGGTVVIiAAD PS ATLCF P Ij I EKHQ VN 
VTALVPPAVSLWLQAL I EGESRAQLASLKIjLQVGGARLS ATLAARI P 
AEIGCQLQQVFGMAEGLVNYTHESSTYLRRHEFSSKRRQTRKEIFLS 
RMEQILPWQNMVEVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLS 
DGAMEDALYE I ACHAS VCPVI PG 
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4873 

A 

222 

513 

PEGAPRTPGPLPRAPGRTSEGAGRRGGPLGPVVTMCLPRRQIiSYPDV 
G*AEGGLPALDPWPGHARGSEGPRRQGGAVLAVRLGGRVYVHLHTPG 
GQ 

4874 

A 

20 

161 

RALPIDEREHQPEGP^nAmCSDCILHPPLASGRCVTSRASLRPKKLV 

4875 

A 

1 

2251 

MAGVGAGPLRAMGRQAIiLLLALSHRRQGLYFHIGETEKRCFIEEIPD 
ETWIGNYRTQI^DKQKEVFLPSTPGLGhmVEVKDPDGKVVLSRQYG 
SEGRFTFTSHTPGDHQICLHSNSTRMALFAGGKLYREERFRLTSEST 
NQR VL WW S I AQTV I L I LTG I WQMRHLKS F FE AKKLR VRAALLS SAME 
DSEALGFEHMGLDPRLLQAVTDLGWSRPTLIQEKAIPLALEGKDLLA 
RARTGSGKTAAYAI PMLQLLLHRKATGPWEQAVRGLVLVPTKELAR 
QAQSMIQQLATYCARDVRVANVSAAEDSVSQRAVLMEKPDVWGTPS 
RILSHLQQDSLKLRDSLELLWDEADLLFSFGFEEELKSLLCHLPRI 
YQAFLMSATFNEDVQALKELILHNPVTLKLQESQLPGPDQLQQFQW 
CETEEDKFLLLYALLKLSLIRGKSLLFVNTLERSYRLRLFLEQFSIP 
TCTVLNGELPLRSRCHIISQFNQGFYDCVIATDAEVLGAPVKGKR\GA 
EGPKGDKASDPEAGVARGIDFHHVSAVLNFDLPPTPEAYIHRAGR** 
CDGPGICM\TARANNPGIVLTFVLPTEQFHLGKIEELLSGENRGPIL 
LPYQFRMEEIEGFRYRCRDAMRSVTKQAIREARLKEIKEELLHSEKL 
KTYFEDNPRDLQLLRHDLPLHPAVVKPHLGHVPDYLESKVPEPTAQL 
QAQRKE I QTHSQALLRLLGLSGAEHI VEHRLTPFVDRRGSGAYCTA 

4876 

A 

2 

2318 

FVSGLGLCGRWGGRLLLLIiALCATGAQGLYFHIGETEKRCFIEEIPD 
ETOVIGNYRTQ^DKQKKVFLPSTPGLGMHVEVKDPDGKMIiQVVLSR 
QYGSEGRFTFTSHTPGDHQICLHSNSTRMALFAGGKLRVHLDIQVGE 
HANNYPEIAAKDKLTELQLRARQLLDQYREERFRXiTSESTNQRVLWW 
SIAQTVILILTGIWQMRHLKSFFEAKKLRVRAALLSSAMEDSEAIiGF 
EHMGLDPRLLQAVTDIXSWSRPTLlQEKAIPIiALEGKDLLARARTGSG 
KTAAYAIPMLQLLLHRKATGPWEQAVRG1.VLVPTKELARQAQSMIQ 
QLATYCARDVRVANVSAAEDSVSQRAVLMEKPDAAA7GTPSRILSHLQ 
QDSLKLRDSI,,ELI*WDEADLLFSFGFEEELKSLLCHLPRIYQAFLMS 
ATFNEDVQALKELILHNPVTLKLQESQLPGPDQLQQFQWCETEEDK 
FLLLYALLKLSLIRGKSLLFVNTLERS YRLRIiFLEQFS I PTCVIiNGE 
LPLRSRCHIISQFNQGFYDCVIATDAEVLGAPVKGKRRGRGPKGTSV 
ARGIDFHHVSAVIaNFDLPPTPEAYIHRAGRTARANNPGIVIiTFVLPT 
EQFHLGKI EELLSGENRGP ILLPYQFRMEEI EGFRYRCRDAMRSVTK 
QAIREARLKE I KEELLHSEKLKTYFEDNPRDLQLLRHDLPLHPAWK 
PHLGHVPDYLESKVPEPTAQLQAQRKEIQTHSQALLRLLGLSGAEHI 
VEHRLTPFVDRRGSGAYCTA 

4877 

A 

966 

1150 

QPPQEGTAEAGKPSLCVFISTFPGIWGGLLFLRFQVQHCIRTEAVTE 
DSVPRKGASCVSLGFGAIiLKKPTTPITPYCSPSTPPHSSMKMCEPPV 

4878 

A 

417 

872 

SFKLSKVEQFQEGGPSKRMPWVIHVFPWVHFQLCIPALQLCSQPRAN 
TLHPNLRWGGKGQSPIWDPSIEGQPGLLPAGCI.CYTCRMCTMFSLDN 
TPSFLPHQPQTSRPFPPFLPVAEPPTIAHTVLVQLLTVPGIiRSVMLiA 
FPQQERKPAG 

4879 

A 

3 

284 

AASRGDGGALAGARVAEAAAVGRGECPFPGRRQSGPEPAAQGGELRA 
EMAADAAHPHGGPRLPTGGPRAADEAGRASTGCGISKPLPSVFSHL 

4880 

A 

2 

437 

AAASGGEWRGSGWGSCVAEAAAVGRGECRFPGRRQSGPEPAAEGGEIi 
RAEMAADAAH PHGG PRL PTGG PRAADEAGRAR VRAG APPGAQRG PH S 
RSHPPISQESLCLWRWPKEEEEPVTLWKNALPVLWCPLAPADTAKC 
PIPK 

4881 

A 

336 

442 

EIGWVQWLMPVIPAVWEAEVGGLLEPRSLRAARAI 

4882 

A 

523 

630 

VKQKPRIVAHVLAYSVYTHTQTHKHTHTQCHTGSDP 

4883 

A 

1 

1004 

MAPWPHENSSIJVPWPDLPTIAPNTANTSGLPGVPWEAALAGAIjLALA 
VIATVGGNLLVIVAIAWTPRLQTMTNVFVTSLAAADLVMGLLVVPPA 
ATLALTGHWPLGATGCELWTSVDVLCVTASIETLCALAVDRYIJVVTO 
PLRYGALVTKRCARTAWLVWWSAAVSFAPIMSQWWRVGADAEAQR 
CHSNPRCCAFASNMPYVIiLSSSVSFYriPLIiVMLFVYARVFWATRQIi 
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RLLRGELGRFPPEESPPAPSRSLAP/APGYANSAFNPLIYCRSPDFR 
SAFRRLLCRCGRRLPPEPCAAARPALFPSGVPAARSSPAQPRLCQRL 
DGERHL 

4884 

A 

144 

1423 

FGRPPPSFFPYRPTRDPGMAPWPHENSSLAPWPDLPTIiAPNTANTSG 
LPGVPWEAAIiAGALIALAVLATVGGNIjLVIVAIAWTPRLQTMTNVFV 

a «-> j-i.i-vj-wu»/jj v i i\juxj v v r tr J-\J-\ X XaMXj J. LioA 1 vJV^ .EjXj VY X O VU V JuU VTA 

S I ETLCALAVDRYLAVTNPIiRYGALVTKRCARTAVVIjVWVVSAAVSF 
AP I MS QWWRVGADAEAQRCH SNPRCCAFASNMP YVLLS S S VS FYL PL 
LVMLFVYARVFVVATROLRLLRGET.OPFPPFTTQPPZiPCDCT a-Datmrj 

TCAPPEGVPACGRRPARLLPLREHRALCTLGLIMGTFTLCWLPFFLA 
NVLRALGGPSLVPG PAFLALNWLGYANS AFNPL I YCRS PDFRS AFRR 

LLCRCGRRLPPEPCAAARPALFPSGVPAARSSPAQPRLCQRLDGASW 
GVS 

4885 

A 

1309 

1702 

I GQVRWLTLV I LALWEVE VGGSPKVTS S R POT, T TT-TT.nT .PKrwnY Q t ,t? 
PLCPTCLFFEIQSCSITQVECSGTITAHCSLQLLGSSDSPASASQVA 
GTTGALHAWLI FKF I VEKGSHFAAQACQTFS YKKLHA 

4886 

A 

1 

648 

^^v^^EME^iVTWMVME^r™vMv^x^ 

VMEfWmmVMEMVMV^ 

MVMV^^VMWRCDGYGDGDDLPL^1ALPPCHALCQFYVVNSELSCQLYQ 
RSGDMGLGVPFNIASYALLTYMIAHITGLKLQREPRPFPKLRILRKV 
EKIDDFKAEDFQIEGYNPHPTIKMEMAV 

4887 

A 

1 

1049 

GG^?GT*T r WPA t 5VPPRT-TT.2Vr ,, T.PP2iZi'DT "D t ddcdtydtj M/iDUAnc?t?r noon 

LPPAAQERDAEPRPPHGELQYLGQIQHILRCGVRKDDRTGTGTLSVF 
GMQARYSLRDEFPLLTTKRVFWKGVIiEELLWFIKGSTNAKELSSKGV 

SGQGVDQLQRVIDTIKTNPDDRRIIMCAWNPRDLPLMALPPCHALCQ 
FYWNSELSCQLYQRSGDMGLGVPFNIASYALLTYMIAHITGLKPGD 
FIHTLGDAHIYLNHIEPLKIQLQREPRPFPK\LRILRKS*RKLIDFK 
AE0FQI EGYNPHPTI KMEMAV 

4888 

A 

437 

851 

NGHLQETLHRW I VS LS S VPSLQRLLWTSLQPSGSLS WS PP YEMAS VS | 

LLFYFAFSCGLSmiHPSLSSSGGSSDLPVMALPPCHALCQFYVVNSE 

LSCQLYHRSGDMDLGVPFNIASYAI.LTYMIAHIHGPEGGLSRER 

4889 

A 

1 

1063 

* ^■m-xtuj x. r x. j vn^uociArnor rJXjXrCJiV^iM 11*1 V 1 x^oxjJKo.fciP'lV* J. x JKJLirLA. 

ATPSSNSDRHVSMPLLLYLBQVTFLQEHAGTGRRWWSIVWRRRTFPA 
AG AMEM P L P PDDOELRNV I D KLAOFVARNGPF FP KMTMI? KYiTCnw P K" F 
S FLFGGEF YS Y YKCKLAIiEQQQRHS RG S PHLPGLGLRARGNGGGAAA 
TRAHWDDGRATYVYQWPWTLLLLAVSGFCNFAQNVIAFS I LNLVSPL 
S YS VANATKR I MV I TVS LI MLRN PVTS TJWLGMMTA I LG VFL YNKTK 
YDANQQARKHLLPVTTADLS S KERHRS PLEKPHNGLLFP PARGLSVR 
PQQHLNRPLPIQPAELPKLVQFEPL 

4890 

A 

496 

701 

FTKGPPPSKKRFISSINIFDKGFQFGFHRGVPGHDFGNLGPLPPGIK 
RFLSLSLSNSWECKPASLERP 

4891 

A 

875 

1156 

LHWC^CVC^O^CMCVCVCLKGYSVFTLKIQRSHSFSQNYKGEVKK 
NS KAPS VYLVYLKE IRVCI CSHRAYINAHTOTHTHTHTHPKY I S AH 

4892 

A 

76 

382 

FS HW I FFPAVRGDLG PVEGHCLNLS GVCRRDVCKWEDQ I GAFWKFIj 
KSPQVSRGDCSCTESHCLDlWPGWAGVGLNPGVKILELGVFIiYLFW 
DSAPLANI 

4893 

A 

71 

228 

FSHW I FFPAVRGDLG PVEGHCLNLSGVCRRDVCKVVEDQ I GACRLFF 
LFSEC 

4894 

A 

69 

242 

FSHWIFFPAVRGDLGPVEGHCLNLSGVCRRDVCKVVEDQIGAWGSCR 
SRPKSRCEDP 

4895 

A 

146 

431 | 

LPHKLIHSiWGKWSRVPGQSAISTGTETPPVQGSSRLDLGLHLQLPQ 
ASYPGSDFLYSYSHLWTPAETLKISRRSGIFGDLSYGYVACKGESIS 
F 

4896 

A 

3 

262 

SKRWFGSCGRSLSQFWCLQNRCLQHSRRSNWCLCRQCSHEKGLQKF 
LTFWKFLKSPQVSRGDCSCTESHCLDNWPAGCFSYFSEC 
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4897 

B 

185 

338 

MS TTFMGRS QATGGCVQTDTPG PPTLLQDTQQLS WHPD I QLEVGFP I 
GGGGX* | 

4898 

A 

69 

303 

r onwir r i'/vvKUlJljUPVJEGHCLNLSGVCRRDVCKWEDQIGACRRRM 1 

KdrRTWWTT.MDTDTDT TMCnv^cnr t^TJTtrr «■ 1 

4899 

A 

260 

881 

MTVTHITIYNKAQVSILKCRGRVKGCGSAERRSHPNQHVAARTPSDA 
oi\x x x^wru^^fKjKjJKfKbt^ l XUSVVSNSVDSSSDVCRSMSVESQSIVAY 1 
FLHSQVQAAGYLNPLHLAVKLCQAPGTSNNSDCLRPRRRRCHRQHTP 
SDHFTCWPSNVQQRI KHPS ARTLES APWPAKS PAI FADGPTGGAACI 
LQALFRAGSTAKAAMVASR 

4900 

A 

373 

491 

ASDHFELVRWLTPGIPALWEAQEGGLFGPGSSRPTWATF 

4901 

A 

1 

232 

^nijv^ 1 v lAAKJ.ijKtjyKKDKLGPEIPIiAI^RFPYVALSKTYNVDKHV 1 
PDSGATATAYLCGVKGNFQTPRPADLCFSP 

4902 

A 

3 

703 

LGVPTVTATRILKGQKNGKLGPETPIJVMDRFPYIJU.SKTYNVDRQVP 
DSAATATAYLCGVKANFQTIGLSAAARFNQCNTTRGNEVISVMNRAK 
QAGKSVGVVTTTRVQHASPAGTYAHTVNRNWYSDADMPASARQEGCQ 
DIATQLISNMDIDVRPPGQGLGLGRGEGGTGSDPGNQKPDLGQQVLE 
V I LGGGRKYMF PMGTPDPE Y PADAS QNG I RUDGKNL VQE WLAKHQ | 

4903 

A 

3 

1622 

CPPDMLGPCMLLLLLLLGLRLQLSLGI ILVEEENPDFWNREAAEALG 
AAKKLQPAQTAAKNLI I FLGDGVGVS TVTAARILKGQKKDKLGPEI P 
LAMDR F P YVAL S KTYNVDKHVPD SGATATAYLCGVKGNFQT I GLS AA 
ARFNQCNTTRGNEVISVMNRAKKAGKSVGVVTTTRVQHASPAGTYAH 
TVNRNWYS D AD VPAS ARQEGCQD I ATQL I SNMD I DV I LGGGRKYM FR 

MGTPDPEYPDDYSQGGTRLDGKNLVQEWXJUOIQGARYVWNRTELMRA 
to l,u Fi> VAHLMGL FE PGDMKYE I HRDSTLDPS LMEMTEAAIiRLLS RNP 

RGFFLFVEGGRIDHGHHESRAYRAIiTETIMFDDAIERAGQLTSEEDT 

ajouv A-^^xlc>rlv^^^l^l^^FiJK^^SIFGLAPGKARDRKAYTVLLYGNGP I 

GYVLKDGARPDVTESESGSPEYRQQSAVPLDEETHAGEDVAVFARGP 

QAHLVHGVQEQTFIAHVMAFAACLEPYTACDLAPPAGTTDAAHPGRS 
WPALLPLLiAGTTjT.T.T.PTnTaTS f 

4904 

A 

26 

168 

TCLTHTHTHTHTHTHTHTGLLSLVQVCQVCRQNMSESGFVTTDHKPT | 

4905 

A 

258 

486 

GHTGPLGSPWSSVWVCLAGRQVPGPQHPHRPPGCSWGCRPPAGTGPR 
LPSASAPRCCPPRMRLEPRASRRSGTSG j 

4906 

A 

1121 

1518 

uutr^Q x iniv>\x ijW^JJ^UAbHTIMQGTGALTPGPAPPARGPWPLLRWPP 
QPTAARASEIWHRSGPQHVALIRRESCGSQGAPRGRRPPPRPAVPAD 
xhj vj v_ v. .uft^> x K^Ij to PCjAHQ S AS AS ERGG S SMA | 

4907 

A 

19 

541 

YARTLCLTCHIFFFFLETDPCSITQARVWWCEHGSLQPRPSGSSDPH 
ASDPQS AG I TGESHHAGlxNLQGI LRLKCHLLNHPCFKS VGGHPVSAA 
DLAWTLGCEREGWAGLEEPEALVMSQAHLAGHPLRCTCPRWPPQQRR 
ATGGWRASTGRSSGGEGRAQAPRRLRRPEPGVR 

4 908 

A 

288 

561 

ISLHYKEENPPNETDMNIVTGLHTHTHTHTHTHTHTQIYFNPVTTRN 
LETTYQQRGTERLPFSKMSKKRRVGWEVIiVYLDEQGSEMARHE 

4909 
4910 

A 
A 

1 

81 

1857 
416 

MLESKEEDVEVEKWSPLSLHLHTLKGYGQVQEARYSPKVIIEIWKMG 
VDWSTGLVLKAEQHSFAFGVMETGIRIRJCKKTSNESKSGGPSPSQMG 
GSGNLIGRGRRKGGVKIWCKEPI^RTDPAKKVRQVKWCSWCVLEGES 
VE I VSEVRVE VGELN 1 1 KDWGRE S VE KGGAV I S MEAERVKGQAM IAT 
GG VI TGLAALKRQDS ARSQQHVNLS PS PATQEKKP IRRRPRADWW 
RGKIRLYSPSGFFLILGVLISIIGIAMAVLGYWPQKEHFIDAETTLS 
TNETQVIRNEGGVWRFFEQHLHSDKMKMLGPFTMGIGIFIFICANA 
II^ENRDKETKIIHMRDIYSTVIDIHTLRIKEQRQMNGMYTGLMGET 
EVKQNGS SCASRLAANTI AS FSGFRS S FRMDS SVEEDELMLNEGKS S 
GHLMPPLLSDSSVSVreLYPPPSKTTDDKTSGSKKCETKSIVSSSIS 
AFTLPVI lOoNNCSaDEPS IDNI TEDAD^ 

TSESFQPVSTVLPRNNSIGESLSSQYKSSMALGPGAGQLLSSGAARR 

QFGSNTSLHLLSSHSKSLDLDRGPSTLTVSGRTTETSKLGLGUDREQ 
QAKGIYGN | 





PSLHPKASPVSPIKGKEAVSRDEQNIfflQGSHFLLPPKIPSWRDPPET | 
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LEEPQNAPRERPEGPAAAKKPPRHCELWTLGCPEIHGDIiRPWDRKR 
QPRSIjRGSHJjGGQRLHG 

4911 

A 

1 

507 

ASPKPVLCHQSKERKPSAEMNRITTKEATSSCPPKSPLGETRQKLWR 
SLKMLPERGQRVRQQLKXPPRHCELVVTLGCPEIHGDLRPWDRKRQP 
R SLRG SHXjGGQRLHG SLiCGH I S QKPLTAPGTKRQKGPHQEGREE S KP 
EHQGLAEPENDQLPEEYQQPPPFAPGYC 

4912 

A 

519 

858 

GLSEILAIGFCPHQVQRGPLGKVLQS\LAAQLPGPFTRLVHRRQSLH 
AVPKC IHAVFCb r I liJr HLiFyJbCaLiQKPtur LiAPWGIjHLjI^TPVWPFaNP 
RCCS ILGAGGYFVQLTLGMS 

4913 

A 

67 

467 

QRPGSCPYRYPHGWTGLCTPRWASWPKTGSPSGSMVQSQERMLRTCW 
SHSHWDPFSSGSVTAMWATHSPTSKAWAGIQGRGRWALGFPWRGQGV 
LHVIASQKAAVTEAVDLRTGAASQAMSEAFSRSGMFYLR 

4914 

A 

2 

307 

PPETLEEPQNAPRERPEGPAAAKKPPRHCELWTIiGCPEIHGDLRPW 
DRKRQPRSLRGSHLGGQRIiHGSLCGHISQKPIjTAPGTKRQKGPHQEG 
REVGQLH 

4915 

A 

1 

2112 

mepveangkdpkegiqnpqaesgglqqtpghthgalrhsqsqldeva 
acpqygaasrvrkrphwgwltclrkqelprgsqprpidvqtlnrnpg 
scrhwhsyseglghprvsovcdprasaleetersravllwdwdlgmq 
gsasrdhqgastgqkprlgdppgqaterglpkaspvspikgkeavsr 
deqnnhqgshfllppkipswrdppetleepqnaprerpegpaaakka 
tsppvniwtlgcpeihgdlrpvfdrkrqprslrgshlggqrlhgsiic 
ghisqkpltapgtkrqkgphqegreeamteagklplplpprldwfvh 

TQMGQLAQDGVPEWFHGAISREDAENLLESQPLGSFLIRVSHSHVGY 
TL.S YKAQS SCCH FMVKLLDDGTFM I PGEKVAHTS LDALVTFHQQKP I 
EPRRELLTQPCRQAEGGDHKKDPANVDYEDLFLYSNAVAEEAACPVS 
APEEL FPPHIRL FYR YQDLAPQLVPLDYTTCPDVKVPYEIjIGSMPEIi 
KDNPFRQRIAQVFSEDGDGHMTLDNFLDMFSVMSEMAPRDLKAYYAF 
KIYGLQMETPGCKEFRSLQTFLWKMEQCETAWNSMDAFGNCVEALSP 
LPQLPDGGPQEQSSNCTSSFSSHlrPYLYISRGDNMDKPGKWPGQLGS 

43316 

A 

*7 O C 

T fit D 

O O C^OD AA7ITCTaTMI> ^T'DTrr* VITT 7\ tDUTT*JI5 7ATfr , D TV T5/^vT^D AUJA OH*/"* A COCPn 
KobbyrAVf onrlKy 1 F&ljJviiXiAJr VM WJr A VL-KAFyXJir AWAir 1? yAbbbrVi 

PMLS PAGPREATCNYAAFRLPPQPPGH 
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4918 

A 

2 

134 

SGLGRLPGPWQEAGSSRGPSSGDMAGVKALVALSFSGAIGLTF/liHM 

Ty2r*2iT T7T^»V/^!T ri'DT r T*iC\7VTJ13T.I? r irT T TTU/IT O DT "DWCT TV VTJTPTVncnTlTQ 

IjOujHXiii-JJ x \jL>\j¥ 1 iJo V i WJrLif VX»Xr rivjXoFX Frir XA1\-K.V X iDoUAlb 
SACRELAYFFTTGIWSCLWISPVILaRVaIiIKXWGACGLCV/VAGN 
AVIFLTIQG\FFPIFGRGDDFSWE\QWGYWTDFS 

4 y is 

TV 
A 
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L.F i X»^ V WVjrKvjoyijr-c'i'oF X X AFrlr AI F rf Jr 0 Fi? 1 XALL 

4 y*iu 

A 

one 
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X V I AV X Ik, VrHSUrJr VJjV^v«r X»lNriXW XijixoL. X oyPiX V XX»v^WXi»^JI\J v loy 

GVEIYI 

4921 

A 

504 

690 

GKIYTYTCVCVCLCVCVCIYIYFHAIiPRLTKQCGGSRSSGKCMTIiGY 
WYDLGNS TVQNYLP 

4922 

A 

3 

33 6 

TKFKSKS X»TS GXt r AGG VG AEPDW SGQRHAGAVPRAQPRRGRGRGWRA 
ARPSSPKHAGRVAPSCbVGSLITTWPVNHPKEAGNESGAFSAPGLSP 
SFLGFQGHSLEARWRV 

4923 

A 

22 

129 

ASFQGTYSSDHREKLTSVI FFCLTYTHTHTHTHTHS 

4924 

A 

852 

1366 

RDTVSIAPYLEKCLFGVRGRGGAWPKVKaEGRNAWPLHPVPSIIPGS 
HPMPAELKHKTREVPARKARSGDAEHPGVSaDHPLLIiGPGPGPTLHH 

X r xr lr *Vijv_ FN ri V ft i-» X nVwCtlrHwV-XiXiXiv-oFXjJNlFAJNI Vvjoxlrir JXXiJMX 

FKKHFFFWRQSLAVS PTLLTPGSLS S PRAT 

4925 

A 

1 • 

897 

MCLSLDPCKLLSPVKIVAYIDRCVCAGRSSSQSPVIARKGKCLVSPK 
GFSVVKVAAliSRRLLPVLPTARKSEKGSETAPiJRIjpGCHAAlAGRJCT 
GCGILSPGPVSALPLPSHPFFSSPSVPHEAASESFINMSDPVIQGSL 
SPGVTVQGQLGTWPHKQEEYTLVVYIIDGVDFVTSNSSRRVVIETYF 
KELARGIVGPGNLKSAEKAaAEIQTIi^ 

NFTDRKTAGEEKRVS PFELMSTHTS 1 1 HRS AHGPHLDPYNRHTYLYN 
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RPYATNYPSNITPQDHS 

4926 

B 

449 

2578 

MLLECEEEAVCVIMCASVKYNIRGPALIPRMKTKHRIYYITLPSIVL 
LGLIATGMFQFWPHSIESSNDWNVEKRSIRDVPVVRLPADSPIPERG 
DLSCRl^TCFDVYRCGFNPKNKIK^IYAIiKKYVDDFGVSVSNTISR 
EYNELLMAISDSDYYTDDINRACLFVPSIDVLNQNTLRIKETAQAMA 
QLSRWDRGTNHLIiFNMLPGGPPDYNTALDVPRDRALLAGGGFSTWTY 
RQGYDVSIPVYSPLSAEVDLPEKGPGPRQYFLLSSQVGLHPEYREDL 
EALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKHQVFDYPQVLQEAT 
FCWLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 
WPEEKMSDVYSILQSIPQRQIEEMQRQARWFWEAYFQSIKAIALAT 
LQIINDRIYPYAAISYEEWNDPPAVKWGSVSNPLFLPLIPPQSQGFT 
AIVLTYDRVESLFRVITEVSKVPSLSKLLWWNNQNKNPPEDSLWPK 
IRVPLKVVRTAENKLSNRFFPYDEIETEAVliAIDDDIIMLTSDELQF 
GYEVWREFPDRLVGYPGRLHLWDHE1WKWKYESEWTNEVSWVLTGAA 
F YHKYFNYL YTYKM PGD I KNWVDAHMNCED I AMNFLVANVTGKAVI K 

VTPRKKFKCPECTAIDGLSLDQTHMVERSECINKFASVFGTMPLKVW 
EHRA* 

4927 

A 

533 

804 

HWREEAGLTAQRPDTHCFLYIQENEIYTGNTGCTFGGGVSYKANVAH 
HQPPARGRKVTEPPPRWERDARRPTEPSGLPWPEHTQRACGWS 

4928 

A 

965 

1109 

PVWATRDCNLQEISTVCQDLCPPLPPPNPHCSAQGGGEGDPLFRLTT 

L i 

4929 

A 

1 

1337 

MAS SKLRE P VDE VFDLDLAVP ETARLD S S LHKARAQLLAKGRRHR P S 
RSRLRDSAS S AEDGEGSDGPGGKVTDGCGS PLHRLRS PLHSGPGSPA 
GGSFCLDPPGLRRSLDEDEPPPSPLTRYRPLHNAASHEGLAAASCSP 
PRSAPSSDSSPSFVRRHPRAEPHSEDDSRDASPPEPASPTIGLDKKT 
RRKFLDLGVTLRRASTGKSRKEKGSNRLSMGSRESVEGSGRSGGSPF 
LPFSWFTDSGKGSASSGSTTSPTCSPKHEGFSPKKSASQAQLLAHAQ 
EALSNSWCLVQDFGSTQIGMESTLSDDSTPPSSSPKIPSGPWQEAKC 
SYPYHTLSQSSDEFLDEPLPPVHHWTSQQVGQWLQSLNI#EQYAAEFA 
ARQVDGPQLLQLDGSKLKSLGLSNSHDRALVKRKLKEMAAAAEKERK 
AQEKAARQREKLRRREQEAKKS 

4930 

A 

507 

981 

CWRSTATCYQWVI KS ANRCTLQVGTRRRTMDNGDERHSR I CPGM * EA 
ARSLFIPT/ CQRTRH*RPVPQRLRSNTAPIRT/ RPRPSVHRPTGRES 
MPRTRPPEPEPLQTPLAATKRRSTSENASYSDALLRGLGKLSGKDRE 
ARRGPRS RPG SRVQTAGAAIj 

4931 

A 

1121 

1272 

YDDTNEYFAIKTYYFKIYIYIYLYICIFLQNYYKIjCLMMVSYKKQGG 
KGP 

^ ^ J <6 

TV 
/"V 

CA 

O A C 

KSESGSFADASRCDPAHSQVWWLRGETTSA*EEQREE*RGLRLELRY 
QLSHSGKEHQAS SLGP 

AQ"J1 

A 

J. U :> 

1176 

KliFHEKSQCPMINLCICFLKGEDGEKimKLIiADIRGHKMSSSFVQMT 
FSC 

4934 

c 

178 

360 

MGQELRHFGTVTWPMHRKMERRXFPREQGPGXQLLPQWRGNEKTLGS 
AGRQGEDHRLRAE * 

4935 

A 

409 

516 

YPPPGPLQINNTHSIPPPPTPYVGFQGQLKNRWDKN 

4936 

A 

288 

573 

GLIiSS SATHEAHGCH IDLYQQMLRRSHSDFAMGRGRRAWGEGGREGT 
RWSPWMGSLGGLAGLGKNPGNNGTWRSSSVRMPCQGRKASGKIPPVT 

4937 

B 

1 

1152 

MGSEAGCLVRAERFQLIVERDVRSSFPSWKELSMPGFXQKQQARVYV 
QFFLADALILPVPRPCLHSATPSTPQTDPTGPEGPHLGQSRLFLLCH 
KEALMKRNFCVPPGASPEVPKPALSFYVLGSWLGGTQRKEGTGWGLP 
cxr\^jviLJw*Lj^r^ v njji r r Vjo o vKW r ii r Jjriir'QjQ Vx RLi I APG PATPMXjFE 
KDGSSCISRRPLELAGCASCLTVQDNWTLELESSQDIQDVLDANKSL 
PESSLTDLLSDKSHNVYCCFRSSTYVQVLSFPPETTISIPLPHIYLA 
ELLQGGQSPFQATASCHIVSVFSLQLFWVCAYCTSICRQSSARVDEP 
MTMFLWTLCTS PSVLRP I VLS FELERKPS KI VPLGTPSRS S S FQPFA 
ARAKGGK* 

4938 

A 

409 | 

701 

KEKVKVNWLLNPLPKAQE FGYPS PS LPATLHLKLG SRPCKLQDH / CC 


f©?*7 
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CHMVIHTGHHGFGS PI LHOEPMG «5R 1? KYQP T . 

4939 

A 

301 

573 

KKKDSVRPGPPLTLRPFCPSRRPPTPPAAMQTi\i\wt pdpcvppdt no 
AQPRPYLGFWWLHLIKVVCNFGTKWPCPACCWPSTLNPSVTHG 

4940 

A 

754 

999 

MTLGITSFISFYFCFIIYIFOYIiTTfiTOOOT TT?Q\rr ttitpttvvtot tt 
ISKEEKNTFKISHWIWKI I IMELWFIiMS THIS FRY 

4941 

A 

339 

494 

CVHSQETRGAASTSPGSRTTSQPSTHSTTATTTPPPNRAPPPSRPPP 
RPSV 

4942 

A 

408 

548 

CLEVCAG YHHHHHRHHHHHQTSIiERS KPKCTLGHMALWHS EKYS SC 

4943 

A 

1421 

1581 

VRAGPLTLLKEHSQLFKLSFQKEMENFEAPFHCLAKRFHQLYREKVE 
VFRALA 

4944 

A 

1 

1113 

MHQEDLRAWYIiDLGLPSHQNAQPTAWKCQRAPS P YTHQDMAL I PS PT 

ARWTiSP'FTC"'FPK'Of"2'P'\7f2ir VCT T oniiTT T>t H'i\nn ▼ ________ ___ __... _ _ 

n jjo f r. v <j sl ivt> jjJbi'JJ P I LiPLiTD PRLTGSTEQAHAEGLAA 
LMS ALR VS HLQGRGG WTL VD S QLG V I AVS S TQ FNKG P S YRLLAD VQ 
NRLLPKYDSQKEAELRS WI KGFTGLS I RPDFQKGLKDG 1 1 LCTLVNK 
LQPGSVPKIN\ASV*NWH*LENLSNFLKAMVSYGM/NPVDLFEANNL 
FE SGNNMQ VR VS LLALAG KAKTKGLRS G VD I RDKYS E KQN FDDTTMK 

ASO("7VTRTiO TTWKT'Zl CnQPMTBVPTDOUT VrtnmTnTr ««»/m\trtmt-n» 

rt °V L v J-Rijvx iiMi\.^M.i?yi>^i v j 1 AxoTRKHIjYDPKNRILPPMDNSTISL 
QMGTNKCASQVGMTAPGTQRHIYDTKLGIDKCENSSMSLKMGYT 

4945 

A 

272 

902 

LOHRTjTjS PAT iT* A PTT .Pi DPiT .PlDT OOPT Dr'B'KTrvf r-»Tr-r»T\ -a rtr%nn« _______ 

j-i^xa-vjjijox' njj a n_* x i^v "W^w^^-K^^IjKoKI^DS SKPAADPS PACTPPA 
WHCRWSSALSSSHPHGLLTAQLSASLPSLLYRWSSQQPTYSWGSAI 
PVSPVQSLPLSPPKAALSAAPAPPGERSAGESNREKRTGRGWLTNTN 
LQLDRSNEHTYDTKLG IDKCDNS S TS LKMG YTQG AHQSGQAFGMGLQ 
IYDPRYYWGGPVAHGAPSGSGP 

4946 

A 

281 

471 

TGSWSQCTPSC*TTMCVYTFMTYTWAWGCPPNHVQPHA*PVSPTVLL 
NVGFRAKAVFSGHGYL 

4947 

A 

1192 

2248 

AP S KKLRS ATLRR PG S S KDTARG P VS VG WC P S QEPLGAR CATS CP WL 
CRIjPSSVFSPRALAIRLGGGSGSLSAGCAARLQGSVPCWPI^ 

RA r PPT 1 ^T?T.POPIP_PP"DDP»T"\TiP»OT5T nrrtTTiv nnnik nm> _ _ _ _ _ 

ort. x iruo ^"KWVi^A^Flr'KUJJPQabRJjSVPVAPCPAPRAPQPRRLPSPGP 
RFCRCPSLSVCRSLG/PVSPPPQPFLPPCLPRWl^SLAL.GRGETAS 
QANPRNIX.KXLAMKFKKFFDFGAIFEWSQS 

PVPASLiPERPSPRHPA^C! cnpi PriDPT TT PCI puvt DTnTnvxinrmn 
"•* J * r * -f^-Ti moo ovj r\.i_j^uf Vj J_» J. 1jL»o lj\_ri x Lit* IRTD YAW TR 

AGNGGECVWLOKFPVKIjASLMMSFnT.VTr:T4T?T PUDDnwDnuADOTDc 
FFTPKSVMEPSAVPGTAERVLHPG 

4948 

A 

654 

730 

VCDPSVMCCPPKVPMGSPRPCSMEE 

4 94 9 

A 

1574 

1881 

KHLRWPLRTGDSEPPVPRGSSEAEGPIiPSPTTAGGHGQGSCRRPGKV 
SIRPPGIHHRSYFLKCNEPOGLSGMRAT.r'TPT zit nwr r irDO\rDT t>r*oo 
GHSPAMQV 

4950 

A 

207 i 

315 

SCCRGTACGPESGTKKVPRAHVSREAVAQSGPKPL 

4951 

B 

199 

420 

MSLPIGIYRRAVSYDDTLEDPAPMTPPPSDMGSVPWKPVIPERKYQH 
LAKVEEGEASLPSPAMTLSSAIDSVDK* 

4952 

A 

336 

594 

HLS VRLVI TS FHCALEAS APLGKMS LP IG I YRRAVS YDDTLEDPAPM 
TPPPSDMGSVPWEPVISERKYQHLAKVEEREASLPSPA 

4953 

A 

401 

1301 

Vft] ^ & i^AKWtj FAHKIJ x TAGAVSYDDTLEDPAPMTPPPSDMGSVPWKPV 
IPERKYQHLAKCGGRRGQSTLPCHDPVIKPFDSVDKAPKVAHCAKAra 
VIMNSLITKQTQESIQHFERQAGLRDAGYTPHKGLTTRGDQVPFELA 

EAJjHKXj KXiOS fi RVTtf TCP PP PS QT.T-IP> DC DDPAT5T DTnt nv/-rt 

wmj ' * ■* VJJ luJ vf v rt r\_rtxu*.\j ^ i-»riy Jr o lr Jr yAPL»KTPLiPKQRPRGWFTSG 
SSTGLPGPNPSIMDSGSWGIKDKKLCSDl^WSLFGPRSLQKYDSGSFA 

TOAYRGAOKPS PTiKTiTR APlAKJPMZi TT'non 7\ T vnnvmrn Tninurcn w/\n t-i 
* * i\r j r ^"^^-^^wrtJNfvrTrtXixJJrAAx^ 

RAHNLKPRDLNVLTPTGF 

4954 

A 

388 

681 

PFESICSPIGANESSCCETGIKCVCSRRCRCGCTYKANSIilQGEGEM 

EELTDLKVGLCCLKLGFCTHMPCMCVYNCl^EPSGSI^^ 
VIRV 

4955 

A 

272 

452 

LHWPGSlxAAAQEFLKRGRSPPVRAGLLTMWGLYYVGGLGFSFTEMLS 
HQLCPTCPNSSPL 

4956 

A 

256 

381 

EREFCF ATQVGGQGGDFGS LNPLTPRLKFFS CLTP P I S W I TG | 
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4957 

A 

58 

301 

VMRLKPSSTGASCSNSLCVNMPSWSVTRFWLAAGSRPPPRSHRNAGS 

T.PPfJPP QPQDPT.CHDHPT.T?D Q CTKTUT>VO C T>r\Tf2 n V. 

4958 

A 

- 

562 

1221 

RDVAVRCFEVTGLHDVDQGWMRAVRKHAKGLHIVPRLLFEDWTYDDF 

HMLTHFGRGSAPGPAAGPPGSSRLPSPPGPTSWACSRTRSFEQLAPV 
LGWFSAFIDLTDYFYSAFSLALIAPLSWVRACVPGPGPESPKWRSKN 
PSWGFNFIjWYGTTRPPRMPVSLVVGPQVHPRH 

4959 

A 

152 

358 

CGLFKISNTPMLVSSIDTTSFKWIPRCNCSYCLCLSCSSHCLNTYHC 
LNGNYFAIVKKHYQLGLIVGS 

4960 

A 

184 

361 

GGPFKKTLGGPKFNGGWKGKIFSLKGGVLKTTWGFLEKNLFFGGGKY 
GPNPPPKFKGFR 

4961 

A 

2017 

24Q6 

GSQAVGVSQHHTIPKVPGDLLWLIGGSSGVGCCPAVEEEQQSDMSPP 

FGDOASAlVWWGPGTRHTC^HH^PMnWTTC 

IPILKNTISGMLLHQRRFFEIAHLPELSKYRSFHFL 

4962 

A 

198 

629 

IQSVFVRSSFGLLAVTPACCLLRLISSVAmLSRSLAGLSTLPDQKLS 
WHAQRNSTHMSRWLRPSRSRIRRISLAHPAKFGF\RLMVR*SSPCQT 
STLRRSGLLLS PTCCR S ELORPV 1 ^ POKflR FT »7VnOP TRON^T .TNNT ,PP 
FLL 

4963 

A 

132 

326 

YVHCKDLWMDCAHVSS PRKNS * PFHRRHPLQGDKKPLPS PRNSSMPA 
VYPYWIiPQRS IRI FQHI 

4964 

A 

3 

396 

GLGPGRQPRPPWRRGPPSAALQVRSSGR*LGGAVLQVRPSKCGPRGG 
NFAGQERIWADaoDDGYARLSaNTMLL 

YG WNKPMDR ITDSQLSE I TKLPVKRCNEAKLELVRMN I IKQQGGMFG 
PNKNISEWG 

4965 

A 

1 

2463 

MKLCRELSDLWYTNS VAAQD I VDDGTTGNVLS FS ETRAHQWQQKS 

EQFMIYNQKQLTRIYPSAYRIDSSNFNPLPYWNAGCQLVALNYQSEG 

RMMQLNRAKFKANGNCGYVLKPQQMCKGTFNPFSGDPLPANPK^ 

LKVISGQQLPKPPDSMFGDRGEIIDPFVEVEIIGIiPVDCCKDQTRW 

DDNGFNPWEETLTFTVHMPEIALTOFLVVTOHDPIGRDFVGQRTVTF 

SSLVPGYRHVYLEGLTEASIFVHITINEIYGKNRQLQGIjKGLFNKNP 

RHSSSE1WSHYVRKRSIGDRILRRTASAPAKGRKKSKMGFQEMVEIK 

DSVSEATRDQDGVLRRTTRSLQARPVSMPVDRNLLGALSLPVSETAK 

DIEGKENSIaAEDKDGRRKGKASIKDPHFLNFNKXLSSSSSALIxHKD 

SQGDTIVSTAHMSVTGEQLGMSSPRGGRTTSNATSNCQENPCPSKSIi 

bFivyHJ-iAFUPV VNP 1 QDlxHijVKIKEKGNPEDFVEGKS ILiSGSVIjSHS 

NLEIKNIiEGNRGKGRAATSFSLSDVSMLCSDIPDLHSTAILQESVIS 

HIiIDNVTLTNENEPGSS I SAI^IGQFDETIWQALTWSHLHNTSVMSG 

HCPLPSLGLKMPIKHGFCKGKSKSSFLCSSPELIALSSSETTKHATN 

TVYETTCTPISKTKPDDDLSSKAKTAALESNLPGSPNTSRGWLPKSP 

X IvVJDL/nD x AJJVOUDIrriOOrL'AJ x ljEtU V 1/iUr 1 bLr IMoVjJioo JL* V Aj J. aJoJCjO 

El^SLTTCEYTUlEGTSQlxASPLKILKYNQGMGLPDSWTVVIVIACLGI» 
ATQWGYQAPAWYSGI SAKS PVK 

4966 

A 

1 

607 

MTNTAKI IjNFGRGl^AGOERlWADLDDGYARLSNMtiljEAY SGADLTK 
RQ FKVLLAI LRKTYGWNKPMDR I TD S QLS E I TKL P VKRCNEAKLELV 
RMNI I KQQGGMFGPNKNI SEWCI PQl^GKSPKTRDKTSLKLGDCYPS 
KQGDTKDTITKEKPvKBYSSENSGESSDQPElTOLSVVKPDAAIQSGSK 
WGTAEDLTAARVDV 

4967 

A 

1 

3105 

MTKLVALWREEAENDLS WKPDAAI QSGSKWGTAEDLTAAEWMFDMV 
KTI AP S ARKPNFAGWANDI RLMRERDGRNHRD I LDLRGG SMKNIAAQ 
MVNFDREQMRRIANNMPEQYDEKPQVQQVAQI INAS YWQDSRRALAN 
RDQltfEVNE IRRQGVLAFRENG I NQDGTGNARKPDRRNDHFLPSTRAD 
LVHGGREENN PLPDRNGQNRGGRLDALF7VAS LF SGTAPAS RGFARLF 
RS PSPTLTNS PTYELRYPTRTRATYPAVGLS I PNKKHTQPS PHMNNV 
TTDYCLD 1 1 KKFEVS EENKVKNVLG I EGFTNFMRS PACD I FNPLHHE 
VYQDMDQPLCNYY IAS SHNTYIjTGDQIjLS QS KVDMYARVLQEGCRCV 
EVDCWDGPDGEPVVHHGYTLTSKILFRDVVETINKHAFVKNEFPVIIi 
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S IENHCS IQQQRKI AQYLKGI FGDKLDLSSVDTGECKQLPSPQSLKG 

KILVKGKKLPYHLGDDAEEGEVSDEDSADEIEDECKFKLHYSNGTTE 

HQVESFIRKKLESLIiKESQIRDKEDPDSFTVRAliLKATHEGLNAHLK 

QSPDVKESGKKSHGRSLMTNFGKHKKTTKSRSKSYSTDDEEDTQQST 

GKEGGQLYRLGRRRKTMKLCRELSDLWYtNSVAAQDIVDDGTTGNV 

LSFSETRAHQWQQKSEQFMIYNQKQLTRIYPSAYRIDSSNFNPLPY 

WNAGCQLVAIJTYQSEGRMMQLNRAKPKANGNCGYV^ 

PFSGDPLPANPKKQLILKVISGQQIiPKPPDSMFGDRGEIIDPFVEVE 

1 1 GLPVDCCKDQTRVVDDNGFNP VWEETLTFTVHMPE I ALVRFLVWD 

•XODTOPT^P\700PT\JTWQCT A7POV"DW\7VT T?r*T T'T? HOT TTATU TTT MT? T VfJ 
rii->.lr XuKUr VbyK 1 Vlr obliVrblKHV x XjXU\3±j x clAo X r vriX X xJNiix xtj 

TCW C P L> T T ,N P O YT T T .H V T .O AT TOJP OT .OO T . KYTT . FTtf TTKI PP W Q Q C I^iSjTJQWY 

VRKRSIGDRILRRTASAPAKGRKXSKMGFQEIWEIKDSVSEATRDQD 
GVTjRRTTR SLQARPVSM PVDRNLLGAL S LPVS ETAKDI EGKENS I>VQ 
I 

4968 


1 

396 

NKKKKRDGDDRWFRLVLS FDEEKRREKAAT EE T KORLKEEORKLREE 
RHQEYLKMLAEREFJUJSEADEL^ I 
S DLDLS GARLLGLTP PEGGAGDR S EEEAS S TEKPTKAL 

4969 

A 

67 

180 

WNDLVTGEGRRNWSQTSQQKQQRQHAVGTPGEQQAIiW 


a 

2 

A1 C 
** J- ~> 

T .OfiFKARfrT .TAMOPWKK'KTCRnOTynPP PP T.VT.QT?DP , T?KT?PI?YT.Tf3 pWlf 

RKVERKKAAI EE I KQRLKEEQRKLREERHQE xTIjKMLAEREEALEEAD 
ELDRLVTAKTBSVOYDHPNHTVTVTTISDLDIiSGAP^ 

4971 

A 

1090 

1203 

S VFCQNC S KAPVSN S VGVFMGKNKTKQKRWL I R PRRCC 

4972 

A 

3 

356 

VQNRTLHFSLLYEPKRKSKPESYLYCYKIFQDEPIiAP*AFL* IPASL 
SQIQ*FTSRKGKASQNLIYIVIKYSKMSLWPPEPSCKYIjPPCLKSNS 

DyiiO WXjXjW^ Vxj V WL» VonM L.yijWLrtjX^X^AJttXjVoxjWvj^x o 

QHTSLRGVPKGVIFIIYI 

4973 

B 

148 

305 

MDNSASVEQLQETLLPJ\LRALVLK1^PLDTSRFTKLLLKLPDTO 
SSLGHN* 

4974 

A 

484 

927 

GREGGLTGWEVTSPAEPTLPAFRHLLLI^QRPGRRYGTPSPSPSAFW 
VjAbOmjU «LA(_y VLiAJJljKr JbxjKGARC? VKxjTAVVFP TDRSGMENSASVE 
QLQETLLRALRALVxjKira 
SFRVDAQ 

^ if / 5 

A 

o n £ ^ 

it X.D 

KL»*-Jrr Ij^jKAAx-'rvjijXj.fri VJvUv ■*» VirlrJLjoKI x x oxjyiiljvjiAMoMuxJxjA-ioA 
MF \DFS EKLNS LALTEEELGLF\ TA\ VLLVS ADR \ SGME \ NSASVEQ 

T.OP\ TT.T.PA\ TiP AT.VT 1 K^S^RPT.^^TOPPTT.T.T.T.TO.PT^T.PT , T.^T^IMT4Q'PKT, 

LVLSGWTAPVTPPP^GLSAGCPLCTESE^ 

FILL 

4976 

A 

173 

520 

VLMVRFASLFlWKDQTVl^FLSRTx^SLQELGAM 
KLNSIxALTEEELGLFTAVVLVSAGRSGSRLTRGRROTOFNSTi^ 
TYSMPG P APGTG VYRHD SDTV 

4977 

A 

44 

213 

RVTCPRLKNFPLPPGKKRFLEKGK3CGDFWPREKGKPPPFLISPPQSL 
NPVLFWGFF 

4978 

A 

840 

980 

KLQ\njLPLHWSVXCSPWPTC^FLTKXILGF 

4979 

A 

72 

464 

PCP IALPTFKFN I VFKQGFS F I FPLTKSNPREGATEMGICnSTTGILK 

LQRGTVPPl^TLKKWIWPSSRPl^CPQNSCPHPMKTO 

GLPGPMGQRQGSPRTATOPSSPATIASLGPPPLLSL 

4980 

A 

1 

132 

LPRDASREGCRLWDILNQCCSRDLSIDVECSLCCQTGQPLATCG 

4981 

A 

99 

212 

CARTMGGTPSGVmCCTPAEPSCSPRLCGTHYHHACHH 

4982 

A 

84 

196 

CaRTRGGTPSGWHCCTPAEPSCSPRLCGTHYHHACHH 

4983 

A 

2 

186 

ECLTSLSSSLQRWHAPPALPAQWKPHAACFKWQLKASQCWAVLTPL 
WSSWRGTGSLWGKG •* 

4984 

A 

87 

347 

SLNFHIGQ*AQPHAGFGLRGS*GPCPPCCQANAC*APEPNDSMEDQA 
PGVRHPPLSNVKIQGSB 

4985 

A 

176 

1554 

VTLASLPTISCl^NAWESYSVLGNCRPSDPYSmSWSYH 
SLTSVNGFHSKYALPSFSYYGFPSSNPVFPSQFLGPGAWGHSGSSGS 
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FEKKPDLHALHNSLS PAYGGAE FAELPS QAVPTDAHHPTPHHQQP AY 
PGPKEYLLPKAPLLHSVSRDPSPFAQSSNCYWRSIKQEPVDPLTQAB 
PVPRDAGKMGKTPLSEVSQNGGPSHLWGQYSGGPSMSPKRTNGVGGS 
WGVFS SGES PAI VPDKLS S FGAS CLAPSHFTDGQWGLFPGEGQQAAS 
HSGGRLRGKPWSPCKFGNSTSALAGPSLTEKPWALGAGDFNSALKGS 
PGFQDKLVmPMKGEEGRIPAAGASQLVFYQHKKLNQPNHGLALWEAK 
MKQLAJERARARQEEAARLGLGQQEAKLYGKKRKWGGTWAEPQQKEK 
KGWPTRQALAVPTDSAVTVSSYAYTKVTGPYSRWI 

4986 

A 

1 

1863 

GGGRAPPGVFAEPEGGASGASVPAAAGGPERLEERSGNWEELARLEG 
GPLRPGAGSQAPLGAKPRVFSGSQFHTRLCAVPGIPWRGAHLRGGMR 
SEKSLTLAAPGEVRGPEGEQQDAGDFPEAGGGGGCCSSERLVINISG 
LRFETQLRTLSLFPDTLLGDPGRRVRFFDPLRNEYFFDRNRPSFDAI 
L YYYQSGGRLRRPVNVPLD I FLEE I RFYQLGDEALAAFREDEGCLPE 
GGEDEKPLPSQPFQRQVWLLFEYPESSGPARGIAIVSVLVILISIVI 
FCLETLPQFRVDGRGGNNGGVSRVSPVSRGSQEEEEDEDDSYTFHHG 
ITPGEMGTGGSSSLSTLGGSFFTDPFFLVETLCIVWFTFELLVRFSA 
CPSKPAFFRNIMNIIDLVAIFPYFITLGTELVQQQEQQPASGGGGQN 
GQQAMSIAILRVIRLVRVFRIFKLSRHSKGLQILGKTLQASMREIiGL 
LIFFLFIGVILFSSAVYFAE7UDDDDSLFPSIPDAFWWAVVTMTTVGY 
GDMYPMTVGGKIVGSLCAIAGVLTIALPVPVIVSNFNYFYHR.ETEQB 
EQGQYTHVTCGQPAPDLRATDNGLGKPDFPEANRERRPSYLPTPlIRA 
YAEKRMLTEV 

4987 

A 

1171 

2784 

WRGAHLRGGMRSEKSLTLAAPGEVRGPEGEQQDAGDFPEAGGGGGCC 
SSERLVINISGLRFETQLRTLSLFPDTLLGDPGRRVRFFDPLRNEYF 
FDRNRPSFDAILYYYQSGGRLRRPVNVPLDIFLEEIRFYQLGDEALA 
AFREDEGCLPEGGEDEKPLPSQPFQRQVWLLFEYPESSGPARGIAIV 
SVLVILISIVIFCLETLPQFRVDGRGGNNGGVSRVSPVSRGSQEEEE 
DEDDSYTFHHGITPGEMGTGGSSSLSTLGGSFFTDPFFLVETIiCIVW 
FTFELLVRFSACPSKPAFFRNIMNIIDLVAIFPYFITLGTELVQQQE 
QQPASGGGGQNGQQAMSLAILRVIRLVRVFRIFKLSRHSKGLQILGK 
TLQASMRELGLLI FFLF IGVT LFS SAVYFAEADDDDSL FPS I PDAFW 
WAWTMTTVGYGDMYPMTVGGKIVGSLCAIAGVLTIALPVPVIVSNF 
NYFYHRETEQEEQGQYTHVTCGQPAPDLRATDNGLGKPDFPEANRER 
RPSYLPTPHRAYAEKRMLTEV 

4988 

A 

176 

464 

NALRI PRGGLLAF Y SVL SIKISIPI FFTNKRG FFNPKEGLQAGKRGR 
QWWVKSPVPSRTCHQHHHLPRKTPSPACGPLPLPPGYPPFIAFDFKI 
NI 

4989 

A 

1 

2190 

M PE P PPAPGS CAARAS KTS AAPG S TAP S P I HRSRAEERGTTARNTWQA 
APTAAPVRDPLGEASWAPESANLVGIWRTFVSSSGIVNAPISTLSKR 
TNQLSVKQTNRLSVKWTNQQDVARGSATYREGEHGCWLLAPGGFRAL 
QLDNCRDGVCVRPPLRQVLGEAPPLQLACRFPPGTPAASPERGPGAE 
KRVCGKDVSEVTHCTHDLLHATLEDSWDVSSQLDRQQENWKRHLGSE 
ASTQKKI ITPQENFEQNKFGENSRLNTNLVTQLNI PARIRPSECETL 
GSNIA5HNADLLNE1WILAKJCKPYKCDKCRKAFIHRSSLTKHEKTHKG 
EGAFPNGTDQGIYPGKKHHECTDCGKTFLWKTQIiTEHQRIHTGEKPF 
ECNVCGKAFRHS S S LGQHENAHTGEKP YQCSLCGKAFQRS SSLVQHQ 
RIHTGEKPYRCNIiCGRSFRHGTSLTQHEVTHSGEKPFQCKECGKAFS 
RCSSLVQHERTHTGEKPFECSICGRAFGQSPSLYKHMRIHKRGKPYQ 
SSNYSIDFKHSTSLTQDESTLTEVKSYHCNDCGEDFSHITDFTDHQR 
IHTAENPYPCEQAFSQQAISHPGEKPYQCNVCGKAFKRSTSFIEHHR 
IHTGEKPYECNECGEAFSRRSSLTQHERTHTGEKPYECIDCGKAFSQ 
SSSLIQHERTHTGEKPYECNECGRAFRKK7TNLHDHQRIHTGEKPYSC 
KECGKNFSRSSALTKHQRIHTRNKL 

4990 

A 

7 

396 

HHELQYPIRFCPIKSCCKGRNSLQMKIYDLSIiQNFPPATI^RQNLALS 
PRLEC SGT I S APCNLHLPGSSDS PAS ASRV\ AGNAGMHCHKQ\ LI FV 
FLVETGFHHVAQAGLRTP* PCGSGPPRGPKSAGDLQ 
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4991 

A 

1 

2201 

MTNIAMVERDSEAGTAASRFPGNHATUCGKAQAHYKVWRPABVRCLKIi 
GPEWVTLRYTIKHRPYKLCGKRQHVFFFTSRSDVGFMLTTLKPFGSV 
SVESKMNNKAGSFFWNLRQFSTLVSTSRTMRLCCLGLCKPKIVHSNW 
NILNNFHNRMQSTDIIRYLFQDAFIFKSDVGFQTKGISTLTALRIER 
LLYTUCRLFFDSKQSLVPVDKSDDELKKVl^njNHEVSNEDVIiTKETKPN 
RISSRKLSEECNSLSDVLDAFSKAPTFPSSNYFTAMWTIAKRLSDDQ 
KRFEKRLMFSHPAFNQLCEHMMREAKIMQYKYLLFSLHAIVKLGIPQ 
NTILVQTLLRVTQERINECDEICLSVLSTVIiEAMEPCKNVHVLRTGF 

SVLNSQHMFEVLAAMNHRSLIIiLDECSKVVIiDNIHGCPLRIMINILQ 
S CKDLQYHNLDLFKGLAD YVAATFDI WKFRKVLF I L ILFENLGFRPV 
GLMDLFMKRIVEDPESLNMKNILSILHTYSSLNHVYKCQNKEQFVEV 
KA.SAJjTGYLHTISS\ENTjLDAVYSFCLMNYFPT.APT^nT."LnKD*I T <5F 

4 l/\urvu ^— * X* *-J 1. x x .x ^ 4-m» * ▼ I if 1 f 1 r V A O X wJJrUi X X Xr L, LnY Xr XT X" \j* 1 J XJ^*X\ fj J X. X O lit 

LLTSDDMKNAYKLHTLDTCLKLDDTVYLRDIALSLPQLPRELPSSHT 
NAKVAEAOjSSLLGGEGHFSKDVHLPHNYHI\DFEIRM\DTNR\NQVL 
PL/SLDVGTTSAT\DIQRLLTY1SFAGLSELKS 

4992 

A 

3 

485 

LHTLDTCLKLDDTVYLRDIAIiSLPQLPRELPSSHTNAKGGQRS*AAL 
LGGEGTPPSKDV\HIiAPHNY\HIDFEIQNGTPNRNPSAYPLS\DVDT 
TSCLQIFKE * LCYVFPRSAYCLGS SHPRGFIAMKMRHLNAMGFHVIL 
VNNWEDGQTRDGRCQS PFLKT 

4993 

A 

40 

131 

ATKAFEENEVYKTTLWYCFFKCVILKASL*FRILLTEWFYMRWLKK 
YHI VCF / PLGDAKI FC FMY ITVLFKTIjEDS C * TLWYC FFKCVT LKA 
SL 1 

4994 

A. 

2437 

2762 

GIILFWAQLFPASFFFFFFF*DGVSLCCPGWSAWRSQLTASSASRV 
OAILCLSLPSSWDYRHLPPCLANFFVFT. / CRDGGFTMT,ART.VT.NQ * A 
S*VHPPWPPQSAGDY 

4995 

A 

1 

299 

FSLIKISMMLLMKMEK*NLQFIW/KPRRLQIAKARLNASSSSSSSSS 
S S S S SSSS SVVWYWHKKFJilDHWNRLElJSNTNPH T P <5f>T,TT.TKVPGA 
NTKDHP 

4996 

A 

3 

358 

S TSHLS PLAG S RAL SSSS PRVARSDMAVAQVLGTDGNRRG CTSS WHE 
ALEDAOVGEDRHOGESGRPTRVROEWVWSGORHPPAKTKGKPSHTPP 

*■ 1 V V 1 > 1 rlVI 1^ W JLf U W4 \ « X XX V A\\^iJ ■ 1 V II lJ\J^XvAAA X/VX\X J\\J1VX Oil X JrX\ 

HVHEAPPSPFQPPFRTLSLSLSEF 

4997 

A 

76 

388 

VL P P P S S P ALHS PAP P S TC P YL PGA / P P PLL P PC AGR S P PAAAAPHC 
PAPCAPPJI*GSR*LESPAPQGPQSRAARMPAWPLPPAPPTDPTAPPA 
PRSHWPAAPPT 

4998 

A 

66 

381 

VliPPPSSPALHSPAPPSTCPYLPGALPPLE/GPPSRRPPRTFIGNPG 
GQGPGEVSPIVt»RSPSQPH*PGNQGPCPSSQPPGSPRSEHGC*HRCW 
ALYGQQEKPAPS 

4999 

C 

36 

236 

MGPTIPDXSXFFWRKPITWMPTWEGTSNVGPQPIiSSSKSI.HSXRGHP 
AP I PTGQAGPRD SGPGAS P * 

5000 

C 

231 

521 

MRAP VCQS QS VYFTQT I S LKPT I TVPGRCDAFQ VTNNKLAQATQS EG 
VAGASNPDLESHEPAHGMQREFSGVTAVSQADTVMNYPVSNGWENRG 
SG* 

5001 

A 

46 

1438 

LSTPTALVWPGHRPP * PI P * WPLCPQEGGPGGLPRGWSSGTVAP * S * 
AGGCEGKIQSYPQGSASWSEPAPSPS/PAFPAPFLMLNCSGPLALSL 
SGPKWAREQL* CVPPRALATGLPLEVWL*IiRPFPWP * CDPPPG/ PWV 
nT.T.H<5T.Q«;rwnppPMPPTTPr!TDDr!QTrDT mzT^&n i\m \ t>t)ljwt ocn 

yjjijnoiiOOV-nurf Jri'ZirJrv 1 i.n.o J. JrlrLio XV^K.JjV^.r L,r/A>VVjJj \ Jtr irVy WIjoovi 

QLPHPPGTPLCSVS*GGPDQPPLLCLSPWAPGLSSTALCHSRLV1,PA 
TPIPWTWPSCGLSTT/PPAPGATPTPAACFQQVPSAPDKPLLVSPS 
PAKHPPVPPPVGLGIiRAMDR/PPGPIPETPRD\L,PCHPTPPKSKLQA 
RGPWPGWVD * TTSG WI^RKEALMGPTI PDPKS S S WAGPLVSLPLVLG 
GNPS PWNALRGKRDPRKCGNPQPPSPKLRKSPPTA\ PGKTPAPYS PR 
EKAGPQKTLGPRGPPP * SRDPPL * L» \ RTPADPGVRTRGAGPPAPPRR 
NP 

5002 

A 

1 

874 

liDDALESCDEHGYRTAS S SCLRKGVMVKSS SNRELV* SRYTD ITI / S 
GSTSEFPDWAAGQRGSSHPRRWAARQRRSSLHIjFYSEPDTDAKHQDD 
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ALTEEET * G/ SRSAS CAHWL * SNCCTATGS S TLLPLTSRL * LPRMAV 
KSSRASAAIjPSSRFSSWHRACSASPAAATTAAPSAWGLSSNIPSGPA 
GCAPRPPTSFSLPGALPTTTRRRHSS\DRSRRSPSRGI*TSARESGSK 
SPPQG/TARDSRQALRSAMVTKTSWRMELDRECGRSSCTSTFCVSAA 
SRGFLLQTSFRWE 

5003 

A 

1 

2487 

MLRSLRLQQEWLEWEDRRRAAAQQCRSRRCPSSPRARLTRPHRSCRD 
PAVHQALFSGNIjQQVQTUjFQDEEAANMIVETVSNQIjAWSAEQGFWVI* 
TPKTKQTAPLAIATARGYTDCARHLIRQGAELDARVGGRAALHEACA 
RAQFDCVRLLLTFGAKANVLTEEGTTPLHLCTIPESLQCAKLLIiEAG 
ATVNLAAGESQETPLHVAAARGLEQHVALYLEHGADVGLRTSQGETA 
LNTACAGAEGPGSCRRHQAAARRLLEAGADARAAGRKRHTPLHNACA 
NGCGGLAELLLRYGARAEVPNGAGHTPMDCALQAVQDSPNWEPEVLF 
AALLDYGAQPVRPEMLKHCANFPRALEVLLNAYPCVPSCETWVEAVL 
PELWKEHEAFYSSALCrWNQPRQLQHLARLAVRARLGSRCRQGATRL 
PI.PPLLRDYLLLHMKRSRCRDRPQPPPPDRREDGVQRAAELSQSLPP 
RRRAPPGRQRLEERTGPAGPEGKEQPPALASQSABIAASARLPPRLG 
SEERLCLAAHRLGCEPPLCIiAAQSGNIiGDNGLPAGRQELKQGTHEET 
TALAQVRAEGLPVLVWMETEDLQVMSFRISLSRHGPKPSHPKSDSFS 
LAGSKSLLPFKDLKHTVNDEEGRVJSAIPDTDGPSARAGGQTDPEQEE 
GPLDPEEDLSVKQLLEEELSSLLDPSTGLALDRLSAPDPAWMARLSL 
PLTTNYRDNVISPDAAATEEPRTFQTFGKAEAPELSPTGTRIASTFV 
SEMSSLLEMLLEQRSSMPVEAASEALRRIiSVCGRTLSLDIiATSAASG 
MKVQGDPGGKTGTEGKSRGSSSSSSSSRCL 

5004 

A 

227 

2119 

RCRKEMFSLCWAYYCI * KLQQI PMRLAPLPTLDPV * SP/ GGASTATN 
SGSSVTSSGVSTATISGSSVTSNGVSIVTNSEFHTTSSGISTATNSE 
FSTASSGISIATNSESSTTSSGASTATNSESSTPSSGASTATNSDSS 
TTSSGASTATNSDSSTTSSEASTATNSESSTTSSGASTATNSESSTV 
SSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSESSTTSS 
GASTATNSESSTPSSGAGTATNSESSTTSSGAGTATNSESSTVSSGI 
STVTNSESSTPSSGANTATNSESSTTSSGANTATNSDSSTTSSGAST 
ATNSESSTTSSGASTATNSESSTTSSGASTATNSGSSTTSSGTSTAT 
NSESSTVSSGASTATTSESSTTSSGASTATOSESSTVSSGASTATNS 
ESSTTSSGANTATNSGSSVTSAGSGTAAIiTGMHTTSHSASTAVSEAK 
PGGSLVPWEIFLITLVSWAAVGLFAGLFFCWSACVPLCECRDALK 
MPGGLEQLLTESRIVHWGTEGRPIATAEGWVL.LLAMKIRLAAEAKIK 
CFLDGMAAGLRRSSLSRLGSQAEALPTSQTMGGQAETIiLSSQTGSRP 
NRGLFSHLRRWAAGQRRSLTS 

5005 

A 

124 

670 

FQRTKLLNGPGDVETGTS I TVPQKKWLHVI S P I FVQS \ LTLPFIAKW 
GDRLQLLQ I ELAAREVSD I * EETV* NETYLLLLC SRKTLDTLKWAHS 
IPSYARLFYI * *FSCSLKUVFSQFLLPA\DPYGVAVGGTVGHCLCTG 
LAVIGGRMIAQKISVRTVTIIGGIVFLAFAFSALFISPDSGF 

5006 

A 

3 ! 

273 

AAAPGNGRASAPRLLLLFLVPLLWAPAAVRAGPDEDLSHRNKEPPAP 
AQQLQPQPV\AVQGPEPARVEVSGPGWGERGCRAGCAEYQAPGL 

5007 

A 

2 

986 

AAAAPGNGRASAPRLLLLFLVPLLWAPAAVRAGP\DEDLSHRNKEPP 
APAQQIiQPQPV\AVQGPEPARVEKIFTPA\APVHTNTEDPATQ\TNL 
G\FIHAFVAAIS\VII\VSEI»G\DRTFVIAAIMAMRYNRPGPCWAGA 
MLCIi/AGLMTCLFS/VLFGYATTVIPRGIiYILMFQPVLFAlFGIRMIi 
REGLK\MSPDEGQEELEEVQA\ELKKKDEEFQRTK\LLNG\PGDVET 
GTS I TVP \ QKKW \ LHFI S P I FGQ ALTLTFLAEWGDR S * \ LiTT IVLAA 
REDPYGVAVGGTVGHCLCTGLAVIGGRMIAQKI SVRTVTI IGGIVFL 
AFAFSALFISP 

5008 

A 

2 

439 

GVEKliVLSKLYEEGSNKRLFNVDRHVG^VAGLLADARSIJVDIAREE 
ASNFRSNFGYNIPLKHLADRVAMYVHAYTLYSAVRPFGCRIYIVHDA 
VKDKAFELELSWVGELTNGRHEIVPKDIREEAEKYAKESLKEEDESD 
DDNM 

5009 

A 

3 

366 

RDSWEILFEQDNKEQSVATLILDSLIQCPIDTRKQLAENLEMMFDV 
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GKTQPPLLKRAFSTEK* KFD * / TINLAS YQI FNQL* ANCTKYVGCFV 
IEDYSGSESILCLLFAIilYNSFDFVSLV 

5010 

A 

3 87 

446 

HLLQHLL YF KFLLQSNVT I 

5011 

B 

16 

348 

MVLDCGYRESLVLPIYEGIPVLNCWGAL.PLGGKALHKELETQLLBQC 
TVDTSVAKEQSLPSVMGSVPEGVLEDIKARTCFVSOLKRGLKIQAAK 
FNIDGNNERPSPPPKC* 

5012 



~J ~> 

MGCP PVGOAG LE IjLTS AR TC FV«! DT .K"R Gl ,KT Ci A A TCPU THnUMPPPTn 
TRKQLAENLWIGGTSMLPGFLHRLIjAEIRYLVEKPKYKKAIjGTKTF 

rihtppakancvawlg/ gkvfvgi *qveyamkavensstaigirckd 
gwlgvrki s p f 

5013 

A 

3 

461 

Y I EDDMNP FQGNLE \ EQR PRKS KTFL * QGAAEQI KHI LANFKNYQ \ F 

figenmis\dgm\vllldwkgva/cppymiffk\dgld^kmlnkcg 

KXiFWIYHPVIHJWASALSSHTTTPJDIjRQNWTDCHPGAIjPFIIjDWGFY 

lgvgghcfyekpcpw 

5014 

A 

3 

325 

rrvssvtqagvqwhglgslqppppgfklfclslprsqdyrcmpprpa 
nfc/ iflvetgfhpah*dklvsns * pcgppapasqsagitgvt/hra 
hlpcnv/c*ldlfhhhv 

5015 

A 

1593 

4449 

PSGLGNTCIYRRGPHLPNSHPJxAARGRISADRJRFPJILGHP^LHPAPC 
VPGRGEPHPAPCVPGHGElxHPALCVPGRGOLHS VPC I PGRGEPH/ RC 
TLCPRSWA/RCTLHPRDLRi^IREIQPGAFRRLRNIxNTL 
KRIPSGAFEDLENLKYL.YLYKNEIQSIDRQAFKGLASLEQLRLDSNT 
LHCDCEILWLADLLKTYAESGNAQAAAICEYPRRIQGRSVATITPEE 
LNCERPRITSEPQDADVTSGNTVYFTCRAEGNPKPEIIWLRNKSLER 
HAAAJ5TGSLSSRVDLTCLPNKGSLLLCPQTCCGPVKAKTAXAAAVRV 
TEG S PTADHYMRPGS EPHLS SLTCRIjAItPN I QSHLQATLTCWLAVFL 
TG I S VASNVTSLLFTCFC I KGTKVTCLQ VGNEL SMKTDS RLNI»LDDGT 
LiM I QNTQETD(^I YQCA1AJ<1WAGEVKTQEVTLR YFGS PGS FLGNRVC 
TSSVSVCTATSLQPQNTEVLVGESVTLECSATGHPPPRISWTRGDRT 
PLPVDPRVN ITPSGGLYI QNWQGDSGE YACSATNNIDS VHATAF 1 1 
VQALPQFTVTPQDRWIEGQTVDFQCEAKGNPPPVIAWTKGGSQLSV 

EEEAPLCFLLRESGSPQSLLSSVGPAAQHMPETAPPSFFDVIDSFSW 
ILYICLPVTPVFASIPSDTTVKVGAITVQLPCSSQGEPEPAITWNKIXJ 
VQVTESGKFHISPEGFLTINDVGPADAGRYECVARNTIGSASSGVST 
CC KKMVQKG PR VG P V I TNSRRHL WRS ANAF VG WTL S S VL S QTYRDA 
MRKFPAEEHLPDVSRNGDPFVATS IVEAI ATVDRAXNS TRTHLFDRY 
RGAVNTLTLKSLSFRVKGTPSMPVIVLGQGLQSQTEEIKTSALKDQT 
FQRNADGKPERGPCT 

5016 

A 

1 

3174 

GPCVAVSVSWVCPSHAICEVWSSGLGFLHGHKNPSLGWSCGKLLVAG 
GPPF IQ PSGS AGAGGEWVLVEWVLVERVIiVE SGCWWRVGAGGVGAGG 
EWMLVESGCWWSGCWWSGCWWRVGAGGAGAGGAGAGGEWVLVEWVljV 
ESGCWWRVGACGAGAGRVDAGGAGAGGEWVLVERVLVERVLVESGCW 
WRVGAGGAGAGRVDAGGVGAGGEWVLVERVLVERVLVESYHY11DLVS 
PQYLI^IAl\n_SGCTAHRRVNNCSDM 

ASLTAFERLLKS VYENG FNTPRG INPHRL YNGHALPMPRLVS TTLI G 
TExVTPDEQFxTiMLMQWGQFIJDro^ 

SOTPPCFSVMIPPNDSRARSGARCMFFVRSSPVCGSGMTSLLMNSVY 
PREQINQLTSYIDASNVYGSTEHEARSIRDLASHRGLLRQGIVQRSG 
KPLLPFATGPPTECMRDENES P IPCFLAGDHRANEQLGIiTSMHTLWF 
REHNR I ATELLKLNPHWDGDTI YYETRKI VGAE I QH I TY QHWLPKI It 
GEVGMRTLGEYHGYDPG INAGI FNAFATAAFRFGHTLVNPIjIjYRIjDE 
NFQPIAQDHLPLHKA^FSPFRIVNEGGIDPLLRGLFGVAGKMRVPSQ 
LLNTELTERLFS^lAJ^WALDLAAI^ 

AAHTFEDLKNEIKNPEIP^KLKRLYGST1_NIDLFPAJ_.VVEDLVPGSR 
LGPTLMCLLSTQFKRLRDGDRLW YENPG VFS PAQLTQ I KQTS LAR I L 
CDNADNITRVQSDVFRVAEFPHGYGSCDE I PRVDLRVWQDCCEDCRT 
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RGQFNAFSYHFRGRRSLEFSYQEDKPTKKTRPRKIPSVGRQGEHLSN 
STSAFSTRSDASGTNDFREFVLEMQKTITDLRTQPRDQVYIPQAAVR 
VDSVWQVLLPPQPRDQVYIPQAAVRVDSVWQVLLPPQPRDQIKKLES 
RLSTTECVDAGGESHANNTKWKKDACTICECKDGQVTCFVEACPPAT 
CAVPVNIPGACCPVCLQKRAEEKP 

5017 

A 

1 

4759 

SRPWWLRASERPSAPSAM\AYRSRGPGRRCLLALVLFCAWGTLAWA 
QKPGAGCPSRCLCFRTTVRCMHLLLEAVPAVAPQTSILDLRFNRIRE 
IQPGAFRRLRNLNTLLLNNNQlKRIPSGAFEDLENLKYLYIiYKNElQ 
S I DRQ AF KGLAS LEQL YLH FNQ I ETLDPDS FQHLPKLERLFLHNNR I 
THLVPGTFNHLESMKRLRLDSNTLHCDCEILWLADLLKTYAESGNAQ 
AAAICEYPRRIQGRSVATITPEELNCERPRITSEPQDADVTSGNTVY 
FTCRAEGNPKPEIIWLRNNNELSMKTDSRIjNLLDDGTLMIQNTQETD 
QGIYQCMAKNVAGEVKTQEVTLRYFGSPARPTFVIQPQNTEVLVGES 
VTLECSATGHPPPRISWTRGDRTPLPVDPRVNITPSGGLYIQNWQG 
DSGEYACSATNNIDSVHATAFIIVQALPQFTVTPQDRVVIEGQTVDF 
QCEAKGNPPPVIAWTKGGSQLSVDRRHLVLSSGTLRISGVALHDQGQ 
YECQAVN 1 1 G S QKWAHLTVQPRVTPVFAS I P SDTTVEVGANVQLPC 
SSQGEPEPAITWNKDGVQVTESGKFHISPEGFLTINDVGPADAGRYE 
CVARNTIGSASVSMVLSVNVPDVSRNGDPFVATSIVEAIATVDRAIN 
STRTHLFDSRPRSPNDLUUiFRYPRDPYTVEQARAGEIFERTLQIilQ 
EHVQHGLMVDLNGTS YHYNDLVS PQ YLNL I ANLSGCTAHRRVNNCSD 
MCFHQKYRTHDGTCNNLQHPMWGAS LTAFERLLKS VYENG FNTPRG I 
NPHRLYNGHALPMPRLVSTTLIGTETVTPDEQFTHMLMQWGQFLDHD 
LDSTWALSQARFSDGQHCSNVCSNDPPCFSVMIPPNDSRARSGARC 
MFFVRSSPVCGSGMTSLLMNSVYPREQINQLTSYIDASNVYGSTEHB 
ARSIRDLASHRGLLRQGIVQRSGKPLLP\FATGPPTECMRDENESPI 
PCFLAGDHRANEQLGLTSMHT1.WFR\EHNRIATELLKLNPHWDGDTI 
\ Y \ YETRKI VG AE IQH I TYQHWLPKI LGEVGMRTLGEYHG \ YDPGIN 
A\GIFNAFAT\AAFRFGHTLVNPI»LLPGIiDENF\QP\IAQDHLPIjHK 
AFFSPFGIVNEGGIDPLLRGLFGVAGKMRVPSQLLNTELTERL\FSM 
AHTAALDLAGINIQRGRDHGIPPYHDYRVYCNLSA\AHTFEDLKNEI 
KNPEIREKLKRLYGSTLNIDLFPALWEDLVPGSRLGPTLMCLLSTQ 

pi/TJT DTVTl'DT tilVPUDPl ft? C 15 TV AT m/\TVrtft *T R t» T T VnVTTVrMJTT«T51TAC 

F JU<ljRD<jDKijW i hJtk'sj Vr S FAQL»TQ I KQA* IiAR X I YDNADW XTRVQS 
NVFRVLEFPHGYDSCDEILRVDLRVWQDCCEDCRTRGQFNAFSNHFR 
npp qt ( p PQYn i P*T>KP*r'KTJ r pp pp ttt p QVf7T?onT?TJT. qtjqtpc a ts* qtp cna q 

Ui\i\o i-iH, r o Jt \£d±jr\Jr X JV1>< X rCxrl\.I\.X Jr O VorC^vjUtxTJLlOiMO X Onf O XI\OJL//iO 

GTNDFREFVLEMQKTITDLRTQIKKLESRIiSTTECXVDAGAESHANN 
\ TK*KK\DACTICECKDGOVTCF\ VEACPP\ AT\ CAVPVNIPGAC\ C 
PVCLQK\RAEEKAPRLPGEAPSEFCLACAIVRSGGRWQGAADLQTRK 
HPELRDISLTTSKLGAVYRKAVQEGFPTRAFCGEGRHSRCWKGSRQE 
S 

5018 . 

A 

2 

267 

ADLSAEAL*xT^EWDDIFKVLKTSS/LGQPKILYPSKLSL.INEASSS 
SSSSSSSS/RE FTTRLVL * EMLKG I LHMBAQGQ YLPS * KHTKV 

5019 


\ 

x. o *± 

TPVWFWnTFQKA AY&T.FVVT?nriV&PPT.T.Q'Dar2 C / Qf2&P& CPZVPQ CDP 
XlvVXNlxiii^yxijO rs_MJ-\ x /VXj x* X JvKyU V /VfCK J_i1j o Jr /-llj o / ob/irAbirnCoorr 

SSEFMDVN 

5020 

A 

1 

712 

EILIIHLKRFSYTKFSREKl^TLVEFPIRSGARERMAGGRQGKEGVY 
QY * P S PHPQ\DLDFSEFVI QPQNESl^PELYKYDL I AVSNHYGGMRDG 
HCMCQAVGGACPGG SGQGGDQDLPSE * LGM * ASGEGS SWGRK* TRS 
EIWTLSEEARKGRRG*LSFPFR\TTFACNKDSGQWHYFDDNSVSPVN 
ENQIESKAAYVLFYQRQDVARRLLSPAGSSGAPASPACSSPPSSEFM 
DVN 

5021 

A 

3 

3090 

I PLLQLLLRRLWRRHG&WTEDRE PQHEELPGLD SQWRQ I ENGESGRE 

RPLRAGESWFLV^KHWYKQWEAYVQGGDQDSSTFPGCINNATLFQDE 

Il^RLKEGLV^GEDYVLLPARAWHYIiVSWYGLEHGQPPIERKVIELP 

NIQiOTEvTPVELLLVRHOTLGKSHWQFSHTDSIGLVliRTARERFLV 

EPQEDTRLWAKNSEGSl^RLYDTHITvXDAAliETGQLlIMETRKra 

TWPSAQLHVMNN1TOSEEDEDFKGQPGICG 
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NVPQLTEYFIiNNCYLEEIiNFRNPLGMKGEIAEAYADLVKQAWSGHHR 
SIVPHVFKNKVGHFASQFLGYQQHDSQELLSFLIiDGLHEDLNRVKKK 
EYVELCDAAGRPDQEVAQEAWQNHKRRNDSVIVDTFHGLFKSTLVCP 
DCGNVSVTFDPFCYLSVPLPISHKRVLEVFFIPMDPRRKPEQHRLW 
PKKGKISDLCVALSKHTGISPERMMVADVFSHRFYKLYQLEEPLSSI 
LDRDDIFVYEVSGRIEAIEGSREDIWPVYLRERTPARDYNNSYYGIi 
MLFGHPLLVSVPRDRFTWEGLYNVLMYRLSRYVTKPNSDDEDDGDEK 
EDDEEDKDDVPGPSTGGSLRDPEPEQAGPSSGVTNRCPFLLDNCLGT 
SQWPPRRRRKQLVQLQ\TVNSNGHNRLHHSPCTKSNAKPVHCYSTWK 
P \ EMKK\ R YYDE * EGEGL REGNAFRRG YV * RKAP VRLQEC I ELFTTV 
EPPPFGGK\EREKPW\YCP\SCKQHQLAT\KKL\DLWM\LPEILIIH 
LKTFFPYTQVLPEKKLEHPSWKFPYPGTLDFS * /EFVIQAHQNEVEI 
REL YK\ YDL I \ AVS\NH\ YGGMRDGTLTQHLACNKDSGQWHLIiLIDN 
SVFPLFN*GIQIRVPRAAYVL\FYQRQ/DTLARRLLSPCRAHLAPSV 
LPCLHAPSPQLVSSCDVILRALiGPAPEKKKKKALSAISLLVSAPLLI* 
FVLGAPARHCRLSRGYCSPVPLNRSLPGKNRSCLLLAVRAPPCVCPS 
SSDPPF 

5022 

A 

1 

1584 

MTISTAIPAPTDLKFTQVTPTSLSAQWTPPNVQIiTGYRVRVTPKEKT 
GPMKEINLAPDSSSVVVSGLMVATKYEVSVYALKDTLTSRPAQGVVT 
TLENVS PPRRARVTDATETTITI SWRTKTET ITGFQVDAVPANGQTP 
IQRTIKPDVRSYTITGLQPGTDYKIYLYTIiNDNARSSPVVIDJVSTGL 
E PGTEYT I YVI ALKIWQKSEPLIGRKKTDEIiPQIjVTLPHPNLHG PE I 
LDVPSTVQKTPFVTHPGYDTGNGIQLPGTSGQQPSVGQQMIFEEHGF 
RRTTPPTTATPIRHRPRPYPPNVGEEIQIGHIPREDVDYHLYPHGPG 
LNPNASTGQEALSQTTISWAPFQDTSEYIISCHPVGTDEEPLQFRVP 
GTSTSATLTGLTRGATYWIVEALKDQQRHK\mEEVVTVGNSVNEGL 
NQPTDDSCFDPYTVSHYAVGDEWERMSESGFKLLCQCLGFGSGHFRC 
DSSRWCHDNGVNYKIGEKWDRQGENGQMMSCTCLGNGKGEFEFTRNS 
GLGSSIWESLI 

5023 

A 

1 

3519 

MLRGPGPGLLLLAVQCLGTAVPSTGASKSKRQAQQMVQPQSPVAVSQ 
SKPGCYDNGKHYQINQQWERTYLGNALVCTCYGGSRGFNCESKPEAE 
ETCFDKYTGNTYRVGDTYERPKDSMIWDCTCIGAGRGRISCTIANRC 
HEGGQSYKIGDTWRRPHETGGYMLECVCLGNGKGEWTCKPIDRCNDQ 
DTRTSYRIGDTWSKKDNRGNLLQCICTGNGRGEWKCERHTSVQTTSS 
GSGPFTDVRAAWQPQPHPQPPPYGHCVTDSGVVYSVGMQWLKTQGN 
KQMLCTCLGNGVSCQETAVTQTYGGNSNGEPCVXiPFTYNGRTFYSCT 
TEGRQPGHLWCSTTSNYEQDQKYSFCTDHTVIiVQTRGGNSNGALCHF 
PFLYNNHNYTDCTSEGRRDNMKWCGTTQNYDADQKFGFCPMAAHEEI 
CTT^GVMYRIGDQWDKQHDMGHMMRCTCVGNGRGEWTCIAYSQLRD 
QCIVDDITYNVNDTFHKRHEEGHMLNCTCFGQGRGRWKCDPVDQCQD 
SETGTFYQIGDSWEKYVHGVRYQCYCYGRGIGEWHCQPLQTYPSSSG 
PVEVFITETPSQPNSHPIQWN/LPQPSHISKYILRWRPKNSVGRWKEA 
TIPGHLNSYTIKGLKPGWYEGQIilSIQQYGHQEVTRFDFTTTSTST 
PVTSNTVTGETTPFS PLVATSES VTE ITAS S F WS WVS ASDTVSGFR 
VEYELSEEGDEPQYLDLPSTATSVNIPDLIiPGRKYIVNVYQISEDGE 
QSLILSTSQTTAPDAPPDTTVDQVDDTSIVVRWSRPQT^PITGYRIVY 
S PS VEGS STELNLiPETANS VTLSDLQPGVQYNITI YAVEENQESTPV 
VIQQETTGTPRSDTVPSPRDLQFVEVTDVKVTIMWTPPESAVTGYRV 
DVT PVNLPC5EHGORLP I S RNTFAE VTGL *5 PGVTY YFKVFAVSHGRES 
KI^LTAQQTTKLDAPTNLQFVNET 

TRRGQPRQYWGPSVSKYPLRNLQPASEYTVSLVAIKGNQESPKATG 
VFTTRHHSPRKMSGI^TPVTOTIaH^^ 

CEWLYVXi ITHYECTC PLPTMSGCACPS PAVGGCAS VTCRGWLCMYVT 
RRGWLCVSVTCRGWLCVSVACHGWLCMFCPHHHLGLQHFAIX3HGG 

5024 

A 

3 

673 

LDAPTNLQFVNETDSTVLVRWTPPRAQITGYRLTVGLTRRGQPRQYN 
VGPSVSKYPLRNLQPASEYTVSLVAIKGNQESPKATGVFTTRHHSPR 


I03UW 
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KMSGMNTPWRTLHMHRCSLSHPGFITHSERLCVPKIHCEWLYVIjITH 

YECTCPLPTMSGC^CPSPAVGGCASVTCRGWLO^ 

CRG WLCV S VACHGWLCM F C PHHHLGIjQHF ALGHGG 

5025 

A 

3 

2383 

QTEMTIEGLQPTVEYWSVYAQNPSGESQPLVQTAVTNIDRPKGLAF 
TDVDVDSIKIAWESPQGQVSRYRVTYSSPEDGIHELFPAPDGEEDTA 
ELQGLRPGS EYTVS WALHDDMESQPIj IGTQS TAI PAPTDLKFTQVT 
PTSLSAQWTPPNVQLTGYRVRVTPKEKTGPMKEINLAPDSSSWVSG 
LMVATKYEVSVYALKDTLTSRPAQGWTTLENVSPPRRARVTDATET 
TITISWRTKTETITGFQVDAVPANGQTPIQRTIKPDVRSYTITGLQP 
GTDYKIYLYTLNDN7VRSSPWIDASTAIDAPSNLRFLATTPNSLLVS 
WQPPRARITGYIIKYEKPGSPPREWPRPRPGVTEATITGLEPGTEY 
T I YVI ALKNNQKS EPLI GRKKTDELPQLVTLPHPNLHG PE I LDVPS T 
VQKTPFVTHPGYDTGNGIQLPGTSGQQPSVGQQMIFEEHGFRRTTPP 
TTATPIRHRPRPYPPNVGEEIQIGHIPREDVDYHIiYPHGPGLNPNAS 
TGQEALSQTTISWAPFQDTSEYIISCHPVGTDEEPLQFRVPGTSTSA 
TLTGLTRGATYNIIVEALKDQQRHKVREEWTVGNSVNEGLNQPTDD 
SCFDPYTVSHYAVGDEWERMSESGFKLLCQCLGFGSGHFRCDSSRWC 
HDNGVNYKIGEKWDRQGENGQMMSCTCLGNGKGEFKCDPHEATCYDD 
GKTYHVGEQWQKEYLGAICSCTCFGGQRGWRCDNCRRPGGEPSPEGT 
TGQSYNQYSQRYHQRTOTNVNCPIECFMPLDVQADREDSRE 

5026 

A 

2 

2349 

TFDNLSPGLEYNVSVYTVKDDKESVPISDTIIPAVPPPTDLRFTNIG 
PDTMRVTWAPPPSIDLTNFLVRYSPVKNEEDV/^ELSISPSDNAWLT 
NLLPGTEYWSVSSVYEQHESTPLRGRQKTGLDSPTGIDFSDITANS 
FTVHW I APRATITG YRIRHHPEHFSGRPREDRVPHSRNS ITLTNLTP 
GTEYVVSIVALNGREESPLLIGQQSTVSDVPRDLEVVAATPTSLLIS 
WDAPAVTVRYYRITYGETGGNS PVQEFTVPGSKSTATISGLKPGVDY 
TI TVYAVTGRGDS PASSKPI S INYRTB I DKPS QMQVTDVQDNS I S VK 
WLPSSSPVTGYRVTTTPKNGPGPTKTKTAGPDQTEMTIEGLQPTVEY 
WSVYAQNPSGESQPLVQTAVTTIPAPTDLKFTQVTPTSLSAQWTPP 
NVQIiTGYRVRVTPKEKTGPMKE INLAPDS SS VWSGLMVATKYEVS V 
YALKDTLTSRPAQGWTTLENVSPPRRARVTDATETTITISWRTKTE 
T ITGFQVDAVPANGQTP IQRTI KPDVRS YTITGLQPGTDYKI YLYTL 
NDNARS S PWIDASTAIDAPSNIiRFIiATTPNSLLtVSVJQPPRARITGY 
IIKYEKPGSPPREWPRPRPGVTEATITGLEPGTEYTIYVIALKNNQ 
KS EPLI GRKKTVQKTPFVTHPG YDTGNG I QLPGTSGQQPS VGQQMI F 
EEHGFRRTTPPTTATPIRHRPRPYPPNVGQEALSQTTISWAPFQDTS 
EYIISCHPVGTDEEPLQFRVPGTSTSATLT 

5027 

A 

1 

7338 

MLRGPGPGLLLLAVLCLGTAVPSTGASKS KRQAQQMVQPQS PVAVSQ 
SKPGCYDNGKHYQINQQWERTYLGNVLVCTCYGGSRGFNCESKPEAE 
ETCFDKYTGNTYRVGDTYERPKDSMIWDCTCIGAGRGRISCTIANRC 
HEGGQSYKIGDTWRRPHETGGYMLECVCLGNGKGEWTCKPIAEKCFD 
HAAGTSYVVGETWEKPYQGW^IMVDCTCIJGEGSGRITCTSRNRCNDQD 
TRTSYRIGDTWSKKDNRGNLLQCICTGNGRGEWKCERHTSVQTTSSG 
SGPFTDVRAAVYQPQPHPQPPPYGHCVTTDSGVVYSVGMQWLKTQGNK 
QMLCTCLGNGVSCQETAVTQTYGGNIiNGEPCVLPFTYNGRTFYSCTT 
EGRQDGHLWCSTTSNYEQDQKYSFCTDHTVLVQTQGGNSNGAIjCHFP 
FL YNNHNYTDCTS EGRRDNMKWCGTTQNYDADQKFGFCPMAAHEE I C 
TTNEGVMYRIGDQWDKQHDMGHMMRCTCVGNGRGEWTCIAYSQLRDQ 
CIVDDITYNVNDTFHKRHEEGHMLNCTCFGQGRGRWKCDPVDQCQDS 
ETGTFYQIGDSWEKYVHGVRYQCYCYGRGIGEWHCQPLQTYPSSSGP 
VEVFITETPSQPNSHPIQWNAPQPSHISKYILRWRPKNSVGRWKEAT 
IPGHLNSYTIKGLKPGVVYEGQLISIQQYGHQEVTRFDFTTTSTSTP 
VTSNTVTGETTPFSPLVATSESVTEITASSFWSVA^SASDTVSGFRV 
EYELSEEGDEPQYLDLPSTATSVNIPDLLPGRKYIVNVYQISEDGEQ 
SLILSTS QTTAPDAPPDPTVDQVDDTS I WRWS RPQAP I TGYRIVYS 
PSVEGSSTELNLPETANSVTLSDLQPGVQYNITIYAVEENQESTPW 
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IQQBTTGTPRSDTVPSPRDLQFVEVTDVKVTIMWTPPESAVTGYRVD 

VI PVNLPGEHGQRLP I S RNTFAEVTGLS PGVTYYFKVFAVSHGRES K 

PLTAQQTTKLDAPTNLQFVNETDSTVliVRWTPPRAQITGYRIiTVGLT 

RRGQPRQYNVGPSVSKYPLRNLQPASEYTVSLVAIKGNQESPKATGV 

FTTLQPGSS I PPYNTEVTETTI VITWTPAPR IGFKLGVRPSQGGEAP 

REVTSDSGSIWSGLTPGV^YVYTIQVLRDGQERDAPIVNKWTPLS 

PPTNLHLEANPDTGVLTVSWERSTTPDITGYRITTTPTNGQQGNSLE 

EVVHADQSSCTFDNLSPGLEYNVSVYTVKDDKESVPISDTIIPEVPQ 

LTDLSFVDITDSSIGLRWTPLNSSTIIGYRITVVAAGEGIPIFEDFV 

YS S VG Y YTVTGLE PG ID YD I S V I TL INGGES APTTLTQQTAVPPPTD 

LRFTN IGPDTMRVTWAPPPS IDLTNFLVRYS PVKNEEDVAELS I S PS 

DNAWLTNLLPGTEYWSVSSVYEQHESTPLRGRQKTGLDSPTGIDF 

SDITANSFTVHWIAPRATITGYRIRHHPEHFSGRPREDRVPHSRNSI 

TLTNIiTPGTEYWSIVALNGREESPLLIGQQSTVSDVPRDLEWAAT 

PTS LL I S WDAPAVTVR YYR I TYGETGGNS PVQE FTVPGS KS TATI SG 

LKPGVDYT ITVYAVTGRGDS PAS SKPIS INYRTE I D KPSQMQVTDVQ 

DNS I S VKWLPS S S PVTGYRVTTTPKNGPGPTKTKTAGPDQTEMTIEG 

LQPTVEYWSWAQNPSGESQPLVQTAVTNIDRPKGLAFTDVDVDSI 

KIAWESPQGQVSRYRVTYSSPEDGIHELFPAPDGEEDTAELQGLRPG 

SEYTVSWALHDDMESQPLIGTQSTAIPAPTDLKFTQVTPTSLSAQW 

TPPNVQLTGYRVRVTPKEKTGPMKEINLAPDSSSVVVSGLMVATKYB 

VSVYALKDTLTSRPAQGVVTTLENVSPPRRARVTDATETTITISWRT 

KTETITGFQVDAVPANGQTPIQRTIKPDVRSYTITGLQPGTDYKIYL 

YTLNDNARS S P W I DAS T A I DAP SNLRFLATT PNS LL VS WQPPRAR I 

TGYIIKYEKPGSPPREWPRPRPGVTEATITGLEPGTEYTIYVIALK 

NNQKSEPLIGRKKTDELPQLVTLPHPNLHGPEILDVPSTVQKTPFVT 

HPGYDTGNGIQLPGTSGQQPSVGQQMIFEEHGFRRTTPPTTATPIRH 

RPRPYPPNVGQEALSQTTISWAPFQDTSEYIISCHPVGTDEEPLQFR 

VPGTSTSATLTGLTRGATYNIIVEALKDQQRHKVREEVVTVGNSVNE 

GLNQPTDDSCFDPYTVSHYAVGDEWERMSESGFKLLCQCLGFGSGHF 

RCDSSRWCHDNGV1JYKIGEKWDRQGENGQMMSCTCLGNGKGEFKCDP 

HEATC YDDGKT YHVGEQWQKE YLGAI CS CTCFGGQRGWRCDNCRRPG 

GEPSPEGTTGQSYNQYSQRYHQRTNTNVNCPIECFMPLDVQADREDS 
RE 

5028 

A 

2 

7106 

KSKRQAQQMVQPQSPVAVSQSKPGCYDNGKHYQINQQWERTYLGNVL 
VCTCYGGSRGFNCESKPEAEETCFDKYTGNTYRVGDTYERPKDSMIW 
DCTCIGAGRGRISCTIANRCHEGGQSYKIGDTWRRPHETGGYMLECV 
CLGNGKGEWTCKPIAEKCFDHAAGTSYVVGETWEKPYQGWMMVDCTC 
LGEGSGRITCTSRNRCNDQDPRASYRIGDPWSNKVHRGNLLQC\ICT 
GNARGEWKCERHTSVQTTSSGSGPFTDVRAAVYQPQPHPQPPPYGHC 
VTDSGWYSVGMQWLKTQGNKQMLCTCLGNGVSCQETAVNQNYR\GN 
SNGEPCVLPFTYNGRTFYSCTTEGRQDGHLWCSTTSNYEQDQKYSFC 
TDHTVIjVQTQGGNSNGALCHFPFLYNNHNYTDCTSEGRRDNMKWCGT 
TQNYDADQKFGFCPMAAHEE I CTTNEGVMYR I GDQWDKQHDMGHMMR 
CTCVGNGRGEWTCIAYSQLRDQCIVDDITYNVNDTFHKMIEEGHMLN 
CTCFGQGRGRWKCDPVDQCQDSETGTFYQIGDSWEKYVHGVRYQCYC 
YGRGIGEWHCQPLQTYPSSSGPVEVFITETP\SQP\NFPPIQWNAPQ 
PSHISKYILRWRPKNSVGRWKEATIPGHLNSYTIKGLKP/GVWVYEG 
QLISIQQYGHQELTRFDFTTTSTSTPVTSNT\VTGETTPFSPLVATS 
ESVTEITASSFWSWVSASDTVSGFRVEYELSEEGDEPQYLDLPSTA 
TSVNIPDLLPGRKYIVNVYQISEDGEQSLILSTSQTTAPDAPPDPTV 
DQVDDTSIWRWSRPQXPITGYRIVYSPSVEGSSTELI^LPETANSVT 
LSDLQPGVQYNITIYAVEENQESTPWIQQETTGTPRSDTVPSPRDL 
QFVEVTDVKVTIMWTPPESAVTGYRVDVIPVNLPGEHGQRLPLSRNT 
F\AEVTGLSPGVTYYFKVFGQ*ATGRESKPLTAQQTTKL\DAPTNLQ 
FVNGTDSTVL/VLRWTPL\RAQITGYRLTVGLTRRGQPRQYNVGPSV 
SKYPLRNLQPASEYTVSLVAIKGNQENPKATGVFTTLQPGSSIPPYN 
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TBVTETTIVITWTHAPRIGFKLGVRPSNPGEAPREVTSDSGSIWSG 
LTPGV^YVYTIQVLRDGQDRDAP\lWK\VVTPLSPPTNIiHLEANPD 
TGVLTVSWERSTTPDITGYRITTTPTNGQQGNSLEEWHADQSSCTP 
\DNLEVPGLEYNVSVYTVKDDKESVPISDTIIPAVPPPTDLRFTNIG 
PDTMRVTWAPPPSIDLTNPIiVRYSPVKNEEDVAELSISPSDNAVVLT 
NLLPGTEYWSVSSVYEQHESTPLRGRQKTGLDSPTGIDFSDITANS 
FTVHWIAPRATITGYRIRHHPEHFSGRPREDRVPHSRNSITLTNLTP 
GTEYWSIVALNGREESPLLIGQQSTVSDVPRDLEWAATPTSLLIS 
WDAPAVTVRYYRITYGETGGNSPVQEFTVPGSKSTATISGLKPGVDY 
TITVYAVTGRGDS PASSKP I S INYRTE IDKPS QMQVTDVQDNS I S VK 
WL P S S S P VTG YRVTTT \ PKNG PG PTKTKTAG PDQTEMT I EGLQP TVE 
YWS VYAQNPSGE\ SQPLVQTAVTNIDRPKGLAFTDVDVDS I KI AWE 
SP\QGQVSRYRVTYSSP\EDGIHELFPAPDGEEDTAEL\QGLRPGSE 
YTVSWALHDDMESQPLIGTQSTAIPAPTDLKFTQVTPTSLSAQWTP 
PNVQLTGYRVRVTPKEKTGPMKEI\NLAPDSSSVWSGLMVATKYEV 
S V\ YALKDTLTSRPAQ\ gwttlenvs PPRRARVTDATETTITI S \ w 
RTKT * T I / TLGFQVDA \ VP ANGQTP I QRT I KPD VR S YT I TGLQPGTD 
YKIYLYTLJNDNARSSPVVIDASTAIDAPSNLRFLATTPNSLLVSWQP 
PRARITGYIIKYEKPGSPPREWPRPRPGVTEATITGLEPGTEYTIY 
VIALKNNQKSEPIilGRKKTDELPQLVTLPHPNLHGPEILDVPSTVQK 
TPFVTHPG YDTGNG IQLPGTSGQQPSVGQQM I FEEHGFRRTTPPTTA 
TPIRHRPRPYPPNVGQEALSQTTMSWAPAIjYTAEYIISCHPGGSDEE 
PLQFMDTGTATS ATLTGLTRG ATYNM I MEALKDQH \ RHKVREEWTV 
GNSVNEGLNQPTDDSCFDPYTVSHYAVGDEWERMSESGFKLLCQCLG 
FGSGHFRCDS SRM / WCH\ DNGV\ NYKIGEKWDRQGENGQMMS CTCLG 
NGKGEFKCDPHEATCYDDGKTYHVGEQWQKEYLG\AICSCTCFGGQR 
GWP/ CVDNCRRPGG E P \ S PEGT\ TGQS \ YNQYS Q\ R YHQRTTHNFKL 
PQLKCS WP \ LD VQADRRRFPES KS S F P I QRNQACLSAKI HLKLE 

5029 

A 

1 

389 

SMAASLRLLGAASGLRYWSRRLRPAAGSFAAVCSRSVASKTPVGFIG 
LGNMGNPMAKNUWLGIRLGLDPKLLAKILNMSSGRCWSSDTYNPVP 
GVMDGVPSANNYQGGFGTTLMAKDLGLAQDSATST 

5030 

A 

2 

554 

ADALGRPTRPASQSRCTRSQSVGPSGFGGGHRGE / SGRAYVWQPPYG 
SSE/HASGLRYWSRRLR PAAGS FAAVCSRS VAS KTPVGF I GLGNMGN 
PMAKNLMKHGYPLIIYDVFPDACKEFQDAGEQVVSSPADVAEKADRI 
ITMLPTSINAIEAYSGANGILKKVKKGSLLIDSSTIDPA\VSKELA 

5031 

A 

1 

1123 

MGNPMAKNLMKHGYPLIIYDVFPDACKEFQDAGEQLCPNLDVNHPFV 
QRLHAADTPWPI SHLLAAS VIGCCDI ACLFSGRVS IS SLCVTI S SQA 
DEVRAKENKP I PQH PTKI AEML WS S PADVAEKADR 1 1 TMLPTS INA 
IEAYSGANGILKKVKKGSLIilDSSTIDPAVSKEIiAKEVEKMGAVFMD 
APVSGGVGAARSGNTjT\ FMVGGVEDEFAAAQE\ LLGCNGSNVVYCGA 
VGTGQAAKICNhn^IaAISMIGTAEAMNIiGIK/S*GFDPKIiL\AKIIxN 
M\SS\GRCWSK*HFINPVPG\VMDGVPSAN*LFRVDFGTTLHWLKDL 
GLAQDS ATSTKS P I LLGSLAHQ I YRMMCAKG YS KKDFS S VFQFLREE 
ETF 

5032 

B 

13 

441 

MEGVEEKKlCEVPAVPETIiKKKRRNFAELKIKRLRKKFAQKMLRKARR 
KL I YEKAKHYHKE YRQM YRTE I RMARMARKAGNF YVPAE P KLAFVI R 
XRGINGVS PKVRKVLQLLRLRQI FNGTFVKLNKAS INMLRDCRAIYC 
MG* 

5033 

A 

1 

775 

RRVPA\VPETLKKKRRNFAELKIKRL\RKKFAQQML\RKAR\RKLIY 
EKAKH\YHKEYRQMYRT*NFEW^MGKKKLANF\YVPAEPQIWRFVI 
R I RG I \ NGVS PKGS ERFFQLLRLRQ I \ FNGNLL * KLNQG FR FNMAED 

f*spyiamggtpnlkvskmn*sykrglwaksnkkr\ ial\tdnal\i 
arslgkirhilawedliheiytv\gkrfk\eanwflwp\fklsfstk 
vemkkktthf\veggdagnr\edqinrlirrmn 

5034 

A 

193 

384 

EQGFDGHLAS W I S LKDTAKW KNGDELC S Y YQS CCTD YTAE CKPQ VTR 
GDVFTMPEDEYTVYDDG 
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5035 

A 

96 

1636 

ARSPAimPLRPLLILALLAWVAIJ^QESCKGRCTEGFNVDKKCQRD^ 
LCS\YYQSCCTDYTAECKPQVTRGDVFTMPEDEYTVYERLGEEKNNA 
TVHEQVGGPSLTSDDQAQSKG\NPEQTPVLKPEEEAPAPEVGASKPE 
GIDSRPETLHPG\RPQPPAEEELCSGKPFDAFTDLKNGSLFAFRGQY 
CYELDEKAVRPG\YPKLI\RDVWGIEGPI\DAAFT\RINLiFRGRPYL 
FKG * \ Q YW / RAFEDG V\ LG£ W I Y PRN I S WTAFDGH S PGQTWD \ AALG 
LPCPLAYSGRERV\YFFQRGKQYW/ESYQFPGTSPVQEECEGSSL\S 
AVF\EHFAMM\QRDSW\EDIF\ELLF\WGAERSGWVTR\QPQFHliAG 
EW\ HG VPG \ QVD\ AAMG\ GRI LHLQGMATRPLLWPKKK\ RFRH\ RNR 
\ KGYRS \QR\ AHSRGR* PETPRRP\ SRA\MWLS FVLPSEESNLGA\N 
NY\DDYRMDLAIiCLPTC\EPIQECFFFSFGDKYYRVNLRTRRVDTVD 
PPYP\RS IAQYWLGCPAPGHL 

5036 

C 

1 

300 

MRSFGQLTLCPRNGTVTGKWRGSHVVGLLTTLNFGDGPDRNKTRTFQ 
ATVLGSQMGLKGGAGSVWLQAAGLGLLPASLLWPSLLCHCYVIiPPAP 
GVPLV* 

5037 

A . 

359 

1229 

MPQPPTLGQEMTGPSQPWTGKGGLPGP*QLRAKAQEVDSHLGRKKIK 
QQNRSKSC 

5038 

A 

2 

311 

VRWNSAAPLVTSRGAPASARPRGQALPGGSAPSAPHGQLPGRAQPAP 
VSGPPPTSGLCHFDPAAPWP\PGLGLGMLPPHPQDWPAQP*HPPGIiG 
LFFEIFSAS 

5039 

A 

310 

415 

SQYFGTLRRVDHIiRSGVRDQPGQLGKTPS * PQVIHPPQCPKVLGLQY 
YHFLFFLRRSL/DSVAQAGVQWRDLGSLQRPPPGFTPFSCLSLPSSW 
DYRRPPPRLANFFVFS VETGF\TVSARMVS I S * PRDPPASASQSAGD 
TGVSQAPV 

5040 

A 

1 

114 

ARAEMIiIVQYILPRLTHCAIFTILFIFSLLT*VMLLSS 

5041 

A 

335 

478 

TPASLKIPVE* *NTLLAKMVSIS * PRDLPASASQSAGITGVSHRARC 

5042 

A 

135 

804 

GIDTILTLNQN*SLKTRQ*FTI,IIF/IFFFFLRWSIjALSPRPDCGLQ 
WRDLGSLQAPLPGFTPFSCLSLPSSWDYRCPPPRPANFFCNFFFSRD 
GGFTVLARMVSIS*PQ/CDPPTLASQRAG\ ITGLSHCTRLFFF*WME 
SPSVTQAGIQWHDLGSLQPMPPQFR*FSWLSLPSSQDYRCMPPCQAN 
F\CIFSRDEVSPC*PGWSGSPDLVIHLPWPPKVI,GLHA 

5043 

A 

63 

363 

TSQEPLRTSARIiLWAHSTDGRRGWSTRICRPSTDFPFMAIMASLASS 
MFSKVTNPNPRDLWSPCLSQPPLQRSVRTLRRPAPATAHWCQNSASN 
KSLPS 

5044 

A 

151 

384 

PRSPPPLENPRPRPRPPRKKPRPPAEPPRYPRDRLSDDLPAWSTRIC 
RPSTDFPFMAIMASLASSMFSKVRDGRGGGR 

5045 

B 

385 

479 

MAGAFRRRFASEVRAQGLESLLEHGLRCAGSLRGGQSLPTTMWSPVK 
VGFE* 

5046 

A 

3 

698 

VFFFFFSGCTRGPLFESDFHRAPHRCGQGLAAP*AAGAPQPRAPGET 
RGPEPAPHWRSASGDKPPGQAA\PPALVPGEGPSWDSDPRGHRCSQA 
TLPRVLAGPLQP*LS*LSL*DPPELRSD*PPL*IiRIiL**SLEPP*PP 
LLDSNRFPP*PRSPPPLENPRPRPRPPRKKPRPPAEPPRYPTESVSD 
DLPAWSTRISVHLQTSHSWPSWASIiASSMFFKVTKPNSREIiWSS 

5047 

A 

138 

550 

FCCCFTSSEHSLLAYGSCSPRRTEVLCAQQNPSETQAAPLAS\YMWK 
*PCNSRLKGPGPGLGEGWPES\RMTSGVTVPGGTSPGTRAGGS\PCP 
GGLS PEALRQ *GAGSGPRVS PGA/ PGCGAPAAYGAAS PCPQRCGAL 

5048 

A 

2 

555 

FFFFFFLGAHV7UJYSNPTFTGLHIWGKDWPPP*AAGAPQPRAPGET 
RGPEPAPHWRSASGDKPPGASGCLQIiLFQGEVPPGDS * PQRSSLALR 
PPFPRVLGPGPFQP\AKAKQRYSQRPPQGSQLAPP/GPPRKQPQPLR 
KVSG*SGGCDLRLRHRPACPGRVCTLAQLEPATRMGWSYVGQCGL 

5049 

A 

1246 

1379 

RCLFILS FFTGRRTHTP * PPLPGLRTGF PP * PRS PP PSREPTGP/ RP 
RPPREES PGPRQ * PPRVPHG IGLSDDLPAWS YSGS / GRP S TGL P I / P 
WPSWHP*RRQCSQR*QTQIPEISGSPCIiSQPPLQRSVRTLRRPAPAT 
A/PLVSKLSPPTKSLPSSDAM\RPLSRAGP*RKQEDSAPQ\DPEPPR 
NRGPGN 

5050 

A 

207 

253 

SASLRLPKCWDHRRG 
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5051 

A 

335 

419 

TFDFPNWRSYFSMSNWITSLPTLLSSWD\YRHVPLYPAKICIFSRDR 
VLPYWPG*SHDFTLFVSMTILTLQIYKMTKLDNYLETIYY 

5052 

A 

68 

399 

WELLIYFSGHTCSNLSPLLSLSFFFFF*DSLAVSAPGVQ\WHDLCSIi 

\RPLPPRFKRFWCLSLLSGWD\YRHVP\QYPAKICIFSRDRVLPILA 
KASRNVS FLTLNMLGGFS 

5053 

A 

2 

322 

AFFFFFFF*DGACSVT\RLECSGTITAHCNFRHLGSSNFPASTSRW 

EITGTCHHT*LIFVFLVETGFHHVGQAGLELLT\*VIHPPQSTKVLG 
LQWSHCTRPLSDF 

5054 

A 

16 

357 

TITIGQNFFFFFLEMGSCSVA\RLECSG\ISAHCNLKLLGSSNLPTS 
TS*VAGTTGVCATIPSYFYNKISM*NQSLPMLPSLGIiELLGLRPVFL 
PRPPKVLGITRHEPPHSALK 

5055 

A 

2 

346 

LRWSL/DSVAQGGVQWHDFGSLQPPPPGFKRLSGLSLMSSWDYRRPP 
PRLANFLHF* *RRSFTMLAR1WL»IS*PHDPPASASQSAGITGVSDHA 
QSQI FLKLQMTKSMGFKTKFLN 

5056 

A 

3 

401 

FFFWRQSLTLVTQAGVQ * LDHGSLQPPLPGPKGFSHLSPPSSWDHKH 
APPHLANFFL * RHGFTM/ RLVLTSLPQ/ CDPPAAASQS AGI TGVSHC 
AQPVIFCKKLDFSSTDIFTPSNIiQPRGFWDLTAPQGEIVST 

5057 

A 

142 

250 

PPTGSLLTPPLWDPQFASLFNGTL*WERE*PLPQNV 

5058 

A 

2 

775 

RCNSAQQQMTSSQKALMLELKSLQEEPVEGFRITLVDESDLYNWEVA 
IFGPPNTLYEGGYFKAHIKFPIDYPYSPPTFRFLTKM\WHPNIYEEW 
EMYGISILHPPV\DDPQSGELPSERWNPTQQCEGLILLRC*SHWLNE 
PQHLS PQPMSDAS VMFR\ KWERQ * RBKTKB YGLKFIRETKFSAT *GP 
KAGKGWGVEGSPTNPGREYCIKTEVP\RNDEQVQSLLYDELGMMTDI 
DDER*GRRKDAD\CYDDDEFWGMRKP 

5059 

A 

1 

571 

ALGKAGYID/ KYVTAMGI VAFVFF\ RSGKYNLDFRSRNDP / arh I / t 
PEQLVAL YKS F I REKH * V FPD * GLVQRTLFS L VFLVCLKN/ S FDCDD 
REVV?KKFT7^ACLQEAGNDLTV\NLRHIAEAVDENSCNFLQLTVD* I 
\ ACKLI QSNW * G IMVSHYSI/T*NTF I AI LVKHCTG * / INTGAPC * SE 
PLARYKELSF 

5060 

A 

32 

534 

GRERPLLSPYMETLYRVPFLVLECPNLKJ^KKPPWLHMPSAMTVYALV 
WSYFLITGGIIYDVIVEPPSVGSMTDEHGHQRPVAFLGLQSKMDNN 
INGKGLCIQAS * FTMGGFRFS * I LDP I ECTKYPKTS I \ DS FFCS IGI 
S S WLLEFFPWARSNSMRNGNCPGL I 

5061 

A 

1 

328 

FLKRQRFGLVAQAGL*WHNIX5SLQPLPPGFK*FSCLSLI J SSWDYRHL 

PPHPPNFL.LFLVETGFYHFGQAGLK3jLTD\ SGDPPAIiASQSAGI IGV 
SHRAGPGLCTFSNHI 

5062 

A 

2 

687 

GRVG FNVRWDRNDTNLI PNG SSI TVNQTNKRD YVS KY I NY I FND S VK 
AVYEEFRRGFYKMCDEDIIKLFHPEELKDV\MLGNTDYDWENI\EKN 
ARYEPGYNSSHPTIVMFWKAFHKLTLEEKKKFLVFLTGTDRLQMKDL 
NNMKITFCCPESWNERDPIRALTC/YPVSLLPSPKYSYNGNS*KKRF 
KKPINNNRGIWLTSLPCPNSLILLLLSLIiliLLIiLLFIiYFVLF 

5063 

A 

1 

527 

ISHDAIAQESKRYWQNLEANVPGAQVLGNQIMPGFLNMKIKFVCRPQ 
CPEKRSKS IEPDKNRKYCS AKARHS WT\ KDRRAMRVMS Y * \ RKKWMN 
I RPLPTKKQMPLQFDLCNH I ASGKKCQYVGNCS FAHS PE EREVWTYM 
KENGIQDMEQFYELWLKSQKNEKSEDIASQSNKEN 

5064 

A 

735 

1517 

S YSYFF I QLCNH I ASGKKCQYVGNCS FAHS PEEREVWTYMKENG I QD 
MEQFYBLWAQESKK*KK*RHRPVSPNKGKWENQIHMANRIMAGRLQW 
D FSLAGMCGG KTGNS E KQWQGH I S S EKHKEKVFHTE \ TD Q YCWQHRF 
PTGYFSICDRYMNGH/ AAQEGNSCK/ CLHMGNARTFMEWGRKGRRLP 
ij^VhAb I IUiKJUJliljIGPK\DNDFGK\ YSFLFKDLN* YAGFYGMIPNQ 
SIDQKNLKCSEAHSRGAARFPACIGVYRSS 

5065 

A 

237 

381 

LHQPPTTQTLAFTSNLRGR*RSAALPDALDSSKVSEMHRFLKRRGVN 

5066 

A 

1 

876 

^II^RSKLSEIAASTSVQEQNTTDEEKTQPVSLLPITIKLLHDLSP 
LPLLRNILAPEFLLPGTHVAQPESLTESKELEDSRGFSSIiTAHPVTjV 
WFPRGAFKTGSAS I FDGSALAAYDDVLVWVQYRLGIFGFFTTGDEH 
SPGNWGHLDQLAALHWVQDNIASFGGNPGSVTIFGGSVGGESVSVLV 
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MVW IHRGGLMVGTASTSDGRVLAAHENWWTI QHCLG I WGPFRNPN 
GEGL PHWPE YNQKEG YLQ I G ANTQAAQKLKDKE VAFWTNLFAKKAVE 
KPPQTEHIEL 

5067 

A 

1 

78 

LVRSGRKGRRRVFSLSEADSHGGHWV 

5068 

A 

155 

418 

IIIX3EVTEQAPKSMKERLVLCLAGFAVAQCINQHSSPSLSSQSPPSA 
SGSPSGSGSTSHCDSGGTSSSSTPRHSLSFPDTSFLAFLN 

5069 

B 

40 

411 

MQVTGVRTDSGALNITIHKLASSPHSKPFPIFQATFIFSDHIRCIIA 
EQRIoAKGRI PGKAHEDAENS CPP WTS QSS PPLKS WGLDS AP PPPLST 
CLSASTTRGAGAAATPQCSAPCLHRWTRCQ* 

5070 

A 

1 

987 

MVHGREEGGl^PEKLERIQLPVPNAAEKTTYNHPLAERLIRIMNd^AA 
QPDGKIRLATLELSCLLLKQQVLMSAGCIMKDVHLACLEGAREESVH 
LVRHFYKVIGRAQMDSSMGEDIFLDMFEDEYRSMTMKPMNVEYLMMD 
ASILLPPTGTPLTGIDFVKRLPCGDVEKTRRLLPRVDECGCEVTSPG 
PGRQQIJ\HAARKQEGENRAFVPGQLVGSAHHRCHPVVPARSVSSSIK 
S S S YNKVPTVPATPD Y YEGHL I LGD KVLADDGCS VQ A I RVFFMLRS L 
SLQLRGEPETQLPLTREEDLIKTDDVLDLRGLSPGSGKMAFSSPKIC 

5071 

A 

111 

2187 

DERRVGAADMFGRSRSWVGGGHGKTSRNIHSLDHLKYLYHVIiTKNTT 
VTEQNRNLLVETIRSITEHjIWGDQNDSSVFDFFLEKNMFVFFLNIL 
RQKSGRYVCVQLLQTLNILFENISHETSLYYIiLSNNYVNSIIVHKFD 
FSDEEIMAYYISFLKTLSLKLNNHTVHFFYNEHTNDFAIiYTEAIKFF 
NHPES^n^IATOTITLNWK^NQAMLHYIRDKTAVPYFSNLWFIG 
SHVI ELDDCVQTDEEHRNRGKLSDLVAEHLDHLHYLND I LI INCEFL 
NDVLTDHIiLNRLFLPLYVYSLENQDKVFLI IHHAPLVNSLAEVILNG 
DLSEMYAKTEQD IQRS S VLPTLSS LWQGSHLSLNQLQSGLHKCS SHL 
CGAQAAADS VTGE I PAI RSLEWL I SAGS KARTFFFLKML IGFWEKVD 
CE YQRRQVL S TRLQEALPSNRLTD VAAVHS S CMLGFGS TAPRGS W IG 
DPAAVHLPLPGELAEHLGSKGTTTVTKHQPQAKPSIRCFIKPTETLB 
RSLEMNKHKGKRRVQKRPNYKNVGEEEDEEKGPTEDAQEDAEKAKGT 
EGGSKGIKTSGESEEIEMVIMERSKLSELAASTSVQEQNTTDEEKSA 
AATCSESTQWSRPFLDMVYHALDSPDDDYHALFVLCLLYAMSHNK/G 
KSPEKEEGLSGTQSHPGKAGTFGKEGAEERKRAQV 

5072 

A 

2 

2774 

RNLLVETIRSITEILIWGDQNDSSVFDFFLEKNMFVFFLNILRQKSG 

RYVCVQLLQTLNILFENISHETSLYYLLSNNYVNSIIVHKFDFSDEE 

IMAYYI SFLKTLSLKl^raTVHFFYNEHTNDFALYTE 

MVRIA\mTITLNVYKVSLDNQAMLHYIRDKTAVPYFSNIjVWFIGSHV 

IELDDCVQTDEEHRNRGKLSDLVAEHLDH1.HYLNDILI INCEFLNDV 

LTDHLLNRLFLPLYVYSLENQDKGGERPKISLPVSLYLLSQVFLIIH 

HAPLWSIlAEVILNGDIiSE^1YAKTEQDIQRSSAKPSIRCFIKPTETL 

ERSLEMNKHKGKRRVQKRPNYKNVGEEEDEEKGPTEDAQEDAEKAKG 

TEGGSKGIKTSGESEEIEMVIMERSKLSEtAASTSVQEQNTTDEEKS 

AAATCSESTQWSRPFLDMVYHALDSPDDDYHALFVLCLLYAMSHNKG 

M\DPEKLERIQLPV\PNCGREGPPYNHPLG\ERFIQGSWNNV\AQPD 1 

GKIRL\ATLGA*GCIiVS*RQQVLMSAGCIMKDVHLACLEGAREESVH 

LVRHFYKGED IFLDMFEDE YRSMTMKPMNVE YLMMDAS I LLPPTGTP 

LTGIDFVKRLPCGDVEKTRRAIRVFFMLRSLSLQLRGEPETQLPLTR 

EEDL I KTDDVLDLNNSDL I ACTVI TKDGGMVQRFLAVD I YQMS LVEP 

DVSRLGWGWKFAGLLQDMQVTGVEDDSRALNITIHKPASSPHSKPF 

PILQATFIFSDHIRCIH/ iakqrfakgriqarrmkmqriaalldlpi 

QPTTEVLGFGLGSSTSTQHLPFRFYpQGRRGSSDPTVQRSVFASVDK 
VPGFAVAQCINEHSSPSLSSQSPPSASGSPSGSGSTSHCDSGGTSSS 
STPSTAQSPAVRSGRKGRRRVFSLSEADSHGGHWV 1 

5073 

A 

2 

1802 

NYAYPGVLLIGTDSHfPNGGGLGGISIGVGGRDRVDVMAGIPCELKC 
TKVIAMKLTGSLSGWTSPKDVILKVAGILTVKGGTGAIVEYHGPGVD 
S I SCTGMAT I CNMGAE I GATTS VFP YNHRMKKYLS KTGRAD I AS L»RE 
EFKDHLVPDPGCQYDQVIEINLNELKPHINGPFTPDLAHPVAEVGTV 
AEKEG W PLD I RVGL IG S CTNS S YEDMGRS AAVAKQALAHGLKCKS QF 
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TITPGSEQIRATIERDGYAQILRDVGGIVLANACGPCIGQRDRKDIK 
KGEKNTMVT S YNRNFTGRNDAFPETHAFVTS PE I VTALAI AGTLKFN 
PETDFLTGKDGKKFKLEAPDADELPRS S FDPGQDTYQQPPKDS SGQR 
VDVSPTSQRLQLLEPFDKWDGKDLEDLQILIKVKGKCTTDHISAAFP 
WLKFRGHIiDNISNNIiLIGAINIENGKANSVRNAVTQEFGPVPDTARY 
YKKHGIRWWIGDENYGEGSSREHAALEPRHLGGRAIITKSFARIHE 
TIKKQGLLPLTFADPSDYNKIHPVDKLTIQGLKDFAPGKPLKCIIKH 
PNGTQETILLNHTFNETQIEWFRAGSALNRMKELQQ 

5074 

A 

546 

802 

GQAGRARQVRSVPSSALITLPPLA*QKHG/ISRWWIGDB\NYGEGS 
\ SREHAALEPRHLGGRAI I TKS FAR IHGELES VPRPS S SHP 

5075 

A 

3 

2380 

SLSVHKMAPYSLLVTRLQKALGVRQYHVASVLCQRAKVAMTHFEPNE 
YIHYDLLEKNINIVRkRLNRPLTLSEKIVYGHLDDPASQEIERGKSY 
LRLRPDRVAMQDATAQMAMLQFISSGLSKVAVPSTIHCDHLIEAQVG 
DE KDLRRAKD I NQE VYN FLATAGDKYGVGF WS PGSG 1 1 HQ 1 1 LENYA 
YPGVLLIGTDSHTPNGGGLGGICIGVGGADAVDVMAGIPWELKCPKV 
IGVKLTGSLSGWTSPKDVILKVAGILTVKGGTGAIVEYHGPGVDSMS 
CTGMATICNMGAEIGATTSVFPYNHRMKKYLSKTGREDIANIiADEFK 
DHLVPDPGCHYDQLIEINLSELKPHINGPFTPDLAHPVAEVGKVAEK 
EGWPLDIRVGLIGSCTNSSYEDMGRSAAVAKQALAHGLKCKSQFTIT 
PGSEQIRATIERDGYRQILRDLGGIVLANACGPCIGQWDRKDIKKGE 
KNTI VTS YNRNFTGRNDANPETHAFVTS PE IVTALAI AGTLKFNPET 
NYLTGTDGKKFRLEAPDADELPKGEFDPGQDTYQHPPKDSSGQHVDV 
SPTSQRLQLLEPFDKWDGKDLEDLQILIKVKGKCTTDHISAAGPWIiK 
FRGHIiDNISNNLLIGAINIENGKANSVG\NAVTQEFGPVPDTARYYK 
KHGI RWWI GDENYGEG SS REHAALE PRHLGGRAI \ I TKS FARDPRD 
EPGRKQGLLPFGPFA\DPADYNKIHPV\DKLTIQGLKDFTPG/RSPL 
KC 1 1 KHPNGT \ Q\ ETI LLNHTFJJETQ I E WFRAGSAIiQQK * RNLQQ 

5076 

A 

3 

203 

APRGSWMWAEDPACSVTLCPESLSGYIiPGGSGQGPSPAAGEACGTQ 
LGWQPLVGRCGPHVSVLSK 

5077 

A | 

52 

393 

EASAWPAPRGSWMWAEDPACSVTLCPESLSGYLPGGSGQGPSPAAG 
TS PSLPPRSLPGLEKLRS PSGRESRLSS SAPRAPS ISAKAQLQCQAE 
EEDPEIVNNV*FSSVTCLSIj 

5078 

C 

121 

396 

MALPGRLPHRKIJVGGTLEAPWPGIPSGAVRRHQPQPPTTIJCXWLGKV 
KKPLRKRIEAKFLCAEGPEHIRQGSAAVPGGGGRSRNCEQCLI * * 

5079 

A 

3 

755 

FWDFDRPHETRVX.QVSCGRAHSLVLTDREGVFSMGNNSYGQCGRKV 
VENEIYSESHTVHRMHDFDGQVDQVACGQDHSLFIjTDKG\KVYSCGW 
GADGQTGLGHYNI TS S PTKLGGDLAG VNV IQ VATYGDCCLAVS ADG \ 
DI FGWGNLEYLQLA/ APVTDSTQVNVPRCLHFSGVGKVRQ/ CLHAVA 
RAVQC*TGEGHVFVWGYGILGK\GPNLSGKCPSLKMIPPTLFGFDGF 
QPRNPGFPAFRCGIQATLLH 

5080 

A 

1 

446 

MPILFTTVSSALSGAQHIADAQCDRKGHRTTIQGSRVNLVQPSVGPK 
VEETQCWGPGRGMPGAPEAPRERRVRSGSRSGDPGVA/ PRAAS GVGG 
RYAITRGRASAAVPCRSGGARRGGAGGQAADCGPGH*KESESRGPEG 
PAEGSQPL 

5081 

A 

1 

744 

GGL I PRAQRE SKRRSRGRARAASG V/ VWAIiCYHPRQS ECCGTL * KWR 
SSPRRCWRTSG\DCGPGH*KESESRGPEGPAEGISASLKM*WFASGT 
CLSRCLTLRQRK*LKKIKIIWIKK/SEK/PAEAT*SEGQPPF*GPRQ 
TGAEGF*LEPPQPGMSLKLFKVILKG*DSRTKMGDQQPPRVHGRTHG 
PL O PLTPVKG OG S C P VTCG P S PS G T W P * KP"L T, * PT » A TTj * WOPRTP <? T, <3 
GLLG PGTPG AS VGHKI Y 

5082 

A 

7 

279 

GQVCIAETGVQWYHLSSLQPLPPGFKRFL\CSASRRAPPCPANF\*F 
LVEMGFHHVGQAGLQLLTLGDPPTSASQSAGITGVSHHALPAWL 

5083 

A 

5164 

5586 

FTAALNSWVIKLSSCFSLLSSWDCMYMPPCPASFFFFFLR*SFALVA 
QAGVQWCDLGSLPPPPSGFERFSCLGLPSSWDYRRPP/PTPG*FFFF 
VFLVEMGFHYVGQAGLELLTSSDLPTSASQSAGITGVSHHALPSLYK 

5084 

A 

2 

5438 

NMTPS ELMD EIISI RVYN SHSLRADCliMGEFKI DVGFVYDEPGHAVM 
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RKWLLLNDPEDTSSGSKGYMKVSMFVLGTGDEPPPERRDRDNDSDDV 
ESNLLLPAGIALRWVTFLLKI YRAEDI PQMDDAFSQTVKEIFGGNAD 
KKNLVD PFVBVS FAGKKVCTN I IEKNANPEWNQWNLQIKFPSVCEK 
I KLT I YDWDRLTKND VVGTT YLHLS KI AAS GGEVEDFS S SGTG AASY 
TVNTGETEVGFVPTFGPCYLNLYGSPREYTGFPDPYDELNTGKGEGV 
AYRGRILVELATFLEKTPPDKKLEPISNDDLLWEKYQRRRKYSLSA 
VFHSATMLQDVGEAIQFEVSIGNYGNKFDTTCKPLASTTQYSRAVFD 
GNYYYYLPWAHTKPVVTLTSYWEDISHRLDAWTLLAMAERLQTNIE 
ALKSGIQGKIPANQI^ELWLKLIDEVIEDTRYTLPLTEGKANATTVLD 
TQ I RKLRSR S LSQ IHEAAVRMR SE ATD VKS TLAE I ED WLDKLMQLTE 
EPQNSMPD 1 1 IWMIRGEKRLAYARIPAHQVLYSTSGENASGKYCGKT 
QTIFLKYPQEKNNGPKVPVELRVNIWLGLSAVEKKFNSFAEGTFTVF 
AEMYENQALMFGKWGTSGLVGRHKFSDVTGKIKLKREFFLPPKGWEW 
EGEWIVDPERSLLTEADAGHTEFTDEVYQNESRYPGGDWKPAEDTYT 
DANGDKAASPSELTCPPGWEWEDDAWSYDINRAVDEKGWEYGITIPP 
DHKPKSWVAAEKMYHTHRRRRLVT^KRKKDLTQTASSTARAMEELQDQ 
EGWEYASLIGWKFHWKQRSSDTFRRRRWRRKMAPSETHGAAAIFKLE 
GALGADTTEDGDEKSLEKQKHSATTVFGANTPIVSCNFDRVYIYHLR 
CYVYQARNLLALDKDSFSDPYAHICFLHRSKTTEIIHSTLNPTWDQT 
IIFDEVEIYGEPQTVLQNPPKVIMELFDNDQVGKDEFLGRSIFSPW 
KLNSEMD I TPKLLWHP VMNGDKACGDVXjVTAELI LRGKDGSNLPILP 
PQRAPNIiYMVPQGIR PVVQLTA I E II1AWGI1RNMKNFQMAS ITS P SLV 
VECGGERVESWIKNLKKTPNFPSSVLFMKVFLPKEELYMPPLVIKV 
IDHRQFGRKPWGQCTIERLDRFRCDPYAGKEDIVPQJJKASLLSAPP 
CRDIVIEMEDTKPLLASKLTEKEEEIVDWWSKFYASSGEHEKCGQYI 
QKGYSKLKIYNCELENVAEFEGLTDFSDTFKLYRGKSDENEDPSWG 
EFKGSFRIYPLPDDPSVPAPPRQFRELPDSVPQECTVRIYIVRGLEL 
Q PQDNNGLCDP Y I KI TLGKKVI EDRDHY I PNTLNP VFGRM YELS C YL 
PQEKDIiKISVYDYDTFTRDEKVGETIIDLENRFLSRFGSHCGIPEEY 
CVSGVNTWRDQLRPTQLLQNVARFKGFPQPIbSEDGSRIRYGGRDYS 
LDEFEANKILHQHLGAPEERLALHILRTQGLVPEHVETRTLHSTFQP 
NISQGKLQMWVDWPKSLGPPGPPFNITPRKAKKYYLRVIIWNTKDV 
ILDEKSITGEEMSDIYVKGWIPGNEENKQKTDVHYRSLDGEGNFNWR 
FVFPFDYLPAEQLCI VAKKEHFWS IDQTEFRI PPRLI IQI WDNDKFS 
LDDYLG FLELDLRHTI I PAKS PEKCRLDM I PDLKAMNPLKAKTASLF 
EQKSMKGWWPCYAEKDGARVMAGKVEMTLE ILNEKEADERPAGKGRD 
EPNMNPKLDLPNRPETSFLWFTNPCKTMKFIVWRRFKWVIIGLLFLL 
ILLLFVAVLLYSLPNYLSMKIVKPNV 

5085 

A 

1 

4294 

WAVFDGNY Y YYLPWAHTKP WTLTS YWED I SHRLDAVNALLAMAERL 
QTNIEALKSGIQGKIPANQLAELWLKLIDEVIEDTRYTLPLTEGKAN 
VTVIJDTQIRKXRSRSLSQIHEAAVRMRSEATDVKSTLAEIEDWIJ5KL 
MQLTEEPQNSMPDIIIWMIRGEKRLAYARIPAHQVLYSTSGENASGK 
YCGKTQTIFLKYPQEKNNGPKVPVELRVNIWLGLSAVEKKFNSFAEG 
TFTVFAEMYENQALMFGKWGTSGLVGRHKFSDVTGKIKLKREFFLPP 
KGWEWEGEWIVDPERSLLTEADAGHTEFTDEVYQNESRYPGGDWKPA 
EDTYTDANGDKAAS PSELTCPPGWEWEDDAWS YDINRAVDEKGWE YG 
IT I PPDHKPKSWVAAEKMYHTHRRRRLVRKRKKDLTQTASSTARAME 
ELQDQEGWEYASLIGWKFHWKQRS sdtfrrrrwrrkmapsethgaaa 
IFKLEGALGADTTEDGDEKSLEKQKHSATTVFGANTPIVSCNFDRDY 

TVUT O PV\7VniV DMT.T 7VT nVTlCT?CnriI?7\ UTr"T?T Uf O OVI^TT^ T TUCTT Mfl 

X XXIJjKL. I V J yAKttLliftJjUiUJJ) f isJJFJ? All J.v-r JjHJKoivl I E, _L ±rlo 1 JuIM Jr 

TVTOQTIIFDEVEIYGEPQTVLQNPPKVIMELFDNDQVGKDEFLGRSI 
FS PWKLNSEMDI TPKLLWHPVMNGDKACGD VLVTAEL I LRGKDGSN 
LP I LP PQRAPNLYMVPQG I RP WQLTA I E I LAWGLRNMKN FQMAS I T 
SPSLWECGGERVESWIKNLKKTPNFPSSVLFMKVTLPKEELYMPP 
LVIKVIDHRQFGRKPWGQCTIERLDRFRCDPYAGKEDIVPQLKASL 
LSAPPCRDIVIEMEDTKPLLASKCLSSMSTALSKMASPATVHLTEKE 
EE I VD WVJ SKF YAS SGEHEKCGQ Y I QKGY S KLKI YNCELENVAE FEGL 
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TDFSDTFKLYRGKSDENEDPSWGEFKGSFRIYPLPDDPSVPAPPRQ 
FRGLPDSVPQECTVRIYIVRGLELQPQDNNGIiRDPYIKITLGKKVIE 
DRDHYIPNTLNPVFGRMYELSCYUPQEKDLKISVYDYDTFTRDBKVG 
ETIIDLENRFLSRFGSHCGIPEEYCVSGVNTWRDQIiRPTQLLQNVAR 
FKGFPQPILSEDGSRIRYGGRDYSLDEFEANKILHQHLGAPEERLAIi 
HILRTQGLVPEHVETRTLHSTFQPNISQ\YYIiRVIIWNTKDVILDEK 
S I TGEEMSD I YVKGW I PGNEENKQKTDVRYRSLDGEGftFNWRFVFPF 
DYLPAEQLCI VAKKEHFWS IDQTEFRIPPRLI IQI WDNDKFSLDDY1* 
GFLELDLRHTIIPAKSPEKCRLDMIPDLKAMNPLKAKTASLFEQKSM 
KGWWPCYAEKDGARVl^GKVEMTLEILNEKEADERPAGKGRDEPNMN 
PKLDLPNRPETSFLWFTNPCKTMKFIVWRRFKWVIIGLLFLLILLLF 
VAVLIiYSLPNYLSMKIVKPNV 

5086 

A 

444 

797 

FHSLSPGASKPMLEPGLDWEPCKKLNDVTCRAKSVPIjTRHIjH*VYLLM 
LSLPPPPTPQQAPVYDVPIiPVSKCSHCQFPLWF*FAFL*WPVMMSVF 
SCVCWLHKCLLLRSVCSYPLPTF 

5087 

A 

4 89 

611 

SPCPCIiCPEWYCLGFLLGFLWFLV*HLSL*SILNYFLYKV 

5088 

A 

1 

336 

MAVLPLPSNPHDQPQNSMPDI I I WM I RGEKRLAYAR I PAH QVLYSTS 
GFLELDLRHTIIPAKSPEKCRLDMIPDLKAMNPLKAKTASLFEQKSM 
KGWWPCYAEKDGARVMAV 

5089 

A 

1 

1581 

MDEAGNHHSQQTNTGTENQTPHVLPHKWEL^ENAWTQGGEHHTLGP 
VGSVGSSGQGNQARERNKDIQLGKEEVKLAIiFADNMIVYLENPIISA 
QNLLKIilSNFSKVSGYKINVQKSQAFLYINKRQTEIQlMSELPFTTA 
SKRI KYLGTQ I TRDVKDLFKED YKQLLNKI KEDTNKLKN I PCS WI GR 
I NIMKMAI LPKG PS CKGS KIiHPYLPLPSQVALDLLGHMAQVAEQLAQ 
EPGLVAEPSPVLSPTQNSGNFPGHUDEQEMEARQSRCNHLLACCVVN 
INPDYTNEHSYPRLDKEGSRSPFLKNCMQVLPVEAAEQQSLLPVPSS 
GTF I PEGTHQVPAGAVLYEVS VNPCWEES P SQEAQGSRTHLRRDAMP 
REEESREAWLQWLCGTAVSSARFKLPGSFVYTVRGKPPTQASVMVD 
ALPPAKLKCPRSTSDCCAGSENFKPVDLSLIiASMGSIFTLGTGKKCF 
KVTTGEKQNCQWEKI ITISS PEECDVP I P VS PECE S PECDVPLPVSM 
CSHCSIPTYE 

5090 

A 

235 

583 

DSVGGFQIYNCELENVAEFEGLTDFSDTFKLYRGKSDENEDPSWGE 
FKGSFRIYPLPDDPSVPAPPRQFRELPDSVPQECTVRIYIVRGLELQ 
PQDNNGLCDPYIKITLGKKVLV 

5091 

A 

1 

430 

MGKDFMTKTPKAMATKAKIDKWDVIKLKSFCTAKETTIRVNRQPTEW 

EKFFATYS SDKGL I SR IHKBLTQI YKKKTNNP I KKWAKDMNRHFSKE 

DIYAA^KHMEKCPSSI^IKEMQIKTTMRYHLMPVRMAIIKKSGKNRW 

ELNNENTWTQEGEHHTLGPVVGWGKGGGKALGDTPNVNDELMAFSTT 

RPTFFPRSDPTTTRPMAVPPHEEKDDHLETHRPLRPRTTLPHAPDGR 

WLLLTLTHTSTPGKTGLRTYRPCLSRTEKVQTAGARPSHPHSSAAFT 

TWAKGPTGRRPGHKRSESATQKEVTMSPCTAI*RGQKADSKAHGQQGT 

WAARGMHTESGTCKASRGFPSLLSPRTRIPKVFVAERKGLRGFVLSL 

TLP FS C I QKHWEQTVPMS PLVEDVS E * TGNLQNGRNFLQP THLTKG * * 

YPESTKNSHKFTRKKQTTPSKSGQRI * TDTSQKKTFMQPTNTWKNAH 

HHWP S KKCKS KPQ * DTT * CQLEWQS LKSQEKTGGN 

5092 

A 

65 

163 

LEASKFSLVPTGPSV*CSPPCVHVFSLFNSHL 

5093 

A 

175 

495 

PQFFGNVLGHLLYSFLGPFQCDENIRYLS *CS PSSCPPCTNRPQYVM 
FPSLCPC^MVQLPPMSENMWCLVFCSCDSLLRMMVSSFIHVPYRFC 
YCNIi VVY FE VR CH 

5094 

A 

67 

351 

IJ1TRKSVFNCVTCLFLHCISSVLINQLYDRPWCVMFPTLCPSVFIVQF 
PPMSENMWCLVFCPCDTLLRMMVSSFIHVPPERHKSKHHDTRG*CSN 
M 

5095 

A 

2 

432 

QVFGS I VRIHQGEWFPLLYGI.LLPRVQNCLTN* LDLCWSQHQTPHVL 
THR WELNNENTWTQEGEHHTLGP WGWGEGGG I ALGDI PNVNDKLMG 
AAHQHDFLVSPKITGIVLSISQVRVPFIDHRRKARYSADLTSFIiIAI 
PG 
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5096 

A 

75 

173 

I PGFPPETCALHFLPHFCKWQPQS S S YS S KEH 

5097 

A 

1 

2349 

MLEDRDGEEDEGDBDRLDNAVRGPGPKGPVGTVSEAQIARRIjTKVKN 
SRRMLSNKPQDPQVATAWRSASLFNIiYIAHLLAIAVMFQWTYSNGQW 
LNRPALNDFIPTAALGLYNPTPVPILGMTLGGSTQLVAPGIAQSCPG 
L VLA I L I GEGNE ATPED PKD PTCWERLGNHP P S VLQ IR VRVI EGRQL 
SGNNIRPVVTCV1JVCGQTHRTRIKRGNNPFFDEVLWLQPGPHTPASYS 
RAGLNDYHNCCKNKMQGVVIiTHKWELNNENTWTQDGEHHTLGPVVGL 
WPQCLTDPEGFGEETWASTQPFAGVTCKALYLLHFTPTAIGSHCLDH 
CFSDFLG I KNYRKEKVGLRDLG I CRRPGEGRLGKH I LRNVLGDDAEA 
ELIGTATVAliKDLTGDQSRSLPYKLISLLNEKGQDTGIYNCELENVA 
EFEGLTDFSDTFKLYRGKSDENEDPSWGEFKGKIjQMWVDVFPKSLG 
PPGPPF\NITPRKAKKYYLRVIIWNTKDVILAEKSITGEEMSDIYVK 
G\WIPGNEENKQKTDVHYRSUDGEGNFNWRFVFPFDYLPAEQLCIVA 
KKEHf W£> IXJQl J5FKXPPK\XjI IQI W \DNDKFSL\DDYLGFPRNIiTIjR 
HTIHFLQKS PGGNC/RGLDMI SGTFKAMNPLKAKTAFLFEQKFMN*W 

DLPNRPETSFLWFTNP\CKTMK\FIVWRGFWWVIIG\LLFLLILLLF 
VGRAPLTbLPNYFVQWKI VKPNVLTKGKGF I SRVIQQ 

5098 

A 

1 

296 

RRS APSTSLRTTCVS S CCWPRRRAPSMTPGAG YPSGS S AGP/ PTHS P 
* PPWTRPS/ S AQRYLRK*QPQPGPPALLPLLPAP\ PKAWLSGLGPW 
LDSPGPQ 

5099 

A 

26 

253 

RWLSPWPIi/SCPQPGWTARNPQPTRELPPHPQAGKPFLPVPVPFQPT 
SPSLLPVMGGLSRPGSFSPVSVASLRCCTH 

5100 

A 

274 

2462 

RRSAPSTSLRTTCVSSCCWPRRRAPSMTPGAGYPSGSSAAHHRCDGW. 
ATTHSP*PPWTRPSWRSAYIiRK*QPQPGPTQHCCPLPSQPPAKGVAV 
RAGPCSAGLSRATIGTGQG/PG/TSPGHTWGQQRYHEATFWG*LLDC 
LVFKS EKPR * L I LLL I TMF * S TS VS LTMLSIjD * ARGRREPAHPTPGP 
SPAATN* PGALLPTVQI S * KCSSLQAPERASPPPCS PR/PPATASFR 
PSLSDPGPS/APSFGKPSPHLKPNSNIFIFLPYQLLSHL*VAQSMAT 
PCPQPGWTARNPQPTRELPPHPQAGKPLLPVPVPFQPTSPSLLPVMG 
GLSRPGS\SLPSQWIiL*GAVHAR*PVPFSILPRGTELTWTPSAKGPM 
GWAAGLAAS GGCLHS SR * PTL VFRGWWEHI PVPLPHNRCGDS PGRP * 
QGSSMVVRDSGEL*GLGDPGLPGFEINKPCCPF*AEILKWNARLARNA 
VLTQNAPSPCPYWTWPCLMPSKDPYPEAYPPYVLREAECPLQAVMGP 
EAPPAWPCS PVG/ S VTVFP/ TS VSRS PPPKKLTHCE * P WAS / S PNLI* 
VPQFPHLEKMG PPLAS S SRAAHAP FS P / S PHPAIiGRLMPLPQWAIiWE 
PTGSPALTPPEASEAC^/MSQLSSR*GPRVLCWVJGVJGMPSFLYLFHRK 

oXT%Ao/^inyoOXAjXjif o!t w X^VXjXj w vjFX^ 

LPVRGLSPSFQSNSRRGLQSPVRQTGPFPKVKPCV 

5101 

A 

1 

545 

DFPRQSS* NIjN I TGKCG I S P/ GTPLiNA\AHFVPGQY / VDVTTKTMCK 

GKWGNIYRQEYG\LKVWRIOTKPPIl\YVKWSGPGP*KWIiR*RSKDF 
KLPA\ FKDLGKNLP I PYIFFLEGDEGGTAQKIGMDEKRVS SPGAPSN 


A 



Lir <j X Xj^Jv* VUWItH) 1 JSJvPC X W J. J? X XjAXjL 1 X AK I WK.yPKCPXjVGlvWX / Q 
LCCI * KMKYYSDFKKSCQAMK 

5103 

A 

1 

422 

FFFFFLRWSFTLVA*A*VQWRDLNSLQPPPPRFKRFSCLSLPSSWDY 
KK/iorLortWr JL«Ur / xir Xj V Jc. XKr jlxi VCyA^ijiiXjJj|MoljX^Xjir/\oAoyoAVa 
IIGVSHC^PIASLFLIHIFNKCISGVRLCSRPWRYSAAQKSLDIAY 

5104 

A 

2 

375 

DGS CFVAOTGVOWHNIjG SLOPLPPRFK * FS CLS Ij PS <5 WDIjROV\ PPP 
HPANF\ * FLV* PGFHDVGQAGLELLTSGDPPTLATRSVGITGVSHCA 
RPGSIYKQRLDKKLRSSVAARTNRCPLWSLPI 

5105 

A 

1 

293 

RRLFFSFLRQSLVLVIQAGVQWGDLGSLRPLPAGFKRFSCLSLPSG* 
D YRHGPPRLANFWI FTREGM/ FAMLARLVSNS * PQGDP P AS ASQS SG 
DYRA 

5106 

A 

3 

305 

ETETCSVAQAGVQWPILSSLQPLPPRLKRFSCLSLLNSWDYRCLPPH 
AANF\*FLVETGFPHVGQSGLKLLTSGDLPTSGSQSAGITGVSHCTW 
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AGKSYIF 

5107 

A 

1 

487 

arrdaaevflvsdpsgrmvkfslqrilnshcfarekegdkpsatiha 
srtmpllslhsrggssse/ sddrlnvteeltsndktrilnvqsrltd 
akr inwrtvlsggs lyi e i pggalpegs kds favllefaeeqlradh 

VF I CFHKNREDRAALLRTFS FL 

5108 

A 

1 

506 

FGVSVYLHDYNMRTNPGEMSESTVPTHPADKGWPQGPDAQPRGGPTP 
PPLPKTLSRTCRPMKQASGLLGTRGCPGLTISKPKKLKVRSKAAAGE 
KPSEPLPTRTPRPHPPPPQGDRACGLRMKGRRRHPHAPYCRRYGSPT 
SVLAVLVEANEDMVGSQLLLSELQENWQAVLAPLGQRAARDLDVEAA 
ATQHCSPN\WRTVLSGGSL»YIEIPGGALPEGSKDSLPVLIiEFAEEQI» 
RADHVF I CFHKNREDRGR * PVS PAVGCVG VPAP PLH S QPAGTVSLRG 
WGVRAGGSGRQGF * RLLPR\ AALLRTFSFLGFE I VRPGHPLVPKRPD 
ACFM\AYTFERESSGEEEE 

5109 

A 

1 

996 

MVKSSLQRILNSHCFAREKEGDKPSATIHASRTMPLLSLHSRGGSSS 
ESESVWMARDAPGEAAH/ CRRHPG/ PSPRGPPGSWGWRTPTARQRPG 
KTPAASREDASRFRGRGCQLRLWGS/HAGSRSSFPDCVPRGGRRSAP 
DREDDRLNVNRGN* RSNRQDGGFFNVQ\ SRL\ TEAQTALNWRNNC* S 
GGSLFNIKPAGGRRWPEGSKGQLLQFLLEFAE/EKQLRADHV\FI\C 
SHKNRE \ DRS CLAPKLS AFLGLKI L\ RPGHP\ LVPKETPDACFMALQ 
RSRESLPGEEEEVGARLRGWAIPPPWGHPFVSRVGRNRRIiAHLAWVC 
PHWIVQINAHSQ 

5110 

A 

1 

125 

FFFEMESHSVIRLECSGVNSAYCNL/TLPGSSGSPASAPPPQVGGTT 
GTHHHAQL I FVFLVETGFHHVGQDGLDLL/NLVIRPPQPPKVLiG * QA 
VLLPQHRPPK 

5111 

A 

2 

280 

FFFFGDRVPLHSAAQAGVKWCNLGSLQPPAP/GLKPFSCLSLPSS*D 
YRSLPPCPTHFCIFSRDEVSPSWPGWSPTPDLMNHPPRPPKVLGIQV 

5112 

A 

2 

291 

FLEMESHSVAQAGVQWRDLGSLQPPPPGFKRFSCLSFPSSWDYRRPQ 
PRPA/NFFVFLVETGF\ TILARLVSNS * PRD P PASAS QS AG I TGVTY 
RARP 

5113 

A 

3 

690 

FFFFFLRQSL/DSVTQAEVQW/RRILGSLRP/LQLPGLK*CCCLSLL 
S S * D YS YNPLPRLANFF / VFC I F * * RRG FTMLARLLSNS * PCDLPTS 
ASQSAGITGVSH/HAPGLASPSLNNKVQSSVGAQRRHDEDNGSRA*V 
FFCFFFFKRFALVA*AGVQWRDLSSLQPLPL»RFM*FSCLSLPSS*DH 
RCVPPHPANFFVFI*\RDGVSPCQPG*SRSPDLAIRPPRPPKVLGLQ 
A 

5114 

A 

3 

282 

FFFFETESHS I LSPRLECNGTI SAHCNLRLPGPRDCPAS AS * VAG IT 
GVCHHAWLIFVFLVETGFHHVGQAGLELL/NTLVIRIiPQPPKVLGLQA 

5115 

A 

3 

198 

LSLNWCI ILYI YICI YW/ CWLFCHSCLSNMGT* FFLAGLLLFRIC 
LCLF I FYFLL * * *LLFIMI 

5116 

A 

1 

854 

MNKLFSILLGEKVDTQPNVLHNDPHARHSDDNGQNHLEGQMNFNADS 
SQFKDENTD I AEK\ LEEKVRI LCWVMTGPQ\ NLEKKAKHVKATWAQR 
CNKVLFMSSEENKDFPAVGLKTKEGRDQLYWKTIKAFQYVHEHYL*D 
ADWFLKADDDTYVILDNLRWLLSKYDPEEPIYFGRRFKPYVKQGYMS 
GGAGYVLTREPLKRFVDALKTVKCTHS \ S S I EDLALGRCMEI MNVEA 
GDSRDTIGKGNFFHPFV\ PE\HHFNI*KGYLPRTFWYWNYNYYP\ PVE 
GPGLLL 

5117 

A 

13 

290 

ECRRQRG VQ * CNH S * QQL * PP\ GLMQS SLLS S PG S SDYRQVLLHSAE 
F*VFNLVEMGSYYWQAGLKFLASSDPPALTSOWARITGMSHITRA 

5118 

A 

1 

595 

MGIHHVGQASLELLTSGDLPASASQGRGVRLYYI\EGRSSQSASVTA 
LFLS S LPTVTS AMAGTRPP S ARSHQTLQACRAQKTKTRMS SI* GTGA 
KHQASSPGKAPLSTPS.PYFWKPSLQTSPCSGSRSLWASLPSPLAALF 
LCFWQDAT*RSSTTRSSLPSWPSRSTRALRLSTS*PECAPSA*ASSK 
AGERSTGDRL 

5119 

A 

3 

496 

FFFLRCSV/NSTAKAGVQCHNLSSLHPLPPGFKQFPCFSLPSS*DYG 
CAPPHPANFYI PSR I SQAWWRVPVI PATQEA\ RQS FNFVAQVGVQWH 
NLSSPQAFPAGFK*FSCLSLPTSWDYRHMPPCPDNFCIFSRGGVSPY 
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WPSWSRTLDIMIHIPRPPNVLGLQA 

5120 

A 

3 

329 

ETVSHSVAQAGVHWHHVGSOQPPSPGSKQFSNLSLPGGWDYRHPPPH 
PAN\FIFLVETGSHYVAKADLQIiLASNDPPTSASQHAGIIGRSHCTR 
PRIY*WSNKL*YKIT 

5121 

A 

1 

537 

ERQSCSVARAGVQWHDLGSPQP/LPPRLKRFS\CLNLLSSWDYRRAL 
PHPG * FLYF* * RQGFTMLVRL/ VLNS * PCDPPTSASQSAGITGVSHR 
ARPVLRF*QNYYYFETGSGSAAQAGMQW\PIMAMQPQPPRLKQASNL 
SLLSSWDH\GMQPHPANTTVCKTERKQVFRMHLLNNTNYREQ 

5122 

A 

1 

197 

EAGSHSVAHAGEQWCDYGSLQHQP\PVLKQFSHLSLPSS*DYRCVPP 
CP ANFLI I LM I S VS IRGS V 

5123 

A 

809 

- 

1472 

KTLLLS C YLS KLWP VD YLRS SRLLRRAAS S FG S WR FALQRTPHTTRR 
LS RS S KAVQTAFPRRS PS SRHRCGS S SRATTTCRMS FLS S LTTCAQQ 
lAGWWLKRSIGLRKRSMSLMALKKWPCLMWKXRRSLPRYPQPQRTR 
GKKRSG S KIMI RRLNGQMGPRTVPAPAMKEVQI / TKLKKS KRRS CSH 
CSSKVCDSKS/VQEQGAP*VGGQQPPPRG*SYAWC 

5124 

A 

1 

504 

DSEPCSVAQAGEQ*RDLGSLQPLSLGLKQFSCLCLPSSRDYRRPPPC 
LANFCYF*YRWGFTTSARPV*NSWPCDPPTSASQSAGTTGVSHQA\R 
LLLFFKAFKVWSGDSI*CVNEFNIISKISSFPSHFTSTPQSCPIMHS 
QIKYK*F*DYP*CSKQAGGISISDTYY 

5125 

A 

2 

351 

KQGIIMTYVLRRPFFFSRWSL/DSVSQAGVQWRVFGSLQAPPPGFLP 
FSCTG/ RQPPSPANFFLYFLVETGFLF* *RQGFTVLDRMVS IS * PRD 
PPALASQSAGITGVSPRTRQEGHS 

5126 

A 

1 

1047 

MVS I SWPRDLPASASQS AGITGLIGALVXiS VGI YAEVER/ HEI *NP* 
KCLPGSSHHPHPPGRRHVHGLLHWCAGVPP*QPPELLASLRLSRGYG 
LVLSWLEPRYEKMISGMYLGEIVRNILIDFTKKGFLFRGQISETIiKT 
RG I F ETKFL SQ I E SDRIiALLQVRA I LQQLGLNS TCDDS I L VKTVCG V 
VSRJIAAQLCGAGMAAVVDKIRENRGLDRIiNVTVGVDGTLYKLHPHFS 
RIMHQTVKELSPKCNVSFLLSEDGSGKGAALITAVGVRLRTEASS 

5127 

A 

116 

498 

YNFALKSELKFKNQAIVSLNKFFFFFFLRWSLRPVAQAGVQWRDLGS 
LQ\PPPPGLTPSSCLSLPSSWNRRRPPPCPAKSFWIF/M*RRGFTVIi 
ARMVS IS* PRDLPAS ASQSAGI TGVSHRTRQEGHS 

5128 

A 

3 

490 

GIYHTLRTGQEGRGSIKQQRLLQRPLPYFFLFIFPSFLYMIEIKH*L 
SFYILLFAFNIVS*PFSYFIFFF/RF*DSVLLCGPGWSTVA*S*IiTA 
ASTSQDQGDSPTSGPLKVTDRVCGHHT*ANFYIFCGGWGGLTII#LRR 
IPNSWAQGIFPPWPPKVLGLQA 

5129 

A 

423 

495 

LLF P I RRPFATRS SQKES S PKTP 

5130 

C 

628 

849 

MCGSVYSTIWSLIASRREETIRVIVLYIQSPNINTRHISKRGLNKGV 
NQFHKEDWNLLKRVCSHSGAEAGIFS* 

5131 

A 

350 

412 

NFELKLSPYQKKSIYWG^QVMPVAVGKPSSEWPPGSHMDNSQIRGN 
PVCGIK*FSCPSLQCSWDYRCMTPCLANLFFFKLEGGFCHVAQTSLK 
LLGSSSPFILASRSTGITGMNHHA\RQLIiN*SIRE*HSITHRLTHSR 
EKSCECNIiYGKTLSSNFPPTKKKVSTGELPR 

5132 

A 

3 

189 

HERKAQPAGEGRTHMTKSDSLPSFRVSTLPLESHHPDPNTMGGASHR 
DRAIj SGTATVRDTRGIDPAPVHAS PTRNR\ NEGRHVFKRFPDTLTDR 
SIS I YTDHHSLQIRTLRALSQL*TTPTQTPWAGPATGTGLiSRGLPPY 
GTPEG 

5133 

A 

32 

800 

FKTIIKVRFLCIRVYSLCCKLLTYCDHFHSIiDFEECAHKXiLKMEFPE 
SQTKELCNMILDCC^QQRTYEKFFGLIaAGRFCMLKKEYMESFEGIFK 
EQYDTIHRLETNKLROTAKMFAHLLYTDSLPWSVLECIKLSEETTTS 
SSR\ IFVKI FFQELCEYMGLF* / SLIARLKDGN\IiAPFF *RI \ FPRD 
DPRITRVAI \ SFFPS IASGCLTYCPVSALSS I PLRPLWALLLP I PI 
FHHDLLLELFRLPFFYNDTLVLPFV 

5134 

A 

2 

706 

RCN S AQQQMTS S QKALML ELKS LQEE P VEGFR I TL VDES DL YNWE VA 
IFGPPNTLYEGGYFKAHIKFPIDYPYSPPY/TFRFLTKMWHPNIYEN 
GDVCISILHPPV\DDPQSG\ELPSERWNPTQQCEGLILLRC*SHWLN 
EPQHLSPQPMSDASVMFR\KWR\DSKGKDKEYGLKFIRETKFSAT\K 
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ARRQEKGGS * RFPTTLGGNTC I KTEVP \ RNDEQVQS LLYDELGMMTD 
I DDER * GRRKDAD \ C YDD \ DDS GE * G S PQP WAG I PAS KLRCPCNDEQ 
VQS LLYDELGMMTD I DDER 

5135 

A 

7 

219 

EGV*VSRDW*QWS/DIGSWRHSMTGSGDSLASAF*VSGGTGESHHKQ 
LIAVNMVEMGSHRAGQAGNELSTL 

5136 

A 

2 

309 

DGVSLLLSPRLQWHNLGLLQPPPP*FKGFSCLSLPGSWDYRHV/PPC 
LANFVFLVETGFRHVGQTGLELLTSGDPPASASQRAGITGVSHRTWQ 
KMYFLSQKYF 

5137 

A 

1 

309 

SCLSLLSSWDHRRVPAHPANFVFLVETGFHHVSQV/GLELLTSSNPP 

ASASHSAGITGVNHRAWLPGLNLRF*KMIIRRLRRLLKNKFHSTILR 
KSETSNSCE 

5138 

A 

2 

625 

TYGEMAKI VDVPTKQLRAPDP I DSMLCHFCHNVS FPCTRNGCVDMEH 
FKVIKTHQIEDERERREKKLYFGYSLLGAHPILNQTIGRMQRATEGR 
KEELFALYSAHDVTLSPVLSALGLSEARFPRFAARLIFELWQDREKP 
S*TF\VRILYNGVDVTFHTS FCQDHHKRSSKPMFPLENLVRFVKRDM 
FLALGGSCTNYYDACHREGF 

5139 

A 

167 

688 

VI KI LGTEKGKKNRTNLI FLSGNRVLKWMERSHGTEKALTALLKCGA 
EDHVEAVKKLQNSTKILQKNNLNLLRDLAVHIPHSLREQSRLGRCGH 
ITQETLLFLTVGDEKGGGLFLLAGPPASVETLGPRVAEVLEGKGAGK 
KGRFQGKATKMSRRMEAQALLQDYI STQSAKE 

5140 

A 

3 

1230 

QRDSYAREFTTTVVSCCSAELQT/GEGATARNIFLSGFQVVLEDTVL 
FPEGGGQPDDRGTINDISVLRVTRRGEQADHFTQTPLDPGSQVLVRV 
DWERRFDHMQQHSGQHLIT\AVADHLFKLETTSWEVGRFRSAIELDT 
PSMTAEQVAAIEQSV\NEKIRDRLPVNVRELSLDDPEVEQVSGRGLP 
DDHAGPIRWNIEGVDSNMCCGTHVSNLSDLQW* \ ILGTEKGKKNR 
TNL I FLSGNRVLKWMERSHGTEKALTGLLKCGAEDHVEAVKKLQNST 
KILQKNNLKSAQ/RT*AVHIAHSLKEQSRLGRCGHITQEGG*FRVHE 
YHCQ * RLGQRRPS L FLTCGR * ERWWTLLYWQGHLRLWRP WGPGWLRS 
WEGKGAGKKGRFQGKATKLSRRMEAQALSPGLHQHAEC 

5141 

A 

3 

306 

SRQWTLFADGQHSGPAAPCPCGEV*RTGALNPKTFQG/LPSAPGPCP 
VPHGASHP*TPQKLQPGNKSQMKPVKPGFPPSGAGGAWASL*PSSLP 
FPDLQYLL 

5142 

A 

1 

306 

FFFFFL*RQSCSVAQAGVQWYDPGSQKPPPPRFERFSCLNLPSS*DY 

RRPPSLPA\IFVFLVETGFHHIGQDSLKLLASSDLPISASQSAGITG 
MSHCTWPV 

5143 

A 

81 

657 

TAEKGGEETKSGELGRSQRTLSQVGATRGTSEEITLIVQMRDAGGSA 
LCSG I GGGEKW S VLGVI L KKDDR I VLAYCMGHE I KRG I EDDS KVFFF 
FFLEMKSCSWQAGVQWHDLSSLQPALPRYK*FSCLSLPGSWDYRHV 
LLRLVNF\*FLVEIGFHHWQAGLKLLTSSDLPHSASRSAGITGVSH 
RARPG 

5144 

A 

3 

494 

ETEDLALSPQAGVHWAR/SLSSLQPPPPGVFKQILCLSLLSSWDYRG 
/PCHPHPAKFLAF*VETGVSPSWAR\LFLNS*PRDPPASASQSAGEP 
PRWASVCNSVKMLS SAFNWCVCRLHKEKWMRRQKTHTVMXPHSNRCS 
SRHFSRIPSSIETYNNLRHTLLAGRF 

5145 

A 

3 

123 

ESGSHS\LPKL*CSGAITAHCNLKLLGSRDPPTSASGVAGT 

5146 

A 

3 

292 

RQRL/NSVTHSGVQWRNPGSLQPLPPGFKHFSCLSLLSSWDYRHTPQ 
CPA\ IFVFLVETGFRHARLVSNS * PHNLPVSASQ/ SAWITGLSHHAR 
PCVSF 

5147 

A 

2 

303 

FLGWSFTLWQAGVQWRDLGSPHPLPPGFKQFFCLSLPSS*DYRHAP 

PHLANFVFLGEMDFLHAWQAG\LELLTSGDPPTLASQSAGITGVRHC 
SRPVSE 

5148 

A 

2 

613 

PRVRTYRFGSVGHDTQLCLWDLTEDILFPHQPLSRARTHTNVMNATS 
PPAGSNGNSVTTPGNSVPPPLPRSNSLPHSAVSNAGSKSSVMDGAIA 
SGVS KFATLSLHDRKEKAPRERSQSRNHSMGHNF * QEASDKLES S / S 
P KPKPDPAKTLGTPLCPRMEDVPLLEPL I CKK I AHERLTVL I FLEDC 
I VTACQEGF I CTWGRPG 
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5149 

A 

2 

395 

IGFQPQLTEFQVSLQDLEKLSGLVFFPHLDRTSDIRNICSVDTCKLL 
DFQBFTLYLSTRKIEGARSVLRI«DKIMDHLKNV*NE\PYDYPMRRYY 
KKLQELTAIDLTSTHISQPSLLSYTTCLISGATPPFT 

5150 

A 

644 

962 

NASSLAWTVLLKLLVGFGAGQCMGNFFFCWGGAMESPSVAQAGVQWR 

NLGSLQPPPPGFKRFSCLSFPSSWDYRHVPPHPANF\CIFSRDGVSR 
CWPGWS*TPDLR 

5151 

A 

3 

321 

RRCLCAGYFKNVARRSCWRTFCTMDGRGSPVHIVHPSSALHEOETKT. 
EWI I FHEVLVTTKVYARIVCP IRYEWVRDLLPKLHEFNAHGFEQCGP 
T*SERRCKKEMDK 

5152 

A 

237 

602 

LLITGRQYSLTFRRFYIHLRNKILNSGLGKPPLKX.HFYTEAFFARNP 
DQMEFHSVAQAGVQWHDLG*LQPLPAGFKQFFCLRFLSSWDYK\GTP 
PCLAG FC I FSRDEVS PRWRGWS * TPDLK 

5153 

C 

166 

417 

MPPWDPCFPNHPAPKPHQPHPARLTDRHHPXPFPTHOLPCLIiPAP<5P 
GS LWALSPQGHVYTNVEGTRS I RECSCTKNPFPGP I 

5154 

A 

282 

1112 

FLKRENLLKNVCRNKLDLRPNRASPTGSQEWRQHGVQSEVQFVHSLL 
KPGPPADSCRTAA/GDPGGTETADPGHTGLELRFPEPPKPTLPLALS 
EAPVP/PERGENTGPRSLPSTPMPSWGPLLSSPPRPATSPTSPSQAN 
*PPPPRPSSHPPTTVPPTRPOPWOPGCCAOPPRPrT 1 W*PnnM*W*r» 

MQLHQKSVSWPHSQWPQKREGNGSSHEIHPSCPPSGFHGPSPEGRMP 
TQAGALHSWVPGLSLSHGSGPHPPPRWTATVQAARPRVPPLTNL 

5l55 

A 

69 

734 

SRLGRDHVGPGLGGVG / VRR/ REND IGE YEGGRNEAGERHGRGRARL 
PNGDTYEGSYEFGKRHGQGIYKFKNGTU^YIGEYVRNKKHGQGTFIYP 
DGSRYEGEWANDLRHGHGVYYYINNDTYTGEWFAHORHnonTYT Yaw 

TGSKWCTWWGQQEGTAELIHLNHRYQGKFLNKNPVGP/ AKVCI *C 
WV*TTW*ISFNRYGKRRRGRGGRISNCCSKMESYPNH 

5156 

A 

4 

674 

GVQRGHFQEAKGSGRCLS S LNVPHP PP fiHMnp q c mroxroi?!? tv pp n 
LAQVPSAALAQVAPGPVPISSLLEGPT\PSLPF*ERGNSSHKQASGP 
ERGARPPALLGPQEIGGSSK/YPPPGPGSTLGKGLLASPRCGPPQVL 
GWGLRDPTRLEEGPWGSL/GNTPREPSIKCSP*MGRMGAQPLPKAEG 
APPPTQLPPTHS PRR ILGPLGAS PERPPPRWPPGGEG 

5157 

A 

3 

342 

NFFFLRQSF/NSVTQAGVQWRDLGSLQPLPSGFTPFSCLSLLSSWDY 
RHPPPHPANFLYF**RRGFTLLARMVLIS*PHDPPTSASQSAGITGV 
SHGAWPKS I FFLNAEVMLYY 

5158 

A 

1 

251 

VCVCGFCFVLETESRSVTQAGLQRHHLCSLQPPPPGFKGFSCLSLPS 
SWDYRCPPPRIiANF\ * FLVEMGFHHVGQAGLKLVASSDPPAS VSQSA 
EMTGVSHHALPG*SRSVTQAGLQRHHLCSIiOPPPPGFKGFSCLSLPS 
SWDYRCPPPRLANFCIFSRDGVSPRWSGWSQTRGLK 

5159 

C 

24 

362 

MGIWFRXXXXXXMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxs * 

5160 

A 

1 

287 

FFFF*DIRVSAQSPRLECSGAILAHCNLCLPGSSDSCASTSPVAGIT 

GMCHHTQLIFFVFLVETGVHHVG\RAGLEL\RLQVIHPPQPPKVLGL 
QA 

5161 

A 

1 

1672 

QTGLQAx^KHSGFPV^MDNAVPIVPQAPAAQPLQIQSGVLTQGSCTP 
LMVATLHPQVATI TPQYAVP FTLS CAAGRPAL VT3QDCRCTAGVAWKG 
LKQILLPSNFGQQLAWG*ALHNS\VQPTAMIPEJ^GSGQQ1^ADWRNA 
HSHGNQYSTIM^PSLLTimVTLATAQPLNVGVAHVVRQQQSSSLPS 
KPJ^K\QSAPVSSKSSLbvXiPSOVYSLVO\ «?«5PT,pttqc;vwct \ \rmrn 

DQHQPIIIPDTPSPPVSVITIRSDTDEEEDNKYKPSSSGLKPRSNVI 
SYVTVNDSPDSDSSLSSPYSTDTLSALRGNSGSVLEGPGRVVADGTG 
TRTIIVPPLKTQLGDC.TVATQASGLLSNKTKPVASVSGQSSGCCITP 
TG YRAQRGGTS AAQPLNIj SQNQQS S AAPTSQERS SNPAPRRQQAFVA 
PLSQAPYTFQHGSPLHSTGHPH1VVPAPAHLPSQAHLYTYAAPTSAAA 
LGSTSSIAHLFSPQGSSRHAAAYTTHPSTLVHQVPVSVGPSLLTSAS 
VAP\AQYQ\QQVATQ\SYIGSFRGSTIYTGYPLSPTKISQYCYL 

5162 

A 

1 

111 

ELR YKRWPGWKADT PQAQ \ TVKWVG VG KSRES M I QL F 
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5163 

A 

2 

1034 

DRAHLVFDFHQAVDGLQBVQRQAQEGKNIGTTKKGIGPTYSSKAART 
GLRICDliLSDFDEFSSRFKNLAHQHQSMFPTLEIDIEGQLKRLKGFA 
ERIRP>nmDGVYFMYEALHGPPKKIIiVEGANAALLDIDFGTYPFVTS 
SNCTVGGVCTGLGIPPQNIGDVYGWKAYTTRVGIGAFPTEQINEIG 
GLLQTRGHEWGVTTGRKRRCG\ WLGS \ MI LRYAHMVNGFTALALTKL 
D IL\ DVLGEVKVGV\ S YKLNGKXRI PYFPG * PGRCFKKGRKLKYETL 
PGWK\ADTTGRPGRWEDLPPQAQ\TYIRFVENHVGSRSGYHYQSFLL 
/VSVKWVGVGQVQESPMIQLF 

5164 

A 

3 

148 

LLSSRDHRHVPPHPANFFCRD/RGIJmLPRLALNSWA*AIHPPWPPK 
VL 

5165 

A 

3 

325 

YVFSGDLAHQILNVSLGIRRLHLADVSVGICLAKLRIAPVPPHNESV 
FNRWRCP\YSSCSY\HHLITSHQFQPSELIKYWNHLQHNKHNACANA 
A* EKAGMYRHRR1»H 

5166 

A 

373 

950 

SRGVFFFLRQSFTLVA*AGVQWCDLGSLQPLPPRFKQFSC\PASLVS 
WDYRHAPPHPANFFIFLVEDKGFSMLVRLGLELPTSGDLPASASQSP 
GITGVSHHARP * MILF I FMFS FGRRDGVS VTQAGVQWHDLGSLQASP 
SGFMPFSCLSLPSSWDYRHV*PCQANFLYF**ANLSRVSPCWSGWSR 
TPNLK 

5167 

A 

2 

334 

VRHRFTLVPQAGVQ*CSLG*QQSPPPWFKQFSCLSLPRSWDYRHLPP 
HPANFFVF/VEKTGFHHVGQAGLELLTSVT3PPASASQSAGLTGLSHH 
AQLSFNLSAKKSFTKTRS 

5168 

A 

35 . 

438 

NSEAAQAFLCKNGDPQTS I FDHLVAI ERAGRAADGNYYNARKMNIKH 
LVDPI DDLFLAAKKI PGISS TGVSNWGG YALACAL Y I LYS CAVHSQY 
LRKAVGPSRAPGDQAWTHALPSVIKEEKMLGILVQHNSRS 

5169 

A 

1 

1845 

MAGELRPASLWLPRSLAPAFERFCQVNTGPLPLLGQSEPEKWMLPP 
QGAISETRMGHPQFWKYEFGACTGSIiASLEQYSEQLKDMVAFFLGCS 
FSLEEALEKAGLPRRDPAGHSQAGAYKTTVPCVTHAGFCCPLVVTMR 
PIPKDKLEGLVRACCSLGGVSRGKGPEQGSKALAAVSGLGLQMPLRF 
KELLGIKELSKPAYGDAMVCPPGEVPVFWPSPLTSLGAVSSCETPLA 
FAS IPGCTVMTDLKDAKAPPGCLTPERIPEVHHISQDPLHYS IASVS 
ASQKIRELESMIGIDPAKLESQWKRFDHLVAIERAGRAADGNYYNAR 
KMNIKHLVDPIDDLFLAAKKIPGISSTGVGDGGNELGMGKVKEAVRR 
HIRHGDVIACDVEADFAVIAGEHSDGRPVRPATQGVSNWGGYALACA 
LYILYSCAVHSQYLRKAVGPSRAPGDQAWTQALPSVIKVTNPSQPLA 
WKQLLPWMSSRPEVPSQQAPPPTPIPGSNPPPLGLLKEMGPAPLGGE 
AGGPDLRD IHNAS VLNGTPTWG IFPGQVLSTHGDMDVEY AI S PGHS S 
HVEEKMLGILVQHKVRSGVSGIVGMEVDGLPFHNTHAEMIQKLVDVT 
TAQV 

5170 

A 

1 

212 

LRAPGDQAWTQALPSVIKEEKMLGILVQHKVRSGVSGIVGMEVDGLP 
FHNTHAEMIQKLVDKGQHFSGAF 

5171 

A 

371 

915 

NQPHFVNACCVPRTLLSALLEEKMLGII.VQHKVRSGVSGIVGMEVDG 
LPFHNTHAEMIQKAGGRHHGTGVTVHVPCEQSPYORAGLHPGRMQLL 
IiATILLVNTGLRQPTWSPVR I FSTVTYGDTCEQLTARQSMRTNI PRP 
CRIRYSRLQSSSPIHGFCYSWFQLLIANCRPKILNEKFQK 

5172 

A 

1 

2460 

MAGELRPASLVVLPRSLAPAFERFCQVWTGPLPLLGQSEPEKWMLPP 
QGAISETRMGHPQFWKYEFGACTGSLASLEQYSEQLKDMVAFFLGCS 
FSLEEAIiEKAGLPRRDPAGHSQAGAYKTTVPCVTHAGFCCPLWTMR 
PIPKDKLEGIiVRACCSLGGVSRGKGPEQGSKALAAVSGLGLQMPLRF 
KELLGIKELSKPAYGDAMVCPPGEVPVFWPSPLTSLGAVSSCETPLA 
FAS IPGCTVMTDLKDAKAPPGCLTPERI PEVHH I SQDPLHYS I AS VS 
ASQKIRELESMIGIDPGNRGIGHIiLCKDELLKASLSLSHARSVLITT 
GFPTHFNHEPPEETDGPPGAVALVAFLQALEKEVAIIVDQRAWNLHQ 
KIVEDAVEQVWGMTCRVFIiAIRGSPGPPELGRWEYEVAERLRQQSH 
LVVMGRAAPGAPSLAWRRLPDSTQSWVLSLVLMGAQRVEFAILDFTE 
LTLHLHEESVLKTQIPILTYQGGSVEAAQAFLCKNGDPQTPRFDHLV 
AIERAGRAADGNYYNARKlWIKHIiVDPIDDLFLiAAKKI PG I SSTGVG 
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DGGNELGMGKVKEAVRRH I RHGDV I ACDVBADFAV I AGEHS DGRPVR 
PATQGVSNWGG YALACALY I LYS CAVHS QYLRKAVGPSRAPGDQAWT 
QALPSVIKVTNPSQPLAWKQLLPWMSSRPEVPSQQAPPPTPIPGSNP 
PPLGLLKEMGPAPLGGEAGGPDLRDIHNASVLNGTPTWGIFPGQVLS 
THGDMDVE YAI S PGHS S HVEEKMLG I LVQHKVRS GVS G I VGMEVDGL 
P FHNTHAEM I QKLVDVTTAQV 

5173 

A 

1 

933 

LESMIGIDPGNRGIGHLLCKDELLKASLSLSHARSVLITTGPPTHFN 
HEPPEETDGPPGAVALVAFLQAL\GRRSP * SLTREPVII»HQKIVEDA 
\LSKVF*RRRSRY*LTKVDQWKLLRHSCAKIGDPQTLRFDHLVAIER 
AGRAADGNYYNARKMEHQAL / RLTPFDDLFL \ SAKKI PG I S S \ I GVG 
DGGNELGMGKVKEA/DEEAHTARGCHRLATWKAEFAVIAGVSNWGGY 
AliACALYILYSCAVHSQYLRKAVGPSRAPGRFRPGIiRPSRRSLRKKK 
CWASWCSTKSGVASRASWAWRWMGCPSTTPTPR 

5174 

A 

367 

460 

VTPEKQHEQWIPHSSIRQQNQQPKEQWS*YPAPQSG\SPIiTIMEGTV 
WQQLMSVMISKHDCLFPKDAELQSKPQDGVSNNNEIQKKATMGQLQ 
NKENWNTKDSPSRQCSWDKSESPQRNSMNNGSPTALSGSKTNSPKNS 
GHKL 

5175 

A 

96 

403 

IKSLYLLLLRPHPPPPPNILCSLEQRNLTLETEMMSLHDELDQERKK 
FTM I E I KMRNAERAKEDAEKRNDMLQKEMEQFFSTFGELTVEPRRTE 
RGNTIWIQ 

5176 

A 

74 

1506 

QQIMRATSSWQAPRMIWKTG*SQSAESYGDLSGGGTVWQQLMSVMI 
S KHDCLFPKDTVELQSKPQDGVSNI^NE IQKKATMGQVTEQGEQ * HQGQ 
P**AVLLG/PSLSH/HQRSSMNNGSPTALSGSKTNSPKNSVHKLDVS 
RSPPLrfVlCKNPAFNKGSGIVTNGSFSSSNARnT.P*inrnT T ri3TjnQT nap 

RS S S LKVSGTKMGTHS VQNGTVRMG I LNSDTLGNPTNVRNMSWIiPNG 
YVTLRDNKQKEQAGELGQHNRLSTYDNVHQQFSMMNLDDKQSIDSAT 
WSTS SCEI SLPENSNSCRSSTTTCPEQDFFWGELLRTLFWMAPPQ\ p 
DPFPTPRDYESKSDHRSVGGRSSRATSSSDNSETFVGNSSSNHSALH 
SLVSSLKQEMTK\QKIEYESRIKSLEQRNLTLETEMMSLHDELDQER 
KKFTMIEIKMRNAE\RAKEDAEKRI^MLQKEMEQFFSTFGELTVEPR 
RTERGNT I W I Q 

5177 

A 

1 

439 

GNEFS ILKSPGS WFRNGNWPI PGERIPDVAALSMGFSVKEDLSWPG 
LAVGl^FHRPRATVMVMVXGVNKL^ 

QVLHDISSLLSRHKHLAKDHSPDLYSLELAGLDEIGKRYGEDSEQFR 
DASKI 

5178 

A 

1 

553 

GNEFSILKSPGSWFRNGNWPIPGERIPDVAALSMGFSVKEDLSWPG 
L\AVGN1jFHRPRGYPSWVM\VKGSGTKLALTPQAVVISYP\LE1TRVY 
WVGKANSVF\EDLSVTLRQLRNRLFQENSVLSSLPUNSLSRNNEVDL 
LFLSELQVLHDISSLLSRHKHIiAKDHSPDLYSLELAGLDEIGKRY 

5179 

A 

3 

1111 

I LKS P\GS WFRNG\ NWP I PRE\ RDPPDVAA\ LSHGLPL * KKDLSWP 
GLAVGNI»FHRPRAT\ VMVNVNGVNKLDLP P \ G S G I S \ Y PLENAVPFS 
LDSVANS \ IHSLF \ SEET\ PVVLQLAPSEERVYM/ VKGRANS VFEDL 
S\VTLRHSRNRLVFKKTLFLSFTPPQILLSRNNEVDLIiFLSELQV\L 
HDISSLPSRPKHL\ARDHSPDLYFTGSWAGL\DEIG\KALLGEDSEQ 
FRDASKILVD\ALQKFADDHVPVFMVGNPVVELVHCPSHLNTSPPLG 
KTRDLPLRPKQAQEPQQVPYNLA\ YKYNFEYS \ WFNMVTiW IMI \ AI* 
AIoAXVIITSTOIWNMDPXGYDSIIYRMTNQKISEWIECYLWPRIRKR 
GFGNWLFC 

5180 

A 

18 

595 

NVAGTA ICS FAPGRRQTPCHTP^PQDGRPRARLTAWTRARGTCRVAH 
THTGRQNHPMPLWGFSRHHRAFSEHLSLGCAPPSATWT*TRSPAAAA 
AGP/PVPRKPNPSPPSRSPAGLRPPGRAGPLTSPRCPGPPASQASDT 
VGPVRHKQRGPTSDVSGAWDGPATAASGPWRLYPSTLGGRGRQITR 
SADKT 

5181 

A 

68 

446 

RLTS ELAQTLAVHPS KSARPRP * AGE\ KPGKKPPLQVNG AWD * * PRV 
GADPGHSPDSLDPST\GDVQGGSYSYWEGRTITMPLWGFPDRAGLLG 
SPGSSGPGKRRPILAGQPYKLLELQARFLGPGH 
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5182 

A 

2 

513 

AP PS ATWT * TRS PAAAAAG P / PVPRKPNPS P PSR S PAGLRP PGRAGP 
LTSPRCPGPPASPGIGHCGCRQTQAAWPRRRTYQGRGMGPRPRPRRP 
WPVGGERSHSHPSGPPLRSPGPEWGLTSSSCPARG/PGGCLCPQNQ 
LPNPGP/SGPASPNLPSPRLHPTDTSGPGFPRA 

5183 

A 

14 

720 

RNFPDRPTRPRTRGRTRGRTRVRTFGPGPVASAGCSRGDGRFGEAGP 

TALDSGADSGVRDTLPVHEPGSCLRYYRTPRPRSRAHPTPLIRPTSG 

PRCFGSDGTHSVRCPVNAGRPGHLGLGG/AKSSGFAGQKGRLLLAAG 

TCGVHVQGPLRAARPQEVLIFQGGPLQPAA\PEQPPATGAWLQPRLR 

CS EKAL * LP * GAP/ SQS PPRPMQ\ P ITVPASG *NAGGPGASGGTRVP 

G*KLPDTGGP/SAQHPLSPTGLPHDARP/PSSCPTGRSCVHCPQLNA 

VHL*E*PANSELSRPLSEDTGP*AAPTRPGSAGPISDPHS\PPPSSP 

SSSGTPAIiPGHMSS/GGKWYQHQPSACPVLGIi/PPDPGAEKAGSQPS 

PGHLVQAPGHPGPPGPHTPAVTPSFPNSRAALPSFSPR*PCPQSPKP 

QGSCDPRPPRKPKPWKFQGSL*GLPLQKNRPPFSPRTGATPGEPPGA . 

L/PLSGNPPKSHRDGSALPSMSMSHPARPSVLGSKLSGFLPSSWPGP 

CTF * WMKCGRTRS IRGHS CAWLKAA 

5184 

A 

2 

1558 

• 

RRSRDSGYRAPPASHSRPGQAARAPAPRPARREVCCPRPGVPRRART 
TRCTPTWAAVATGTRGASAEGGPTGLAHG/SGLAAAAGLMXGCEPAS 
GWGPGRQGSGARTAAGSSGSSSGSGPSAPTSSSVLGSGLVAPVSPYK 
PGQTVFPLPPAGMTYPGSLAGAYAGYPPQFLPHGVALDPTKPGSLVG 
AQLAAAAAGSLGCSKPAGSSPLAGA5PPSVMTASLCRDPYCLSYHCA 
SHLAGAAAASASCAHDPAAAAAAIiKSGYPLVYPTHPLHGVHSSLTAA 
AAAGATPPSLAGHPLYPYGFMtiPNDPLP\TSATGCRPTGRATSASPR 
PKSC*AT/SRTHTAFPGTDKLLSGYPSSSSLASAAAAAMACHMHIPT 
SGATGQPWDAGAAQPPPRGWDSAAATTPTPRA/ALPTPGAPVPVPAA 
TGPYYSPYAIiYGQRLTTASAAGVSVRGGREGERGIGRKRGRGGVQGE 
AASRTRLLTPARGEGLGPRKERNATLPYISYPTAATETRWDTPLLPTF 
TSHTQTL 

5185 

A 

6 

56 

WAWRGSDSFSGLTSKLVQRQVFHHQLLKTVPHQAHPY*LPRAALQQ 
IHRC*LPR/DTSQVAASLLASKSSSQTSGSLVSKSTSLASVSQLASK 
SSSQTSTSQLPSKSTSQSSESSVKFSCKLTNEDVKHEATFF/LIRLY 
KTVAWKLVAVGGFSPNVOTGELLNAAIEALKATLDVFFVPLKELADI. 
PQNKS SQES I VCELRCKSVYLGTGCGKSKENAKAVASREALKLFLKK 
, KWVKI CKRKYRG S E I EDLVLLDEE SRPVNLPPALKHPQELL * CVQN 
I TA YN I WYLKRKTD FC I V * N 

5186 

B 

133 

345 

RLSERRNWTPQAMLYIiKGAQGRRYLLRPEPANGPLRPAHCRIDHAPI 
STYYILRRPEPSYRSSSYFPDDD* 

5187 

A 

2 

2168 

KESRRTVRRMLRGRSLSVTSLGGLPQWEVEELPVEELLLFEVAWEVT 
NKVGG I YTVI QTKAKTTADEWGENYFL IGPYFEHNMKTQ VEQCEPVN 
D A VRRA VD AMNKHG CQ VH FGR WL I EG S P YWliFD I G YS AWNLDRWKG 
DLWEACSVGI PYHDREANDML I FGSLTAWFLKEVTDHADGKYVVAQF 
HEWQAGIGLILSRARKLPIATIFTTHATLLGRYLCAANIDFYNHLDK 
FN IDKE AGERQ I YHRYCMERAS VHCAHVFTTVSB ITA I EAEHMLKRK 
PDVVTPNGLNVKKFSAVHEFQNIiHAMYKARIQ 

EKTLFLFI AGRYEFFKTKG AD I FLDSLS RLNFLLRMHKSD ITWVFF 
IMPAKTNNFNVETLKGQAVRKQLWDVAHSVKEKFGKKLYDALIiRGEI 
PDLNDILDRDDIiTIMKRAIFSTQRQSI^VTTHNMIDDSTDPILSTI 
RRIGLFNNRTDRVKVILHPEFLSSTSPLLPMDYEEFVRGCHLGVFPS 
YYEPWGYTPAECTVMGIPSVTTNLSGFGCFMQEHVADPTAYGIYIVD 
RRFRSPDDSCNQLTKFLYGFCNMSRRQRFIQRNRTER\LS\DLIiDW\ 
RYLG\RYYQ\HAR\HLTLRQSFSKINFHVGT*HAPPTDRKDFKYPR\ 
PSCSYPPFSFQGSSGP"SSP\Q\SSDVEDEVEDERYDEEEEAERDRLN 
IKSPFSLSHVPHGKKKLHGEYKN 

5188 

A 

1 

1347 

MKRLLERRNWTPQAMLYLKGAQGRRFISDQSRRKDLSDRPLPESVSS 
PERRG S EAPSDTVRPRAARVGS S S SWMP S AGAGS SRVQPRANGGKGG 
PGWRCGSPSLIGLRLFSQPSGCQERRSPNPQLIiTIPEAATILLASIiQ 
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KSPEEQFSSKAIWEGAVEVQRKASESKQFEKCLHKEEQPKAVSEPEW 
REEGIRACEGVYGERLVTQKKIDPICSKLKVILHPEFLSSTSPLLPM 
DYEEFVRGCHLGVFPSYYEPWGYTPGGLGPFPPLSAYKDIGAGLKPS 
FGLF PL AAECTVMG I P S VTTNLS G FG C FMQEHVADPTA YG I Y I VDRR 
FRSPDDSCNQLTKFLYGFCKQSRRQRIIQRNRTERLSDLLDWRYLGR 
TEGFKYPRPSSVPPSPSGSQASSPQSSDVEDEVEDERYDEEEEAERD 
RLNIKSPFSLSHVPHGKKKLHGEYKN 

5189 

A 

1 

272 

QTEGFKYRRPSSVPPSPSGSQASSPQSSDVEDEVEDERYDEEEEAGF 
G/RYV*ESPSILYAVPYSDSCCQVTPSLHCSHFFILFPKTDKSS 

5190 

A 

2 

603 

PRVRERGVSDSCCQQAPGQSEKEGQIPS\NPSGAS*HSLPMPRC*AT 
VSHRSLCPEWPAPRPCNSPSPSPQEPPQPPPLWPLLGVILAPQNVTW 
SVWCAGSPTAVPGCPSCWPASMPSAPSA*SSCCACRTTPGPSPARCA 
ARSPPSPGASSAACATMRRWWGSWPSHAQRYRSVLRGWWILLTWQQD 
TPAWWERMDRMK 

5191 

A 

225 

432 

SNQ I RQMEEVKI S KKS KVG I LP FVAEFEE FAGLAE S I FKNAERRGDL 
DKAYTKL I RG VFVNVE KV ANE 

5192 

A 

16 

1420 

PS VRRNQRFL * S CKDFD YWHN * RKQEGCYT/ VLEKKVLSNRKQRVF/ 
DGS SGKLTGSTS SLNKLS VQS SGNRRSQS SS LLDMGNMS ASDLDVAD 
RTKFDKIFEQVLSELEPLCLAEQDFISKFFK\HSNIKVCLELWAGSR 
GPGMGGTIITAT*LLGTPLPVSSEKDMIRQMMIKIFRCIEPELNNLI 
ALGDKIDSFNSLYMLVKMSHHV\ *LQQNVDPASFLSTTLGNVLVTVK 
RNFDKCI SNQIRQMEEVKI SKKSKVGILPFVAEFEEFAGIAES I FKN 
AER\ RGDLDKAYTKLI RGVFVNVEKVANES \ QKTPRD WMMENFHH I 
FATLSR\LKISCLEAEKKEAKQKYTDHIiQSYVIYSLGQPLEKLNHFF 
EGVEARVAQGIREEEVS YQLAF\ NKQELRKVI KEYPGKEVKKGLDNL 
YKKVDKHLCEEENLLQVVWHSMQDEFIRQYKHFEGLIARCYP\GSGV 
TMEFT 

5193 

A 

2 

391 

FFFFLRWSLA/SV/TQAGVQWRGLGSWQRPPPGFKSFSCLSLLSSWD 
YRRPP \ HAQLI FVFLVETS FTMLAGMVS IS * PRDPPASASQSAG ITG 
VSHRARLTLL I FKRCS IRDES FEVFL YVYLFKTN I LLKA 

5194 

A 

2 

302 

KTLQSGCPRLEVQWCNLGS LQPL/ PF PGSK* FS CLSLPG \ S WD YRS V 
PPRSANF\*FLVETAFHHVGQAGLELSTSGDPPVSASQSAGITGVSH 
SARQ*RAF 

5195 

A 

2 

388 

RATPRHIIVRFTKVEMKEKMLRAAREKGRVTLKGKPIRLT\ADLSAE 
TLQSRREWGP/IFNILKEKNFQPRISYPAKVSFISER*IKYFTDKQM 
LRDFLTSRPALKELLKEALNMER1WRYQPLQNHAKM 

5196 

A 

1 

578 

VAAKCMHKDAKKGFIRLDMS*FQERHEVAKFIGSPPGYVGHEEGGQL 
TKKLKQCPNAWLFDEVDKAHPDVLTIMLQLFDEGRLTDGKGKTIDC 
KDAIFIMTSNVASDEIAQHALQLRQEALEMSRNRIAENLGDVQISDK 
ITI S KNFKENVI RP I LKAHFRRDEFLGRINE I VYFLPFCHSEL IQLV 
NKEC 

5197 

A 

3 

1618 

MKLLRTSEAKYQEKQRKREAEERRRFPLEQRLKEHIIGQESAIATVG 
AAIRRKENGVTOEEHPLVFLFLGSSGIGKTELAKQTAKYMHKDAKKG 
FIRLDMSEFQERHEVAKFIGSPPGYVGHEEGGQLTKKLKQCPNAWL 
FDEVDKAHPDVLT IMLQLFDEPLDGI QVHS YDQTRMCPLGALCLVDR 
WKGDMWAGILHFVCHSLSSEAYVYQFNPFNNPPKLAIITASIEQMW 
KLKQKSALWDQEVEEDLGHALPGQSPWLARLKFFDHKGS S SGSGRGS 
SCCLDCKVIHLAKCHLLAVDVPSSAAWLQTLCQNPLRKGSWTLEPTN 
NPPELSALEMLENSLCSPTWEGSQFWVTVQRTEAVEHCGLHASYVLR 
VEAERLTLLTMGAQSQTAQGNDIFQADETAIHRQKAQGKARQGHNAL 
RADSHERKWEGKFSSPPGPQELLESPPAL.YAEPLDSLHIASGPSQG 
SLYSDLlXlGTIARAGEGVQWKKPIiYWDLYEHEQQQLLKAKLTDPKED 
PIYDEPEGLAPALPQGLYDLP 

5198 

A 

495 

653 

MTLTS IDPVLTTFFNLECPL/ VFTATAS SGLGEVE * AI AJLN 

5199 

A 

1 

1433 

MPESNCSLTWRHSVGRAVAAQGGEHSSQGASLVLGEPRGGSSWLLGL 
EGDTGLWGGLLKEGRLTDGKGKT ID CKDAI F I MT SNVAS DE I AQHAL 
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QLRQEALEMSRNRIAEN\LGDVQISDKITISKNFKENVIRPILKVRG 
LSWHFLLEPKAHFRRDEFLGR\IGEIVYFLPFCHSELIQLVNKELNF 
WA\KRAKQRHN\ ITLLWD\REVADVIiVDG\ YNVHYGARS\ IKHEVER 
\ RWN\ QLAAAL * SRTLLP\GGCTLRI TVEDSDKQLLKSPELPSPQA 
EKRLPKLRLEIIDKDSKTRRLDIRAPLHPEKGRLTDGKGKTIDCKDA 
IFIMTSNVASDEIAQHALQLRQEALEMSRNRIAENLGDVQISDKITI 
SKNFKEIA^IRPILKVRGLSWHFLLEPKVLTTFFNLSVLYLHGNSIQR 
LGEVNKLAVLPRLRSLTLHGNPMEEEKGYRRARGHSLHQGIEKYICE 
SDDI FRKYCNNQFL 

5200 

A 

1823 

4712 

FFKKNIiCASAHYY/TWCVPLPFFFFLRQGL/NSIGQAGVQWCNHSSL 
QPCPPRLKLSLHLSLPSSWDHRHAPLRMT/NFFKYLRRGLPVLPRLV 
SNSWAQWLPS/SASQSIGITDMSH\*PAQLPNSLPFFKRQFRSISP 
GLGMQ*CDHSSL*PQTLGIiKQFSHQPPFYFFVQMGSCYVAPSWS*TP 
KENTTI IQHPHMYTLKTTVTFLYTELYSDI A 

5201 

A 

3 

298 

RQSCSVA*AAVQWRDLSSLQPPPLGFKRFSCLSLLSSWDYRRPPPRS 
VNF * K I FL IETG FHHLGQAG VERLTS \ D P PG S AS QS AG 1 1 G I SHCTVi 
PELFK 

5202 

A 

1 

422 

APMPAATMTARSGPAAISTRCPWRPRSPSTPRPSWSGTCCAPTRSLR 
RAATASQGPS PATNTPS PRAAS PP * RVWAPRTET I KGHTAHSI4RTGA 
TTGPKLSAP/VATESGHPDSQKPPTHPGTSD\TILPPETASHPSTRR 
P*LPPETASHPSTRRP 

5203 

A 

123 

201 

KRGFCWVPQVGGQGHDFGLWQSPFPGLRQFSGFFLWKSWDYGHPPPC 
*ISSSSSSSSS\FSSSSSSSLKLWT*GDPLA*VSQRVGIPGLSHWP* 
PRLGGRGTILVYGNLHFRD 

5204 

A 

88 

167 

KINMG ILL KLVYRFN INP VKTPAGLFS S S SHE / 1 TE IDKR I LKF I WK 
CTEHR I AKQSLKRTKLEDFLLNFKTYYEATE I KTS * YRFNINPVKTP 
AGLF 

5205 

A 

179 

474 

FGFQINVFKDPVADPNKRSKKGRLS LHRTP \ AGNFVTLE * GKGDLEE 
YG \ QDLLHTV\ * KNGKVTKAYS FDE I RKNAQLNI ELE AARSLGLMDW 
VCWCM 

5206 

A 

195 ; 

351 

LLPLQDKSEATRGALSQRPCRPRPNSLPLNLPEEETLRIARIFSSQY 
SQKD 

520? 

A 

1 

1080 

ERCTMPDEPLVRAPDKGTDSPSPPPLEETSNGGRMLHESLRHAVPIT 
RMQSSEDTEAG\PAYSDEDYEEDIIEPRTLNEITTVTDKTSPWSSV1 
SDTSEVISPQPDEVQREGPSCPSPGPFCREELMVKSSFLSSPERAVN 
PHLPRQD\ S PSQSLVACECEASKARVGGESAS ANPQP I PCPTLSGAQ 
QSSTFVGWSSPQTDQNKEPKSEAPA\EELKVATS*GSVDS\A*LAFK 
KLPLNLASQSRRENHKGPPIDSSDIRQRQVTTGSETSTKQSLLLPGP 
IWPNFFLPPQQLEASLRMLSLSATLPPAATTDQDKSEATRGALSQR 
PCRPRPNSLPLNLPEEETLRI ARI FS SQYSQKD 

5208 

A 

1 

1140 

AIVEDKHSGRSYDITSDLGNVLTSTSIAKTVNG*AESSDSGAESDEE 
DAQEDLMGA YHS D I DKKMMKI VADHKNLE V I VTNG YDKDG FVHD I QN 
DIHASSSLNGRSTVHVKP IDENLGQTGKS AVC IHQDINDDHVEDVTG 
IQHLTSDSDSEVYCDSMEQFGQEESLDSFTSNNGPFQYYLGVHSSPP 
MENSGFREDIQGPPGNGNIGNMQWAVEGKGEVKHGGEDGRNNSGAP 
\HREKRAGETDEFSNVRRGRGHRMQHLS\EGTKGRQVGSGGDGERWG 
SDRGSRGSLNEQIALVLMRLQEDMQNVLQRLQKLEMLTALQAKSSTS 
TLQTAPQPTSQRPSWWPFEMSPGVLTFAIIWPFIAQWLVYLYYQRRR 
RKLN 

5209 

C 

68 

331 

MLKNVLMRKCGVHDREKFLVLD YFCQS S VLAVLYLI LS LCY FP YQDV 
ASECEVKCMPTFQFFKJCGQKVGEFSGANKEKLEATINELV* 

5210 

A 

935 

1036 

QMFQFPPPKNPITTKTHTQITSQLHICISLTKQ 

5211 

A 

1 

3270 

MRPKTFPATTYSGNSRQRLQEIREGLKQPSKSSVQGLPAGPNSDTSL 
DAKVLGSKDATRQQQQMRATPKFGPYQKALREIRYSLLPFANESGTS 
AAAEVNRQMLQELVNAG CDQEMAGRALKQTG SRS I EAAL.EY I S KMGY 
LDPRNEQIVRVIKQTSPGKGLMPTPVTRRPSFEGTGDSFASYHQLSG 
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TPYEGPSFGADGPTALEEMP/PAVRGLPFPRSRPPRARPPAPAPTQG 
LRCQRRGS RGTL P AAGRALRAAAP AGAWGTPGLRS AAQPLL»PBQI>AA 
GDRGLRQPA\PKGQGGPPGAGLAFPPPAAGLYVPHPHHKQAGPAAHQ 
LHVI/GSRSQVFASDSPPQSLLTPSRNSLNVDLYELGSTSVQQWPAAT 
LARRDSLQKPGLEAPPRAHVAFRPDCPVPSRTNSFNSHQPRPGPPGK 
AEPSLPAPNTVTAVTAAHILHPVKSVRVLRPEPQTAVGPSHPAWVPA 
PAPAPAPAPAPAAEGLDAKEEHALALGGAGAFPLDVEYGGPDRRCPP 
PPYPKHLLLRSKSEQYDLDSLCAGMEQSLRAGPNEPEGGDKSRKSAK 
GDKGGKDKKQIQTSPVPVRKNSRDEEKRESRIKSYSPYAFKFFMEQH 
VENVIKTYQQKVNRRLQLEQEMAKAGLCEAEQEQMRKILYQKESNYN 
RL KRAKMDKSMFVKI KTLG IGAFGEVCLAC KVDTHALYAMKTLRKKD 
VLNRNQ VAHVKAERD I LAEADNE WWKL Y YS FQDKDSLYFVMDY I PG 
GDMMS LL I RME VFPEHLAR FY I AELTLAI ES VHKMG F IHRD I KPDNI 
L I DLDGH I KIiTD FGLCTG FRWTHNS KYYQKGSHVRQDSMEP SDLWDD 

■t r r<XT/~»T3 f~*f~*T\T2 T VT»T PAD J\ D VAUAD PT TVUCT \Tr"PTiVTVT TV DDITT T D VPVT 

VSNCKCoDKJLKI JjiSyKAKKyrlUKl-LiAiio JLVCj I PW x 1 APIWIjIjKJnaj x i 
QLCDWWSVGVI LFEMLVGQPPFLAPTPTETQLKVINWENTLHI PAQV 
KLSPEARDLITKLCCSADHRLGRNGADDLKAHPFFSAIDFSSDIRKQ 

DUDWDTT QRPMT)T\ ^TtfTn^P\n^T?PQPl/iTKJT>r , fyf? / PT7QTV2i WDTT.TQPM 
xr >vzr I Vrl J. t^titrriLJ X \ olN r Uc V UHjCjO trvil>4LJ\~.\Jl\./ CiV9i IvrtriL' X Xi lOrll 

NKHPEHAFYEFTFRRFFDDNGYPFRCPKP\*EQKLHKAESSDLESSD 
VDOTPG PO P VYV 

J-J V *-J\J, X IjO \^ XT V X V 

5212 

A 

3 

274 

LERLIFVGGLGQFFPPPKLRFFPKIPQLVFFSPPFRKKIFPCPPRLS 
LGPPRVLLKGPPSSSSSSSSSKKC*PLMFYCHRRWPTGVYVI,P 

5213 

A 

403 

776 

QGPDHFSLFWHLLQAPGLSPCLKWATALETPPSPNGLPIVPVFPGSS 

rCi v LA.U'I v l X \ orix J.oJSi J-kjl* JLilrlSJ ]SJltiSJSJ\S*j\j^h' exit? FK-N rbr ixr ArobulVv 

G\ YI/KVFEPKLNYSDHDVIKWVR*\ IGR\SGI Y 

5214 

A 

165 

440 

PPRQAKMQNLAAPGSHSQSPW/TLRPKAIj*LTPSQIFSA*RLKTDTA 
RSPRKPPSFQGPVSLASITWGIDGQASKPLKTPQLWCOLRQYSFK 

5215 

A 

2 

525 

AAED * HCP I ASEXY KT I T \ Elj WVTIj P VEG KS VP FL I NTEATH S TJj PS 
FQGPVSLASITWGIDSQAYKPLKSSQLWCQLGQYSFMHSFLVIPTC 
rv Fljij^yU 11j I iVubAolj x XF^xjyFrilj jLAx JUr Fo JjJvF Fori Fir J_iVoF.riJL» 
NPHIWDMSTPSLVTDHAPLTIPLKPNHPYPVQC 

5216 

A 

1 

425 

PGNEIYR/KGTISFFEIDGRKKKSYSQNLCLLAKCFLDHKTLYYDTD 
PFLFYVMTEYDCKGFHIVGYFSKEKESTEDYNVACILTLPPYQR/RG 
Y/GKLLIEFSYBLSKVEGKTGTPEKPLSDLGLLSYRSYWSQTINEIS 
EITS 

5217 

A 

25 

193 

LGGCEDSRRKGAPRCFLSRGDHSWEPRD * PPPWLFPSNSC * * LAAGR 
PSPVRSSPS 

5218 

A 

1 

1383 

MAEVGEIIEGCRLPVLRRNQDNEDEWPLAETLSVKDISGRKLFYVHY 
IDFNKRJjDEWVTHERLDLKKIQFPKKEAKT^PTKNGIjPGSRPGSPERE 
VKRKATEVVSPATPVPSETAPASVFPQNGAARRAVAAQPGRKPvKS 

gtdedsqdssdgipsaprmtgslvsdrshddivtrmkniecielgrh 

RLKPWFSPYPQELTTLPVLYLCEFCLKYGRSLKCLQRHLTKCDIiRH 

ppgneiyrkgtisffeidgrknksysqnlcllakcfij^hktlyydtd 

PFLFYVMTEYDCKGFHIVGYFSKEKESTEDYNVACIIiTLPPYQRRGY 

gklliefsyelskvegktgtpekjplsdlgllsyrsywsqtilbilmg 
lksesgerpqitineiseitsikkedvistlqylnlinyykgqyilt 
lsedivdgheramlkrllridskclhftpkdwskrgkw 

5219 

A 

1 

1617 

maevgei I egcrlpvlrrnqdnedewplae ils vkdisgrklfyvhy 

TT^PTtfVHDTT^PWV'TTOPPT.'nT.TrK' TnPPJfTTFfcTn^PTVWfiT.PfJQPPfJ^P'FPP . 

VPASAQASGKTLPIPVQITLRFNIiPKEREAIPGGEPDQPLSSSSCLQ 
PNHRSTKRKVEWSPATPVPSETAPASVFPQNGAARRAVAAQPGRKR 
KSNCLGTDEDSQDSSDGIPSAPRMTGSLVSDRSHDDIVTRMKNIECI 
ELGRHRLKPWYFSPYPQELTTLPVLYLCEFCLKYGRSLKCLQRHLTK 
CDLRHPPGNEIYRKGTISFFEIDGRKNKSYSQNLCLIAKCFLDHKTIi 
YYT>TDPFLFYVMTEYDCKGFHIVGYFSKEKESTEDYNVACIIiTLPPY 
QRRGYRKLLIEFSE\YELSKVEGKTGTPEKPLSDLGLLSYRSYWS\Q 
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1 TILE IL \ MGLKSKSGERPQITSNDI SE ITS IKKEDVISTLQYLNLIN 1 
YYKGQYILT\LSEDIVDGNERPMLK\RLLRIDS\KCLHFHSQRTWSQ 
EVDSGDQDTCPHCKCQRPASQDLGAD | 

5220 

A 

890 

1537 

MLLMFICDLLLTLSCSIALGYELSKVEGKTGTPEKPIiSDLGLLSYRS 
YWSQTI LE ILMGLKS ESGERPQ I TI K* AWRCLPGGTWHGLS VPGLSL 
SGLLGTDKGPQGT*PVLSHSEISEITSIKKEDVISTLQYLNLINYY/ 
KGREAGRGDRCVGCRVQS SVG * PPAEPI S S AQGQYILTLS ED I VDGH 
ERAMLKRLLRIDSKCLHFTPKDWSKRGKW 

5221 

A 

2 

435 

LRDGVLLCHPGWSTVAQS*LTAALNFW\VRGSSCLSLL*LVLRNSPL 

HPANLFFYFCR/DKHLTILTRLVLNFWAQVILRPWLPKGLGITGMSH 

CTQPNFKNQRKGYCQVNFNNQTKTLSLAGNTYFTPILFFGNKHVTSY 
NFPSM 

5222 

A 

3 

384 

WAAEVPS FFFFFLRTES CFVTQAGLQWHDLGS ATS I / S PGSSDS PA 
SASQVAGITGRHHYAIiA*FFVFLVDMGlHHVGSGWSRNSLNIiVNLPC 
LGPPQYWGLQGMSPPMPRPLKCPSLKGKMPNSHP 

5223 

A 

2 

322 

LYFFSFETESCSVTQAGVQ/WWCDLGSLAT\PPP\GSSNSCASAS*V 

ARITGVHHHAQLMFVFLVETGFHHFGQSGLKPLSDPPASASQSAGIN 
RCEPPGTGHHYIFSK | 

5224 

A 

1 

2535 

MQPLDFSSGGSDPNISLSEKIRDQLWGQLIPDCYVELEKIILSERK 
J NVPIEFPVIDRKRLLQLWENQLQLDENELPHAVHFLNESGVLLHFQ 
DPALQLSDLYFVEPKWLCKIMAQDVSSIFGLYIRDILTVKVEGCPKH 
PKGIISRRDVEKFLSKKRKFPKNYMSQYFKLIiEKFQIAIiPIGEEYLL 
VPSSLSDHRPVIELPHCENSEIIIRLYEMPYFPMGFWSRLINRLLEI 
SPYMLSGRGCILLGQWDHIDSLMEEWFPGLLEIDICGEGETLLKKW 
ALYSFNDGEEHQKILLDDLMKKAEEGDLLVNPDQPRLTIPISQIAPD 
LILADPPRNIMLNNDELEFEQAPEFLLDCFVCIHLYPSSDYISRHYM 
RTINIVQTGFAKCRWRVTVHGADHGDGSFGSVYRAAYEGEEVAVKIF 
NKHTSLRLLRQELVVLCHLHHPSLISLLAAGIRPRMLVMELASKGSL 
DRLLQQDKASLTRTLQHR IAIiHVADGLRYLHSAM 1 I YRDLKPHNVLL 
FTLYPNAAI IAKIADYGIAQ/ S LL *NGD * KTI QRGTPGFRAPEVARG 
NV I YNQQAD VYS FGLLLYD I LTTGGR I VEGLKFPNEFDELE I QGKLP 

DPVKEYGCAPWPMVEKLIKQCLKENPQERPTSAQVFDILNSAELVCL 
TRR I L L P KNV I VECMVATHHN S RNAS I WLG CGHTDRG QL S FLDLNTE 
G YTS E E VADS R I LCLAL VHLP VEKES W I VS GTQSGTLL VTNTEDGKK 
RHTLEKMTDS VTCLYCNS FSKQSKQKNFLLVGTADGKLAI FEDKTVK 
LKGAAPLKILNIGNVSTPLMCLSESTNSTERNVMWGRMWHKDFLLF | 

5225 

A 

2 j 

223 

LGVAFQCFFCFWFVELFGFVGLQLLSNLEKFQPLCLQIFFLSPCPIR 
SLYI/C*SSKWSQLTVTLLVFPVFFSL 

5226 

A 

236 

1626 

IAAAAHPIAGLVGGEAWIGGAKSWLSFSAGSSCMAEMATATRLLGWR" 1 

VASWRLRPPLAGFVSQRAHSLLPVDDAINGLSEEQRQLRQTMAKFLQ 

EHLAPKAQEIDRSNEFKNLREFWKQLGNLGVLG\ I TAP VQYGG SGLG 

YLEHVLVMEEISRASGAVGLSYGAHSNLCINQLVRNGNEAQKEKYLP 

KLISGEYIGALiAMSEP\NAGFWMVSMKLKAEKXGNHYILNGNKFWIT 

NGPDADVLIVYAKTDLAAVPA\SRGITAFIVEKGMP\GFSTSKKLDK 

LGMRGSNTCEL I FEDCKI PAANI LGHENKG VYVLMSGLDI*\ ERLVLA 

GGLLGL\MQAVL\DHTIPYL\HVREAFGQKIGHFQ\LMQGKMADM\Y 

TRLHGLFGQYVYNVAKACDEG\ HI I PKDCAGVT LYAAECATQVALDG 

IQCFGGNGYINDFPMGRFLRDAKLYEIGAGTSEVRRLVIGRAFNADF 

5227 

A j 

1 

867 1 

l^!AiRb\l J GWKV/u>WRIiRPPI J AGFVSQRAHSLLPVDDAIKGVLR/E 1 
S QRQLRQTMAKFLQE \ NLAPKA\ RE I DRSNEFKNLRE F \ WKQLGE PW 

AVFGASQPLVQYG\GFRPGAYLGSMLLG*WEEISPSFSGAVGL\SYG 

AHSNLSHQPSLLRNWEFRAQKEKYI»PEG*FSG\EYIGAIjAMS\KPHA 

GSDVVSMKLKAEKXGEATLNIiGAKMGRGNHYI LNGNKFW ITNGPDAD 

VLIVYAKTDLAAVPA5RGITAFIV^KGMPGFSTSKKLDKIjGMRGSNT 
CEL I FEDCKI PGK 
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5228 

A 

3 

391 

SHHHPLAAPSLGEAAISRSFQSLACSPGLPAADRLSYSGRPGSRQAG 
LGRAGDSAVLVLPPSPGPRSSRPSMDSEGGSLLLDEDSEVFKMLQEN 
REGRAAPRQS S S FRLLQEALEAEERGGTPAFLAS SLS PQSFLARLQG 
/ GLATP PKLHTCEKCS TS I ANQAVRI QEGRYRHPG \ C YTCA\ DCGLN 

LK\RRGHFVW3DELYCEKHARQRYSAPATLSSRA*APFGSCRKPWKL 
RREVARQPSWPAH 

5229 

A 

2 

384 

FVHFQSKKTMRKICLFFFFSSETGSRSVSQAGVQ/WARVIAHCNLKL 
LD S S DL PAL ALAS * VARTVGM YHRGLACVY F 1 1 Q I NL I L Y FLAQMNL 
KKHQTFLFYEMLNLIASQAGKIYSSNRNIKVFWPQ 

5230 

C 

301 

369 

MPTLYNHCFLQLFTI ISKLISY* 

5231 

C 

210 

329 

MWNLCIWRVNCTINNTNMPTLYNHCFLQLFTIISKLISY* 

5232 

A 

2 

268 

FFFLKRD\RVSFCHPGLSAVAQL*LTVALNS*\VKGSSYLSFQS/SW 
HYRYVLPYLANFKNFFFKRRELTLFLRLILTSWAQVSFLLQPPK 

5233 

A 

1 

415 

LFAGPSAMFYLAAAVSDFYVPVSEMPEH/KIQSSGGPLQITMKMVPK 
LL S PL VKDW AP KAF 1 1 S F KL ETD PA I V I NRAR KALE I YQHQVWAN I 
LESRQSFVFIVTKDSETKLVSTHSFYR*QKLK*KALIGSKIVQGS 

5234 

A 

3 

705 

PVRFLDNFSSGRRGATSAEAFLAAGYGVLFLYRARSAFPYAHRFPPQ 
TWLSALRPSGPALSGLLSLEAEENALPGFAEALRSYQEAAAAGTFLA 
VE\FTTLADYLHLLQAAAQALNPLGPSAMFYLA7VAVSDFYVPVSEMP 
EHKIQSSGGPLQITMKMVPKLLSPLVKDWAPKAFIISFKLETDPAIV 
INRARKALEIYQHQVWANILESRQSFVFIVTKDSETKLLLSEEKKK 

5235 

A 

200 

636 

GPCAMFYLAAAVSDFYVPVSEMPEHKIQ\SSGGPLQITMKMVPKLLS 
PLVKDWAPKAFI I SFKLET * PR/LI VINRARKALEI YQHQVWA\NI 
LGVKDS PLCLL * PKDS E PKLLLS EEE I EKG VE I EEKI VDNLQSRHTA 
FIGDRN 

5236 

A 

17 

424 

LWISPCGGIPGTAAAAATMSDQQLDCALDLMRRLPPQQIEKNLSDL 
IDLVPSLCEDLLSSVDQPLKIARDKWGKDYLLCDYNRDGDSYRSPW 
SNKYDPPLEDGAMPSARLRKLEVEANNAFDQYRDLWRRMKL 

5237 

A 

1 

338 

SDRQLDC/ ALDLMR/ RLPPQQIGKNLSNLIDLVPSLCEDLLSSVDQP 
LKIARDKWAKDYLLCDYNRDGDSYRSPWSNKYDPPLEDGAMPSARL 
RKLQVEACFCHTSLSVLSTFP 

5238 

A 

2 

160 

ENK IRS TLNE I YFGKTKD I VNGLRS VQTFADKS KQEALKNDLVEALK 
RKQQC 

5239 

B 

46 

426 

MSDQQLDCALDLMRRLPPQQIEKNLSDLIDLVPSLCEDLLSSVDQPL 
KIARDKWGKDYLLCDYNRIX3DSYRSPMEVTKYDPSLGRMGGNAVKL 
RAEKS LE VGKANKWP FDQ Y PKTLLLLELLD PG * 

5240 

A 

2 

93 8 

PRVR * APGPAAGGRATDAGTGTAAAATTMSDQHLDCALDLKRRLRPQ j 

QIEKNLSDLIDLVPSLCEDLLSSVDQPLKIARDKWGQDYLLCDYNI 

DGDSYRS PWSNKDDPPLEDGAMPSA\ RLRK\ LEVEANNAFD\ QYRDL 

YF*KVGVSSV\YLWDLD\HGFGLS/VILHKRKGWEDGIKRRSKGCW\ 

DS I \HWEVQEKSSGRTPHYKLTSTVMLWLQTNKSG\ SCTMNLGGSL 

TRQMEKDETVSDCSPQIS/ATIGRLVEDMENKIRSTLNEIYFGKNKG 

YRSMGLRSV\QTF\ADNSKQEALEELTLVGGFERESKQC 

5241 

A 

1 

1911 

MTLEDPPGFCERLLSRVAALFPALRSGGFQAHYRAHSSSLCPAENPW 
XHjTCCLLNNIJJEDGDLVAFSSDEELimMSYVKDDIFRIYINDSPAV 
ENISSFCFVSLKSAGRADALKERQSGSTHAHVQLRLQGSRGNSGHSQ 
RPSNYKVLAGSQRDFNGTDITAAAAAPSGLYLDNWNKQSVSPPIPH 
LRALLS S SDDP PAEVD I FELLKVS YEKFS S LRAED I EQMRFKQRLKV 
IQSLEDTAKRSVDRFSELQLSRQTQHEHFKELDNFYKISSHEQSYGS 
QPPILQPALGAAQHCASHSVTCQSTSFLQLKKGKTKLYAEKQLNSTW 
SRQTSGANLRVLHEAREAQSSGPAPSISRMDARQALVLLPSSRESSQ 
KRTRTCSPRGDEDPGKFKET^ELKMRKQFGMPEGEKLVNYYSCSYWKG 
RVPRQGWLYLTVNHLCFYS FLLGKEEKKRHLS ICKI IPVCAKWGVQF 
SSEMLHQHRWVLRQCDPKHLGSLPDKSQAPPTLTHQDQKRPGLSLLG 
PGCCAGTLPLQQLTGLPGCRFFFRSGNIEHPEDKLFNTSVEVLPFDN 
PQSDKEALQEGRTATLRYPRS PDGYLQ IGS F YKG VAEGE VDPAFGPL 
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EALRLS I QTDS PVWVI LS E I FLKKAD 

5242 

A 

3 

803 

ESVAAALSPLG\IEVDIDVEHGGKRSR\LTPVS\PEVS\ATGEKSSS 
QPSSCC\SDPSKPGGNVEGATQSLAEQMRKIALESEGRPEASLCPFP 
PPGTTGQPSDLWPAPPPHSQHLCSPTYKPEGAGCWLAVDVCCHLKSR 
WEPARTL*LLGLCEWPLFPLDPCSLNCTCACVXiPDCLPGARAMVRLG 
LLRVSPVCSW*VLFQEQMESDNCSGGDDDWTHLSSKEVDPSTGELQS 
LQMPESEGPSSLDPSQEGPTGLKEAALYPHLPPGK 

5243 

A 

2 

1511 

RGGCDRDGPFSASSPLAMASLTVKAYLLGKEDAAREIRRFSFCCSPE 
PEAEAEAAAGPGPCERLLSRVAALFPALRPGGFQAHYRDEDGDLVAF 
SSDEELTMAMSYVKDDIFRIYIKEKKECRRDHRPPCAQEAPRNMVHP 
NV I CDG CNG P WGTRYKCS VCPD YDLC S VCEGKGLHRGHTKLJVF PS P 
FGHLSEGFSHSRWLRKVKHGHFGWPGWEMGPPGNWSPRPPRAGEARP 
GPTAESASGPSEDPSVNFLKNVGESVAA\ALSPLGIEVDIDVEHGGK 
RSRLTPVSPESSSTEEKS\SSQPSSCCSDPSKPGGNVEGATQSL\AB 
Q\MRKIALE\SEGRPEEQME\SDNC/SQGGDDDW\THLSSKE\VDP\ 
STGELQS\LQ\MPESEGPKLSGTPPREG\PTGLERKLPLYPHLP\PE 
V\DPRLDLSPFSQIACSMGLSS*LKGGLGSTRVPGRPKNY*HSERLW 
DTIQ\YSK\HPPAWDHFLPTFFLRAPLLVFIWFKLCVELAGLb 

5244 

A 

24 

493 

HWQVLPRVKKRLSPLKTHNCTUjPSTKLSAVRDYLFYSENLNFPQLNK 
DS * RGC * QALK/ S PTAWFVLS LSHSFS FAVSEGCFFHRERQVASTS 
MEQLLSLLAGI PFVEDS CPLQKELS KTLPLTLEVLGVSMFFKFS WLP 
VIWVFGFISQLSGLES 

$245 

A 

1 

4547 

MRNPVHSHIGATVSHQTDFGANVPNMPALQSQLITSDTYSMQMQMIP 
SNSTRLPVAYQGNQGLNQSFSEQQVDWTQQCISKGLTYPDYRPPPKL 
YRYSPQSFLPDSTIQKQNFIPHTSLQVKNSQLLNSVIjTLPSRQTSAV 
PSQQYATQTDKRPPPPPYNCRYGSQPLQSTQHITKHLSMEVPQSREM 
LSSEIRTSFQQQWQNPNENVSTIGNFTNIiKVNTNSKQPFNSPIRSSV 
DGVQTLAQTNEEKI MDS CNPTSNQVLDTS VAKEKLVRD I KTLVE I KQ 
KFSEIiARKIKINKDLLMAAGCIKMTNTSYSEPAQNSKLSLKQTAKIQ 
SGPQITPVMPENAERQTPTWESAETNKTQCMLNSDIQEVNCRRFNQ 
VDSVLPNPVYSEKRPMPDSSHDVKVLTSKTSAVEMTQAVIiNTQIiSSE 
NVTKVEQNS PAVCET I S VPKSM S TEE YKS KI QNENMLLI*AL»I*SQARK 
TQKTVI.KDANQTIQDS/ITRQL*NESKYPNDW*PTEFEETWNSKYF* 
S/ILQILQKKRSVINSEQTQQQLEFQSLLTSTLRVLVQL/SGNSNSQ 
NKISNPSQQTALSM/VNAQL*VFRYKYNKGNRTSDSCSVTVSSVRGQ 
NIVCQRNNTCSVT*NSVSRY*RGQCLWFYKINWQKMQRQLLL*KLLL 
VDQ* QVQQHQPRFFH * LRRKSRMSQLMVI QKSHLMS IKESITN*SQ/ 
LIHSPMNDQQISQESRNSTWSSDTLQIDNICSLVEGDTSYNSQIAK 
IFSSLPLKMVEPQKPSLPNQQGIGCREPEKQLDNTTENKDFGFQKDK 
PVQCTDVSHKICDQSKSEPPLESSFNNLiET/ QQSYSREK*FGACHLK 
KAQLTIRARQLAIQEDIYPQEIDASSNYTPQDPARNEIHSDKAPVLY 
LHDQLSELLKEFPYGIEAVNTREGSVGQQTTYQTSEDQTADKTSSDS 
KDPADQIQITIIiSSEQMKEIFPEQDDQPYWDKLAEPQKEEPITEW 
SQCDLQAPAAGQSRDSVILDSEKDDIPLLCIGLALHGLRRSTPVSV* 
FHQEL I \ DEEEKQKEQC S PLDTNS CKQGERTSDRDVTWQFKS L VNN 
PKTPPDGKSHFPELQDDSRKDTPKTKHKSLPRTEQELVAGQFSSKCD 
KLNPLQNHKRKKLR FHEGNLS LQ * * NDS I L * TS FS / EKPDRRNM * HR 
THVH*KQKQLFLPNKI>VYKKHSSLGQSLSPEKIKLiKIjKSVSFKQKRK 

T.nrM^XTR TP\ *T?VT<^TrTrPT^Tfr4'Rr\K"riQ\7fiaTC , V"T CnCT CMDWP'OIlTWT? 
±-iU \J\J IN t\- .L £\- \ O V XVT«-IVrvXTljrvVl^^rvvjo V VJM.JI c 1\JjoUoJjOIM h* is* tLii\J\ a. V ivc* 

KWSNTKSVDTKASSSKFSRILTPKJSYLQRQKHKEALSNKASKKICV 
KNVPCDSEHMRPSKLAVQVESCGKSNEKHSSGVQTSKESIiNGLTSHG 
Kl^KIHHSQESETYNIIiRNVKEKVGGKQPDKIWIDKTKLDKLTNISN 
EAQFSQMPPQVKDQKKLYLNRVGFKCTERESISLTKLESSPRKI*HKD 
KRQENKHKTFLPVKGNTEKSNMLEFKLCPDILLKNTNSVE\NGRM*S 
L I LGRS KPLCKFQE * KVQKKTG 

5246 

C 

191 

397 

MLPRAGLLLCGKLVPNNFSQPSRQILLCLGCDSPHLTVGHEGWKTTL 
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Pf57xfi7xTPVPTf2PITQT.T.nPf2\7C * 

5247 

C 

598 

744 

MSKVLNQSTYITVQCGYVS INS FYVSFYYS IKIAKRHLGTFFRTVLV 
I* 

524 8 

A 

164 

1 c c 

355 

NGSTGCIGIQANQlFN xRSIjSSFESGKFVKCTEQIjE1jVPGENIH\ *L 
LEGSKKKLTLKNLFRNM 

p lyl ft 

A 

1 

■5 no 
308 

KVQ5F v If YJjr lit *DGV*JjFRPGWSAVVJjSRIjTATSA\FRFKQFSC 
LSLPSSWDYRCPPLH1oANFCIF**RRGFTMLARLVSIS*PQVIRPPW 

5250 

A 

67 

367 

SDELRTPGRESLPAENISL*DRVNHQPAIESCTA*NYLYSINAKTKG 

£«l r iXJlL>'t-Crl3l r JxW V V X JlYV t\±J\— lit X V Xjl v Jlvrt.'OV_ WpC X 1 J K 1 » t 1 j fr^ fr** f*A t 1*1 1 i 

WAVMSR 


A 

1429 

XO^ J, 

QKQP * Q WPDQGPERGR 

5252 

A 

180 

411 

IPWIPPV/CPFHGHTHLE/VHPHGIiSISLYSPPADSPVQGFVVRLIS 
FTFSAliADLPRSGPWSDTTCHGCFTVAASSESIi 

5253 

A 

288 

532 

FRNLPLPIPSQGTDLIFSAGRLSRPGVRSSSDQFHFLCSGRPSSFRS 
LVGHHLSRMIiHCGSLLREPSRPIVRAPVSDEIS I 

C O C /I 

5254 

A 

9 

257 

I FTAVAF 1 I SESWKQPQCL * TNE * / I KLW YIHTMQYYS AIKRNEKLI 
HATTWMNSE1WQW*KARHKKPHIV*LHLYEGSQKRQI 

5255 

A 

6215 

7038 

RTVTTFLSKDSHGVYCAQGGKIPDHQNPQCNRKQHPVSTIlxhtl^ 
FCQLRKRKHNLSVNCINRNPFMSLK1^SWHSSLSVTQRHQQQSKI»HF 
QGSILIiH*PSQNI1j/SNI*KCINYC*HCSSVLLSYLFIETESYSVAQ 
AGVQWHDLGLliQLLPLRFKQFSCFSLPSSWDYRSAPSCPANFCILVE 
MGr CHVGyAGljKixuASoDPPAlxASQSAGITGV 

LFSCKVLSNPYHKGRIYLGRMCFLNSTWHLVKSTLFCPLFI 

5256 

A 

2 

513 

VLIQLVWIALLLVDPSRPVIjTKDEPASSSTSSFTSQSQKKGQQSGFL 
QSRN*GCIaLCE/VFQREEIY*VGMD*DSFRIKWFFNFIFHLTVITS 
SDIjH 1 HGB. 1 W W P1MTDH VCTRFF FV YDDCL VvYAYINxTjIj*GFY* YI 
LLVIVNYTGSVHWF I FYYLVLNVIiFCFIjLY I 

5257 

A 

1 

274 

STPPHSSQRACEPLPSFAQCPR*APSPPAPHCAHCPIPKMEERP/P 
ri xGK w Kiviixjlji> 1 iLAKbCIhQAPCI xIjJjNPHNVI^xjSRxjWVKGQ 

5258 

A 

3 

437 

GYLSEVVEEOTPPKMEKEGLEIMIGKKKGIQGHYNSCYLDSTLFCLF 
ArbbVljUj. VljJjKIr'lvbJ^JL/Vx^i ibEl yoLiJjRTEIVNPljRIYGYVCATK 
IMKIiRKILEKVEAASGFTSEEKD/PAGQKVQDCYFYQIFMEKNEKVG 
VPTIQ 

5259 

A 

1 

340 

BEl^TAPVQESPPLAMPPGNSHGLEVGSLAEVKENPPFYGVIRWIGQ 
PPGLNEVLAGLELEDECAGCTDGTFRGTRYFTCALKKALFVKLKSCR 
PDSRFASLARIWSYLLFSA 

5260 

A 

215 

385 

LPYKEFFLCAIGGQNGFNIPQVTPCPEVGEYLKMSLEDLHSLDSRRI 
QGCARRLLC 

5261 

B 

347 

1742 • 

MPNTOGSIGHSPLSLSAQSVMEEloNTAPVQESPPLAMPPGNSHGLEV 

GSxxAEVKENPPFYGVIRWIGQPPGLttEVLAGLEL^ 

EGTRYFTCALKKALFVKLKSCRPDSSSVLDTVLx^ 

TQELLRTE I VNPLRI SAGQKVQDCYFYQI FMEKNEKVGVPTIQQLLE 

WSFINSNLKFAEAPSCLIIQMPRFGKDFKLFKKIFPSLELNITDIjIiE 

DTPRQCRICGGLAMYECRECYDDPDISAGKIKQFCKTCm'QVHIiHPK 

RLITOKYNPVSLPKDLPDVTOWRHGCIPCQNI^LFAVLCIET 

KYGKDDSAWLFFDS^lADRDESlmEFVQITGVIGKGKETEGRVLTLHL 

I RAG S S VHVHCR OWM S L W * 

5262 

A 

1 

1547 

PVQESPPlxAMPPGNSHGLEVGSLAEVKENPPFYGVIRWIGQPPGLNE 
Vx^GLELEDECAGCxTOTFRGTRYFTCALKl^^ 

SLQPVSNQIERCNSLAIWEAYLSEVVEENTPTQKWEKEGIjEIMIG\K 
KiCGIQGHYNSCYLDSTLFCLFAFSSVLDTVLLRPKEKNDVEYYSETQ 
ELLRTEIWPLRIYGYVCATKIMKLRKILEKVEAASGFTSEEKDPEE 
FLNILFHHILRVEPLLKJRSAGQKVQDCYFYQIFMGKK*GKLGVPTI 
SGSC*EVJSFINSNLKFAEAP\SC1 J IIQ\MPRFGKDFK\LFKKIFPSL 
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ELN I TDLLEDTPRQCR I CGGLAMYECRECYDD PD I SAGKI KQFCKTC 

NTQVHLHPKRLNHKYNPVSLPKDLP\DWEIARHGCIPCQNMELFAVIi 

C I ETSHYVAFVK YGKDDS AWLFF \ DSMAD\ RDGGQNGFNI P\ QVTPC 

PEVGEYLEDVSWKTLHSLGLPGESQGCARRLLCDAIYVPCTQSPTMS 
LYK 

5263 

A 

29 

258 

LRKYHATDLIS *GS WGEF\DCCF\ SFLDGSLEDAVDGLL\LALQPH 
KEQYKEFQDLNQEVMNLDDILKVSTFSQKLD 

5264 

A 

98 

497 

THENNYNEKKDEIKIFKISLQISAQEEFPFFRRWKSCLCCPRLGVQW 
HDLG\SLQPPTSP/GFKRFSGLKLP/RAAWIYRYVPITSG*FLYFQ* 
RRGFTHVGXQADLDLLTSGDPPVWASOSAGlTGVSHHAOPPYr 1 

5265 

A 

156 

1047 

VPAGEARVQWHDLGSLQPPPP\GSSDSPASSSRVAGITGIKTNTICK 

KCAQNVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPYSCEQ 

CKQQ(^FDRKDDRKKVDGKLLCWLCTLSYKRVLQKTKEQRKHLSSSS 

RAGHQEKEQYSRLSGGGHYNSQKTLSTSSIQNEIPKKKSKFESITTO 

GDSFS PDLALDS PGTDHFVI I AQLKEEVATLKKMLHQKDQM ILEKEK 

KI TE LKAD FQ YQES QMRAKMNQME KTHKE VTEQLQ AKNRELLKQAAA . 

LSKSKKSEKSGAITSP 

5266 

A 

156 

960 

VPAGEARVQWHDLGSLQPPPP\GSSDSPASSSRVAGITGIKTNTICK 
KCAQNVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPYSCEQ 
CKQQCAFDRKDDRKKVDGKLLCWLCTLS YKRVLQKTKEQRKHLS SS S 
RAGHQEKEQYSRLSGGGHYNSFSPDLALDSPGTDHFVIIAQLKEEVA 
TLKJO^LHQKDQM I LEKEKXITELKADFQYQES QMRAKMNQME KTHKE 
VTEQLQAKNRELLKQAAALSKSKKSEKSGAITSP 

5267 

A 

4 

496 

DGFPSCCQAGVQ*HDLGSRQPPPP\GSSDSPASASRVAGTTGMHH*S 
RDVL I R I LELEEGLELCPHFS VRKLPKTGQI LTLMLSLAJPVKLAPQp 
KLS WMELTRTQQAVPS PFGRWGG I GRRKVL VP I AADGWASGQARGLL 
GGSGKGEGKATGCEREGTCLLQG 

5268 

C 

190 

435 

MFCYLGPHLSHDPILFAKVVRIGKSFMKEFQSDGSKQEDKEKTEVIL ! 
SCLLS I TDQVLLPSLSLMDCNACMSEELWXMFKTF 

5269 

A 

114 

269 

QLYINHTPPPLSKSKEREMDKKDLDKSRTLKQKQKPLVNWHNSWF* 
DHSN 

5270 

A 

1 

1071 

MKGVMNVALVHSSQEPNGETPSSLYRVAAVL.LOFNT.Tm nm wm t 
PADNCIMDEHKREIAEAKQIVRKLTMWLSSEKMDEREKEKEKEEEK 
VEKPPDNQKLGLLEALLKIGDWQHAQNIMDQMPPYYAASHKLIALAI 
CKLIHITIEPLYRSVTSWAVDHAGFLESDPCDSTVGHLLSRVGVPKG 
AKGSPVNALQNKRAPKQ7VESFEDLRRDVFNMFCYLGPHLSHDPILFA 
KWRIGKSFMKEQS KKVGEKDCCVAAKLRAQATRGAQGS PDWVPRFA 
FS PVRCSGDLGLVGGGWTYVAGEWGVGNKVDGEQYQI PETMGE IGTK 
TKDLKNGNVKIPTAYPFNWPIWPMENAI 

5271 

A 

1 

1231 

GR FLCCMLETVTRWHS DRAT YE KECGNY PG FLT I LRATG FDGGNKAD 
QLD YEN FRHWH KWHY KLTKAS VH CLETG E YTH T PJJ T T , T VT ,TK TT . pu 
Y PKVLNLGQALERRVHK I CQEEKEKRPDL YALAMG\ YKI IF* CDLQD 
* MTEG F * RL * NVLQC * SLS S AFRRL WE I MRS CFHRL VR VTTSD YQLP 
VYTG* *FF*SHV*YFLIGRKYNFIIIVNRSSRVKKKKPS*FKD*FIG 
KL*FLRKLGVL*KPSRTY*KWVFFNSLEPHSRMNNLLILFLKTSCLC 
FL I WVC I FKHFVRLSS YSGQLKK* EKS IP* YPENEFPSQRPPPPRNA 
VA\SVQNGP\GGGPSSSSIGSASKSDESSTEETDKSRERSQCGVKAV 
NKASSTTPKGNSSNGNSGSNSNKAVKENDKEKGKEK 

5272 

A 

2 

3383 

ITDQVLLPSLSLMDCNACMSEELWGMFKTFPYQHRYRLYGQWKNETY 
NSHPLLVK^nCAQTIDRAKYIMKRLTKENVKPSGRQIGKLSHSNPTIL 
FDYVCFEILSQIQKYDNLITPVVDSLKYLTSLNYDVLAC ILSNCI IE 
ALANPEKERMKHDDTTISSWLQSI^SFCGAVFRKYPIDLAGLLQYVA 
NQLKAGKS FDLL I LKE WQKMAG I E I TEEMTMEQLEAMTGGEQLKAB 
GGYFGQIRNTKKSSQRLKDALLDHDLALPLCLLMAQQRNGVIFQEGG 
EKHLKLVGKLYDQCHDTLVQFGGFLASNLSTEDYIKRVPSIDVLCNE 
FHTPHDAAFFLSRPMYAHHISSKY\DELKKSEKG\SKQQHKVHKYIT 
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SCEMVMAPVHEAWSLHVSKVWDDI SPQFYATFWSLTff^LAVPHTS 
YEREVNKIiKVQMKAIDDNQEMPPNKKKKEKERCTAIiQDKLLEEEKKQ 
MEHVQRVLQRLKLEKDNWLLAKSTKNETI TKFLQLC I FPRCI FS AID 
AVYCARFVELVHQQKTPNFSTLLCYDRVFSDI IYTVASCTENEASRY 
GRFLCCMLETVTRWHSDRATYEK\ECGNYPGFLTILRATGFDGGX«CA 
DQLDYENFRHWHKWHYKLTKASVHCL.ETGEYTHIRNILIVLTKILP 
WYPKVXiNLGQALERRVHKICQEEKEKRPDLYALAMGYSGQLKSRKSY 
MIPENEFHHKDPPPRNAVASVQNGPGGGPSSSSIGSASKSDESSTEE 
TDKSRERSQCGVKAVNKASSTTPKGNSSNGNSGSNSNKAVKENDKEK 
GKEKET^KK^KTPATTPEARVTjGKT^CJKRKPKFFRPMKTJT^KaPPTTrPR 

wiujiujiu^iuujiv a * *» a. a. xr i-ji-\-t\, v xj\j jvjls w i\Xil\x XVCi uj\l: XM I\L> A Xv/vfv El X <\i<li\ 

TPKSDKEKEKFKKEEKAKDEKFKTTVPNAESKSTQEREREKEPSRER 
DIAKEMKSKENV*RREEKTPVLGSLE\SPVSPGSDISQSLERGNQKR 
RKIDTHPSPSHSSTVKE\SFIEL\KESSAKIiY\lNHTPPPTVPRVRR 
EKMDKKDLDKSRERSREREKKDEK\DRKEREKGVHSKQRPVKLPPDL 
/TPKRRKEENGTMGVSKHKSESPCESPYPN\EKDKEKNKSKSSGKEK 
GSDSFKSEKMDKISSGGKKESRHDKEKIEKKEKR\DSSGGKEEKKHH 
KSSDKHR 

5273 

A 

442 

1457 

KHFLYI YAOMNCFS LRLKKKKKKNOLIjIjOVF^ «?ADP * <3 *3 <5 <3 <5 <S <3 "3 
GHKRHKKHKRNRSESSRSSRRHSSRASSNQIDQNRER*VIjPSSS*YF 
S I FS FNP * TRS GRKLLGKQDRLQ YE KTQ I KEKDRCPLS S S SLE I PDD 
FGGRSEDPRDFYNSYKTQAGSSKTEKPYKSE\DIFPARRNSSDSFCR 
NSEAKIYGYRRFEKDIEGRKEHYRRWEPGSVRHSTSPASSEYSWKSV 
EKYKKYAHSGSRDFSRHEQRYRLNTNQGE YEREDNYGED I KTEVPEE 
DALSSKEHSESSVK/NKFTSEFTEYI *SDS * I * KRKRK* GQKINTPK 
DIGHPLPQNAH 

5274 

A 

2 

457 

KLIFFPR\ KPSAPQEGRQFC * KIGNRPTQLTGPV\MPVRNVYK\ KEK 
ARVITEKEKNFQSPCLVFVMARAK\ARL\ FG IRAKKS PRKPQNQDVE 
KEKIKTLPGNFEISLAVMPGSPTWCVSVGTTWALGWGFTACDFLLPR 
GFGAFLKRQVQAPG 

5275 

C 

27 

158 

MHSKPTGQVSWLGMTTASLPGTNASSMWMPWRLEDHSNLDALP* 

5276 

A 

1 

320 

FFPNGNAFATG SDDATCRLFDLRADQELMTYSHDN IICGITSVSFSK 
SGRLLl^GYDDFNCxWWDALKADRAEITSYSCNHCGSVHFSIiLLCGT 
YRKQYSARLYTL 

5277 

A 

100 

659 

CPLVIRSCSSALWDIETGQQTTTFTGHTGDVMSLSLAPDTRLFVSGA 
CDASAKLWDVYSQCFFLSVYMMS * FAFS FQFFPNGNAFATGSDDATC 
RLFDLRADQELMTLTPHDNIICGITSVS\FSKSGRLLLAGYDDFNCN 
VWD AL KADRAG VIoAG \ HDNRVS C LG VTDDGMA VATG S WD S FL K I WN 

5278 

A 

16 

495 

AFSAPDTRLFVSG\ACNASAKLWN\VGEGMCGQTFTG\HESDINA\I 
CFFPNGNAFATGS \DDATCR\ LFDLRADQELMTYPHDNI I CGITSVS 
FSXKSGRLPPGMGTTNFNClWTOALKSRTGQVFIiAGXHDNPRQPAWG 
VTD\DGMA\VATGSW\DSFPQDLGN 

5279 

A 

5 

108 

SRGFTLLARLVXJ^S * PQ/ CDPPLPLAS QS AG I / TRLGMS YHA* P 1 1 1 
FETRSRSVTQDG/GVQ*CNHGLLQPWHPGLKESSCLSLPG/GHAPPH 
LANFIFCRNRVSLCCSGWS*TSSDPPLPLASQSAGIIGWV 

5280 

A 

1 

458 

PVARSWCRKTYVTPRRPFEKSRI^QELKLIGNAIiLRRLVRIGVIiDE 
GKMKLDYILGLKIEDFLERRLQTQVFKLGIiAKSIHHARVL»IRQRHIR 
VRKQWNIPSFIVRLDSQKHIDFSLRSPYGGGRPGRVKRKNAKKGQG 
GAGAGDDEEED 

5281 

B 

403 

629 

XVHLVKASCQACGQLDRVRKQWNIPSFIVRLDSQKHIDFSLRSPYG 
GGRPGRVKRKNAKKGQGGAGAGDDEEED * 

5282 

A 

1 

637 

PVARSWCRKTYGDPRGGPFEK\SRLDQ\ELKLIGGVWGSGTKREVL 
\RVKFT1^QDSAKAAPGNWLT\LDEKDPTGVLFERQTPLLR/RGLVP 
QLGCWDEGQKMK\L\DYIPGP*RLEDFFRETPCKTQVFQSLGLAKSI 
H\QPRFACLGSRQRHIRVRK\QV\VNIPSFIVR\LDSQK\HIDFSLR 
S P YGGGRPA\ RVKRKNAKKGQGGAGAGDDEEED 

5283 

A 

3 

295 

VCIGVLLCCLGCGVRWHDHASL* P* PPGLKRSSHLSLLSTWDHRHML 
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THPLNFFi\VEPRSHYVAQTGLKLLGSSSPPASASQSARITGVSHHT 

QSKK*FLFVEPRSHYVAQTGLKLLGSSSPPASASQSARITGVSHHTQ 
SKK 

5284 

A 

2 

284 

WFLTETSPFMWSNLG IGLS I SLS WGAAWGI YITGSS I IGGGVKAPR 
IKTKNLADAQNPSLFVKILIVEIFGSAIGLFGVIVAILQTSRVKMGD 

5285 

A 

1 

785 

MTGIiALLYSGVFVAFWACALAVACWVNSGTDWAGGIWLERLLPRSRL 
GFGLAGGDPACAGVCYTIFDLGFRFDVAWFLTETSPFMWSNLG\ IGL 
AISLSWG\AAWGHSILTGSSIHWEGS*KAPRDSKTKNL\VSIIF\ 
CEAVA\ I YGI IMGNCSLSNMT\ EPF\ SATEPQGHRQFGN * PWQGYSH 
GLGAGLHRKPCLNLFCG\VCVGIVGTWGCPWQMPQNPTASLCKRFSI 
VEIFWQAPLALFGVIGRKFFKTSMSERWVTR 

5286 

A 

2 

351 

DEDSLHSVPVAQMGNYQEYLKTLASPLREIDPDQPKRLHTFGNPFKQ 

DKKGMMIDEADEFVAGPQNKVKRPGEPNSPMSSKRRRNATIIHDGHE 
EKMENGQITPDGFLSKSAPSEL 

5287 

A 

1 

999 

NLLKTHKF \ LLGQDEDSLHS VPVAQMGNYQE YL KTLAS PLRE IDPDQ 
PKRLHTFGNPFKQDKKGMMIDEADEFVAGPQNKVKRPGEPNMSYV\T 
KRRRSMSLLLRKPQTPPTCN\NHVGGKGPPSASWFPSYPNLIKPTLV 
HTDAT 1 1 HDGHEE KMENGQI TPDG FL S KS AP S EL I NMTGDLMPPNQV 
DSLSDDFTSLSKDGLIQ\NLVVTHL*EEPKTAASP*MTKKTQ*HLLW 
ELCQIHYKSLLLWHKE/VNADIKHQLMKEVRKFGRKYERIFILLEEV 
QGPLEMKKQFVEFTIKEAARFKRRVLIQYLEKVLEKINSHHLHNNIS 
HINSRSSC 

5288 

A 

329 

661 

ITLSLLSFFNLRPSFALIiAQAGVHWRDLDSLQPPPLRFK*FSYLKSP 
\RSWDYGHAPPRPANSVLLVETGSLHVSQGGLILPTSGDPPASASQS 
AGITGVTNCARPPSLFS 

5289 

A 

46 

554 

NKTRKDPFPPYTPAHGHT\PPTPSLPFG/CSAHGHLKPRGSHCDQPP 
S TCRP PVQS PVAHASQVAGRTWA * KRALHL PGGG VKRKGTGS PKAPR 
SGCGLPYKLGSTARAAGSCSPAE/EPETFSPQGQPLPVNKGKLRRVG 
QPSFQCAPRDSCSRYFQFGEEQGRAIEHQIP 

5290 

A 

228 

388 

VHYCDRQSGKECVTCLTLAPVQMTFHAIGSSIEASHDQAALSALKQF 
SEQGLD 

5291 

A 

3 

257 

GPRERETGRTATAVEPDSCEERTYSNDTKRWCSSHPGGNGGVPERQH 
LG*ATGSWGAKPLRLAAATRASGNPGHRSLKCLETIF 

5292 

A 

3 

1379 

NEFFGEGKTLQAARHFAAMKALQALQNEPIPERSPQNGESGKDMDDD 
KDANKSE ISLVFE IALKRNMPVS FEVI KESGPPHMKSFVTRVSVGEF 
SAEGEGNSKICIiSKKRAATT\njQELKKLPPLPVVEKPK\HFFKXRPKT 
IVKAGPEY6QGMNPISRLAQIQQAKKEKEPDYVLLSERGMP*RREFV 
MQVKVGNEVATGTGPNKKAKKKK\NVAVAMLLQLGYKAS-INLQDQLE 
KTGENKGWSDPTPGFPEPTNNTPKGILHLSPDVYQEMEASHHKVTS/ 
GHYSRLFVTQRYEPTFKLFSSVYLPHRIVQSTFARHLLMNGTSSTAE 
A IGLKG S S P \ I PI/VP P VQP S KQLG I F S KGFKAFRVHYCDRQKWQRVC 

DLSDISPCADGL/ shaigssieashdqaalsalkqfseqgldpidgs 

1WIEKGSLEKQAKHLIKKVDHNQAPPGSIAHDCKISNSAV 

5293 

A 

7 

351 

SSTSLIRKMQIKTTRYQNIPNRMAKIKNADNTKL/WSRILTNYWWEY 
RHLHK*KYVHKKTCTFMLIA7VLFIKFLKWKKFKHSSTGE* I IKL*DR 
HMTEYY * A I KKAQATDRMQHE 

5294 

A 

3 

1585 

GEPLI/LHHTTCH/NSKALIAFLAFLI I VTS I \ALLWLTNIYDLHR 
KRSCNVDEQQELVERYDESNLMNVQPIHADIMLETYHRRIA\DEGRL 
F\LAEFQSIPRVFSKFPIKEARKTH\NQNKNRYVDILPYDYNRVELS 
EINGDAGSNYI\NASYIDGFKEPRKYIAAQGPRDETVDDFWRMIWEQ 
KATVIVMVTRCEEGNI^NKCAEYWPSMEEGTRAFGECCCKDLTKHKRC 
P\DYIIQKLNIVNKKEKATGREVTHIQFTSWPDHGVPEDPHLLLKLR 
RRVNAFSNFFSGPIWHCSAGVGRTGTYIGIDAMLEGLEAENKVDVY 
GYWKLRRQRCLMVQVEVCSNL* \DYSHFGFLDLNLFQVI IMIFFKN 
PNS S QPQTVDDFLKTKTW * S PWPMLC I W 

5295 | A 

1 

1324 

MEDSRKAQKRDTKIEREEQLYWPSRKSRGKSKGDLVLHTTYKPSHSG 
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GRAVSGLFGSPIPSLGSWTAFLDPPWARVEPIALKGLTQRSPNGGGH 
RVCFVIPPAPPDGSEQKKRLCSFRRKEQSLPGNLENSFGSCPRLSRQ 
YVYESARTTALMRLGCPLKQIQLISQDLSPFEYRESLPMKDKQNKHQ 
RISLSINTIQETIISPNELSKSPGTNSGETEIHDLSDREFKIAVLRT 
LKE IQDNTEKEF\ KLYQINLTKR I E 1 1 KKNQAE I LELRNA IGIQKNA 
SQSFNSRNGREEEIISELENRLFENTQRRQNKRIKNNKACLQDLEYR 
LKRANLRVIGLKEEVEKAIGVESLFKGIITENFPSLEQDINIQVQEG 
YGTPSRFNPKKTTSRHLIVKLPKIRDKERILKVTRENKQVT*NGTHI 
CLAADFS WAL * ARREWH 

5296 

A 

1 

868 

PTPHRGAFGRLPEPGLVQPQLPVGPA/PGLGGPQRSTHPPDGPAPQG 
GPVPAAGAHPQTHLHHLRG PEAR PRRPGG PPAAPQRATW * PCCSGGP 
AP*LLPPIFGKKCSWRCRGKVQYRPVEGDPQTQLQDDKDPMLILRGP 
SP*GPGPGTSEVDPDP*RRTWVSGGLFLESHQAPP\PP\GGLAGREP 
EQRSGRLGSR/ PRETTVPRTDFYLPWCQRMNM*LPPQSGRI IGTAGA 
RAAASLLCSTGPQPLPAALGPCQQPWDPASSPGKRLWPQG\PPSARK 
* SPGKRLWPQGASQCQKIKSFWILKKKKK 

5297 

A 

3 

226 

GSGCEQGCPPGRYGPGCEQLCGCLNGGSCDAATGACRCPTGFLGTDC 
NLTCPQGRFGPNCTHVCGCGQGAACDPVTGTCLCPPGRAGVRCERGC 
PQNRFGVGCEHTCSCRNGGLCHASNGSCSCGLGWTGRHCELACPPGR 
YGAACHLECS CHNNS TCEPATGTCRCG PG FYGQACEHPC PPGFHGAG 
CQGLCRCQHGAPCDPISGRCLCPAGFHGHFCERGCEPGSFGEGCHQR 
CDCDGGAPCDPVTGLCLCPPGRSGATCNLDCRRGQFGPSCTLHCDCG 
GGADCDPVSGQCHCVDG YMGPTCRE\ VGPSGSPRTRP * PRGPAAAPL 
GSSGRTATSPVRRAASAPTAPTCVGVGRGRPATL 

5298 

A 

152 

990 

LLLIHSEEKPFEDHLYGKALSSNSHLHGPRRLCT/GENPEE/CEQL* 
KAFS AI SRDS CDS SH *GRKPVKVSEGFSDSHPLRHTGNHTMEQHSEM 
RKTLQQKCSHY * ANYS WFCS * LWES LECSIGF* GDLR INSGENAPF 
EHHQYWRAFWFSTFVKET/ W* S IHTTEK/ PWKF * EEEKAFSDASVLG 
K/HMELLPGCKTYEYINIGKFFSDSSLSCI/ WRELTWRNP*ECNQC* 

DA/FSLNSSLSG/HQFSLGTKKYNHCFECGISFIQCLICRLDCWLIN 
FFSLFFF 

5299 

A 

1 

181 

E I EDLN * T I TQLYLT * HL YRTLHRTPPAYKFFS S VHETFS RIDH \ Vh 
EHKTSLNEF * RiK 

5300 

A 

1 

170 

PPPPPFPLIPD*G*\PPSPLSPGQLQLSAPPP\PLPCPDITLSYPGR 
RPVLGLSLMD 


A 

84 6 

1094 

I PAL/ P * FYFI FVYLNI LKYLFFFFD/DGVLLCCPAWSAVA* SRLTA 

TFASQV\KRFSCLSLLSSWYYRSVP/ SSLANFCIFSRDG/ SFIVLAM 

LVI^S*PQVILLPWPPKVL*LPIMECSSLTIAHCSLNLPGTSNPPAS 
ASK 

5302 

A 

19 

391 j 

AYPLRGGRDFLRGGPSRSVSLLSWPARV*RRRRRLREEARGAMTSA 
S TKVGE I FS AAGAAFTKLGELTMQLHP VA0S S PAG AKWTETE I EMLR 
AAVKRFGDDLNHI SCVI KGPDKGHCETQG I 

5303 

A ; 

1 

338 

SLIFSDSLSSISGTNTLAAPLSWMIjPYIPWILQARPALGLLHIjLIjAA 
CNWGQSAT*PSPSLPLGLCSSSHSQ*GLF*PPVFKISPLITDNPNP 
FDLHLMD I F I QKT I KI SL 

5304 

A 

3 

180 

TVKRKVYEDSGIPLPAESPKKGPKKVASGVLSPPPAAPPPSSSSVPE 
AGGPPHKETEG 

5305 

A 

1 

518 

RAAQSPCSPWSRLACSGGGGVIREEARGAMTSASTKVGEIFSAAGAA 
FTKLGELTMQLHPVADSSPAGAKWTETEIEM/PEGCCEAIWGRS*SH 

VFVHLLQLAPP PKQLP VFPEGRGS P P I KETERLM 

5306 

A 

1 

705 

GAKFQRTRSVPRTTGPGQLKCLEDKVSRLETELDE^KNTVELLTDR\ 
LNRGRDQVDQLRTELMQERS / SSGRTWKCDKI SLERQNKDLKTRLAS 
SEG/SPESLVPASLSLSPRISCCRERLPAEQRVKTVLRSTNRKLERK 
VN * LS LQI EDERHHVSDT\ KDQLSLRVKALKRQV\ DEAKEE I ERLDG 
LR \ KKAQREVEEQHE VNEQLQARI KSLEKDS WRK/ GFPL S C * VS SQK 
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RRAELR 

5307 

A 

831 

1159 

DVKSKSFFFFLRWSL/NPFAQAGV*WHDLSSLQPLPHRFKQFSCLSL 
LSS*DYRRVPP*LANFLYF**RQGFTMLARLASNS*PHDPPASASQS 
AG I TGMSHRTQKKQI S 

5308 

A 

296 

527 

FNCCEEEKFKKKPCFVS FLEGLERG * VKCQEH I PDTH * * *LLGLSEQ 
INSDSRLRKELNHFILFLEQLTCPPHSRER 

5309 

A 

5424 

5957 

YNFFLNEMGSCSVAQAGVQWHNHSSLQP *TLG* SSYHPASAS * VAGT 
VCTCHHVQLVF/ IFFFLR* SFALVAQAGVQWCDIX3SLPPPPSGFERF 
SCLGLPSSWDYRRPPLHI»VNFFFLYF * W RWGFTMLARLVSNS * PHD\ 
HPASASQSAGITGMSHHVWLINFFFNCKSLIFSNVFRJJ 

5310 

A 

2 

237 

I KRF S YLS LPNSWDYKVPS SMPG * FLYF * LR * G FTKTVGQAGFE I P * P 
QNDSNPLSSQSAGITGMSH\HNRPPLFYFLK 

5311 

A 

36 

386 

LNIHRKFMLLYYH*LLIiFL\FHF*DRVSVCHPGWSaWNSQGFT\AA 
LTYW\ VK* SSCFNPPCVSWEPRGACHHAL\ ANFLI FCKNQVSLCLPR 
LVSNS\WVQAILPSW\PPRRAGIYGV 

5312 

A 

1 

310 

FFFFET/ ESHS VAQAGVQWGDLGLL * S S \ PP\ GSSDS PASAS * ASGT 
TGA/QHHAQLI F\ VFSVETGFHPVSQDGLDLLTS * SARLSLPKS WD Y 
RHEPLAPGLGFTF 

5313 

A 

598 

884 

LIFFLFSFFFFSFFFFFFFFFFFFFFFFFFFFFFFFFFLIiLLLLLLL 
LLLIiFFFFFFFFFFFFFFLRQSL/DSVAQAGVQWRHLGSIiQALPPRF 
AP 

5314 

A 

439 

1169 

QSGHRYLYNILIIQVLCFTIFLFLRQDLAVL/TQAGVQWC/DSWQPR 
PPGLK\ GTPTS AAQVAGTTGMCHH\ TR* I FFFFFFFFFFFFFFFFFF 
FFFFFFFLRQSL/DSVAQAGVQWRNLGSLQPLPPRFKRFSCLGLPSS 
WD YRNAPPHPANCCI F/M * RRGFTWLAKLVLNSGDS PAS ASQS AG 1 1 
GVSHHARPPRYFKKYIFCRDGGLTMLLGWS*TPGLKQSSHLGFPNFW 
DYRCEPPCLATLFIL 

5315 

A 

1 

222 

ETGSHFVTQAGV*WCNLGLIiQPLTPGLKQFSHLSPPSSWDHKCVPSC 
PANSFCTLGK*RRVSTMLPPGWS\QTP 

5316 

A 

1 

234 

LRQSFAGWPRLECNGTIPAYCNLHLLGSSNPPASSPR\RIiIFWLVE 
MRFLYVA*AGLELLSSSDLPPWHRKVLGLQA 

5317 

A 

1 

196 

FFLIN1jTLLIVTAQFIN*LDFIDSYSTICPKPPEYTFFSRA/PNGTF 
TT KDH ILGFKI SLNKYKS I 

5318 

A 

1 

477 

SCWSPV\WGHPPRDREPPPS*LGIQSSPGVPQAALPPWPSPASHQAS 
ATWAVLPLPPLQGPARBPPCPPHLRRHVQPTKTSAVSILSAPPIjPPP 
G*DLPEASFSQWACKGINKCLQGLGLVCRRTEHKDLGLGLFGPLVLA 
MVBVRGVDLTENNSGASVPRGR 

5319 

A 

139 

398 

FYFFYFYFLRQ5CSVNQAGVQ*/HCLNSLQPPPPGFKRFSCLSIiPSR 
WDYKCPPPCTPNFCVFS/EMGFLHVGQAGLELMTSGDPPGFG 

5320 

A 

11 

137 

RWGFTKLPRLVSNSWPQAILSPWLPKVIjaNHREALYFSLYYISKYTE 
GQFRNRLYPGTICQDYLWRYLLFLFIFETESCTVIRLECSGSLQPPP 
PRFKQFTCLRLHSSWDYR\RPP\PWPINFCISRVQTGFHHVGQG\GL 
DLVDLCX>PPASASQSAGITGREHPAPGTYLWEGFKKSFKSRA*RSTL 
FFPIiLH 

5321 

A 

1 

293 

LERMS FGHI QTKED I LQENT I F I ATEVQI QPNPD * TPEAKAFI \ RRC 
LSYRKEDRIDVHQLTCDPYLMPHMRMSVSTCSPAGAAIASTSGASSS 
SSSN 

5322 

A 

3 

291 

RKPFGHNQSQQDILQENTILKATEVQFPPKPWTPEAKAFIRRCLAY 
RKEDRIDVQQLACDPYLLPHIRKSVSTSSPAGAAIASTSGASNNSSS 
N 

5323 

A 

14 

943 

HRILEPLIFHFFFFFSSAPPQPPELLY/PASLTLPASPGREEVYPLP 
ARLPSLPRWEPLQVWLALGAVKLPLSPPWSGP\NGLGKPPGTPKRRG 
ALFCHMPG ILTP/ WMPGARWTKPPANP PST/TS PVPPKDG * KLQPGG 
LRGKPGPERGWNPPCP/RSPRPPYLLC*PQTLPRA*CFPIPKMSFTG 
HSPCRASLGHLE/VGHGIPLWPRVCQRRKSRAFFVLFFFKGACLFNV 


loam 
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NNRKP * YLFCNTE * YPNVHVLD VHNI F I TKQQEST * PWLPRGVS WLG 
AGGGYQGAWDREAPPPRP I PKLPPTLASRMRSCHI* 

5324 

A 

12 

486 

GPRPAWKRQNRQVPSTQDPLFPGYGPQCPVPLAGPPCLRPLFGGLG 
G YWRALQRGREGRTMTSRASELS PGRS VTAG 1 1 IVGDE ILKDWTYRD 
IWDFLRQLFVPYCILYDRGYTSLGSRENTVRNP7U.KCLSPGGHPTYR 
PAYLLENEEEERNSRT 

5325 

A 

51 

390 

DGVLPSLLGWEVMCGSSAPLLRSPVDRLDGMWGVCHP\GWSIVAQS\ 
QFTAASNLLASRAT/LSLPSSWDYRRVPPHSAHFKFFL*KWGLAMLP 
RLVLNSWSQVILLPQPPKVLEL 

5326 

A 

279 

380 

ASHPLQGLLCGPSLLDEHHPLLHGAQSHRPPKG 

5327 

A 

618 

818 

G I L AL VP AECDAD VP WTE I RRNLDKLQRGAP I PLA5 RS L YS RNPFLT 
S AGPDHWS ILHELSLPGGW 

5328 

A 

1 

197 

LHFYQQNYDNPSDFTFLTTLGIVNSFNNIHSNVYVVVRGDSVIiAVLT 
ALTRSQRLLCLGSHFGGT 

5329 

A 

2 

498 

CPRMTPKI)FQ*PVL/VVTNLLGHVIDIKWMSIAFINDRAWPSAFVSD 
ENHARIQCDGQHRFEVSVSELPDETDVSSTVLDLDHQGLGPPPPPAB 
PDPLQLLLQQLFGLGQLSCWGPGPWRGLSSARDKQNIMVS I IHKVK 
RPQS PLRAVYWPLRYC I LPSRRATVN 

5330 

A 

600 

811 

PIWLSLSPWHFINQRKEKKK**DIVKREKPLMGLSTLTSI*MQSKPR 
KEHQINLKSEIK*LQDLSQYFGR 

5331 

A 

1 

75 

FFFFFFFFFFKLAMFVFILFFIFSK*KEIVLSQFLFVFIj/ iylfyvf 
TSLKVALKISRDIKSTFLRVAIjFFFLNNSWEFCGSTAFPSVSMIjCMF 
*LLL*FFFFFFFFFFKLAMFVFILFFIFSK 

_> J -> 

A 

949 

1711 

VTTPTjTPDLKGSSfrVGPPKP*DCRHWTPPTjTT*TYFFPFKVFSPS 
AGVQWHDLSSLLTLSPDQV/HSPVSAF*VAGIQG /^lHHHAPvL/ IFFF 

fffflrqsl/ssvaqareqwcyhgslqpppsgfkrfsclslpsswdy 
rcppphla/dffvfliemgftmltrlvsns*pq/cdlpasasqtawi 
tglshraw\ lnffc i fs iygvs pcwpgwsltpdlk* s aclslp * cwd 
ykceppcpaqftfirkdsfadvycsvs 

5333 

C 

11 

154 

MESRSVARLECNGAISAHCNLHLLGSNDSPASASRVAGTTGTATTPG 
* 

5334 

C 

163 

270 

MNKMESL.FYCFNADILIKQDYFHHHQHLYIYTKPV* 

5335 

A 

92 

404 

WLQICVSGCICT*\^1ALYICIN\VVWKVVLFYEDVCIKAVK*AFHF 
IHNlJ^CVYICMCVCVWIEGLYIPPQRYIH^VLIxHRSNSQTPGSIR 
YKFKISFIGFI 

5336 

A 

2 

1100 

FAIxAAR LtOE FG TRKY FTES GT. VGL. T. F O AJXA S F SM AGMYEAVNEVYKV 
LIPIHEANRDAKKLSTIHGKLQEAFSKIVHQSTGWERMFGTYFRVGF 
YGTKFGDLDEQEFVYKEPAITKIxAEISHRLEGFYGERFGEDVVEVIK 
DSNPVDKCKLDPNKAYIQITYVEPYFDTYEMKDRITYFRQKLQSSVR 
FMYCTPFTLDGRAHGGTSMEQFQKGRPI\LDYVSCLFPYIKTRG/PM 
SLIKKRSSLHPF1^\AIEDMQKKTQEIxAFATHQDPADPK^QMVL^ 
\ S VGH\QVNQGPLGSCPGFFCSGNT* * PKALSD IH* \ KLATLAFKDF 
TKRCEDALRKNKSLIGPDQKEYQRELGEKljSSAIiKRALPAT 

5337 

A 

48 

1057 

PCWENCTSTCKTMKSDP YIjR / PYTTTN^KW T KDI\NVR PET TRENT GE 

Jv\* rf XjXV X«. X w X V-* XV X I * 1W X/ X X XJXV / f X X X J» Xv O XV. w% X 1VL/ XXLV V XV X C* X X DDli XVJCf 

kfld \ mtlgnd fld i tpkaqstktk/dkwd y i kpks f/ ctvketvnk 

mkq* ptd\w*kifvrhiygkglvski ykefiqlnf* kwakdlkrhfs 

eddtq>xal^yikgcst*liikemhfknhskyylspvrt j av 

nkc/w*wcgikgtl/lhctwecrlvqplwktewkpl\rxii<lpydpa 

ipllgiysk*mksppckyics/rvfiaalfiva/k/twrqlkcplt/ 

v*irklcix^pytpmeyypplktivx<nptischllptwmgl 

isqtekkilhnftiyveskkki 

5338 

A 

2 

409 

PLAIxAIxAMAYPSCPGDAHGPPRPGPLSPPS * PASVP/TS P/PQSSA* 
FLCPQS YPVEGLTPR I LPAGGGHVPCSRPS S S PGLAS VS^PRWGALL 
LPGPCPTSLSSGCRKGRASPRAHPPPTASLCTALWPPCFSPP*PGVG 
LGNGLSLLPRRCTWPAAARSOVSSLLACLCPNSPI^LLRDSSVI»SHT 
QLRGSPHGFYPLEGAMFPAPALPPHQALPRYQPPGGELCCCLDHALIi 
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PSPRDVGKGGPALELTHPPQQACVXrRSGHLASRLHNL 

5339 

A 

32 

323 

KFHLGGGGWFPPLFPGPGGARGVGS*VRGFRPPWAPPLNPPFPFKPQ 
NS PGIiWFAFRP SSSGG/ PGPGNS PHPGGQGFG * QKFRPCPS AWGTK 
GGFP 

5340 

A 

2 

570 

LKILSKMKIHQGGYNSDENACVHQHRVRALTGSMCKSKPGSPAPGGR 
♦NLPSREPGAPAGI GGSFLASAAVS PS PPS PH*CPLE * EPVAEAEGL 
LGELSKVWKEGPPERAPG* P\ PRGPPGLHQLHASQMLFIGRPG\GPG 
y lb PAPGCGGLiEGCLSRHCAIAGAGERI SETTFGSNALYHRSPNRLG 
QGL 

5341 

A 

4 

329 

I IAHCKLKRPSSNDPPQPPSSWDCRHAPPHPAN* FFF * EMGSCHVAQ 
TSSQTPALNPSRYPPTSASQGAEITVMSHCAQLYPCFLTKEK/FSHS 
* IRRTLLSETSNGEY 

5342 

A 

3 

274 

FQSQNIKLNLMGVLFCFETEAHSVT*AGLQWLDLSSLQPPPPGFKQ/ 
FSCLSLLSSWDYRSPPRGPVNFCIFSGDEVSPYWSGWSQTSDLK 

5343 

A 

1 

1075 

MVDGTLLLLLSEALALTQTWA\GSHSLKYFHTSVSRPGRGEPRFISV 
G YVDDTQFVRFDNDAAS PRMVPRAP\ WMEQEGSEYWDRETRS \ ARDT 
AQIFRVNLRTLARL/Y*QSERRPGSHTL\QWMHG\CELGPD\NASFR 
GYEQFAYDGKDYLTLNEDLRS WTAVDTAAQ I SEQKSNDASEADHQRP 
\ YLQATFVDSFPK* L YEGKETFLHLEPPKTHVIiHHP I S \ DHEATLRC 
WALGF\YPAEITLTWQQDGEGHTQDTELVETRPAGDGTFQ\KWAAW 
VPSGEEQRYTCHVQHEGIiPEPVT\LRWEARFPSPPI/LPSWASLLGL 
v JbJjvj bvvb LAV \ V AAV I WRKKS S GH FL PTGG KGG S Y F KG 

5344 

A 

58 

426 

KLQLL*FFFWFF\FCLFTYFLETVECSGAIIAHCSLKLLGSSHPPAS 
ASLVAKITGACHPAWL I * VVL I VL FKRKWQSHLKLMVEKRFIjS EGQ I 
LS VYHFRVPLLQNDRNLAQI ILGRKREMY 

5345 


3 

O *± 

JUvlijKi^lLRKSSV*I*D*SSQPEN\IP*KQFLSJra 

msimkkggrise\dfiklffpslllshllviewglyieiyltppnnt 
f 

5346 


2 


lijPCILibTSSTPWSASSQEREMSFGDG*PVG*PPLTAVP\CCSVWGL 
RRL*GERTKLGIGAAGGFHSYDGNTQT 

5347 

A 

3 

104 

WSTVARS*LTATSCSFCLSPPSSWDCRRVPPCLA/SFHFQYIALNN* 
A*AIILPQPSKVtiGL*ASALQVAGTAGVYHHAW 

c ~a 4 a 

J J T O 

TV 

A 


TGKNEIYCICFIDLEYQ* KIGLLLHIGDKEEYS / WNSGYFIGYCLEL 
PCP WKVSGN * ND S I EAAVliMTTPSKI KI * VTLQS KEAS P I KVLTQV 
1<jL>1 KG I JjENGHYKYQIj * QFFKLQKGNDNH 

5349 

A 

2 

67 

AAED*HCPIASETYKTIT\ELWVTI>PVEGKSVPFLINTEATHSTLPS 
FQGCV S LAS I TWG IDGQAS KPLKTPQLWCQLGQHS FMHS FLV I RTF 
PAPLLGRDILTNLSASLTIPGLEPHLIAAFCSSSKPPSHPPLVSLHL 
KPQV*DRLGNLQDHHRCFG 

5350 

A 

2 

275 

ETES CS VA\ RLECSGM I SAHCNLCIiPDTGNS PASAS * VAG I IGACHH 
AQLIFVFLVETEFHHVFHHVGQASLEFL/NLVIPPPRPPKALGLQG 

5351 

A 

3 

1031 

SSAPEAAKKPTPCHRCRGLVDKFNQGMVDTAKKNFGGGNTAWEEKTL 
SKYESSEIRIiLEILEGLCESSDFECNQMLEAQEEHLEAWWLQLKSEY 
PDLFEWFCVKTLKVCCV\ PGT\ YGPDCLACQGGSQRP\ CSGNGHCS \ 
GDGSRQGDGSCRCHMGYQGPLC\ IDCMDRLLQLRSRNETHS I \ CTA\ 
CDESCK\ TCFGP * PT\ GLRRSCEVG\ WVLDEGACVDVE KCAAQT \ PP 
CS AAQFCKNANGS YTCE\ EGDSS WVGCPG\ KGPGNCKQC ISGYAREH 
GQ \ CAD VERVPH * PEKTLCEEKTKTC YNTPG \ S Y \ VCVC PDG FEET / 
RRCLCAAGRRLKPQKGESPTQLPLP 

5352 

A 

206 

330 

KMLQLALQSCIKVRTLNSRHIjGAGVTAHEE*CPCCHPMIKO ' 

5353 

A 

1 

8271 

GEFGSEKKTALHEKEETI^LQSAQAQPFHQEEKESLSLQLQKKNHQV 
QQLKDQVLSLSHEIEECRSELEVLQQRRERENREGANLLSMLKT^DVN 
LSHSERGALQDALRRLIiGLFGETIiRAAVTLRSRIGERVGLCLDDAGA 
GLALSTAPALEETWSDVALPELDRTLSECAEMSSVAEISSHMRESFL 
MSPESVRECEQPIRRVFQSIiSLAVDGLMEMALDSSRQLEEARQIHSR 
FEKBFSFKNEETAQVTOKHQELLECLKEESAAKAELALELHKTQGTL 


IOVIU 
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EGFKVETADLKEVLAGKEDSEHRLVLELESLRRQLQQAAQBQAALRE 

ECTRLWSRGEATATDAEAREAGFFRVFSGTSIiTSCIALSIFALAVLL 

HLGVRCRCFSDAFSACPLLSGGSLMCLRAQPAGAGPHPAGGDHLVFH 

HLEPLDFSQSLRLSGLRADKGVCGLHSVERPALRKEVEDIiTKEQSET 

RKQAEKDRSALLSQMKILESELEEQLSQHRGCAKQAEAVTALEQQVA 

SLDKHLRNQRQFMDVR ILNNTFFC I TKSCRYSMNFFPDR YEQAAERE 

HEREEFQQEIQRLEGQLRQAAKPQPWGPRDSQVELLQQKLREKLDEF 

NELAIQKESADRQVLMQEEEIKRLEEMNINIRKKVAQLQEEVEKQKN 

I VKGLEQGGPLS ITCTWSG WPLAGHCTHFSDSCSRAAS PLLPSLSE 

RLLSLVDARDKEVLKKQQMSSLLLASTLQSTLDAGRCPEPPSGSPPE 

GPEIQLEVTQRALLRRESEVLDLKEQLEKMKGDLESKNEEILltLNLK 

LDMQNSQTAVSLRELEEENTSLKVIYTRSSEIEELKATIENLOENQK 

RLQKEKAEEIEQLHEVIEKLQHELSLMGPWHEVSDSQAGSLQSELL 

CSQAGGPRGQALQGELEAALEAKEALSRLLADQERRHSQALEAIiQQR 

LQGAEEAAELQLAELERNVALREAE VEDMAS R IQEFEAALKAKEATI 

AERNLEIDALNQRKAAHSAELEAVLIJUjARIRRALEQQPLAAGAAPP 

ELQWLRAQCARLSRQLQVLHQRFLRCQVELDRRQARRATAHTRVPGA 

HPQPRMDGGAKAQVTGDVEASHDAALEPWPDPQGDLQPVLVTLKDA 

PLCKQEGVMSVLTVCQRQLQSELLLVKNEMRLSLEDGGKGKEKVIiED 

CQLPKVDLVAQVKQLQEKLNRIjL YSMTFQNVDAADTKS LWPMASAHL 

LESSWSDDSCDGEEPDISPHIDTCDANTATGGVTDVIKNQAIDACDA 

NTTPGGVTDVIKNWDSLIPDEMPDSPIQEKSECQDMSLSSPTSVLGG 

SRHQSHTAEAGPRKSPVGMLDLSSWSSPEVLRKDWTLEPWPSLPVTP 

HSGAIiSLCSADTSLGDRADTSLPQTQGPGLLCSPGVSAAALALQWAE 

SPPADDHHVQRTAVEKDVEDFITTSFDSQETLSSPPPGLEGKADRSE 

KSDGSGFGARLSPGSGGPEAQTAGPVTPASISGRFQPLPEAMKEKEV 

RPKHVKFSRSFYGKTIDK2^IiAKQALLQ^TVRDESHQILAl^SEGIlAPPS 

GEPHPPRKEDEIQDISLHGGKTQQALWCERQPGWHSGGSLVLHAAP 

GDQREVPTACPDWRGDLLQWQEAFEKEQEMQGVELQPRLSGSDLGG 

HSSLLERLEKIIREQGDLQEKSLEHLRLPDRSSLLSEIQALRAQUW 

THLQNQEKLQKLRTALTSAEARGSQQEHQLRRQVLDSVKGNGLAVQP 

TALVPAPMPFVLSGAGGRRCAHTUjELLAYKVEQEKCIAGDLQKTLSE 

EQEKANS VQKLLAAEQTWRDLKSDLCESRQKS EQLSRS LCEVQQEV 

LQLRSMLSSKENELKTUUiQEIiESEQGKGRALQSQLEEEQIiRHIiQRES 

QSAKALEELRASLETQRAQSSRLCVALKHEQTAKDNLQKELRIEHSR 

CEALLAQERSQLSELQKDLAAEKSRTLELSEALRHERIiLTEQIiSQRT 

QEACVHQDTQAHHALLQKLKEEKSRVVDLQAMLEKVQQQALHSQQQL 

EAEAQKHCEAIiRREKEVSATLKSTVEALHTQKRELRCSLEREREKPA 

WLQAELEQSHPRLKEQEGRKAARRSAEARQSPAAAEQWRKWQRDKEK 

LRELELQRQRDLHKIKQLQQTVRDLESKDEVPGSRLHLGSARRAAGS 

DADHLREQQRELEAMRQRIiLSAARLLTSFTSQAVDRTVNDWTSSNEK 

AVMSLLHTLEELKSDLSRPTSSQKKMAAELQFQFVDVLIjKDNVSLTK 

ALSTVTQEKLELSRAVSKLEKLLKHHLQKGCSPSRSERSAWKPDETA 

PQSSLRRPDPGRLPPT^ASEEAHTSNVKMEKLYLHYLRAESFRKALIY 

QKKYLLLLIGGFQDSEQETLSMIAHIiGVFPSKAERKITSRPFTRFRT 

AVRWIAI LRLRFLVKKWQEVDRKGALAQGKAPRPGGTPAAVDRWTH 

KPFFLLLFF IHFGS FFS VDGDVI KHVYI CGFTGRSLGPVDRRRKNTG 

LFYQRIHTALNQLPRPSSIAHGSMYILTLGYFFHKKVDDQELCLKFC 

LLRGYAL I T I FQEW FLQKQTLRL S LG YEDVDMVSTPVKGRGRKQDWA 

EEEGYAPRAATKQPARLKPATLKQAQRRACRERRSPASPNGSPERHR 

KVb bAN GS EALiVXiJjGGAGRAARARD PLRTRARYPPFPPRTRSRGRPG 

AQERAGQRRRGGAIRARARAASARTLAGLAMADRGCPLEAAPLPAEV 

RESLAELELELSEGWKKKLNSDKDAVFNRNW 


5354 

A 

2 

10028 

LCVSPVTAGRPARSLREMEVEQEQRRRKVEAGRTKLAHFRQRKTKGD 
S SHS E KKTAKRKG S AYDAS VQEES PVTKED S ALCGGGD I CKSTS CDD 
TPDGAGGAFAAQPEDCDGEKREDLEQLQQKQVNDHPPEQCGMFTVSD 
HPPEQHGMFTVGDHPPEQRGMFTVSDHPPEQHGMFTVSDHPPEQRGM 
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FTISDHQPEQRGMFTVSDHTPEQRGIFTISDHPAEQRGMFTKECEQE 

CELAITDLESGREDEAGLHQSQAVHGLELEALRLSLSNMHTAQLELT 

QANLQKEKETALTELREMLNSRRAQELALIiQSRQQHELELLREQHAR 

EKEEWLRCGQEAAELKEKLQSEMEKNAQIVKTLKEDWESEKDLCLE 

NLRKELSAKHQSEMEDLQNQFQKELAEQRAELEKIFQDKNQAERALR 

NLESHHQAAIEKLREDLQSEHGRCLEDLEFKFKESEKEKQLELENLQ 

ASYEDLKAQSQEEIRRLWSQLDSARTSRQELSELHEQLLARTSRVED 

LEQLKQRE KTQHE S ELEQLR I Y FE KKLRDAE KTYQEDLTLLQQRLQG 

AREDALLDSVEVGLSCVGLEEKPEKGRKDHVDELEPERHKESLPRFQ 

AELEESHRHQLEALESPLCIQHEGHVSDRCCVETSALGHEWRIiEPSE 

GHSQELPWVHiQGVQDGDLEADTERAARVLGLETEHKVQLSLLQTEL 

KEEIELLKIENRNLFEKLQHETRLKDDLEKVKHNLIEDHQKELNNAK 

QKTELMKQEFQRKETDWKVMKEELQREAEEKLTLMLLELREKAESEK 

QT 1 I NKLELREAEMRQLQDQQAAQ I LDLERS LTEQQARIiQQLEQDLT 

SDDALHCSQCGREPPTAQDGELAALHVKEDCALQLMLARSRFLEERK 

EIAEKFSAEQDAFLQLLQERHQQQLLSVTAELEARHQAALGELTASIj 

ESKQGALLAARVAELQTKHAADLGALETRHLSTLDSLESCYLSEFQT 

I REEHRQALELLRADFEEQLWKKDSLHQT I LTQELEKLKRKHKGELQ 

SVRDHLRTEASTELAGTVAHELQGVHQGEFGSEKKTALHEKEETIjRIj 

QSAQAQPFHQEEKESLSLQLQKKNHQVQQLKDQVLSLSHEIEECRSE 

LEVLQQRRERENREGANLLSMLKADVNLSHSERGALQDALRRLLGLF 

GETLRAAVTLRSRIGERVGLCLDDAGAGLALSTAPALEETWSDVALP 

ELDRTLSECAEMSSVAEISSHMRESFLMSPESVRECEQPIRRVFQSL 

SLAVDGLMEMALDSSRQLEEARQLHSRFEKEFSFKNEETAQVVRKHQ 

ELLECLTEESAAKAELAIjEIjHKTQGTLEGFKVETADLKEVIiAGKEDS 

EHRLVLELESLRRQLQQAAQEQAALREECTRLWSRGEATATDAEARE 

AALRKEVEDLTKEQSETRKQAEKDRSALLSQMKILESELEEQLSQHR 

GCAKQAEAVTALEQQVASLDKHLRNQRQFMDEQAAEREHEREEFQQE 

IQRLEGQLRQAAKPQPWGPRDSQQAPLDGEVELLQHKLREKLDEFNE 

LAIQKESADRQVXiMQEEEIKRLEEMNINIRKKVAQLQEEVEKQKNIV 

KGLEQDKEVLKKQQMSSLLLASTLQSTLDAGRCPEPPSGSPPEGPEI 

QLEVTQRALLRRESEVLDLKEQLEKMKGDLESKNEEILHLNLKLDMQ 

N S QTAVS LRELE EENTS LKV I YTRS S E I EELKAT I ENLQENQKRLrQK 

EKAEEIEQLHEVIEKLQHELSLMGPWHEVSDSQAGSLQSELLCSQA 

GGPRGQALQGELEAAIiEAKEALSRLLADQERRHSQALEALQQRLQGA 

EEAAELQLAELERNVALREAEVEDMASRIQEFEAALKAKEATXAERN 

LEIDAI^QRKAAHSAELEAVLLALARIRRAIiEQQPLAAGAAPPELQW 

LRAQCARLSRQLQVLHQRFLRCQVELDRRQARRATAHTRVPGAHPQP 

RMDGGAKAQVTGDVEASHDAALEPVVPDPQGDLQPVl.VTLKDAPLCK 

QEGVMSVLTVCQRQLQSELLLVKNEMRLSLEDGGKGKEKVLEDCQLP 

KVDLVAQVKQLQEKLNRLLYSMTFQNVDAADTKSLWPMASAHLI,ESS 

WSDDSOTGEEPDISPHIDTODANTATGGVTDVIKNQAIDACDANTTP 

GGVTDVIKNVTOSLIPDEMPDSPIQEKSECQDRSLSSPTSVLGGSRHQ 

SHTAEAGPRKS PVGMLDLS S WS S PEVIjRKDWTLE PWPSLPVTPHSGA 

LSLCSADTSIiGDRADTSLPQTQGPGLLCSPGVSAAALALQWAESPPA 

DDHHVQRTAVEKDVEDFITTSFDSQETLSSPPPGLEGKADRSEKSDG 

SGFGARLSPGSGGPEAQTAGPVTPASISGRFQPLPEAMKEKEVRPKH 

VKALLQMVRDESHQILAIjSEGLAPPSGEPHPPRKEDEIQDISLHGGK 

TQEVPTACPDWRGDLLQWQEAFEKEQEMQGVELQPRLSGSDLGGHS 

SLLERLEKIIREQGDLQEKSLEHLRLPDRSSLLSEIQAIiRAQLRMTH 

LQNQEKLQHLRTAIiTSTEARGSQQEHQLRRQVELLAYKVEQEKCIAG 

DLQKTLSEEQEKANSVQKLLAAEQTWRJDLKSDLCESRQKSEQLSRS 

LCEVQQEVLQLRSMLSSKENELKAALQELESEQGKGRALQSQLEEEQ 

LRHLQRESQSAKALEELRASLETQRAQSSRLCVALKHEQTAKDNLQK 

ELRIEHSRCEALLAQERSQLSELQKDLAAEKSRTLELSEALRHERLL 

TEQLSQRTQEACVHQDTQAHHALLQKLKEEKSRVVDLQAMLEKVQQQ 

ALHSQQQLEAEAQKHCEALRREKEVSATLKSTVEALHTQKRELRCSL 
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EREREKPAWLQAELEQSHPRLKEQEGRKAARRSAEARQSPAAAEQWR 
KWQRPKEKLRELELQRQRDLHKIKQI^QTVIUDLESKDEVPGSRLHLG 
SARRAAGSDAJDHLREQQRELEAMRQRLLSAARLLTSFTSQAVDRTVN 

Lin lboN&AAVnoljJjJl 1 X^CiiLJjJVoJL/lioKir 1 ooUiUU v lAA£*LUr y* VJJVIjLi 

KDNVSLTKALSTVTQEKLELSRAVSKLEKLLKHHIiQKGCSPSRSERS 
AWKPDETAPQSSLRRPDPGRLPPAASEEAHTSNVKMEKLSLHYLRAE 
SFRKALIYQKKYLIitiLIGGFQDSEQETLSMlAHLGVFPSKAERKITS 
RPFTRFRTAVRWIAILRLRFLVKKWQEVDRKGALAQGKAPRPGPRA 
RQPQS PPRTRES PPTRDVPSGHTRDPARGRRLAAAAS PHSGGRATPS 
PNSRLERSLTASQDPGHSIiTEYIHHIiEVIQQRLGGVLPDSTSKKSCR 
PMIKQ 

5355 

A 

2 

5217 

APLDGEVELLQQKLREKLDEFNELAI QKES ADRQVLMQEEE I KRLEE 
MNIN I RKKVAQLQEEVEKQKN I VKGIiEQDKEVLKKQQMS SLLLASTIi 
QSTLDAGRCPEPPSGSPPEGPEIQLEVTQRALLRRESEVLDLKEQLE 
KMKGDLESKNEEILHLNLKLDMQNSQTAVSLRELEEENTSLKVIYTR 
S S E I EELKATI ENLQENQKRLQKEKAEE I EQLHEV IEKLQHELSLMG 
PWHEVSDSQAGSLQSELLCSQAGGPRGQALQGELEAALEAKEALSR 
LLADQERRHSQALEALQQRLQGAEEAAELQLAELERNVALREAEVED 
MASRIQEFEAALKAKEATIAERNIiEIDALNQRKAAHSAELEAVLLAL 
AR I RRALEQQPLAAGAAPPELQWLRAQCARL SRQLQVLHQRFLRCQV 
ELDRRQARRATAHTRVPGAHPQPRMDGGAKAQVTGDVEASHDAALEP 
WPDPQGDLQPVTiVTLKDAPLCKQEGVMSVIiTVCQRQLQSELLLVKN 
EMRLSLEDGGKGKEKVLEDCQLPKVDLVAQVKQLQEKLNRLLYSMTF 
QNVDAADTKS LWPMAS AHLLES S WSDDS CDGEEPD I S PH IDTCDANT 
ATGGVTDVIKNQAIDACDANTTPGGVTDVIKNWDS1.IPDEMPDSPIQ 
EKSECQDMS LSS PTS VLGGSRHQSHTAEAG PRKS PVGMLDLS S WS S P 
EVLRKDWTLEPWPSLPVTPHSGAIiSLCSADTSLGDRADTSLPQTQGP 
GLLCSPGVSAAALTOiQWAESPPADDHHVQRTAVEKDVEDFITTSFDS 
QETLSSPPPGL.EGKADRSEKSDGSGFGARLSPGSGGPEAQTAGPVTP 
AS I SGRFQPLPEAMKEKEVRPKHVKALLQMVRDESHQI LALS EGLAP 
PSGEPHPPRKEDEIQDISLHGGKTQEVPTACPDWRGDLIjQVVQEAFE 
KEQEMQGVELQPRIiSGSDLGGHSSLLERLEKIIREQGDLQEKSLEHL 
RLPDRSSLLSB IQALRAQX LRMTHLLN * EKLQHLRTGLR S AEARGSH 
HEHQLRRQVELVAYKVEQEKCIAGDLQKTLSEEQEKANSVQKLLAAE 
QT\AOU3LKSDLCESRQKSEQLSRSLCEVQQEVLQLRSMLSSKENELK 
AALQELESEQGKGRAliQSQl»EEEQLRHLQRESQS7UCAIiEELRASLET 
QRAQS SRLrCVALKHEQTAKDNLQKELR I EHS RCEALLAQERS QLS EL 
QKDLAAEKSRTLELSEALRHERLLTEQLSQRTQEACVHQDTQAHHAL 
LQKLKEEKSRWDLQAMLEKVQQQALHSQQQLEAEAQKHCEAIiRREK 
ERELELQRQRDLHKI KQLQQTVRDLE S KDE VPGS RLHLG S ARRAAGS 
DADHLREQQRELEAMRQRLLSAARLLTSFTSQAVDRTVNDWTSSNEK 

3XVM c ^T.T.HTT.T?PT."PC;riT,c;'RP , T , Q<3r>TrVMli2iT?T r^"C > 0"cn7T~»T7T T VTYKR/CT TW 
nouun X I tCtCf*JCtS9UXjOI^.lr X O o ^rvJvrl.MyA ii> JlJ^-T yf V JL/ Vl_ii_i XSJL/iM VoJjI A 

ALS TVTOEKLELSRAVSKIjEKLLKHHLOKGCS P <5 R ^ RP S AWICPDKTA 
PQS SLRRPDPGRLPPAASEEAHTSNVKMEKLYLHYLRAES frkali y 
QKKYLLLLIGGFQDSEQE1XSMIAHLGVFPSKAERKITSRPFTRFRT 
AVRWIAILRLRFLVKKWQEVDRKGALAQGKAPRPGPRARQPQSPPR 
TRES PP\ TRDVPSGHTRDPARR\ RRLA\ AAAS P \ HSGGRATPS PNSR 
LERSLTA\SQDPEHSLTEYIHHLERVPPKNCEGAYPHSTSKKSCHPM 
IKQ 

5356 

B 

224 

326 

XSSSDGNLRPEAITAI I WFSLLAALLI1AVGI1ALL* 

5357 

A 

87 

558 

STFFPAIKHWVDLTAQRGHCLSPRSHSNEGRQMQERGPRWAGSPAPS 
PWSPEGPGPRLHGRTERKERGLWGLLDTQRKVGSASRGVGPEPQHT* 
AAGTAKQRQHAAQSPPHGCPQPLPTSPPEPSEAGVSVAAAVYKPATR 
SPGQCWWLQQAPAFR 

5358 

A 

3 

713 

TDRGCRGRDTFHEPELG/PKTPGPGAPLQVPLAGDAVGGASARSPFL 
GAGNPPPI PWRTP\ AGAASGAGPAAPAGPLGPSLG/ RKYRPLLRMRI 
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ALFCLHPPAPAPMATCVHKPSLIiSSRSSTS/SAALLLAVGAGTVGAE 
/RFGRSGRRRAPTGPVARSSSPMQPRPGPLRTPRRRCRAACPSRSPIi 
LHL S PF I AV * PWGKAVPS LG S Q IHPVLNS REEGTS KTLPLRSREDGA 
IHFYIFI 

5359 

A 

2 

807 

RPQGFPAWRLPSRRGATPGRLPRS*PRPHPRPEPGVRVGPWVIIESV 
R/HQDCSGRSPGQTPTCEDRAPSGKGSALGRGSLGPWTLIGSGRG/R 
SPGKLGPSPGPAKAF*RRR\RASKVAPGPLDGWPRIAQAWARKGKHL 
AWGTGGSWPPSHAPGHQLGFQLKQPLGNQREWGQNLGIQDL*GQVDP 
N\RPPPLSGPGDRHLTGKPSGREGAIiSQGPQGRHSPGRPQQQASSSC 
CERSHPGLPPVTTHCPGVQWASCPGWGCIxARCSPRPA 

5360 

A 

1 

412 

FFFFLRWSL/NSV/SQAGVQWWDLSLLQPLLPGFKQFSCLSLLSSWD 
YGCPPRPANFLHF* *RQGFTMLFRLVSNS * PHDSPISASQSAG\ I IG 
VSHRARP SCLLLAY FS EFVS KWKQS STKALPCGFRMLCPVLTPVEK 

5361 

A 

3 

886 

YLS IHQVYGKEACI PCGPGSKNNQDHSVCYSDCFFYHEKENQSLHYD 
FSNLSSVGSLMNGPSFTSKGTKYFHFFNISLCGHEGKKMALCTNNIT 
DF\NVKEIVAGSDDYTNLVGAFVCQSTIIPSES/TGVSEQPYHHNPI 
ILADTFIGVTVETTLKNINIKEDMFPVPTSQIPDVHFFYKSSTATTS 
C I NGRS TAVKMRCNPTKS GAGVI S VP I KCPAGTCDGCTFYFLWES AE 
ACPLCTEHDFHEIEGACKRGFQETLYVWNEPKWCIKGISLPEKKIiAT 
CETVD FWIiKVGAGV 

5362 

A 

3 

2697 

CSVKECLQASAKLPSAPASVSLLCLLVAKVWRRLRWQEAAFSCASGE 
YLEMKNQVCSKCGEGTYSLGSGIKFDEWDELPAGFSNIATFMDTWG 
PSDSRPDGCNNSSWIPRGNYIESNRDDCTVSLIYAVHLKKSGYVFFE 
YQYVDNNIFFEFFIQNDQCQEMDTTTDKWVXLTDNGEWGSHSVMLKS 
GTNIL YWRTTG I LMGSKAVKPVLVKNITI EGVAYTSECFPCKPGTFS 
NKPGSFNCQSCKIIDPFLKRFFYVAEEGSSECTERPPCTTKDYFQIH 
TPCDEEGKTQIMYKWIEPKICREDLTDAIRLPPSGEKKDCPPCNPGF 
YNNG S S S CHPCP PGTFSDGTKECRPCP AGTE PAIiGFE YKWWWVLPGN 
MKTSCFNVGNSKClJGMNGWEVAGDHIQSGAGGSDNDYLIIiNLHIPGF 
KPPTSMTGATGSELGRITFVFETLCSADCVLYFMVDINRKSTNWES 
WGGTKEKQAYTHI IFKNATFTFTW\ AFPEN* FRVQDNRRF INDMVKI 
YSITATNAVDGVASSCRACAIiGSEQSGSSCVPCPPGHYIEKETNQCK 
EC PPD TYTj S T HO VYG KEAP T P C*G PG S KWNOCJ KTCMALPTTJN'TTDFTVK 
E I VAGSDD YTNLVGAFVCQS TI IPSESKG FRAALS SQS 1 1 LADTF IG 
VTVETTLKNINIKEDMFPVPTSQIPDVHFFYKTIYRQIIDIKCPAGT 
CDGCTFYFLWESAEACPLCTEHDFHEIEGACKRGFQETLYVWNEPKW 
CIKGISLPEKKLATCETVDFWLKVGAGVGAFTAVLLVALTCYFWKKN 
QKLEYKYSKLVMTTNSKECELPAADSCAIMEGEDNEEEVVYSNKQSL 
LGKLKSLATKTAI GVFGGPWSLGYVI SCQ YSETRS SCI I SVHAKNEI 
LAIEEE 

5363 

A 

1 

1146 

NFDSKRRITVRNLRIREDIAKYLRNLDPNSAHYDPKTRAMRENPYAN 
agknpdevs yagdnfvr ytgdt i smaoto \ iis vs sm £ h p i s ahktf 
GIV*SLFSRSIQ*CSCQLTKMFRITLVFAWEA\Y1DKGSE\VHIj\QAD 
P \ TKLELL YKS \ FKVKK\ EDFKEQQKES I LEKYGGQEHLDAPPAELL 
IxA\ QTED YVEYSRHGTVI KGQERAVACSKYEEDVKIHNHTH I WGS YW 
KEGRRGN\ KCCHS FFKYS YCTGEAGKEI VNSEEC I INEITGEESVKK 
PQTLMElxHQEKLKEEKKKK3CKKKKK^ 

NA* E A\ RIiLHVKET\MQI \ DERKRP\ YNSMY* TSRP\ I EEEMEAYRM 
K\RQRPDDPMASFLGQ 

5364 

A 

3 

1589 

APALKVILRYT*TDAPAIWWLDPELTYGCAKPFVFTQGHSVCNRSFF 
PCFl^PAAKCTYSAVVKAPSGVQElJ«SAraSAYMEEEGVFHFHMEHP 
VPAYLVALVAGDLKPGDIGPTSRVWAEPCLLPTATSKLSGAVEQWLS 
AAERL YGP YMWGR YD I VFLPPS FP I VAMENPCLTF IISSI LESDE FL 
VI DVIHEVAH SWFGNAVTNATWEEMWLSEGLATYAQRR I TTETYGAA 
FTCLE TAFRI1DALHRQMKLL1GEDS PVS KLQVKLE PGVNP S HLRNLFT 
YEKGYCFVYYLSQLCGDPQRFDDFLRAYVGE\YKFTSWAQDLLDSF 
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LSFFPELKEQSVDCRAGLEFERWIiNATGPPtiAEPDLSQGSSLTRPVB 
GLFQLWTAE PLDQAAAS AS A I D I S KWRTFQTA1* FL \ DRLLDGS PLPQ 
EWMSLS KC YS SLLDSMNAB IR IRWLQI WRNDYYSI.T / FHRVRRFP 

GRARCHACYTIPLYEDLCTGALKSFAIiEVFYQTQGRLHPNLRRAIQQ 
ilsqglgfqhrarp 

5365 

A 

1 

418 

FMGGVNRADENIDKYRAS IRGKKWYSS PLLFCFELVLQNAWQLHKTY 
DEKPVDFLEFRRRWCHYLETHGHPPEPGQKGRPQKRNIDSRYEGP* 

1 QVDS PNQGK\ QTRC\ AECHKNTTF\RCEKCDVAL\ HVKCS \ VEYHT 

E. 

5366 

A 

1 

205 

KKFSCGLKKIFHEKNCQST*QISLLSYFKKLQQPPQIiSSTTL/SQQP 

5367 

A 

1 

389 

CRFNAI S S KITADFTVFGGFLLEVHKVI LKCKWKY Y\ KSR I VKTGFK 
KTQLS/ GLILHNNKT* YTS 1 1 IKTELTKYRH\RMENPEIDLHRFDQM 
± r utnj ± j\ v i u w {j ivbKJj r NQ WCWSRLRC I CLKFS I TNGQ 

5368 

A 

2 

300 

TFPNLMTNMNLQIQEAQ*IPSWKNSE/RSQ/ SRHLIIKLLKIKGKER 
X Liiv \ byKIi I TYKGCS I RLRGDFS PETTEARK* WDDI FKVLKEQVSTK 
NLIFSKWL 

5369 

A 

199 

494 

CSKGMPCSSKRNFCRQTMPSFIHEDDKNGRLIEKASVNRKPQLPQHF 

VDAYLDEMDQGKNDPSSTFSKENIilFSVGELIIAGTETTTNVLRWAI 
LFMA 

5370 

A 

256 

514 

ELHNl^jNTFSLTGQVQKEIDLI^GPNGKPSWDDKCKMPYTEAVLHEV 
LR FCNI VPLG I FHATS EDAWRGYS I PKGTTVI TNLYS V 

5371 

A 

2 

1340 

SLDLGGISTVVVNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMGGL 
LK\ S RYGRGWVDHRRLA VNS FRYFGYGQKS FESKIL/ EKETKFFNGC 
YWKHTKVGPFDF* TV* *RMLFSNITNLIHFWEERFHFMQGHRFFRHM 
DWSYFRWKMWN*AASASVFLYNAFPW/ IWA/FLPFGKHQQLFRK/WT 
AWYDFLSRLIEKASVNRKPQLPQHFVDAYLDEMDQGKNDPSSTFSK 
ENL I FS VGEL 1 I AGTETTTNVLRWAI LFM\ AL YPNIQGQVQKE IDLI 
MGPNGKPSLGR/RNAKCPYTEAVLHEVLRFXGNIVPLGIFHATSEDA 
S \ LRGYS I PKGTT\ I ITKP * FLYHLGWKKVPGEDPQKVFPIPERFPG 
TSSGIFWPQEGKLWVPFFPLGRRHCPWENTLARMEMVLFFSSIASEV 
S FAFSHMN * FQ I * S PRLGMTLQPQ PYL I CAERR 

5372 

A 

135 

273 

SPLVLVSHFHLHDITRRWWGAWFPALPPPSPSSPVCLPYPHFPNL 

5373 

A 

1 

344 

LNTLPEVPFTPLSCLPSLSIiSS*LAPSPFICICYAIVP/PPFLLLPS 
DI^PSSHPGLPFLFPPHSPRHAPSAPPSKKKKKKKKKKKKTTAPVQ 
ASliGTSSLYPLGGSETGPTWS 

5374 

A 

1 

724 

GRRGRGRAG SRAGRRGGAQAAAL \ VNRGGG P I RNY \ SDVAGGAAGG \ 
GGR\NRPAPYSRPKQLPDKWQHDLFDSGFGGG/AAGVETGGKLL\VS 
NLDFG\ VS \ DAD IQE\ LFAEFGER * RRAAVHYDRSGRSLGTADVHFE 
RKADALKAM \ KQYNGVP \ LDG \ RPMNI QLVTS Q I DAQRRPAQ\ S VN \ 
RGGMTRNRGAGGF\ GGWVDGTRKGT \ RGGA \ RGRGRGAGRNSKQQLS 

5375 

A 

37 

331 

GLLCQASGPQAPGWPAGLGRRGQGSPNLPVPPRPVPA* / APGHTCHA 
Ii/GKIPF*VPLPLQ*GPPETHLREPPPGPCPPS/PVPSPYQGFIiQPG 

5376 

A 

1 

385 

FFFFFFFFETETCSVA*AGVQWRDLD*LQPPPPGFKRFLCLSRPSSW 
DYRCTSPYLAHF\ *FLVETGFHHVGQAGLELLTSSDPPI SASQSAGI 
TGVSHCTWPKLTTSTLMNVFPLNI ILLYLVILAK 

5377 

A 

21 


PTHPWPPTVLEATAPGSEYFFF*DAVLLSSRRLECSG/SISAHCDLH 
LPSSSDS PS S VPQVAWI IGVRHHVQL I FVFLVEMGFRHVDQAGLK/ 1 
A* PQVIHPPRPHKVLGLQV 

5378 

A 

2 

545 

TCLGFFQMFFLGSLIFVCYILLKFDCVFIFSSNLGSIiIFVCYlLLKF 
DCVFIFSSQLPFFTFYMDYSNPLGLSMSSLRIS*KTF*MYG*VLYF\ 
FF * KRNS QCFPPFNRDYF W I VCS S S VDL FCTLFFTSCPHQQPS CS PP 
PPGNKGPAPLPTVTLEALKMMMVFFIGLSSRIRGRSWKSP 

5379 

A 

545 

1462 

GYFPCFIFPVATS*QSIGWNF*LFEYFVYIYIFEGFWGGLICLKVS i 
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1 SVCLFQVYFSFAKIIEPPGFKTTLPPSTLVNNWELTSLHQPK/RPAN 

FVIFGKRQGFSHVGPGLNSGPHAIRPASASQSTGITGVSHCACALNF 
ITEP 

5380 

A 

2 

243 

GFKQFSCLSHPSSWDYRCAPPHPTNF\ *FLVEPGFHHVGQAGLELPT 
SGDLPALAS QS AG I TGVSHHPWPM SALNS KNVR 

I 5381 

1 A 


O *i Q 

3 2 8 

S PETGSCSGAQAGVQWHDLGSLQP * LGLK* SS YLSLLSNWDYRCA/ P 
PRLANFL F FVQAGS CHIiAQAGLELL S S CHLPALVS QNAG I TG VSHHA 
RAGSCMEENTSLWLTL j 

5382 

1 A 

2568 


RGTEGRRVKSSKAGFFCFVLFCFVLRWSLTLLAQAGVQRHDLSSLQP 
PPPTFK* FSCLSLPS SWDYRRPPPRSG * F/ SFVFLVETGFHQVGQAG 

LELLTSGDPPASASQSAGITGVSHHAQPQLDLEPPVCGQRDMALLLL 
ERLPETVTCFLFPSELCRRAR 

5383 

A 

2 

306 

FFEMESCSAAQAGVQWHNPSSLQPPLPGFKQFSCLSLPSSWDYRRAP 

PHPANF\EFLVDARFYHVGQAGLELLTSSDLLASAS*SAGITGMSHH 
GWP*ATF 

5384 

A 

141 

415 

IRITEGGAQASWFKTPQV/WLGTVAHTCSPSTLRSRGGWIT*VQEF 
KTSQSNRNQRGCNLVWSLKLTSILLTFELDSIROEEGKt.VRPWnP 

5385 

A 

2 

287 

1 GRVGNNKKFGGRISLC/QPGWSAVTQLQLIVTLNSWAERFSQLDLPS 
S *DYRCTCLANFLTFCKDECLAMLLRLVLNSWPQGIPLPQLIjKVLEL 

5386 

A 

663 

IOTA 

1 LtKH bcf^yissi IjQRKHTRQLS AS L I TNAKVFFQAS RFNVS SHI LAL 
LQENIFYFNTKPVICKPG/CVIRITF*KL* IASQVRWLTPVIPALWE 
AEAGRSRGQGI kti lantvkprl y 

5387 

A 

1 A 

3 

208 

EDVNRWRDIPSPWIGRVRIVRMSVL\PNPINNTYINTEFLILI*Y*Y 
RNTS I P I KI SGRFFVLLRENEN 

5388 


190 

261 

I S I F/ CFFKTDSSCVAQAGVQ* *NLGSLQ/ PCNLQPLPPGFK* FSCL 
SLLSNWDYRCAPP/HPNFCIFSRDGISP*LSILPTTPFIKVARKIIiI 

5389 
1 5390 1 

A 

31 

1431 

RETRSCWAHYHAESSKHQKAKLKDSRKARRSIELIGPGSSQKGGDIC 
SLIHLSSKLGILCYSKTGNMGGWEGGMQLRVSSENSMGKSSRERAAA 
DLPQGSCWHELPGRLINTRISLKPRHVWNLSSVLCQVVKKVKSMQMF 
HMPITSAMQGDRLGICVTQFDPKLLERGLVCAPESLHTVHAALISVE 
KIPYFRGPLQTKAKFHITVGHETVMGRLMFFSPAPDNFDQEPILDSF 
NFSQEYLFQEQYLSKDLTPAVTDNDEADKKAGQATEGHCPRQQWALV 
EFEKPV/YLPSAVPGDWLQARCGHS\PNTCRLAFHGQSCSTG*RTGT 
/NADSFLPRIiKVYKLKHKHGLVERAMDDYSVIGRSLFKKETNIQLFV 
GLKVHLSTGELGIIDSAFGQSGKFKIHIPGGLSPESKKDPDTPPSKT 
GPGWPVGGQPGRRRSPSGANPHKMGWLTRPFKRYVFDTHKRMVQSP 


A 


4^1 1 

FFFW*GTESQLLPQCSGAITAHCSL*LPGSSEPPTSAS*VPETIGTY 

/RHAWLFFFFNLFFVETGSHYVAQAGFEPLGSNDPP\PQPPKLLGIT 

GMRHHSRPELLPLTVPMSWESQDGETEDFCAMETWKVNITCLAQTPR 
L 

5391 

A 

1105 

1497 f 

LFRTSLLVIAFLPKKNFFF*HYA*SYNVHPNTARKIKPEFRAVRVSF 
LLSERMSNI FFFFFDTEFHS \ VT\RLECTG/ T* SSTHCDPRLPGFEQ 
I LPASAS QS S WDS PACATHARL I FVFLVETGFLHVGQAG 

5392 

A 

109 

695 

LLLIPSAVTSYPPKDLPWGGMILSKPPI*LGSMIiANNCIVAGGCS*SS 
TLPS *LSSWRVGPQHLPRHARPPHQQVQYI ISQDGVQHLLPQEYW\ 
VPEGHH I QGQKGR I TH I Q YEQGAP FLQE S Q I Q YVP VS PG QQL VTQAQ 

LEAAAHSAVTAVADAAMAQAPGPVWLQDEDKCPKHNSNKLQHPRASK 
YDVITPGR 

5393 

A 

1 

405 

YAPSDLPVK7VERSTYNYSLPITVKKTSSQLVTMHDLKQGLGPSGTSG 
ARKPAS S KYKLKVSLQ'i'LRVRTRVQS PRAVCVGDI WRRGGDKSQGS L 
APAAACGCPSRAIiWEPGPPLSHVTTPGMIGIGHTQEDRARA 

5394 

A 

203 

459 

DDASGEGARNDSFT^ERLRRGLWEYFVGRTIAGPARRTGMAAEAADLG 
LGAAVPVELRRERRMVCVEYPGWRDVAKMLPTLGGEE 

5395 

A 

2 

1185 

LILNKKSRTRRSFFRRHGPSHGPFAPWRPSRLAEPGELG*REREGYN 
NPPISGENLIGLSRARRPHNAIFVNFEDEEVPKOPLEAAAO\TWRRV 
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ct\np\vdrkveeelrkaspalriaplfdirpiwsrnavk\anitvh 
pnrfkawlpfiayymi \tgpwrslwirfgydp\ rknpdakdl/ yqvl 
df\rircgmkqgy\apsdlpvkakrstyn\yslpitvkkts\sqlvt 
^lhdiieawpwprrgrsgarkpasskyklkvslqtlrdsvy\ifregail 

PP Y \RQMFYQLCDLNV \ EELQKI I \HRNDG\ AEDFLAQERGWGGCLP 
QGPGDEF/RGDTMSPMIRE\TIGSKRPALFSSSAKADGGKEQLTYES 
GEDEEDEVEEEEEEEDFKPSDGSENEMGNQNF 

5396 

c 

95 

j j 

YL YWDKI TFLYCEHTVGAQ * 

5397 

C 

262 

390 

MLVARMNXOCXJtt^ 

5398 


JLXO 


MGVGD 

5399 

A 

109 

559 

QFLHRIiVTOSGEVTVMKIiVE**NTLIiAKMVSISWPRDLPASASQSAGI 
TGLIGALVLSVGIYAEVER/HEI*NP*KCLPGSSHHPHPPGRRHVHG 
LLHWCAGVPP *QPGKH 

^4 on 

ft 
r\. 

X 


FFFFFLRRSL/SSVPQAGVQ*HPLSSLQPP/PPGFTPFSCLSLPSSW 
DYRCPPPRPAKFFLYF* *RRGFTMLARMVSIS * PRDPPASASQSAGI 
TGVSHRARPPPPQFFIQLPLYNIHTLEGR 

54 01 


At 


NPPTSAS*VAGTTGACYHTRLILVCRDRVSPCCP/VLVFFFFF*DRV 
AQAGVQWHGLGSTCQPSCPPGFKRFLLPSDSPALSPPSSWBYRHPPP 
RP\ANFFCIFIV^TGF\TVLARMVSIS*PRHLPALASQSAGITGLSH 
LARPVDS FLATS I Y 

5402 

A 

3 

352 

SSRPGSCRRGAVRAAIRYVRLRAVPSGGGEPPSGAPLQPSWK*AFAR 
PGMLRCS PMS QAPSRSAAVLFEPLAEWMLHCA* QDE VRLGGH * GRRG 
PLLLDRSRA*RESPAEVRPARE 

C /. f"J "3 

A 

•> 

286 

SVAQVGMHGHSLGSLQPLPPGLK*SSHliNLPSSWDYRHAPPHIiANFC 

IFFVETGFRHA\SQAGLKSLSSSNWPTLASQSWITGVSHRAQPC*T 
F 

3 4 

a 
J\ 

/ J 

542 

VLIFFFFFLKLSL/DSVA*AGVQWCNLSSLQPPSPGFK*FSCLSLLS 
S WDYRCVPPS PANFFI FL VETGF\ TVLARMVS IS * PHDPPTLASQS V 

GITGMSHCPWTDLIF**CKRDTLCIPSMSLS**FKVPKGNLIWKLLG 
KFGGLPLG I QNAYFW IV I 

5405 

A 

1 

397 

EMESSS VTQA*VQWHDLALLQPPPPRFNRFSCLSLLSSWDYRCPPLS 
PANFCIFSRD/MGFI VTiARLVLNS*PNDLPASASQSAGITSVTHRIW 
SKI FFRVDNDHY YILFS SMELNVPTI VLC YHPQS SNMYS 

5406 

A 

72 

243 

EPMTPQVSMFS ILRQPMKTTGMR IAMQEKVTQNTPS S * KI FRFPS YL 
IGVSIjVISI 

5407 

A 

34 

420 

MLYSHDPELS *QRVLTALYAVFPPFVPVDFL* 1 * FPSALVS IGGSRL 
L/ PVLSHWFLDLRV\ FH * FEKAHL * KLCGNP/ WLS ALLS TFKVM * ER 
LVDQ * LSMAKTTE * EPMTPQVSMFSCGRVEEFRHHPC 

5408 

A 

3 

316 

FFFNFSRQGLCHPSWSAVTQSRLTVASIFR\VQGSSHLSLLGSW/DA 

RHVPPCLANF\*FFCRD/RGLVMLPMSVSNSWAQAILPPWLPSIGIT 
bVbnnVWLASLFVF 

5409 

A 

905 

1291 

RGADTCLGFQVSADLHHLFSFLFCSVAQAGVQWRNLGSLQPPPPGFK 
LFSCLSLLSSWDYRHAPP/RPG*FFVFLVETGFT\RVGQAGL*LLTS 
<jD FFA \ b AS QTA/ RGL *GMSHLAS P 1 YLD Y FK* NFNKP 

5410 

A 

2 

308 

FLRHNLAVTQPGAQWRLLSSLQPLPPELKPSSHFSLPSSWEYRCEPP 

HPANF\*FLVETGFHHVAQAGLEFLASSDPPTSAFLNAGVTGVNHSA 
WJjCFSYLL 

5411 

A 

3 

368 

HRGES VWN 1 CRWGTLR * GKIGKMTRAHCLE FI GAL T VP9 T T <3 P A P PP 
AEPKWKNDLD * TLTHGRPVS VVXLAKKCDHG/ KDECRNS SLKMGQI Y 
MEHSFVG* IETSAKENVNFDEASRCLDAK 

5412 

A 

73 

380 

GQALQWCVSVSMCSCIHSHPCV*VLWVCV\CLCISM\CMYSYACVLG 

YTKFGVCTEKNKQNCTGNRSTQGLPLRI^TGSFRFWIJDFHPLASPK 
GTLRSLLSP 

5413 

A 

1 

400 

NS EGAAW 1 QPPAGE PTWGQS GGTQRAGAMP PGD YS * S QGHPFLH / RA 
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PPPLTRN\RRIFLRRAPSPVLEDPRKPPSRGQQRL*DALHHVLVAQL 
REGMLGQPRPHHPPPKPIK*SQHA*QLPAGEPEVGPGAGDR 

5414 

A 

1 

895 

LRRARPRGWAGREAGPHPH* LRCRSS SAALRPAFPTRPGTGETGLDR 
TEGRDQTSVNSAT/RLPNTPSR\SAGGSSLQTQRGLPGSSPQQENRP 
GDKAEAPKGQE/RMPPGDYFVIPRTPLSPPAPPPLTRN\RRIFLRRA 
PSPVLERPPEASKQRTATPLRRTAPCASGAAQRGDVSGRKGVCKAGE 
QG + PPPNNPRLRRSQAHARKRRRGDKFGVRPLQYCAF*DFWRRLSWQ 
NPFPKATLPLLR*PLPPPLQAGTAPSPPSSSQAN*MITARVTVTGWR 
AGS GTGS KGP * EPGRAS FPS 

5415 

A 

20 

481 

PTDRPGEHSKTPSLQKCKK*KLAGCSGTCL*S * ILGRLRQKDHLSPG 
VRGYNEL* SCHCTP 5 WVAE* GPVS K* 1 1 QS PPRKE * S / L YNEKNPSN 
♦HIVDSGEWFSFVFLGADRWGAGVLYQRVRALSQMCNERGHKHICPI 

tpTVT tJVVOT PT Ti TV T 

5416 

A 

236. 

820 

RRPQQAGKMKGSRIELGDVTPHNLFNCLIIiCGVTSPNIKQL\KRLNQ 
VIFPVSYNDKFYKDVLEVGELTK\LAYFNDIAVGAVCCRVDHSQNQK 
RLYIMDTRDVWAPYRSLGIRELKC*IMCLKHIiLKKMVLFDNIFLHVQ 

T OXTCO TV T n C VO CD T T T\ Tj VVTlTVWD TriYlTi T*V T\ XXTTT AWT inrDOr*AW 

ADVQKTDN 

5417 

A 

15 

209 

KASRKMQIETTLRYYLSPMRLAKMQMLIOTSCW/RKSQETRTLIC*W 

+ T-» f> T/TJT\ 1 f~\TlT PVPTIJT1VT T/*\ 

* EG KMVQ PL S KG I W P Y 1# J. Q 

5418 

A 

1 

498 

VQRTRQLVTS PS PM S S SD/ EQS S S PQCTVPS KLL VDPLQHEFS KCGE 
NDNNIP\TPSNSNTPLKHSASFISATGTTEESRSVPQIKNGSWSLQ 
S PGSRS S S AGGTS AVEVKVEPETS SDEHPVQCQENSDEAKAPQTPS A 
LLGQKSNTDGALQKPSNEG VI E I KATK 

5419 

A 

8 

299 

LYSPDESHSDFEPSTVFPSANPQNMIDSSTSVYEFQTPSYLTKSNST 
GQ INFS PGDNQAQS E IGEQQLD FNS TVKDLLS GD S LQTNQQLKY PDV 
LGE 

5420 

A 

1 

3788 

MMRLEGAEPSGQLQNIDEEVISSACRLVCEWAQKVLSQPFDTVLELA 
RFLVKSHYIGTKSMAALTVMAAAPAGMKGITQPSAFIPTAESNSFQP 
QVKTLPSPIDAKQQLQRKIQKKQQEQKLQSPLPGESAAKKSESATSN 
GVTNLPNGNPSILSPQPIGIWAAVPSPIPVQRTRQLVTSPSPMS/C 
F*RQSSSPQCTGGHSAHAVCETGTKDSPERSSQSWWGSFCPAPLPSD 
LTQTS EHQCT\ PIRS PTTVLFTS S P I KTA WPASHMS SLNWKMTTI 
SLTPSNSNTPLKHS AS VSS ATGTTEESRSVPQI KNGS WSLQS PGSR 
SSSAGGTSAVEVKVEPETSSDEHPVQCQENSDEAKAPQTPSALLGQK 
SNTDGALQKPSNEGVIEIKATKVCDQRTKCKSRCNEMLPGTSTGNNQ 
STITLSVASQNLTFTSSSSPPNGDSINKDPKLCTKSPRKRLSSTLQE 
TQVPPVKKPIVEQLSAATIEGQKQGSVKKDQKVPHSGKTEGSTAGAQ 
I PSKVSVKVS SH IGANQPLNSS ALVI SDS ALEQQTTPS SSPDI KVKL 
EGSVFLLDSDSKSVGSFNPNGWQQITKDSEFISASCEQQQDISVMTI 
PEHSDINDLEKSVWELEGMPQDTYSQQLHSQIQESSLNQIQAHSSDQ 
LPLQSELKEFEPSVSQTNESYFPFDDELTQDSIVEELVLMEQQMSMN 
NSHSYGNCLGMTLQSQSVTPGAPMSSHTSSTHFYHPIHSNGTPIHTP 
TPTPTPTPTPTPTPTPTSEMIAGSQSLSRESPCSRLAQTTPVDSAIjG 
SSRHTPIGTPHSNCSSSVPPSPVECRNPFAFTPISSSMAYHDASIVS 
S S PVKPMQRPMATHPDKTKLEWMNNG YSGVGNSSVSGHGI LPS / VSG 
TSGRPFQETSCFCC7VWTVLSVSIQTS*H*FWSFDSCLSCAASRCHCK 
*HQQTGGFCSPCPS***RN*FICQQQLQMPECEPCCSSPT*S*WKHP 
liSSI*Yir 11* \ Vi J? r GS P v 1 PB Vnvr TN \ F TQTHVPTT * LKGAbFob 
QLQNIDEEVISSACRLVCEWAQKVLSQPFDTVIiELARFLVKSHYIGT 
KSMAALTVMAAAPAGMKG ITQPS AFI PTABSNS FQPQVKTLPS P IDA 
KQQLQRKIQKKQQEQI^QSPLPGESAAKKSESATSNGVTNlLPNGNPS 
ILSPQPIGIWAAVPSPIPVQRTRQLVTSPSPMSSSDGKVXPLNVQV 
VTQHMQS VKQAPKTPQNVPAS PGGDRS ARHRYPQILPKP ANTS ALT I 
RS PTTVLFTS S P I KTAWPASHMS S LNVVKMTTI S LTPSNSNTPLKH 
SASVSSATGTTEESRSVPQIKNGSVVSLQSPGSRSSSAGGTSAVEVK 
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VEPETSSDEHPVQCQENSDEAKAPQTPSALIX3QKSNTDGALQKPSNE 
GVIEIKATKVCDQRTKCKSRCNEMLPGTSTGNNQSTITLSVASQNI.T 
FTS S S S P PNGD S I NKDP KLCTKS PRKRL S S TLQETQVP P VKKP I VEQ 
LSAATIEGQKQGSVKKDQKVPHSGKTEGSTAGAQIPSKVSVNVSSHI 
GANQPLNSSALVISDSALEQQTTPSSSPDIKVKLEGSVFLLDSDSKS 
VGSFNPNGWQQITKDSEFISASCEQQQDISVMTIPEHSDINDLEKSV 
WELEGMPQDTYSQQLHSQIQESSLNQIQAHSSDQLPLQSELKEFEPS 
VSQTNESYFPFDDELTQDSIVEELVLMEQQMSMNNSHSYGNCLGMTL 
QSQS VTPGAPMS SHTS S THFYHP I HSNGTP IHTPTPTPTPTPTPTPT 
PTPTSEMIAGSQSLSRESPCSRLAQTTPVDSALGSSRHTPIGTPHSN 
CSSSVPPSPVECRNPFAFTPISSSMAYHDASIVSSSPVKPMQRPMAT 
HPDKTKLEWMNNGYSGVGNSSVSGHGILPSYQELVEDRFRKPHAFAV 
PGQSYQSQSRHHDTNFGRLTPVSPVQHQGATVNNTNKQEGFAVPAPL 
DNKGTNSSASSNFRCRSVSPAVHRQRNLSGSTLYPVSNIPRSNVTPF 
G S P VTPE VHVFTNVHTD ACANN IAQRS QS VPLTVMMQTAF PNALQKQ 
ANSKKITNVLLSKLDSDNDDAVRGLGMNNLPSNYTARMNIiTQILEPS 
TVFPSANPQNMIDSSTSVYEFQTPSYLTKSNSTGQINFSPGDNQAQS 
E I GEQQLD FNS TVKDLL SGDS LQTNQQLVGQGASDLTNTASDFS SD I 
RLSSELSGSINDLNTLDPNLLFDPGRQQGQDDEATLEELKNDPLFQQ 
ICSESMNSMTSSGFEWIESKDHPTVEMLG 

5421 

A 

288 

668 

VEKRQFI^STADPECIRYPLTYFVLGQSLALVTKAGVQWRNLSSLQP 
PPPEFKRFSCLSLLSSWDE\RHAPPCPS*F/ CVFFLVKTGFHRVSQD 
GLDLLTSNDPPTSASQSAGITGVTLAPSLFVLIH 

5422 

A 

3 

288 

STFFFFFK*N1ARLSRLECHGTJSAHCNFRLLGSSDSPALASQVA\G 
LRGTCHHT * L I FVPTVETG FRHVGQAGLELLA\ WVI C P P * PPKVLRL 
QA 

5423 

A 

1 

782 

MVHITGLCADNLQYQAGARIISVPEEDNSKSLENIFGGIIEENFPSL 
ARDLDIQIQEAQGTPGKCITKRSSPRHIVIRLFKVKTKERILRAVRQ 
KHQVTYRGKPIRLTADFSEETLRARRDWGHIFSFLKQNDYQPRILYA 
AKLSIIYEGKIQSYSDK\QMLREFTITNGKLLKGAVSLWALNR*NK 
SWKHIKTHQNRTSLKHKSHRTYKTKIPLHDEYNDLRSALRPASHTAY 
GSMLMILKSSLTLFKHGDEVVGRRAK 

5424 

A 

2 

392 

KTKI IQMANKHRKRH\QSQ* * VRLMQKKI IMIHHYIPQWLKTSKSDM 
IKCCSGYGTTEIFTHYIjWECK1jLHLLWRMVWL\ILLKFKTDTPHDPA 
I PLLGTYPAEC\ CTCTPKAHTAALS 1 1 VPSWEQPKYPQ 

5425 

A 

1 

181 

VSHVITNVLQPPANRPQERGDELADSALEIFKQASAFSRASQRSVRL 
APVYQKLLERMKH 

5426 

A 

1 

2162 

MALFVRLLALALALALGPAATLAGPAKSPYQLVLQHSRLRGRQHGPN 
VCAVQKV I GTNRKYFTNCKQWYQRKI CGKSTALGTADVKGVS S LAVS 
VSDLGGEQMRKLRPREMMKLAEDGPAALPLSNLYETLGWGSTTTQL 
YTDRTEK\ LR PEME / GG PG S FT I FAP \ SNEA\ WAP \ LPAEVAGTPL V 
SNVNIELLNALRYHMVGRRVLTDELKHGMTIiTSMYQNSUIQIHHYPN 
\GIVTVNCARLLKADHHATNGVVHLIDKVISTITNNIQQIIEIEDTF 
ETLRLA\VAASGLNTMLEGNGQYTLLAPTNEAFEKIPSETLNRILGD 
PEALR£)LIjNNHIL/RSQAMCAESIVAGIiSVETLEGTTLEVGCSGDMIi 
TINGKAIISNKDILATNGV\IH\YIDELLIPD\SAKTLFELAAESDV 
STAI \DLF\RQAGLGN\ HLSGSE\ rltllg\ plns vf\ kdgnp\ pid 
\ ALTRDLLWNH 1 1 \ KDQLAS KYL YHGQTLETLGGKK\ LR VFVYRNS L 
CIENSCIAA\HDKRGRYGTLFT\MDRVL\TPPMGTVMDVLKGDNRFS 
MLVAAI QS AGLT\ ETLNREGVYTVFAPTN\ EAFRALPPRERSRLLGD 
AKELAN I LKYH I GD \ E I LVSGG I G AL VRLKS LQGDKLVS S A* KTNW 
SV\1«CEP\VAEPDIMATNGV\VHVI\TNV\LQPPANRPQE\RGIDLA 
DFAV\EIFKQAFSVFPGASQ\RSVRLAPVYQKYL\ERMKH 

5427 

A 

1 

410 

FFFLKRRDLAVAQTGVQWGDHGSLQPQIPGRRQSSDLSFLNS*DHRH 
ILSCSATFTSF*RSHSAAQAGLEILATSNPPA*ASQGPGI\*GVSHH 
TWQKATFCICLPANNILSCLNIYSFFYHKTNHNFIMKTNHKQI 
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5428 

A 

3 

404 

RL F YNFT I EMRF S LFVQG CL / INS LG * SDLLVF AS AS AG I T \ G VS YP 
HALASIVFVCVCV*WGSALVAQAGVQ*RNLSSLQPLPPEFKQFSFLS 
LPSSWIYRHPPPPLSFYLPKLKSCGLCTGIWAWRLFIFIGS 

5429 

A 

2 

240 

TRSAW*SQLTAALTSL\VKQLSQLSLPRSWDYTHVPPCLSIFYFL* 
R*GLPMLPGLVSNSWTQAIFLPQPPKVLGLQA I 

5430 

A 

3 

468 

RWSFTLVAQAGVQRVILAHAQPPPPGFKRFSCLSLPSS/WDYRLAPP 
RLANF\*FLVETGFLHVGQAGLELPTSGDPPASASQSARIIGVPLCP 
G ISRI FVQQMALEDRDS I SLRLWFLSRHANLQCVTLS WACQLLPCGT 
WRAR ETKANTL PA VP 

5431 

B 

151 

344 

YENESALNLYETCKVRTVKAGTLEKLVEHLVPAFQGSDLSYVTIFLC 
TYRAFTTTQQVLDLLFKS * 

5432 

A 

1 

439 

PSASSGQAGGFWSQSLFKQESSGFVRTSAESVLRQESLRGVLGVLEL 
YWRGSEVCGSRQSLYGLSSGRGILQGPQEHP*SPLCPSPPELKIPEN 
ANVFYAMNSTANYDFVLiKKRTFTKGVKVKHGASFTLP\RMKQKGLKI 

akgif 

5433 

A 

77 

444 

glgvpselqeirvlpltvcpssvppqyenesalnlyetckvrtvkag 
tleklvehlvpafqgsdlsyvtiflctyrafsasqqvldllfkr*a* 

ALHCTGCTRGPRPHPHLAAGLGNAESPLi 

5434 

A 

1 

3190 

MVQRMWAEAAGPAGGAEPLFPGSRRSRSVWDAVRLEVGVPDSCPW1, 

HSFTQLDPDLPRPEVGGGRGLLQKSASRVGSGGCRRAPSNSRDHGWG 

FAPIjNTVPGPAVTKAAAAQVDLFLRHCPCLHRWPVPYFTQFPPRARS 

ASLAFTMQAVGLCSRQSVPYLGNRLHRCIPCTVARQALDVPAPGCRA 

LSSQSSTQEIGEELINGVIYSISLRKVQLH\PAGNKGQRWLGYENES 

ALNLYETCKVRTVKAGTLEKLVEHIiVPT^FQGSDLSYVTIFLCTYRAF 

TTTQQVLDLLFKRYGRCDALTASSKYGCIIjPYSDEDGGPQDQLKNAI 

SSILGTWLDQYSEDFCQPPDFPCLKQLVAYVQLNMPGSDLEHRAHFL 

LAHLEHSEPIEAEPEGEEDWALSPVPTUiKPTPELELALTPARAPSPV 

PAPAPEPKTNS\TSVSGSKLEETPAPAPELRRAP*PAVGLESAPAPA 

LELEPAPEQDPAPSQTLELEPAPAPVPSLQPSWPSPWAKNGLSEEK 

PHLLVFPPDLVAEQFTLMDAELVKKEGPYHWPGFIWSQRDKKGKELL 

APT\ I RATVTQFNS VAN I \ V I TTCLGNRS TKAPDRAR WEHW I EVAR 

ECRIliKNFSSLYAILSAXiQSNSIHRLKKTWEDVSRDSFRIFQKLSEI 

FSDENNYSLSRELLIKEGTSKFA\TLEMNPKRAHKRPKETGI IQGTV - 

PYLGTFLTDIiVMLDTAMKDYLYGRIilNFEKRRKEFEVIAQIKLLQSA 

O0OTSIAPDEQFGAWFRAVERLiSETESYNLS\CEL\EPPSESASNTL 

RTKKNTAI V\ KRWSDRQAPQH* AQLPVGS SHFQVPVDQFRC / SGPYF 

SSGDIADALSVHSAGSSSSPVEEINI\SFVPES\PDGQ/EKKKFW/E 

NSAFKSYSPETSGISS\ASSSTS\SP*ASTTPVAATRTHKRSVSGLC 

NS S S ALPL YNQQVGRIi WI IRVS WDVGQWATCYKS IL/WTS QDKAP 

AVI\RKAMDKHNLEEEEPE\D\YELLQILSDDRKL\KIPENANVFYA 

MNSTANYD FVLKKRTFTKGVKVKHGAS S TL PRMKQKGLKI AKG I F 

5435 

A 

1 

436 

EKINKIRWLPQQNAAHFLLGTNDKAIKLWKISERDKKAEGYNIiKDED 
GRLRDPFRITALRVP I LKPMDLMVEAS PRRI FANAHTYHINS I SVNS 
DHETP\NIVDIKPANMEELTEVITAAEFHPHQCNVFVYSSSKGTIRL 
CDMR 

5436 

A 

204 

397 

VPILKPMGSYG*EASPR\RIFANAHTYHI\NSI\SVNSDH\ETYLSA 
DDLR INL WHLE I TDRS FNI V 

5437 

A 

1 

1463 

PLRS WLPRLPDS QAD 1 1 STVE FNYSGDLLAT/ GDKGGRW I FQREQE 
VLAQPRRPAIiP*SVSSFLSTSCREVWQGCEFSLPFRERIK/GRPHSR 
Cat. i N ViS/ TAFQSHGTGGFDYLKSLE IEEKINKIRWLPQQNAAHFLL 
STNDKTI KLWKI S ERJ>KRAEG YNLKDEDGRLRDPFRITAiRVP ILKP 
MDLMVEAS PRR I FANAHTYHINS I S VNSDH\ ETYLSAR * PGG INLW\ 
HLGNHR*EAFNIVDI\KPANMEELTEVITAAEFHPHQCNV\F\VYSS 
S KGT I RLCDMR\ S S ALCDRHS / KS FFEEPEDPK/ SSRSFFS\EIISS 
IS\DVKF\SHSG\RYM\MTRGLPVRWKVWGPSTWEEAGPVGGPTRVH 
EYL\RTKL\CSL\YENDCIF\DKF\ECCWNG\SDSAIMTGSYNNFFR 
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M FDRDTRRD VTL \ EAS RES S KPRAS LKPRKVCTGGKRRKDE I S VDS L 
DFN\KKILHTAWHPVGQCYLPWLATN\NLYIFQDKIN 

5438 

A 

3 

392 

FCACFIGSKTPASCRAQGRRPGEFSPGSPSGPAGGKGVLKELEPSEM 
/ PP/ PEAGERKLERKTVPGRGAEYGPPHLRLS YPQAVNSHPLPSSVN 
GGGDSNNSSQLPESHCPPFSPKEGPRKGEVPCLHATRA 

5439 

A 

1 

298 

SLLGPPEPTTWT/PAPQDR*HGPC*GR*RKRRPSL*T*QWELASPP/ 

REPMFVPKKEMVAIQPLQEEASVGPTPELLSGSCRSPISGISILGNC 
PSLAPLT 

5440 

A 

3 

11 

HELILSPGPTQGQTPDEAGVIIRVLKNVPDLAILNSKDALDPRQPGY 
QPPNPHPGPSSPPAAP\RPRGA*GNPQLENASRSDRNPSQGLRTRIR 
RPETPDCGPPSPAGSS\PAPPPSDVPPAFHSSGPPEPTTWT/PAPQD 
R * HGP * GDKRGAPGVAGED PDQP * GTS 

5441 

B 

1 

513 

MEVAVSRIMTLHSSLGDRARLHLKKKKEEEEKEEEEEEEEEKKKEEK 
EK^KKKEKIWRRRKKTKNNTKQmCT^ 

PGVTARYATSKGDVSAAASTLPVKTPAPIQRRGEVSTIHLSQWQWQ 
WGPHALYTVWLLEEGEQLSPGDLGTGFI * 

5442 

A 

2333 

2633 

GGTPISLLFLFCFETQSHFITQAGVQGTILAHHNLHLPGSSDSPASA 
S*VAGIAGAHHCAWLIFVFLVEMGFLHVGQAGLELP/NIjVIHPPRPP 
KVLGLQA 

5443 

A 

1 

704 

MVAASAA I KFKTNMSLAI AI AGEHVEFAL I PVTAVKYTYTVI PALVM 
TWCLSYIERWVDSITPAVTKNFLKPMLIVI.IAAPIiAILLIGPIGIWI 
GSAI SALVYTIHGYLGWIiSVAIMGAI,WPLLVMTGMHRVFTPTI IQTI 
AETGKEGMVMPSEIGANLSLGGSSLAVAWKTKNPELHLDTPHTESPT 
PHDSNSHRP/PTAGVYNRSKAKSTDKDNDQTRWGNWRWSQRKYCLPL 

5444 

A 

195 

524 

RALTPRHLPCTSQVASCNLCCLFLADLDTL\HTESPTPHDSNSHRP/ 
PTAGVYNRSKAKSTDKDNDQTRWGNWRWSQRKYCLPLIVLSFFQTKQ 
NTGTFGRNFLSPFLMYV 

5445 

A 

2 

884 

VPQFKPKLHQENGPGLARCDHQQGAQPTRETWLQDRDREFQSQTKFL 

LRSRV*LVLQLQESQHSLERLREGQRLVEREQARMRAQQSLLGHWKH 

GRQRSLPAVLLPGGPEVMELNRSESLCHENSFFINEALVQMSFNTFT 

KLNPSVIHQDATYPTTQSHSDLVRTSEHQVDLKVDPS\QPSNVSHKL 

WTAAGSGHQI LPFHESSKDSCKNDLDTSHTESPTPHDSNSHRP / PTA 

GVYNRSKAKSTDKDNDQTRWGNWRWSQRKYCLPLIVLSFFQTKQNTG 
TFGRNFLS PFLM YV [ 

C A A C 

A 

269 

484 

DKTSRGTIRQQHSRFTKIRCSAATAADTQANRVWSGPLANSNRPAAE 
GPRDPPASASQSAGITGVSHRAWP 

5447 

C 

5465 

5719 

MRLGKTHSTAGAWKMGLLRQADSTRGEKRGGKKAAI I CVKS PPGYQV 
VAVWKYHS SIX3VFEFKKTCVSMMGRHDTEKKMS WERP * 

5448 

A 

57 

102 

MEFCVNSQKCDINS 

5449 

A 

3 

287 

RALYYFL*FALFGLT*M*I»VS*KLQIIEYEKKQTLGQNDTGFICDGT 
ANTFRVMFKE P I E I LPNVC YTACATLKVRVMHYVPDG FLLNVAECVF 

5450 

A 

1 

594 

MAVHPWI TVI ISILGII SS VPHTBRQQRQEGCGLHVRQLDLGRRGG 

HAAVEHGVKDGAAGRQHEAVGGDALRAPSGGGAATLAQYEAHIAQQL 

RVEEEGEALLQRRLGRLPWERFPLQGEQGPQRGRPLITKSSAPIVY 

LGHVPRPVPRSVLPGSGGAGKYGQGNTVSRVRSFWWVRGLADFKNGA 
VDLPDEFYSS 

5451 

A 

1 

768 

S VE F PCRL S G AWR WAEAECQRQQLP VTFGNKQKVXiGKAXi S L IRF PL 
MTIEEFAAGPAQSGILSDREWNLFLHFTVNPKPRVEYIDRPRCCLR 
GKECCINRFQQVESRWSYSGTSDRIRFTANRRTSIVG\LGLYGS\ IH ! 
vjjt \ \y vwiyi ±2,xz,fu\\i 1Ijo\QNIjTGFSCDGTANTFRVMFKEPI 
EI LPNV CYTACATL\KGPDSHYGHKR I *RKWHGGHLLAS KDCFFLF 
* F/ ALGINNGTS IEDGQI PE IIFYT 

5452 

A 

1 

491 

LKDVITLRRHVETKVRAKIRKRKVTT 

QRAQELRRRAEDYHKCKIPPSARKPLCNWVSLFVFLAFEHSLPGQDM 

DTFFSLQLCAQALQREMAERKAAYSITVPSLLVTVLCKNTFIPTQWA 
PDLNS KCQRFH I CNS KWALTVN 
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5453 

C 

175 

324 

MKVFLHNTVTNRDGTVMLVGSLPLCHLPLQGLSTEPVAKRLSCKGGD 
FAF 

5454 

A 

3 

190 

RAAYKPHS P I PVGTI PNEGLK* CSAKLVLVWPLYLQKLPSPMITRDG 
TCTI>EVJCHS FKTCFV 

5455 

A 

1 

716 

MVKLSIVLTPRFLSHDQGQLTKELQQHVKSVTCPCBYLRKVINTLAD 
HRHRGTDFGGSPWLLIITVFLRSYKFAISLCTSYLCVSFLKTIFPSQ 
NGHDGSTDVQQRARRSNRRRQEGIKIVLEDIFTLWRQVETKVRAKIC 
KMKVTTKVNRHDKINGKRKTAKEHLRKIiSMKEREHGEKERQVSEAEB 
NGKLDMKE I HTY/ M ITPS ARKPLCNRVRMAAAEHRHS SGLP YWP YLT 
AETL 

5456 

B 

1 

387 

MRVRWLLFWLLFWLLLGFISHQSTCVINTLADHRHRGTDFGGSPWLL 
I ITVFLRS YKFAI SLCTS YLCVSFLKTI FPSQNGHDGSTDVQQRARR 
SNCRRQEGIKIVLEDIFTLWRQVETKVRAKIRKMK* 

5457 

A 

1 

435 

MFQRGQELR WRAED YHKCKI TPSARKPLCNWVRMAAAEHRHS SGLPC 
WPYLTAETLKNRMGRQPPPPTQQHSITDNSLSLKTPPECLLIPFPPS 
PLPPSVDDNIiKTPPLATQEAEAEKSLKPKRQRLSKLRTGHCTQAWAI 
RRNL 

5458 

A 

1 

4798 

MGSTGVYKVTPRS CHRFEQAF YTYDTS S PS I LTLTAIRHHVLGTI TT 
DKMMDVTVTIKSS IDSEPALVLGPLKSVQELRREQQLAEIEARRQER 
EKNGNEEGEERMTKPPVQEMVDELQGPFSYDFSYWARVLCFVGTGPA 
KLKYINYFRSGEKITVTPSSKELLFYPPSMEAWSGESCPGKIilElH 
GKAGLFLEGQIHPELEGVEIVISEKGASSPLITVFTDDKGAYSVGPL 
HSDLEYTVTSQKEGYVLTAVEGTIGDFKAYAIJVGVTLHSQDVLMLPG 
DLVGLQHDAGPGALLHCSPAPGHPGPQAPYLSANASSWLPHLPAQLE 
GTWACPACALRLLAATEQLTVLLGLRPNPGLRLPGRYEVRAEVGNGV 
SRHNLSCSFDWSPVAGLRVIYPAPRDGRLYVPTNGSASVLQVDSGA 
SATATARWPGGSVSARFENACPALVATFVPSCPWETNDTLFSWALP 
WLGEGEHVMDVVVENSASRANLSIiRVTAEEPICGLRATPSPEARVLQ 
GVPVVLLAGSSGYLVGFKFLESHGSDSGSANSFHRLISRNEFKTPLP 
DLTRVPRYSPWEAGSDMVFRWTINDKQSLTFQNWFNVIYQSAAVF 
KLS LTASNHVSNVTVNYN I TVERMNRMQGLRVS TVPAVLS PNATLAL 
TAGVLVDSAVEVAFLWTFGDGEQALHQFQPPYNESFPVPDPSVAQVL 
VEHNVTHTYAAPAALGGGAVLTRQPSVLLHLCSVPHVAWEPGTLKAG 
PQVSTVLTVLASNAFENRTQQVPVSVCASLPSVSVCASLTGACWYPR 
VLIRSGRVPIVSLECVSCKAQAVYEVSRSSYWLEGRCLNCSSGSKR 
GG YTFTLTVLGRSGEEEGCAS I PLSPNRPPLGGSCRLFPLGAVHALT 
TKVHFECMGWHDAEDAGAPLVYALLLQRCRQGHCEEFCVYKGSLSGY 
GAVLPPGFRPQFEVGLAVWQDQLGAAWALNRSLAITLPEPNGSAM 
GLTVWLHGLTASVLPGLLRQADPQLVIEYSLALVTVLNEY^RALDVA 
AEPKHERQRRAQIRKNITETLVSLRVHTVDDIQQIAAALAQCMRKLP 

eqd i aqg s y i alpltllvllag ynhdkl i plllqltsrlqgvgalgq 
aasdnsgpedakrqakkqktrrtlatsintsrepstddqlpahnqtm 
pqrharrs appraydrktrqeenphqtrshaaakrrerp phdlqkqa 
ttrl i pag prrrdgts prrtqpp pntrrpaaaghlarfrraapgarg 
arpptarrgreeldpahiyaaapglptppragrtpptperrdrntrr 
rrtreegegefrpvs flkti fpsqnghdgstdvqqrarrsncrroeg 
i ki vl \ ed i ftlwrqvetkvraki r kmkvttkvnrhdkingkrktak 
ehlrk\lgmkerehgekerqvseaeengkldmkeihtymemfqraqa 
lrrraedyyrcki tps arkplcnrvrmaavehrhs sglp ywp yltae 
tlknrmghqpppptqqhsiidnslslktpserllypfppsalpsapp 
saddnlktravcllhplppsappsaddnlktpp/evcliitplppsap 
psaddnlktppecvcsLpfhpqlhpqrmiisrhlpsvsahspstlss 

TLSG 

5459 

A 

2 

568 

RRQEGIKIVLEDIFTLWRQVETKVRAKIRKMKVTTKVNRHDKINGKK 
KTAKEHLRKLSMKEREHGEKERQVSEAEENGKLDMN\ * IHFYMEMFQ 
RAQALRRRAEDYYRCKITPSARKPLCNRVSLLVFLAFGHSLPGQDMD 
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TF FS LRLCAS S PAEGDGREEGCLQAFTVP SLLVTVLCKNTF I PTQWG 
P 

5460 

A 

69 

419 

LKSKLSDYKINL/QKSVTFVYTNSN*SEK/QFKKEIQFTNSCKDN*K 
NYLGINVIKEVK/DLYN/ ENY* IIjIKNEKNPKK/WRAIPCSWFGRIN 
IVNKF\ ILPKVGYRSNAIPMKMPVTSRTRG 

5461 

A 

2 

216 

YLFIYLLPRLKCSGVITAHCSLYLPGSKYPPHSAS*VAGTTGPATMS 
S*YF*FFVETRFFLCSPGWS*GSSSDPSASASQSIVKIGVSHCT\CP 
RNFRNRLAQS LHFTGEETVPRKKGDLPWVYRQTQG * I PGPHRCQVLA 
F*PKDFLLSPHIYVSTSILWPPPA*AQAEVQWCDHSSLQPLPPRFKI 
S S PL S LL S S WNHRS CHHVQL I FLI FCRD KVL P VF PRLVLRELQ 

5462 

A 

59 

405 

KRGFNFGAPDKREWGEFG\*LKPWLPG*NNFSGQTLRRSGD*RKTPH 
GG VNL VF * EKTG FSNLGRGGLKP P P SGNTP AW P PKG VG I TGG AS S S P 
PVFCFLGFCENTNLRAKKKFI 

5463 

A 

1 

242 

STFFFFCETESSSVTQAGVQWHNFSSLQPPPPRFKQFSCLSLPSSWD 
YRCAPPCPSNFCVFSRDGDSPRWPGWSPTPDLK 

5464 

A 

15 

276 

ASVTQAGVQWHNLGSLQPPPPRFKQFSCLSLPSSWDYRRPPPCPTNF 
LF\ LVLARMVS IS * PHDPPASASRSAEITGVNHHARPGLFL 

5465 

A 

2 

309 

EQKFNKQKGRRALRCRGGLSGLLSCSKDVKVSFFSVARLQCSGLILA 
HCNLHLPG\ SDS PAS AS * VVGTPG\QRHHAQLGFVFLVEMGFHHVGP 
GMVFDLLTSG 

5466 

A 

1 

1209 

GG I PALDKNVAELTVMDVYDI LSLVGHEVERVIDQHGCEAIARLMPK 
WRVLE I LE VliVSRHHVAPELDELRLElA APLR\ RVRMDLI EKERKH 
QKELELVEDVWRGEAQDLLSQIAQLQEENKQLMTNLSHKDVNFSEEE 
FQKHEGMSERERQVMKKLKEVVDKQRDEIRAKDRELGIiKNEDVEALQ 
QQQTRLMKINHDLRHRVTVVEAC^KALXEQKVELEADLQTKEQEMGS 
LRAELGKLRERLQGEHSQNGEEEPETEPVGEESISDAEKVAMDLKDP 
NRPRFTLQELRDVLHERNELKS KVFLLQEELGLL * E * RNGRG KPNTP 
TPTHRPPEDGPPSRESGIKATCLASFFPEVKKRLGQTHRGNVHIQGV 
PLDKLG KHPTAMTG YTEQRTG S PCS I C 

5467 

A 

1 

388 

fkdpeavraltctllredfglsidiplerliptvplrlnyihwvedl 
i ghqd sd ks tlrrg i d i gtgas c i ypllgatlngwyflatevddmcf 
nyakknveqnnlsdlikwkvtqktllmdax j kgsi*v*smtf™\ot 
ppffanqigsqve*/ysrnprrpppssvntggiteimaeggelefvk 
riihdslqlkkrlridmaa1.ligvcsevsgrkdwiqqlaprwyscm1, 
gkkcslaplkeelriqgvpkvtytefcqgrtmrwalawsfyddvtvp 
vhegriavggvtsdsqktkgsasppskrrklekprkpitfvviiasvmk 
elslkasplrsetaegivvvttwiekiltdlk\asinefpvekrksa 

FS * RP * KTPGFI * GERKESV*DSDKSTLRRGIDIGTGAS CI YPLLGA 
TLNGW YFLATE VDDMCFNYAKKNTVEQNNLSDL I KWKVPQKTLLMDA 
LKGSI 

5468 

A 

72 

680 

LEEPRPQSLEGHTEGERAALPGKAKWEGRVADDGVPGQPAWPGAPP 
HTAAFQPVHALG S TRAS QQG S LGDC * AACR * Q * KAP * VPKQGT * QER 
SSTK/PRAVPSGHWAGGSGOTCSE*RAALG*TTEPQQSGELPGPPWP 
LEGAAHPLSAETGSCVPPRAFSGQGKQPRGPG*AAEGPHGDFPQTGD 
RHCPPSSTEPSGMTGR 

5469 

A 

259 

373 | 

FSGLFLDFI S FLG PMSGMGMNMGMDGQWHYMYPS SCKA 

5470 

A 

1 

336 

MEVGKLSKGVWEEPALPDERKTFASARGPVRKADTPVHRAAAGTAWR 
KVQCLTKDDAVWCVKENGLDISSLGLIPAQTTCVILHPYPSEEQKKQ 
LAQDTGLTI LQVNNWKAH 

1Z. A *7 T 
34 / J. 

A 

1 

1396 

gahaphpnvmpasmgsavndalkrdkdaiyghplfpllal.vfekcel 
atctprepgvaggdvcs sds fnediavfakqvraekplfs snpeldn 
lmiqaiqvlrfhlLelekghelcdnfchryisclkgkmpidlvider 
dgssksdheelsgsstnladhnpsswrdhddatsthsagtpgpssgg 
hasqsgdnsseqgdgldnsvaspgtgddddpdkdkkrqkkrgilpiv 
fp\nimrawlfqhlthp\ypseeqniqlahdtgltifqvnnwfinar 
igivqpmidqsnragflldpsvsqgaayspegqpmgsfv\ldgqqhm 


1040^ 
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G\ IRPAGTYEWDGA* IWAWMGNGTTC*PSSC*RHSQSKGEVCRAWPG 
DYVSQGGPMGMSMAQPSYTPPQMTPHPTQ\IiRHGTPNAFHILPSLPH 
HPGLD * LHGGTPY PTWD * LC * AQSPTMLNS VDPNVGGQVMDIHAQ 

5472 

A 

3 

862 

TMRPDIDNSQAAHQPPGRLPNSRKLKKTRRQMCCVQTKKRIARLPPC 
CRNTPTTARSHPGRDCAKPNT * SLRS PGGDCH * QGNTTWRM PMAR * L 
SGDSESGRSPVRTIICHQPSRSQSPVDRLQQLLPASQSTQLPCSSSP 
QETTQSRPMPPEARRLIVNKNAGETLLQRSARLGYEGS\VLYCLENK 
I CDVNHRDNAGYCALHEACARGWIA KFVRHLLEYGADVNCSAQDGTR 
PLHDAVENDHLEIVRLLLSYGADPTLATYSGRTIMKMTHSELMEKFL 
TDLFK 

5473 

A 

1 

444 

KLLRWLRQE\NCL\NLGGGSCSELKSHHCTPPWMIEGDSI*KKQNKT 
KTRKRKPQGSHS/TLSGGPDWV*QNCEVSQVLPISHFPLLQDILPPIi 
PYLGANGQAGEQGATDGH * / RTTTESHRAS VGCRI PWLCQRPLALAL 
S FFLCEMRT I P 

5474 

A 

2 

330 

EMSFALVAQAGVQRHNL»GSLLPLPPGFK*FSCLSLPSSWDYRYAPPT 
P\ AN\ FAFLVETGCLHVGQAGLKLPNSGNIiPTS ASESAGI IGVSHCA 
QPGTATHSNTVLSGLFC 

5475 

A 

2 

363 

ELDTLCDLYEP * PS PS 1 1 F INTRRK/ VDWLTEKMHARDFTVS AMHGD 
MDQKERDVIMREFRSGSSRVLITTDL\RIGRGGRFGRKGVAINMVTE 
EDKRTLRD I ETFYNTS I EEM PLNVADL I 

5476 

A 

112 

434 

ARGIDVQQVSLVINYDLPTNRENYIHR*A* IWNTPLPLHTWPSLGLK 
LLIFLIPFLVFQ\IGRGGRFGRKGVAINMVTEEDKRTLRDIETFYNT 
SI\EEMPLNVADLI 

5477 

A 

3 

724 

NSNVEREEWKLDTLCDLYETLTITQAVIFINTRRKVDWLTEKMHARD 
FTVSAMHGDMDQKERDVIMREFRSGS SRVL I TTDLL/ AE * RELI AKA 
EGRIQGDSLSKGTS VPLRKVAAWNI I WHA* GLWGTGMLI SSAFLGCP 
HGCLSVFPPG*SVLRAHAEELSFLTHARGIDVQQVSLVINYDLPTNR 
ENYIHRIGRGGRFGRKGVAINMVTEEDKRTLRDIETFYNTSIEEMPL 
NVADLI 

5478 

A 

1 

1274 

MSASQDSRSRDNGPDGMEPEGVIESNWNEIVDSFDDMNIiSESLLRGI 
YAYGFEKPSAIQQRAILPCIKGYDVIAQAQSGTGKTATFAISILQQI 
ELDLKATQAIjVIiAPTREIAQQIQKVVMALGD^ 

AEVQKLQ\ MEAPH I \ IVGTPGRVFDMLYRK/ YTLS PKYI \ KMFVLDE 
ADEMLSRGFKGQIY\DISKRLNSTTK\VVLLSATMPFD\VIiEVTKKF 
MRGPPFRILVIQCEELTLEGIRQ\ F YI \NVEPEE\ FNIJDTLCDLYENL 
DHHPRQVI FHQPPGGKVDW / LS PEKMHA\ RDFTV\ S AMHWRFWTQKE 
RRT * L * REF \ RSGS * PEF * I TPLTLPAQRAFDVAS RVS F * S FKLMTL 
PPPTRGKLLIHR\ IGSRVDRFGRKGVP INMLTE / EKTKRNT*ED I ETF 
\ YNTS I \ EEMPLNVA 

5479 

C 

114 

302 

MKEEILKKTAELYKRAMXNKXKQKHHSWVXKAPSRPFIKWETRLGLK 
TAHSPNVFLQQGQR* * 

5480 

A 

1 

2043 

MPEISLSVHAQRRDHVSIWCDGSAYTPRETKENEIQDLLRAKRELES 
KLQRLQAQGIQVFDPGESDSDDNCTDVTAAGTQCEYWTGGALGSEPS 
IGSMIQLQQSFRGPEFAHSSIDVEGPFANVNRDDWDIAVASLLQVTP 
LFSHSLWSNTVRCYLIYTDETQPEMDLFLKDYSPKLKRMCETMGYFF 
HAVYF P I DVENQYLTVRKWE IEKSSLVI LF IHLTLPSQDRVTWRS AD 
EL PWLFQQQG S KQKLHDCLLNL FVS QNL YKRGHFAELLS YWQFVG KD 
KSAMATEYFDSLKQYEKNCEGEDNMSCLADLYETLGRFLKDLGLLSQ 
AI VPLQRS LE IRET^liDPDHPRVAQSLHQLAS VYVQWKKFGNAEQL Y 
KQALE I SENAYGADHP YTARELEALATLYQKQNKTADQFLKRSLEMR 
ERVLGPDHPDCAQSLNNLAALCNEKKQYDKAEELYERALDIRRRAIiA 
PDHPSLAYTVKHLAliiYKKMGKLDKAVPLYELiAVEIRQKSFGPKHPS 
VATALVNLAVLY S QMKKHVEAL PLYERALKI YEDS LGRMHPRVGE TL 
KNLAVLSYEGGDFEKAAEDYKRAMEIKEAETSLLGGKAPSRHSSSGD 
TFSLKTAHSPNVFLQQGQRFVMTVYLENAWSAQNLLKLISNFSKVS 
GYKINVQKSQAFIYTNNRQRAKS 
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5481 

A 

153 

332 

QTFVLAEGKH* ENE / PVFKLNVSPLDEWREGAFPPKSDVSAS FI S I A 
LLYNSAAFSKSPPS 

5482 

A 

2 

904 

HILLVNLKHLQLCTRNKRNIFKLFHFRKKSFMGHQRAIKKKGNIiYGF 
ALLRRRALQVEELTLGKDTPDNARTLNELGGLYYLQNNLETADQVLK 
RFL * MRERVLGPDHPDCAQAlA NNGAGS YAMKXNQYDKPEDL YKRAL 
DIGRRALAPDHPFIuAYTVKHLAILYKKMGKLDKAVPLYELAVEIRQK 
SFGPKHPSVATAIiVNLAVLYSQMKKHVEALPLYERALKIYEDSLGRM 
HPRVGETLKNLAVLSYEGGDFEKAAELYKRAMEIKEAETSLLGGKAP 
SRHSSSGDTFSLKNQLHFS 

5483 

A 

2 

331 

PGQHGETLPLLKIQK\LAGQGGTSL* SEPLRRLRQENCLNLGGRGCS i 

ETRSCHCTLVWVTGRDSISKQKTAVDQAEERVSELEDRLFENTQSEK 

TKEKRIKISTLRGFRK 

54 84 

A 

26 

117 

AKGQVALGRPGARMQGGSKWGVLAGPGSRSEAGTVPSYPAAQTRPH 
PPPQPLL I FRS PG WPNGHQGPPNSTAS S KQCNLRCWGLPTVARpPK 
SGQRPQ/PRPTR*PGPSPQGPTt>GRKNPERP*LVQQ*PCWAREPV 

5485 

A 

111 

270 

LFQTIFEDVSGFGAWHRRWCVLSGNCISYWTYPDDEKRKVIYIVLFS 
GESPD 

5486 

A 

240 

408 

TDPSPQASDYETMETWQIQEEKSESQLSVSPGKNQESEAGELSHNQ 
LSSPLGFQ 

5487 

A 

128 

657 

ESESYWAICSRKRSPVLSSYGGHIYLKIRCPVDS\SVGRRGFLTIFE 
DVSGFG\AWHRRWCVLSGDCISYWAYPDDEKRKNPIGRINLANCTSR 
QIEPANREFCARRNTFELITVRPQREDDRETLVSQCRDTIjCVTKNWL 
SADTKEERDLWMQKLNQVLVDIRLWQPDACYEPIGKP 

5488 

A 

1076 

3513 

I KALS S S AED AS L» VNAS I SSSVKATS PVKSTTS ITDAKSCEGQNPEL 
LPKTPISPLKTGVSKPIVKSTLSQTVPSKGELSREICLQSQSKDKST 
TPGGTG I KPFLERFGERCQEHS KES PARSTPHRTP 1 1 TPNTKA I QER 
LFKQDTSSSTTHLAQQLKQERQKELACLRGRFDKGNIWSAEKGGNSK 
SKQLETKQETHCQSTPLKKHQGVSKTQSLPVTEKVTENQIPAKNSST 
EPKEVIREIEMSVDDDDINSSKVINDLFSDVLEEGELDMEKSQ/AGD 
G S S I S R / T AAKNRKMH * I S PQCL YLHHWHKQL V * V * CPHLD WN * KT P 
AEVMKVQNQENSKELVS/RRAESGDSLGSEDRDLLYRSQRFKETERP 
S I KQVI WKEDVTSKLDEramAFPCQVNIKQKMQELNNE INMQQTVI 
YQASQALNCCVDEEHGKGSLEEAEAERLLLIATGKRTLLIDELNKLK 
NEGPQRKN*G*S/APSEFIAIPKDQFTLSEIRLP*KADFVCSTVQKP 
DAANYYYLI ILKSRS \ ENMVATPLAS TSNSLNGDALTFTTTFTLQDV 
SNDFEINIEVYSLVQKKDPSGLDKKKKTSKSKKSNIHSSVMASPGGL 
SAVRTSNFALVGSYTLSLSSVGNTKFVLDKVPFLSSLEGHIYLKIKC 
QVNS SVEERGFLGCPGGGRLQPKRQT I FEDVSGFGAWHRRWCVLSGN 
CISYWTYPDDEKRKNPIGRINLANCTSRQIEPANREFCARRNTFELI 
TVRPQREDDRETLV\TNAGTHSVFTKNWLSADTKEERDLWMQKLNQV 
LCDIRLWQPDACYKPIGKP 

5489 

C 

135 

206 

MF I FAFGKS I FKI TTTFQSTIVG * 

5490 

A 

67 

388 

DAVTA/GAMPTMASVKIiSTLHPIVNHPHYEDADLRPGCSMLEIWDVE 
DPSNAANPPLRSILLEDAR\PKMPVFQNSVYRVLKVN*EGNYPLHYG 
TMPCVKNNHPLAYLH 

5491 

A 

3 

107 

SIKLSTLHPI\VNHQHYEDADLRARTKIVYSTYSRTSAKEVRDKLLE 
LHVNYTVLEEAWCVVRTKFILQDGPGVLQAAEK* CQMVTQI YKKYSN 
ALEM\PGCSMLEIWDVEDPSNAANPPLCSVLLEDARPYFTTVFQNSV 
Y\ KVLKVN*EDADLRARTKIVYSTYSR 

54 92 


3 


l iij»lrt'lioW(jAli , iSUF\AHVVVYIIFMLGSCAFFS 

DVAKQLKEQQMVMRGI^TSMVHELNRPKVKLRRWKGEGRHFTKRIL 
FY 

5493 

A 

2 

652 

RVDLPIKSARYRGQYSSYPIKLFYTSNIPIILQSALVSNLYVISQML 
SVRFSGNFLVNLLGQWADVSGGGPARSYPVGGLCYYLSPPESMGAIF 
EDPVHVVVYIIFMFGSCAFFSKTWIEVSGSSAKDVS *TA* KEQQMV\ 
MRGHRDTSMFHELNRYIPTAAAFGGLCIGALSVLADFLGAIGSGTGI 


\OH\\ 
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IiLAVT 1 1 YQYFE I FVKEQAEVGGMGAJLFF 

5494 

A 

58 

596 

KFIFRDGGSLCCPSWT*TLGLQQSSHMSLPSDWDSRVRATKPNFPFS 
S*HSFSALY\MHLPRSPLNFIQPCRFYKGPAGLSRGGRE*HGL*SDF 
SVANLWQLGFLWQALTDYRAELPGMDPIISTHLAKLYDNLLEQNLIR 
VIEPFSRVQVRTLWGLHFWPGILTVATSLPHL5RMGTHF j 

5495 

A 

2 

951 

PRVRPRVRPRVRSSRPRSRDPSPRRARLRWQLRWKPRWCPRPPKTPG 
VWKRPRTRPRSSAGGSTGFPSSPILRRSPSTRRRSSRKASPTATRAT 
GTPPRQAQRKTARAAGRRRASPGIATAGTRSMISM\RPGRKPSNPSW 
EGRTNEETSSLSRLKPVSPGTITCPLRTPGSLLKDSKIPISIKHLTN 
LPSSHPWHQQPSRSEMPRTKIPVSKVLVRRVSNRGLAGTTIRATAC 
HDSAQKWRSSRPRWMGPMPRNTTFPWETTKVS FAFPKESLL /WTPP 
VPRPAPERGPRRSLCPE*GPDNTRKRDATRGFLLSR 

5496 

A 

1 

279 

VQWHDLG*PQPPPPGFKRFSCLTPP/RSSCDHRCP\PPTPD*FLCFL 

VETGFHYVGQAGLELLTSGDPPTLASQSAGIAGVSHHARPNFLIFFF 
N 

5497 

A 

2 

291 

FLVARVAGSGGDGIiTLFPRLECTSVITAPASISWAQGDLPASASQVA 
QTAGACHHS WPVFTSCVEMGFPCVP * AGLELLGS S PP \ PQPPKMFRL 
QA 

5498 

A 

2 

345 

SHSVAQIGVQWRDLGSLQPSPPGFKRFSCLSLPSS*DYRHAPVHPAN 
FF\FLVEMWLPRVSQDGLDLLTS\DLPSSASQSTGITGMSHRARPNF 
C I FSRDRVS PCWPVWSQTPDLR 

5499 

A 

2 

311 

QVLENPHFPRG*APPPSW*YPSGFYKSNRPRACFNPRSLAPRGPHQG 
FFFPPGPL\FPPPPFLGPPYGSRFFPSEHFQKVQPLNQPPFLSLKNP 
P*RFPFFFP 

5500 

A 

1 

2500 

MEDAWG IRKETGRVKEEAEKVTGWGNNWRNVEKS SMSRKVKAARPGF 
KKLPMTSDIPPKCTIKDLLPKEKSSTEAVFH1WLERHESPDIEDFS 
FKEPQKNVHDFECQWRDDTGNYKGVLMAQKEGKRDQRDRRDIENKLM 
NNQLGVS FHSHLPELQLFQGEGKMYECNQVEKSTNNGS S VS PLQQI P 
SSVQTHRSKKYHELNHFSLLTQRRKANSCGKPYKCNECGKAFTQNSN 
LTSHRRIHSGEKPYKCSECGKTFTVRSNLTIHQVIHTGEKPYKCHEC 
GKVFRHNSYLATHRRIHTGEKPYKCNECGKAFRGHSNLTTHQLIHTG 
EKPFKCNECGKLFTQNSHLISHWRIHTGEKPYKCNECGKAFSVRSSL 
AIHQTIHTGEKPYKCNECGKVFRYNSYLGRHRRVHTGEKPYKCNECG 
KAFSMHSNLATHQVIHTGTKPFKCNECS\QVFTQNSQLANHRRIHTG 
EKPYKCNECGKAFSVRSSLTTHQAIHSGEKPYKCIECGKSFTQKSHIi 
RSHRGIHSGEKPYKCNECGKVFAQTSQLARHWRVHTGEKPYKCNDCG 
RAFSDRSSLTFHQAIHTGEKPYKCHECGKVFRHNSYLATHRRIHTGE 
KPYKCNECGKAFSMHSNLTTHKVIHTGEKPYKCNQCGKVF\IQNSHL 
ANHQRTHTGEKPYRCNECGKAFSVRSSLTTHQAIHTGKKPYKCNECG 
KVFTQNAHLANHRRIHTGDKPYRCTECGKAFRRQGVTPGPSLECNGT 
IIAYSSLKFVGSSDTHISLLRSQDDRQVIiPHLANPNFREGLLPAFLL 
LCSDSHFLQFCSQFAPRKTQGSGSVNNIQANIIAK 

5501 

A 

1 

2508 

MPHQLGPNLVPDIPPKCTIKDLLPKEKSSTEAVFHTWLERHESPDI 
EDFSFKEPQKNVHDFECQWRDDTGNYKGVLMAQKEGKRDQRDRRDIE 
NKLMNNQLGVSFHSHLPELQLFQGEGKMYEC^QVEKSTNNGSSVSPIj 
QQIPSSVQTHRSKKYHELNHFSLLTQRRKANSCGKPYKCNECGKAFT 
QNSNLTSHRR I HSGEKPYKCS ECGKTFTVRSNI/TI HQVI HTGEKPYK 
CHECGKVFRHNS YLATHRR I HTGE KP YKCNECGKAFRGHSNI/TTHQL 
I HTGEKPFKCNECGKLFTQNS HL I SHWR IHTGEKPYKCNECGKAFS V 
RSSLAIHQTIHTGEKPYKCNECGKVFRYNSYLGRHRRVHTGEKPYKC 
NECGKAFSMHSNLATHQVIHTGTKPFKCNECSKVT^QNSQLANHRRI 
HTGEKPYKCNECGKAFSVRSSLTTHQAIHSGEKPYKCIECGKSFTQK 
SHLRSHRGIHSGEKPYKC^ECGKVFAQTSQLARHWRVHTGEKPYKCN 
DCGRAFSDRSSLTFHQAIHTGEKPYKCHECGKVFRHNSYLATHRRIH 
TGEKPYKCNECGKAFSMHSNLTTHKVIHTGEKPYKCNQCGKVFTQNS 
HLANHQRTHTGEKPYRCNECGKAFSVRSSLTTHQAIHTGKKPYKCNE 



.1. 0 O e H-6 4- 3 . O S! H ib O 2 






CGKVFTQNAHLANHRRIHTGEKPYRCTECGKAFRVRSSLTTHMAIHT 

gekrykcnecgkvfrqssnlashhrmhtgekpyk\ecgevirynsll 

SHQL I \ HS * QNP YKCSDSGQSLMS * S INRYERP * ARD/ YHVNICGRG 
S IQASQVTRHQDFI SLMKRNKCNMHPEAITQ* PMVSEDS 

5502 

A _ 

3 

361 

PKKNYFPLHLKGPKTKGVFFYSSSSSSQKGVSLCNPRWNAVGPFGFP" 
STPPS \ LVK* FLCPNPWS * CDFRPPSPRPGYFCFFNKNKALLARVVF 
NS * PQVI PLFQPPKMVGFQGLTPLAG 

5503 

A 

32 

392 

FLFLS FPPLSFKMTLNDAMRNKARLS ITGSTGENRRVMTPEFP\ KAV 
HAVPFT*SPGHGNGMSSVTESLGGLIRTLLSALPQWFSCVFIVQSGE 
RHPSILEDFSFQPKNSLNMWSSILKL 

5504 

B 

473 

524 

MGEAKKTGEQRVRLPRTX * 

5505 

A 

1 

627 

LVSSFFFFFFCRDRVSPCCLGWSPTWSPGLKRSARLGLPQCWDSRRE 
LPRPAYLLFLLSWGSPSPSAWQEGG/WCKKKRGTEHPEVPSIPSCTC 
PSPTVQQ* S/ GEKQVQETGRQS/LGEAKKTGVQWHDQGSIiQP* SGLK 
* S S YLS FLSS * DHRCAPTTPD * FL * R * DPTMLPRLVLNS WAQ/ CDHP 
ALA\ SQSTGII GLSHRAWACDQNSHFF 

5506 

A 

3 

649 

RVSSVMSTSLEEIFNCIFDAHVPPLWGKAYPSQKPLAAWTRDLAMRV 
BQFELWASRARPPVIFWI»SGFTFPTGFLTAVLQSSARQNNVSVDSLS 
WEFIVSTVE*QAT*VYPPKDGC\WVRGLYLEGAGW/DPEELLLGGRQ 
S PCKLVWLMPTIHFRPAESRKKS PKGMYS CPL\ YYYPNRAGS SDRAS 
FVIGIDLRSGAMTPDHWIKRGTALLMSLDS 

5507 

B 

167 

377 

XKIjIQFGLMKNLIRRLQKYPVRVTREEQSHPARLYTGCHSYDEICCK 
TGMSYHELDERLENDPNIIICWK* 

5508 

A 

2 

1247 

TPLREERGATGLGPVIAMGSGCRIECIFFSEFHPTLGPKITYQVPED 
FISRELFDTVQVYIITKPELQNKLIT/VLESSFVSMEESKOKLVPTM 
TILLEELNASGRCTLPIDESNTIHLKVIEQRPDPPVAQEYDVPVFTK 
DKED F FNS Q WDLTTQQ I LP Y I DG FRH I QKI S AE AD VELNL VR I AI QN 

LLYYGVVTLVSILQYSNVYCPTPKVQDLVDDKSLQEACLSYVTKQGH 
KRASLRDVFQLYCSLSPGTTVRDLIGRHPQQLQHVDERSEENIiLGHL 
GVT*GKLDPLCLSGALDHGALPIQAGFLEMMGYRDKIEGRLQERVGL 
PERRPGQGVTPSSDPHPRKLIQFGLMKNLIRRLQKYPVRVTREEQSH 
PARLYTGCHSYDEICCKTGMSYHELDERLENDPNIIICWK 

5509 

A 

1 

512 

CTLPIDESNTI\H*KVIEQRQTPRVAQEYDVLVFTKDKEDFFNSQWD 
LTTQQILPYILX3FRHIRKISAEADVELNLVRIAIQNLLYYGWTLVS 
ILQYSNVYCPTPKVQDLVDDKSLQEACLSYVTKQGHKRASLRDVFQ\ 
YDEICCKTGMSYHELDERLENDPNIIICWK 

5510 

A 

1 

474 

HTIHLKVIEQRPDPPVAQEYDVPVFTKDKEDFFNSQWDLTTQQILPY 

IDGFRHIEKISAEADVELNLVRIAIQNLLYYGWTLVSIIiXRKLIQF 

GLMKNLIRRLQKYPVRVTREEQSHPARLYTGCHSYDEMCCKTGMSYH 
ELDERLENDPNI I ICWK 

5511 

A 

199 

562 

KES PGPGRHPVPL I PHP * ES * I PVRAL * KNLI QATYRKYP \ VRV\ TR 
EE \ RAHPARLYTGCHS \ YDE I CLQDRWRQAGSQGGFRAGCARPRPLT 
S PPPPHPGMS YHELDERLENDPNI 1 1 CWK 

5512 

A 

1 

1224 

EFGTRPLREERGATGLGPVIAMGSGCRIECIFFSEFHPTLGPKITYQ 
VPED F I S RELFDTVQVY 1 1 TKPELQNKIj I TVTAME KKL I GCP VC I EH 

KKYSRNALLFNLGFVCDAQAKTC^EPIVKKLAGYLTTLELESSFVS 
MEESKQKLVPIMTILLEELNASGRCTLPIDESNTIHLKVIEQRPDPP 
VAQEYDVPVFTKDKEDFFNSQWDLTTQQILPYIDGFRHIQK\ISAEA 
DVELNWVRI\AIQNLLYYGVVTLVSILQYSN\VYCPTPK\VQ\DLVD 
L>iUa L>y h. ACLi S YVP KQ \ GHKRAS LRD VFQL YCS LS PGTTVRD \ L I GRP 

PPSSCQHVDE\RKLI\QFGLMK\NliIRRLQKYP\VRVTREE\RAHPA 
R\ LYTGCHS \ YDE I \ CCKTGMS YHELDERLENDP\N I 1 ICWK 

5513 

A 

2 

313 

FETESYSVAQAGVQWWDLGSLQPPPPGFKRFSCLSLLSSWDYRRPPP 
RLANF\ * FLVETGFHHVGQAGLDLLT/S/GDPLASASQSAGITGLSH 
CAWQTTYITFNHK 

5514 

A 

731 

1156 

SVLGLISDTSNTFLSFFFFFEMESHSLAQAPLQWHYLGSLQAPPPCK 
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LRLPGSRHSPASASRVAGTTGARHHARL\ I FGFLVEAGCHRVSQDSL 

DLLTS * STCLKP/AQSAGITRREPPTPGLFFFLKMESCSVGPRLGVQ 
SW 

5515 

A 

3 

365 

LLNSRPVDFFFFFFFLSHSL/DSWQAGVQWHNLGSRL\LQSPPPGS 
MPFSCIiSLLSSWDyRHPPPYLANFFFFCIFSRD/MGFTMLARMVSIS 
* PCDLPAXiGS QS AG I TGVSHHAWPAYF YPQ 

5516 

A 

400 

619 

MPEQICPLETSQS * INNLPKKFFFMSI *CPGFERSYFFF*CC 

5517 

A 

274 

693 

KYQILLYNGDVDMACNFMG\DEWFVDSLNQ/KGKVEIPGPGIEPAGE 
EWGAGMPGSGQLAAWLWAHRGQVWRNSQP*GLAGSCEQGCSCCRLMS 
FLVGQMEVQRRPWLVKYGDSGEQIAGFVKEFSHIAFLTIKVGTGPAE 
R 

5518 

A 

3 

766 

NRLWSSLQTHCCSQNKCNFYDNKDLECVTNLQEVARI/VGNSGLNIY 
NLYAPCAGGVPSHFRYEKDTVWQDLGNIFTRLPLKRMWHQALLRSG 
DKVRMDPPCTNTTAASTY\LTNPYVRKALNIPEQLPQWDMCNFLVNL 
QYRVFLPSQNSR\YLKLLSSQKYQILLYNG\DVDMA\CNFMGDEWFV 
DSLNQK\MEVQRRPWL\VKYGDSGEQIA\GFVKEFSHIAFLTIKGAG 
H\MVPTDKPIi\AAFTMFSRFLNKQPY 

5519 

A 

2 

279 

FLRDVI LL/ C/HPGQSVW* S * LTVS PTELPGS SDPPSSAFNCSWEY 
RCVTPCQLMF* FTLLS IGIGSTLPRPVLNS S PQVI LAPWP\ PKVLRL 
QE 

5520 

A 

2 

1045 

CPSRPCRRRPPPPRGPKPRASYPGGPAP/PGDKKKRPPQ\GPRGSSP 
APGPPP \ PQEAAGPARPRPGQDAAAGPSGR\ PPQALPS\ PRGPQPRA 
PR PGRQ APATAQRG PPMRR PRRGRG P P PRAP P / PGAGPC / PRLRCAI* 
LPESHPRGRPGHPEASSGEAAAGAAPQALPLTLGRTQAPTRPVSPG* 
GRGPAEGRPRQR/PPTHPPPPA\GPPRASARRPAAARRPAGSPAQPA 
ARGPAADAQGVAAQRAAQVRRRGVRGGARGGGRGP/RGPRGRP/PGS 
SGRQRPLSG\PSAQAGPRGAQPDSPDGHGPPPALASRQPGPFSFTPV 
VSTPDACLCPHSGLLLASPS IHAVRGELSN *" \ 

5521 

A 

84 

202 

SYWLFTLPGCKPLQGRGIiSRVQCCIPAAGHGTWSMAFNK 

5522 

A 

3 

576 

ILGFPFFVRWGSHTVAQAGVQWCDHGSLQPRSPGVK*SSHLSLLGSW 
NHRHATTTPG * FCF FSR IRSHCYAQAGL * LLTSNHPPAXiASQT\ VG I 
TGVSHWTWPNTGFSVLTATNKNLKFFHYAISKCLVR7UCLSSRLLKIE 

ERNKALSAPWSVSIFDRVLRLLGYSASDWQPEFVETAVSNFVIYGI 
FRGQ 

5523 

A 

177 

904 

ESHLQNDTAAHPLLNGTECGVSPPPLTSRQGKVRRRTQSSTSRTC/S 
PVIPKQTFPQPVLHMSAGVAL\DSPAPPPRPWQPPPQGFAHT*DRKE 
EGDPIGIWAPEGKSCTPKPPPPSLPRTSPGWKRALQKGDTGCPGPST 
ASTPHPLILGPSQAPPKPKPP/PSPGVPACAPS/YPALPAPSISLTE 
KEQARAPGGQDGAGHTVGGGGAAGRREGQNKTFSFGFFFSFFFLPFT 
LWWLLFIiSFSL 

5524 

A 

1 

173 

FLGPSKFPPPGFPPFFPPPPRERGNQRSSSPPSSSS/P/PSQKKTGV 
PPG*PGGFFPPPP 

5525 

A 

1 

392 

L I LNTKRPAF I FDLLHl^LDPKEQAYADLVS AYTLNSMFWL YLATQGV 
NPTDHPVKHELDTT/RACMNRSSSPTDKKKAGLLYRGAASKFVKNAL 
WEPKSKNA*KVGNKGKSPTSLFGFDVHIFKKNMLPPP 

5526 

A 

1 

1548 

MPERDS EP FSNPLAPDGHDVDDPHS FHQS KLTNED FRKLLMTPRAAP 
TS AP P S KS RHHEMPRE YNEDEDP AARRRKKKS YYAKLRQQE I ERERE 
LAEKYRGRAKERRDGVNKDYEETELINTTANYRAVGPTTKADKSAAE 
KRRQLIQESKFLGGDMEHTHIjVKGIiDFALLQKVRAEIASKEKEEEEL 
MEKPQKETKKDEDPENKIEFKTRLGRNVYRVLFKN/KAYKRNELFIiP 
GRrmYV\KVDCPTMEAQTILTTiroiVISKLTQLLSYLRQGTRNKKLK 
KKDKGKLEEKKPAEADRSIFEDIGDYTPSTIKTPWDKERERYRERER 
DQ/RDRDRDRERERERDQERERDRERERDRER\GPGSAKELIKSINE 
KFAGSAGWEGTESLKIAEDKKQLRDFFGMSNSYAECYPATMDDMAVD 
SDEEVDYSKJTOQGNKKRTLSRWDFDTQEDYSEYMNNKEALPKAAFQY 
GIKMSEGRKTRRFKETNDKAALDCQWKISAIIEKRKKMEADGVEVKR 
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PKY 

5527 

A 

1 

3162 

MEQFLWQELFAEYNRQASAQVPLGEDRTEYYDQFCKDVPPKNEETEE 
\^VNNGPTWSARTVDSHLGQKDLSRLQKVFVDGFGSSDLQSIFFSSL 
N IQLTDATQKEPLCKKI ATLHSESKLNS FEKDYYY I GASRNLGCS AK 
IDEGLLSKVYSSI^SELGSSQEIFVRVVTHKVLGVEINEATQEKLVK 
RLQELLKKDDTLrSGLG YAFNVAPLLG AS AS F I ANRVEDAI VQADEVD 
GKLLQFEGGLS ITSL I INGAFGVSKTFNKPVPI TADQAVKFANYFLS 
RR S VQTAKG AHVL I E ALKT I SGADKIAPVCVQLIGNGQLDAQSPTLN 
VAWDLLGKPLS PAP KVINAKVIRKKDS SVLADKI TVAS KS SDKTTY 
VADLASLKPARGVYQAELSADGVYTQTLQFiO^GRVKVQSLEVGIAE 
SDASAATRKQSVTYPGKLQEVLSADSTQKLLLKAVLVDESTSKPLAV 
HQAFVRLYNKKTDKEI IFVAEQDSSKAYKFDMDVGNNGKNFNYQSGT 
YS I YLTVGDS SLSNS FEWLVAEVQLKFNEDKEVKPKNTSGPLPEI IH 
QFRVPDKRPPRIVSDIFTGLCITPLVLLFVFWGKLGINVSNLTLAPS 
TIGFHLGFGGILVLFFVFWLQLNMFQTLRLL I P I AVFTFLAGNRLLR 
RL YAQRNS KAS AS YNTDVLEQELKE I KECAQTTI»TTND I VI S KLTQ I 
LSYLRQGTRNKKLKKKDKGKLEEKKPPEADMNIFEDIGDYVPSTTKT 
PRDKERERYRERERDRERDRDRDRERERERDRERERERDREREEEKK 
RHSYFEKPKVDDEPMDVDKGPGSTKELIKSINEKFAGMDDMAVDSDE 
EVDYS KMDQGNKKGPLGRWDFDTQEE YSE YMNNKEALPKAAFQYG I K 
MSEGRKTRRFKETNDKAELDRQWKKISAIIEKRKKMEAIX3TLAWCSS 
LGVTTASWLDATLPLSTLSPRFVASPDPRKLESHLSHYIjVFDYIHIiV 
LLSMDRTCEERPAEDGSDEEDPDSMEAPTRIRDTPEDIVLEAPASGL 
AFHPARDliLAAGDVDGDVFV 

5528 

A 

2433 

2540 

QRKNDNKMPERDSEPFSNPLAPDGHDVDDPHSFHQ 

5529 

A 

1 

1739 

MPERDSEPFSNPLAPDGHDVDDPHSFHQSKLTNEDFRKLLMTPRAAP 
TSAPPSKSRHHEMPREYNEDEDPAARRRKKKSYYAKLRQQEIERERE 
LAEKYRGRAKERRDGVNKDYEETELINTTANYRAVGPTTKADKSAAE 
KRRQLIQESKFLGGDMEHTHLVKGLDFAIjLQKVRAEIASKEKEEEEL 
ME KPQKETKKDEDPENKI EFKTRLGRNVYR VL FKN / KAYKRNEL FLP 
GRMAYV\KVDCPTMEAQTILTTNDIVISKIiTQLLSYLRQGTRNKKLK 
KKDKGKIjEEKKPAEADMKYVS*PLASDQREGVCTFLVSGILGRC*HE 
NL>EKWS GG VKVYV* I WPASDLHFP S I FED I GD YVPS TTKTPRDKERE 
RYRERERDRERDRDRDRERERERDR*\HERERDREREEEKKRHSYFE 
KPKVILLGPWTFDKGPGSTKELIK\ SFIEKF\ AGSAGWEATES \ LKK 
PED\KKQIjGDFLGMSNSYSECYPATMDDMAVD\SDEEVD\YSKMDQG 
NQEGGP*ARW\DL*FPQEEYKRVFVTTKEA\LPRVAF\QYGIKMS\E 
GRKT\RRFKEPQ*PKQSLIRQWKKI 

5530 

A 

93 

547 

YNCRPTLGVSGCAGLDAQWCQDCAFELGKKP/PSGGEQYPSSSAIK 
QGPKEPYADFIARLQESLKKVIADLAAQDIVIjWLIjAFDNANPECQAA 
LRPIRGKAHLVDYTKVCDGIRDKLHKATLLAQAMAGLRMGKGNTPFP 
GACFNCGKHGH 

5531 

A 

1 

541 

MGQVWALVRSTLELFHTDDEEEGEYDEVTEEVTEQVYLPAKAKVAQE 
EEVHPYPSAPPHYYFEEKEWPDPPDLSFLEDTGRKWAPVTEQHLER 
LLSVLFRQEFSRLDERGPKELYADFIAWNIiLRQESLKKVISDSAAQD 
IVLQLLAFGNVNLDCQAALRPIRGKAHLVDYIKACDGIG 

5532 

A 

365 

661 

SESMTPVCICTPFKFKLDLPRSNPIVPAGKGPQTPVGTFCRGSPGRR 
LTAFVQHKSTTALPAVAGDRHCPGVREWPEPEMLPFGMGPEIGFPSL 
SKLE 

5533 

A 

3 

74 

RMGPRTGP * VFPETGFPPLSNLE 

5534 

A 

83 

262 

CXVTPPPQDTGQSKS*STRVSSSLVPRSPTVAPVTLRPGFEPP*WTP 
LAKTS S VLTQWC 

5535 

A 

636 

698 

PWHLNQPSRRPPKAGLQLY j 

5536 

A 

125 

340 

SESMTPVCTC1PFKFKLDLPRRNPIVPAGKGPQTPVGTFCGGSPGRR 
LTAFVQHKSTVALLAVMGTD I VQG 

5537 

A 

81 

1007 

QPKAEHAALNLRKTFQHASHTPPFSAQQ*NPHRPHQPWL*HAIiVQHC 
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SCSDMQRPSASGP*/PPGTFSRWSSCKMPVPPPAQVPKSLCGVLSSS 
C * PGRRSPGPAEPPSRACRPCRQDSG/ PAQAGGPCRQDSG/ PDPAGG 
PRDK\*ASQPGLPAGERAIPAGGPCRQDRGQPRLASSAGGHCRQLRG 
/ PTPAGGPCRL/ PQRAAPAGGPCRQDSEQLRLAGPAGRLAGRHGRRS 
PGPAEPPSRACRQESG/PAQAGGPCRQHSG/PDPAGGPRGQLSLPAG 
PARRRAGN PGRQ PLQTGQRAAEAGE PAFATAVS GRD 

5538 

A 

1 

766 

MGQVWALVRSTLELFHTDDEEEGEYDEVTEEVTEQVYLPAKAKVAQE 
EEVHPYPSAPPHYYFEEKEWPDPPDLSFLEDTGRKWAPVTEQHLER 
LLSVLFRQEFSRLDERDDAVEQLRGVCIRAWEKITSGGEQYPSFSAV 
KQGPKELYADFIAWNLLRQESLKKVISDSAAQDIVLQLLAFGNVNLD 
CQAALRP I RGKAHLVD Y I KACDG I GAKQD S ERFAFT I PWNNLQPAK 
HFHYFTDGSSNGKASYSGSK 

5539 

A 

74 

1178 

WLRRTPTVTSNTRLTQPPTWGQIKKLSOMAEENLRKAGOPVTMKINLM 
VAMIWITTTEEQEYSEVTEEVTEQVYLPAKAKVAKEGEVHPYPSAP 
PHYYFEENDPPNLSFPEDTGRKVVAPVTVRAAPRGTAIjSSIQAGIQQ 
ARQEGDLEAWQFPVRIHPPDQQGNIIATFEPFPFKLLKEFKQAISQY 
GPGSPFVMGLLKNVTVSSQKIPTDWDTLTRACLTPAQFL.QFKTCAMK 
QGPRE PYVDF I ARLQES LKKM I ADSAAQE I VLQLLAFDNAHPDCQAA 
LRPIRGKAHLVDYVKACDGIGEYTPSTCEKKVNCKTASCRSFRRYPE 
EGIVITGDDSSMRVWPEDLPVGQDVEVEDRDIDDPDPV 

5540 

A 

1 

857 

MGQVWGLVHFTLELFQTDDEEEQEYNKVTEEVTEHVYLPA*PDVHLN 
QPSHQPPKAGLQLY 

5541 

A 

518 

578 

PFVHLNQPNCQPPKVGLQLY 

5542 

A 

750 

797 

AVLWHHPAGGPWLAK 

5543 

A 

2073 

2846 

IVRSLSRTAQKIRSGCPCQSW*RRGRPVGSPSQSTEGPGQVRLRT*P 
LPFHTHPEGGPGAGRRSRSSLERPTSQPHRSQSGRCPAAPATPLPSP 
GPQA\PLGAPFERG/PRSRSA/QGAPGTGVRFFSMAQPFPPCSTSQP 
LKPCSPQELKMRLRPKGA\PDSSAPGRTAGTPPCLRCWA<5SVEPLRR 
VRGPRRCGRGPAAAATQWARPLCRARRGTLGGIA*PRSVAARGSAPA 
PAAPGAARPGPGMGPPAAVAAIAGRP 

554.4 

A 

3 

611 

SLPPRASRLPPGPSFSGWSPGDNRKLPNTPPPPRAGPRPPSRAPPA* 
TPGPPSAGS WP * PPGTGSAPRGPAPS APGARRPGRPGCPPAPAGRPR 
VPTAGAGACSAASRPGTRPRCTRPGAPRSRRRGGCPGPGA*PGARRP 
PRPRPPATRAPRS PRSPRGAAPPPTPRTARGAAARELRVQRGP / APL 
HAPQQHTPAAIVPYKM 

5545 

A 

3 

42 

HEEPPSSSNSPK 

5546 

A 

1 

160 

SMQQTLLLS YF * KLP * PPQPS ATMTQ I SQQPSTSRQDPP PTK/RL* L 
VEGSNNH 

5547 

A 

298 

421 

LHPHILPIDKLLHQ*KWSVSEYRQNQEVPLYSISSNIiVHS 

5548 

C 

598 

762 

MNSM I Q I CQDRAE 1 1 SS YCNLKI HQRLQNLL I S FLQSLKKLKNVNLS 
FINKNLK* 

5549 

C 

138 

296 

MNLEVTVARD YLS PVLADLAQRRGNIQE I QTRQDNKWIGFVPLAE I 
MVGTL* 

5550 

A 

2 

1575 

AMLGTRVERCLRVLDGAVAEFDASAGVEAQTLTVWRQADKHNIPRIC 
FLNKMDKTGA\NFKYAVESIREKLKAKPLLLQLPIGEAKTFKGVVDV 
VMKEKLLWELQFK* WEKTLRRKAPLGK*MIPEFAGRETT* R\ KECL* 
LEQVADLDDE FADLVLE E FCENFDLLPAE KLQTA IHRVTLAQTAVPV 
LCGS ALKNKG I QPLLDAVTMYLPS PEERNYEFLQWYKDDLCALAFKV 
LHDKQRGPLVFmiYSGTIK^QI^IHNINGNCTDRTQIALSVNNHEV 
HIYKKNGRQCVKAHELKEHNGHITDIDWAPKHDHIVTCGADHNAYVW 
SQKDGVWKPTLM I I»R INRTATFVKW S ALENKFAVG S EARL I S VC YFE 
SENDWWVSKHIKKLIRSTVXSLDWHPNNVLLAAGSCDFKHRVFSSYI 
KEVDEKL>ASMPWDSKMLFGQLMSESGGGGTGGWVHGLDIPKQSIQRN 

MSAMECFCNTDKRATTEGSNMALKMLRQNS ITQVS I YEVDEKDCRKY 
CTTGIDGEP 

5551 

A 

1137 

1288 

RTLNLLKAMASRVLRI I LTGR* CLDVS I KI PL * ENLGES IMVDSLMV 
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KVP 

5552 

A 

1 

1851 

riALKKADKQVT»EPLMNIjEVT 

DNKVVIGFVPLAEIMVADLDDEFADLVIjEEFSENFDLLPAEKLQTAI 
HKSDTKLRQQLPVL \ CGS ALKN\ KG I QPL \ LDAVTMYLPS PEE/ L * P 
MEFLQWYKDDLCAIiAFKVLHDKQRGPTGFYAPFNSRPL*KPQLAHSI 
I LMETATER I SRLLLPFADQH VE I PSLTAGN I ALTVGLKHTATGDTI 
VSSKSSAL\LQLVEPNGREKRSTDKTMKAERLLLAGSGRFQEPVFFC 
TIGTPITV*GSQIWEHAFEMSFSVRIPVLKVRLDPDSGQTVLCGMGE 
LHIEIIHDRIKREYGLETYLGPLQVAYRETILNSVRATDTLDRTLGD 
KRHLVTVEVEARPIETSSVMPVIEFEYAESINEGLLKVSQEAIENGI 
HS ACLQGPLLGS P I QDVGNYL Y I P * QIHPWAPSTTYDFCLCLKMPCK 
KALERKAD KQVLEPLMNLEVTVARD YLS PVLADLAQRRGN I QE I QTR 
QDNKWIGFVPLAEIMGYSTVLRTLTSGSATFALELSTYQAHESSRS 
KYTAQPEKWFDLNVIiVFGRGNSGRHPRLLHFSVRNNFYCLYLLGRNK 
GTVSGTFRGT 

5553 

A 

2 

499 

RCQKAKFTPVNRRHHCRKCGFWCGPCSEKRFLLPSQSSKPVRICDF 
CYDLLSAGDMATCQPARSDSYSQSLKSPLNDMSDDDDDDDM/TVTKD 
TFGS I * SGEAL* EINFGGNARL* ALSLDCS\ TMNLPEKLGTYVPQYI 
P * KEGAPAFCHS SHS STG IGFCKYIR 

5554 

A 

147 

315 

L I PQPWGQHLPLPPN ISNYD * RPQKFLGLPHPCS FGGRALTPHWSTG 
LRLTPSAQ 

5555 

B 

105 

508 

XPGTREVEELNALQEELAPFGLVILGFPCNQFGKQEPGENSEILPTL 
KYVRPGGGFVPNFQLFEKGDYNGEKEQKFYTFLKNSCPPTSEIjLGTS 
DRLFWEPQESSRHPLELWKFLVGQMDTHHALHHRPRQQRR* 

5556 

C 

4 

165 

MCLSGNVYHLCACSCWPGQCTTLSLQFSTPMIIVHLHLNPKEKPAL 
GPIVLL* 

5557 

A 

1 

1803 

MARLLQASCLLSLLLAGFVSQSRGQEKSKMDCHGGI SGT I YEYGALT 
IDGEEYIPFKQYAGKYVIiFVNEELAPFGLVIIiGFPCNQFGKQEPGEN 
SEILPTLKYVRPGGGFVPNFQLFEKGDVNGEKEQKFYTFLKNSCPPT 
SELLGTSDRLFWEPMKVHDIRWNFEKFLVGPDGIPIMRWHHRTTVSN 
VKMD I LSYMRRQAALGVKRKKVTS QYQDRDNKFQNVER I PLSGSHRT 
VCMMGVKGKVKTQAKPLSGEESLELEQSGLAMAAGADNHFIiAEVPAK 
AVALVLTTTESQQGLQEVSGGVTLAEAESRALGPLPREDGNIiMLHLQ 
RLETTLSEPDHGQLFTHLGRMALEFNRIASKVHKNEQRTSILQEYKD 
VQSTLLPQEVPSQMGNARTSQTHAVPGGKPRRGPYRIYDPGGSVPSG 
EASAAFERLVKENSRLKEKMOGIKMLGELLEESQMEATRLRQKAEEL 
VKDNELLPPPSPSLGSFDPLAELTGKDSNVTASPTAPACPSDKPAPV 
QKPPSSGTSSEFEWTPEEQNSPESSSHANAMASFHSLGKSLARRGK 
EQRPSLTVASRLQRNGGRAGRGHKEEGVGEDKEEWP 

5558 

A 

2013 

2024 

KVAG S AGHLRRTVARDQAAAQRDP ECAPTP S LARLLQAS CLLS L L LA 
GFVSQSRGQEKS KMDCHGG I SGTI YEYGALT IDGEEYIPFKQYAGKY 
VLFVNVA\ SY* \GLTGHVHLN* TAL\QEELAPFGL\ VILG\ FP\ CNQ 
F\GKQ\EPGENFRRSFPNPSRYGPTKVGGFCSLNFPALLRKGDVNGE 
KEQKFYTFLKNSCPPT\SE\LLGTSDRLFWEP\MKVHDIRWNFEKFL 
VGPDGIPIMRWHHRTTVSNVKMDILSYMRRQAALGG*RVGCPPFSFK 
ILYFPSRVYSLGGGWSILFPPQNKGSFPKFPSRFLLGPP 

5559 

C 

2 

226 

MLLPLPPSPSPSTALPGLGEMGGRPPPPYMLSVAPQTFCFISHSSPS 
SHRVSFSAFPCSVLPPPSQKFLHRTSF* 

5560 

A 

2 

1308 

ALSLLALPPASPP/AGRGRPAGPGPAPLPWHQGLPI.TRPGPEG/PLP 
QGRIiPSPPLQPTPAQPLLIiPAFPLLPPAPGPRRVGFAFWCP*SPGLS 
PPGLPSPDFQTLPCGISFL\PEMRKLESWSPAGEPSPPPAPPATRVS 
SRMQLPDPLTLLPPAGPKATFNS YGV/ PPPAPE / PSQGRVRGHPEPT 
GPLLHSSQGQEAAPLPPGPCCYGDTAF / PRTEGPPSLPPTTRARLSS 
PPPRFPADWCVLSVSLVIiCTPSLLWPM*/PHPCCTGSIiSSWLPPL 
IiMMLLPLPPSPSPSTALPGLGEMGGRPPPPYMLSVAPQTFCFISHSS 
PSSHRVSFSAYPCSVLPPP*APAWAQDPILPARVSPPGRVPFPPC/P 
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PPSHPTPGSEAPHS * PS VFLRKWLKATGPPAQPSAPPKPGNLGPHKP " 
LWKLFS CASKS FPRSFCTELPF 

5561 

A 

3 

259 

FLFWVFQIPAPFPEYVTDLCICTPTSASPNGYXiLA\GRDKAQRL\WB 
RLRWF\ QRALRDGA* AGEAGG\ STPNTVTMEMQCPPGNAS 

5562 

A 

2 

329 

AGVQWRNLCSLQAPPPGPRSFSRLSLPSSQVAGTYRRPPPSPANFLY 
V* *RRGFTTVNODGLDI J 'LiT55f3<?\ pacHQnca^TcpTrotnjnnnonmnn 

wvw * * * • *' v^wuwiju i ouo \ tr ftOnOyonuX OVj V orixvAUPSFTFF 

RLACHVFVRHCHLPT 

5563 

A 

3634 

4389 

LIQFLLLHSIGliYSYTTICLFILLLMEYWLFPVFSYYYYYFFFLRRS 
LALSPGWSAWOSRLTATS7VSOVOTTT.TiPDopt?*t r-T rwr*AT«nTic»*-c» 

LYF* *RRDFTMIjARMVIiIS*PRDLPTIjASOSAOTTr5\7QPT?awcTrT 

LFFF*TESHSVAQAGVQWHDLGSLQ\PAPLRFK*FSCLSLPSSWDYR 
RPPLRPANFFVFLVETGFCHVGQAGLELLTSSDLPALASQSVGITSV 
SHHTR PGMS FF I TSHKL 

5564 

A 

1 

369 

LQPQP\ PGLNQSSLLSLLSSWDYRHMPPYPANF\ * YFPETRFCHVAQ 
TGLELLSSSDPPYSASQSAGITGVT/HRA*PRFTLCTGLIRGQHGEK 
GLAQGHNVRASSSIQGSWTWGLRGLFFPHH 

5565 

A 

161 

231 

LLQPRTCLFFEGVDLNGWASLFL 

5566 

A 

1 

1401 

MQIFDYNS E I TDPGI QQEAAGLEGTRKPWN I SMLLTTRNI SLCQWFS 
KCGPEPDTSTSWGTPQKEHCHLVNKVGVDSLRMRGYMHGIIQPLNLF 
QESQLMPQTLFPFHNFQNCFFEEIiHKWRCPDYYENNVRKMQLPFSSK 

LIjGST'TjT'R'RRTf OPPPOrVYT.PDT HDT \TTiDnTjrtT3wr nnT-inr — — — _. . _ 

AJ - UVJO A JJ A °^^^W^i^WWW^KKijyi5JjWARRREEKIiQLDQERL 

SGSQPR/ EWDW/ TTASQRPLTWEQLEPSLEDVESMNDFDPLFSEE / 
D/TLEWRSRSP/RVOPVFNIJ^AYHOLFVRTPPTPnDTJTTwrknoT -rrtT? 
EQAG I AETLQY I LDR Y PKD I QEM\ LVQNVFLTGGNTMYPGMKS \ RME 
KELV\ G * DPS RLS FKVNIiASNPCAGM/ DWYGGLVNWALNHLDDNEVW 
I TRKE YEEKGGE YLKEHCASNI YVP I RLPMS CLPLLRCPRHPS QGS A 
AGGGGAG 

5567 

A 

2 . 

131 

SSCLSLSKCWDYRHNPOI^KKVFKNFOSSYLTOT.nPVPZVT odr 

5568 

C 

170 

394 

MKTNGYQGRVQWLTDLVLNLKSRPVQSSRPSLGNMVKPPSIQKLGKA 
WVACTCCPS YSGAGAGGWHEPRSLRLQ * 

5569 

A 

2 

583 

DRKGLESSPEPPSDCRP PPnMDT.l\& TQ\ OA7T T A nnnnT^ rnriOMnor** 

CGRQAPCVLESGHSAGAGVSSWDGPPVPCKCPPHPGLSQAHPGKTFG 
PKNLHMVPS GGAQAPGPHKANOOS WPS CP P VPR c; p p ptt? q w t a t?m T3M 

ERKSFAQGSGAGNWQRKVSNPHPWELQVFESPSRPCVKNGRAAGLVT 
GGFREG 

5570 

A 

3 

312 

FFLNRQGLALLLKLEYSGMIIDHCKPQISGAQVILPVSASWVARPTG 
VYHHA * L I FFKWGRE VETESC YVAQAGLELLAS SDPPTS TS QRAW\ I 
TGLSHCAWPA 

5571 

A 

1 

121 

IFHVIiKKCQTRSIYIJUjMSFKNEGEIKMFSNIQKLKEFITDRSK/LQ 
EMLKNAL*AERIYY 

5572 

A 

2 

228 

FETGSHS \ I FWAGVQWYNHSSLQPRLPGLR* FSHLSLPS SWDHRCTP 
PRPCNFS I FS RDGFS SC * SGWSQTLGLK 

5573 

A 

1 

1896 

MVSQGCRSREMRDAHAGWDSPTALHMARGLGRRVFSAPRHTEGGTPD 
GNS HL S PNQKME KLTEKGCKQLRQGGLCVR VSDRTVRS VQETTAI TA 
TLKGSLGTYCRGTRNLLTAEGFMPHASVCIQSKSGKVARNTVQGFTA 
HGKAVLEAAVS ASGPAGTWGCLR I KWPRS VS WLLKPERAATLVGHGC 
RKSGCPPGLGGSAKM/PPRHQSAAPAGRGRCPCWGPEPTSGSLSNQQ 
WLPGXALHAPPLPSAPLLGPPLLSRTVDTGPGGSGREVEATTCGRRI, 
KDKDSEHAGAGSRHHQSQVTGSLGEERAVRAESPGEGPAEFLRRIQG 
AVQTRAVTAGSSVEKINTKLETCDCDASEEEVLGGDNVETEERSVCV 
PEPVKMTLFEKRVFADI IRCLARWAPAATNRRDLVNTL I PGHGPSCS 
LVAFL/ APPS PRP YP * PAARCWWASDFPERGKRLRPAHS S FPVPAPP 
CVPVQSQSGPLPCTPRHFPLPFPAPAFGICRGPRAAPGG/SKAVATQ 
RPWWLYAPGTAPGGLPW*/AVPGHNKSIPPQPVSNLTETPLNAPQSS 
WAALGSKRLGCK3CSHGFFAFSFTADVCT1TODTLSNANSLTEVLATEI 
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KHDINLANEGLAWDRGWDYQGKDMLSL 

5574 

A 

1 

475 

FWKS PYSVQPRTEARAQGNKQKEFRETAPGS SSKADTKSEHLWALLP 
SRRWGGSQTQHPPLARBPQFLACGPMKDIRIjADWSTPLLSNPS*KLP 
*ICQSCPLEQEGKQKA*DARGVPSFPELQPPSGLTAQGQTHRPHPDI, 
RMTCGRTWHFFLAKTLH 

5575 

A 

2 

468 

SRGILADPPRPGGQPDFLQP*PTRVAALSGLSNQDTDLGHLMRRQPQ 
VPAGPEAETAASRTALPCAVNPWTVLRATLPDIiDCIQTEACGlyiKPSA 
VSKFLVPRQYVPRLPFKVAVMAVVSCTLRTVLSLTLTHSPPCRSCLQ 
PFSVSFSIFWFGDK 

5576 

B 

90 

1686 

X WWE PS PFQPQRDS LKVTGLQRAAFNP VAPTHS FKDQHPMPS FCHFI* 
SGPVPSSRTTIATGRAAPHSSPLKSLVQGYAPWQTSMSCDYQPPRRQ 
EHKETSRRNSEKQLQAPAQRPTPKANTSGRSCLQEDLLPEAQPGLSV 
PVCHPCPLLHAARLAVSAAAAIRGLEVAEAALVCAQKLLAACSSHSL 
ITVWSRGKVKETGLPSTVQAVSELLQVLLCKQKAGGCTPDQGPHTSK 
VLTEASSLPRISRMCLLQPGPFSDNIPLGTATLTPAATAATPGPLLP 
CTKHGNPLGHGLTCHYPI1APLPLPPAPHVRFAAI1SCLERSAGCCLCT 
PS CR I TTVGGS VGS CASELNHRACFS LPLMLGFQAP I ATHQWLFSVR 
NDKNTLDTPRSAGCDGAHRPRQEPAPDGPIVLLCLEPMPAPGSYPGG 
APVSGHRQVHPQQWQVQHIiGQSGGIRTGLGANCGKSHDSAYKKGGDV 
CFPGEMVPWHQILKGVQHPNKCRGHWGTLVQQKQPWTGSRGPTADQ 
LHPKNSSLTLVFPSR* 

5577 

A 

171 

1631 

GLPQAGADSDPRTRPFLLLGGLPGLPS PRP YP * PAARCWWASDFPER 
GKRLRPAHSSFPVPA/PPMRPRTVPVRAAPPHPPSFPTSVPSTGIRH 
LPRAEG W* TGQ * RNTPNKS RRRLS PLVT PGSCRS S PTQS PGPSGTTV 
EMPARRWCAGATGPGATS P / SSLKVTCETGLP STVQAV S ELLQVLiLCK 
QKAGGCTPDQGPHTSKVLTEAS /SSAPHLQNVIAAARSI \ SDNIPLG 
TATI*TPAATAATP/ EASPPLHQAWQPS/TAMASPAITPWPHYPCPQL 
RMSGLLPCPA*NAPQAAVS VPPPAE * RLLAALWVHVLRN * TTGPASH 
CH*CSASKPPSPPTNGYSVSAMTRTPWILPDRLAVMVLTGPGKSQPQ 
M/RPIVI,LCLEPMPAPGSYPGGAPVS/AA*AGTPPAVAGAASRAVGG 
HPHR PG C KLRQ I P * FC I QEG WGRVFSRGNGPCRAS DS QRG PAPKQVQ 
GSLGDTGATEAALDWESGPHCRPAPP 

5578 

A 

718 

2209 

IPSACFIiVLLPSWWLWAAHGSLPGRVALDQAFQWTRVRGCSACGKG 
CPAAG/ PQAQRGSGRS WAWDAGP * RS VWGPLG* TQPAAAL* P * VSPF 
VAGKGLAPTT\ LVGTAIQS ILQGLILS WNPGGEPAG * RNDLGAGGRA 
CCPDEDDQ IRGGAVLAEVT *QCSCS PMPDAAHHRHRVRVGVSGAAVC 
PQWEIjGGHGGNARPDAGRGVLRSCRGTRNLLTAEGFMPHASVCIQSK 
SG/TSGTQHCPGVHCTWQSCPLPSAPLLGPPLLSRTVDTGEEGHPVD 
ASWTPS TPREPGRWAHCLPQRKP/ PLGPREARAS VPPAHSGTVAGEA 
PPVISHFR/SPAPAFGICRGPRAGATEALPAPRADAPVKAAV/PAHT 
GAAS / EPARHVRPFPTATAPFTFPSAVHKGSGFS TSIiQTLiVI FCCVT* 
MVAI PMGVRWHLTWLWTNLAVISRVPI SVCALDLS FISVPCPLRCW 
CWWNRVS PKLAS PGCS ECDLCGQMRVMVS SHFS VE 

5579 

A 

2 

354 

LPSSWPYRRPLIFVFIiVET\GFNMLARMVSIS*PLDPPDLASRSDEI 
IGVSHRAWLILS * * KGERKSTITVSHKEGIjG* WSRATVLQVYSVGFWE 
VPFQGCPQSQSYFHNNTNMLFAF 

5580 

A 

1 

1263 

MAEEQGRERDSVPKPSVLFLHPDLGVGGAERLVLDAALALQARGCSV 
KIWTAHYDPGHCFAESRELPVRCAGDWLPRGLGWGGRGAAVCAYVRM 
v r ±i£\±ji VXjr LLt\lJr* & t JJVV VLDyVbALlFV]? Klj^VKivRKKILir \YCHr A 
DLLLTKRDSFLKRLYRAPIDWIEEYTTGMADCILVNSQFTAAVFKET 
FKSLSHIDPDVLYPSLNVTSFDSSCS*KAGMT*SPRGKNSWL»IiSINR 
Y\EREGKIWTLGTGKALVQLRGRLTSQDWERVHIiIVAGGYDERVLEN 
VEHYQELKKMVQQSDLGQYVTFLRSFSDKQKISLLHSCTCVLYTPSN 
EHFGIVPLEAMYMQCPVIAGNSGGPLESIDHSVTGFLCEPDPVHFSE 
AIEKFIREPSLKATMGLAGRARVKEKFSPEAFTEQLYRYVTKLLV 

5581 

A 

200 

705 

LSQTL/TVLYPSLNVTSFDSWPEKLDDLVPKGKKFLLLSINRYERK 
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KNLTLALEALVQLRGRLTSQDWERVHLIVAGGYDERVXtENVEHYQEL 1 
KKMV QQSDLGQYVTFLRS FSDKQKI S LL*/ RQLARVCFYT PRQ * GTLG 
IVPLGRPCYMAVPQFICCLIRVGPFGSSI 

5582 

A 

1 

259 

ERGSRS IARLEC/ SG/ AISAHCNFRPPGSSDSSPSACRVAGITGTRH 
HPQLFFVFLVEMGFHHVA*DGLKLL/NFMIHLPQPPKVLGLQV 

5583 

A 

94 

395 

PI I ITYFEFLNGYI I SQKIKFSHHQLQKIDFLP * TWHLFLIEQR*NI 

LMEGNQFSVTDDVKILFSGKLYSHSKIQSMLL*LVTRE*CYMFNCNI 
CLLSFSSLSVIINDLQNP 

5584 

A 

3 

270 

FFEAGSHSGCPGWSESGAIIAHCSLDLPGSSNPTSAS*IAGTRSMHH 
HAHIiI FVFLVETGS PYWQPSLKL\RVQA1 LPPRPPKVLGLQA | 

5585 

A 

13 

775 

YSVRGLVPAERRTPYPGS IVAPADGMVYALEGMGPDTAPQAQVRVYE 
PRRDCWLSLPSMPTPCYGASTFLHGNKIYVLGGRQGKLPVTAFEAFD 
LEARTWTRHPSLPSRRAFAGCAM\AEGSVFSLGGLQQPGAPQL\YSR 
PHFVNTVEMFDLEHGS WT/ 1 1 APQPAHEG * EGQTLWLGPLGATLWPL 

GALETSHVLWALLESFSLARRRWEALPAMPTARCSCSSLQAGPRLFV 
IGGVAQGPSQAVEALCLRDGV j 

5586 

A 

1 

936 

MAALYACTKCHQRFPFEALSQGQQLCKECRIAHPWKCTYCRTEYQQ 
ERLEC^GTISAHCNLHLPGSSDSPASSSRVAGITGIKTNTICKKCAQ 
NVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPYSCEQCKQQ 
CAFDRKDDRKKVIX3KLLCWLCTLSYKRVLQKTKEQRKHLSSSSRAGH 
QEKEQYSRLSGGGHYNSFSPDLALDSPGTDHFVI IAQLKEEVATLKK 
MLHQKDQM I LEKEKKI TELKAD FQ YQ E S QMRAKMNQMEKTHKE VTEQ 
LQAKNRELLKQAAALSKSKXSEKS GAITS P j 

5587 

A 

1 

1023 

MAALYACTKCHQRFPFEAIiSQGQQLCKECRIAHPWKCTYCRTEYQQ 
ERLECNGT I S AHCNLHL PGS S DS PAS S S R VAG I TG I KTNT I CKKCAQ 
NVQLYGTPKPCQYCNI IAAFIGNKCQRCTNSEKKYGPPYSCEQCKQQ 
CAFDRKDDRKKVDGKLLCWLCTLS YKRVLQKTKEQRKHLS S S SRAGH 
QEKEQYSRLSGGGHYNSQKTIjS TSS I QNE I PKKKS KFES I TTNGDS F 
S PDLALDS PGTDHFVI I AQLKEEVATLKKMLHQKDQM I LEKEKKITE 

LKADFQYQESQMRAKMNQMEKTHKEVTEQLQAKNRELLKQAAALSKS 
KKSEKSGAITSP j 

5588 

A 

3 

345 

LNLLMYFRDNVS FCHPG WS TVM * S * LAAASNS W \ VT * S S C FS / LPS S 

WD*S*LPPCPANFFFF/CQ*RQNLIMSSRLVSNSI J \QPQVMLLPQPL 
KALGL YR VS HHA S VKVFNVL I NN ( 

5589 

A 

1 

430 

GDGV1,LCRPG*STVALSQFTAISASWV\KRSSCLNLPSSWDYRCAPP 

CPAKFCIFSRDGGFTMLARLVSNS * PQMIHPPWPPKVPPLPACVWYF 

SMACNFLYTPSFFLRLFCQQPKMEKAALCQGSPKTVKSCESQMFTFN 
FF 

5590 

A 

3 

389 

GVSPCWPGWS*TPDLK*STRLGFPRCWDYRLL»EVDPCTGHSLYL*KD 
PPVLPLPAEGCDYQGKEYALSSCCDKSYLLPDYRTKFLCCHPERGTW 
KLGTVGGCYAP IQSFGIADEQAWLQHGSGAVYLC 

5591 

A 

3 

53 

GVSPCWPGWSQTPDLK j 

5592 

A 

363 

777 

DMRPQLCSVDNHREQRAEFINIYQHQFFFLRQSLTSVAQAGVQWCSL I 
SSLQPLPTGFKPFSCLSLLSSWDYRHMPPSLANFFCIFSRDRGFTML 
ARLVLSS *PQ/ CDPPALASQNAGITGMSHHAWPTPTFS YLLNTQD | 

5593 

A 

3 

85 

FFFFFLMLHNNVYLVLHLFTISYKHFIIIVLYKQQFKFTYVY/TFIC 
LLVFQFQI ILLESFPFLKI ILL* SL | 

5594 

A 

1 

281 

FFFAPETESYSVA\RLECSGTILVHCTLCLPGSSDSPASASQVAGTT 
GACHHTWLILVILVEIGFHHVGQAGLG/ IS*LQVTRPPWAPKVLGII 
G i 

5595 

A 

2 

369 

FFLRWCFALVAQAGVQWQDLGSLQPLPPGFKRFSCLSLPSS *DYRCP 
PTRLANF / SVFLVEMGFHHVSQAGLELLTSGDLTTLASQSAGTTGVS 
SRAWPDLYLFSYLIEKTSPFKFSIPFHLEI j 

5596 

A 

432 

1492 

KKGCGKPGELCGCMPSSWKNAYCPTAQQTAQEYLSSPCGSAAPCLP / 
DRASSSSRSVEASSLDSAPGT\SAAGSPSPRPFGGEYHHPGPGANNE 
PTPPAESIVVPSWP*GCHSSVAVEKLSLAPGTPT/SDLPI I LDSSMNR | 
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KAP I P VFGL S AS S LLPLKD ARAS ET \ S S R * THQ * FFGKD VH * TQNS L 
AQSLQHSSGPQLLWGKTGNSQSSFPQAG*HSRHKEGCWREPSHKPLE 
PHSLSQHHGTASSARIPEVPPPHSLGHSPACAWRSSSPRPPRPLAAL 
PGCRRRRGAGSSHKAPDRSTARPGRCGRRRGRRGARRGCGGGSRGPS 
HSGPQRSESAPPAGAHTRLQPLRSHSIiH 

5597 

A 

1 

- 

2163 

MARM S FVI AACQL VLGLLMTS LTES S I QNS EC PQLCVCE I RPWFTPQ 
STYREATTVDCNDLRLTRIPSNLSSDTQVLLLQSNNIAKTVDELQQL 
FNLTELDFSQNNFTNIKEVGLANLTQLTTLHLEENQITEMTDYCLQD 
LSNLQELYINHNQISTISAHAFAG\LKNLL,RLHLNSNKLKVIDSRWF 
DSTPNLE I LM I GENP VI GI LDMNFKPLANLRS LVLAGM YLTDI PGNA 
LVGLDSLESLSFYDNKLVKVPQLALQKVPNLKFLDLNKNPIHKIQEG 
DFKNMLRLKELGINNMGELVSVDRYALDNLPELTKLEATNNPKLSYI 
HRIAFRS VPALESLMLNNNALNAI YQKTVESLPNLREI S IHSNPLRC 
DCVIHWINSNKTNIRFMEPLSMFCAMPPEYKGHQVKEVIilQDSSEQC 
L PM I SHDS FPNRLNVD I GTTVFLDCRAMAE PE PE I YWVTP I GNKI TV 
ETLSDKYKLSSEGTLEISNIQIEDSGRYTCVAQNVQGADTRVATIKV 
NGTLLDGTQVXiKIYVKQTESHSILVSWKVNSNVMTSl^LKWSSATMKI 
DNPH I TYTARVP VD VHE YNLTHLQ PS TD YE VCLTVSN I HQQTQKS CV 
NVTTKNAAFAVD I SDQETS TALAAVMGSMFAVI S PAS \ I AVS FPKR F 
\RRKNYHHFIKJC\^WQKTS\SIPI^L\YPPLI*PU3KVDSEKDKDG 
SADTKPTQVDTSRSYYMW 

5598 

A 

50 

824 

SLLKPYYVRIFTKEKSYKCKECGKFHKSCSRH*NHKIIHTGEKHYKC 
EECG \ KVFNHC * QL I AQKKIHAEKNSDFKECGKAFNNYYELTHQRFC 
AR*KQCKCNDTHYIYLWKDVTK*KP*STQGVMIVIHYICTWKDFJCNI 
SLRVHKS FYTEEKHYKYKDCCNTFTYVTDFWHRRIYTER*PSNSCS 
NF I PF * 1 1 YIGKKPYNCNEYEKNCLKSTP * KTLV*GRARWLTLII PA 
PWE\AKAGGSRGQEIETTMKPCLY 

5599 

A 

1 

2043 

MV S I SRPHDPPAS ASQS AG I TGVSHRLRFYFDHQDVAPECGAYFFHE 
LAKEKRKGVERLLKMQNQRGGPAVFQDILKPGQDEWGKTLEAMEAAM 
APGKNLNQSLLDLHALGSAPTDPHLCDFLESHFRDEEVTFMPRKSAD 
LHAAGEAASLGGVSLNVMKGASLHALCLRLLNSFGPGVATLCGERW 
HKNNRASALRKCIFWPVSLNSSRARPRQRTTAGGAVPTAALQRVIPA 
SSQCLFPAPMGTTEATLRMENVDVKEEWQDEDLPRPLPEETGVELLG 
S P VEDTS S P PNTLNFNGAHRKR KTL VAPE I N I S LDQS EG S LLSDD FL 

DTPDDLDINVDDIETPDETDSLEFLGNGNELEWEDDTPVATAKNMPG 
DS ADLFGDGTTEDGS AANGRL WRTVI I GEQEHR I DLHM I R P YMKWT 
HGGYYGEGLNAIIVFAACFLPDSSLPDYHYIMENLFLPPGQQTTPGP 
VGLTQPAAGVEAGRVGTAGVXiTLDPSMLGPCLIiCPVSRYVISSLELL 
VAEDYM I VYLNGATPRRRMPG IGWLKKCYQM IDRRLRKNLKSLI I VH 
PSWFIRTVLAISRPFISVKFINKIQYVHSLEDLEQLIPMEHVQIPDC 
VLHDRQIILTTLClAYGEGEALASHLFHVILPFHIHSCFPSSTETSDP 
PCGPGLDLAYTTPFSTHFAQRGK 

5600 

A 

18 

403 

FCIFSGDGVSPCWPGWSRIPDLR*SACLSLPSSWDYRRPPPRSANFF 
\ * FLVKTGFHHVGQAGLELLTSGDPPASASLSAGITGVSHCTRPMYA 
FLKQLNFHIIDKSDYRLYRKXiYLKATTMVFRYAFL 

5601 

A 

2 

497 

ETfeSHSVTQAGVQWCDLGSLQPPPPRLKRFSCLSLPSGWDYRHVPPH 
LANVCI F* * IWCFTMFARLVFNSRRTPSDLITSAS * SAG ITGVSHRA 
\RLCIFYFYFFVAEMVRMSCFVTQVGLEFVALSD\LPPWPPKVLGLQ 
A*ANSAWPVFYCNSFFHSHLNAFPN 

5602 

A I 

3 


LSCFPQATFFFFIF*DRVLIi\CPGRSTWQSQLTAALTSRAQGSSCL 
SLPTKWDYRHVPPYSAFFLFVLDKSLALLPRLVLNSWAQAILPPQPP 
KVLRLLA 

5603 

A 

122 

525 

QFFFFEIGSLSVAQVGAQWR\STSPLQPWSPGFK*FSHLSLPESWEH 
RPMPQGLDNFCI \ FVETGFRHVAQAS I KLLGSSDLPALASQSAGMTS 
MSHHAWPIVKNFVCIYYVPDIVLSMLSVKSINIHINHVRGLQ 

5604 

A 

1 

198 

LTGRL/SKCEWSWRFDVQLKDLKRWQNHLLLSQQLACIVLTSAGIM 
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DHKEMR* KHTGGKSLGFFF 

5605 

A 

156 

1047 

VPAGEARVQWHDLGSLQPPPP\GSSDSPASSSRVAGITGIKTNTICK 
KCAQNVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPYSCEQ 
CKQQCAFDRKDDRKKVDGKLLCWLCTLSYKRVLQKTKEQRKHLSSSS 
RAGHQEKEQYSRLSGGGHYNSQKTLSTSSIQNEIPKKKSKFESITTN 
GD S FS PDLALD S PGTDH F VI I AQLKEEVATLKKMLHQKDQM ILEKEK 
KITELKADFQYQESQMRAKMNQMEKTHKEVTEQLQAKNRELLKQAAA 
LSKSKKSEKSGAITSP 

5606 

A 

156 

960 

VPAGEARVQWHDLG S LQP P P P \ G S SDS PAS S S RVAG I TG I KTNTI CK 
KCAQNVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPYSCEQ 
CKQQCAFDRKDDRKKVDGKLLCWLCTLSYKRVLQKTKEQRKHLSSSS 
RAGHQEKEQYSRLSGGGHYNS FS PDLALDS PGTDHFVI I AQLKEEVA 
TLKKMLHQKDQMILEKEKKITELKADFQYQESQMRAKMNQMEKTHKE 
VTEQLQAKNRELLKQAAALS KS KKS EKSGAITSP 

5607 

A 

1 

354 

ETEFPSCCQAGVQ*HDLGSRQPPLP\GSSDSPASASRVAGITGMHH* 
SRDVLIRILELEEGLELCPHFSVRNLPKTGQILTLMLSLAPVKLAPQ 
PKLSWMELTRHSRLWAALSGGGQV 

5608 

A 

3 

403 

YIFSRDRVLPCFPGWSQTPGLKQSARLCLPKCWDYRREPPQLAED/S 
YFQFRAS LKMTL *GI VSMV/ PLLQKLLWMCKAQGNQAEEGTGVALPE 
REASSWSCLLQLPFPVDEPCSSFKRRESEKEPSPNGFPNLNI 

5609 

A 

340 

1015 

VGS VEEFQGQERS VI LI STVRSSQSFVQLDIiDFNliGFLKNPKRFNVA 
VTRAKALLIIVG\NPLLLG\HDPDWKV*APPPHSPS*WPQPLPRAGG 
SFIiSACLSLS*SSDVSIFTASLLYIi*VK*HPEVEQVQLRWDRTVA*P 
PVS AGHS S VPHRFL\ EFCKENGGYTGCP FPAKLDLQQGQNLLQG / LW 
SKLSPSTSG\PHSHGLTSPREREGERGPVFCKWEPEWRNEL 

5610 

B 

769 

809 

GFPII FHGVMGKDEREGNS PS FFNPEEAATVTS YLKLLLAPSSKKGK 
ARIilPRSVGVISPYRXQGASEDLEDIKDFEGGFSKKNSKAQERSVIL 
I S TVAKR PRALLQLDLD F * 

5611 

A 

99 

341 

SELGTVAHTO^ILPSSGDYRRPSPHIxANF\*FLVEMGFHHVGQSGV 
ELLSSGDSLALASQS PGITGVSHRAQVPPPFFFF 

5612 

A 

1 

1795 

MRQSHQLPLVGLLLFSFIPSQLCEICEVSEENYIRLKPLLNTMIQSN 
YNRGTSAVNWLSLKLVGIQIQTLMQKMIQQIKYNVKSRLSDVSSGE 
LAL 1 1 LALGVCRNAEENL I YD YHLIDKLENKFQAE I ENMEAHNGTPL 
T^^YYQLSLDVLALCLFNGNrYSTAEVVNHFTPENKNYYFGSQFSVDTG 
AJ^VLALTCVKKSLINGQIKADEGSLKNISIYTKSLVEKILSEKKEN 
GLIGNTFSTGEAMQPSGVLPLNGCSTTTLAVACSTGHRKCASRALEN 
RCRSPAAKVTKNKRLSNCQTGQRKLRRCDNRMQCGVLDRILEQKEDI 
SGTTS E I Q I KWGVELH S VSRWRLGKVAMEMVQCQKAG YPERGDGVG A 
VAEGPNIKVAGGLRELETRKTALPENESSQEHAFITITLQHLVQGLA 
HSGYSVliSKCLLNENRAMQKHGTVWTQQIQSSAAEKSEKTMSWDRLG 
RGTAHSWQACPVSLKKQAVEGAHQEEQCKRRDEMRPGESDLVTMASG 
RDALLCGRCCQQAKTEELTITVPRLGEQREAPLPCSTPPGSTPPAAK 
\ PRPS AAPDGPS S * S PGARRQVARPRRGQAPENK 

5613 

A 

12 

367 

LGGASEATGASAAQPPPPPPAPRAPNPAPGPAPAQR*SALDPPLPGR 
APRS FLKG PRP ATARP PG / WRLQS PL P * PE S P PDTRG VRGEAPAPAD 
PAAGPRKSKSDCLAAVIRCLCRAPR 

5614 

C 

231 

302 

MNS FLVFFPLYVQMYMLRLNFCA* 

5615 

A 

194 

737 

EGGVCLSWCGWFGPVCCDYTPPLPEPRDGCLC/PLPKMPPAPPMLPI 
QQSFQPFQERPGPSPVPETASPPAGN*ISHLSEGPPNTQWSPGFKPW 
GVCHL* / CGHVGVPPLKPAPCPLE/ CCPSGCVDSGAFPQKHSPSGSG 
PETAAGPSHLNDNWQRLGFYELLSGI FCRAPNKCI CYVS I FVLK 

5616 

A 

2 

360 

PR VRDPR VR PRVRRR I YKPLRTRDELDQFLDKMDD PD Y / W PAVDF FS 
GDVMIHPVTNRPADKRSFIPSLVEKEKVSRMGHAIKMGWIQPRRPRD 
PTPS FYDLWAQEDPNAVLGRHKMHVPA 

5617 

A 

1 

2072 

DSGSDSSEDDDEGDEEGEDGALDDEGHSGIKKTTEEQVQASTPCPRT 
EMASARIGDEYAEDSSDEEDIRNTVGNVPLEWYDDFPHVGYDLDGRR 
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IYKPLRTRDELDQFLDKl^DPDYWRTVQDPMTGRDLRIiTDEQVALVR 
RLQSGQFGDVGFNPYEPAVDFFSGDVMIHPVTNRPADKRSFIPSLVE 
KEKVSR1WHAIKMGWIQPRRPRDPTPSFYDLWAQEDPNAVLGRHKMH 

vpapklalpghaesynpppeyllseeerlaweqqep\serklsflpr 
kfpslravpaygrfiqerfercldlylcprqrkmrvnvdpedlipkl 

lwevatarcv^tvpvggwksvawnpspavclvaaavedsvlllnpa 
lgdrlvagstdollsafvppeepploparwXleaseeerovglrlri 
chgkpvtqvtwhgrg\dywplvij\prghtgviiihqlsrrr\sqspfp 
cthv * vhrvafs parpfla lvgpqrsvrl\ yhl \ lrqeltk\ klmp\ 
nckwvcqpggcnpagdn\vicgsy\dsklvwfdldlst\kpyrmlrn 
h/hrrallavaf\hpryplfasgsdd\grvivchgmvyndllqnpiil 
vpvkviikghvltrdlgviidvifhptqp\wv\fssgadgt\vrlft 

5618 

A 

1 

214 

QD S ALLQLL * 1 1 Y PVYNL S PAGQGRLDLFS CLLRHRL KLPTS EGVRT 
\ QSALQAFKGPNAQTQWI YGHDPN 

5619 

B 

58 

280 

IHCAISKIDIEKDIQAVMEETAILSTENKSEFLLTDYFEEDPNSAMD 
KERRKS LLKPKLLRLQRD I E KAS KDKEX * 

5620 

A 

2 

785 

HFNMRDPLTDCPYNKVYKNLKEFSQ 

YAKGLQKLAS KLS KALQNTRKS CVS S AWAWAS EGMKS TADLHQKLGK 
AIELEAIKPTYQVLNVQEKKRKSLDNEVEKTANLVISNWNQQIKAKK 
KLMVSTQET* STFPSL* ESS / ARQSMTEKEKRKLLNKLTKSTEKLEK 
EDENYYQKNMAGYSTRLKWENTLENCYQSIL/ESWRRKEFNFYAITL 
NQYSQHI SLFGQTLTTCHTADSLCHQQD 

5621 

B 

13 

543 

MFLAPIAVSVRRFFXVKASVYAYTMFFSTFYHACDQPGEAVLrCILSY 
DTLQYCDFLGSGAAIWVTILCMARLKTVLKYAYRCGHRRQCYPTLWQ 
RWAFYLLPGVFMASVGIAIYTSMMTSDNYYYTHSIWHILIaAGSAALL 
LPPPDQPAEPWACSQKFPCHYQICKNDREELYAVT* 

5622 

A 

1 

1798 

MLCFYAGSCRHFRSGAPPVINPLGTSFPDDTAVQPSFQVGVPLSTTP 
RSNASVWSHPAPGDWFVAAHLPPSSQKIELKGLAPTCAYVFQPELL 
VTRWEISIMEPDVPLPQTLLSHPSYLKVFVPDYTREIiLLELRDCVS 
NGSLGCPVRLTVGPVTLPSNFQKVLTCTGAPWPCRLLLPSPPWDRWL 
QVTAES LVG PLGTVAFSAVAALTACRPR S VTVQPLLQS SQNQS FNAS 
SG L»LS PS PDHODLGR SGR VDR S P FCLiTNYPVTR EDMDWQ VH POPLD 
RVSVRVCSDTPSVMRIiRLK\ PGMDSGGSLTISLRANKTEMRNETVW 
ACWACLALPIJ\IjILSIiNCTTAFFQGYPLSLSA\GSRRANLIIPYPE 
TDNW\YLSliQLMCPENAEDCEQAVGPRWEDHIiCTWGAliF*TDCGPYG 
QC \ S LLRRHS YL YAS CSC KAGWRGWS CTDNS TAQTVAQQRAATLIiLT 
LSNLMF/LGPPSPSSVRRFFLVEASVYAYTMFFSTGCGVGGQDRAHG 
ASPQFYHACDQPGEAVLCILSYDTLQYCDFLGSGAAIWVTILCMARL 
KTVLKYVLFLLGTLVIAMSLQLEPQGHVEHLGGPASLPS 

5623 

A 

121 

291 

AFCFPKYELTINFKNFQLGVVTHACNPSILGGRGGRIT*AQEANIjAN 
mak\ PCLLNK 

5624 

A 

571 

690 

CQQGFSFLQAYGPAQHAI S \MRKFKAKYPDYEVTWANDGY 

5625 

A 

130 

943 

TGEAEHLGLARADTWI1GEEGPRDGAFPGGSL1EGKAWCEQPRGPPNCP 
VPRVPPDPAGEELLRWLISSPPIADSTFPLSGAGGAARRQGTGLRLV 
STCLGPRYNWLRRWALGS I PRRDWSSLSQSPPPAVQAAPGWGWEGGD 
SGRRNMAVADLVL I PDVD I D S DCVFKYVT, I RVHS APRSGAPAAES KB 
IVRGYKWAEYHADIYDKVSGDMQKQGCDCECLGGGRISHQSQDKKIH 
VYGYSMVSRS PVPPCRRPQYQLRGPPEPAALTRGPS 

5626 

A 

45 

473 

G I PGRRNMAVADLAL I PDVD I DSHGVFKYVL IPS PLGL PAPG IRPAE 
SKEITOGYKWA\GHHADIYDKSVGATCRKQGLRTVSVWTAGPHLPTK 
SPGQERFT*YG*FHGP1WLPKNANFNI J RKSKAKYPRLTEVTLGLNRP 
AT 

5627 

A 

3 

348 

TLRLTHLHTDTHQ/HSHSYTHTEIHTPHTPHTETQTH*YTYAIiQ/PL 
RSHLIFLRQSRPQNGNAPLCEFSPHSPDVLQTLLHLTSVWKIPSTS 
LLMCRS SGGAQCRE S LDQHEGVG P 
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5628 

A 

3 

331 

FFEMESPSV/SQAGV*G\PNliGSLQPPPPQFKQFFCIiSLPSSWDYRR 
TPPCPANFWIF*\KDGVSPSWP*WSQSLDLGLFKFIYFFKETLILCT 
QS STTTTTTKRTTRS SHR 

5629 

A 

3 

336 

SHSVAQAGVQWHNLGSLQPPPPAFK*FSCLSLPSN*YYRRP\PPHPA 
DF\*FLVETGFSHVGRAGLKLLTSSEPPACASQSAGITSVSPPCRAH 
TFELIPLCPFLFWPLFCP 

5630 

A 

1 

267 

EMESCSVT\RLECSGAILAHCNLCLLGSSDSPASAS*VAGITGMCHY 
TRPTFLFLVEMGFCHVGQAGLELLT\QVIRPPRPPKVLGLQA 

5631 

A 

3 

521 

FS FFFFPAFKMSNRGRGGSLL\GRKFPDFPLGPSGRHi * SNWCLTNT 
RS P KTLY I HL P * RGNQGDRLDQTS PLLGVGDMVMAHS / VKKGKPELR 
KK\VHPSSGSFRQPKVHYP*EKMGVFLLFFEG*LQGVIVEPFKGRDG 
KVS CHL PGTQLG KG S FAD FVAPG ICIPMPWQP LH 

5632 

A 

3 

216 

DRSHSVTQAGV/Q/WCDLSSLPPPPPGLK*FLCLSFLSSWDY\GAPP 
HQTNFYI FI RDGVS PYCPGWP * TPSLG 

5633 

A 

1 

412 

IIRSKNI*YLEINPTKHTKTFSKTLTK\EMKEDLN*WQEI\PCS\WI 
G S VS I VKM S VLiS KL I HTE I QCflTRNKY PQ I G LLL L P L LL F FLLDRL VIj 
KFTWKGTG PRMAKAI LRKS / S EDLTLTY I NTYYKATVI KLVGAKPD 

5634 

A 

276 

2589 

NKHFSFQTPKKSHQKSLSFSKTTPRRISHTPQTPLYTPERLQKSPAK 
MTPTKQAAFKESLKDSSSPGHDSPLDSKITPQKRHTQAGEGTSLETK 
TPRTPKRQGTQPPGFLPNCTWPHSVNSSPESPSCPAPPTSSTAQPRR 
ECLTPIRDPLRTPPRAAAFMGTPQNQTHQQPHVLRAARAEEPAQKLK 
DKAIKTPKRPGNSTVTSSPPVTPKKLFTSPLCDVSKKSPFRKSKIEC 
PSPGELDQKEPQMSPSVAASLSCPVPSTPPELSQRATLDTVPPPPPS 
KVGKRCRKTSDP/ MKEAS/ SECQLDASATPGVGTADSPAAPTDSRDD 
QKGLSLSPQSPPERRGYPGPGIiRSDWHASSPLLITSDTEHVTLLSEA 
EHHGIGDLKSNVXiS VEEGEGLRTADAEKSSLSHPG I PPS PPSCGPGS 
P LM P S RDVH CTTDGR OCOAS AO LiDNT. PAS AWH *5 TD <5 A <? POTYEVPTjR 
MQASGLPKLRIKKIDPSSSLEAEPLSKEESSLGEESFLPALSMPRAS 
RSLSKPEPTYVSPPCPRLSHSTPGK\TGGKPTSARPVPPPT/SPSST 
PSPFQTDGVPWTPSPKHSGKTTPDIIK\TGPGGRGRWAVAPAPLPGG 
ARS VQ\ PS WEPVTA* VRGQGPRP * TQHPQDAHLGGF * ARGSVPAPRP 
VASQE/PACLRPRKPLPGDSLG*VPGRESCWPRKKLTVEPKGSVT*E 
KIQKLVRVKRGLQVGVHGSYPPRETKRCLFPAPPHLPAVPCGAPSPA 
SALQALTQSPLLFQGKTPSSQSKDPR 

5635 

A 

29 

268 

ILINHFFLESCSVAQAAVQWHNLSSLQPPPPGFK*FSYLSFLSS*DY 

khap\prpanfcifsrda\vs pmlaswsqtpdlk 

5636 

A 

2 

118 

WSPSPDLVIHPPRPPKVLGLQALHWVLKVNGHSTDIGL 

5637 

A 

1 

262 

ETDLALSAJUjECSG\TIIoAJiRKL»RLPGSRHSPASASQVGGTTGARTT 
PG*FFVFLVETGFHRVSQDGLDLL/NLVIHPPWPPKVLGLQA 

5638 

A 

3 

309 

IDTTFFFFFFFFFFETESLSVT\RLECSGAILAHYKLRLPGSRHSPA 
SASRVAGTTGARHHARLI F/ VYFLVEMGFHHVSQDGLDLLTS * SGPP 
RPPKS\LGIiQA 

5639 

B 

49 

754 

MAGAAPRLPWSRPHGSCGWWPMQPLQGPGRRAVAAAWEVATPVFMPV 
GTQATMKASRPNSWTLWVAASAWEYIjPSGSKAGGTAAFPDGVAGVSV 
RGDGGGLRFRSPYDGNETLLSPENPCRSRMRWAPGVVTGA1.LQVNPL 
AGPVHAAHQRPDKQNLFAI IQVGWTQI SGPPALKRFGS AWHSRAFLT 
HCCTVTTRPRCTTSRPQHRLPAELMSAVRTSIVEKRFPDSCGTSWAP 
X* 

5640 

A 

3 

906 

TVKMAGAATQAS LE S APRI MRLVAEC S R S RARAGELWLPHGTVATPV 
FMPVGTQATMKGITF\EFLDALGCRICLGNTYHLGLRPGPELIQKAN 
GLHGFMNWPHNLLTDSGGFQMVSL.VSLSEVTEEGVRFRSPYDGNETL 
LSPEKSVQIQNALGSD\ I IMQLDDWSSTVTGPRVEEAMYRS IRWLD 
RCIAAHQRPDKQNLFAIIQGGLDADLRATCLEEMTKRDVPGFAIGGL 
SGGESKSQFWRMVALSTSRLPKDKPRYLMGVGYVVDREARALPVGSG 
FLGTPYPAWGGGIWGKRTQV 

5641 

A 

379 

677 

EGLRP/ELQNLFAIIQGGLDADLRATWP*RDDQARNVPGFAIRGALS 
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GE * EQSREF/ CAGWLAAEHTLG * PKDKPR YLMGVG YATDLWCVALG 
CDMFDCVFPTRTARFGSALVPTGE\LQIiRKKVFEKDFGPIRPG\CTF 
PHVPKAPARAFLHATAAQ * QHGPRCTTLTVHN\ I AYQLQLMS AVRTS 
IVEKRFPDFVRDFMGPMYGDPTLCPTWATDALASVGITLG 

5642 

A 

2 

335 

AGVQWPDLSSLQPLPPGFKRFFHLGLPSSWDY/RVHEAPRPANFL\K 
FLVEMAFCHVGQAGLELLG S SDTPVLAFQSG I TGE SHLAWPVFNF * N 
GS FNVFSLRVINEHLALHL 

5643 

A 

56 

571 

Y YQN PPKKKLAAS CGLS P VLQVLKS LKGEVLPLGETQHY FPLNRELR 
LLTQHFLLLMPLSQ/QTSKAGTIIARLIDPDYQGRTKFKIRNGKKIIi 
NGRRKEIAWNVGFSIT/PCLLWKVIGILEQSNMAIYWYIHPRMKV*A 
S P PGKEQRP FEVTPEGQRNNE WLEEERNGRCLP 

5644 

A 

2 

1085 

AERO I EEENREREWEREVLG I KRDKSD S P A T OT .P T .ttp PMhvmrpn vv 
PAFLDMVRSLLDGNIDSSQYEDSLREMFTIHAYIAFTMDKLIQSIVR 
QLQH I VSDE I CVQVTDLYIiAENNNGATGGQLNTQNSRSLLESTYQRK 
AEQLMSDENCFKLMFIQSQGQVQLTIELLDTEEENSDDPVEAERWSD 
YVERYMNSDTTSPELREHLAQKPVFLPRNLRRIRKCQRGREQQEKEG 
KEGNS KKTMENVDSLDKLECRFKLNS YKMVYVI KS ED YM YRRTALLR 
AHQSHERVS KRLHQRFQAWVDKWTKEHVPREMAAETS KWLMGEGLEG 
LVPCTTTCDTETLHFVS INKYRVKYGTVFKAP 

5645 

A 

387 

533 

RWSD YVER YMNSDTTS PELiREHLA\ QKPVFLPQGE * AMS ICQQVHTV 
HD 

56.46 

A 

1 

169 

VFLVETGFR\ LLARLALNS * PCDPPASASQS VG ITGESHCARPGTIL 
MKHRGQLCT 

5647 

A 

2 

369 

FLRWS FTLVAQAG I QWCGVS S LQPLPPG FKQFFCLS LP S S WD YRC PP 
PCPTNF\*LLVEMGFHHVGQAGLELLTSGDLPASASQSAGITGISHR 
ARPGTLFF*AVNGGGNQVFLFLRVLNGL 

5648 

A 

1 

340 

FRPQSC*CPPADSRLFPATPEAVIjSIjLVKA t ;pPPPT.PPc;pDPK'PTf3T 

SGGALG*QRFSRAARNRGDPLTRKNSKITSLLSFLQELTKALEQKPD 
DAHILSSKSLLSHSSWELL. 

5649 

A 

2 

660 

RPPPAPAPGPQGPWPQGPYP*AESPPARSRTPHASQLPRCGPRPTAD 
PSSAPSRRLRPTSRSRHCFPAAGS/PGTPPQPPSADVRRVPPFSPLS 
SRWW*PFPQSC*RPPADSR\RSRPPLRPSFRFL*KPARPQSSSVASR 
EAKGPLVERQVDSVLAGPRETASSFATRPCATLPGKVTANCKPPQKT 
SGQLLVRGLSRSVAAYSSPFFGAPEIGEGEMPI 

5650 

C 

19 

153 

MKMTSNFLLFSSGIFSSTFQIHIiDVPLELQTQNTVMGCLMRIHP* 

5651 

A 

2 

306 

ETKSCSVA*AGVQRRNHGSM*RQPPG\ SSNPPISASRVFG/ ITGICH ! 
RTHL I FLFFVETGSHYVAOGS S * TPEAOVI RLHOPPKTA VFFYT FOT. 
FLCFLLVCY 

5652 

A 

2 

760 

FFFFFWSENDGHLDTGARFAPDRRHTGVGRTKDTQVGYRCHTAPARA 
RGGWGEDGCLAS * PGVRVKSRWS VPGAPGRGGQCHNPSGFPT* PGGR 
HRKQPRPGARVLQAGGQGQAFLPLAGWHLSGPGYWLSTPWACKGLGV 
VLSVTPIiPPRSPWVG*GIiGSR/QAVEGRAAQGRYLSFCPGTQW*WPP 
SAP I PTGT* SEVAHTKGVPGPGEPPAARGLRTLTIiAVGPGLPGSRLV 
QHPKRYRALQALTSCFCSA 

5653 

A 

1 

834 

MVS I SRPRDPPASASQSAGITGPALGAEGERAFHRHRVM I VTEGAPG 
GLSSGHLRDAMDSEAGTDDGCASHVRVVHRDLKPSNILYVDESGNPE 
CLRICDFGFAKQLRAENGLLMTPCYTANFVAPEVLKRQGYDEGCDIW 
SLGILLYTMLAGYTPFANGPSDTPEEILTRIGSGKFTLSGGNWNTVS 
ETAJCDLVSKMLHVDPHQFJ^TAKQVLQHPWVTQKDKLPQSQLSHQDLQ 
LVKG AMAAT YS ALNS S KPTPQLKP I E S S I LAQRRVRKL P S TTL 

5654 

A 

674 

1269 

PRDLPASASQSAGITGVSHRLRFYFDHQDVAPECGAYFFHELAKEKR 
KG VE * * NTLLAKMVS IS* PRDL PAS AS QS AG I TG VSHRLR F YFDHQD 
VAPECGAYFFHELAKEKRKGVERLLKMQNQRGGPAVFQDILKPGQDB 
WGKTLEAMEAAMAPGKNI^QSLI^LHALGSAPTDPHLCDFLESHFRD 
EEVTFMPRKSADLHAAGEAASLGGVSLNVMKGASLHALCIjRLIjNSFG 
PGVATLCGERWHKNNRAS ALRKC I FW P 
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5655 

A 

2 

612 
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C\ PAS PS SWDYRRPPPRPANFFLYFYS /MRRGFTVLARMVS IS* PRH 
LPALASQRAG\ITGLSHCTRLFFF*WMESPSVTQAGIQWHDLGSLQP 
MPPQFR*FSWLSLPSSWDYRRPPPRPADF\CTFSRDGVSPC*PGWSR 
SPDLVIHPPWPPKVLGLQA 

5656 

A 

140 

261 

RAEPPHLANHILFF * DRVSLCRPGWTAMA* TW\ LTVASTSRAQAVLP 
PSAS PNS * D YR\ HVS PHPL I FVFS FLIMLSRMVSNS WAQG ILLPWS P 
KVLELQA 

5657 

A 

79 

358 

AGVKWGDLGSL/ QPPPPRFKRFSCLSLPSSWGYRHAPP / * PG * FSVF 

LVKTGFLHVGQAGLELIiTSGDLPVSASQSAGITGMSHCAWPTKALIS 
K 

5658 

A 

2 

350 

SQVDR*QSEPESIRICREDHMERLQAFDANSRKQEAEWKEKAIKELE 
EWYARODEOLOKTKANNRVADEAFYKOPFADVTnYV\ PFayvMnTnp 
SSPGTEWERVARLCDFNPKSLD 

5659 

A 

3 

752 

HRCGVGGSVGFCLTVGVRAVQLPAMAELDPFGAPAGAPGGPALGNGV 
AGAGEEDPAAAFLAQQESE IAG I ENDEAFAI LDGGAPGPQPHGEPPG 
GPDAVDGVMNGEYY\QESNGPTDSYAAISQVDRLQSEPESIRKWREE 
QMERLEALDANSRKQEAEWKEKAINELEEVTfARQDEQLQNTKANNRV 
ADE\ AFYKQPFADVIG YVYVAKL I PFSTSDS F YFCLELMLLLCHKLL 
CFLNYFKLALWGLPKN 

5660 

A 

1 

348 

NSLGESANALGNQ/QPITQNVCPRFL\MQAPGHAPSPSFIMNTLE\R 
KAFLKRFP\WMSAP\lQVGLVGFCLVFATP\LCCAIiFPQKSSMSVTS 
LEAELQA*DPKRAHP\ELRRVY\ FNKGL 

5661 

A 

3 

430 

LRQSL/DSVAQAGVHWRDLGSR*APPPGFTPSSCLSLPSSWDYWHPP 
RRPAVFLYF*ERRGFTVLARMVS IS *RRDPPASAPQSAGI IGM/SHR 
ARPQILFFIHIFSTVSAGINES*VCFYCIIILEVPSKEGILES*CIS 
EAAH 

5662 

A 

3 

756 

GAISAHRI^RFLGSSDSPASASQEHWGTETLAH*RSQLTATSASWVQ 
VILLPQPPKNTGAQKHWPTEGEAKEHCVSDDSESTLKAGGQHRIKQG 
GNS S SG AREPAPNI LNSRRS WWTL VAKAAAVTVI I RVI LMAAILRGT 
LAKEVNWWVLDLTDFKI^AADPRARHKDSPSPHQTQEPS 
PGPQVELPAS PTMCAGTSQPLVSRWDWS PWSRKRHI I FAPS SHNKYA 
GESFPGIYDAIFDIENKANSRLAWKT^VKTTHTQT Ani?TT02iAZir ,r rT vr? 
VL 

5663 

A 

2 

774 

LANFC I FSRDGVS P * WS G W SRTPDLR * STRVGL PKC * D YRRE PPHPP 
GLIiPVFSSHFAFGLFORRRRLFVCFFA\ FT.Rn^T.nT.VTnzinvnX uwn 
LGSLQPPPPGFK*FSCLSLLSGWNYRCVPPRQANFCIF/M*RRDFTM 
LARLVSNS * PQ/ CDPPTS AS QS AG I TGVSH/ H/ ARLAFVS / CYTHTH 
THTHTLA*IPFPSVFPPNSLSPPLQQGFCPHHCTELTSYIiIKITNPV 
HVI RPNGHSALC I L VQVS APLHS WLS PLLE 

5664 

A 

2 

417 

LDPGSIxAGFTSYIQFMYDEFVEEYEPTIOUDSYRKK/VAQDGEEJVQIY 
1 1 NTAGQED YTAI KDNY FHCVFS ITEMES FAATVDFKEQ / ILRVKKD 
ENIPFLLVGNKSDLEDKRQVSIEEAKNRAD*W1TVIYVETSPKT*AN 

5665 

B 

151 

1312 

MMGGRLLTGGIPGEAAVWGCSRPSTAPSVFPPPPAAAAAAAAVSSGN 
xjRLHPGOVPRVVXjSRPnr?T,2iPPT?f^r , 'DQ VCPPCi7CTDt7DD 7\r»r>Dr , trc"T» 

GDRDPAPPPGPPSSSHHFSPLRIERKADKPPENSGESSHDVASREQP 
LVLLEAHFRWAILl^CHSMPWGRKQVx^ 

TYQNLWDTAKISFFPDFSYNFFMPISVIjDAGGGSETAKHHHQDVHEI 

QPHIEPQISLGSFEHPIFGSCTIKEGTYRKRVVIxDGEEVQIGILETA 

GQEDYAAIRDNYFRSREGFLCVFSITEMESFAATADFREQILRVKED 

EIWPFLLVGNKSDLEDKRPVSVKEAKNRA^ 

TRHKPGTSSQSX* 

5666 

A 

3 

955 

LLLLLLLLLLLLQPPRLPRHPSDSSFDRSRRGAFQATRTEYPPGGAT 
QPRLPEPSGRRTGSRCKFFLILWRKLRHKIVx^^ 

MGGSGGVGKSALTLQFl N TxT)EFVEDYEPTKADSYRKKVVLI^EEVQID 
ILDTAGQEDYAAIRDNYFRSGEGFLCVFSITEMESFAATADFREQII* 
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RVKEDENVPFLLVGNKSDLEDKKQVSVEEAKNRAEQWKVirrVETSAK 
TRANVD/ KEWPFLKTRWWNTCKYI S SHCPRPAPVSRKTAHWAEVFFD 
JjraKxi J. KAKlunr.U b KE KNG KKKRKSLAKRIRERCCIL 

5667 


± 

D J 3 

piaajm ivF isAjyJN oiiAljHKrv I PWGSGGVGKS AXjTIj \ QFM YDEFVED * E \ P 
TKADSSRKRV\VLDGEEV\QIGYPLDTAG\QED\YAAIRD\NYFRSG 

EGFLCVFSITEMESFAATAE\FREQILRVKEDENVPFLLVGNKSDLE 

DJCR\ OVFSPKT?rtTO\TRAPOWTJVTJV\/T7'T'Q2i VTI? 7\ "Knrri v^yctTTM mddtdk 
■i^xa-tv. W v f ^ xvrvrv \^xvxii jv/vE#%^r¥l>l VW X VOl O/VTvX x\_ftj>/ VUi\V r C L/LtV f l3t(.X!!*J. KA 

RKMED I \ YYLNGTKNTKRLAERI REGGC I L 

5668 

A 

2 

178 

WGFTMLARLVLNS*SQVIHPPWLPKVLGLQALGSGSLHRSFFISVKG 
SSGFTAPSRRL 

5669 

A 

208 

339 

RGQRLPVSRKWLLGTEEAWALR*STRLSLPKCWDYRHEPPCPAH 

-J Q / V 

A 


-coy 

DKIFFFFF*DRVSLCRPGWSSVARSRLTATSASQFKQFSCLSRPSSW 
DYRHVPPRPANFCI FSRDR/ SFTMLVRLVLNS * PQVIHLPRPPKVLG 
LQA 

5671 


1 


JU\Vx^v,l\J5Cijl\Ax* JbSCablMr TQHQRIHTGEKPYECKECGNAFSQSSQIj 
IKHQRIHTGEKPYECKECEKAFRSGSDLTRHQRIHTGEKPYECKICG 
KAYSQSSQLISHHRIHTSEKPYEYRECGKNFNYDPQLIQHQNIiYW** 
NRIYEMCREAFNYGSKQN1»SC* *FL*IE*C/SKHFICGIjQLIQPGKN 
NVGEILYE 

5672 

A 

2 

355 

lflsflffsfffffetgphsvaqawqwlnhgslqpqlsgfkqsfrl 
rllsg* d yrctplrlas f \ * ffvemgchyvaqaglqlld sgdlpasa 

SQRAGITDVS PCIWPPMSCYVKRT 

3 D / .J 

A 
r\. 

y £ X 

1 n Q iZ 

TQPWYRHGGGGPKTLRQRS WS PCQAYHVPHACPSHHKNIj S SRPRPWC 

I PVHLWHSGWPLLHTLPAR I S AVQPSFPFPKETLGQAELQNRWVPLG 

PGKQVLRERPLSHQREKREAQLTPRLljARRPQAGEAAARRQGKGTAE 

GET?R* *Df5P c JK*PQ.TA7T?ZVP'\7WKT? Zv D vr\r* dob on *T>r"»r»\ OTDMTiDTniA 
vjorttj iWJxroiNJr'oij vjc/iuK vKI\JtrturA.ybroAoA ,r rbr \ b 1 rW P P X 1 Q 

GSHWPREQAPAPTPMSSEEPNVLSQQTARP/RPSPGTGMGVGVLKPF 
VSVPnAT.afa.TMPPMffVT.HTTtfTrpaBnnun^yuPMPA'rpr* 

5674 

A 

2 

70 

WSRSPDLMIRLPRPPKVLGLQA 

5675 

A 

2 ! 

118 

WQ po py~)T .VTHPPTJ PP1T\n CZT oat 1TU\7T i<nt ns.7r*i_r c TnTPT 
rw or ori>uv JLrilrzrTKJrlrr^v i-ivj^Xj^M-Url W VJ_ixvVJ>JLjrit> lUlbL 

5676 

A 

446 

756 

ISFFLSFFLSFFFFFEAESRSFTQIGVCSGTISAHCKLCLPGSHPSP 
AS AS * VAGTTGARHHTRL I FL YF * * KTGFHRFSRDGLDLL/NLVIRP 
PRPPKVLGLQA 

5677 

A 

2 

276 

LRWSFSLWQAGVQWCDLGSLQPPPSGFK*LSCIiSLPSSWDYRSAPP 
CPANFVFLVETGFLHVGQA\ CLELLTS SDLSTS AS \ QS AG I TRREQ 

5678 

A 

60 

235 

VRVLQIYHSLRLPGSHHPPRGDRLQGRGFLTACQPC*STYC*TFPPG 
VLGNTGRDLLL 

5679 

A 

1 

2059 

MSSQEVTFEQRPEGDEGGAVELSRQQTQQERRRQLGQTQLMYIWAKP 
KICSEDLEGAVKLPASGVKTHCPPCNPGFFKTNNSTCQPCPYGSYSN 
GSDCTRCPAGTEPAVGFEYKWW1TTLPTNMG/TRPFSVGSTSSTR/CM 
TGWEVAGDHIYTAAGASDNDFMILTLVVP 

RITFVFETLCSVNCELYFMVGYNSRTNTPVETWKGS KGRQAYTYI I E 
ENTTTS YTCAFQRTTFHEASRKYTNDAAEI YS 1 1 CHPCY *MPWASYA 
GPCAQKASEEGSSCTSCPAGYYIDRDSGTCHSCPPNTILKAHQPYGV 
QACVPCGPGTKNNKIHSLCYITOCTFSRNT^ 

AGGPS FTS KGLKYFHHFTLSLCGNQGRKMS VCTDNVTDLR I PEGESG 
FSKS ITAYVCQAVI I PPEVTGYKAGVS SQPVSLADRLIGVTTDMTLD 
GITSPAELFHLESLGIPDVIFFYRSNDVTQSCSSGRSTTIRVRCSPQ 
KTVPGSLLLPGTCSDGTCDGCNFHFLWESAAACPLCSVADYHAIVSS 
CVAGIQKTTYVWREPKLCSGGISLPEQRVTICKTIDFWLKVGISAGT 
CTAILLTNHjTCYFWKJCNQKLEYKYSKLVIWATLKDCDLPAADTCAIM 
EGEDVEDDLI LTS KKSLLGEDQI I YLQEDS 

5680 

A 

1 

1098 

MGFHHVGQAGLELLISSDPPAIiASQGAGITGSWKWILAPVIIjYICER 
ILRFYRSQQKWITKVKNI^FPI^VIRVIFYRTTEl^FYVVTLKEEL 
GIWAQWVVVimPSKVLELQMNKRGFSMEVGQYIFVNCPS I SLLEWHP 
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TASEDVFQYEVAVLVGAG IGVTPFAS ILKS I WYKFQCADHNLKTKKI 
YFYWICRETGAFSWFNl^LTSLBQEMEEIiGKVGFLNYRIiFLTGWDSN 
I VGHAALNFDKATD I VTGLKOKTS FGRPMWDNEFS TI ATSHPKS WG 
VFLCGPRTLAKSLRKCCHRYSSLDPRKVQFYFNKENF 

5681 

A 

14 3 

571 

KYCSQIYYLNRAAVRCRQKGLIAQVDLFLVGQCSSQLLWASYFTQML 
TOTT ATTLVHGNVLS P P S L Y ANTEAMATRR S OTW SPGSRSACERCSW 
RSMSSSLTRSSAVTARGSAPKPRCSCCWPLRSRTSRRCWWPSGATGF 
G 

5682 

A 

3 

355 

FEIGSCSITQAGVQRHD\LCSPI»PPPPGFKQSPRPSLPSS*DHRCAP 
PCPASFCIF**RQGFHHVGQAGLEIiLTSGDPPALASQSAGITGVSHQ 
AWPPFGFLDRTLS IVSQKTNTNRP 

5683 

A 

1 

2472 

RYDIALWPRLECSGAITARCNliKLLGSSNSPTSTSCI / C*GYRLIFF 
KFFVDTGSCYVAQAGFKLPASSDPPPPKMLR\YRP\GHHAR/LCLTV 
NRTNVP 

5684 

A 

5 

298 

TGSRSVAQAGVQ*H/DYQGSLQPPPPRPM*SSHLSLPRNWDYSHTAP 
HPAKLFFFLDTGSHYVAQAGLKLLGSSDPSTSASQSAGITGMSHRIQ 
PHISF 

5685 

A 

9 

320 

F I YLF/ RDGVLLCCPGQS AVAFLxtf/TVTST/ FL * SPDGELLTL* STR 
LGLLKWWDYKGEPPRPVFLFCFLNV**RCSLAMLSRLVLNS*AOAIF 
L PQPS KVLGLQG 

5686 

A 

2 

438 

FFFFFSRLENFEFYNKPFCTNPGPRGPPRVGPPFAPRRGAFSLPQGG 
PF*SLLFSPRPNP/SGPLVRPGLPPTGAIiPCSVR*VDRSPLPYMVSP 
GPPGGLRGGAGPGPVHPRGPEAPPRARGRGREGGPAESVTRFPWQLR 
GGHFOP 

5687 

A 

243 | 

379 

PASLSTNEAEHEMSLFS FQKPLNVTVIRRGGKHQLRLVPTRWAGK 

5688 

A 

1 

867 

E YVDYNGGAGVQHIALKTEDI ITAVRTPFPVRIPLS FRARDDLCKIE 
RAGGGSPGPVSLGWETDRTPSLLLATHGQAQPHPPPESSVSTEPLGF 
GGLQNDHKAVMKQVEEALHQLHARDKEKQARDMAEAHKEAMSRKLGQ 
SESQGPPWAFAKVNS I S PG SPAS IAKTQRGSKSKLSLQLEDAETDEL 
LRDLSTQI EFLDLDQI S PEEQQI SS PERQPSGELEEKTDRMPQDELG 

STPVYPL 

5689 

A 

1 

874 

TVLAFADGCGVXiAHGRSRSYGRLGRRPYSREPGLWSRGPGFTMSDEE 
ARQSGGS S QAGAVTVSDVQELMRRKE \ E I EAQ \ I KANYD VLBS QKGI 
GMNEPLVDCEGYPRSDVDIiYOVRTARH\ NT T rLONDHKAVMKOVEE\ 
AIjHQLHARDME kqa\ RDMA\ EAHKEAMSRKLGQSE sqgp PRAFAKVN 
S I S PG\ S PAS I AGLQVDD\ EI VGVSGS VNTQNFQSLHNIGS WQHS / 
EGGPGTHHPTFCLYEFDYSRDLI *KPLNVW 
AGKGLLGCNI I PLQR 

5690 

A 

30 

512 

TPVPJDMAEAHKEA\MSRK^^ 

AGLQVDDEIVEFGSW/TPQDFQSLQ\NNGNWQ\HSEGKPPECDQ* 
S ARGE KH \ OLKLVP KNAG PGKG \ TVLG C\ NTIP SAKDEL <5 PRGN S KOR 
KHPSPCPRTWVLGDFPTCPSSPLKA 

5691 

A 

945 

1358 

KKQNFIYTVHGFSCCLGRGKGQDRDGWESRDGRFLSRSIiTFPQFAII 
R1jVGOKEFSMLPKSFTLOILPRGSLSLPDPOEOT.SK*5?MPHRAVOSP 
SIUlALGNPKTAREGSAPRSPIiTGRPRPPVAAG*CRACRGAPQVARPP 
QGDQHRGFRGAHSDWRVWRGWASPSLLPRS SS SSGIANQAS \ PVPES 
PGERS /RPALGGRSGSS PCRRP/PPKPPWRGA\GS * SGTWRRWIiAGW 
RTRRPRAPGCARPARPGPPSWPSRPQTGLAAGLSGAPSVPSCRRCTP 
RRSRPPLPARWTPPGAPACRRRPWS * SPGSTTAA 

5692 

A 

3 | 

316 

FFFFFLRQSL/DSVAQAGAQGHNPGPLQAPPPRFTPFSRLSLPSSWD 
HRRP P PC PTNF S AFPVETG F\ TVLARMVS IS* PCDLLTS AS QS AGTT 
GTSHRARP 1 1 FK 

5693 

A 

2 

769 

I^IYKHAGGVFSFILLQTLNTYGPGQDTKGSVIjAYWRLPGLCIIIYL 
VKQSAKDFFSYYDINRQSVSKL\QNVEQLPP\DEIKELRQT\TMPAG 
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VDKSSTDQKYVQAGRRLGKVIQQSRLNCFPLVS YI IET\ CLFILWRH 
LByYL\LHCMPTD\SQKSLFASR\TLF*KQKDCQGFPSPQBTQS*DF 
* KVGLAYS EANHG F * TQLSGLM P INGFLENHYPKRNFLD I EGL/YI S 
KVRFFGYS FIQALVRRIRGLLRI SRN 

5694 

A 

189 

426 

TLP1SIYQSSPSGFHNYQFLASGVGATLPITPHPTS/ELPYRVPSPP 
SPSPRPPS*FSRTFQPP/V/PGPAH*PPDEIVEPG 

5695 

A 

243 

1660 

AEPDTSPI* TPGQSGS S * GNTS / EPWTVTPPGLKMQQRVLCCTL * PP 
/ RRL PVCR PAAG * TRLDRA * ES GHPG PAVPREATGVPGAIiPAQAS VR 
S I WPAJjATCVATTVQTST/ DAEVARQG/ icpaphpayppa* palccp 
ARADPRGRVLQ* *ADAVPGPPAI * AVHRAVSHPGGAQCPHPNPPAPQ 
ARVPRETCLPAX,QLAAPVLRGAGATAASAPT\EEEEEACLEEEEEEE 
DSDEEDQRSGKDS EAWVPDS EER L I LREEFTSRMHQRFLDGKDGDFD 
Y S TVDDNPD FDNLD I VARDEEER Y FDEE E PEDAPS PELDGD * WP P P F 
PPPAPSPSPTRQLIPGLLSDPFSRGTS/AAVPHNPHTPPSHWV*SHI» 
RQPPPPLFLGFPFSSLPPCLLPLSFLVSVWASFCLFLSFCLSPYPRS 

LFPVLWLAVSPFLPLSLCLRLCLHRRAQGESPPLAGGQLAGPFVAGS 
GQFGSGADP 

5696 

A 

7 

558 

GWLDLGLTKCVC*NAEVPTCEWVRLMCMSVCASK/CGGGCGCIiC/MA 
SGHCECLGMCWSL*ACVFLSKNECLCVS\LGTCQCLCLCVPVSHHLC 
IRVAHAASLCAMVCQCLCTOGWPMSDPRQGRPGWLDPAHLPSGSLAIi 
WVRVTEAFCGLKTGCSGSAIiRVAGGLPSLLPFRLVKRTPGGQGAV 

5697 

A 

272 

1146 

GFLGLEGPLSSSPQVAWATVPLWAQLQGGLAGSCLGQILAPGPSEAR 
DWWASLAGP * ERVEGADRP WRFHLSTPEALVLAAGR/ S KARASP PGR 
SSTLSLLHVL\ kVEKRRVNLPRVLSMPPVAGTACHAYDREVHLRCEL 
S PG Y YLAVPS TFL KD APGE FLLRV FS TGR VS LS AI RAVAKNTTPGAA 
LPAGEWGTGPI PG/ CS WKFG\QTAGGHKNF\ AS YPTNP/CVPLFGS P 

KGPGPRCVRITLHQHCRA\SDTEFHPIGFHIFQVPEGGRSQDAPPLL 
LQDPLLSCVPHR\TPRR 

5698 

A 

281 

535 

DSSKYRLFYIDAFSPAPAGNQGPPP^IMGMNMNNRATIPGPPMGPGPA 
MGPEGAANMGTPMMPDNGSSCTMTDFLKGPPSQMGFT 

C£QQ 
•J \> _7 _7 

j\ 

J. 

1014 

MEPEMELISITTGKTACEDLTTESKMNSVFWKIRFTYSSSRISSDEY 

TTSWAS VFVNRNLKRR YD VH I AH 1 1 RHRLHAS YDTSN I CL S EKD Y I F ! 

PSFMKLQGESHLIQLRHGKVIAIWSTLDMGREREQPPRFAQPGTYAS 

RWKALEEMEKQQREQVDRNIREAKEKLEAEMEAARHEHQLMLMRQAN 

QMVKCDPRHGKYMACCMLYRGDVVPKDVNAAIATIKTKRTIQFVDWC 

PTGFKVGINYQPPTVVPGGDLAKVQRAVCMLSNTTAIAEAWARLDHK 

FDLl^AKRAFVHWYVGEGMEEGEFSEAREDLAALEKDYEEVGVDSVE 
AEAEEGEEY 

5700 

A 

155 

2916 

QQKAAGVLPGKFYGTVFPWQRAGQQFQTDKAKGLTFIiGGKAVHLS * / 
HLEREQPPRFAQPGTYASRWKALEEMEKQQREQVDRNIREAKEKLEA 
EMEAARHEHQLJ^l^QDPGIiALVSCMQQGQSMSRSPKKVFKHIiYEI^ 
LTKLQSLGSRSRLRWKDSPFWGEEAGGKRPPLVSLALHRSPSLLLLF 
VPKHPSKGRPDCTVQRSPKPTEKSQLLDLRECISIHVGQAGVQIGNA 
CWEIiYCLEHGIQPDGQMPSDKTIGGGDDSFNTFFSETGAGKHVPRAV 
FVDLEPTVVDEVRTGTYRQLFHPEQLITGKEDAANNYARGHYTIGKE 
IVDLVLDRIRKLADLCTGLQGFLIFHSFGGGTGSGFASLLMERLSVD 
YGKKS KLEFAI YPAPQVSTAWEPYNS I LTTHTTLEHSDCAFMVDNE 
AIYDICRRNLDIERPTYTNLNRLIGQIVSSITASLRFDGALNVDLTE 
FQTNLVPYPRIHFPLATYAPVI SAEKAYHEOLS VAE T TM AC FFPawn 

MVKCDPRHGKYMACCMLYRGDWPKDVNAAIATIKTKRTIQFVDWCP 
TGFKVKQGFLLGLDPLLLPSSATSLHGGTGDVCRVLWCDIjSSSLPPV 
DNS TCF S GGGRGVKQT LWESLVWL FS ALALLNAG YARS EADWGHSS 
DAFS PAPAGNQGPPPMTGMNMNNRATI PGPPMGPGPAMGPEGAANMG 
TPMMPDNGAVIMASMSFQCCLEAFGVTLDSAFSPYPCLLNSIFSQQM 
LSAliSSKCIPCPITSVKVLNDWVPTAPTPSVPTPSAPTPFVPAPSAL 
LTSTAPTPSALLTSTAPTPSAPTPFTPAPSALLTSTAPTPSAPAPSA 
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LLTSTAPTPSAPTPSVPAPSALLTSTASTPPAPILSASTSAAPTPSA 
PTLSAPTI CCTY 1 1 CTYTLCTY I LR I YTI* 

5701 

A 

37 

1542 

prbptppa/ pprlqrttrtrrcgftidiksflkpge\ ktytqrcrlf 
vgnlptditeedfkrlferygepsevfinrdrgfgfirl\esqnpwa 
eiakaelgrapflksrp\lricfrlhmgaafdcsrnlfsslffqlsc 
leq\ afsqfgs \ vekagvwddrgr\ ptgk\gfvefasknlpatkri, 

WKRCCDGAF\LLTTTPRPSPLWKPMEQFDDEDGLAQRLLMHKTQQ\Y 
HKIiR \ EQPPRFCSTWGHLKFEYQSRWKA\ LDEMEKQQREQVDRNIRE 
AKEKLEAEMEAARHEHQLMLMRQDLMRRQEELRRLEELR\NQKXQKR 
KQIQLRHEEEHRRRE\ EDMIRHQ\ KQEGLRRQQEGFKPNYMENREQE 
MRMGDMGPRGAINMGDAFS PAPAGNQGPPPMMGMNMNNRATI PGPPM 
v^iro Krtnu 1 kk& kj\ x <jJM i>y w IMEQCTNDRFPQG PPSQMGS PMGSRT 
R F * NPLKHPMS G VG P VS GG PGGFGRG S P\ GGNFEGPNNRRR Y 

5702 

A 

81 

334 

m ^ v v jru^xivoj> v±Huww\W(jDNIjX*PS*KrvSWKYESRAFKTSRP 
LDPTPLLLGIQSK*/YNKGHKDLDIRILITALFWTKKL 

5703 

A 

3 

449 

WGPEP*/PTRPTLDEGPGCPGCGCSPSLPMGPPRGRGPPTSMSLLCR 
w v ¥ LjKj& ffljC i/PKARVKLrFPHPGPDVSTS PPWAGHGAHQDP * GPV 

SGSESP/PPAGHWAPPASTSSPKIQAGL*PE\GMRTAQPPSVAVGSP 
OGS FRHJCEOWf? 

5704 

A 

1 

206 

GSL*SPSPRFKRFSCLSLLSSWDYKHVPPHLANFCIF*\RDWVSPCW 
LGWSRTPDLHLSWPPKVLGLPS " 

5705 

A 

32 

401 

At ^iJlUjFWWlrKljCQVWRCGISEWAGMSEMGMSDIQLSVGKLVAR/CW 
LQAEL*G*QFTKQER/ CPSLTRSRWHGQPREGQAERPFLCAPPEGAP 
IGFLR * HHLRVQLAMAW I KFKMLGQS PLPYI 

5706 

A 

252 

416 

F* CKf FKADb V 1 KQER/ CPS LTW I RWHGQPQGGQAERPFLRAPPEGA 
STGFLRQHH 

5707 

A 

2 

164 

FFLYLFLFLFLRWSL/DSVIQAGVQWHD\ICSLQPLPPGFK*FSDCS 
1 Vijgp^RQSETGIiKYFFITV*EMESCSVIQAGVQWHDNIjFPATSASW 
VQV I LRLLHCTPA WATE 

5708 

A 

1334 

1707 

v k fh aka(j PWjRGRGD PGLG ARAL PQPR VAVAG AACQ PPS RLRHA 
GHRGPMVHAVGGSTFLGFGRQ*bSSHSSTRQCPSRDSVWGLQPHISL 
LHCPSRGSP *GPHPCHRLLPGHPGI S IHLL 

5709 

A 

57 

376 

KRGSLSGCPRPRVQW/RKTLAQLQPSKLPGLK*TFP/CIiSLLK*TGT 
HTCAPLCPNFFVFLTYLIEM\GLTMLPRLVSYFWPQVF\PSLASQST 

5710 

A 

9 

264 

PCPPNTIN/CPLGLRLPC/PPHLCPSLCLGKVGGP*KVPQTPSLGET 
oucyiuu'f V irTl/\i3W?W 1 oOKISICjGWEKSJjRGKERRVFSQEGFP 

5711 

A 

124 

770 

QYFPTATHGNVKKSLLSGKASLKRLHRVCCSCLFYIVQKQPKLNKTL 
FRQSYICDQTTFRKSKGIINIKFIFFFFEMESRSVAQAGMQWRNLGS 
jjy / fi'rrbf WFJj&cijbijIjSSVroYKRPPPCPANFFVFIjVETGFTVIiAR 
MVS IS * PCDLPASTSQSAGITDVSHHAWHIKFKPEVTWRERGREEHV 
DRH I FRLLD I RFTDTHF I KKELMRSDTL 

5712 

A 

2 

232 

CLSLLSS * YYGRMSPRPANF\ * FLVETGFHHIGEAGLELLTLGDPPA 
SASQGAG I TGVSQRGQSLFAFLMSSTSFCK 

5713 

A 

1 

242 

QAGVQWHDLGSLQPPLPGLKQFSCLSLPSSWDYRRVPPRLANFC/DF 
UVk X tFHHVGQADLELRTSGNLPTSASQSAGITGVSHHA* PKVLGLQ 
V 

5714 

A 

1 

133 

ERLR YFRVEHLDQRLKAAENKFS Y \ CLMTWLKCG FWCLEKF I KFLNR 
NAYIMIAIYGTNFCTSARNAFFLLMRNIIRVAVLDKVTDFLFLLGKL 
L I VG S VG ILAFFFFTHRIR I VQDTAPPLNY YWVP I LTVI VGS YL I AH 
GFFSVYGMCVDTLFLCFIjEDLERITOGSAERPYFMSSTLKKIjLNKTNK 
KAAES*RLRYFRVEHLDQRLKAAENKFSYVPHDI*AQMRLiIjVPGEVHQ 

IP 

5715 

B 

166 

969 

mgsacikvtkyflflfnliffilgavilgfgvwiladkssfisvlqt 
sssslrmgayvfigvgavtmlmgflgcigavnevrcllglyfaflll 
iliaqvtagalfyfnmgklkqemggivtelirdynssredslqdawd 
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YVQAQVKCCGWVS F YNWTDNAELMNRPEVTY PCSCEVKGEEDNSLS V 
RKGFCEAPGNRTQSGNHPEDWPVYQEGCMEKVQAWIiQENLGIIIXSVG 
VGVAI I ELLGMVLS I CLCRHVHSEDYSKVPKY* 

5716 

C 

300 

347 

MIHLPRPPKVLGLQA* 

5717 

A 

316 

797 

APALPQPTGLAAS SHMVQVPLW I CLKLSHS KSRNSPQL FF * VFQPLR 
RSL/SSVPQAGVQ*HDLSSLQPP/PPGFTPFSCLSLPSSWDYRCPPP 
RPAKFFLYF* * RRGFTMLARMVS IS* PRDPPASASQSAGITGVSHRA 
RPPPPQFFIQLPLYNIHTLEGR 

5718 

A 

2 

519 

WHPQSDVQAEVQWRDLGLLQSS\PP\GFTLFSCLSLLSSWDYRRPPP 
RPG * FLYF* *RRGFTVLARMVS IS * LHDPPALASQSAGITGVSHRTR 
PGADL*PRVQKPRASLGG*NHMCNMKSLPTGQTRLNKGRRI»GHFPDH 
WRWKETPGL* CFHGNFCGALKAGRAHLDGQGRR 

5719 

A 

1 

663 

MISPFINENTRRKFLIYSGSNYQGPGGLVDYLDREVIPDFLGGESVli 
GLNYLRSFRLAGWRAVLVMLTFYLQWGDTDRGQASPFSSTVKLSGL 
GLSPFFDPCSFSPPTGRCPGCDNKLYGQIxANVKKVRTGYKYIGMGRK 
ARNNVEESQEITTAWIJUjTLAIJVFLLLISTAANLSLLLSRAERNRRIi 
HGDYAYHPLQEMNGEPLAAEKEQPGGAHNPFKD 

5720 

A 

1 

1561 

MATSTGRWLLLRLALFGFLWEASGGLDSGASRDDDLLLPYPRARARL 
PRDCTRVRAGNREHESWPPPPATPGAGGLAVRTFVSHFRDRAVAGHL 
TRAVEPLRTFSVLEPGGPGGCAARRRATVEETARAADCRVAQNGGFF 
RMNSGECLGNWSDERRVSSSGGLQNAQFGIRRDGTLVTGYLSEEEV 
LDTENPFVQLLSGWWLIRNGSIYINESQATECDETQETGSFSKFVN 
VISARTAIGHDRKGQLVLFHADGHTEQRGINLWEMAEFLLKQDWNA 
INLDGGGSATFVL^GT\LASYPSDHCQDNMWRCPRQVSTVVCVHEPR. 
CQPPDCHGHGTCV \ DGHCQCTGHFW\ RGPGCDELDCGPS * LAAQHGL 
CNGSTGLSAVD\AGW\TGSNCSEECPLGWHGPGCQRPCKCEHHCPCD 
PKTGNCS VSRVKQCLQPPEATLRAGELS F FTRTAWLALTLAIAFLLL 

I S IAANLSLLLSRAERl^RRLHGDYAYHPLQEMNGEPLAAEKEQPGGA 
HNPFKD | 

5721 

A 

1119 

2008 

ALHPPAQGELFCDYFKDPSWSEGPQSPPLTRLAPHPQTQCTELSGGR 
RGSAVLEECVPTPASAHLAASPAEAGTAGVWR*RHHF/HSTGLLGPR 
* LSMSLLMS * CKAGRCPPGEDCPRPLPGCS LS PLERKLCFSSA*GPA 
SFLGGLTAKASGCSLSLRP/HSHIQGHGLWEKL*VPARRAGGGVCFK 
SA\ PHLQSL * * KPS SKAS KREG * Q * TLPS AT I GELPFK* HAPKHLCN 

AQGFLCKIH*HHK/QTANAIKA*GGPGSLEGTLRRPGSPPSSLGVGS 
GKTEVNSQNCRHLGREGVS 

5722 

A 

3 

444 

FFFFFLRQSL/NSVI*AGVQ\QPLPPEFKRLSCLSLPSSWDYRCAPA 
HSANFCIFSKD/MGFTMLTRLVSNSWQQ/CELPTLASQSGITGVSHH 
TQPI YH\ FIFVETEFHHVAQAGVK/ PPSLK* SSRLSLPKQWNYSHEP 
LYPALFNETFFFK 

5723 

A 

1 

6738 

MRGAQKATNVNKLSEVIQGKEESPAQFYQRLCEVYRMYTPCDPDSPE 
NQHMIHMALVRQSPEDMRRKLQKQAGLAGMNPSQLIiEIASQVFVNRD 
AVSRKENGKENGGQARJ^YADLFSPVEPDCVEVLDSIHSSRPDLRDQP 
WPSGDWELYVDGS S FFNPQGERGAG YAVITLDTWEATS LPQATSGQ 
KAELIAFIGALELSEGETVNIYTDSRHGTEEGQQILKQPVSBRUDFV 
CSFLQRRNSFYMGTCQDEPEQLDDWNRIAELQQRNRVCPPHLKTCYP 
LESRPSLSLGTITDEEMKTGDPQETLRRASMQPIQIAEGTGITTRQQ 
RKRVSLEPHQGPGTPESKKATSCFPRPMTPRDRHEGRKQSTTEAQKK 
± ixv/ujiCKyijpi/vr b -LJjJU I PJU^GNSIiLRRGASKKALiSKASPNT 
RSGTRRSPRIATTTASAATAAAIGATPRAKGKLGQDDVTLDNLFRVH 
EHTGEVGAVQGRGQLLQHQGLAQLRTCAPTFPAGAEGKERYGFQEVH 
PVFYVVVIAENRKHPSSPECLVSAQKVLEGSELEIiAKMTMLLLYHST 
MSSKBPRDWEQFEYKIQAELAVILKPVLDHEDGLNLNEDIiENFLQKA 
PVPSTCSSTFPEELSPPSHQAKREIRFLELQKVASSSSGNNFLSGSP 
ASPMGDILQTPQFQMRRLKKQLADERSNRDELELELAENRKLLTEKD 
AQIAMMQQRIDRLALLNEKQAASPLEPKELiEELRDKNESLTMRLHET 
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LKQCQDLKTEKSQMDRKINQLSEENGDLSFKLREFASHLQQLQDALN 
BLTEEHSKATQEWIiEKQAQLEKELSAAIiQDKARAKGDLGNKMMGPMF 
ADVYEGLCGNLKCLEEKNEILQGKLSQLEEHLSQLQDNPPQEKGEVIi 
GDVljQLETLKQEAATIAANNTQLQARVEMIiETERGQQEAKLLAERGH 
FEEEKQQLSSLITDLQSSISNLSQAKEELEQASQAHGARLTAQVASL 
TSELTTLNATIQQQDQELAGLKQQAKEKQAQLAQTLQQQEQASQRLR 
HQVEQLSSSLKQKEQQLKEVAEKQEATRQDHAQQLATAAEEREASLR 
ISRDAALKQLEALEKEKAAKLEILQQQLQVANEARDSAQTSVTQAQRE 
KAELSRKVEELQACVETARQEQHEAQAQVAELELQLRSEQQKATEKE 
RVAQEKDQLQEQLQAIiKESLKVTKGSLEEEKRRAADALEEQQRCISE 
LKAETRSLVEQHKRERKELEEERAGRKGLEARLQQLGEAHQAETEVL 
RRELAEAMAAQHTAESECEQLVKEVAAWRERYEDSQQEEAQYGAMFQ 
EQLMTLKEECEK7VRQELQEAKEKVAGIESHSELQISRQQNELAELHA 
NLARALQQVQEKEVRAQKLADDLSTLQEKMAATSKEVARLETLVRKA 
GEQQETASRELVKEPARAGDRQPEWLEEQQGRQFCSTQAALQAMERE 
AEQMGNELERLRAALMESQGQQQEERGQQEREVARLTQERGRAQADL 
AliEKAARAELEMRLQNALNEQRVEFATLQEAIiAHALTEKEGKDQEliA 
KLRGLEAAQIKELEELRQTVKQLKEQLAKICEKEHASGSGAQSEAAGR 
TEPTGPKLEALRAEVSKLEQQCQKQQEQADSLERSLEAERASRAERD 
SALETLQGQLEEKAQELGHSQSAIASAQRELAAFRTKVQDHSKAEDE 
WKAQVARGRQEAERKNSLISSLEEEVSILNRQVLEKEGESKELKRLV 
MAESEKSQKLEERLRLLQAETASNSARAAERSSALREELEHTSTQAI* 
VSELLPAKHLCQQLQAEQAAAEKRHREELEQSKQAAGGLRAELLRAQ 
RELGEL I PLRQKVAEQERTAQQLRAEKAS YAEQLSMLKKAHGLLAEE 
NRGLGERANLGRQFLEVELDQAREKYVQELAAVRADAETRLAEVQRE 
AQSTARELEVMTAKYEGAKVKVLEERQRFQEERQKLTAQVEQIjEVFQ 
REQTKQVEELSKKLADSDQASKVQQQKLKVGQLGEWVWPLDKGGKLF 
fc> iUjWvjKijrtjlj V X KWFJjLiI YKI S YATG I VG YMAVMFTLFGLNLIjFKIKP 
ED AMDFG I SLLF YGL YYG VLERDF AEMC AD YMAS TIG FYS ES GMPTK 
HLSDSVCAVCGQQIFVDVSEEGIIENTGAQLPRVLHPWLVHRGKEAN 
VSLLQREGRPQEDVQQSVSFIGVWGSGLWEESTGQCDLHLGPPGAL 
&nriv\jLvFi&vtriiL\j\ji\AJ I liOlJl^rilJj^JJJJLJKiiiiGlMNPY 

5724 

B 

107 i 

467 

MRLHETLKQCQDLKTEKSQMDRKINQLSEENGDLSFKLREFASHLQQ 
QGKLSQLEEHLSQLQDNPPQEKGEALX* 

5725 

A 

149 

579 

GLLSTRPISCRHTL*GPLQSSWEYMDVPWDLRASEGNGKLLYSARDC 

RLGSPDYGNSALLSLPGYRPT\TRSSARRSQAGVSSGAPPGEGTTGT 
YY 

5726 

B 

24 

219 

MPP *?QPT , K^OT?QQr*r3ZiWQCf^MlfT7NrV"E > T D M?7\ tttd T pun /-\r\T\ r*T vrnxr-ry 

n tr rv o o ft ± ivo *~ xvo o l. w M.V* o o y^rl r^Jun iN^lji^A-biAJiRJjGHEJjQQAGljKXKE 
AEQTCRHLTAQVRSLGGTX * 

5727 

A 

272 

683 

REGLYLSRFPFHSKLPRTQPDGTSVPGEPASPISQRLPPKVESLESL 
YFTPI PARSQAPLESSLDSLGDVFLDSGRKTRS ARRRTTQI INITMT 
KVRLLGQGLQGSNSHFLSRASLHWALSQEQGGMSSQSMLQWES 

5728 

A 

1 

406 

QELQEQLRSLEQLQKENKELRAEAERLGHELQQAGLKTKEAEQTCRH 
LTAQVRSLEAQVAHADQQLRDLGKFQVATDALKS REPQAKPQLDLS I 
DSLDLSCEEGTPLSITRSGGSLPPYVCLWSACCLSGCILVR 

5729 

A 

211 

5508 

IWCLFPLSVHMGLIYSQKCLEEKNEILQGKLSQLEEHLSQLQDNPPQ 
EKGEVLGDVLQLETLKQEAATIAANWTQLQARVEMLETERGQQEAKlj 
LAERGHFEEEKQQLS SLITDLQS S I SNL SQAKEELEQAS QAHGARLT 
AQVASLTSELTTLNAT I QQQDQELAG LKQQAKE KQAQLAQTLQQQEQ 
ASQRLRHQVEQLS S S LKQKEQQLKEVAE KQEATRQDHAQQLATAAE E 
REASLREPJDAALKQLEALEKEKAAKLEILQQQLQVANEiVRDSAQTSV 
TQAQREKAELSRKVEELQACVETARQEQHEAQAQVAELELQLRSEQQ 
KATEKERVAQEKDQLQEQLQALKESLKVTKGSLEEEKRRAADALEEQ 
QRCISELKAETRSLVEQHKRERKELEEERAGRKGLEARLQQLGEAHQ 
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AETEVLRRELAEAMAAQHTAESECEQLVKEVAAWRERYEDSQQEEAQ 
YGAMFQEQLMTLKEECEKARQELQEAKEKVAGIESHSELQISRQQNE 
LAELHANLARALQQVQEKEVRAQKLAD^ 

TLVRKAGEQQETASRELVKEFARAGDRQPEWLEEQQGRQFCSTQAAL 
QAMEREAEQMGNELERLRAALMESQGQQQEERGQQEREVARLTQERG 
RAQ7UDLALEKAARAELEMRLQNALNEQRVEFATLQE70AHALTEKEG 
KEQELAKLRGLEAAQIKELEELRQTVKQLKEQLAKKEKEHASGSGAQ 
SEAAGRTEPTGPKLEALRAEVSKLEQQCQKQQEQADSLERSLEAERA 
SRAERDSALETLQGQLEEKAQELGHSQSALASAQRELAAFRTKVQDH 
S KAEDEWKAQVARGRQEAERKNSLI S SLEEEVS I LNRQVLEKEGESK 
ELKRLVMAESEKSQKLEERLRLL\QAETASNSARAAERSSALREELE 
HTSTQALVS ELLS A\ KHLCQQLQAEQAAAEKRHREELEHS KQAAGGL 
RAELLRAQRELGELIPLRQKVAEQERTAQQLRAEKASYAEQLSMLKK 
AHGLLAEENRWLGERANLGRQPLEVELDQAREKYVQELAAVRADADT 
RLAEVQREAQSTAREL33VMTAKYEGAKVKVLEERQRFQEERQKLTAQ 
VEQLEVFQREQTKQVEELSKKLADSDQASKVQ*QKLKAVQAQGGESQ 
QEAQRLQAQLNELQAQLSQKEQAAEHYKLQMEKAKTHYDAKKQQNQ* 
LQEQLRSLEQLQKENKELRAEAERLGHELQQAGLKTKE\AEQ\TCRH 
LYLPRLRSLE\AQ\VAHARPSSFRDLGKFQVAT\DALKSREPQAKPQ 
LDLSIDSLDLSCEEGTPLSITSKLPRTQPDGTSVPGEPASPISQRLP 
PKVES LE S L YFTP I PARSQAP \ LE S S LDSLGD VFL \ DSGRKTRS ARR 
RTTQI INI \ TMTKK\ LDV\ EEPD/ SAPNLSF YS \ TRSAPASQASLRA 
T \ S S T * S LT&LG \ S PDYGN S ALLS L PG YR PTTRS S ARRS QAG VS SGA 
PPGRNSFYMGTCQDEPEQLDDWNRI AEL\ QQRNSELWPPNLKNWLS P 
WESRP\SLSLGTITDEEMKTGDPQETLRRA\SMQPIQIAEGT\GITT 
RQQRKTGLP*EPHQGPGN\P*VLKKATS\CFPRPMTPR\DRLEGAQT 
EHLLEAQK3CAAPAFLLKQA\DRRQSMAF\SILQPHPRSLG\NSLLRR 
GAS KKAL S KAS PNTRS GTRRS PR IATTTAS AATAAAI GATPRAKGTA 
NHLKGQVPVSGPTLCPRCLPSPGPPPTVPLSAFSQLPRPNSSLTS 

5730 

A 

111 

407 

NLLRTMIFWFGFEDRVWLCCPVLTATSVLTATS/TFPGFKQFSCLSL 
LSSWDYRCIPPHPGNLWYF**RQGFTMLARLDSNSGPQAIHPPQPPK 
VLGLHV 

5731 

A 

3 

615 

GLRALRRGQGLPGL*CCSRPQPSPRAQGHPSMGGPMQRVTPPRGMAS 
VGPQSYGGGMRPPPNSLAGPGLPAMNMGPGA7RGPWASPSGNSIPYSS 
SSPGSYTGPPGGGGPPGTPIMPSPGDSTNSSENMYTIMNPIGQGAGR 
ANFPLGPGPEGPMAAMSAMEPHHVNGSLGSGDMDGLPKSSPGAVAGL 
SNAPGTPRDDGEMAA 

5732 

B 

134 

471 

MYAKGGKGSAVPSDSQAREKLALYVYEYLLHIGAQKSAQTFLSEIRW 
EKNIMPSPGDSTNSSENMYTIMNPIGQGAGRANFPLGPGPEGPMAAM 
SAMEPHHVNGSLGSGDMDG* 

5733 

B 

223 

692 

MEPSPRAQGHPSMGGXMQRVTPPRGMASVGPOSYGGGMRPPPNSLAG 
PGLPAMNMGPGVRGPWASPSGNSIPYSSSSPGSYTGPPGGGGPPGTP 
IMPSPGDSTNSSENMYTIMNPIGQGAGRANFPLGPGPEGPMAAMSAM 
EPHHVNGSLGSGDMDG* 

5734 

B 

228 

292 

XLARDDHERVMGRQPRASLRA* 

5735 

A 

1 

1753 

MYAKGGKGSAVPSDSQARENLSAFQAATELTSLDRLALYVYEYLLHI 
GAQKSAQTFLSEIRWEKNITLGEPPGFLHSWWWYGLGCCGVFWDLYC 
AAPDRREACEHSGEAKAFQDYETPERPAHRHGNAGRAGS S SGMIDVG 
GSSDPQILRPTTPVCSLTNASSLSHGAGHLQRGGGTPQSAAAAPSPV 
MGSMAPGDTMAAGSMAAGFFQGPPGSQPSPHNPNAPMMGPHGQPFMS 
PRFPGGPRPTLRD\GSQPPAGPPWVPSPSSPG\AMEPSPRAQGHPEH 
GRPNARGVTPPRG/MGPAWGPRAYGGGMRPPPNSLTRPRACLPMNMG 
PRKFVGPWAQPPVEYSIP\YSSSSPGSYTG\PPGGGGPPGTP\IMPS 
PGDS\TNSSENMYTIMNPIGQGADRANFP\LGPGPEGPMAAY\G\GM 
EPHHVNGSLGSGDMDG\LPREFPQAPVAGLSNAPG\TPRDDGEr4AAA 
GTFHAPSSQSE\NYSP\GMTMSRVNVJAAAPGALCGPRLLPRRPCLRA 
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KGLKVTPSGTLDSLANQGLPMLGGPTRKTLTILLKTQGPRETFFSVW 
TLPAICILVPERKALWGGPSSPGRQGGGAH 

5736 

B 

315 

458 

NGPSRTQQPSLTDVLYCPEAIVSLVGVLRRLAACQEHKRAPEVYVAF 
T* 

5737 

A 

202 

402 

RRVLHTHTGCFWFCKISRSSSESDESLNFASKLWQERRGFQHGDPS 
RLPLDSREPRGPLRSPAATK 

5738 

A 

1 

3156 

LLLWDWVICQLSKELAENRCNQSEPRSRSACVEIAAFTFNMAKMFSF 

NDSNAARIHQDTQELMCFGRWCWKDGIPGSLCIPCSQPMSISTVQQG 

KKAASSHYSRGGAKYEGEAVTCRSLVESYTHPNSNETERRENIDTVMN 

CFSTEDFDFVTLYYREPDNVGHRFGPEAEKRKLMIQQIGRTIVYLVG 

ATEKHSLQSTSAPWGDHREEETQRQQDPLTNYIKFRVWVKFDIAGYG 

GFGLPLPKLGQEEALYQALKNAHPHLQVYKKEEFPEHFHLAKHDRVIi 

PIVIFQRIIMCFNKGSHGFDNVLMDMNIIFRAFGPDFKRNRLAEPFN 

S I H I YP FV S PG S HPQTHNG S LAVTQEMLMNS YDQQPGREL I KTAKAK 

QCPGPVLKPTFLNHPDPSQSRLGFRPVSRQRCGPAPVLVPEENAGTE 

LLLQSFERRFIJVARALRSFPWQTVKHPVCVKHRLSVKYARCFLSELT 

KKHEAVHTELLDlOiYEDIiAETLMAKESTQGHPSYIiLNSVVSGMFDEW 

NPWPEMSVFHATTVLELGSGASLTGLAICKMCRLQAYIFSDCHSQV 

LEKLRGNVIiLNGLSLEAD I SANLD S PRVTVAQLDWDVATVHQLS AIQ 

PD W I AAGNAQ PRALC AGDVL Y C PES I VSLVAVLRRLAAYREHQRAP 

QVYVALIVRNPETCQLFTTELSFAPDHPEHSPSWQPCAQMHAQQPLP 

AHRDVGPMPVHVGQRVNYRANKQASTRRHTGFHDKHFLCANKNVCIK 

YDVPPNVDTLGS SQS QARCS CH I CGHTRFLTKPS PLQLPHRWTHS VP 

HKAKPAAAATSLDTLGSSQSQAHCSCHIPGHTRFLTKPSPLQLPHRW 

THS VLHKAKPAAAATS VDTLDS SQSQAHC S CH I PGHTRFFTKPS PLQ 

LPHPWTHSVLHKAKPAAAATSLDTLGSSQSQARCSCHIPGHTRFLTK 

PS PLQLPHRWTHS VLHKAKPAAAVTSVDTLGSSQSQARCSCHIGGHT 

RFLTKPSPLQLPHPWTHSVLHKAKPAAAATSLDTLGSSQSQARCSCH 

I PGLMVQQVLFSRQESKC 

5739 

A 

1 

1470 

MAPEENAGTKLLLQSFERRFLAARTLRSFPWQVGGGASGEACGACGS 
PGADGMGLPAETPGAPATEGSHSALPTHSIACFNVPVNAPLRFLSLP 
STRSLEAKLRDSSDSELLRDILQKHEAVHTELLDELYEAIAETLM7UC 
ESTQGHWSYLLPRYPSCTDKVKTPRKARSHSPSTAIISHGTTGLVTW 
DAARYLAJ2 WAI KITLAAFTNRTVLELG SGAGLTGLS I CKM / WPPPD IH 
L\ HDCHSRVLEQLRGNVLLNGLSLEADI TANXiDSPRVTVAQLDWDVA 
TVHQLSAFQPDWITADVLYCPEAIVSLVGVLRRLAACWEHQWAPEV 
YMAFTVRNPETCQLFTTELAPSTCEGVLSLSHLTDKNTRTHRGQETP 
ELPQIAARPPGFSAFTVHPSPPLPPDFAPAPPEHSPSWQPCAKMHPQ 
QPLPAHRDMDDPVPVHVGQPVNYRANKQASTRRHTGFHDRVPHNLAQ 
EAGQAWGSGLSSQHFRRLRQED 

5740 

A 

2 

414 

PWGDHREEETQRQQDPLTNYIKFRVWVKFDIAGYGGFGLPLPKLGQE 
EALYQALKNAHPHLQVYKKEEFPEHFHLAKHDRVLPIVIFQRIIMCF 
NKGSHGFDNVLMDMNI I FRAFGPDFKRNRLAEP FNS IHI YPFV 

5741 

A 

3 

233 

QARLPPGLPATLRPRPRLGARGERGDRTLAAEFRAPLPGGARTAFLP 
LADCEASCVCEAPAVSQVCPVLSLRTHQK 

5742 

A 

83 

1503 

SWNTPYNHWATKQ/LSSHYSRGGAKYEGEAVKQSLVESYTHPNSNET ! 

ERSANIDTVMNWFTKEDFDFVTLCYREPDNVGHRFRPEAENRKLMIQ 

QIDRTIRPWDDHREEETQCQQDPLSNYIKFMDLVKFDIVGYGGFGMP 
LPKLG0EEALYOALKNAYPHTiHTYK'K'Pr , T?DT?tJT?tJT MomDirr ht^tviv 

ANFGYSINGISLLYFYTHLCDKYFNVHFFHEPLSLWRAQTTPSLSHR 
Y I S LVRHMDATNLDASEAQLPGS PYPD S APGPANLDS TQDLPQRRL S 
WPYPRPRPPNAHPGPALQPAPRTLLLTLPPGPAPSLPPRTALGPPPL 
QS R PG FR P VS R QRCG P AHFMAPEENAGTELLLQS FERR FL AARTLRS 
FPWQGGGGRAERPAGLAGVQGQTGWVSVLKP PALL PRE FLGSPRSGP 

RDPVPVPGAWPKSLAPLLPRELAAEDEGVSTALGGQTRARCGVQFTI 
IFTTF 
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5743 

A 

266 

477 

DFSLSRPHGA*KQN*ETHQILSCC/ GDILHKTVK/HSCVCEAPAVRQ 
ICPVLSLRTHQKARGCPHRAFGRAV 

5744 

A 

14 8 

^ 

SHVALGQLGWLTRAVRSSWRWELCVSSTGSGLPAFSLKQPKSCRCW* 

EEVKTGSCVGISHWQTST/VFPQAVFHLPPVLLPRSLACLSCPRGCA 

QIR\CQ*TPPYPLFPIILTYSFGMLSSTVLIRHILETNKKEKSLLAA 
EGVRHLQVS 

5745 

A 

164 

667 

ERGT Y I CQ I TTS L YRAQQ 1 1 QLN I QAS PKVRL S LANE ALL PTL I CD I 
AGYYPLDVWMWTREELGGSPSPSLWVSSFSKPQAKALAGTYSISSS 
jji/uif Kif kJjUVFJjX iCQVTHISLEEPLGASTPGCPTRAENSLGSHLC 
QQSLPSCTDVPWGFRDGQAPNRTWAASG 

5746 

A 

1 

790 

VEELSIGCLADSDQASKVQQQKLKVGQLGEWVWPLDKGGKLFSKLWGR 
GGLVYKWFLLIYKISYATGIVQYMAVMFTLFGLNLLFKIKPEDAMDF 
GISLLFYGLYYGVLERDFAEMCADYMASTIGFYSESGMPTKHLSDSV 
CAVCGQQ I FVD VS EE / EDH * EHRCS / C FHE FC I RG WC I VG KKQTCP Y 
CKEKVDLKRMFSNPYPLLGSLLGVGCGKKVLASVTCIWVLLGPSLLP 
PTRTCQTMNNKKI MKVMMR I KRMMTDTMAS 

5747 

A 

x 

X V/ v 3 

FRGRAVKMAAVVEVEVVGGAAGERELDEVDMSDLS PEEQWRVEHARM 
HAKHRGHEAMHAEMVLI L I ATLWAQLLLVQWKQRHPRS YNMVTLFQ 
rirv v v fij i r 1 VAijrtWWKr LVXWILFSAVTAFVTFRATRKPLVQTTPRL 
VYKWFLLIYKISYATGIVGYMAVMFTLFGLNLLFKIKPEDAMDFGIS 
LLFYGLYYGVLERDFAEMCADYMASTIGFYSESGMPTKHLSDSVCAV 
CGQQIFVDVSEEGDH\ETTYRLSCNHVSHEFCIRGWCIVGKKQTCPY 

CKEKVDLKRMFSNPWERPHVMYGQLLDWLRYLVAWQPVIIGWQGIN 
YILGLE 

5748 

B 

99 

236 

XTTS TS S TVTVTAPG P AATG S PVKKHR PLLPKETAP AVQR WWNS SX 
* 

5749 

A 

1 

783 

MAEl^QSLEQERDRLIAEVKKQLELEKQQAVDETKKKQWCANCKKEA 
I FYCCWNTS YCD YPCQQAHWPEHMKS CTQS ATAPQQEADAEVNTETL 
NKSSQGSSSSTQSAPSETASASKEKETSAEKSKESGSDSJVHDGYFS 
S SLLCFGPPCI S PCVT I PGRSGLLLCPLQGLRVS KRCDKQPAYAPTT 
TDHQPHPNYPAQKCKYEPVLRKLPGLLPDHSRSNKSSWSSSDEKRGS 
TRSDHNTSTSTKSLLPKESRLDTFWD 

5750 

A 

1 

1626 

MDLKELSESVQQQSTPVPLISPKRQIRSRFQLNLDKTIESCKAQLGI 
NEISEDVYTAVEHSDSEDSEKSDSSDSEYISDDEQKSKNEPEDTEDK 
EGCQMDKEPSAVKKKPKPTNPVEIKEELKSTSPASEKADPGAVKDKA 
SPEPEKI)FSEKAKPSPHPIKDKLKGKDETDSPTVHLGLDSDSESELV 
J. u lAjfcDHS GREGRKNKKEPKEPS pkqdgktpp s ttvgs hs ppetpvl 
TRSSAQTSAAGATATTSTSSTVTVTAPAPAATGSPVKKQRPLLPKET 
APAVQRWWNSSSKFQTSSQKWHMQKMQRQQQQQQQQNQQQQPQSSQ 
GTRYQTRQAVKAVQQKEITQSPSTSTITLVTSTQSSPLVTSSGSMST 
u v j>o vrtfUjLth' x A 1 iU>AU VAAD1 AKYTS KMPFFRVYLLKHLESWGSW 
I PGLEATAMHLAS S VMQEQLGWGRRPACRWL WAPHWLTP WVLS PQLW 
EWREGALEVFGANVYIADTISHLLKPCPYGPWGSRHCGNQEDSQMS 
PEEGAVPPLVKNLCSRELQNSLDYG 

5751 

A 

1 

542 

CGWNTGYCD/ YPCQQAHWPEH I KSCTQS \ ATALQQGADVEVNTEPLN 
KSSQGSSSSTQSAPSETA\SASKEKETSAEKSKESGSTLDLSGSRET 
PS S I LLGSNQGS VS KRCDKATLLCPNHHRPPAAPQLPRPEVPFPEL * 
NSRLSPPVFKKKGIRRVPIPNTSTSTKSLLPKESRLDTF\WD 

5752 

A 

1779 

3628 

MEISEDVYTAVGHSDSEDSEKSDSSDSEFISDDEQKSKDEPEDTEDK 
EGCQMDKEPSAVKXKPKPTNPVEIKEELKSTSPASEKADPGAVKDKA 
SPEPEKDFSGKAKPSP^PIKDKLKGKDETDSPTVHLGLDSDSE\NEL 
VIDLGEDHSGREGRKNKKEPKEPSPKQDWGKTPPSTTVGSHSPPET 
PVLTRSS AQTSAAGATATTSTSSTVTVTAPAPAATGS PVKKQRKL\ I 
PKE\TAPA\VQRVRVELIK*SFKRPPKSGTCRRCSVSSSSSSSKTSS 
SSHQSSQGTRYQTRQAVKAVQQKEITQSPSTSTITLVTSTQSSPLVT 
S SGSMSTLVS SVNADLP I ATAS ADVAAD I AKYTSKL \MDAI KGTM\ T 


l 04 VI 
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E I YNDLSKNTTGSTI AE IRRLR I E IEKLQWLHQQELSEMKHNL>ELTM 
AEMRQSWEQERDRLIAET\nCKQLEIiEKQQAVDETKKKQWCANCKKEAI 
FYCCWNTSYCDYPCQQAHWPEHMKSCTQSATAPQQEADAEVNTETLN 
KSSQGSSSSTQSAPSETASASKEKETSAEKSKESGSTLPLSGSRETP 
SSILLGSNQGSDHSRSNKSSWSSSDEKRGSTRSDHNTSTSTKSLLPK 
ESRLDTFWD 

5753 

A 

2 

1711 

RSEDSEKSDSSDSEYISDDEQKSKNEPEDTEDKEGCQMDKEPSAVKK 
KPKPTNPVEIKEELKSTSPASEKADPGAVKDKASPEPEKDFSEKAKP 
SPHPIKDKLKGKDETDSPTVHLGLDSDSESELVIDLGEDHSGREGRK 
NKKEPKEPSPKQDWGKTPPSTTVGSHSPPETPVIiTRSSAQTSAAGA 
TATTSTSSTVTVTAPAPAATGSPVKKQRPLLPKETAPAVQR\SCGTS 
STVQQKElTQSPSTSTITLVTSTQSSPLVTSSGSMSTIiVSSVNADLP 
IATASADVAADI AKYTSKMMDA I KGTKTE I YNDLSKN\ TTWKAQLAE 
DSQGIiRIEIEKLQWLHQQEL\SEMKHNIjELTMAEMRQSLEQERDRLI 
AEVKKQLELEKQQAVDETKKKQWCANFKKEAIFYCCWNTSYCDYPCQ 
\QAHWPEH\MKSCTQSAYCSSSRKADAE\VNTETLKLSPSQGSS\SS 
TQSAPFRKRPAPSKEK\ETFSLRKAKESGSTIiDLSGSRETPSSILLG 
SNQGSDHSR\SNKSSWSSSDEKRGS\TRSDHN/TPSTQHGRSLLPGK 
ESRAGTPFLGTSK 

5754 

A 

1 

191 

MQK* ITAWAPAPMKIKI IASPERKYSWIGGS I WPQLST/ FQQMWIS 
KQEYDESGPSIVHRKCF 

5755 

A 

2 

1203 

LSRRCQLSHSVLPPLRRRVSLPVAMEEEIAALVIDNGSGMCKAGFAG 
DDAPRAVFPSIVGRPRHQGVMVGMGQKDSYVGDEAQSKRGILTLKYP 
I EHG I VTNWDDME KI WHHTF YNELR VA PEEHP VLLTEAP LNP KANRE 
KMTQIMFETFNTPAMYVAIQAVLSLYASGRTTGIVhmSGIXSVTHTVP 
IYEGYALPHAILRLDLAGRDLTDYLMKILTERGYSFTTTAEREIVRD 
I KEKLCYVALDFEQEMATAAS S S SLEKS YELPDGQVTTIGNERFRCP 
EALFQPSFLGMESCGIHETTFNSIMKCDVDIRKDLYANTVLSGGTTM 
YPGIADRMQKEITALAPSTMKIKI I APPER KY S VW I GG SI LAS LS TP 
QQMWISKQEYDESGPSI\VHRKCF 

5756 

A 

3 

412 

SRFPEGLFQPFFPG\MKSCGIHETTFHSH/lIKFDVAIR\KDIiYANT 
LLPGGNHQVSGALIiTGMQKE IHAPAAQATLRFKI I APPGSASTRW/ V 
GS VGS I LASLSTFQ\ QMW\ I SKQE \ YDESGPLHS STAKCFLNGLSQ I 
A 

5757 

A 

1 

2297 

MGRDVmSLESLEEDRKMRESLEHARDLLNCCDQNADHHMDSEVQAEV 
VSEGDEELIGKWS KAYCHRCEQKDDLKWEL I FKREAEHKSLE ILQPD 
HWEKKNPGGGRNQS TS S S WVGTVAE I TT I EAVKCKI QVLQQQADDA 
EERA\ ERSQREFREERRARE\ QAEAEVASLNRR1 Q\ LVEEELDRAQE 
RLATALQKLEEAEK/ SLADESERRY*RLFEN\RALKRLKEKIGTSRE 
IQLK\EAKHIAEE\AD\RKYEEV\ARK\LVIIEGDLERTE\ERA\EL 
AESRCREMDEQIRLMDQNtfKCLSAAEEKYSQKEDKYEEEIKILTDKL 
KEAETRAEFAERSVAKLEKTIDDLEDTNSTSGDPVEKKDETPFGVSV 
AVGIiAVFACLFLS TLLLVLNKCGRRNKFG I NRPAVLAPEDGLAMSLH 
FMTLGG S SLS PTEGKG SGLQGH 1 1 ENPQYF SD AC VHH I KRRDI VI*KW 
ELGEGAFGKVFLAECHNLLPEQDKMLVAVKALKEASESARQDFQREA 
ELLTMLQHQHIVRFFGVCTEGRPLLMVFEYMRHGDLNRFLRSHGPDA 
KXLAGGEDVAPGPLGLGQLI^VASQVAAGMWLAGLHFVHRDliATRN 
CLVGQGLWKIGDFGMSRDIYSTDYYRVGGRTMLPIRWMPPESILYR 
K^TTESDVWSFGVVLWEIFTYGKQPWYQLSNTEAIDCITQGRELERP 
RACPPEVYAIMRGCWQREPSNATASRMCTPGCKPWPRHLLSTWMSWA 
RGPAQGLGWSRNTGACPQHPP 

5758 

A 

3 

636 

WAWNQAEAEVAS VNGR I QLVEEELDCAQERIiATALQKLE EAGKAADE 
SERGMKVIENRALKDEEKMELQEIQLKEAKHIAEEADRKYEEVARKL 
VIIEGDLERTE\ERAELAESRCREMDEQIRLMDQNLKCLSAAEEKYS 
QKEDKYEKE I KI \ LTNKLREAET\ RAEFAERS VAKLEKT I DDLED KL 
KCTKEEHLCTQRMLDQTLLDLNEM 
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5759 

A 

1 

239 

QCCSGGTHGNPAIGDR\MQKQILPWAPAQMKIRFMAPPERKYSVWIA 
APILASLSTSSRMWISKQEYDESGPS IVHRKCF 

5760 

A 

3 

1458 

ADPPPVHTRRQLTMDDDIAAI.WDNGSGMCKAGFAGDDAPRAVFPSI 
VGRPRH QG VMVGMGQ KDS YVGDEAQS KRG I LTLKYP I EHG I VTNWDD 
MEKIWHHTFYNELRVAPEEHPVLLTEAPLNPKANREKMTQIMFETFN 
I PAn x V A 1 y AV u b Jj i Ab GKTTQ I VMD S GDG VTHTVP I YEG Y AL PHA I 
LRLDLAGRDLTDYLMKILTERGYS FTTTAERE I VRD I K/ EEAVLRRP 
GLRARDGHG\ASSSSLEKSYELPDGQVITIGNERFRCPEALFQPSFL 

C*KKITALEP\ATMKI\RIIA\PPEAQSTPLWNRGAPIL\ASLVHLP 
AKMWDQKSREYDESGPLPFVHRK\ CFF 

5761 

A 

171 

32 6 

SPSR 

5762 


3 

JO J 

x\J l^i->TJ ^ ^ IVUDHr ZZ»X\.I\JM iltil LxvULiXivX i-J x\^/\ t> JCjXjXv III 2\_L) X X"^Xj1\XjX UO W 

MGKGAAKLRVAGWAPTIiPREDLEKIIAHFIilQQYLKEDYSFTAYAT 
ISYLKIGPKANLLiNNEAHAITMQVTKSTQNSFR 

5763 

A 

2 

314 

FFFFISALKALFAFLQILLFQVlWLRITAHIVISFINLLSVTPSKAF 
LLLAFIFCREDYSFTAYATISYLKIGPKANLt,NN\EAYVIT\MQVTK 
STQNS FRVNG Y 

5764 

A 

1 

176 

MRTFALLTAMLLLVA/HAQAEPLQARADEAAAQEQPGaDDQEMAHAF 
TWHESAALPLSA 

5765 

A 

1 

2674 

GKHKLLSTGPTEPWSIREKLCLASSVMRSGDQNWVSVSRAIKPFAEP 
GRPPDWFSQKHCASQYSELLETTETPKRKRGEKGEVVETVEDVIVRK 
LTAERVEELKKVI KETQERYRRLKRDAELI QAGHMDSRLDELCNDI \ 
KKKLEEEEAEVKRKATDAAYQARQAVKTPPRRLPTVMVRSPIDSASP 
GGDY PLGDLTPTTMEEATSGVTPGTLPS TPVTS FPG I PDTLPPGSAP 
LEAPMTPVTDDSPQKKMLGQKATPPPSPLLSELLKKGSLLPTSPRLV 
NESEMAVASGHLNS TGVLLEVGGVLPM I HGGE I QQTPNTVAAS PAAS 
ESVSQATIVMMPALPAPSSAPAVSTTESVAPVSQPDNCVPMEAVGDP 
HTVTVSMDSSEISMI INSIKEECFRSGVAEAPVGSKAPS XDGKEELD 
LAEKMD I AVS YTGEELDFETVGD 1 1 A 1 1 EDKVDDHPE VLDVAAVEAA 
LSFCEENDDPQSLPGPVTCHPIQQERDKPVPLPAPEMTVKQERLDFEE 
TENKG I HELVD I REPS AE I KVEPAE PE P V I SGAE I VAG WPATSMEP 
PE LRSQDLDEELGS TAAGE I LEADVA I GKGDETPLTNVKTEAS PESM 
LSPSHGSNPIEDPLEAETQHKFEMSDSLKEESGTIFGSQIKDAPGED 
EEEDGVSEAASLEEPKEEDQGEGYLSEMDNEPPVSESDDGFS IHNAT 
Xj\j b n X JLiAL> is ± F b fa FAb £> Q VCS ED QEA I QAQ K I WKKA I MLtVWRAAA 

VTUO V2iKr\7T?T.Ol>\7 T TTYr»T 71 DflVUOT^fAD T>MT"\T CPTirwiTrMnT TncrnT? 
iMriivx/vn vr ijyr V XJJUJLJ\k'\3 IrioX V yK ir nXJij o X x IvKJN X h>N \jLj1. K.o X A_b» 

FQRDIMLMFQNAVMYNSSDHDVYHMAVEMQRDVLEQIQQFLATQLIM 
QTSESGISAKSLRGRDSTRKQDASEKDGGTRGRRCAIEaDMKMKK 

5766 

A 

1 

419 

YANVFLQP\ VQDD IAP\ GS PS IGQR\ PMDLST/ IKKNI * NG * SEAPA 
DFQR*HMx^FQNLLMYHSSDHYFYHIAI\EMQ\RDVLEQIQQFLATQ 
LIMQT\SESGI\SAK\SLRGRDSTR\QQDASEMDGGTRGR\RCAIEA 
DMKMKK 

5767 

A 

2 

111 

ISAKSA\RGRDSTRKQDASEKDSVPMGSPAFtiLSLF ! 

5768 

A 

3 

2884 

GKHKLLSTGPTEPWS IREKLCLAS S VMRSGDQNWVSVSRAIKPFAEP 
GRPPDWFSQKHCASQYSELLETTETPKRKRGEKGEVVETVEDVIVRK 
LTAERVEELKKVI KETQERYRRLKRDAELI QAGHMDSRLDELCNDI A 
TK'KKT.PF'PP AP VTCRKZiT'na zi voziT?r > i2i\7'Pr*T , t>o'D'DT DTTTvnrD coTncac 

x i\s\s\±JiuJCjiui^t\iu v JvxvlvrlX 1/nHI V JVX r^xr rCrCLilr X Vrl VKor lUO/w 

PGGDY PLGDLTPTTMEEATSGVTPGTLPS TPVTS FPG I PDTLPPGS A 
PLEAPMTPVTODSPQKICMLGQKATPPPSPLLSELLKKGSIjLPTSPRL 

vne s emavasghlns tgvllevggvlpm i hgge i qqtpntvaas paa 
sgaptlsrlleagptqfttplasfttvaseppvklvpppvesvshat 
ivmmpalpapssapavsttesvapvsqpdncvpmeavgdphtvtvsm 
dsseismiinsikeecfrsgvaeapvgskapsidgkeeldlaekmdi 
avsytgeeldfetvgdi iai iedkvddhpevldvaaveaalsfceen 
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DDPQSLPGPWEHPIQQERDKPVPLPAPEMTVKQERLDFEETENKGIH 
ELVD I REPS AE I KVEPAE PE PVI SG AE I VAG WPATS ME PPELRSOD 
LDEELGSTAAGEIVEADVAIGKGDETPLTNVKTEAS PESMLSPSHGS 
NPIEDPLEAETQHKFEMSDSLKEESGTIFGSQIKDAPGEDEEEDGVS 
EAASLEEPKEEDQGEGYLSEMDNEPPVSESDDGFSIHNATLQSHTLA 
DSIPSSPASSQFSVCSEDQEAIQAQKIWKKAIMLVWRAAANHRYANV 
FL\QPVTDDIAPGYHSIVQRPMDLSTIKKNIENGLIRSTAEFQRDIM 
LRFQNAVMYNSSDHDVYHM\AVEMQRDVLEQIQQFL\ATQLLMQTS\ 
ESGISAKSLRGRt>S\TRKODASEKDSVP\MGSPAF\LLSLFDGGT\R 
GRPLCPLKPDMKDEKSESPSELPPL 

5769 

A 

3 

627 

PRGE^FKAQPEWRKTPCQMMLSQSTFRKT\YIGKIFTTIlii\L\PYV 
G\ KELNMI IMLPDE\ TTDLSTV\ EKDSTHAKFVEWTSLDMMAEEEVK 
LSIPRFKLEESY\DMESVLRNLGMTDAFEAGARANFSGMS\QTDLSL 
VQRSCTKSICGRSNEGRPREAASPPKVAI\MMMRCA\RFCPPPPAPD 
HPFLF\FIQHR\KTNG\ILFCGRFSSP 

5770 

A 

3 

626 

YNLVTFR/LAVLSFEKDDDHNGHIDFITAASNLRAKMYSIEPADRFK 

TKRIAVKIILVKPTTTGTVSGLVCFE\MIKVTGGYPFAAYKNCFLNIi 

SHS/lFVVFTETTEVRKTKIRNGISFTIWDRWTVHGKEDFTIiLDFIN 

AVKEKTYGIEPTMVVQGVKMLYVPVMPGHAKRLK^ 

YVDLTVSFAPDIDGDEDLPGTSK 

5771 

A 

92 

265 

SFELFADK\TPKTA/WLDGKHWFGKVKEGMNIVEAMERFGSRNGKTS 
KKITIADCGQLE 

5772 

A 

1 

1310 

GNS PPSELKWKAKS EDLRHRGLKAQAE I KGS TQQ IG FTTDPRMARSS 
pyptdvarwnapi fhvnsddpeav^^yvckvaaewrstfhkt)vvvdl 
VCYRRNGHNEMDEPMFTQPIJ^YKQIRKQKPVLQKYAELIjVSQGVVNQ 
PEYECVSMHGVRNKPSYNSTKSSMDGLILHPATGLVFVLSKQCEEIH 
QPVVWTCEQREAENATAEENRVLLA>TVNPTVFFDIAVDGEPLGRVSF 
EVGRAAACGNGAQKVGRGRENFPCL * ATGEK\ GFWL* G VPCFHRLFP 
RVLCVQGGELQQRH*WPLVGKPHSMGKKI * KIEELPSLKHTGSPGIL 
S PWQNAGPQTTNGFPS FS I CTA\ KT\ EWLEWPSHWFWHK* KEG\MN 
IVEAHWSRFG\SR\NGKTQQRRSPFADCGQLIiISLTCVFILTTKIIS 
FCRLRRAPLQPHLLAVS 

5773 

B 

47 

204 

XQHTFCKRCGVQSFYTPRSNPGGFGIAPHCLDEGTVRSMVTEBFNGS 
DWEKA* 

5774 

A 

2 

402 

CGDRGALRPPSCAGRSGPGPPRPPRPPRPLPWHPAPPAHGAPIiARPG 
ARARRSEKPPSEKPRLRRSSPRAQEEGPGEPPPPELALLPPPPPPPP 
TPA\ PRRPRRP\ PGPGRAAG ALGDVPEAAEAYLERVPPS SC 

5775 

A 

2 

241 

FSCLSLLKSWDYRALPPSPAN\LFLVETGFHHVGQASLKXLTSGDPP 
AS ASQTVG I TGMSHHAWP VS S FY I WSHWAPRNS 

5776 

A 

22 

643 

WEFPVLTMAATS GI LS ALGNFLAQM I EKKRKKENSRSIiDVGG PLR Y 
AVYGFFFTGPL\ SHFFYFFMEHWI PPEVPLAR\ VRRLLLiDRL\ FFAP 
\AFLMLFFLIMNFLEGKDAS\TFAAKMRGG\FWPAAEDETCRVWTP\ 
LQFINl\imrPFESSGCFFGOPGQLLFWYAYLALLGEVTTRWENIRC 
TCGRGSGGLHPPKREQKPIQSGCH 

5777 

A 

23 

448 

ELESRSVTQAGVQWCDLGS\LQPPPL/SFKQFSFLSIiLSSVnDYRHVP 
PCPANVC ILVEMGLVGQAGLELL I S SDTP I SVSQSAGX I S VSHGTQP 
HIGNFLEQLKWQLCYTSFLCFTKSSTVKHIQLIIIKITIiRNTKNKA 
RI 

5778 

A 

25 

396 

ISGRSTFSLFSRQGLSALSPRLE\CSVAISAHCNLRLPGSSN\GTTG 
A*HHTRLILYF/LAEMGFHHVGQAGFEVLTSSNPPASASTSARITGM 
SNRT\GHLFNFHP*MCYKYRCGSLAGRGGSRW 

5779 

A 

80 

460 

Y I SLNVTTHLI FFFFFLRQGL \ CS VTQAGV\ QWCNLGSLQPLPPRFK 
\NWDYRCVTPHLANFVFLVEMGF/LPASASQSAGITGVSHCTQLGVF 
I C I CYG S SHGWQS WHQFCS S KLL VKSGRVGVGLLG 

5780 

B 

1 

3811 

MVGELRYREFRVPLGPGLHAYPDELIRQRVGHDGHPEYQIRWLILRR 
GDEGDGG S GQVDCKAEH I LLWMS KDE I Y ANCHKMLGEDG QV I G P SQE 


low? 
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5781 1 A 1 


3135 


5782 A 


5783 A 

5784 A 


214 
221 


696 


348 
537 


SAGEVGALDKSVLEEMETDVKSLIQRALRQLEECVGTIPPAPLLHTV 

HVIiSAYASIEPLTGVFKDPRVLDLLMHMLSSPDYQIRWSAGRMIQAIi 

SSHDAGTRTQILLSLSQQEAIEKHLDFDSRCALLALFAQATLSEHPM 

SFEGIQLPQVPGRVLFSLVKRYLHVTSLLDQLNDSAAEPGAQNTSAP 

EELSGERGQLELEFSMAMGTLISELVQAMRWDQASDRPRSSARSPGS 

IFQPQLADVSPGLPAAQAQPSFRRSRRFRPRSEFASGNTYALYVRDT 

LQPGMRVRMLDDYEEISAGDEGEFRQSNNGVPPVQVFWESTGRTYWV 

HWHMLEILGFEEDIEDMVEADEYQGAVASRVLGRAIiPAWRWRPMTEL 

YAVPYVLPEDEDTEECEHLTIAEWWELLFFIKKLDGPDHQEVLQILQ 

BNLDGE I LDDE I LAELAVPIELAQDLLLTLPQRLNDS ALRDL INCHV 

Y KKYG PEALAGNQ AYPSLLE AQED VLLLD AQAQAKD S ED AAKVEAKE 

PPSQSPNTPLQRLVEGYGPAGKILLDLEQALSSEGTQENKVKPIiLLQ 

LQRQPQPFLALMQSLDTPETNRTLHLTVLRILKQLVDFPEALLLPWH 

EAVDACMACLRSPNTDREHPVHRVHSGTCYCSAPVAFALFHSSPPLC 

PHPGAPGTDFFLHRLTSVSRDYAWLNQLGTVRDAISKALEKHLGKLE 

LAQELRD^fVFKCEKHAH] J YRKLITNILGGCIQMVI J GQIEDHRRTHQP 

INI PFFDVFLR YLCQGS S VEVKEDKCWEKVEVS SNPHRAS KLTDHNP 

KTYWESNGSAGSHYITLHMRRGILIRQLTLLVASEDSSYMPARVVVC 

GGD S TS S LHTELNS VNVM PS AS RVI LLENLTRFWP 1 1 QI R I KRCQQG 

GIDTRIRGLEILGPKPTFWPVFREQLCRHTRLFYMVRAQAWSQDMAE 

DRRSLLHLSSRLNGALRQEQNFADRFLPDDEAAQAIiGKTCWEALVSP 

WQNITSPDEDGISPLGWLLDQYLECQEAVFNPQSRGPAFFSRVRRL 

THLLVHVEPCEAPPPVVATPRPKGRNRSHDWSSLATRGLPSSIMRNL 

TRCWRAWEKQVNNFLTS S WRDDDFVPR YCEHFN I LQNS S SELFGPR 

AAFLLALQNGCAGALLKLPFLKAAHEQQEMAQQERKQQQELHLALKQ 
EX* 


MLRWTVHIiEGGPRRVNHAAVAVGHRVYSFGGYCSGEDYETLRQIDVH 

IFNAGWERAADEVWLWSLGPRGLGTNIWFGKSGQPGGRFPGLSRAVP 

TVSLRWTKLPP^AIRGQAPWPYMRYGHSTV1,IDDTVLLWGGRND 

TEGACNVLYAFDVNTHKWFTPRVSGTVPGARDGHSACVLGKIMYIFG 

GYEQQADCFSNDIHKLDTSTMTWTLICTKGSPARWRDFHSATMLGSH 

MYVFGGRADRFGP FHS NNE I YCNR I R VFDTRTEAWLDCP PTP VL PEG 

RRSHS AFG YNGELY I FGG YNARLNRHFHDLWKFNPVS FT WKKIEPKG 

KGPCPRRRQCCCIVGDKIVLFGGTSPSPEEGLGDEFDLIDHSDLHIL 

DFSPSLKTLCKLAVIQYNLDQSCLPHDIRWELNAMTTNSNISRPIVS 

SHGRGRGCRERKRRHSSSERYCQLMPGANYRAGAGAGAGARRPRGAR 

DREEDGGGLEPAAVARDLLRGTSNMSFEELLELQSQVGTKTYKQLVA 

GNSPKKQASRPPIQNACVADKHRPLEMSAKIRVPFLRQWPISKKVS 

EQFARHIDQQIQGSRIGGAQEMERLAQLQQCLQAVLIFSGLEIATTF 

EHYYQHYMADRLLGWSSWLEGAVLEQIGPCFPNRLPQQMLQSLSTS 

KELQRQFHVYQLQQLDQELLKLEDTEKKIQVGLGASGKEHKSEKEEE 

AGAAAWDVAEGEEEEEENEDLYYEGAMPEVS VLVLSRHS WPVAS I C 

HTLNPRTCLPSYLRGTLNRYSNFYNKSQSHPALERGSQRRLQWTWLG 

WAELQFGNQTIJIVSTVQMWLLLYOroLKAVSVESLIAFSGLSADMLN 

QAIGPLTSSRGPLDLHEQKDIPGGVLKIRDGSKEPRSRWDIVRLIPP 

QTYLQAEGEDGQNLEKRRNLLNCL I VR I LKAHGDEGLHI DQLVCLVIj 

EAWQKGPCPPRGLVSSLGKGSACSSTDVLSCILHLLGKGTLRRHDDR 

PQVLSYAVPVTVMEPHTESLNPGSSGPNPPLTFHTLQIRSRGVPYAS 
CTATQSFSTFR 


GGGLEPAAVARDLLRGTSNMGFEELLELQSHVGTITYKQLVAGNSPK 
KQGSRPP I QNACVADKHRPLEMS AKIRVPFLRQWPI SKKVARDPRF 
DDLSGEYNPEVFDKTYQFLNDIRAKEKELVKKQLKKHLSGEEHEKLQ 
QLLQRMEQQEMAQQERKAQQE\RHLPLEPKRRAQAQQGHRPYFLKKS 
EQRQUUAEKFKELKRSKKLENFLSRKRRRNAGKDRRHLPLSKE 


ILKWSDFFARSWKNTFHFQILHKGLSLHHGLEDAYPVASRGSFH 
PMGASPNGPAHGLKGCKGWGAVPPMGFRSSGKFGKVTTRDSOEPAW 
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FRKGATEFSDRATNRSSNGF* SRERI/YPAS/ CEW*LDDSRAIFTFL» 
VIDNRCFRHDRSM 

5785 

A 

1 

1761 

MVL VWAVWWL WAV I VVVVVWAA VWG AVVVVAAAAmVVVVVV 

VWEEDNQHKTGAINNNOTAKNPQQSPFHSPATSTGAEATQMRRNQK 

TNPHNMTKQVSLTPPKITLAHQQWIQTKKKYLIYLKKHSGVNKIPRN 

PTYEGCEGPFQGELQTTAQQNKGGHKQTEDHSMLMDRKKQYCENGHT 

AQAVPNPYTLLSQIPEDAEWFTVIiDPKHAVFCIPVHPDSQFLFAFED 

PSNPM S QL I WTVLPQG FRNS PHL FGQALAQDLS QFS YLDTLVLRYMD 

DLLLATHSETLCHQATQALLNFLATCGYKVSKPKAQLCSQQVKYLGL 

KLSKGTRTLSEERIQPILGYPHPKTLKQLTAFLGITGFCQIWIPRYS 

KIARPLNTRIKETQKAlTrHLVRWTPEAEVAFQAIjKKALTHAPVI>S 

VGQNFSLYVTEKNRNSSRSPYTGTSAQLAELIAIiTRAPELGEGKRVN 

I Y ANS I GREREFLTS KGTL VKHQEAI KRL»I>LAVQKP KE VAVLHCVJGH 

QKGKEREIEENRQADIEARRAARQDPPLEMLTEGPLAFELAMATARA 

ELSLAIHHCCLPPPPQTRCWLPSLRIRQGVCCIPDPARAITLTAWPK 

IPFbGIRKAKNPRSEKTRLATILEAACCHFGSGPPPSWELWEQGPPV 

TVQTHILRS*VGFLGILFTPECFFR*IRYFFLVWIHCW*ASVILGGV 

KETCFVIL 

5786 

A 

1 

507 

MWSLVFSDLGFLASRIPRNGIFGHAEPRPAIiGLTPEHKGNDGDAGR 
GPLESSSSDPFLCGQERRENRVTVDSDTLALDAARLTCRHGLGNQVF 
CPHNGAQNVVDKQYTAAIHSRHTTAHGGYSVIjQTPAYSGLNDFSSSN 
KSDFRIGMPLEVTjYY 1 1 CVESPLS WLS I 

5787 

A 

143 

358 

KHSLKCLSLSSSSRWRATLSSVGWHLSSAAMRCTVSAGSSRDEGSIT 
SCSLKPGPSTHSPACSPSHKQLSK 

5788 

A 

184 

290 

VCLFKNSMYLLLLPGGPCSQSSQDGGRPLPKWWQASCSLiTWGFLASR 
I PRNGILGHAVS VI ALLEAMGHGREPWNP * GQE PPGQRT/ WRLATI I» 
EVACHHLGSSGSKDPLVTWDT 

5789 

A 

136 

351 

SSEVPIAAPDQAKGLPLFPHS*VPRFIPKFHEPNSRGAEERHSRGEK 
RRSI *TWRGVWIiGTVGEIGRGMAEL 

5790 

A ; 

253 

417 

WFSHPQAMGLPKFHEPNSRGAEERHGRGEKRRS I * TWRGVWLGTVGE 
IGRGMAEL 

5791 

A 

415 

2879 

PCRNNGEPLARLGAAMGALLYQKRSGHPAISGGVEVSDGSATVANSS 
GGRYVKEAKEATKNGDIiEEAFKLFNIiAKD I FPNEKVLSR IQKIQEAL 
EEIAEQGDDEFTDVCNSGLLLYRELHNQLFEHQKEGIAFLYSLYRDG 
RKGGILADDMGLGKTVQI I AFLSGMFDASIiVNHVliLIMPTNI,INTWV 
KEFIKWTPGMRVKTFHGPSKD/DTDQKPQSDSAKEW\VIITTYQMLI 
NNWQQLSSFRGQEFVWDYVIIiDEAHKIKTSSTKSAICARAIPASNRIj 
LLTGTP IQNNLQELWSLFDFACQGSLLGTLKTFKME YENP I TRAREK 
DATPGEKALGFKISENLMAIIKPYFLRRTKEDVQKKKSSNPEARLNE 
KNPDVDAICEMPSLSRKNDLIIWIRLVPLQEEIYRKFVSLDHIKELL 
METRSPLAELGVLKKLCDHPRLI*SARACCIjLNLiGTFSAQDGNEGEDS 
PDVDHIDQ\TTODTIjMEESGKMIF3jMDLLKRLRDEGHQTLVFSQSRQI 

i^iierllk^jrhfktlridgtvthllerekrinlfqqnkdysvfllt 
tq vggvgltltaatr wi fdps wnpatoaqavdrvyr i gqkenvwy 
rlitcgtveekiyrrqvfkdslirqttgekknpfryfskqelrelft 
i edlqns vtqlqlqslhaaqrksdi kldehi ayiiqs ix3i ag i sdhdl 
mytcdlsvkeeldvveeshyiqqrvqkaqflvefesqnkeflmeqqr 
trnegawlrepvfpsstkkkcpklnkpqpqpspllsthhtqeediss 
kmaswiddlpkegekqdlsqykr 

5792 

A 

16 

301 

IFASTAPKRSWARLGPPQILVSVATKASFGAPTIPSR\PGPQSVFAN 
SDLFPAPPQIPSRPVRIPPGIPPGVPRRPPAAPSRPTIIRPAEPSLL 
D 

5793 

C 

78 

209 

MSVFXXXXXXXXXXXXXXXXXXNEETNHPLGWYRLSDSTQYI * 

5794 

A 

119 

231 

QTTMCYGKCARCIGHSLVGLAIiLCISNKRPFELPCVli 

5795 

A 

69 

909 

E WCLLS P PEGHTFHLI WG I TELKTKRRGRKP S RP PMC YG KCARC I GH , 
S\LVGLAIIjCIAANILL\YFPKGETKNAS\ENHLSRF\VWFFSGIVG 
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RL/ VLLMLPAQHFVLHWGLEQDDC \ CG \ CCGHEN\ CGKRCAMLS S V I 
GWLSIGI \ AGSGLLWS I V/ VQPLGLARKDPLCLDS /LSGQWKLQPLP 
STEGPVTFWI TSTWS \ ECTEPKHI \ VKWNVSLFS ILLA\ LGGPELIL 
CLIQVINGS/VLGGICG\FCCSHQQQYGLAKRTNPGTEPQSFLYFHC 
NLSYIiHLYSIC 

5796 

A 

1 

200 

RGPVLIjiDTS / TW ARCNEPKP I / VEWNDTLFSN/ LLASGG IEFI LCLI 
QVINGVLGGICGFCCSHQQQYDC 



2 

424 

RLSLTLVAQAAVQWHNLGSLQPPPPGFKRFSCLSLPSSQDHRHTPPC 
PA/NSFVFLVETRLHHAGQAGLKLPTSG/DTPSSASQSARITGVSHR 
ARCWDSNSTIFLGSLANSNWRNVFLIVGGERIKGKHTHFNTHSMSEF 
EE 

5798 

A 

46 

626 

PPPPLPPPPSPPPPGPFAAARGERGGAAGAAMEPDSVIEDKTIELMN 
TSVMEDQNEDESPKKNTLWQISNGTSSVIVSRKRPSEGNYQKEKDLC 
IKYFDQWSESDQVEFVEHLISRMCHYQHGHINSYLKPMLQRDFITAL 
PEQGLDHIAENILSYLDARSLCAAELVCKEWQRVISEGMLWKKLIER 
MVRTD 

5799 

A 

1 

1004 

MWLi I IxAGLAW AGLiWV SDGLKD VLVMAKGKS RRGKS APKH I MS LGPLL 
MIAFAICHFASVMGFHYLEVNLRLQQCFKYQLIBTAFAVLVQSVLRI 
YSSRIWDKTSLECIiKVXiTGHTGSVLCIiQYDERVIVTGSSDSTVRVWD 
VNTGEVLNTLIHHNEAVLHLRFSNGLMVTCSKDRS I AVWDMAS ATD I 
TLRRVLVGHRAAVNVVDFDDKYIVSASGDRTIKNRDKAEVAYTNLVW 
STSTCEFVRTLNGHKRGIACLQYRDRLWSGSSDNTIRLWDIECGAC 
LRVLEGHEELVRCIRFDNKRIVSGAY/DMGFSSLELLATWLPNA*GS 
SFTAEL 

5800 

A 

1 

3084 

MAAS AWLQLWQKAKGKRALRVAGAGGRES AHL ILSPCSG PRIxLGWPG 
PVAASCHAGLLPGTGRGWRATVLQRFTYPCSNTSVMEDQNEDESPKK 
NTLWQISNGTSSVIVSRKRPSEGI^YQKEKDLCIKYFDQWSESDQVEF 
VEHLISRMCHYQHGHINSYLKPMLQRDFITALPEQGLDHIAENILSY 
LDARSLCAAELVCKEWQRVISEGMLWKKLIERMVRTDPLWKGLSERR 
GWDQYLFKNRPTDGPPNSFYRSLYPKIIQDIETIESNWRCGRHNIiQR 
IQCRSENSKGVYCLQYDDEKIISGLRDNSIKIWDKTSLECLKVLTGH 
TGS VLCLQYDER V I VTG S S D S TVR VWD VNTGE VXiNTL I HHNEAVLHL 
RFSNGLMVTCS KDR S I AVWDMAS ATD I TLRRVLVGHRAAVNVVDFDD 
KYI VSASGDRTIKNRDKAEVAYTNLRPGLTLS PRLVFS ILRNCQTVF 
QRRCTILQSCQWYMRVSACSEEPPFHNQHWIALLYIiAPOITKCVASG 
FALDS WEAKQLGF I CGNLLPQSHQLVWS TSTCEFVRTLNGHKRGI AC 
LQYRDRLWSG S SDNT I RLWD I E CGACLRVLEGHBELVRC IRFDNKR 
IVSGAYDGKIKVWDIiOAJVLiDPRAPASTIjCL^ 

M PKG VR S QL S YEAG I G S RR WVD AKQ PNS STQGLCTHCMH F P S YLHGC 
FLIIQVSAQFHSKAGLLPHGIQVAASSPKSMSYPLSVSGGAKSLPSS 
RKSMSYPLSICGGHSGVADTITHRSGPKLRQEALLLPIHPSSLLSFD 
KLCRVHTPVPGAWGGANTNGTIWWGTGEEWFFRGKRGSYLKRGLPRS 
PVAILERGGQILEGHSSGHCPIFSCQFDVSLCQSSLKGHSDTAKPPN 
VFFLILLRRTITRIIMKPERVRMROTPLQPADHLVSAQFCLFKLFAG 
RTPAQGQS SMAFTGRCS GLLQGPTLKAGAGELVAGIQALGAEGTSRR 
KEREFREGVQGGEGLGMKTTAHC PFANG I YRPHFMHENTEA 

5801 

A 

3 

267 

FAI CHFAS VMGFH YIjEVNIiRLQQC FKYQL I ETAFAVLVQ SVLRI YS S 
RIWDKTSLECLKVLTGHTGSVLCLQYDERVIVTGSSDSTV 

5802 

A 

242 

401 

DLYAATYSHSHT/ ILWSTSTCEFTOTLNGHKRGIACLQYRDRLVVS 
GSSDNTI 

5803 

A 

2 

386 

FVFAAARGERGGFAGAVMEPDSVIEDKTIELMLPKCMFGILLKKAGY 
SKRKESWKMLDCCLFSD*SERENPHSFWIGGSEVQNIFQNTSVMEDQ 
NEDES PKKNTLWQ I SNGTS S VI VSRKRPSEGNYH 

5804 

A 

1 

1248 

MAGCRSRAMPRREAAKAGEKSSPVPVGGTALSTGPPQPLALVLSPSIi 
HGAGRACRLLRVRARQAHAKLELQLARKCPHSPCSRSCLSLHTSLQA 
EGASSGLGQPRKGLPQCSGGLKGSSSAAKVGAQAEEVPRASEASEDC 
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QHAVTSHWDYSCSLMLLSFTLKDFLWRFLKGKTVVTSTDSAIGVGIT 

CYSKSRSPIPGLQTSTSLWTARNRATWQEPCGTNTSVLEDQDEDKSP 

KKNTPWQISNGTSFVIISRKRPSEGNYQKEKDLYIQYFNQGSESDQL 

EFVEHLISRMCHYHHGRINSYLKPMLQQDFITALPEQGLDHIAENIL 

LYLVTRSLCAAELDERVIVTGSSDSTVRVWEVN^ 

VLHLRFS SGLMVTCSKDRS I AVWDMAS ATDITLRGVLWH 

5805 

A 

2 

1653 

RGGAAGAAMEPDSVIEDKTIELMCSVPRSLWIrGCANLVESMCALSCL 
QSMPSVRCLQISNGTSSVIVSRKRPSEGNYQKEKDLCIKYFDQWSES 
DQVEFVEHLISRMCHYQHGHINSYLKPMLQRDFITALPEQGLDHIAE 

nilsyldarslcaaelvckewqrvisegmlwkkxtlerlwrtdplwkg 
lserrgwdqylfknrptdgppnsfyrslypkiiqdietiesnwrcgr 
hnlqriqcrsenskgvyclqyddeki i sglrdns i kiwdktsleclk 
vltghtgsvlclqydervivtgssdstvrvwdvntgevlntlihhke 
avxihlr f snglmvtc s kdr s i avwdmas atd i tlrr vlvghraavnv 
vdfddkyivsasgdrtikvwststcefvrtlnghkrgiaclqyrdrl 
wsgs sdnti rlwdi e cgaclrvlegheelvrc i rfdnkr i vsgayd 
gkikvwdlqaaldprapastlclrtlvehs\grvfrlqfdefqiiss 
shddtiliwdflnvppsaqnethspsg\tytyisr 

5806 

A 

1 

180 

NFKRFHKTIAl\ETASYWPK/DRHlNQRN/RESPVINPTTYGQIiIVD 
KGAKNTQWGKDNYFN 

5807 

A 

1 

355 

RWASGMSTEPGGSRHSLGIQVRGGWGVTGGEEESLTVPVADTWQAGS 
FKVATQERNPQRVQ^LRJRQKKGNTNK^SKEVRVLQEMQLLQVAAMN 
YRLRPLEKFGTYFTRMEQLSDKER 

5808 

A 

81 

474 

LGERRRASLSLSLTPGRRRGFKVATQERNPQRAQMRL\RRQKKGWP 
FWGDFLTELQ/MGLDSAIPGRPWMATPNKRSKEVRVIiQEMQIiLQVAA 
MNYRLR PLEKFVTYFTRMEQLSDKES YKLS CQLEPENQ 

5809 

A 

3 

383 

YGGKEYWDTRRSMVFARHLREVGDEFRSRHLNSTDDADRIPFQEDWM 
KMKVKLGSALGGPYXGVHLRRKDFIWGHRQDVPSLEGAVRKIRSLMK 
THRLDKVFVATDAVRKETVPFSSVAFSRLRAV 

5810 

A 

1 

146 

VLQSYAEGWKEGTWEEKVDERPCIDQLLYSQDKHESSLSSHFPHLLE 
EQ 

5811 

A 

3 

1219 

PGAMATLSFVFLLLGAVSWPPASASGQEFWPGQSAADILSGAASRRR 
YLLYDVNPPEGFNLRRDVYIRIASLLKTLLKTEEWVLVLPPWGRLYH 
WQSPDIHQVRIPWSEFFDLPSLNKNIPVIEYEQFIAESGGPFIDQVY 
VLQSYAEGWKEGTWEEKVDERPCIDQLLYSQDKHEYYRGWFWGYEET 
RGLNVS CLSVQGS AS I VAPLLLRNTSARSVMLDRAENIjIjHDHYGGKE 
YWD TRR SMVFARH LRE VGDE FR S RHLNS TDD ADR I P FQEDWMKMKVK 
LGSALGGPYLGVHLRRKDFIWGHRQDVPSLEGAVRKIRSLMKTHRLD 
KVFVATDAVRKEYEELKKLLPEMVRFEPTWEELELYKDGGVAIIDQW 
ICAHASS * SAAGEPETSCAGAGGTGRPLT 

5812 

A 

75 

130 

IHRE * VPAAQG ELRGAAE 

5813 

A 

1 

1940 

MATLSFVFLLLGAVSWPPASASGQEFWPGQSAADILSGAASRRRYLL 
YDVNPPEGFNLRRDVYIRIASLLKTLLKTEEWVLVI,PPWGRLYHWQS 
PDIHQVRIPWSEFFDLPSLNKNIPVIEYEQFIAGAQSLSHAAVSFPK 
CFSEMDLKWMSFVPPQLEPPLHPKKAATPGCYSDRTRLPVRKAVLRS 
GRRTACGVCFWSQCSARDKELHLRDAPSRFVPAPEFHLRGAPSRFV 
PAPELHLRGAPSRFIPAAFEQILIADLLGVTVPPSWCGSGQSGGPFI 
DQVYVLQSYAEGWKEGTWEEKVDERPCIDQLLYSQDKHEYYRCLLRIi 
LPLPQGWRSCQSQVSIAIKGSDTTRPPVEKLSVNSLHVSSARAAGSK 
AWQGSASIVAPLLLRNTSARFERWDYSLQRVFSVGVDFRIDGVTFHS 
HPWFCDRPADVFTRAAACKAGTCLQETQSLLTWDFVQTRRSMVFARH 
LREVGDEFRSRHLNSTODADRIPFQEDWMKMKVKLGSAL\GGPYLGV 
HLRRKDF I WGHRQDVPSLEGAVR K I RSLMKTHRLD KVFVATDAVRKE 
YEELKKtiLPEMVRFEPTWEELELYKDG \ G VAI IDQWI CAHARFF IGT 
SVSTFSFSDS*GKEKFLGLDPKTDVQQVLRRPREGV 

5814 

A 

3 

1286 

PGAMATLSFVFLLLGAVSWPPASASGQEFWPGQSAADILSGAASRRR 
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YLLYDVNPPEGFNLRRDVYIRIASLLKTLIiKTEEWVLVLPPWGRLYH 
WQS P \ D XHOVR I PWS EFFDLP *5 T .NKTJ T P VT "FVEniP t a t? ennoi? rnrwr 

YVLQS YAEG WKEGT WEE KVDERP C I DQLL YSQDKHE Y YRG WFWQ YEE 
TRGLNVS CLS VOGS AS I VAPLLLRNTSARS VMLDR aPWT.T.Mnwvrsr'ir 
E YWDTRRSMVFARHLREVGDEFRSRHLNSTDDADR I PFQEDWMKMKV 
KLGS7VLGGPYLGVHLRRKDFIWGHRQDVPSLEGAVRKIRSLMKTHRL 
DKVFVATDAVRKEYEELKKLLPEMVRFEPTWEELELYKDGGVAIIDQ 

WICAHARCLPTSLSAESGSGGFQRFFCPKYSVSEQMVACVHSGHFHT 
VCLLV 

5815 

C 

164 

391 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

5816 

C 

321 

341 

MAAliAS* 

5817 

A 

128 

526 

FFFFIFKTASHSWQAGVQWRSLSSMQPPPPGLKQFSHLSLLGSWIK 
GH\ LPHPSNFC I FSRDGVS PCWLGWSQTPGLRPQPS PKCLGLTGCEA 
TSPWPASLKFLKNCVFHLPGYVYLLHVLGPRSCKYLQKW 

5818 

A 

3 

273 

RRPRQRPTDQTQWFSILPDFSLDLQEGPSVESQT\HSDPHIPPVDPT 
THLTFNHLSIVCVSLTLSLPHLGSPCLWSPALVSQEAASHQDRR 

5819 

A 

1 

861 

MLDSLLALGGLVLLRDSVEWEGRSLLKAIiVKKSALCGEQVHILGCEV 
SEEEFREGFDSDINNRLVYHDFFRDPLNWSKTEEAFPGGPLGALRAM 
CKRTDPVPVTIALDSLSWLLLRLPCTTLCQVLHAVSHQDSCPGDSSS 
VGKVS VLGLLHEELHGPGPVGALS SLAOTEVTLGGTMCJOAc? AH TT.rP 

RPRQ\RPN\YQTQWFSI\LPDFQPGSPRGAXCRVPAXIPIPHIPPKE 

REARDSLILPFQFSSEKQQAIiLRPR:PGQATSHIFYEPDAYDDLDQED 
PDDDLDI 

5820 

A 

2 

494 

LGLLHEELHG PG P VG AL I SLAOTE VTLGfiTMno acaun r*p o -dt> no d 

TDQTQRVSILPDFSLDLQEGPSVESQPYSDPHIPPVSKNAKARTRKC 
SLVSGHGRENKSCRGWGWGQGF*GWGRVAASFVP\QVDPTTHLTFNL 
HLSKXEREARDSLILPFQFSSEK 

5821 

A 

1 

498 

VRIWSVLGLLHEELHGPGPVGAi^ 

\CRRPRQRPTDQTQWFSILPDFSLDLKEGPSVESOPYSDPHTPRVDP 
\TTHLTFNLHL\SKKEREARDSLILPFQFSSEKQQA\LLRPRPGQAT 
SHIFYEPDAYDDLDQEDPDDDLNILPGQI 

5822 

A 

3 

322 

HERQMALLKANKDL I PDGMTELTPLLQPLGFEDP WCEQDWTAGQD 

WVTYYDRDYGQQVTGEPQEFJDKALQELRQ 

S\HELPSHPPPSS 

5823 

A 

3 

276 

MTLPRLVSNSWLKQSSHLGLLKHWD 

FHAnEGSGGCRMAVVmRSGEEEEGSKVCSRFHWDSGVI.PKPMTN 

5824 

A 

151 

204 

SGEPFISCGRFWWLQDG/DLMTLPRLVSNSWLKQSSHLGLLKHWDYM 
S EAPHLAKNFLKAYEVVS LS FHVEGS GGCRMA 

5825 

A 

175 

457 

I HFVL I S AHAVGCEG I LWGG W S LRG VGVPHS S AKR S * VG I RGPGKGR 
PGAQLPAGRARASTEVP\SGAPCPLLGAK1TORGLHAPKRTGTGPSNH 

5826 

A 

188 

269 

IHFVL I SAHAVGCEGI S P I EEMGVS AL S I OEACSPGLOWATS FT.Ac: a 
WVSAPLLQ^lAi^ICPIILLGSLKGARATNRFRQSLFV<^VSFPARFPH 

LDLLLQ*P*KTOWTDVSHJjEQRGTNPG*RQE*SGPLQPWPAGFLDG n 
C 

5827 

A 

1 

542 

RHERNISHLDLKPQNILLSSLEKPHLKLADFGFAQHMSPWDEKHVLR 
GSPLxT4APEMVCQRQYDARVDLWSMGVILYEAIiFGQPPFASRSFSEL 
EEKIRSNRVIELPLRPLLSRDCRDLLQRLLERDPSRCISFQDFFAHP 
WVDLEHJ^IPSGESLGRATALWQAVKKDQEGDSAADLSLY 

5828 

A 

2 

889 

RGLRIPTWALQELAGRVTRRGPETGEGRREPGGSASPMRAAFPAGGA 
GGSVEPPSARPAPQPAGTAARSEEAPARAQAAGMAGPGWGPPRLDGF 
ILTERLGSGTYATVYKAYAKKDTREWAI KCVAKKSLNKASVENLLT 
EIEILKGIRHPHIVQLKDFQWDSDNIYIjIMEFCAGGDLSRFIHTRRI 
LPEKVARVFMQQLASALQFLHERNISHLDLKPQN\ ILLS SLEKPHLK 
LADFGF \ AQHMS PWD \ E KHVLRG \ S PL YMAPEMVCQRQ\ YDARVDLW 
SMGVI LYGETS FPCFS P 
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5829 

A 

297 

331 

RLLWGHHAGQR 

5830 

A 

3 

407 

TVYTFRSAIVTARAVWVRPRMDRRADLSSATQSASAEKFGGRVSAGH 
CALPLPARPVTASVYGRLARLRGCLEDSYPSALSAQVFLDSPAVGCG 
LE TRL» F I EAALG P P CRATVTS R GHIj "LD T S T T V P nP P f* FT . 

5831 

A 

3 

410 

RVSAGHCALPLPARPVTASVYGRIiARLRGCLEDSYPSALSAQVFLDS 
PAVGCGLETRLFIEAALGPPCRATVTSRGHLLDISITKS PGRPCFLS 
VCLHGSDQQKRKGAAATAKRKSKGGGVNVEGRLCTWPPEDP 

5832 

A 

1 

530 

QTIQEDSAATSEIIoDVMASQKRPSQRHGSKYLATASTMDHARHGFLP 
RHRDTGILDSIGRFFGGDRGAPKRGSGKDSHHPARTAHYGSLPQKSH 
GRTQDENPWHFFKNIVTPRTPPPSQGKGAEGHRPGFGYGGRASDYK 
SAHKGFKGVDAOGTLSKTFKTiOf5Rn^P«?riQPMlVPP S 

5833 

A 

198 

1175 

RANPFRMGNHAGKREIxNAEKASTNSETNRGESEKK\RNLGELSRTTS 
EDNEVFGEADANQNNGTS SQDTAVTDS KRTADPKNAWQDAHPADPGS 
RPHLIRLFSRDAPGREDNTFKDRPSESDELQTIQEDSAATSESLDVM 
ASQKRPSQRHGSKYIiATASTMDHARHGFLPRHRDTGILDSIGRFFGG 
DRGAPKRC?^C2KVPWTiKPf3R *3PT.PQU1\P QODC1T rKTMVvnciniDaTJTau 

YGSLPQKSHGRTQDENPWHFFKNIVTPRTPPPSQGKGAEGQRPGFG 
YGGRASDYKSAHkGFKGVDAOGTLSKTPin'.(^pn^R<5f3^PMi\PP 

5834 

A 

1 

651 

MGlOTAGKRELNAEKASTNSETiraGESEKiG^ 

E ADANQNNGT S S QDTAVTD S KRTAD PKN AW QDAHP AD PG SRPHL I RL 
F S RD APGREDNT FKDRP S E S D E LOT T OED 9 A AT <? F T ,n VM A <? HTTP P <3 
QRHGSKYIiATA\ STMDHARHGFLPRHRDTG ILDS IGRFFGGDRGAPK 
RGSGKRLTPPGKNCSLRVPAPEVTRPDPR 

5835 

A 

3 

210 

YEMLSDPVKRRAFNSVDPTFDNSVPSKSEAKDNFFEVFTPVFERNSR 
YNFDSWREFSYLDEEEKEKAE 

5836 

A 

71 

396 

APSAJ^VSRCAHRARPGAIMLLLPSAADGRGTAITHALTS^ 
ELSEESEDEELQLEEFPMLKTLDPKDWKNQDHYAVLGLGHVRYKATQ 
RQI KAARKYLVI LK 

5837 

A 

146 

1910 

APSAJ^VSRCAHRARPGAIMLLLPSAADGRGTAITHAIiTSASTIiCQ\ 

VEPVGRWFEAFVKRRl^nOJASASFQELEDKKELSEESQDEELQLEEFA 

MLKTLDPRIJWKNQDITYAVLGLGHVRYKATQRQIKAAHKAjW 

KKKAAGEPIKEGD1STOYFTCITKAYEMLSDPVKRRAFNSVD 

PS KS EAiCDNFFEVFTPVFERNSRWSNKJ<lWPKLGDMNS S FEDVD I FY 

or « i xv r WKJC»r o x XjiJj£iirji\Jii\>\x^C.KxJxi»i<KW x liKyjNoAl KAyKivlvbii 

MITOIRTLVDNAYSCDPRIKKFKEEE]^^ 

EKQRQAJSLEAARlxAXEi^EEEVRQQAilxAXKE 

RNSCKIEEINEQIRKEKEEAJELAR 

DDLQLLIKAVNLFPARTNSRWEVIAi^xTWIHSSSGVKRTAKDVIGKA 
KS LQKLDPHQKDD INKKAFDKFKKEHG WPQADNATPSERFEG PYTD 
FTPWTTEEQKLLEQAXiKTYPVx^TPERWEKIAEAVPGRTKKDCMKRYK 
ELVE^1VKAKKAAQEQVLNASRAKK 

5838 

A 

1 

1909 

DVSRCAJxRJU^PGAI^LLPSAADGRGTAITHAiTSA^TLCQVBPVGR 
WFFAFVKRRNRJJASAJSFQELEDKK^^ 

PKDWKNQDHYAVLGLGHVR YKATQRQ I KAAHKAxWLKHHPDKRKAA\ 
GEPIKEGD1TOYFTCITKAYEMLSDPVKRRAFNSVDPTFDNSVPSKSE 
AJCDNFFEVFTPVFERNSRWSNKKxWPKLGDMNS S FEDVD I FYS FW\Y 
NFDSWREFSYIJ)EEEKEKAJECRDERRWIEKQN 

AlXivuiMAI o v^lJlr rt X IVJvr !\JioX1»l\>\I\J\JiAx^ivKA 

QAELEAAJRIxAXEKEEEEVRQQAiLAXKEKBI^ 

KTWlTOFSDREA^RVKiyWEEVEKX^ 

KAALEKQIEEIxTOQIRKEKEEAR.A^ 

SEDDLQLLIKAVlNnjFPAGTNSRWEVIANYMNIHSSSGVK^ 
KAJCSLQIOjDPHQKDDINKXAPDKFKXEHGVVPQADNATPSERFEGPY 
TDFTPWTTEEQKLLEQALKTYPWTPERWEKIAE^VPGRTKKDCMra 
YKELVEMVKAKKAAQEQVLNAS RAKK 

5839 

A 

2 

812 

RVLELQPVTLLLGDHGWEHVDNGIRYKLDVTQCMFSFGNITEKL 
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ASLS CAGEVLVDLYAG IGYYTLPFLVHAGAAFVHACEWNPHAVVALR 
NNLE INA\ VADRCQ IHFGDNRKLKLSN I ADRVI LGL I PS S EEGW P I A 
t_y v LKQDAGG \ I LH IHQNVES FPGKNIiQALG vs kvekehwlypqqit 
x im w w ri i\w is jj ±r* vj j. ukj jc. raj jj t> t> a 1 JvF y JjAKVGR I CRNSNRHSSSAGAWE 

tmedtnsahptseilcspcgshspgsgmlplsfswl 

5840 

A 

3 

231 

OpppTpOPCVRUDnanUrTlTCCT DDDnnr^V OOrtrmnon nrMrik/s-rmAm 

or rex toto vMjrii^OAVVVwiJXooJjirl^i^J^l^o \ boDCPASASR VAGITGT 
HHHA W P I FVPWETG FH CL PQ P P I VLGLQ A 

5841 

A 

2 

300 

RRRLALSPARVTTWT-TfST.nciT.nPPPDWI?!?© PCPT r*T T CPC.TnPDni\nnnn 

ASFFVFLVETGFRHVGKDGLDFLT/S/GDPPALASQGAW\IAGVS\H 
RTRPQIHFF 

5642 

A 

227 

291 

NI I I FLLEQINFGFWFFFEME 

5843 

A 

1 

225 

HCNFPLPVQAI LGS SNSRAS ASRVAGTTGALQHMQLI FC I LVETGFH 

5844 

A 

3 

147 

LEIFFFFFSFTFF/CRWGLTMFIRLVLNFWPQMILLPLPLPPKVLGL 
QA 

5845 

B 

47 

241 

GGKDn^PLTISRITPGSKAAQSQLSQGDLWAIDGVNTDTMTHLEA 
QNKI KS AS YNLS LTLQK* 

5846 

A 

3 

409 

CGGI PSQAT I FTLSTL I S S VTS LNSTS FS TKVHTLSQNL YVFREPLK 
LTIiFLTL*ASAAFIALSN*SRT*RASWGVIC*DCMSSSR\PPEDCFP 
K*YFCITDMPWLRRKN\FFSTFQSKCRF*GTPICQKDKTSKK 

5847 

A 

1 

2209 

AAADSTSMSYSVTLTGPGPWGFRLQGGKDFNMPLTISRITPGSKAAQ 
SQLSQGDLVVAIDGVNTDTMTHLEAQITOIKSASYNLSLTLQKSKRPI 
PISTTAPPVQTPLPVIPHQKDPALDTNGSLVAPSPSPEARASPGTPG 
TPELRPTFSPAFSRPSAFSSIjAEASDPGPPRASLRAKTSPEGARDLL 
GPKALPGSSQPRQYNNPIGLYSAETLREMAQMYQMSLRGKASGVGLP 
GGSLPIKDIiAVDSASPVYQAVIKSQNKPEDEADEWARRSSNLQSRSF 
R I LAQMTGTEFMQDPDEEALRRS S TP I EHAPVCTSQATTPLL PAS AQ 
P PAAAS P S AAS PPLATAAAHTA I AS AS TTAPAS S PADS PRPQAS SYS 
PAVAAS S APATHTS YS EG PAAPAP KPR WTTAS I R PS VYQPVPAS TY 
SPSPGANYSPTPYTPSPAPAYTPSPAPAYTPSPVPTYTPSPAPAYTP 
SPAPNYNPAPSVAYSGGPAEPASRPPWVTDDSFSQKFAPGKSTTSIS 
KQTLPRGGPAYTPAGPQVPPLARGTVQRAERFPASSRTPLCGHCNNV 
I RG P FLVAMGRS WHPEE FTCAYCKTS LAD VCFVEEQNNVYCERC YEQ 
FFAPLCAKCNTKIMGEVMHALRQTVraT^ 

EDG\ EPYCEKDYINLFSTKCHCCH\ FPVEAGDKVI EALGHTWHDTCF 
I \ CAV\ CHVNL\ EGQPF YS KKDRPLCKKHAHTINL 

5848 

A 

1 

537 

RTRGGD VKTDLNNI EE FLEETLTPEKYP I LAAKHRE YNTED I D I FYK 
F S AY I K1>ITKQQNNAALERGLTKALKKVDD YLNTPL PQERDAYTCGED 
KGTRRKFL\DGDEL\TI^\DCi^\LPKLHW\K^ 

mtglwrylknayarXdeftntcaadseielayadvakrlsrs 

5849 

A 

1 

654 

WEGPTCSMCKVIGIHNACEEAPLHSVFHGQKT^ 
vn 1 ± ± lyijfcjJbRRVTKENSHHVlm 

P T^OTrOT?TfK"T.TT?TI* , 2iT.T/'VYVY^'E*/'^T rwpTirr t» hm»tv T/\nr t-\t-»t^*-»«h mi-t-v 
^xow^W^iNJ^J-i? lii/UilWiybyLUJ^TKLVETAIQSLDEPGGATFL 

LTAKQLIKSIVEASKGCQLGKTEQGFENMDFFTIjDLEHIADALRAID 

FGTDEEEEEFIEEEDQEEEESTEGKEEGHQ 

5850 

A 

141 

626 

LLWILIAAPPPPPPVIRNGARGCSPSPTTIPNAWGQEPPRPRERPPP 
PPSRDA\AGATPSFLPPPLRNGHRDSIHHCSGL/CLDDFESKYSFHP 
VEDFPAPEEYKHFSED\YPSKTNRAARG\APPLPPILQVKPGLVPFL 
IKRMDLLFFSYGPLHSSELLHDSS 

5851 

B 

23 

160 

MQHVC AMRQVD IDAYTTCLYASGTTPVPQLPLLLMALLGLCTLVL * 

5852 

A 

221 

1645 

APRMRLLAWL I FLAN\ WGGARAE PGKFWH I ADLHLD P \ DYKVS KDPF 
Q\ VCQSAGSQPRGPTQAPWGDYLCDSP\WAL\ INSS I YAiMKEIEPEP 
DFILWTGDDTPxW\P\DEKLGEAAVLEIVERLTKLIREVFPDTKVYA 
ALGNHDFHPKNQFPAGSiWIYNQIAELWKPWLSNESIALFKKGAFYC 
EKI»PG P SG AGR I VVXiNTNL YYTSNALTADMAD PGQQ FQW LED VLTD A 
SKAGDMVYIVGHVPPGFFEKTQNKAWFREGFNEKYLKVVRKHHRVIA 
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GQFFGHHHTDS F\ RMLYD YAGVS I S AMFI TPGVTPWRTTLPGWNGA 
NNPAI\RVFEYDRATLSLKDMVTYFMNLSQANAQGTPR\WEI»EYQLT 
EA\HGVPHATAHSMHTVLGRI\VGDKST1*QRYY\VYNS\VSYSAGVC 
uj&/\v^ Xoi^wrlv^AriKy VJJ 1DAI 1 rcXjYASGTTAVPQLPDLLMALIjG\Ij 
CTTRAVTCQAHHSSW 

5853 

A 

3 

126 

A Yf^rJQMfZl \ Vr*\7QriP TTTH2i'P'R'T'KT70V*T T\7V^7T vn^APOAVAV 
ojL'orJV-.i^ \ i\.v_ v o i/Jv J. ivri>\r r XiVEtyzvxx, V J\.VJL#l\AQAGSQKAxC 

5854 

C 

306 

386 

MLFLKAIWGGSDSWSREAACPLNCNL* 

5855 

j± 

7 09 

11 QQ 
X X 7 7 

AbvbVK^l AOKJj VVKlVf 1j Ttil FGNP1jFL»VAPPKPHSEWSQRLTY\RR 

RPSPYNTAT.*OK /RPPR ^T-TNTOT POP T"\7VT .T vdddt pvutjm\ cnpptrr* 
nror ihi/uj V*^-/ -Tvxtxrxv o m>J \^ J. ir iri\ i V I IjXj jl rKl^ljLjlvH ir.N \ 0/VL-L3 VL- 
PROEFEGVRAVRPX KV\ LMRL S KTKKHVS R AYfifi CMvr T ,tJV FP np T K" 
RAFL I \ E EQKI \ WKVLKAQAQSQKAK 

5856 

A 

1 

219 

HRG YLLFNLFVEVI S SEQS VLAPPVRR * TC I VANLRGVGDPDTSRHC 
AQGLGFKIiLTLYSLLVGSESWGGARA 

5857 

A 

112 

314 

LCTVCLWALKAGVGPEPERLIYSVPVFV*/HAVCAGIADGGSFSSKV 
RCLE I I FFYTG I S I KMFLYI K 

5858 

C 

133 

237 

MFMDXXXXXXGGRFKGSLGGPKFTRACKVKFFSL* 

5859 

C 

61 

177 

MRIHKRIilDLHSPSEIVKQITSISIEPGVEVEVTIADA* i 

5860 

c 

100 

153 

MS PPKSKGPFPRFPGRF * 

5661 

Q 

142 

IOC 

J- _7 J 


5862 

A 

11 

90 

RPRVRALKAGVGPEPERLIYSVPVFV 

5863 

A 

3 

759 

TVPLEVVLLRKVGAAGGARGVIRLUDWFERPDGFLLVLERPEPAQDL 
FD F I TERGALDE PLARRF FAQVL AAVRHCHS CGWHRD I KDENLLAV 
LRS GELKL IDFGS GALLMDTVYTDFDGTRVYS PPEWI RYHR YHGRSD 
TVWSLGVLLYDMVCGDIPFEQDEEILRGRLLFRER\VSPECQQLIRW 
CLSLRPSERPSLDQIAAHPWMLGADGGAPEQ/DCDLRLCTLDPDDVA 
oil bbbbl IjljRSIjHijTGS 

5864 

A 

1 

422 

I7UjGAEDAKEEKRAHYR£LAAQITKTCHESYARSDTKLGPEAFWFNS 
GREAVATQLSESYYILRPEVVESYMYLWRQTHNPIYREWGWEVVIiAL 
EKYCRTEAG FSG I QDVYS S TPNHDNKQQ S FFLAE TLKDALP FQDLR 

5865 

A 

2 

435 

ILAEFGSLHLEFLHLTELSGNQVFAEKATRNlRKVIiRKIEKPFGLYPN 
FLSPVSGNVTVQHHVSVGGLGDSFYEYLIKSWLMSGKTDMEAKNMYYE 
ALET/ HKLGPEAFWFNSGR EAVATQL S E S YYI LR PEWES YMYLWRQ 
THNTK 

5866 

A 

1 

837 

MPliVMKAG ENQRRQGDRGL S G ATV IDS LDTL YLMELKEEFQE1AKAWV 
GESFHLWVSGEASLFEVNIRYIGGLLSAFYLTGEENRKMTEGAEKAS 
VFFRAYAQPSYLAVVSGDGLQEGRGEDTAQGSRGGLGMKLQRETPCW 
VGQSYREPVPMGVTRITTAAHVFRIKAIRLGEKIiLPAFNTPTGIPKG 
WSFKSGNWGWATAGSSSILAEFGSLHLEFLHLTELSGNQVFT^EKAS 
LLPTLLPAEQRDERAAATGFSEASGSSLMDQQEGSGEPGSPSYH 

5867 

A 

1 

415 

TKIXSPEAFWFNSGREAVATQLSESYYILRPEVra 

YR\EWGWEVVLALEKYCRTEAGFSGI QDVYS STPNHDNKQQSFFLAB 
TLKYLYLLFS \ EDDLLSLEDWVFNTE AHPLPVNHSDS SGRAGGRH 

5868 

A 

1 

1187 

ESFHxjNVSGFASLFEVNIRYIGGLLSAFYLTGEEVFRIKAIRLGEKL 
LPAFNTPTGI PKGWS FKSGNWGWATAGS S S IIiAEFGS1jHL\EFIiHL 
TELSGNQVFAEKVRNIRKVPQVSFEKPFGLYPNFLSPVSGNWVQHHV 
SFGGLGDSFYEYLIKSWLMSAKTD\LEAKW 

VS PGGLTYIAEWRGG I LDHKMGHLACFSGGM I ALGAEDAKEEKRAHY 
RELAVOITKTCYESYAR\ SDTK7jGPEAFWFMQf5RTrAV2\Tryr.c*RR/Y»r 
Y I LR PE WE S YMYLW \ RQTHNP \ S YRI2 WG WEVVXALEKYCRTEAGFS 
GDSKTCYSRHPQPRTTSRQSFFKAETLNRCLVSAOJGPDTGLEARDT 
AESETVISPAVKGSRGEKAMVRETD 

5869 

A 

1 

1561 

MIFVVIILIWVLSPEGGSGLDSSPFLSFJ^AERIVQTLCTVRGAALK 
PYPPTGPNFRYSHGVPPHLFAYFPPGSTVSQDNSFISPQLQHIFERV 
RQSADFMPRWQMLRVIiEEELGRDWQTUCVASLEEVPFAAASIGQVHQG 
LLRDGTEVAVKIQYPGIAQSIQSDVQNLI^VLKMSAAIiPAGLFAEQS 
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LQALQQELAWECDYRREAACAQNFRQLLANDP F FRVPAWKELCTTR 
VLGMELAGGVPLDQCQGIiSQDLRNQICFQLLTLCLRELFEFRFMQTD 
PNWANFLYDASSHQVTLIiDFGASREFGTEFTDHYIEWKAAADGDRD 
CVLQKSRDLKFIiTGFETKAFSDAHVEAVMILGEPFATQGPYDFGSGE 
TARR I \ QDL I PVLLRHRLVS P \ PEGDLW PWHRKL\ AGAFLACAHLRD 
HIACRDHKPGHLPPLLGQSPARRSHCRQPPHQRGTPGWIPHDSLHGG 
FS PQSRP YPAWPLI PS PS ALGQRSPLGLPSLAWLSSLAPKTQESLG 
LGNSP 

5870 

A 

1 

1532 

MPQALERADGSWAWVVLIAT^1VTQGLTLGFPTCIGIFFTELQWEFQA 
SNSETSWFPS ILTAVLHMAGPLCS ILVGRFGCRVTVMLGGVLASLGM 
VAS S FSHNLS QLYFTAG F I TGLGMCFS FQS S I TVLGFYFVRRRVLAN 
ALASMGVSLG ITLWPLLSRYLLENLGWRGTFLVFGGI FLHCCI CGAI 
IRPVATSVAPETKECPPPPPETPALGCLAACGRTIQRHLAFDILRHN 
TGYCVYI LGVMWSVLGFPLPQVFLVPYAMWHS VDEQQAALLIS I IGF 
SN I FLRPLAGLMAGRPAFASHRKYLtFSLALLLNGLTNLVCAASGDFW 
VLVGYCLAYSVSMSGIGAIilFQVIJ^IVPMDQFPRAIXSLFTVLDGLA 
FL I S PPLAGLLLDATNNF * DVFQQCPS FFP * S KAPLLKGGSFPPPPQ 
KKEQGKQAVAADA \ LiERDLiFLiE \ AKDG PGKQRS P \ E IMCQS\SRQPR 
P \ AG VNKHL WGC P AS S RTSHE WLFWPKAVLQAKQTAItGWNS PT 

5871 

A 

946 

1279 

QTELGVRFPFTLPFLLGSARGCSGFRVLFVDFRVQPGFRKENLIQGD 
LFFM1^CQSSRQPRP\AGVNKHLWGMSCP/ASRDPAMKWI,LWPKAVL 
QAK\ QTAXiGWNS P YLECPV 

5872 

A 

3 

297 

ESCSVTQAGVHW/CNLGSLQPSP\PDSSDSPSSVSRVAGITGMCHHA 
QLIFVFWETGFHHGEAGLCLRLPKCWDYRCEPPRJLHLGLQLNFTYK 
DRCliA 

5873 

A 

276 

880 

KADSEQHGEAKRRYSSSIJUCKEVSKAPEDKKKRLEDDKRKKEDKERK 
KKDEEKVKAEEESKKKEEEEKKKHQEEERKKQEEQAKRQQEK\EAAA 
HMKE KEE S I QLHQEAWERHHLKKELCS KNQNAPG S R PKENFFSRLDS 
SLKKNTAFVKKLKTIPEQQRDSLSHDFNGLNLSKYIAEAVASIVEAK 
LKI SDVNCAVHLCS 

5874 

A 

2 

619 

LDLDLFNAILIVTSLAPFFQDCMSENALHELNIELLRNKIiYKSYLEA 
FYKFCKNHGDVTAEVMCPILEFEADRRAFIITLNSFGTEIjGQE\DRE 
TLYPNLGQLLSEGFAL\LAQAEDFDQMKNVADHYGVYKPLFEAVGGI 
GG KTLEDVFYEREVQMNVLAFNRQFHYGVF YA YVKLKEQE I RN I VW I 

TV T^f* T PAD tm tpy TVTCVTnTT 

AECISQRHRTKINSxIPIL 

5875 

A 

1842 

23 06 

GPRRIjAAF.LT S AGDG V YTR P W FHD CTCAT FHREDAYRLiPFIPTNHH 
PTASPE*VPPHPHLGALQLPGDPSHDIHSIGTPNSNADAPKAGGCKW 

EHTS LAWALGTAD 

JO / o 


J \J JL 

ATI 

Tk / J. 

AOQPPP WT?MT?nP "Pf^f^P 7AP*P \7C!nP^Pl\ Q TD TTLvT\7I*7 7A <TP V C CnwanTVrUT 
/^yorvXX^rVXM , XE»vjri, * vjv^iCrtJirv v o ^i\&jr jHo Xr r rV V nrtoij X oounAu l VLVl 

AQQAPHLARP 

DO// 

TV 

/ D J 


X r Ooir on xr /-vri rnu trf-\J-\J-±-r\\j x X v«,x_lm.v w /XjXjV_Xj X uKyy X yv^f 

5878 

A 

524 

605 

VSCPGSNSPLGSSSASGASSLEFHHTA 


A 

X.O / 

A ft 1 

T7CrDf!CMCDTyiCJ OC7\Cr27AQCT MPT DCA/TDATO * C\7CPT KTT "DXTT37AMAT3 7A\T" 

TI SGDVTKAWVSGFASLRPVKFLL 

5880 

A 

667 

890 

AHTVH IRRSASRLPAGPASRLVDLSH * DLGHVP / AGFFL IMNVQQEG 
LADLSTSAHLLGAPLVKGSRPHAWADAW 

5881 

A 

1 

163 

GLVDE S FVDTS PTGPS S ADATTS VRAAKKRAP YTLVGS I S EPGLG\ L 

XOrinl ©AM 

5882 

A 

183 

428 

TNSKNSCGKKSQNTGhiVIAHTKALDPSQPVTFVTIWTYAADKGALYV 
DVIRVNSYYSWYRNYGHLELIQLQLATQFENWCK 

5883 

A 

1 

1153 

MHNSDGIEVl^QQKAEIGRVGGLEGIQLGVTEVIVNGARxMLESYNCK 
AELGATGLVOTQISVKCSNQFKLEWLLNAENKVVDNQAGTQGQLKV 
LGT1STLWWPYLMHEHPAYLYSWEGRPDGAQAVGALTPGTIA 
AQKSLGP/ SDFYTLPVGLRTVAVTESQFLI SGKPFYFHGVNKHEDAD 
I QG KG FNWPLLVKDFNLLCWLGANTFCTSHYPYTEEMLQ I CYRYG IV 
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VIDECPAPSGHT\GPSVPSLLARWQLFNNVSMHHHMWVVEEPVLRDK 

NHPAMVMWSLAKEPASFLESAGYSFKSLTMEQTARVLDLDTGEAVLQ 

YRSLPRGAHKTLGKKRKISSYNVDLTSCQLAKEKCLKGPSSFLQSRQ 
ERMNSELRDN 

5884 

A 

401 

882. 

CTNTPPTWTRGRMVIAHTKAIjDPSQPVTFVTITVTYAADKGALYVDVT 
P\LNSYYSGYRNYGHL\ELI\QLQLAA\QFENWCKTSQSHYSERVWS 
GNAL*GFHQDPPLMFQ*RGPGKVLLEQYHLGLDQKRRKY\WGELIW 
NFADFMTNQCVS FLVGNENSANL 

CDQC 

n 


1184 

ETSHQVMDRSNPVKPALDYFtjgRLVNYQISVKCSNQFKLEVCLLNAE 
NKVVDNQAGTQGQLKVLGANLWWPYLMHEHPASLYSWEGRPDGAQAV 
GALTPGALAV\EVRLTAQKSLGP/SDFYTLPVGLRTVPVTESQWALY 
VD V I R VNS YY S W YRNYGHLEL I QLQL P AQFENWCKTS QSHYS ER VWS 
GNAYRVSPGGGNLPHALPCCSQNLALPTFSPGTWISLALSLIGLWA 
QETGVLFIWSQDPPLMFSEEYQKSLLEQYHLGLDQKRRKYWGELI 

WNFADFMTNQCKWQFSAWDNVPVLIFSGCLAHSGHFGCKNIGNKGGK * 
LV 

5886 

A 

1 

657 

MADFG I S AGQFVAWWDKS S PVEALKGLVDELQVKELQREPI/TPEE I 
QSVREHLGHESDSLLFVQITGKKQNFEVGSSSQLKFSITKKSSPSVK 
PAWPAAAKLWTLSANDr^DSMDLIDSDELLDPEDLKKPDPASLQA 
ASCGEGKKRKACKNCTCGLAKELEKEKSREQMSSQPKSACGNCYLGD 
AFRCASCPYLGMPAFKPGEKALLSNSNLHDA 

5887 

A 

76 

615 

VCQYCTARMAD FG IS AGQFVAWWDKS S PVEALKGLVDKLQALTGNE 
GRVSVENIKQLLQCL\VPGSTT\LHSA\EILA\EIARILRPGG\CLF 
LKEPVETAVDNNSKVKT7^SKL\CSALTLS\GLVEVKELQRBPLTPEE 
VQSVREHLGHESDNLLFVQITGKKPNFEVGSSMQLKLSITKKS 

5888 

A 

102 

1292 

ACQYCTAR I VD FE I SAGQFVA * VWDKSS PVEALKGLVDKLQAF / TPG 
NEGRVS V\ ENI K\ QLLQ\ SAHKESS F\ DI I h\ SGLVPGKAPLWHSA* 
DFWAGNPPGFLRPGWMFFFLKEPVETAVR*0*AKWKTASKLCSALTL 
SGLV/EKLKELQREPLTPEEVQSVREHLGHESDNLLFVQITGKKPNF 
EVGSSRQLKLSITKKSSPSVKPAVDPAAAKL\WTLSANDMEDDSMCI 
FCGCSLTHRWPLEHVVRLNMMINQKEDRVDTFFTLDSKFPLEACSHF 
SFSLAETTTVSLIALNTLQDLIDSDELIiDPEDLKKPDPASIjRAASCG 
EGKKRKACKNCTCGLAEELEKEKSREQMSSQPKSACGNCYLGDAFRC 
ASCPYLGMPAFKPG\EKVLLSDSNLHDA 

5889 



•7 A *7 

MGFRYVAQAGLELLSSDPS I PSSKTLQEQNDQS YPSTLPVFPEVALH 
TLLWGEDANNSKVKELQREPLTPEEIQSVREHLGHESDNIjLFVQITG 
KKPNFEVGSSRQLKLSITKJCSSPSVKPAVDPAAAKLWTLSAN\DMED 
DSMDLIDSDELLDPEDLKKPDPAS\LRA\ASCGEGKKRKACKNCTCG 
IiAEEL\ EKEKSREQMSSQPKS A\ CGNCYRGAMPSGCASCPYLGMPAF 

JVf Ijli JV v Jjij o U o V* LttUJ A 

5890 

A 

2 

152 

YRHMAPHPTNFLKK/FFCRDGGLVMLPRLVFNSWPQGILLPQPPKVL 
GLQV 

5891 

A 

1 

581 

EKQSCKSCGETFNSITKRRHHCKLCGAVICGKCSEFKAENSRQSRVC 
RDCFLTQPVAPESTEKTPTADPQPSLLCGPLRLSKSGETWSEVWAAI 
PMSYPQVLHLQGGSQDGRLPRTIPLPSCKLSVPDPEERLDSGHVWKL 
QWAKQS WYLSAS S AELQQQWLETLSTAAHG\ DTAQ\ DS PGALQLQVP 
MGAAAP 

5892 

A 

1 

1459 

MKENFKHRHWAFISSQEAQEDSREEEDRLESRFYFLFIiLELSYKTT 
GLNREPTVSLNSKSGTPMDONEH^HwnPWAKnonnGTJCTrT dttt \rr>v 

TGTGKSAAGNSILRKQAFESKLGSQTLTKTCSKSQGSWGNREIVIID 
TPDMFS WKDHCEALYK\ EVQRCYLVAGP\ KPHVLLLVTQLGRYTSQD 
QQAAQRVKEI FGEDAMGHTIVLFTHKEDLNGGSLMDYMHDSDNKALS 
KLVAACGGRICAFNNRAEGSNHDDQVKELMDCIEDLLMEKNGDHYTN 
GLYSLIQRSKCGPVGSDERVKEFKQSLIKYMETQRSYTALAEANCLK 
GAL I KTQLPKVDKTTKIGKKQSRKTGNS KKQSTS PPPKERS S S PAME 
QSWTENDFDELREEGFRRSHYSELPEDIQTEGREVENFEKSLEECIT 
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RITHTEKCLKELMELKTKARELREECRSLRSQCDQLEERVSAMEDEM 
NEMKREGKFKEKKNKKK 

5893 

A 

2 

1126 

SYKTTGLNREPTVSLNSKSGTPMDQNEHSHWGPHAKGQCASRSELRI 
ILVGKTGTGKSAAGNSILRKQAFESKLGSQTLTKTCSKSQGSWGNRE 
IVIIDTPDMFSWKDHCEALYKEVQRCYLLSAPGPHVLLLVTQLGRYT 
SQDQQAAQRVKEIFGEDAMGHTIVLFTHKEDLNGGSLMDY\MHDSDN 
KV \ L FKLVAACGGR I CAFNNRAEG SNQDDQVKELMDC I EDLLMEKNG 
DH YTNGL YS L I QRS KCG PVG S DER VKE F KQS L I KYME TQRS YT\ ALA 
EANCLKGAIiIKTQLCVLFCIQIiFLRLIILWLCILHSMCNLFCCLLFS 
MUiSiLir Ui>AjJ-»r lXFiUUjMlrIjRTVIRLE\RKTPRFIVTDPRLLFTSPI 
I 

5894 

A 

1 

303 

PCGARQGCGGPPPRL/ CSSRRTSQELLS KPVPPPAPGGPASRVGRGS 
v bbALb PJb. WW*jfc.KO±liiRG VVLTHPQS PNHQAGS PNPNTRQALATLS V 
PNKPFIE 

5895 

A 

1 

284 

ATESCSVAQVGVQWCDLSSLQAPPPGFTPFSSLSLQGSWDYRSPPPH 
PANFFVVLVE/MASASQSAGITGVSHRTRPRIAIFNLTENTNGSVDE 
D 

5896 

A 

3 

338 

GIHGWSYGGYLSLMALMQRSDIFRVAIAGAPVTLWIFYDTGYTERYM 
GHPDQNEQGYYLGS VAMQAEKFPS / E I YPQERHS IRVPESGEHYELH 
LLHYLQENLGSRIAALKVI 

5897 

A 

1 

1200 

MKNPAISITENVLHFKAQGHGAKGDNVYEFHLEFLDLVKPEPVYKLT 
QR QVNITVQKKVSQWWERLTKQEKR PL FLAPD FDRWLDE SDAEMELR 
AKVQLVNNRFKG VKYFRLNTLAS LG YWWIDNRGSCHRGLKFEGAF 
KYKMGQ I E IDDQVEGLQ YIiASRYDF IDLDRVG I HGWS YGGYLSLMAL 
MQRSDI FRVAIAGAPVTLW I FYDTGYTERYMGHPDQNEQGYYLGSVA 
MQ AEKFPS E PNRLLLLHG FLDENVH FAHTS I LL S FL VRAGKP YDLQ I 
AFCDNFAFY I CLELANMEEDARS AGVATFVLQEE FDR YS GYWWCPKA 
ETTPSGGKI LRILYEENDESEVE I IHVTS PMLETRRADS FRYPKTGT 
AN PKVTFKMSEI M 1 DAEGRGNFCK 

5898 

A 

2 

594 

NRGSCHRGLKFEGAFKYKMGQIEIDDQVEG\LQYLASRYT>FIDLDRV 
GIHGWS YGGYLSLMALMQRSDI FRVAI AG\ APVTLW I FYDTGYTERY 
MGHPDQNE QGYY LGS VAMQAEKFPS E PNRLLLLHG FLDENVH FAHTS 
ILLSFLVRAGKQ\YDLQIYPQERHSIRVPESG\EHYELHLLHYLQEN 
LGSRIAALKVI 

5899 

A 

74 

250 

KTLWMVNRTVLAGICSQQLSAGTAT/DVSKHRLQGFIPCRIPS/ASP 
QTQRKFS LQFHRNC 

5900 

A 

258 

552 

CMVFHD/VSGACCCNGETTGKISGGGNQLGSSPIGGPNPS/SGSSQT 
S ISGDWEACCS VLS MVCADPVYKVYVAALLLERELPGVFSGKI FNL 
LSDCLLW 

5901 

A 

1 

1660 

MDDRPEERMIREKLKATCMPAWKHEWLERRNRRGPVPPNNKWSPPRI 
QIKHSRNPPPIYPFTLETKDWVSKNQIKQAAQQQTVQQQPLAGSRRN 
QESNFNLTHYGTQQIPPAYKDLAEPWIQVVATPYLMAINAASSLRTC 
NNHRPASISYLDSVRLLCHKRPSKSSVSALITVCSSFAGFMRPPYSL 
RATVHVSCSPLAALHSIAPSSVKVRVQACGTSASPQIRIMAFRYPGS 
RAIFPIIPCSAKPVRMSTCSWRATVIAMSLRLTEVLGQQNRYGSNNP 
YAHGANLCGELPLELYVFPIRPSFSTRPKTVAVDIASIFKIQQFPSH 
FTTPQLIDLVRFNSPVLLRVGALFHQCSISRSPLRRRSLSDATSSRA 
RKAVLTRTFGLCYADLKNHLNATFVAVLKTGPLAAMQEQYRPEBIES 
KVQLHWDEIQiTFEVTEDESKEKYYCLSMLPYPSRRT.WMnHVPTJYTTG 
D V I AR YQRMLG KNVLQ P I G WD AFGL P AE GAAVKNNTAP AP W TYDN I A 
YMKNQLKMLGFG YDWSRELATCTPE YYRWEQNS S PS 

5902 

A 

1 

1947 

MALRRLSHDVSGALLLANGESTGNSGGSSGSSPSGGATSGSSQTSIS 
GDVVEACCSVLSMVCADPVYKVYVAAL\QCMLLVTLEDPSSHFTRMR 
RRLM/AYADEVEIAEAIQLGVEDTLDGQQDSF\CRHLFPTTIWKPQR 
TVP/LECTIHLEKTGKGLCATKLSASSEDISERLASISVGPSSSTTT 
TTTTEQPKPMVQTKGRPHSQCLNSSPLSHHSQLMFPALSTPSSSTPS 
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VPAGTATDVSKHRLQGFIPCRIPSASPQTQRKFSLQFHRNCPENKDS 
i^ivj_»o f v r x VisK.irjjir'oolM XilKfivf t>Kir rPGNTSKQGDPSKNSMTLDLN 
SSSKCDDSFGCSSNSS/NCCYT\SDETVFTPVEEKCRLDVNTELNSS 
IEDLLEASMPSSDTTVTFKSEVAVLSPEKAENDDTYKDDVNHNQKCK 
EKMEAEEEEALAIAMAMSASQDALPI VPQLQVENGEDI 1 1 IQQDMTF 
FRHIIPPIQWIYKKESANLLIDSTGQRLRIADFGAAARLASKGTGAG 
EFQGQLLGTIAFMAPEVLRGQQYGRSCDVWSVGCAIIEMACAKPPWN 
AEKHSNHLAL I FKKLLDFANTACDGDKES EVEDVETDSGNS PEDLRK 
E IM IGLQYQAEI PP YLGE YDGNEKDS PQPKKMTGVQNAKEVLS T 

5903 

A 

1 

4491 

PSPEAGGGGGALKASSARAAAAGLLREAGSGGRERADWRRRQLRKVR 

SVELDQLPEQPLFLAASPPASSTSPSPEPADAAGSGTGFQPVAVPPP 

HGAAS RRG AHLTE S VAAPD S GAS S P AAAE P GE KRAP AAE PS P AAAPA 

GREMENKETLKGLHKMDDRPEERMIREKLKATCMPAWKHEWLERRNR 

RGPVWKP I PVKGDGSEMNHLAAES PGEVQASAAS PASKGRRS PS PG 

NSPSGRTVKSESPGVRRKRVSPVPFQSGRITPPRRAPSPDGFSPYSP 

EETNRRVNKVMRARLYLLQQIGPNSFIilGGDSPDNKYRVFIGPQNCS 

CAHGTFCIHLLFVMLRVFQLEPSDPMLWRKTLKNFEVESLFQKYHSR 

RSSRIKAPSRNTIQKFVSRMSNSHTIiSSSSTSTSSSENSIKDEBEQM 

CPICLLGMLDEESLTVCEDGCRNKLHHHCMSIWAEECRRNREPLICP 

LCRSKWRSHDFYSHELSSPVDSPSSLRAAQQQTVQQQPLAGSRRNQE 

SNFNLTHYGTQQIPPAYKDLAEPWIQVFGMELVGCLFSRNWNVREMA 

LRRLSHDVSGALLLANGESTGNSGGSSGSSPSGGATSGSSQTSISGD 

WEACCSVLSMVCADPVYK\^AALKTLRAMLVYTPCHSUVERI 

RLLQPWDTILVKCADANSRTSQLS I STLLELCKGQAGELAVGREIL 

KAGS IG I GGVDYVLNC I LGNQTESNNWQELLGRLCL I DRLLLE FPAE 

FYPHIVSTDVSQAEPVE IRYKKLLSLLTFAIiQS IDNSHSMVGKLSRR 

I YL S S ARMVTTVPHVF S KLLEMLS VS S VS THF TRMRRRLMAYADEVE 

IAEAIQLGVEDTLQRQQHNSFCRHLFPTTIWKPQRTVPLECTVHLBK 

TGKGLCATKLSASSEDISERLARISVGPSSSTTTTTTTTEQPKPMVQ 

TKGRPHS QCLNS S PLSHHS QLMFPALSTPS S STPS VPAGTATDVSKH 

RLQGFIPCRIPSASPQTQRKFSLQFHRNCPENKDSDKLSPVFTQSRP 

LPSSNIHRPKPSRPTPGNTSKQGDPSKNSMTLDLNSSSKCDDSFGLS 

SNSSNCCYTSDETVFTPVEEKCRLDVNTELNSSIEDLLEASMPSSDT 

x v x c i\od v^v jjo±rc.l\-rt-Cii>llJjj X x iSJLJJJ ViNliijN^JKV-KKKMRA KK 
MAMSASOVALPI VPOLOVKNnRDT T T TOOFi r PT3T? r rT dtutv»vodvod 
Dx^WLKGQQIGLGAFSSCYOAODVGTGTLMAVKOVTVVPTJTQQPnRT? 
WEALREEIRMMSHLNHPNI IRMLGATCEKSNYNLFIEWMAGGSVAH 
LLSKYGAFKESWINYTEQLLRGLSYLHENQIIHRDVKGANLLIDST 
GQRLRIADFGAAARIxASKGTGAGEFQGQLLGTIAFMAPEVLRGQQYG 
RSCDVWS VGCAI I EMACAKPPWNAEKHSNHLAIi I FKI AS ATTAPS I P 
SHtiSPGLR\DVALRCLRTSTFTDRPPSRELLKHPVFRTTW 

5904 

A 

1940 

2062 

RPLKSQETTETWMKLETIILSKLSQRQKTKHRMFSLIGGN 

5905 

A 

785 

892 

PKQHGTWMKLETI ILSKLSQGQKTKHRMFSLIGGN 

5906 

A 

1 " 

2745 

MGDFNTPLSTLDRSTRQKWKDTQExxNSALHQADLIDIYRTLHPKST 
EYTFFSAPHHTYSKIDHIVGSKAIjLSKCKRTEII ITfYLSDHSAIKXiE 
LRI KNLTQS HS TTWKlxNNLLLND YWVHNEMKS E I KMF FE TNENKDTT 
YQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQT 
HSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPL 
ARLIKKKREKNQIDTIKxTOKGDITxT^PTEIQTTIREYYKHLYANKLE 
NLEEMDK^LDTYTLPRlxNQEEVESLhmPITGAEIVAIINSLPTKKSP 
GPDGFTAEFYQRYKEEXiVPFLLKLFQSIEKEGILPNSFYEASIILIP 
KPGRDTTKKENFRPISLMNTDAK^ 

FIPGMQGWFNIRKSIIWIQHII^TKDKNHMIISIDAEKAF^KIQQPF 
MLKTLNKLELEKTTLNF I WNQKRAR I AKS I LS QKNKAGG I TLPD FKL 
YYIUVTVTKTAWYWYQNRDIDQWNRTEPSEITPHTYNCLIFDKPEKNK 
KWGKDSLFNKWCWENWloAICRK^ 
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KTIKTLEENIfGITIQDIGMGKDFMSKTPKAMATKDKIDKWDLIKLKS 
FCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNBLNQIYKKKT 
NNP I KKWAKDMNRHFS KED I YAAKKHMKKCS S SLAIREMQI KTTMR Y 
HLTPVRMAI I KKSGNNRCWRGCGEIRTLLHCWWDCKLVQPLWKSVWR 
FLRDLELE I PFDPAI PLLG I Y PND YKS CC YKDTCTRMF I AALFT I AK 
TWNQPKCPTMIDWIKKMWHIYTMEYYAAIKNDEFMSFVGTWMKLEII 
ILSKLSQEQKTKHRIFSLIGGN 

5907 

A 

1 

3297 

MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLQPKST 
E YTFFS APHHT YS KIDHIVGS KALLS KCKRTE 1 1 TNYLSDHS A I KLE 
LRIKNLTQSRSTTWKLNNLLL^YWVHNEMKAEIKMFFETNENKDTT 
YQNIiWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQT 
HSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPL 
ARL I KKKRE KNQ I DT I KND KGD I TTD PTE I QTT I RE YYKHLY ANKLE 
NLDEMDKFLHTYTLPRLNQEEVESLNGPITGAEIVAIIDSLPTKKSP 
GPDGFTAEFYQRYKEELVPFLLKLFQS IEKEGILPNSFYEASI ILIP 
KLGRDTTKKENFRPLSLMNIDAKILNKILAKRIQQHIKKLIHHDQVG 
F I PGMQGWFNIRKS INVIQH INRGKDKNHM 1 1 S I DAEKAFDKIQQPF 
MLKTLNKLG IDGTYFKI I RAI YDKPTAN 1 1 LNGQKLEAFPLKTGTRQ 
GCPLSPLLFNIVLEVLARAIRQEKEIKLISNFSKVSGYKINVQKSQA 
FLYTNNRQTESQIMSQLPFTIASKRIKYLGIQLTIUDVKDLFKENYKP 
LLKE I KEDTNKWKNI PCS WVGR INIMKMAI LPKVT YRFNAI P I KLPM 
TFFTELEKTTFKFIWNQKRACITKSILSQKNKAGGITLPDFKLYYKA 
TVTKTAWYWYQNRDIDQWNRTEPSEIMLHIYNYLIFDKPEKNKQWGK 
DSLFNKWCWEWWIAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIK 
TLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTA 
KETTI RVNRQPTKWEKI FATYS SDKGL I SR I YNELKQ I YKKKTNNP I 
KKWAJCDMNRHFS KED I YAAKKHMKKCS SS LAI REMQ I KTTMRYHLTP 
VRMAIIKKSGimRCWRGCGEIGTLLHCWWDCKLVQPLWKSVWRFLRD 
LELEIPFDPAIPLLGIYPKDYKSCCYKDTCTRMFIAALFTIAKTWNQ 
PKCPTMIDWIKKMWHIYTMEYYAAIKNDEFMSFVGTWMKLEIIILSK 
LSQEQKTKHGIFSLIGGN 

5908 

A 

1 

2202 

MKAEIKMLFETNENKDTTYQNLWDTFKAVCRGKFRTLNAHKRKQERS 
KI DTLTSQLKELEKQEQTHS KAS RRQE I TKI RAELKE I ETQKTLQK I 
NESRSWFFGRINKIDRPLARLIKKKREKNQIDAIKNDKGDITTDPTE 
IQTTIREYYKHLHANNLENLEEMDKFLDTYTLPRLNQEEVESLNRPI 
TGSEIVAIINSLPTKKSPGPDGFTAKFYQRYKEELRIKYLGIQLTRD 
VKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIY 
RFNGIPIKLPITFFTELEKTTLKFIWNQKRARVAKSILSQKNKAGGI 
MLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIF 
DKPEKNKQWGKDSLFNKWCWENWLAICRKLKMDPFLTPYTKINSRWI 
KDLNWPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAMATKAKIDKM 
DLIKLKSFCTAKETTIGVNRQPTKWEKIFATYSSDKGLIFRIYNELK 
QI YKKKTNNPIKKWAKDMNRQFS KED I YAAKRHMKKCSTSLAIREMQ 
IKTTMRYHLTPVRTAIIKKSGNNRCWRGCGEIGTLLHCCWDCKLVQP 
LWKS VWRFLRDLELE I PFDPAI PLLG I YPKD YKSCCYKDTCTRMFTA 
ALLTIAKTWNQPKCPTMIDWIKKMWHIYTMESYAAIKNDEFMSFVGT 
WMKLEIIILSKLSQGQKTKQRIFSLIDGN 

5909 

A 

1 

2868 

MKAEIKVFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERS 
KI DTLTS OLKE LE KOE OTH S KAS RP OE T TK T r a p t . m? t r>Tfi ttpt .n v t 

NESRSWFFERINKIDRSlxARLIKKKREKNQIDTIKNDKGDITTDPTE 
IQTT IRE YYKHLYANKLEl^EEMDKFLDTYTLPRLNQEE VESLNRP I 
TGAEIVAIINSLPTKKSPGPDGFTAEFYQSWAETQPKKENFRPISLM 
NIDAKILNKIIJUa^IQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVT 
QHINRAKDKNHMI I S I DAEKAFDKIQQPFMLKTLNKLG IDGTYFKI I 
RAIYDNPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLAR 
AIRQEKEIKGIQLGKEEVKLSLFADNMIVYLENPIVSAQNLLKLISN 
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FSKVSGYKINVQKSQAFLYTNNRQTESQIMSQLPFTIASKRIKYLGI 
QLTRDVKDLFKENYKPLLKEI KEDTNKWKNI PCSGGRKNQYRENGHT 
AQELEKTTLKFIWNQKRAHIAKSILNQKNKAGGITLPDFKLYYKATV 
TKTAWYW YQNRD I DQWNRTEP SE I TQH IYSYLI FDKPEKNKQWGKDS 
LFNKWCWENWLAICRKLKIjDPFLTPYTKINSRWIKDLNVRPKTIKTIj 
EENLGI TIQDIGMGKDFMS KTPKAMATKDKIDKWDLVKLKS FCTAKE 
TTIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKK 
WAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVR 
MAI I KKSGNNRCWRGCGETGTLLHC WWDCKLAQPLWKS VWRFLRDLE 
LE I PFDPAI PliLG I YPND YKS CC YKDTCTRMF I AALFT I AKTWNQPK 
CPTIIDWIKKMWHIYTMEYYAAIKNDEFVSFVGTWMKLEIIILSKLS 
QEQKTTHRIFSLIGGN 

5910 

A 

1 

3237 

MKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFI71LNAHKRKQERS 
KIDTLTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKI 
NESMSWFFERINKIDRLLARLIKKKREKNQIDAIKNDKGDITTNPTE 
IQTTIREYYKHLYANKLENIiEEMDKFLDTYALPRLNQEEVESLNRPI 
TGAEIVAI INSLPTKKS PGPVGFTAEFYQRYKEELVPFLLKLFQS IB 
KERILPNSFYEASIIVIPKPGRDTTKKENFRPISLMNIDAKILNKIIi 
ANRIQQHIKKLIHHDQVGFIPGMQGWFNICKSVNVIQHINRTKDKNH 
MIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAICDKPTANI 
II^GQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGI 
QLGKEEVKLS LFADDM I VYLENP 1 1 S AQNLLKL I SNFS KVSGYKINV 
QKS Q AFL YTNNRQTESQ IMS ELP FT IAS KR I K YLG I QLTRDMKDLFK 
ENYKPLLKEIKEDTNKWKHIAFSWVGRINIMKMAILPKVIYRFNAIS 
I KLPMTFFAELiEKTTLKF I WNQKRAR ITKS I LSQKSKAGG I TLPDFK 
LYYKATVTKTAWYWYQNRDIDQCNRTEPSEIMPHIYNYLIFDKPEKN 
KQWGKDSLFNKWCWENWLAICRKLKIJ5PFLTPYTKIHSRWIKDLNVR 
PKT I KTLEENLG I T I QDIGMGKDFMS KTP KAMATKDKI D KWDL I KLK 
SFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKKK 
TNNPIKKWAKDMNRHFSKEDIYAAKKHMKKC^^ 

YHLT P VRMA 1 1 KKS GNNRCWRGCGE I GTLLHCWWDCKLVQ PLWKS VW 
RFLRDLELEI PFDPAI PLIiGIYPEDYKSCFYKDTCTRMF I AALFTIA 
KTWNQPKCPTMIDWIQKMWHIYTMEYYAAIKNDEFMSFVGIWIKLEI 
IILSKLSEEQKTKHCIFSLIGKRTFAGKLPVIKPSDLVRLIHYHENS 
MGETDP 1 1 RLS PPS YALDTWGLLQLKILDGGLPVTS VPDGSKKNH 

5911 

A 

i ! 

2001 

MPH I QVMLMQEMDSHS LGKLCP YGFAE YS PTS S CFHGF VLS VCGS SR 
DKATQRVDALVRQV I VNGGI VLDQFAVHDRVALADL I DLLVDLSAMM 
LCGVPTTGDP FVAFALGHPDDFDHLI LPKHLVDRYLLLE PLLGPVQL 
LSHSASVHLDLHQFCSDFSYGIQLTRDVKDLFKENYKPLLKEIKEDT 
NKWKN I PCS WVGR IN IMKMAMLPKVI YRFNAI P I KLPMTFFTELEKT 
TLKFIWNQKRARIAKSSLSQKNKAGGITLPDFKLYYKATVTKTAWYW 
YQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCW 
WJbAJ. CKiiljiUjLiir'r JLiTPxTKIHSRW I KDLNVRPKT I KTLEENLGLT 
IQDIGMGKDFMSETPKAMATKAKIDKWDLMKLKSFCTAKETTIRVNR 
QPTKWEKIFTTYSSDKGLISRIYKELKQIYKIKTNNPIKKWAKDTNR 
HFSKEDIYAAKKHMKKCSPSRAIREMQIKTAMRYHLTPVRMAIITKS 
GNNRCWRGCGE IGTLLHCWWECKLLQPLWKSVWQFLRDLELEIPFDP 
AIPLLGIYPKDYKSCCYKDTCTRMFIVALFTIAKTWNQPKCPTMIDW 
IKKMWHI YTMEYYAAIKNDEFMSFVGTRMKLEI I ILSKLSQGQKAKH 
RIFSLIGGN 

5912 

A 

i 

2982 

MVDIHKPYHPFTWLGVSVSPCGPVLRYSNFFLASSTVQVELLVCGGS 
SSGSSEQGFVTMRTElilQEEVESLNRPITGSEIVAI INSLPTKKS PG 
PDGLTAE F YQR YKEEL VP FLLKLFQS IEKEGILPNS FYE AS IILIPK 
PGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGF 
IPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFM 
LKTLNKLGIDGTYFKI IRAI YDKPTANI ILNGQKLEAFPLKTGTRQG 
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CPLSPLLFNIVLEVXiARAIRQEKEIKGIQLGKEEVKLSLFADDMIVY 
LENPI I SAQNLLKL I SNFS KVSGYKINVQKSQAFL YTNNRQTESQIM 
S EL P FT I AS KR I KYLG IQLTRDVKDLFKENYKPLLKE I KEDTNKWKN 
IPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFI 
WNQKRAR I AKS I LS QKNKAGG ITL PDFKL YYKATVTKTAWYW YQNRD 
I DQWNRTEP PE IMPHI YNYL I FDKLEKNKQWGKDS LFNKWCWESWLA 
ICRKIiKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIG 
MGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTW 
EKIFATYSSDKGLISRIYl^LKQIYKXKTNNPIKKWAKDMNRHFSKE 
D I YAAKKHMKKC S S S LAI REMQ I KTTMRYHLTP VRMA 1 1 KKS GNNRC 
WRGCGEIGTLLHCVmDCKLVQPLWKSVWRFLRDLELEIPFDPAIPLL 
G I YPNDYKS CC YKDTCTRMF I AALFTI AKTWNQPKCPTMIDW I KKMW 
H I YTME YYAAI KNDEFMS FVGTWMKLETI I LS KLSQEQKTKHRI FSL 
IDRLDQRNFLLGSSGAFPLSHDLAERDQASGTGKASPHRQLNPSHPA 
QPSVDSGPLTANHNTASATFSLPCLWAKAPPAGHLEHGTFRVKSFLG 
HSPAINQ 

5913 

A 

1169 

3269 

VHCRFWILALCQMSRLQKSPLLFNIVLEVLAKAIKQEKEIKGIQLGK 
EEVKLSLFADDMIVYLENPTVSAQNLLKLMSNFSKVSGYKINVQKSQ 
AFLYTNNRQTESQIMSGLPFTITSKRITYLGIQLTRDVKDLFKENYK 
PLLKEIKEDTNKWKNIPCSWVGGRINLVKMAILPKVIYRFNAIPIKL 
PMTFFTELEKTTLKFIWNQKRALIAKSSLSQKNKTGGITLPDFKLYY 
KATVTKTS WYWYQNRD I DQWNRTEPS E IMPHI YNYLI FDKPDKNKQW 
GKDSLFNKWCWENWIJVICRKLKLDPFLTPYTKINSRWIKDLHVRPKT 
I KTLEENLGNTI QD I GMGKDFMS KTPKAMATKAKIDKWDL IKLKSFC 
TAKE TTIRVXTOQPTEWEKIFATYSSDKGLI SRI YNELKQIYKKKTNN 
P I KKW AKDMNRHF S EED I YAAKKHMKKC S S S LA I REMQ I KTTMRYHL 
TPVRMVI I KKSGNNRCWRGCGEIGTLLHCWWDCKLVQPLWKS VWQFL 
RDLELEIPFDPAIPLLGIYPEDYKPCCYKDTCTRMFIAALFTIAKTW 
NQPKCPTMIDGIKKMVmiYTMEYYAAIKKDEFMSF\AGTWMKLETII 
LRKLSQGQKTKHRMYSLIGGNLTMRTFGHSAGSHHTPGPIMRCGAGG 
GI ALGE I PNVNDELMGTANQHGTCI PMQQNCTLCTCTLKLKV 

5914 

A 

2821 

5794 

DAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKAS 
RRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK 
KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEM 
DTFLDTYTLPRLNQEEVES LNRP ITG AE I VAI INSLSTKKS PGPDGF 
TAEFYQRYKEELVPFFLKLFQS IEKEGILPNS F YEAS 1 1 LI PKPGRD 
TTKKENFRPISLMNIDAKILNKILANQIQQHIKKLIHHDQVGFIPGM 
QG W FN I R KS INV I QH INRAKD KNHM I VS I D AEKAFDKI QQP FMLKTI 
NKIGIDGMYFKIIRAIYDKTTANIILNGQKLEAFPLKTGTRQGCPLS 
PLLFNIVLEFLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVDLENP 
IVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELP 
FT IAS KR I KYLG I QLTRDVKDLFKENYKPLLKE I KEDTKKWKNI PCS 
WVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQK 
RAR I TKS I LSQKNTKAGG I TL PD F KL YYKATVTKT AWYWY QNRD I DQ W 
NRTEPSEITPHIYNYLIFDKPDKNNKKNKQRGKDSLFNKWCWENWLA 
ICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQAIG 
MGKDFMNKTPKAMVTKAKIDKOT)LTKLKSFCTAKETTIRVNRQPTEW 
EKIFTIYPSDKGLISGIYNELKQIYKKKTNNPIKKWVKDMNRHFSKE 
D I YAAKKHMKKC S S SLAIREMQ I KTTMR YHLTPVRMA 1 1 KKSGNNRC 
WRGCGEIGTLLHCWWDCKLVQPLWKSVWRFLRDLELEIPFDPAIPLL 
GVYPKBYKSCCYKI)TCTRMFIAALFTIAKTWNQPKCPTMIDWIKKMW 
H I YTME YYAAI KKDEFTS FVGTWMKLET 1 1 LSKLFQRRKTKHCM FS L 
IGGN 

5915 

A 

457 

651 

LWKTFFHQKKIFGLTSYSGMFVGTWNGNTKVAIKTLKPGTMSPESF 
LEEAQIMKKLKHDKLVQL 

5916 

A 

3 

639 

KGDHVKHYKIRKLDNGGYYITTRAQFETLQQLVQHYSERAAGLCCRL 
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WPCHKGMPRLTDLS VKTKDVWE I PRESLQLI KRLGNGQFGEVWNGT 
\WN\GNTKVAIKTLKPGTMSPESFLEEAQIMKKLKHDKLVQLYAVVS 
E\EPIYIVTEYMNKGSLLDFLKDGEGRAIjKLPNLVDMAAQVAAGMAY 
I ERMNY I HRDLR SAN I L VGNGL I CK 

5917 

A 

1 

2523 

MHWKRPWQVGGYLTLAATDTVQMLNYMSVRQLSYLNFPNTWQFLSWS 

QM PTQQAY P CVTHTHTP P VLGQMACC I SNQRRTAAL I GTGRPERNVH 

VNLRQGFWNAHEPWVNEKVARQEPEDAGRGAVNRAWGFSPRLIHWS • 

EGPQAAALGDRDPRWLPYHHAAALSGSHAAALSLAPSRSSALAVXiIA 

LL P AAQLHHHVPR S KACD I A YVGGHRRQ VW YGQEDGQQ I L WP VLLS 

TTQQKA^SALWADRLE\RALLARSL7UVRLAPPSVHPCWV/VTFPSAVA 

W INAG SLLDFLKDGEGRAL KL PNLVDMAAQVAAGMAY I ERMNY I HRD 

LRSANILMGNGLICKIADFGLARLIEDNEYTARQDPTRSRGHRSSAM 

NPYGSRAGRGRVESGLTLQMAATQVTVQWSAHLVSGEAVAWNTFPTA 

NQDRKELAALSCSCPSTL.CFMLLGLYPTLDHLQVRQSHCLTVSCAWQ 

GL KP LFLWLA I NKAE E I CLTLLKYWQ ADC ELLAS AYQHAWHTCARTQ 

SSNLSIADIESLELWKTFFHQKKIFGLTSYSGMFVGTWNGNTKVAI 

KTLKPGTMSPESFIiEEAQIMKKLKHDKLVQLYAWSEEPIYIVTEYM 

NKGGDEGTIREEIGQVKAPRRDNIPKPSAAHAAQLVMWHEVRLTRAL 

CPFWGWLDSWHHFCVELHPTHPLITQKPKGKELTDVVHRSQFSGAQS 

RVESIWIQRSMTKTPGAKLQKQAYQGVSNPHHHHHTGQQLKAVPHPL 

MEAPDAIDGYRKILVNFTKIYSAFAVTNSDYEWQPTLVAIGYRCGDP 

Q(jTERRTQl\JPPTr.VV ITQIjQPQPWIjIjRGHVGLVLIjDHPIFKEKP 

5918 

A 

74 

1002 

SNESNAVALSTVERTARRDAPQHGTQPRGLPRSSEPHPAQRTAATGK 
ARQRRKTTASAGAREGRRRIiAGTTS I PGP / FS AVP VHRHRPQEPRQG 
PRTPRRAEEKRDQMASHVECRPLGVFECELCTLTAPYSYVGQKPPNT 
QSMVLLEESYVMKDPFTSDKDRFLVLGSCCSLCSRLVCVGPVGRMEV 
EQHG FHS VPS VLRVTPES EKSLSI CHRLTHTKG SC INSQGDEMGQGG 
RSLRSVAWEYLRLMEELWQEKLGKEDCKGNSARLGEKESSIKEDPQ 

5919 

A 

3 

401 

RSRGTRAAMASHVECRPLGVFECEIiCTLTAPYSYVGQKPPNTQSMVLi 
LEESYVMKDPFTSDKDRFLVLGSCCSLCSRLVCVGPVG\CSLFYSKR 
FCLPCVRENINAFPQEIRQDLEKRKAPSKRTPSQPGSRT 

5920 

A 

96 

615 

NI ILRNS I ITSRKIKYTTLSTTHS IFLFPQIS PQMSLIGFFYESEAN 
KGSHFPFGYVSCLNIiAETT/SLFYYICFLKSPDQGQNAQTMAYNITP 
LRRA\VECVQDQFG\FINYEVG\DSKK\LFFHMKEV/SRDGQCSAWN 
VWPSLLRGPKA\VAAPRP\DRL\VSRIiKNITL\D\DASDPR 

5921 

A 

1497 

2853 

EMVLVS S S VW I VMFVCS STSVKFWMGTS SH I ADEVE FTWPDMLS AQ 
RNHAIRIKKXiPKGTVSFHSHSDHRFLGTVEKEATFSNPKTTSPNKGK 
EKEAEDG 1 1 AYDDCGVKLTI AFQAKDVEGSTS PQI GDKVEFS I SDKQ 
RPGQQVATCVRLLGRNSNS KRLLG YVATL KDNFG F I ETANHDKE IFF 
rixz>h,r &K3UvU£>UEtlj\jUFiVb* i oJLoKOKGNKVSAEKVNKTHSVNGITEE 
ADPTIYSGKVIRPLRSVDPTQTEYQGMIEIVEEGDMKGEVYPFGIVG 

QFGFI\NYEVGDSKKL\FFH\VKKVQDG\IELQA\GDEVEFS\VILN 
PAATGKVOAPC\ NVWR\ VCEGPPRTjTiOAP\ P PDRVnQTPT .TCNTTTX T.D 
DASAPR\LMV\LRQPKGDQITSMGF\GAERKI\RQAGVID 

5922 

A 

23 

206 

GVGQRVSGTGRTRAVSPHPWDGFPEWPGRIiGAVASTANLLDDVEGHA 
CGEAGGE I EAPTS L 

5923 

A 

1 

267 

ALFRDKLKHMGKSTRRKLFELAFJVFSEKT 

AAASTANIiLDDVEGHACDE\NFRGQASRRAPQGGGRPCREGQ 

5924 

A 

1 

267 

FFFFFLTDILLF/CLFLRWSFTVTQAGVQWSDLGSLQPPPPRLKQFS 
CLSLLSSWDHKCLAPRPANFCSFSKDGVSPCWPGWSQTPHLR 

5925 

A 

1 

326 

MGTSSGSWPRWS VTAMPPTSMRPCSRRSS SCWPCI PPJbPASTAP * SM 
RPHSGPRMTTRSCljKT/RSYADGYSPPLNVAQRYVVA/W^GTQEKW 
PPGRLAI PQRHRAGQQR 

5926 

A 

366 

489 

QETTWPDLETKWLPG* RPWMGMSSGFWARWS VTAMPPTS E 
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5927 

A 

228 

756 

KVVQTEERKKANQFDAP IS I YEVHLGS WRRHTDNNFWLS YRELADQL 
VPYAKWMGFTHLBLLHINEHPFDGSWGYQPTGLYAPTRRFGTRDDFR 
YFIDAAHAAGLNVILDWVPGHFPTDDFALAEFDGTNIiYBHSDSRESY 
HQDWNTL I YNYGRREVS Y FLVGNAL YWMNRFG I D 

5928 

A 

1 

2709 

MGLDDGVARNLGLALSLARLADAVAGD V I S ALLVNAVG 1 1 GL I GLFA 
TAVAKIVGERWFARRHFFDIAPWMEEIAIKKRGMKLMRHFTGDRRFA 
AARYAHQDVNWIMVSRAQSGAPTIQELKMLRVTVIGNIVQPDNIRS 
NIHPCLKCISIRHEYAPAGVQRQYCPSAHAASARSQWAPSPDQCHD 
RGNRQSPYRQELL»TPD7U)IQTPRRPPGHHSYRTRHPAADIPAAPGES 
VQYRKKQTELHTGIFNWTIRQFLLGFRLVKRVTVTYRHARDGKARK 
QHRVNRQTCHHQEEQNPDVQVGNHPTRTDWHNRKGQQQRGKGNHRCQ 
GKDDAlREFRDPVFFKEHFDHVRNQLERTAPANTVRSVTVLEQTQQA 
TFAKAHECP AGDNHQQDHHRFEDDPQDARQPAPEHAD I S SEATCLAA 
GIX3EFTGAGRDMGGLGFWYKWNLGWMHDTLDYMKLDPVYRQYHHDKL 
TFGILYNYTENFVLPLSHDEVVHGKKSIIiDRMPGDAWQKFANLRAYY 
GWMWAFPGKJO^LFMGNEFAQSREWNHDASLDWHLLEGGDNWHHGVQR 
LVRDLNLTYRHHKAMHELDFDPYGFEWLWDDKER S VL I FVRRDKEG 
NEIIVASNFTPVPRHDYRFGINQPGKWREILNTDSMHYHGSNAGNGG 
TVHSDEIASHVRQHSLSLTLPPIiATIWLGYNFTLFSAHAERVELCVF 
DANGQEHRYDLPGHSGDIWHGYLPDARPGLRYGYRVHGPWQPAEGHR 
FNPAKLLIDPCARQIDGEFKDNPLLHAGHNEPDYRDNAAIAPKCVW 
VDHYDWEDDAPPRTPWGSTI I YEAHVKGLTNLHPE I PVE I RGTYKAL 
GHPVMINYLIQLGITALELLPVAQFASEPRIjQRMGLSNYWGYKAGGD 
VCAASGVCLiIj 

5929 

B 

76 

925 

xqcfflsadsriaelltelhqlskqtqeersrsehnlvniqktherm 
qtenki s p y yrtklrgl yttakadaeaecni lrkaldmi aei kslle 

ERRSAAKIAGLYNDSEPPRKTMRRGVLMTLIjQQSAMTLPLWIGKPGD 

kpsqpssplvlaaskalqvllgfllynlpclshrhhpqflsrlrlpl 

YSGDCISQECLNLVFLIJVDVWFGFLPSIYLVFIjIILYEGLLGGAAYV 

ntfhnialetsdehrefamaatcisdtlgislsgllalplhdflcql 
s* 


A 


1066 

gswg yqptglyaptrr\ glnvt ldwvqahfptddfalae fdgtnlye 
hs d preg yhqdwntl i yn ygrrevsnflvgnal yw i erfg i d ajlrvd 
avasmiyrdysrkegewipnefggrenleaieflrntnrilgeqvsg 
avtmaeestdfpgvsrpqdmgglgfwykwnlgwmhdtldymki^ 

KQ x HHD KLiTFG 1 1» YNYTEN FVL P LS HDEVVHGKKSIIjDRMPGDAWQK 
FANLRAYYGWMWAFPGKKLLFMAAKIAGLYNDSEPPRKTMRRGVLMT 
LLQQSAMTLPLWIGKPGDKPPPLCGAIPASGDYVARPGDRLLARLKP 

5931 

A 

53 

507 

FFVFLVEMGFHHVGQAGLELLNSSDLPSSASQSAG/IYRCEPLCPAF 
FFFFSETESRSVAQAWQWCDLSSLQPPPPHFKRFFCLSLLSSWDYR 
HVPPCLANFCIF\VEAGFHHVGQAGHKLLISSDPPASVSQSAGITGM 
SHCIQPHSPVFS 

5932 

A 

1 

346 

ST^ATTKTSETNHTSRPRLKNVDRSTAQQIJVVTVGNVTVI ITDFK\E 
KTRSSS\TSSSTVTS\SAGS\EQQN\QSSSGVQRAPDKGIiPPRSLPT 
FiS-^jUMb \AVJNlJiilr PJilATWNCEIvL 

5933 

A 

1 

915 

MGKEKTHINIVVNGHIDSGKSTSTGHLIYKCGGINKRSIKKFEKKAA 
EMGKGS FKS FVCDS S W YI PGHRDF ITN I FTGTSQADCAVIj WAAGVG 
E F EAALLLODVYKI GG I GTVPDGOVETVAT, K PCIWT TPA PVNTTTFV 
KSVK>lHHEAIiSEAFSGDNVGFHVK3^SVKDIHCSNNAGNC 
AAGFTAQVI\IL1TOPGQ/DOTMAHIACKFAELKEKIDRSSGKKLEDG 
PKFLKSGDVAIIDIW^GKPMCVESFSDYPPI^CFAAC^MRQTVAIiGV 
I KAADKKAAGAGKVTKYPQKAQKAK 

5934 

A 

1 

759 

WQTQISKTADELISYWGTSFPPPFAASLTLYELEYCITIDISIjWKF 
E TS KY YATV I DAPGHRDF I KNM I TGTSQADCAVL I FAAG VGEFEAVT 
FAPVIWTTEVKSVEMHHEVLSEALPGDIWDFDV^ 
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«v 1 & ACjCAP VliDCHMAH X ACKFAKLKKKTG S TSGKKLEDGPTFLKSG 
DAAI VDMVPGKPMCVE S FS VYP PLSRFAVCDMRQTVAVGVIEAMDKK 
AAGAGKVTKS TQKAQKAK 

5935 

A 

30 

707 

KLPLKAKMGKEKTHINIVVIGHVDSGKSTTTGHLIYKCGGIDKRTIB 
KFEKEAAEMGKGS F KYAWVLDKLKAERERG I T I D I SLWKFETS KYYV 
TI IDAPGHRDFIKNMITGTSQADCAVLI VAAGVGEFEAVI FKNGQ\T 
x\r*ru\ijijj\i i jjovjjvyjj J. vvjr viMJvrlJUo 1 bJPri SQKKY EEX VNEVSTYIKK 

5936 

A 

1 

706 

QSEQPCALPSRESTKLGGIGTVSSAPMET\GFSNPGMWTFAPSPR* 
W AAV 1N - \>3"Jvi'in \ ncj/u; \ oxi/\jJooxiyXjv*Moi>l V Jsjn VJt* C-QGMFRPWQTVA 

G* PAKNDPTQWEASWLSLLQVI IL\NHPRPK*GAGPMPIiYLDC\HTA 
\HIACK\ FAELKEKIDRR\ SGKKT.PnnPKlTT w cnnaa T\rniwnrorvD\ 

MCVES FSDYPPLGRFAVRDMRQTVAVGVI KAVDKKAAGAGKVTKSAO 
KAQKAK 

5937 

A 

364 

458 

VIEHLVSQDGLDFL/NLVICPPRPPKVLGLQA 

5938 

A 

1 

266 

EMESRSIA\RMECSGSISAHCKLRLPGSHHSPASVSRVAGTTGTCHH 
ARL I FLYF / LLETGFHCVSHDGLHIiIj / NT,VT PP PP ppttvt .tjt .n a 

5939 

A 

1 

201 

PPTGP/PKLPDPVHLPPFGPPPSAWALPGMSPPSPDPDRNKLRIIRI 
LKNASLSYLQVEVFPYPKMP 

5940 

A 

2 

142 

ESIQDYKHLCQDLSFCQXLPPDPPPLTVPQTHNARDQWLQDAFHISL 

5941 

A 

1 

432 

IHDELSRAARA\ PDGPRHAAGAANAGPAAGPRRPVNLDS ALAALRKE 
njji> a \ v^3ijK\2LiSJrlt> l»Jjv_y IjWGIjYESIQDYKHLCQDLSFCQDLSSSLH 
SDSSYPPDAGLSDDEEPPDASLPPDPPPLTVPQTHNARDQWLQDAFH 

5942 

A 

3 

696 

VSMGALGLEGRGGRLQGRGSLLLAVAGATSLVTLLLAVPITVIJVVLA 
LVPQDQGG/LGFQKLPEEEPETDLSPGLPAAHLIGAPLKGQGLGWET 
TKEQAFLTSGTQFSDAEGLALPQDGLYYLYCLVGYRGRAPPGGGDPQ 
vji*£>v iXjK&^JUxKAL»LjAxuFljTPjEijIjIjEGAETVT 
WYTSVGFGGLVQLRRGER\TAmiSHPDMVDFARGKTFFGAVMVG 

5943 

A 

1 

743 

MG ALGLEGRGGRXiQGRG S LLLAVAG AT \ S LVTLLLAVPI TVLAVLAL 
VPQDQGGLVTETADPGAQAQQGLGFQKLPEEEPETDLIPGLPTAHLI 
GAPLKGHGL\GWETTKEQ\AFLTSGTQVSDAEGLALPQDGIiYYIYCL 
vl* ± \«AJb'J^o^ijUF^^K&VTIjRSSIjYRAGGAYGPG\TP\ 
AETATPVLDPARRQGFWPLWYTSVGFGGLVQLRRGERVYV\NIS\HP 
DMVD F\ ARGKTF FGAVMVG 

5944 

A 

1 

588 

MGFLHVGQAGLKLLTSGHYQKKRMFSENEENVKRMKTSEQINENICV 
oijjiKViAf J-iCvvirCior c»liJjirFJjPl!#PPAPIjPEIjVDKTRDTIjPPQKPEIi 
K\TKRVFRPNGIALTWNITKINPKCAPEEK\YPLFLCHENSNNKIiIWK 
KIGEIKALPLPMACTLSQFLASNRYYFTVQSKDIFGRYGPFCDIKSI 
PGFSENI/T 

5945 

A 

3 

497 

EDTGTFRIY\ESAGAVKKARGFLEFVEDFIQVSKNIiIGKVIGKNGKV 
IQEIVDKSD1WPVRIEGDSENKLPREDKDDRDSRHQRDSRRCPGGRC 
RSVSGRRGRGGPRGGKSSISSVPKDPDSNPYSVLDN/TESDQTADTD 
AS KSHHSTNRHTRSRRRRTDEDAVL 

5946 

A 

1 

2065 

MAELTVEVRGSNGAF YKGF I KD VHEDSLTWFENNWQPERQVPFNEV 
RLPPPPDI KKEI SEGDEVEVYSRANDQEPCGVWIjAKVRMMKGEFYVI 
EYAACDATYNEIVTFERLRPVNQNKTVKKNTFFKCTVDVPEDLREAC 
ANENAHKDFKKAVGACRIFYHPETTQLMILSASEATVKRVNILSDMH 
LRSIRTKLMLMSRNEEATKHLECTKQIjAAAFHEEFVVREDIjMGIiAIG 
THGSNI QQARKVPGVTA I ELDEDTGTFR I YGESADAVKKARGFLEFV 
EDFIQVPRNLVGKViqKNGKVIQEIVDKSGVVRVRIEGDNENKLPRE 
DGMVPFVFVGTKESIGNVQVLLEYHIAYLKEVEQriRMERLQIDEQLR 
QIGMGFRPSSTRGPEKEKGYATDESTVSSVQGSRSYSGRGRGRRGPN 
YTSGYGTNSELSNPSETESERKDELSDWSLAGEDDRDSRHQRDSRRR 
PGGRGRSVSGGRGRGGPR\GGKSSISSVQYRSNIHNCSTLKRIFLAS 
DMNIVLKDPDSNPYSLLDNTESDQTADTDASESHHSTNRRRRSRRRR 
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TDEDAVLMDGMTESDTASVNENGLDDSEKKPQRRNRSRRRRFRGQAB 
\DRQPAIDFIYKEVEKWSLWQAKDVIEBHGPSEKAINGPTSASGDD 
ISKLQRTPGEEKINTLKEENTQEAAVLNGVS 

5947 

A 

1 

1876 

- 

MADVTVEVRGSNGAFYKGFIKDVHEDSLTWFENNWQPERQVPFNEV 
RLPPPPDIKKEISEGDEVEVYSRANDQEPCGWWLAKVRMMKGEFYVI 
EYAACDATYNEIVTFERLRPVNQNKTVKKNTFFKCTVDVPEDLREAC 
ANENAHKDFKKAVGACRIFYHPETTQLMILSASEATVKRVNILSDMH 
LRSIRTKLMLMSRNEEATKHLECTKQLAAAFHEEFVVREDLMGLAIG 
THGSNIQQARKVPGVTAIELDEDTGTFRIYGESADAVKKARGFLEFV 
EDFIQVPRNLVGKVIGKNGKVIQEIVDKSGWRVRIEGDNENKLPRE 
DGMVPFVFVGTKESIGNVQVLLEYHIAYLKEVEQLRMERIiQIDEQLR 
QIGSRSYSGRGRGRRGPNYTSGYGTNSELSNPSETESERKDELSDWS 
LAGEDDRDSRHQRDSRRRPGGRGRSVSGGRGRGGPR\GGKSSISSVL 
KDPDSNPYSLLDNT\ESDQTADTDA\SESHHSTNRRRR/SIRRRRTD 
\ EDAVLMDGMTDSDTAS VNENGLVTVA\ DYIS RAESQS I QRNLP\ RE 
TLAKKKKEMAKDVIEEHGPSEKAINGP\TSASG\DDISKAT/RVLRG 
EEKINPLKEENTQEAAVLNGVS 

5948 

A 

1 

392 

TESERKDELSDWSLAGEDDRDSRHQRDSWRRPGGRGRSVSGGRGRGG 
PRGGKSSIS\SVLKDQPSNP\YSLLDNTESDQTADTDASESH\HSTN 
RRRRSRRRRTDEDAVLMDGMT\ESDTASVNENGLGKRCD 

5949 

A 

269 

973 

PAACPSPANNI CFYGECSYYCSTEHALCGKPDQIEG\ SLG\AFLPDL 
SLAKRKTWRNPWRRSYHKRKKAEWEVDPDYCEEVKQTPPYDSSHRIL 
DVMDMTI FDFLMGNMDRHHYETFEKFGNETFI IHLDNGRGFGKYSHD 
ELS I LVPLQQCCR IRKSTYLRLQLIiAJCEE YKLSLLMAES LRGDQEAP 
VLYQPHLEALDRRLRVVLKAVWDCVERNGLHSVVDDDLDTEHRAASA 
R 

5950 

A 

1 

106 

LVCILNMEGRKALTWKYYAKGFEGKKSGFTPLVIP 

5951 

C 

146 

2B0 

MVFKDQKMRFCFCNQXXXXLMNLXXXXXXVLREKKGGS PPFVFL * 

5952 

A 

2 

1929 

MAAAAVDST^MEVVPALAEEAAPEVAGLSCLVNIiPGEVLEYIIjCCGSIj 
TAADIGRVSSTCRRLRELCQSSGKVWKEQFRVRWPSLMKHYSPTDYV 
NWLEEYKVRQKAGLEARKIVASFSKRFFSEHVPCNGFSDIENLEGPE 
I FFEDELVC I LNMEGRKALTWKYY AKKI LYY LRQQKI LNNLKAFLQQ 
PDDYESYLEGAVYIDQYCNPLSDISLKDIQAQIDSIVELVCKTLRGI 
NRRHPSLTFIAGESSMIMEIEIjQSQVLDAMNYVLYDQLKFKGNRMDY 
YNAIiNIiYMHQVLIIlRTGIPISMSLLYLTIARQLGWLEPVNFPSHFL 
LRWCQGAEGATL\DIFDYIYIDAFGKGKQLTGKECEYLIG\QHVTAA 
LYG\VVNVKXVLQRMVGNLLSLGKREGIDQSYQLLRDSI>DLYIA\ffY 
PDQVQLLLLQARVYF\HLG\IWPEKSFCLVIjKVLDILQHIQTL\DPG 
QHGA\VGYL\VQHTLEHIIj/ERKKEEVGVEVKL\RSDEK\HRDVCYS 
I \ GLHYGRHKRYGIi/ N* LC * FYGWGPHLAWMGHELDSRNMNV\HSLP 
HGHHQPF YNVLVEDGS CRY A\ AQEN \ LE YNAEP \ QE I SH \ PDVGRVY 
SQRFT\RTHYIPN\AELEIRYPEDLEFV\YETVQ\NIYKCKRKENIE 

5953 

A 

3 

364 

FFSPBTESHCIAQAGVQWQMLGSLQSPPPFSCLSLPSSYDYRHAPLH 
RANFLLFXLRDGVSACWPATAPGLLLIFFGRAPWLMPVIPAPREAEA 
GGSLEARSSRPASLFQVSFFSPFFHMF 

5954 

A 

2 

114 

FFFFGRDK/ SLTMLPRLVSNCWAQGILPSWPPEVLRLQA 

5955 

A 

3 

139 

FFFFFETETHTVARAGVQWCHLASLQPLPAG\SSHSP\AAASHVAS 

5956 

A 

274 

363 

LATWMKSNTSCLVTTLSKWPLSRPQRSHSA 

5957 

A . 

7 

1047 

ARPGPDMAALYACTKCHQRFPFEALSQGQQIiCKECRlAHPWKCTYC 
RTEYQQERLECNGT I S AHCNLHLPG S S DS PAS S SRVAG I TG I KTNT I 
CKKCAQNVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPYSC 
EQCKQQCAFDRKDDRKKVDG KLLCVTLCTL S YKRYLQKTKEQRKHL S S 
SSRAGHQEKEQYSRLSGGGHYNSQKTLSTSSIQNEIPKKKSKFESIT 
TNGDSFSPDIiALDSPGTDHFVIIAQLKEEVATLKKMLHQKDQMILEK 
EKKITELKADFQYQESQMRAK3WQMEKTHKEVTEQLQAKNRELLKQA 
AALSKSKKSEKSGAITSP 
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5958 

A 

7 

960 

ARPGPDMAALYACTKCHQRFPFEALSQGQQLCKECRIAHPWKCTYC 
RTEY QQERLECNGT I S AHCNLHL PGS SDS PAS S S RVAG I TG I KTNTI 
CKKCAQNVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPySC 
EQCKQQCAFDRKDDRKKVDGKLLCWLCTtiS YKRVLQKTKEQRKHLS S 
SSRAGHQEKEQYSRLSGGGHYNSFSPDLALDSPGTDHFVIIAQLKEE 
VATLKKMLHQKDQMILEKEKKITELKADFQYQESQMRAKMNQMEKTH 
KEVTEQLQAKNRELLKQAAALS KSKKSEKSGAITS P 

5959 

A 

2973 

3400 

GIIFMCPDMYT\SKTFLKACLFALFFFIIYPIYFFLVGIMEEYIKMC 
NI PQVGI FS FKYLP * SLS V* FF FETES CFVTQAGVLWCYLSSLQPPP 
PGFKRFSCLSLSSSWDYRHLPPCPANFC/IFSRDGVSPCWPG*SQTP 

DLR 

5 960 

A 

C. A c: Q 

/ W / *i 

LEARAGSYRPDLPGSHYLLAFFFFFFRRSLTLF\SQAGVK\WCDLGS 
SQPPSPGFKQFSCLSFLSSWDYRHAPPYRG*SFFFFFFFVFLVETGV 
SPCWARL\\TLNARPQVIHPPRASQSA\GTTAVSHHAQPIjCLFIYLFI 
YFY/CIFLRLSFALGPQARVQWHNLSSLQPPPPGFKQFSCLSLPSSW 
DYRWPPPHPANFVFLLRYLR 

5961 

A 

1 

361 

vcypdrdcgeslvaasyrqpvphqcgwfhplkpa/ shpdtvecvfrk 
tpkipqkhsgspr\pppspsyrgrflfrfhppqhgaqgpsppplgrk 
hrpshtgsaalracpppgaaglhswgql 

5962 

B 

27 

ZOO 

MRKVHVSTVTPNYAGGEPKRFRTAYTRQQVLELEKEFHYNRYLTRRR 
RVEIAHALCLSERQIKIWFQNRRMKWKKDHKLPNTKIRSGX* 

5963 

A 

1 

825 

MSSFLINSNYVDPKFPPCEEYSQSDYLPSDHSPGYYAGGQRRESSFQ 
PEAGFGRRAACTVQRYAACRDPGPPPPPPPPPPPPPPPGLSPRAPAP 
PPAGALLPEPGQRCEAVSSSPPPPPCAQNPLHPSPSHSACKEPWYP 
WMRKVHVSTGRYSLSEVGSWGGVEEVEEEKEEESNPNYAGGEPKRSR 
TAYTRQQVLELEKEFHYNRYLTRRRRVEIAHALCLSERQIKIWFQNR 
RMKWKK\DHK\LPNTKIHSGGAAGSAGGPPGRPNGGPR70j 

5964 

A 

656 

1165 

PPSPLSVGCLSFSFFFPFFFFIQSSLPKSFKRKISWSTKGVPAGNS 
DTEGGQPGRKRRWGAStATTQKKPS I S ITTESLKSL I PD I KPLAGQE 
AWDLHADDSRISEDETERNGDDGTHDKGLKICRTVTQWPAEGQEN 
GQREEEEEEKEPEAEPPVPSQVSVEVAli 

5965 

A 

1 

4033 

MWRRKHPRTSGGTRGVLSGNRGVEYGSGRGHLGTFEGRWRKLPKMPE 
AVGTDPSTSRKMAELEEVTLDGKPLQALRVTDLKAAIjEQRGLAKSGQ 
KSALVKRLKGALMLENIiQKHSTPHAAFQPNSQIGEEMSQNSFIKQYL 
EKQQELLRQRLEREAREAAELEEASAESEDEMIHPEGVASLLPPDFQ 
SSLERPELELSRHSPRKS SS I SEEKGDSDDEKPRKGERRSSRVRQAR 
AAKLSEGSQPAEEEEDQETPSRNLRVRADRNLKTEEEEEEEEEEEED 
DEEEEGDDEGQKSREAPI LKEFKEEGEE I PRVKPEEMMDERPKTRSQ 
EQEVLERGGRFTRSQEEARKSHLARQQQEKEMKTTSPLEEEEREiKS 
SQGLKEKSKSPSPPRLTEDRKKASIiVALPEQTASEEETPPPLLTKEA 
SSPPPHPQLHSEEEIEPMEGPAPPVLIQIiSPPNTDADTRELLVSQHT 
VQLVGGLSPLSSPSDTKAESPAEKVPEESVtiPLVQKSTLADYSAQKD 
LEPESDRSAQPLPLKIEEIiALAKGITEECLKQPSLEQKEGRRASHTL 
LPSHRLKQSADSSSSRSSSSSSSSSRSRSRSPDSSGSRSHSPLRSKQ 
RDVAQARTHANPRGRPKMGSRSTSESRSRSRSRSRSASSNSRKSLSP 
GVSRDSSTSYTETKDPSSGQEVATPPVPQLQVCEPKERTSTSSSSVQ 
ARRLSQPESAEKHVTQRLQPERGSPKKCEAEEAEPPAATQPQTSETQ 
TSHLPESERIHHTVEEKEEVTMDTSENRPENDVPEPPMPIADQVSND 
DRPEGSVEDEEKKESSLPKSFKRKI SWSATKGVPAGNSDTEGGQPG 
RKRRWGASTATTQKKPS I S I TTESLKSL I PD I KPLAGQEAWDLHA\ 
DDSRI S \ ED\ ETER\NGD \DGDP *QRGLKICRTVTQWPAEGQENGQ 
REEEEEEKEPEAEPPVPPQVSVEVALPPPAEHEVKKVTLGDTLTRRS 
ISQQKSGVSITIDDPVRTAQVPSPPRGKISNIVHISNLVRPFTLGQL 
KELLGRTGTLVEEAFWIDKIKSHCFVTYSTVEEAVATRTALHGVKWP 
QSNPKFLCADYAEQDELDYHRGLLVDRPSETKTEEQGIPRPLHPPPP 
PPVQPPQHPRAEQREQERAVREQWAEREREMERRERTRSEREWDRDK 
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VREGPRSRSRSRDRRRKERAKSKEKKSEKKEKAQEEPPAKLLDDLFR 
KTKAAPCIYWLPLTDSQIVQKEAERAERAKEREKRRKEQEEEEQKER 
EKEAERERNRQLEREKRREHSRERDRERERERERDRGDRDRDRERDR 
ERGRERDRRDTKRHSRSRSRSTPVRDRGGRR 

5966 

A 

3 

5979 

KSSSISEEKGDSDDEKPRKGERRSSRVRQARAAKLSEGSQPAEEEED 
QETPSRNIiRVRADRNLKTEEEEEEEEEEEEDDEEEEGDDEGQKSREA 
PILKEFKEEGEEIPRVKPEEMMDERPKTRSQEQEVLERGGRFTRSQE 
E/^RKSHLARQQQEKEMKTTSPLEEEEREIKSSQGLKEKSKSPSPPRL 
TEDRKKASLVALPEQTASEEETPPPIiLTKEASSPPPHPQLHSEEEIB 
PMEGPAPAVL I QL S PPNTDADTRELLVSQHTVQLVGGLS PLS S P SDT 
KAESPAEKVPBESVLPLVQKSTLADYSAQKDLEPESDRSAQPLPLKI 
EELALAKGITEECLKQPSLEQKEGRRASHTLLPSHRLKQSADSSSSR 
SSSSSSSSSRSRSRSPDSSGSRSHSPLRSKQRDVAQARTHANPRGRP 
KMGSRSTSESRSRSRSRSRSASSNSRKSLSPGVSRDSSTSYTETKDP 
SSGQEVATPPVPQLQVCEPKERTSTSSSSVQARRLSQPESAEKHVTQ 
RLQPERGSPKKCEAEEAEPPAATQPQTSETQTSHLPESERIHHTVSH 
PPPSTPNPQPFSPCWCLQILRISRPLLIKQLIKHSEEKEEVTMDTSE 
NRPENDVPEPPMPIADQVSNDDRPEGSVEDEEKKELESLRRCQPQLS 
EEKYSDLAAECLPGPGVFTYPQATFIRGSMIPLAATKGVPAGNSDTE 
GGQPGRK\RRWGT\STA\TTQKKPSISITTESLKSLIPDIKPLAGQE 
AWDLHADD SR I SEDETERNGDDGTHDKGLKI CRTVTQ WPAEGQEN 
GQREEEEEEKEPEAEPPVPPQVSVEVALPPPAEHEVK\KVTLGDTL/ 
TLRRSISQQKSGVSITIDDPVRTAQVPFPPRGKl\SNIVHISNLVRP 
FTLGQLKELIiGRTGTLVEEAFWIDKJKSHCFVTYSTVEEAVATRTAL 
HGVKWPQSNPKFLCADYAEQDELDYHRGLLVDRPSETKTEEQGIPRP 
LHPPPPPPVQPPQHPRAEQREQERAVREQWAEREREMERRERTRSER 
EWDRDKVREGPRSRSRSRDRRRKERAKSKEKKSEKKEKAQEEPPAKL 
LDDLFRKTKAAPCIYWLPLTDSQIVQKEAERAERAKEREKRRKEQEE 
EEQKEREKEAERERNRQLEREKRREHSRERDRERERERERDRGDRDR 
DRERDRERGRERDRRDTKRHSRSRSRSTPVRDRDLQRSAAPAPIGSG 
LCPGTRRQARLWRFPRPGPLAVPGSREHPGPALLRTRRSWVWNQFFV 
IEEYAGPEPVLIGKLHSDVDRGEGRTKYLLTGEGAGTVFVIDEATGN 
IHVTKSLDREEKAQYVLLAQAVDRASNRPLEPPSEFIIKVQDINDNP 
PIFPLGPYHATVPEMSNVGTSVIQVTAHDADDPSYGNSAKLVYTVLD 
GLP F FS VDPQTGVVRTAI PNMDRETQEE FL WIQAKDMGGHMGGLS G 
STTVTVTLSDVNDNPPKFPQSLYQFSVVETAGPGTLVGRIiRAQDPDL 
GDNALMAYSILDGEGSEAFSISTDLQGRDGLLTVRKPLDFESQRSYS 
FRVEATNTL IDPAYLRRGP FKD VAS VRVAVQD APEP PAFTQAAYHLT 
VPENKAPGTLVGQI S AADLDS PAS P I RY S I LPHSDPERC F S I QPEEG 
TIHTAAPLDREARAWHNLTVLATELDS S AQASRVQVAI QTLDENDNA 
PQLAEPYDTFVCDSAAPGQLIQVIRALDRDEVGNS SHVS FQGPLGPD 
AN FTVQDNRDG S AS LLLP S RPAP PRHAP YL VP I EL WD WG Q PALS S TA 
TVTVSVCRCQPDGSVASCWPEAHLSAAGLSTGALLAIITCVGALLAL 
WLFVALRRQKQEALMVLEEEDVRENI I TYDDEGGGEEDTEAFDITA 
LQNPDGAAPPAPGPPARRDVLPRARVSRQPRPPGPADVAQLLALRLR 
EADEDPGVPPYDSVQVYGYEGRGSSCGSLSSLGSGSEAGGAPGPAEP 
LDDWGPLFRTLAELYGAKEPPAP 

5967 

A 

16 

208 

LLPYSTNPPASASRLAVIAGVHHHTQL\IFFLVETGSHCVTQSGLKL 
PAS SNPPTGLTMLGLQV 

5968 

A 

1 

932 

MKFLLDILLLLPLLIVCSLESFVKLFIPKRRKSVTGEIVX.ITGAGHG 
IGRLTAYEFAKLKSKLVLWDINKHGLEETAAKCKGLGAKVHTFVVDC 
SNRED I YSS AKKVKAE IGDVS ILVNNAGWYTSDLFATQDPQ I EKTF 
\EVNVIJUiFW\TTKAFLPAMTKNNHGHIVTVASJ\AGHVSVPFLLAYC 
S SKFAAVGFHKTLTDELAALQITG VI S \ LCLCPNFVNTGF I KNPSTS 
LGPTS S CSRVGPKLVPEEWNRLMHG I LTEQKMI F I PS S I AFLTTLE 
RILPERFLAVLKRKIS VK\ FEAVIG YKMKPH 
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5969 

A. 

1 

1202 

IHTGENPYECHECGKAFSRKYQLISHQRTHAGEKPYECTDCGKAFGL 
KSQLIIHQRTHTGEKPFECSECQKAFNTKSNLIVHQRTHTGEKPYSC 
NECGKAFTFKSQLIVHKGVHTGVKPYGCSQCAKTFSLKSQIiIVHQRS 
HTGVKPYGCSECGKAFRSKSYLIIHMRTHTGEKPHECRECGKSFSFN 
w yxi x VHQR InTGENP YECSECGKAFNRKDQL I SHQRTHAGEKPYGCS 
ck^kdisjw tooivto x JjXXJ1MK1H^(jEKPYECNECGKAFIWKSLLIVHERTH 
AGVNPY\KCSQCEKSFSGEITPSLLHQRMHTTEKPYECSECGKAFIR 

TECGKAFCWKSQLIMHQRTHVDDKH 

5970 

A 

153 

571 

ajaj vi iur x ir *3±jiwj\^txji\.\^jaui ioijxj\*.\£Xj1 obibo XyjC* X JSa3AC-QAASSPN 
WTYALENGFFDEEEEYFPEQNSLHDRRDRGPPRDLSLP\APPSPAAT 
GFWSPSRGSWEGCDCWEALPTGHAVIICCFSCKKAPPLWTVLGHRG 

5971 

B 

223 

347 


5972 

A 

2 

283 

LHSRLDGAAICLPGHCHGSRVEVTYETHQCGLVGLKLLRGGREQSDR 
GGSLGLT*DGQLGPPAEQRTTQAVIPCCAVIMSLILRLEPTDGQGG 

5973 

A 

3 

231 

RRQAQIAAGRVLVVALFGGCGGLHSRLDGAAI/CLPGHCHGSRVEVT 

5974 

A 

2 

385 

APFMRDPAAVASVGF/PLLGRQAQVAAGRILWALFGGFGGLHSHLD 
G AA I CL PGHCHG SRVE VT YETHQCGL VGLKLLRGG WEQS DRGG SLGL 

iCiUOVVVjrtaoJllJoX X onr iulAvi V oXiXLRXjEPGDE f 

5975 

A 

3 

539 

RRQAQ I TAGRVL WALFGGCGGLHS RLDGAA I CL PGHCHG SRVEVTY 
ETHQCGLVGLKLLRGGREQSDRGGSLGLT*DGQLGPSAEHPLITTTI 
ftiu i v x VLjijxir'wXjUi^wUw^^wRVPKAGA\ElTOSGIPAGRSAS * *MTN 
TGARPGFCWYHCTLLSPIFSPPQVILAVFPGATDTEGG 

5976 

A 

1 

351 

RGGWEQSDRGGS LGLTEDGQLGAS S EHRTMQAVI PCCTVI I SL I LRL 
c £> i JJuvv^'bAKJjbAKlSbVSDP'GPL X XTIDBEFGGPS WDLLVPVYN 
INSDVGVASRTRDGDPLPGGPRR 

5977 

B 

78 

448 

XQRVTISCTGSSSTIGAGYDVHWYQQLPGTAPKLLIYGNSNRPSGVP 
jjk i* w w b Aw w 1 oAS LjA I TGLQAEDE AD Y YCQS YD S S LS G YWVFGGGTK 
LTVLGQPKAAPSVTLFPPSSEELQANKATL* 

5978 

A 

1 

393 

RVLWALFGGCGGLHSRLDGAAICLPGHCHGSRVEVTYETHQCGLVG 
L KLLRGGREQNDRGG S LGLT * DGQLGPS AEH PS GTTTS S I *TAI IIS 
LGLSLEPINCQGGRGGTLGSIEIYNKPLRADYRPHKS 

5979 

A 

3 

389 

LPG<5PGTMVAFGGLPTVQG\RVQGALAPVLVGACLEGCGGLilSRLDG 
AAICLPGHCHGSRVEVTYETHQCGIiVGLKL\LRGGREQSDRGGSLGL 
uxjvjKoAWw IjKK X PHDKDHWCLIj IDSNSQPRLLPGRH 

5980 

A 

1 

908 

AGHRGRRAFRGKGQGQRGSRLGAGFSGWDKKWWVKGVSVRAEIEGVT 
GVPARDNSIQAGGREVGGYALRRRGRNERAEGLVFSDQTSPIRASFT 
KKllFwKXjl_Ab PbOPDSHWVPGFPHP 

F S LTMR I F I E KGWMTWDGGRDPS PGNLQLQAP VGGVRVGGLKAFCRG 
n^uLinurtr r i v iKXJi<H>W/\o VwXjPXiXjljrRQAQ VAAGRIL WALFGGFGGIi 
QS HLDRAA I CL PSH \ CTG PR VKI HFKKNHVGL VGLKLLRGG WEQSDR 
GGSLGLTEDGQLGASSEHSTQ 

5981 

C 

60 

392 

MGYKKGEGGEGWGENAX^VlxAECKRCWPFXXXXXXXXXXXXXXXXXXX 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

XX R KKKA A V CiDU A D A W 

5982 

C 

176 

385 

MCVSGxTC<XXXXXXXXXXXXXXXXXXXXXXEGSTSFTFCKHICIFTP 
PFPSFSLFISHFYIDLLFYNKTL 

5983 

A 

1876 

2540 

FLQLGSWLGRGSCQVSRGPGQPRSDNLLVEPKDLKGNLTHFTTLGGF 
HLLYMMIWIHPPINLCSGSNFFFLFFFFFSFSLRLGLALLPQAGVEW 
RDLGLLQPLP \ PRLEQS C\ PS SGS S WDRRFMP PWPANFCM / FL * RWG 
LTVLPRLVSNSWAQ/ CDPPVSASQSAGITIVSHHVQLEGSTSFTFCK 
HICIFTPPFPSFSLFISHFYIDLLFYNKTLLPKKKKK 

5984 

A 

1 

393 

AEVAELKAEKENSEAQVENAQRIQVERLKQECRNFRSQAEKAQLEAE 

KTLEEKQIQWLEEKHKLHERITDREEKTOQAK^ 

LHENKLKRLQEKWVLEAiw<EELETENQVlxNRQNVPF 
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5985 

A 

1 

1427 

^fRERFRNLDEEVEKYRAVYNKLRyEHTFLKSEFEHQKEEYARILDEG 

KI KYE S E IARLEEDKEELRNQVLNVDIjTKDS KR VEQLAREKVYLCQK 

LKGLEAEVAELKAEKENS EAQVENAQR I QVRQLAEMQATVRSLEAEK 

QSANLRAERLEK/DSLQSSSEQNTFLINKLHKAEREINTLSSKVKEL 

KHSNKLEITDIKLETARAKSELDRERNKIHSELD\YLHLDNEILKAA 

VEHHKVLLVKKDREL I RKVQAAKEEG YQKL WLQDE KLELENRLADL 

EKMKVEHDVWRQSEKDQYEEKLRASQMAEEITRKELQSVRLKLQQQI " 

VTIENAEKEKNENSDLKQQISSLQIQVTSLAQSENDLLNSNQMLKEM-* 

VERLKQECRNFRSQAEKAQLEAEKTLEEKQIQWLEEKHKLHERITDR 

EEKYNQAKEKLQRAAIAQKKRKSLHENKLKRLQEKVEVLEAKKEELE 

TENQVLK 

5986 

A 

2 

1831 

NYKTLI I ICALFTLVTVLLWNKCSSDKAIQFPRRSSSGFRVDGFEKR 
AAASESNNYMNHVAKQQSEEAFPQEQQKAPPWGGFNSNVGSKVLGL 
KYEEIDCLINDEHTIKGRREGNEVFLPFTWVEKYFDVYGKWQYDGY 
DRFEFSHSYSKVYAQRAPYHPDGVFMSFEGYNVEVRDRVKCISGVEG 
VPLSTQWGPQGYFYPIQIAQYGLSHYSKNLTEKPPHIEVYETAEDRD 
KNKPNDWWPKGCFMANVADKSRFTNVKQFIAPETSEGVSLQLGNTK 
DFIISFDLKFLTNGSVSWLETTEKNQLFTIHYVSNAQLIAFKERDI 
YYGIGPRTSWSTVTRDLVTDLRKGVGLSNTKAVKPTKIMPKKVVRLI 
AKGKGFLDNITI STTAHMAAFFAASDWLVRNQDEKGGWP IMVTRKLG 
EGFKSLEPGWYSAMAQGQAISTLVRAYLLTKDHIFLNSAJLRATAPYK 
FLSEQHGVKAVFMNKHDWYEEYPTTPSSFVI^GFMYSLIGLYDLKET 
AGEKLGKEAR SL YERGMES LKAML \ PLYDTG SRT I YDLRHFMLGI AP 
NLARWDYHTTHINQLQLLSTIDESPIFKEFVKRWKSYLKGSRAKHN 

5987 

A 

34 

332 

LLRQGLALLPRLECSGTISAYCNI>CLPGSNHPP\VSVSQVAGTTGVH 
HHAQL I FVFWVETGFCHVAQYGLEFLGS SNPPALTS PS AG ITGVSHC 
IWAQVF 

5988 

A 

3 

396 

PAGGPAAGRAAARAGAMAKLLSCVLGPRLYKIYRERDSERAPASVPE 
TPTAVTAPHSSS / WGYLS LS KWPFSHYAGTLLLLLAGVACLRS IGR 
WTNPQYRQFITILEATHRNQSSENKRQLANYNFDFRSW 

5989 

A 

3 

416 

GTCPFYLCPPETWSRPFLLTSNRGNDLLLKLLQHRYPRVMAEEGLR 
VVRQWLEASSQLEEASIYSRWEVEEDWCLSVLRSYQAEHGPDFPWSV 
GEDMSADGRRQLALFIiARKHLHNFEGTYWNPIPKPKTFKRPWH 



■> 


LIALRTCTESPGTTHASADAWEAKLLSCVLCPRLYKIYRERDSEGAP 
ISETETPTADTAPHSSSWDTYYQPRALEKHADSILALASIFWPISYY 
SCTFAFFYLYRKGYLSLSKWPFSHYAGTLLLLLAGVACLR\GIGR\ 
WTNPQYRQFITILEATHREPSLQKTKRQLANYNFDFRSWPVDFHWEE 
PSSRKESRGGP\SRRGVALLRPEPLHRGTADTLIiNRVKKL,PCQITSY 
LVAHTLWRRMLYPGSVYLLQKALMPVL 

5991 

A 

461 

660 

HTRTVTQSLPCWYFSQTSGRSPRLVKECNGPWIQSIKHLGCLLNQGH 
SGWGCLPSSSSGSQVQDHV 

5992 

A 

1 

1774 

MSSSEEVSWISWFCGLRGNEFFCEVDEDYIQDKFNLTGLNEQVPHYR 
QALDMILDLEPGIGRWTNPQYRQFITILEATHRNQSSENKRQL\VYY 
FFDFRSWPVDFHWEEPSSRKESRGGPSRRGVALLRPEPLHRGTADTL 
LNRVKKLPCQITSYLVAHTLGRRMLYPGSVYLLQKALMPVLVQGQA\ 
RLVEK\CNG\RRAKL\LACDGNEIDTMFVDRRGTAEPQGQKLVICCE 
GNAGFYEVGCVSTPLEAGYSVLGWNHPGFAGSTGVPFPQNEANAMDV 

VDQFAIHRLFFHPQDIIIYA\LAIGGFTATWAAMSYPDVSAMILDAS 
FDDLVPLALKVMPDSWRGLVTRTVPOHT /NTT .WMzxprnT HDvrv^mrr ttd 

KTQDGNLTPTVP\KDICSNRANCLLVKAI^\HRYPG\VMAEEGLLVA 
RQWLEASSQLEEASIYSRW\EVEEDWCL\SVLR\SY\QAEHGADFPW 
\SVGEDMSA\DG\RRQAGPLFPGLGKASCHNFGGPTHCTP\LPSPQN 
FPRCPWHPLGTQLGLIMEEWGERRHEERPFLFVILLCFMFAVYKFCG 
KVGGPFPLLTTVPLARFPSFMWLYLTFSNIHPALHEMG 

5993 

A 

3 

180 

DLKKHEVKHLIDKLEEKFRLNRRELIAFEFPVLVALEFALHLPEHEV 
MPHYRRLVQSS 
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5994 

A 

1 

1015 

MI APLGLG VLARDQ YRQ AALADWDLKLDGGRQS TG AVS LKE 1 1 GLEG 

VELGADGKTVSYTQFLLPTNAFGARRNTIDSTSSFSQFRNLSHRSLS 

IGRASGTQGSLDTGSDLGDFMDYDPNLLDDPQWPCGKHKRVLIFPSY 

MLCTQLPCSQAFKCSTVHSSDLSLHERNEHPCKKHNCLLQEGHSPLQ 

APGFRRAL VVAPMQVF PLLMALG VTTV I D YVKP S \ DLKKDMNKFFKE 

KFPHIKLTLNKIRTLKRE\MRKIiAQEDCALEEPTVPMAF\VYFEKLA 

LKGKLNKQNRKLGAGACVnjVSSQYKFGSD 

RLNRARTG F AF 

5995 

A 

48 

327 

EELEALRR\ QRLAELQAKHGD PGDA\ AQQEAKHRE AEMRNS I LAQVL 
DQSARAR\VSEQGLIEILKKVSQQTEKTTTVKFNRRKVMDSDEDDDY 

5996 

A 

52 

281 

VSKLKHLLKPEKTK\AVENY\LIQMARYGQLSEKVSLDSLEELYCYL 
LYQNMAS KGQLHLHW I TE FLLTLRRNC WRE 

5997 

A 

1 

280 

GNPLF FLELHWN PSLR I QACDR I YRVG \ QQK\ DVS I PRF VCGGTVEE 
SILHFQEKKIDLAKQFLSGSGESVTKLTLSDLRVLIGIYPPVDRVSE 

5998 

A 

36 

512 

LFVLLQVl^SGNELCLMTSHLESTRGHAAERMNQLKMVLKKMQEAPE 
SATVIFAGDTNLRDREVTRCGGLPNNIVDVWEFLGKPKHC\QYT\WD 
TQTELLILGITA\ACKLRFDRIFFR\AAAEEGHIIPRSLDLLGLEKL 
DCGRFPSDHWGLLCNL\DIIL 

5999 

C 

205 

519 

MGKEG VHGGL I NKKC YEMAF PLQ AFPVLTS S VP S P S PL PTALHPFXX 
HTHTNHFIFWAITPYPLLLPKPHGLGARAGWTDTSPYPYPSRVWLEN 
FCFLGFFFSE* 

6000 

A 

3 

939 

FLKLHLQSSVSGLASGGGREWGAGSGVRGLLHSERSRGPDGSACPEL 
SASPRPASRGSSSPVRGRQQQRPRAAPVAAPWPCWN\AYIDNLMADG 
T\ CVTGAI VRYQDSEPPVWA\ AVPGKT\ FVNI \ TPA*GGLSLVGKD\ 
RSSFLR*MGLTLWGGQK\CSVIRGLTCLQDGEF/ SAMDLRYQEPPGG 
APTFQCSLFTKT\ DKDG * VLADGAKEGVHGGLT\ NTCKC YEMASHLSG 
FPSTDHRLSLPLHRSPTAFAPLSSPYTHHPFLFFGPLPHTPYCCQNH 
MGLGGQGLMGQTPSPTHIPSRVWYGKLLFFGVFFFSE 

6001 

A 

172 

407 

RKTFFFLKCSLYKQKLWWISVAIHRSLSRKVILSSVFIKKQSDFYQL 
PFFNKS CFKEVFS I S VYSMETNTWNL YYS YT 

6002 

A 

1 

185 

ESSCLSLPSSWDHRHGRPWLADFSVFF/ CYRUS LTMLPRLGL\NSWP 
QAILPPWPPKVLGLQA 

6003 

A 

1 

618 

MVRTQCLLGLRTFVAFAAKLWSFF I YLLRGQ IRTVI Q YQTVRYDI LP 
LSPVSRNWLAQWIDKHSVRFFVRKRPHVDFFLEWSQWYELWFTE 
SMEIYGFAVAGKLDNSRSILKRRYYRQHCTLQLSSYIKDLSWHSDL 
SSIVILDNSLGAYRSHPDNAIPIKSWFSDPSDTALLNLLPMLDALRF 
TADVHSVLSRNLHQHRLW 

6004 

A 

1 

478 

HPVRFFVHRGPH/VDFSLEWSQWYELWFTASMEIYGSAVADKLDN 
SRS I LKRRYYRQHCTLELGS Y I KDLS WHSDLS S I V I LDNS PG\ AYR 
SHPDNAIPIKSWFSDPSDTALLNLLPMLDALRFTADVRSVLSRNLH\ 
QHRPLTGTTGGTLGEPNQCL 

6005 

A 

2 

833 

WNSAELGRGGPGAGGAGVIGMMRTQCLLGLRTFVAFAAKLWSFFIYL 
LRRQ I RTV I Q YQTVR YD I LPLS P VS RNRLAQVKR KI L VLDLDETL I H 
SHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVHKRPHVDFFL\EV 
\ VSQWYELWFTASM\EI YG\ SAVGRLNWDNSRS I L *GGRY\ YRQPA 
PLELG\SYIKGPLCWFHSDLS\SIVIL\DNSPGAYRSHPGYGGR\DN 
AIPIKSWVSVTPS\DTALLNLLPMLDALRFTADVRS\VLSRNLHQHR 
LW 

6006 

A 

2 

25 8 

ISAISoKo Vir \KJuxl»V-oljlr Jl J-iAVjUriJUrlljJt'vjoCxlr Jr Ao/\oy VAkj J. LrU\xsJtrx 
TQLI F\ AFLVENGLC * PGWSRSPDLVI RLPWPPKVLGLQA 

6007 

A 

60 

429 

IRGQYMWGPVSS SLFG? SMGYLVFGFFS F I FFFCDEVLLLS TRLECNG 
TISG\HCNLCLPGSSHSPVSASRVAGTTGARNKARLIFLYFFLVEMG 
FHHISQDGLDLL/NLVIRPPWPPKVLGLQA 

6008 

A 

1 

1047 

SCGWTFTSMS KLLRHRRKHDDDRRFTC PVEGCG KS FTRAEHLKGHS I 
THLCTKPFECPVERCCIARVSARSSLYIHSKKHVQDVGAPKSRCPVST 
CNRLFTSKHSMKAHMVRQHSRRQDLLPQLEAPSSLTPSSELSSPGQR 
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ELTNMDLAALFSDTPANAS GSAGG SDEALNSG I LTIDVTS VS S SLGG 
!^PAmSSLGPMEPIiVLVAHSDIPPSL\DSPLVIiGTAATVLQQGSFS 
VDD VQTVS AG ALG CLi VAXi PMKN Jj S DD PLALT \ S N SNLAAH I TTPTS S 
bl PR£N A£> VPr.JjJ_iAPl iv V JbPiJS PSKPGAVGQQEGSHGL»PyoTLiPSPA 
EQHGAQDTELS AGTGN FYLV 

6009 

A 

66 

423 

MEVACRQSTSWQPMDLASSSDFRPRMQNPTTNRHSQANSCLQGDSGP 
RRMSASKRYFEKTOAICRPSSFTKHSRESSGSPGVSRSCTGTRF*NV 

TVAOVPTC C VLTD V r T t f- , V\TV\rO Y OTHJ 

livyrjf ioolVrlKl\.lvjl\.V ivVKi.0 V V 

6010 

A 

44 

283 

FSCLILHDAFPCI.ILHDAFPCSLCSVTGSLVLSRVPIiRGTHQKSAlA 
TSTKIALSNM/KILKHHTDAYFKKQQLFFDVSKK 

6011 

C 

107 

118 

MAV* 

6012 

A 

197 

683 

SLEVPLGFLLYLSFILLYSCFPLSNRVLLCHRDPFLSFLDLFLLPAF 
SPDYVQFLFL*C\PQFSIRH*TL*MLHCKCLDFFFFFFFFETRFHSP 
\ S LRLECSGMI S AYCS \ PTSWTQVRGTTGTCHHAWliV F I F SVETGFH 
HVAQPDLQLLP\ PRPPKVLRLQV 

6013 

A 

1 

415 

LWREGPKWN/AARLNESTTFVSVSRPTIACGMVGTIVRFYIKSP\HIi 
FKYAADPRDKHWL\ AEQHHMRATGG\ KMAYIililEEDI PGPCGPVMIT 
EDALDLKLEGIEIPLSLPSWMVEKMRKYMETLRTENEHRAVEAP\PQ 
T 

6014 

A 

1 

1256 

VGDNSIDSWKNAGRVFKDSDKFDANDPILKDQTQEWSGSATFTSDGK 
IRLFYTDYSGKHYGKQSLTTAQVNVSKSDDTLKINGVEDHKTIFDGD 
GKT YQNVQQF I DEGNYTS GDNHTLRD PHYVEDKGHKYLVFEANTGTE 
NGYQGEESLFNKAYYGGGTNFFRKESQKLQQSAKKRDAELANGALGI 
IELNlTOYTLKKVMKPLITSNTVNEGKEEPGRPAGEVSMGCIiASEKGG 
KSSRS PEDPEKLPIAKPNNAYEFGQI INALSTRKDKEACAHLLAITA 
PKDLPMFLSNKLEGDTFLLLIQSLKNNLIEKDPSLVYQHIiLYLSKAE 
RFKGFCRFRNHHQTGFSPAGANQRGPLAATIiSGPGGEGQSAVARLTG 
EKKNHPGAQYANRL S PRVGRF I NAAGTTGF PTGKRAVS ATQL 

6015 

A 

2 

1594 

GRPARGAPQRGQTPEAGADKRATAGLCGGGGGRRRHRASGRRAGRGE 
P\AGLKSQGQRAVPKRAVARGGRQ\YSAAIALLEPAGSEIADDLSIL 
YSNRAACYLKEGNCSGCIQDCNRALELHPFSMKPLLRRAMAYETLEQ 
YGKAYVDYKTNOiQIDCGLQLANDSVNRLSRII^ELDGPNWREKLSLI 
PAVPASVPLQAWHPAKEMISKQAGDSSSHRQQGITDEKTFKALKEEG 
NQCVNDKNYKDALSKYSECLKINNKEC^IYTNRALCYLKLCQFEE^ 
QDCDQAIiQIxADGNVKAFYRRAlxAHKGL^^ 

IEAKMELEBVTRLIjNLKDKTAPFNKEKERRKIEIQEVNEGKEEPGRP 
AGEVSTGCLASEKGGKS S R\ S PEDPE KLP I AKPNNAYEFGQ I INAL S 
TRKDKE ACAHLIiA I TAPKDL PMFLSNKLEGDTFLLL I QS I»KNNL I EK 
DPbitV i yHlili I lib KAKKr KTaMLi rjbl SKGQKEIjXEQLFEDJjSDTPNNH 
FTLEDIQAIiKRQYEL 

6016 

A 

1 

449 

MAEGEITTFTALTEKFNLPPGNYKKPKLLYCSNGGHFLRILPDGTVD 
GTRDRS\DQHIQLQLSAESVGEVYIKSTETGQY\LAMDTDGLLYGSQ 
TPJNEEUJLF1j&KIjE£i \NHYNTYI SKKHAEKISWFVGijK\GPKTriYGQKA 
ILFLPLPVSSD 

6017 

A 

1 

153 

FNLPPGNYKKPKLLYCSNGGHFLRILPDGTVDGTRMDTDGLLYGSQT 
PNEE 

6018 

A 

1 

528 

MAEGEITTFTALTEKFNLPPGNYKKPKLLYCSNGGHFLRILPDGTVD 
GTRDR S DQH S GLNY S WTMHQRLQALRTNDHFAG AHA VPS DAQVMGMD 
AG VYTRKP I PTS TT I TLO TTKS T YVHDAIj FRAG P KMWP YT5SHVTGAG 

w x x mvr x x ^? x x jl, x xj^^ x x iv^j xxv nx/AU x jr i\i u< xr x jo w i i v x unvj 

YETYLRQQWECPQVNKLEHVQNRAGNYQPLPSKKI 

6019 

A 

223 

515 

QRGPLPEDPSGWHSGWDKGQERPAHRHQMRNVCSWKGWRRTITTPIY 
PRSMQRRIGLLASRRMGAANAVLGLTMARKQSC*A^lAEGEITTFTAIi 
TEKFNLPPGNYKXPKLLYCSNGGHFLRILPDGTVDGTRDRSDQHXQT 
PNEECLFLERLEENHYNTY I S KKHAEK1WFVGLKIQJGSCKRGPRTHY 
GQKAILFLPLPVSYNSRDLFWEG 

6020 

A 

1 

812 

HAEGEITTFTALTEKFNLPPGNYKKPKLLYCSNGGHFL»RILPDGTVD 
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GTRDRSDQHSGLNYSWTMHQRLQALRTNDHPAGAHAVPSDAQVMGMD 
AGVYTSSKTIAAFVIVLLYHSQVHLSVSLIMQKPQVQGPWLALSQQIi 
QEDVGLENWNEVTLESWTPSQLHKPLSLVQLQ\LSAESVGEVYIKST 
ETGQY \ LAMDT\DGLLYGSQTPNE\ EC\ LFLGGLEENHYNTY I S K\ K 
HAEKNWFVGLKKNGSCKRGPRTHYGQKAIIiFLPLPVSSD 

6021 

A 

1 

1471 

MESMFSSPAEAALQRETGVPGLLTPLPDLDGVYELERVAGFVRDLGC 
ERVALQFPDQLLGDAVAVAARLEETTGSKMFILGDTAYGSCCVDVLG 
AEQAG AQAL I H FG P ACLS P PAR PLP VAF VLS S TFCGLGTLCQDL WGP 
KPRPQSACGAA\GEPACAHAIiEALATIiLRPRYLDLIiVSSPAFPQPVG 
SLSPEPMPLERFGRRFPLAPGRRLEEYGAFYVGGSKASPDPDLDPDL 
SRLLLGWAPGQPFSSCCPDTGKTQDEGARAGGLRARRRYLVERARDA 
RVVGLLAGTLGVAQHREALAHLRNLTQAAGKRSYVIJUjGRPTPAKIA 
NFPEVDVFVLLACPLGALAP\QLSGRLFQPILAPCELEAACNPAWPP 
PGLAPHLTH Y ADLLPG S PFHVAL PP PE S EL WET PD VS L I TGDLRP P P 
AWKSSNDHGSIiALTPRPQLELiAESSPAASFliSSRSWQGLEPRLGQTP 
VTEAVSGRRGIAIAYEDEGSG 

6022 

A 

1 

343 

PTRPPTRPRTHGQECPLLDPVDFLLFRTRAGDPLRRVS S S \ FNKNLI 
FFSIKPQPPPCLAFHPRDPPGGSKRPLFWDPFKGPPILAPIIaSLTQI 
FFRWSCFFPKSRIAQGWALS 

6023 

A 

3 

919 

VCGLIVPQEDMPFCDSGICPDIIMNPHGFPSRMTVGKLIELIiAGKAG 
VLDGRFHYGTAFGGSKVKDVCEDLVRHGYNYLGKDYVTSGITGEPLE 
AYIYFGPVYYQKLKHMVLDKMHARARGPRAA^^ 

PTEGRSRDGGLRLGEMERDCLIGYGASMLLLERLMISSDAFEVDVCG 
QCGLLG YSG WCHYCKS S CHVS S LR I P Y ACKLLFQELQ SMN 1 1 P STAR 
TGSVQLYNFIRQGLGKANPEFPFEESLETRTKCRTHRILQKRFVKAA 
LFNNDGRKEGGGLSEHTSLEQPF 

6024 

A 

3 

275 

GLRLGKI GRD \ CL I G YGASMLLLERLM I S S DAFEVD VCGQCGLLGYS 
GWCHYCKSSCHVSSLP\ IPYACKLLFQELQSMNI IPRIiKLSKYNE 

6025 

A 

455 

680 

AQEYQKWGPEGGGSRAHGTRELSVRCCLGHWSGHGSPRPAIiPPYQL 
LSAASLFQLDALQRHCEILCSQTLSMES 

6026 

A 

1 

2065 

MTASIRRYHTCATDGEPDSSVLVGGDGDLTLLVAALGLDLGLPFMLL 
PPLME WMRVA I T YAEHRRSLTVD S GD I RQ AARLLLP / G P EHCFS S FR 
\ RLDARAATEKFNQDLGFRMLNCGRTDL INQAI EALG PDGVNTMDDQ 
GMTPLMYACAAGDEAMVQMLIDAGANLDIQVPSNSPRHPSIHPDSRH 
WTSLTFAVLHGHISWQLLLDAGAHVEGSAVNGGEDSYAETPLQLAS 
AAGNYELVSLLLSRGAPPLLSMIiEAHGMGSSLHEDMNCFSHSAAHGH 
RG I WGLVTLGPLACLEE EDHETPS PRVPQS S PSGQEGTGGQLRNVLR 
KLIiTQPQQAKADVLSLEEILAEGVEESDASSQGSGSEGPVRLSRTRT 
KALQEAMYYSAEHGYVDITMELRALGVPWKLHIWIESLRTSFSQSRY 
SWQSLLRDFSSIREEEYNEELVTEGLQLMFDILKTSKNDSVIQQLA 
XI FTHCYGS SPIPSIPEI RKTLPARLDPHFLNNKEMSDVTFLVEGKL 
FYAHKVLLVTASNRFKTLMTNKSEQDGDSSKTIEISDMKYHIFQMMM 
QYLYYGGTESMEI PTTD1LELLSAASLFQLDALQRHCE I LCSQTLSM 
ESAVNTYKYAKIHNAPELAIjFCEGFFL»KHMKALLEQ\MPSGSSSTAA 
AAKCRAWI HCRTCRTPWQS ACTL S TS P PG S AA 

6027 

A 

2 

144 

LFYEMES CS I AQAGVQWLS LGIiLQAPP P \ GSCHS PASAS CELMFFKI 

6028 

A 

163 

465 

RSMRRGRVRILl^GQNRDGIVFSTDDYFHHQDGYRYNVNQLGDAHDWN 
QNRAKQAIDQGRSPVIIDNTNIQAWEMKPYVEVAIGKGYRVEFHEPE 
TWWKF&P 

6029 

A 

1 

1614 

MSYGEIEGKFLGPREEVTSEPRCKK1,KSTTESYVFHNHSN7U>FHRIQ 
EKTGNDWVP VT 1 1 DVRGH S YLQENK I KTTDtiHRPLHDEM PGNRPDV I 
ES IDSQVLQEARPPLVSADDE I YSTSKAF IGPI YKPPEKKKRNEGRN 
EAHVLNGlNDRGGQKEKQKFNSEKSEIDNEIiFQFYKEIEELEKEKDG 
FENSCKESEPSQEQFVPFYEGHNNGLLKPDEEKKDLSNKAMPSHCDY 
QQNLGNEPDKYPCNGQVIPTFCDTSFTSFRPEWQSVYPFIVPYGPPL 
PSIiNYHI>NIQRFSGPPNPPSNIFQAQDDSQIQNGYYV™CPiVNWNCM 
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1 TFDQNNEYTDCSENRSSVHPSGNGCSMQDRYVSNGFCBVRERCWKDH 
CMDKHNGTDRFWQQFQEEKUITKLQKLLILLRGLPGSGKTTLSRILL 
GQNRDGIVFSTDDYFHHQDGYRYNVNQIiGDAHDWNQNRATSRLMGMR 
FWYKMmTSMSDNDLGAGIKDMTKSSKNKRETDTLITFSTFDLLRGD 
LTE FCMNFVSNPVQTTMP I IR 

6030 

A 

163 

1917 

SKMSYGEIEGKFLGPREEVTSEPRCKKLKSTTESYVFHNHSNADFHR 
IQEKTGNDWVPVTI IDVRGHS YLQENKIKTTDLHRPLHDEMPGNRPD 
VIESIDSQVLQEARPPLVSADDEIYSTSKAFIGPIYKPPEKKKftNEG 
RNBAHVLNGINDRGGQKEKQKFNS EKS E IDNELFQF YKE IEELEKEK 1 
DGFENSCKESEPSQEQFVPFYEGHNNGLLKPDEEKKDLSNKAMPSHC 
DYQQNLGNEPDKYPCNGQVIPTFCDTSFTSFRPEWQSVYPFIVPYGP 
PLPSLNYHLNIQRFSGPPNPPSNIFQAQDDSQIQNGYYVNNCHVNWN 
CMTFDQNNE YTDCS ENR S S VH PS GNGC S MQDR YVSNG FCE VRERCWK 
UWCl^iOlNGTDRFVNOQFQEEKIiNKLQKLLILLRGLPGSGKTTLSRI 
LLGQNRDGIVFSTDDYFHHQDGYRYNVNQLGDAHDWNQNRAKQAIDQ 
^Ki> fy x x UN i N lyAWEMKP YVEVAIGKGYRVEFHEPETWWKFDPEEL 
EKRNKHGVSRKKIAQMLDRYE YQMS I S I VMNS VEPSHKSTQRPPPPQ 
GRQRWGGSLGSHNRVCVTNNH 

6031 

A 

133 

424 

1 ^^ij^uijrnjjVX^XSDSIiFLMVIDFVNGTFMISFINRIIjIjGQN\RDGI 1 

i vro iui/i rnnyuoixiw vwyLUUAHDWNQNRGLFWAKCPRIEyLARS 1 
QKT 

6032 

A 

3 

270 

FKMESGSVA\KLECRGTISAHCNLRPPGSSDPPASMQIAATTGVHH 
i nivKjaxrAt ±jVZsiL,t CHVTjQAGLiELIiT/ LVIHPPRPPKVIiGLQA I 

6033 

A 

407 

720 

1 RHNSSNFFFFFFFFETESRSIAQACMQWCYLSSLQSLPPGFKRFSCL 

SLPSSWDDRCPPPCIAN\FVFLVETGFCHVGQAGLELLISGDPPASA 
1 SOSTGTTGMT 

6034 

A 

1 

.135 

KEAPGLKKENN\CFVCAICQINMEGKTLCSKKDRPICKSHAFFHV | 

6035 

C 

96 J 

410 ! 

I nrutusMWljl: VKUSFN1KGXXXXXXXXXXXXXXXXXXXXXXXXXXXX | 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

XXXXXXXXXT.V 

6036 

C i 

108 

422 

MPLKKLPHLFVKGTPKXXJOCXXXJCXXJCXj^ 

AAAAAAAAAAAAAAAAAAAAAXXXXXXXXXXXXXXXXXX^^ 

XXXXXXXXXLV 

6037 

A | 

2 

195 

CLPRFMP I LPASTSQVAGNTGAHHHARIiI FVFLVEVGFHHVGQAGLL 
/ ISLPQMIHPPRPPKVLGL 

6038 

A j 

251 j 

377 

QKQYFSSHAKNGG*REGVAA*APTGPLEPPLAYTVNLILiKSKA ~ | 

6039 

A 

175 

393 

ARPTFRLYLSLPVSQAGQQREAER/HPRLVPTGPTHREPSPVRYDNL 
SRHIVDLSVAPRGEESRRCQGLVVHL j 

6040 

A 

1 j 

537 

MYKYRPGYSSSSTSAAMPHSSSAKVLSTLRARLAQQREAERHPRLVP 
TGPTHREPSPVRYDNLSRHIVASLQEREKLLRQSPPLPGRPTAPYLG 
RSMSYSSQKACfPGVSETEEVAIiQPLLTPKDEVQLKTTYSKSNGQPKS 
LGS AS PGPGQPPLS S PTRGGVKKLLTTE IAS S AGAKCD " | 

6041 

A 

1 

1557 

627 1 

MPYSLNIAIRDTFVNASRTLYSSSPRVLSl^SDANLELINTWVAKOT 
NNKISRLLDSLPSDTRLVLI^AIYLSAKWKTTFDPKKTRMEPFHFKN 
SVIKVPMMNSKKYPVAHFIDQTLKAKLSRGDSLKEPTS IAESSRHPS 
YRSEPSLEPESFRSPTFGKSFHFDPLSSGSRSSSLKSAQGTGFELGQ 
LQSIRSEGTTSTSYKSLANQTRNGSLSYDSLLTPSDSPDFESVQAGP 
EPDPPLGYTSPFLSARLAQQREAERHPRLVPTGPTHREPSPVRYDNIi 
SRHIVASLQEREKLLRQSPPLPGREEEPGLGDSGIQSTPGSGHAPRT 
SS S SDDS KRSPLG \ KTPLGRPAVPRFGKPDGL \ RGRGVGFPEPGPTA 

PYLGRSMSYSSQIAQPGVSETEEVDLRPLLTPKDEVQLKTTYSKSNG 
QPKSLGSASPGPGQPPLSSPTRGGVKKVSGVGGTTYEI SEFDHHCPW 
VNNC I GRRNYRYFFLFLLSLTAH I MGVFGFGLL YVL YH I EELSGVRT 
AV 

1 6042 J . 

A | 1 |_ 


TRREAERHPRLVPTGPTHREPSPVRYPNLSRHIVASLQEREKLLRQS | 
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PPLPGREEEPGLGDSGIQSTPGSGHAPRTSSSSDDSKRSPLGKTPLG 

RPAVPRF\ gkpdglrgrgvgs pe pgptap ylgrsms ys s qkaqpgvs 

ETEEVALQPLLTPKDEVQLKTTYSKSNGQPKSLGSASPGPGQPPLSS 
PTRGGVKKVSGVGGTTYE I S V 

6043 

A 

3 

2267 

PTWLERGGGEPAARPQTPQPTAPESRGPSGASALRCRGPTARSLPAA 
SMLGAPDESSVRVAVRIRPQLAKEKIEGCHICTSVTPGEPQVFLGKD 
KAFTFD YVFD IDS QQEQ I Y I QC I EKL I EGCFEG YNATVFAYGQT\ GA 
GKTYTMGTGFDVNIVEEELGIIS\RAVKHLFK\SIEEKKHIAIKNGL 
PAPDFKVNAQFLELYNEEVLDLFDTTRDIDAKSKKSNIRIHEDSTGG 
NL YCGRLPTRTVNTE S EMMQCLKLGAL S RITAS TQMNVQS S R SHAI F 
TIHVCQTRVCPQIDADNATDNKIISESAQMNEFETLTAKFHFVDLAG 
SERLKRTGATGERAKEGIS INCGLLALGNVI SALGDKSKRATHVPYR 
DSKLTRLLQDSLGGNSQTIMIACVSPSDRDFMETIiNTLKYANRARNI 

INDMFHENAMLQTENlWIiRVRIKAMQETVDTVLRSRITQLVSDQANHV 
UU^GEGNEEISNMIHSYIKEIEDLRAKLLESEAVNENIiRK\NLTRA 
TlvPZlPVF^fi^QTF^PTTLSSDKETTEl IDLAKKVDLEKLiKRKEKRKK 
KSVAGKEDNTDTDQEKKEEKGVSERENNELEVEESQEVSDHEDEEEE 
EEEEEDDIDGGESSDESDSESDEKANYQADLANITCEIAIKRKLIDE 
LENSQKRLQTLKJCQYEEKLMMLQHKIRDTQLERDQVLQNLCSVESYS 
EEKKKK 

6044 

A 

2 

323 

EDCK5LYSIRMEGARGCLAKSLAIj*GAGRGFYSGERTWTLSLPSGNCA 
P\YQRGGWWSHA*AHSNLNGVWHHGGHYRSRYQDGVYWAEFRGGAYS 

T OVaaMT.TTJPT.ITT. 

6045 

A 

3 

902 

PFFCSTISYRLCPEGAAGQKKVLPPPPLVPWPVRLVGSTRDTSRMR 

ma B DT? DAD n^TrtP HOP DM Q PMP A fiHP RD PTTCPVfi P WOD CAEAROAQ 

HEQSGVYELRVGRHWS VWCEQQLEGGGWTVI QRRQDGS VNVFT\ S W 
QHYKAGFGRPDGEYWL\GLEPEHH\LTSRGDHELLVLKEDWGG\RGA 
RAHYDGF S LE PE SDHYRIjRLGQYHGD AGDSLS WHNDKPFSTVDRDRD 
SYSG\NCALYQRRGWWYHA\CAHSNLNGVWHHGGHYRSR 
E FRGGAYS LRKAAML I WPLKL 

6046 

A 

3 

218 

KAVSLSVTQAGVQW\LISLQSPPPRFNRFSCLSLPSSWDYRQVPPRP 
ANFC I FSRDRVS PCWPGWS PTPNLK 

6047 

A 

1 

297 

MVVSLYFPTHPDSFPVLLCLYLCPGRLILRDCNPPRLPFV\IDF\CL 
GLWGMPVmSFGGKKREDigVFLHCSHSDLGYVLLAGTASLQDFSFFQ 
KSPPP 

6048 

A 

1 

8157 

MTKHYDYIAIGGGSGGIASINRAAMYGQKCALI 

VPKKVMWHAAQIREAIHMYGPDYGFDTTINKFNVJETLIASRTAYIDR 
I HTS YENVLGKNNVDV I KGFARFVDAKTLEVNGET I TADH I L I ATGG 
RPSHPDIPGVEYGIDSDGFFALPALPERVAWGAGYIAVELAGVING 
LGAKTHLFVRKHAPLRS FDPMI SENQSRSGAVRDE IQFYATGARPDL 
AKEMGFIGGKMPTHWGPHDGDAGIRKDAAMVADMRVKCAERCDRI SL 
^lHAGKVIxASGTPQELVEKRGAASLEEAFIAYLQEAAGQSZ^raAEAPHR 
ATAAAKRWNCDAIQYVRTIiALMGTVILMIjIMGYGI SMMWKTCALKC^ 
RDFCGSDFRLIKGTPRSSDDSIKILPPILKTSNSGPKGQQSSTKGNS 
SMATDSTQCVKKSRGRPKVFDRDAALDKAMKLFWQHG YEATS LJ^DLV 
EATGAKAPTLYAEFTNKEGLFRAVLDRYIDRFAAKHEAQLFCEEKSV 
ESAlxADYFAAITVNCFTSKDTPAGCFMINNCTTLSPDSGD IANTLKSR 
HAMQERTLQQFIjCQRQARGE I P PHCD VTHLAE FLNC 1 1 QGMS I S ARE 
GASLEKLIAPEPRTPPASFRKLRTYPAATLSTGTAADEIYVWRGQTP 
FVMS I AATRQRS SHPTWFNGNTQHAPNFCRFARFAFSVERLGNRRAA 
FFDFTIRLRFRQNGVPNERFDVNVRSQLSSQAIiTTTRIPHTIRSASI 
TKPLSQPSTCRDMSAAHRQAIADITPKYMEDlxDQRWiTOYGVKFIiDKM 
AKSDKPFFLYYGTQFATFAPGCPARSSAWRLTSTISISGMELPMAPA 
FP I PRQNP W SMAP ACLLTGTSGHNGG SG AKG EDRG VFGGRP P I TE FP 
FGSTWNFRVCGLRSPVTALCRGEGGQPQGAQVQQRPGKITGENFRQG 
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NSRGGHQNAQGHGHPLVQRHTGGPRHKKGATEHPGYTSAPANPSABIi 
KIYDTSSQPLSQILSQVQQDGASIWGPLLKNNVEELLKLSKALYNI 
DWHRHAQVKCNGLQINMLMAAAVCAVI ASNLGMFVVMAE IMALCAQQ 
QRGQLWFALGRLGTLLI^IACWLLWQRYGTIiDLRLLDMRMQQLPLGS 
D I WLLGV I GFGLLAG 1 1 PLHG WVPQAHANAS APAAALFS TVHVYKRV 
ILTGDHITDSTSSQDEYNQPQWISLDSAGGNIMSNFTKDNIGKPMA 
TLFVEYKDSGKKDANGRAVLVKQEEVINIANIQSRLGNSFRITGINN 
PNEARQLSLLLRAGALIAPIQIVEERTIGPTLGMQNIRWGYGRSQPA 
ELTKLSLFCYIANYLWKGDEVRimLRGFLKPMGVIIjVIiAVLLLAQP 
DLGTVVVLFVTTLAMLFLAGAKLWQFIAI IGMGISAWIiLIIiAEPYR 
IRRVTAFWNPWEDPFGSGYQLTQSLMAFGRGELWGQGLGNSVQKLEY 
LPEAHTDF I FAI IGEELG YVGVVX.ALLMVFFVAFRAMS IGRKALEID 
HRFSGFLACSIGIWFSFQALVNVRAGAGDVPTKYNSLSCFKETRNAI 
LGRLMTPAHGHVWLDGEHIQHYASKEVARRIGLLAQNATTPGDITVQ 
ELVARGRYPHQPLFTRWRKEDEEAVTKAMQATGITHLADQSVIPFRS 
RLVKNRKAAWHSGSRDTAATLMLFVLGWLGRRRGLGCCGIWGGFSQQ 
GLPPAMQKVMYKGLVPEDKTLREIKVTSGAKIMVVGSTINDV1AVNT 
PKDAAQQDAKAEENKKEPLCRQKLHCCLFTLVAS I IVPAAFVLEDVD 
FDQMVSLEANRSSYNASFPSSFELSASSHSDDDVIIAKEGTSVSIEC 
LLTASHYEDVHWHNS KGQQLDGRS RVTAIjVPVTPAI QPNARIRRHRV 
VAERQRQIGANFLRGKLTVI AAVVHI EAALEI IVHSQGNRFVEGQYQ 
AVQHAWRIPVGPGRGSGAGSLVAYALKITDLDPIiEFDRATVRVIiNG 
DNTRQIVFVRQMQV7VHYAKCSFVRHADITNFSRALVLCQCVQRIQQR 
NRRRRIRPAITEFAEAVGRALRPVHLIKIEVNALSDGIVELCVGVGF 
AVLFILSGKSAKHRYHRKLGINRVNANCPGFYVNRILAPYINEAIRM 
LTQGERVEH IDAALVKFGFPVGP I QLLDEVG IDTGTKI I PVLEAAYG 
ERFSAPANWSSILNDDRKGRKNGRGFYIiYGQKGRKSKKQVDPAIYP 
LIGTQGQGRISAPQVAERCVMLMLNEAVRCVDEQVIRSVRDGDIGAV 
FGIGFPPFLGGPFRYIDSLGAGEWAIMQRLATQYGSRFTPCERLVE 

AEQTLEE VGDCLNLP S S AEVLPELTPCGDVGLMW I GDKKQLLTPLRD 
QFTRAPLDND I GV S EATR ID PN AWVERWKAAGHYQAEAALIjQCTADT 
IJVDAVLITTAHAWQHQGKTLFISRKTYRIDALAVIMFSFGGLELIGI 
TAAEARJDPEKS I PKAVNQWYR I LLF Y IGSLWLLAL YP WVEVKSNS 
SPFVMlFHNLDSNVVASAIiNFVILVASLSVYNSGVYSNSRMLFGLSV 
QGNAPKFLTRVSRRGVPINSLMLSGAITSLWLINYLLPQKAFGLLM 
ALVVATLLLl^IMICLAHLRFRAAMRRQGRETQFKALLYPFGNYLCI 
AFLGMILLLMCTMDDMRLSAILLPVWIVFLFMAFKTLRRK 

6049 

A 

1126 

1355 

PQDEAPLVLLSKDVWQGSAEATQVQEACGAGSEPHSHRNPGISAQV 
GLAPSYGAARGRRRPLALQQSPQERRHVG 

6050 

A 

1 

1627 

MGSRCLNPPPPAHSDTTGKDSFGNIRGAETGQGASACSVTSARVTCG 
AGSEPHSHRNPGISAQVGLAPSYGAARGRRRPLALQQSPQERRHVGW 
NSTRGLLPASLPGTASSQSASATASAALPLKVTGPLARNPTPPWTAA 
AAIaATRGORPEKGLFPGPAPFSLGKRKRGRGRTWERRRRVSIETSTC 
FRPGCERLGAAAGANX.SQLASSQRPLRERWVLYTI IMAAAGAPDGMB 
EPGMDTEAE WATEAPARPVNCLEAEATAGAGAGTVAED s gtargs l 
QP A\ P APAQ APGDPVSQAS VSNGEDSGGS ADSELVDLRI I WNKTKHD 
VKFPLDSTGSELKQKIHSITGLPPAMQKVMYKGLVPEDKTLREIKVT 
SGAKIIWVGSTINDVIiAVNTPKDAAQQDAKAEENKKEPIiCRQKQHRK 
VLDKGKPEDVMPSVKGAQERLPTVPLSGMYNKSGGKVRIiTFKLEQDQ 
LWI GTKERTEKLPMGS I K\NW\ SDP I \ EGHED YHNDGRFQLAPTEA 
\ S YYWVYWVPTQYVDAI K\DTVLGKWQYF 

6051 

C 

47 

298 

MITSTSQTKGSSSPRISHSCPSSTAYVGRWRGPCGPGAQLPPNIGQP 
CPYPGPPLPSSYVYNNDQSVWXXXXXXXXXXXXXXXX 

6052 

B 

64 

362 

XCWHRRKVAWLAXLKIiFRHYYV>1VICYVYFTRIIAIIjIjQV 
WQWLYQPTGl^PYLQLPQEDEEDVQMEQVMTDSGFREGLSKVNKTAS 
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GRELL* 

6053 

A 

1 

342 

AAGIFWVS I LCRNTYS VFKI HWLMAALAFTKS I SLLFHSVRALDWGA 
AGGGLKGPPGPSPKSLQTLGLWEGHPPAIHVSTHFTSVASHPSPLST 
SPAGQLCPPLSLGSAVTVRIi 

6054 

A 

1 

485 

FPVVWSIRHLQDASGTDGKVAVNIiAKL 

AILLQVAVPFQWQWLYQLLVEGSTLAFFVLTGYKFQPTGNNPYLQLP 
QEDEEDVQMEQWCGEVEGVHVDQAPSS PGPRFPDKPI WKKS PFLPP 
NIGQPCPYPGTTPPFQLCVQ 

6055 

A 

696 

3195 

SGRRTPMASCRQRRCPFSSSTWSCPPASWPLASSGCPSSAGTRTASF 
KIHWLMAALAFTKSISLLFHSVRALDRGAAGAPITSFESHPPSSLFN 
FLTTHLIWTV^VPNPPPPPSASHPSVRLTVRLSIHPQINYYFINSQG 
HPIEGLAVMYYIAHLLKGALLFITIALIGSGWAFIKYVLSDKEKKVF 
GIVIPMQVXANVAYIIIESREEGASDYVLWKEILFLVDLICCGAILF 
PWWS I RHLHGCVWHRRRMVAVNLAKLE \ LFRHYYVMVGARLTHSGF 
GGITGLLHAHPSPSAAAPSRGPHPGSPAPILQSSPMPARAPTVLAPR 
LLAPLQVICYVYFTRIIAILLQVAVPFQWQWLYQLLVEGSTLAFFVIj 
TGYKFQPTGNNPYLQLPQEDEEDVQMEQVEAHTLKLLVKDIKSTVSN 
WSEMKENMDKELKQIMRMIYERRICVADPFEVTVMQDFFIDLRLPY 
SVVRNEQV^IRAVLYNYRQNQELKVRVELLHNPAFCSIATTKRRHQQ 
TVTIPPKSSLSVPYVIVPLRPACRKWKSRLLSTIISSVTVSGKGVQK 
EDIPPADLSDQVPDTESETRILLQGTPVAQMTEDAVDAERLKHLIVT 
PSGCGEQl^IGMTPTVIAVHYLDETEQWEKFGLEKRQGALELIKKGY 
TQQLAFRQPSSAFAAFVKRAPSTWLTAYV\nCVFSIAVNLIAIDSQVL 
CGAVKWL I LEKQKPDG VFQED APVT HQEM I GGLRNNNE KDMALTAFV 
LISLQEAKDICEEQVNIHNRYAATRQLMVAKISGKPSKIwnDKSKTAGIi 
AD CHS VXiDVSNKGVIjRD VASTS SGI VDWRGSQIQT 

DUDO 

A 

1 

516 

IiRQPTRPVFGIVIPMQVLANVAYI I IESREEGASDYVLWKEILFLVD 
LICCGAILFPLVWSIRHLKDASGTDGKVPVNLPKLKLFRHYYV\IPA 
PTPCAPLQVI CYVYFTRI I ANP \LQVAVPFQWQWLYQLLVEGS \TLA 
FFVliTGYKFQPTGNNPYLAVAPGGRGGCSRWSK 

6057 

A 

3 

528 

LSCRVQVSVAPPRPRPAIiPVSAVPGTPGTLTPALLHP/WLADCDVSD 
S S CS SLAATLLANHSLR ELDLSNNCLGDAG I LQLVES VRQPGCLLEQ 
LVYVQRAGRWRPGLPREEGGGGGGHWAHASLLACVQPVRQFTGLRR 
WRTGLQGPGERTSTSLRGHLLRLFLLLAAALDDRP 

cnco 

A 

A 

X 

"i "5 A 

334 

EKINIXZAEVTCQELAASDRERAQSRIQELEEKVASLiLHQVSWRQDSRE 
PDAGRIPmGSKTAKYLAADALELMVPGGWRPGTGSQQLEPPAEGGHP 
GAAQRGQGDLQCPHCLH 

6059 

A 

225 

843 

WTRCWSRLRASGSSCEGRRAAPPRPRCARYEGGYQRSCGSESACGGR 
RVLGSSLRQHQGSCRNTRAGMFCGRQYIPHPSAASPRRPLGRSRNTQ 
MGTPLSRVLLRSCRKKID/ SLKQKVTHLRGLQIPHEPEIiMRKEISRL 
NKTVGRENK* LSRSEAGAGGLQ\ EARDAALERVQMLEQQVSVPRAPD 

/~1 T*T "N/^v O A C* T5 TVTT.1 n \T f~* MT\T> T 

\j 1 iJyMoACjKWWPKS WDRI/G 

6060 

A 

300 

957 

GSTK^HAETHALGCSVADSTSRTHQQLHRGAHWGEVGTHRW\HTSVQ 
SVIEKLQEENRLLKQK\miVEDI^AKWQRYNASRDEYVRGLHAQLRG 
JjQ I PHEPELMRKE I S RLNRQLEEKI ND CAEVKQELAASRTARDAALE 
rev vjnjjj&yyj. J^JJJcnbJiKAIJKiiKAQSRIQELEEKVAS 
QDSREPDAGRIHAGSKTAKYLAADALELMVPG 

6061 

A 

2 

271 

r r iiici>K^ VAXKLhLbbAIbAY C-rL»CL»PGSSDSIASASR\VAGITGE 
HHHARLILVFLVEMGFHHVGQAGLGLLDLVIHPPRPPKVLGLQA 

6062 

A 

3 

376 

DYSLGLPVVTTGCWGHMSNAPLRWALNPSYWGKNQLLCPESIiALCCG 
LCCKPRFLEVTSVRSADLYLSSPCLVRRCPLFRGKEKPAVGMRMPDN 
KGWRRALS S SLS PR \ VP \ APVAAS VLTCVMN 

6063 

A 

2 

70 

WSRSPDLMI RL P R P P KVLGLQA | 

6064 

A 

3 

258 

ADPPSPSFCFCFFFFETESCSVT\RIiECSGAILAHCNLC/LPTCHHA 
WV I F VFLVETEFHHVGEDGLNIiL/NFM I YLPRPPKVLGLQA 

6065 

A 

1 

408 

G FDE W FG S PNCHFGP YDNKAR PN I P VYRD WEMVGR YYEE F P I NLKTG 
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EANLTQIYLQEAXiDFIKRQARHHPFFLYWAIDATHAPIYASKPFLiGT 
SQRGRYGDAVREIDDFQFRLSFVHKEFPWTLIMREACSFLSL 

6066 

A 

3 

25 

KKCLWSH 

6067 

A 

3 

505 

RSEEQPQAGFVPGQDRLRKETRTAGLGARASGHRGVACRAAVGRRMR 
LGATRGHSPYSVIjEGARLTKNFVKVVSVAAQNFRVSPRGVPEVSWVP 
VSRHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWB 
MVGRYYEEFPINTLKTGEANLTQIYLQ 

6068 

A 

. 1 

1740 

MAAWAATRWWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLG 
VYGEPSRETPNLDRMAAEGLLFPNFYS7VNPLCSPSRAALLTGRLPIR 
NGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKW 
HLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVG 
RYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDAT 
HAPVYAS KPFLGTS QRGR YGDAVRE IDDS IGKI LELLQDLHVADNTF 
VFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGH 
VTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLLQGR 
LMDR P I F Y YRGDTL \ MAATL \ GQHKAH F W \ TWTNS WENFRQG I DFCP 
GQNVSGVTTHNLEDHTKLPLIFHLGWDPGERFPLSFASAEYQEALSR 
ITSWQQHQEALVPAQPQLNVCNWAVMNWAPPGCEKIjGKCL\TPPES 
I PKKCL WVPLAPAQTQARPR I S S WPCKCLEEGWLWPR S S PNPAPSQT 
DSTCRRRGTCTI SNLPRT 

6069 

A 

3 

2945 

AALKGGTPPPGALAARAPREPVAAMPPRAPPAPGPRPPPRAAAATDT 
AAGAGGAGGAGGAGGPGFRPLAPRPWRWLLLLALPAACSAPPPRPVY 
TNHWAVQVIX3GPAEADRVAAAHG YLNLGQ IGNLED YYHF YHS KTFKR 
STLSSRGPHTFLRMDPQVKWX.QQQEVKRRVKRQVRSDPQALYFNDPI 
WSNMWYLHCGDKNSRCRSEMNVQAAWKRGYTGKNVVVTILDDGIERN 
HPDLAPNYDSYASYDVNGNDYDPSPRYDASNENKHGTRCAGEVAASA 
NNSYCIVGIAYNAKIGGIRMLDGDVTDWEAKSLGIRPKYIDIYSAS 
WG PDDDGKTVDG PGRLAKQAFE YG I KKGRQGLGS I FVWASGNGGREG 
DYCSCDGYTNSIYTISVSSATENGYKPWYLEECASTLATTYSSGAFY 
ERKIVTTDLRQRCTDGHTGTSVSAPMVAGIIALALEANSQLTWRDVQ 
HLLVKTSRPAHLKASDWKVNGAGHKVSHFYGFGLVDAEAX.VVEAKKW 
TAVPS QHMCVAASDKRPRS I PLVQVLRTTALTSACAEHSDQRWYLE 
HVWRTSISHPRRGDLQIYLVSPSGTKSQLLAKRLLDLSNEGFTNWE 
FMTVHCWGEKAEGQWTLEIQDLPSQVRNPEKQGKLKEWSIilLYGTAE 
HPYHTFSAHQSRSRMLELSAPELEPPKAALSPSQVEVPEDEEDYTGV 
CHPECGDKGCDGPNADQCLNCVHFSLGSVKTSRKCVSVCPLGYFGDT 
AARRCRRCHKGCETCSSRAATQCLSCRRGFYHHQEMNTCVTLCPAGF 
YADESQKNCLKCHPSCKXCVDEPEKCTVCKEGFSLARGSCIPDCEPG 
TYFDSELI RCGECHHTCGTCVG PGREEC I HCAKNFHFHD WKCVPACG 
EGFYPEEMPGLPHKVCRRCDENCLSCAGSSRNCSRCKTGFTQLGTSC 
I TNHTCSNADETFCEMVKSNRLCERKLF I QFCCRTCLLAG 

6070 

A 

2 

3078 

GNAPRPPPPPRLPPAAVLAAAVAAAAAX.PRRGAALKGGTPPPGALAA 
RAPREP VAAMPPRAP PAPGPRP P PRAAAATDTAAGAGGAGGAGGAGG 
PGFRPLAPRP WRWLLLLALPAAC S AP PPR PVYTNHWAVQVLGGPAEA 
DRVAAAHGYIjl^GQIGNIiEDYYHFYHSKTFKRSTLSSRGPHTFLRMD 
PQVKWLQQQEVKRRVKRQVRSDPQALYFNDPIWSNMWYLHCGDKNSR 
CRS EMNVQAAWKRGYTGKNVWT I LDDG I ERNHPDLAPNYDS YASYD 
VNGNDYDPSPRYDASNENKHGTRCAGEVAASANNSYCIVGIAYNAKI 
GG IRMLDGDVTD WEAKSLG IRPNY I D I YS AS WG PDDDGKTVDG PGR 
LAKQAFEYGI KKGRQGLGS I FVWASGNGGREGDYCS CDG YTNS I YTI 
SVSSATENGYKPWYLEECASTLATTYSSGAFYERKIVTTDLRQRCTD 
GHTGTS VS APMVAG I ^ ALALE ANS QLTWRD VQHLLVKTS R PAHLKAS 
DWKVNGAGHKVSHFYGFGLVDAE^VVEAKKWTAVPSQHMCVAASDK 
RPRS I PLVQVLRTTALTSACAEHSDQRWYLEHVWRTS I SHPRRGD 
LQIYLVSPSGTKSQLLAKRLLDLSNEGFTNWEFMTVHCWGEKAEGQW 
TLEIQDLPSQVRNPEKQGKLKEWSLILYGTAEHPYHTFSAHQSRSRM 
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LELSAPELEPPKAALSPSQVEVPEDEEDYTAQSTPGSANILQTSVCH 
PECGDKGCDGPNADQCLNCVHFSLGSVKTSRKCVSVCPLGYFGDTAA 
RRCRRCHKGCETCSSRAATQCLSCRRGFYHHQEMNTCVTLCPAGFYA 
DESQKNCLKCHPSCKKCVDEPEKCTVCKEGFSLARGSCIPDCEPGTY 
FDSELI RCGECHHTCGTCVGPGREE C I HCAKNFHFHDWKCVPACGEG 
FYPEEMPGLPHKVCRRCDENCLSCAGSSRNCSRCKTGFTQLGTSCIT 
NHTCSNADETFCEMVKSNRLCERKL F I QFCCRTCLLAG 

6071 

A 

3 

3204 

AGRGRLWGGTASREPQKSAGKFPRWGAFRASRTALGPGARSQERGPG 
GGNAPRPPPPPRLPPAAVLAAAVAAAAALPRRGAALKGGTPPPGALA 
ARAPRE P VAAMP PRAP P APGPR P P PRAAAATDTAAGAGGAGGAGGAG 
GPGFRPLAPRPWRWLLLLALPAACSAPPPRPVYTNHWAVQVI.GGPAE 
ADRVAAAHG YLNLGQ I GNLED YYH F YH S KTFKRS TLS S RGPHTFLRM 
DPQVKWLQQQEVKRRVKRQVRSDPQAIiYFNDPIWSNMWYLHCGDKNS 
RCRS EMNVQAAWKRG YTGKNVVVT I LDDG I ERNHPDLAPNYDS YAS Y 
DVNGNDYDPSPRYDASNENKHGTRCAGEVAASANNSYCIVGIAYNAK 
IGGIRMLDGDVTDVVEAJCSLGIRPNYIDIYSASWGPDDDGKTVDGPG 
RLAKQAFEYGIKKGRQGLGS IFVWASGNGGREGDYCSCDGYTNS I YT 
ISVSSATENGYKPWYLEECASTLATTYSSG\AFYERKIVTTDLRQRC 
TDGHTGTSVSAP^AGIIAIJ^EANSQLTWRDVQHLLVKTSRPAHLK 
ASDWKVNGAGH\KFSHFYGFGSVDAKSLGEEAKKWTAGPSQHMWVAA 
SDKRPRS I PLVQVLRTTALTS ACAEHSDQRWYLEHVAVRTS I SHPR 
RGDLQIYLVSPSGTKSQLLAKRLLDLSNEGIiTNGEFMTAHCWGEKAE 

r-, ^\nrnT T>T r\T\T T% C fW TT3 XT Ve^if* ITT VTC WOT TT VY"* r P7\ T?U DVUTTTO IMJflCT? 

GQWTLE I QDLiPS QVRN FE K^Cj KJ_i JvJb WbLiJ-ijiUl AJirtFilllr oArlUoK 
SRMLELSAPELEPPKAALSPSQVEVPEDEEDYTGVCHPECGDKGCDG 
PNADQCLNCVHFSLGSVKTSRKCVSVCPLGYFGDTAARRCRRCHKGC 
ETCSSRAATQCLSCRRGFYHHQEMNTCVTLCPAGFYADESQKNCLKC 
HPSCKKCVDEPEKCTVCKEGFSLARGSCIPDCEPGTYFDSELIRCGE 
CHHTCGTCVGPGREECIHCAKNFHFHDWKCVPACGEGFYPEEMPGLP 
HKVCRRYGPPGGERQATVSSKGVPGGQSLSASSPGAGEGMLHHPTVD 
RSPFTELLRGLRPFVHWMHICWVPAVGRHRAAAG 

6072 

A 

3 

3002 

AALKGGTPPPGALAARAPREP VAAMP PRAP PAPG PR PPPRAAAATDT 
AAGAGGAGGAGGAGGPGFRPLAPRPWRWLLLLALPAACSAPPPRPVY 
TNHWAVQVLGGPAEADRVAAAHGYLNLGQIGNLEDYYHFYHSKTFKR 
STLSSRGPHTFLRMDPQVKWLQQQEVKRRVXRQVRSDPQALYFNDPI 
WSNMWYI^CGDKNSRCRSEMNVQAAWKRGYTGKNVVVTILDDGIERN 
HPDLAPNYDSYASYDVNGNDYDPSPRYDASNENKHGTRCAGEVAASA 
NNSYCIVGIAYNAKIGGIRMLDGDVTDWEAKSLGIRPNYIDIYSAS 
WGPDDDGKTVDGPGRLAKQAFEYG I KKGRQGLGS I FVWASGNGGREG 
DYCSCDGYTNSIYTISVSSATENGYKPWYLEECASTLATTYSSGAFY 
ERKIVTTDLRQRCTDGHTGTSVSAPMVAGIIALAIiEANSQLTWRDVQ 
HLLVKTSRPAHLKASDWKVWGAGHKVSHFYGFGLVDAEA1>VVEAKKW 
TAVPSQHMCVAASDKRPRS I PLVQVLRTTALTS ACAEHSDQRWYLE 
HVWRTS I SH PRRGDLQ I YLVS P SGTKSQLLAKRLLDL SNEGFTNWE 
FMTVHCWGEKAEGQWTIiEIQDLPSQVRNPEKQGKLKEWSLILYGTAE 

Tin vtimPC JV unCD CDMT UT C7iT5T?T UDDV7VM OTDC?r\17~C\7 , DT?T*i'CT?T^V r riVO 

HFYHTr SAHUoKoKMljliLioAiriiJbJbF 1«W 
STPGSANILQTSVCHPECGDKGCDGPNADQCLNCVHFSLGSVKTSRK 
CVSVCPLGYFGDTAARRCRRCHKGCETCSSRAATQCLSCRRGFYHHQ 
EMNTCVTLCPAGFYADESQKNCLKCHPSCKKCVDEPEKCTVCKEGFS 
LARGSCIPDCEPGTYFDSELIRCGECHHTCGTCVGPGREECIHCAKN 
FHFHDWKCVPACGEGFYPEEMPGLPHKVCRRYGPPGGERQATVSSKG 
VPGGQSLSASSPGAGEGMLHHPTVDRSPFTELLRGLRPFVHWMHICW 
VPAVGRHRAAAG 

6073 

A 

1 

1181 

MEVSRAGLSTVLEAVLCPADLTAASLDAASPMPSSSHLPVLGYPVGK 
SGARPGSTQEVVLTAAAVTQEVVLTAAAVTQEVVLTAAAVTQEVVLT 
AAAVTQE VVLTAAA VTQE VVLTAAAVTQEVVLTAAAVTQ E WLTAAA 
VTQEVVLTAAAVTQEVVLTTAGAENKPVS APHS SS TCARTGS PGRPL 
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HTQSPPRGHILFGKEKVCYKFKHDILKHLLTQLTVDGIVSIPLVQVL 
RTTALTS ACAEHSDQRWYLEHVWRTS I SHPRRGDLQI YLVSPSGT 
KSQLIJUCRLLDLSNEGFTl^EFMTVHCWGEKAEGQWTLEIQDLPSQV 
RNPEKQG/DVFCTKCHSHQSFEESQDTKSVVAWEGWGCTHGVHCEVP 
KNLG I RSGKHHPLKI KLV 

6074 

A 

3 

656 

RSAFELWTQEGGLPSPSGW/VIIQSLKNLNRTDSRGRRSSPLISCLA 
TGAGTSRHLLPLDWMSTIGCPGSQASRPELSLNHTSGFPGFPAFHPR 
WWAFSGPQSHINQIPHPKSPLCVGS\LLQRAYTNTLPLFPLQFLRSP 
TKESTYSQLHPSGSPPGGGPGPRQLGLTSPKASEQPLPGFDWEGQGS 
DRRAGADGSGRESSGTPGTAVLQRSSCGTSC 

6075 

A 

3 

n *7 

AAEGLPPVR I PPFSETTANGTI S FTEMVQDMGAGLAWPLMGLLES I 
AVAKAFASHNNYRIDANQELLAIGLTNMLGSLVSSYPVTGSFGRTAV 
n Ay jd^j vl x fa<jCjJLi V 1 u V1jVLL*SLDYLTSLF\ YYI PKSALG\ AVI IMG 
RGPAVSTPRSFRNALGFLKRLDLLPLCVTFLLCFWEEQDGILAGALG 
SLLMLLHSAARPETKGSEGPVLVLQPASGLSFPAMEALREEILSRAL 
E VS PPRCLVLECTHVCS I D YTWLGLGELLQD FQKQGVALAFEGLQV 
PRLRVLLSR 

6076 

A 

3 

256 

HASGMKNDLVYTGVDPETLPNRNGDGV/Pt^LSPDPQFRTRKIQASL 
LTLEETMKQLEELEEEFRRLKPLLANLGGNSVPHPGCT 

6077 

A 

1 

1966 

MAEPLLRKTFSRLRGREKLPRKKSDAKERGHPAQRPEPSPPEPEPQA 
PEGSQAGAEGPSSPEASRSPARGAYLQSLEPSSRRWVLGGAKPAEDT 
SLGPGVPGTGEPAGEIWYNPIPEEDPRPPAPEPPGPQPGSAESEGLA 
PQGAAPASPPT/NSLPHQVPGPRQAPLHKDEEAAGTAAPPEPARPPG 
WQGAREGCPCGLRHQPLPPGQQRGGPRAGSRAWGHPEPEGRLPQRRG 
LTGAPSWAPITHLLPALRGGSRSPGTPGRTLGPPQPAPVRSRGAAAS 
AGGHPQGPLLPTASGWGGQGPNRATARGAGLPAAGPHLPPGAGGRQA 
PARPGACVGPWREKAPAL,CPGHRAAAHGI*PR\CQAQQLAVRLEPQGIj 
LYAKLTLSEQQEAPAT7VEPRVFGLPLPLLVERER PPGQVPLI IQKCV 
GQIERRGLRWGLYRLCGSAAVKKELRDAFERDSAAVCLSEDLYPDI 
NVITAVPSIAIiEASGALTRGDPVFTGILKDYLRELPTPLITQPLYKV 
vjjtATiAKUFt'iNiKviyFi lliCj XKGLLSCLPDVERATLTIjLIjDHLRXiVSS 
FHAYNRMTPQNLAVCFGPVLLPARQAPT\RPRARSSGPGLASAVDFK 
HH I EVLHYLLQS W P \ I PAC PDNIjQM SRLTCD PND S KLCTCRWQTP KW 

6078 

A 

2 

82 

WCDLGSLQPPPPRFKQFSCLSLPRHS 

6079 

A 

1 

347 

FFFFFFFFLETASHSVAQAGGQGHDLSSLQPPPPRFKQFSCLSIiPSS 
WDHRHKPPHPP\NFVFLVETGFHHVGQAGLELLTSSDPPASASQMAG 

6080 

A 

1 

779 

FNFVGRILGPRGLTAKQLEAETGCKIMVRGKGSMRDKKKEEQNRGKP 
NWEHLNEDLHVL I TVED AQNRAE I KLKRA VEEVKKIiL VP AAEGED S L 
KKMQLMELAI LNGTYRDANI KS PALAFSLAATAQAAPRI ITGPAPVL 
PPAALRTPTPAGPTI MPLI RQ IQTAVMPNGTPHPTAAI VPPGPEAGI» 
IYTPYEYPYTLAPATSILEYPIEPSGVLGAVATKVRRHDMRVHPYQR 

X v j. /^tvJvolCrlrty Xj X iUJjX>Xo.fc>ljr X%2 

6081 

A 

1 

214 

HPTAAIVPPGPEAGLIYTPYEYPYTLAPATSILEYPIEPSGVL\VWL 
SQRKAKNSRTVLTEPSSDLNLTNA 

6082 

A 

3 

435 

x-'iofoDC 1 rKA x IjESCRSMIALMDTVGSGKLNLQEFHHLWNKIKGW 
Q\KIFK\HYDTDQSGHHQTATKMRNAV/i^ 

RYADKHMNIDFDSFICCFVRLEGMFRAFHAFDKDGDGIIKljlTVLEWb 
QLTMYA 

6083 

A 

2 

339 

GKPTLREFTHL/WDKIKGWREIFNNITPSQSGTITGYEMRNA/VNDA 
GFra^PLYDIl\TSRTADNHMNIDFESF\lCCFVR\LEGMFRAFTl^ 
FD\KDGDGI IRLNVLEWLQLTMYA 

6084 

A 

2 

775 

PSASDQSTWYLDESTLxxONIKKTLHKFCGPSPVVFSDVNSMYX»SSTE 
PPAAAEWACLLRPLRGREPEGVWNLLSIVREMFKRRDSNAAPLLEIL 
TDQCLTYEQITGWWYSVRTSASHSSASGHTGRSNGQSEVAAHACASM 
CDEMVTLWRl^VLDPALSPQRRRELCTQLRQWQLKVIENVKRGQHKK 
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TLERLFPGFRPAVEACYF1WEEAYPLPGVTYSGTDRKLALCWARALP 
SRPGASRSGGLEESRDRPRPLPTEPAVRPKEPGTKRKGIiGEGVPSSQ 
RGPRRLSAEGGDKALHKMGPGGGKAKAIiGGAGSGSKGSAGGGSKRRIi 
SSEDSSLEPDLAEMSLDDSSIiALGAEASTFGGFPESPPPCPLHGGSR 
GPSTFLPEPPDTYEEDGGVYFSEGPEPPTASVGPPGIiliPGDVCTQDD 
LPSTDESGNGLPKTKEAAPAVGEEDDDYQAYYLNAQDGAGGBEEKAE 
GGAGEEHDLFAGLKPLEQESRMEVLFACAEALHAHGYSSEASRLTVE 
LAQDLLANPPDLKGKiO^KVSTSRQTWVATNTLSKAAFLLTVLSERPE 
HHNLAFRVGMFAIiELQRPPASTKALEVKLAYQESEVAAl.LKKIPLGP 
SEMSTMRCRAEELREGTLCDYRPVLPLMIiASFIFDVI,CAPGSRPPSR 
NVTNSETPGDEELGFEAAVAALGMKTTVSEAEHPLLCEGTRREKGDIA 
IALMITYKDDQAKLKKILDKLLDRESQTHKPQTLSSFYSSSRPTTAS 
QRSPSKHGGPSAPGALQPLTSGSAGPAQPGSVAGAGPGPTEGFTEKN 
VPESSPHSPCEGLPSEAALTPRPEGKVPSRLALGSRGGYNGRGWGSP 
GRPKKKHTGMASIDSSAPETTSDSSPTLSRRPLRGGWAPTSWGRGQD 
SDSISSSSSDSLGSSSSSGSRRASASGGARAKTVEVGRYKGRRPESH 
APHVPNQPSEAAAHF YFELAKTVL I KAGGNS S TS I FTHPSSSGGHQG 
PHRNLHLCAFE IGLYALGLHNFVS PNWL S RTYS SHVS W I TGQAMEIG 
SAALTILVECWDGHLTPPEVASIADRASRARDSNMVRAAAELALSCL 
PHAHALNPNE IQRALVQCKEQDNLMIiEKACMAVEEAAKGGGVYPEVL 
FEVAHQWFWLYEQTA\GGSSAAREGA\TSCSASGIRAGGEAGRGMPE 
GRGGPGTEPVTVAAAAV\ TAAATWPVI S VGSSLYPGPGDGHGHSPG 
LHPYTALQPHLPCS PQYLTHPAS PLPTPW PHMPRP AVFPV\ P S SAYP 
QGVHPAFVWGPGGIPGPAPTPSY 

6085 

A 

193 

388 

NRNNPFYNSGSSNFLENSIWSHQLSLLWKLNSLSLSLSLSCFTAKLG 
DTKELEAF I ADLDKTLAK 

6086 

A 

2 

353 

AAAEDLSDALCE \ FDAVLADFASPFHERHFHYEEHLERMKRRSSASV 
SDS SGFSDSES \ ADSL\ YRNSFS \ FSDEKLNS PTDSTPAX»LS ATVTP 
QKAKLGDTKELEAFIADLDKTLASM 

6087 

A 

315 

413 

TFERKWGSKNLHSCNGWSWKGVLLEISQQFSE 

6088 

A 

2 

284 

ASSLHSVSSKSFRDFLLEEKKSVTSHSSGDHVKKVSFKGIENSQAPK 
IVSPWLSSSVTAPSMVAPVTFASIVEEELQQEAALIRSREKPLALIQ 

6089 

A 

3 

189 

PSKAPKPVNAWASSLHSVSSKSFRDFLLEEKKSVTSHSSGDHVKKVS 
FKGIENSQAPKIVR 

6090 

A 

1931 

3137 

KMEISSLKEEINl^PNHSETMFKKAKTKAKKKPRKRSDSSGGYNLSD 
IIQSPSSTGLLKSGKTNSVESIiPEr.LTSDSEGSYAGVGSPRDLQSPD 
FTTGFHSDKI EAKVKPYVNGTS PVYS REDLKPWEKS P ILKI S APQPI 
PSNRIDTTSSASWVAGSFSPVSPPWDLRTIMEIEESRQKCGATPKS 
HLGKTVSHGVKLSQKQRKMIALTTK\DNNSGLNSMQTVICIPSDAP* 
TVNE*GSSLHSVSSKSFRDFIjLEEKKSVTSHSSGDHVKKVSFKGIEN 
SQAPKI VRCSTHGTPGPEGNHI SDLPLLDS PNPWLS S S VTTP SMVAP 
VTFASIVGRGTSTKEAALIRSREKPLAL\IQIEEHAIQDL>LVFYEAF 
GNPEEFVIVERTPQGPLAVPMWNKHGC 

6091 

A 

336 

410 

LLPQFQSLFLVYSE IQLLPGLVFC 

6092 

A 

40 

921 

TEVPGVHS EJTOKNQKKKIRKLD PKKKQVDTVQI I QGS KKG S S F L I F F 
FSPLI ILTVELQVGNWAQELS IQLELFWRTRASQSVTEPTAGLCLIi 
NPGEQEMRALRGRRKPLLL I P \ S KQDLEWTS S KLQQTC S * GF * \ R * K 
EN*QTERTSTPKPHLVGSSGQGNQAGEGNKGYPVRKRGSQIVPVCR* 
HDCISRKPHCLNPK/YSLS* * ATAAKSQDTK/ FNVQKSQAFLYTNNK 
QTDSQIMS ELPFTI AS KR I KYLSNPTYKGCEGPLQGKLQT I AQRNKT 
EPNKWKNI PVMGEE PM 

6093 

A 

1 

1960 

MPPHS EQPHS PS TRQKRKVP ti FVRQC P S FV I QAP KAKMDT FDGRKKQ 
FTRFLFLFSMDQLGQGRFSMQTLSFAL»AVYRKDSPLVACQVQA1»GNL 
EPSSVEAHVSSHGIAIDRKGWHVSLFSIQPGDCFPKALVEDSPRDRA 
RRQQSSKEEGWCRDRRRLKFHSKKGAAAIWKSKKYESPSFGVCFES 
LIiNPPRLTSRREKTISSSKRCRQCHAEETTWFWAKESQTGEQTGRG 
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AGQRRMGMIICKACSMLPAERQDLRPTSLVISHGVSVKDRREHCQCP 
NYYPPLRNMSPLEHSIKICGGKKPQNDSQRITTITKVIVNQDFKQPD 
DTELSQGSRKETSEASLVTLLWELVTAGQTPPLVTPRQTGSGVDLQQ 
TPTDLQLRVLTIRRKTNKQKGHPPKPHIiYVTIIKDQRTLHPKSTEYT 
FFSAPHPTYSKIDHIVGSKALLSKCKRTEIITNCLSDHSAIKLELRI 
KKLTQNR S TTWKLNNLLLND YWVYKEMKAE I KM FFETNENKDTT YQN 
LWDTFKAMYSGKFMALNAHKRKQQRSKIDTLISQLTELEKQEQTHSK 
ASRRQETTKIRAELKEIETQKTLQKINESRSWFFEKINKIDRTLARL 
IKKKREKNQIDTIK\KIKVISPPIPQKYKLLFILFGTGTQIN 

6094 

A 

275 

1839 

CPSVGFSLQAGWFGCKLLTAKDF*QQTQGFSQ/RLSQLHTHGHRAL 
QKKAISSKNEFTIMGQISEATSALFGEKESKEATLRGEGARNRDDRV 
DSNCRLQPPGQDGDEGQKVPVKYSAGDTQANRVWSGLOQTPTDLQLR 
VLTVRRKTNKQKGH PHQNP I CTS AS S KTKE I QTT I RE YHKNL Y ANKL 
ENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSKIEAIINSLPTKKS 
PG PDAFTAE FYQRYKEELR I KYLG I QLTRDVKDLFKENYKPLLNE I K 
GHKQMEEHSMLMDRKNQYCENGHTAQELEKTALKFIWNQKRAHI7UCT 
ILSQKNKAGGIILPDFKLYYKATVTKTAWYRYQNRDIDQW1JRTEPPE 
1 1 PHI YNHLIFDKPDKNEKWGKDPLFNKWYT^ENWIiAICRKLKLDSFL 
TPYTKINSRWIKDLNGRPKTIKTLEENLGNTIQDIGMGKDFMTKTPK 
AMATKAKVGKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYPSDK 
GLISRI 

6095 

A 

78 

1986 

EKTKSASDWCT*K*RG\DGTKLENTLQDIIQDNFPNLGRQANIQIQE 
IQIMPQRYSSRRATPRHIIVRFTKVKMKEKILRAAREKEIQTTISEY 
YKHLYANKLENLEEMGKLLDTYTLPRQNQEEVESLNRPIAGSEIEAI 
INSLPTKKSPGPDGFTAKFYQRHKEELVPFLLKQFQSMEKEVILPNS 
FYEAS I ILIPKPGRDTTEKENFTPISLMI IDAKILNKILANRIQQHI 
KKL I HQDQVGF I SGMQGWFNEHKS VNVIHH INRTKDRNHM 1 1 SVDAE 
KAFNKIQQPFMLKTLNKLGIQLTRDVKDPFKENYKPLLNEIKEDTNK 
WKNIPYSWIGRINIVEIAILPKVSLERERQHDQTHVQSQLEKLDLLE 
QEYNKLTTMQALAEK3CMQELEAKLHEEEQERKRMQAK7\AEHPRLCLY 
IJ\AIFSALKmQDKIRRLELERIQAEESVKTLSRETIEYKKVIiDEQI 
QERENSKNEESKHNQELTSQLLAAENKCNLiLEKQLEYMRNMIKHAEM 
ERTSVLEKQTPLLIPRQRGSRVDLQQTPTDLQLRVLTVRRKTNKQKG 
HPHQNPICMSPSSKTKEERSSSPTTERSWTENDFDELREEGFRRSNY 
SELKEELQTHGKEVKNLEKKIRRMAN 

6096 

A 

870 

1860 

TRKSRRNG*IPGHIHPPKTK\QEEAESLNRPITGSEIEAIINSLPTK 
KS PGTDGFTAKFYQR YKEELVPFLLKLFQS IEKEEILPNS FYEAS 1 1 
LIPKPDRDTTKXENFRPISPMNIDAKTLNKTLANRIQQHIKKLIHHD 
QVGFI/HCNARLVQHMQINKRNPAYK*NQRQKPHDYLNRCRKGL*QN 
STALHAKTSQ*ISVGSSGQGNQAGEGNKGYSIRKRGSQ\MSLFAEDV 
IVYLENPIVSDQNLLKLISNFSKVSGYKINVLNSQAFLYT1SINRQTES 
QIMSEVPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTTF 
HAHG 

6097 

A 

453 

545 

VGEVLLDNILWSLMMV*TYRWGFAVDVliSVC 

6098 

A 

1 

235 

REVLITTNVCARGIDVKQVTIWNFDLPVKQGEEPDYETYLHSIGRT 
GRFGKKGLAFNMIEVDELPSLMKIQDHFRKS 

6099 

A 

3 

706 

SAGDTQQTGSGVDLQQTPTDLQLRVLSVRRKTNKQKGHPHQNPICTS 
PPS KTKGVNAMIiRKVAVAAAS KPHVE I RQDGDQ FY I KTS TTVRTTE I 
NFKVGEGFEEETVDGRKCRFFDMLRDLLKNVDLKGFEPDVRILLTKY 
SNSNG S QS PWMEEQ IRDAWGSMVLKNWRETDEVGKVPQ7VKGNNW I 
TSLMTNRGFFEEGEATFAPSFLMNIKGNKTSVVKNSILEQGQLTVN 

6100 

A 

368 

798 

LSYIYMHP IQ/RAPRFIKQALRDSQRDLDSHTI IMGDFNTPLSTUDR 
PTRQKVNKDIQELNSVLHQVDIiXDIYRTFHPKSTEYTFFSAPHRTYS 
KIDHIVGSKALLSKCKRGEIITKIKLELRIKKQTQNRSTTWKLNNLL 
LNDY 

6101 

A 

1 

666 

MPVTLGTSMRSDSLGEPQTEVEPPLVIPRKTGSGVDLQQTPTDLQLR 
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VLTVRRKTNKQKGHPHQNP ICTS PS SKTKAQNLLKLil SNFSKVSGYK 
1NVQKSQAFLYNNNRQTESQIMSELPFTIASKRVKYLGIQLTRDVKD 
LFKENCKPLLKBIKEDTNKWKNIPYSWIGRINIVKMAIQPKVIYRFN 
AIPIKLPITFFTELEKTTLKFIWNQKEPAFPRQS 

6X02 

A 

412 

681 

GDIIPKCSQPLNIHFHYVFKMR*GSSRPGRGPEIRCFSPAARLLLTA 
HQQWPQLWAI FQQVALIiLPHAGGADHVI LFPS PVS KATEKIVG 

6103 

A 

489 

587 

EVPNFPTFPCLLLSPPNCLNLCLLTSAKVAST 

6104 

A 

1 

1474 

MDMLDKGMIQVPVESLECPEHFVQEPVLPSELRSEKQEPWNQPSLDIi 
EPSFATYMACSMLPAERQDLRPTSLVISHGVSVKDRREHCQCPNYYP 
PLRNMS PLEHS I KI CGG KXPQNDSQR I TTI TKVI VNQDFKQPDDTEIj 
SQGSRKETSEASLVTIiLWELVTAGQTFQDDRMGTAPVYSSQRERRRR 
RVISACPTEPPLVTPRQTGSGVDLQQTPTDLQLRVLTIRRKTNKQKG 
HPPKPHLYVTIIKDQRTLHPKSTEYTFFSAPHPTYSKIDHIVGSKAL 
LSKCKRTEIITNCLSDHSAIKLELRIKKLTQNRSTTWKI^NIjLLNBY 
WVYKEMKAE I KMFFETNENKDTTYQNLWDTFKAMYSGKFMALNAHKR 
KQQRSKIDTLISQLTELEKQEQTHSKASRRQETTKIRAELKEIETQK 
TLQKINESRSWFFEKINKIDRTLARLIKKKREKNQIDTIK\KIKVIS 
PPIPQKYKLLFILFGTGTQIN 

6105 

A 

1 

903 

MNGIYSRDTRELFPLCAMGALKEKSVACNVERPSPEPNHDGQGVDEP 
LSETGFKQAAAAGIFLNNVKFTHAFSSDLMRTKQPPLLIPRQTGSGV 
DLQQTPTDLQLRILTVRRKTNKQKGHPYQNPICTSPSSKTKELEKQE 
QTHSKASRRQEITKIRAEIiKEIETQKTLQKINDSRSWFSEKIDKIHR 
LRARL I KKKRGKNQ I DA I KDDKGD I TTD P TE I QTT I RE YYKHL YANK 
LENIiEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIEAHNSLPTKK 
GPGPDRFTAEFYQRYKEEL 

6106 

A 

803 

1184 

YS LMWC I S L VFDDGD VHMG FW CGC PFCLLVFLIiTVRTLi S CRS VG VC 
WRSTPDPVCLGISSGGRASAKVGAPPAASLPPCSLISDSCAGSEQGS 
VGVGPSRPGAGYNLLVCRIiLRPLGKRS IRVRVT 

6107 

A 

1458 

1568 

TQTNLQEKNQTTQS KSGQRV * TDTS RKKTFMQPTDT 

6108 

B 

24 

2456 

MGVSSGDLLKGLLHPRVRVGNEYVTKGQSVEQWFAVGALAKATYDR 
LFRWLVSRINQTLDTKLPRQFFIGVLDIAGFEIFEFNSFEQLCINFT 
NE KL.QQF FNQHMFVLEQEE YKREG I DWVF I DFGLDLQP C I DL I E KEE 
RLKPSRKKITKKHTKKRTASLILHAMICCRSLiNSSKTKNTKCLNSIN 
QRLKIWSLQKDLMCGTAGRCKTLHGTSQIim^ 

DDTSIYMFYQKMILYCIVDSECKSRDVLQSYFDLLGELMKFNVDAFK 
RFNICYINTDAKVTLKQINSSLVDSNMLVRCVTLSLDRFENQVDMKAA 
LLSIPGFVERLCKLATRKVSESTGTASFLQELEEWYTWIiDNAIiVLDA 
LMRVANEESEHNQVFWFEQHGLVSGAEKAPGTTIDQAPRIYRWSDLT 
RTQAQWQSAMTWSMSAFTETFLTERDKQSKWSGIPQLLLKLHTTSHL 
HSDFVECQNIIiKPLLVIPRQTGSGVDLQQTLTDLQLRVLTVRRKTNK 
QKGHPHQNPICMSPSSKTKEAKNI.DKRLDEWLTRINSIEKTLNDLME 
LNTMARKLRDACTSFSSQFDQVEERVSVIEDQMNEMKREEKFREKRI 
KRNKQSLQEIWDCVKRPNIjRLIGVPESDGENGTKIjENTIjQDIIQENF 
PNIiARQVNIQIQEIQRTPQRYSSRRGTPRHIIVRFTKVEIKEKILRA 
AREKGRVTHKGKPIRLTAVLIAETLQARREWGPIFNILKEKNFQPRV 
SYPAKLSFISEGEVTCSFTDKQMLRDFVTTRPAIiQELLKEALNMERNN 
RYQLLQKHAKL * 

6109 

A 

82 

1210 

LVEGKLTNRKNIHTKTPSVHHHHQRPKCGREMWGQSPHTESLLGHHL 
VELWEEGHCPLDPRMVDPPTACTMYLEKPQTLNASL/WKAARSQVPQ 
GGGKGTKKS E TE I QATIRE YYNKHL YANKLENLEETEKFIiDTYTLPR 
LNQEEVESLNRRI TGSKIEAI INSLPTKKS PGRDGFTAE FYQGYKEE 
LVPFLLKLFQS IEKEGILPNSFYEASI ILI AKPGRDTATTTKNFRTI 
SLMN IDAKI LNKILENQIQQHI KKLIHHNQVGFI PGMQGWFNIHKS I 
NVIHHIKRTKNKNHMIISIDAGKAFDKIQQPFMLKTLNKLGIDGMYlt 
KIIRAIYDKPTANIILNGQKLEAFPLiKTGTRQGCHSYSTIiTTPIQHS 
V 
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6110 

A 

1261 

2396 

LNALYFLLLPDCPGQN* *TASKTNKEEKREESNRCNKK* *S\NITTN 
PTEIQTTIREYYKHLYANKLENLEEMDNFLDTYTIiPRLNQEEVESLN 

KGHS IRKRGSQIVPVCR*HDCISRKPHRLSPKSP 

6111 

A 

1 

819 

MCHSAI FTVLQPPLWI PRQTGSQVDLQQTPTDLQLRVLTVRRKTKK 
QKGHPHQNPICMSPSSKTKATFEDSVWHFAERIIPDPNVIKLRKEEL 
TLNNIRQYYVLCEHRKDKYQALCNI YGS ITIGQAI IFCQTRRNAKWL 
TVEMIQDGHQVSLLSGEL\TVEQRAPFl\QRFRDGKETVIiIT\TNVC 
ARG I D VKQVT I WNFDL P VKQGEE PD YET YLQR I GRTGRFGKKRJLAF 
NMIEVDELPSLMKIQD\HFNSSIKQLH\AEDMDEIEKIDY 

6112 

A 

1 

1068 

MASWEGKDLTVPOPDTPKO^VT.PP TQt?T.VDDVT?iniT)nT ot wr*'Krvr , m\ 
SLSGGESSSQNTNVI^SLEDLKPFILLNDMEHLWSLLSNCKKTREKA 
SATRVYIVLDNSGFELVTDLILADFLLSSELATEVHFYGKTIPWFVS 
DTTIHDFNWLIEQEGVEAEKKAISLLSKLRNELQTDKPFIPLVEKFV 
DTDIWNQYLEYQQSLLNESDGKSRWFYSPWLLVECYMYRRIHEAIIQ 
RHR INLiS SDFLi I TPDFYWDRENLFnTjY7*)KTr'OFT. Q T HPP WT .WZinpp 

FGFLCDCTVAIGDVYFKAHKSVlxASFSNYFKMLFVHQTRFISLPLTA 
ATTGLRLTHSQVPFGKTGFNKSHRKAS 

6113 

A 

3 

655 

LLSlSTOQRCSISFNICrREKASATRVYI 

SELATEVHFYGKTIPWFVSDTTIHDFIWLIEQVI^ 

WEE Y I KMGKWVYHNHI FWPVPHE YCAMPQVA\ HDLYAELQKAHLILF 

KGDIxNYRKLTGDRKWEFSVPraQAlaNGFHPAPLCTIRTLKAEIQVGL 

QPGQGEQLLASEPSWWTTGKYGIFQYDGPL 

6114 

A 

3 

225 

GACGRPPPFPRPLPRGAPP/PAMAAPFSPPPASAPGAAPTFPPSLGP 
AWHPALFLTGINLTCFSKEREKEAQSWY 

6115 

A 

2 

624 

NRGSCHRGLKFEGAFKYKMGQIEIDDQVEGLQYLASRYDFIDLDRVG 
IHG WS YGG YLSLMALMQRSD I FRVAI AGAPVTLW I FYDTGYTERYMG 
HPDONEOG YYT-G SVAMDAPKPPQ P PNTR T .T .T .T .Hfz x?t rM?xn rtn? & ut ctt 

JLS FLVRAGKPYDLQI YPQEPJiS IRVPESGEHYELHLLHYLQENLGSR 
IAHWLFNQMRRFNQQKTQN 

6116 

A 

2 

594 

gihgwsyggylslmalmqrsdifrvaiag\spvtlwifydtgytery 
mghpdqneqgyyi^svamqaekfpsepnrllllhgfldenvhfahts 
illsflvragkqXydlqsypqerhsirvpesgXehyelhllhylqen 

LGSRIAALKVI j 

6117 

A 

4 

310 

P VLCG I L EW S OTLMTVNE F I S I Y T VNE PP nvfiPT , Zi n WO T . pnn T PPT . IC 
QDISI PD YCS LGDGEEEE IT INAW FGPQGT I S PLHQDPQQNFLVQVG 
AAAGIVAF 

6118 

A 

1 

601 

L 0 KW S L E Y I OE I AG CRTV P VEVG S P YTD PP w c; ott .mtvijt? p t <5 irv T v 
NEPRDVG YLAQHQLFDQI PELKQD I S I PD YCS LGDGEEEE ITINAWF 
GPQGTISPLHQDPQQNFLVQVMGRKYIRLYSPQESGALYPHDTHLLH 
NTSQVDVENPDLEKFPKPAKAPFLSCIIjSPGEILFIPVKYWHYVRAL 
DLSFSV\SFWWS 

6119 

A 

3 

306 

EI IAFS/DRAED/ FRKIX3CEVLGVSVDSQFTHLAWI1^PRKBGG1jGP 
IxNI PLLADVTRRLSEDYGVLKTDEG IAY\ VCPAGWKPG <3DTTTCPNVT> 
DSKEYFSKHN 

6120 

A 

1 

774 

PTRTKGFAGQGSLGAENAGPRWIVRASRGLPTQLSVMASGNARIGK 
P APD FKATAWD \ G AF KE VKI» S D YKG TCYWT , FP Y PT . PT , W PPf? Z\ PTP T 
IAFSNPCRGTSRKiGL*SAGASRWDS\HFTHL\AW\INTPRKRGEAL 
GPL\NIPPALLNGEPERLV*GIYGRC*KTD*GAFAYKGASLFIRLGK 
GCPFAKI TLL I DL \ PVGTLPLDEGFCGLVQGLSQVHxCTS HGGKFVPA 
\ GWKALPVATD *APTWMTARKYFSKHN 

6121 

A 

3 

306 

KNWPPPRFF/YKGPPPSSSSSSSSSSSSSSSPLTPFFPSSSPGVCPL 
KGSSSSSPPPKKP\GRPPSSSQKGGKPPFFKPPNFFPKxNTSPTPGGAP 
LKPFWKNF 

6122 

A 

1 

200 

PPH1WPRRWSANLYLTP\ Sl^TWLLT^ 
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KKADDRE KRQEAHR FH FD AM 

6123 

A 

2 

298 

NREEQNLVAYPHDGKI FFCTS QD I PPENELLF YY SRD Y AQQ I GQKNY 

RCTLCDKSFTQKAHLESH^IHTGEKNLKCDYODKLFMRRQDLKQHV 
LIHT 

6124 

A 

1 

211 

VSPHVLII^QSSRHLGSLYLQAPFV*MAGCLLETGSHV*TYKDLHVT 
ITRHLPKLWKTRSTLNFQVARSS 

6125 

A 

1386 

2151 

DYLCEKHQYRQLCGYLQPCQGSPC^TPRAGSGRAALRTGMAVLARA 
AGQRLRARLPPQAQRARAAAGALRERGLCAPPRPPAPGAGRQSPQQS 
GELRVPSTTSSTCRSCWSARPWGSRRGRAVPQRRRNATATGSPRPLR 
RLRPAASPRRGAASERPNWPDPPRP\RTARSRALNLRSSSKVNATLV 
PCHKETEAQRGVWTGETIPVRTCFGPLIGQQSHSMEVAEWTDKAVNH 
IWKAPRNTALKMTQTYNDPAD 

6126 

A 

1 

1337 

MRNKAVGQARAPPAQLAGWAAPFPKLLDGRPQVFPEMPQPVAGCTAT 
GPREKLGGPGRPRGRVAASVQRLMNEMNLSPVGMEQLTSSSVSNAIiP 
VSGSHLGLAASPTHSAIPAPGILSYLADRPPPQYIHPNS INVDGNTA 
LSITNNPSALDPYQSNGNVGLEPGIVSIDSRSVNTH\GAQSLHPSDG 
HEVALDTAITMENVSRVTSPISTDGMAEELTMDGVAGEHSQIPNGSR 
SHEPLSVDSVSNNLAADAVGHGGVIPMHGNGL\ELPWMETDHIASR 
VNGMSDSALS\DSI\HTVAMSTNSGPRESSAVAPVASHFSVHVFLFS 
F I WVS PADAETLGE S LLGLTVYGSNDQD P YVTLKDTEQ YERED FL I K 
PSDNIiIVCGRAEQDQCNLEVHGNYIAVGNMTPVIEVWDLDIVDSLEP 
VFTLGSKLSKKKKKXGKKVKKLINI 

6127 

A 

1 

2412 

MKMNLS pvgmaqlts s s vsnalpvs g s hlglaas pths ai PAPGLP V 

AIPNLGPSLSSLPSALSLMLPMGIGDRGVMCGLPERNYTLPPPPYPH 

LESSYFRTILPGILSYLADRPPPQYIHPNSINVDGNTALSITNNPSA 

LDPYQSNGNVGLEPGIVSIDSRSVNTHGAQSLHPSDGHEVALDTAIT 

MENVSRVTSPISTDGMAEELTMDGVAGEHSQIPNGSRSHEPLSVDSV 

SNNLAADAVGHGGVIPMHGNGLELPWMETDHIASRVNGMSDSALSD 

SIHTVAMSTNSVSVALSTSHNLASLESVSLHEVGLSLEPVAVSSITQ 

EVAMGTGHVDVSSDSLSFVSPSLQMEDSNSNKENMATLFTIWCTLCD 

RAYPSDCPEHGPVTFVPDTPIESRARLSLPKQLVLRQSIVGAEVGVW 

TGETIPVRTCFGPLIGQQSHSMEVAEWTDKAVNHIWKIYHNGVLEFC 

I ITTDENECNWMMFVRKARNREEQNLVAYPHDGKI FFCTSQDI PPEN 

ELLFYYSRDYAQQIGVPEHPDVHLCNCGKECNSYTEFKAHLTSHIHN 

HLPTQGHSGSIGPSHSKERKWKCSMCPQAFISPSKLHVYFMGHMGMK 

PHKCDFCSKAFSDPSNLRTHLKIHTGQKNYR\CTLCDKSFTQKAHLG 

V\HMVTHTGE\KNLK\CDY/ CVDKLFYGGSRDLQGRHVLiIH\TQER\ 

QIKCPKCDKL\FLRTNH\LKKHLNSHEGKR\DYVCEKCTKAYLTKYH 

LTRHLKT\ CKGPTS S F VQAPEEEEEDDSEEEDLADS VGTEDCR IN\ S 

AVYSADESLSAHK 

6128 

A 

1 

1767 

MQALIPLESGSGMTQTSGVSTETGDWKDMGVNNQSKEGRCPWKDHE 
AAPWISEKPKKRGNEGKSKKFKNNYSTQPARMERKEEILNPPFEGKD 
GDTGSIPHKSKEIGFTFPKMHDSSFSHTPDTPTVEAVDRKGGNFQVN 
FVEI^TLGENKISTVTCASTVTEPPAKVTDVSCQEQIQGAGFVPSVVS 
EENKTD7VANRYTAVADKPSERSNDGKSSGSAACEKLPTPTPQVVKEG 
DSFPDTLAKNGQEIAPAQISKSLMVDNYTKDGVPGQERPKGPSAVVP 
STSTGGVALPITTAIETVNIHGDHSLKNKAELADSMKNEAGIDEGHV 
IGESESVHSGASKHSVEKVTELAKGHLLPG/CASRRPEPTRRGQSPR 
R/T/HADRGNFPAHPVNEEKETKEGSVAVQIPDLLEDKAQKLSFCED 
Ayuxw £> Js\a bXJb LN KKVDLTLLS PKS END KL KE I SLACK I TELES V 
SLPTPEIQSDFLHSKVEAPPSEVADTLVIMTASKGVRLPEPKDKILE 
TPQKMTEKSESKTPGEGKKEDKSRMAEPMKGYMRPTKSRGLTPLLPK 
STIQEQERHKQLKSAGMNLPWGNVCACGF 

6129 

A 

893 

3822 

GKGNHATVNLLS FLQRPQVGSLVHPPVVG/HGNQRKS IHVDSLEPQR 
DLGREAWDIESTPIMMKKKKKKPKQKRYSQPRAGGPSDDDNADKPKG 
HPFAADTQKSGVLPSQPTTMGTEYGLVSGENLKRECLVNSSAARLVA 
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ENFVSESLRIPLYPSEEAPKTAISSQSKLRVEEESKSNKSVIjQNQDK 
KLLKQHEYKPQPAPHIjKTPVDKSQSVGPLNIjKGPLAEVSAYITVETPL 
D I RLKEGCS FFlfDQEVMGVVSKPTAAKEI PNLVPTLI ASNPLECNLK 
EGNNESKMTKLQNVKLKEFPEGAEEDKELKKEAFPNERQEISIFTSE 
QDQGQVLVQVPGVENEPFKRMAGDGKSRKGRGSSGKMRTDSGKVKAK 
S ELPFLLD S QKDGRAVL IPS EPVS KTEGMTTQDKS EELGLNS S KQPG 
TKADLTEAWMGEPKEMTQPKVAGTMQALIPLESGSGMTQTSGVSTE 
TGDWKDMGVNNQS KEGRCPWKDHE AAPW I SEKPKKRGNEGKS KKFK 
NNYSTQPARMERKEEILNPPFEGKDGDTGSIPHKSKEIGFTFPKMHD 
SS FSHTPDTPTVEAVDRKGGNFQVNFVELGTLGENKI STVKASTVTE 
PP AJCVTDVS COEO T OGAGFVPS WS EENKTD AANP YTAVADKP S KRS 
NDGKSKKVKNSSPEKHILENKIDATKIHVPMETTGDQGIEG\WPIWT 
KIEIL/RFTCPRTPSEIilNKSSPLEVLESAACEKLPTPTPQWKEGD 
SFPDTLAKNGQEIAPAQISKSLMVDNYTKDGVPGQERPKGPSAWPS 
TSTGGVALPITTAIETVNIHGDHSLKNKAELADSMKNEAGIDEGHVI 
GES ESVHS GAS KHS VEKVTELAKGHLLPG / CAS RRPE PTRRGQS PRR 
/T/HADREPKDKILETPQKMTEKSESKTPGEGKKEDKSRMAEPMKGY 
MRPTKSRGLTPLLPKSTIQEQERHKQLKSAEPLYGGCEINLIQF 

6130 

A 

1 

1965 

MQ AL I PLE SG S GMTQTSG VS TETGD WKDMGVNNQS KEGRCPWKDHE 
AAPWISEKPKKKGNEGKSKKFKNNYSTQPAPvMERKEEIIiNPPFEGKD 
GDTGS I PHKSKE IGFTFPKMHDSSFSHTPDTPTVEAVDRKGGNFQVN 
FVELGTLGENKISTVKASTVTEPPAKVTDVSCQEQIQGAGFVPSVVS 
EENKTDAANRYTAVADKPS KRSNDGKS KKVKNS S PEKH ILENKIDAT 
KIHVPMETTGDQGIEG\WPIWTKIEIL/RFTCPRTPSELINKSSPLE 
VLE S AACEKLPTPTPQ WKEGDS FPDTLAKNGQE I APAQ I S KSLMVD 
JN X i j\U\j v lrOvi!»lv Jr iv\j roi-i. V Vfo X o X V /\_Li Jr J. 1 X /\X £j X VJM inbUrtolj 

KNKAEZjADSMKNEAGIDEGHVIGESESVHSGASKHSVEK\n*EIiAKGH 
LLPG/ CASRRPEPTRRGQSPRR/ T/HADRGNFPAHPVNEEKETKEGS 
VAVOI PDLtLEDKAOKLiS FCEDONAODRNS KGSDSLNKXVDLTliliS PK 
SENDKLKEISLACKITELESVSLPTPEIQSDFLHSKVEAPPSEVADT 
LVIMTASKGVRLPEPKDKILETPQKMTEKSESKTPGEGKKEDKSRMA 
EPMKGYMRPTKSRGLTPIjLPKSTIQEQERHKQLKSAGMNIjPWGNVCA 
CGF 

6131 

A 

1632 

2783 

VILLSKSLIIQENFFFFF*QSFALVAQAGVQWSNPCS\LQPPPPGFN 
GFSCLILPSSWDYRRVPPCPA\NFVFLVETGFHHVGQAGL»EIjLTSGD 
PPTSASQSAGITGVSHHAQPAEDIANNVTLFFPLHLFL 

6132 

A 

240 

564 

NGWEFKHGPSWPGGFFFFFFFETESRSVA/TRLECSGVIPAHRNLCL 
PGSSDSPASASQLTGTTGAHHHARLIFVFLVETGLRHIjSQAGLELL/ 
NLI IHQPWNPQELGLQA 

6133 

A 

1 

1898 

MAASGRGLCKAVAAS P FPAWRRDNTEARGGLKPE YDA W I GAGHNGL 
VAAAYLQRLGVNTAVFERRHVIGGAAVTEE I IPGFKFSRASYXjLSLL 
RPQIYTDLELKKHGLRLHLRNPYSFTPMLEEGAGSKVPRCLLLGTDM 
AENQKQIAQFSQKDAQVFPKYEEFMHRIiAIoAIDPLLDAAPVDMAAFQ 
HGSLl^PJ^SLSTLKPLLKAGRILGAQLPRYYEVLTAPITKVLDQWF 
E S E PLKATLATDAV I GAMTS PHTPGS GYVLiLHHVMGGLEGMQG AWG Y 

GTEVRSKMVLSl^SPQITFLKLTPQEWLPEEFLERISQLDTRSPVTK 
INVAVDRLPSFLAAPNAPRGQPLPHHQCSIHLNCEDTLLLHQAFEDA 
MDGLPSHSDLGSAKIHTPKDRKCIPGEKSEDEGRPVIELCIPSSLDP 
TLAPPGCHVVSLFTQYMPYTKAGGKAWDEQERDAYADRVFDCIEVYA 
PGFKDSWGRDILTPPDLERIFGLPGGNIFHCAMSIiDQLYFARPVPL 
HSGY\AGPLQGI*YLCGSGAHPAPYSEILTSLTKRVPIQHVHPQPLRL 
WAVDIoAP PQTFGAGSLEGAR I Y 

6134 

A 

163 

513 

CLLFFLTPSHFSVFGTPVFDCNRVYAPGFKDSWGRDILTPP\DLER 
I FGLPGGNI FHCAMSLDQLYFARPVPLHSGYRCPLQGLYLCGSGAHP 
GGGVMRAAGRNAAHVAFRDLKSM 
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6135 

B 

1 

720 

MGCKHWMTCGRARKEG KDD S YMQTQWLHRAFAAPPBTCPFPLRDH I Q 
TGRDLYILGTFEKCVFWGGAEREVWRERHQREPGLRAAFAGQLEFPA 
GVGLAGPALGAAGQPCWPRGNEGLSTRASGCGGCTGSSSSASPPVLC 
SISHQALAAFLRGRSRDLQRAMPEPPTPSMGSCAAGASPTSTTPCSK 
EPSPIDHPRAEECECTARDWQAAPPAALVRDPLGEASWAPESGGDME 
SLYI* 

6136 

A 

133 

311 

QLHADKVSRFATHA/GVPWCDHGLFQPQPPPLKRSSCLSLWSSWDYR 
HAPPHPAI I CLLL 

6137 

A 

1 

1098 

MGFHHVGQAGLELL I S SDP PALAS QGAG I TG S WKW I LAPV I L Y I CER 
ILRFYRSQQKWITKVKNT^FPLAVIRVIFYRTTELNFYVVTLKEEIi 
GNWAQW VWMHPS KVLELQMNKRGFSMEVGQ Y I FVNCPS I S LLEWHP 
FTLTS APEED FF S I H I RAAGDWTENL I RAFEQQ YS P I PR I EVDG PFG 
TASEDVFQYEVAVI^VGAGIGVTPFASILKSIWYKFQCADHNLKTKKI 
YFYWICRETGAFSWFNNLLTSLEQEMEELGKVGFLNYRLFLTGWDSN 
I VGHAALNFDKATD I VTGLKQKTS FGRPMWDNE FSTI ATSHPKS WG 
VFLCGPRTLAKSLRKCCHRYSSLDPRKVQFYFNKENF 

6138 

A 

2 

241 

PPRFKRFSCLGLSSSWDYRCTPPRSANFCI\LVEMGFHHVGQAGLEL 
LTSDDPPTLASQSAGITGVSHHAGPHRFLINIY 

6139 

A 

3 

335 

YFISSHFIISFHYVISFHLITFHLISFH/HHFISSFHFISSFHLFIS 
FH/HHHLINSFHLISSFHHFISSFHLISYLISFQF/HFHYFMSSHLI 
SSFHL.ITSHHFINFIISSFH 

6140 

A 

3 

137 

GVSPCWPGWSRAPDLK* SAHLGLPKCWHYRRPIHDCGQAPHLHC 

6141 

A 

967 

1386 

KKKKIFIGHQLRWL*RFFFFFFFETGSHSVPQAGVQWCDHGSIjQPQS 
PWAQRSSNLSLPSSWDYRHLPPCLANF\FIFFVEM\GLTMWPRLVLN 
S\G\PSDLPASVSQ/SAGITGVSHRNWWPMKIFFFFQTASHSVAQAG 
VQWCNLGSLQALPPAFMPFSCIiLPQPP\SSWDYRSLPPCLANF/CFG 
FLVETGVS PCWPGWSRTPDLR 

6142 

A 

278 

643 

GRQERGERERS SRPLGGRKCQ/ HWIjAVNAQQKLLAIjQVE KAKVEVGV 
APALGILWAGCSFAIRRYQKKSDSLKQPQNPVLEAAVGVPVEMSLP 
HASSSLTLVPCLRRSLDPFLCFPLSLAKV | 

6143 

A 

18 

479 

QRTFCKGKWKGAAKILIDLNHLFLCKLEQKGTDLXLVIAMENYGSLG 
-SKKGVDLPG\AVDLIAMSEKDIQDLKFGVEWDVDMVFASFIHVASNV 
HE IRKAI IANWSKVPTCPPELGS YLPRGTETHS YFYIS YSSDQIQIC 
KPLHYSYGRNYDTL 

6144 

A 

3 

282 

RSPPA\GSPPPSAPAPRLPPPPAPPPALPFPASRPTPFPPLPVSLPV 
LPSSRSLSLPPAFLFSFFCIGVLGLLHTRALSIAAYIKAGAIMQIiYC 

6145 

B 

26 

2671 

MAAPG AP AE YG Y I RTVIiGQQ I LGQLDS S S LAL.PS EAKL KLAG S SGRG 
GQTVKSLRIQEQVQQTLARKGRSSVGNGNLHRTSSVPEYVYNLHLVE 
NPFVGGRS PVPKTYDMLKAGTTATYEGRWGRGTAQYS SQKS VEERS Ii 
RHPLRRLEI SPDS SPERAHYTHSDYQYSQRSQAGHTLHHQESRRAAL 
LVP PRYARS E I VGVS RAGTTS RQRHFDT YHRQ YQHG S VS DTVFDS I P 
ANPALLTYPRPGTSRSMGNLLEKENYLTAGLTVGQVRPIiVPLQPVTQ 
WRASRSSWHQSSFHSTRTLREAGPSVAVDSSGRRAHLTVGQ7^AAGGS 
GNLLTERSTFTDSQLGNADMEMTLERAVSMLEADHMPPSRISAAATF 
I QHEC FQKS EARKRVNQLRG I LKLLQLLKVQNEDVQRAVCG ALRNLV 
FEDNDNKLEVAELNGVPRLLQVLKQTRDLETKKQITDHTVNLRSRNG 
WPGAVAHACNPSTLGGQGGR I TRS GVRDQ PDQHGLL WNIj SSNDKLKN 
LM I TE ALLTLTEN 1 1 1 PFSGWPEGDYPKANGLLDFDI FYTtfVTGCLRN 
MS S AGADGRKAMRRCDGliIDS LVHYVRGTI ADYQPDDKATENCVCIL 
HNLSYQLEAELPEKYSQNIYIQNRNIQTDNNKSIGCFGSRSRKVKEQ 
YQDVPMPEEKSNPKGVEWLWHSIVIRMYLSLIAKSVRNYTQEASLGA 
LQNLTAGSGPMPTSVAQTWQKESGLQHTRKMLHVGDPSVKKTAISL 
LRNLSRNLSLQNEIAKETLPDLVSIIPDTVPSTDLLIETTASACYTL 
NN 1 1 QNS YQNARDLLNTGG I QKI MA I S AGDA YASNKAS KAAS VLL YS 
LWAHTELHHAYKKAQFKKTD FVNSRTAKAYHS LKD * 

6146 

A 

2 

355 

FYFLRWSFTLIAQAGVQWRDLGSPQPLPPRFKQFSCLSIjLNSWDYRH 
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APPRPANFLCF/LVBTRFLHAGQAGLELLTSGDPPTLASQSAGITGA 
SHRNRPVQSILRLEGLLETIVQPPT 

6147 


3 

277 

FFLBTEFRSTQSPRIiECSGAISAHCNLQLPGPSHSPASASRAAETTG 
LCHHARL I FVFLVEMRFHHVGQSGLElA RQAI CLPRPPKVLGPQA 

6148 

A 

20 

364 

LHHFPKPICPAHFSACIHWPQHPHTCACTQAHMCKPKH/THTHACTH 
/ ICVLSGTHPWVHTGVCARSSSSS / PSSSSSS LR YTRQ VHALLHTAQ 
PQRL.KASEPTPAQLQPGRQEGLTG 

6149 

A 

3 

301 

STATWAGVQWCNLSSLQPLPSGFKPFSCLSLPGSWDHRHIiPPCPANF 
LYCFFLVEMGFHYVGQAGLKLLT/ s /GDLCASAPQSAGSTGVNHRVR 
LGLLIYIP 

6150 

A 

15 

352 

DSIFTLVCAEHINFLRAGQHWGCVIDCLHYNKQAVTGDHSSYSAERS 
STPENGGREGTREQGERRGGTWGRGGG/ QERPS IPDRQKNQGRGGGG 
NPLSGRGKVKAITI ILLI I 

6151 

A 

20 

416 

LVEDFVWYQAAPTLRALLPSLIGFAGCPSLLHSPLIGHSTAALRGAG 
SAPVGCLWWGCGVADPAGSARTMGTRDDEYDYLFKWLIGDSGVGKS 
NLLSRFTRNEFNLESKSTIGVEFATRSIQVDGKTIKAQ 

6152 

A 

24 

156 

S SFKVMELCPELVGS WSH *LQE * SRGPLR* RTHSSRYMVKRQE 

6153 

A 

128 

880 

HYLDVAPFLFPDRALLPSLIGFAGCPSLLHSPLIGHSTAALRGAGSA 
PVGCLWWGCGVADPAGSARTMGTRDDE YD YTiFKAOCLOALGS RHLST 
PPHRWQPRWCSTQGTGGKWGAWGGVWGFTEPRLPAQWCSSGTQAW 
ARATCCRAS PATS S TWRARAPS AW S S PPAASRWTAR P SRRRSGTPLA 
RSATAPSPPQLSLLPSSLVmiATSSPTITIALAMAAGNLCPLPSSCR 
RRRRWCQASQQRGCIi 

6154 

A 

310 

719 

LRTYPTIRGPVREEKWGYVQTESRSVAWAGVQWHC\FTEL.RGFLVSL 
TS RMKPRTliAVS VPALKVARLE F VS SDVQRWG FHHVG P AGLELLTS S 
DliPVSASQMAGITGTRHRAWPWEHVYMDFVHFLKQTEGl^IiLDL 

6155 

A 

1 

169 

FFETGSCSVAQAGMQCSDHGSLQPCPPG\SWDPPAIASQSAGITSMS 
HHNHPKLFI 

6156 

A 

525 

972 

NHLGGFKQILMPRLHPMPMKSFLGVSARQLYFFFFFFFETESHSIVQ 
AGVQWRDLGSLQPLPPGFKQF\SCLSLPSSWNYRHTLPRPANFLYFL 
VETGFHCVAQAGLKI.LSSGNLPAXASQSARITGMSHHARLAVVFFKA 
LLSSSRTRI 

6157 

A 

300 

500 

DGLCS KQT *NGVP S ITNP I KTANQHQG KKQHPI QGNRTAPS S S VS YM 
KRCELLPKGTGLEWRESG 

6158 

A 

1 

1889 

MGLLQGLLRVRKIiLLWCVPLLLLPLPVLHPSSEASCAYVLIVTAVY 
WVSEAVPLGAAALVPAFLYPFFGVLRSNEVAAEYFKNTTL.L.LVGVIC 
VAAAVEKWNLHKRIALR1WLMAGAKPGMLLLCFMCCTTLLSMWLSNT 
STTAMVMPIVEAVLQELVSAEDEQLVAGNSNTEEAEPISLDVKNSQP 
SLELIFWEDRSNT^LTTLMHNENLNGVPSITNPIKTANQHQGKKQH 
PSQEKPQVLTPSPRKQKLNRKYRSHHDQMICKCLSLSISYSATIGGL 

SWFWIViHWLFIX5CNFKETCSLSKIO<XTKREQLSEKRIQEEYEKL 

YPEM^n^FFFILMTVLWFTREPGFVPGWDSFFEKKGYRTDATVSVFL 

GFLLFLIPAKKPCFGKKOT)GENQEHSIjGTEPIITWKDFQKTMPWEIV 

I Ij VGGG YAIiAS G S KS S G L, S TW 1 GNOMLS T. S S LP P W AVTLTJVC I LiVST 

VTEFVSNPATITIFLPILCSLSETrffllNPLYTLlPVTMCISFAVMLP 

VGNPPNAIVFSYGHCQIKI)MVKAGLGVNVIGLV\IVMVA\INT\WGV 

S \ LFHLDT YP AWARVSN I TDQ A 

6159 

A 

41 

570 

SLHDSTADSIRTTCQGVISTLKAHYLRRTFQHIIiEAADGEDPSYIK\ 
EFWRITYSIMDAVDI^IAWEELKPALMNSMWKKIWPECVQAQRFSQAD 
NIAQLQKNIVTLARNVAFEEVAEAAVDQLLQSHEEDLSNEELMRLEQ 
ELAVGEEEREDGPWALWQLTTGRL.SAALSHFEAGL 

6160 

A 

3 

367 

HASAHAFGQLSCVSKLMRDVCGQPCFSPRG1W/LYLQWGKRKYSR\G 

NSSWQIKEKVWRFSTAFCSViraWKFADILSM^ 

EAI PLPCMCVTRELTKEGRS LRS VLKP VL 

6161 

A 

74 

295 

AEDKSQ^LDLRWSYIRTSSSSADRSHASGPRIAHKWSTVDRNRPRY 
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6162 

A 

32 

342 

LMKKI WS L YLGDRAG I CLKRKKI GRVWWHVPVVPAARGAGVGGS LEP 
GS LRLQRAVI ALLHS S LGDRVTVS KKKKKI KHTS VFHFS FNHNHALK 
NTLEHMEQI 

6163 

A 

184 

556 

QTVARRCTALTFIASEEKVIiVADKSGDVYSFSVLEPHGCGRLELGHL 
SMLLDVAVSPDDRFILTADRDEKIRVSWAAAPHSIESFCLGHTEFVS 
RI S WPTQPGLLLS SSGDRHPEALGVQERP 

6164 

A 

251 

X JL J J 

i'WiJb KJ<Li 1 lit KTKP WQCLS VR TVARRCTAIjTF I AS EE KVLVADKS GD 
VYS FS VLE PHGCGRLELGHLCMLLDVAVS PDDRF I LTAARDE KMRVS 
WAAAPHS I ES FC\ LGHTE FVSR I S WPHS ARA\ LLS S SGDGTLRLWE 
I KJ>VjKUf Aliijb FWP VCRS S VDPQAPQKFAASRI AFWCQENCVALLCD 
GTPCGLH LPAGR PQTAVG VQAAAG VPAP S VGRG FRGD PGAVGAPGLA 
RKPPWCSYRHVGDQWQSVPESTVLKKVSGVLRGNWPMLERLCRRRRQ 
LSAVSYKAHVPTT 

6165 

A 

1 

392 

APMAGCLSFYYLIQQGDDKGFSLYTRDGAGLYKEIWKADRPGNAAWN 
LAEGE FNAP Y PME V I FEDAFNG PKGG YVALDD I S FS P VHCQNQTELL 
FSAVEASCNYEQDLCNFYQDKKRPGWT\RVKGKPNML 

6166 

A 

3 

1459 

KLPPPPGECTFEQDECTFT/AGEKKPEQLAQEEG\ETPTSYTGPKGD 
HTTGVWSSHI^ECPFGLNSDALLPKKICSLVliLLTSFLAASADSG 
TMXiLRWGCLKSKIKMSVGLVPSEISIiLGLQMAAFLLHMAFPLCQCTP 
DMYLLLTVGNRNHGRQKQIKGALLYWTFTTCRHCARNLNILSTTCKD 
KVGEFAKLS KIELLEATEKS IGPPEMHKYHCELKNLREKEKQLEDLIi 
ATKIASKVAKLGILLGMPTLEIGGKEGGGWGTGYWLRAEKEDGDYLC 
IEGSEGLNTEKIVPVMQERNSININKKDVHSETPAKGHQLQRPKVDK 
i^MtWHiKN yHKKAENS KTQ I TS S PP KD PKS S PAREQTCMENEFDKLTE 
VGFRRWAITNSPELKEDVLTQCKEAKNLEKRLDKLLTRITSLEKSMN 
DLMELKNTGRELCEAYTS INS R I DQVEER I S E I EDQLNE I KQDDKIR 
EKRVKRNKQS PKKYGTL 

6167 

A 

1 

393 

MYGGGTGLLS VYLKKEEDSEESLLWRRRGEQS I S WLRALIE YSCERQ 
HQLLRVKYTAMGQGDFGGCCSSPGKRSDGLGYAAEIIFEAIRGVSIR 
SDIAIDDVKFQAGPCGEMEDTTQQSSGYSEDLNEIEY 

6168 

A 

1 

1137 

MKRKHVPIDDKMLQQKAIiRLHEDFSKGSPETSDTKPFTASPEQLQSS 
EISAQKGGDEVTWQFPSSTSSMSGSLKPPPLDQGHYI^TYVDSVYVKHF 
QE VAQI. I S PLTTAPMAGCLS F YYQ I QQGNDNVFSL YTRD VAGL YEE I 
WKADRPGNAAWNLAEVEFSAPYPMEPFPLGKDKLLGFPVDFGEFRYQ 
VLMGPLLLRLLTHTRGMHTWVVFPAQVKAVLTSDS PMVTGRRWLTEL 
TVKVAGREESKTKGVI FEVAFNGPKGG YVALDD IS FSPVHCQNQTEL 
LFSAVEASCNFEQDLCNFYQDKEGPGWTRVKVKPNMYRAGDHTTGLG 

KSEICLCGDILGSRELYWPFLLHTCKDIDLKKKGSCEEAKTISSLVF 
DHF 

6169 

A 

1 

1793 

MAETAVINHKKRKNSPRIVQSNDLTEAAYSLSRDQKRMUYLFVDQIR 
KSDGTLQEHDG I CE IHVAKYAE I FGLTS AE AS KD I RQAL KS FAG KEV 
VFYRPEEDAGDEKGYESFPWFIKRCATAVGRQRTHADKERGAREKGG 
EKKAKAKKTSGTRKRGGGRERKKKGEQEGKDGQRNAQTKGEKRQPHK 
ANTPPKTLQQRKEKKTGDSHRRKKKLFKESEKVLSQEWEHDFVIETM 
ATGARPQLLFSAVEASCNFEQDLCNFYQDKEGPGWTRVKVKPNMYRA 
VjD\HTTGIjGYYLIiANTKFTSQPGYIGRLYGPSLPGNLQYCLRFHYAI 
YGFLKMSDTLAVYI FEENHWQEKIWSVLES PRGVWMQAEITFKKPM 
PTKWFMSLCKSFWDCGLVAIiDDITIQLGSCSSSEKLPPPPGECTFB 
QDECTFT/ AGEKKPEQLAQEEG\ ETPTS YTGPKGDHTTG VGYYMY IE 
ASHMVYGQKARLLSRPLRGVSGKHCLTFFYHMYGGGTGLIiSVYLKKE 
ED S EES LLWRRRGEQS 1 S WLRAL 1 EYS CERQHQ 1 1 FEAI RGVS IRSD 
I AIDDVKFQARP\ CGEMEDTPQQSSG YS EDLNEI EY 

6170 

A 

2 

392 

RNKKQGYPAFHVFSTFFYPK* KSGGYQAVKRWTKGVNLFEQEI ILVP 
IHRKVHV/SLV\QYLQDESKTKRNSDLNLLEWTHHSMKPHEIPQQLNG 
SDCGMFTCKYADYISRDKPITFTQVSEDPLHPFTCY 
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6171 

A 

2 

386 

MNLLVERNKKQG YPAIiHVFSTFFYPKLKSGG YQAVKR\ WTKGVNIiFE 
QEI IIiVPlHRXVHWSLVVlDLRKXCliicyT^n<;MnnTcr;trp tcvmk \tt r*v 

LQDESKTKRNSDLNLLEWTHHSMKPHEIPQQLNGB 1 

6172 

A 

3 

162 

GGG VLTLLVE VSN I FLT I RMMMK I SNAQDHLL YR VNKG P WAPS CWVS 
CSCWT 

6173 

A 

3 

407 

TRRADPLRTWRWHNLLVS FAHS I VSG I WALLCVWQTPDMLVE IE TAW 

SLSGYLLVCFSAGYFIHDTVDIVASGQSASLL\EYLVHHVMAMGAFF 
SGIFWSSFVGGGVLTLLVEVSNIFLTTRMMMTfTQTsjnrknxjT t 

6174 

A 

13 

367 

KYLFFFLKDHTKVGKRJ^SIRl^QNLGRGRTKSCLKKLVAGANCRDQ 
DDPSPRNQQPRSATFRPSPNRCH*IPQ*PYSAQCPHPGSGGQFRSLR 
GLIVEYRSSERHISYRRYAGLCEY 

6175 

A 

3 

413 

AAIEELCGROAARAEAVRGLRGPRC5"LR /WPT.DUmD7\DD7\upAt>T »do 

GARARRSEKPPSEKPRLRRSSPRAQEEGPGEPPPPELALLPPPPPPP 
PTPATPTS S ASNIiDLGEQRERWETFOKROKLTS KGC R OA P APHT, 

6176 

A 

2 

445 

RPLPVmPAPPAHGAPLA^PGARARR/CEKPPSEKPRLRRSSPRAQEB 
GPGEPPPPELALLPPPPPPPPTPATPTSSASN1UDLGEQRERWETFQK 
RQKLTSEGAAKLLLDTFE YOGLVKHTGGCHCGAVR ppvwa <: nm w T p 
DCKYRNYI 

6177 

A 

2 

265 

FLMESRS VTRLECS S T I TAHCNLCLPVS SDS AVS ASQVAGTTGMRHH 
AQLIFVFLVETGFHHVGQDGLHLL/NIVIHLPRPPKVLGLQA 

6178 

A 

1 

394 

WLVGFGFGFFEIESRPVAQAGVQWRNLGSLQPSPSKFKRFSFLNLPS | 

SLDYRHTPPSLANFLYLVQMGFHHVGQDGIjELLTS\DPPTIiASQSAE 

ITGVSHRTQPRSAILLQSQSSLLPIjLRTFISSVPVTE 

6179 

A 

3 

1306 

HE I SLGAI CAVLIiVIMGLFATRCNREKKDTRS YNCRVAESTYQHHPK 
RPSRQIHKGDITLVPTINGTLPIRSHHRSSPSSSPTLERGQMGSRQS 
HNSHQSI^SLVTISSNHVPE^FSLELTHATPAVEQVSQLLSMLHQGQ 

YOPPPfiPPfSKTOTVCD CVTJVRT rir»MnVI>PT im^nnrk^nnnnnfMrrM' 

x yr o r «. uim i\ x oKis i K x AljyUMJL) I\_r SIjKDSGRGDS EAGDSD YDLGR 
DSPIDRLLGEGFSDLFLTDGRIPAAMRLCTEECRVLGHSDQCWMPPL 
PSPSSDYRSNMFIPGEEFPTQPQQQHPHQSLEDDAQPADSGEKKKSF 
«-» i r uiuj^r \ivi miuuui o x oojjjjbtrloo Vr yKljljPPSLiD TY SECSEV 
DRSNSLERRKGPLPAKTVGYPQGVAAWAASTHFQNPTTNCGPPLGTH 
S S VQP S S KWLPAME E I PENYE EDDFDlWIilWl^GKHELMDASELVA 
EINKLLQDVRQS 

6180 

A 

21 

376 

GVbTVGlGRGSRLEL»WAPPEKYTPPnPP<5UPT ljt wrrnrpoTDDUTn 

TTSSPHQVPVQSWFDDMGl^ELLl^ 

GPLDCPAFKRRPFHYGTFPHCAFTI 

6181 

A 

3 

471 

HASAHASAHQVYVLKKPYVDEFLRGMGELFECVLFTASLAI^ 
DLIiDRRGVFRARl»FRE S CVFHQGC YVKDLS RliGRDLRKTL I LDNS PA 

SYIFHPENAEPVQSWFDDMAYTELLNLIPIFEELSGA*DVYTSL/GA 
AAGPLACPASKRRPSQ 

6182 

A 

1 

1953 

IWKLRQS FRRKKDVYVPEASRPHQWQTDEEGVRTGKCS F PVKYLGHV 
EVDE SRGMH I CEDAVIOILKAERKFFKGFFGKTGKKAVKAVLWSADG 
LRWDEKTKDL I VDQT I E KVS FCAPDRNFDRAFS Y ICRDGTTRRW I C 
HC FMAVKDTGERLSHAVGCAFAACLERKQKREKECGVTATFDA5RTT 
FTREGS FRVTTATEQAEREE I MKQMQDAKKAJSTDKI WG S S VAPGNT 
APSPSSPTSPTSDATTSLEMlWPi^IPRPJiAPIEQLARQGSFRGFPA 
LSQKMSPFIO^QLSLRINELPSTMORKTDFPIKNAVPEVEGEAESISS 

LiCSOITNAFSTPEDPFS A PMTTTP'VTT^n/a r>r%0 r>«-p t\ xt/~« «-m^ r» a mn rr 

AJ^AHTAloAPVAMPVRETNPWAHAPDAANKEIAATCSGTEWGQSSGA 
A5PGLFQAGHRRTPSEABRWLEEVSKSVRAQQPQASAAPLQPVLQPP 
PPTAI SQPAS PFQGNAFLTSQPVPVGVVPAIiQPAFVPAQS YPVANGM 
PYPAPNVPVVGITPSQMVANVFGTAGHPQAAHPHQSPSLVRQQTFPH 
YEASSATTSPFFKPPAQHLNGSAAPNGVDDGRLASADRHTEVPTGTC 
PVDPFEAQWAALENKS KQRTNPS PTNP FS SDLQKTFE I EL 

6183 

A 

3 

240 

PPTAI SQPAS PFQGNAFLTSQPVPVGWPALQPAFVPAQSYPVANGM 
P YPAPNVP WGFVCSNCHF I FTLLDMGTVWV 


IOH80 
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6184 

A 

1 

462 

MNNPHAIPRRHAPIEQIiAROGSFRGFPAIiSQKMSPFKRQIiSLRINEL 
PSTMQRKTDFPIKNADSGLVHKYSLPFRTVSQAFVWPLYSYASLYRS 
ILVIKSVAKVLDLDLIVDQTIEKVSFCAPDRNFDRAFSYICRDGTTR 
RW I CHC FMAVKDT 

6185 

B 

1 

546 

MVKPLSLQMVPLHSSLGNRGSLSENKNTGVWESAASCCMAGCGGPAD 
G I CSEPLGHDQDGRAQS LLLWAKLTRKDR FS SQPRS IPS FDFLKAKA 
LAKQRETIWEKVFSPFQAGVLSPPQSATLLSTLEQGAVGQKPPSVPC 
LMGLFVEHLTTWQLTSSVQASKSKREDSPETVIAKQKAQSF* 

6186 

A 

1 

1846. 

MWSVPCLEKVDVLAWTVLCLAVKIKMNKLRQSFRRKKDVYV^ 
HQWQTDEEGVRTGKCSFPVKNWQRLFIQSLLAMRSLSASCCHQPAVY 
S SHGTQAVHAKGCLQAHAHRACCPSCTS PAAAAI FAAATPDGPLLPS 
RPLVNLIEKSQALVRETLTSQASAPNSGRAGQPVTVCLS/GANAAWR 
PALEAPVDTGLWFETQAVSQCLG * E\HSACLERKQKREKECGVTATF 
DAS RTTFTREGS FRVTTATEQAEREE I MKQMQDAKKA\ QTSVN * TAF 
PRGRTSTTSTGRLWLMGRPASLARGMSVWMAAAGLSAVITSWTRPSR 
RTSVCGVGVTARPATPSQAPLTLMTSAEL*RMFVGNTTSMGTGPSRR 
PGPCQQPAPSCITSGVLRGT/SAPER/PPCPGPTSEPLVI/VAPASA 
GATAHGVTAARS VAEVTS PAWC SAPSTMRPTPTTCAS ASHGQLTGVP 
AIF TLARR PS AGLLGWWRTWLCMG PQGC S / S PLGPCTRGLQS \ PPRM 
AGRQGHGHPAQPPVEEAPSPAPCTASRLTGPASRRPWRRLSVPGCLG 
S PLPFRPVTCS AVQPGARS PGESVXiS WALASGSGALLAGVKGVLCS 
IPQRAPWKTGHL 

6187 

A 

1 

1542 

MVILIiPGEILIYNEESPRKFSKHIISTFLhTITIEPSKVKQTGKJCAVKA 
VLWVSADGLRVVDEKTKI)LIVDQTIEKVSFCAPDRNFDRAFSYICRD 
GTTRRW I CHCFMAVKDTGERLS HAVGCAFAACLERKQKREKECGVTA 
TFDASRTTFTREGS FRVTTATEQAEREE IMKQMQDAKKVPEVEGEAE 
SISSLCSQITNAFSTPEDPFSSAPMTKPVTWAPQSPTFQANGTDSA 
FHVLAKPAHTALAPVAM PVRETNPWAHAPDAANKB IAATCSGTEWGQ 
SSGAASPGLFQAGHRRTPSEADRWLEEVSKSVRAQQPQASAAPLQPV 
LQPPPPTAISQPASPFQGNAFLTSQPVPVGVVPAIjQPAFVPAQSYPV 
ANGMPYPA PNVPWG I TPS QMVANVFGTAGHPQ AAHPHQS PS LVRQQ 
TFPHYEAS S ATTS PFFKPPAQHLNGSAAFNGVDDGRIiAS ADRHTEVP 
TGTCPVDPFEAQWAALENKSKQRTNPSPTNPFSSDLQKTFEIEL 

6188 

C 

539 

1039 

MPYPAPNVPWGITPSQMVANXFGTAGHPQAAHPHQSPSLVRQQTFP 
HYEASSATTSPFFKPPAQHLNGSAAFNGVDDGRUVSADRHTEVPTGT 
CPVDPFEAQWAALENKS KQRTNPS PTNAFLHVTYRRRLKLNFKQS LW 
LC I LS I PDREQG VAVKGAKQTLCPD * 

6189 

A 

V 

303 

2235 

VKIKMNXLRQSFRRKKDVYVPEASRPHQWQTOEEGVRTGKCSFPVKY 
LGHVEVDE SRGMH I CEDAVKRLKATGKXAVXAVLWVS ADGLRVVDEK 
TKDLIVDQTIEKVSFCAPDRNFDRAFSYICRDGTTRRWICHCFMAVK 
DTGERLSHAVGCAFAACLERKQKREKECGVTATFDASRTTFTREGSF 
RVTTATEQAEREE IMKQMQDAKKAETDKI WG S S VAPGNTAPS PS S P 
TSPTSDATTSLEMNNPHAIPRRHAPIEQLARQGSFRGFPALSQKMSP 
FKRQLSLRINELPSTMQRKTDFPIKNAVPEVEGEAESISSLCSQITN 
AFSTPEDPFSSAPMTKPVTWAPQSPTFQANGTDSAFHVIAKPAHTA 
LAPVAMPVRETNPWAHAPDAANKEIAATCSGTEWGQSSGAASPGLFQ 
AGHRRTPSEADRWLEEVSKSVRAQQPQASAAPLQPVLQPPPPTAISQ 
PAS PFQGNAFLTSQP VPVG WPALQPAFVPAQ S Y PV\ ANGMPYPAPN 
VPWGITPSQMVANVFGTAGHPQAAHPHQSPSLVRQQTFPHYEASSA 
TTSPFFKPPAQHLNGSAAFNGVDDGRLASADRHTEVPTGTCPVDPFE 
AQWAALENKSKQRTNPSPTNPFSSDLQKTFEIEL 

6190 

A 

795 

2728 

GLSNFFRGRVPVQPLtfQFKILIYIHRLEYWWASNPQTPHLGQAVKIK 
MNKLRQS FRRKKDVYVPEASRPHQWQTDEECARTGKCS F P ANDLGHA 
EFFESRGMHICEDAVKRLKATGKKAVKAVLWVSADGLRVVDEKTKDL 
I VDQTI EKVS FCAPDRNFDR\ AFS Y \ I CRDGTTAS HGFCR\ CFMAVK 
/DTQGERVEHMQ*ICA\FAALFRRARQKREKECGVTATFDASRTTFT 
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REGS FRVTTATEQAEREE IMKQMQDAKKAETDKI WGS SVAPGNTAP 
S PS S PTS PTSDATTS LEMNNPHAI PRRHAP I EQLARQG S FRGFPALS 
QKMSPFKRQLSLRINELPSTMQRKTDFPIKNAVPEVEGEAESISSLC 
SQITNAFSTPEDPFSSAPMTKPVTVVAPQSPTFQGTEWGQSSGAASP 
GLFQAGHRRTPSEADRWLEEVSKSVRAQQPQASAAPLQPVLQPPPPT 
AISQPASPFQGNAFLTSQPVPVGWPALQPAFVPAQSYPVANGMPYP 
APNVP WG I TP S QMVANVFGTAGHPQAAH PHQS P S LVRQQTFPHYEA 
SSATTSPFFKPPAQHLNGSAAFNGVDDGRLASADRHTEVPTGTCPVD 
PFEAQWAALENKSKQRTNPSPTNPFSSDLQKTFEIEL 

6191 

A 

1 

1825 

MNKLRQSFRRKKDVYVPEASRPHQWQTDEESVRTGKCSFPVKYLGHV 
EVDE SRGMH I CEDAVKRLKAQRKFFKG F FGKTGKKAVKAVLWVS ADG 
LR WDEKTKDLI VDQTI E KVS FCAPDRNFDRAFS YI CRDGTTRRW I C 
HCFMAVKDTGERLSHAVGCAFAACLERKQKREKECGVTATFDASRTT 
FTRERS FRVTTATEQAEREE I MKQMQDAKKAETDKIVVGSSVAPGNT 
APSPSS PTS PTSDATTSLEMNWPHAI PRRHAP I EQLARQGS FRGFPA 
LSQKMSPFKRQLS\LRINELP/ SPTMQRKTDFPIKNAVPEVEGEAES 
ISSLCSQITNALSP\PEDPF\SSAPMTKPVT\VLAPQSPTFQGNE\W 
GHSSG\AASPGLFQAGHRRTPSEADRWL\EEVSKSVRAQQPQAPAAP 
LQPVLQPPPPTAISQPASPIQGNAFLTSQPVPVGWPAHMQPAFVP\ 
AQSYPVANGMPYPAPNVPWGITPSQMVANVFGTAGHPQAAHPHQSP 
SLVRQQTFPHYEASSATTSPFFKPPAQHLNGSAAFNGVDDGRIiASAD 
RHTEVPTGTCPVV\PFEAQWAALENKSKQRTNPSPTNPFSSDLQKTF 
EIEL 

6192 

A 

164 

282 

LMKYIYSKKESNHLKMFIilTKIKSILKALSPFVIRTLPN 

6193 

A 

193 

274 

HCSGIQRFIHCLLIVKMHRALWKHSLG 

6194 

A 

3 

247 

FFFFFNRGFIAALFKITKIWKQPRCPSVSEWDKLWY/YTIEHNSALK 
RNGLSCPQKMRKLKSILLSKRRQSEKATYYITATM 

6195 

A 

70 

1130 

ASRCVAPGRG A/ PAS PHD VDLCL VS PCE FEHRKAVPMAP APAS PGS S 
NDSSARSQERAGGLGAEETPPTSVSESLPTLSDSDP/DCP/SAPGAA 
DSDEDTEGFGVPRHDPLPDPLKVPPPLPDPS S ICMVDPEMLPPKTAR 
QTENVS RTRKPLARPNS RAAAPKATP VAAAKTKGLAGGDRAS RPLS A 
RSEPSEKGGRAPLSRKSSTPKTATRGPSGSASSRPGVSATPPKVP\V 
YliDI*AY\LHSGS S AHLVDEEFFQRVRALC YV I SGQDQRKEEGMRAGP 
DAATGQQSS I WDRDLQVRVTPGPL WTL I PTFDSVAMHTWYAETHAR 
HQALGITVLGSNSMVSMQDDAFPACKVEF 

6196 

A 

57 

361 

GLTm)TQIODVQPPK\QQPMI\YICGECH/TPENEIKSR\DPI\RCRE 
CGY\RIMYKKRTKRIjGRFLMARWNAGEFQREWPSPYTLGFGS1iPHFW 
IVCYSFSIFALQ 

6197 

A 

2432 

3682 

NSLILLFFFFFRQRSLTLECSGVISAHHNLHLPSSSNS / P/ASAS *V 
AGITGMHRHAWPICIFLVETGFRHVGQAGLELLTSGDPPAPTSQSM* 
ATTSGLILLLSKILFPFHSTKVFK*SCPS*KILKEENCSVINEWFNS 
ES S YTS KEKNNLVPNAC * E I \ TM * VATNCDFT/ S LL * RS ENYPKLFH 
NNETTSQKIHLKIKISRPATSGQW\ILFSLVWWRARKG*GILMIHNG 
ILY*TIC*IKLHRLP*GMDYPNQPD/MKSGWDKRMSHTL*F*GKEIV 
FDFQNQQNKLSTLYLSVQE**HEEFFFRDLKYNKPGSTIKSLVSF/H 
SliTLFFFFFFEARSHSASQAGVQWGNIGSLQPAPPGLKRSSYIiSLLS 
SWDYRRVPPHPANFCIFCR/GWGFGVSPCCPGSSGTPVLKWRAHLSL 
PNC 

6198 

A 

1 

632 

KYRQDLTVPPGYCTGEQGAGGRPGETEPWLRPPALLPSXDFLLCASS 
v VKI {jurr JjF x PPRACPQGQAASSATTSRPCPEVKEAGWGSP 
GHLARHALSHVPPPARWLVLRERWHFCGTC 

MGYQLL/PGEWPWPGPWRILTPSLRVTCRPLLSPQLPFEBLESQRG/ 
LCPQLKSYLRQKIiQALGLRWGPEGG 

6199 

A 

2 

313 

EVRVYDLTQYEHCPDDVLVLGTDGLWDVTTDCEVAATVDRVLSAYEP 
NDHS R YTALAQALVLGARGTP \ RDRGWRLPNNKLGSGDD I S VFVNPP 
GRARQLLLRG 
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6200 

A 

4 0.1 

532 

KALISPFSLPLSVVPSSPTMASSTSLPSNCSSSSGIFSFSPAN | 

6201 

*\ 

■X. 

oo4 

MAGVPVGSTQPPPCQPQKILKERLSSPKTEGALNEPTIDYGFQRLQK 
VIPRHPGDPERLPKLGEPPWSISSSSNPTRVLAALVGRATCQNWPCL 
GEIHPSGHYTEPLDRARAGRSPPQEALSVTQSRVAVLIALNVKVERE 
FKGRHHS PS TFHGAIiVLEDANECYQKEVGVSHS VI FGKDWMTY I KPL 
LCQAHIVILQTGDSLFPTPMGKTWSFLYPGFLPWCSGRIGSRVGLEN 
ECKVSLSGSSSQPMGEPEGRWSSPEVGPU\SPGSPLIAWAKLRFVPP 

vULfLtk'V j 

6202 

A 

9 

811 

YIVTTLRQLGTRGTPGRFIYTALNEPTIDYGFQRliQKVIPRHPGDPE 
RLPKEVILKRAADLVEALYGMPHNNQE I ILKRAADIAEALYS VPRNH 
NQLPALANTSVHAGMMGVNSFSGQLAVNVSEASQAT\DQGFTRNSSS 
Vb ^ Wb x VFb 1 1 PQQTNYNS VTTSMNGYGSAAMSNLGGS PTFLNGSAA 
NSPYAIVPSSPTMASSTSLPSNCSSSSGIFSFSPANMVSAVKQKSAF 
AP WRPQTS PPPTCTSTNGNSLQAI SGM I VPPM 

6203 

A 

299 

826 

LEGHCHNLSSLQISASLGFKRFS\CLTSPSSW\DSRNTPPCPANFC/ j 
EFLVENGFHHVGQAGLKLLTSSHPPSMASQSAGITGMSHQAWPTFPV 
LINFNSLKMCRMCPMATHLGQCPSREYKIHSCSCAELEGQAHHEPDV 
GVTIS IRKCPVPNALWWKELVPGAQGHGPHLEPEGLG 

6204 

A 

1161 

2627 

RSDATNWGLSHLCFWFWFLFLRLSFLFSLPRLEC^TVTISAHCNLFCI, 
LGS SNS PVS ASLVAGI TGTRHHAQL I /LL YFFFFLVKTRFHHVDQAD 
LELLTSGDPPTSASQSAGITGVSHCAR/LSHLCFLPPCLLFY/CYFI 
YVMLLCY*WSLFLVAQVGVQWHDLSSLQPLPPGFK*FSCLSLLSSWE 
YRRSPPHPANFFFFFFLRR/RSRFVAQVGVQWCNLDSLQPPPPSFKH 
FSASACRVAGITVACHHAWLIF/ IFLYF* *RQGFTILARI#VLNS *PR 
DPPPSASQSAEITGVSHHAWPILFIYLRQSL/DSVAQAGVQWYNLGS 
LQPPPPGFKQFSCLSFLSSWDYRRVPPCPANFIiCF* * RQDFTMLARL 
VSIS*PHDLPASASQSAEITGVSHRAWPIFCIFF/CFFEMESRSVAQ 
AGVQWRDLGSLQAPPPGFTPFSCLSLPSSWDYRCPPPRPANF\CIFS 
RDGVSPWSLSPDLVICLPRPPKVLGLQA | 

6205 

A 

24 

617 

DPVSTKNTQISHMWWCTPWPATQEAEAREFLENFLSPSTGFECSSL 

FFFFFFFFLSQGSHSVTQAGVQWCDHGSLQPQPP\GRKPSSPLCLLS 

S WDNH/ RPANFF YKTTVKS PEE / FLQLAWGCFACARVYS AFQTEKP 

QHSSSVGLHRSSFEISAFSHFPI*VFVPWPSPPLQYEEPTLSFPPSS 
SIPHCP$TSPPND 

6206 

A 

239 

1789 

ATAPGQI FFFFFFLRWSLTLS PRLECSDAIMAHCNI1KLL1GSRNPPTS J 
AS*V\AKDYSTHQQAWLISYF\FFIETGFRHVAKAGLKLLTSSDPPA 
PASQSAGIKRVRHHVWPSFSYA/NKHLFSRGLRALSYKLK*QMQLFFN 
FLS ILSHDFVAVPS CHD\ HLRS PETCH * * CS VTHLEGLQVHF * L* DV 
IPVLIQGVSLNSSSSSFLEESQ*DNIiILSREWGSHSNLSY\SKVIM* 
HLTATS S CDTELASAS V * VLS * PL * LRLSR * S F * TFLWXiLFRDR I IQ 
1 1 FCLFLKRWSL/DS VTQAGVQW/ RNHGSLQPQTPGLGSDTESFFF* 
AKVSIjCHPGIjEVQWCNHGSLQASNS*AQVILPP*LPN*LRLQACTIM 1 
PG*FCF*ILVEM\GLAMMPSLVSNS*AQ/CNHPV r LASQSSGITGMRH 
CARLGRES*FDPLH*TVCPICNFFFFLRRSFALVAQAGVQWRDLSSL 

QPLPLGFR*FSCLSLLSSWDYGHPSIiRSANF/CVFIiVEMGVSPCWTG 
WS*TPDLiT? I 

6207 

A 

2 

208 

LEHALRPSSKMLLKCSQGGTPQEPPNPRIKRASSLNFLNKSVEDPTQ 

6208 

A 

1 

419 

SAFS VAVTKKLPTGAACPI KF\ TKI LMNEGGHYNAS SGKFR\ CGVLG 
IYYFHL\HITLGNK\DLAIGLGAQRPSTASGPLMPNTGNHDVGSGST 
ILALKQGDEVWLQIFYSEQNGLFYDPYWTDSLFTGFLIYADQDDPNE 

6209 

A 

756 

966 

VGRRILLRKNFGQAQWLMPIIPTLWEAEVGGLLEQSSSLGKFFKKLA 
GHGRPQNKTLSLKKLKEKNLSK 

6210 

A 

24 

365 

PTEY/ENL/FPCIKEAF/WEEWVKETLAVL/WPAKQYPFVTPIEER 
ILMEEGKAFPP5RSTAKQKLDGNPVSPTPV\IGLSPTPNKEEKQPDP 


X O O S: »+ i& «+ 3: ,, O 2! 2 6. 0 ifi: 






WPLEPTGHLDGARDTAGPSWLHHRF 

6211 

A 

32 

188 

KIFFFFFFLPPQFKILFKRPTGLRVPGKGRRALGGSPKGAKKAWPTR 
RLPS FNG YLEG S SNGAS TE S PGNFVPGKTLQP PGVELGRGRWTAQGQ 
PSRPVALGA* kl 

6212 

A 

122 

339 

EKGFWFC^QGGKOTiPGGNSLEPSASGLKEIFGLNLLNNWE*RGGPKT 
PGNFWIWKKGGV* PLWPGWG*NPGI* 

6213 

C 

56 

118 

MKALPSSQSCGPTSVNSPFH* 

6214 

C 

345 

410 

M KACHLGTPQFTG VS CG VQKA F 

6215 

C 

7 

438 

MQVETGRGLGVGRREGRQGLAKPRQLPESLLHSARHPSKLGRPKVAS 
LHEGTAQIiTILWANFSQLSFPLDLFLIKRPLXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSSKLXRD 
DHA 

6216 

A 

2 

80 

FLHFYFQTYSKQNNHLNCQKNSNFTQ 

6217 

A 

3743 

4059 

GCAVFFFFFFFLRWSLLPRLE/CSGTVSAHCNLHLLGSSSSRAAASQ 
VAGTTGMHHHARL I FVFLVEMGFHHVSNS * PQA IHPPQRPKVLG I IG 
RDNRHLGLMEWSF 

6218 

A 

3 

4 02 

DTMVHLTPVEKGAVTALWGKVNVDEVGGEAIiGRLLVGYPWTQRFFES 
FGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHIJDNLKGTFATLSEL 
HWDKLHVD P * NFRLLG \NGLD * MLAHKFGKKI S PNQGKLG 

6219 

C 

295 

618 

MARKCSVAFSGGPGCTWDNLKGTFAHTEVSLHCDKAAPWIPEELSGS 
WGWVAWSCVIiGPFTFGKRISTPPVAGIiPNQENWLAWCWIiNALGPTSN 
HLSLAFLAGPISN* 

6220 

A 

2 

338 

IKKAL*HE*LREKKKSPIRTSQSSKAP*NLPRSSSSSESSNNYQLL* 
AHKLQKVKEMGKLLETPNLPLLSQKVAEPI^QPITSSKIETLIKKPY 
HPEKSYGHDEVTAKFSRA 

6221 

A 

1 

2349 

AAAMDEQAGPGVFFSNNHPGAGGAKGLG PLAEAAAAGDGAAAAGAAR 
AQYSLPGILHFLQHEWARFEVERAQWEVERAELQAQIAFLQGERKGQ 
Em.KKDLVRRIKMLEYAIiKQERAKYHKLKYGTEIiNQGDMKPPSYDSD 
EGNETEVQPQQNSQLMWKQGRQLLRQYLQEVGYTDTILDVKSKRVRA 
LLGFSIDVTDREDDKNQDSWNGTEAEVKETAMIAKSELTDSASVLD 
NFKFLESAAADFSDEDEDDDVDGREKSVIDTSTIVRKKALPDSGEDR 
DTKEALKEFDFLVTSEEGDNESRSAGDGTDWEKEDQCLMPEAWNVDQ 
GVITKIiKEQYKKERKGKKGVKRPNRSKLQDMIiANLRDVDELPSLQPS 
VGS PSRPS S SRLPEHE INRADEVEALTFPPS SGKS F I MGADEALESB 
LGPGEIAGLTVANEADSLTYDIANNKDALRKTWNPKFTIiRSHFDGIR 
ALAFHPI EPVL ITASEDHTLKMWNLQKTAPAKKS TSLDVEP I YTFRA 
HKGPVLCVVMSSNGEQCYSGGTDGLIQGVJOT^PNIDPYDSYDPSVL 
RGPLLGHTDAV>/GLAYSAAHQRLLSCSADGTLRLWNTTEVAPALSVF 
ND I KELG I PAS VDLVS SDPSHMVAS FSKGYTS I FNMETQQR I LTLES 
NVDTTANSSCQINRVISHPTLSISITAHEDRHIKFYDNNTGKLIHSM 
VMLEAVTSLAVDPNGLYLMSGSHDCSIRLWNLESKTCIQEFTAHRK 
KFEES IHDVAFHPSKCYI ASAGADAIiAKVFV 

6222 

A 

2 

1802 

YLQEVGYTDTI LpVKSKRVRALLGFS SDVTDREDDKNQDSWNGTEA 
EVKETAMIAKSELTDSASVLDNFKFLESAAADFSDEDEDDDVI)GREK 
SVIDTSTIVRKKALPDSGEDRDTKEALKEFDFLVTSEEGDNESRSAG 
DGTDV^KEDQCIjMPEAWNVDQGVITKLKEQYKKERKGKKGVKRPNRS 
KLQDMLANIiRD VDELPSLQPS VGS PSRPS S SRLPEHE I NRADEVEAL 
TFPPSSGKSFIMGADEALESELGLGELAGLTVANEADSL.TYDIANNK 
DALRKTWNPKFTLRSHFDGIRALAFHPIEPVIilTASEDHTLKMWNIiQ 
KTAPAKKGPVLCWMSSNGEQCYSGGTDGLIQGWNTTNPNIDPYDSY 
DPSVLRGPLI/SHTDAVWGl^YSAAHQRLLSCSADGTLRliWNTTEVAP 
ALSVFNDTKEIX3IPASVDLVSSDPSHMVASFSKGYTSIFNMETQQRI 
LTLESNVDTTANSSCQ INRVI SHPTLPI S I TAHEDRH I KFYDNNTGK 
LIHSMVAinjEAVTSIiAVDPNGIiYLMSGSHDCSIRLWNLESKTCIQEF 
TAHRKKFEES I HDVAFHPS KC Y I AS AG ADALAKVF V 

6223 

A 

5175 

5804 

IANNKDALRKTWNPKFTLRSHFDGIRALAFHPIEPVLITASEDHTLK 
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MWNLQKTAPAKK* EYS TLTLFE YFK* HLS I LLYS I 

6224 

A 

3 

115 

HETDYHKQPWQGEISGIVSTESQ*WGLGGGEVKVKWE 
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HEKVTKQEPTRRSARLSAKPAPPKPEPKPRKTSAKKEPGAKISRGAK 
GKKEEKQEAGKEGTAPSENGETKAEEIHISRSTVNVSTSRGTPPSTL 
HRKLNIt 

6306 

A 

1 

122 

TRPE FPGRTTP S QKG * GKKEEKPET * KERTG PS EKGETKA 

6307 

A 

2 

392 

CSCSAEAASSGGAAVPVRCFTFCFTDI VIMPKGKSPENTEGKDGS \ K 
VTKQEPTRRSARLSAKPAPPKPEPKPRKTSAK\KEPGAKISRGAKGK 
KEEKQEAGKEGTA\ PS \ EHGETKA\ EE A\ QKTES VDNEGE 

6308 

A 

3 

426 

AGNSSIQGHGGWTLRHSILSFLSLLPFSRGPQCMGAVQPLVLKQPRD 
ASLRGCLLGYDSTATGLY*ILCNLRRHPGVMPSCGARVLRDHNPEDI 
EWLCSCSLLEVLSGSLSSTPGLTLADQAGHSGGIMPLEGSLRSDCC 

6309 

A 

3 

374 

HAGLGAHPLHHPVIQQEGFLPAPRGFGYRSEVD*IRIPAESTGQNSQ 
CQLRK* KDDS YFHCG YFCGCVCTCRGRLQS STSHQCQAAL * LLLPVC 
IiTMLRCISSLIYT*NLKTVHSVRLNFIYN 

6310 

A 

1 

230 

S GRPFLFFFQG KERVYF I PVWLF YTWVQ * LRKNWRWQKTMEPPR 
SWLMQLGHTEEGPGYPLLGETETKLFRTA 

6311 

A 

2 

100 

RRPFFFFFFASC I *NL I S PNVQLSNFTC I LRN ~~ 

6312 

A 

2 

192 

PDFFFIQYRCKWAQRLWVIAFLRLMTSITSLVQLLQTNADFSLPLA 
FPLLiAIGP I PS I SHFP 

6313 

A 

1 

451 

NTF^n^bQVIKFTKDL^VFRSIlPIEDQISLLKGAAVEICHIVLNTTFC 
LQTQNFLCGPLRYT I EDGARAP YLTDR PGVTQRDE I DQLQEEMALTL 
QSYIKGQQRRPRDRFLYAKLLGLLAELRSINEAYGYQIQHIQGLSAM 
MPLLQEICS 

6314 

A 

231 

1335 

EKQESVTATPTRDVMASREDELRNCWCGDQATGYHFNALTCEGCKG 
F FRRTVS KS IG PTC PF AG S CE VS KTQRRHCPACRLQKCLDAGMRKDM 
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ILSAEAIiALRRAKQAQRRAQQTPVQLSKEQEELIRTLLGAHTRHMGT 
MFEQFVQFRPPAHLFIHHQPLPTLAPVLPLVTHFADINTFMVLQVIK 
FTKDL P VFR S LP I EDQ I S LLKGAAVE I CH I VLNTTFCLQTQNFLCGP 
LRYTIEDGARVGSQVELLELLFHFHDTLRKLQLQEPEYVIiLAAMALF 
S PAP YLTDRPGVTQRDE IDQLQEEMALTLQS Y I KGQQRRPRDR FLYA 
KLLGLLAELRSINEAYGYQIQHIQGLSAMMPLLQEICS 

6315 

A 

636 

852 

FRGSGPRDKTPNCLISLLFSWQPHYFLQRFLYAKLLGLLAELRSINE 
AYGYY IQH I QGLS AMMLLL * E I CS 

6316 

A 

1 

690 

MCPSEMKLPEERSGSNICCSAIFAVLQPLLVIPRQTWSGVDLQQTPT 
DLQLRVLFVRRKTNKQKGHPHQNPI CMSQS S KTKARQAS IQIQE I QR 
MPQRYSSSRATPRHIIVRFTKVRMKEKMLRAAREKGRVTHKGKPIRIi 
TVDLS AETLQARREWGP I FNI LKEKNFQPRI S YPAKLS F I SEGE I KS 
FTNKQMLRDFVTTRPALQELLKEALNMERNNQYQPLQKHAKIj i 

6317 

A 

3 

302 

G FRGC KPQALAAS TWC * LC S CTE VKN * GLGTS TW I S ED VRKHLDVQA 

EVCCKGGALVDNLCQGSTEGKCGVRAPTQSPQWGTAWWSYQKRATIL 
QIPEW 

6318 

A 

1919 

2245 

PMQRSP*RT* WS *KPWQBNYVMRAQDSVADS INWKKG* QRLKIK*MK 
* SKKRSLEKKE * KEMNKASKKYGTM * KDQ I YVGLVYLKVTGRMEPSW 
KTLCRILSRRTSPT 

6319 

C 

395 

1096 

MAPDDTSGLLEFVGVPLQTIiFAWVSPAEAAEQRILIiNSKCCCLIGPIi 
EASSQRGTRPCLLLKKFTFHKAIADIDSESCDGAGQSK^KTIWKGFT 
ILDAIKNICDSREEVRILTLTGVGKKFILILMNEFEGFNTSVEGVTA 
HVVEIARELELQGHSEDWELLQSHDKTWMNEELLLLDKQPKWFLEM 
ESTLGEDAVNTVEMTTKNSEDYINLVDKAVAGFEKISSSFERSSTV 

6320 

A 

304 

475 

PRQTGSGVDLRQTPTDLQLRV\*LLEGKITNTRKGHLHQNPICTSPSS 
KTKGKDQIYV 

6321 

A 

224 

1767 

HCLSNVSAGLHPFTSYCLLHEKKFKAAVLSLGEGGCDGADGADGGDG 
SGMAKSGPLAAGLLEFAGGPLQTIiFAWSAAEAAEQPILLNSKCCCIi 
IVPLEASSWRGTRPCEVIVSEAGASIYSVSPEANKEMPGLDPNLRSA 
VSIARRVQDPLAELVKIEPKHIGVGMYQHDVSQTLLKATLDSWEEC 
VS FVG VD INI CSEVLLRH I AGLNANRAKN 1 1 EWREKNGPFINREQLK 
KVKGLG PKS FQQCCWAF I RNQPGL YPNVLQS AJJLKLQAKFQGVAVTS 
SADVEVTNEKQGKKKSKTAVNVLLKPNPLDQTCIHPESYDIAMRFLS 
S IGGTLYEVGKPEMQQKINS FLEKEGMEKIAERLQTTVHTLQVHHRW 
VFSQPEGFDFRTDF**TWIFKEEGIVCLEDLQIGTVLTGKVENATLF 
GIFVDIGVGKSGLIPIRNVTEAKLSKTKKRRSLGLGPGERVEVQVLN 
IDIPRSRITLDLIQGVMSIPRRPDADFIFSFPQIDKDKSWCKL 

6322 

A 

2 

64 

AFRFS S SVR IHERSHTGEKP YECKQCGKAF I S SSHFRLHERTHMGEK 

V*EYKQHSEAFGCSSSF*IHKRVYTGQKLCEYKHCSKALD*SSFF*A 
RKNSHW*ETIi 

6323 

A 

168 

501 

VCSSAKQ*QTCFSLVSPTM*NLIKEFLNNIKFPQQDGTSQALVSGEN 
VDYRKIVGVSGVGSYLGSKIFRLGAVAHTCNPSTLGGRGGQITRSGD 
GDHPGQQGETPSLLKI 


A 

165 

548 

SPSQHSLVPSTPLPGSVLWGIGGCLPCAAEGSRSQQGDGWLRRSSVD 
LAVGDGEGKEVPRPLAIjGCDLARDGSRTLSPLGPCPKPLiGQRWES*E 
LRKVPGLAPDLS VE S APRS WEAPECQQMLQLS P 


A 

18 

225 

TLGLK*SSHLSLLSSWDYRCMPPHIAN*NFCLFVF*RWGLAVLPRLL 
LNSWPSAILPPWPLKVLELQV 

6326 

A 

724 

900 

MTSFSDKFQSFQNRGSTIVTIAHFWSRKYLSDKECS*NACVVFIHSI 
* KW ANTKTF I PD 

6327 

A 

15 

185 

VPFENPWVDPRVRPRVRFFSTGTFRCCAH* IS* LLQSGS SQASLCRY 

NEESQKHAALSLSKEHNPTQYEERMRIQKAGGNVR*PRQACWIMRR 
VKNMQP 

6328 

A 

199 

544 

PARLGLEEGRYL I TGRGWVW WLTP V I PAF WEAEVARSQG QKI ET I LA 
NMVKPHL Y * KYKKKLAGGGGG YL * S * LLGRLRQENG VNLGGGACSEP 
RSCHCTPAWATEQDSVSKNKK 
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6329 

A 

837 

1096 

v w v wv»uJ5 J- ixfuiij yx vu v JbPNPFLiIj WVGTHLAVLSGDKDDCWVGALAT 
GVE ILLDVGTGQWAGNVGKAAGCGVRGI WGGRCQERRGQ 

6330 

A 

454 

526 


6331 

A 

258 

488 

NTVSTRKYKKLARRSGGCL * SQLLGRLRQENDVNPGGGACSEPRSRH 

6332 

A 

3001 

3080 

YE I IKFHKSEYLKLIKVIRKCKQIGL 

6333 

A 

6181 

62 92 

Jjia x 1 b t M&NPGPSMLLS PEHENDRREEAGCGGSRL 1 

6334 

A 

3 

222 

M^vi/\CKi^uyuut>PC:QPFCFPHLAAPLP*VPNPQLETPFP*ARTQR 
IPLPSG*PGTPSPKPTNHIPLTDPL 

6335 

A 

2 

227 

i? t«t iiibKbijiUA(JMQWPNL»GSL»QPPRPGFKQFSCLSLPSRWDHRHMP 
SGLANFCIFSRDGVSPY*SGWS*TPDLR 

6336 

A 

2 

218 

1 WKJ^VAQA(aVQWRNLSSIiQPLPPGFK*SSCLSLPSSWEYRLLPPRP 
ANFYIFSRDRVSPCWPGWSQTPDLR 

6337 

A 

31 

165 

u^rti v iyA^My*bNHGSIjQLYLPSLRRFSYLNIjPSSWDQRSNKEG 

6338 

A 

122 

376 

KMGPLSSSSLPEAECPDPRAATLGMNKMLHSHWVSSIAPVDPGMG*K 
AASGPDFETGQAEARLSPPPSLQESSVWPGDFPHQIC 

6339 

A 

3 

268 

GRVAVRKEKGSFWLWFNWLSDLNWPLPPLPGELA*TFLKSSLRRPA 
PAPRPPPPQTPSPTPRSLAVSSEGWRLLLSMTLESDLDPDL 1 

6340 

A 

2 

218 

IYTHIH*IDPMFSL*GHMEGEVWGLATHPYLPICATVSDDKTLRIWD 
LS PSHCMLAVRKLKKGKRLSHLNVD 

6341 

A 

1 

5748 

MPLKLQNCLAiHPERVLVATGQVGKJEPYICIWDSYTVQTISVIjKDVH 

THGIACIJIF1DLDGQCSVKAVIDNARRNECGYIPVKLYLQKHSFSLIC 

LPFYSFLDFLDINQRLVSVGLDSKNAVCWDWKRGKMLSMAPGHTDR 

FWSLCGNAliTPKRGVFGKTGDLQTILCIiACARDELTYSGAljNGDIYV 

WKGINLIRTIQGAHAAGIFSMNACEEGFATGGRDGCIRLWDLTFKPI 

TVI DLRETDQG YKGLS VRS VCWRGDH I LVGTQDSE I FE I WQERNKP 

FLIMQGHCEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHAIiIARC 

NMEE P I RCAAVN ADG I HLALGMKDG S FTVLR VRDMTE WHI KDRKEA 

IHELKYSPDGTYLAVGCNDSSVDIYGVAQRYKKVGECLGSLSFITHL 

DWSSDSRYLQTODGNGKRLFYRMPGGKEVTSTEEIKGVHV^ASWTCVS 

GLEVNG IWPKYSD IND INS VDGNY IGQVL VTADDYG 1 1 KLFRYPCLR 

KESLADSHSDESDSDLSDVPELDSEIEQETQliTYRRQCSAGCTSRAP 

ASAQLLVKPQEAFTHGRRQRGSSYRGYDCRSNLFYTQIGEIVYHVAA 

VGVIYNRQQNTQRFYLGHDDDILCLTIHPLKDYVATGQVGRDPSIHI 

WDTETIKPLSILKGHHQYGVSAVDFSADGKRLA\OLA*MIAILLCSG 

TGRKERNFQ * QEEVKI RFLL * R * TPMCL IN* LQLELNT * NFGVKQGE 

D * LEEKAT * AHWG KMTQ * CVQCMDGLKRW/ HFSGTSTGDVC I WRD I F 

LVKTVKAHDGPVFSMHAIiEKGFVTGGKDGIVALWDDSFERCLKTYAI 

KRAALAPGSKGLLLEDNPSIRAISLGHGHILVGTKNGBIIiEVDKSGP 

ITLLVQGHMEGEWGIATHPYLPICATVSDDKTLRIWDLSPSHCMLA 

VRKLKKVQKGNVGLEPPYRVPTGAPPSGVMRRGQPSFRPQNGRTTDS 

LHHVPGKATGTECQPVKAARKGAILCKTTGAELPKTMGTHLLHQCGL 

DLRFAGGPLQTFFAWVPPTPRG I TSGGCKTVKRVTEL I P VETPLYKV 

SSNPCWGGLTQSGGTGSRTHIjMKHSDYPLAEGVHYTGGNPTCPDCPD 

SSELGAQSSEAADSQSDDSRPFPQELGSLRQSVAKWPLRICAAIiCLG 

LKELVAWAHGGDLLIHQLHRSMEKVWFPGQDSTITHCLSWLGVGAPL 

VLCSSQKAATLVLRESEQ\^GDSDLFGNGVAATRTFSELVAJCKLARG 

CSPQVmSESSHLVKEKLARGSRPEI^GGRCCCFSPDGKAIaAVGLNDG 

SFLMANADTLEDLVSFHHRKDMISDIRFSPGIWPVIGEVTDVTASCIi 

TS D KMVLiATGDD LG FVKLi FR Y PTKO K Ffi Y pkt? w a u c Tin /TMrrowrv 

DDSMLVTLGGTDMSLMVWTNEMEGYREKRPCDSEESDIDSEEDGGYD 
SDVTRENEISYTIRALSTNIRPMLGIKPHLQQKEPSIDERPPVSRAP 
PQPEKLQTNNVGKKKRP I EDLVLEL I FG YRGRDCRNNVHYLNDGDDI 
IYHTASVGILHNVATGSQSFYQEHNDDILCLTVNQHPKFINIVATGQ 
VATAPS IHI WDAMNKQTLS I LRCYHS KGVCS VS FS ATGKLLLS VGLD 
PEHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSDTQFVSVGVKHV 
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KFWTLAGRALLS KKGLLS TLEDARMQTMLAI AFGAJJNIiTFTGTl SGD . 
VCVWKDHILCRIVARAHNGPVFAMYTTLRDGLIVTGGKERPSKEGGA 

VKTjW'nOF'LPPPR AFPT J FTnOATi^r , VPQVCTOtT^nTf TT.VnTPMAFT TF17 

GEKNAACN I LVNGHVDGP I WGLATHPSRDFFLS AAEDGTVRLWD IAD 
KKMLNKVNLGHAARTVCYS PEGDMVAI GMKNGEF 1 1 Ll» 

6342 

'A 

2 

252 

PPE*LGPNGCQGPPTPLPKFW*ILSKKNGGSPPIIGPGPILKPP*PL 
RWIPPCPLPSPKCLRLPGHKPSAPQPEN 

6343 

A 

289 

395 

SRICPHRDTRKEYNSYTIAFLINLLKFAELDCIVYFRLNDN*M*IFR 
CSTIFYNPALAFLHILRKPPTSFF*GRFLLSEMPC*MLLLII*IPNY 
LIQIIKRSI 

6344 

A 

262 

389 

PQVTFS I AS RYATI LLL * FF * PCLVFYSLPNQFYVQGKLANA 

6345 

A 

1 

103 

QSFTYCFESF*NLSVFLYCLIYRITN*VYYLYVE 

6346 

A 

7 

156 

LFSFHVSGTKAHTCNPSTI J TG*GRIT*GLEFKTSLGNTVRSHRYRKK 
KIA 

6347 

A 

94 

469 

NCTYVIKVYNLIRHMYTPITATKI ISTRFKI YWY^T.TFQ* TCTHTFF 
TRSLTP I AQAGVWCHDHS SLQPLPPGLR * S S YLSLSS S WD YRHTPPC 
S ANLCVCVCRDRRVS PCY PGWLLN I LNVC I I 

6348 

A 

1 

226 

FFEMESHSVTQAGVQWPDIiGSLQPLPPGFK*FFCLSFPSSWDYRHTP 
PHPAYFC I FSRNGVS PCWPGWPRTPDPR 

6349 

A 

3 

308 

KDEKGEEDGKEDKNGNEKGEDAKElSTDDGKEKGDKKEGKDVX\nCEDEK 
E R FDG ICED EGGNF, E F AC? K"F KPTjT . K" F P FFfi TCP pn F T TfFnTWKTCP' P PO Q 
IV*NCPM 

6350 

A 

1 

239 

MDGGTEKPFACPVL * CTKRYKNVNG I KYHAKNGHRTQIRVRKPFKCR 
GG KSYKTAQGLRHHT I NFHPPVSAE I IRKMQQ 


7V 
n 


O _7 U 

Ki<rlrl/\x V o LtU F LiC/Kf f r rr r b Lit b b r r r r t LKKbrAljVAyAUVnWKD 
LGSPQPPPPRFK*FSCLSLPSSW1WRHAPPRPANFVFLVETGLQLPT 
SGDLPASASQSVGITGVSHCAWPGNWLLKETRCGSSVGRQITGPITP 
D A WVD PG T PDP YT »0 A 

6352 

A 

3251 

3700 

KWPYWLKEESSKLLHHQETS*HSNLGN*ASGLRTLPYRAJSQGKE*GM 
W J- o vj rv i a j\x->/AJjr o x x x i^^r^r\.xr.rt v o oxxx^xAvj±^xj^oo^yiv.i^-Mx ai^xj 
RGEDVLADWSMGGYGWPRRGPRIPTLVGRGAHSMATTTTDPHRVLPA 
YHQCSPCI 

6353 

A 

3 

347 

DAWEQTQDTELVETRPAGDGTFQKWAAWVPSGEEQRYTCHVQHEGL 
PQPL I LRWEQS PQPTI P I VG I VAGLVVLGAVVTGAVVAAVMWRKKS S 
DRNRGSYSQAAV*DSFLVWD 

6354 

j\ 

3 

174 

WA/PQfiFFOP YTr*HvnHPf!T.P]f 'pt.tpt.pwtm? vnpcvcni\ a ccncanno 
DVSLTACKV 

6355 

3 

10 

720 

YVDDTQFVRFDSDAAS QRMEPRAPWI EQEGPE YWDGETRKVKAHSQT 
DRENL R I ALiR Y YNOS EAG S HTT.OMM FGCDVG Q DR P PT .P ("3 YWft Y A YDR 
KDYIALKEDLRSWTAADI*1^ 

LRRYLENGKETLORTD PP KTHMTHHP I SDHEATLPCW AT.GFYPCGDH 
T* 

6356 

A 

3 

223 

GEKPHKCGDCGKAFNSPSHIjIRHQRIHTGQKSYKCHQCGKVFSLRSL 
LAEHQKI PFGDNCFKCNEYSKPS S IN 

6357 

A 

1 

179 

RSQVIPJiQRIPTGQKSYKOTKC/GOGFSLRSLLAEHQKIHF*DN*CT 
GNEYSKPSSINCH 

6358 

A 

38 

294 

LHLMQNRRLHTGDKLHI<YDDC\GKAFTSHSHLIRHQRIHTGQKSYKC 
HQCGKVFSIiRSLLAEYQKIYF*DNSSECNEYSKPSSIN 

6359 

A 

2 

244 

DTS KG I ADVPEW FKG SRLK^ AENLL.RHKENDR VAL Y I AREGKEE I VK 
VTFEELRQEVAXFAAJ^I<^G^m^SVQR^lNIiYL 

6360 

A 

339 

1079 

I LS LPLPLLAGGTD 1 1 SCFMGHNFSLPVYKGE I QARNLGMAVEAWNE 
E*KAGWGESGElxACTKPIPCQPTHFWNDENGNKYRKAYFSKFPSIWA 
HGD Y * R INPKTGG I VMLGRSDGTLNPNGVRFGSSE I YNI VES FQEVE 
DSLCVPQYNIOrREERVILFLKMASGHAFQPDLVI^ 
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HVP S L I liETKG I PYTLNGKKVE VAVKQ 1 1 AG KAVE0GGAFSNPETLD 
LYRDIPELQGF 

6361 

A 

1 

1047 

MVSISWPRDLPASASQSAGITGLIGAi.VLSVGIYAEVER/HEI*NP* 
KCL PGS SHHPHP PGRRHVHGLLHWCAG VP P * QP PEI/LASLRL S RGYG 
LVLSWLEPRYEKMISGMYLGEIVRNILIDFTKKGFLFRGQISETLKT 
RG I FETKFLS QI ESDRLALLQVRA I LQQLGLNS TGDDS ILVKTVCGV 
VSRRAAQLCGAGMAAVVDKIRENRGLDRLNVTVGVDGTLYKLHPHFS 
RIMHQTVKJELbPKCNVor IjljoJbDtj^GKGAAljXTAVGVRIj^ 

6362 

A 

2 

55 

FFLRRSFTLVAQAGVQWRDLGSLQPPPLRFR*FSCLSLPSSWDYRCIj 
PPLLANFLHFQQRRSFTILARLLVSNS*PRDLPASASQSAGITGMSH 
HACPVI*FFFETEFHSCCPGWSAVA 

6363 

A 

167 

293 

vvkkllgillnqmktkmnfgdfltvmtqkmsekdtkeeilkssqf™ 

HQRA*KLiFDDDETGK.ISGr AjL» 

6364 

A 

3 

226 

FLR*SLAVNQAGVQGRHLRSLQPLPPRFKLFSLLYL,PSSWDYSHAPL 
FPANFC I FSRNDVS PCWPGRSQTPDLK 

6365 

A 

1 

648 

MVD VGKWP I FTLLS PQE IAS IRKACVFGTS AS EALYVTDNDEVFVFG 
IiNYSNCIiGTGDNQSTLVPKKLEGLCGKKIKSLSYGSGPHVLLSTEGK 
E YGKADAR WVY FDPT I VS VE I LTVALDG S LALFL I YAI VKEKYYRQV 
RSWKCESLALLYQVSNSLGIKRQSWREQQVCGLSSPSGGLTGSAALG 
GGPFCGHPWTPKSLAKHLYPVTRQFDS 

6366 

A 

26 

612 

VGAGGRGWRFAAAVRRAAGGGLRPGPAPGPRAGGGGPRGAHLALLRR 
AG ALRAGKE YGKADAR WVY FDPTIVSVE I LAVALDVSLALFLI YAIV 
KEKYYRHFLQITLCVCELYGCWMTFLPEWAHPEAPOTiQQPATGWLYW 
LGFTLFFF*RCVGFIiIPRNWLLWAVHGPRTSRKWHQKGNQFQWKEVF 
SGTFQNP 

6367 

A 

3 

292 

TPWLRDFHDPIVEVEVSVFPGEPLLFSAPGLPEPRAGHSVPSGLSW 
AFPSFSSDNVPGSLEHLLVLHFPGGDVSLLRPLCLPI*ETFMlPliSR 
LRCPCSRVSPSFSQHQACLSLGLVILSPVASVWLLFHRFLLTMSQVP 
WS I FLFS IS PGATCPSCADCAFLSRSEELRSHLCHR 

6368 

A 

237 

1202 

LS PR VQVG PGAAASAACAG EGTMWL PWALLLLWVPG C FAL S KCRTVA 
GPVGGSLSVQCPYEKEHRTLNKYWCRPPQIFLCDKIVETKGSAGKRN 
GRVS IRDS PANLS FTVTLENLTEEDAGTYWCG VDTPWLRDFHDPWE 
VEV S VF PASTSMTPAS I TAAKTS T I TTAF PPVS S TTL FAVGATHSAS 
I QEETEE VVNS QLPkLLS bliALLLLLLVG ASLiLAWRMFQKW I KAGDH 
SELSQNPKQAATQSELHYANLELLMWPLQEKPAPPREVEVEYSTVAS 
PREELHYASWFDSNTNRIAAQRPREEEPDSDYSVIRKT 

6369 

A 

2 

58 

PRVRCF I YTGKAPNLDKMADDLLAAADKVR * EG S KPRQNG 

6370 

A 

1 

1136 

MSSGPVAESWCYTQWHFLFNCFLFFYQQIKWKFSYMWTINNFSFC 
REEMGEVIKSSTFSSGANDKLKWCLRVNPKGLDEESKDYLSLYLLLV 
SCPKSEVRAKFKFSILNAKGEETKAMESQR\AYRFVQGKDWGFKKFI 
RRDFLLDEANGLLPDDKLTLFCEVS WQDSVNI SGQNTMNMVKVPEC 
RLADELGGLSENSRFTDCCLCVAGQEFQAHKAI LAARS PVFS AMFEH 
EMEESKKNRVEINDVEPEVFKEMMCFIYTGKAPNLDKMA\NDLLAAA 
DKYAL\ERLKVMCEDALCSNLSVENAAEILIIjADLHSADQLKTQAVD 
FINYHASDVLETSGWKSMWSHPHLVDEAYRSLASAQCPFLGPPRKR 
LKQS 

6371 

A 

21 

358 

GCRHS AS CAAPRALGPARAPQR * PKAGRVFPERTS * T * G * PAPGRRH 
PGPRIGGRRRRSGCSRVRRSSVRDC/PSSRSKASNPPLSLSSST\GC 
DGARrPCSV*KWt>Kf FbAKK 

6372 

A 

1 

1458 

SLPRNLPVTI I SQDFGDAS PRTNGQEADDSSTSEESPEDSKYFLPYH 
PPQRRMNLKG I QLQRAKS P I SLKRTSDFQAKGHEEEGTDASPS SCGS 
LPITNSFTKMPPRSRSSIMSITAEPPGNDSIVRRYKEDAPHRSTVEE 
DNDSGGFDALDLDELAAGETVAQSPPGVPCQPPLFQGSPSLCQLRLP 
TDETKDEWSSLMGKHQRYQVLKRDDSHERYSFGPSSIHSSSSSHQSB 
GLDAYDLEQVNLMFRKFSLERCRNRGPQRKGLAKAMQHGRGKEVGPR 
YPAVPIHDTVEPDPSGLRSPLWGTCGAQGPRCSTRR*MATASPPSSP 


o q e h-6i "+ iii 1 !; . o h 2 6. o e 






CWGATREGASCTRSSLALWPRKPASVRATSCCC*KAASEARGRVSRW 
THAPKRKPTGPSRGAAAPSRCT\ KGQPRS PAENP WACR I *CGQHLQS 

PELRSSTALQVWNSRERKFIPAGVPEAGEGHGGRPDHIGGLVLHPAE 
PEHLQPAGRLHHVPEV 


TV 


226 

CCFEILKLMKILISEMHLMLFFL*QLRKSA* IPYVPELFLS 

6374 

A 

112 

303 

SGVMVSFNRVLLIILEWNPFLTYSPRRLGLEIFFLFYFNVFFFRKTP 
LLLCT I KYFI SLNC IDF 


n 
/\ 

•a 


AWKFFFYF I LLFFFLEKHHFY YVQ * NI S LA* KKKK 

6376 

A 

22 

175 

THDHVS VY F FFLGLKS TE I QCD I KMQ I FL F I * VENHLP VS I Y I LKYF 
LWIL 

Oj / / 

n 

H. 

1 c o c 

1878 

TMSDTTFKALR*QLPVTRTKIDWNKILSYKIGKEMQNA 

6378 

A 

394 

626 

SDSVNPQLRVFKILN*SLGILiCTHIQIKLFFFFFSALTFALGTLLVC 
VSIYLYGLPRQDTTSIQQGETASKERVIGV 


A 

2 

101 

KTLVPL I LP I I TTLANPCKKD * YP YYVKI S I AC 

6380 

A 

1 

297 

SRFKRFFCLSLPSSWDYRHAPTHALANFVFLIETGFLHVGQAGLKLL 
T*GDLPALASQSAGITGGNHCTRQVQCFFRCWGYTRESGQKLDPARV 
MFENK 

6381 

A 

1 

3 94 

TGTHFHF YLLLVLLFLEMGPQS ITQADVQGHNHS SLQPQTPGLKQS S 
RLSL*NSWDYRHTPPHPANFFRIL*RQGSHYVAQG*SQTPGLKQFSR 
LGFPKRWGYRHESPHMARDRYSYCPHFPDEETGDPER 

6382 

A 

3 

160 

QL*WSHLLKMLQRYSSCNV**KRRGLIRTYGLDMCSQSLCQYAKDIG 
FIKLD 

6383 

A 

1 

1240 

MRRRKRAPAGILFLRPRRRGWPRGSGYRALPLGDFDRFQQSSFGFLG 
SQKGCLS PERGGVGTGADVPQS WPSCLCHGLI S FLGFLLLLVTFPI S 
GWFALKPEEAERVKAQVQALGLAEAQPMAWQSVPGAHPVPVYAFSI 
KGPSYGEDVSNTTTAQKRKCSQTQCPRKVIKMESEEGKEARLARSSP 
EQPRPSTSKAVSPPHLDGPPSPRSPVIGSEVFLPNSNHVASGAGEAB 
ERWVISSSEDSDAENSPAEQAATPDAEPHSEPPDHQERPAVHRGIR 
YLLYRAQRAIRLRHALRLHPQLHRAPIRTWSPHWQASTPAITGPLN 
HP ANAQEH P AQLQRG ISP PHR I RG AVR S R S R S LRG S SHLS QWLNN F F 
ALPFSSMASQLDMSSVGGPHAPQLIPDTHLPTLIEGR 

6384 

A 

3 

3129 

KLGSMEPAPARSPRPQQDPARPQEPTMPPPETPSEGRQPSPSPSPTE 
RAPASEEEFQFLRCQQCQAEAKCPKLLPCLHTLCSGCLEASGMQCPI 
CQAPWPLGADTPALDNVFFESLQRRLSVYRQIVDAQAVCTRCKESAD 
FWCFECEQLLCAKCFEAHQWFLKHEARPLAELRNQSWEFLDGTRKT 
NN I FCSNPNHRTPTLTS I YCRGCS KPLCCS CALLDS SHS ELKCD I S A 
E I QQRQEELD AMTQALQEQD S AFG AVHAQMHAAVGQLGRARAETE E L 
IRERVRQWAHVRAQERELLEAVDARYQRDYEEMASRLGRLDAVLQR 
IRTGSALVQRMKCYASDQEVLDMHGFLRQALCRLRQEEPQSLQAAVR 
TDGFDEFKVRLQDLSSCITQGKDAAVSKKASPEAASTPRDPIDVDLA 
EEAERVKAQVQALGLAEAQPMAWQS VPGAHPVPVYAFS I KGPSYGE 
DVSNTTTAQKRKCSQTQCPRKVI KMESEEGKEARLARSS PEQPRPST 
SKAVSPPHLDGPPSPRSPVIGSEVFLPNSNHVASGAGEAEERVWIS 
SSEDSDAENS/ CEWPRSSAQDSCLPPFQVPGGTATAGDSQTCLAPGE 
FSSEALSPKLC * GQS PNDS CMPGPPQPSHYTRARS S S EMLI ACVQS T 
VSFLECPRPLSGASLCIFCPLSPQ*GGLDKNPFLGLLQPVGASEGVS 
GHRAAI *GPNK*GLTPSM*KDI * IHSTVKQVASWVVTLIiSLEVYY* R 
tiDYHLSRGKGDLNRVKGWTGWPLRSLPALSLRFCIGKCMEPMETAEP 
QSS PAHS S PAHS SPVQSLLRAQGASSLPCGTYHPPAWPPHQPAEQAA 
i ruAiLftib t, V fVtiiu JbRPAVHRG I R YLLYRAQRA I RLRHALRLHPQLH 
RAPIRTWS PHWQASTfPAITGPLNHPANAQEHPAQLQRG ISPPHRIR 
GAVRSRSRSLRGSSHLSQWLNNFFALPFSSMASQLDMSSWGAGEGR 

AQTLGAWPPGDSVRGSMEASQVQVPLEASPITFPPPCAPERPPISP 
VPGARQAGL 

6385 

A 

2 

416 

ACCLLYRGDMVPKDVNAAIVTIKTKRRIQFVD*CPTGFMDGINYQPS 
TWPVENLDKVQRAVCMLSNTTAIAEAWARLDHKFDLMYAKRAFVHW 
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YMGEAMEEGEFSDAREDMAALDKDYEEVDADS VQR * GDEDGQE Y 

6386 

B 

95 

412 

TNR S I QFGDWCPTG FKVG INYQPPTAVPAGDIiANVQRAVCMLSNTTA 
I AEAWARLDHKFDLMYAKRAFVHW YVGEGMEEGDFS KAREDMAALEK 
DYEEVCVDSVE* 

6387 

A 

2 

614 

G ALNVDLTE FQTNLVP Y PR I H FPLAT YAP V I S AE KAYHEQL S VAE I T 
NACFEPANQMVKCDPRHGKYMACCLLYRGDWPKDVNAAIAAIKTKR 
SIQFVDWCPTGFKVGINYQPP1VVPGGDLAKVQRAVCMLSNTTAIAE 
AWARLDHKFDLMYAKRAFVHWYVGEGMEEGEFSEAREDMAAIiKKDYE 
EVGADSADGEDEGEEY 

6388 

A 

151 

1605 

RG KTLRGLWRFGK I PTHS RENMREC I S I HVGQAGVQ I GNACWEL YCL 

EHG I Q PDGQM PS DKT IGGGDDS FNTFFS ETG AG KHVPRAVFVDLE PT 

VIDEVRTGTYRQLFHPEQL I TGKEDAANNYARGH YT I GKE I IDLVLD * 

RIRKLADQCTRLQGFLVFHSFGGGTGSGFTSLLMERLSVDYGKKSKL 

EFSIYPAPQVSTAWEPYNSILTTHTTLEHSDCAFMVDNEAIYDICR 

RNLDIERPTYTNI^NRLIGQIVSSITASLRFDGALNVDLTEFQTNLVP 

YPRIHFPLATYAP\VISAEK\AYHDQLSVAEITNACFEPANQMVKCD 

PRHGKK\MACCLLYRGDVVPKECHMLAILPPS \ KTKRTYPCFVDWVR 

PSGFHRCGISTIQP\PTLIiVPG\GDTGARYQRSCVHC*ANTTAI\AE 

AWARIiDHKFDLMYAKRAFV\HWY\VGEGMEEGEFSEAREDMAALEKD 

YEEVGVD\ SVEGEGEEEGEEY 

6389 

A 

1 

251 

CLKEISFLNGSLTPGAAVKSWLTATSACSFKGFSCLSLSSSWNYRCA 
PPRPTNFCIFSKDRVSLFLLLFSLFLPI>LARLVIiNY*PQVIHLPWTP 
KVLGL*AAVKSWIiTATSACSFKGFSCLSLSSSWNYRCAPPRPTNFCI 
FSKDRVSLFLLLFSLFLPLLARLVLNY 

6390 

A 

3 

446 

RLFFFFFVFLVEMGFCHVGQAGLKLiVTSCDPPTSASQSAGIAGVSHR 
A* PNTSLK* KLTDS KS PG I CFFVCFE I ES 

6391 

A 

2 

176 

CKFAHGNAEI^EWEERRDALGMKLNKARKDHLIGPNDNDFGKYRFLF 
KDLiN*YAGFYV 

6392 

A 

186 

374 

G WQP YWGLWVLRAAGRM PAHQAI*PTYARS FLS VS VGAFVKTWLPF I L 
LLGIILTVSLVFNLR 

6393 

A 

872 

1122 

LRPWKPLCSSVERLWLPAVELLPPPPAGVLPQPSAQYGERRTFGLTC 
QGGPGTRAGPSMGCTGYT*RVSISQVASGPFHHWPI» 

6394 

A 

3 

415 

ERLWLPSVELLPPPPAGVLPQPSAQYGERRTFGIiTCQGGPGTRAGPS 
MGCTGYT*RVSISQVASGPSPLASRMVQRGWIiGPFVRLLPAAALGDM 
CAHLAS S S PCS PLLTVPRAS QVPPKELDP I S PCS LRPRKVSGP 

6395 

A 

3 

349 

VSWEFLCSKRFAQWSHLAQHQLLHTGEKPFPCLECGRCFRQRWSLA 
VHKCS PKAPNCS PRS AIGGSSQRGNAH * KGKDCLRS FHFMEGPRKGK 
EEPQVIQGRVRTKPGSPAAQS 

6396 

A 

2 

352 

EGAAPSSGGPLRDFWGFPRNTNKPCSGTG*SLQRICWMASLAPPGPA 
PRGQAPHTHRPLGRPSRGAQVQPAAQAGQATSKQSPGPGFWSQLPQD 
LTTASVSPSVNGASKACLECKS 

6397 

A 

2 

361 

GGEDPPALCRKPTAQTQTGKKHSGTAAVRSRERHKEVRAGHSLSALD 
SPPSLLPGILSPVHQCDVTRVDPIPSIiLTAIjLWIiPSAPK^RSEPHKP 
GVGDPVWSAPPTSGEDLILLATSAH 

6398 

A 

11 

967 

GS WSGLGARPPGGWNYTGS KKLS PGGEAL * MKRS KEL I TKNHSQEET 
S ILRCWKCRKCI ASSGCFMEYLENQVIKDKDDSVDAQNI CHVWHMNV 
EALPEWISCLIQKAQWTVGKLNCPFCGARLGGFNFVSTPKCSCGQLA 
AVHLS KSRTD YQPTQAGRLMRPS VKYLSHP I VQSGCDKE ALIiTGGGS 
ENRNHRLLNMARNNNDPGRLTR^ 

KYQLFVPQLVTGRCATRAFHRKSHS LDLN I S EKLTLLPTjLYE IHS KT 
TAYSRLNETQPIDLSGLPLQSSKNSYSFQNPSSFGS 

6399 

A 

105 

465 

SRAYCSLTLDFPGLKQSSHLSPQVAGPTGACYHAWPI F * ILAQMRSH 
YVARAGLELLS SRD PPAS ASQS AGVTG I SHHAWPVLLLYLAGTQSPE 
HFIYDGLDNVSSCLHANFSSLSYST 

6400 

A 

1 

463 

HKSDGSTVSVPMMAQTNKFNYTEFTTPDGHYYDILELPYHGDTLSMF 
IAAPYEKEVPLSAI,TNILSAQLISHWKGNMTRI,PRLLVIjPKFSLETE 
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VDIiRKPLENLGMTDMFRQFQADFTSLSDQEPLHVAQALQKVKIEVNB 
SGTVA\SSSTAVI 

6401 

A 

3 

569 

NQASLSFLHLRPYSGHLIRGTPGRGWGKVGAASQRGPRLEQPLAAIC 
RFSLETEVDLRKPIiENLGMTDMFRQFQ7UDFTSLSDQEPLHGPQAL*K 
VKIEVNESGTVASSSTGESGSGEAPRVSPPSPFRITGPQTRKGPRSP 
LGTEQLCLCSAITHSPPVSQPGKRAWDHGTLFPRLMAPKFPDCEKAS 

6402 

A 

3 

194 

F F F FTCTQM F I TALF I T APNWKQPRC P S I GQ * TNL FNEAVLS REE S I 
KTKTRDRCYHMDRI SK 

6403 

A 

2 

223 

EMESCSITQAGVQW*DLSSLQPPPPGFQRVSCLSFPSSWDYRYLPPR 
LANFCI FSRDGVSPC* PGWSRTPDLR 

6404 

A 

2 

247 

RLFFFF/CFFEMESHSVTQAWQWRDIiGSLRPPPPGFKQFSCLSLLS 
S WD YRHTPPHPANFC I FSRDG VS PCWPG * SRTPDLR 

6405 

C 

65 

211 

MSPNWRWGPPCLATPQGVTPSDTQPCQALGLGPHPRNVQPPIIISGT 
G* 

6406 

A 

1 

112 

GGRGCNQS * SRHCTPAWVTKQGPI SD ID I L I RPPLT I 

6407 

A 

259 

380 

GCVQQG VNVQHCAGQQHL * ENGGPGS SDCGWHRGHPLCRL | 

6408 

C 

5 

160 

MGFEGGPKILNWKFKKVXQSXSCLXKGQXESIFLNTXKVIRAGDTXK 
SVLG* 

6409 

A 

117 

207 

KKTM FRQ KL FY KRKQLQ KG PR P * GAG FKV 

6410 

C 

155 

394 

MI»KW I YR I F VNLFL VFVRF FNCS FLCAEC I S LPGQECGGAEVS S FS R 
TFPQHLLSLYLMXXXXXXXXFLQALGLCYPIQ* 

6411 

A 

68 

547 

LDNRCERCLTEGGWLTSHTAPRGKGTWGQKKEPASWGPWGCQW*RGA 
♦WPTGCKQGRGGSPSPQPLGSGGRHEPGQAKATKASPSQGEGDKGRP 
RQQGP$WRSKSSPRPISDPPTPGGDKGPSPSTSLRL*TRLEPDLVSL 
G I SAG PTRWTLL S AVQLQ 

6412 

A 

11 

321 

KNFYLFF*DGVSLVAQAVQWRDLGSPQPPPPGLKLLSCLSLPSSWDY 
GHVPTMPG*FCTFSRDGVSPCWSSWSRNSRISDASACLGLPKCWEFT 
RRDNSAPRS 

6413 

A 

17 

237 

YVAQAGLNL3^SSQSSCLSLPKHWDYECEPLCLAFFFFLL*RQGIaAM 
LPR I L PP I KS I S I RWRNGQKKMHQVL 

6414 

A 

806 

1681 

AMGIHCITTTRTERSMPTTSAAVl.PGPMRQWQLQLPPCIITQSJIPCP 
SCHKSRIQPWPVTSPPPLSIPTIDTCCRPQELLPQLLLQQQPLLLLL 
QLPLHITGGIGAPCVALQPQSPLLERATVTGMRVSCPKLQQPRGILC 
TTWPGMSGSSMP I GRVTATNAKARCRHAADTALPAASASCVGTTTAA 
CWAAAWASATTGPSCACCFMPPASCSTSLCCWALHCRPCCEPTRPST 
WLPSSCFPGSC/ CRDHHCRLLGRCVGFGNYRPFLCLIi/ SSCRR/PSC 
STSLCCWALHCRPCCEPTRPSTWLPSSCFPGSCCSQPHCPTTLWSTL 
TPLSHGRLWLQSQTLSSPWALWKSRVSGSLH 

6415 

B 

61 

516 

MTTLKKNXiADDDAVS CLVLGTENKELLVLDPEAFT I LAKMSLPS VPV 
FLEVSGQFDVEFRLAAACRNGNIYILRRDSKHPKYCIEIiSAQPVGLI 
RVHKVL WG STQD S LHG FTHKGKKLWTVQMPAA I LTMNLLEQHSRGL 
QAVMAGLANGE* 

6416 

A 

360 

861 

RCGS FLQLLLPMPKPADTALPAASASC^GTTTAACWAAAWASATTGP 
SCACCFMPPASCSTSLCCWALHCRPCCEPTRPSTWLPSSCFPGSCCS 
QAECLWHSLPWPS * RTRAWRVRCCAGLGCSSMGCCCCGARPHGSGLG 
ASTPMTWVPATTCRQPWAPARRLGP 

6417 

A 

2 

263 

NPRDGPTTAACWAAAWASATTGPSCACCFMPPJ^CSTSLCCWALHCR 
PCCEPTRPSTWLPSSCFPGSCCSQP*SPLSSCKPLKPPVF 

6418 

A 

18 

211 

NEPRASTLTS S KAGQRQAPWTLFPNS YPL * E YPPANPFQGGPHDCRA 
QTKDMGAQAPRVPEAGP 

6419 

A 

2 

549 

NGFRG AHSQCPPACVLPGQAMSTGGRQLPHPCMNPPS PGDSMDAVI K 
GAAVDLGGTWKGAKPLGLWHCGG * P I ENRAGAGET * S LH I * TPVRS P 
GFLGTSPMGSPGPALKNHGSCARPLIiSPLPYPPKERPLPPPWHLQSE 
DLPPPRGPE ALMPQELGLERC * YPLPQVPRFRGTGAGL VQA 

6420 

A 

368 

623 

LVCVFFFFF*ERESCSVTQAGVQWCDLGSLQPPPPRSKQFSCLVLPS 
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SWDYRHTPPCLANLCLIFKSRNPKYFIFIPFPNCYEWD 

6421 

A 

42 

179 

CYAFNFCIYELVHLKIKDTYCHIFKLNLYRGKPPVKEKQYFSDKN 

6422 

C 

28 

219 

MQILLKSKPKGSTKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXX FRCHD * 

6423 

A 

2 

161 

KNLH*MESYSVAQAGVQWHDLGSVQPLPPGFKRFSCFSLLSSWDSRL 
Y FRCHD 

6424 

A 

363 

750 

REQDPACAWSSS/APNSSWSLHGEAVSHRECLSSPVLPGPDSSKALS 
SAAAALSMSSGRGGGSLVPFSGPVSGVARWIASCVAGESCLA*AGTR 
TRCTAPS EAT * S STRGLSRGAASLAAS VHHGSRMSL 

6425 

A 

890 

1879 

LWGLVTSSCLSFSRRNFTTEIFQEDLMLSDPLNLLSLVMWLCTLWTS 
IiAALSMSSGRGGGSLVPFSGPVSGVARWIASCVAGESCLA*AGTRTR 
CTAP S EAT * S S TRGL SRGAAS LAAS VHPG S RMS L * VAG KPKG PGPTE 
AAPSCRSSSITTCTPLPTLSHSSSTADRALKASPGLSLRGPGSSGGE 
LCARLS PS SSGMDTF PCLR S PTS S WRRFNNSLLALARPRAAS AKPTA 
LSTRW*ITCRSCSEL*TAKPCPVTVMGVLGGGFPPWEPVLPPLVSLI, 
QAQQQHLASLPFPFSMRMVRGAQAPSCRVRLPRARGETPPRLPSGSR 

6426 

A 

8 

363 

AEVQWRNLSSLQPPPPGFK* FLCLSLPRS *DYRLAPPCSANFCMVKT 
E FHHVGQ VGLELLAS SD PP PALAS AS QS AG I TG VSHHAW PRAS LLS L 
VIALKQASFFCSHAKPQVSQHFTN 

6427 

C 

87 

287 

MIFFSFCVNTWCHSPFC^TVTEYHSLIXXXXXXXXXXXXXXXXXXXX 
XXXWEGLFCC I I TQRKVEG * 

642 8 

C 

11 

328 

MHMYGXXXXXXXXXXXXXXXXXXXRISEHILCIFRSlsrYRMIFFSFCV 
NTWC3SPFC7VTVTEYPRLIXXXXXXXXXXXXXXXXXXXXXXXWEGLF 
CC I I TQRKVEG * 

6429 

C 

76 

276 

MIFFSFCraTWCHSPFCATVTEYHRLIXXXXXXXXXXXXXXXXXXXX 
XXXWEGLFCCI ITQRKVEG * 

6430 

C 

7 

324 

MHMYGXXXXXXXXXXXXXXXXXXXRISEHILCIFRSNYRMIFFSFCV 
NTWCHSPFCAPVTEYHRLIXXXXXXXXXXXXXXXXXXXXXXXWEGLF 
CCI ITQRKVEG* 

6431 

A 

1 

336 

FFFFFFRQSLSVAQAGVQWCDLCSL*SPPPTLSTPLASASQVAGTTG 
AHH*TGIIFFVFVETVFCHVAQAGLELLGSSDPPASASQSAGITATV 
PSLL*LFIKQKQQKNRTY 

6432 

A 

3 

1105 

DKDCSRAFLLYGSYSRAPALGGTARTLLGLLNEAKLEFHSVVIjLEPG 

NKHAVTELSKIKKELIEKGHWDDVFLDSTQRPNGGKPIDNPPHPGSS 

NPLRKVIIEETGNLIQTIDVPDSTTAAAPENNPINLADVIAATGTTS 

KKNSSQDDLFPTSDTPRAKVLKIEEVSDTSSLQKEKPLIiIFEILQRIi 

SELIO^FDMAVMFMSETEKXNRYGSEQEFVDDLKNMWFGIiYSRGNEEG 1 

DSSGFEHVFSGEVKKGKVTGFHNWIRFYLEEKEGLVDYYSHIYDGPW 

DSYPDVLAMQFNWDGYYKEVGSAFIGSSPEFEFALYSLCFIARPGKV 

CQLSLGGYPLAVRTYTWDKSTYGNGKKYIATAYIVSST 

6433 

B 

52 

936 

MAYLKIQKYEEAEKDCTQAILLDGSYSKAFARRGTARTFIX3KLNEAK 
QDFETVX»LLEPGNKQAVTELSKIKKELIEKGHWI)DVFLDSTQRQNVV 
KPIDNPPHPGSTKPLKKVI I EETGNL I QTND VPDSTTAAAPENNP IN 
LANVIAATGTTSKKNSSQDDLFPTSDTPRAKVLKIEEVSDTSSLQPQ 
ASLKQDVCQSYSEKMPIEIEQKPAQFATTVLPPIPANSFQLESDFRQ 
LKSS PDMLYQYLKWLLSHCQDNTRAQRLEKAGWQL AFLR YYYHLLDV 
QKI TVCMWQKAF * 

6434 

A 

1 

309 

LPPIPADSFQLESDFRQIiKRSPDMLYHD*KQIEPSLYPKXiFQKNLDP 
DVFNQ I VKI LHDF YH * GKKS PLL I FE I LQRLSELKRFDMAVMFMSET 
EKKIARGII 

6435 

A 

1 

936 

MAALYACTKCHQRFPFEALSQGQQLCKECRIAHPWKCTYCRTEYQQ 
ERLECNGTISAHCNLHLPGSSDSPASSSRVAGITGIKTNTICKKCAQ 
NVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPYSCEQCKQQ 
CAFDRKDDRKKVDGKXLCWLCTLSYKRVLQKTKEQRKHIiSSSSRAGH 
QEKEQYSRLSGGGHYNS FSPDLALDS PGTDHFVI I AQLKEEVATLKK 
MLHQKDQMILEKEKKITELKADFQYQESQMRAIQviNQMEKTHKEVTEQ 
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LQAKNRELLKQAAALSKSKKSEKSGAITSP 

6436 

A 

1 

1023 

MAALYACTKCHQRFPFEAJjSOGOOLiCKECPTAHPWK'PTYr'P Tpvnn 
ERLEOTGTISAHCl^HLPGSSDSPASSSRVAGITGIKTNTICKKCAQ 
NVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPYSCEQCKQQ 
CAFDRKDDRKKVDGKLLOVLCTLS YKRVLQKTKEQRKHLS S S SRAGH 
QEKEQYSRLSGGGHYNSQKTLSTSSIQNEIPKKKSKFESITTNGDSF 
S PDLALDS PGTDH FV 1 1 AQLKE E VATLKKMLHQKDQM I LEKE KKI TE 

LKADFQYQESQMRAKMNQMEKTHKEVTEQLQAKNRELLKQAAALSKS 
KKSEKSGAITSP 

6437 

A 

602 

745 

TPASLKI PVE * * NTLLAKMVS IS* PRDLPAS AS QS AG I TGVSHRARC 

6438 

A 

1447 

1931 

RKVFKWLSAPLDVSNCPSLCKD*W*RV*RFMPETFPAR*GAENPTLP 
CCHYS IN/HFTVFS FLF/ 1 FFFWR* SLTLVPQAGVQWRDLGS LQPLP 

PGFKRFSCLSLPSSWDYRCLPPCLANFLYF**RQGFTVLARMVSIS* 
PHDL PAS AS QS AVFSYL I FFLPY 

6439 

A 

1 

74 

FWRPVRRQSEYKQVlWGQDSTFPI*CLQTlTroYRFRVCA*RRCLDTS 
QELSGAFSPSAGFVLQRSEVMLTGDMGSLHDSKMNS^1MPTDEQIAAI 
IALGFATIiAILFGFI*QSEYKQVNKGQDSTFPI 

6440 

A 

2 

174 

FFFIFFFLLENLF*PSNFGSYKPVHCCVFFCCCCFFFLAVFFFVCFC 
FFFSFSFLFF 

6441 

A 

1 

271 

PPRFFPEFFYSGPPKPPFFKTPVFLGVKPGVSSSPPYQKKPTNFGPK 
MGAL * R I PLFETP I WVFP I KGFHKKKP PVLN * PPTRKPPPDKI 

6442 

A 

3 

128 

FFNKKPPPFGRGWPFLVPATLRF*G*KIFPPPPPPEVRFPG 

6443 

A 

1 

154 

LVWS * EASKI PGGAEAAHP PTTF 

6444 

A 

2 

230 

FFFESGSRSVTQAGVQ*HSLGSLQPLPPRFK*FSCLSLPSSWDYTHV 
PTRPANFCIFSGDGVSSCWPGWSRTPNLR 

6445 

A 

70 

550 

NQVIWESFQPAGRHGHPFAPSLAPSASSFLSSFCPSGKAEETLESI* 
SSGKPGVGEGATSLFPS IS *MFWEGPENGTQEPAGMP I PHTPPSCSN 

PAIHSPGGTGAPGRREEALHCPP*HIPSHPPSRPPWLAPLSLRFPFT 
PMAVNDQDRSHVCFPLGLI 

6446 

A 

77 

485 

NLVYVSSICFTWTFHFERKKTYVSHIVLYKVASPIMKCLYCHSGHTQ 
WQYLNGEAARIiGQGLLC*lWIIFICLLSIiSLSLFFSFFGPMLKM*N 
FVFK*CFYNFIRKPKFVFFVTLDFNSLYIGTSNVITVLNKFH 

6447 

A 

3 

349 

KVEDPS PVWRNS I FWKFVI TFKVYFFCLFVFLETESLS WQAGVQWH 

SLGSLQPLPPGFKQFSCLSLLSS*DYRCTPPRPADFCIFGRDGVSPY 
WSG * S * TPDLMLCPAWLPKVL 

6448 

A 

1 

189 

FFF*IGPHSVAQAGVRWCDLGSCSLNLPGSSDPPASASQVAGTTGVH 
YYTQLIFKFFIEMRAP 

6449 

A 

2 

304 

LDTPLGAAPAAFFLPSLNQCLIYNPMPVRPLKAWRLKTGILTEPRVK 

GEA1^AGMMPTGVSTASSPSYPIAPLGPRLPLQDCMHQALS*APLSD 
ALCIAL 

6450 

A 

21 

359 

SLLQPS PAFFS PAFCGLL* PS WQPS PAFS SLL* PS PAFSSLL * PSL 

LWPSFSGLL*PSLTFSNLL*SSLLPSTEETPAGQRQELQLFPVGGAA 
DAYFPFKCFKI LS SECLK 

6451 

A 

2 

170 

LSLLSS*DYR*APSQLANFSIFL*RWGLAVLPRLVSDSWPQMILLPP 
VPDVLELKA 

6452 

A 

3 

112 

EKKMADFHKEEMDDQDKDKAKGRKETL I LERENKSM 

6453 

A 

4041 

4215 

GRVVI^WWQGFFLGFFFFFl^FLTLSCLKLKHSSDVWGETSIiLKWV 
LVLAFWGGGP 

C A C A 

A 

3 

309 

PPPPK*IL*RAPNPPLSSSSFSNPPTWGPGPQPPPSKFSKFARFPFL 

PPPFPFKKEPPSSSLFFPTKEGTVI *KNPPFSGFQSPDS IK*GRVEP 
PGGTKRG 

6455 

A 

2 

439 

RTS AREKRDIoNRLKKLGDS S KNSDC * S VS SNTDADATQEKNNATSNR 

KSSVGVKKNSKSRTLTRQSMSRIPASSNSTSSKLTHI1WSRVPKKLK 

KPAKPLLSKIKLRNHCKRLEHKNASRKLEMGNLVLKEPKVVLYNKLP 
HKKR 
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6456 

A 

1 

146 

IX3EWGLCKIDREGKPRKWGCSCVVVKDYGKESQAKDVIEBYFKCK 
K 

6457 

A 

59 

431 

PTAMAEEG I AAGG VMDVNTALQEVLKTAL IHDGLARG I REAAKALDK 
YVYQSQYCGPLQPEQNCHPREEGMEFMVLAQKF*MASVIiPET*EK\R 
QAH LCVLASNCDE PM YVKL VEALCAEHO INTj I KVDDNKKX.GKWVf;T.T . 
*ILWVLATGTKLPPKGRRHGVHGVSAKVLNGIRPTGNLGKGAKPIFV 
CLHPTVMSLCMSSWWRPFVLNTKST 

6458 

A 

242 

431 

LHFHCNLNFRRQAHLCVLASNCDEPMYVKIiVEALCAEHQ INLI KVRL 
LRIVLGLMFSDACIWG 

6459 

A 

65 

473 

P T AMVE E G I AAGG VMD VNTALQE V \ L KT AL I HD G LAR G I REAAKALD 
KRQAHLCVLAFNCDEPMYVKLVEALCAEHQINLI KVDDNKKLGEWVG 
LCKI DREGKPRKWGCS CVWKDYGKESQAKDVI EE YFKCKK 

6460 

A 

32 

485 

HLVLASLLACLASIFLPSVGLGQCDLVGSQKTIPRNEGLPRKQTFLS 
DLLLPSCLWFPYSPPRLAIKTTIPLHQKPAIjKPKNITLTFLFLSV*N 
WAIGKLFGPLSGL*GQEGPPFQRESCAI*GRKECCAKRPKRILTDRP 
RWVPHSVYQH 

6461 

A 

3 

336 

FFFFDIGSCFFA*AGVQWHDHGSLQPQPPGLK*HDIPTTAS*VAGTT 
GAHHHA*LIF*LSVKMGSSPLVAPGWSMKSWTQANPSLLQPPQSAWD 
YRCEPPCLVRNTFEYV 

6462 

A 

2 

280 

VLGIWRVPGGSGKMEKRESEKQILPRPIGCIWERGKSPGEAPSALPM 
PWSNPPPGTCTPSCPTYRCTPLPR*TDAPSRHTHIiAPRNGETIFA 

6463 

A 

1 

171 

MWGWKEKIKINTFVYCWWEFKLVQNFWSTTWQHIL»KF*MSFDAIjLGI 
YPKYILTNVT 

6464 

A 

21 

404 

LSTGTSSAMVSRTEGNIDDSLIGGNASAEGPEGEGTESTVITGVDIV 
MNHHLQETSFTKEAYKKYIKDYMKSIKGKLEEQRPERVKPFMTGAAE 
QIKHILANFKNYQFF IGENMNPDGMVALLDYRE 

6465 

A 

42 

266 

KLILLKIQYFNLLMKCCFRIKGKLEEQRPERVKPFMTGAAEQIKHIL 
ANFKNYQVNTLS I WI KGLYNFNCKSKN 

6466 

A 

60 

656 

RRRRL P S VAI M 1 1 YRDLi T SHDFMF T Y KT P T? T anf2T.r , T ."RVT7f3TnvT\7Q 
RTEGNIDDSLIGGNASAEGPEGEGTESTVITGVDIVMN\HDLQETCF 
TKEAYKKYI KDYMKS I KGKLEEQRPERVKP F\ MTGAAEQ IKHI LANF 
KNYQFFIGENMNPDGMVALLDYREDGVTPYM I FFKDGLEMEKCVSTR 
KWVKINNVKKTF 

6467 

A 

176 

552 

MRS S KQMKPNFPRCS S I TVLS VRGMVIiFLTL AC PR FKM ^RTDPR FC5 
NPQVT*GSTICSIFRLPSILSMRTTK/DQGMFIWQFQGMRFHF*SSB 
THLVTFLPPGIM*E*FQSLKPEPFSFLLSCYCH 

6468 

A 

98 

509 

RATEKWKGRKNHELSWKCFPLKIFFRRSLAMVLCTSKLRPSVLSMRT 
TKARECLSGSSNA*GFTSSRLRHSLSRFC\ARNHVGMIPVA*T*ALF 
LSSAPSLPKMPALPGWLCGLDKPSSFSGDFLEPKGFFFALAPP 

6469 

A 

204 

401 

WHEFTGAENHCKTJjPKENF*WCLCKSHPSEPKNAAYNRTLCDLGISK 
SEVSLGTSFEMWTSQGQE 

6470 

A 

249 

906 

RIHFPRVSGPSQSNPKFAVASRGFFSLSLSSAQPDPIiPPPLGEALAli 
S LHPVPRRSTETVAGDS S ELQLGLRS PQQPI1AGI1AFLARLFLLFPP P 
*RCKSKJ>N*NDRRRSSVDSQIHLVGRESAHPI^GLRVCVSLPLLARC 
FGQVLQGVPGLWIPSPGGS/AGVSGRRREERHMGVVVMRVRVEARVS 
S * ES KI / SRALR * STHLGLPKCWDYRREPPCPAH 

6471 

A 

401 

659 

LIDLRFIYLFIYLR*GLRLVP*AGVHWCNLGSLQSIiPPGFKRFSCLS 
LS S S WDYRHAP PHPAHFCI FSRDAVS P CWSG * SQTPDLR 

6472 

A 

1271 

1538 

ARGCKRPCRQIASLMLQLCILHACWALVPQLHAAPGLAIQRLCICPY 
LGSWPSTLGPRVPPLSPSTSPRPSPAFGASLNPPRHSSGPLR 

6473 

A 

1 

369 

ACSSCFHAGLEHVRVRGSCHSGVAAARRSEAPDLFC*PPSLPVPSTP 
FSPS PAPAMNQATSDPTHTCPTRLPTCPNAAP * AGPTADTPS * GI PD 
LPAWLKTPQHLDLPVCSLCGPPLGPTET 

6474 

A 

3 

340 

ETRS YS VTQAGAEWHDHRSL * P * PPGLR* S SHFS LPSNWD YRCVPPH 
SANFLVIFVEIVFHHVAQAGLELLGSSDLPT*ASQSTGITGVNHCPA 
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EKEVLPREKAMQRGGSQA 

6475 

A 

21 

273 

GMNERGNITKE I KKI I WEYYB * LGTSQLDTLD * TNSQKDTNYQNQLT 
KYRKYGLEPITSDYISNFKRLPKKS PGLDGFTGEFI 

6476 

A 

2 

3204 

FGEPTSEQTGTAAGKTIAQTTAPVSWKPQDSSEQPQEKLCKNPCAMF 
AAGE I KTPTGEGLLDSPSKTMS I KERLALLKKSGEEDWRNRLSRRQE 
GGKAPASSLHTQEAGRSLIKKRVTESRESQMTIEERKQLITVREEAW 
KTRGRGAANDSTQFTVAGRMVKKGLAS PTAITPVASAICGKTRGTTP 
VSKPLEDIEARPDMQLESDLKLDRLETFLRRLNNKVGGMHETVLTVT 
GKSVKEVMKPDDDETFAKFYRSVDYNMPRSPVEMDEDFDVIFDPYAP 
KLTSSVAEHKRAVRPKRRVQASKNPLKMIiAAREDLLQEYTEQRLNVA 
FMES KRMKVEKMS SNSNFS EVTLAGLASKENFSNVS LRS VNLTEQNS 
NNS AVP YKRLMLLQIKGRRHVQTRL VE PRAS ALNSGDCFLLLS PHCC 
FLWVGEFANVIEKAKASELATLIQTKREIiGCRATYIQTIEEGINTHT 
HAAKDFWKLLGGQTSYQSAGDPKEDELYEAAIIETNCIYRLMDDKLV 
PDDDYWGKIPKCSLLQPKEVLVFDFGSEVYVWHGKEVTLAQRKIAFQ 
LAKHLWNGTFDYENCDINPLDPGECNPLIPRKGQGRPDWAIFGRLTE 
HNETILFKEKFLDWTELKRSNEKNPGELAQHKEDPRTDVKAYDVTRM 
VSMPQTTAGT I LDGVNVGRG YGLVEGHDRRQFE I TS VS VDVWH I LEF 
DYSRLPKQSIGQFHEGDAYVVKWKPMVS\RLYCVRGEVPVEGNLLEV 
ACHCSSLRSRTSMVVL1TVNKALIYLWHGCKAQAHTKEVGRTAANKIK 
EQC PLE AGLHS S S KVT I HE CDEG SE PLG FWDALGRRDRKAYDCMLQD 
PGSFNFAPRLFVLSSSSGDFAATEFVYPARAPSWSSMPFLQEDLYS 
APQPALFLVDNHHEVYLWQGWWPIENKITGSARIRWASDRKSAMETV 
LQYCKGKNLKKPAPKSYLIHAGLEPLTFTNMFPSWEHRED IAEITEM 
DTEVSNQITLVEDVLAKXCKTIYPLADLLARPLPEGVDPLKLEIYLT 
DEDFEFALDMTRDEYKALPAWKQVNLKKAKGLF 

6477 

A 

250 

5816 

DFNKKNLSFLGFPRQRKVSSFQKEFSLEDKEQLANHERGIDAQIjLVA 
LPKVAELRQIFEPKKKEFLEMKRKERIARRLEGIENDTQPILLQSCT 
GLVTHRLLEEDTPRYMRASDPASPHIGRSNEEEETSDSSLEKQTRSK 
YCTETSGVHGDS P YGSGTMDTHS LES KAER I ARYKAERRRQLAEKYG 
LTLDPEADSEYLSRYTKSRKEPDAVEKRGGKSDKQEESSRDASSLYP 
GTETMGLRTCAGESKDYALHAGDGSSDPEVLLNIENQRRGQELSATR 
QAHDLS PAAES S STFS FSGRDS S FTE VPRS PKHAHSS SLQQAASRSP 
SFGDPQLSPEARPRCTSHSETPTVDDEEKVDERAKLSVAAKRLLFRE 
MEKSFDEQNVPKRRSRNTAVEQRLRRLQDRSLTQPITTEEWIAATL 
QASAHQKAliAKDQTtfEGKEIJUSQGEPDSSTLSL 

SKAISTRKRIDTRQRRMNARYQTQPVTLGEVEQVQSGKIilPFSPAVN 
TS VS TVAS TVAPMYAGDLRTKPPLDHNAS ATD YKFS SSI ENSDS PVR 
SILKSQAWQPLVEGSENKGMLREYGETESKRALTGRDSGMEKYGSFE 
EAEAS Y P I LNRAREGD SHKE S KY AVPRRG S LERANPP I THLGDEPKE 
FSMAKMNAQGNLDLRDRLPFEEKVEVENVMKRKFSLRAT^EFGEPTSE 
QTGTAAGKTIAQTTAPVS WKPQDS S EQPQEKLCKNPCAMFAAGEI KT 
PTGEGLLDSPSKTMS IKERLALLKKSGEEDWRNRLSRRQEGGKAPAS 
SLHTQEAGRSLIKKRVTESRESQMTIEERKQLITVREEAWKTRGRGA 
ANDSTQFTVAGRMVKKGLASPTAITPVASAICGKTRGTTPVSKPLED 
IEARPDMQLESDLKXiDRLETFLRRLNNKVGGMHETVLTVTGKSVKEV 
MKPDDDETF AKF YR S VD YNM PR S P VEMDED FD V I FD P YAP KLTS S VA 
EHKRAVRPKRRVQAS KUPLKMLAAREDLLQE YTEQRLNVAFMES KRM 
KVEKMSSNSNFSEVTLAGLASKENFSNVSLRSVNLTEQNSNNSAVPY 
KRLMLLQIKGRRHVQTRLVEPRASALNSGDCFLLLSPHCCFLWVGEF 
ANVIEKAKASELATLIQTKRELGCRATYIQTIEEGINTHTHAAKDFW 
KLLGGQTSYQSAGDPKEDELYEAAIIETNCIYRLMDDKIjVPDDDYWG 
KIPKCSLLQPKEVLVFDFGSEVYVWHGKEVTLAQRKIAFQLAKHLWN 
GTFDYENCDINPLDPGECNPLIPRKGQGRPDWAIFGRLTEHNETIIiF 
KEKFLDWTELKRSNEKiaPGEIiAQHKEDPRTDVKAYDVTRMVSMPQTT 
AGT I LDGVNVGRG YGLVEGHDRRQFE I TS VS VDVWH I LE FD YS RL PK 


IQSOO 


,1 O O S 4-6 3 . O H H 6. 0 ES 






QS IGQFHEGDAYVVKWKPMVSTAVGSRQKGEHSVIUVAGKEKCVYFFW 
QGRHSTVSEKGTSAIiMTVELDEERGAQVQVIiQGKEPPCFIiQCFQGGM 
VVHSGRREEEEENVOSEWRLYCVRGEVPVEC3NljLFVa(^HPcjQT dpdt 

SMWLNVNKALIYLWHGCKAQAHTKEVGRTAANKIKEQCPLEAGXiHS 
SSKVTIHECDEGSEPLGFWDAIiGRRDRKAYDCMLQDPGSFNFAPRLF 
ILSSSSGDFAATEFVYPARAPSWSSMPFLQEDLYSAPQPALFLVDN 
HHEVYLWQGWWPIENKITGSARIRWASDRKSAMETVLQYCKGKNLKK 
PAPKSYLIHAGLEPLTFTNMFPSWEHREDIAEITEMDTEVSNQITLV 
EDVLAKLCKTIYPI^LLARPLPEGVDPLKLEIYLTDEDFEFALDMT 
RDE YNAJj PAWKQVNLKKAKGL F 

6478 

A 

553 

7399 

NFNKKNLSFLGFPRQRKVSSFQKEFSLEDKEQLANHERGIDAQLLVA 

L PKVAELRQI FE P KKKE FLEMKRKER I ARRLEG I ENDTQP I LLQS CT 

GLVTHRLLEEDTPRYMRASDPASPHIGRSNEEEETSDSSLEKQTRSK 

YCTETSGVHGDS P YG SGTMDTHS LE S KAER I ARYKAERRRQLAEKYG 

LTLDPEADSEYLSRYTKSRKEPDAVEKRGGKSDKQEESSRDASSLYP 

GTETMGLRTCAGESKDYALHAGDGSSDPEVLLNIENQRRGQELSATR 

QAHDLSPAAESSSTFSFSGRDSSFTEVPRSPKHAHSSSL.QQAASRSP 

SFGDPQLSPEARPSTGKPKHEWFLQKDSEGDTPSLINWPSRVKVREK 

LVKEES ARNS PELAS ES VTQRRHQPAP VH YVS FQSEHSAFDRVPS KA 

AGSTRQPIRGYVQPADTGHTAKLVTPETPENASECSWVASATQNVPK 

PPSLiTVLEGDGRDS PVLH I CES KAEEEEGEGEGEEKEEDVCFTE ALE 

QSKKTLLALEGDGLVRSPEDPSRNEDFGKPAVSTVTLEHQKELENVA 

QPPQAPHQPTERTGRSEMVTjYIQSEPVSQDAKPTGHNREASKKRKVR 

TRSLSDFTGPPQLQALKYKDPASRRELELPSSKTEGPYGEISMLDTK 

VSVAQLRSAFLASANACRRPELKSRVERSAEGPGLPTGVERERGSRK 

PRRYFSPGESRKTSERFRTQPITSAERKESDRCTSHSETPTVDDEEK 

VDERAKLSVAAKRLLFREMEKSFDEQNVPKRRSRNTAVEQRIjRRLQD 

RSLTQPI TTEE W I AATEP I PASCSGGTHP VMARLPS PTVARS AVQP 

ARLQASAHQKALAKDQTNEGKELAEQGEPDSSTLSLAEKLALFNKLS 

QPVSKAISTRNRIDTRQRRMNARYQTQPVTLGEVEQVQSGKLIPFSP 

AVNTSVSTVASTVAPMYAGDLRTKPPLDHNASATDYKFSSSIENSDS 

PVRSILKSQAWQPLVEGSENKGMLREYGETESKRALTGRDSGMEKYG 

S FEEAEAS Y P I LNRAREGD SHKES KYAVPRRG SLERANP P ITHLGDE 

PKE FS MAKMNAQGNLDLRDRLP FEE KVEVENVMKRKFS LRAAEFGEP 

TSEQTGTAAGKTIAQTTAPVSWKPQDSSEQPQEKLCKNPCAMFAAGB 

IKTPTGEGLLDSPSKTMSIKERXiALLKKSGEEDWRNRLSRRQBGGKA 

PASSLHTQEAGRSLIKKRVTESRESQMTIEERKQLITVREEAWKTRG 

RGAAOTSTQFTVAGRMVXKGLASPTAITPVASAICGKTRGTTPVSKP 

LEDIEARPDMQLESDLKIjDRLETFLRRLNNlWGGMHETVIiTVTGKSV 

KEVMKPDDDETFAKFYRSVDYNMPRSPVEMDEDFDVIFDPYAPKLTS 

SVAEHKRAVRPKRRVQASKNPLKMLAAREDLLQEYTEQRIjNVAFMES 

KRMKVEKMSSNSNFSEVTLAGLASKENFSNVSLRSVNLTEQNSNNSA 

VPYKRLMLLQIKGRRHVQTRLVEPRASALNSGDCFLLLSPHCCFLWV 

GEFANVIEKAKASEIiATLIQTKREIiGCRATYIQTIEEGINTHTHAAK 

D FWKLLGGQTS YQS AGD PKEDEL YEAA 1 1 ETNC I YRLMDDKL VPDDD 

YWGKI PKCS LLQP KE VL VFD FGS E VYVWHG KE VTLAQRK I AFQLAKH 

LWNGTFDYENCDINPLDPGECNPL I PRKGQGRPDWAIFGRLTEHNET 

ILFKEKFLDWTELKRSNEKNPGELAQHKEDPRTDVKAYDVTRMVSMP 

QTTAGTILDGVNVGRGYGLVEGHDRRQFEITSVSVDVWHILEFDYSR 

IiPKQSIGQFHEGDAYWKWKFMVSTAVGSRQKGEHSVRAAGKEKCVY 

FFWQGRHSTVSEKGTSAIjMTVELDEERGAQVQVLQGKEPPCFLQCFQ 

GGMWHSGRREEEEENVQSEWRLYCVRGEVPVEGNLLEVACHCSSLR 

SRTSMWLNVNKALIYLWHGCKAQAHTKEVGRTAANKIKEQCPLEAG 

LHSSSKVTIHECDEGSEPLGFWDALGRRDRKAYDCMLQDPGSFNFAP 

RLFILSSSSGDFAATEFVYPARAPSWSSMPFLQEDLYSAPQPALFL 

VDNHHEVYLWQGWWPIENKITGSARIRWASDRKSAMETVL.QYCKGKN 

LKKPAPKSYLIHAGLEPLTFTNMFPSWEHREDIAEITEMDTEVSNQX 
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ITLVEDVLAKLCKTIYPIJU)LLARPLPEGVDPLKLEIYLTDEDFEFA 
LDMTRDEYNALPAWKQVNLKKAKGLF 

6479 

A 

380 

999 

AGAKGGGPGGRGMKKTNSLTPLLSAYRLLPVELSKGWGILGFSVCPF 
APGLMAKGVSPCDTAVKRDNIH*VGGGVS*KCP*RSS*WSGPVPALT 
ALFLVDNHHEVYLWQGWWPI ENKI TG SAR I RWASDRKS AMETVLQYC 
KGEAPSNMPAARNRWGSPSGMRSQETPFPFCACQPASIHKAWQVVDE 
NPIFSPLKPFEDVIKFLD 

6480 

A 

78 

200 

KKGSPFFPKVGGQGGNLGLLEPLPPKLKKFFCFKPPKRGDYSSSSSS 
S PD S S S S KKKGVS PFWPGGV * TPKEGGL 

6481 

A 

179 

384 

FNS YYM YVYVYVH I CVFL YVC I C I YR Y I YtiC I AV I TCLMNE W * * FKC 
KGNDARG YEKKLPS KYE I EKE 

6482 

A 

76 

205 

RKKPEPE KTCFDN I TGNTY PCLS T YEHDSD * YYVNRT * LCAER 

6483 

A 

2 

774 

HSYSHSHGHCGSPAGDTEQGYKPVWPVCSLFPDGSHPGV*QPIHEPA 
/QGRGGLPPWGAA*TPRAWRLA*RPRG*AALPWA*TSPGRPASAPLA 
HTGSGCPSRPTRAPGPSP/IPIQNIKRPYPGEAFVPSRAG\PTVGVT 
RSFHLAPSLPPFPSS*LSPSLPPRTTTSCTRAILTPSS*QKLIiYPPS 
RP\VVLLVRRARPPAAAPTSEEPPERSPWETPHAAPSQLHELHBTHS 
VAQKSDLLPAPEAM*PGSVSSRFLLY 

6484 

A 

2 

732 

GRRRPGPKPPRAPGRDRDRCNF*PCCPPGAKSSAACCCSDIRRAARA 
IPLGNSPCSPLSAP*AP*NPLCRTVMGRSPMTLSLGSRTPTSLPAPF 
PWSPRFGE * PTHPRARSLGTLWPMPTTP* IQAATPWRRA* PPATQAS 
TPHSLRG SSSSS QPRLAPLRPTS SRACMAGPTTPAAGALGPVTLGVL 
TPPQAWASLRTPGRLLTSLSPRQPLQQRQWQQQQPQLQPQPRPLWQP 
CRRHRTRI 

6485 

A 

2 

338 

CTTAIPTPEEMTRLRSMl^QLQINVDCALTEVDLLQSRGNFDPKAMN 
NFYDNI * PGPDVPPKPSKKGPTDSEPKTSSGPCSNIGCLLSCKFFHA 
CSRFQNPPICIHQSTLAA 

6486 

A 

84 

300 

RVGARGKMTALASGQLGPLMCQFGLPA * AVEAANKGDVEAFAKAMQN 
ISKPEQI * GDTKDKTD * KEDLSLD 

6487 

A 

91 

668 

SRAQVVLHCVLREWAGGRRSPAPPSPWPVLHVPDPQSWQLSSVALGC 
SPGWSSALAWLFLQALGMFSAALASGQLGPL.MCQFGLPAEAVEAANK 
GGKVAAPGTPRPEPGAGS I PVPLDLQKWG *GGKGHSPS ALDPIALI P 
SFGARVRHHLVSLTCS \DVEAFAKAMQNNAKPEQKEGDTKDKKDEEE 
DMSLD 

6488 

A 

1 

414 

LQPGPAGAQQVDLA/SW*PPEIMAPI\LANADVQERLDPYI.PSGE\ 
SLPQTADEI \QNTLTLAPS SQQALGMF\ SAA\ LASGQVGAPFMCQFG 
LPAEA\ VRA\ANKGDVEAFAKAMQNNAK\ PEQKEGDTKDKKDEEEDM 
SLD 

6489 

A 

205 

553 

LSRPRHHKASTTGAQMQGRTRSMRQEPEKAILTLRGSREPTPRAPQL 
VPSVPSSWLFLSI^SIAQTPLPHTHSTHTPRSQPALPSQAPVPVPQH 
PPPHPE*ALSPDNLFSMILKSP 

64 90 

A 

3 

347 

SFFFVX»RRSLAVAQAEVQ*RNLGSLQPPPPEFKRFSCFSliPSGWDYR 
RPPPRPANFLYF * * RGVSTMWGQGWSRTPDLRGDIiAHLiGS PKVLGMT 

64 91 

j± 

9 ft n 

*± \J o 

Jf r v r Ju^a ttKv&yJUKjUNljUl 1j w oAK1joIjPRCWDYRREPPH 

6492 

A 

1665 

1789 

FFVLLVETGFHRVSODGLDLLTS * SAHLGLPKCWDYRHEPP 

O *± J 

A 
xi 

i,4Jl 

1 c c c 
1 J J J 

r r VljljViilijr HKVSOIXjLDLLTS*SAHLGLPKCT 


Tv 
M. 


1 Q A 

r r VJLJjViilC*rHRV&ODGLDLLTS*SAHLGLPKCWDYl?HE 

6495 

A 

1173 

1297 

FFVLLVETGFHRVSODGLDLLTS * SAHLGLPKCWnYPWP! PP 

6496 

A 

1027 

1149 

FFVLLVETGFHRVSODGLDLLTS * SAHLGLPKCWDYRHEPP 

6497 

A 

631 

744 

VIEHLVSQDGLDFLT^*SARLGLPKCWDYRREPPRPVH 

6498 

A 

166 

352 

I FFVFLVETGFHHVSQDGLDLLTS * STRLGLPKCWDYRREPPRPAPS 
SPIFHTVAPIQYTHP 

6499 

A 

2250 

2374 

FFVLLVETGFHRVSODGLDLLTS* SAHLGLPKCWDYRHEPP 

6500 

A 

150 

370 

EKELAFFPQGEMOGGNLG * LKPPPLR *RDFS CLALPRGGNGRGAP PS 
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PTNFGFLGGNGVYPSGPGGFBTPDLK 

6501 

B 

174 

350 

MDEKTKJCAEEMALSLTRAVAGGDEQVAMKCAIWLAEQRVPLSVQLKP 
EVSPTQDIRFLM* 

6502 

A 

1 

498 

EFRAPGPLGPRTRRPARRVAGSGGPG *G\ TI FPAPPPSPAPLPGPPL 
SLSLPPKPLPLRRTGASREERGLCLETRDARGAPASGGPPGLGTPRL 
GGVRASFPRFFPPRLVPFPPAPSADLSRRPLPSHRPTQDPGLVIGQC 
DASRLPRGQRGTHRAWAAPEATRPG 

6503 

A 

919 

1761 

RAFSFNREEPTADEQGLPTSVAGRPAVIAAVPSWHARGTERRAPGRT 
D*HHPPHLGAQPSAALLPGLP*HCGHISAGGRREAGNIPGRKIIYPP 
PQGFYGSNNGQHQAYIKLSDA/RSLTR*IQSSMTSCRVLR*GPSLPS 
AGSS PGLGMSCIjTSGTSCGYMTSSWP\ PPTDADLLiCSRDCVVSRAGS 
PGL* L*HGLGPPPV/SPRSLRTCPMRH* SAEQETFLFSFPHPNLIiGR 
PLPNSKLRGRQPLLSKTLSWHQPSRGLIWCCGSGFGDFCGLKIEQKM 

6504 

A 

131 

575 

VFALKPGTPGALPQVGVLRDLERPGWWSGRPFPASSHLGLVPFPPA 
F1\A_G I XjbKKFxjFoHKF 1 yDFGXjV I GQIj* LF PTAAGTATaHTGxjGFKK 
.KPHRPWLSCSWSPRLFPGDPEVSIFPRKARSPPGEWGTATPDGREDQ 

6505 

A 

2 

291 

HEPQLNSNNVALHGAVI PGPPFLFGAC INLMNFMVALFI PE YKIARG 

vyJVrll X J^rtTviooJj xn X F w KlsoXJ&XJl&FxjxjKXJb X XWhiiJr or x3xjFG.NvL_ F 

EL 

6506 

A 

1 

267 

PRFKLFPCLSLLSS*DYR*VPAITPG*FLYFF*VKTGFHHVGQTGLE 

XjXjXoVjUF f 1 oHoyoRb X. XoFloriV- X Wr lof XjjLXjWII X GolVJ\JVC&V» 

6507 

A 

1 

216 

HEWMGTRVGEWMDEWKEGWKMDGKVDK*MNGWMEGCLYQVnjJOT 

6508 

A 

74 

357 

VWPWDQPGSQGEGRDQREPSGCRLQAPLSLSL.CSFQASCSSRSSGAA 

o J. rL.rCoJUoM-H.v_Kc> iJuljlHVobW X JNIKvjilortX^KvjlVo^AKirKJrlrX^J^ 

C C ft o 

A 

*> 

A 

nc 
i 1 o 

KiirS V KFy W X FG w U 1 JJ HXj FN Xj b VAXjGF TKQPGRRQKPEGTQVPblPAS 
SIPASQSQPPQSQPPNPSLLHPRTPTPSLPIPASLIPASPKTQTL*I 

Ir/A O Cj VVJi V O i» tr X tr V* X r\.f\-xjvjJEr \jH»/tJM JCxJjO XjI 1 V 

6510 

A 

3 

148 

HERREVTVIRRGGKHQLRLVPTRWAGKGLLG*SICFWSFSLGHHWSV 

c 


a 

en 

^ j j 

Jjorlvo X VXj V X X J.l v Jx\Xiv,o x In. yjLVo rVXjirV_XjXlxXlvXi^XrlXrtXrlXrlX XlA. 

VLHTI/TCKQYYAI IGNVLS FY 

6512 


268 

355 

£»f X? XJUtr I\X»1M X XT X ll\llMIV\XSiUjn X\^IVX.Hi' r l 

6513 

A 

2 

206 

RKGSIClxDILRSQWSPALTMSIOnjLSICSLLWDPNPDDPljVPEIARI 
YKSDRDKYNR I CREWTQKYAM 


TV 
A 

i 

X 

"1 "7 
.3 J. / 

X 4 r x* r r r FXjr KvV-XjXXjoFKXj12»x bul 1 d AnL otioyruobXJrri o/iuK V A 
ETTGTRHHAWL I F VF * VETGLHHWLAGLELMGSRNQPTS ASQSAG I 
TGVNHRAWLTW 

O D 13 

A 



IvbAxr FbFvjf l\-riFFFFFr FJjKGivRXr r SFPRl\i>JWPPQ-R2r IvKAP 
PSSSSSSSSSSSSAQI *SFNSP 


TV 

en a 

you 

WJ? r r WXjLHFGWoAVAWWHDXjooXjQPPPPRXiM w FSCLSLLSNWDYRKA 
PPFlxASFVFLVDTArT^LGQAGLELLTSSDPPTxxASQSAGITGVSHR 
TQLRSSFDS 

6517 

A 

2 

337 

HVXISSSALILIISCLLI^U_ELVCCCLSRSSRNTFSRCTYSGLSKTP 

Fil WAAKAXXjXJJJ Vr oXjKX r oorloxjJVrlCtXj VyULil AbAFMx So 1 oKUFFA 
SASQSAG I TS VSHRARP 

6518 

A 

3 

260 

CLGLLVI FVCNFQS IRDYRRP PLH * L I FFVFLVEMGFYYVG * AGLEL 
LTS*SARLLASIiRAGITGVSHRARPRSFVFGFGWQDHG 

6519 

C 

38 

478 

MSEI KKRNSFI^QARK^TlvrLYXXXXXXXXXXX 

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIFKEWT 
l^QMICEHFDEFOA'IPJRIQHK^NCKHAYLGRVGGRVGXRGFAMTI^ 
ASFXGA 

6520 

C 

33 

317 

MLTNHLIixxWPFFKl^XXXXXXXXXXXXXXX 
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XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRCPRITTFFLRG 

6521 

A 

1 

253 

RRLFFFFLRWSFALVAQAREQWCNLG*LQPPSPGFKQFFCFSLLNSW 
DYRRAPS CPANFVFLVEMGL Y I FKEWDNDGRMI CEAF 

6522 

A 

2 

411 

GVLRLS SAC PQACPS STSTGP * AALAGFTTTGSGLCWAWTPPS S S FV 
SGSSREAPSGTATTSWTRNVTATSATTANRMAMCTSTRTARLLLAPT 
TNCSPRSKTLTPIDPCSGGCRLTCKDNTPRLGTGQGRRQGTF 

6523 

A 

2 

222 

THIGLPKC*DYKCEPPVHRQFIi*F*VETRFHHISQAGFKLTTSGDPP 
ALASQSGRITGVTTRTQPLNGFYKVM 

6524 

A 

2 

261 

WWSQYAAERPSWAHACNPSTLGGRGRWLT*GQQVETSLASGYLP 
TLAS PKCWDYRREPLRLACFNFFVETRNI.LTVL P I S VQQ 

6525 

C 

203 

355 

MVALSTRGQVHLSGAWPSSFLIFLIAPSGEQEVDIAMVAGYAVTSV 
VLL* 

6526 

A 

2 

671 

GVGGVGRGMLPPAPPRAPAPGFPSTILWPSPPLL*LPPPGLGTGFLP 
FGHGCPPLSQSPWILFPWGGAQAQPPL* P * PRHPCPPHPRQSYGRQIi 
PTFWLAPPSFCLSLVFSSLSLILSLSSVSRSVLLP*HPPPHISFHPIi 
GFLS * AS PI SGVGG I KAFLPLRFQPP F * PL I PTTPLS L PKMGALWGR 
L * HS TPAQAMGSRAP FSGLAQASTYLLQP FG WGS 

6527 

A 

3 

477 

AGKFPLRSGHFPCFLTVESVCSGPFKPSVIPLSPCGASAPSPRFIiLS 
PSPGLVRAVLHCTL1,VAS*APPPASPPGVAVFSPPVSIWHPSRLPGS 
YGLCGQHLRDPTKPRSRTHFRAGVPPLVSCSLPSPNI.PLMTSGVQRP 
PKGLENS LLQLADLNK 

6528 

A 

1 

381 

VKAAWGKVGAQAGEYGAEALERMFLSFPTTKTYFPHFDIiSHGSAQVK 
GHGKKVADALTNAEAHVDDMPNALTALSDLHAHKLRVDPVNFKLLSH 
CLLVILADHLPAEFTFAEHASMDKFMASVSTGA 

6529 

A 

1 

2654 

MS FF S TLKTALSLKEKLAATG VTjVLI CALVGAG FAWERHQLKQAI EK 

IGSLDQAVKERDKSIMDLNQTIETMNKAEQHFHSQEVKNESEQAKYA 

DRQMERKAEVQKQLiVAAGNVRQRI PADTQRLIjRES ISEFNADADKGI 

FMSDKVTVKQTINKATSIYKIEQITVGKPGSEQYRRAFELADQLGLK 

HPDCIEHVFPTYADEQCTHVLTEEDFFSTEEREGVDRCIGVICSSVS 

DELFPNVPE YGG I G YQFL YEGDELKC YEHDI ES KAKELTVNSNNTVQ 

PVALMRLGVFVPKPS KS KGESKE I DATKAFS QLE I AKAEGYDD I KIT 

GPRLDMDTDFKTWIGVIYAFSKYGLSSNTIQLSFQEFAKACGFPSKR 

LDAKLRLTIHESLGRLRNKGIAFKRGKDAKGGYQTGLLKSWRADSKL 

WELFQLDYRVLLQHHALRALPKKEAAQAIYTFIESLPQNPLPLSFAR 

IRERLALQSAVGEQNR 1 1 KKAI EQLKTI G YLDCS IEKKGRESFVIHT 

YLIPAIPPQHIIQCDFPLSHILDFYSPSSTHRSCRASRSHQLIESEA 

WLP I EAGNHQNDSGYNLNVEE I RVIjY AWRMPKHVVTHLAKDQLHKVA 

QRANRMLNVLTEQVQLQKDELHANEFYQVYAKAALAKLPLLTRA^ 

YANI I D I YEHRG VP KYRDR YS ERG VS KTVS TVS LAHAMRAHPHLLME 

DLRIIiVIDLDPQSSATMFLSHKHSIGIVNATSAQAMLQNVSREEIjLE 

EFIVPSWPGVDVMPASIDDAFIASDWRELCNEHLPGQNIHAVLKEN 

VIDKLKSDYDFILVDSGPHLDAFLKNALASANILFTPLPPATVDFHS 

SLKYVARLPBLV/KLISDEGCECQLATNIEKQTNNWANP 

6530 

A 

1 

945 

MPASIDDAFIASDWRELCNEHLPGQNIHAVIiKENVIDKLKSDYDFIL 
VDSATVDFHSSLKYVARLPELVXLISDEGCECQLATNIGFMSKLSNK 
ADHKYCHSLAKEVFGGDMLDVFLPRLDGFERCGES FDTV I SANPATY 
VGSADALKNARIAAEDFAKAVFDLEQVFKLSTGRQATFIEEVIPPNQ 
VJbbDI r VUQIINNGKDQASDTPKSJjKS IRS T I KHQQFYPAIGVRRATG 
KIE I LDGS RRRAS AILENVGLRVLVTDQE I S VQEAQNLAKDVQTALQ 
HSIREIGLRIiMRMKtlDGMSQKDIAAKEGLSQAK 

6531 

A 

146 

4368 

LGFLLRLSEMPRKQGDYRT/SDMEIRGRVEQRVGYTIEQINHMRDVF 
GTRIiRRAEDVFPPVIGVAAHKGGVYKT S VSVHLAQDliAL KGLRVLLV 
EGNDPQGTASMYHGWVPDLHIHAEDTLLPFYLGEKDDVTYAIKPTCW 
PGLDII PS CLALHR I ETELMGKFDEGKLPTDPHLMLRLA I ETVAHDY 
DVIVIDSAPNLGIGTINWCAADVLIVPTPAELFDYTSALQFFDMLR 
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DLLKNVDLKGFE PDDLA I LFHVTAVHMF S LEG I ENLPFELQRNFQLM 

RDLDQRTEGVKRATRKNENFGCSPASSFSHLRERRDPGFSHDPGDQR 

QEAQNLAKAVQTALQHS I RE I GLRLMRS KNDGMS QKD I AAKEGLSQA 

KVTRALQAASAPEELVAIiFPVQSELTFSDYKTLCAVGDEMGNKNLEF 

DQLIQNISPEINDILSIEEMAEDEVKNKILRLITKEASLLTDKGSKD 

KSVVTELWKFEDKDRFARKRVKGRAFSYEFNRLSKEKSKLSNLREYQ 

NR I AG I AKRS KAVLGWAS TAQFGTDNQF I KDDAARAAS I LEAARKDP 

VFAG I SDNATAQI ATAWAS ALAD YAAAHKSMPRPE I LAS CHQTLENC 

LIESTRNSMDATNKAMLESVAAEIWSVSDGVMRLPLFLAMILPVQLG 

AATADACTFIPVTRDQSDIYEVFNVAGSSFGSYAAGDVLDMQSVGVY 

SQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIRKGRTNIYVNRIK 

SWDNGSGSLLHSFTNAAGEQITVTCSLNYNIGQIALSFSKAPDKGT 

EIAIETEINIEAAPELIPLINHEMKKYTLFPSQFVIAAEHTFRTLKE 

YLSHEQDMLRLRIMIWRTLATDTFDIALPVNQSFDVWATIIRGKFOT 

VYRDIIERVKSSGAMGMFAGADAASFFQQLPKISSSQPKTISRLHLL 

FLPNFLKLTDTFLETIEENGWLGIHDTAHKGSLAWLAFLLACFLGE 

GS S FAKTARYETVGKGLLLLLVTAGEVRI VSEHTFSAAATSSHLMCL 

FNLCQVS QGKNVLKP FPQ IEALVRAPNPQEHQEEAKETDTADS LCKL 

DLGIARTRVRIRRPHTSPALSPQHPTPAVHAHLPVDEPHASAVQTSV 

STSVGEPTRWVPGAPSGVAGTAGWRRAGRLKAVATELLPMNGSLWST 

QARISPAFLLKVNQTLMDYRTGRVQPAIPQSLPEALRIiAADLAEQKQ 

RLEQKMLMDAPKVEFAERVATASGVLIGNYAKVLGLGQNYLFTWLRD 

NGILIATGERRNVPKQDSISHAGHILSQCANLAATTSEYFIHKPHRL 

IAAETGYSQSTVVRAFREAVNKGILSVEIVIGDHRERRANLYRFTPS 

FLAFAQQAKNAL IESKLKIS SAATKS KVTAWE ARARGH YYHKGDSR 

6532 

A 

820 

2011 

TTTLVRLPCRS PKRR I KALRSQLRGLPMMRVPS FRLRVTS PTFMKSL 
TWQVHLLVLMLL VMF WTCNPS VCTAS YVAAMCWWQAPMAPAKPQPSR 
WKTSKARMYQSQKVAKTSTLTVLSLLLITVPAAYFTRLLMLLLPLMR 
VPS FRLRVTS PTSMKSLTWQVHLLVLMLLVMFWTCNPS VCTJ\S YVAA 
MCWWQAPMAPAK^QPSRWKTSKARMYQSEKVAITSTLTVLSLLLITV 
PAAYFTRLLMLLLPLMRVPS FRLRVTS PTSMKSLTWQVHLLVLMLLV 
MFWTCNPSVCTASYVAAMCWWQAPMAP7VKPQPSRWKTSK7UIMYQSEK 
VALTS TLTVLSLLL ITVPAAYFTRLLMLLVS KS L/ CTCS LNYN IGQI 
ALS FS KAPD KGTE I AI ERGANGD VLDMQS VG VYS QLRRRYVLVAS SD 
GTSKTATFKMEDFEGQNVPIRKGRTNIYVSRIKSWDNGSGSLLHSF 
TNAAGEQITVTCSLNYNIGQIALSFSKAPDKGTEIAIERATDDACTF 
I PVTRDQSD I YEVFNVAGS S FGS YAAGD VLDMQS VGVYS QLRRRYVL 
VASSDGTSKTATFKMEDFEGQNVPIPKGRKNIYVNRIKSWDNGSGS 
LLHS FTNAAATADACTF I PVTRDQSD I YEVFNVAGSS FG S YAAGDVL 
DMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIRKGRN 
N I YVNR I KSWDNG SG S LLHS FTNAAATADACTFI PVTRDQSD I YEV 
FNVAGSS FGS YAAGDVLDMQS VGVYS QLRRRYVLVAS SDGTSKTATF 
KMEDFEGQNVPIRKGRTNIYVNRIKSWDNGSGSLLHSFTNAAGEQI 
TVTCSLNYNIGQIALSFSKAPDKGTEIAIERGSST 

6533 

A 

1 

3039 

MLNVLTEQVQLQKDELHAl^FYQVYAKAALAKLPLLTRANVDYAVSE 
MEEKGYVFDKRPAGSSMKYAMS IQNI IDI YEHRGVPKYRDRYSEAYV 
I F I SNLKGGVSKTVSTVSLAHAMRAHPHLLMEDLR I LVI DLDPQS S A 
TMFLSHKHSIGIVNATSAQAMLQNVSREELLEEFIVPSWPGVDVMP 
ASIADDAFIASDWK/ELCNEHLP\LVKLISDEGCECQLATNIGFMSK 
LSNKADHKYCHS LAKEVFGGDMLDVFLPRLDGFERCGES FDTVI SAN 
PATYVGS ADALKNAR IAAEDFAKAVFDRI S YAKI I KKFRSTI KHQQF 
YPAIGVRRATGKIEILDGSRRRASAILENVGLRVLVTDQEISVQEAQ 
NLAKDVQTALQHSIREIGLRLMRMKNDGMSQKDIAAKEGLSQAKVTR 
ALQAASAPEELVALFPVQSELTFSDYKTLCAVGDEMGNKNLEFDQLI 
QNISPEINDILSIEEMAEDEVKNKILRLITKEASLLTDKGSKDKSW 
TELWKFEDKDRFARKRVKGRAFSYEFNRLSKEKSKLSNLREYQNRIA 
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D I AKRS KAVLGWAS TAQFGTDNQF I KDDAARAAS I LEAARKDP VFAG 
ISDNATAQIATAWASALADYAAAHKSMPRPEILASCHQTLENCLIES 
TRNSMDATNKAMLES VAAEMMS VSDGVMRLPLFLAM I LPVQLGAATA 
DACTFIPVTRDQSDIYEVFNVAGSSFGSYAAGDVLDMQSVGVYSQLR 
RRYVLVASSDGTSKTATFKMEDFEGQNVPIRKGRTNIYVNRIKSVVD 
NGSGSLLHSFTNAAGEQITVTCSLNYNIGQIALSFSKAPDKGTEIAI 
ETE INIEAAPELI PLINHEMKKYTLFPSQFVI AAEHTVQAAYEAQRE 
FGLDLGSLQFRTLKEYLSHEQDMLRLRIMIWRTIATDTFDIALPVNQ 
S FDVWATI I RG KFQTVYRD 1 1 ERVKS SGAMGM F AGADAAS FFKQLPK 
DFFQPAEDYIQTPYVHYIGTPFRTTHST 

6534 

A 

1 

2607 

MS F FS TLKTALS LKEKLAATGVLVL I CALVGAG FAWERHQLKQAI EK 
IGSLDQAVKERDKSIMDLNQTIETMNKAEQHFHSQEVKNESEQAKYA 
DRQMERKAEVQKQLVAAGNVRQR I PADTQRLLRES I S EFNADADKGT 
VKQTINKVISIYKIEQVTVGKPGSEQCRRAFELADQLGLKHPDCIEH 
VFPTYAQEQCAHVXiTEEDCFSNAERDGDDRCIGVICSWEFAKACGFA 
SKRLDVKLRLAIHESLGRLRSKGIAFKRGKDAKGGYQPGLLKVERFD 
AALAHIFDTSDPSTAHNSMRLHQLIESEAWLPIEAGNHQNDSGYNLN 
NVSREELLEEFIVPSWPGVDVMPASIDDAFIASDWRELOJEHLPGQ 
NIHAVLKENVIDKLKSDYDFILVDSGPHLDAFLKNALASANILFTPL 
PPATVDFHSSLKYVARLPELAASAPEELVALFPVQSELTFSDYKTLC 
AVGDEMG YKNIiE FDQL I QN I S PE IND I LS I EEMAEDEVKNKI LRL I T 
KEASLLTDKGSKDKSVVTELWKFEDK3DRFARKRVKGRAFSYEFNRLS 
KEKSKLSNLREYQNR I AD I AXRS KAVLGWAS TAQFGTDNQF I KDDAA 
RAASILEAARKDPVFAGISDNATAQIATAWASALADYAAAHKSMPRP 
E I LAS CHQTLENCL I E S TRNSMDATNKAMLES VAAEMMS VSDGVMRL 
PLFLAMILPVQLGAATADACTFIPVTRDQSDIYEVFNVAGSSFGSYA 
AGDVLDMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPI 
RKGRTNIYVNRIKSVVDNGSGSLLHSFTNAAGEQITVTCSLNYNIGK 
IALWFSKEPDKGTEI PFEGENQF 

6535 

A 

1 

4557 

M S F FS TLKTALS L KE KLAATGVLVLI CALVGAG FAWERHQLKQAI EK 
IGSLDQAVKERDKSIMDLNQTIETMNKAEQHFHSQEVKNESEQAKYA 
DRQMERKAEVQKQLVAAGNVRQR I PADTQRLLRES IS EFN AD ADKG I 
FMSDKVTVKQTINKATSIYKIEQITVGKPGSEQYRRAFELADQLGLK 
HPDCIEHVFPTYADEQCTHVLTEEDFFSTEEREGVDRCIGVICSSVS 
DELFPNVPEYGGIGYQFLYEGDELKCYEHDIESKAKELTVNSNNTVQ 
PVALMRLGVFVPKPSKSKGESKEIDATKAFSQLEIAKAEGYDDIKIT 
GPRLDMDTDFKTWIGVIYAFSKYGLSSNTIQLSFQEFAKACGFPSKR 
LDAKLRLTIHESLGRLRNKGIAFKRGKDAKGGYQTGLLKVGRFDADL 
DLIELEADSKLWELFQLDYRVLLQHHALRALPKKEAAQAIYTFIESL 
PQNPLPLSFARIRERLALQSAVGEQNRIIKKAIEQLKTIGYLDCSIE 
KKGRESFVIHTSKVPDKRSSQLIESEAWLPIEAGNHQNDSGYSLNVE 
E I RVLYAWRMPKHVVTHIjAKDQLHKVAQRANRMLNVLTEQVQLQKDE 
LHANEFYQVYAKAALAKLPLLTRANVDYAVSEMEEKGYVFDKRPAGS 
SMKYAMSIQNIIDIYEHRGVPKYRDRYSEAYVIFISNLKGVGELISD 
EGCECQLATNIGFMSKLSNKADHKYCHSLAKEVFGGDMLDVFLPRLD 
GFERCGESFDTVISANPATYVGSADALKNARIAAEDFAKAVFDPSGD 
RVEQVFKLSTGRQATFIEEVIPPNQVESDTFVDQHNNGRDQASLTPK 
SLKSIRSTIKHQQFYPAIGVRRATGKIEILDGSRRRASAILENVGLR 
VLVTDQE I S VQEAQNLAKDVQTALQHS I RE I GLRLMRMKNDGMSQKD 
IAAKEGLSQAKVTRALQAASAPEELVALFPVQSELTFSDYKTLCAVG 
DEMGNKNLE FDQL I QN I S PE IND I LS I EEMAEDEVKNKI LRL I TKEA 
SLLTDKGSKDKSVVTELWKFEGKDRFARKRNRIADIAKRSKAVLGWA 
S TAQFGTDNQF I KDDAARAAS I LEAARKD P VFVH I FDNATVQS ATAW 
AS ALADYAAAHKSMPRPE I IiAS CHQTLENCL I ESTRNSMDATNKAML 
ES VAAEMMS VSDGVMRL PL FLAM I LPVQLGAATADACTF I PVTRDQS 
D I YE VFNVAG S S FG S YAAGD VLDMQS VG VYSQLRRR YVL VAS S DGTS 
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KTATFKMEDFEGQNVPIRKGRTNIYVNRIKSWDNGSGSIiLHSFTNA 

AGEQITVTCSLNYNIGQIALSFSKAPDKGTEIAIETEINIEAAPELI 

PLINHEMKKYTLFPSQFVIAAEHTVQAAYEAQREFGLDLGSLQFRTL 

KEYLSHEQDMLRLRIMIWRTLATDTFDIAIiPVNQSFDVWATIIRGKF 

QTVYRDIIERVKSSGAMGMFAGADAASFFKQLPKDFFQPAEDYIQTP 
YVHYIGTPFRTTHST 

6536 

A 

3 

2847 

ELHAYAMS IQNI IDI YEHRGVPKYRDR\LAHAMRAHPHLLMEDLRIL 
VIDLDPQSSATMFLSHKHSIGIVNATSAQAMLQNVSREELLEEFIVP 
SVVPGVDVMPASIDDAFIASDWRELCNEHLPGQNIHAVXjKENVIDKL 
KSDYDFILVDSGPHLDAFLKNALASANILFTPLPPATVDFHSSLKYV 
ARLHELVKLISNEGCECQIATTIGFMSKLSNKADHKYCHSLAKEVFG 
GDMLDVFLPRLDGFERCGESFDTVISANPATYVGSADALKNARIAAE 
DFAKAVFDQNVGLRVLVTDQEISVQEAQNLAKDVQTALQHSIREIGL 
RLMRMKNDGMSQKDIAAKEGLSQAKVTRALQAASAPEELVALFPVQS 
ELTFSDYKTLCAVGDEMGYKNLEFDQLIQNI SPEINDILS IEEMAED 
EVKNKILRLITKEASLLTDKGSKDKSVVTELWKFEDKDRFARKGVKG 
RAFSYEFNRLSKKKSKLSNLREYQNRIADIAKRSKAVLGWASTAQFG 
TDNQF I KDDAARAAS I LEAARKDPVFAG I SDNATAQ I ATAWASALAD 
YAAAHKSMPRPEILASCHQTLENCLIESTRNSMDATNKAMLESVAAE 
MMS VSDG VMRLPLFLAM I L PVQLGAATADACTF I PDTRDQSD I YEVF 
NVAGS S FG S YAAGDVLDMQS VGVYSQLRRR YVX.VASSDGTS KTATFK 
MEDFEGQNVPIRKGRTNIYVNRIKSWDNGSGSLLHSFTNAAGEQIT 
VTCS LNYN IGQ I ALS FS KAPDKGTE I AI ETE IN I EAAPEL I PLINHE 
MKKYTLFPSQFVIAAEHTVQAAYEAQREFGLDLGSLQFRTLKEYLSH 
EQDMLRLRIMIWRTLATDTFDIALPVNQSFDVWATIIRGKFQTVYRD 
IIERVKSSGAMGMFAGADAASFFKQLPKDFFQPAEDYIQTPYVHYIG 
TPFRTTHST 

6537 

A 

1 

4236 

MSFFSTLKTALSLKEKLAATGVLVLICALVGAGFAWERHQLKQAIEK 

IGSLDQAVKERDKSIMDLNQTIETMNKAEQHFHSQEVKNESEQAKYA 

DRQMERKAEVQKQLVAAGNVRQR I PADTQRLLRES I SEFNADADKGI 

FMSDKVTVKQT I NKATS I YKI EQ I TVGKPG S EQYRRAFE LADQLGLK 

HPDCIEHVFPTYADEQCTHVLTEEDFFSTEEREGVDRCIGVICSSVS 

DELFPNVPEYGGIGYQFLYEGDELKCYEHDIESKAKELTVNSNNTVQ 

PVALMRLGVFVPKPSKSKGESKEIDATKAFSQLEIAKAEGYDDIKIT 

GPRLDMDTDFKTWIGVIYAFSKYGLSSNTIQLSFQEFAKACGFPSKR 

LDAKLRLTIHESLGRLRNKGIAFKRGKDAKGGYQTGLLKKEAAQAIY 

TFIESLPQNPLPLSFARIRERLALQSAVGEQNRIIKKAIEQLKTIGY 

LDCSIEKKGRETHIFDTSDPSTAHNSMRLHQLIESEAWLPIEAGNHQ 

NDSGYNLNNVSREELLEEFIVPSVVPGVDVMPASIDDAFIASDWREL 

CNEHLPGQNIHAVLKENVIDKLKSDYDFILVDSGPHLDAFLKNALAS 

ANILFTPLPPATVDFHSSLKYVARLPELKNRPTIGRTLNPSILSGFD 

SSSASGDRVEQVFKLSTGRQATFIEEVIPPNQVESDTFVDQHNNGRD 

QASLTPKSLKSIRSTIKHQQFYPAIGVRRATGKIEIUDGSRRRASAI 

LENVGLRVL VTDQE I SVQEAQNLAKDVQTALQHS I REIGLRLMRMKN 

DGMSQKDIAAKEGLSQAKVTRALQAASAPEELVALFPVQSELTFSDY 

KTLCAVGDEMGYKNLEFDQLIQNISPEINDILSIEEMAEDEVKNKIL 

RLITKEASLLTDKGSKDKSVVTELWKFEDKDRFARKRVKGRAFSYEF 

NRLSKEKSKLSNLREYQNRIADIAKRSKAVLGWASTAQFGTDNQFIK 

DDAARAAS I LEAARKDPVFAG I SDNATAQ I ATAWASALAD YAAAHKS 

MPRPEILASCHQTLENCLIESTRNSMDATNKAMLESVAAEMMSVSDG 

VMRLPLFLAMILPVQLGAATADACTFIPVTRDQSDIYEVFNVAGSSF 

GSYAAGDVLDMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDFEGQ 

NVP I R KGRTN I YVNR I KSWDNGSGS LLHS FTNAAGEQ I TVTCS LNY 

NIGQIALSFSKAPDKGTEIAIETEINIEAAPELIPLINHEMKKYTLF 

PSQFVIAAEHTVQAAYEAQREFGLDLGSLQFRTLKEYLSHEQDMLRL 

RIMIWRTLATDTFDIALPVNQSFDVWATI IRGKFQTVYRDI IERVKS 
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SGAMGMFAGADAASFFKQLPKDFFQPAEDYIQTPYVHYIGTPFRTTH 
ST 

6538 

A 

65 

316 

S SAGWKKS FGNCLKKDAAS APANI P I APEDLTRS IM SR YTVWNL PRM 
MVAHTSKDWLTGRAMSKVSVARVRQIMMRRRSISCS 

6539 

A 

54 

265 

KKDAASAPANIP I APEDLTRS I MS R YTVWNLPRMMV AHTS KDWLTGR 
AMSKVSVARVRQIMMRRRS ISCS 

6540 

A 

2034 

2175 

EMPGHARCYADQQRYRFLPGSEVLP/ASINVAWRPVDNLNTCSTRSP 
E 

6541 

A 

33 

285 

SSAGWKKSFGNCLKKDAASAPAN I PI APEDLTRS IMSRYTVWNLPRM 
MVAHTSKDWLTGRAMSKVSVARVRQIMMRRRSISCS 

6542 

A 

1 

2448 

MLNVLTEQVQLQKDELHANEFYQVYAKAALAKLPLLTRANVDYAVSE 
MEEKGYVFDKRPAGSSMKYAMSIQNIIDIYEHRGVPKYRDRYSEAYV 
I F I SNLKGGVSKTVSTVSLAHAMRAHPHLLMEDLR ILVIDLDPQS SA 
TMFLSHKHSIGIVNATSAQAMLQNVSREELLEEFIVPSWPGVDVMP 
ASIDDAFIASDWRELCNEHLPGQNIHAVLKENVIDKLKSDYDFILVD 
SGPHLDAFLKNALASANILFTPLPPATVDFHSSLKYVARLPELVKLI 
SDEGCECQLATNIGFMSKLSNKT^HKYCHSLAKEVFGGDMLDVFLPR 
LDGFERCGESFDTVISANPATKSKLSNLREYQNRIADIAKRSKAVLG 
WAS TAQFGTDNQ F I KDD AARAAS I L EAARKDP VFAGI S DNATAQ I AT 
AWASALADYAAAHKSMPRPEILASCHQTLENCLIESNRNSMDPNNKA 
MLESVAAE^SVSDGVWRLPLFLAMILPVQLGAATADACTFIPVTRD 
QSDI YEVFNVAGS S FGS YAAGDVLDMQS VG VYSQLRRR YVLVAS SDG 
TSKTATFKMEDFEGQNVPIRKGRTNIYVNRIKSVVDNGSGSLLHSFT 
NAAGEQI TVTCS LNYN I GQ I ALS FS KAPDKGTE I A I ETE INI EAAPB 
LIPLINHEMKKYTLFPSQFVIAAEHTVQAAYEAQREFGLDLGSLQFR 
TLKEYLSHEQDMLRLRIMIWRTLATDTFDIALPVNQSFDVWATIIRG 
KFQTVYRD 1 1 ERVKS SGAMGM FAG ADAAS FFKQL PKDFFQPAED YI Q 
TPYVHYIGTPFRTTHST 

6543 

A 

1 

2979 

MLNVLTEQVQLQKDELHANEFYQVYAKAALAKLPLLTRANVDYAVSE 
MEEKGYVFDKRPAGSSMKYAMS IQNI IDI YEHRGVPKYRDRYSEAYV 
IFISNLKGGVSPTVSTVSLAHAMRAHPHLLMEDLRILVIDLDPQSSA 
TMFLSHKHSIGIVNATSAQAMLQNVSREELLEEFIVPSWPGVDVMP 
AS IDDAF IASD WRELCNEHLPGQN IHAVLKENV I DKLKS D YDF I LVD 
SGPHLDAFLKNALASANILFTPLPPATVDFHSSLKYVARLPELKNRP 
TIGRTLNPS ILSGFDS SS ASGDRVEQVFKLS TGRQATF I EEVI PPNQ 
VESDTFVDQHNNGRDQAS LTPKSLKS I RST I KHQQFYPA IGVRRATG 
KIEILDGSRRRASAILENVGLRVLVTDQEISVQEAQNLAKDVQTALQ 
HSIREIGLRLMRMKITOGMSQKDIAAKEGLSQAKVTRALQAASAPEEL 
VALFPVQSELTFSDYKTLCAVGDEMGYKNLEFDQLIQNISPEINDIL 
SIEEMAEDEVKNKILRLITKEASLLTDKGSKDKSWTEIiWKFEDKDR 
FARKRVKGRAFSYEFNRLSKEKSKLSNLREYQNRIADIAKRSKAVLG 
WAS TAQ FGTDNQF I KDDAARAAS I LEAARKD P VFAG I S DNATAQ I AT 
AWASALADYAAAHKSMPRPEILASCHQTLENCLIESTRNSMDATNKA 
MLESVAAEmSVSDGVMRLPLFLAMILPVQLGAATADACTFIPVTRD 
QSDIYEVFNVAGSSFGSYAAGDVLDMQSVGVYSQLRRRYVLVASSDG 
TS KTATFKMEDFEGQNVP I RKGRTNI YVNR I KSWDNG S GSLLHS FT 
NAAGEQITVTCSLNYNIGQIALSFSKAPDKGTEIAIETEINIEAAPE 
LIPLINHEMKKYTLFPSQFVIAAEHTVQAAYEAQREFGLDLGSLQFR 
TL KE i LSxi EQDMLRLR INI WRTLATDT FD I AL P VNQ S FD VWAT I IRG 
KFQTIY 

6544 

A 

1 

3276 

MLNVLTEQVQLQKDELftANE F YQ VYAKAALAKLPLLTRANVD YAVSE 
MEEKGYVFDKRPAGSSMKYAMSIQNIIDIYEHRGVPKYRDRYSEAYV 
IFISNLKGGVSKTVSTVSI^HAMRAHPHLLMEDLRILVIDLDPQSSA 
TMFLSHKHS IG I VNATS AQAMLQNAMLQNVS REELLEEF I VPS WPG 
VDVMPAS I DDAF I ASDWRELCNEHLPGQN IHAVLKENVI DKLKSD YD 
FILVDSGPHLDAFLKNALASANILFTPLPPATVDFHSS3JKYVARLPE 
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LVKLISDEGCECQIATNIGFMSKLSN^a^HKYCHSLAKEVFGGDMLD 
VFLPRLDGFERCGE S FDTV I S ANP ATYVG S AD ALKNAR I AAEDFAKA 
VFDPSGDRVEQVFKLSTGRQATFIEEVIPPNQVESDTFVDQHNNGRD 
QASLTPKSLKSIRSTIKHQQFYPAIGVRRATGKIEILDGSRRRASAI 
LENVGLRVLVTDQE I SVQEAQNLAKDVQTALQHS IRE I GLRLMRMKN 
DGMSQKDIAAKEGLSQAKVTRALQAASAPEELVALFPVQSELTFSDY 
KTLCAVGDEMGNKNLE FDQL I QN I S PE IND I L S I EEMAEDE VKNK I L 
RLITKEASIiliTDKGSKDKSVVTELWJCFEDKDRFARKRVKGRAFSYEF 
NRLS KE KS KL SNLRE YQNR I AD I AKRS KAVLGWASTAQFGTDNQF I K 
DDAARAAS ILEAARKDPVFAG I SDNATAQI ATAWASALADYAAAHKS 
M PRPE I LAS CHQTLENCL I e s trns md atnkamles vaaealmms vs 
DGVMRLPLFLAMILPVQLGAATADACTFIPVTRDQSDIYEVFNVAGS 
SFGSYAAGDVLDMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDFE 
GQNVPIRKGRTNIYVNRIKSVVDNGSGSLLHSFTNPAGEQITVTCSL 
NYNIGQIALSFSKAPDKGTEI AI ETEINIEAAPEL I PLINHEMKKYT 
LFPRKLWGLC I M I WRTLATDTFD I AL PVNQS FDVWAT 1 1 RGKFQTVY 
RDIIERVKSSGAMGMFAGADAASFFKQLPKDFFQPAEDYIQTPYVHY 
IGTPFRTTHST 

6545 

A 

1 

67 

MTKCSHLCNNYLFSVTLRDNGILI ATGERRNVRKQEY I SRGYFTLKET 
VIDTSNGSRISFTTRITGKGQQWLMKRLLDADVVRTTSIVWLTIICVTF 
LSCITMSDFTFSGYELACFVTHSGLSRSAGHILSQCANLAATTSEYF 
IHKPHRJjITAETGYSQSTVVRAFREAVNKGILSVEIVIGDHRERRAN 
LYRFTPS FLAFAQQAKNAL IESKLKI S SAATKVKAVLAKTLALFNFL 
STPPCQNDTPSPCQDDVAIKNKKSQVKKTKRSVSGGAGTTSLKKLTS 
WI AKAKAKADNLRLSKKRTQKHEFKQKVEAAARKYAYLKNKRS PD IG 
GI SNFDNLPHCMTVl^AIiNAVLAKNKDNEQWGAVAGAYIAD I TDGED 
RARHFGLMS ACFGVGSGVYWQTRYRGKLVS QGF WQM I LTPRDADGT 
FITSTDSETVHYHYGPKDLVTILFYIFITIILHAVVQEYILDKISKR 
T^LSKVKHSKPNESGQLWFHFTSVIWCFYVVVTTPQAIiTRVLNCLL 
ALGSNAAVMGECGFAQLQQTWASALADYAAAHKSMPRPEILASCHQT 
LENCL I ESTRNSMDATNK/ ADAG I CRSRDDERF * RC YAS AF I PRDDP 
ACSVGGSYR*CVITEF 

6546 

A 

1 

3435 

MIJm.TEQVQLQKDELHANEFYQVYAKAALAKLPLLT^ 
MEEKGYVFDKRPAGSSMKYAMS IQNI IDI YEHRGVPKYRDRYSEAYV 
I F I SNLKGG VS KTVS TVSLAHAMRAH PHLLMEDLR I LV I DLDPQS S A 
TMFLSHKHS IG I VNATSAQAMLQNAMLQNVSREELLEEF I VPS WPG 
WVMPASIDDAFIASDWRELCNEHLPGQNIHAVLKENVIDKLKSDYD 
FILVDSGPHLDAFLKNALASANILFTPLPPATVDFHSSLKYVARLPE 
LVKLISDEGCECQLATNIGFMSKLSNKADHKYCHSLAKEVFGGDMLD 
VFLPRLDGFERCGESFDTVISANPATYVGSADAIiKNARI AAEDFAKA 
VFDPDLIVLQPLAIESRRYSSYQLVARPHLLKRVIPPNQVESDTFVD 
QHNNGRVQASLTPKSLKSIRSTIKHQQFYPAIGVRRATGKIEILDGS 
RRRAS A I LENVGLRVLVTDQE I SVQEAQNLAKDVQTALQHS IRE I GL 
RLMRMKNDGMSQKDIAAKEGLSQAKVTRALQAASAPEELVALFPVQS 
ELTFSDYKTLCAVGDEMGNKNLEFDQLIQNISPEINDILSIEEMAED 
EVKNKILRIilTKEASLLTDKGSKDKSWTELWKFEDKDRFARKRVKG 
RAFSYEFNRLSKEKSKLSNLREYQNRIADIAKRSKAVLGWASTAQFG 
TDNQF I KDDAARAAS I LEAARKDP VFAG I SDNAT AQ I ATAW AS ALAD 

MMSVSDAEMMSVSDGVMRLPLFLAMILPVQLGAATADACTFIPVTRD 
Q S D I YE V FNVAG S S FGS YAAGD VLDMQS VG VYSQLRRR YVLV AS S DG 
TSKTATFKMEDFEGQNVPIRKGRTNIYVNRIKSWDNGSGSLLHSFT 
NAAGEQITVTCSLNYNIGQIALSFSKAPDKGTEIAIETEINIEAAPE 
LIPLINHEMKKYTLFPSQFVIAAEHTVQAAYEAQREFGLDLGSLQFR 
TLKEYLSHEQDMLRLRIMIWRTLATDTFDIALPVNQSFDVWATIIRG 
KFQTVYRDIIERVKSSGAMGMFAGADAASFFKQLPKDFFQPAEDYIQ 
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TPYVHYIGTPFRTTHST 

6547 

B 

9377 

49839 

MUTVLTEQVQLQKDELHANEFYQVYAKAALAIOjPLLTRA 
MEEKGYVFDKRPAGS SMKYAMS I QNI IDI YEHRGVPKYRDRY SEAYV- 
I FI SNLKGGVSKTVSTVSLAHAMRAHPHLLMEDLRI LVIDLDPQSSA 
TMFLSHKHSIGIVNATSAQAMLQNVSREELLEEFIVPSVVPGVDVMP 
AS IDDAF I ASDWRELCNEHLPGQNIHAVLKENVIDKLKSDYDFI LVD 
SGPHLDAFLKNALASANILFTPLPPATVDFHSSLKYVARIiPELVKLI 
SDEGCECQLATNIGFMSKLSNKADHKYCHSLAKEVFGGDMLDVFLPR 
LDGFERCGESFDTVISANPATGFDSSSASGDRVEQVFKLSTGRQATF 
I EE V I P PNQVLDGL PGKMKI LDVPRRRAS P I LENVG I RVLVTTQE I S 
VQEAQNLAKDVQTALQHSIREIGLRLMRMKNDGMSQKDIAAKEGLSQ 
AKVTRALQAASAPEELVALFPVQSELTFSDYKTLCAVGDEMGNKNLE 
FDQLIQNISPEINDILSIEEMAEDEVKNKILRLITKEASLLTDKGSK 
DKSVVTELWKFEDKDRFARKRVKGRAFSYEFNRLSKEKSKLSNLREY 
QNRIAD I AKRS KAVLGWASTAQFGTDNQF I KDDAARAAS I LEAARKD 
PVFAG I SDNATAQ I ATAWAS ALAD YAAAHKSM PRPE I LAS CHQTLEN 
CLIESTRNSMDATNKAMLESVAAEMMSVSDGVMRLPLFLAMILPVQL 
GAAT ADACTF I PVTRDQS D I YEVFNVAG S S FG S Y AAGD VLDMQ S VG V 
YSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIRKGRTNIYVNRI 
KSVVDNGSGSLLHSFTNAAESPRNSMDAPNKAMLESVAAEMMSVSDG 
VMRLPLFLAM I LP VQLGAATADACTF I PVTRDQSD I YEVFNVAG S S F 
GSYAAGDVLDMQSVGVYSQLRRRYVLVASSDGTSKTATFI01EPFEGQ 
NVP I P KG S TN I YV I R I KS WDNG SG S LLH S FTNAAG W ANAMAD YAQA 
HKSMPCREILASCPQTLE^CLIESTRNSMDATNKAMLESVAAEMMSV 
SDG VMRL PL FLAM I L PVQLGAATADACTF I P VTRDQSD I YEVFNVAG 
S S FG S YAAGDVLDMQS VGVYS QLRRRYVLVAS SDGTS KTATFKMED F 
EGQNVPTRKGREAS TLTVL S LLL I TVPAAY I TRLLMLLTLENCL I EP 
PPNSMDPFNKAMLESVPPEMMSVSDGVMRPPLFLEMILPVQLGAATA 
DACTF I PVTRDQSD I YEVFNVAG SSFGS YAAGDVLDMQS VGVYS QLR 
RRYVLVASSDGTSKTATFKMEDFEGQNVPIRKGSTNIYVNRIKSWD 
•NGSGSLLHSFTNAAAEMISVSDGVMRLPLFLAMILPVQLGAATADAC 
TFIPVTRDQSDIYEVFNVAGSSFGS YAAGDVLDMQS VAWASALADYA 
AAHKSMPRPEILASCHQTLENCLIESTRNSMDATNKAMLESVAAEMM 
SVSDGVMRLPLFLAMILPVQLGAATADACTFIPVTRDQSDIYEVFNV 
AGSSFGSYAAGDVLDMQSVGVYSQLRRRYVLVASSDGTSKPATFKME 
DFEGQNVPIPKGSTNIYVNRIKSWDNGSGSLLHSFTNAAGEQITVT 
CSLNYNIGQIARRFQSANKALVAMKLGAATVDACPFIPVTRDQSDIY 
EVFNVAGS S FG S YAAGDVLDMQS VGVYS QLRRRYVLVAS SDGTSKTA 
TFKMEDFEGQNVPIRQGRENIYVNRIKSWDNGSGSLIHSFTNAAES 
TRNSMDATNKAMLESVAAEMMSVSDGVMRLPLFLAMILPVQLGAATA 
DACTF I PVTRDQS D I YEVFNVAG S S FG S YAAGDVLDMQS VGVYS QLR 
RRYVLVASSDGTSKTATFKMEDFEGQNVPIRKAHKSIPRPEILASCH 
QTLENCLIESTRNSMDATNKAMLESVAAEMMSVSDGVMRIiPLFLAMI 
LPVQLGAATADACTF I PVTRDQSDI YEVFNVAGSS FGS YAAGDVLDM 
QS VGVYS QLRRRYVLVAS SDGTS KTATFKMEDFEGQNVP I RQGRTNI 
YVNR I KS WDNGSGS LHHS FTNAAGEQ ITVTCSLNYNIGQI ALSDGT 
SKTATFKMEDFEGQNVPIRQGSDGIYVNRIKSWDNGSGSLKHSFTN 
AAATANACTF I PVTRDQS D I YEVFNVAG S S FGSYAAGDVLDMQSVGS 
EDIYVNRIKSVVDNGSGSIIHSFTNAAGEQITVTRSLNYNIAMILHV 
QLRAATADACTF I PVTRDQSD I YE VFNVAGS S FG S YAAGDVLDMQS V 

vtkilrpgsreisglvdhtggrqialigfpptggmaiplkktklgnl 
fqssgaatadactfipVtrdqsdiyevfnvagssfgsyaagdvldmq 
svgvysqlrrryvlheimsvsdvvrrlplffamillvqlgpatadac 

TF I PVTRDQSD I YEVFNVAGSS FGS YAAGDVLDMQS VGVYSQLRRRY 
VLVASSDGTSTPATFKMEDFEGQNVPPPQAWKPINKAKVGICSHQKI 
MSVFDGVMRLPLFLAMILPVQLGPATVNACTFIPVTRDQSDIYEVFN 
VAGS S FGS YAAGDVLDMQS VGVYS QLRRRYVLVAS SDGTS KTATFKM 
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EDFEGQNVPIRKGRTOIYVSRIKSVVDNGSGSLLHSFTNAAGEQITV 
TCSLNYNIGQIALSFYKAPEKGTEIAIERGAAGKIPPLAHESMMSAP 
NQILYTLGQVKLSVIRLQHIKSKDASDKFWPASLTPWKTCVIESTRD 
SMYATNKAMLKSVAAEMMSVSDGVMRLPLFLANILPVQIjGAATDDAC 

tf i pvtrdqsd i yevfnvag s s fg s yaagdvldmqs vgvys qlrrr y 
vlvassdgtsktatfkmedfegqnvpirkayftrllmllhiilcrvg 
kfwlpapqtlkncliestgnsmdatnkamlesvaaqimsvshgvmrl 
plflamimpvqlraatadactfipvtrdqsdiyevfnvagssfgsya 
agdvlhmqs vgvysqlrrryvlvas sdgts ktatfkmedfegqnvp i 
rkgrnniyvsriksvtongsgsllhsftnaaesprnsmdatnktmle 
svaaemmsvsdgvmrlplflamilpvqlgaatddactf i pvtrdqsd 
i yevfnvag s s fgs yaagdvldmqs vgvysqlrrryvlvas sdgts k 
tatfkmedfegqnvpirkgrtniyvnrikswdngsgsllhsftnaa 
estrnsmnalnkamlesvaaeimsvsdgvmrlplflamilpvqlgaa 
tadactf i pvtrdqsd i yevfnvags s fgs yaagdvldmqs vgvysq 
lrrryvlvas sdgts ktatfkmedfegqnvp i rkgrtn i yvnr i ksv 
vdngsgsllhsftnaagssfgsyaagdvldmqsvgvysqlrrryvlv 
assdgtsktatfkmedfegqnvpipkgrnniyvnrikswdngsgsl 
lhsftnaaawasaxadyaaahksmprpeilascpqtlencliestrn 
smdatnkamlesvaaemmsvsdgvmrlplflamilpvqlgaatadac 
tfipvtrdqsdiyevfnvagss fgs yaagdvldmqs vgvysqlrrry 
vlvas sdgts ktatfkmedfegqnvp i rkgrtniyvnr i ksvvdngs 
gs llhs ftnaaawas7vlad yapahnlmprpe i las chqtlencl i e s 
trn smdatnkamles vaaemm svs dgvmrl pl flam i lpvqlgaata 

DACTF I PVTRDQSD I YEVFNVAGS S FGS YAAGDVLDMQS VGVYSQLR 
RRYVLVAS SDGTS KTATFKMEDFEGQNVP I RKGRNNIYVNR I KS WD 
NGSGSLLHSFTNAAGSSFGSNAAGDVLDMQSVGVYSQLRRRYVLVAS 
SDGTS KTATFKMEDFEGQNVP I RKGRTN I YVNR I KSVVDNGSGSLLH 
SFTNAAAWASARVDYPEAHKFMPRPKILASGHQTLKTCLIESTRNSM 
DATNKAMLESVAAEMMSVSDGVMRLPLFLAMILPVQLGAATADACTF 
I PVTRDQSD I YEVFNVAGSS FGS YAAGDVLDMQSVGVYSQLRRRYVL 
VAS SDGTS KTATFKMEDFEGQNVP IRKGRTN I YVNR I KS WDNGSG S 
LLHSFTNAAAWASARAEYPAAHKSMPRPEILASATQTLENCLIESTR 
NSMDATNKAMLESVAAEMMSVSDGVMRLPLFLAMILPVQLGAATADA 
CTF I PVTRDQSD I YEVFNVAGSS FGS YAAGDVLDMQS VGVYS QLRRR 
YVL VAS SDGTS KTATFKMEDFEGQNVP I RKGRNN I YVNR I KSWDNG 
SGS LLHS FTNAAE STRNSMDATNKAMLKS VAAKMMSVSDGVMRLPLF 
LAM ILPVQLGAATADACTF I PVTRDQSD I YEVFNVAGS S FGS YAAGD 
VLDMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIRKG 
RTNIYVNRIKSWDNGSGSLLHS FTNAAE STRNSMDATNKAMLESVA 
AEMMS VSDGVMRLPLFLAM I LPVQLGAATADACT F I PVTRDQSD I YE 
VFNVAGS S FGS YAAGDVLDMQS VGVYSQLRRR YVLVAS SDGTS KTAT 
FKMED FEGQNVP I RKGRTN I YVNR I KSVVDNGS G S LLHS FTNAAAWA 
SALADYAAAHKSMPRPE I LAS CHQTLENCL IESTRNSMDATNKAMLB 
SVAAEMMSVSDGVMRLPLFLAMI LPVQLGAATADACTFI PVTRDQSD 
I YEVFNVAG S S FG S YAAGDVLDMQS VGVYS QLRRR YVL VAS SDGTS K 
TATFKMEDFEGQNVPIRKGRTNIYVNRIKSWDNGSGSLLHSFTNAA 
AWAS ALAD YAAAHKSMRRPE I LAS CHQTLENCL I E S TRN S MDATNKA 
MLESVAAEMMSVSDGVMRLPLFLAMI LPVQLGAATADACTFI PVTRD 
QSDI YEVFNVAGS S FGS YAAGDVLDMQS VGVYS QLRRRYVLVAS SDG 
TS KTATFKMEDFEGQNVP I RKGRNNIYVNR I KS WDNGS GSLLHSFT 
NAAAHKSMPRPEILASCHQTLETCLIESTRNSMDATNKAMLESVAPE 
MMSVSDGVMRLPL FLAM ILPVQLGAATADACTF I PVTRDQSD I YEVF 
NVAGS S FG S YAAGDVLDMQS VGVY S QLRRRYVLVAS S DGTS KTATFK 
MEDFEGQNVPIRKGRTNIYVNRIKSWDNGSGSLLHSFTNAAGSSFG 
SYAAGNVLDMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDFEGQN 
VP I RKGRKN I YVNR I KS WDNGSG S LLHS FTNAAVG AS Y PLMRVP S F 
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RLRVTSPTSINQLRRRYVLVAS SDGTS KTATFKMEDFEGQNVP I PKG 
RKN I YVNR I KS WDNG SG S LLHS FTNAAG S S FGS YAAGDVLDMQS VG 
VYSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIPKGRKNIYVNR 
I KS WDNG SGS LLHS FTNAAATDDACTF I PVTRDQS D I YEVFNVAGS 
SFGSYAAGDVLDMQSVGWSQLRRRYVLVASSDGTSKTATFKMEDFE 
GQNVPIRKGRKNIYVNRIKSWDNGSGSLLHSFTNAAAEMMSVSDGV 
MRLPLFLAMILPVQLGAATADACTFIPVTRDQSDIYEVFNVAGSSFG 
SYAAGDVLDMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDFEGQN 
VPIRKGRNNIYVNRIKSWDNGSGSLLHSFTNAAAWASALADYAAAH 
KSMPRPEILASCHQTLENCLIESTRNSMDATNKAMLESVAAEMMSVS 
DGVMRLPLFLAMILPVQLGAATADACTF I PVTRDQSD I YEVFNVAGS 
S FGS YAAGDVLDMQS VGVYSQLRLRYVLVASSDGTS KTATFKMEDFE 
GQNVPIRKGRTNIYVNRIKSWDNGSGSLLHSFTNAAAWASALADYA 
PAHKSMPRPEIIASCHQTLENCLIESTRNSMDATNKAMLESVAAEMM 
SVSDGVMRLPLFLAMILPVQLGAATADACTFIPVTRDQSDIYEVFNV 
AGS S FGS YAAGDVLDMQS VGVYSQLRRR YVLVAS SDGTS KTATFKME 
DFEGQNVPIRKGRTNIYVNRIKSWDNGSGSLLHSFTNAAATADACT 
F I PVTRDQSD I YEVFNVAGS S FG S YAAGDVLDMQS VGVYSQLRRR YV 
LVASSDGTSKTATFKMEDFEGQNVPIPKGRNNIYVNRIKSWDNGSG 
S LLHS FTNAAGS S FG S YAAGDVLDMQS VGVYS QLRRR YVLVAS SDGT 
SKTATFKMEDFEGQNVPIRKGRTNIYVNRIKSWDNGSGSLLHSFTN 
AAAKMMSVSDGVMRLPLFLAMILPVQLGAATADACTFIPVTRDQSDI 
YEVFNVAGSSFGSYAAGDVLDMQSVGVYSQLRRRYVLVASSDGTSKT 
ATFKMEDFEGQNVP I PKAWAS ALAD YAAAHKFMPRP E I LAS CHQTLE 
NCLIESTRNSMDATNKAMLESVAAEMMSVSDGVMRLPLFLAMILPVQ 
LGAATADACTFI PVTRDQSD I YEVFNVAGS S FGS YAAGDVLDMQSVG 
VYSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIRKGRTNIYVNR 
IKSWDNGSGSLLHSFTNAAGEQITVTCSLNYNIGQIALSVSKAPDK 
GTEIAIERGDSTXLGTATADACTFIPVTRDQSNIYEVFNVAGSSFGS 
YAAGDVLDMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDFEGQNV 
P I PKAEMMS VSDG VMRLPL FLAM I LP VQLGAATADACT F I PVTRDQS 
D I YEVFNVAGS S FGS YAAGDVLDMQS VGVYS QLRRR YVLVAS SDGTS 
KTATFKMEDFEGQNVPIRKGRTNIYVNRIKSWDNGSGSLLHSFTNA 
AGS S FGS YAVGNVLDMQS VGVYS QLRRRYVLVAS SDGTS KTATFKME 
DFEGQNVPIPKGRKNIYVNRIKSWDNGSGSLLHSFTNAAAWASALA 
D YAAAHKS I PRPE I LAS CHQTLENCL I ES TRN SMD ATNKAMLES VAA 
EMMS VSDGVMRLPLFLAM I L PVQLGAATADACTF I PVTRDQSD I YE V 
FNVAGS S FGS YAAGDVLDMQS VGVY SQLRRR YVLVAS SDGTS KTATF 
KMEDFEGQNVP I PKGRTNINVNRIKSWDNGSGSLLHS FTNAAGS SF 
GS YAAGDVLDMQS VGVYSQLRRR YVLVAS SDGTS KTATFKMEDFEGQ 
NVPIRKGRTNIYVNRIKSWDNGSGSLLHSFTNAAESTRKSMDATNK 
PMLE S VAAKMM S VSDGVMRLPL FLAM I L PVQLGAATADACTF I PVTR 
DQSDIYEVFNVAGSSFGSYAAGDVLDMQSVGVYSQLRRRYVLVASSD 
GTS KTATFKMEDFEGQNVP I PKGRKN I YVNR I KS WDNG SGS LLHS F 
TNTU^AEMMSVSDGVMRPPLFLAM ILLVQLGAATDDACTF I PVTRDQS 
D I YE VFNVAGS S FGS YAAGDVLDMQ S VGVY S QLRRRYVLVAS S DGTS 
KTAT FKMEDFEGQNVP I P KAHKSM PRRE I LAS CHQTLENCL I E S TRN 
SMDATNKAMLESVAAEMMSVSDGVMRLPLFLAMILPVQLGAATADAC 
TFIPVTRDQSDIYEVFNVAGSSFGSYAAGDVLDMQSVGVYSQLRRRY 
VLVAS SDGTS KTATFKMEDFEGQNVP I RKGRTN I YVTR I KSWDNGS 
GSLLHSFTNAAAM I PHVQLGAATADACTF I PVTRDQSD I YEVFNVAG 
S SFGS YAAGDVLHMQSVGVYSQLRRRYVLVAS SDGTS KTATFKMEDF 
EGQNVPIRKGRNNIYVSRIKSWDNGSGSLLHSFTNAAAGPSALADY 
APAHKFMPGRKILGSLPPDAENCLIESTRNSMDATNKAMLESVAAEM 
MS VSDG VMRLPL FLAM I LP VQLGAATADACTF I P VTRDQ SD I YEVFN 
VAGS S FGS YAAGDVLDMQS VGVYS QLRRRYVLVAS SDGTS KTATFKM 
EDFEGQNVPIRKGRTNIYVNRIKSWDNGSGSLLHSFTNAAGEQITV 
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TCSLNYNIGQIALQLCRRYVLVASSDGTSKTATFKMEDFEGQNVPIP 
QG S KNMYVNR I KSWDNGSG S LLHS FTNAAAKD YERFLTVFMRLPL S 
LAMFLPFKLGAATADACP F I RVTRDQS D I YE VFNVAGS S FG S YAAGD 
VLDMQSVGWSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIPQA 
KMLSVSDGVMRLPLFLAMILPVQLGAATADACTFIPVTRDQSDIYEV 
FNVAGS S FGS YAAGD VLDMQS VGVYSQLRRR YVLVAS SDGTS KTATF 
KMEDFEGQNVPIRKAPVDACTFIPVPRDQSDIYEVFNVAGSSFGSYA 
AAWASALADYAAAHKSMRRRKIIASCHQTLENCLIESTRNSMDATNK 
AMLESVAAEMMSVSDGVMRPPLFLAMILPVQLGAATADACTFIPVTR 
DQSDIYEVFNVAGSSFGSYAAGDVLDMQSVGVYSQLRRRYVLVASSD 
GTSKTATFKMEDFEGQNVPIRKGRTNIYVSRIKSWDNGSGSLLHSF 
TNAAGS S FGS YAAGDVLDMQS VGVYTQLRRR YVLVAS SDGTS KTATF 
KMEDFEGQNVPIPQGRKNINLGAATADACTFIPVTRDQSDIYEVFNV 
AGS SFGSYAAGDVLDMQSVGVYS QLRRRYVLVAS SDGTS KTATFKME 
DFEGQNVPIRQGRNNINVNR I KS WDNGSGS LLHS FTNAAGEQITVT 
CS LNYNI GQ I ALS S PKRR I KALLGAATADACTF I PVTRDQSDM YEVF 
NVAG S S FGS YAAGDVLDMQS VG VYS QLRRRYVLVAS SDGTS KTATFK 
MEDFEGQNVPIRKGRKNINVNRIKSWDNGSGSLLHSFTNAAATAYA 
CTFIPVTRDQSDIYEVFNVAGSSFGSYAAGDVLNMQSVGVYSQLRRR 
YVLVAS SDGTS KTATFKMEDFEGQNVP I RKGRNNI NVNR I KSWDNG 
SGSLLHSFTNAAAHKSMRRPEILASCHQTLENCLIESTRNSMDATNK 
RMLESVAAEMMSVSDGVMRLPLFLAM I LPVQLGAATADACTF I PVTR 
DQSDIYEVFNVAGSSFGSYAAGDVLDMQSVGVYSQLRRRYVLVASSD 
GTS KTATFKMEDFEGQNVP I RKGRTNI YVNR IKS WDNGSGSLLHS F 
TNAAESTRNSMDATNKAMLLSVASEMMSVSHGVMRLPLFLAMIMPVQ 
LGAATADACTF I PVTRDQSD I YEVFNVAGS S FG S YAAGD VLDMQSVG 
VYS QLRRRYVLVASSDGTS KTATFKMEDFEGQNVP IRKGRNNIYVTR 
IKS WDNGSGS LLHS FTNAAAWATEMMSVSDGVMRLPLFLAM I LPVQ 
LGAATADACTF I PVTRDQSD I YEVFNVAGS S FGS YAAGD VLDMQSVG 
VYSQLRRR YVLVAS SDGTS KTATFKMEDFEGQNVP I RKGRTNI YVNR 
IKSWDNGSGSLLHSFTNAAAEMMSVSDGVMRLPLFLTMILPVQLGA 
ATDDACTFIPVTRDQSDI YE VFNVAGS SFGSYAAGDVLDMQSVGVYS 
QLRRRYVLVAS SDGTS KPATFKMEDFEGQNVP I PKGRNN I NVTR I KS 
WDNGSGS LLHS FTNAAGEQ I TVTRSLNYN IGQ I ALS VS KAPLGAAT 
ADACTF I PVTRDQS D I YEVFNVAGS S FGS YAAGDVLDMQ SVG VYSQL 
RRR YVLVAS SDGTS KTATFKMEDFEGQNVP I RKGRKNI YVNR I KSW 
DNG SGSLLH S FTNAAVARKD P VFGG I S EKPTVQ I ATAW AS ALPD YAP 
AHKSMRRPKILASCHQTLETCLIESTRNSMDATNKAMLESVAAEMMS 
VSDG VMRLPL FLAM I LPVQLGAATADACTF I PVTRDQSD I YEVFNVA 
GS S FGS YAAGDVLDMQS VGVYSQLRRR YVLVAS SDGTS KTATFKMED 
FEGQNVPI RKGRTN I YVNR I KS WDNGSGSLLHS FTNAAAAS KS PGF 
SGIFDNAHGQIRYRVGKWHWLLTPQHINLCRVGKFWLPATQTLENCL 
IESPRNSMDPTNKAMLESVAAEMMSVSDGVMRLPLFLAMILPVQLGA 
ATADACTF I PVTRDQ SD I YEVFNVAGS S FGS YAAGDVLDMQS VGVYS 
QLRRRYVLVAS SDGTS KTATFKMEDFEGQNVP I RKGRTN I YVNR I KS 
WDNGSGS LLHS FTNAAESTRNSMDATNKAMLESVAAEMMSVSDGVM 
RLPLFLAM I LPVQLGAATADACTF I PVTRDQSD I YE VFNVAGS S FG S 
YAAGDVLDMQS VGVYSQLRRR YVLVAS SDGTS KTATFKMEDFEGQNV 
PIRKGRTNIYVNRIKSWDNGSGSLLHSFTNAAATADACTFIPVTRD 
QSDIYEVFNVAGSS FGS YAAGDVLDMQS VGVYS QLRRRYVLVASSDG 
TS KTATFKMEDFEGQNVP I RKGRNNIYVNRIKSVVDNGSGSLLHS FT 
NAAAEMMSVSDGVMRLPLFLAM I LPVQLGAATADACTF I PVTRDQSD 
I YEVFNVAGS SFGSYAAGDVLDMQSVGVYS QLRRRYVLVAS SDGTS K 
TATFKMED FEGQNVP I RKGRKN INVNRI KS WDNGS G SLLHS FTNAA 
AWASALADYAAAHKSMPRPENWASCHQTLENCLIESTRNSMDATNKA* 
MLESVAAEMMSVSDGVMRLPLFLAMI LPVQLGAATADACTF I PVTRD 
QS D I YE VFNVAGS S FG S YAAGDVLDMQS VGVY S QLRRRYVLVAS SPG 
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TSKTATFKMEDFEGQNVPIRKGRTNIYVNRIKSWDNGSGSLLHSFT 
NAAESTRNSMDATNKAMLESVAAEMMSVSDGVMRLPLFLAMILPVQL 
GAATADACTF I PVTRDQSD I YEVFNVAGS S FGS YAAGDVLDMQS VGV 
YSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIRKGRTNIYVNRI 
KSWDNGSGSLLHSFTNAAAWASALADYAPAHKSMPRPEILASCHQT 
LKNCLIESTRNSMDATNK3MLESVAAEMMSVSDGVMRLPLFLAMILP 
VQLGAATADACTF I PVTRDQSDI YEVFNVAGSS FGS YAAGDVLDMQS 
VGWSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIRKGRTNIYV 
NRIKSWDNGSGSLLHSFTNAAAEMMSVSDGVMRLPLFLAMILPVQL 
GAATADACTF I PVTRDQSD I YEVFNVAG S S FGS YAAGDVLDMQS VGV 
YSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIRKGRTNIYVNRI 
KS WDNGSGS LLHS FTNAAGEQ I TVTCSLNYNIGQI ALS FS KAPDKG 
TEIAIERTLENCLIESTRNSMDATNKAMLESVAAEMMSVSDGVMRLP 
L FLAM IL P VQLGAATADACTF I PVTRDQS D I YEVFNVAGS S FGS YAA 
GDVLDMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIR 
KGRTNIYVNRIKSWDNGSGSLLHSFTNAAAEMMSVSDGVMRLPLFL 
AM I LP VQLGAATADACTF IPVTRDQSD I YEVFNVAGS S FGS YAAGDV 
LDMQS VGVY S QLRRRYVLVAS SDGTS KTATFKMED FEGQNVP I RKGR 
NNIYVNRIKSWDNGSGSLLHSFTNAAAEMMSVSDGVMRLPLFLAMI 
LPVQLGAATADACTFI PVTRDQSDI YEVFNVAGSS FGS YAAGDVLDM 
QSVGVYS QLRRRYVLVAS SDGTS KTATFKMEDFEGQNVP I RKGRTNI 
YVNR I KS WDNG SG S LLH S FTNAAAEMMS VSDGVMRLHLFLAM ILP V 
QLGAATADACTF I PVTCDQSD I YEVFNVAGSS FGS YAAGDVLDMQS V 
GVYS QLRRRYVLVAS SDGTS KTATFKMED FEGQNVP I P KGRTNI YVS 
RIKSWDNGSGSLLHSFTNAAAWASALADYAAAHKSMPRPEILASCH 
QMLENCLIESTRNSMYATNKAMLESVAAEMMSVSDGVMRLPLFLAMt 
L PVQLGAATADACTF I PVTRDQSD I YEVFNVAG S S FGS YAAGDVLNM 
QSVGVYSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIRKGRTNI 
YVNRIKSWDNGSGSLLHSFTNAAADMMSVSDDVMRLPLFLAMIMHV 
QLGAAPADACT F I PVTRDQS D I Y EVFNVAG S S FG S Y AAGNVL YMQS V 
GVYSQLRRRYVLVASSDGTS KTAPFKMED FEGQNVP I PKGRNNIYVI 
RIKSWDNGSGSLLHSFTNAAATDDGCTFIRVTRNQSTFYEVFNVAG 
S S FGS YAAGDVLH I QSVGVYS QLRRLYVLVAS SDGTS KTATFKMEDF 
EGQNVPIRKGRTNINVTRIKSWDNGSGSLLHSFTNAAVPHSIKSLT 
VAS S S I G S YAAGD VLDMQ S VGVY S QLRRRYVLVAS SDGTS KTATFKM 
EDFEGQNVPIRKGRTNIYVNRIKSWDNGSGSLLHSFTNAAGSSYGS 
YADGDVLDMQS VG VYSQLRRR YVLVAS SDGTS KTATFKMEDFEGQNV 
P IRKDGDALNMQS VGVYSQLRQPLCAGGKLRWHQQI RNLQDLGAATG 
DACTF I PVTRDQSD I YEVFNVAGSS FGS YAAGDVLDMQS VGVYSQLR 
RRYVLVAS SDGTS KTATFKMEDFEGQNVP I RKGRKN I YVXPSYVACY 
ELVAS SDGTS KTATFKMEVFEGQNVP I PKGRKNI YVNR I KS WDNG S 
GSLLHSFTNAAATADACTFIPVTRDQSDIYEVFNVAGSS FGS YAAGD 
VLHMQSVGVYSQLRRRYVLVAS SDGTS KTATFKI LLHS FTNAAGEQI 
TVTCSLNYNI GQ I ALS FS KAPNKGTE I AI ERE I S QKVTGGXLGAATD 
DACTF I PVTRDQSD I YEVFNVAGSS FGS YAAGDVLDMQS VGVYSQLR 
RRYVLVAS SDGTS KTATFKMEDFEGQNVP I PKGRKNINVTR IKS WD 
NGSGSLLHSFTNAAVWANALANYAQAHNLIPRPKILASCHQKLKTCL 
IKSTRNSMDATNKAMLESVSAEMMSVSDGVMRLPLFLAMILPVQLGA 
ATADACTF I PVTRDQSD I YEVFNVAG S S FGS YAAGD VLDMQS VGVYS 
QLRRRYVLVAS S DGTS KTATFKMED FEGQNVP I RKGRNN I YVNR I KS 
WDNGS GS LLHS FTNAAAEMMS VSDGVMRLPLFLAM I QHVQLGPATG 
NACTFI PVTRDQSDI YEVFNVTGSSFGSYADGNVMNMQS VGVYSQLR 
RC YVLVAS SDGTS KTATFKMEDFEGQNVP I RKGRKN I YVNR I KSVVD 
NGSGSLLHSFTNAAAHKFMPRPEILASCHQTLDNCVIESTRNSMDAT 
NKAMLESVAAEMMSVSDGVMRLPLFLEMILPVQLGAATADACTFIPV 
TRDQSDIYEVFNVAGSS FGS YAAGDVLNMQSVGVYSQLRRR YVLVAS 
SDGTS KTATFKMED FEGQNVP I RKGRTNI YVNR IKS WDNGSGS LLH 
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SFTNAAEMMSVSDGVMRVPLFLAMILPVQLRAATDNACTFIPVTRDQ 
SDIYEVFNVAGSSYGSYVAGDVMDMQSVAWASALVDTPQAHKSMPRP 
KIRASOTQTLENCLIESTRNSMDATNKAMLESVAAEMMSVSDGVMRL 
PLFLAM I LPVQLGAATADACTF I PVTRDQSD I YEVFNVAG S S FGS YA 
AGD VLDMQS VGVYS QLRRR YVL VAS SDGTS KTATFKMEDFEGQNVP I 
RKGRTN I YVNR I KSWDNG SGSLLHS FTNAAESTRNSMDATNKAMLE 
SVAAEMMSVSDGVMRL PL FLAM I LPVQLGAATADACTF I PVTRDQSD 
I Y E VFNVAG S S FG S YAAGD VLDMQS VGVYS QLRRR YVL VAS S DGTS K 
TATFKMEDFEGQNVPIRKGRTNIYVNRIKSWDNAYFTRLLMLLREQ 
I TVTCSLNYN I GQ I ALSFSKAPDKGTEIAIERAWASALAEYAAAHKS 
MPRPEILASCHQTLENCLIESTRNSMDATNKAMLESVAAEMMSVSDG 
VMRLPLFLAMILPVQLGAATADACTFIPVTRDQSDIYEVFNVAGSSF 
GSYAAGDVLDMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDFEGQ 
NVPIRKGRTNIYWRIKSVVDNGSGSLLHSFTNAAAEMMSVSDGVMR 
L PL FLAM I LPVQLGAATADACTF I PVTRDQSD I YEVFNVAGSS FGS Y 
AAGDVLDMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVP 
IRKGRTNI YVNRI KS VVDNGSGSLLHS FTNAAGEQITVTCSLNYNIG 
QI ALS FS KAPDKGTE I AI ERGS STCFRRLLEWFPEENLWLAWAS ALA 
E Y AAAHKFM PRPKI LAS CHQTLETGL I E S TRNSMD ATNKAMLES VAA 
EMMSVSDGVMRLPLFLAMILPVQLGAATADECTFIPVTRDQSDIYEV 
FNVAG S S FG S YAAGDVLDMQS VGVYS QLRRR YVL VAS S DGTS KTATF 
KMEDFEGQNVPIRKGRNNIYVNRIKSWDNGSGSLLHSFTNAAATAD 
ACTFI PVTRDQSD I YEVFNVAGS S FGS YAAGDVLDMQS VGVYSQLRR 
RYVLVASSDGTSKTATFKMEDFEGQNVPIPKGRNNINVNRIKSVVDN 
GSGSLLHSFTNAAAEMMSVSDGVMRLPLFLAMILPVQLGAATADACT 
F I PVTRDQSDI YEVFNVAGS SFGS YAAGDVLDMQS VGVYSQLRRRYV 
L VAS SDGTS KTATF KMED FEGQNVP I RKGRTN I YVNR I KS WDNGSG 
SLLHS FTNAAGMPLNKPNAG I LSQQKMMRVSAGVMRLPLFLAMI LPV 
QLGAAPVDACTF I PVTRDQSD I YE VFNVAGS S FG S YAAGD VLDMQS V 
GVYSQLRRRYVLVAS SDGTS KTATFKMEDFEGQNVP I PKGRNNIYVN 
RIKS\A/DNGSGSLLHSFTNAAGEQITVTCSLNYNIGQIALSFSKAPD 
KGTEIAIERAEMMSVSDGVMRLPLFLAMILPVQLGAATADACTFIPV 
TRDQSD I YEVFNVAGS S FGS YAAGDVLDMQS VGVYS QLRRR YVLVAS 
SDGTS KTATFKMEDFEGQNVP I RKGRTNI YVNR I KSWDNGSGSLLH 
SFTNAAAHKFIRRPEIRASCHQTLKNCLKESTRNSMDPTNKAMLESV 
AAEMMSVSDGVMRLPL FLAM I LPVQLGAATADACTF I PVTRDQSD I Y 
EVFNVAGSSFG S YAAGDVLDMQS VGVYS QLRRR YVLVAS SDGTS KTA 
TFKMEDFEGQNVPIPKGRNNIYVNRIKSWDNGSGSLLHSFTNAAGE 
QITVTCSLNYNIGQIALSFSKAPEKGTEIAIERESTRNSMNATNKAM 
LESVAPEIMSVSDGVRRPLIFLAMILLVQLGTATADACTFIPVTRDQ 
SDIYKVFNVAGSSFVSYAAGDVLDMQSVGVYSQLRRRYVLVASSDGT 
SKTATFKMEDFEGQNVPIRKGRNNI YVNRI KS VVDNGSGSLLHS FTN 
AADLKNGLIEPTANSMNALNKAMLESVAAEIMSVSDGVMRLPLFLAM 
I LPVQLGAATDDACTF I PVTRDQSD I YEVFNVAGS S FGS YAAGD VLD 
MQS VGVYSQLRRRYVLVAS SDGTS KPATF KMED FEGQNVP I PKGRKN 
INVNRIKSVVDNGSGSLLHSFTNAAAWASALAEYAAAHKFMPRPBIL 
ASCHQTLENCLIESTRNSMDATNKAMLESVAAEMMSVSDGVMRLPLF 
LAM I LPVQLGAATADACTF I PVTRDQSD I YEVFNVAGS S FGS YAAGD 
VLDMQSVGVYSQLRRRYVLVASS DGTS KTATFKMED FEGQNVP IRKG 
RTNIYVNRIKS VVDNGSGSLLHS FTNAAGEQITVTCSLITyNIGQIAL 
S FS KAPNKGTE I AI ERGVLTKVGAEH INLCG VRKI LAS CHQTLENCL 
IESTRNSMDATNKAMLESVAAEMMSVSDGVMRLPLFLAMILPVQLGA 
ATAD ACTF I PVTRDQ S D I YEVFNVAG S S FGS YAAGDVLDMQS VGVYS 
QLRRR YVLVASSDGTS KTATFKMEDFEGQNVP IRKGRNNI YVNRI KS 
VVDNGSGSLLHS FTNAAGEQ I TVTCS LNYNI GQI ALS F S KAPDKGTE 
I AIERGLGAATADACTF I PVTRDQSD I YEVFNVAGS S FGS YAAGDVL 
DMQSVGVYSQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIRKGRN 
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NIYVNRIKSWDNGSGSLLHSFTNAAAHKFIRRREFLASCHQTLKNC 
LIESTRNSMDATNKAMLESVAAEMMSVSDGVMRLPLFLAMILPVQLG 
AATADACTF I PVTRDQSD I YEVFNVAG S S FGS YAAGDVLDMQS VGVY 
SQLRRRYVLVASSDGTSKTATFKMEDFEGQNVPIRKGRNNIYVNRIK 
SWDNGSGSLLHSFTNAAAHNFMRVPEILPSCHQTWKTALIKSTPNS 
MNATNKAMLESVAPEMMSVSDGVMRLPLFLPMIWPVQLGAATADACP 
FI PVTRDQSDI YEVFNVAQSS FGS YAAGDVLDMQS VGVYSQLRRRYV 
LVASSDGTSKTATFKMEDFEGQNVPIRKGRTOIYVTRIKSWDNGSG 
SLLHSFTNAAGEQITVTCSLNYNIGQIALSFSKAPDKGTEIAIERDA 
S LNF AQRGQ VH WLTTPP AH KS M PR PE I LAS CHQTLENCL I E S TRNSM 

DATNKAMLESVAAEMMSVSDGVMRLPLFLAMILPVQLG7UVTADACTF 
I PV.TRDOSD I YEVFNVAGS S FGS Y AARDVT .nwin Qunw cnT.'DOD v\rr 

VAS SDGTS KTATFKMEDFEGQNVP IRKGRTNI YVNR T KS WDNGSGS 
LLHS FTNAAGEQ I TVTCSLNYN IGQI ALS FS KAPDKGTE I AIEREST 
RNSMDATNKAMLESVAAE^SVSDG\^RLPLFLAMILPVQLGAATAD 
ACTF I PVTRDQSD I YEVFNVAG S S FGS YAAGDVLDMQS VGVYS QLRR 
RYVLVASSDGTSICTATFKMEDFEGQWPIRKGRTNIYVNRIKSVVDN 

GSGSLLHSFTNAAGEQITVTCSLNYNIGQIALSFSKAPNKGTEIAIB 
RGSSTX* 

6548 

A 

3 

127 

LFHPCQDSQQHH*CVCCRLTGHGAA*VHGPCQAVQTYRASH 

6549 

A 

2 

323 

G VGAGGCLREAAAGHVTGRAPHGRRPS S ARPERPAR PR PWGAATAAR 

NEDSDQDRRGAEAVCLDPSQPPPTKALNGAEPNYHSLPSARTDEQAL 
LFFHPCQDSQQHH 

6550 

A 

2 

94 

WRDLGSPQPPPPGFKRFFCLILPTRCRSPR 

6551 

A 

171 

632 

YVKVREMGLAS KKWNLI SMF YGNSDLMGS KLVHISRNYF I FSRQS FT 
LVGQAGVQWRDLGSPOPLLPGFKRFSCLSLP^I^YPHT'PPHPZVTJTrv 

FLVETGFLHVGQAGLELPTSGDLPASASQSPGITGVSHCTWPIPSHL 
LKKKI KKINIHL 

6552 

A 

3 

325 

TFNVPAMYVAIQAVLSLYASGRTTGIVLDSGDGVTTMYPGIADRMQK 
EITALAPSTMKIKIIAPPERKYSVWIGGSIIxASLSTFQQMWITKQEY 
DEAGPS I VHRKCF 

6553 

A 

2 

171 

LAPSTMKIKIIAPpERKYSVWIGGSILASLSTFQQMWISKQEYDESG 
PS I VHRKCF 

6554 

B 

104 

1164 

MCDEDETTALVCDNC?^f?T.VKafTT?l\r2riniVD'Di\'\7T?DC! T'irr'D DDunm/Mxr 

GMGQKDSYVGDEAQSIOIGILTLKYPIEHGIITNWDDMEKIWHHTFYN 
ELRVAPEEHPTLLTEAPLNPKANREKMTQIMFETF1TVPAMYVAIQAV 
LSLYASGRTTGIVLDSGDGVTHl^PTYEGYAT.PHnTMPT.'nT anpm t 
DYLMKI LTERGYS FVTTAERE I VRD I KEKLC YVALD FENEMATGRHP 
SILPWKRATSCQTGQVITIGNERFRCPETFFQPSFIGMEVGGHFTRP 
PYNSIMKCDIDIQEGPCMPTNVMFGGHPRLYPWEFADPHARKRFTGA 
GTPGTHEESKDKD S FAPRRSGKI LG * 

6555 

A 

1 

308 

LRPWC/GHLRT*AOYHTLRAARLDPNFATA*MFAKATnr:PPPi?R /\ro 

gfptiyf\spannrljhprtseggrdlsdfisylhveatnppviqbv* 
pkkk\kkaqedl 

6556 

A 

2 

589 

airtaxgekfvmqeefsrdgkalerflqdyfdgi^krylksepipes 
1togpvkvvvaenfdeivnne 

ekls\kdpnivia\kmdatan\nvhspyevrgf\pshillskpnkkl 
npk\ kyeggrelr* fllai yqreatkpppviqv* fipkyrrrrhqed 
llkpvaqtptl 

6557 

A 

1 

129 

YDKPHVLVDILVDDSGKESLIGYRPTEDYDLRNCILVGINQME 

6558 

A 

23 

116 

SVLSKPKVKMPHILF7LSCTVSHSIWLIPIM "1 

6559 

A 

1 

1582 

GRGWRAVLGWSRRRSGLEPATVGSSMALLFLLPLVMHGVSRAEMGTA 
DLGPSSVPTPTNVTIESYNMNPIVYWEYQ™ 

NSEWIDACINISHHYCNISDHVGDPSNSLWVRVKARVGQKESAYAKS 
EEFAVCRDGKIGPPKLDIRKEEKQIMID I FHPSVFVNGDEQEVDYDP 
ETTCY I RVYNVYVRMNGS E I Q YKILTQKEDDCDE I QCQLAI PVS SLN 
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SQ Y CVS AEGVLHVWGVTTEKS KEVC ITIFNSSIKGSLWI PWAALLL 
FLVX.SLVFICFYIKKINPLKEKSIILPKSLISWRSATLETKPESKY 
VSLITSYQPFSLEKEWCEEPLSPATVPGMHTEDNPGKVEHTEELSS 
x x__ v v x xdCtn xr uv viro^ \«^a*'Xc#RESSSPxjSSNQSEPGSIALNSY 
HSRNCSESDHSRNGFDTDSSCLESHSSLSDSEFPPNNKGEIKTEGQE 

LITVIKSPPPSFC\YDKP\HVLVDLLV\DDSGKESL\IGYRPT\EDS 
QRNFHEISLSCTQL 

6560 

A 

69 

523 

PLGCASSQS ISASRNTLCTTASSCCPQVIiAHS\ KAABYMTRWKVQQM 
PHSQDRALQSVFCAPFHS * *LVALPTGHR*MTPAQFSECFQATSGGS 

D*DP FLAPS FL\VPGLPVAPGLLLPLGPVHSRATMEEGQATHEELiTV 
FIGLRPGVRGS 

6561 

A 

101 

1469 

RTHPTFPHPGTGPTSAPPSGALEGTAGTITSNEWSSPTSPEGSTASG 
GSQALDKPIDNDGEGVWSPDIEQSFQEALAIYPPCGRRKIILSDEGK 
rn i bKW -bJjl AR Y 1 KxjR TGKTRTRKQ VS SHI Q VLARR KARE I QAKLKDQ 
AAKDKALQSMAAMSSAQIISATAFHSSMRLARGPGRPAVSGFWQGAL 
PGQAETSHDVKPFSQQTYAVQPPLPLPGFESPAGPAPSPSAPPAPPW 
QGR\SVASSKbWMLEFSAFLEQQQDPDTYNKHLFVHIGQSSPSYLRP 
YLEAVDIRQIYI)KFPEKKGGLKDLFERGPSNAFFLVKFWADlaNTNIE 
DEGS S FYGVSSQYES PENM IITCS TKVCS FGKQWEKVETE YARYEN 
GHYSYl^IHRSPLCEYMINFIHKLKHLPEKYMMNSVLENFTILQVVTN 
RDTQETLLCIAYVFEVSASEHGAQHHIYRLVKE 

6562 

A 

1 

1290 

MPAAACRQDKKSGRFYS FTEAEGTVAVTQGKAEGRAAPWSLPS VKKA 
QEKVWGEERRPPALGRNELIARYIKLRTGKTRTRKQVGIiKRRVGVPG 
V VPQHQS APKVS SH I QVLARRKAREI QAKLKDQAAKDKALQSMAAMS 
SAQI I S ATAFHS SMALARGPGRPAVSGFWQGALPGQAGTSHDVKPFS 
QQTYAVQPPLPLPGFESPAGPAPSPSAPPAPPWQGRSVASSKLWMLE 
FSAFLEQQQDPDTYNKHLFVHIGQSSPSYSDPYLEAVDIRQIYDKFP 
xir_vLAjXjf_jjjr t_KVj_'c>XSl/_r r JjVKr wVRGREGTSQRQKAGPGLBAKTQA 
YQLIPPQKPPPAEPERTLAVGHPTCRFVGEENGLKKPIDLPKFPQQL 

QSRKASWRRKYLSHPREKDSCRSEGHAHIMRDWVSELRVTRRPGKTA 
EVQVLLL 

6563 

A 

604 

1047 

SNNRTDN PTS VAYLS KE TD WAKG WPHCLWVWAVA I LVX»EAI KI I Q 

GKDFTVWTSHDVNGIIXSAKGNSWLSDKRx.LRYQALLLEGP^ 

VAIjNPAIFLPExX3EPIKHDYQQIVAQTYVT*EDLLEVPLANPD1^LY 
TNGSSFV 

6564 

A 

1 

1410 

METPGKAFRQIDRKTSAASWFSFPRTSVTGLRGGGDKMKGAI^GDPSA 
PPVPAI^RLANSTWSDGSVTVTIWGLPEEPDILVHIIYIW 
_> _> K-biiVP 1 KKKQRKTTDGDWQAAPPATGCRIHWVKP^ 
TFMSSQGIVNTPVSTLHLAQVRPRTHQFWTDFGDHEGTITNRQASSG 
x ivx y/_r vu\jxjj\_h.x ori/___»_ij\.i y VoCF liGKGSLiMTPGLFGVGTVGGQA 
RVQ VF KNVS VS AAKSDL PWS S I WS GRKTS VS AAAS LLGGGE FG PTR I 

HIPFSLSDLKQIKAGLGRFSDDPDRDQEEQAQKEKRDQRKAAALVMA 
LROTNLGGSERTENGAGOSPFPAf!T AT.nnT Tn~^riirTM_rvrrtrA-/T ovo 

w — " - w«u4jr»xoi»_nuyor _>X\_-l_rX__X_\jJLIXj J. y X KAjiVl PyWAYtigKB 

IDAVAKVWPHCLKLVAAVAVFVSEAI KI I QGKDLTVWTTHEE P VG S S 
AQQSQHQPDQPRYTCKLLEDLHLIiFRKEISQTKKAPILFDPVIKREK 

6565 

A 

510 

755 

LDTECPHKGSPSPHQSRGPTLGGRPJ^GGKMAAETOYW-CLPESRTCA 
E EAVHL W PAVLRD PRWGNP GDG A W P C YMP n pp t .p 

6566 

A 

59 

413 

SWPSDKQTLWQRGQKMEQANHPDPTDHMSQLMWT/VI-PQGFRDSPH 

LFGQAI_AX_DIyGHFSSPGTLVLQY^ 

LANQGYK/LSRSKAQLCLQQVKYLCL 

6567 

A 

1 

565 

KKTS KE I S KGPQKPPGYRLRPLQAVGGGEFGPTQVHVHFSLSDLKQI 
KVDLGKISDDPDRYIDVLQGLGQT^ 

Ax_^y_\QEFGDTWYLSEVNDRMTAEERDKFPTEDGEPIKCDCQQ 
TYAAQDD I LEVPLANPDLML YTDGS S FVENG I RRAG YAI VSDVTILE 

6568 

A 

126 

492 

CNNSMTSLQVRLJCVCPRPCRTSM/ LPIRI I *KLPQVCLDIjL* IREGE 
WDMYPCWAKFPCPYSLKGT*PIAWGILWSFGDFFAY/IP1_MQQQKH* 
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FSSQNTRRNEEGQI *WPLLMHSQKPAPLPVV 

6569 

A 

1 

342 

FLFAFEDLTDHTSQLTWTILPQGFRDSPHLFGQALAQDLGHFSSPGT 
L VLQYVDDLLLATS S EAS CQQATLDFLS FLANQG YEAS R S KAQLCLQ 
QDRGYQATDDLTNGTPNELN 

6570 

A 

55 

219 

LGNKHLLGSIDPRGSWVTGEYIFLRPPIAA*GRQ*DFLPFPELWWTS 
LNNSWAFS 

6571 

A 

339 

1371 

ASHPLRGLLCGPSLPNEPCPLLHGTQSHRPLKG*GVRAHSTGLAGSS 
TCRPLRDPLGEAS WAPES AQGL * I HQS ALC I * LKXiKPAGQRAKLGDR 
LDIPDRRKSSTILGRAGDLQPAMPEPPTLSVGSCAAOASPMNPAPrc; 
TAPS P I DHSRAEECGR I ARDWQAAPPAG PCG I HWVKPAG LLSLLRDC 
KYTNRHS VS S S RFANTNQHP AS SSG FVNAP I DTLYLANLVGTWRTFV 
ASSGIVWAPISTLSKQTTWLYQSAGCGKESTQASGAQTGGTSFKSLE 
TTG SIS EAS S A KHCTEL FS R FTTFN PD S VP S DG WGDTAGTVW PG VL 
KGEPCHLGTCYRCVLDPHPTPSESDTIQGIHVQVCYMSTLHDADVCD 
TNDPVTHNTKKYIYSTEIFTSNNPELRSEDETVFRALEKWKTSEQTI 
GEMDFYICNDPHPDSALYQNGLSKMQDTVSLSVFSPSVAA 

6572 

A 

749 

1003 

QLKKGTNSLLVSKPSPVWIPTGTLTQIMGTGVANIC*PVF*KD*GEL 
GKMNYAMMSTITQGKEENPAFLKWLWEALRKYTPLSP 

6573 

A 

1 

1632 

WCPEFVPSDVRMCPEFLPSVGFWSLTSGVKLQTFTLDTKVLHVPF 

S LSDLKQ I KADLGKFSDDPDR YKDVLQGLGQTFDLAWRD VMLLIJDQ I 

IJVFIEI^AALAAAREFGDTWYLSQVNDPJ^TAKEPJDKFLTGQQAIPTM 

DPHWDFDSDHGDWSHKHIiLTCVLEGLKGIRKKPMNYSMMSTITQGQE 

EKPSAFLERLREALRKYTPLSPESLEGOLILKDKFTTO^AAnTPPnT. 

QKQALGPEQNIiQELLNLATLVFYNRDQEEQAQKEKiro 

LRQTNLGGS ERTE YGAGQS PRRD I LAKAGA 1 1 YMNMGNKLP I CCPLL 

EEGINPEWAiERQFGRAKNARPVQIRLKDPTTFPYQRQYPLRLEAH 

KALQDIVKHLKPQGLVRKCSSPCNTPVLGVQKPNGQWRLVQDLRLIN 

EAVIPVYLVVANPYTLI1SQIPEEAEWFMVLDI1KDAFFCIPLHSGSQF 

LFAFEDPTDHMSQLMWMVLPQGFRDSPHLFGEALAQDLGHFSSPGTL 

VLQYMDDLLLATSWEASCQQATLELSS 

6574 

A 

307 

543 

DENIHPLNPLSKQDTPTAVGNWTMTAVATSCWIGVKKGPCSCSILQR 
GALGREFVPFFSCHS I I YLTKIPGISKLLGCS 

6575 

A 

1 

1149 

MDGTGDHYVKRKKPDTERQTSHLLTSLWDLKIKTIELVEIESRRMAT 
KAWEGPAMEHTCHLiASSYQTSVETAXjGHPIEEEAAEAEPKQNSGINV 
VLAVISELLRLPPVSIIQSNEARVAGPHDQNHHSFCSRGHLWPLYSG 
GNWGLLFGLSELKS I PWEKLSHVYMSRGLSQGPMGDERYMREKRQGC 
SKGSEEMGASYSGGDFENLCVGTLYLANIiVGTWRTFASSSGIVNAPI 
SILSKRTNQLSVKWTNQLSVKWTNQQDSTLPLPJ\ATLTAKVCSFTPE 
ARETTNPPGGMSNSPJtAAJjRVVTLTVKVCS FI PEPVRPGTHQKEETL ! 
NTSEHQKEQTLDTPPLRTVTLTPRVHGFILEVSETKNPP I PDTPALW 
EAEAGGS 

6576 

A 

119 

409 

GDICHHLGLTPVGSHSLLSCSR*QVA*VGAVTAATIGTGILLQQLAF 

LVCNWLLLSGSSENFPRSALICFKSEREKGTCIQVGPNS PPPTACKG 
HN 

6577 

A 

1 

2433 

MPQVDHLVFVVHGIGPVCDLRFRSIIECGLRRIRKKPMNTSMMSTIT 
QRKEENPS VFLEWLWEALRKYTPLS PDS LEGQL I LKDKF I TQSATD I 
SRKIjQNRALVPEQWLEVLLNLATLVFYTPJDQEEQAKKEKRDQRKAAA 
LVMALKQTNLGGSERTENGAGQSPGMAFYQCGLQGHLRKDCPTRNKL 
PPRPCPLCRGNHWKATAPEDKEDGEPIKHDCQQIIVQSYAPQEDRLE 
VP LANPDLNL YTDGS S FVENG I RRAG Y A I VSD VT I LEMR KCAGQERD 
RFPVSGWPGTRRRVSDGGPQPWPRENLTVAAAAPPLPHRALTYFSP 
PR^RSSASMCPSSQSPRPPLLRPPCSYRELSRVLDIHSYPGIPQIFA 
EYPLLVRQVLAGNANKLNSGPLLGYSSLSPTRRLLGYEDSTDVGEED 
S FLGQTS IHTS APQTFS YFS QVS S S SDPFGNIGQS PLTTAATSVGQS 
GFPKPLTALPFTTGSQDVSNAFSPSISKAQPGAPPSSLMGINSYLPS 
QP^SLPPSYFGNQPQGIPQPGYNPYRHTPGSSRANPYIAPPQLQQCQ 
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TPGPPAHPPPSGPPVQMYQMPPGSLPPVPSSVQSPAQQQVPARPGAP 
SVQVPSPFLLQNQYEPVQPHWFYCKEVEYKQLWMPFSVFDSLNLEEI 
YNSVQPDPESWLGTDGGRYDVYLYDRIRKAAYWEEEPAEVRRCTWF 
YKGDTDSRFIPYTEEFSEKLEVPSLNEVPLHYIiKPNSFVKFRCMIQD 
MFDPEFYMGVYETVNQNTKAHPQQELDLNSPRNTTLERQTFYCVPVP 
GESTWVKEISFI 

6578 

B 

97 

1065 

MHAVHTSLLVERTILTTTKERGSTLQYPLRPGAHKGLQDIVKRFKAQ 
ALI RKCS S PCNTPI LGVQKPNGQWRLVQDLRL INEAVI PL YP VVPNP 
YNLLSQIPEEAEWFTVLDLKDAFFCIPLHSDSQFLFAFEDPTDHTSQ 
LTWTVIiPPGFRJDSPPLFGQAIiAQDLGHFS S PGTLVLQYVDDLLLATS 
SEASCQQATLDLLNFLANQGYKTHSRDSDAPVGFTDRHLVATEKIPR 
GREPLSSLAVYTRGRPTRTGARRTIVSRIRRDFRFFYRYLAQREPPL 
YQQSLIiIHLQLDSQPALSWIGKVPNWELTPNFGTAAQV* 

6579 

A 

1 

2403 

MSMGFLVEESEPVWRGLMVMSAIEKLLRQVSGYLLAQGVSRNVVHE 
PGVGS S QVCSVPCPTVAKLNSGAKYRAPVGQLKATS AATRLKTRVSG 
FLGRGSLTSPDSLELGALVCLEPASTFPVLLGRAKGRQRGKPFSSGV 
RTTRGP I SLPPYPSLP I APRP INDNPPL I S PAQKE I CEE I SKGQQKP 
LGYRLCPLQAVGGGEFGPTWVHVPFSLSDLKQIKADLGNFSDDPDRY 
INVLQGLGQIFYLIWRDVMLLLDQTIJVFNEKNAALAAAQEFGDTWYIj 
SQVNDRMTAEERGKIPNGQQAVPSMDPHWDLDSDHGDCSCKHIiLTCV 
LEGLRRIRK3<DQEEQAQKEKQDQRKTAALVMALRQTNFGGSERTENG 
AGQSPERDTLAKAGAIIYMNMGNKIiPICCPLPEEGINPEVWALEGQF 
GKAKNAHPVQIRLKDPTTFPYQRQYPLRPEAHKGLKDIVKYLKAQAL 
VKKCSSPCNSPILGVQKPNSQWRLVQDLRLINEAVIPLYPWPNPYI 
LLSQIPEEAEWFTVLDLKDAFFCIPPHSDSQFLFAFEDPTDHMSQLM 
WTVLPQGFKDSPHLFGQALAQDLGYFSSPGTLVLQYVDDLLLATSSE 
ASCQQATLDLLNFLANQGYKRLLTMDPRTQRDSQAPLYSNQGENQRD 
KKS KKEVQREEETDKEGVRKRERERQRRSQRERKRDGTS VSTLAVLE 
EPFSLPMRYGGPYLGLAKTRADSLCLWGGVEGEARVGAGASCSTRRP 
VRVPGEHRLGRPCTGHGQPAPAGLDRGMCSFWAARVPGLGGAKSHGD 
CH 

6580 

A 

1 

2838 

MSGTLKGMLPKRDLPIYPIYPDLCPLGPNACETNFLSPLFSEKEISK 
EISKGPQKPLGYRLCPLQAIGAGEFGPTRVHVPFSLSDLKQIKADLG 
KFSDDPDRHIDVLQGLGQTFDLAWRDVMLLLDQTIJVFYEKNVALAAA 
REFGDTWYLSQVCLTIEGQEIDFLLDSGTAFSVLISCPGQLSSRSVT 
IRGILGQPVTRPEAHKGLQNTVKHLKAQGLVRKCSSPGNTPILGVQK 
PNRQCRLVQDLRLINEAVI PLYPWPNP YTLLSQ I PEEAEWFMVLDL 
KDAFFC I PLHSDSQFLFAFEDPTDHTS QLMWMVLPQGFRD S PHLFGQ 
ALARDLGHFSSLGTLVIiQYVDDLLI^TSSEAIjCQQATLDLLNFLANQ 
GYKVSRSKAQLCLQQDGEPIEHDCQQIIVQTYATRDDLLDVPLTIPD 
LNL YTDGS S WENG I QRTG YT I VSDVTI LE S PVAA I LLLLAFGAC I F 
NLL VKFVS SRI EAFELQMVLQM EPQMS S TNNF YRGPMDRPAGTS PGX» 
ES S PLKDTTTAGAVTLTTE VCS FTPEPARPRTHQKE ETPNTS EHQKE 
QTPDTLPLRTVTLTAS VHGF I LEVS FDQEPTNSRH I LVTTKGPIil I A 
KWRKTNVS AAVLVS S T I L I S R VQGLLQ VLGQGC FCCC I GERNYSNQQ 
GPGTIAGSWAGWSTAASNATLAAAPKFGDTWYLSQVNDRMTAEERD 
KFPTGQQAIPSMDLHWDLNSDHGDWSRKHL^ 

YSMLSTVTQGKEKNPSAFLEQLWGALRKYTPLSPKSLEGQLILKDKF 
ITQSAADIRRKLQKQALGPEQNLEALLNLATSVFYNRDQEEQAQKEK 
RDQRKAAALVMALRQTNLGGSERTENGAGQ S PERAG I ALGDL I QTRG 

KTPQPVAYLSKE IDAVAKVWPHCLKLVAAVAVFVS EA I KI IQGKDLT 
WTTHE 

6581 

A 

1908 

4698 

SNDRTEDDCGKHPFMS S PP \ TEPWVCL I IEGQB IDFLLD TGTTFSVL 
IPCLGRLSSRSVTIQGILGQPVTRYFSHLLSCNWETLLFSHAFLVMP 
ESPTPLLGRDILAKAGAIISMKTGNKLPICCPLLEGINPEVWALEGQ 
FGRAKNAHPLQIRLKDPISFPYQRQYPLRPEAHKGLQDIVKHLKAQD 
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S VRKCS S PCNTP I LGVQKLNS QWRLVQDLRL INEAVI PLY P WRNP Y 
TIiLSQVPEEAEWFTVLDLKDAFFCIPLHSDSQFLFAFEDPTDHTSQL 
MWMVLPQRFRDS PHLFGQAQAQDLGHFS S PGTLVLQ YS E I AKTLYTL 
IKEMERANTHLVEWEPEAETAFETLKQALVQAPALSLPTGQNFALYV 
I ERAG I ALGVLTQTHRTTPQPVAYLS KE I DWAKG W PHCLRTWAVA 
VL VS E A I K 1 1 QGKD FTVWTTHD VNG I LG AKGGLW P S DNCLLR YQ ALL 
LEEPVLQIHMCIAINPATFLPEDGEPIKHDCQQIIVQTYATQDDLLE 
VPLANPDFNLYNSGSSFVENGIQRAGYAIVSDVTVLGSKPLSPGTST 
QLAELVALTRALELGKGKR I NVYTDS KYAYL I LHAHAA I WKERE FLT 
SGGTPIKYHKEIMELLHTVQKPKEVAVLHCQSHQKGSARRIFCLTRI 
PGTRVCKNPHVSLSGISEWPLRDHTPLCFRPKALLAWVHEEICSMGC 
KD PG WNS LKVS EEDRKMQE S LETS RDLLNG FD QNVDNDMD S KVQAEG 
ASDGDEELIGKWRKGLSMLWKILSIENMMNVWKDGTIKDAIIVIEKA 
VKAI KPETINS CQRNLCPD WHDFTRFTTEP I KD I KE I GDLA I KLG I 
FESSNSTQFLNQFVPDDRLSPHNKLLKFYKPEFYSSYPLVSCLDHYV 
DQVGARISMSFCVTTVTEASGSGSWTCGVSQLKLSYAD 

6582 

A 

1529 

2945 | 

TWQPQCSI IGTKRNRPKKEKRDQRKATALVTALRQTNLGGSERTENR 
AGQS PERBTLAKAGAI I YMNMGNKLP I CC PLPEEGINPEVWALEGQF 
GKAKNAHPVQIRLKDPTTFPYQRQYPIJIPEAHKGLKDIVKYLKAQ7VL 
VKKCSSPCNSPILGVQKPNSQWRLVQDLRLINEAVIPLYPWPNPYI 
LLSQIPEEAEWFTVLDLKDAFFCIPPHSDSQFLFAFEDPTDHMSQLM 
WTVLPQGFKDSPHLFGQALAQDLGYFSSPGTLVLQYVDDLLLATSSE 
ASCQQATLDLLNFLANQGYKRLLTMDPRTQRDSQAPLYSNQGENQRD 
KKSKKEVQREEETDKEGVRKRERERQRRSQRERKRDGTSVSTLAVLE 
EPFSLPMRYGGPYLGLAKTRADSLCLWGGVEGEARVGAGASCSTRRP 
VRVPGEHRLGRPCTGHGQPAPAGLDRGMCSFWAARVPGLGGAKSHGD 
CH 

6583 

A 

3 

154 

KSEGNEKSEDAEVIJ5ATPSGIMKTDNTEVLLSADFTGAIKVFVNKRK 
NVS 

6584 

A 

3 

4 25 

GDS CAVC I EL YKPNDLVR I LTCNH I FHKTCVD P WLLEHRTCPMCKCD 
I LKALG I E VDVEDGS VS LQVPVSNE I SNS AS SHEEDNRS ETAS SG YA 
S VQGTDEPPLEEHVQSTNES LQLVNHEGKFCGQWDV I PS CLTNPTL 

6585 

A 

3 

223 

RLSQSVTQAGVQWHDLSSLQPLPSKSKRFSCLSHPNSWDYRHTPQHP 
ANFCI FFFCRDG VS PCWPCWS CPGLK 

6586 

A 

3 

330 

MWEEVTCPICLDPFVEPVSIECGHSFCQECISQVGKGGGSVCPVCRQ 
RFLLKNLRPNRQLANMVNNLKEISQEAKEGTQGERCAVHGERLHQTH 
LDQPKTCFCLLYGT 

6587 

A 

50 

1740 

RS PHCCLTAHLTMASAARLTMMWEEVTC P I CLD P FVE P VS I ECGHS F 
CQECIS QVGKGGGSVCAVCRQRFLLKtJLRPNRQLANMVNNLKE I SQE 
AREGTQGERCAVHGERLHLFCEKDGKALCWVCAQSRKHRDHAMVPLE 
EAAQEYQVRPKRHLEKLQVALGELRRKQEIJVEKLEVEIAIKRADWKK 
TVETQKSRIHAEFVQQKNFLVEEEQRQLQELEKDEREQLRILGEKEA 
KLAQQSQALQELISELDRRCHSSALELLQEVIIVLERSESWNLKDLD 
I TS PELRS VCHVPGLKKMLRTCAVH I TLD PDTANPWL I L SEDRRQVR 
LGDTQQSIPGNEERFDSYPMVLGAQHFHSGKHYWEVDVTGKEAWDLG 
VCRDSVRRKGHFLLSSKSGFWTIWLWNKQKYEAGTYPQ\TPSHLQVL 
PCQVGIFLDYEAGMVSFYNITD\HGSLIYSFSECAFTGPLRPFFSP\ 
GINDGGKNTAPSNPLVPLNIGSQGSTDYWMAFSGTLPLSPHWAPASS 
ATKP WPLFPHGTLEPPLS S AEASRDPQQAS FS REVTS PINI PAPDGV 
V 

6588 

A 

2 

472 

LKAVHRCTGPEFPGRQRLRHEQGRLRGRRCPPGRLPLHREALFQPSF 
LGMEACGIHETTFNSiMKGDVDIRKDLYANTVLSGGTTMYPGIADRM 
QKEITALAPSTMKIKIIAPPERKYSVWIGGYILGWLSTFHQMWISKQ 
EYDESGPFIVHTTCF | 

6589 

A 

1 

239 

QCCSGGTHGNPAIGDR\MQKQILPWAPAQMKIRFMAPPERKYSVWIA 
APILASLSTSSRMWISKQEYDESGPSIVHRKCF 
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6590 

A 

1 

834 

MDDDIAALWDNS SGMCKAGFAGDDVPWAI FPS IVGCTRHQVMMLHV 
APEEHPVLLTEAPLtfPKANCEKMTQIMFETFNTPAMYVAIQAMLSLY 
ASVRTAHVQGLHTTAERE IVRD I KKLC YVALD FEREMAMVAS SSSLE 
KS YKLLDGQ VI T IGNERFHCPEALFQP S FLGMES CG IHKTTFNYIMK 
CDVDICKDLYTNTVLSGGTTMYPGITNRMQKKITALAPSMMKIKIIA 
PPERKYSWISGSILASLSTFQQMWISKQKYDQSGPSIVHECF 

6591 

A 

1 

702 

LHVAPEEHPVLLTEAPLNPKANCEKMTQIMFETFNTPAMYVAIQAML 

SLYASVRTAHVQGLHTTAEREIVRDIKKLCYVALDFEREMAMVASSS 

SLEKSYKLLDGQVITIGNERFHCPEALFQPSFLGMESCGIHKTTFNY 

IMKCDVDICKDLYTNTVLSGGTTMYPGITNRMQKKITALAP 

1 1 APPERKYSVWISGS ILASLSTFQQMWI SKQKYDQSGPS IVHECF 

6592 

A 

1 

1623 

MVP S AAAAAGMRCL I LKG VDS I GKTERTLQE I G S PCVALAS PELLS S 
RDPVAVDSQGWGFLHSLLYRSFGICCAYHTQCNDHASHVWLKVPVS 
GSTAQLSLSCK2CEKDILHENSTLREEIAMLRLELDTMKHQSQLREKK 
YLEDIESVKKRNDNLLKALQLNELTMDDDTAVLVIDNGSGMCKAGFA 
GDDAPRAVFPSIVGRPRQQGMMGGMHQKESYVGKEAQSKRGILTLKY 
PMEHGIITNWDDMEKIWHHTFYNELRVAPEEHPVLLTEATLNPKANR 
EKMTQIMFETFNTPAMYVAIQAVLSLYTSGRTTGIVMDSGDGVTHTV 
PIYEGNALPHATLRLDLAGRELPDYLMKILTEHGYRFTTMAEREIVR 
DIKEKLCYVALDFEQEMATAASSSSLEKSYELPDGQVITIGNERFRC 
PEALFQPCFLGMESCGIHETTFNSIMKSDVDIRKDLYTNTVLSGGTT 
M YPGMAHRMQKE I AALAP SMMKI R 1 1 AP PKRKYS VWVGG S I LAS LST 
FQQMWISKQEYDESGPSIVHRKCL 

6593 

A 

1 

1132 

MEEEIAALVIDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHQGVMVGM 
GQKDS YVGDEAQS KRG I LTLKYP I EHG I VTNWDDME KI WHHTFYNEL 
RVAPEEHPVLLTEAPLNPKANREKMTQIMFETFOTPAMYVAIQAVLS 
LYASGRTTG I VMDS GDGVTHTVP I YEGYALPHA I LRLDLAGRDLTD Y 
LMKI LTERG Y S FTTTAERE IVRD I KEKLC YVALD FEQEMATAAS SS S 
LEKS YELPDGQVIT I GNERFRCPEALFQPS FLGMES CG I HETTFNS I 
MKCDVDIRKDLYANTVLSGGTTMYPGIADK\MQKEITAX#GP\STMKI 
KIIAPPERKYSVWIGGSILASLSTF\QQMWISKQE\YDESGPSIVHR 
\KCF 

6594 

B 

635 

646 

MCDEDETTALVCDNGSGLVKAGFAGDDAPRAVFPSIVGRPRHQGVMV 
GMGQKDSYVGDEAQSKRGILTLKYPIEHGIITNVTODMEKIWHHTFYN 
ELRVAPEEHPTLLTEAPLNPKANREKMTQIMFETFNVPAMYVAIQAV 
LS LYASGRTTG I VLDSGDGVTHNVP I YEGYALPHAIMRLDLAGRDLT 
DYLMKI LTERGYS FVTTAERE IVRD I KEKLC YVALD FENEMATAAS S 
SSLEKSYELPDGAGHHHRQRALPLPGDPSFIGMEVGGHFTRPPYNSI 
MKCDIDIQEGPFMPTTFMSGGTHDVPWESLTACRKEITRAGTPAPME 
DQDSFAPAGAQNTRVWIGGS ILASLSTFQQMWITKQEYDEAGPS IVQ 
PQM LLDTLHLQ ARD F SRTTNLLNGGGG * 

6595 

A 

1 

1284 

MGDTPRQFGGARPRSGGALRGCRSRGGGNRRGLCLSRALANGDRRLT 
MDDD I AALWDNG S GMCKAGFAGDDAPRAVF PS I VGRPRHQGVMVGM 
GQKDS YVGDK\AQSKRGILTLKYRIEHGIVTNWEDMEEIWLHTFYSE 
LHVAPEEHPLLLTEASLNPKANHEKVNQIMFETFNTPAMYVAIQALL 
SLYASGRTTGIV>TOSGDGVTHTVPIYGG\YALPHTILHLDLAGPDLT 
DHLMKILTERGYSFTTTAKREIVHDIEENLCYVALDFEQEMAMVASN 
PLLEKS*ELPDGQ\VITIGNE\RFRCPEALFQPSFLGMESCG\IHET 
TF\NSIMK\CDVDIRKDLY\ANTVLSGRTPPCTLGIADRMQKEITAL 
APSTMKMKIIAPPERKYSVWIGGSILASLSTF\QQI\WISKQEYDES 
GPS\IVHRKCF 

6596 

A 

1 

370 

ETESRSVAQAGVQWHNLGSLQPSPSGFKRFSCLSLPSNWDYRHPPSC 
PANFCIFVETRFHHVGQAGLKLLTSGDPPTLAS * IAGITGMSHCAWP 
SLH I WTLLLFAKCPRRWTGQVLLAAI FSQ 

6597 

A 

2408 

2810 

IHTHTHTHTHTI FFLKRS FALVAQARVQWRDLCSLKPPP PEFKQFS C 
LSLPSS*DYRCPPPCPANF/SFFFVFLVETGFHHVGQAGLKLLTSSD 
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S PTLAS QNAG VTGM S HCAW PQSNKYYQNNFTM I QEQQQQKB 

6598 

A 

1 

1157 

APPVPANALGARGEAVRIiQLQGEELRLQB\ESVRLHQINIYLSDRIS 
LHRRLPVRWNPLCKEKKYDYDNLPRTSVI IAFYNEAWSTLLRTVYSV 
LETSPDILLEEVILVDDYSDREHLKERXANEIiSGLPKVRLIRANKRE 
G\LVRARLLGASAARGDVIiTFLDCHCECHEGWLEPLLQRIHEEESAV 
VCPVTDVIDWNTFEYLGNSGEPQIGGFDWRLVF\RWHTVPERERIRM 
QSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFR 
IWQCGGVLETHPCSHVGHVFPKQAPYSRNKALANSVRAAEVWMDEFK 
ELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHVP 
EDRPGFFGMLQ 

6599 

A 

1 

1992 

MG FRHVDQ AGLELLTS G I QHELGHS ERLEATLRS KARG P VTTWQLTS 
SKPAADTLSRWPLGAEVLSPTTTRNEFCQKTCQPGLDPEFLKFRCIC 
RCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEE 
VILVDDYSDRGCSAALVLMDQAGFSPPTHSGQPEDLKGKESHTLPPE 
GTAWKFSTHPIEQSWAWPQHLKERLANELSGLPKVRLIRANKREGL 
VRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRLSWPASLITS 
HQFLGELAFARGQLELAQQERRMDVLGAFRIHEEESAWCPVIDVID 
VINTFEYLGNSGEPQIGGFDWRIiVFTWHTVPERERIRMQSPVDVIRSP 
TMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVXiET 
HPCSHVGHVFPKQAPYSRNKAJ^ANSVRAAEVWMDEFKBLYYHRNPRA 
RL.GIACDECSIKAGWWLEPFGDVTERKQLRDKLQCKDFKWFLETVYP 
ELHVPEDRPGFFGMLQNKGLTD YC FD YNP PDENQ I VGHQVI L YLCHG 
MGQNQFFEYTSQKEIRYNTHQPEGCIAVEAGMDTLIMHLCEETAPEN 
QKFI LQEDGSLFHEQS KKCVQAARKES SDS FVPLLRDCTNSDHQKWF 
FKERMJb 

6600 

A 

111 

426 

RIYKDRAGSPAHSCQHRDPLRTSPRGPHSLPWGQ\AAQSDAETEATH 
MGAPSCLGPAQLPLEPGGRGSHTTSVDSRHDSVEGFRDSRQTESPWN 
DEN*NLLDYGSL 

6601 

A 

1 

2219 

MASILLRSCRGRAPARLPPPPRYTVPRGSPGDPAHLSCASTLGLRNC 
LNVPFGCCTPIHPVYTSSRGDHLGCWALRPECLRIVSRAPWTSTSVG 
FVAVGPQCLPVRGWHSSRPVRDDSVVEKSLKSLKDKNKKLEEGGPVY 
SPPAEWVKKSLGQRVLDELKHYYHGFRLLWIDTKIAARMLWRILNG 
HSLTRRERRQFLRICADLFRLVPFLVFVWPFMEFLLPVAVKLFPNM 
LPSTFETQSIiKEERLKKELRVKLELAKFLQDTIEEMALKNKAAKGSA 
TKDFSVFFQKIRETGERPSNEEIMRFSKLFEDELTLDNLTRPQLVAIi 
CKLLELQS IGTNNFLRFQI/TMRLRS IKADDKLI AEEGVDSLNVKELQ 
AACRARGMRAIiGVTEDRLRGQLKQWIxDLHLHQEIPTSLLILSRAMYI. 
PDTLS PADQLKSTLQTLPE I VAKKAQVKVAE VEGEQVDNKAKLEATL 
QEEAAIQQEHREKELQKRSEVAKDFEPERWAAPQRPGTEPQPEMPD 
TVLQSETLKDTAPVLEGLKEEEITKEEIDILSDACSKLQEQKKSLTK 
EKEELELLKEDVQDYSEDIiQEIRKELSKTGEEKYVEESKASKRLTKR 
VQQMIGQIDGLISQLEMDQQAGKLAPANGMPTGENVISVAELINAMK 
QVKHIPESKLTSLATUOiDENKDGKVNIDDLVICVIELVDKEDVHISTS 
QVAE I VATLELEEKVEEKEKAKEKAEKEVAEVKS 

6602 

A 

1 

221 

LEMESHSVTQAGVQWRDLGSLQPPPPRFEQSSCLSLPSSVfDHRHTPP 
RLAKCLLLNTQNALNQQKQSPRSEEL 

6603 

A 

2 

218 

PCGTWRTPGEGPDEGAGGRAHYHVYLQLKAYLMVAVPRGDLGSRAQE 
AERY I S ECKKRAEEKKLGTERI ERK 

6604 

A 

101 

456 

GWERAHRLVQNYAANGGEAEAGANQGRCGLVRGGEHACRRTAFAAAR 
YTRIWKNSAAICCGPRGGGGGPRPSPLPDTKASSKH71PRDRNDSRPN 
ATPEDPL.PVPTLNPRAFPAPVLCP 

6605 

A 

62 

558 

PSCGLCRGRGTEGQEPWAPRGGGClMDHIiGASLWPQVGSLCLLLAGA 
AWAPPPNLPDPKFESKAALLAARGPEELLCFTERLEDLVCFWEEAAS 
AGVGPGNY S F S YQLEDEPWKLCRLHQAPTARGAVRFWCSLPTADTS S 
F VPLELRVTS SLRRS AIS PCHPHQ 

6606 

A 

2 

178 

QTPMPFLCRPSPAALLAARGPEELLCFTERLEDLVCFLRSMRSFSSF 
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FSSARFLHSLM 

6607 

A 

1 

194 

PARPFVES I ATQYMTNKQDHDRMASQWTKG IRS INGGFTILTFFCRS 
QKRAAYDFEGVREGGSW 

6608 

A 

59 

132 

KAVDKKAAGAGKVTKSAQIAQKAK 

6609 

A 

3 

104 

RNSWVDPPPEAVDKKAAGAGKVSMSAQKAQKAK 

6610 

A 

2 

620 

GFRPGDPRVRETGVLKPGMLVTFAPVNVTGGVKSVEMHHEALSEALP 
GDNVGFNVKNVSVKDVRRGNVAGDSKNDPPMEAAGFTAQVI ILNHPG 
QISAGYAPVLDCHTAHIACKFAELKEKIDRRSGKKLEDGPKFLKSGD 
AAIVDMVP\ARPMCVESFSDYPPLGRFAVRDMRQIVA\VGVIKAGDK 
KAAGAGKVTKSAQKAQKAK 

6611 

A 

293 

384 

AAQRHETLTSLNLEKKARLKBEAAMKAKTE 

6612 

A 

39 

543 

QVAMGNLSGLRLAAGSCFRLCERDVSSSQRLT\RSSDWKG\INGFC\ 
TKPTGKVPGAP I PALTNR VP FNTKP\ TDW \ QKKRPL I W SRS PS KMED 
EIPQ\SLSPLEDALNACQRTKMPSERSSLSQWICPDGGMDCI\FMVI 
EGQK\AA\QRHDDFTQA*TLEKKARLERGKQL 

6613 

A 

1 

194 

YEPSCTQNVNHGVLWGYGDLNGKEYWLVKNSWGHNFGEEGYIRMAR 
NKGNHCGIASFPSYPEI 

6614 

A 

71 

1145 

NLRRE PTNS RTLTVGATAGS ITMKRLVCVLLVCS SAVAQLHKDPTLD 
HHWHLWKKTYGKQYKEKNERAVRRLIWEKNLKFVMLHNIjEHSMGMHS 
YDLGMNHLGDMTS EEVMSLMS S LRVPSQWQRN ITYKSNPNRILPDS V 
DWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQN 
LVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDLKC 
QYDSKYRAATCSKYTELPYGR\*DVLKEAVANKGP\VSVW\DARQP 
S FLPLQESGVYYDPS \ CTQ\NVNHGVLVVGYGDLNGK\ EYWLV*NS W 
GP\NFGEEGYIR/LWARNKGNHCG\ IASFPSYPEI 

6615 

A 

2 

289 

VLKELWPNKGQCLFGVDARfTLPFFLYRSGVYYEPSCTQNVNHGVIiVV 
GYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPE 
I 

6616 

A 

2 

125 

PDQGQYCMKRMPAYSGPGTAPGALDYPAHCTALYGESDLRC 

6617 

A 

9 

106 

QSCSVAQAGVQWHELGSLQSSLGKKSETPSQK 

6618 

A 

2 

505 

THASDCLYRAVSKTVYGDQSLHRELREQTVHYIADHLDHFSPLIEGD 
VGE F 1 1 AAAQDG AW AG Y PELLAMGQMLNVN I HLTTGGRLE S PTVSTM 
IHYLGPEDSLRPSIWLSWLSNGHYDAVFDHSYPNPEYDNWCKQTQVQ 
RKRDEELAKSMAISLSKMYIEQNACS 

6619 

A 

1 

104 

RCSRYLYTLVITDKEKAEKLKQSLPPGLAVKELK 

6620 

A 

1 

109 

KVRCCRYLYTLVITDKEKAEKLKQSLPPGLAVKELK | 

6621 

B 

99 

285 

MPRKIEEIKDFLLTARRKDAKSVKIKKNKDNVKFKVRCSRYIiYTLVI 
TDKEKAEKLKQSLPXX* 

6622 

A 

239 

449 

NCGNQRTSLRHRPTKRMPNSCQDPRKIRTLVKFKVRCSRYLYTLVIT 
DKEKAE\ KLKQSLPPGLAVKELK 

6623 

A 

1 

104 

RCSRYLYTLVITDKEKAEKLKQSLPPGLAVKELK 

6624 

A 

1 

109 

KVRCCRYLYTLVITDKEKAEKLKQSLPPGLAVKELK 

6625 

B 

99 

285 

MPRKI EEIKD FLLTARRKD AKS VKI KKNKDNVKFKVRCS R YLYTLVI 
TDKEKAEKLKQSLPXX * 

6626 

A 

239 

449 

NCGNQRTSLRHRPTKRMPNSCQDPRKIRTLVKFKVRCSRYLYTLVIT 
DKEKAE\ KLKQSLPPGLAVKELK | 

6627 

A 

2 

129 

YGPTHASGAMLRSCAARLRTLGALCLPPVGRRLP\EASRDPS 

6628 

A 

1 

289 

MLRSCARRLRTLGALCQPPPVGRRLPG S EPQPALRS FS S EEVILKDY 
SVPNPSWNKDLRLLFDQFMKKCEDGSWKRLPSYKRTPTERIQDFKTH 
FL 

6629 

A 

6 

418 

RRLMRDFKRLQEDPPVGVSGAPSENNIMQWNAVIFG\TFKLVIEFPE 
EYPNKPPTVRFLSKMFHPNVYADGSICLDILQNRWSPTYDVSPILTS 
IQSLLDEPNPNSPANSQAAQLYQENKREYEKRVSAIVEQSWNDS 

6630 

A 

3 

330 

ARRRLMRDFKRLQEDPPVGVSGAPSENNIMQWNAVIFG\YADGSICL 
D I LQNRWS PTYDVS SILTS IQSLLDEPNPNS PANSQAAQL YQENKRE 


:1 0 01 S <+3 . 01 ES iS! ifo CI 2 






YEKRVSAIVEQSWNDS 

6631 

A 

51 

420 

KLTAGQLRSMSTPARRRLMRNFKRLQKDPPGGVRGAPFENNIMQGNA 
VIFGPKGTPFEKGTFKLVKEFFEKYPNKPPPVRFLSKRFHPNVYAEW 
S I CLN I LONRWS PTYEG S <5 T T *T Q T O P T .T , 

6632 

A 

3 

141 

ECVTNFS I VFAVYADGS ICLDI LQNRWS PTYDVS S I LTS I QVS VC 

6633 

A 

1 

581 

TT^Ti^. 2121 21 PftifSm'S Zl ZM?T*^'PT?l?T?^ , r r r^'D OP*" 1 ?! /^pmnit t>i-> V nr » i-it^v^ •» 

xiai^sA/^rijv^ijrvjMAKJJr r r r v a UKKObCCjACSTPARR\RLMKJ)FKRL 
QEDPPVGV\SGAPSENNIMQWNAVIFGPG\GTPFEDG\TF*L\VIEF 

SEEYPN\ KPP\ TVPVPT PlCMFPCPXnTVIi ★ TjJT7/"lTr»T \ fit\ r Avmtinni*. 

* cr * \ * v * r * \ lrwir iroJriN v w VviXL-Ju \DI \ LQNRWES Q 
HMD VS FLSLTFNS VSG WDEPN\ P * QS S P IAQAAQL YQENKRE Y\ EKR 
VSAIVEQSWNDS 

6634 

A 

2 

266 

YCGPL*SVWVARNPPGFAFVEFEDPRDAAGVV*ELDGRTLCGP\PSW 
GHRP*DDYCRRGLPPRRRPPRRRNLSCSRSRSLYSPWTLSL 

6635 

A 

3 

164 

GRSHLSNGEKRSRNRGPPPSWGRRPRDDYRRRSPPPRRRYLRBSLLR 
GIGVME 

6636. 

A 

1 

94 7 

o.r jjvr r ± ii^rimcD^uFIjiJCiw Y VGNLGNNGNKTELERAFGYYoPLRS 
VWVARNP PG F AFVE FED PRDAA * V * YLC PLRTLCGCRVRVELSNGE K 

RSRNRGPPPCJ Wf3PP PPrVnVPT?"DCMT POT CT T OXTT VT/-\ TMnnirr nnmtT 

rVk - , - CVL ^* v -°' t ^ r ^ ~ "^rtxtjr XKaJLf Xi\r F&rlLtr oJjoJjJjSNXjNQIGSSHLiDRPHI 
PGQSAQLFIYQMSSQQLQQQPSANKKAGKIHNTPFANQLNPTATSGQ 
NLFRQILPGPSVRAGLTFTISSLLKPKTNLQEGEASLAAGAGPFLEI 
GEERDRCLGRE I TS R PDPS LGL WDLGQMKGNRRQFARE WYRKLLH 
LTGVCTENFKFCtjRI\HKTinAPT.Pr , T?c3r , Q'KrT mro 

6637 

A 

1 

282 

PLRSVWVARNPPGFAFVEFEDPRDAADAVRELDGRTLCGCRVRVELS 
NGEKRSRNRGPPPSWnRRPPnriYT?PPQDDDi>DDTrrTMCT t t-«-pt ovtt t» ' 

6638 

A 

116 

561 

PLDLEMHRDSCPL\DCKVYVGNLGNNGNKT\ELERAFGYYGPLR\SV 
WVARNPPRPLAFVEFEDPPRWQLDAVPRS*DGKNTNVACPCKEWELV 

RMGEKXRSRXNRGPPPCJT./WfiPPPV Dnr'PTDDUPimTi'PT rnnrnnv^ 

GRFLFAARSRSLS 

6639 

A 

126 

376 

VLIaARXiQANGANTVELSAEWTFPSGYVGSVLPQGHREPNAL / CLS FS 
KFRVLGSAPFLINRRTASNCVNRYCRLAATWRAVLPL 

6640 

A 

464 

781 

AFAFTKDRQLGSSKKRRMSSRF7VSSAAKCRAVRPLLRSRRFPPVPMP 

PGPWVSSTKIjGSRLGRHQASCRG*SQRAR*SGLTGPTHMEPN1CVRST 
GLKFLLPAQQS 

6641 

A 

201 

318 

EINLTNFICFSSRSDVS*RESPFSRELHRISSISLEPG 

6642 

A 

1 

1714 

MKLPEEGAGSNICCSAIFAVLQPPLVIPRQTVSGVDLQQTPTDLQLR 
VLTLRRKTSKQKGHTHQNPICMSPSSRTKGRQRDKLEEILRGLTPRK 
ND IGDAMVFCLNNAEAAEE I VDCI TESLS I LKTPLPKKACAPRWSG 
AGPAGTWLALGAGAAFRPGLHGAAGGHPPGRGEVGGVRTRRTRPPRA 
PDRATGVSSPPGPASAGNPPKSYRVHVHKVQVCYIGIHVPCWFAAAV 
KLSCTLD I CPNAMV S AARQVPTGAGTP VRRQEGS I EVYEDAGSHYLC 
jjuivM^jir «x/£>il» i. i l Aoi\JAyL>yijbCS WTLQ VETHPSFLIPHLSLRPS 
ATRNPGFRFSSFSHRPRPLPLALGPRTRHVPLSSLTLQVCFRWGRGR 
QVXSPAGGLCVPRALLRWDLRGDTQSPEK1.YMPKTIKETSLSTYHVP 
GITLNDRS I INKIDPGVEESPLDLHVCHGNNYVDEKPALLLLSGKLG 
CPSLTSCYSNCLSYGNRCRLLQELVVNQHVEQQPHYTHKPTLEYSPF 

PIPPQSPAYEPNLFDGPESQFCPNQSLVSLLGDQRESENIANPMQTS 
SSGOAAK 

6643 

A 

1 

789 

MG FRHVGQAGLKLLTS SDL PAXASQVQCKL WPAS AERH I S PLQRGL 
I AF S LRPDPGL AGDRAS LG VPLCG YR I GNPRRS P KRTRHCARTAAV S 

CSAYSKCIjYOFRPNVG'H c ;2VQ r rPVZiMrT , » r P»T 1 » t\ r*x?/~f 7\ TTTk nrk-nnvrnrvmrtT 

GARRGTPQRYTGGAETKNRAFRSYPKMETSWPRGGGSTPPVALDAQV 
ERECALGSGAQGKHS CIiAVRWXi I FC INEHYPQTLTYRIjPFVIjPLRS 
GGTPTS WRALFFPS S ^D SNCKLCGGTS S 

6644 

A 

1 

293 

SPEMESHPITQAGVQWHNHSSLQPEI PGLKQSSHLSLPS SWDKR* EP 

L\PWPKNIFKMLIL*IGFSHVIGNNGTYLYQSINNVEGPLKRSQKDR ' 
SCGN 

6645 

A 

1 

252 

GLTLP PRLGCHAT I TAHYS LNLPGS SNPPTSAS QVAGTTHS RPCPE I 
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TQTALQRGPTGLTTKASRPKARRGSKVSWTIAILPPQ 

6646 

A 

1 

1974 

MERLETINILTLEKADLKTTLYHTKKAARHFEEESKDLAGRLQYSLQ 
RIQELERALCAVSTQQQEEDRGHCLSSPDQNFSLFTIQSSSCREAVL 
HRRIiQQTIKERALLNAHVTQVTESLKQVQIiERDEYAKHIKGERARWQ 
ERMWKMSVEARTLKEEKKRDIHRIQELERSLSELKNQMGKMGLLQDE 
AKHLRQEVEGLEGKLQSQVENNQALSLLSKEQKQRLQEQEEMLREQE 
VQRVREQERLCEQNERLREQQKTLQEQGERLRKQEQRLRKQEERLRK 
EEERLOKOEKRLWDOEERLWKKEERIiOKOEERt^AT.QnTJWKT.nTfOT.aT? 
PQCS FEDLNNEKKS7VLQLEQQVKELQEKLDEVKEMQYVATYQQLTS E 
KEALHRQLLLQTQLVDQLQQQEAWGKAEHLEAASQQNQQLETQLSLV 
ALPGEDPLMLLVSHLLIALAPHPFREPWRHFVTCDQQLGIYQKAFE 
HLGNALTYDPTNYKAISCLTOANTLAPFDWKILYNLGLVHLTMQQYA 
SAFHFLSAAINFQPKMGELYMLLAGLLSTCNPLVNLNYAVIiLYNQGE 
KXNALAQYQEMEKKVSLLKI>NSSLEFDSEVCLLLAPKSHNQSNQTRE 
RKGIQISKEEVKLSLFADDMIVYLENLKDSSKKLLELYLSEYWSFSV 

6647 

A 

3 

322 

LLSTCNPLV^NYAVLLYNQGEKKNALAQYQEMEKKVSLLKDNSSLE 
FDSEVCLLLAPKSHNQSNQTRERKGIQISKEEVKLSLFADDMIVYLE 
NLKDSSKKLLEL 

6648 

C 

238 

324 

MADLPIFLVEFSCFIIYFNGSFMFSQKK* ] 

6649 

A 

1 

1088 

MOG FVYNHOVNIjETTKVIjTS TG S TLOTSGSOKE VT .P PTf? <3 TP K7 . An K 
AQDQLHNALNLNRHDLTY I MLGKI HLLEGDLDKAI EVYKKAVEFS PE 
NTELLTTLGLLYLQLG I YQKAFEHLGNALTYDPTNYKAI LAAGSMMQ 
THGDFDVALTKYR WACAVP ES P PLWNN IGMCF FGKKKYVAAI SCLK 
RANYLAPFDWKILYNLGLVHLTMQQYASAFHFLSAAINFQPKMGELY 
ML LAVAL TNL ED I ENAKRAYAEAVHLD KCNPLVNLNYAVLL YNQGEK 
KNALAQYQEMEKKVS LLKDNS S LEFDS EMVEMAQKLGAALQVGEALV 
WTKPVKDPKSKHQTTFNQQTWPVSSSLWALIKL 

6650 

A 

1 

368 

GTSLKQEEAFRSYFEIFNGPGEVDAQSLKNILLLMGFSVTLAQVEDA 
LMSADVNGDGREDFKDFLAVMTDTTRFFCFAEQNAXiSDMALSNPHTL 
LFE I LHL PL YMLALPNARLCD IPDCILI 

6651 

A 

3 

243 

HERNEYFRSEDLKLQISTETITLSHGSSTFSLSWSDTAQVTNQRVQI 
SVKKEKVVTITLDKEMSFTVVLTRVWNHAIFI 

6652 

A ! 

24 

288 

VLE I KKKE 1 1 PTSTPYSTLPPLKTDLPPLYPQVPPPSQCQTPAIHRM 
ATPPLKPQNFKALLRKWHHGLFIFPFFKGKSLKKFPSENS 

6653 

A 

2 

272 

PDRPTRPAVPMSPSSHSPEGRPPPLLPGGPVCKAAASAPSSLLDQPC 
LCPAPS VRTAVALTTPD I TLVLPPDVT QQEAS ALREETEAWAR 

6654 

A 

965 

2737 

AASPGVLVCPQARTYIQSLPPALDVQLTPLPTTGFPAAIDLLGRMLV 
LDSDQRVSAAEALAHAYFSQYHDPEDEPEAEPGPGCGRHKLTLSLHL 
LELTYQEVLSWDEAAGRRSPPPALPPRLRPSPGSGPPPAAAASGFRR 
WAAPGPAMSS P P PARS GF YRQEVTKTAWEVRAGACAGC PAFPLSGP P 
RSSAVDGRTGAKVAIKJCLYRPFQSELFANPARVLCCILNPMWPPQVI 
GHAEAG S VR * QRS G P SHS YL VMPFMGTDLG KLMKHEKLGALRGCAEA 
CHHAWEGGAQWPPFLCPHRSLGVP*NPPPQFFPQILDFGLARQADSE 
MTG YWTRWSHP VPLPTELTRAS I S VD IWSVGCI MAEM I TGKTLCS C 
PGGGSGGPDLWPPSCVPDLGADLDQLKEIMKVADLGLGVGPSPFLPT 
QAKbTYMKGLPEWSVGPPVWPDAPSPAAVNLLEKMLVLDAEQRVTAG 
EALAHPYFESLHDTEDEPQVQKATTDGPSLPSDPHASPGHPGLGDVI 
AISSGATLTCGMAAVHTGPKVWAHRWPEDWGVAFALQTPRPPRLSPQ 
VFLRPGRVGLGAALDPDPGWGAGRDL 

6655 

A 

3 

231 

WAAPGPAMSSPPPARSGFYRQEVTKTAWEVRAGACAGCPAFPLSGPP 
RSSAVDGRTGAKVAIKKLYRPFQSELFA 

6656 

A 

27 

144 

QRSGPSHSYLVMPFMGTDLGKLMKHEKLGALRGCAEAC 

6657 

A 

3 

308 

TKRAPAQYSGRTRLCTRSSGPTRGDGQERVPLRGSWAAESLGAQPGG 
SRVLACTSTRSPGLTRGKCPRLPLSPPPSRHSRASVKSTRAGAPEIR 
HGNAGSA 

6658 

A 

192 

310 : 

ARGSTKPKMQLRKPPVICGDSRAQLLIYGHLLAPWAGVXi 
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6659 

A 

88 

1506 - 

ACGRFGFPEGPEGGRGRHVRAARGPVGPRAGEGPCAAWVGVGPGSAP 
GTSLREGPGAGAPQRPAGGRASGRAARSRPEARAEGAPSPVVGAPPA 
GSAGWAAPGPAMSSPPPARSGFYRQEVTKTAWEVRAVYRDLQPVGSG 

*»■ A VJrt. V V — <~>^\ V JL/V7XV X V3ni\ V nJ. XVTvJ-l X J\. .£ F \£OE*lJP J\T\1\J\ X KMiKhliN 1*11*1 f-t |-j 

ENVIGLLDVFTPDETLDDFTDFYLVMPFMGTDLGKLMKHEKLGEDRI 

QFLVYQMLKGLRYIHAAGIIHRDLKPGNLAVNEDCELKILDFGLARQ 

ADSEMTGYVVTRWYRAPEVILNWMRYTQTVDIWSVGCIMAEMITGCT 

LFKGSDHLDQLKEIMK^TGTPPAEFVQRLQSDEAKNNMKGLPELEKK 

DFASILTNASPl^VNLLEKMLVLDAEQRVTAGEALAHPYFESLH^ 

DEPQVQKYDDSFDDVDRTLDEWK\RVTYKEVLSFKPPRQLGARVSKE 

TPL 

6660 

A 

71 

921 

NPPPQFFPQILDFGLARQADSEMTGYWTRWSHPVPLPTBLTRASIS 
VD I WS VG C I MAEM I TG KTLC S C PGGG S GG PDLW PPS CVPDLG ADLDQ 
LKE IMKVADLGLGVGPS PFLPTQAKNYMKGLPEWS VGPPWWPDAPS 
PAAVNLLEKMLVLDAEQRVTAGEALAHPYFESLHDTEDEPQVQKATT 
DG P S LP SDPHAS PGHPGLGD VI A I S S G ATLTCGMAAVHTG PKVWAHR 
WPEDWGVAFALQTPRPPRLSPQVFLRPGRVGLGAALDPDPGWGAGRD 
L 

6661 

A 

2 

310 

GDKVAHS VTKVGVQWHNHC S PQPRPSGLKRS S SLS LQSS WDHRRMPP 
CLAHFLIFCRDGVLLGLPRLGVALLGLK3*SSHLGLPKVLRITGSEAT 
ILGPQGFS 

6662 

A 

5517 

5831 

LPGNFFFFFSDGVSLLLPRLGGQWHHLGSLQPPPPGFQRFSCLSLPS 
SWDYRIiLPPRLARKFFVVNMTFGIjISSFNSTSFVHKSQECFRQKTGF 
TLLIIRRSVL 

6663 

A 

1 

625 

CKFIRVMAHTRLRLLPLRRKKAHLME IQVNEGTVAEKLDWARERIiEQ 
QVPVNQVFGQDEMIDVIGVTKGKGYKGVTSRWHTKKLPRKTHRGLRK 
VAC\KDGKLIKNNASTDYDLSDKSINPLGGFVHYGEVTNDFVMLKGC 
WGTKKRVLTLRKS LLVQTKRRALEKIDLKF IDTTS KFGHGRFQTME 
EKKAFMGPLKKDRIAKEEGA 

6664 

A 

1 

1261 

MSHRKFSAPRHGSLGFLPRKRSSRHRGKVKSFPKDDPSKPVHLTAFL 
GYKAGMTHIWEVDRPGSKA^KKEVVEAVTIVETPPMVVVGIVGYVE 

Arn.v7UA.xrAi v r ^-vx^n jl ox^i^ v_r\Xvxv r 4 X Jvi\J rvri X\.o r\J\J\J\r J. Jvx \—i\J\.ri\JLJtLL} 

GKKQLEKDFS SMKKYCQVI RVI AHTQMRLLPLRQNKAH \ LM * I QVNG 
GH CG PR S WD W P PRE GL KOOGTVD P S VXiG RNKM T D V T G \ VTDGP KGY K 
GGHPWWHTQESCPRKD/HHPRACRKGGHVLGAW\HPA\RVAFSVAR 
AG \ QKG \ YHHRTE I TKKD/ 1 YKI G PGLS PL. FRTAKVI KGTI AS TDYD 
LF * QEASTPPGGFVH\ YGE \ VTNDFVML KG C WGTKKRVLTLRKS LL 
VQTKRRALEKIDLK\FIDTTSKPGHGRFQTMEEKKAFMGPLKKDRIA 
KEEGA 

6665 

A 

1 

168 

PTRPATQSNICDEDSATETCYTYDRNKCYTAVVPLVYGGETKMVETA 
LTPDACYPD 

6666 

A 

3 

203 

SLCTELPPSSSAFSSSSSSPAFPPFWSSSSSISRASTSSSSSSPSSW 
SPLCTSFQPLDKAAPJDRAS 

6667 

A 

3 

523 

LQ F FRR SEVKM KNHLL FWGVLAVF I KAVHVKAQEDERI VLVDNKCKC 
ARITSRIIRSSEDPNEDIVERNIRI\IVPLNNRENISDPTSP\LRTR 
FVY\ HLSDLCKK\ CDPTEVEL\ DNQI VTAT\ QSNI CGWMTVPQSTCY 
TL S TGNKC YTA WPL \ VYGG E TKMVE TALTPD AC Y PD 

6668 

A 

3 

276 

EDSATETCYTYDRNKCYTAWPLVYGALWPRRWRRRLRSVPAAAAAG 
LLEDPAVPGLTAALPAVELPERRRRQQRQRQAPGHQLLPRSGR 

6669 

A 

2 

152 

FFFVFLVETEFYLVGQAS LELVTSGD PAASAS QS AG I TGVSHACSPF 
QAF 

6670 

A 

1 

936 

M7VALYACTKCHQRFPFEALSQGQQLCKECRIAHPVVKCTYCRTEYQQ 
ERLECNGTI SAHCNLHL PG S SDS PAS SSRVAG I TGI KTNTI CKKCAQ 
NVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPYSCEQCKQQ 
CAFDRKDDRKKVDGKLLCWLCTLSYKRVLQKTKEQRKHLSSSSRAGH 
QEKEQYSRLSGGGlTraSFSPDLALDSPGTDHFVIIAQLKEEVATLKK 
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MLHQKDQMILEKEKKITELKADFQYQESQMRAKIWQMEKTHKEVTEQ 
LQAKNRELLKQAAALSKSKKSEKSGAITSP 

6671 

A 

1 

1023 

MAAL YACTKCHOR F P FEAL S OGOOTj C JCP prtah pvwptv r*p tt? vnn 

ERLECNGTISAHCNLHLPGSSDSPASSSRVAGITGIKTNTICKKCAQ 
NVQL YGTPKPCOYCN I IAAFIGNKCORCTNSEKKYnpPY^P pnrifnn 

CAFDRKDDRKKVDGKLLCWLCTLSYKRVLQKTKEQRKHLSSSSRAGH 
QEKEQYSRLSGGGHYNSQKTLSTSSIQNEIPKKKSKFESITTNGDSF 
SPDLALDSPGTDHFVIIAQLKEEVATLKKMLHQKDQMIIjEKEKKITE 
LKADFQYQESQMRAKMNQMEKTHKEVTEQLQAKNRELLKQAAALSKS 
KKSEKSGAITSP 

6672 

A 

106 

125 

KYLEQCLKHGKSS*HSILQANFFASENDRAKMK*KPCFYKCF*AITF 
LREDWD YRRE PLRLAYPVYFLTCKI R I M YTHL KD YTGH 

6673 

A 

3 

155 

GAI SAHCNLRLPGSSDS PDSASNL IQMSLLRVLLLEMVLTLRTTRKP 
SEI 

6674 

A 

3 

266 

GAISAHRNLRLPGSSDSSASSSQPGWPRGRWSPTSCSHGKHHQRPG 
VRGAPRRQGCTLRGRRRPCPGEGAQDHGGEYGRPQRVQPP 

6675 

A 

5 

160 

RDLGSLQPPPPGFK*FSCLRLPSPQVQRCCVPSPWLLEQMITAWGAS 
IHRN 

6676 

A 

1586 

2220 

PSCCSGIS HATRS S FTLL VS QANDTVDCTAF I CTAWTL F I PRASEL V 
AVTWI S S FGSLFHLLSS ALS SMGTCPVDS YWTLMNS EEKS SEPLVLC 
DLWYFIALPSPLVTFYCYLPYIGSCQGPSLSLFSFFFFFFFFLRRRT 
ESCSVAQVGVQWRDLGSLQPPPPGFKRFSCLSLPSSWDYRRAPPRPA 
NFYIFSRDGVSPCWPGWSRTPIS 

6677 

A 

3 

441 

CGVGLHDRPWLPRNRCSPASALRSACWNSPRARGALWPESACASTY 
PNPRGLSLGAAPTQSAPGPSGLCCSLCTGQPLSCLLWHHPGHSLRAP 

HSPARLPSCCTQSPGAGAGSDLQGGTPGPAPCSTAAPSGPHAWRWAC 
IHIP 

6678 

A 

1 

91 

SFATLLRLVSNSWPQMILLPRSPKVLGLQA 

6679 

A 

2 

226 

FFEMES CCDAQAR VQWHEL S S LKPLLCS PGSRD FS AS AS QVAGTTGM 
HRHAQL I FVFRVCVC FG 1 1 VLEKQ I KY 

6680 

A 

3 

187 

EMGKGIDSLFNEXIAENFPGLARDIDIWKLETQGVPNRFNLKRSSQR 
QMIVKLSKVKNREF 

6681 

A 

2 

209 

GEEGRTQKGERNGGGGSERKEGEEHEARRTGRGEGGGGEAKGRPRSA 
RRRGRGSRGSKSQTPSRHLPAH 

6682 

A 

2 

702 

FVSVPSVLLGLQFWRSERD/TARLQSARWRVERGRLKELLSRQRPRR 
REEVWGREVCRTEMEVRASLQKIVSNGDEQLEKAMEEILRDFEKRP 
S SLLVDCQS S S E I SDHS FGD I PASQTNKPSLQL ILDPSNTE I STPRP 
S S PGGLPEEDS VL FNKLT YLGCMKVS S PRNEVE ALRAMATMKS S S Q Y 
PFPVTLYVPNVPEGSVRHILSLLGGGEDVIASSGVELTWSHCRHAE 

6683 

A 

3 

499 

SFGLQRWRSERDSARLQSARRRVERGRLKELLSRQRPRRREEVVVGR 
EVCRTEMEVRASLQKVSGSSDSVATMNSEEFVLVPQYADDNSTKHEE 
KPQLKI VSNGDEQLEKAMEE ILRDSEKRPS SLLVDCQS S SE I SDHS F 
GD I PAS QTNKPS LQL IMD P SNTET 

6684 

A 

1 

224 

RRLFFFWLI DMG FHHVGQAGLKLL S S SD PPTLAS QS AE ITSMS HRA 
RPQCSPSTIiHCIEKLINTLGDKLTYYL 

6685 

A 

1 

789 

MGFRHVGQAGLKLLTSSDLPALASQVQCKLWPAS AERH I S PLQRGL 
IAFSLRPDPGLAGDRASLGVPLCGYRIGNPRRSPKRTRHCARTAAVS 
CSAYSKCLYQFRPNVGHSASTPKAMCQTTAAGEGAHAADEGYTQRW 
GARRGTPQRYTGGAETKNRAFRSYPKMETSWPRGGGSTPPVALDAQV 
ERECALGSGAQGKHSCLAVRVVLIFCINEHYPQTLTYRLPFVLPLRS 
GGTPTS WRALFFPS SSDSNCKLCGGTS S 

6686 

a : 

1 

7500 

MLFLNLATAHQEIEYISPPLEHGSNQALRSTALETLHESALLSGELA 
LKTLCNKMDTGDTALGQKATSRSGETDKASGRWRQEQSAVIKMSTFG 
SHEGQRQPQIEPEQIGNTASAQLFGSGKLASPSEWQQVAEKQYPPH 
RPSPYSCQHSLSFPQHSLPQGVMHSTKPHQSLEGPPWLFPGPLPSVA 
SEDLFPFPIHGHSGGYPRKKISSLNPAYSQYSQKSIEQAJ2EAHKJ0EH 
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KPKKPGKYICPYCSRACAKPSVLKKHIRSHTGERPYPCIPCGFSFKT 

KSNLYKHRKSHAHAIKAGLVPFTESAVSKLDLEAGFIDVEAEIHSDG 

EQSTDTDEESSLFAEASDKMSPGPPIPLDIASRGGYHGSLEBSLGGP 

MKVPILIIPKSGIPLPNESSQYIGPDMLPNPSLNTKADDSHTVKQKI. 

ALRLSEKKGQDSEPSLNLLS PHSKGSTDSGYFSRSES AEQQI SPPNT 

NAKSYEEIIFGKYCRLSPRNALSVTTTSQERAAMGRKGIMEPLPHVN 

TRLDVKMFEDPVSQLIPSKGDVDPSQTSMLKSTKFNSESRQPQIIPS 

S I RNEGKL YPANFQG SNPVLLEAP VDS S PL I RSNS VPTS S ATNLTI P 

P S LRGSHS FDERMTG SDDVF Y PGTVG I P PQRMLRRQAAFEIiP S VQEG 

HVEVEHHGRMLKG I S S S S I»KE KKLS PGDRVG YD YD VCRKPYKKWED S 

ETPKQNYRDISCLSSLKHGGEYFMDPWPLQGVPSMFGTTCENRKRR 

KEKSVGDEEDTPMICSSIVSTPVGIMASDYDPKLQMQEGVRSGFAMA 

GHENLSHGHTERFDPCRPQLQPGSPSLVSEESPSAIDSDKMSDLGGR 

KPPGNVISVIQHTNSLSRPNSFERSESAELVACTQDKAPSPSETCDS 

EISEAPVSPEWAPPGDGAESGGKPSPSQQVQQQSYHTQPRLVRQHNI 

QVPEIRVTEEPDKPEKEKEAQSKEPEKPVEEFQWPQRSETLSQLPAE 

KLPPKKKRLRLADMEHSSGESSFESTGTGLSRSPSQESNIiSHSSSFS 

MSFEREETSKLSALPKQDEFGKHSEFLTVPAGSYSLSVPGHHHQKEM 

RRCS S EQMPCPHPAEVPEVRS KS FD YGNLSHAP VSGAAASTVS PSRE 

RKKCFLVRQASFSGSPEISQGEVGMDQSVKQEQLEHLHAGLRSGWHH 

GPPAVLPPLQQEDPGKQVAGPCPPLSSGPLHLAQPQIMHMDSQESLR 

NPLI QPTS YMTS KHLPEQPHLFPHQETI PFS P I QNALFQFQYPTVCM 

VHLPAQQPPWWQAHFPHPFAQHPQKSYGKPSFQTEIHSSYPLEHVAE 

HTGKKPAEYAHTKEQTYPCYSGASGLHPKNLLPICFPSDQSSKSTETP 

SEQVLQEDFASANAGSLQSLPGTVVPVRIQTHVPSYGSVMYTSISQI 

LGQNSPAIVICK^ENMTQRTLVTNAAMQGIGFNIAQVLGQHAGLEK 

YPIWKAPQTLPLGLESSIPLCLPSTSDSVATLGGSKRMLSPASSLEL 

FMETKQQKRVKEEKMYGQIVEELSAVEIiTNSDIKKDLSRPQKPQLVR 

QGCASEPKDGLQSGSSSFSSLSPSSSQDYPSVSPSSREPFPPSKEML 

SGSRAPLPGQKSSGPSESKESSDELDIDETASDMSMSPQSSSLPAGD 

GQLEEEGKGHKRPVGMLVRMASAPSGNVADSTIjLLTDMADFQQILQF 

PSLRTTTTVSWCFLNYTKPNYVQQATFKS S VYASWC I S S CNPNPSGL 

NTKTTLALLRS KQKI TAE I YTLAAMHRPGTGKLTSS SAWKQFTQMKP 

DASFLFGSKLERKLVGNILKERGKGDIHGDKDIGSKQTEPIRIKIFE 

GGYKSNEDYVYVRGR6RGKYICEECGIRCKKPSMLKKHIRTHTDVRP 

WCKLCNFAFKTKGNLTKHMKSKAHMKKCLELGVSMTSVDDTETEEA 

ENLEDLHKAAEKHSMSSISTDHQFSDAEESDGEDGDDNDDDDEDEDD 

FDDQGDLTPKTRSRSTSPQPPRFSSLPVNVGAVPHGVPSDSSLGHSS 

LISYLVTLPSIRVTQLMTPSDSCEDTQMTEYQRLFQSKSTDSEPDKD 

RLDIPSCMDEECMLPSEPSSSPRDFSPSSHHSSPGYDSSPCRDNSPK 

RYLIPKGDLSPRRHLSPRRDLSPMRHLSPTKEAALRREMSQRDVSPR 

RHLSPRRPVSPGKDTTARRDTiQPPPPPPVMTTTDaDCDDDBT VTrWDD 

LSMGQYIjQAEPIVLGPPNLRRGLPQVPYFSIiYGDQEGAYEHPGSSLF 
PEGPNDYVFSHLPLHSQQQVRAPIPMVPVGGIQMVHSMPPALSSLHP 
SPTLPLPMEGFEEKKGASGESFSKDPYVLSKQHEKRGPHALQSSGPP 
STPSSPRLLMKQSTSEDSLNATEREQEENIQTCTKAIASLRIATEEA 
ALLGPDQPARVQEPHQNPLGSAHVSIRHFSRPEPGQPCTSATHPDLH 
DGEKDN FGTS QTPLAHS TFYS KS CVDDKQLDFHS SKELS SSTEESKD 
PSSEKSQLH 

6687 

A 

1 

8175 

MKKENVSQGIKTASGSQKRQENDCPLEPLERRQPSSHLAFTPVRAVW 
GFRHLELMLVRQILVASVSFQCLQTYRSCGNFRVLFPSQSATSVI 1 1 
ITVKFDVREKIAHGNI^HLAFGWTFLQWRELCYSPGPRTLLAN/UiRLW 
CACGRFQREGAISKFADTIiEHGSNQALRSTALETLHESALLSGELAL 
KTLCNKMDTGDTALGQKATSRSGETDKASGRWRQEQSAVIKMSTFGS 
HEGQRQPQIEPEQIGNTASAQLFGSGKLASPSEWQQVAEKQYPPHR 
PSPYSCQHSLSFPQHSLPQGVMHSTKPHQSLEGPPWLFPGPLPSVAS 
EDLFPFPIHGHSGGYPRKKISSLNPAYSQYSQKSIEQAEEAHKKEHK 
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PKKPGKYICPYCSRACAKPSVLKKHIRSHTGERPYPCIPCGFSFKTK 

SNLYKHRKSHAHAIKAGLVPFTESAVSKLDLEAQFIDVEAEIHSDGE 

QSTDTDEESSLFAEASDKMSPGPPIPLDIASRGGYHGSLEESLGGPM 

KVPILIIPKSGIPLPNESSQYIGPDMLPNPSLNTKADDSHTVKQKLA 

LRLSEKKGQDSEPSLNLLSPHSKGSTDSGYFSRSESAEQQISPPNTN 

AKS YEE 1 1 FGKYCRLS PRNALS VTTTSQERAAMGRKGIMEPLPHVNT 

RLDVKMFEDPVSQLIPSKGDVDPSQTSMLKSTKFNSESRQPQIIPSS 

IRNEGKLYPANFQGSNPVLLEAPVDSSPLIRSNSVPTSSATNLTIPP 

SLRGSHSFDERMTGSDDVFYPGTVGIPPQRMLRRQAAFELPSVQEGH 

VEVEHHGRMLKGI S SS SLKEKKLS PGDRVGYDYDVCRKP YKKWEDSE 

TPKQNYRD I SCLS S LKHGGE YFMDP WPLQGVPSMFGTTCENRKRRK 

EKSVGDEEDTPMICSSIVSTPVGIMASDYDPKLQMQEGVRSGFAMAG 

HENLSHGHTERFDPCRPQLQPGSPSLVSEESPSAIDSDKMSDLGGRK 

PPGNVISVIQHTNSLSRPNSFERSESAELVACTQDKAPSPSETCDSE 

ISEAPVSPEWAPPGDGAESGGKPSPSQQVQQQSYHTQPRLVRQHNIQ 

VPEIRVTEEPDKPEKEKEAQSKEPEKPVEEFQWPQRSETLSQLPAEK 

LPPKKKRLRIjADMEHSSGESSFESTGTGLSRSPSQESNIjSHSSSFSM 

SFEREETSKLSALPKQDEFGKHSEFLTVPAGSYSLSVPGHHHQKEMR 

RCSSEQMPCPHPAEVPEVRSKSFDYGNLSHAPVSGAAASTVSPSRER 

KKCFLVRQAS FSGS PE ISQGEVGMDQSVKQEQLEHLHAGLRSGWHHG 

PPAVLPPLQQEDPGKQVAGPCPPLSSGPLHLAQPQIMHMDSQESLRN 

PLIQPTSYMTSKHLPEQPHLFPHQETIPFSPIQNALFQFQYPTVCMV 

HLPAQQPPWWQAHFPHPFAQHPQKSYGKPSFQTEIHSSYPLEHVAEH 

TGKKPAEYAHTKEQTYPCYSGASGLHPKNLLPKFPSDQSSKSTETPS 

EQVLQEDFASANAGSLQSLPGTVVPVRIQTHVPSYGSVMYTSISQIL 

GQNS PA I V I CKVDENMTQRTLVTNAAMQG I G FN I AQVLGQHAGLEKY 

PIWKAPQTLPLGLESSIPLCLPSTSDSVATLGGSKRMLSPASSLELF 

METKQQKRVKEEKMYGQIVEELSAVELTNSDIKKDLSRPQKPQLVRQ 

GCASEPKDGLQSGSSSFSSLSPSSSQDYPSVSPSSRBPFPPSKEMIiS 

GSRAPLPGQKSSGPSESKESSDELDIDETASDMSMSPQSSSLPAGDG 

QLEEEGKGHKRPVGMLVRMASAPSGNVT^DSTLLLTDMADFQQILQFP 

SLRTTTTVSWCFLNYTKPNYVQQATFKSSVYASWCISSCNPNPSGLN 

TKTTLAIiLRSKQKITAEIYTLAAMHRPGTGKLTSSSAWKQFTQMKPD 

ASFLFGSKLERKLVGNILKERGKGDIHGDKDIGSKQTEPIRIKIFEG 

GYKSNEDYVYVRGRGRGKYICEECGIRCKKPSMJjKKHIRTHTDVRPY 

VCKLCNFAFKTKGNLTKHMKSKAHMKKCLELGVSMTSVDDTETEEAE 

NLEDLHKAAEKHSMSSISTDHQFSDAEESDGEDGDDNDDDDEDEDDF 

DDQGDLTPKTRSRSTSPQPPRFSSLPVNVGAVPHGVPSDSSLGHSSL 

I S YLVTLPS IRVTQLMTPSDSCEDTQMTE YQRLFQSKSTDSEPDKDR 

LDIPSCIWEECMLPSEPSSSPRDFSPSSHHSSPGYDSSPCRDNSPKR 

YLIPKGDLSPRRHLSPRRDLSPMRHI,SPRKEAALRREMSQRDVSPRR 

HLS PRRP VS PGKDI TARRDLS PRRERR YMTT I RAPS PRRALYHNPPL 

FVYVQNLNDSHTVKFPPFPAYEVWTIVLMIAPFHSHFQENPCFQLA 
YDISQRRNIiRRGLPQVPYFSLYGDQEGAYEHPGSSLFPEGPNDYVFS 
HLPLHSQQQVRAPIPMVPVGGIQMVHSMPPAIiSSLHPSPTLPLPMEG 
FEEKKGASGESFSKDPYVLSKQHEKRGPHALQSSGPPSTPSSPRLLM 
KOS T S ED S LNATE R E OE EN I OTCTKA I AS LR T A TF F A A T . IjR PDOPAR 
VQEPHQNPLGSAHVSIRHFSRPEPGQPCTSATHPDIiHDGEKDNFGTS 
QTPLAHSTFYSKSCVDDKQLDFHSSKELSSSTEESKDPSSEKSQLH 

6688 

A 

3 

598 

QCGGIRATTI>^MSFEWPWQ\YRFPPFFTLQPNVDTRQKQLAAWCSIjV 
LSFCRLHKQS SMTVMEAQES PLFNNVKLQRE\ LPVES I Q IVLEELRK 
KGNLEWLDKSKSSFLIMWRRPEEWGKLIYQWVSRSGQNNSVFTLYEL 
TNGEDTEDEEFHGlA EKPLLLRALQ/ CPYSREHKAEIHHCSGDGRGV 
QVLLGRDLS pfts 

6689 

A 

2 

356 

SGQNNSVFTLYELTNGEDTEDEEFHGLDEATLLRAXQALQQEHKAEI 
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ITVKRWPRRSSSSSRDLSPLYFLPPTFPGLSKGDQTPVSPKRLESVT 
PPELKKDSSPEGWGPWGMGFLKPP 

6690 

A 

2 

123 

FCNVGQAGLKLLTSSDLPASASQIAGITVVIHHTWLECRF 

6691 

A 

161 

464 

APP PCATQTAPHQHGQHSRDALLRFGCHCE I LLFS SAG I YVQEMADM 
SMAKLY SGLLG VGDE I LEVNGAKVAGLGLAH I KELLAHSESLS I RVL 
RQRPVPR 

6692 

A 

3 

86 

HSAPRHFHTQTQSLQQSVRYLSLFCLF 

6693 

A 

3 

85 

HSAPRHFHTQTQSLQQSVRYLSLFCLF 

6694 

A 

2 

201 

RIAESLYDDRMFDIMRALDLTLKHHILPKELRTEYEEEYFYLEPYLK 
QDIREGIEREEWAKKWSCK 

6695 

A 

1 

119 

AVSRPAKAAESRLLTSPESLSRDLDAPEVQESYRQQVPP 

6696 

A 

1 

3414 

MEPNDSTSTAVEEPDSLEVLVKTLDSQTRTFIVGAQMNVKEFKEHIR 
AS VS I PSEKQRL I YQGRVLQDDKKLQE YNVGGKV IHLVERAPPQTHL 
PSGASSGTGSASATHGGGSPPGTRGPGASVHDRNANSYVMVGTFNLP 
SDGSAVDVHINNEQAPIQSEPRVRLVMAQHMIRDIQTLLSRMETLPY 
LQCRGGPQPQHSQPPPQPPAVTPEPVALSSQTSEPVESEAPPREPMB 
AEEVEERAPAQNPELTPGPAPAGPTPAPETNAPNHPSPAEYVEVLQE 
LQRIiESRLQPFLQRYYEVLGAAATTDYNNNHEGREEDQRLINLVGES 
LRLLGNTFV7ALSDLRCNLACTPPRHLHVVRPMSHYTTPMVX.QQAAI P 
I QI NVGTTVTMTGNGTR P P PTPNAEAP P PG PGQAS S VAP S S TNVE S S 
AEGAPPPGPAPPPATSHPRVIRISHQSVEPWMMHMNIQDSGTQPGG 
VPSAPTGPLGPPGHGQTLGQQVPGFPTAPTRWIARPTPPQARPSHP 
GGPPVSGTLQGAGIX5TNASLAQMVSGLVGQLLMQPVLVAQGTPGMAP 
PPAPATASASAGTTNTATTAGPAPGGPAQPPPTPQPSMADLQFSQLL 
GNLIiGPAGPGAGGPGVASPTITVAMPGVPAFLQGMTDFLQATQTAPP 
PPPPPPPPPPAPEQQTMPPPGSPSGGAGSPGGLGLESLSPEFFTSW 
QGVLSSLLGSLGARAGSSESIAAFIQRLSGSSNIFEPG/U^GAIjGFFG 
AliLSLLCQNFShTTOVVMLLHGHFQPLQRLQPQLRSFFHQHYLGGQEP 

NSIAAHVLHCTDSGFGARLLELCNQGLFECLAIJ^LHCLGGQQMELAA 
VINGRIRRMSRGVNPSLVSWLTTMMGLRLQWLEHMPVGPDAILRYV 
RRVGDPPQPLPE EPMEVQGAKRAS PE PQRENAS PAPGTTAEEAMS RG 
S\ PLAPEGG\SRDE\ODGASAETEP\ WRAA\VPPPWVPTTnnnTn\ <2 
QRKLKPQPPL\ SDAYL\ SGMPA\ KRRKTMPG *GPPSCFSTEAVSRAA 
KAAGARPLTSPESLSRDLEAP\EVQESYRQQLRSDIQKRLQEDPNYS 
PQRF\ PNAQRAFADDP 

6697 

A 

1 

441 

RKS FQKELGMAJjLKKDCQGL WRL I QDFVXjLTTAVEVAHRWRELAEK 
LANVSKQQMDAYESPHRDRNGVVDSEAMRKPAYDFLLTWSHQIGDSY 
RDVIQELHLGLDNNKNPITKRWKHLTGTLILVNSLDVLRAAAFSPAD 
QDDFEI 

6698 

A 

2 

180 

DVWWESERHQLKTKVVHLSKLPKDTALLLDPNIYRTMPQKRLKRTL 
IRKVFNLYLSKQ 

6699 

A ! 

1 

14 3 

EVQEVQGKKVARVANPGLDTFSREIFRNEKLKKCRRFLDELDAVKMD i 

6700 

A 

2 

1069 

^TiVFRTiKMAA^nJWTRPMPCPWT <^!Prrt!\rPKnyuT w T''nppp\TVOPvr\ , nMj 
o u v r*iujxu T inMD y/i v ddrj no R V v Lj\j Ht r V? VrCXM VXl L L L/rirvrN X o\j x LjLJj\Yi 

DODRFEKNFRVDVVHMDENf5TjFFDMVnTnaaTtt"KrzVT?PP TT.T.2i'P r \7"DTM 

AVBKVliVYNNTS I VQDE ILAHRLGLI P IHADPRLFEYRNQGDEEGTE 

IDTLQFRLQWCTRNPHAAKDSSDPl^Lx^mHKVYTRHMTWIPLGNQ 

ADLFPEGTIRPVHDDILIAQLRPGQEIDLLMHCVKGIGKDHA\KFSP 

VATASYRLLPDITLLEPVEGEAAEELSRCFSPGVIEVQEVQGKKVAQ 

SCPTPRLDT\ FS \ REIFRNEKLKKVVRLARVR\DHYIFSVES\TGVL 

PPDVLVSEAIKVLMGKCRRFLDELDAVQMD 

6701 

A 

197 

310 

WLCYI IYK* YDNWWHSMCFSKPLYPEIVSLNYILLKG 

6702 

A 

3 

67 

GHRSDDSSSHGCKANTAPQES 

6703 

A 

3 

517 

DGSKPVKVPENPPSMVFKQMEQVAQFLKAAEDYGVIKTDMFQTVDLF 
EG KDMAAVQRTLMALG S LAVTKNDGHYRGD PNWFM KKAQEHKRE FTE 
SQLQEGKHVIGLQMGS SSLCPQPQPNFLPES ITALAPPSRLPPS PLL 
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S P P WAKQGR S GLGVAWMWAKS TVLLGAKAH 

6704 

A 

150 

840 

GSLLSPDMANKGPSYGMSREVQSKIBKKYDEELEERLVEWIIVQCGP 
DVGRPDRGRLGFPVWLKNG\ VI LDRL * TTFTPDAS KPV\ KVPE KPPP 
SCS S S KTEPLPHFLKAAE\ DYGVRPKSN I FQDL * TSLKGK\ DMGSSS 
KUPlJ^FGQIiWgVTKN\DGHYRG\DP\NWFYERKPQQHKREFHR 
LQKGKACPLALPDGAANRRGP PQARH * QG YGTTLGQ 1 1 KLEAERAKP 

6705 

A 

3 

842 

TRAADPARPTRTRTRGRTRGRQHSLKKSRMENDMAQSLHSDTADDTA 
HRPGKSNLKLPKGILKKKVSASAEGVQEDPPELSPIPASPGQAAPLL 
PKKGILKKPRQRESGYYSSPEPSESGELLDAGDVFVSGDPKEQKPPQ 
kj JbJjXjriKiUj X JjKJjNGJVI F r QTAl»ELiAAP TTFGSLDELAPPRPLARAS 
RPSGAVSEDSILSSESFDQLDLPERLPEPPLRGCVSVDNL.TGLEEPP 
SEGPGSCLRRWRQDPLGDSCFSLTDCQEVTATYRQALRVCSKLT 

6706 

A 

1 

570 

CDGIWDVMSNEQIiCEYVKSRLEVSDDLENVCNWWDTCLHKGSRDNM 
SIVIiVCFSNAPKVSDEAVKKDSELDKHLESRVEEIMEKSGEEGMPDL 
AHVMRILSAENIPNLPPGGGLAGKRNVIEAVYSRLNPHRESDGMAVL 

GTSICKPAESSGDKKGSGWQTFQALRSTRTKRFFCKPERKINPALYN 
YK 

6707 

A 

1 

1437 

MGAFLDKPKTEKHNAHGAGNGLR YGLS SMQGWRVEMEDAHTAWG I P 
HGLEDWSFFAVYDGHAGSRVANYCSTHLLEHITTNEDFRAAGKSGSA 
LELSVENVKNGIRTGFLKIDEYMRNFSDLRNGMDRSGSTAVGVMISP 
KH I YF INCGDSRAVL YRNGQVCFS TQDH KPCNPRE KER I QNAGGS VM 
I QRVNG S LAVS RALGD YD YKC YDG KG PTEQLVS PE PE VYE I LRAEED 
EFI ILACDG IWDVMSNEELCEYVKSRLEVSDDLENVCNWWDTCLHK 
GSRDNMSIVLVCFSNAPKVSDEAVTCKDSELDKHLESRVEEIMEKSGE 
c*^rit'ULt£\Jri VflKX JbbAiiW 1 FNJjPPGGGLAGKRNVIEAVYSRIjNPHRES 
DGASDEAEESGSQGKLVEALRQMRINHRG^RQLLEEMLTSYRLAKV 
EGEESPAEPAATATSSNSDAGNPVTMQESHTESESGLAELDSSNEDA 
GTKMSGEKI 

6708 

A 

3 

286 

CFSNAPKVSDEAVKKDSELDKHLESRVEEIMEKSGEEGMPDLAHVMR 
j. xj x i^xm i-i f f\jyVj\j JjAvjW KJSi V IfcAVYSRLNPHRESDGGAGDLEDPW 

6709 

A 

172 i 

364 

F YL YL VMLC FIE P VKLLL I VG A Y I L * LLKT I HE FHNQC F VDTKS LKG 
xvnr ivci x v c r xc XuMPir rlK 

6710 

A 

1 

683 

KTGREPRGRFLLCLLFFTVITYFFVVIGIAPIFILYELDSPLCWNEV 
F I G YGS AliG S AS FLTS FLG I WLFS YCMED I HMAF I G I FTTMTGMAMT 
AFASTTIJmFLARVPFLFTIVPFSVLRS^SKVVRSTEQGTLFAClA 
FLETLGGVTAVS TFNG I YS ATVAWYPGFTFLLS AGLLTNFQPS VYVL 
»->»-> v x^*iorJXXXY/\ril>i r Ij X IyinJNJ t? V rJ v lijy 1 h>ir Vi\J_tJ_iLi 1 VGAYTTi 

6711 

A 

18 

292 

cuij vorr Xiiurx J.irjfxv^vjl_r JrKxr^PlviDISyPVPIiCDPRTQVSDCEQM 
WPCPETPAW PEGHAS A YY F I S PGS LPAVPRVS S S PTLGTRDATT 

6712 

A 

1 

687 

PTRPVFDQVRTLVLCECAHADRSGGAPSLCPHAAEPRGSPAPQACGS 

AZVK'f" r \/P<" 1 Zi QW\7T.T.r , 2\T»CVrJT*» r P/rLr* , '\7T , T riDVD Pnvnnmnrtt mT rkntmnn 

YGGTGGVGLDRYRSDYGGTGGVGLDRYRSDYGGTSGRGVDKVPVRLR 
WHRRRGVRQVLVRCTGSLNPCCG FGS KDRTHPCHKRSTGRGVGVGAA 
LRVSRPQPHSHSSLGRKGRVPGVSWWVSQGAAPPWEVTGQB 

6713 

A 

1 

119 

«Ai\.o x lnui a. v rLiCbi jvi v i o V X iJijivrl 1 VlMVISGHVKljAr 

6714 

A 

1 

339 

GTRKFIPLTAWSNWAIVKAASEPFQSENl^YPPAERISAEEGGDAAE 
GGEDSEEDFEENTDKPGDELISFEEHVGPSAAPKIEKPYMPRCLKQR 
RALRS SRLLIG 1 1 RS GRLQ 

6715 

A 

3 

470 

HERLEKNVKEVLED FAEDGE KKI KLLTGKRVQLAEDLTRTQHYTPGH 
HMAPHPALTLLLPCLGSDIRARLLFVMESLLLSSALTLCARVPLHLH 
PSHPLSLCPTPDRFLSSPGSQTMAFVGDTLFTLPVLRQPAPGHPPAQ 
ILSSVLDQATNSPM 

6716 

A 

2 

2942 

TGAPAWPSRRLRDLPAGGMWRLRRAAVACEVCQSLVKHS SGI KGSLP 
LQKLHLVSRSiraSHHPTLKLQRPQLRTSFQQFSSLTNIiPLRKLKFS 
P I KYG YQPRRNFWPARLATRLLKLRYL I LGS AVGGG YTAKKTFDQWK 
DM I PDLS E YKW I VPD I VWE I DE YI DFEKI RKALPNS EDLiVKIjAPDFD 
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KIVESLSLLKDFPTSGSPEETAFRATDRGSESDKHFRKVSDKEKIDQ 
LQEELLHTQLKYQRILERLEKENKELRKIiVIiQKDDKGIHHRKLKKSL 
IDMYSEVLDVI,SDxT)ASyNTQDHLPRVVVVGDQSAGKTSVLEMIAQA 
RIFPRGSGEMMTRSPVKVTLSEGPHHVALFKDSSREFDLTKEEDLAA 
LRHEIELRMRKNVKEGCTVSPETISLNVKGPGLQRKVIiVDLPGVINT 
VTSGMAPDTKET IFSIS KAYMQNPNAI I LC I QDG S VDAERS I VTDLV 
SQMDPHGRRTIFVLTKVDLAEKNVASPSRIQQIIEGKLFPMKAIiGYF 
AWTGKGNSSESIEAIREYEEEFFQNSKLIiKTSMLKAHQVTTRNLSL 
AVSDCFWKhlVRESVEQQADSFKATRFNLETEWKNNYPRLRELDRNEL 
FEKAKNE I LDEVI S LSQVTPKHWEE I LQQS LWERVS THVIEN I YLPA 
AQTMNSGTFNTTVDIKL.KQWTDKQLPNKAVEVAWETLQEEFSRFMTE 
PKGKEHDDIFDKLKEAVKEESIKRHKWNDFAEDSLRVIQHNALEDRS 
1 £>IJKyy WDAA JL Y r Mbfc»AijUAKxj KxJ I fc»NAl bNMVGPDWKKRWLi I WKNR 

tqeqcvhnetkn\elekmlkcneehpaVijvsdeittvrknlesrgve 

\7T^PQT.TKT^ r PWWrV\7VPP'H'PT TT r T2VT "KTTJr*>JT PDTJPU WVfYDUTTXmCPT T?r* 
VJJroJjl 1\U X wxiy V IrCxCrir J_ii\. L*\±jiixriL-vi LiLKKbr X X XyKrii? v JJo£*J-i£>\-. 

NDWLFWRIQRMLAITANTLR(MLTNTEVRRLEKNVKEVL 
EKKIK\LLTGKRV\QLA\EDLKKVREIQEKLDAFIEALHQEK 

6717 

A 

38 

359 

PVIDEILKFREVWKSFDRAVGPEWFPYAKAARHPFSSKITATTFLHL 
AS AAL YRTAETS STMQNYHQDWPTPGG I PETTLQGAYRKKI KRSDAA 
AAGTHSSPWCIPL 

6718 

A 

122 

282 

KPRSGASGAGWLPRGETKSLMCPPSGGCEIVEAWLRADRYAPWDVPL 
DSPPLA 

6719 

A 

2 

1010 

ASGEWRVSGGRPAGAGRPE EALAAG S D PRG AAARLACS APTPGGGTM 
PFDFRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFI 
TTEVVNELYVDDPDKDSGGKIDVSl^ISLPN 

HEVGHIDNSMKI PLNNGAGCRFEGQFS INICVPGNFHVSTHSATAQPQ 
NPDMTHVIHKLSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYIL 
KIVPTVYEDKSGKQRYSYQYTVANKEYVAYSHTGRIIPAIWFRYDLS 
PITVTOTTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAWKK 
IQLGKMH 

6720 

A 

326 

514 

KPRSGASGAGWLPRGETKSLMCPPSGGCEIVEAWLRADRYAFWDVPL 
REETELSRRVPRSGL 

6721 

A 

1 

1076 

MSLCAQPSPPVLRGSx^CLSPSVNELYVDPPDKDSGGKIDVSLNISL 
PNlixiCE LivGuUl QDEMGRHBVGHIDNSMKI PxaNNGAGCRF liGQFS XN 
KNQELQ I LSQVGERS ALGRAEAPGGS WQGFAQHVPAFLRAR YCHSGA 

GGQDGVPGNFHVSTHIATAQPQIPDMTHVIHKLSFGDTLQVQNIHGA 
FNAiGGADRLTSNPIiASHDYILKIVPTVYEDKSGKHRDSYQYTVANK 
EYVAYSHTGRI IPAI WFRYDLSPITVKYTERRQPLYRFITTICAI IG 
GTFTVAG I LDS C I FTAS EAW KKT OT.GKMH 

6722 

C 

11 

79 

MliPILRHTFLRVTLRGKKACLL* 


a 
/a 


x ^ o 

VTf ^KT.MTTTJT.r , T.r>TJQPT.nVV'lVHTVKrPQ , rT T?f^t>T?r > »r , TlQT CDUfrvnTQT 
X XVOXVJ-ll'lJ. J. -l^ J-i\— tJLJVt O XVLjVj V V/xtl X Jlvr O X liabr KyoiVOliO f KuVyi Oil 

GNIARSCLYPKNIFFLN 

6724 

A 

i 

X 

*± O £. 

PP YQPf3f2QMPT^Vf , T^QPWr!PT^O\7PT^CT^T^\7^ATrTTW^ T 

AKFDYVGRTARELS FKKGAS LLL YQRASDDWWEGRHNG I DGL I PHQY 

T WODTRDRWPR 9 ^PTC^F TFVT ^PPPFPIfVTZiP Zir5ZiWr , PQf2r!TTVZ*'n 

IYL 

6725 

A 

123 

336 

RCFLCCCCFPSSLPACPFSPSEDG WERS S PKSE IEVI S EPPEEKVT 
ARAGASCPSGGHVADIYIiANINK 

6726 

A 

1 

227 

FGFLVETGFLQVGQACLEIiLTSGDSPAPASQNGGITGRSHRAGPTIF 
LKK^FKl^HSLCSSl^SIIGSB^SNTY 

Sill 

A 

1 

131 

MPVLLTSDEIWKSSVKAGFAKVDQQQAPKSVQLLTVKTRDSTSNRGN 
SGPRSQATDRICRTPYHFEDC*SGSTASPQVSSASDSEDKRLNK 

6728 

A 

3 

364 

GSCSVAQAGVQWCDLSSLQLLPPGSSSSPASACQIAGITGVYHHSQL 
I FVFLVEMGF YHVGQ AGLELLAS GD P P AIxAS QSAEI TG VNYRAW PHV 
LYFFQVILKSTTRNVGMGFGMYNGNR 
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6729 

A 

2 

197 

QAGVQWLVLSSLQTLPPRFKPFSCLRLPSSWDYRCLPPCLAKFCIFS 
RDGVS SCWPGWSQTPDVR 

6730 

A 

2 

422 

NGMIiGNMLKSSLIFKBDLQETAMETWSIDQVCSWLEEKNLRBtiVYRF 
QEEKLS GAALLAIiNDRMVQQL VKK I GHHAVLMDL I KKYKHNTQGLKS 
P ENP KKAAWM OTE AARJD YRD E E S S S PARHOPOM P pyp ZaWNTT ,nMP 

6731 

A 

33 

294 

SQHFGRSRWVDHLRSGVRDQPGQHGETPSLLKMQKFSQAWCWDYRHI 
TPH PANS C I FVEMC3 FR YVAOTd F KHT . A 9 <?DPP A cv 0 nMT . 

6732 

A 

2 

670 

RSRGIKDFENDPPLSSCGIFQSRIAGDALLDSGIRISSVFASPALRC 
VQTAKL I LEELKLE KK I KI RVE PG I FEWTKWEAGKTTPTLMSLEELK 
EANFNIDTDYRPAFPLSALMPAESYQEYMDRCTASMVQXVNTCPQDT 
GVILIVSHGSTLDSCTRPLLGLPPRECGDFAQLVRKIPSLGMCFCEE 
NKEEGKWELVNPPVKTLTHGANAAFNWRNWISGN 

6733 

A 

1 

171 

YELWYLLNG 

6734 

A 

1 

789 

MGFRHVGQAGLKLLTS SDLPALAS QVQCKL WPAS AERH I S PLQRGL 
I AF S L» *R PD PC5 "L A CJD R A 9 T .fi V PT . Cd VP T HUPP P C t> TTD td up a "D t a n \/ c 

CSAYSKCLYQFRPNVGHSASTPKAMCQTTAAGEGAHAADEGYTQRW 
GARRGTPQRYTGGAETKNRAFRSYPKMETSWPRGGGSTPPVALDAQV 
ERECALG SGAOGKHS CLAVRWL T FC INPHY POTT .TVP T . PPVT .PT .P Q 
GGTPTS WRALFFPS S SDSNCKLCGGTS S 

6735 

A 

3 

363 

FFFFFFFFTjPO^T.QTi^PPT.PPPnTT QZiTJr r klT.PT t rcTMOcacBcnTrn 

i i i r r f r r DAyoJU^JJO tr rvxjIZ>v_.r\.vj X X 0>\T1v~INIXjIvXjXjv?0 IWo oAdAo^Vn 

GTTGACHHVQP I VP VFLGETG FHHVGQAGLKLLTS SDP PAS ASQRVR 
ITGMSHQIWPKLSLIKSLSDAQIRE * 

6736- 

A 

2 

379 

S CER F S T S O VIiC S OAL S S G KH YWEVDTP Nr WW A Vf5 V A Q W PM ^ P nnv 

LGRxTyiDSWCVEWKGTSQLSAWHIWra^ 

IxACDTEDDQDMLLSGVPHLSSYLGSCIGMIW 

6737 

A 

lie 

373 

E I FGEQAVLRGSPGGPKDPG S FQCRLP VLQTWGGLRHDALDHAYGMP 
GGICVHASGYIGGHRTREGALSMARATLAQRSYLPQIC 

6738 

A 

1 

164 

RAKAJ<P.RlTrTVSCPJTOHLKIVYRRFRYRLYV 
CKKITVP 

6739 

A 

3 

122 

R I MI^RKVLKHHQLLGE VLTQLS S R FKPR VP V I KVQD FT | 

6740 

A 

83 

2455 

IRIAALDDFRTSLTMSSTRSQNPHGLKQIGLDQIWDDL.RAGIQQVYT 

RQS^KSRYMELYTHVYITTCTSVH^ 

AQFVGLELYKRLKEFLKbTYLTO^ 

FSSKVIxNGICAYlxNRHWVRRECDEGRKGIYEIYSlx^ 

LNKQ VTN A VLi KX. I E KE RNG E T I NTRL I S GWQ S YVELG LNEDD AF AK 

GPTLTVYKESFESQFLADTERFYTRESTEFLQQNPVTEYMKJCAEARL 

LEEQRRVQVYLHESTQDELARKCEQVLIEKHLEIFHTEFQNLLDADK 

NEDLGRMYNLVSRIQDGLGELKKLLE 

P KM YVQTVLDVHKKYNALi VM S AFNNDAGFVAALDKACGR F INNNAVT 

EDKDVFQKF YAKMIxAKRIiVHQNSASDDAEASMI S KLKQACGFB YTS K 
LQRMFQD I GVS KDxxNEQFKKHLTNSEPLDIjDFS IQVLS SGS WPFQQS 
CTFALPSELERSYQRFTAFYASRHSGRKLTWIiYQLSKGELVTNCFKN 
RYTLQASTFQMAI LLQ YNTEDAYTVQQLTDS TQ I KMD I LAQVLQ I LiL 
KS KLLVTjEDENANVDEVELKPDTL I KL YLGYKxSTKKLRVN INVPMKTE 
QKQEQETTHKNIEEDRKIjLIQAAIVRIMKMRKVx^KHQQL 
SSRFKJPRVPVIKKCIDILIEKEYLERVDGEKDTYSY1A 

6741 

A 

83 

2450 

IRIAALDDFRTSLTMSSTRSQNPHGLKQIGI»DQIWDDIjRAGIQQVYT 
RQSNiAXSRYMELYTHVYNYCTSV^ 

KDGEDLMDES VLKFYTQQWED YR FS S KVLNG I CAYLNRHWVRRECDE 

GRKGIYEIYSLALVTWPJDCLFRPlxNKQVTNAVLKLlEKERN 

RL I SGVVQSYVELGLNEDDAFAKGPTLTVYKES FE SQFLADTERFYT 

RESTEFLQQNPVTEYMKKAEARLLEEQRRVQVYLHESTQDELARKCE 

QVLIEKHLEIFHTEFQNLLDAD^ 

LETHIHNQGLAAIEKCGEAAIxNDPKMYVQTVL 
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NDAGFVAALDKACGRFINNNAVTKMAQS S SKS PELLARYCDS LLKKS 
S KNPEEAELEDTLNQVMVVFKY I EDKDVFQKFYAKMLAKRLVHQNSA 
SDD AEASM I S KLKQACGFEYTS KLQRMFQDIGVS KDLNEQFKKHLTN 
SEPLDLDFSIQVLSSGSWPFQQSCTFAliPSELERSYQRFTAFYASRH 
SGRKLTWDYQLSKGELVTNCFKNRYTLQASTFQMAILLQYNTEDAYT 
VQQLTDSTQIKMDILAQVLQILLKSKLLVLEDENANVDEVELKPDTL 
I KL YLG YKNKKLRVN INVPMKTEQKQEQETTHKN I EEDRKLL IQAAI 
VRIMKMRKVLKHQQLLGEVLTQL\SSRFKPRVPVIKKCIDILIEKEY 
L\ERV\DGEKDTYSYLAIiTLSGKGLDLCDPQQIVHVGKE 

6742 

A 

3 

318 

EHNTPLTSGFVFQEWALTNYYSVKHMDLCLTVVDRAPGSLIKI^GCR 
ENDSRQKWEQIEGNSKLRHVGSNLCLDSRTAKSGGLSVEVSGPALSQ 
QWKFTLNLQQ 

6743 

A 

17 

174 

P PA WPHEDRR F S PHFLDWAAFGVMTL P S I G I PLLLWYS S KRKYDTP 
KTKKN 

6744 

A 

3 

711 

LSSCLCGSGKAFGMPTMRLLSFVAnaAIiFAVTQAEEGARLLVSKSLLN 
RYAVEGRDLTLQYNI YNVGS S AALDVELSDDS F P PEDFG I VS GMI»NV 
KWDRIAPASNVSHTVVIiRPLK\AGYFNFTSATITYLAQEDGPVVIGS 
TSAPGQGGMLV\QREFDRRFSP\HF\LDWAAFGGHDPFPPSGIPLL\ 
LWYSSKRK\YDTPKTKKNLIWGFHKPSSPKKSKAPLPKKPQVLSQTP 
KGYP 

6745 

A 

74 

472 

GDDETKLTLHGLQQYYVKLKDDEKNRKLFDLLDVLEFNQVVIFVKSV 
QRC I ALAQQFKDFQRRIDVATNLFGRGMD I ERVNIAFNYDMPEDSDT 
YLHRVARAGRFGTKGLAITFVSDENDAKILNDVQDRFE 

6746 

A 

1 

1371 

MAENDVDNELLD YEDDEVETAAGGDGAEAPAKKDVKGS YVS IHSSGF 
RDFliLKPELLRA I VDCGFEHPS EVQHEC I PQAI LGMDVL.CQAKSGMG 
KTAVFVliATLQQIiE PVTGQVS VLVMCHTRELAFQ I S KE YERFSKYMP 
NVKVAVFFGGLSIKKDEEVLKKNCPHIWGTPGRIIJU^UmKSLNI*K 
HIKHFILDEODKI^EQIXiMRRDVQEIFRMTPHEKQVMMFSATLSKEI 
RPVCRKFMQDPMEIFVDDETKLTLHGLQQYYVKIiKDNEKl^RKIiFDLL 
DVLEFNQVVIFVKSVQRCIALAQLLVEQNFPAIAIHRGMPQEERLSR 
YQQ\FKDF\QRRILVATN\LFGRGMDIERVNIAFNYDMP\EDSDT\Y 
\LHRVARA\ORr C3TK(jXjAlT.r VSDh.NDAKllaNiJVQ\DRr IiVNl£>r.ljF 
DEIGHLLPTLEQDTVEDSPILGMLTVCSFQGGGTPGWG 

6747 

A 

2 

139 

AGSELGEGEEEVGLSYL/DERTNSEEGGLRGEKRKRDAENDGEEEDE 

6748 

A 

3 

270 

S YLMKEE I QDEEDDDD YVEE GEE E EE EEEGGLRGE KRKRDA VKEE I Q 
DEEDDDDYVEEGEEEEEEEEGGLRGEKRKRDAVKEEIQDEEDDDDYV 

ppprrrrpn l.' L.'f/***T dppvtjvtjtmi Dnnrrpunn 
EEGEEEEEDEEG(j±jR(jEi^^ 

6749 

A 


301 

QKGPDENRGALGPGGDQPRNTPVPSRPGSLELSSPIKLMVAPRKKG 

6750 

A 

4 

366 

RHRVSLYPSE/WSAVAQTHSSL*P*APWELK*SF*LSLPSS*K*RSI 
QLCPG*LRNFFFF/CGRDGVLMLSMLVFWSWTQVMLLPQPPQVLGLR 
VELDDTFLKVS S LVLFCLKQG YKKR I WNP 

6751 

A 

3 

263 

KMHARDFTVS ALARG I D VQQVS L V INYDL PTNRENY I HR I GRGGRFG 
RKGVAINFVTEEDKRILRDIETFYNTTVEEMPMNVADLI 

6752 

C 

54 

161 

MLPS 1 0 ITVEKRMNVFQYS QDAGFS P VN I VFT I GR * 

6753 

A 

161 

1122 

RSLSFV7VSMLTVLRSLPLFSRELLFPVLKIQKVILALGDYMGATCHA 
CICGTNVRNEMQKLQAEAPHIVVGTPGRVFI)MLNRRYLSPKWIKMFV 
LDE^EMLSRGFKDQIYEIFQKIJ^TSIQVVLLSATMPTDVLEVrKKF 

MDnOTDTT MVVL'L'T T'T t»pTV/^'C , VT\nn7DI?I7I»JVT r^rpT pnT VPTT.TTTY^ 

NKLfriKlLiV J\J\JiEljlIjEOXl\.y.r x XJN VEKEEWJvLj1J11jL.1JXjXE1 XjJ. X IV 
AVIFrJTCRRKVDWLTEKMHARDFTV^^ 

S SRVL I TTDLLARG ID VQQVS L V INYDL PTNRENY I HR I GRGGRFGR 
KGVAINFVTEEDKRILRDIETFYNTTVEEMPMNVADLI 

6754 

A 

1 

891 

MDPDGVIESNWNEIVDNFDDMNLKESLLRGIYAYGFEKPSAIQQRAI 
I PC I KGYDVIAQAQS GTGKTATFAI S I LQQLE I EF KETQALVLAPTR 
El^QQVVLLSATMPTDVLEVTKKFMRDP I RI LVKKEELTLEGIKQFY 
INVEREEWKLDTLCDLYETLTITQAVIFLOTRRKVDWLTEKMHARDF 
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TVSALHGDMDQKERDVIMREPRSGSSRVLITTDLIiARGIDVQQVSLV 
INYDLPTNRBNY I HR I GRGGRFGRKG VAINF VTEEDKR I LRD I ETFY 
NTTVEEMPMNVADLI 

6755 

A 

2 

104 

L I TTDLLARG IDVQQVSLVINYDLPTNRENY I HR 

6756 

A 

957 

1285 

HGTRRRRDVIMREFRSRVKVRVTjITTDLLGSRGLMCNNVSLVIN/YM 

T^T.'DT'IJX PTTMV VTM'PTr2Pf2f2QKIT.r2f3K'f3\72iT'KIT?r , VT VVTDPPT DnTPTP 
LHjf xIN \i\JC»lM \ X JLnXXoKVjuoWJubo/V.'jVHlWf V_ I i-i rvJx 1 K r LKU 1 b i r* 

LPILQWRRMPMKCGLTLF 

6757 

A 

1 

716 

TPKRAPKKKMAI SGVPVLGFF 1 1 AVLMSAQBSWAI KEEHVI I QAEFY 
LNPDQSGEFMFDFDGDE I FHVDMAKKETVWRLEEFGRFAS FEAQGAL 
ANIAVDKANLEIMTKRSNYTPITOTKFTPPVV1WTWLRNGKPVTTGV 
SEWFLPREDHLFRKFim,PFLPSTEDVYDCRVEHWGLDEPLLKHWE 
FDAPSPLPETTENWCALGLTVGLVGIIIGTIFIIKGLRKSNAAERR 
GPL 

6758 

A 

2 

835 

FLSVLPHSRALLTPKRAPKKKMAISGVPVLGFFIIAVLMSAQESWAI 

KPEIWTTOAEPYLNPDO^nEPMPDPDRnPTPinmMttTnfFTVWPT.PPP 
ivDCiii v x -L \^/-vxjjt x i ni c i/your»r nr -Lv r uui/Ci _L v xt v < - 1 1 l n i\ r n x V r* t* 1 ir *" r 

GRFASFEAQGAxxANIAVDKANLEIMTKttSITC 
PVELREPlWLICFIDKJFTPPVVlSrV^^ 

ENTLFR/KFriYFP\FLPPTEDVxT)FRVE\lWGLDEPLLKHWEFDAPS 
PLPK\TTENVV\CALGLTVGLVGI I IGTI FI IKGI \RKSNAAERRGP 
L 

6759 

A 

759 

1022 

PTQGQRKGTSHLGGGADAGCTAAEEPGGTAAASGAGAGRAGD * HPGT 
GAQHHHPPAVPGEAPGEETQGQVQEEPAEAQGWQPHQPPF 

6760 

A 

2 

689 

CISTSTGSDNMQVRKGTEGPKIPPWREREERGGGLVASAYQKDPLVH 

QKIVVLGEQELPSGVSSLLLLLTWLSALQPVKPDSFLTYKIETVIVP 

TLEYSEDYISNTCKTMRPASVQGKHSTNIGHDGDDDDEDDDEEMMIA 

SACLLQAELVNYERVKEYCLKVLKKEGENFKALra^ 

KALYYLKEARTQQPTDT1WIRYIQLTEMKLSRCSQREKEAM 

O/Ol 

A 


1U11 

AAAAGAASGSSGENGSAGRQTAAGCHGRHEACEARGRRPRLLNPGRR 

PLCVPGGGGGAPARGQVGAAAEPAELIRRAHEFKSQGAQCYKDKKFR 
EAIGKraRALLELKGLLPPPGERERDSRAASPAGALKPGRLSEEQSK 
TVEAI E IDC YNSLAACLLQAELVNYERVKE YCLKVLKKEGENFKALY 
RSGVAFYHIjGDYDKALYYLKEARTQQPTDxTWIRYIQLTEMKL 
QREKEAM 

6762 


2 

151 

CKTJl^KGG FOFA7ARHL 

EV 

6763 


3 

74 0 

R P T GE FR SN AD YOYOLLRCNVDL.LK T T OT jRT^TFNTKrPOrJF* YPPRT^TW 
tvc xucr nomrvu x ^ x \^ jj u rv v — i v v xjxjxj jxx. xyuvjju x x l liM Lyuc* x rrvj loin 

QFNFKFNLTEDMYAQDSIELLTTSGIQFKKHEEEGIETQYFAELLMT 
SG WLCEGVKWLS FHSG YDFGYLI KI LTNSNLPEEELDFFE I LRLFF 
PVI YDVKYLMK\ SCKN\ LKGGL\QEVAEQL\ ELERIGP\ QHQAGSDS 
L\ LTGriAF FKMREMF FEDHIDDAKYCG\ HLYG\ LGSG S S YVQNGHRG 
MHMEEGSQQGS PWTWK 

6764 

A 

296 

522 

I WVFHNLKGLFE I FCC I LKY FLE ELL I LF * KTKL * KHAS FLKNNPS I 
QDMESSTTVHRSGLGGSLCADVLSLQWL 

6765 

A ' 

3 

333 

F FF FFL I PFFF 1 1 OH F I HNKYKLFTATV1QCAHLHR G P. T .R ALTCALWC 
WIPYPGWRGYFLKRRHVFTTLFFKIKLGSSSKKYFKxTTTKGFQITLL 
RFVEKPKFKGQFGLG 

6766 

A 

1326 

1982 

TSEASRQSERVDSAALSALSLLSRSSKCRPWGTARVARAGGCLTPLL 
SRRFLETRPFTGPWDPGLGVTCWCCPPKRRLKSTPRPKFSVCVLGDQ 
QHCDEAKAVDATFEVCLGDQVLSNANGFLSLSAKKYDAFIASESLIK 
QIPRILGPGLNKAGKFPSLLREEGQAICYSSTNLT*SSLPSSQVLCL 
AVAVGHVKMTDDELVYNIHLAVNFLVSLLK 

6767 

A 

11 

713 

EAMSSKVSRDTLxT5AVREVLHGTQRK3^ 

\KDKRF\SGT\VRLKSHSPALSFSVCVLGD\RQH\CD\EAKA\VDIP 
\HMD I \ EALEKTSTKNKKLV\ KKAGQRKYDAVFGPQSLLIKQIP\RI 
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PRAPGLNKAGK\ FP\ SLLTHNENMV\ AKVDEVK\ S PIKF\QMKKVLC 
IAVAVGHVKMTODELVFNIHLA\VNFL\VSLLK\KNWQKCSGAIiYYQ 
cIAFMGQo PKRLi Y 

^ •"» C O 
O/DO 

A 

z 

2 77 

PKVKGRVGAFPTYGTDIjEASIjLiS FEKJjDRAS pylwpeqlpgvaefaa 
SFKSPITS SPPKWMAE I EFJDDIDMLKATEMTRGKFLNILEKPKK 

T H O 

6/69 

A 

1 

"5 "5 if 

MAb P rDGTDJjEASLiJjS FEKLDRAS PDL WPEQLPG VAEFAAS FKS PIT 
S S P P KWMAE I ERDD I DMLKELG S LTTANLMEKVRGLQNLAYQLGLDE 
o j\.jc<* i x rv vj r\ r j. xjcj jvtr rvtv 

6770 

A 

132 

347 

NAKQVGDRGKPCVCHYGLSLAWDPCMLKYCHSRDRPTPYKCGIRSCH 

6771 

A 

3 

109 

CAPSPVRGAMASARLFALLLFFVGGVAESMTWKTP 



c 

X O \J 

nor liooir rvx JUIN wxvr jSAVAUbAoLbAl^oUAbolr JL*N JL AKVIKA^jUX AK 
SVLG* 

6773 

A 

52 

310 

YIIRTQTQISHHPLHYIl^SAKKKLCFGKSYFIRGSSFRKGQGHEER 
GLKYKKKTGEGV* * KRTKKQTKTKTKTIKYKLSRCCGNV 

6774 

A 

149 

333 

E FMLKW IYRIFVNL FLVFVRF FNCS FLCAEC I S LPGQECGGAEVS S F 
SRTFPQHLLSLYLM 

6775 

A 

1 

1616 

LFFFFFLESTRFY*FFFFFFFFFTIQTFKGGPLSPFFFLLFFIKPLP 
QFFFFIL 

sue 

A 

2 

138 

AS GHRE VQDS YEA YGQDDWNGT I TSLK/ AP PAR P VKGAYREHP YGR Y 

6111 

A 

1 

1212 

MQRRDDPAARMSRSSGRSGSMDPSGAHPSVRQTPSRQPPLPHRSRGG 

GGGSRGGARASPATQPPPLLPPSATGPDATVGGPAPTPLLPPSATAS 

VKMEPENKYLPELMAEKDSl^PSFTHAMQLLTAEIEKIQKGD 

EENYLDLFSHKNMKLKERVL I PVKQYPKEEELRKGGDPKYAHIiNMDL 

HVFIEVFGPPCEAYALMAHAMEEVKKFLVPDM^TODICQEQFLEIjSYL 

NGVPEPSRGRGVPVRGRGAAPPPPPVPRGRGVGPPRGALVRGTPVRG 

AITOGATVTRGVPPPPTVRGAPAPRARTAGIQRIPLPPPPAPETYEE 

YGYDDTYAEQSYEGYEGYYSQSQGDSEYYDYGHGEVQDSYEAYGQDD 

WNGTRPSLKAPPARPVKGAYREHPYGRY 

6778 

A 

3 

150 

YYDYGHGEVQDSYEAYGQDDWNGTRPSLKAPPARPAKGAYREHPYGR 
Y 

6779 

A 

2 

1622 

ASVATAPAJLPPPPTAARASVAAASLSRSLDRTSSQMQRRDDPAARMS 

RSSGRSGSMDPSGAHPSVRQTPSRQPPLPHRSRGGGGGSRGGARASP 

ATQPPPLLPPSATGPDATVGGPAPTPLLPPSATASVKMEPENKYLPE 

LMAEKDS LDPS FTHAMQLLTAE I EKI QKGDS KKDDEENYLDLFSHKN 

MKLKERVL I PVKQ YPKFNFVGKI LGPQGNT I KRLQEETG AKI S VLGK 

GSMRD KAKEEELRKGGDPKYAHLNMDLHVF I EVFGP PCE AYALMAHA 

MEEVKKFLVPDMMDDICQEQFLELSYLNGVPEPSRGRGVPVRGRGAA 

PPPPPVPTG\RGVGPPRGALVRGTPVRGAITRGATVTRGVPPPPTVR 

GAFC \ * RARTAG \ IQRI PJjPPPPAPETYEEYGYEDTYAEQS YEGYEG 

x xbyDybUbb x lUibHbb \ V y \lJb x EAJjWPRTNW \NGTRP \ oXiKAPXi 

APPVKfJ\ ZVYPPPPYrJUT.T.KTTrRPf^TrTvJTQVTJOQT.T T "ntnWT .Df3T "D^Zi. T 
tr V rvAJ \«ll\Dr r XOI\XJXJlvX I\XuZ»IjXY1>I X O X XZi^OXiXjijXVX 4 Xj X Lirul ir VxvLf 

PTTDQGNWSKCFFFRGSPSFSPTLIPFIj 

6780 

A 

1 

76 

RGFFFFFFFFFFFFFFFQQKSQIY 

6781 

A 

1 

287 

CFL*DRVLLCOTGWSAVAHNHSSLQP*TPGIKQFSCLRL/SSS*DYR 
CTS PYLAI FFFF * \ FFCRKQSLATVPRTI SNS WPQAIIiDSWSKHVE I 
TGF 

6782 

A 

3 

64 

AASTFNFYFYFET TVDLHAV 

6783 

A 

2 

69 

FFFLCFFNFYFYFE I I VRFTCS 

6784 

A 

1576 

3458 

KINLl^LNPKKKKKKKW 

MICEKNAXEELIRWEEGKXWQAXIEGIRNKLKEK^ 
KDLFAKADKEKLTLQRKI.rvTTGMTVDQVLGIRAL 
LDLFJTOILYMRAHQALPRDSVVEDLH^ 

YSKPSISGIESDDHCQREQELQK^LKLSSENIELKFQIjEQANKDLP 
RLKNQVRDLKEMCEFLKKEKAEVQRKIiGHVRGSGRSGKTIPELEICri 
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GLMKXVVEKVQRENEQLKKASG I LTS EKMANI EQENEKLKAELEKLK 
AHLGHQLSMHYESKTKGTEKI IAENERLRKELKKETDAAEKLRIAKN 
NLEILNEKMTVQLEETGKRLQFAESRGPQLEGADSKSWKSIVVTRMY 
ETKLKELETDIAKKNQSITDLKQLVKEATEREQKVNKYNEDLEQQIK 
ILKHVPEGAETEQGLKKELQVLRLANHQLDKEKAELIHQIEANKDQS 
GAESTIPDADQLKEKIKDLETQLKMSDLEKQHLKEEIKKLKKELENF 
frot r &a±EtULil\jLi4x niihV JvKI>J XXiX>E*JbiJ\ V ivKJjbr^Q-L»G VEXiTS P VAAS 
EEFEDEEESPVNFPIY 

'6785 

A 

3 

841 

GKRLLGPQWLDYDCDRAVIEDWVFRVPHVAIFLSWICKGFLVLCSS 
LDLTTLVPERQVDQGRGFESILDVLSVMYINAQLPREQRHRWRLLFS 
SELHGHSFSQLCGHITHRGPCVAVLEDHDKHVFGGFASCSWEVKPQF 
QGDNRCFLFS I CPSMAVYTHTG YNDHYM YIiNHGQQT I PNGLGMGGQH 
NYFGLWVDVDFGKGHSRAKPTCTTYNSPQLSAQENFQFDKMEVWAVG 

T~>D CTTTTr^T.Ti W^MVC TT.T" , lAT»"D'I?Af'\7\T T T7T CPUCDUPur'T T5 dt 7T>T\T^T7 
i-> ir o JCj Hi V XjMJvvjIV iVb X JjJJ/vlJr*li/\yiVLiijii«L ol^rtoKJloEoXrKE Vlrl/LiE 

6786 

A 

1 

960 

MGKDFMTKTPTPMATKAJCIDKVTOLNKiKSFCTAKDTIVRVNRQPIEW 
DA.lrAi x Ft>L»ls_KL»lAl X xisJiljJvQI xlvl^XNNLiIKKQTGCGADLiQQTP 
ADLQQTGLSVRKIOT)KQKGIASTSTKRTSTQKPHPKITNMKTKEIQT 
T I KE YYEHL YTNKLENLQEMDKFLDTYT FPGIiNQEE VQS LNRP ITS S 
E I EAVI NS LPTKKS PGPDRFTAE F YQRDMDEAGNHHS QQTNTGTENQ 
TLH I LTHKWELNNENTWTQGGE YHTPGP VRVWGNRGG I ALGE I PNEC 

^QCipATTincjWMPTJnPT^PT.TnPPfiPPPTA/TT^TPCPT TTCnTf 

6787 

A 

507 

755 

NKRGYKQMEEHSMLMGRKNQYRKNGHTAQGNIi*IQCHPHQATNDFLH 

6788 

A 

274 

882 

PDQVDRGG AHWQTD ADD PWLADHLQHCQRCGE I LRHYSGG I RGNVS A . 
VKCAEHHVPAFARLRNPQCGDFQRLDYRLFDS PGVKRREL* TAYRFC 
HVAP * LMDLRSGWAAGAVYRYQSH * FTADRLRS \ VDMDEAGNHHSQQ 
T\SQDQKIKQRTFSLISGS*CTRTHGHREGNITHRGLSRPNTARILC 
AS KAHLGLPLRS SLP 

6789 

A 

114 

329 

NKRGH I QMEKHSMLMDRKNQYHE YGHTAQSNLWI QC YPHQATIDFLH 
R I RKNYFKVHTEPKKS P YSQDNPK 

6790 

C 


1983 

MGDGTSDFDP IGDIGIRQSNYSCGGCPR I PYVWRVTGRAHGTQTI VV 
FMAKVYTGNVMRREKDSGGAWGNPWAGILLPPPSTRNCTVHSSFSNE 
NI FPQHTS SCVWHNQYIGADFPWSNFS FWE YNYVEDS CWTKHCKE 
LNIPYMAFKVILPDTVLERSTLLDRFGGFLLEIQIPYVFFASEGLLN 
TPD I LQLLES KDMDEAGNHHS EQTVTRTENQTLHVLTHRWELNNENT 
WTQDGEHHTPG PWGWRELI I APG VE ATAL I IRQ I ADH S LMTS KRDP 
HEWLDKSWLKVSPSEENRNQISTLSSQSSASDLDSVIQEHNBYYQYL 
G XiG E TVQED KTT X LNDN S S I ME L» KE I S S FL P P VTS YNQTSYWKDSSC 
KSNIGQNTPFLINIESRRPAYNSFLNHSDSESDVFSLGLTQMNCETI 
KS PTDTQKRVS WPRFINSQKRRTHEAKGFINKDVSDPI FSLEGTQS 
±rXiJiY¥i\Jr JVJvJM X Wii»yJiIMJli , r , IMXiv xOAyy X AL-NKXix SQKvjNljr TDQvKCIj 
SDESEGLTCESSKDETFWRELPSVPSLDLFRASDSNANQKEFNSLYF 
YQRAGKSLGQKRHHESSFNSGDKESLTEEKTEAQEKMTTEDRGRGWG 
YATTNRRMPRIAGNHQEPGERRGTPPPSEIAPEGSNPANTLILDFPPP 
EL* 

6791 


190 

4 86 

PT.T.WPPT.WT1* T VQCNTlfT HIT17W7CTDDTPDCT f' , 'D/T7T Ti;nT r>T MCTTKIM 

RCLVFCSCVSLLRMMVSSFIHVPLNFFNAFLLl^FFRFCPLLSCHMI 
KKSSS 

6792 

A 

1 

597 

MDENLPSELKANPTQTS YMPS I KS YRGAGQLKI AQESGVSVQGHWSM 
KREGFQKLERTHKNFQMKFSRWGIAVGIOjNEQHDCHKEEAVTGTSCM 
KELKl^K^YRRIQQHRDMDEAGNHHSQQTIRRTENRTLHALTHKWELN 
NEOTWQEDPSTPWSTATNRIQVTSSGLWQKASSVSPLLPVPTLPW 
SRFHIIAQGIF 

6793 

A 

1 

721 

MTS YNEQNKKP VTD PNEMA I HED SNQE F KI AWNETQR S PRS S PTEH 
LMTPPPLPTREQQPPLTVIFHYLPKSYKTAPPLSPFPDSLFGLSQPA 
PRDMDEAGNHHSQQT\ KQEQKTKHCMFSIiTSGS * TMRTHGHREGTV I 
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HRGLLGQFSQQLINKSEFSNTVCREKSGRVLQVNMESPL*MIQDALQ 
DQTTPKGPITVFL*LFYLDHLLRSSHKVASYLSTESRSRGSITVI»FI 
RCSEP 

6794 

A 

417 

698 

DRNRKANTACFQHSQVG VEQ* EHMDTGKV* SI* YGNYEHTSTCERME 
I QKATTEHLNL * S * SGS FLRFTWKMLQFS KI SRSFYQVXKHQQKIM 

6795 

A 

325 

1489 

RSPTLLMHVHYIKVISWRELLLIWLRAANASINPGVEQSPSGDPKTV 
TGFPFLRLEAADWVEEASVPKGVHIALSSQEQRAGDPRDLEAPSNLV 
ISERTHRSFRVSWTPPSDSVDRYKVEYYPVSGGKRQEWRTPGVPDL 
RLPAIIPQAAWLLAHLPAPSAPPTELPTKHTEAQSQRCWSLLLCSVT 
EVAAPSQWKEAPPSFHTGTENGYQGEESLFNKAYYGGGTNFFRKES 
QKLQQS AKKRDAE IjANGALG 1 1 ELNNDYTLKKVMKPL I TSNTVTDE I 
ERANVFKMNGKWYLFTDSRGSKMTIDENKIPRNPTYKGREGPLQGEL 
QTTAQGNKRGYKQMEEHSMLMDRKNQYRENGLTAQA\PQPPPSGI*G 
YTAPCFPWENL I 

6796 

A 

274 

718 

QACYIYTTEYYAAIKNDEF/TVLCRDMDEAGNHHSQQTVTRTKNQTP 
HVLTHRWELNNENTWTQEGEHHTPGPVMLDSAPPTLGHQTPGSSAFG 
LWDLHQRLRLPRPQTKGSTVGFPGSEAFKLGLGHYWLSFFPSLQTAY 
RGTLPFNH 

6797 

A 

1 

870 

MDEAGNHHSQQTITRTENQTMHVIiTHKWELNNENWTQW 
AAAAQGAGKKKGFVNYTGPQRFGKGRKVHTDQIGLALLKNEMDAKGT 
LSLMPEFKVRERALLEAIiHRFGMTEEGCIQAWFSLPHSMRIFYVHAY 
TSKIWNEAVSYRLETYGARWQGDLVCLDEDIDDENFPNSKWLPVL 
G YN I Q YPKNKVGQWYHD I L S RDGLQTCR FKVPTLKLN I PGC YRQ I LK 

HPCNLSYQLMEDHDIDVKTKGSHIDETALSLLISFDLDASCYATVCIi ' 
KEIMKHDV 

6798 

B 

328 

2283 

MKLEAI ILSKLTQEQKNQTLHVLTHRWELNNENTWTQGGEYHTLGPV 
DGFQKKEEAAIGAILQPLLVIQTVLGVDLQQTLADLQQRGIiLEEKLT 
NRKK I AHTVRDH I QREHNS S PAREQNWMENTFDELTQVGFRRWVINS 
SELKEHDLTQCKKGTWMMLETI I FS KLTQEQKNQTLHVLTHKWELNN 
ENTWTQGGEHHTLGPWRMFQIYWAERMLDKIPSLAFGLLDLRQWF 
ARGCWSFGHRLKAARITAPRLAREQNWMENEFDKLTEVVFRRWVITK 
SKIjKEHALTQCKEDKNLEKKELATQQKMSSRTASETVSSVFTATPHCY 
HYYLSSASYHISGSIRFPVLEKIRLHMGLENKCKVLLSGGSSSQQMY 
GDLEGGWNGKWFPWSWAVQEPASPPTTLGQISLTVCCSTVNGLPAS 
VDVFFCREAIiPRNLQKFPAKDIXSCILLQYIDNIiLLGYSTAVGCARRM 
D ALLRHLEDCG YKVS KKKAQ I CRQQTEGEQTI RERWI KLSDGR I AVP 
QLLGAAVVLAVHKTIHLGQESLEKLLGQYFYISHLPALAJCTVAQQCV 
TCQQRHAKQG P S VPPG I QA YRVAS FENLQVD FTEMPKCGDQGLERG S 
TVALVEGTPDHYPDHSHSYKGRRNPKLDSHSCVKPAT7VKT* 

6799 

A 

2 

472 

LKAVHRCTGPEFPGRQRLRHEQGRIiRGRRCPPGRLPLHREALFQPSF 
LGMEACGIHETTFNSIMKGDVDIRKDLYANTVLSGGTTMYPGIADRM 
UK-b 1 T AliAPS TMKI KI I APPERKYS VW IGG Y I LGWLSTFHQMW I S KQ 
E YDESGPF I VHTTCF 

6800 

A 

x 

239 

yLLbbb I HGNPAIGDR \MQKQILPWAPAQMKIRFMAPPERKYSVWIA 
APILASLSTSSRMWISKQEYDESGPSIVHRKCF 

6801 

A 

2 

441 

TGIVMDSSDGITHTMPIYEGYTLPHAILPLDLAGQDLTDYFMKILKE 
GGYSFTTTAEREIMLDIKEKLCYITLDFEQEMATAASSSSLEKS*EL 
ijWCjy VI riSNQWFHCP/ EYLFQPSF\ FTS IMKCDVDICKDLYANTVTi 
SGGTTMY 

6802 

A 

2 

212 

PGSTHASTRREITALAPSTMKIKI I APPERKYS VWIGGS ILASLSTF 
QQMWISKQEYDESGP§ I VHRKCF 

6803 

A 

3 

445 

IVEFEQEMATAASSSSLEKSYELPDGQVITIGNERFRCPEALFQPSF 
LGMESCG IHETTFNS I MKCDVD I RKDL YANTVLSGGTTMYPG IADRM ! 

QKE I TALAPS TMKI KI I APPERKYS VWIGGS I LAS LSTFQQMWISKQ I 
EYDESG 

6804 

A 

1 

1470 

MSTRPRRPPQLGRSTGRARLASRVAAAAGPPGARAPLAGDRAPAAGA i 
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PRRAGAGPCRPDPFPPAAGPTRGSPREGGKTGNRGKEREKGRRAQSIi 
AFADP PP VATRRQLTMDDD IAAL WDNG SGMCKAG FAGDDAPRAVFP 
S I VGRPRHQGVMVGMGQKDS YVGDEAQS KRG I LTLKYP IEHG I VTNW 
DDMEKIWHHTFYNELRVAPEEHPVLLTEAPLNPKANREKMTQIMFET 
FNTPAMYVAIQAMLSLYASGRTTGIVMDSGDG\rrHTVPIYEGYALPH 
AI LRLDL AGRDLTD YLMKI LTERGYS FTTTAERE I VRD I KE KLCYVA 

LDFEQEMATAASSSSLEKSYELPDGQ\VITIGNE\RFRCPEAIiFQPS 
FLG\MESCG\IHETTFNSIMKCDVDIRKDLY\ANTVLSG\G\TTMYP 
G\IADRMQKEITAL\APSTMK\IKIIAPS*RKYSVWIGGSILGL\LS 
TF\QQMW\ ISKQEYDESGPSI\ VHRKCF 

6805 

A 

171 

393 

GKPKPKPKPLPKPKSREISHIP*GL*KAGPFRAPGRPGCKPCWLGPS 
\ \j FOvjK w O C P * KA S PE KVIiG EN LTL KN P 

6806 

A 

3 

264 

SRRGSSPAQGSLGRPQLLSAQYKWGVGGDRVSL*TMDPSLAPKTNQ 
RASQSKLDLQPA*APWP/PQLCGHPVPSPASCPKSPTFKPT 

6807 

A 

1 

632 

DHYARALELQIFKTLHYLSNLIQSIMTPLGTKENPARVCRDLMDCEQ 
TMVDGTYWVDPNLGCS SDT I EVS CNFTHGGQTCLKP I TAS KVE FA I S 
R VQMN FLHLL S S EVTQH I T I HCLNMTVW QEGTGQTP AKQAVR FRAWN 
GQIFEAGGQFRPEVSMDGCKVQDGRWHQTLFTFRTQDPQQLPIISVD 
NLP PAS S RES S TAWEVGPAC FL 

6808 

A 

1 

591 

MTSVHSADDGNIWQHPLQCGGDDRDLVEFNLGSPKNVGPFLAVDQKH 
NILLKYRCHGPPIRFTTVFSNELHYVANELNGIVGGKTQCTFPIjEVY 
IRRLRNIRRAVVRLLDRSPQTVVVIRTANAQELGPEVSLFNSDWYNF 

QLDTILRRMFSGVGVYIxVDAWEMTLAHYLPHKLHPDEVIVKNQLDMF 
LSFVCPLET 

6809 

A 

783 

1516 

RNHI SSNFRETSHDFNPE ILLLLGQDKVALQKLND7VATKLQACGDLV 
EFNIiGSPKNVGPFLAVDQKHNILLKYRCHGRPIRFTTVFSNELHYVA 
NELNGIVGGKNTWAIAVWSHFSTFPLEVYIRRLRNIRRAWRLLDR 
SPKTVVVIRTANAQELGPEVSLFNSDWYNFQLDTILRRMFSGVGVYL 
VDAWEMTIJUIYLPHKLHPDEVIVKNQLDMFLSFVCPLETYPVLEGTG 
GIIFNDLLN 

6810 

A 

1 

951 

GTRQVLNINDHAPEFSQYYETYVCENAGSGQVIQTISAVDRDESIEE 
HHFYFNLSVEDTNNSSFTI IDNQDNTAVILiTNRTGFNLQEEPVFYI S 
IL I ADNG I P SLTS TNTLTI HVCDCGDSGS TQTCQ YQEL VLSMGFKTE 
VIIAILICIMIIFGFIFLTLGLKQRRKQILFPEKSEDFRENIFQYDD 
EGGGEEDTEAFDIAELRSSTIMRERKTRKTTS7VEIRSLYRQSLQVGP 
uz>j\.x v KZvr x ijJbiUjlih^iNTDPCAPPFDSLQTYAJPEGTGSIjAGSLSSLE 
SAVSDQDESYDYLNELGPRFKRLACMFGSAVQSNN 

6811 

A 

122 

271 

FAPSDLFHYLLLQHLHLILWPLQMPRQGETSGNEPQESGHLRVPGDD 
AS 

6812 

A 

3 

132 

VWNDDTQS YVLNFHGR VTQAS VKNFQ I IHGNDQTQRCDTILI | 

6813 

A 

3 

302 

GAHPASSESRGVPGARWHRGRSGRPGSQEGEAAAPSGPDWSSKSGGY 

PRSSGPGRWLQPREGIAGSNSGKGKQRHGTSAHWMRIHLPFELGIDF 
VKDIM 

6814 

A 

2 

2158 

STGGQEEQEEQEQEEQEEQEEQEEQEEQEQEEEEEEKEEQEEQEEEB 

QEQEEQEGEEAATPSGGGRNRS AS S S WVG TMAG I TTI EAVKRKI QVL 

QQQADDAEERAERLQREVEGERRAREQAEAEVASLNRR I QLVEEELD 

RAQERIiATALQKLEEAEKAADESERGMKVIENRALKDEEKMELQEIQ 

LEEAKHIAEEADRKYEEVARKLVI IEGDLERTEERAELAESRCREMD 

EOIRLMDONLKCLSAARF.K^QOK17nTrVT?C't?T VTT T»nirr w r T?7\T?T«n a cr? 

w v» un.vJJonn£iDi\ i o\JJVC»UI\ I CiC*C J. 1\. JL JLi 1 1 J K 1 1 K K A R X iv/Vo.!? 

AERSVAKLEKTIDDLEDTNSTSGDPVEKKDETPFGVSVAVGLAVFAC 
LFLSTLLLVJ^KCGRRNKPGINRPAVLAPEDGLAMSLHFMTLGGSSL 
SPTEGKGSGLQGHIIENPQYFSDACVHHIKRRDIVJLKWELGEGAFGK 
VFLAECHNLIjPEQDKMLVAVKALKEASESARQDFQREAE lltmlqhq 
HIVRFFGVCTEGRPLLMVFEYMRHGDLNRFLRSHGPDAKIiLAGGEDV 
APGPLGLGQLLAVAS QVAAGMVYLAGLH FVHRDLATRNC I»VGQGL»W 
KIGDFGMSRDIYSTDYYRVGGRTMLPIRWMPPESILYRKFTTESDVW 
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S FGWLWE I FTYGKQPWYQLSNTEAIDC I TQGRELERPRACPPEVYA 
IMRGCWQREPSNATASRMCTPGCKPWPRHLLSTWMSWARGPAQGLGV 
VSRNTGACPQHPP 

6815 

A 

2 

186 

HELKEAETRAEFAERSVAKLEKTIDDLEDELYAQKLKYKAISEELDH - 
ALNDMTS IASLOPT 

6816 

A 

1 

866 

MMEAIKKKMQMLKLDKENALDRAEQAEAEQKQAEERSKQLEDELAAM 
QKKLKGTEDELDKYSEALKDAQEKLELAEKKAADAEAEVASLNRRIQ 
LVEEELDRAQERLATALQKLEEAEKAADESERGMKVIENRALKDEEK 
MELQEIOLKEAKHIAEEADRKYEEVARKLVIIEGDIiERTEERAELAE 
SKCSELEEElA KNVTNNLKSLA EAOAEKY^OKPDKYPPPTlfTT.TnKT 

KEAETRA\EFA\ERSVAKLEKTl\DDLEDELF\AQKL\KYKAIi\SEE 
LDHALNDMTSI 

6817 

A 

1 

639 

MLSIQSTHAEKELENFFQNTEDWYFHTSHHLATWIKQKSEKDPVHLQ 
KGKNGPSCTKGQCNPLELVITNPLDPCWKKEERVTLGIDGAGLDPQV 
H I LVRGE VYKHS PE PVFQTFCDELNVPVPE I PGKTRNLFLQLAWHVA 
QCLNVTSCYVCGRTIIGDQWPQEAQELVPTDPVPDEFLAQKNHPDNF 
WVLKPS I IGQYCIAREGKEFTHPIG 

6818 

A 

1 

1710 

MAVGVLTQTVRLWPRPVAYLS KQLDGVS KGWPPCLRALAAMAIjIiAQE 
ADKLTLGQNLN I KAFHTVVTLMNTKGHHWLTNARLTKYQ S LLCENPR 
IITDVCNTLNPATLLPVSESLVKHNCVDMLDSVYSSRPDLRDQPWAS 
ADWELYMDGSNFINPHGERCAGYAWTLNAVIEAKPLPQGTSAQKAE 
L I ALTRALELS EVHGALYKEKGLLNSRGKD I KYQQE I LQLLEAVWKP 
vrLV/ivruiLAbnyKAo l o V/AJj\jiNJoy llJoc»Ai<isJU\bTPYClAS VAAPLiLiP 
QTPDLVPTYSKEEKDFFHTEVGQVIKEGWIRLPDGKVAVPQLLRATV 
VLA^ETTLLGQESLEKLLGRYFYISHLPALAKAVAQCIQAYGAAPF 
EGLQVDFTEMPKCEGNKYLLVLVCTYSGWVEAYPTQTEKAYEVTRVL 
jjjvlalj x xrjvx' \jxjxt xii\. luoiinur v x* v>ixvxjJJ^ VEiXJNIVJJILjVJL W>\ 1 WXiV-NhiK. 

DPVQLQKGKSGPSCTKGQCNPLELVITiSTPLDPRWKKGERVTLGINGA 
GLDPRVNILVRGEVYKHSPEPVLOTFYDRT/h^ t?t n 
IiAKHVA 

6819 

A 

764 

1203 

FWLWVSFWVPCNTHTTSKSTSFSSPSTQATRFGVL/WFVIKASIIGQ 
YCIARERKEFTHPVGRLSCLRQKIiYNGx^ETVTSWSSNHTERNPFSK 
FPKLRTVWraPESHRDWTAPTGLYWICGHRAFAKLPDESAGIGGPKQ 
GAIGIKE 

6820 

A 

24 

225 

ARQAQAW FWAWRWAACLWF YFL WFS * NR Y * VSNN\ KNN I RRSLS S L 
NSSCKF/WSTRQQK*IPKMIiS 

6821 

A 

1 

1752 

MLSIQSTHAEKELENFFQNTEDVfYFHTSHHLSSLRFFSARESNCSMK 

LYEEAYMERYRDFLPATMWSAHTEVDVPGQATWIKQKSEKDPVHLQK 

GKNGPSCTKGQC^PLELVITNPLDPCWKKEERVTLGIDGAGLDPQVH 

I LVRGEVYKHS PEPVFQTFCDELNVPVPE I PGKTRNLFLQLAWHVAQ 

CLNVTS C YVCGRT 1 1 GDQW PQEAQELVPTD P VPDEFLAQKNHPDNFW 

VXiKPS I IGOYCTARRfiKPPTHPVfJPT.QPT.nr»VT vivir» r r r ri7'T T \7 , T , t»ri<jocia 
* W1V * u XNJ v*V' xrtxvxivjxvxir' inr V OxvXjO i - 1 iv Tt,^/ tv i ■ xlM\jix X £• A V InnobW 

HIERNPFSKEPKLOTVT^THPESHODWTAPTOT.YWT rrtWP A. va VT.pnw 
WAGSCVIGTIKPSFFQLPIKTGELLGFPVYASRKKRS I AI ENWKDDE 
WPP PER I IQYYGPVTWAQDGLWGYQTPIYVLNRIIRIiQAAIiEI I TDK 
I S RALTI LAWQETQMRNAI SQNRLALDYLlxAAEGGVYGKFNLTNCCL 
HVDDQGQWED I VRDRTKLAHVPVQVWHE FD PGAMFGKW F PALGG FK 
TLIIGVLIVIGTCLLLPCLLPVLIjQMIKSFIATLVHQNASAQVYYVN 
HYQS I AQKDISSKNKSENSH 

6822 

A 

110 

203 

GTPNSANTQGNSTPNPQVIFFPCRAYIiKIRIj 

6823 

A 

76 

353 

FFSLSSVSLPxxAKIIPIVNGQIHSVCSETPSHFVQVRRKLGSVIKSK 
LYLWVWSCLSNIKLHSLLRGFFKKSQNLELHVEYGRVSHESNNGP 

6824 

B 

64 

930 

^RSVWNFLKRHKKKCIFLGTVLGGVYILGKYGQKKIREIQEREAAE 
YIAQARRQYHFESNQRTCNMT^SMLPTLREALMQQLNSESLTALLK 
1TOPSNKLEIWEDLKI I SFTRSTVAVYSTCMLWLLRVQLN I IGGYI Y 
LDNAAVGKNGTTI LAPPDVQQQYLSS IQHLLGDGLTEL ITV I KQAVQ 
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KVIiGSVSLKHSLSLIiDLEQKLKEIRNLVEQHKSSSWINKDGSKPLLC 
HYMMPDEETPLAVQACGLS PRD ITT I KLFHKTEDMLGS PDFRQFLNT 
CFKPRF* 

6825 

A 

385 

666 

LSANPVWSTLiYIHRDKTNTSTCSFFSCLSI J Sl?PPT.avr , QT?QUT T?Trm/ 
HTCSAAHPHWQWRYHDHEGGFPRARLIHRTPDMLTPAIFPNVGNSTA 

6826 

A 

701 

930 

VAERQTGTGQGTVGD I TDRATSM * MRQRRGCCFATCC / FRHEGVPVP 
WQGRYHDHEGGFPRARLIHCTPDVLTPYHVP 

6827 

A 

248 

719 

TDEPSIiLSHTFCPKGALLSACSGACPLSSGTQAGMPPSTEALQPCRG 
IiTVPHQQDKGHRACC/VWKALAADWSVEWQKGRRGLGKALSVTSR/1 
RATSM * MRQRRGCCFATCC / FRHEGVP VPWERG YHDHEGGFPRARIjI 
HCTPDVLTPAISPNVGNSTA , j 

6828 

A 

1 

2489 

MTVPELAGETEHCQCLINNVLMYGQPYRVTLELELPESPVNQDLGMF 
LVTISCYTRGGRIISTSSRSCNVYNSGQRRKLLLFKTFSRKVVVVVP 
NEEDWKKRLELRKEVEGDDVPESIMLEMKAPVPSQAPAPAEAELVQV 
KVEAIVISDEETDVSDEQPQGPERAFPSGGAVYGAQPSQPEAFEDPG 
AAGLEEVGPSDHFLPTDPHLPYHLLPGAGQYHRGLVTSPIiPAPASLH 
EPLYLSSEYEAAPGSFGVFTEDVPTCKTCGKTFSCSYTLRRHATVHT 
RERPYECRYCLRSYTQSGDLYRHIRKAHNEDLlAICRSKPDPEVGPLLG 
VGLFTEIGPMSCFI SRHDI VIQQDDE IRLKI VGTRVDKNDI LHLLLH 
FPVPQEPWATVMEKRLQEAQLYKEEGNQRYREGKYRDAISLEHEILL 
HPRYFGPNLIJmnCQKLFTEVEGTCTGKYGFVIAVTTIDNIGAGVIQ 
ir ^kij r v ±j x .P V iv x J\A IVrKrr KAje* W D A WTQVNKFQD P CAG YAG S LA 
DSCIiADRCLWDRLHAQPRLGTVPTFDWFFGYDEVQGLLLPLLQEAQA 
AS PLRVLDVGCGTS S LCTGLYTKS PHPVDVLGVDFSPVAVAHMNSLL 
c»vjo Jr vj v A rJL#v,±r vjilr'.M.o o Lnr i f LriJ\UJ\\J vt Xjv?A V A.o bob FQXjjjIjDKGTWD 
AVARGARGPAPELRLARGRPGRSPADPALSGSTDRELEWNRDYGSSG 
GKDQPAPNGDCVGAATLFPQAFLSPFISHEMGSELKKKLFKRRRVLN 
RERRLRHRVVGAVI DQGL ITRHHLKKRANHQLLHPMARGGVQVAEQL 
GTQGLNLGPDTLLSRPWSTTLQYLPYCGPR 

6829 

A 

227 

548 

FHNKCPRRLESLRQQISIFGMPENHLLKMQILRSAPDSLDFPCAGPL 
HLGEELLLHRVEQVGAE VAR VQQDFVL * GDMVEHLPRPGRQAESTP P 
TVPGNPPETLRRR 

6830 

A 

3 

435 

FQVSKDRYGGQPLFSEKFPTLWSGARSTYGVTKGKVCFEAKVTQNIiP 
MKEGCTEVSLLRVGWSVDFSRPQLGEDEFSYGFDGRGLKAENGQFEE 
FGQTFGENDVIGCFANFETEEVELSFSKNGEDLGVAFWISKDSIiADR 
LS 

6831 

A 

37 

360 

VFPVDSQGLPEWTLPAYLVWEDVL,PCEQGSGWRQGIiGGKEEQGQAP 
ARD S PCR P I TCQRLAWS P S HFS PQ I S LEHE I LLHPR YFG PNLLNTVK 
QKLFTEVEGTCTGK 

6832 

A 

280 

1816 

KPRFPRKIHATSVGHAVLPGVPLFSDSSLWPPVCLRIiNREDTEIQIS 
GL.QGAGPGLVRAGAAILRPEEGGRDHIENAGCLGAAILREALYGKLR 
DVGAAMLVRPSWNGSGCGRSHLRYKIGSSRGEDEFSYGFDGRGLKAE 
woyr jar*r vjy i r viuLr ANr r# 1 Jbxi V r>±ji> J? oKNG£»DJjGVArWISK 
DSIiADRALLPHVLCKNCWELNFGQKEEPFFPPPEEFVFIHAVPVEB 

AETVl^Q^PRSNTVKVHCRPPMKANCSRLSVDCLSSWKWSIKSVT 
D S TAAE PC KT S A PG V S S EKALH FR OL OR VI , F F T .F T .T .fil? Tf3 T .P hvt . T . 

PNDLQDWACTSHQHSHKRPSDSA7UCAYFSVLCTSRCFSSFSTLMGDQ 
CTVSHVSTGTHPSSTTATRSCGQGPAH7VEVRGEWVNEQKEHSGSRRQ 
ISLEHEILLHPRYFGPNIiLNTVKQKLLHRGGRGPGTGKYGLC 

6833 

A 

3 

251 

GGALRLHQVPPALPLRGAVSGAAAVQGMSDCTPCCEVHLLQD*VPAG 
EARVQWHDLGSLQPPPPRFKRFSCLILPSSWDYRH 

6834 

A 

3 

251 

GGALRLHQVPPALPLRGAVSGAAAVQGMSDCTPCCEVHIiLQD*VPAG 
EARVQWHDLGSLQPPPPRFKRFSCLILPSSWDYRH 

6835 

A 

2 

340 

SFTLVAQAGVQVmDLGSLQPPPPRLKQFSCFNLPSSWDYRCPPPRLA 
NFCIFNRDGFHHVGQAGLKLLTSSDPPTSASQSARTTGMSHHTRPSI 
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FFFKRWGLTMLPRLVSNS 

6836 

A 

220 

412 

LQTFWAVAVGSSRYLCFHVPFVCLEFVAVISLPKNHLLQSLDGEHSP 
ARHPVLAVLPVP I PS FW 

6837 

A 

425 

619 

ijv-x' riJbilr VL-xjlsi* ViiVloJjFI^DIjIiQRLDAIjVAEEHLTVDARVYSyAL 
ALKHANAKP FEVPFLKF 

6838 

A 

1 

684 

nc» c> ir* i n y Wb j\y x 1 1 Jd kit. li X b EG KWVKLEKTTYMDPTGKARTWE 
S VKRTTR KEQTRDG VAV I P VLQRTLKYE C I VLVKQFRP P \ MGG YC I E 

SPA\GL\ TDDR"RTPPKOT.T.QT?'PT."PT?T? , rr2T AOPDT DDD/C«t Tnvirr-Mun->fk 

•^■c-rt. \kj xj \ iijuuD x r^i^r^^jjjjoixJc.l-j.Cjiili ± ^-LiyrvCjt^ljlrfcjC. / SLPAVCMGP 

QACPNCT\ IH I VTV/ TP INGDDAENARPKP/ KS PGDGEFVE \VI SLP 
\KNDLLORLDA2jVAEEHPHSGTPRVY<5\ YAT.ziT \ vwawaifDV ommi? 
LKF 

6839 

A 

3 

194 

EMESHYVAQSGLKIjLSSSKAPTIxAFQSAWITDI^ 
H I LD FKS KPR1TYLNKS 

6840 

A' 

1 

168 

PLSEDQRTALQEVYTIxAEHRRTVAEAKKKIEQQGGFTFENKGVIiSAF 
NFGTVPSNN 

6841 

A 

3 

191 

R I FH I VENDL Y I DFGGKFHCVCRR PE VDGE S P I S W YAVWRRFS Q VM I 
TCFSSNSDFGKHRKR 

6842 

B 

15 

377 

MAAIjCRTR AVAARSHPTiP VTTT^T^PlDl^ori^/riT^ir cr , ot?c?r'c , ovT7v tn?nxr*r> 
* M-u-u_iwn x jvn vxvujunr jun vr jjt r nr r rcvj V v» X iIiouoISovjo bnAJvclrKT 

RAGGFA^AYGAARRSFYRRWSPYRKVEPGRGSPKKCGILCIYAETIL 
PLTTDGTLQTTKLVHWTGS F I FVENDL * 

6843 

A 

2 

205 

TFLRRWGFTILPRLFLNSWAQEICLPWPPACLGITGMSHCAQSSWLF 
KNNFKFLWFQRARDHKPSHS 

6844 

A 

1260 

162 8 

br JjLijVbCr oNFS*NCSRPVFSLSLKFDFVFVLFCDGV1jLCHPGWIA 
VAQSRLTTASTSRI\KRFSCLTIRSSWDYRRPPPCPADFCIFSR/DR 

6845 

A 

3 

629 

ws,iA iri&KXiOijVbXiii%JbL>xjL>M VWKii I AHEREMQTAMQISQSWDESLS 

SSQS KTDFQAGEVRVQVKC I KTQVFIX3PLKKKKVKWRPE SQPQETLS 
KGTTNML S PDAAOL S DL S S CSDI LDG S S S S S GT. c; qr»PT . z\ Q zv ta tt c 
PVACSNSCSSFILMDDLSPK 

6846 

A 

15 

241 

I Ixl^XVxjTTTIOroKYVRFLTTVDVPHRKS FS FKMCSCPMGS SCPLHKKI 
S S WRP WQNS FQ I QEKSGREKQ ICS KFCS 

6847 

A 

93 

172 

DRVSQCCLGWCQTPRLKRSSGPHPPE 

6848 

A 

3 

82 

KGREG VHQI RRNEKRRWERRERERGS 

6849 

A 

2 

237 

GTRVTjLSEGWQRQKYCLNPGGRGCTELRSHHCTQAWATEQDCVSKKP 
NQTKTKIHEQVDLPS SNHCCSKGS I VYYFHW 

6850 

A 

407 

529 

YIjPMVKTrtATCKT.QTT.ZiTPTf CVT \7MCDT>ntJT nupT ncr>mr(? 
* iJrijvxvivjAiuNXioiiiHir JvoISJjVIVo IrKXJWJuo Wr JjCjoCJGNS 

6851 

A 

2 

383 

DFADFGTTIKQDFRLLGQTSVX>RLLQLSQGQAV1CGNQLLPVSLFVTL 

FVGGVRGF55 TT*T.QAFVF!OTnQ AVWVT nPCTPm t/-»t TTDi\vriirvT?t>M 
*: v ovj v Aor o j.Ltijo/i£i V f ruUJoA V W I ItlJuo lb VijUjii llf AY GVKEFM 

STELGQLDQLWKFDQLPKGHSMVNFTRGPNDGR 

6852 

A 

1 

444 

Ai^VllSAHREYIAC^ILGVIFLLSSICIW 
LFSVSILSGILCSIxaAVXiKFMLGKVXTSRALITDGFNSL 
ILLSAEVFKHDSAVWYLDGS IGVL IGLTI FAYGVKLLIDMVPRVRQT 
RHYEMFE 

6853 

A 

1 

279 

RW IAADAANALLPANRAMAYLKI QKYEEAEKDCTQAI LLDG SYS KAF 
±\vlh\j i ir XjoivjjWiiAAyiJr r* I VIjIjIjGTWEIRQAVTELSKIKKGIN 

6854 

A 

3 

991 

LEEFAAKKDYERVLELEPIWFEATNELRKISQAJoASKENSYPK^ 

VIKSTEGERKQIEAQQNKQQAISEKDRGNGFFKEGKYERAIECYTRG 

XAJU5GANALLPANRAMAYLKI QKYEEAE KDCTQAI LLDGSYSKAFAR 

RGTARTFLGKLNEAKQDFETVLLLEPGNKQAVTELSKIKKKPLKKVI 

IEETGNLIQTIDVPDSTTAAAPENNPINlxANVIAATGTTSIG<N 

DLF PTSDTPRAKVLKI EE VSDTS S LQPQAS LKQD VCQS Y S EKM P I E I 

EQKPAQFATTVl.PPIPANSFQLESDFRQLKSSPDMLYQYLKQIEPSIi 

6855 

A 

1 

318 

DIVT^TQSPSTLSASVGDRVTITCRASQSISSWIxAWYQQKPGKAPKLL 
IYKASSLESGVPSRFSGSGSGTEFALTISSLQPDDFATYYCQHRGAF 
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6856 

A 

1 

397 

VPAQLLGLLLLWLPGAKCDIQMTQSPSTLSASVGDRVTITCRASQSI 
0 0 vi liaw i yyjs.Jb'uiSuHt'jsxiij j. x l\-A±>»x>Jb©GVPSRFSGSGSGTEFTIjTIS 
SIiOPDDFATYYCOOYNSY<3P TTFf;nnTRT.PTVTJT\ra n 0 

6857 

A 

15 

95 

SVTSIEERTRPILSKTLKGILHKGTY 

6858 

A 

3 

350 

GRHTLGEN I ADNGGLKAAYRAYQNWVKKNG AEHS LPTLGLTKKKE KR 
NSCHLPQDPNPLSEHRHPPHGWAWLPGPVQSSSALSEAAAAQSPPQG 
RTAPAPRMPIiASAVQPPGVIjY 

6859 

A 

118 

325 

PSREKEKFLQDITDAEKYPIGLIYHREEKRWRWINNSVFNGKYVNMP 
OFPGDLGLLOKTKPE IAGFTLE 

6860 

A 

1 

135 

PR VRCH YDAGD CC S S TL S S KKV I P F A AD PD T . n P P Tr»p n p ir a t? T?xm 

6861 

A 

1009 

1140 

RQGIiPLVAQDGLELLGSSNPPTLASQSAEITRHEPLCPlSTLSRMT 

6862 

A 

3 

416 

HASETLLFISNDKKWGCLIAEHIQWGYRVIEEKLPVIRSEEEKVRF 
ERQKAWCCSTLPEPAICGISRIWVFSMMRRKKIASRMIECLRSNFIY 
GSYLSKEEIAFSDPTPDGKLFATQYCGTGQFLVYNFINGQNST 

6863 

A 

4 

252 

NS S LRKLYPNS GKAQFLLNHTLVPGLC IGLLLHVNIiLLLPGNVDTWL 
PGVQRPWHLGGFELDRSLETKASRSSGCRESQKGRT 

6864 

A 

3 

130 

TTTALGLLTIPIIIHPIDRSVDFLLDSSLRKLYPTVGKPSSS 

6865 

A 

1 

1590 

MVTVTQCSVRPQWPESEVGIMAP>VYISEREAQRGYA2^^ 
PAAWWLS YG VAS S YVLADAI DKGKKAGEVPS PEAGRS ARVTVAWD 
TFVWQALAS VAI PGFT INRVCAAS LYVLGTATRWPLAVRKWTTTALG 
LLTI PI I IHPIDRDHPLSSDESGSSSLQVGGFPPGLQPAQALPNSGE 
AQLLLIILWYLACASA5CFMSTSYSCQGMWTPGSLVSKDPGTWLREV 
ERWEHRAEADLGPSQPCPKHQI SSHQDPGP I PCS PVPAI PGYEVFTA 
PFKAYGFIQLVRGLLEAHEDKAAKSQGVPGRFGGEELGCLGWKEGCL 
VNGKNFHSHFVPQEVPKRLELSFQAFSFTGTPADVPSDATKKACSFH 
RGI^GWYTCTLSHVNNIWIjGNGGCHPV 

VYLLPLLGRFLPRLVYLLPLLVHFLPPIiMHFLiPIjLVHFIxAIjIj^^ 
LLVHFlJUjLAJiFLPLLWFLPLLVYFLPLLVHFLPLLAHFLPl^ 
SLSLLYMEVFPVH 

6866 

A 

1 

595 

GTRRLAPTAQRKRDENQSVKPEPGRGASVRGGNLQTLPAGGDFLTCP 
SARD FRRVPAGQTQVPAAETAVEPVPAWAGARGGRVMS E PQPRGAB 
RDLYRDTWVRYLGYANE\VGEAFRSLVTARVVWLSYGVASSYVLADA 
XUlNAjlvb/ r*Xjh.KUFiUjlLAbKSARVTVAVvDTFVwQA^ 
RVCAASLYVLGT 

6867 

A 

2 

4 53 
^ j j 

UbL'^ciOAo A^^^^KUKlatjKVMSEPQPRGAERDljYRDTWVRYXiSYA^ 
VGEAFRSLVPAAWWLSYGVASSYVLADAIDKGKKAGEDGGFPPGLQ 

•c riynuriii i\nuQrrtr run X JLi V xrOXiV-XVjrXjXiXjtll V rHjxjljLitrKjN V U L V* I >rWV 

QRPWQPGWV 

6868 

A 

1 

265 

EEGKAGAAGGGGGSCGAGSREQPSSHNCGGGVPvESGRRDSSPPSSPP 
RLRGASAHRRDPPPRPTiRT.T.QPT.T.nnfiDTDBDDnivn a at do 

6869 

A 

46 

194 

KDHAAFFTIPKRWRQPSSPSIDEWRNIMWSSHTVEYYTAMKRNKQLY 
SQ 

6870 

A 

4 

228 

DRFC^AEAEVQRYDHSSVQKPRPPMLKLSSCLSLPSNWDYEITGAPP 
HPANFLTFCRDKGLTLCCPGWSQTRGLQ 

6871 

A 

1 

288 

ALIYWQRKISIWPALTQVAIQYLSCPMCSWQSECIFTKNSHFHPKQI 
MSLDFDNIEQIiMFLKlWLKNVNYDYSTLVLSVroPEQNEVVQSSEKEI 
LP 

6872 

A 

1 

885 

MDDFLQKLRQKIKIGVVGGSDFEKVQEQLGNDAFKPQSDRSSYCPSD 
SPPHLPITETESPAQDPEKPERLIDPFTGESDLIGLYLQSYSLYDVN 
DDDASATRVMRCQTRYDSVLLLEIVSYXIIGVHQEDELLECLSPATS 
RTFRIEEIPLDQVDIDKEl^MLVTVAHFHKEVFGTFGIPFLLRIHQF 
KIAIVMMDRHQYINEDEYEVNLKDFEPQPVCSAKETGLSAQLCSTMQ 
VEVAARTGRNRAAQMQLGQGMPLPGWRSVraKSQRIOTENAGPQCHE 
KHKRLTNRLHLPR 
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6873 

A 

532 

687 

OVPGLIiCGLTCFSSSPPGNMSHPRPWLGLDHFNKAPKRSRYTYLEKA 
IKIHN 

6874 

B 

158 

3331 

PRPPAAGEAQAAADMNHQQOQQQQKAGEQQLSEPEDMEMEAGDTDDP 
PRITQNPVINGNVALSDGHNTAEEDMEDDTSWRSEATFQFTVERFSR 
LSESVLSPPCFVRNLPWKIMVMPRFYPDRPHQKSVGFFLQCNAESDS 
TS W S CHAQAVLKI INYRDDEKS FSRR ISHLF FHKENDWGFSNFMAWS 
EVTDPEKGFIDDDKVTFEVFVQADAPHGVAWDSKKHTGYVGLKNQGA 
TCYMNSLLQTLFFTNQLRKAVYMMPTEGDDSSKSVPLAIjQRVFYELQ 
HSDKPVGTKKLTKSFGWETLDSFMQHDVQELCRVliLDNVENKMKGTC 
VEGTIPKLFRGKMVSYIQCKEVDYRSDRREDYYDIQLSIKGKKNIFE 
SFVDYVAVEQLDGDNKYDAGEHGLQEAEKGVKFLTLPPVLHLQLMRF 
MYDPQTDQNIKINDRFEFPEQLPIJDEFLQKTDPKDPANYILHAVIiVH 
SGDNHGGHYVVYLNPKGDGKWCKFDDD WS RCTKEEAI EHNYGGHDD 
DL S VRHCTNAYMLVY I RES KLS E VLQAVTDHD I PQQL VERLQEEKR I 
EAQKRKERQEAHLYMQVQIVAEDQFCGHQGND^1YDEEKVKYTVFKVIJ 
KNSSLAEFVQSLSQTMGFPQDQIRLWPMQARSNGTKRPAMLDNEADG 
NKTMIELSDNENPWTIFLETVDPELAASGATLPKFDKDHDVMLFLKM 
Y D P KTKb IjN x CGn 1 x T P I S CKI RDL»L> P VM CDRAG FIQDTSIjIxjYEEV 
KPNLTERIQDYDVSLDKALDELMDGDIIVFQKDDPENDNSELPTAKE 
X t KJUJb I UK VUVli'L UK 1 JLFNDFvj«b VV I JjblMRMNx FQVAK.TVAQRL1NT 
DPMLLQFFKSQGYR1X3PGNPLRHNYEGTLRDLLQFFKPRQPKKLYYQ 
QLKMKITDFENRRSFKCIWLNSQFREEEITLYPDKHGCVRDLLEECK 
KAVELGEKASGKLRLLEIVSYKIIGVHQEDELLECLSPATSRTFRIE 
EIPI^QVDIDKENEMLVTVAHFHKEVFGTFGIPVFAEDTPGRAFSRS 
DEANPEPAGHQGEGVLRSLNLQL * 

6875 

A 

coo 

/ 3 Z 

WDPxjKFJlNr IjPKW I L. I Qbb lb FQ x IM XCEQMVQTSDDl»LiRFl FQLtAR 

S 

6876 

A 

1 

799 

GRLNVKLLYNSSVEQIW1TVLQCRDLSWPSSYGDTPTVSIKGILTLP 
KPVHFKSSAKEGSNAIEFMETFVFAIKLQNLQTVRLGFKIQTQTPRK 
K.1 xOiiCbr'ibljKx.Lib IQbMDi blxDITPPSKIS VCHAELifiljGTCr QAVN 
SRIQLQILEARYliPSSSTPLTLSFFVKVGMFSSGELIYKKKTRLLKA 

V A.VVVjJc.XWXr krxjx\J&r*lSJc*xvr xjx JyxjX 0K00 VKKJVJrtX? Vtjyj. WJLoJS 

DSNNIEAVNQWKETVINPEKVVIRWHKLNPS 
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TV 
i\ 
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HRELSPLWLLAELRKLYLEGNPLWFHLEHRAATAQYLSPRARDAATG 
FLLDGKVLSLTDFQTHTSLGLSPMGPPLPWPVGSTPETSGGPDLSDS 
LSSGGWTQPLGAGQ 
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WLGRLHLRVMNAPFSLVHS 

6879 

A 

367 

551 

EHSTLKLPYHTQEAGAAPGPTGTDSHEVDHLEGGAGKEAGPCA*SLG 
TMVAPVRSNSGSPE 

6880 

A 

24 

547 1 

VLSFSFVFYTFGNI ILRI VHRLNQTAKGTMAPKWLQAMVQRDHWREP 
VmLLSLHVPAHLGADLPTLVTRVYISLYASHQLTLLRGPLLNSWFliY 

PC/^lvT\rVIf Pl"l r TV*'OP 1 G»'tJ7A TT UTtltniT TYT^n^irc:XTD'Dr^"C'A/~»TV TV D^DTPTTlOU 

PbON V x i\.r U. x bbKr rtA x L*WriJ\MLiUUJ\{^i<J6N L tor X Uorl 
EVDHLEGGAGKEAGPCAI»KAWAPMVAPSQEQQW 

<^ ft r i 

TV 

i 
j- 


MVTiWPPPPiTTfWPPrspr2Pr2Pwr , r , pppPT^T vcvtt ta a t> a m a *td nr\ni> a o 

ITlISXJVVJvr'K^lVIN.rTi: X^VJX^VJxrO*rilv^V^J&J&lrKXJlj V O X X Xj/vVIr/\lTl>^I\J\.ljl^\JiCttX\. 

VW Q "P f3 7x TV Wf) T . PWfiVP P VR 71 OlfC* p T KT .MTvTTVnP PTTKITXTT'QT'rJQPTNO 
v v v OAUnnnyur *ivj vijjr v om^ivojv x rvxjiv lm x v xxim/t. x x 0 UNorvxny 

AE ER I S ELENWLSE IRQSDKNKE KRI KRNEQNLQEVGD YVKKPNLQ I 
x^IPERKGEKANSLElSrVFQYIVHENFHNIJ\REDNSQIQEIQTTPARF 
YTGRSSPRHIIVRFSKDPDSLEGDLHRLRI^LWHSATOKYRLHIiGF 
S SVGLLAASDEGLFKGFHGG PLTLTLG FKLWNR FDPG I YT 

6882 

A 

2 

270 

NMPTKLQKLDLSSYPEATEKNVERILGLLQTYFRKYPDTPVSYFEFV 
IDPNSFSRTVENIFYVSFIVRDGFARIRLDEDRLPNIRADEC 

6883 

A 

2 

114 

KGLLDVTCKl^ANMIKGKTPEEIRKTFNIKNDFPSFL 

6884 

B 

75 

293 

MPS I KLQSSDGE IFEVDVEI AKQS VTI KTMLEDLGMDDEGDDDPVPLi 
PNVNAAILKKVIQWCTHHKDDPPSS * 
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6885 

A 

446 

639 

NGYLVFPRKNSFLLIFGLFVYLETNIJDSLPLVDTHSKRTLLIKTVBT 
RDGQVINETSQHHDDLE 

6886 

A 

3 

1503 

ARQRRSRAPTTHTHRALVRLFSGSQSAPPPPPRPSPPSAAMSTRSVS 
SSSYRRMFGGPGTASRPSSSRSYVTTSTRTYSLGSALRPSTSRSLYA 
SSPGGVYATRSSAVRLRSSVPGVRLLQDSVDFSLADAIigTEFKNTRT 
w t,rw tiij^nLirvUKr AIM x iDAVKi' JjJb%A2NJvX LLAELEQLKGQGKSRLGD 
LYEEEMRELRRQVDQLTNDKARVEVERDNI^AEDIMRLREKLQEEMLQ 

PPP3XPTJTT.OQPPr>nVnTJAQT.ZlPT.r»T.17T?Tr\7T?GT rNtrir T A PT WT TJ'C»r'T?T 
ivd£iM£iiv x uyor i\.\^a-/ v l^j.m.m.o xjm^x^l/xjji* rcrv. v /vr IiKKIiHKKK \_ 

OE L O AO I OE OHVO I D VD V S KPD LT AS LRDVT? noYF QV aa\ mJT .apap 

EWYKCQVC*PLLRASNRN\NEPLRQAK\QES\TELPENRVQSLTC\E 

VDAL * RKPM S SLER \ QLREMEENFC/ AVKAANYPR PLLARL\ QDE IQ 

NY * RREMAS LTFVEYQ * PALNVKMAP * HLRF ATLQE/ MLLGRRGERG 

FPLAS FPN F S LP * TLRGNLI WDSLPSGL I PHS KRDTS 

6887 

A 

3 

232 

FEAESYSVTQAGVQWHDLSSLQPPPPRIKQLSYLSLQSSCDYRHLPP 
CPANFCIFSRNGASPCWPCWSRTPARWIH 

6888 

A 

252 

.384 

DSWSEQVIKPPFQRGSCPIPRRKECCSERPRRIWTDRPCWFS 

6889 

A 

3 

196 

LIMICFSFFLLSVLENKNSGPYLFFQRQFWSSVKVRVHRLSKRLRFR 
LTSYITFFRVFCFLLTP 

6890 

A 

1 

2994 

MAQVAMSTLPVEDEESSESRMVVTFLMSALESMCKELAKSKAEVACI 
AVYETOVFWGTERGRAFVNTRKDFQK^ 

S TTQANRMS VDAVE I ETLRKTVED YFCFC YGKALGKSTWPVPYEKM 
LRDQS AWVQGLPEGVAFKHPENYDLATLKW I LENKAG ISFIIKRPF 
LEPKKIWGGRVMVTDADRSILSPGGSCGPIKVKTEPTEDSGISLEMA 
A\nrVKEESEDPDYYQYNIQAGPSETDDVDEKQPLSKPLQGSHHSSEG 
NEGTEMEVPAEDS TQHVPSETS EDPEVE VT I EDDD YS PPS KRPKANE 
LPQPPVPEPANAGKRKVREFNFEKWNARITDLRKQVEELFERKYAQA 
IKAKGPVTIPYPLFQSHVEDLYVEGLPEGIPFRRPSTYGIPRLERIL. 
LAXERIRFVIKKHELLNSTOEDLQLDKPASGV 

DQLFCKKFAETUjGSTEAKAVP YQKPEAHP1TOL YVEGLPENI PFRS PS 
WGIPRLEKIIQVG1TOIKPVIKRPELLTHSTTEVTQPRTNTPVKEDW 
NVRITKLRKQVEEIFNliKFAQAlrGLTEAVKVPYPVFESNPEFLYVEG 
LPEGI PFRS PlWG I PRLERI VRGSNKI KFWKKPELVI S YLPPGMA 

O V T VPT 1 V TV T /">0 DVO TJDC TU/*""* PMCinrtlf t ttti / ii ii r i^t/^i TWTvnkTVYT , »/'"v,rT*o TV « rn mrvn 

oMW 1 JSJVljy&FiU<^KbFvjSWbKVPEIEVTV^ 

QTNGSNVPFKPRGREFSFEAWNAKITDLKQKVENLFNEKCGEALGLK 

OAVTCVPFATiFP^T?PPT}PVA7Pf^T.PP(^17'PPPPPGT n 17fxTO'DT T?WTT nvrvn 

KIKFIIKKPEMFETAIKESTSSK5PPRKJNSSPNVNTTASGVEDLNI 
IQVTIPDDDNERLSKVEKAJ*QLREQVNx^^ 

YRKITINPGCWSnDGMPPGVSFKAPSYLEISSMRRIlxDSAEFIKFTV 
IRPFPGLVINNQLVDQSESEGPVIQESAEPSQLEVPATEEIKET1X3S 
SQIKQEPDPTW 

6891 

A 

75 

350 

TGIPVNSYRVKINWYTLQFVSLFRPFPGLVINNQLADQSESEGPVIQ 
ESAEPSQL\EVPATEEIKETDGSSQIKQEPDPTWETSFIiLGEISF 

6892 

A 

1 

2935 

MAQVAMSTLPVEDEESSESRMVVTFl^SALESMCKELAKSKAEVACI 

AVYETDVFWGTERGRAFVNTRKDFQKDFVKYC^EEEKAAEm 

STTQANRMSVDAVEIETLRKTVEDYFCFCYGKALGKST\A^VPYE^ 

LRDQS AVWQGLPEGVAFIOaPFJmJlxATLKWILENKAGI SF 1 1 KRPF 

LEPKKHVGGRVMVTDADRSILSPGGSCGPIKVKTEPTEDSGISLEMA 

AVTVKEESEDPDYYQYNIQGSHHSSEGNEGTEMEVPAEDSTQHVPSE 

TSEDPEVEVTIEDDDYSPPSKRPKANELPOPPVPEPANAGKRKVREF 

NFEKWNAR I TDLRKQVEELFERKYAQAI KAKGPVT I P YPLFQSHVED 

LYVEGLPEGIPFRRPSTYGIPRLERILLAKERIRFVIKKHELLNSTR 

EDLQLDKPASGVKEEViryARITKLRKMVDQLFCKKFAEALGSTEAKAV 

P YQKFE AHPNDLYVEGLPENI PFRS PS W YGI PRLEKI I QVGNRIKFV 

IKRPELLTHSTTEVTQPRTlTrPVKEDWNVRITKLRKQVEEIFNLKFA 

QALGLTEAVKVPYPVFESNPEFLYVEGLPEGIPFRSPTWFGIPRLER 

IVRGSNKIKFWKKPELVISYLPPGMASKINTKALQSPKRPRSPGSN 
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S KVPE I EV T VT3GPNNNNPQTS AVRTPTOTNGSNVPFTCPPap pt? q i?r» a 

WNAKITDLKQKVFJ^LFNEKCGEAIjGLKQAVKVPFALFBSFPEDFYVK 
GLPEGVPFRRPSTFG I PRLEKI LPvNKAKI KF 1 1 KKPEMFETAI KES T 
SSKSPPRKINSSPNVNTTASGVEDLNIIQVTIPDDDNERLSKVEKAR 
QLREQVNDLFSRKFGEAIGMGFPVKVPYRKITINPGCVWDGMPPGV 
SFKAPSYLEISSMRRILDSAEFIKFTVIRPFPGLVINNQLVDQSESE 
GPVIQESAEPSQLEVPATEEIKETDGSSQIKQEPDPTW 

6893 

A 

214 

301 

LIGTVLFLLIEIKETDGSSQIKQEPDPTW 

6894 

A 

1 

2879 

MAQVAMSTLPVEDEESSESRI y fVArrFLMSAI/ESMCKELAKSKAEVACI 
AVYETDVFWGTERGRAFVNTRKDFQKDFVKYCVEEEEKAAEMHKMK 
STTQANRMSVDAVEIETLRKTVEDYFCFCYGK/VLGKSTWPVPYEKM 
LRDQSAVWQGLPEGVAFKHPENYDLATLKWIGENKGGISFIIKRPF 
LEPKKHVGGRVMVTDADRS ILSPGGSCGPIKVKTEPTEDSGI SLEMA 
AVTVKEESEDPDYYQYNIQGSHHSSEGNEGTEMEVPAEDDDYSPPSK 
RPKANELPQPPVPEPANAGKRKVREFNFEKWNARITDLRKQVEELFE 
RKYAQAI KAKGPVT I P YPLFQSHVEDL YVEGLPEGI PFRRPSTYG I P 
RLER I LLAKERI RFVI KKHELLNSTREDLQLDKPASGVKEEWYARIT 
KLRKMVDQLFCKKFAEALGSTEAKAVPYQKFEAHPNDLYVEGLPENI 
PFRS PSWYG I PRLEK 1 1 QVGNR IKFVI KRPELLTHS TTEVTQPRTNT 
PVKEDWNVR ITKLRKQVEEI FNLKFAQALGIjTEAVKVP YPVFESNPE 
FLYVEGLPEG I PFRS PTWFG I PRLER I VHG SNKI KFWKKPELVI SY 
LPPGMASKINTKALOS PKRPRS PGSNSKVPE T FATrvpn PKnsjrjMPfYrci 

AVRTPTQTNGSNVPFKPRGREFSFEAWNAKITDLKQKVENLFNEKCG 
EALGLKQAVKVPFALFESFPEDFYVEGLPEGVPFRRPSTFGIPRLEK 
ILRNKAKIKFIIKKPEMFETAIKESTSSKSPPRKINSSPNVNTTASG 
VEDLNIIQVTIPDDDNERL\SKVEKASQL/REDQVNDLFSRKF\GEA 
I\GMGFPVKVPYRKITINPGCVVVDG\MPPGVSFKAPSYLEISSMRR 
ILDSAEFIKFTVIRPFPGLVINNQLVDQSESKGPVIQESAEPSQLEV 
PATEEIKETDGSSQIKQEPDPTW 

6895 

A 

1 

334 

PTRPPTRPPTRPLFSPLVKDWAPKAFI I S FKIjETDPAI VINRARKAL 
EIYQHQVGGANILESRQSFVFIVTKDSETKLLLSEEEIEKGVEIEEK 
IVDNLQSRHTAFIGDRN 

6896 

A 

3 

444 

V I VHARP I RTRAS KYY I PEAVYGL PAY PA YAGGGGFVLS GATLHRLA 
GACAQVELFPIDDVFLGMCLQRLRLTPEPHPAFRTFGIPQPSAAPHL 
STFDPCFYRELVWHGLSAADIWLMWRLLHGPHGPPCAHPLPVAAGP 
FQWDS 

6897 

A 

117 

1009 

REPGAFSVLFKEPQAIINYFSSLMQQRVQEKGETAEVNEENRIQSTE 
KKNLAQKNDKPGTPGVKRSSHLSLSVLGLRTRSTSRSRIPAFKADGR 
EQGQGDIPTFYTYEEGLSHLTAEGQATLERLEGMLSQSVSSQYNMAG 
VRTEDSIRDYEGGWSPGPRETCDPRCRRENFSPEGKSAL.RDPAPLPS 
SATDPERS YRM I LETAHFRNHGWFRGGTS I L I SHTGS EVRGEEQLAG 

SRSFEGAEPETNTFCPSQGLPLVAIVGGGPAHSHHTLNGHATGALDD 
GPFCQGPLDRPLREI 

6898 

A 

5 

808 

LPLPGRAVWCTHCCAPTPSGPVLPHSAAMSFLKSFPPPGPAEGIiLRQ 
QPDTEAVLNGKGLGTGTLYIAESRLSWLDGSGLGFSLEYPTISLHAL 
SRDRSDCLGEHLYVMVNAKFEEESKEPVADEEEEDSDDDVEPITEFR 
FVPSDKSALFAMFTAMCECQALHPDPEDEDSDDYDGEEYDVEAHEQG 
QGDIPTFYTYEEGLSHLTAEG\QATLERLEGMLSQSVSS\QYNMAG\ 

vrt\edsirdy\edgdgrldttp\tvclgqf\edadv 


B 

233 

411 

XALNGLSAQDNWAKLQRDYPDALITNYYLWPAVQLANFYLVPLHYRE 
DPASTTLVSDPT* 

6900 

A 

109 

435 

hgkpfrwdysfiprmgtinreaitqlftfflcaevgvgpgktvslgs 
gggaaiqplhfllqlwpavqijusifylvplhyryvaplphpsrkthvt 
nswrqndshfnlelp 

6901 

A 

34 

— . 

408 

RQWLQEEAY I TKEQKYS FLHDSQTS FCFSDSI PTS SNM E ETQQ / KSN 
LELLHI SLLLN\ QSRLBPVRFLRSTFTNNXiVYDNSDSDD YQIiLKDLE 
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GG IQMLMGRLEAATN* LGGPQANWKFDKTRTT 

6902 

B 

80 

495 

X PG S RTS LLLAFALLCLP WLQEAGAVQTVPLSRL FDHAMLQAHRAHQ 
LAIDTYQEFEETYI PKDQKYSFLHDSQTS FCFSDS I PTPSNMEETQQ 
KSNLELLRI SLLLIESWLEPVRFLRSMFATTWCMTPRTAMTITS * 

6903 

A 

1 

597 

MAAGSRTSLLLAFALLCLPWLQEAGAVQTVPLSRLFKEAMLQAHRAH 
QLAIDTYQEFISSWGMDSIPTSSNMEETQQKSNLELLHISLLLIESR 
LEPVRFLRSTFTNNLVYDTSDSDDYHLLKDLEEGIQMLMGRLEDGSH 
LTGQTLKQTYSKFDTNSHNHDALLKNYGLLHCFRKDMDKVETFLRMV 
QCRSVEGSCGF 

6904 

A 

3 

552 

QTS FCFSDS I PTS SNMEETQQKSNLELLH I SLLLIESRLEPVRFIiRS 
TFTNNLVYDTSDSDDYHLLKDLEEGIQMLMGVRVAPGVPNPGSPLAS 
RAGGEKHCCPLFSSQAIjTQENSPYSSFPLVNPPGLSLHPEGEGGKWI 
KTERGRELGPSAGPLLLFLHFAEAGRRQPPDWADPQADLQQV 

6905 

A 

3 

716 

DPKAQLPEPLRVLWTAHLAAMATGSRTSLLLAFGLLCLPWLQEGSAF 
PTI PLSRPFDNAMLRAHRLHQLAFDTYQEFEEAYI PKEQKYS FLQNP 
QTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRS 
VFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQT 
YSKFDTNSHOTDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSC 
GF 

6906 

A 

359 

648 

WIPRPNSPNHSGSCGQAHLVAMAPGSRT\SLLLAFALLCLPWASRRL 
GAVPKPFR I SRLF \ DHA\ MLOAHRAHOI^AT \ DTYORPRET\YTPTCDO 
KYS FLA* \ PQTS FCFSDS IPT\ PSN\ RR I TQQKSNLELLR I \ SIiLL\ 
TESW1jEPVRFPQEYCSANNLGVLTPRTAIDYHLLKDLEGGHPTV*WN 
I CREVCFSACFSRGIRDD I EEEDDQEAYFRYMAENPTAG WQEEEED 
NLEYDSDGNPIAPTKKIIDPLPPIDHSEVDQHGLRPEGTHLTLKERT 
SWVESVADTLCTILTTLVMDPKAQLPEPLRVLWTGSPSGNGSRLPDV 
PCSWLLPCSACPGLQGGWVPCQNRSVYPGFFDHAIAPSPSRAPAGHC 
TPTRSLKKPHIPKDQKYSFLA 

6907 

A 

113 

412 

GSSQGAKPQDRGWGPRTLWTPVPFMRGPQHATCLLWQPAGRVEAFCK 
FHMWPQGMTSLMKAALDLTYPITSMFSGAGFNSSIFSVFKDQQIEVH 
PHPRK 

6908 

A 

3 

1162 

GGGARGCAHVVVLKALAECGIRVDMVGGTS IGAFVGAXiYSEERNYSQ 
MRIRAXOWAEGMTSI^KAAIjDIjTYPITSMFSOAnFTT^^TF^VPKTJOO 
IEDLWIPYFAITTDITASAMRVHTDGSLWRYVRASMSLSGYMPPLCD 
PKDGHLLMDGGYINNLPADVARSMGAKWIAIDVGSRDETDLTNYGD 
ALSGWWLLWKRWNPLATKVKVLNMAE I QTRLAYVCCVRQLEWKS KD 
YWEYLRPPIDSYSTLDFGKFl^ICEVGYQHGRTVFDIWGRSGVLEKM 
LRDQQGPS KKPASAVLTCPNAS FTDLAEI VS R I EP AKPAMVDDESDY 
QTEYEEELiLDVPRDAYADFQSTSAQQGSDIiEDESSLRHRHPStAFPK 
LSEGSSDQDG 

6909 

A 

2 

265 

RLMQCVS S FS KNGALQARLE I CALRDTVAVYLTPE SKSS FKQAIiEAL 
PQLSSGADKKLLEELLNKVKSSMHLQLNCFQAASSTMMKT 

6910 

A 

3 

151 

ILPLPLQNATVEAGTRCQVAGWGRKWSQGRVALVTGASWGPLGAEKS 
AT 

6911 

B 

58 

359 

SIJUU^QAYSHWIAQYCSEVHRQNTQQFVTLISTTMDAITPLISTKVQ 
DKLLLS ACHLLVSLATTVRPVFLI S I PAVQKVFNR I TDASALRLVDK 
RMSSSGP* 

6912 

A 

4 

329 

S S R PATAWGHGLI»LLP PAPALPS TWKSRGVAKADQAHVAHGAELPRT 
AHDSATFSPFQPRRDVTLELLWHFCQASPAPIALRSLLFVRDLQRLT 
FLLFSHHS IVILRN j 

6913 

A 

2 

257 

YEDALVLLLTEVLNRIQFRYNQAQLEELDDETLDDDVQDKLLLSACH 
LLVSLATTVRPVFLIS I PAVQKVFNR I TDASALRLVDK 

6914 

A 

1 

350 

AQVLVCRALSN I LLLPWPNLPENEQQWP VR S I NHAS L I S ALSRD YRN 
LKPSAVAPQRKMPLDDRHRLNITAENDCRRLHCSLRDLSSLLQAVGR 
LAEYFIGDVFAARFNDALTVVE 

6915 

A 

257 

2659 

VDCRMS / FS YI EKFTDFLRLFVSVHLRR I ES YSQFPWE FLTLLFKY 
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tfhqptheg yfs cld i wtlfld yltski ksrlgdkeavlnryedalv 
llltevlnri qfrynqaq\ leel\ ddetldddqqtewqr ylrqs lev 
vakvmellpthafstlfpvlqdnl\evylglqqfivtsgsghrlnit 
aendcrrlhcslrdls sllqavgrlae yf i gdvfaar\ fndaiitwb 
rlvkvtlygsqiklynietavpsvlkpdlidvhaqslaggcrlsahw 
laqycseihrqntqqfvtllsttmdaitplistkvqdklllsachlii 
vslattvrpvflis i pavqkvfnr itdasalrlvdkaqvlvcralsn 
i lll pw pnl peneqqwp vrs i nhas l i sal s rd yrnlk\ ps agapqr 
kmplddtkliihqtlqrlrkilwenisg\estksstdfayqfgcres 
v\qvslglfpaf\ihqsdvtdemlsffltlfrglrvqmgvpf\teqi 
i qtflnmftreqlaes i lhegstgcrwekflki lqvwqepgqvfk 
pflpsiialcmeqvypiiaerpspdvka\elfellfrtlhhnwryff 
kstvlasvqrgia\eeqmenepqfsaimqafgqsflqpdihlfkqnl 
f\yl\etlntkqkl\yhkk\ifr\tam\lfqfvnvllq\vl\vhksh 

DLL\QE\EIGHRHSYNMASVDF\DGFFAAFLPEFL\TSC\DGV\DAN . 
QKSVLGRNF\KMDRVRRERGRAKRRAEWARKPGTCAARRGHIEASGR 
GLCPPCSLAAAHEMPADLVL ' 

6916 

A 

1 

321 

NS KIMELLQVVPSCVASLDDVAETDRKEYVTVKIRKKWRQSVTLRQN 
NLPAQSGSHAAEKGNSDWPERPGLTQTGPGHRRMLRRHTVEDAVVSQ 
GPEAAGPLSTPQE 

6917 

A 

1 

259 

HGAS INASNNKGNTALHEAVI EKHVF WELLLLHGAS VQVLNKRQRT 
AVDCMQNSKIMELLQVVPSCVASLDDVAETDRKEYVTG 

6918 

B 

1 

3045 

MPDCTSKCRSLKHALDVLSVVTKGSENQIKAFLS SHCYNAATIKDVF 
GRNALHLVSSCGKKGVLDWLIQKGVDLLVKDKESGWTALHRSIFYGH 
IDCVWSLLKHGVSLYIQDKEGLSALDLVMKDRPTHVVFKNTGCLLDP 
NGEKCVTAPRQVS ALHHKD I ALS LVAAS DGATVCVTTRGD I YLLAD Y 
QCKJ<MASKQLNLKKVLVSGGHMEYKVDPEHLK^ 

GRVFCWRS VNS SLKQCRWAYPRQVF I SD I ALNRNE I LFVTQDGEGFR 
GRWFEEKRKSSEKKEILSNLHNSSSDVSYVSDINSVYERIRLEKLTF 
AHRAVSVSTDPSGCNFAILQSDPKTSLYEIPAVSSSSFFEEFGKLLR 
EADEMDSIHDVTFQVGmLFPAHKYILAVHSDFFQKLFLSDGNTSEF 
TDIYQKDEDSAGCHLFWEKVHPDMFEYLLQFIYTDTCDFLTHGFKP 
RIHLNKNPEEYQGTLNSHLNKVNFHEDDNQKSAFEVYKSNQAQTVSE 
RQKSKPKSCKKGKNIREDDPVRiytLQTVAKKFDFSNLSSRLDGVRFEN 
EKINVIAKNTGNKLKLSQkKWYNVYKEKVM 

VLWADQLLITRLKEICEVALTEKLTLKNAAMLLEFAAMYSAKQLKL 
S CLQF IGLNMAALLEARSLDVLSDGVLKDLSEFYRKMI PAMDRRVI T 
P YQDG PD I S YLEVEDGD I FLKEE INMEQNHS ETMFKKAKTKAKKKPR 
KRSDSSGGYNLSDIIQSPSSTGLLKSGKTNSVESLPELLiTSDSEGSY 
AGVGSPRDLQSPDFTTGFHSDKIEAKVKPYVNGTSPVYSREDLKPWE 
KSPILKISAPQPIPSNRIDTTSSASWVAGSFSPVSPPWDLRTIMEI 
EESRQKCGATPKSHLGKTVSHGVKLSQKQRKMIALTTKENNSGMNSM 
ETVLFTPSKAPKPVNAWASSLHSVSSKSFRDFLLEEKKSVTSHSSGD 
HVKKVSFKGIENSQAPKIVSFVGGERR* 

6919 

A 

1 

1149 

GETMFKKAKTKAKKKPRKRSDSSGGYNLSDIIQSPSSTGLLKSGKTN 
SVESLPELLTSDSEGSYAGVGSPRDLQSPDFTTGFHSDKIEAKVKPY 
VNGTS PVYSREDL KPWEKS PILKI SAPQP I PSNRIDTTS SAS WVAG S 
FSPVS PP WDLRT IME I EESRQKCGATPKSHLGKTVSHGVKLSQKQR 
KWIALTTKENNSGMNSMETVLFTPSKAPKPVNAWASSLRSVSSKSFR 
DFLLEEKKS VTS LS SGDHVKKVS FKG I ENSQAPKIVRCS THGTPGPE 
GNHI SDLPLLDS PNPWLS S S VTAPSMVAPVTFAS IVEEELQQEAALI 
RSREKPLALIQIEEHAIQDLLVFYEAFGNPEEFVIVERTPQGPLAVP 
MWNKHGC 

6920 

A 

426 

2967 

HS AP PAVLRGGM S EAGGAGPGGCGAGAGAGAGPGALP PQ PAALPPAP 
PQG S PCAAAAGG S GACG PATAVAAAGTAEGPGGGGS AR I AVKKAQLR 
SAPRAKKLEKLGVYSACKAEESCKCNGWKNPNPSPTPPRADLQQIIV 
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SLTESCRSCSHALAAHVSHLENVSEEEMNRLI^IVI^VBYIiFTCVHK 
EEDADTKQVYFYLFKLLRKS ILQRGKPWEGSLEKKPPFEKPS I EQG 
VNNFVQYKFSHLPAKERQTIVELAKMFLNRINYWHLEAPSQRRLRSP 
NDDISGYKENYTRWLCYCNVPQFCDSLPRYETTQVFGRTLLRSVFTV 
MRRQLLEQARQEKDKLPLEKRTLILTHFPKFLSMLEEEVYSQNSPIW 
DQDFLSASSRTSQLGIQTVINPPPVAGTISYNSTSSSLEQPNAGSSS 
PACKASSGLEANPGEKRKMTDSHVLEEAKKPRVMGDIPMELINEVMS 
TITDPAAMLGPETNFLSAHSARDEAARLEERRGVIEFHWGNSLNQK 
PNKK I LMWLVGLONVFS HOL PRM P KEY I TRI/VFD P KH JTTT.a T. T TcnfiT? 

VIGGICFRMFPSQGFTEIVFCAVTSNEQVKGYGTHLMNHLKEYHIKH 
D\ILNFLTYADEYAIGYFKKQG\YCKEIKIPITKYGGYIKDYEGATL 
MGCE\LNPR\IPYT\EFSVIIKKAE/EGIIKKLIERKQAQIPKKFTL 
GLSCFKDGVRQIPIESIPGIKKA\GWNPSGKEKSKEPRDPD\QLYST 
\LKSILQQVKSHQSAWAFMEPVKRTEAPGYYEVIRFPMDLKTMSERL 
KNRYYVSKKLFMADLQRVFTNCKEYNAAESEYYKCANILEKFFFSKI 
KEAGLIDK 

6921 

A 

1 

184 

GKEKSKEPRDPDQLYSTLKSILQQVKSHQSAWPFMEPVKRTEAPGYY 
EVIRFPMGNTINIF 

6922 

A 

114 

384 

FWHEYTKAFFFFFSMCFLLFGSLCIYYFSLFLVFFFFLFLFCLVFCS 
CLHCFRYFFTPLDS PRAGSRWSS YAQLLPPPPPPLVEHS CDC 

6923 

A 

141 

1741 

WEPAQFPPEQQWAGWGHSRCQQLGTPWQGRWGLRAEPPPGLVYPCGA 
CRS EVNDDQDAI LCEAS CQKW FHRECTGMTE S AYGLLTTEAS AVWAC 
DLCLKTKEIQSVYIREGMGQLVAANDGPSVAHYTPCVLQLSAPLFPC 
LPGLIEGKLEKAQSPIPHFLHDKRIQSIiPVETLGPASRMDPESERAL 
QAPHSPSKTDGKELAGTMDGEGTLFQTESPQSGSILTEETEVKGTIjE 
GDVCGVEPPGPGDTWQGDLQETTWTGLGPDTQDLEGQSPPQSLPS 
NPKAVNYAKGAGKGRVRAGAAATAAAPDSRRPRAGAAKGVGQGLFQL 

lvnpmniyeeqkimflyslvpstqdplfpgygpqcpvdlagppclrp 

LFGGLGG YWRALQRGREGRTMTSRASELS PGRS VTAG 1 1 IVGDEILK 
APCMKSNNALIVILGTVTLDAVGIGLVMPVLPGLLRDIVHSDSIASH 
YGVLLALYALMQFLCAPVLGALSDRFGRRPVLLASLLGATIDYAIMA 
TTPVLWIYPLVNSPSC 

6924 

A 

1 j 

153 

RECTGMTESAYGLLTTEASAVWACDLCLKTKEIQSVYIREGMGQLVA 
ANDG 

6925 

A 

1 

200 

IDNYGTKPGNIFLCTFLTLGAALTAFIKADLRRQKANKETLENKLQE 
EEEESNTSKVPTAVSEDHL 

6926 

A 

2 

242 

W I KDTNLKPAT I KI LKDD I GNTLLDNGLGKDFMTKNPKAKATKTKVA 
KENNAVKHWVEHTLLTYRRDRARSASLVDFSLP 

6927 

A 

132 

251 

VPAGEARVQWHDLGSL.QPPPPRFKRFSCLILPSSWDYRH 

6928 

A 

132 

251 

VPAGEARVQWHDLGSLQPPPPRFKRFSCLILPSSWDYRH 

6929 

A 

2231 

2578 

PLPPGSRTLHWMAPDEEDFRIGN*TMNAIFFFIiFLFFFFFFFLRQSF 
ALWQAGVQWHDLSSLQPPPPRFK*FSCLSLPSGWD*RHAPPRPAIF 
VFLVEMGFHHVNQDGLDLLTW 

6930 

A 

2 

94 

WRDLGSL* PPPPGFKRFS * LSLKSHPNVTS 

6931 

A 

130 

295 

GIISNADFVSFFFF*DGVFALVAQAGVQWRGLGSLQPLPPGFKRF\S 
CLSLPSTW 

6932 

A 

3 

403 

TYKILDFIVNSIRHRRGTPRFPGHTEVTSMTLDFIGAG* /SNLQ*PA 
ST* PGDQY * I * LLQST*NYVLHKFQLPTWKGRSE I SEQAVSAKEKT* 
KS/T*LAXiVK*ARTLESNWPG/Li* ILILISLSCMI WTNDSNXiF 

6933 

C 

78 

335 

MWS PLTKE I FS CPTPQTCG AFKGTLKRAPXXXXXXXVKI GXXXXXXX 
XXXXXXXXVSPRPLSARVLAPCLGCDFFVSFNRTTVWK* 

6934 

A 

i ! 

143 

GFRWFSCLGLPSCWDYRHAPPHPANFCIFSDRVSPCWPGWSQTPDLE 

6935 

A 

i 

1676 

MAIiKGQEDYIYLFKDSTHPVDFLDAFRTFYLDGLFTDITLQCPSGII 
FHCHRAVLAACSNYFKAMFTADMKEKPKNKIKLSGIHHDILEGLVNY 
AYTSQ I E I TKRNVQSLLEAADLLQFLS VKKACERFLVRH LD IDNC I G 
MHSFAEFHVCPELEKESRRILCSKFKEVWQQEEFLEISLEKFLFILS 
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RKNLSWKEEAIIEPVIKWTAHDVENRIECLYNLLSYINIDIDPVYb 
KTALGLQRSCLLTENKIRSLIYNALNPMHKEISQRSTATMYIIGGYY 
WHPIiSEVHIWDPLTNVWIQGAEIPDYTRESYGVTCLGPNIYVTGGYR 
TDNiEALDTVWIYNSESDEWTEGLPMLNARYYHCAVTLGGCVYALGG 
YRKGAPAEEAEF YDPLKEKW I P I ANM I KGVGNATACVLHDVI YVIGG 
HCGYRGSCTYDKVQSYNSDINEWSLITSSPHPEYGLCSVPFENKLYb 
VGGQTTITECYDPEQJTEWREIAPMMERRMECGAVIMNGCIYVTGGYS 
YSKGTYLQSIEKYDPDI^KWEIVGNLPSAMRSHGCVCVYNV 

6936 

A 

1 

161 

YILLVFLTLWLLIEMIYCYRKVSKAEEAAQENASDYLAIPSENKENS 
AVPVEE 

6937 

A 

61 

742 

RLKRKSKKATKQSLQDSFLLDLKSPPSFPVEISDRLPAASWEGQESC 
WNKQTSKSEGTQPEGTYGEQLVPELCNOSESSGEDFFLKSKLQEQDV 
WRRSTSFYTHMCNPWGLLVGAGGSLLIMFVIQWVYTLVNMGVAAIVY 
FY I GRAS PGLHLG S ASN F S F FRWMRSLLLP S CRS LRS PQEQ 1 1 LAP S 
LAKVDMEMTQLTQENADFATRDRYHHSSLVNREQLMPHY 

6938 

A 

7 

238 

PGGSQGVPAGGRGSGRWSSALPGPLPRPGSVGSPTLLATWLPAGPGR 
GPARGTVRKTPIPPPRCWVPGPPWFLSQTG 

6939 

A 

1 

213 

RNTDNAWIETAGGTVHLQDQNDGEIiNRLNSNLHTCEWRASIPWQGGD 
RR I PL YANHKTLLHMAAAE I GANY 

6940 

A 

1 

257 

RRSFALVAQAGAQWRNLSSPQPPPLGFEQFSCLSLSSSWDYRHVPPR 
PANFVFLVEMGFLYWQAGNPPALAS QSAE I TGTSHHA 

6941 

A 

3 

279 

GSGWDGRGQSGRVADSS*ARVLSAL\REVNK\RLQDLRSCLSPKPPQ 
GQEQQGQEDEWLVEGPTLPETPRLFPLKIRCRADLVRLPLRMVSA 

6942 

A 

73 

816 

FLGFVGPVS LHCLRPAVS CF YRDLDNS PLS P PS PRTKSRTHTRALKK 
LSEVNKRLQDLRSCLSPKPPQGQEQQGQEDEWLEEGPTLPETPRLF 
PLKIRCRADLVRLPLRMSEPLQSWDHMATHLGVSPSRILLLFGETE 
LSPTATPRTLKLGVADIIDCWLTSSPEATETSQQLQLRVQGKEKHQ 
TLEVSLSRDSPLKTLMSHYEEAMGLSGRKLSFFFDGTKLSGRELPAD 
LGMESGDLIEVWG 

6943 

A 

1 

484 

MEPTQMPINQQMDKETILKLLKGKVWGHALHNDFQALKYVHPRSQT 
RDTTYVPNFLSEPGLHTRARVSLKDLALQLLHKKIQVGQHGHSSVED 
ATTAMELYRLVEVQWEQQEARSLWTCPEDREPDSSTDMEQYMEDQYW 
PDDLAHGSRGGAREAQDRRN 

6944 

A 

1 

1758 

MDFQLKKXQVYVDKVEKMQQALVQLQAACEKREQLEHRLRTRLEREL 
ESLRIQQRQGNCQPTNVSEYNAAALMELLREKEERIL^EADMTKWE 
QKYLEENVMRHFALDAAATVAAQRS I FEGHI LYVHNLI VRERSHFIW 
DTTV I SHS PNTS YDTALEAR I QKEEEE I LMANKRCLDMEGR I KTLHA 
QI IEKDAMIKVLQQRSRKEPSKTEQLSCMRPAKSLMS I SNAGSGLLS 
HSSTLTGSPIMEEKRDDKSWKGSLGILLGGDYRAEYVPSTPSPVPPS 
TPLLSAHSKTGSRDCSTQTERGTESNKTAAVAPISVPAPVAAAATAA 
AITATAATITTTMVAAAPVAVAAAAAPAT^AAAPSPATAAATAAAVSP 
AAAGQ I PAAAS V AS AAA VAP S AAAAAAVQ VAPAAP A P VP A PAL VP VP 
APAAAQASAPAQTQAPtSAPAVAPTPAPTPTPAVAQAEVPASPATGP 
GPHRLS IPSLTCNPDKTGNEREYADFYSHFDI ANDLVYI CSLLYLQS 
PEECLMQTMFLRPETQMRDPARSLLFQTRAQKLVTQLRQCDGKRAVI 
KNTHTPEPECLELYLLYHLQAV 

6945 

A 

1 

1420 

QLQAACEKREQLEHRLRTRLERELESLRIQQRQGNCQPTNVSEYNAA 
ALMELLREKEER I LALEADMTKWEQKYLEENVMRHFALD AAATVAAQ 
RDTWISHSPNTSYDTALEARIQKEEEEILMANKRCLDMEGRIKTLH 
AQI IEKDAMIKVLQQRSRKEPSKTEQLSCMRPAKSIiMS I SNAGSGLL 
SHSSTLTGSPIMEEKRDDKSWKGSLGILLGGDYRAEYVPSTPSPVPP 
STPLLSAHSKTGSRDCSTQTERGTESNKTAAVAPISVPAPVAAAATA 
AAITATAATITTTMVAAAPVAVAAAAAPAAAT^PSPATAAATAAAVS 
PAAAGQ I PAAAS VAS AAAVAPS AAAAAAVQVAP AAPAP VPAPALVPV 
PAPAAAQASAPAQTQAPTSAPAVAPTPAPTPTPAVAQAEVPASPATG 
PGPHRLSIPSLTCNPDKTDGPVFHSNTLERKTPIQILGQEPDAEMVE 
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YLI 

6946 

A 

186 

209 

QLWPN*LLSRNKEATLRYSQLPTCHSICY*MPAF*HTISDLSFNKNI 
NCKS YHLRVF YKYV* I VCHF * VPKVYWTKYYQTS CRT * WR * SGVE * P 
PIISSSS SACCLLRRLPTEGTCPE I FWNPRLG I S AECRDS CGPTS F 

6947 

A 

2 

2948 

CLWRKWSVALGCRYFFGLLRGARIQLLPSGAFSCLRVSLSPSPPPGT 
PLSTMRKGLRATAARCGLGLGYLLQMLA/LPALALLSASGTGSAAQDD 
DFFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYCKASPATQ 
IYFKCNSEWVHQKDHIVDERVDETSGL.IVREVSIEISRQQVEELFGP 
EDYWCQCVAWSSAGTTKSRKAYVRIAYLRKTFEQEPLGKEVSLEQEV 
LLQCRPPEG I P VAE VE WLKNED 1 1 D P VEDRNF Y I T I DHNL 1 1 KQARL 
SDTANYTCVAKNIVAKRKSTTATVIVYVNGGWSTWTEWSVCNSRCGR 
GYQKRTRTCTNPAPLNGGAFCEGQSVQKIACTTLCPVDGRWTPWSKW 
STCGTECTHWRRRECTAPAPKNGGKDCDGLVLQSKNCTDGLCMQTAP 
DSDDVALYVGI VI AVIVCLAI S VVVALFVYRKNHRDFESDI IDSSAL 
NGG FQPVN I KAARQDLLAVP PDLTS AAAMYRGPVYALHDVSDKI PMT 
N S P ILDPLPNLKI KVYNTSGAVS PQDDLSEFTS KLSPQMTQSLLENE 
ALSLKNQSLARQTDPSCTAFGSFNSLGGHLIVPNSGVSLLIPAGAIP 
QGRVYE^TYVTVHRKETMRPPMDDSQTLLTPVVSCGPPGALIJTRPVVIi 
TMHHCADPNTEDWKILLKNQAAQGQWEDVWVGEENFTTPCYIKLDA 
EACHILTENLSTYALVGHSTTKAAAKRLKLAIFGPLCCSSLEYSIRV 
YCLDDTQDALKEILHLERQTGGQLLEEPKAIiHFKGSTHNLRLSIHDI 
AHSLWKSKLLAKYQEIPFYHVWSGSQRNLHCTFTLERFSIjNTVELVC 
KLCVRQVEGEGQIFQLNCTVSEEPTGIDLPLLDP/^NTITTVTGPSAF 
S I PLP IRQKLCS SLDAPQTRGHDWRMLAHKLNLDRYLNYFATKS S PT 
GVILDLWEAQNFPDGNLSMLAAVLEEMGRHETVVSLAAEGQY 

6948 

A 

1 

616 

MREVEEDSPHESPLVLVSDWDVGIVTLWPCLLGSFLHHVTTRGYRRIi 
S FVLEKAAAFQDWWGEENFTTPC YI QLDAEACH I LTENLS T YALV 
GHSTTKAAAKRLKLAIFGPLCCSSLEYSIRVYCLDDTQDALKKLVGP 
QLS RHSAE I PNLGFQKX SGQVPS VGKRLSRQQADEE LL 1 1 GGLAVPS 
TLAEYLLVCPADAPQPC 

6949 

A 

1 

1913 

EPSIGEFNEQVLRLMHSLGIDQQKTIESLQNKSYNHFAAIYFLLVER 
LKSHRSSFPVEQRIiDGRQRRPSTIAEQTVAKAQTVGLPVTMHSPNMR 
LLRSALLPQ7VSNVEAFSFPASGCQAEAAFMEEECVDTPKVNGCLLDP 
VPPVLVRKGCQSLPSNMMETSIDEGLETEGEAEEDPAHAFEAFQSTR 
SGQRRHTLSEVTNQLWMPGAGKIFSMNDSPSLDSVDSEYDMGSVQR 
DLNFLEDNPSLKDIMLANQPSPRMTSPFISLRPTNPAMQALSSQKRE 
VHNRS PVS FREGRRASDTS LTOG I VAFROHLONLAR TKR T T^FT.WKVn 

LLYEQIGPEADPNLAPAAPQLQDLASSCPQEEVSQQQESVSTLPASV 
HPQLSPRQSLETQYLQHRLQKPSLLSKAQNTCQLYCKEPPRSLEQQL 
QEHRLQQKRLFLQKQSQLQAYFNQMQIAESSYPQPSQQLPLPRQETP 
PPSQQAPPFSLTQPLS PVLEPSSEQMQYS PFLSQYQEMQLQPLPSTS 
GPRAAPPLPTQLQQQQPPPPPPPPPPRQPGAAPAPLQFSYQTCELPS 
AASPAPDYPTPCQYPVDGAQQSDLTGPDCPRSPGLQEAPSSYDPLAL 
SELPGLFDCEMLDAVDPQHNGYVLVN 

6950 

A 

3 

185 

EFPGDPSPASSRRYPGGLGDIGRIiPPAPPPRPQPLVTAKPPFLLPKS 
RRRKLQCQLEGCS 

6951 

A 

112 

569 

KVtIjUSTACGRWNSKMAPQGLPLPPAVNMPCINIISGTGTQRDFIYL 
FYFYFLRWSFTLVAQAGVQWRDLASLQPPPPGFKQFSCLSLSNCWDY 
RNAPPRPANFVEMGFLHVGQAGLKLLTSGDPPTSALQNAGITGMSHH 
AQPCLGLYFIS 

6952 

A 

1 

1210 

NHITKGTSHGWDEDTSPCKVYHAGSTTSPPSTACNRLAGKCSCSWAR 
BGGAGDKSEAQSGQT^VGGSTAIiRCSSSEGAPKPVYNWVRLGTFPTP 
SPGSMVQDEVSGQLI LTNLSLTSSGTYRCVATNQMGSAS CELTLSVT 
EPSQGRVAGALIGVLLGVLLLSVAAFCLVRFQKERGKKPKETYGGSD 
LRCEAKEERKRDIMQNIVQILESVQLKWELFQSWTDFSRLHLSNKLA 
IFGIGYNTRWKEDIRYHYAEISSQVPLGKRLREYFNSEKPEGRIIMT 
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RVQKMNWKNVYYKFLEITISEARCLEIjHMEIDWIPIAHSKPTGGNVV 
OYLLPGG I PKS PC3LYATRYT?Pr T PR PT . Q PWMTIC JCVDCVru^ircnxyxmr 

GNPFSTSLIYRWEIGTHPNYLLLPRDC 

6953 

A 

75 

333 

QQKGVPSSPSSFTSQGALGPQVPKTQPQAWHALAFPGF 

6954 

A 

271 

512 

GAVGRPRSQOKSIGVPGWVSPGF*ECGKPPMTi«*Vi' T ioitmt t pdadd 
QQ I MRNI PRRLAEHMQQPLFS S CPS LMRAFEVT 

6955 

A 

2 

184 

FCSEPRSCHCTPAWATRAJCLCLKNKNKQTKNTUjKEEKGDNNIATTSK 
VLPGPSPNPTALS 

6956 

A 

13 

163 

LQVSSTYSRADPRVRQVALGDPLLPGASLPIAQPTGLLQAVATGSSR 
KGK 

6957 

A 

627 

849 

WECPSVFTTMIPCTSLLASAVAFSVSSNIiCPGWSAVVQSRLTVALTS 
WAQAI S PLQPFPTDP VAPRSEHLAS P 

6958 

A 

3 

144 

FPTIYFAPSGDKKNPVKFEGGDRDLEHLSKFIEEHATKLSRTKEEL 

6959 

A 

1 

2008 

PAAADARRPRSAPATMRPRKAFLLLLLLGLVQLLAVAGAEGPDEDSS 

1WENAIEDEEEEEEEDDDEEEDDLEVXEENGVLVLNDANFDNFVADK 

DTVIiLEFYAPWCGHCKQFAPEYEKIANILKDKDPPIPVAKIDATSAS 

VLASRFDVSGYPTIKILKKGQAVDYEGSRTQEEIVAKVREVSQPDWT 

PPPEVTLVLTKENFDEVV1TOADIILVEFYAPWCGHCKXLAP 

KELSKRSPPI PLAKVDATAETDLAKRFDVSGYPTLKI FRKGRP YD YN 

GPREKYGIVDYMIEQSGPPSKEILTLKQVQEFLKDGDDVIIIGVFKG 

fc. £> DFAYQQ YQDAAJNNIiRED YKFHHTFS TE I AKFL KVSQGQLWMQPE 

a.* UbKYEPRSHMJTOVQGSTQDSAIK^ 

YraRPxVW\nTi r SVDFSF\DYRAATQFWRS 

DEEDYAGEVKDLGLSESGEDVNAAILDESGKKFAMEPEEFDSDTLR\ 
ELVTAFKKGKLKPVIKSQP\VPKNNKGP\VKVWGRTFDSIVMDPKK 
\ D VL I E \ F YAP \ WCGHCKQLEP \ VYNS LA\ KKYKGQKGL VI AKMEAP 
v xr\^i\.±rj^.d.i\ v Hi\jr tr X \ llr Utr ir vKjUJsjSJJk'S / VK.F EGGD\RDLiEP 

fepsfieeha\tklsrt\keel 

6960 

B 

137 

214 

ASOFPIEWRPVKDAATVD^PPAT.Pnn* 

- 6961 

A 

1 

625 

CCSRGPx^AJ^GILFEDIFDVTOIDRKTUa^ 

DL I LDVTI O I YPVDLGDKFR\ LVTA^TT.YPnnTT .nnnPYMPTnnp d q 
RADQFE YVMYGK\ VYR I EGD\ ETSTEAATR \ LS AYVS \ YGGLL\ MRL 
QGDANl^\HGFlWDSRVYLLMKKIiAF 

6962 

A 

1 

273 

AAAARAGTPGRVGCAAQEVPLSPHNRPGLQPLPPPPPPPPYSGSGPG 
LRRATLTS FI PTVKAQPWLPDPP WSNS VS PVKGQRSLGSRRHH 

6963 

A 

135 

233 

PRDLCSRVYLLTFS PLLS YD PDTDPS PRNTPE 

6964 

C 

198 

449 

MGPSPAFYKlxXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXSPNRFSLGCLGSFFRIPPDW 

6965 

A 

1 

1278 

MGDFNTPLSTLDRSTRQKVNKDTQELNS AQHQVDL I D I YRTVHPKST 
EYTFFSAPHHTYSKIDHIl^SKALLSKCKRTEIITNYLSDHSAIKLE 
IjX X iVN .f iyb 0 X 1 WJUL^NLkLiroYWVrore 

YONLWDAFKAVCRGKFIALNAHKRKQERSKTDTLTSQLKELEKQEQT 
HSKASRRQEITKIPJ^LKEIVTQKTLQKIOTISRSWFFERINKIDRPL 
-h-kxi x iNJSJNjKJ&ivwy x ijaxiunjijivCjL>± TTDPTE IQTTI RBYYKHxjYANKLE 

NI.PRMDTCPTiTj r rY r rTiPPT.MOT?"R\rPCT XTD "D T ty^* 7\ it t "\ 7 a ttmct •nrr*xm\Tr> 
iMjjCi&i iui\r xjxj xxx uitaXjJMUJoJQp VCiOXjiMKlr X XAjA-Ti JL VA1 IWSJjPXKRVQ 

DQ^SQPNSTRJENKIPRNPTYKGCEGPLQGELQTTAQGNKRGYKQME 
EHSMLMGRKNQYRENGHTAQGKPPASAEVIAEEERGFGRETQD15YRG 
AAP 

6966 

A 

1 

325 

KTFFFFFFFQTNSRSVAQAGVQWRDLGSLQPIiPPGFQRFSCLSLLGS 
WDYRRRLPRPANFCIFRGFRHVGQAGLELLTPGDPPASASQSAGITG 
VSHRAGLFHSVFIF 

6967 

A 

525 

661 

TEILSGSLGSEIFTTSES*RPGLGPSFVIFL,FSSSVCLNIFFRVL 

6968 

A 

1 

293 

MAGTPPPALLPPCSLISDCCANNQRDSVGVGPSEPGAGYNLWRSFL 
S PS EKRS VR VGVTRLFQGRPS PLSLTQKGNP\ LTPCASQFTS TI SDG 
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RIMV 

6969 

A 

1 

1372 

MKSPFPLVSTKAEWGTWIFNPEAEESLEPRSLISDCCANNQRDSVGV 
GPSEPGAGYNLVVRSFLSPSEKRSVRVGVTRLSRCRPSPLSLTQKGN 
PLTPCAS Q WQCLALLRLRHVYFYNFGWKD YGVAS LTT I LDMVKVMT 
FALQEGKVAIHCHAGLGRTGVLIACYLVFATRMTADQAI IFVRAKRP 
NSIQTRGQLLCEAQQSGAFSADVSGSHSPGEPVSPSFANVHKDPNPA 
nwvoni-yLrvi rivjvvj£>Fijc> VKyNbRTPRSPLDCGSSPKAQFLVEHET 
QDSKDLSEAASHSALQSELSAEARRILAAKALANLNESVEKEELKRK. 
VEMWQKELNS RDGAWER I CGERDPFI LCSLMWS WVEQLKEPVITKED 
VDML VDRRAD AAEALFLLE KGQHQT I LC VLHC I VNLQT I PVD VE EAF 
LAHAIKAFTKVNFDSENGPTVYNTLiKKI FKHPLE 

6970 

A 

2 

277 

KNEMTLDAYIFLADMAAYQQEEQMQLPRADAIRSRIilDTFSLIEHLQ 
GLSQAVPRHTIRELLGQQPISSLI*I*DSYPHPLHLRVENSGEV 

6971 

A 

434 

564 

ETTF YNEQRGPQPHLPEVKKKYRGN * EAKTK* LRPENWKGDI L 

6972 


197 

1 *3 A *7 

ALVRLDS WMGLNTTANLGGGMTEMARGVTPW ICPRS SAERARSG I PA 
AMVPISVKISGPQLLIGPNPPGSTKRWARNSSCSSEHCR/SQGAQEE. 
^L_ljAyHil/TPSAPSQMT*TPNC*CAGAPLAAVHR\GITQRCQMNEV^ 
VAHMPTTLP* ETGDHNAELSGAPAI YLPQGAQHSPVH*DQLL* FPI S 
a i rtf ± is_i_»yujjjjK 1 K£> PLKj rPQJbFCFAH/ ALPAVPQAQVLGLLQIM 
SGSPQSLRAPSLQSGLQTRLLGMSLVHLHNIiAQVLEDPQQLVAPP/T 
/DYQGCFLGSKLWLPTQLSN/VCSAPSSAAFLEAAPGSLDPRPPGFV 
VPSLPGLPGGGGSVPAAALELFAPAPPPT 

6973 

A 

232 

476 

I FVS S YYHVNLFFRFT * AHS SSH* TNPQDL YP YQHGNNKVKD I PGTL 
PENRKVG*PLKKDLQDENKQDKPSCHLQGQLQWP 

6974 

A 

442 

698 

DVNEWHQLGEAASALLVGKRPCLS PGSLRPLS PDFTLPPPAQAPPEP 
G YP YAGRAQGRLPASLRP I S RPARPPGFGS PRDPALAP 

6975 

B 

546 

1820 

LLLNCPLFCYGLTSVLHVTCFFSHHGYYDTSKDNGAXQQPKDEIDAA 
EDATDFRHLPDVLKQCIPMIQASQPADSAVPATSGKVYGTPETELTY 
PTN I EAAPLAEEEHAS SATS I TECDKLS S FATS VAEDQS VAS LTAPQ 
TEETGKSSLLLDTVTSIPSSRTEATQGLDYVPSAEDKGFKSPPCEDF 
S VTGESEKRGE I IGKGLSGERAVEEEEEETAHVEMSEKLCSQYGTPV 
FSAPGHALHPGEPALGEAEERRLSPDDSTVKMASPAPSGPPSGTHTP 
FHESPVEEKSEPQDFQEADSWGDTQRTTSVGKRKAAEELRCLVRTGR 
SIRGLSLPPACTRAERREVERVWDALSGLKGDLAGRYYRLSEMTEA 

EQQQLIDDHFLFDKPVSPLLTAAGMARDWPDARGIWHNNEKSFLIW 
N* 

6976 

A 

2 

'■ 

480 

YFLAETGFHHAGQAGLELPNSGDLSASVSQSAGITGVSHRARPRHLY. 
FFNFNEQ ILFYLNVY I FNSLFPTI PF 1 1 FCWAQGYWKPIHCFTCLPS 
SSVPALQTLVTSYVPLGYTEAFLATQNIGRVSLWAKHGHPDPFPLAR 
ADFRAQESPSPNDPSWLL 

6977 

A 

1 

2043 

MVSISRPHDPPASASQSAGITGVSHRLRFYFDHQDVAPECGAYFFHE 
LAKEKRKGVERLLKMQNQRGGPAVFQD I LKPGQDEWGKTLiEAMEAAM 
APGKNLNQSLLDLHAIiGSAPTDPHLCDFLESHFRDEEVTFMPRKSAD 
LHAAGEAASLGGV S LNVMKGASLHALCLRLLNS FGPGVATLCGERW 
HKNNRASALRKC I FWPVS LNSSRARPRQRTTAGGAVPTAALQRVI PA 
SSQCLFPAPMGTTEATLRMENVDVKEEWQDEDLPRPLPEETGVELLG 
SPVEDTSSPPNTLNFNGAHRKRKTLVAPEINISLDQSEGSLLSDDFL 
DTPDDLDINVDD I ETPDETDS LEFLGNGNELEWEDDTPVATAKNMPG 
D S ADLFGDGTTEDG S AANGRLWRTV I IGEQEHR IDLHM I RPYMKVVT 
HGGYYGEGLNAIIVFAACFLPDSSLPDYHYIMENLFLPPGQQTTPGP 
VGLTQ PAAG VEAGR VQTAG VLTLD P SMLG PCLLCPVSR YV I S S DELL 
VAEDYMIVYLNGATPRRRMPGIGWLKKCYQMIDRRLRKNLKSLIIVH 
PSWFIRTVLAISRPFISVKFINKIQYVHSLEDLEQLIPMEHVQIPDC 
VLHDRQIILTTLCAYGEGEALASHLFHVILPFHIHSCFPSSTETSDP 
PCGPGLDLAYTTPFSTHFAQRGK 

6978 

A 

1259 

1497 

DYRHLPPLLANFVFLVEMGFHHIGQAGLKLPTSGDPPASASQIARIT 
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GVSHRARQVLFIFVSPDSSTRMEAWDVLCEGC 

6979 

A 

2 

425 

SKNDPPDHPILSSLSILFFYHQGTIVKGSFLISVVRIPRIIVMYMQN 
ALKEQHGAL S RYL FRCC YCC FWGLDKYLLHLNQNAYTTTA INGTD FC 
TSAKDAFKILSKNSSHFTSINCFGDFIIFLRKGLRVCFTGFGRTQCF 

6980 

B 

259 

329 

XFRKKEQQI KQCKGTAGQAL I KE * 

6981 

A 

1 

1251 

MFVFETLGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILAC 
QQMTIAGAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFL 
I S WRI PR 1 1 VM YMQNALKEQRQS WE I PGVCEEERGRKGEGLNES VY 
QHGALSRYLFRCCYCCFWCLDKYLLHLNQVRLYLLPQEHTEGSKGGG 
GG S QS PAPAGERTMHCLGAE YliNAYTTTA INGTDFCTS AKDAFKILS 
KNSSHFTS INCFGDFI IFLGKVLWCFTVFGGLMAFNYNRAFQVWAV 
irXjljJU V/vr r/ii JL»V/\Jtlor Lovr c.1 V JjJJALir .bur AVlJxjJtL l/NJDGoociiVJrir 
MDQEFLV9KHFISNCNLHARNLESFVKLHTRKRLIPSLSLLEGSEAS 
TTPGPESGICAELSKAEPTSLSENIiLPEEVKLNKNADQMDS 


7V 

1 
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VFnnT.MATTIJYTJP AFOVW AVPT.T.T.VIi'FTPliVTA/lYWC'PT CVW'PTT.TT.nZiT.T? 
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LC FAVDLETNDG S S E KP YFMDOE FL S FVKRSNKLNNARATAGTSTH 

6983 

c 

120 

371 

MSAHSHPCTCINGHTPVSITYXXXXXGGPFKRTLGGPKFNGGGQGKI 
FFFMGGLFKPSLGFLEKNLFXXXXTYGATPPPKFKP* 

6984 

A 

76 

251 

ALKFCRGVFPIFSPPQKKFLFQNSPGGFFFPPIjKEKNFSFPP*KSSS 

ppkplnfvggffpffppp/nkkflfqnspggfffpplkeknfsfppp 
vkfgppqgsf*tppp 

6985 

C 

193 

315 

mggqlkpslgfleknlffggeknghxxxxkfkplgxxxxf* 

6986 

C 

363 

413 

mkvicgvilpkqggnsa 

6987 

A 

1 

1088 

yklvcyytswsqyregdgscfpdaldrflcthiiysfanisndhidt 

WEWlTOVTLYGMLNTxxlsT^^ 

RRTFIKSVPPFLRTHGFDGRDLAWLYPGRRDKHHFTTLIKEMKAEFI 
KEAQPGKKQLLLSAAiSAGKVTIDSSYDIAKISQHLDFISIMTYDFH 
GAWRGTTGHHSPLFRGQEDASPDRFSNTDYAVGYMLRLGAPASKLVM 
G I PTFGRS FTLAS S ETGV PAPISGPGI PGR FTKEAGTLAYYE I CD FL 
RGATVHRTLGQQVPYATKGNQWVGYDDQESVKSKVQYLKDRQLAGAM 
WAIxDIxDDFQGSFCGQDLRFPLTNAIKDAIxAAT 

6988 

A 

64 

1279 

LAPGAVGKGRGHTLPCSAAARMGVKASQTGFVVLVLLQCCSAYKLVC 
YYTSWSQYREGDGSCFPDALDRFLCTHIIYSFANISNDHIDTWEWND 
VTLYGMIxNTLKNRNPI^KTLLSVGGWNFGSQRFSKIASxTTQSRRTFI 
KS V P P FLRTHGFDGIjDIjAW L Y PGRRD KQH FTTL I KEM KAE F I KEAQ P 
VjlsJvv2ijijijo/iAijo/Vvjl\.V 1 lUob iL> lAivXSQHLiDr ISIMTYDrHGAWKG 

TTTIT-n-TQ PT.PPriO'PT^ A Q PT^P QMT*F>V Zi\/n YTvIT T>T nUDS CITT \7Mf2TI3 r P1? 
x x uimoirijr nuyEivnOJr uzkm? oin xi/i M. vvj x J'liji^xjo/^JrAo JVxj V 1*1 vj lr ir 

GRS FTLAS S ETGVGAP ISGPGI PGRFTKEAGTLAYYE I CDFLRG ATV 
HRTLGQQVPYATKGNQWVGYDDQES VKS KV\ QYLKDRQLAGAMVWAL 
DLDDFOGS FCGODLRFPLTNAI KI1ALAAT 

6989 

A 

3 

174 

FPGRRFRG I PTFGRS FTLAS S ETGVGAP T SGPG T PGR PTKEAGTLAY 
YEVMELGRR 

6990 

C 

116 

181 

MCSTSRSRFTNILKSCNLLKY* 

6991 


20 

111 


6992 

A 

120 

426 

EVFSLLNVLCKKSTVGHKLTSLQHLLTPSAVIQSPSSLLDTSFLLFA 

xVix^XxrYlr r rt. V tr XjU C xjxlo V Lin V lr r uv_ f IjXj v r r r o i rCxr 1 j i bur JL r la 

CIIQGES 

6993 

A 

2 

112 

RRSLALSPKLECSGVIIxAHCKLCLPGSRHSPAPARD 

6994 

A 

24 

316 

flflffetescsva\rlecsgaisahcklclpgsrhspasassqvag 
stgachqa/qanfvfVflvetgfyrvspg*sg/lslelvirpprppk 

VLGLQV 

6995 

A 

80 

326 

KKFSLGSQGGRAGVFPRPPLGKLQTTALNSGAKGTPPFIjFPEPRKRG 

gpppapgwegifcphffpmgqkgegglwhtgegph 

6996 

A 


233 

qnfslvqavaqwrslgslqplpprfkqflclslpgswdyrrmppcpa 
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NHLRLMSMWARVCTQQKVAGKAPSLLLKS P 

6997 

A 

1941 

2223 

RTPRMKQSPAPIPCOPTTCSGCMASRATRTGWT<3VPAY\ r , T?T.FT?nwa 

\CVPVALPEMVRALSAQQQTLKQIVICGDRQAKDTKALGAVRPLCLH 
S 

6998 

A 

211 

544 

WG PLELPVP PAAFPE YPPTVGRPRPLHMCVRI KPCS VP I TDPCELRK 
TTTSIDMPIPLGVGLEGEASQLDVETPRPCRAWEPCGHPWSTLPLR 
LTLGQYVTCPPLGTHSP 

6999 

A 

31 

230 

CPPSSAVSPAPGAAAGPPPGLSSAESRNTATSLPLPECPLIKPIPAA 
LSALGPPDFHNEQGKGLRT 

7000 

A 

1 

861 

EIGVNHTHLLFHTEVRWLSQGKVLSRVYELRNEIYIFLVEKQSHLAN 
IFEDDIWVTKLAYLSDIFGILNELSLKMQGKNNDIFQYLEHILGFQK 
TLLLWOARLKSNRPSYYMFPTLIjOHTEFNT TMPnPT.ifPTKT t?tt.t wt 

tslsqtfnyyfpeekfeslkeniwmkdpfafqnpesiiei>nlepeee 
nellqlsssftlknyykilslsafwikikddfpllsrksillllpft 

TTYLCELG FSI LTRLKTKKRNRIxNS APDMR VAL S S CVPD WKELMNRQ 
AHPSH 

7001 

A 

55 

406 

PEMLFCATSQCQLTGGGAWLSSTARSVGTTSDGVPVPLLRVRLPRVS 
NTDPPPALGPVPGS PQNPHSLLLS CRLLQPARAPRAWD I LCSPQEPE 
AEPPAQSAPPQQPSHYQGLHCGH 

7002 

A 

1 

361 

CSGFLTFLPCQAGFGRPDGEYWLGLEPVYQLTSRGDHELLVLLEDWG 
GRGARAHYDGFSLEPESDHYRLRLGQYHGDAGDSLSWHNDKPFSTVD 
RDRDS Y SGKENS YSGER I GEAGLLFW 

7003 

A 

1 

1725 

ARAKTNSWGCERPRPGPESTPAQSPTWGARLPTMRKIRANAIAILTV 
AWILGTFYYLWQDNRAHAASSGGRGAQRAGRRSEQLREDRTIPLIVT 
GTPSKGFDEKAYLSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDT 
RHYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLITOTPANIjI 
QEIILVDDFSSDPEDCLLLTRIPKVKCIJ^roRREGLIRSRVRGADVA 

YIuAASADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGI 
FVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRV 
GHVFRKKIPYNFPEGNALTYIRNTKRTAEWI^EYKQYYYEARPSAI 
GKAFGS VATRI EQRKKMNCKS FRWYLENVYPELTVPVKEALPG 1 1 KQ 
GVNCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDHLIQQQG 
KCLAATSTLMSSPGSPVILQMCNPREGKQVSIvIjASGPEAQQPEGPCL 
RVADLGRRAPD 

7004 

A 

1 

1271 

MRRLMKGDEVLHWYDI I YHPKKEASKINI IHQGFGFGFVFWEVMRG 

KMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFE 
LSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFPEGNALTYIRNTKRT 
AEVWMDE YKO YYYEARPS A I GKAFGS TTP AKYKVT .MVPFTPamann 

EGRQPVAIWGVPEEGQSPSPASSYSVATRIEQRKKMNCKSFRWYLE 
NVYPELTVPVKEALPGIIKQGVNCLESQGQNTAGDFLLGMGICRGSA 
KNPQPAQAWLFSDHLIQQQGKCLAATSTLMSSPGSPVIIiQMCNPREG 
KQKWRRKGS F I QHS VSGLCLETKPAQLVTS KCQADAQAQQWQIiL PHT 

7005 

A 

3 

329 

LIKXCKSKAKSCENDLEMGMLNSKFKK^ 

KPTRY/OGELKEIKODISSLRYELLEEKSOATOKT.AnT.TOnT.QPTrpn 
KNIxNKDHLRVNKGKD I 

7006 

A 

1 

801 

MPIRHPSGDIKLVIGYGEERVEESFKTLFWSIFGLSEVISVVLKYDH 
KF I EN IGYVLYGVYNVTMVVVXrLNML I AM INNS YQE I EEDADVE WKF 
ARAKLV^SYFDEGRTLPAPFNLVPSPKSFYYLIMRIKMCLIKLCKSK 
AKSCENDLEMGMLNSKFKKTRYQAGMRNSENLTANNTLSKPTRYQKI 
MKRLIKRYVLKAQVDRENDEVNEGELKEIKQDISSLRYELLEEKSQA 
TGEIiADIiIQQLSEKFGKNLNKDHLRVNKGKDI 

7007 

A 

21 

373 

AERLEAGAPTICMPGRWTFILRSEPHFCPLRPAAQEPRIGADRGPGR 
NEAPAPDLPGLPRLLRSGSGVLPGKGLGAKWKGMLSGRPSPCSGALG 
PLGPENPFPWQVXTESWPSRQLA 
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7008 

A 

3 

63 

L I KKAVAVRKHLERNRKVS 

7009 

B 

374 

447 

X FC FLLADM I MQSGDAI SGFPVCL * 

7010 

A 

1 

860 

MELCRSLALLGGSLGLMFCLIALSTDFWFEAVGPTHSAHSGLWPTGH 
GDIISGYIHVTQTFSIMAVLWALVSVSFLVLSCFPSIiFPPGHGPLVS 
TTAAFAAAISMWAMAVYTSERLSGMGVKRSLQSGGILLSLVANVLM 
VLSTATNY WTRQQEGHSGLWQECNHG I CS S I PCQTTLAVTVACMVLA 
VGVGWGMVMGLRIRCDEGESLRGOTTSAFL.FT.nfiT.T.T T.Tar Tnvw 

KNAWKNI^FFSWSYFSGWLTUjPFSIIjAGFCFLLADMIMQSTDAISGF 
PVCL 

7011 

A 

1 . 

691 

KPASCDKAQDPSLPQLLAVLSFLGSKDPGVWVHSLLLSEIQESDQQP 
LPPPGPRSPELIPMELCRSLALLGGS\LGLMFC\LIALSTIFWFRAG 
GPTHLA\HSGLWPTGHGDIISG\YIHV\TQTFSIMARSVGPWVSVSF 
PGSLSCFPSL\FPPGHGPLV*T\TEAFAAAISMWAMAVYTSERWDQ 
PPHPQIQTFFSWSFYLGWVSAILLLCTGALSLGAHCGGPRPGYETL 

7012 

A 

1 

521 

MSSKRTKTKTKKRPQRATSNVFAMFDQSQIQEFKEAFNMIDQNRDGF 
IDKEDLHDMLASLGKNPTDEYLDAMMNE\ APGPINPPOWP^PMT.VP 
LNAPDP \ ENV I RNAFAC FDE \ EAPG P I QED YLRELLPTMGDRFTNEE 
VDELHREAPIDKKGNFNYIGFTRILEHG\AKDKDD 

7013 

A 

106 

693 

TTTMSSKKAKTKTTKKRP\QRATS1WFAMFDQSQIQEFKEAFNMIDQ 
NRDGFIDKEDLHDMLAS \LGKNP\TDAYIiDAMMNEAPGP\ INSTMF\ 
LTMFG\ EKFKSHRSPEDVI / RANGLCFCSDEEATGHPFQED/ YTLRE 
LLTNPWGD\RFPDGGKWMNI»YQRKHLIGPKKGEFSNYIGVHHGILTG 
RPPKHKDDLKELLA 

7014 

A . 

78 . 

188 

ALMAPSHS SLGNRVRPPFKLTKPGRAQWLTPVI PAL 

7015 

A 

2 

93 

GGSCL*SQHFGRPRWADHEVRRSRPSWLTR 

7016 

A 

2 

93 

GGSCL*SQHFGRPRWADHEVRRSRPSWLTR 

7017 

A 

2 

93 

GGS CL * SQHFGRPRWADHE VRRSRPS WLTR 

7018 

A 

2 

93 

GGSCL* SQHFGRPRWADHEVRRSRPSWLTR 

7019 

A 

2 

94 

GGSCL* SQHFGRPRWADHEVRRSRPSWLTR 

7020 

A 

1 

418 

RRLRTAMKLRSHRPDQTWHDQQYNKAHLGAALQANPLGGAAHAKGIV 
LEKVRVEATQPK\ SAIMK/ CVRVQL I KNGKKI TAFVPNDGCLNF I EE 
NDEVXiVAGFGRKGHAVGD I PGVRF KWKVANVF I LAL YKGKKERPRS 

7021 

A 

2159 

2334 

SATKTYRNLTVIiEl^NSTHRHSHKTNST^ 
QSHICQSNCPF 

7022 

A 

1748 

1821 

SQHFGRPRWADHEVWSLRPSWLTQ 

7023 

A 

2179 

2485 

P PHLQDRGKP VRAP PDVRTRRNQQEGS P S S LLGRP PG PAEPCPTQLH 
RAVLPPHGRRAEETTRSGEQGHEGGDLSTQVFRGTLS S SGSASGGHL 
PPCGSRRA 

7024 

A 

3 

502 

TARAWLLLGPWPCVSERWSKKPSPRGGRDPSDRDPAAFAARSTVPP 
RISAYERPVPWPGEWNDPRGPGRRASAWSPREGNWGVLRDPRLQAR 
KPRMVRSRQMCNTNMSVPTD/ GCCNHLTDSSFGTRDPG* TKAIAFEV 
I KVCWCTKRHL YYERGS FLS WPVYYD 

7025 

A 

89 

291 

CKGVSSHES I FVI ATNTRNKLLILFQENDEVLVAGFGRKGHAVGDI P 
GVRFKWKVANVSLLALLIF 

7026 

A 

1295 

1429 

GQLYEKLGRRGPGAVAHAYNSQHFGRLRWADHEVRRWRPSWLTQ 

7027 

C 

332 

763 

MTGCLNFIEENDEVLVAWIWSQKVMLLVIFLESXFKVVQSSPMVSLL 
ALYKWLYGETKIINflNGENTWINFHMPKNVCILLSPVLTTKIMFIT 
TTTPPYFFYPKPVNAGPVPILGNNKTGLFQDSLELITFVTWHSFPHP 
VE* 

7028 

A 

89 

184 

IHILTLQELLKEALNMERDKQYEPLGKHTKL 

7029 

A 

2 

275 

LDLDKS I KI VKKLKLTG FPYKI FKNTS FI/KD I VFMRTW YPVS I PAF 
YNPVTSLLKP/VGEKDTWSGMWTTGHLGL/AHGVRLKTNKDSLYKGL 
V 

7030 

A 

178 

621 

PPRHVQIjNHAEFEDQDDEARVQYEGFRPGMYVCVEIENVPCEFV*NF 
DPRYPIILGGLGNSEGNVGYVQ/ IPITPQGTGFLAIQSVSGIMPDFR 
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IAATGVVLDLDKSIKIVKKLKLTGFPYKIFKNTSFIKGMFNSALEVA 
KFEGAVF 

7031 

A 

159 

486 

DPITPQGTGFIAIQSVSQIMPDFRIAATGVVIiDIiDKSIKIVKKLKLT 
GFPYKIFKNTSFIKGMFNSALEVAKFEGAVIRTISGIRGQIKKAL*A 
PEGAFRAS FEDKLL 

7032 

A 

19 

3881 

IATMEAKDQKKHRKKNSGPKAAKKKKRLLQDLQLGDEEDARKRNPKA 

FAVQS AVRMARS FHRTQDLKTKKHHI PWDRTPLEPPP I VVWMGPP 

KVGKSTLIQCLIRNFTRQKLTEIRGPVTIVSGKKRRLTIIECGCDIN 

MM I DLAKVADLVLML IDAS FG FEMETFE FLN I CQVHGF PKI MGVLTH 

LDSFKHNKQLKKTKKRLKHRFWTEVYPGAKLFYLSGMVHGEYQNQEI 

HNLGRF I TVMKFRPLTWQTSHP Y I LADRMEDLTNPED I RTN I KCDRK 

VSLYGYLRGAHLKNKSQIHMPGVGDFAVSDISFLPDPCAIiPEQQKKR 

CLNEKEKLVYAPLSGVGGVLYDKDAVYVDLGGSHVFQDEVGPTHELV 

QSLISTHSTIDAKMASSRVTLFSDSKPLGSEDIDNQGLMMPKEEKQM 

DLNTGRMRRKAI FGDEDESGDSDDEEDDEMS EDDGLENGS SDEEAEE 

EENAEMTDQYMAVKGIKRRKLELEEDSEMDLPAFADSDDDLERSSAE 

EGEAEEA&ESSEEEDCTAGEKGISGSKAAGEGSKAGLSPANCQSDRV 

NLEKSLLMKKAALPTFDSGHCTAEEVFASEDESEESSSLSAEEEDSE 

NEEAIRKKLSKPSQVSSGQKLGPQNFIDETSDIENLLKEEEDYKEEN 

NDSKETSGALKWKEDLSRKAAEAFLRQQQAAPNIiRKLIYGTVTEDNE 

EEDDDTLEELGGLFRVNQPDRECKHKADSLDCSRFLVEAPHDWDLEE 

VMNS IRDCFVTGKWEDDKDAAKVLAEDEELYGDFEDLETGDVHKGKS 

GPNTQNEDIEKEVKEEIDPDEEESAKKKHLDKKRKLKEMFDAEYDEG 

ESTYFDDLKGEMQKQAQI*NRAEFEDQDDEARVQYEGFRPGMYVRIEI 

ENVPCEFVQNFDPHYPIILGGLGNSEGNVGYVQMRLKKHRWYKKILK 

SRDPI I FS VGWRRFQTI PLYYIEDHNGRQRLLKYTPQHMHCGAAFWG 

PITPQGTGFLAIQSV\SGIMPDFRIAATGVVIJDLPKSIKIVKKLKLT 

GFPYKIFKNTSFIKGMFNSAL*VAKFEGAVIRTVSGIRG\QIKKALR 

APEGVFR\ASFE\DKLL\MSDIVFMRTWYPVSIPAFF\NPVTSIiL\K 

PVGEKDTWSGMRTTGQIiRLAHGVRLKANXDSIi YKP I LRQKKHFNSLH 

I PKALQKALPFKWKPKTQAKAGKVPKDRRRPAVI REPHERKI LALLD 

ALSTVHSQKMKJKAKEQRHIjHNKEHFRAKQKEEEEKXiKR 

R I QGQKERRNQKS S LKGAEGQLQ 

7033 

A 

182 

374 

PDFR I AATG WLDLDKS I KI VKKLKLTGF PYKI FKNTS F I KVC I S I Y 
SHI YKS PYWLRKGMKYL 

7034 

A 

2 

198 

QARIiCLKKJOOCKMWHIYTMEYYAAIKKNGIMSCAATWMELEAIILNE 
LTQKQKTKYHMFSLISRS 

7035 

A 

1 

457 

IFLRQFLSPRLEYSGVIKAHCSLNLLGSSDPPASAFKVAGTTGIRHH 
AWLIFVFFFAETGFHYVAQGGLELLNLRNIiPATAFQSAGIIGISHHD 
LLGIQITTWLLQKLSYLQKKICKDQETGLLHAFHLYPYPYPMPSSPQ 
LLQGPDILTNG 

7036 

A 

2 

145 

VETGFHHVGGAGLELLTSDDPPASDYQSAGITGISHCARPMAPFLTL 

7037 

A 

55 

193 

MLSFSFLSFFFFRGMVLTMLLPRVVLNCWPQAILPPRPPKVLGLLA 

7038 

A 

564 

884 

GLLMSPITLCLVDLLPLLLLLLLLLLLLLLLLLLLIjIjLLiLLLLLLLIj 
LLPPPPPPPPPPPPPLLLLLLIiliLLFFFFFETESRSVAQAGVQWRHL 
GSLQALPPRFAP 

7039 

A 

257 

515 

KFFSFLSSACCLEQVLFCFVLFFVFFLLLIIFSFFFSFSFSSFFFSS 
SSSSSSSFSSSSFFFLRWS/PRSVAQAGVQ/WS*FGSLQPP j 

7040 

A 

936 

1317 

I LVLHGCF I CWWYLCLSMYFFFNFFEMESRSVAQAGVQWCA/ DLCSL 
Q71PPAGFTPFSCLSLPSSWDYRHPPPRLAKFFVFLVETG\FHRVSQD 
GLDLLTSGDPPTSASQSAGITGVSHRARPLKHIiS 

7041 

A 

1 

1218 

MGKVWKQQMYPQYATYYYPQYLQAKQSLVPAHPMAPPSPSTTSSNIW 
SSSSSNSGWDQLSKTNLYIRGLPPHTTDQDLVKLCQPYGKIVSTKAI 
LDKTTNKCKG YG FVDFD S PAAAQKAVS ALKASGVQAQMAKQQEQDPT 
NLY I SNLPLSMDEQELENML KPFGQVI STR I LRDSS GTS RGVGFARM 
ESTEKCEAVIGHFNGKFIKTPPGVSAPTEPLLCKFADGGQKKRQNPN 
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KY I PNGRPWHREGEVRLAGMTLT YD PTTAAI QNGFYPS PYS IATNRM 
I TQTS I TP Y IAS PVS AYQVQS PS WMQPQP Y I LHD PGAVLTPSMEHTM 
S LQPASM I S PLAQQMSHLS LGSTGTYMPATS AMQGAYLPQYAHMQTT 
AVPVEEASGQQQVAVETSNDHSPYTFQPNK 

7042 

A 

1 

1169 

imPPSPSTTSSNNNSSSSSNSGWDQLSKTNLYIRGLPPHTTDQDLVK 
LCOPYGKIVSTKAILDKTTNKCK'GYnFVriPnQoanzvnv'ftiJCTVT vaer» 

VQAQMAKQQEQDPTNLYISNLPLSMDEQELBNMLKPFGQVISTRILR 
DSSGTSRGVGFARMESTEKCEAVIGHFNGKFIKTPPGVSAPTEPLLC 
KFADGGQKKRQNPNKYIPNGRPWHREGEVRJLAGMTLTYDPTTAAIQN 
GFYPSPYSIATNRMITQTSITPYIASPVSAYQVAKETRENKYRGSAI 
KVQSPSWMQPQPYILHDPGAVLTPSMEHTMSLQPASMISPLAQQMSH 
LSLGSTGTYMPATSAMQGAYIiPQYAHMQTTAVPV\EEASGOOOVAVE 
TSNDHSPYTFQPNK 

7043 

A 

1 

743 

VFLVEMGFHHVGQPSLELPS S SDAPAS ASQTAG I TGVSHCSRPVRFF 
FAGMLLTCCISWRCFASRALCRLQHPIISSSQQPLAFQCFWSTDVPK 
TRVYQIGTILPLPSKLLDSWAEGYPATFESHVPGIYSRVGVSSIWHL 
GCV>7SEPGVSRFLPMRWDGPSAREWMVLCHARKAAPAPCSRFPAQPV 
LSQAPPPATVALCHVLTSLIPLFNLPQVLSYWLPSRDEVGVSAWDSE 
AWLLLHSAPKPQ 

7044 

A 

2 

171 

Y IGGWWDKAGAHS I LNGVWYRGGHYKI KHQDGI FWAE YPGGS YSLRA 
AHMMIRPID 

7045 

A 

1 

350 

EGGTGVRS LS FYQH 1 1 TVGTGHGS LLF YD I RAQKFLEERAS S S LDSM 
PGPAGRKLKLACGRGWIJ^QDDVWVNYFGGMGEFPNALYTHCYNWPEM 
KL FVAGGPL PS GLHGNYAGLWS 

7046 

A 

2 

1685 

QT YDDG AATRLMS TVKPLRE PAP S ED V I D I KPD PDDL I DEDLNFVQE 
NPLSQKKPTVTLTYGSSRPSIEIYRPPASRNADSGVHLNRLQFQQQQ 
NSIHAAKQLDMQSSWVYETGRLCEPEVLNSLEETYSPFFRNNSEKMS 
MEDENFRKRKLP WS S WKVKKFNHDGEEEEEDDDYG SRTG S I SS S V 
S VPAKPERRAFYKOASTTTRPFT.Pn ^r*r*r , l7 , Q Dirr* V"A t nrTrcTuiiivcrv 

GKCRYSRGSVFFSSQKRYDFRYSIFTWSKEGGAVKLWDQEIiRRCRAF 
RLETGQATDCVRS VCRGKGKI LVGTRNAE T TP VfSPKM A A m T T .vmrh 
VDGP I WGLATHPS RDF FLS AAEDGTVRLWD IAD KKMLNKVNLGHAAR 
TVCYSPEGDMVAVGMKNGEFIILLYCFGYLEPLAGPCEFEDWLFISM 
KKHIGI VI ATAPS I HI WDAMNKQTLS I LRCYHS KGVCS VSFS ATGKL 
LLSVGLDPEHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSDTQFV 
S VGVKHVICFWTT^GRALLSKKGIjLSTLEDARMQTMIAI AFGAVA 

7047 

A 

47 

218 

GHMEGEVWGT^THPYLPICAWSDDKTLRIWDLSPSHCTvlT^VRKriKK 
GKRLSHLNVD 

7048 

A 

3 

1716 

GRCCCFSPDGIxALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIR 
FSPGIWPVIGEVTDVTASCLTSDK^TVLlATGDDLGFVKIJFRYPTKASS 
EQGPTTAJ^TPDKQCRQEKETYRATAPSIHIWDAMNKQTLSILRCYH 
S KG VCS VS FS ATGKLLLS VGLD PEHT I T I WRWQEGAKI ASRAGHNQR 

IFVAEFRPDSDTQFVSVGVKHVKFWTI^GRALLSKKGLLSTXEDARM 
QTMLAI AFGAGKI LVGTRNAE T T EVflPKMAAfUT T r> Tum 

ATHPSRDFFLS AAEDGTVRLWD IADKKMLNKVNLGHAARTVCYSPEG 
DMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAV 
GSSENSVDFYDLTLGPTLNRISYCKDIPSFVIQMDFSADSSYLQVSS 
GCYKRHVYEVTSGKHLMGS\SAIDRITWATWTSILGDEVLGIWSRHA 
EKADWCACTVSHSGISLVTGDDFGMVKLFDFPCPEKFAJOIKIvFLGHS 
PHVTNIRFTSGDRHVVSAGGDDCRLVTFSAJiEEQTMLLVKRLNTTTP 
TALVYEIS 

7049 

A 

1 

5853 

MPLKLQNCLALHPER VLVATGQVGKEP YI CI WDS YTVQT I S VLKDVH 
THGIACLAFDLDGQCSVKAVIDNARRNECGYIPVKLYLQKHSFSLIC 
LPFYSFLDFLDINQRLVSVGLDSKNAVCVWDWKRGKMLSMAPGHTDR 
FWSLCGNALTPKRGVFGKTGDLQTILCLACARDELTYSGALNGDIYV 
WKG I NL I RT I QGAHAAG I FS MNACEEG F ATGGRDGC I RL WDLTFKP I 
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TVIDLRETDQGYKGLSVRSVCWRGDHILVGTQDSEIFEIWQERNKP 

FLIMQGHCEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARC 

NME E P I RCAAVNADG I HIiALGMKDG S FTVLRVRDMTEVVH I KDRKEA 

IHELKYSPDGTYIiAVGCNDSSVDIYGVAQRYKKVGECLGSLSFITHIi 

DWS SDSR YLQTNDGNGKRLF YRMPGGKEVTS TEE I KGVHWAS WTCVS 

GLEVNGIWPKYSDINDINSVDGNYIGQVIiVTADDYGIIKLFRYPCLR 

KESLADSHSDESDSDLSDVPELDSEIEQETQLTYRRQCSAGCTSRAP 

AS AQLLVKPQEAFTHGRRQRG S S YRG YDCRSNliF YTQ I GE I VYHV AA 

VGVIYNRQQNTQRFYLGHDDDILCLTIHPLKDYVATGQVGRDPSIHI 

WDTETIKPLSILKGHHQYGVSAVDFSADGKRLASVGIDDSHTWIiWD 

WKKGEKLSIARGSKDKIFVVKMNPYVPDKLITAGIKHMKFWRKAGGG 

L IGRKG Y I GTLGKNDTMMCAVYG WTEEMAFS GTS TGDVC I WRD I FLV 

KTVKAHDGPVFSMHALEKGFVTGGKDGIVAIiWDDSFERCLKTYAIKR 

AALAPGS KGLLLEDNPS I RAI SLGHGHILVGTKNGE ILfiVDKSGP I T 

LLVQGHMEGEVWGLATHP YLP I CATVSDDKTLRI WDLS PSHCMLAVR 

KLKKVQKGNVGLEPPYRVPTGAPPSGVMRRGQPSFRPQNGRTTDSLH 

HVPGKATGTECQPVKAARKGAILCKTTGAELPKTMGTHLLHQCGLDL 

RF AGG PLQTF FAWVP PTPRG ITS GGCKTVKRVTEL I PVETPL YKVS S 

NPCWGGLTQSGGTGSRTHLMKHSDYPLAEGVHYTGGNPTCPDCPDSS 

ELGAQSSEAADSQSDDSRPFPQELGSLRQSVAKWPLRICAALCLGLK 

ELVAWAHGGDLLIHQIiHRSMEKWFPGQDSTITHCLSWIiGVGAPLVL 

CS SQKAATLVLRES EQVKGD SDLFGNGVAATRTFS ELVAKKIiARGCS 

PQWNSESSHLVKEKLARGSRPELNGGRCCCFSPDGKALAVGLNDGSF 

LMANADTLEDLVSFHHRKDMISDIRFSPGIWPVIGEVTDVTASCLTS 

DKMVLATGDDLGFVKLFRYPTKGKFGKTKRYVAHSTHVTmn^WTYDD 

SMLVTLGGTDMSLMVWTNEMEGYREKRPCDSEESDIDSEEDGGYDSD 

VTRENEISYTIRALSTNIRPMLGIKPHLQQKEPSIDERPPVSRAPPQ 

PEKLQTNNVGKKKRP I EDLVLEL I FG YRGRDCRNNVHYLNDGDD 1 1 Y 

HTASVGI LHNVATGSQSFYQEHNDDI LCLTVNQHPKFINIVATGQVA 

TAPSIHIWDAMNKQTLSILRCYHSKGVCSVSFSATGKLLLSVGLDPE 

HTITI WRWQEG AKI ASRAGHNQRI FVAEFRPD SDTQFVS VGVKHVKF 

WTLAGRALLSKKGLLSTLEDARMQTMLAIAFGANNLTFTGTISGDVC 

VWKDHILCRIVARAHNGPVFAKYTTLRDGLIVTGGKERPSKEGGAVK 

LWDQEIiRRCRAFRLETGQATDCVRSVCRGKGKILVGTRNAEIIEVGE 

KNAACNILVNGHVDGPIWGLATHPSRDFFLSAAEIX3TVR1»WDIADKK 

MLNKVNLGHAARTVC YS PEGDMVAI GMKNGEF 1 1 LLVS SLKI WGKKR 

DRRCAIHDIRLVNKWNRVSS IYTjI 

7050 

A 

1 

426 

LKQQ I EEQRVQVQ WERAQQVAVQEQE I ARAEAEQMAKKAE/ AFQLY 
QEAAQLDMLLEKLPQVA\ EE I SGPLTS ANKITLVS SGS GTMGAAKVT 
GEVLDILTRLP\ESVE\RLTG\VTILPRLNHKAF*RTALSLQPSQMP 
SLIAE 

7051 

A 

162 

1449 

TMFFTCGPNEAMVVSGFCRSPPVMVAGGRVFVLPCIQQIQRISLNTL 
TLNVKS EKVYTRHGVP I SVTGIAQVK I QGQNKEMLAAACQM FLGKTE 
AE I AHIALETLEGHQRAI MAHMTVEE I YKDRQKFSEQVFKVASSDLV 
NMGIS WSYTLKD I HDDQD YLHS LGKART AQVQKD AR I GEAEAKRD A 
GIREAKAKQEKVSAQYLSEIEMAKAQRDYELKKAAYDIEVNTRRAQA 
DLAYQLQVAKTKQQI EEQRVQVQWERAQQVAVQBQE I ARREKELEA 
RVRKPAEAERYKLERLAEAEKSQLIMQAEAEAASVRMRGEAEAFAIG 
ARARAEAEQMAKKAEAFQLYQEAAQLDMLLEKLPQVAEEI SGPLTS A 
NKITLVSSGSGTMGAAKVTGEVLiDIIjTRIjPES VERb I Lj v t» X byvrJn^ 
PLRTA 

7052 

A 

57 

389 

GWELIiVTKVRKDDQWNEDLKTQSKLFYEECGQIMGTEWVSLSCQVAE 
E I SGPLTSAIHCITLVSSGSGTMGAAKVTGEVLDILTRIiPESVERIjTG 
VS I S QVRS QVGAEENC 

7053 

A 

162 

1581 

TMFFTCGPNEAMWSGFCRSPPVMVAGGRVFVLPCIQQIQRISLNTL 
TL1WK5EKVYTRHGVPISVTGIAQVKIQGQNKEMLAAACQMFLGKTE 
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AEIAHIALETLEGHQRAIMAHMTVEEIYKDRQKFSEQVFKVASSDLV 

GI REAXAKQEKVSAQYLSEI EMAl^QWDYELKICAAYDI EVNTRRAQA 
DLAYQLQVAKTKQQIEEQRVQVQWERAQQVAVQEQEIARREKELEA 
RVRNPAE\ AERYKLERPAEAEKSOLTMOAEATC\ A A«3\tp\ MPPC&cnr 

CP*GPEPRAE\AEQMA\KKAEAF\QLVPRGVAQLDMLRRESCPRWQE 
EI \ SGPLNFSPIRS PLVSSG\SGTMGAA\ KVT\GEVLDILNSPCP\E 
SVER\ LTGREHLPRVNH\ KPLKNSLEPSALHRCPASLAEVAIUNDPPV 
ACNPLGLPEHVH 

7054 

A 

1 

327 

FFFFFFERESYSLAQAGVQWHNLSSLQPLPSGFKRFSCLSLPSSRDH 
ERAPS HLAWFLVEMGFHHVGQAS LELLTS SDP P ALAS QS SG I TWS 
HHARPGILNSINRFT 

7055 

A 

1 

202 

MELRLSANA1WPALTGGAWPRRSHQASSLGYCCVSSSAVCSASLSGR 
RRGRHEPLVSPPPYRHLTML 

7056 

A 

1 

493 

FPASRLVTLCSEPEDAAMERSAFMELDAGSRLVMHLREWPALLVSST 
GWTEFEQLTLDGHNLPSLVC^ITGSVDLGVCLHMSSSGLDLPMKVVD 
MFGCCLPVCAVNFKCLHELVKHEENGLVFEDSKELAALQMFFSHFPD 
PAGKLNQFWKDLRESQQLRWDES 

7057 

A 

409 

521 

ALLFI FLQMLFSNFPDPAGKLNQFRKNLRESQQLRWD 

7058 

A 

296 

869 

PYSSCPMTRASSPKELQQFVVSGSPMRAPEEEPEDPDTERSAFMERD 
AGSGLV^1RLRERPALLVSSTGWTGSVDLGVCLHTSCSGLDLPMKVVD 
MFGCCLPVCAVNFKWQEQNPNLSGDSFTDPPLRRKQCRASCPHLHEIi 
VKHEENGLVFEDSEELAAQLQVLFSNFPDPAGKLNQFRKNLRESQQL 
RWD 

7059 

A 

1 

848 

MPGLRVKMPCGPGISQDEAVWSMDDNFEV1WQHPVWRAATLAKDCKN 
QSAQAVTVYNKPASFFKEAPLDLQHRLFMKLGSTHSPFRARVFFCSF 
ISEPEDPATERLAFTERDAGSRLVTLLHERPALLVSSTSWTGKGPLR 
EYYSRLIYQKHFQHIQVCTPWL\GAEDYPLLIGSADLG\VCLHTSS\ 
SGLDLP / L* KWDMFGCCLPVCAVNFKVFT\ ELVKHEENG \ LVFEDS 
EELAAQLQMLFSNFPDPAGKLN\SFRKNLPESQQL*WDESWVQTVLP 
LVMDT 

7060 

A 

738 

1902 

FHHLTETNS PNYF I TMALNVAPVRDTKWLTLEVCRQFQRGTCSRSDB 

EPKPAHPPKqOO\rRNf^P\7T2if , *I«T>c:T VPOPCDCMPirvT unn'i'u i Tr*T>f\T 

EINGR1WLIQQKTAAAMLAQQMQFMFPGTPLHPVPTFPVGPAIGTNT 
A I S F AP YLAP VT PGVG L VP TE T L PTT P V T vpr q p p vrnrDrc q t a Tmc t 

LRTDKLE VCRE FQRGNCARGETDCRFAHPAD S TM I DTSDNTVTVCMD 
Y I KGRCMREKCKY FHP PAHLQAK I KAAQP / QAAATVMTQSTAKAMKR 
PLEATVDLAF PPGALH PLPKROALEKSNGTS AVFNPS VLHYOO ALT <3 
AQLQQHAAFI PTGSVLCMTPATS I VPMMHS ATSATVSAATTPATSVP 
FAATATANQULK 

7061 

A 

1 

240 

ALHPL PKRQALEKSNGTS AVFNPS VLHYQQ ALTS AQLQQHAAF I PTG 
MCPYCPTSCALLVMCFLLISLSCLVASSLLLKV 

7062 

A 

3 

354 

AARVMTO S TAKAMKRPLEATVDLAFPPfiAT.H PT.P K"R HAT ,RK<3wnT <5 a 

VFNPSVLHYQQAiTSAQU3QHAJ^IPTDNSEIISRIX3MECQESALRI 
TRHCYCTYYPVS SS VELPRTAC 

7063 

A 

1 

621 

MTAFFVKP AJ^^VCOLPHWND 1 1 SNSLYWER YKKRFWft q q <=: ctp q qnnv 

SFLEKKVLAMEDKHIIQLQSIKEEKDQLQVLVSKQNSIIEELEKKIV 
TATVITOSVLQKQQHDLMETVN^ 

CRTPNKEWKFSGGGGWRGQRGCRVKPTAVTPEAJHPSPRVGTNVENL 
GNGEQASFGTYPCFRFTPE * 

7064 

A 

362 

720 

SDVTFSGKPSLVTLFPLYPPGKFSFSSVDVSMLQSTIKHGSCSLWAS 
PSEEFDILEWPLFCVLFRPTASFISSRLPSKGVKSSSCPFGKEAGV 
LNGCKEKAI FPIX5ACVFFPSLSTTP j 

7065 

A 

1 

2610 

MAETAVINHKKRKNS PRIVQSNDLTEAAYSLSRDQKRMLYLFVDQIR 
KSDGTLQEHDG I CE IHVAKYAE I FGLTS AEAS KD I RQALKS FAGKEV 
VFYRPEEDAGDEKGYESFPWFIKRAHSPSRGLYSVHINPYLIPFFIG 
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LQNRFTQFRLSETKEITNPYAMRLYESLCQYRKPDGSGIVSLKIDWI 
I ERYTAT I ATTQRCAQ I QTKRGVREL P AGEEAFR I ELGGHYTGSNLB 
HVGPVSTEHTPSWEEDTVRSRDFQCTGLKSLKIVRDSSSAYNNFRK 
SMDS I GKKQYQVQHGS CS YTFLLPEMDNCRS S S S P YVSNAVQRDAPL 
EYDDSVQRLQVGDNSIDSWKNAGRVFKDSDKWDANDPILKDQTQEWS 
GSATFTSDGKIRLFYTDYSGKHYGKQSLTTAQVNVSKSDDTLKINGV 
EDHKTIFDGDGKTYQNVQQFIDEGNYTSGDNHTLRDPHYVEDKGHKY 
LVFEANTGTENG YQGEE S L FNKAY YGGGTN F FRKE S QKLQQS AKKPJD 
AELANGAIiELEKTTLKFIWNOK^^ARTAK<?TTi^OIOJK'nr:nTk'T Dnrv-T 

YYKATVNKTAWCAQIQTKRGVRELPAGEEAFRIELGGHYTGGCSETL 
SQGSLGDPVGLLTSQRGGGRLSPRVRTTLRRPCLPRLFGMAAIiWDHC 
LNKLPACLPLSQALLLGKPKLRQCENLLWPLRRGLRTACEPRNFVG 
VVEAKAERVNSIYNHGVPRKKRSQSl^HVFPANDPTVAKEEQISFRD 
CAEVFKSGHTTNGIYTLTFPNSTEEIKPTAVTPEAHPSPRVGTNVEN 
LGNGEQASFGTYPCFRFTPEGRACGALSLPPLTFQQVLEFDSHQTEI 
AKICKNSIGQLFCINDCTGSKGQK 

7066 

A 

122 

1840 

AGLFFPLQSDSLLHAWREHSSKNQVCYWKKRKEKTFIDGPSHGSVAA 
IiRFRRQQLGTLDVCLPSSIiLSCWFITEERMWQIVFFTLSCDLVLAAA 
YNNFRKSMDSIGKKQYQVQHGSCSYTFLLPEMDNCRSSSSPYVSNAV 
QRD APLE YDDS VQRLQVLEN I MENNTQWLMKLENY I QDNMKKEMVE I 
QQN AVQNQTAVM I E I GTNLLNQTAEQTRKLTDVEAQVLNQTTRLELQ 

TiTiRH^Ti^TTJTCTi'PTCOTT^FjriT'Q'R'TKrtfT OTiVKJCPT T?vinrT a Mirr^ iru t t r\r 
uuoiiOJJO l ii) iujd ai^ i uu V 1 Oft X IN Z\X>%^ UIxJM o r J_iC> JtVJV VXinTOISLf J\fl X X U1j 

qs i keekdqlqviivskqns i ieelekki vtatvnns vlqkqqhdlme 

tvnnlltmmstsnsakdptvakeeqisfrdcaevfksght™ 

tfpnsteeikaycdmeaggggwtiiqrredgsvdfqrtwkeykvgfg 

npsgeywlgnefvsqltnqqryvlkihlkdwegneayslyehfylss 

eelnyrihlkgltgtagkissisqpgndfstkdgdndkcickcsqml 

TGGWWFDACGPSNIjNGMYYPQRQNTNKFNG I KWYYWKG SGYSLKATT 
MMIRPADF 

7067 

A 

1 

1433 

MPNRRGRQGRRRWRTRGLMRPPPRWLPTAVTPEAHPSPRVGTNVEN 
LGNGEQASFGTYPCFRFTPEGSFLEKKVliAMEDKHI IQLQS IKEEKD 
QLQVLVSKQNS 1 1 EELEKKI VTATVNNS VLQKQQHDLMETVNNLLTM 

IKAYCDMEAGGGGWTI I QRREDGS VDFQRTWKEYKVMTDVSRI FAIQ 
LTMVQLK I FRL YD V I CGTMLPHDARQRQRATG P S QS CD PQGEQQTLR 
APCCOMALSDCGLMIHLiKGLTGTAGKTS ^ T O PfTKm F TKTinnfm KP 
ICKCSQMLTGDEYFAGGLHSSFDDLCGNSGCGNQERKLEGSINDIKS 
DVCISSLVLKANNIHSSPSFTHLDKSSPQEISEVIFQREEINLAKEI 
LQGKVVHPSPKRQGCRVCLQGGRLEERYPFVFLPLFFQQKGHIjLiDTF 
QDPGVPQ 

7068 

A 

3 

326 

HS LHSALS LEVTGS TKLPKP Y I T INNLNPRENKDVLNFTCE PKSENY 
T Y I WWLNGQS Ij P VS PR VKR P I ENR I L I L PS VTRNETG P YQCE I RDR Y 
GGIRSDPVTLNVIi 

7069 

A 

3 

423 

ETPKPS I SSSNLNPREVMEAVRLI CDPETPDAS YLWLLNGQNLPMTH 
RLQLS KTNRTLY L FGVTKY I AGP YECE I RNPALLNDCHNTNTEKKKQ 
PEKVESSDDIENSNQPFSHPKAFKNIRVQHGHYGIHRKXITKLET 

7070 

A 

3 

554 

SLLNFWNLPITAQVTIEALPPKVSEGKDVLLLVHNLPQNIJVGYIWYK 

GOLMDLYHYITSYVVDGOINIYGPAYTGPJETVY<5^ 

AGS YTLHI I KRGDRTRGVTG YFTFNLYHTS YLWWMNGQSLPMTHRLQ 

LSKTNRTLFIFGVTKYIAGPYECEIRNPVSASHSDPVTLNLL 

7071 

A 

3 

602 

SLIJtfFWNLPTTAQVTIEAQPPKVSEGKDVIj^ 

GQIRDLYHYITS YWDGQI 1 1 YGPAYSGRETAYSNASIjLIQNVTRED 
AG S YTLH 1 1 KRGDGTRGVTGYFTFTL YREAMETVILTCDPETPDTS Y 
QWVJMNGQSLPMTHRFQLSETNRTLFLFGVTKYTAGPYECEIRNSGSA 
SRSDPVTLNLL 

7072 

A 

3 

550 

ETPKPSISSSKLNPREAMEAVSLTCDPETPDASYLWWMNGQSLPMSH 
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RLQLS ETNRTLFLLGVTKYTAG P YECE I RNPVS ASRSDPFTLNLLHG 
PDLPRIYPSFTYYRSGEVLYLSCSADSNPPAQYSWTINGKFQLPGQK 
LFIPQITTKHSGLYACSVRNSATGKESSKSMTVKVSDWTLP 

7073 

A 

1 

102 

AGSYTLHIIKGDDGTRGVTGRFTFTLHRIQTSEE 

7074 

A 

3 

1451 

GAVRTWGRGFQTEKCQASLLNFWNPPTTAQVTIEAEPTKVSKGKDVL 
LLVHNLPQNTLAGYIWYKGQMKDLYHYITSYWDGQIIIYGPAYSGRE 
TVYSNASLLIQNVTREDAGSYTLHIVKRGDGTRGETGHFTFTLYRHS 
LDSALSLEVTGSSQPL*DTWENCPTLWLHCLMTELTSGLDSVLPCVI 
SAEVPSPRSGFPVSKGFKDNWKFHHPSLGCPWQGKLQRKHTSGGKVR 
LKLRRFQHCMLQVRTTS\ETPKPSISSSNLYPREDMEAVSLTCDPET 
PDASYLWWMNGQSLPMTHSLQLSKNKRTLFLFGVTKYTAGPYECEIR 
MPVSASRSDPVTLNIiLPKLPKPYITINNLNPRENKDVLAFTCEPKSE 
NYTY I WWLNGQS L P VS PR VKRP I ENR I L I L P S VTRNETG P YQCE IQD 
R YGG I RS YPVTLNVL Y\ TTKHSGLYACS VRNS ATGMES S KSMTVKVS 
APSGTGHLPGLNPL 

7075 

A 

3 

437 

ETPKPSISSSNLYPREDMEAVSLTCDPETPDASYLWWMNGQSLPMTH 
S LQL S KNKRTLFL FG VTKYT AG P Y E CE I RNP VS ASRSD P VTLNLLQH 
SWLIDGKFQQSAQVFFIPQITKTYRGVYVCFIHNSATAGTNLIIKRI 
IVP 

7076 

A 

79 

1516 

EASLLEDDDGMVYLRPLTSHQYRTAQLRAMLRKFLDPRLSSTEENTQ 
AAETMGPLSAPPCTQRITWKGLLLTASLLNFWNPPTTAQVTIEAEPT 
KVS KGKDVLLLVHNLPQNLAG Y I WYKGQMKDLYHYI TS YWDGQ III 
YG PA Y SGRE TVY S NAS L»L I QNVTRED AG S YTL H I VKRGDGTRGE TGH 
FTFTLYLETPKPSISSSNLYPREDMEAVSLTCDPETPDASYLWWMNG 
QSLPMTHSLQLSKNKRTLFLFGVTKYTAGPYECEIRNPVSASRSDPV 
TLNLLPKLPKPYITINNLNPRENKDVLAFTCEPKSENYTYIWWLNGQ 
SLPVSPRVKRPIENRILILPSVTRNETGPYQCEIQDRYGGIRSYPVT 
LNVliYGPDLPRIYPSFTYYHSGENLYLSCFADSNPPAEYSWTINGKF 
QLSGQKLFIPQITTKHSGLYACS VRNS ATGMES SKSMTVKVSAPSGT 
GHLPGLNPL 

7077 

A 

9970 

11210 

MKDACGGIKGATQDNLLIiSTQ/ PKLPMPYITINNLNPREKKDVLAFT 
CEPKSRI^TYIWWIjNGOSIjPVSPRVKRPIENRIIiTLPSVTRNETGPY 
QCEIRDRYGGIRSNPVTLNVLCEYPLFLCGPGHQLNSKRPEARPLSIi 
S PVQV * TTLLLD IRPGHDSLP WE I LGRHSLNQEYKGRGALVMGH * GP 
TACNERNRGIPQAWA* * T*MGFGCHLRLCLGSEGHCVPLRDQEHPLP 
LDDITCGFILFPPDGPDLPRIYPSFTYYRSGENLDLSCFADSNPPAE 
YSWTINGKFQLSGQKLFIPQITTNHSGLYACSVRNSATGKEISKSMI 
VKVSGKWIPASIiAIGF*VESIWLSEKSQENICIPRLCPMGTSKSQIL 
LLNPPNLSLQTLFCLFFCFLMADLVSGLRKVGRGLYQP 

7078 

A 

460 i 

1832 

QPVLREFLlDPRL I STEENTQAAETMGTLS APPCTQR I KWKGLLI/TAS 
LLNFWXKTLPTTAQVTIEAEPTKVSEGKDVTjLLVHNLPQNLTRATIWX 
YK\GQMRHLYHYITS YWDGEI 1 1 YG\ PAYSGRETAYSNASLLIQNV 
\ TRED \ AGS YTLHI I K\ RGDRTRGVTG Y FTFNL YIiETPKPS I S S SNL 
NPREAMETVILTCNPETPDASYLW\*LNGQ\NLPMTHSIjQLSKANRT 
LYIiFG\VTNYTAGPY\ECEIWELRECQARSDPVTPEFSSRKIiPKPYI 
TINNI J NPGEE*GCL*TFIC\EP\KSENYTYIWWLNGQKPP/VSSPRV 
KR PH * KTG S L I PTP VFTRKWKQG P YSM WNYGDR YGG \ I RS LP S S PLN 
CSSYGSRTSPRIL»PFHFTY\YRFREENLDLFLGFTGIFNPP\AQYSW 
DN*WKSFQLTRGQKLFYSGHITTKHSGLYVC\SVRNSATWQGKAPNP 

7079 

A 

2 

315 

RWSLCPLQAGVQWRDLCSLKPPPPEFKQFSCLSLPSS*DYRCPPPCP 
ANF/SFFFVFLVETGFHHVGQAGLNLLTSRDLLTSASQSAEITGMSH 
CPRPTLSNSRVL 

7080 

A 

1 

534 

MSWLEENVHEVLQAVDAGDPAVEACENRRKVLYQRAPRNIHRHVILS 
EIKEAVT^ALPPPHPKLMRKPHGPPGESRACAQLIiliLCPLAAPPCPCE 
GGSWQAAPLSPISHGNTIALFFRSLLPNYTMEGERPEEGVAGGLNRN 
HGLNRLMLAVRDMMANFHLNDLEAPHEDDAEGEGEWD 
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7081 

A 

1217 

1545 

TRHLVLRASSSQGPWGGFRVRCWGKGPPRAPQCLLLG PRAPS PS TDS 
PLLRHVPLLQGERPEEGVAGGLNRNQGLNRLMLAVRDMI1ANFHLNDL 
EAPHEDDAEGEGEWD 

7082 

A 

2 

1665 

KKKKKQKNKKSSTGEASENGLEDIDRILERIEDSTGLNRPGPAPLSS 
RKHVLYVEHRHLNPDTELKRYFGARAIIiGEQRPRQRQRVYPKCTWLT 
TPKSTWPRYSKPGLSMRLLESKKGLSFFAFEHSEEYQQAQHKFLVAV 
ESMEPmiVVLLQTSPYHVDSLLQLSDACRFQEDQEMARDLVERAIiY 
SMECLAP r HP'LFS'L.TfiGAC*R'LiDYRRPFMRc;pi*YT.aT.Y'K'nMCI?T PVDPPDD i 

■9riC«w\f HC xj l. vj J_l x JwnV,I> 1 11/ X JVTvXr XjIMXvO X7 X XinXi X JV^iMO Xr J— I J— > i\_ivvJ ' XT X"C 

TALE YCKL I LSLEPDEDPLCMLLL IDHLALRARNYE YLI RLFQEWEA 
HRNLSQLPNFAFSVPLAYFLLSQQTDLPECEQSSARQKASLLIQQAL 
TMFPGVLLPLLESCSVRPDASVSSHRFFGPNAEISQPPALSQLVNLY 
LGRSHFLWKEPATMSWLEENVHEVLQAVDAGDPAVEACENRRKVLYQ 
RAPRNIHRHVILSEIKEAVAALPPDVTTQSVMGFDPLA\PSDTIY\S 
YVRPERLKSLSSH\GNT\IALFFRSLL\PNYTMEGERP\EEGVAGGL 
NRNQGLNRLM\LAVR\DMMANFHLN\DLEAPHEDDAEGEGEWD 

7083 

A 

420 

627 

IILWVLNAFLQPPGRHMIAIRERQPGRTNSDYETADGVYMTLNPRAP 
TDDDKNIYLTLPPNDHVNSNN 

7084 

A 

2 

380 

FRVDDFVOKLOAGS I AI VROTFNKE I KTMKKFES PNT L.R T FG T WT DE 
TVTPPQFSIVMEYCELGTLRELLDREKDLTLGKRMVLVLGAARGLYR 
LHHSEAPELHGKIRSSNLLVTQGYQVKIWVPG 

7085 

A 

2 

657 

VKCRGIPNILRIFGICIDETVTPPQFSIVMEYCELGTLRELLDREKD 
LTLGKRMVLVLGAARGLYIU^HHSEAPEIjHGKIRSSNFLVTQGYQVKI* 
AGFELRKTQTSMSLGTTREKTDRVKSTAYLSPQELEDVFYQYDVKSE 
T YSFC5TVT.WRT ATf?nTP , POnPMQTJ , K"TPTrr.\721\/in?orv'n , 'DT r*T?'nr , X>CT7T 

J- 1 Qf \J1 ViinQinlVJi/ir C \^0\_pXVOAXVXIxr\Jj V>\V XVKyyiiiir XJVSlSiUv-JrOlSlJ 

REI IDECRAHDPSVRPSVDEILKKLSTFSK 

7086 

A 

2 

842 

MATSELSCEVSEENCERREAFWAEWKDLTLSTCQVQSCPRAFAWAVH 

LACESERRMDGAS PCHLS PCTCYPCRGAAPATFTLQKRVLGLREGIK 
WPEVTWRSKWPSLQPSEGAALVEKATLGTGEPVWSMSQTSLLPSHGP 
GLSVSARLCPWPTPSPPRLGPSGPAGGVNEPSCSSAALAWHVGSDEW 
QVPGRQRGRVRGQGLRRHCCLCVKTLGRPCPYPAPTGTGQRLQVG 

7087 

A 

174 

463 

GSLKQGMFPSGLNRRLPTESPLQILRATFL*KEVA*GLHQYHLPL\Q 
LLSGPVFGLTHLQSAPVFREKPDDPPELPSLRVLRRTLDSWGARTH1T 
YGI 

7088 

A 

69 

194 

RALTRKTNSKAEPCLLPAQPTCNFPPSVARRRNINIiARPRD 

7089 

A 

1348 

2204 

TWRLDPQITSSPKPQPGGTYTLEVVKSSKSKKVIjSPHP*WPPLRLWQ 
R\GGSPEGGTQAPDGSLPPPPPRPKSERVGSPKLSGGKR/EGSHPGG 
PPHITHP/DGEEKAKSSWFGLREAKDPTQKPSPHPVKPLSAAPVEGS 
PDRKQSRSSLSIALSSGLEKLKTVTSGSIQPV^QAPQAGQMVXJTKRI* 
KDS AVLDQS AKYYHLTHDELI SLLLQRERELSQRDEHVQELES Y IDR 
LLVRIMETS PTLLQ I PPGPPK 

7090 

A 

2 

762 

RDPPSLSSASPPGSRESSIHSGPEELPTPPEPDFPPPPT.PPWAQWHR 
GG P S PPCS PLS EAW PLTTS SAPPGE PALLPG PHEPS P PGGS PALLRE 
DIAAATPASPLVLLPLETRPAEEPQPSASPHPVKPLSAAPVEGSPDR 
KQSRS SLS I ALS S GLEKLKTVTS GS I QPVTQAPQAGQMVDTKRLKDS 
AVL1X2SAKYYHLTHDELISLLLQRERELSQRDEHVQELESYIDRLLV 
RIMETSPTLLQI PPGPPK 

7091 

A 

2 

218 

DFADDDLVLKELRGHS SFVNEATFTQDGHYI I S AS SDGTVKVXYLLS 
VYFGMSTPSIiSFEEFSWIMGMIVli 

7092 

A 

7 

78 

WGRLCICEGRGYMGNLCTFCPILL 

7093 

A 

553 

954 

SRSRRGQKRRRWSTVWLSAGACALSMQTPSRTSWPTAPFRAIWRAE 
TVAAWCRLRRPWRVWSWSCLPTPITRATPLGARSWPGWRMWPPLQP 
AGSAVPTLR*RPLK\FPLPRPVPGRRPSGAQSHREQCLQT 

7094 

A 

1 

1215 

MTLKEHAAFKHLFNKAHLAPPLIHLTLSGHSTCFREHWGDIVmDL 
KLEN I MVKS S L IDDNNE INLN I KVTDFGLAVKKQS R S EAMLQATCGT 
P I YMDRGWNS LEG S EQDRKMWDS LELPRDLLNVFEQNADNDMDDEIQ 


.::!!,. Oi O S Mh& «MI»3; ,„ O E! E! 6, 0 E! 






AEVLSDGDEELLGNWSKGDSCYVLLKQliAAFCPCLRDLWNFKLERDD 
LGYLVEE I S KQQS IQEATWVLLKAFS FIREAEHKNIiENLQPDNAI EK 
KIPFSEEKFKLAAEICISNQEANVNPQDNEKNVSRACQRSSQQPLPL 
PRVLGGKSGFVGWAQDS HAE I YTFGR ILGKGS FG I VI EATDKETETK 
WAI KKVNKEK\ LLLMKSN FLQPVRTTL I CAVQVS HLANS FQLKS REK 
WRKPL*LKEHAAFKHLFNKAHLAPPLIHLTLSGHSTCFREHWVGDIV 
rlKJJ U YsAj k,N IMV Jvo o L 1 DDNN B X NLN I KVTD FG JjAVKKQ S RS E AMLQA 
TCGTPIYMDRGWNSLEGSEQDRKMWDSLELPRDLLNVFEQNADNDMD 
DE IQAEVLSDGDEELLGNWS KGDS C YVLLKQLAAFCPCLRDLWNFKL 
ERDDLGYLVEEISKQQSIQEATWVLLKAFSFIREAEHKNLENLQPDN 
AIEKKIPFSEEKFKLAAEICISNQEANVNPQDNEKNVSRACQRSSQQ 
PLPLPRVLGGKSGFVGWAQDSHAEIYTFGRILGKGSFGIVIEATDKE 

r PT? r PVT*lJV T VTHTMlTUVCTa VTT VV»T?'D7v r T , ClfnMt?nM/ , 'P C?C1?Te PITT T T»7V "n» T 

KGEMEKTPVTPSQGTATKYPAKSGALSRTKKKL 

7095 

A 

1 

770 

MVSLLIGELKSHQDLHTSHCAQCPCVPPIHPQMVQPSWCSPWFLWSA 

AQTETTFSLTSHGLSRSSPEVISAHDYSQQCDIWSIGWMYMLLRGE 
PPFLASLKKKLFEHRKGELHFENAVWNS I SpCAKSVLKQLMKVDPA 

EKNKPSTEEKLKSYSTLGKCP 

/ V -7 O 

j± 

/ ■£> O Z> 

7684 
/ o o t 

T v^T .pt .P T T .T .vpwwwp p pp p ppwT .t . p q pp t .csrv& vwrv q i\ wr'TOPr 1 P 

x v our ur x xjju vr rvnrvv^r rr r r r r Cii v xjxix o rivbyLKiJ v x oMxii_ivf w it 
PGSSDSRASAFRVARITGAHHDTWLIFCIFSRDRVIiSCWSGWSRTSD 
LR* SALPKCWDYRCEPPCRA* FFFFLYPLT T PTSTjPAPHYPP^LW 

7097 

A 

1 

344 

MDKSLLLELPILLCCFRALSGSLSMRNDAVIEIVQCRMCHLQFPGEK 
CSRGRG I CTATTEEACMVGRMFKRDGNP WLTFMGCT ,KTJP ADVTCG T R W 
SVYLVNFRCCRSHDLCNEDL 

7098 

A 

3 

285 

YGPlxNLLTFx^FYHNEHHDLPNIPGKSLPLVRKIAAEYYDNL 

W I KVL YDFVMDDTI S P YSRMKRRPPGRDGAGPN 1 1 1 ALG I LLQNFR 

7099 

A 

346 

517 

RGLVGSPSSAPSAAPSWSSEKQGCRGHCGLGGSLVPHGRIHHSSNFP 
SCSGDPSQDS 

7100 

A 

3 

173 

L YSQS S FHSL1WFATKPFTVAFVKLETI KMS I SRGLGTL I RTVGNS V 
PEGGMRKTY 

7101 

A 

256 

352 

LLFKVTVEYRPI IDKTLNEADCATVPPAIHSY 

7102 

A 

1 

203 

VPIHEFDHSMPFRGHRKKPFGGHWGKliTLSYVDVSTGKVTVKCRPII 
DKTVNEADCAHCPPQP IHS Y 

7103 

A 

1 

112 

PVGQTGLELLTSGDLTAS AS LNAG I PGMSHHAWLKLF 

7104 

A 

1 

1213 

MEAGEEPLLIiAELKPGRPHQFDWKSSCETWSVAFSPDGSWFAWSQGH 
CVVl^VFWFJjfc*xt.Qr IPKCjFJbAlvSRSSKNDPKGRGSIjKEKTxjDCGQ 
WGLAPSPWPSPPSRKLWARHHPQAPDVSCLILATGLNDGQIKIWEVQ 
TGLLLIxNLSGHQDVVRDLSFTPSGSLILVSASRDKTLRIWDIxNKHGK 

EGHQS SWS CD FS PDS ALL VTAS YDTS V I MWD P YTGERLRS LHHTQL 
EPTMDDSDVHMSSLRSVCFSPEGLYlxATVADDRLLRIWAliELKAPVA 
FAP^!TOGLCCTFFPHGGIIATGTRIX3HVQFWTAPRVLSSLKHLCRKA 
LR S FLTTYOVLAT.P T P KKM K"R PT.T YP TP 

j-ix\. o x xj xxx s< * u/vur x t i\i\j ii\xi x xj x x xv x x 

7105 

A 

3 

413 

FRHEQAS KDRNWFS P YGVAS VIxAMLQLTTGGETQQQ I Q AAMG FKI D 
GEPRDTRGGGWHAEOTYOKPRKGWLWLSRAKPHTAVT.DRPPHLLPTT 
LQVTGPLNLQGCLTSRFTAYSPPCDLNLVKFAHLQNEDYLLS 

7106 

A 

45 

537 

GQERCQED PHAPLQQTVI PS SVSRPPPERGANRQSQGTS ENFRMQMS 
PALTCLVLGxxAiVFGEGSAVHHPPSYVAHlxASHFGV^ 
DRSVGFSPYGEDSVLA^QLTTGGETQQQIQAAMGYKIDGEPRDTRG 
DKGMAPALRHLYKELMGPWEQG 

7107 

A 

2 

224 

FFFESHSVAQDGVQLCDLSLVQRPPPGFKQFSCLSLPSRWDYRHTSP 
HPANFCIFSRDGVLPYWSGWSRTPDLR 

7108 

A 

179 

394 

DKWRRLPWKRGVEVGTAGSIxAKAFGFFAEAKTEPREGCS ESGVRI S Y 
Q * VQQTNS KCLKY * MEPARPTFS P 


,:1 0 O S H-\B "Hi- 2S . ill r£ H 6. 0 12 


7109 

C 

21 

410 

MOCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHN 
GISGFHYLENQEPTSQPCCFYKTELRKQSLAVVLAL 

7110 

A 

3 

178 

PPQPRVAGTTGVSRAPPVLIFVFFVDRRSHHVAQDGLELLGSSDLPT 
SASQSVHIVIS 

7111 

C 

134 

295 

MNSVPGLAKSVGLLRTRYDSPKQPLLCSEAWLICKDLSRGWQKPGT 
QGIGG** 

7112 

A 

3 

864 

LYPAKLSGGWIAMPKYWRAPNCSNTGGRLGANNRPGSFYKFPIiKDGP 
RLQAWLQHMG CEHWVPS CHQHLC S EHFTP SC FQWRWGVR YLRPDAVP 
SIFSRGPPAKSQRRTRSTQKPVSPPPPLQKNTPLPQSPVFPVSGPER 
LEVLGPTSGSPKTVATMLLTPIiAPAPTPERSQPEVPAQQAQTGLGPV 
LGALQRRVQRLQWCQERHQAQLQALERLAQQLHGESLLARARRGLQR 
LTTAQTTjGPEESQTFTI I CGGPDI AMVIjAQDPAPATVDAKPEIjIjDTR 
IPSA 

7113 

A 

3 

155 

HEGPLALGFHIAYITMRLFLALI ITTI YHCEYHLPFLTSPECHDYHH 
IKQ 

7114 

A 

1 

122 

TRVDPRVRKKTTMKI I PFNRLTIGEGQQHHLGGAKQAGDV 

7115 

A 

84 

579 

NWKAGMGRPSTADYAMFKWHEVD\KYRFTYAYFAGGDAEDAFDGYD 
FGDDPSDKFFTS\HNGMQFSTWDNDNDKFEGNCAEQDGSGWWMNKCH 
AGHLNGVYY\ QGGTYSK\ ASTPNGYDNGI IWATWKTRWYS\MKKTTM 
KliPFNRLTIGEG\QQH\HLGGAKQAGDV 

7116 

A 

19 

418 

VEMGFCQADQAGLELLTSGDPPASVSQSTGITVLSLSFFFETESRSV 
AQAG VQWRDLG S LQRP P PG\ S RHS PAS AS RVAGTTGARRHAWL I FLY 
F/ LVETGFHRVS QDGLDLLTS * S ARLSLPKCWGYRREPS AR 

7117 

C 

207 

401 

MGXXIFXPLXKTPPFFPGVWGKNFSPGPGXPXPPPRGGPPPIiFFPQK 
KGXXPPPPFPKVLGLQA* 

7118 

A 

2 

681 

FFEMESRSVAQAGVQWRDLGSLQAPPPGFTPFSCLSLPRSWDYRRPP 
TLLANFFVFLVETGFHCVSQDSLDLLTLRSARLFFFFFETESRSVAQ 
AGVQWRALGSLQAPPPG\ SRHS PAS AS QVAGTTGARH YARL I F\ VFL 
VETGFHRFSRDGLDLLTS * SARLGLPKCWD YRREPPRPAVRVYFYNR 
SVFIEYVQFCKLLFTFKINVRSQNILGSPNILKSLLKNLN 

7119 

A 

3 

265 

FFFETESR S VAQAGVQWCHLGS LQ PLP P\ GS SD S PAS TS RVAGV \ T I 
TAT*VAGTTWRHHAWLIFVFLVETGFHHVGQAGLEHLDLR 

7120 

C 

359 

583 

MLPRLVSNSWAQAILLPRPPEVLGMLSSYPNNSGLEMSTIWLIKVTL 
QNILIWGLSHPKEEKRFLFLRRSFACRP 

7121 

A 

279 

566 

FRRRFPLISLFFFFLFFFCFFFFETESRSVAQAGVQWRDLGSLQAPP 
PGFTPFALSPRLECSGAILAHCKLLLPGSSHSPA\SANRVAGTTGTR 
HH 

7122 

A 

237 

524 

PQEFKTSPGNVARPQLYNFFFFFFETESRSVSQAGVQWCDLGSLQAP 
PPG\ SHHSPASAS * VAGTTGAHHHARLL\ FVFLVETGFHCVSQDGLD 
LLT 

7123 

A 

1023 

1459 

KGTRKRGQGIKVHLPIPILLFNLICFCTFGLFLVFLFLFPFFFFFFF 
EIESHSVAQDGVQWCNLGSLQPTPPG\SSDSPASAS*VAEITGTRHY 
AWL I FLFL / VEMRFHHVGQAGLELTSGDPPASASQS AG I RVMSHRGL 
VFLYYK * 

7124 

A 

1794 

4614 

RFFSFFFFFETESHSVAQAGVQWCNLGSLQAPPPG\SRHSPASASRV 
AGTTGAHHHARLIF\VFLVETGFHRISQDGLDLLTS*SARLGIPKCW 
DYRCEPPHLASI 

7125 

A 

2 

441 

FFFFFETESRSVAQAGVQWRALGSLQAPPPG\ SRHS PAS ASQVAGTT 
GARHYARLIF\VFLVETGFHRFSRDGLDLLTS*SARLGL,PKCWDYRR 
EPPRPAWVYFYNRSVFIEYVQFCKLLFTFKINVRSQNILGSPNILK 
SLLKNLN 

7126 

A 

3 

226 

CETESRSVTQAGVQWSDLGSLQAPPP\GSRHSPASASQVAGTTGAHH 
HARL I F\ VLLVETGFHRVSQDGLDLLTS 

7127 

A 

153 

504 

KHFFFFFFFETESRSVAQAGVQWRDLGSLQAPPPG\SRHSPASASRV 
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OKCLKVYFSHCS RS E I TVTGKTNTjI* 

7128 

A 

3 

313 

FLFFETESCSVAQAGVQWHDLGSLQAPPP\GSCHSPASASQVARTTG 
TRHHPRiIFF\VFIiVEMGFHHVSQDGLDLLTLGSGHLSLPKCWDYRC 
EPPHPSSPSY 

7129 

A 

3 

307 

FPPPPRTESR^VAn&nVnWPDT.rjCT.OZiDDDrjX ODUCnA OA CT\1 TT\ nrnrr* 

tr c c c r jzj l r\o v v vvv/vx^Xj^o Xjv^rtXr^ JrO \ oKrlo Jr AoAoQ VAGTX 

GARHYARLI F/ WFLVETGFHRFSRDGLDLLTS * SARLGLPKCWDYR 
REPPRPAYF 

7130 

A 

2 

91 

EMESRSVAOATVnWPT"vr.r5PT.nAPPPPPK"E» 

7131 

B 

3686 

14131 

MTNAFQWQKNRGIDSEDAYPx^GQSYx^LVSSSQEESC^mTPTGKA 
AKCRGYREIPEGTtfPTCAT.TTR AVAP\/r!P\7Q\7aT"nj\ ot Tcunrvcvmrw 

DESO^SDNLNHAVLAVGYGIQKGNKIIW 

GYljDKGKxjKRKDKAGSLOWAYMATART.nr?PMnQMPT»nT2\TMr , aT y* 

7132 

A 

8044 

8378 

YIDVKTFFFFFLPETESHSVAQAGVQWHDLSSLQPLP\PGSSDSPAS 
ASRVAGITGTHHHAWF I FVFLVGTRFHHLGQAGLELLTS SDPPASAP 
QSAG I TGVNHHTQPQKF I 

7133 

A 

3 

283 

HER PR VflA T R OROn <? T ifJWriDT .n PD V c TTPT i?t onD r PT?nr>T'a^n\rovTnf 
n " Ar xv v v^xAyvjy^^-'-'«nyjJi»uV«.r i o X lo-Ur loyK 1 liFlr x x INRYTV 

ESTRTPJIALSDKQSAAVILELGAFFYIMNAJI IQYFAJFI<^KYNS IL 

7134 

A 

2 

177 

RQNSLRGSFHSAJIVLKMDDFGAVPCTDLVVQSISPHQSKQSQPVELD 
PFGAAPFLYRQ 

7135 

A 

1 

491 

KDPESFFKVXMHLKDLGIjN 

jjiiwo x IjFo IvUJj i F QVxjCMADVVI STAKHEFFGVAMLEAVYCGCYPLC 
FSWAALHGKFRSLLTTEPREDL 

7136 

A 

3 

612 

LQYQKVDKLFLHPKFDDWLLDND I ALLLLKS PLNLS VNR I P I CTS E I 
SD I QAWRNCWVTGWG I TNTS EKGVQPT I LQAVKVDLYR WDWCGYI LS 
LLTKlTOLCAGTQDPGKDACQGDSGGALVClviaQvNTAIWYQVGIVSW 
MGCGKKNLPGVYTKVSHYVRWISKQTAKAGRPYMYEQNSACPLVLSC 
RAI LVL YFGMFLLT 

7137 

A 

2 

137 

SALGGAPPVPSRPGASPDPLGPPPQVPSGPNRAPPGVPRITISDP 

7138 

A 

1 

2631 

MGNRGMEDLIPLVITOLQDAFSAIGQNADLDLPQIAWGGQSAGKSSV 
LENx^GRDFLPRGSGIVxTvRPLVxjQL^ 

EEVRLE I EAETDRVTGTNKG I S P VP INLRVYS PHVLNLTL VDLPGMT 
KVPVGDQPPDIEFQIRDMLMQFVTK^NCLILAVSPANSDLANSDALK 
VAKEVDPC^QRTIGVITKLDLMDEGTDAPJ^VLENKLLPLRRGYI 
NRSQKDIDGKXDITAALAAERKFFLSHPSx^^ 

QQLTNH I RDTL PGLRNKLQS QL»L S I EKE VEE YKNFRPDDPARKTKAL 
LQ^QQFAVDFEKRIEGSGDQIDx^ELSGGAJ^ll^iFHERFPFELVK 
MEFDEKEIxRREISYAIKNIHGIRTGLFTPDI^FETIVKKQVKKIREP 
CLKCVDMVI SELIS TVRQCTKKLQQYPRLREEMER I VTTHI PJEREGR 
TI<EQVMLLIDIExxATN^ 

X L» V X RKG WtiTINN I G I MKGG S KE xT^FVxvTA£l^sWYKDDEEKEKKYM 
xj o v UN xjxvxjKU v AivLv r MiSfa KH x r AJj FI^EQRNVYKD YRQLi ELACE TQE 

E\^SWl\ASFLRAGVYPERVGDKEKASETEENGSDSFNxHSMDPQLERQ 

VETTRNTiVn^YT^ATVMTf*T\7PriT.MPV*rTMtiT MTXTw»r»Tnc* , e»TT?or»T r j\vrr 
v *-» A vuo x 1'Lrt.x. vxm JVX VrtX J /XJl*J±r JvX XI Y LliXji v J XiNIV 1 Khf X rSKI | } f AN] r 

YSCGDQNTLMEES A^QAQRRDEMLRMYHALKEAXS I IGNINTTTVST 
PMPPPVDDSWLQVQSVPAGRRSPTSSPTPQRRAPAVPPARPGSRGPA 
PGPPPAGSALGGAPPVPSRPGASPDPFGPPPQVPSRPNRAPPGVPRC 
SRRSTLLTANVS SDS SG YLAPRSMVRCE I CD 

7139 

A j 

2 

112 

DIKAHTQPVTKAPGPGETKIPVTLSRVQNVQPLLAT 

7140 

A 

67 

245 

AYNLSDGQYGLVDPEFEYYFWLCCWLPL*I J GGKCLWVIiPVSVSGDRN 
DNDQAERNGSHL 

7141 

A 

356 

2252 

RSFLDEEFPHYDSLSCADAIC^PLQEIviKQCCHKLYGGQEARIHQTP 
LTLKHTCWYTPLLDALSLDSFTAVPTIiESTPFSGVANQIHTLCERPT 
YGEVKDGAXDVKSQHKCPGPTSGPSPGxTOiSGC^ 

ERVCPESLLQSRGYSSLPLPRHTSSTOGTITSSDPGLEILNMASCDL 
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DRNSLCKKEEDTRS AS PTI EAQGTSPAHDN I AFQDSTS KDKTILNLE 
AKEEPETIEEHKKBHASGDSWSPLPVTTVKSVNVRQSENTSANEKE 
VEA^FLRLSLGFKCDWFTLEKRVKLEERSRDWAEENLKKEITNSLKL 
LES LTPLCEDDNQAQE 1 1 KKLEKS I KFLS QCAARVASRAEMLGAINQ 
ESRVSKAVEVMIQHVENLKRMYAKEHAELEELKQVLLQNERSFNPLE 
DDDDCQ I KKRS AS IoNS KPS S LRR VTMALLFL PR \ NYLGNAG \ MVAGM 
ENNDRF\SRRSSSWRILGSKQSEHRPSLPRFISTYSWADAEEEKCEL 
KTKDDSEPSGEETVERTRKPSLSEKKNNPSKWDVSSVYDTIASWATN 
LiKS S I RKANKALWLS I AF I VLFAALMS FLTGQLFQKS VDAAPTQQED 
SWTSLEHILWPFTRLRHNGPPPV 

7142 

A 

1 

835 

MRYHFTPTRMAMVKEERKLSTISSVHIPTINLTEESRVSKAVEVMIQ 
HVENLKRM Y AKEHAELEELKQVLLQNERS FNPLEDDVKMSTD I TDST 
LGSFEALGGETSKGVLEVSDLIAYVADLHFNKRKLEEENNKFKLALE 
TLEETNSQLSEDCTELRLQVKSAHQAIMRTNLLKEELEELKLSMNAS 
EEQKSMIVAQSKQLETENRALILKIRILQEENFKNIMDIDRLEKKIE 
DLS KTETEHQVS TS Q I LMQLHTYENTLLNKDTS LQKKWDP VEC 

7143 

A 

1 

3735 

MMCRGTVFQAGGTAGAKALS P YPDEDGGDGGLGTGAAPGPRLRHS KS 
IDEGMFSAEPYLRLESAGSGAGYGGYGAGSRAYGGGGGSSAFTSFLP 
PRPLALDPASPLGLALAARERALKESSEGGGAPQPPPRPPSPRYEAP 
PPTPHHHSPHAHHEPVLRLWGASPPDPARRELGYRAGLGSQEKSLPA 
S PPAARRS LLHRLPPTAPGVGPLLLQLGTEPPAPHPGVSKPWRSAAP 
EEPERLPLHVRFLENCQPRAPVTSGRGPPSEDGPGVPPPSPRREENG 
LPLLVLPPPAPSVDVEDGEFLFVEPLPPPLEFSNSFEKPESPLTPGP 
PH PL PDTPAPATPLPPVPPPAVAAAPPTLDS TAS S LTS YDS EVATLT 
QGASAAPGDPHPPGPPAPAAPAPAAPQPGPDPPPGTDSGIEEVDSRS 
SSDHPLETISSASTLSSLSAEGGGSAGGGGGAGAGVASGPELLDTYV 
AYLDGQAFGGSSTPGPPYPPQLMTPSKLRGRALGASGGLRPGPSGGL 
RDPVTPTSPTVSVTGAGTDGLLALRACSGPPTAGVAGGPVAVEPEVP 
PVPLPTASSLPRKLLPWEEGPGPPPPPLPGPLAQPQASALATVKASI 
I S E LS S KLQQ FGGS S AAGGALP W ARGG S GGGGD SHHGG AS YVPERTS 
SLQRQRLSDDSQSSLLSKPVSSLFQNWPKPPLPPLPTGTGVSPTAAA 
APGATS P S AS S S STSTRHLQGVEFEMR PPLLRRAPS PS LLPASEHKV 
SPAPRPSSLPILPSGPLYPGLFDIRGSPTGGAGGSADPFAPVFVPPH 
PGISGGLGGALSGASRSLSPTRLLSLPPDKPFGAKPLGFWTKFDVAD 
WLEWLGLAEHRAQFLDHEIDGSHLPALTKEDYVDLGVTRVGHRMNID 
RALKFFLERYPALPRPLTQQTLTSQPDPSSEERPPALPLPLPGGEEK 
AKL IGQI KPELYQGTGPGGRRSGGGPGSGEAGTGAPCGR I S FALRYL 
YGSDQLVVRILQAIiDLPAKDSNGFSDPYVKIYIiLPDRKKKFQTKLAQ 
RKLH F S VYDFDRFS RHDL I GQVVLDNLLEIiAEQPPDRPLWRD I VEGG 
SEKADLGELNFSLCYLPTAGRLTVTIIKASNLKAMDLTGFSDPYVKA 
SLIS EGRRLKKRKTS I KKNTLNPTYNEAL VFD VAPES VENVGLS IAV 
VD YD C I GHNE VI GVCR VGPD AAD PHGREHWAEMLANPRKPVEHWHQL 
VEEKTVTSFTKGSKGLSEKENSE 
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2 
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ECVRLKPTD I QPDI FS YLLHLMYTGKMAPQL IDPVRLEQGI KFtHAY 
PL I QEASLAS QGAFSHPDQVFPLAS SL YG I Q I ADHQLRQATKIASAP 
EKLGRDPRPQTSRISQEQVPEASQLSQLTSNLAQVNRTNMTPSDPLQ 
TSLSPELVSTPVPPPPPGEETNLEASSSYEQPASLTIAHVKPSIMKR 
NGSFPKYYACHLCGRRFTLRSSLREHLQIHTGVPFTSSQQGESRVPL 
TLC SNAADLG KDAME VP EAGM I S D S ELQH I S DS P 1 1 DG QQQ S ET P P P 
SD IAD IDNLEQADQEREVKRRKYECTI CGRKF I QKSHWREHMYIHTG 
KPFKCSTCDKSFCRANQAARHVCLNQSIDTYTMVDKQTIiELCTFEEG 
SQMDNTAAAH 

7145 

A 
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NFSSNKWNTSQNFANMDNSAQKNERTGKHPRRASEVQK\NDIPGPGF 
YNVIHQSPVSNSVSLSKKGTCMFPSMCARLDTIVSKYPAANAYTIPS 
DFISKRDFSNSCSSMFQLPSFMKALKFET/PLAPNYYNASVSCCKRR 
NNVCTRAGF 
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7146 

A 

1 
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MDNS AQKNERTGKHPRRAS EVQKALR I C IHNTRVLLGSRPGVLE YLA 
PQCLLD PASLSMAVS S S S VHRGS FS PKAGI STP WTSHGARNDI PGP 
CjJt YNVlHQSPVSNSVSLSKKGTCMFPSMCARLDTIISKyPAANAYTI 
PSDFISKRDFSNSCSSMPQLPSFMKALKFETPAPNYYNASVSCCKQR 
NNVCTRAGFMSKTQRGSFAFADKGPPPEVTSSSSHKTEEAALFPEHP 
XxrioiillJXi\ir-oxjVJvyoJri>i xljMov-r Ivoivl JNJKtjJblvUTSTGPGPGYYNPSD 
CTKVPKKTLFPKNP ILNFS AQPS PLPPKPPFPGPGQYE I VDYLGPRK 
HFISSASFVSNTSRWTAAPPQPGLPGPDWWHPESVQPRSMEEEFEWIi 
PWLGAMGPKEKRILSSSTGGRNDQGKRPQQAGGRLCGQLPVTHH 

7147 

A 
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KKDKIMFLAGTWMDLKAIILSKLMQEQKTKYCMFSLISGS 
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GGG S GG WGGG WNGG W S G GG S GG G S Rn W f2 rjn c rjn w e nn t? q r!/2W c nnr% 
SGGWSGGWSGGGSGGWGNVHQPSPASPGPPALQRCWTGTHPRQRRQF 
WPLCSLRVAPSRPAAPAPQRPPQGACRAASLTGSKDVPGCPHHVAGG 
PASFPLSAGSSCR 

7150 

A 

2 

212 

DTPACTRLFDAAYFGQKDSVCGVSECIIMGIPMNIGTGIiFKLLHKAD 
RDPNPPKRPLIFDTNEFHIPLVT 

7151 

A 

3 

91 

HCGGTLELSDDDTDSKTSDVNETQPPQSE 

7152 

A 

1 

115 

PTRPGGS ILASLSTFQQMWI SRQEYDESGPS IVHRKCF 

7153 

A 

1 

1279 

MGDTPRQFGGARPRSGGALRGCRSRGGGNRRGLCLSRALANGDRRIiT 
MDDD IAALWDNG S GMC KAGFAGDDAPRAVFP S IVGRPRHQGVMVGM 
GQKD S YVGDEAQS KRG I LTLKY P I EHG I VTNWDDMEKI WHHTFYNEL 
RVAPEEHPVLLTEAPIiNPKANREKMTQIMFETFNTPAMYVAIQAVIjS 
LYASGRTTGIVI^SGDGVTHI^PIYEGYALPHAILRLDLAGRDIjTDY 
JjlMrLXLi lhKbi bi 1 1 1 1 Ar#RIilVRDIKEKljCYVAIjDFEQEMATAASSSS 
LEKSYELPDGQ\VITIGNE\RFRCPEALFQPSFLGMESCG\IHETTF 
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STMKIKIIAPPERKYSVWIGGSILASLSTF\QQMWISKQEYDE\SGP 
S\ IVHRKCF 

7154 

A 

1 
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PT.TiVCJ^TJC'P'T'CJIi QT?TJ2ilS , T'T , 'nT Ct-TH r TOT TTXTTT T OTC XPT 1 
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7155 

A 

143 

295 

PSCEVQPSRLLMEPSGPYLSIIMPLIARFDHWIHSSAFKFAVMVEI 
LAC 

7156 

A 

186 

439 

HRCFREEKEKALDPEAACPIPAPASLTGVRPQRTALSQDGVEDRKPD 
I PVPYLYFDMGAAVLCAS FMS FGVKRRWFALGAALQL 

7157 

A 

3 
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CI7DCMT niVlfUUMT AMPLJ O TDTaTTaT'D VT^ T* T't Tccni\ m f tvt't t ttatitiat lju^t 

or iroWijJJMX ISJlxlVWxjv^^«olrWWKx lr± I Vr oPATSXFIjIjHQHIKIFTIj 
HVEKALNBSSTELVLVSDEFAHLRTI^QNQMALKMLTAPQGGACTL 
LQTECCVYISGNSHNMTLLGKPC 

7158 

A 

1 

220 

ETEHRSFSQAGVQWCYLGSLQPPPPGFKLFSCLSLiPSSWDYRCLPPC 
SANFCIFSKDGVSPCWSGWPRTPDLR 

7159 

C 

102 

407 

rlDAGLEAMQKYGKAAPGDRTMIJDSLWAAGQELQAWKSPGADLLQV^ 
KAVKS AEAAAEATKNMEAGAGRAS YI S SARLEQPDPGAXAAAAILRA 
ILEVLQS* 

7160 

A 

1 

2302 

RP^GIPVEMWIGDDSAFTVXKKAGRRGLCGl^IHKVAGAKA^ 
VGLEEIAKQVlNTVArrKAMGSEEDEAHEGCSRENLGRIQFSVGYNFQES 
TLWKIMKAQELPAKDFSGTSDPFVKIYLLPDKKHKLETKEVDQASL 
EDEGV^EHEEhIDDnVRT^Tjr'MTT.na\/riTirirCT ,, 7 1, MT?r , T /^•ttt^'D'd 7iT7/ n,r PT t 
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I VKxTxTTGDRLNFGLAREQARAEG I PVEMWI GDDS AFTVLKKAGRRG 
LCGTVL IHKAP VE KADKM YEQMRNS S RVGNS KS QMEMLKTNTVADMQ 
NGFNRPIMRFDIIKERISGRQWTTDQQLIQVIRSIGVYDWELKFAE 
NRANGQS KGYAEVWAS ENS VHKLLELLPGKVLNGEKVDVRPATRQN 
LSQFEAQARKLLPVPKGGTARGSWSNGRRTGCGEGQRARNGVGSLRA 
SSEDTGAPPPPAHGRIRHIRRRRDFPETPKPJUjSSGSCGRWRKEARR 
CSGGGASFGWRPPJVSGVLQEASVALEYRRAAPSLDMSYlTYVArrAQKP 
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TAWGCVTGHFTSAVDVYLLIVTNTRIjEIYVVTPSGFRRQRVCLPSR 
AHVACLGLPSAS PLGI PACHRCMCNA/TAAVVGLTVAS AI LSLPLWMS 
VGLMCMLPAGGKAVNTAPVPGQTPHDESDRRTEPRSSVSDLVNSLTS 
EMLMPTQCEDNKDEDFYDDPLPLNEYSGVSEGDLGGLSAFNRSGRYF 
ENIPKGLDQEGWTRGGIQPQMPGGYALSQPVSCHGGHPQPHGEPAAP 
AS PRGKGNTPAQLLS 

7161 

A 

1 

178 

VCVCVCVCVCVCVCS VPTR I CLGAHC I FLCYGWY I KHDRTRDNT I S i 
HRSPLGSALKEE 

7162 

A 

60 

413 

SGGAVGTARSPALGMALLVLGLVSCTFFLAVNGLYSSSDDVIELTPS 
NFNREVIQSESHNRGFRPFSFLGSQWIFPAVKGHSLHSDDSTSGPFN 
QEWLDWS FENTATVNFC ISRKH 

7163 

A 

2 

1420 

NSGVGAGARRAARCRAEAAAAVGTARS PALGMALLVLGLVS CTFFLA 
VNGLYSSSDDVIELTPSNFNREVIQSDSLWLVEFYAPWCGHCQRLTP 
EWKKAATALKD WKVGAVD AD KHHS LGGQYGVQGFPT I KI FG SNKNR 
PEDYQGGRTGEAIVDAALSALRQLVKDRLGGRSGGYSSGKQGRSDSS 
S KKDV I E LTDDS FDKNVLDS EDVWMVE F YAPWCGHCKNLE PEWAAAA 
SEVKEQTKGKVKLAAVDATWQVLASR YG IRG FPTI KI FQKGESPVD 
YDGGRTRSDIVSRALDLFSDNAPPPELLEIINEDIAKRTCEEHQLCV 
VAVLPH I LDTGAAGRNS YLEVLLKLADKYKKKMWGWIjWTEAGAQS EL 
ETALGIGGFGYPAMAAINARKMKFAIiLKGSFSEQGINEFLRELSFGR 
GSTAPVGGGGFP\TIVEREPWDGRDGELPVEDDIDLSDV\ELDDLG\ 
KDEL 

7164 

A 

547 

980 

CPQGVSEECSAAREKKCLCINKVGRLLFLKLFILLHQQSIVNVCVCF 
GHFSLRELSFGRGSTAPVGGGAFPTIVEREPWDGRDGEVSAREDSCA 
GQQILVYSLEPVHLCTWESSPSHQTSAIiGAPVGLFKRFFSEDPSSSL 
SQV 

7165 

A 

3 

1290 

L YS S SDDVI ELTPSNFNREV I QSDS L WL VEF YAPWCGHCQRLTPEWK 
KAATALKDVVKVGAVDADKHHSLGGQYGVQGFPTIKIFGSNKNRPED 
YQGGRTGEAIVDAALSALRQLVKDRLGGRSGGYSSGKQGRSDSSSKK 
DVIELTDDSFDKNVLDSEDVWMVEFYAPWCGHCKIs^EPEWAAAASEV 
KEQTKGRVKL\AAVDATVNQVtiASRYG\lRGFPTIKIFQKGESPVDY 
DGGRTRSDIVSRAIiDLFSDNAPPPELLEIINEDIAKRTCEEHQLCW 
AVLPHILDTGAAGRNSYL\EVLLKI^KYKKKMWGWLWTEAGA\QSE 
LETALG \ IGGFG \ YPAMA\ A INARKMKFALLK\ G S FS EQG I NEFLRE 
L\SFGGGSNGTL*GGGGFSLPIVEREP\WDGQGCRGFPWGDDH*PSV 

7166 
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7167 

A 

2 

386 

RQSLFSIIQAGMHWHQLVSLQLLPPRFKRFSCLSLPSSWDCRCVPPC 
c t\sxr\j xz oicuo v oMWJliHjCLi V J_iNo WAoCIJLPTIxA.SvSAGMTGVSHCTQ 
SEFFFILIALTALIVLHQGKPYTNSVRHRLGNFC 

7168 
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wx J-X iriisJr nvJSJiJbJ.L.CLVLLljLj. TVIjYRIYHGLLTAEKGTVEFSHTK 
DYTGNRLLLLNVP 

7169 

A 

2 

82 


7170 

A 

51 

347 

LPRIJX5VQWCDLGSLQPLPPRFKQFSCLSLPSSWDYRHAPPRPANFV 
FLVEMGFLHVGQAGLELLTSGDPPASASQSAGITSVSHRARPLLHIL 
NQPR 

7171 

B 

18 8 
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AU V V vf VANL-Lt VQfc. YNSN PKKHVTLRDYITYWKEYIQAGYSSPRGCL 
YLKDWHLCS* 

7172 

B 
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353 

XDVWPVANCGVQEYNSNPKEHMTLRDYITYWKEYIQAGYSSPRGCL 
YLKDWHLCS* 

7173 

A 

753 

1216 

QWEVGPPRGRHPSAAGCPSPA*RRWKTRRAGWAARSPAQSRRRRRR 
PAPRRRLPGPEPGRRRGHRPPGLGSGCRRGRASRGPSSSARVEGPSS 
SLPPERKR*GPASEPEDAGRRVSAPPAPGA\RACTHARGSPRSSLRR 
RRAAEAARSALGLV 

7174 

A 

1 

1777 

MPVLPVTATEIRQYLRGHGIPFQDGHSCLRALSPFAESSQLKGQTGV 
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TTSFSLFIDKTTGHFLCMTSLAEGSWEDFQASVEGRGDGAREGFLLS 
KAPEFEDSEEVRRIWNRAIPLWELPDQEEVQIJUDTMFGLTKVTDDTIi 
KRFSVRYLRPARSLVFPWFSPGGSGLRGLKLLEAKCQGDGVSYEETT 
I PRPSAYHNLFGLPL I S RRDAEWLTSRELDS LALNQSTGLPTLTLP 
RGTTCLPPALLPYLEQFRRIVFWLGDDI.RSWEAAKLFARKLNPKRCF 
LVRPGDQQPRPLEALNGGFNLSRILRTALPAWHKSIVSFRQLREEVL 
GELSNVEQAAGLRWSRFPDLNRILKGHRKGELTVFTGPTGSGKTTFI 
SEYALDLCSQGVNTLWGSFEISNVRLARVMLTQFAEGRLEDQLDKYD 
HWADRFEDLPLYFMTFHGQQ\NIRTVIGTM\NHAVYVYDICHVIIDN 
LQFMMGHEQLSTDRIAAQDYIIGVFRKFATDNNCHVTIiVIHFRKEDD 
DKELQTAS I FGS AKGGEWS APHSTS SRPSAQPKSDSTVAEAAPLASD 

P'PAVMTMPT TUPOPO R CDHDUPPVT TV TV T XT 

r xAx Wlwr Lii. VVQiKCjRASK-URBr (jxxjAAxjN 

7175 

A 

1 

666 

GTSDHWADRFEDLPLYFMTFHGQQSIRTVIDTMQHA\VYG\YDICHV 
I IDNLQFMMGHEQLSTDRI AAQDYI IGVFRKFATDNNCHVTLVIHPR 
Ki^DDDJvbljQ xAS 1 GL»SQRQAR \ EADNVIjDPAGTGKLGNRGQGKRYF 
AGVPKEPF*WEM*GVFPLEFNKNSLTFSIPPKNKARLKKIKDDTGPV 
AKKPSSGKKGATTQNSEICSGQAPTPDQPDTSKRSK 

/ JL /o 

TV 
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O Q C 

285 

V FF F F F F FLRR S L»ALL PRL EC S GT I VFLVETGF\TMLARMVSIS*PR 
DLPAIiASQSAGITGVSHRARPASFNFLTINILPPLLLWTF 

7177 

A 

12 

336 

S PVQL * F * LFLVFC * LWSWSAWYLGPLGTPS ADAHT/ AGLSKTPPH 
WAARARIjDD V F S IjRTFS SHSLtmEljVQD 
GLPKCWDYKREPPRPAH 

TITO 

7 17 o 

TV 

A 

74 

463 

H I SQHDRS YLQVLLAS YKTAS FLiFFFLRQSL/ DS VTQAGVQWRDLGS 
LQAPPPGFTPFSCPQSPK*L/SSWDYRRPPPRPANFFCIFIVEKECF 
TLLARMVS I S WPRDLPAS ASQSAGITGMSHRARPAQVS 

7179 

A 

1 

412 

TRMGLPDASRRRTCRMDPEGWQEAMSSA*GRITLQRLSTG\PEGQGG 
REKVGPEGGSENPQQPKAAGVLSKHL.PGAPAQPPQRPPSSPPPLAGP 
LTERVEKVCDFLDAAGDYLN/ GTPG* PSPGESPAAQDPPVPPWPP 

7180 

A 

117 

295 

TFFPHTTPPPPELiPDPKRPNTFEKKGKHENKAPGCSLKEEKEGAERP 
AGALSPCGPNW 

7181 

A 

279 

1066 

IQHPRLSADDFRVNCETEPAM/*HAVENDIHGLCELQLETEIQAFKE 
ELL / FMKKNHEEE VKGLQAQI AS S GLTWVDAP I SQD FAKIMAD I WA 
QYDEVAQKNREELDNNRSQQIEESTTVVTTQSTEVGAAKAMI.TELTR 
TVQFLE INLDSMRKTLKASL.ENS LRE WAR YALQMEQLNG I LLHLESE 
l^QTWAEGQCQAQEYQALl^IKVKLEAAIATYRRLLEDGKDFNLGDT 
LDCSNSMQTIQKTTTR\R*TRQIVDGKVVS 

7182 

A 

60 

847 

IQHPRLSADDFRVNCETEPAM/ *HAVENDIHGL»CEL»QLETE I QAFKE 
ELL/FMKKNHEEEVKGLQAQIASSGLTVVVDAPISQDFAKIMADIWA 
QYDEVAQKNREELDNNRSQQIEESTTVVTTQSTEVGAAKAMIiTELTR 
TVQFLEINLDSMRNLKASLENSLREVVARYALQMEQLNGILLHLESE 
LAQTWAEGQCQAQE YQALLN I KVXLEAAI ATYRRLLEDGK13FNLGDT 
LDCSNSMQTI QKTTTR\ R* TRQI VDGKWS 

7183 

A 

645 

861 

KYYRKRGIHS AIDASQTPDVVFAS IIjAAFS * ATCKDIjVMFI * CWVQE 
GI S FHPCEAMGGNDRTGKEKLPSG 

7184 

A 

2 

1327 

RPQSLS PVLSLS PDSMS FTTRSTFSTNYRSLGS VQAPS YGARPVS S A 
ASVYAGAGGSGSRISVSRSTSFRGGMGSGGLATGIAGGLAGMGGIQN 
EKETMQSLNDRLASYLDRVRSLETENRRLESKIREHLEKKGPQVRDW 
SHYFKI IEDLRAQI FANTVDNARIVLQIDNARIiAADDFRVKYETELA 

rll\\J O V XMN X riVJXjivJV V JLlJU X JM X X rCXjyXjXI* X IS X£*/VuI\JSI!»1jXjX* ivISJNJ XI H. Xj Xj 

VKGLQAQI AS SGLTVEVDAPKSQDLAKIMAD IRAQ YDELARKNREEL 
DKWSQQIEESTTVVTTQSaEVGAAETTIjTELRRTVQ\SLEI\DLD\ 
SMRNLKA\ SLENS L\ REVEA/ RRTPLQMEQL\NG \ I LLHLESELAQT 
\ RAEGQRQAQE YEAL\LNI KVK\LEAEI ATYRRLLEDGEDFNF\ gd a 
LDSSNSMQTIQKTTTPPG*VGIiGKWS 

7185 

A 

2 

345 

FFFFFFETESRSVAQAGVQWRDLSSLQAPPPG\SRHSLASAS*VAGT 
TGVCHHARLI F\ VFLVETGFHHVSQDGLNLLTS * SAGLGLPKCWDSE 
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KLFFFFGDKSFRFCCPGWSTMV 

7186 

A 
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389 

rt *^- c, * r i~trx.tr /vim Lj Xj u r jr r jt r r UriJc.or'o V AyAvjVQWKJDLiSSIjQAPPP\GSC 
HS PTSLS * VAGRLRNKNCL.NPDAW 

7187 

A 

546 

850 

FSVLFYFFESESCSVLQAGVQWRDLGSLQPPPP\GSSHSPTSAS*VA 
GTTGTCHHTQLI FI FLVETGFHHIGQASLKLLTS * FAHLGLPKFWDY 
RCEPPHPA 

7188 

A 

4 

412 

PIERPHELKHRLPHVQKLVEDVGDGVIPAALEEGQAGWSHGFLVFLV 
EIKSAEGP/PGPADSRVRGVRPQRSQSAGPA*RAGRQDGVCRSLGQG 
RGGGRATSLVLHPPGYPGCWFSVGVPSLPQYTGIRSSSPRTRG 

7189 

A 

148 

268 

LQPR * PLTCAPPPCPVQTQTHPISVSQTLLGLEFCCSLSL 

7190 

A 

397 

1440 
JL ** t 

Ki>n^t>^? J. tsKK JL fc>KJjr.b WjV 1»GVSCKKSKVREGPLGPA* * AG*RGVRPQ 
RSQSAGPA*RAGKRQDGG/ SPEPRAGVGGEVGTQSP /GPSGASGYPG 
MIiVSKPVGGTSASWLPAGCPIPS*LS*QRSQGSPS^CLIGHLLAIjWA 
AFGLGVTSLPQYTGIRSSSPTA*ATVQGDSGHKGGTGGTGLGRGRNT 
QPDAACGRGHRS S PTR / RPHGNPNGNTGCAEGGQARSLL PKLAPKLP 
GWPVSVPVAVG\ PGDFGWRQAQYQS SLWDLSS PRNTLGRSATSAGPA 
PPALLGAGSGRSSGTSPAAPGCSSRCHCWASAPAGVSGGPG\GRGAE 
APPS PTLAQRGS PPG\ AAI FPPACGI PP 

7191 

A 

47 

1654 

LHVPARPPGCGGAFLAGGPCPGWGQPRAQLWPQ/SAAPGPPHP\PAIi 
PSPVFPTWPRLLSH*S/RPA/VEQCPPEPHVYP\DPASSWVPKSASP 
PRVACPCPPAL* / PFPGDSCLIjCTAGLRGPGHDTELPQVWVRP\HPG 
ASAEM*AHKPPPLAQKPPRPSISVPQPSAQLWYPAGPWVPT*AGTWIj 
o x l» a 1 * b pygli f s f yemehg vp as lnppkwe phdr v * lgggr * gks e 
VPRGTPSLTGLFRGVPQGPPAITGTQKHPFPNDMVSFLISFFCLFCF 
GFQLG*/PPGL*SIHIEPIiTGIjPGCQG*A*P* VLLSTAGSAHAV* *G 
SELVHLQS*LHFPSFCPPPVLAIiDLQVCRPALSAFLACGPPAIPGSW 
£> Pista P V & Jj WMb V KL» W PL CG Lt S u YP AL*NIiARRGPGQPFTGK/IGEG 
TPTPHKPCNHQVRKLGVLPQGGVKTKI EVGFWGPTR / PKAGETRG * V 
GRVPQSLA* PQCVS PEMLP * LSGTPGR SWG/ PVLSPTPAHSQALRLC 
GHS P PCWS LV I GLELY FD AMKTLF I SNVY I F 
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A 

4 
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ra k k i Hi KiA* K w u P v_ JL kj A W PA P VN IjAJvLAP S/HDSEQVGSGCVPLAQSM 

APPGPAWRAGHH\RRRSENPRPGPFFGSFFKRFGTNRGWSRRDDDPG 

GPGSGSRWPPAPRRGGQWRCPREPG*RRSSSAGRGSSPCALRHRATK 
ASSSCSSPCPPTPTKPFPaT.ZiZkaQr* 

7193 

A 

3 

516 

SVPKREELQGHHRVRGGEHRVWTRQDVLQHRLPATSWTHPVLPPTA 

RaPaLGRaSGAARRTAARAaPL\PAVPAAAAaPGRaRPTTTWPASPP 
SPRSASPPTAPQ^ACW^PTP /r2QP"P2VT.T?T3T T>Rr»r PMrvrDOBnuntsn 

PLPAHLCVLRHRATKAS S S CS S P CPPTPTKR 

7194 

A 

2 

326 

GGGFRPT-iPPPTiWPPKPVf^PPP'\7T7T"Pr3P2ir'PDijJi7KTPTyoT vrmrr rvrim 

GPSDYSRF\ SEGLARKLPLTRKGRFR * PKFPPLPS FLGKKREIRFPK 
KTKKKIFINSSSSC 

7195 

A 

3 

345 

ERQKLHYKYS KQFLTLFQDWNTN \ VKAEEPEEQKLATMFREQQKSCL 
* ARMV QRQRH * NKLN * NQQFLRS LEDLEENQS S \ FTDEQ SDLRKEMA 
RLQTKLMKETLQQDLALIHSC 

7196 

C 

68 

319 

MWFPWACTLGGQGLSGVTPDHAWDLFRAWLSHLSRVPVSRGPLSASS 

PPP'LiTjPCSTT.OOPT.P Qf^nTTM^T PDDTR cct oddptjt on 

7197 

A 

56 

947 

SRLGPF*SJUAEPPF/PRVPVSRGPLSASSPPPLSSRDAGIiQTSQLLP 
/ S WGWS \ PNS 1 1 PELDLGPLSPTGRLSAAFPCRSPPSTGNGKDATSQ 
VDSPHP\PSFRSPGLELSRCPGDPVTATSPHEWTQQGKDWRPCSCPL 
VMQAPPTSHIFLPPPSRSSLRRPRAPSGCPWGCLG/RPPAGPWATPL 
AGLPSWS\GVRLPG*GVjLPPASRGSG*GPPRHRPDAGSERPAAAAAG 
FAGP*VCSNSSALPSVYEGLL 

7198 

C 

161 

262 

MLKGIPCLHPPLASSSQSDNSQKCLQTLPSLPR* 

7199 

A 

132 

1007 

NPQAPQATKKEGRKNARKGGSVCSGVPWGPPPRKVPTVPWRAESPGP 
GESHAGFESLRHGHAGAGPKGMPRRLPTENTVSAEGSTGGESGARVP 
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DTGKGIi * PATHPLPWGSLSGF * ETQRL * G/ PCLEGG/ YPGEAGQSLE 

IGEAAVWCQPHIGRRALQEACGEHGDRGRFGGNHAE/PDRGGNEAAP 
TNNNPTHPDGHTSL*G/VTGPSP/PGRWGN*GPESSEPGLDPGPPHP 
PTfiWCGI.PT.PSPRT.HS 

tr X O ri\»VJf AJ t XJJvO f XvXJ.nO 
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A 

59 

326 

TGLSPGSEIGALPGNIiV*APFFFNPKNLPGAVGP\PGVPPFWGG*AR 
GLGG PRS S E F PLVGLG PLAFRLGTKLGPVFKKNQE S KKKTS V 

7201 

A 

3 

263 

TLRKKFFQPGKTTRFSPHTP\FSSFFPQKKEKL*GGNPPKFPPPKKK 
PPPKICP /HHPSSSPPPKSSSSOSHAPPKKnPPPl?PT.V]fWPP 

7202 

A 

1 

348 

RYR SG I PGRRTRAGETPS LLKIQQPG / GDGARS * S FJL.LRRLRQENRL 
NPEGRGCSELRLCTCTPAWARE*DSLSKKKKXEFKIPIYFIiTLLWDV 
F P TO YTY S T PM C T.T iPG EMVTD T 



j 

44 ? 

fi 9 r* or c d pj, PT.T, PNWLE T "L. JkG A T P D / HTTP / T Tf fX TH T mCVKVSTK" 
LTEVINI^IIWA*YKIN/S/NKSWCLYxTmEQLEKKFKISSSSSSS 
S S S S S S PNH * GKDL YNENLKTLLKEMKENKW IN I PCS W IRRLNNVNI 
SILSKVTYKF 

7204 

A 

3 

310 

TINCVDITDIYR1jLHPSTADCTFCPSSNGIFTNIGHILG\KTHINKF 
I S I EFI * CLLSDHRI KLE INNKT\ WKI PKYL * INTL I KEKCEEQF *N 
TVN*NPNTI 

7205 

A 

3 

90 

AI FFLPDEGKLQHLENELTHDI ITKFLG 

7206 

A 

2 

1053 

TM P S SVS WG I LIjLAGIj CCL VP VS 1^ 

FTWITPl^AEFAFSLYRQLAHQSNSTNI FFS PVS I ATAFAMLSLGTK 
ADTHDE I LEGLNFNLiTE I PEAQIHEGFQELLRTLNQPDSQLQLTTGN 
G Jj F Jj S E G Li KXj VD KF Li E D V KKL YH S EA F TVNFGDTEFJUCKQxN\DYXjE 
KGTQGKIVDLVXEL\DRDTVFALVirc 

fhvdq\attv\kvpyelkr\lgmfn\ iqhl\kkl\sswgaa**kyl\ 
palihftqivpllgtygflksvlgfnwgih 

/ Z \J / 

TV 


O O A 

T?'n'DDr;Pr2Dl\TT7AT?P'PTf f^A. VT4A OCT UnTT^TTUT VT *\7"\7T 7AOT TT UTJDT 

x<AJtrir\jtr\jt'i\l. \xroX»ziy X XV XriXjlNXj" V V J_i/\yij 1 Xjr.lix-' X 
HCHANPDLIFEDMP*AMTDLELHELCQQDT^I1TOFQLDMDLILDSGKP 
QWGC I LSELKQKDS SN 

7208 

A 

1 

306 

lrrs/lcsvaqagvqwwdlsslqplppgpmpfprlslpsswdhrrpr 

PLPANFLYF * * RRGFTVTARMVS IS* PCDPPASASQSAGITGVRHRT 
QPPRYVFM 

7209 

A 

156 

331 

QPLILRCLPPRS IYRFNS IPIKIQVTFF\ *EKEKSIjLKFIQNIjKGP* 
I TKT I LRKKKVD 

7210 

A 

50 

271 

FRN I KHQRDYNEQ* YG/NKLDNPEDMNKF * ETRYT* / PNIxHQEEIGM 
LNRPVTNKVL * LVI KI PLTKKS PEPDGFV 

7211 

A 

584 

1001 

ILKLWGVG1X3*TKVPPQVPPTPHSPJV*GPPRRECLRRQ*PRP*SPGF 
GPLSFPTSVELMGV1CTVLSQGASRERGSWG*KGSPLNGRQFWAQGGA 
/T.^wawrwrifWWpri€;MRKr!*npnT?r^<5QT.oPT.TT)T)PPTWPMnp^ 

7212 

A 

79 

317 

ERDFLFVPRVGFQGPNLG * LKFPLPGLPLFSGIiTLP/ R1STWGLRGPFT 
TPG*FLDFLKKRGFNIVAQVGLKFWTSGDPPPL 

HOI "1 

/ z ±o 

TV 

A 1 

o z o 

V"VT ,T? T\7T T.T TTV* UMQ T?Cr^\7W T T VPCCCT TT T T?T TMlftfVTT fT TVT*T 

Lx^QTVTQWIFVCPPKTF*S*AIFlvrKDV*IILIILIW 
ILSFLILLKl^I*DVCFVLLH*VLFLKNSLCVYELILDLiFLYSMIKD 
M * KPOT\ PT.G PWKFVF * Y PT,P M S T YPVTTWKT . 2\ T .YMKMRT .Mf) T TCKVMT. 

i» XVX7 V- X \ X" UOr » » Ivr V r X X XJXvi J O X X x V XIX 1 * " 1 ' X I T 1 JVX'i v7 XjIM \^ X XVXV V 1 1XJ 

LSSKKK 

7214 

A 

1 

311 

LNREKLKAFPLTSGT * QE / CPLS PLL FNWREVlxATAIRQEKE IRG I 
Q I GKEE VTLSLFVDDM IL YLtKKPKDCTKKPLEL / NTFS KVA* YIINI 
QISVAFLNAKSK 

7215 

A 

6 

375 

ESLRSPSRSFS*PSPAYRGHGCSRPAS0T1NGDPCRAEAXSDTPVPQS 
SGSGDHNEKGPSRSPFQLH*PGPWGPAGCRGRPAPTPWRL\PAGPLR 
DSQEDPPSPEEPGSGGAAGQGGG*GLSCFP 

7216 

A 

80 

251 

GKFHISKTL**NINILL/VQLSWAIGINIYRTSNVEVLLQFLEDQN 
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TFPNFEHLAYR 

7217 

A 

161 

334 

GKFHI S KTL * * N INILL/ VQLS VVAID INI YRTSNVEVLLQFLEDQN 
\ TFPNFEHLAYW 

7218 

A 

3 

299 

P PFRTHQARR FAPTQD WQ IGFTQH/ RP / G VRKQKYLL V * VDTFTG * V 

KAFPTRSDXATAVITSLLSDIIPRIGLPTCIQSSSTLALISQISHPS 
CQAATIQ 

7219 

A 

526 

1108 

TQTKNSKI YFN* LLF* ETGPHSLSPQAGVWWHSHISLQPQPPGLK\H 
PPHPSLLSSWDYRHTSLRPANF*KIFVETGSLHVAQDGLEFLGSSSP 
PVSASQSARIAGMSLGARARPCLK 

7220 

A 

181 

272 

IQSWFNILKSIHVIHHSNRLK/NVNHIVSSSSSSSSSSSSPPFLIKK 
Q*KG 

7221 

A 

2 

336 

YDEEWRR S TVH PDEVGNLK* EDR * ALQNRLRVFLS LMDTGWLDNIjLIi 
DIDKADVIVKMLDA\SVIKMEGGTDNDIiNILDQDVEDEHA*KPGEPS 
KNEEGRAVSGLGDGERK 

7222 

A 

1 

324 

WS YLLVFESSSK*KKI I YQDCIGFVQDCLNI \ IDPIDSLRGKNVI I 
SLDTEKWGKI Q * PFMTKI FSKLGIEGNFLKWLKYQQSSTVSLLNV* ' 
ILKAFLSRTE*GCM 

7223 

A 

2 

692 

ATY I VDFGFQYNI \ REGQMLTAFCGMYP YVAPERS LGQACQ * PARDI 
QS LS VI L YFRNTVGRRARTLPF YS / AEAS KLQEKI LTGRYHAPPLFA 
LQLDS L / 1 KLLMLNARKGPSL * LMKNP WVKR I QKMPL I P YEEPL/ RG 
PPQTIQLMVAMGFQAKNISVAIIERKFNYPMATYLIIj*HTKQERKCS 
TIRELSLPPGVATSPSPSTELSTFPLSLMRAHREPAFNVQPPEESQ 

7224 

A 

2 

340 

S S SYTPEYWQ * RSRSHS PMSNQRRHTGRRANPDPNTCLRVFGLSL YT 
TDRDLREVCS*YGPLNGANWYNQQTGRSQGFAFVHF*GIDDSKEAM 
VRANRKELDDRR\ IRVDYS 

7225 

A 

604 

842 

RPGGQFCLMEASPSGFNPFFLLNI J PREWELRVPPPTRGNFF*FLKKN 
GF I / EVNPGG FDFLN/ SNDPPAFAS KGVGLTALTP 

7226 

A 

3 

343 

HEPNGFYAEIECERCNSPEIiDEEGYSIIPDEPGSTNGKHFYSSSESE 
Y*E*SHKI\YIQIKPSLSTDILNNAAPVDELKASIGNIALSP*PEGA 
IKRNLSREEVARPRRFTPSP 

7227 

A 

3 

304 

HEEKNEKIRNVGLCEAIVQFTR*YL*VCL*RSVNS*NCCWRIVPNIF 
F*PL*\TFSPSKPAKSLHTQKNRQFFNEPEENFWMVOTFTYTVYLSE 
IVWVELW 

7228 

A 

1 

167 

SARGPDGFTAEFYQTFKEELVQILLKQFQRIKGE\SSLKSIi*ETSIT 
LI PKTRKGT 

7229 

A 

2 

334 

ARDRAALETV I RRAR PELRRPG VQG I PRVHELKI EE YVQRGLQRVGL 
DPQLPLNLAALYAHVA*EKRWSILSRTILIVIL\VESLILLIIVLT 
YGYQLS ILCTCS VGNI F 

7230 

A 

1 

792 

FFYREVKPSLFVDCKILITEIPKES/NRVLQITRELSKMLAYKVSMIi 
KS ILFIL»T\ REYKKVKMPFNIVY/NNTKYSRINLMKDI YDLYTAI YP 
STAALKYRYMCNLQNIARRN*RPPLNKWW/REE*YSWIEGLVVKILV 
FPKQI YRFNAS PNKKS PPLFNKLI PKFP* KYKGRR I SQTNLGRI / Tli 
QDLKIYYKTTLK\NIVVm^YEVRQRLLPPAPQSAPIPRTCWQLSPSD 
LQGLKGAEEGLTDWGGKQSGTEITSGHLPPLSTFG 

7231 

A 

1 

2541 

MKPPRPVRTCSKVI,VLLSLLAIHQTTTAEKNGIDIYSLTVDSRVSSR 
FAHTWTS RVVNRANTVQEATFQMELPKKAF I TNFSM 1 1 DGMTYPG I 
IKEKAEAQAQYSAAVAKGKSAGLVKATGRNMEQFQVSVSVAPNAKIT 
FELVYEELLKRRLG VYELLLKVRPQQLVKHL.QMDIHI FE PQG I S FLE 
TESTFMTNQLVDALTTWQNKTKAHIRFKPTLS QQQKS PEQQETVUDG 
NLIIRYDVDRAISGGSIQIENGYFVHYFAPEGLTTMPKNVVFVIDKS 
GSMSGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVPA 
SAENWKARSFAAGIQAI^GTNINDAWLMAVQLLDSSNQEERIiPEGS 
VSLIILLTDGDPTVGETNPRSIQNNVREAVSGRYSLFCLGFGFDVSY 
AFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYP 
SNAVEEVTQNNFRLLFKGSEMVVAGKLQDRGPDVLTATVSGKLPTQN 
ITFQTESSVAEQEAEFQSPKYIFHNFMERLWAYLTIQQLLEQTVSAS 
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DADQQAIiRNQALNLSLAYSFVTPLTSMVVTKPDDQEQSQVAEKPMEG 
ESRNRNVHSETTMTTQTPACPSCSRSRAPAVPAPIQAPSAILPLPGQ 
S VERLCVD PRHRQGPVNLL S DPEQGVEVTGQ YEREKAG FSW I EVTFK 
NPLVWVHAS PEHVWTRNRRS S AYKWKETLFS VMPGLKMTMDKTGLL 
LLSDPDKVTIGLLFWt)GRGEGLRLIiLRDTDRFSSHVGGTLGQFYQEV 
LWGS PAASDDGRRTLRVQGNDHS ATRERRLD YQEGPPGVE I SCWS VE 
L 

7232 

A 

560 

901 

GNPATPSHCSLPQPPTPQEGAA/ YNPDFFFF * KQP * QPKPQVSSQPP 
WPLSTSLPPACPSCSRSRAPAVPAPIQAPSAILPLPGQSVERLCVDP 
RHRQGPVNLLSDPEQGVEVTG 

7233 

A 

13 

2931 

EEGRQQSPAADVKTGLRAPLAASAWPHRCCVRRLLARITMKPPRPVR 
TCSKVLVLLSLLAIHQTTTAEKNGIDIYSLTVDSRVSSRFAHTVVTS 
RWNRANTVQEATFQMELPKKAFITNFSMI IDGMTYPGI IKEKAEAQ 
AQYSAAVAKGKSAGLVKATGRNMEQFQVSVSVAPNAKITFELVYEEL 
LKRRLGVYELLLKVRPQQLVKHLQMDIHIFEPQGISFLETESTFMTN 
QL VDAJLTTWQNKTKAH I R FKPTL S QQQKS PEQQETVLDGNL I IRYD V 
DRAI SGGSIQI ENGYFVHYFAPEGLTTMPKNWFVI DKSGSMSGRKI 
QQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVPASAENVNKA 
RSFAAGIQALGGTNINDAMLMAVQLLDSSNQEERLPEGSVSLIILLT 
DGDPTVGETNPRSIQNNVREAVSGRYSLFCIiGFGFDVSYAFLEKLAL 
DNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPSNAVEEVT 
QNNFRLLFKGSEMWAGKLQDRGPDVIiTATVSGKLPTQNITFQTESS 
VAEQEAEFQSPKYIFHNFMERLWAYLTIQQLLEQTVSASDADQQALR 
NQALmiSIlAYSFVTPLTS^m^^KPDDQEQSQVAEKPMEGESRNRNVH 
SGSTFFKYYLQGAKI PKPEASFS PRRGWNRQAGAAGSRMNFRPGVLS 
SRQLGLPGPPDVPDHAAYHPFRRLAILPASAPPATSNP\DPAVSRVM 
NMKIEETTMTTQTPAPIQAPSAILPLPGQGVERLCVDPRHRQGQVNL 
LSDPEQGVEVTGQYEREKAGFSWIBVNFKNPL\VWV\HASPE\HWV 
v&ij i CtCboAi 1\*VGKJ5 1L»1? oVMPGIjKMTMDKTGLLLtiSDPDKVTIGLIj 
FWDGRGEGLRLLLRDTDRFS SHVGGTLGQF YQE VL WGS PAASDDGRR 
TLRVQGNDHSATRERRLDYQEGPPGVEISCWSVEL 1 

7234 

A 

3 

338 

DAWVSVLCCNLSEAKRRINSLSYNQCENIRRIjPSF/RDVWNGENSRY 
VEKQASEQQVELLTNERYLMEESQLLLQYLHVYHMNYFLDLTCRPTF 

7235 

A 

147 

380 

ASCRFDFFCLLCYRVLFSNLTVFCFAYRYFNIYNSFCFTFFLYFWWL 
/FCYDFLL/DYVF*TLLIVIiVCFFPVSFVMFLI 

7236 

A 

1 

257 

PTRPDFFAQQQHLTPDKIYHVTVMPCYDKKLEASRPDFFNQEHQTRD 
VJJCVJJTTGEVFRIiIiEEEGVSLPDLEPAPLDSIiCSGAYA*EPTSHR\G 
GGSGGYLDHVFRHASLQLFDS/DW*VSSIHLR**DFFAQQQHLTPD 
KIYHVTVMPCYDKKLEASRPDFFNQEHQTRDVDCVLTTGEVFRLLEE 
EGVSLPDLEPAPLDSLCSGAYA 

7237 

A 

2 

502 

PVWWNLQRRTS AG PGTAVPGLAVA * FRQTGLTRS PS SGRDGAGVRDL 
WNPPPPVPIPLXPLGFHLRGPDSTLAEQPSACLPGS/PPCRNGIiGQP 
QQHGHIj WPG * I GIQFRLGFHGNQGPPGPSNLP YPPFLFAAPLHTRHV 
PC/ LHHCCPLTWAALPTPLCLWRCSAI qd 

7238 

A 

1 

828 

MGKDFMSKTPKAMATKAKIDKWDIilQLKSFCTAKETTIRVNRQPTEW 
EKIFAIYSSDKGLISRIYNELQQIYKJCKTNNPINKWAKDMNRHFSKE 
D I YAAKRHMKKCS S S LA I REMQ I KTTMR YHLTP YRMA 1 1 KKSGNNRK 
KKNN I KL YVCR VF I TDNCEEL I PE YLNF I RVLVDSEDLPLN I SHKML 
(MSKILKVIRKNLVKKCLELFTELAEDKENYKKFYEQFSKNIKLGIH 
EDSQNQKKLSELLRWLTKRREDGAAILNMIGTTDGIYVCLA 

7239 

A 

2 

775 

NISREMLQQSKILKVlRKNLVKKCLELFTELAEDKENYKKFYEQFSK 
NIKVGVNNH*FSNWPL*FFFFFFNSESLCKEHTLFQLGIHEDSQNRK 
KLSELLRYYTSASGDEMVSLKDYCTRMKENQKHIYYIT\GKRTLCYS 
HTRRSYNLVGSVGVFSTQVALLQLVLI*AR*L,T*TRSFSVLGSA*VS 
G * QEKSELDETI LNC * KV * K* LCNTSGETKDQVANS AFV\ ERLRKHG 
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LEVIYMIEPIDEDCAQQLEEF*GK 

7240 

A 

1 

2303 

QLLQRPGVAVPLVLCGHLAKMPEETQTQDQPMEEEEVBTFAFQAEIA 
QLMSLIINTFYSNKEIFIiRELISNSSDALDKIRyETLTDPSKLDSGK 
ELHINLIPNKQDRTLTIVDTGIGMTKADLINNLGTIAKSGTKAFMEA 
LQAGAD I SMIGQFGVGFYS AYLVAEKVTVITKHNDDEQYAWESSAGG 
SFTVRTDTGEPMGRGTKVILHLKEDQTEYLEERRIKEIVKKHSQFIG 
YP ITLFVEKERDKEVSDDEAEEKEDKEEEKEKEEKESEDKPE I EDVG 
SDEEEEKKDGDKKKKKKIKEKYIDQEELNKTKPIWTRNPDDITNEEY 
GEFYKSLTNDWEDHLAVKHFSVEGQLEFRALLFVPRRAPFDLFENRK 
KKNNIKLYVRRVFIMDNCEELIPEYLNFIRGWDSEDLPLNISREML 
QQK/ SKILKVIRKNLVKKCLELFTElA AEDPANYQK\ FYE\QFS\KN 
LKALEYHEDSXQNRKEAFPELLRYXYTSASXSDEMDTLKDXYCTNRM 
KENQKH I Y YI TGETK\DQVA\NS S LCGTVFRKHGl»\ E VI YMIEPIDE 
YC\ VQQLKEFEGKTLV\ SVTKEGL\ ELPED\ EEEKKKQEEKKTKFEN 
LCKIMKI)IK\EKKVEKVVVSNRL\VTSP\CCI\VTSTY\GWTANM\E 
RIHGKLQAIiKETTSTMGFTWQPKKHLEINPDHS I IETLRQKAEADKN 

DPTADDTSAAVTEEMPPLEGDDDTSRMEEVD 

7241 

A 

158 

405 

RMNliDSDGMDDI I SQESSLDMEGNYKKAQKNEREAIRLILAL*SFFD 

nnoDTVTC /pt , ncpvnT'cot)ti**PTXTi?nTpr ceTuicti 
DGSRIiIo/ Cr#OoljJs.yir bFii w w viXr4i? y Xvj1j£>oTWSP 

7242 

A 

3 

442 

TLLM FNNNVS LLTLTFCMFLE FSLPCTEDVPTCGD S CDKVLECG IHR 
CSQRCHRGP\CT*VIYFHAVFQAGPECLHCEEGCSKSRPLGCLHPCI 
L RCH P G EC P P C VQMLiGT AE S AVIj PQACHC P Q FCLQS P P FL FTHKEPQ 
SSWSSG 

7243 

A 

.2 

324 

SGWS LTEQDP YNN I VRTA 1 EAMAAVFGGTQSLHTNSFDEAIiGIjS \TV 
KSSLIPFSLHFLFS*QFSLSYISIFLFILIFFYFYLSLLIHFFYIFY 
LISIFLFFFLFIY 

7244 

A 

1 

3 61 

RELiIjArWQNr IsJbRPQGPr r oP\(iPGGEl\.KR* SJjPKTPPKGGPPGP/ 5 
PNGRLPSGGQI>PPPRGG/SPQGSSSLPKGPQGWGPPFP*RSPPCPPH 
RIGVPQVSSSPTPGLVFPRGPLNPGNLGG 

7245 

A 

115 

3 14 

LiVRSJjTSliPrlbPPrJP IJbbJjPb 1 * JjPPYvjPVYijbHYPPSPP JLTbLiIxiPP 
PSPSPPTSLPPP\SPITPTS 

7246 

A 

2 

326 

NEGECR KDVEMEPVLQAEKTNFQNHKGHE F I PTL \WHFPANCDACAK 
PLVmVIKPPPALECRRCHVKOTRDHLDYTDDLIFPCNIRYDVSSTLR 
YS * XjIjHFSDEI TI X 

7247 

A 

1 

364 

EHTRAEDHMLEEREER * RAERAQREE * L PE YQERMKTL * EADRNIRP 
RELEIEERER\RTEEERILGDSSLSRKDSRWGDRDSEGAWRTGPEAH 
S EWT I G S PQME WR P * E WL YDDRS YTGD 

7248 

A 

86 

512 

RGRVEAAPGYGSHLCHNFLPCTYWQKFGQPPGLQRTAASYSSDTMLG 
TKAETHPTFAFFPAQGPKCESWQSRDTSARSETLGTACSYNTGNNWA 
GAAAHGDTEDLQWIjQRKicB / RSERKTRTDLKG VRTSNGTARDDNCAY 
SP 

7249 

A 

2 

703 

PDCPSRPT/YARGTSPGNKTENAHKGPKSvS SAPKMR*S1jQDTG/RE 
DTREPVRFHTDTGTKEQTVI FAD * KI AGRCKT* GQFPRQRPPHGGGS 
TPQLFSAAIGTSFPARFPAVKVHGFKTSPTGRGGGHWANTR\SFSWL 
A 1 J\Js_Kh,biiKJvl K 1 JjJj-KAj VK 1 bWb 1 AKJJDrJ(_A I b FHKKkKOAJb'PFArlv* 
LYRRSVCARKNHIVLSKTGGRSVRRHDHRRSRSWRLRPGVSHHMRRS 

7250 

A 

148 

359 

LGIDLTKKVK3a^SKKNYKTLMKKIEDDSN*KDNSCL* IR/MTILPKA 
IYRFNTISVNIPMRFLPDAWADAW 

7251 

A 

1 

389 

RGAQAI I IVF/NLNDVASLEHPSKWLADALKENDLPSVLLFLT^ 
ALMEKDALQVAQEMKAE YWAVS SLTGENVRE FFFRVAALTFEANVLA 
ELEKS GARR I GD WR INSDDSNL.YLTAS KKKPTCC P 

7252 

A 

1 

144 

YALMEKDALQVAQEMKAE YWAVSSLTG INSDDSNLYLTAS KKKPTCC 
P 

7253 

A 

174 

810 

DQRGG^AYECQWEGPCRLTHPLLSTSHGRQVnaADALKENDPSSVIiL 
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FLVGSKKDLSVSVPVGGLPSLVRECPPLTLAFPQTPAQYALMBKDAL 
QVAQEMKAE YWAVS S LTG / EVGG FRALL * HS CLALLS LLHLQLCPVS 
PDPALPLSTYLLSVSGENVREFFFRVAALTFEANVLAELEKSGARRI 
GDVVRINSDDSNLYLTASKKKPTCC 

7254 

A 

1 

• 524 

MERFEVLGIPFSLQLWDTAGQERFKCIASTYYRGAQAI I IVFNLNDV 
ASLEHTKQGVAECLKEE*PSRWLF\LLGFKKDLSTPAQ\YALMEKD 
AFPGGPRRLKAEYWAVSSLTGENVREFFF\RVAALTF\EANVLAELE 
KIGGLDVIGMFVRIN\SDDS\NLYLTASKEEAPHVCP 

7255 

A 

1 

258 

FFFFLPEMTSTVYCRYTNEPSLWLGTVHGPADEHGQPTPFPTPESCA 
VIHSRQKVAVSNTGHRFPWLGGASLIHRLAHLSEWPTSR 

7256 

A 

2 

564 

RC FAR YR P VHAD PL YCQGHAAP VG Y S TVS RTHAPGG / VQLP P PP P YT 
A / GRG AQ VGRGC S PAR S G AW PHR S G AVDHPLTQGDHT APDHS S GLND 
HSHHGAGARHLPPPQGHAAWPP*DLNPCRTQPS*PSHSPRPGSPLAP 
VAPPWALGWNQEYFPKSDSLLLIAKLYLIQWVFPCTVDFPNVINFNI 
K 

7257 

C 

30 

323 

MRLQLPSTSHLPVKRGXKEGEAVVLPEVEPGLTAREQEATPRPRETT 
QLPTTHQALGWNQEYFPKSDSLLLLAKLYLIQWVFPCTVDFPNVINF 
NIK* 

7258 

B 

249 

374 

RLRWPLRE VGQS VDQRCTTKPRE PVS G I R YCD FL PG VYDCT * 

7259 

B 

77 

867 

XAG ALQD I TLSQQT P S TWKDTQLLTA I PTS P E PTGLE ATAASTS TLP 
AGEGPKEGEAWLPEVEPGLTAREQEATPRPRETTQLPTTHQASTTT 
ATTAQEPATSHPHRDMQPGHHETSTPAGPSQADLHTPHTEDGGPSAT 
ERAAEDGASSQLPAAEGSGEQDFTFETSGENTAVVAVEPDRRNQSPV 
DQGATGAS QGLLDRKEVLGGVI AGGLVGL I FAVCLVGFMLYRMKKKD 
EGSYSLEEPKQANGGAYQKPTKQEEFYA* 

7260 

A 

93 

1050 

TSGSHSGITQFLPCFSGAQPTGQSP\*KRPSPGPQHRAARCTHFPEG 
S PFFG S S C SRPRNRAWVS AWWAG S * R PAQWS QELTQTPKQAGGQWRG 
PAPSRTPMPRRLEPPQHPKEQERHGVALEQSR/PRTLPPAliGHTCV* 
DQPSAFPVPAQPWGNPSMGSA/HERSWCHVL*LSSHPLIQGRRGSP 
CPPPVGLGISPSPCPGGQPRPQPD\SSLLGA*ASGFSGTPKFKGPGT 
HHGPCQPLPSPAGLGSGLGEDLEAVGRTQPSNA/PLAPGQ*ATGGSA 
LQPPVHTASNAPGPVSPDAPPPLPDQAQRLQDQQAKAKPCPR 

7261 

A 

2 

643 

PRVRLNGI VDR * VLGCLPCYDLGI FTDRDLGVHADAHDLVENL \ KDL 
IA* * IPFLAEGIRIHGDKVTVALRPFHERMEACFKQLKEMVENEYGV 
RIMPSSLDD*RGIRPRSMVRAFTMPSSSRPLSVASVSSLSSDSTPSR 
PGSDGFALEPLLPKKMLSRSHDKLYKDDLDTDKKDYMTYNWTCRQFQ 
ISRDKMKRTDVSLYIMLVSPNPNLSY 

7262 

A 

59 

340 

SLLSEIFATSSSPDSSSMIPPMWLTSMMFCVGGWAQRKLGLCGGTNT 
DDDGDRGCRFLESGCWAQRKLGLSGDPPASASQSAGITGMSHHARP 

7263 

A 

2 

519 

ILFLFLFYFISFLRWSFTFVAQAGVQW*DLDSLQSPPPRFK*LACLS 
LLSSWDYRHMPPRPANFLYF**RWGFTMLARLVLDS*PQ/CDPPASA 
SQSVGITGMSHHAGPTMKLLNPQSEVIFSPRNSHSTLLTPLSRHLPH 
P I S I STWC IHYAPLHTVS I S AEEAPWLFACPQS 

7264 

A 

19 

658 

VQPLGDAGMP * DGQRAAPRS P ALGARALAQGSTGATAGASRKQEVAA 
GPAGHRLVSTAAAGGGPSLRQVCSA/GPPGRAVRSGGRGQVRETPPG 
CLGGAWHPREPNGHSAFRFRPAGPIGVLGPQPGP\PGK*PCTKAFPP 
LPPNATAKAGPPPRCFPCQPCLILGPPSPPGTLEGPIPLSETLPTPH 
S S ALGA I FAQGGGDGGAGATE PAHLNL 

7265 

A 

3 

317 

EESFTAHMLRHRGVQPFPCGQCDKAYGTQRDLKEHQWHSGARPFAC 
DQCGKAFARRPSLRLHRMTHQ\GQLPMPLAHALCGAAPGSQGFLWNH 
MRLHT * KKPFL 

7266 

A 

285 

673 

PLCTREKSEYMCTSYSLARSRVGQTLCKGVQWRHLSSL*PPPPGFKQ 
F/SCLSLPSRWDYRCPPPHLANFFYF*QRQGFTILARLALNS*PRDP 
PASASQ I AGITGVSHRDRLI INFLDFEGYT * S ATDS 

7267 

A 

3 

231 

QIEQLHRRF*AA*/RGDQPTIRKENFNNVPDLELNPIRSKIVRAFFD 
NRNLRKGPRGLADEINYEDFLTIMFYFLSI 
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7268 

A 

88 

362 

KINMGILLKLVYRFSIHPVKTPAGLF/TKNEQ**TEFDKRILKSIWK 
CKE / PYSTRTMFKKTEAKGLPS / P/DFKTYYEATEFKTS * DQQKDRY 
KB 

7269 

A 

2 

355 

RRLFFFFFFLRRSL/DSVAQVGVQWHNIGSLQAPPPGFTPFSCIiSLP 
S S * DHRR PPPCPANFLYF * * RQGLTMLAKLVSTS * PCDLPAPAHQS A 
G I TDMSHRARL I VSLFKRQKTYGNN 

7270 

A 

1 

337 

RGDTGRTLAQRGARSSG / LPTV/ LNPGRQLTS AG* PH * YGKARAAMP 
TPSLVRPVLKVGHSPSPEVSGSAAEIAPWTISSPLYNPSSLPGPASS 
PMPIPNSS PLAS PVSYTVLV 

7271 

A 

1 

371 

RSRPELNDYKDKKAATKNVQKK* MQRQKG/ DIAIDATETQRI IRDCY 
EHYNKMSSSSSSSSSSSSSSS/PSSSSSSSSSSSS/PSSSSSSSSSS 
QSLPSKKNSGSEGFTAEFYQRYKTPLILKLFKK 

7272 

A 

2 

406 

ARAARVFLHRTS LNLSNGGTQC * KTMWHDDE YSEPCMTPS KEGTLI 
VRQTQS AS S TLQKHKS SSSFTPFI DPRLLQ I S PS SGTTVTS WG ISC 
NWMRP * S I \ KH Y PTQKGS LS F * I STH\ TYPQS AFCVI FLHYC 

7273 

A 

1 

165 

GSLLPLLFNIILEVLAKAIRQEKSIQNEKEKV*LSLLTDHMI/YM*N 
PTVKKLTT* KLTT 

7274 

A 

110 

409 

PWYLAQAGYEWGLDWTVAPKKARECVGKTVTLQGNLDPCALYASEE 
EIGQLVKQMLDDFGPHRYIANX.GHGLYPDMDPEHVGAFVDAVHKHSR 
LLRQN 

7275 

A 

53 

930 

ARSYRQLTMEANGLGLQRIHTDTYPHPHLIARPQGFPEIjKNDTFLRA 
AWGEETDYTPVWCMRQAGRYLPEVRETRSAQDFFSTCRSPEACCELT 
LQALRRFPLDAAIIFSDILWPQMFSPPP\ALGM\EVTMVPGKGPSF 
PEPLREEQDLERLR\DPEV\VASELG\YVFQAITLTRQR\LAGRVP\ 
LIG\ FAGAPW\ TLMTYMVEGGGS ST\ MAQAKRWLYQRPQASHQLLR I 
LTDALVPYLVGQWAGAQALQLFESHAGHLGPQFFNKFALPYIRDVA 
KQVKARlA REAGLAPVPM 

7276 

A 

86 

552 

KQPQPTAPLWQPLKLSSKLGLAPGHGPCCLHLLSALGSLPGLVPWLP 
SRVGC/SS*GLLLTKSPQPSADPGQRRVAIiHHVL*VEPFASQEAQPP 
PSLP /WW/ PLRGDTESPQGPS PLSARCTHPGCRSVRGRWGRARGWPA 
RCQGAGGPGGWRTTCPCR 

7277 

A 

176 

779 

NSVSKLGLAPGHGALLPPPAERSREASQIPRGRAPGVHYLCLKQPRP 
RGLLGPQGNERPLPSKPHLCAGLCLGGPGPTSPLPFETIRNWARRPP 
EA* PQGACI/LLLSLGDPRGVG* PWV* TPGGLVFWMSLWHFGPQLKP 
PPLPRSLRPRPSLDLLGRKRHQGQDEKPGGAAPQMRAL*PVTjPLPPH 
PGTAPQLVTATRKP 

7278 

A 

32 

494 

FALVGQAGTQ*CDLGSVQPPPLGFK*FSCLSLLSS*DYRHVPPCWLI 
FVFLVE\QGFVIIiAKLVSNP*PQ/CDPPIIiASQ/SAGITGVSHCAW/ 
L*LSFSFFC*DRVSVTQAGMQWHDLSSLQPLPLGFN*FSCLSLPSSW 
DYRCVPPHPANFFI FSR 

7279 

A 

19 

253 

LENS FLKTLPLAI ACRN I KYMNLTKCESCKLREI \ KDLNKWGSTPCS 
TIGKLRIVKMSFLPKLIYKSQKTFFLQTLIK 

7280 

A 

3 

1023 

FLNWKSKILYSTDDMirYIENAKESRKKLLE*VN*SKLLDTRSMYKS 
NAFLYTSNNLGKPAF * KP YPLAI ACRK\ I KYMNLTKCES CKLREI \ K 
DLNKWGSTPCSTIGKLRIVKMSFLPKLIYKSQKTFFLADIDKSCFF* 
DSFTPGLPRLEWPWPRSQAH*KPSNLPGSSNSPGLSLPSSWDSTSAC 
HRRLANF FVFLVEMW * CMPTVPTTWEAKERAHLS PDVRG Y I S CDCAT 
TLQPGQQNEISASNKTTVINIVSYWCKVIHTR\KWCPEQTTSMSLRV 

£* ilJlj KJjOflWCjOKVO X INK. X. HtJT XlMxlX UXVJx V Vi UUtr lXlxn X <L Ivi li" *J ivn -t- x 

DLNVKGKTIKVP 

7281 

A 

1 

408 

DVCKPDLTVGLRYAPHQYKSWATNLPEAKYVFLNWIADLSKAAPRN 
ND \ ALGTS AQAY YG YRKQEQ I LL * EEDAxYTGTNES LERQMREMEENF 
AVEAANYQDTIGRLQDEII^KEEMARHLREYQDLLNVKMGP 

7282 

A 

1 

3003 

MNIDAKIHNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNTSKSINV 
IQHI^TNDI<I01MIISINAEKTFDKTQQPFMLKTIjNKLGIDGTY1jKI 
IRAIYDKPTANIIPNGQKLEAFPLKTGTRQGCPLSSLLFNIVLEVLA 
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RAIRQEKEIKGIQLGKEEVKLSLFADDMIVYIiEKPIISAQNIiLKLTS 
NFSKVSGYKINVQKSQAFLYTNKRQTESQIMSELPFTIASKRIKYLG 
IQLTRDVKPLFKKIYQSLLNDIKEDTNKWKNIPCSGIGRINIVKMAT 
LPKCLPTTCIHMIRPQFRVSSVRRDFNSEVAMEELKQTRKPI FLARE 
VRTLTSFPIFQPYGFlilLRGHRQEQFTSTWLRITWTKGILVIQVNP 
WLTTLEAG I S KAPDKPWNPS QVAPALPQKPAQVIiGLWLAFS VNRTWM 
QLETIILSKLMQEQKSKYCMLSLISGSESVSKRRIALGSALHVEGPT 
AHSKAMAQLSPRQRRSRAPTTHTHRALVRLFSGSQSAPPPPPRPSPP 
SAAMSTRSVSSSSYRRMFGGPGTASRPSSSRSYVTTSTRTYSLGDAL 
RPSTSRSLYASSPGGVYATRSSAVRLRSSVPGVRLLQDSVDFSLADA 

T "NTT P P VTtfTP TKTP KVP T .HP T .MHP P JXTtfY T n'KA/PTI'T.'P'nnMK'TT.T.aRT.POT. 

KGOGKSRLGDLYEEEMRELRRQVBQLTNDKARVEVERDNLAEDIMRL 
REKLQEEMLQREEAEMTLQSFRQDVDNASLARLDLERKVESLQEBIA 
FLKKL.HEEE IO\ DLOAOIOEOHVOIDANVP * HDFTT\ ARRDEROOYE 
SVAA\ KNLQEAEEWYKCQVC* PLLRASNRN\N\ DPLRQAKAGVPLS Y 
RRTGC S PLTCE S GC P * KGTK* SPLERQ\MRE\MEENFCR* KLLNYPR 
PLLARL\QDEDSGIWKEEMARHLR\EYQ\DLL\NVKMGP*ALEIAT\ 
YR\KLLEGEESRISLP\LPNFSSLNIiRET\NL\DSLPL\VDTHSKRT 
hit I KTVGTRRLDRL I NETS \ QHHDDL 

7283 

A 

2 

178 

PRVRWN/ RKVQKQI I S IHYGQ * IVDKGSRTIQ * RKNS VFN* *C *DK* 
IFTCKRMNL»YLYFVSSSSSSSSSSSSSSSSSSSSSL*EENTGINIiHD 
LGFGNG FXiDMAP KAQATTA* LYLYF 

7284 

A 

699 

2528 

MQYNIFQVLYILDELHGILGNLTINKFIHRSIVANNLDDLDEKTHQIi 
TKLAQKE I KNPNRPVTS KE I E PVI KNLP \ TKE S PGLNEFTGQ / YYGQ 
TY*RKELTLILLELSQKMEEKG/V/IPVTFYKADITLTPKPDKDLTG 
\NYRPISLIWIDAKKL\KILAGQIQQIjIKMITQHD\QVRFIKEMQGW 
F \ HIQMS I D VI I YH I NRMKGKK\ HM I TS I D KKKHLTKI QHPFMVKTL 

V_ IVL10K.1VL3W r JLUMLi -LJS.X o lA.lnlj w yiUNolxl JUxvif Ir Li J. o Ivx" iilo f Juxxr H 

IILEGIJ^^IRQEKEIK*I\QIGRSKTTFAENMVLYIKNPKESTK*F 
RLI \ * KFSNVAGCKINTQKSKKNLRGN* /RQF/ PFTIT* KRIKYLEI 
NI j TKKVQDLDTEKYKTLLKEIKEDIJn'KWK/NILCWQVGRLNIVKMAL 
IiPKLVYRFNTIPIKIPVT/LFTETEKLTLEFIWGLQV/N^RYSSLFK 
T T T .NTOJTCVPnT* T P *5 UP K T YPKAT *5 T KT\TLP WYK7J T * QNP VE WKVOKO 
ITPI\HLWSIDFWPRGSRTIQ*RKNSVFN**C*DK*IFTCKRMNLYL 
YFIPYIK\INS\R*IKDPKCKSVNPIKL*EENTGINLHDLGFGNGFIi 
DMAPKAQATTAKKKKK 

7285 

A 

2 

577 

KKI KNIKGS IQQEGLTVVNMYAPNDKPS TYMKQKLT * LKGE\DNSTL 

I VGVFNITI S WDRT/ SQ YLSKEI *DFHNTINQLDLTTI YRTLYPTM 

KYTFFSSIHRINSKIGHKLSLN*F*KTA\NIFPDHNGMKrLVIIPKKK 

TGKFKNHRN*HTLNH*IKXET^ 

PQLTAR 

7286 

A 

17 

377 

QRKACAPRAWSGDPSACRPSVRS*GSTT*G/KPVNFDDYGDIYIPAV 
lTTjlCrFTjRPTjAESIjTjTPOAYDOTIjGTTCEYTjPWRRSGGIjSPHMIjSLiIY 
WPMVRLWGGRGDVYP SCYCCHMAIjYGM 

7287 

A 

1 

893 

SISSQQT*SSWAPCYRSLPASTWASSLCSSSSLSISFSFSSVYSSPP 
GSSTPLASILSAFF^PLDFSSlxASVSSLLPSIiNLTFVSWLPPASPLS 
SIxAFPASTLVSGLPSASSLMSSLPPGSACLSSPTSQHACSS/STIrSQ 
SLPSTSCCLSLQTP*TLPDWGFRSQKGT*V\S*ALRKTSSPFSSSTS 
GLGPAESREQQLGAKTNCASSRDRSSRCGSTENSST*GPTPRSSSR* 
PIPGSSSPGEKAAPAASAWAASPSSAPTOSGYELLESSRKEKESCTS 
WDSESSRAGAKSSGPES 

7288 

A 

827 

1393 

ARCRQAREDGRLVQRLCRRGHFVPLQHRRGAQERAPGIAHSSCAAEP 
RPLSLRTDISSALVWPRASCARSCPEMGPPZnTLKWASGA*TLEECPP\ 
LSGSPRKNLiD*PPGLEEPSRLEGGTPSSAIjLKG/PAPTYliPPYAltED 
LPKPTSA/VQHSPSTYPPHVLPIPPTPSSWRWAVGALCSRQDNLHD 
QPRL 


/or?* 
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7289 

A 

104 

464 

HLS VKLKLKFWEHDMLQGQKEWI I S IPIxLLPHPIxLQQNPHQPPHNPS 
WCCSACTQLSVPWPGLHQAPQWEAGS . 

7290 

A 

2 

394 

PRVRNHPIQKWAKKGIi\DSHIiSKEGPQ/AAISHMKRCPASPATRE/V 
LRCD/PVATPLLSV/WPQACKPGTQRDLHTTVFTAALFTVAEGQ 

7291 

A 

2 

344 

DTRTHVAEEDKLAIIVEPEKRGKYWWFDPLDGSSNIDCLVSVATIF 
GFYITNS S \D * PS FNHA\ LST* RNLYLVSLPFHLAFLSTFHLS FLFF 
ITT . «; FT . ^ i\FS YIj F FLiFS S F FS S 

XT XjO JjO JL» v w X XJ X. IT JUL uui X- »»? »— » 

7292 

A 

1 

348 

RYRSGIPGRRTRAGETPSLLKIQQPG/ GDGARS * SRLLRRLRQENRI* 
NPEGRGCSELRLCHCTPAWARE *DSLSKKKKKEFKI PI YFLTLLWDV 
FT? TOYTYS I PMCT.T.CGEMVTD I 

7293 

A 

10 

436 

E Y YRFLE AKENRTAT Y *NL/ MDHAAKAELRGKF I S 1 YA Y I KKLE KFQ 
T KTJT.TMO^ TOCT.FKF * T KKSSNVFORARKEOTKPOI SORK* I SEKE * T 
KLRLNQCHKKSLKQPLLS * TKKDQNK* MRKEKGD I KLSGRVGRPEYL 
QT 

72 94 

A 

2 

A t\A 

r , PT , QT*c:^CWTiMWnPAPPFT?t J *P'LiPTFP\ PGOOP1TTIjP/RPTL»RP/SP 
*TPTSACS/RP*/PAVPSRPSHPSHLPPSCPPQFTFisrvPHL**GGPH 
PTHNLPPNPCPALLQLPGFSTLPRLSP*TPPPTPPICHCVSRFLAP 

7295 

A 

27 

88 

CREEDDDLV^SEDS*QWGESLKRVSFTLIxDDi^^ 
EVKS\ FEKRQEKESA* vgrkmmtwlkvktvnngeka 

7296 

A 

109 

411 

KFK1WWKLNDTL / LNNHWVKEE I KKE I RKYLKRNF^na^TTYQNIi * DS 
PutsJrv xjRttt VV ±£4J\x XlvXjoijLjo x JLoxvaj lxjxxjJ\xJljjfcijrVE»x JV^MJVVIMKKjrujJ. 
RITEEIN 

7297 

A 

13 

'811 

VCRRWSGGLARRFGSGCCCSSCPQRRAARRSQVQNGKYIi/FDQINRS 

T T?*TVT?Tir'COrkMPOr , Vlin^rTT?T7nT p PTjT?D/T" , T 0"DlTMM?*T? , \r\7PDTrr.f! , T r U"V 
ijKJM x JairwooyiNi^o^xriO V JLCiJMJJj 1 jfr K<jvjX oKJSJ v 1T1/Lc.V V jrCKlvLiljr X rl i 

QITKlSTRLYRENDCMFPSRCSGra^ 

\7P * mtjg aphp «r/ pci p c vphoq /tittd tmypawtpwe / wgtcht^antls Y 
RS WTEGPLQRRSGKVS \ TQWPWKKKNSTAYFRG SRTS PERDPL I LLS 
RKN PKL VT) AE YTKNQAW KS MKDTLG KP AAKI3 VH L VDH 

7298 

A 

22 

A O "7 

X, c oil I jc r ^XvjT xiivxjjtr jt xnr xi'tnxjxjow-ivv uuc J. c \^\~. r xtxxxj vr ir vjjui x x 
LKLFCYTFKFTOCFLYSFSFTVHLI*YIFPSDIMFLYVEV*FKSFF* 
PYtPT J CTTVlIAFISFI J NTI J i\ ISLFSS** *LTIFYCPYT.T,FYH 

7299 

A 

131 

353 

I FLPEKKLYKLEKWDRAFDOT*NF* T* K/ IHTGEKS * KCKKWDRAFK 
GWPHFI IGKIVHTEKNFYKGEQCGKTFN 

*7 1 A A 

A 

lJ x 


T T T.TFKKT .YKT.FK / PGOGF* OOTjKTYLNT KTvTHTGEKS* KCKKWDRAFK 
GWPHFIIGKIVHTEKNFYKGEQGGKTFNEGSHLIAQESIYT*KKIiSK 
YKD 

7301 

A 

305 

326 

KTFRSQKKK* IHW/DYIKILNFCARKlTriSjWKPvQPTE*EKIFANHI 

QZV*G 

7302 

A 

3 

416 

FLR* IGSHSVTQAGVLWRDFGSLQPLPPGLKLFSCLS FLSS *DYRRV 

PPCxxAKFCIFS/VRRGFAMIxARLVIxNS*L*WCPP\Cm 

QSAGIKGVSHHTWPHK^LKNLITVFSYQSKSTVxWQLS^ 

7303 

A 

3 

476 

PESIKRAKEMDLVGIjGLHPLFSSRFEEEELQRIjRLVDSII^^ 
IFSEWGRKKQRSGPlSrRGAKRRREEARQRDQEFYIPYRPKDFDSERG 
LS I SGEGGAFEQQAAGAVLDIxMGDEAQNLTRGRCK3LKWDR 
QSGQEDKKKIKTESGR 

7304 

A 

2 

1050 

REDTKAAVxJIJHxJxaJ^lJ^^^^ VA1 jTUixiAxl* x L* 1 £»xj1j 1 1 yxvoL 
AHI YSALDPTARKINIxAKFTIjGKCSTLIVTDIxAARGLD I PLLDNVIN 
YS FPAKGKLFLHRVGRVARAGRSGTAYSLVAPDE I PYMQDLHLFIiGP 
LPHPRPTPQGALRVWAGVIX3HAGVGVPQSVVDEEDSGLQSTLEASLE 
LRGLARVADNAQQQYVRSRPAPS PES I KRAJCEMDLVGxjGLHPIjFSSR 
FEEEELQLLD\ LVNS I KNYRSRATI FEINASSRDLCSQVMRAKRQKD 
RKAIARFQQGQQGRQEQQEGPVGPAPSRPAJLQEKQPEKEEEEEAGEK 
CEDIFSEVGGRKRQRSGPNR 

7305 

A 

3 

872 

PKSWNGPPHPPGHRHSRSHSNFSGCSVDEERSRTSKDSSSPVNDHRG 
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RQ*MR*KNPFAASSRSTSSSSSSSSPSPPLWSTAGRTPTTRSGPSPA 
LtDlvVQ X x MJbi hLb L1ATA0 I / B FAG S AHVPGHLi FRAE * VLPAL * GRALLG 
TP VTNNLLGE I AKP S L S QS KRLKAAS / AAKYPHD VVNKLS CDEARNH 

lurCKoVrnrrnrKi \ IV xU^uor 1 rKoLKiiKLLo 1 V 0 irKK X KlvAr'Vjrx/ 

PLVLPRP*GPSPGPLNTSVDSSQKPASNPGDVQNHPGGLLVGNCNSE 
PG APS AES PAV 

7306 

A 

99 

433 

KLKVKGGKRYRMLTL I KRK*H * I KI QNS CVFLYTSN* ELENK/ IFGR 
YVSFMIA*KIKS/LGIILMKDV*NLHSEDYERLLTKIKDDSNKWTDM 
QFTRIR*YHKMLILK*PIIY 

7307 

j± 

1 a 

110 

T.'tmQ'Krn? CDni?nif CT,m?CTm?uDr , D\7* T?f"*T / td cmt , t , ^7\ 01 jr^MnTuo c*t 
xjitxjoim 1 Kgryf ^JVoJuVjr 0 l\jrCirrt^.Jr V w c*vjJj/ Kb V X y/VJVy Wlr X xiooJj 

QPRPPGLQSDPPASACQAARTLCRWY 

7308 

A 

1 

240 

LVCLSKSECLQPLFYHHLTLKNCQFHLCR I LLHPQC * KQALRHGKAC 
PLCHCCSHAALYMICLAKED*KTFDPQELMYSL 

7309 

A 

2 

1790 

ANQLQGEGVRFFWDCRPAEQYGAGHLSTAFHLDSDLMLQNPSEF/D 
TAIL*SLLEAQKQSIESGSIAGGEHLCFMGSGREEEDMYMNMVLAHF 
LQICNTEYVSIASGGFMAIjQHHLADINVDGPENGYGHWIASTSRSRSS 
INSVDGESP/ KWLCKLRWILKHQV 

7310 

A 

1 

1426 

ARILLDNDHYAMEKLKKRVLEYLAVRQLKNNIiKGPILCFVGPPGVGK 
TSVGRSVAKTLGREFHRIALGGVCDQSDIRGHRRTYVG\SMPG\RII 
NGLKTVGVNNPVFLLDE\VDKLGKSLQGDPAAAIjL*/EVLDPEQijHN 
FTDHYXiNVAFDLS Q VX> F I AT ANT \ LAT I PAALLDRME 1 1 QVPGYTQE 
EKIEIAHRHLIPKQLEQHGLTPQQIQIPQVTTF*\IITRYTIIEAGVR 
b IxUK \ NXAjPIjCRAVAV KV A* GQRTE \ P I WTVIjM * LREKVAENTS * KM 
KNI^S/RDTTDLALPPEMPILIDFHALKDILGPP^EMIWSQRLSQ 
PGVAIGLAWTPLGGEIIFVEASRIiDGEGQLTLTGPSSGTVMKESAHI. 

EVTI SNLS RPQLFSGRLVRSDVAMTGE I TLRGLALP VGG I KDKVLAA 
HRAGLKQVI I PR 

7311 

B 

169 

379 

VSQRLSQPGVAIGl^WTPLGGEIMFVEASRITOGEGQIiTLTGQIjGDVM 

7312 

A 

3 

413 

HHFLTCRIKNKIlTxHTIATKTIK^ 
NKWKNISYSWLR*F1\Xj™ISVLPI^ 

r*lVr X w JVNJSJlVlV XoKJSy X X JNJViVo X JNAjr X X» 1 J-i 0 V X i\. 1 x X JVVO X XJK.X V 

7313 

A 

2 

423 

SFPLPFPLPPCSSPQGPQLLRKAITRPWPGTCPYPASTQVGEFTLRT 
RPQLSVGV*EEGCPYPSPADSPMSEDPGAAPPHHPRSPL*RNR/C/S 

QKnTCTAVTTAWPAQWT.AQPT.OPTfST.PrciTPlsjTvj^ 

E 

7314 

A 

2 

417 

ILSTLFQICHCLWSSQLLLTSHCQx^YCCSSKN^ 

WT.'F'A *PTTTT.7XVTT?T.VT.Qr , T.I^VT , 7XT.VT!*Ti*\r\/r*T?T T T \7T DCVCVT T 17T CI? 

x>i i j r.t/-\ r inu/iv irui iiouiior x/\xj x c*r v v j* XjXjXj vLKo x 0 vxjXjx: JUor 
/LFWNFS\THILELLCPICIiLNFVLLFMYFSFSVLQFGYFLFLCLQ 

7315 


2 

*± X JL 

KyoxjXXjotxijirlSVywriL/Jjoo \Xjyx^xjFfc'UXjlvlvXr irFoAoKV/VijxM XL/VxiH 
HAWLVFFPFFV\RRGFAIVAQAVLQNPGLQAICPPMASPKCWDYRRG 
PPHPAQE*EHFLRSLPSTGESYFGEL*HISTRKIKILVHFEETN 

7316 


2 

333 

TF!PAT»CfPRTi'RPQflTTQAWPKT.P / T.PRPMT,T?c;rT,QT.D\ CQLJTIVPPPPP 

HPAGFFVFLSRRTGFHC/ RliARDGLDLLTS * SARLGFPKCW\DYRRD 
DPMPGPVSFNRMENLMETVD 

7317 


3 

232 

AA c ?TPRPT.PPT.P t ?PTPfiT.VPW*P.PPl?PPTQQa A\ TT'Da OT?P7A D'PQZiP 

PxTRSNxTOU^LSPWSSRPVTEEMQSAAJ<HNG 

7318 

A 

1 

386 

IPWSRVG*1WPVKNAHLPKAVSKFKTISIKKPPLFFPRMEKTILKFIj 
WDSSSSSSSLPV*SSSSSSSSSSSSS\FSSSSSSSSSSSSSSSSSSS 
SSSSSSSSSSSSSSP7?HKIKPlTx^RQLIFDKANKN 

7319 

A 

2 

411 

DRKEERLRSPRSGAAATMSSGGKGVAEWDPLPCLDPELWDQFDNLEN 

\HSLEKFIISAKERTEIELSYTKQLR1^LCTCQPKRIWKE1\^ 

SPSLPWKMSDHARQH*VIFFEVLNFQITVELQGHVQNQKQE 

7320 

A 

58 

502 

DPRVRPGTRGPRSGTAGGEl^WEGPFGVIVRVLMFLSxnSXiRAAEIiGE 
VGAGGPRGARRPSQTETAA*PASRFVRGLGPQTIiRPTSPGPRRSRPIi 
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LGDEPPRRRVGGASFPPTRNGSGAW\AIiWRRPCWECSSRSFL.RTTKL 
FLFFGFG 

7321 

A 

381 

2378 

AASMEHIQGAWKTISNGFGFKDAVFIX3SSCISPTIVQQFGYQRRASD 
DGKLTD PF * TSNT I RVFLPNKQRTVVNVRNGMS LHDCIjMKALKVRGL 
QPECCAVFRLLHEHKGKKARLDWNTDAASLIGEELQVDFLDHVPLTT 
HNFARKTFLKLAFCD I CQKFLLNGFRCQTCGYKFHEHCS TKVPTMCV 
DWSNIRQLLLFPNSTIGDSGVPALPSLTMRRMRESVSRMPVSSQHRY 
STPHAFTFNTSSPSSEGSLSQRQRSTSTPNVHMVSTTLPVDSRMIED 
AII^SHSESASPSALSSSPNNLSPTGWSQPKTPVPAQRERAPVSGTQE 
KNKI \ RPRGORDSS YYW\ E I EASEVMlA STRIGSGS \ FGTVYKGKWH 
V\NVAVKILKVVDPTPEQFQAFRNEVAVLRKTRHVNILLFMGYMTKD 
NLAIVTQWCEGSSLYKHLHVQETKFQMFQL\IDIARQTAQGMDYLHA 
KNI IHRDMKSNNI FLHEGLTS * KFGDF\GFATVK\ SRWSGSQQVEQP 
T\GSVIi\WMAPEVIR\MQDNNPFSFQSDV\YSYGHRICMELDGRGEL 
PYSSH/INNRGFRSIFHGWGRGY\ASPDLSKLYK\NCPK\AMKRLVA 
DCVKKVK\EERPLF\ PQILA\ S IELLQHSLPKINRSASEPSL\HRAA 
HTEDIN\ACTLT\SSPRLAGLPS 

7322 

A 

3 

419 

EPQWIQAIiMPEKEPARQSRGGAIjRTDSSEPSRNEVKKTISFIIIjAKR 
TYLE I YATQDA*NLCTEKYKI FLKG I KEYRTKWKD I LQS WITRHNVK 
MAKLP/KFIYS FTICIKMP\LSFYKLILIFMRKYKGCRIANTI*RER 

7323 

A 

3 

222 

DAWVTSTNADKEFEKIQHPLMIKLSSK*VY\NMSYIYPLFKIIKGIY 
DKS LVN 1 1 LNVE KLKAFF\ LRTGTRER 

7324 

A 

2 

395 

GRQPRWRLWGQLLSSGRRPLEPGWHRPSRGERGHHASFPEPGPCPGP 
QQRPQPG*VPVPEP\PPKGGPEPRSSGRKASGPSAQPPPAGDGARER 
RQSQQLPEEDCMQI^NPSFKGIAFNSLIiAIDICMSKRL 

7325 

A 

63 

452 

KNQEQESEETLPNLFYKAS I TL I PKLDTQK* KKRKEKKRKKKK\ EN * 
KPITHRNIYAKILNKILAHQIQQYIGKIIHHDQVGCIP/GAFDKIQY 
RCMIR/TLQKMGIEGTHLNI IKAI YIRPTDSI IENREKP 

7326 

A 

2 

137 

SKAKINGWDYI * LKCWYTAKKAI SGVKRQPTEW/ EKI FAN* SSDRR 

7327 

A 

755 

1287 

GGWDFAPAEGRERLAPCQIWDPSCLPTHW*PLSGLLSRPPPGTQSPQ 
LSPYSCAKSWCPCPSLSPGSPMN*PSVPRTTGPSHPEPSAHGSHTSP 
IVATTERAPERI SGGDSLGLS PSGLEGVNREPLFT\ PPS SSGSSVSL 
MSSTP\PPAGHSAA*SPQPGRCSGPAQGQATSRSVRGP 

7328 

A 

1 

2240 

MEERKGHKE IHLRRLKVR I LS VRMS AI KES PRLL IGNQE PLKQLKQE 
SKimRYVFNTDYSGCIRLIQSRLSHFPESSGQQNPKPYMRKFKTPIiM 
IAEEKYRQQKEEIEKQKQESSYYNIVKTQSQNQHITEVEKEMPLQKT 
NEEVSLSGIDSECTWQPSPGSQSNARILGVCSDNQLSTTSPETVAA 
KRLHHVIiAASEDKDKMKKEVLQSSRDIMQSKSACEIKQSHQECSTQQ 
TQQKKYLEQLHLPQSKPISPNFKQSEIDVQTFTKKQYLKTKKTEAST 
ECSHKQSLAERHYQLPKKEKRVTVQLPTESIQKNQEDKIiKMVPRKQR 
EFSGSDRGKLPGSEEKNQGPSMIGRKEERLITERKHEHLKNKSAPKV 
VKOKVIDAHLDSOTONFOOTO\ NOTAES KAEHKKIiPOP YNSLOEEKC 
LEVKG IQEKQVFSNTKDS KQE I TQNKS FFS S VKE S QRDDGKGALNI V 
EFLRKREELQQILSRVKQFEAEPNKSGLKTFQTLLNTIPGWIjISEDK 
REYAVH I AMENNLEKVKEE I TH IKTQAEDMLVS YEN 1 1 QTAMMS S KT 
GKPANGA/ALVSTKTSPKVSNVHVSNNENSEQKEN* IAKEKTVQHQV 
AAHDEATVRSHVKTHQEIKLDDSNIP\LPL*KHSPPSPTFITIESTA 
RRTENPTKNELSQS PKKDS YVEPPPRRPMSQKS E IHRANTSPS PPRS 
RS EQLVRLKDTTAKLS KGAI PC PAATPVP I VE KRSE I IMS PATIi 

7329 

C 

110 

298 

MPWISEGRATRPCLRVPSARRGDEGLHQRATANSVYKSRNTWLCWIC 
KPPQSSSPKISEKRF* r 

7330 

A 

127 

744 

LVRVFPWREDKVWWHtQE/CRPKR*KSKPGLSGPV/APGHSWDSTET 
TEATGLLLIPGPTRRPPTLYHTPCCLSVPRWCRPSSPIiliAEGTRRQG 
LVALPSLIHGIQPMGLEVWGLASPPR\PGRKWHR*KDKMATQALSVC 
RA*TCILPPSTKTSRPRPHSTPRGPAPTTVQRNSRRVNKGSPESPLP 
RPGLPVSEPHVAGTRTARDG 
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7331 

A 

2 

433 

FED I K* VIGKVKKMMS KQRGN INKGKI I KRNQKB I VELKS TVTKIKN 
LLGGFKNRL*QTDK* I S YF/ EDTTVE I SQSGKQS S SRLEGREQSLKN 
> LWNTIKLTRVYTVGVSEKSKKEQIV*KISAGNFSNIiIKNMNINIQEA 
QRTP 

7332 

A 

1 

281 

PR/PDRDSEAPGADGAGPRGALPVPGGHGQDPAEDGPGVHAGGARDG 
GPDPARGAYPPMPSTRGCRKGSLANLP/PVGFQYVHPFF* FPGVTAK 
I QRRMDQVFTLAVRETEAL I PPEVPTRQCPAHEAVGKVHWLI CHS WF 
SICPSLFLISRAQYFYP 

7333 

A 

33 

428 

GSPAPGLPKGWGFRGGPPGPGKFVFFKLPKGGFPGAEIFLGEFFQRG 
KKGLGPIFPPVFLKTEEGGTLS/NSFFKGRVFLVFKPKAGSSSSSSS 
FN i JvJf X X JjNN I D 1 Kr FNN M LANK I * QC I KE VMQ YDRTRG 

7334 

A 

491 

731 

DRGLCLPTLEVQWAPILGSLQPPPPGFKQFSCLSLPNSWDYRCMPTC 
PANFFVFFCIF* *\SRFHQVGQAGV\QLLTSSDL 

**J "i "5 c 

73 35 

A 

2 

575 

AGQVGQLGFPGWAGGEGRGRPPLL*IiSLPGRQLPREEGLTSVSGHPC 
WP/DSFPGLPGVEKPQGNIi\AGGALLPKLQRQSGPGNHKTIjPISRGP 
AD A V P V CHS TV G PRG * H S iiALHTVP P E KFN I YFFTCPENSSFNSIi 
PL P FRS S L EG I Q AG VS S AWGH S AG PGTLGL S GDGRQRQARRFS LHS C 
LPCSP 

7336 

A 

19 

422 

GEKTDKIQHPLMIKTLTNLGI EKAYI S 1 1 KVIHNRPMACI I INGAKV 
KAFPVRSERQRCSPSPLLFNIGLKVPVRAIEKEKEMKGI*IGKKEIK 
Lv_ Li b GNU \ M I L»Y L KH P KNS TKT I RTDT FS KVAGYKNTDAWV 

7 337 

A 

33 

193 

GNXGETLMD I GLGKE FMTKTS KAQATKPKI / DWDLN * KS FCTDTNIM 
HMVLIYL 

7338 

A 

87 

713 

DIYYNCFGFASRKTDCLSTRIPCFERTSSNS*TQNTjTAIjF*IKYDCA 
TVQACRSRNKWEVRDALNKFSGTNSIFYIIKRFYIYIYIFFFFFFFE 
TESHSVAQAGMQWRNLGSLPAPPPGFTPFFCP\SLLKWLGPTRRPP/ 
RQTLANFFVXiIjVETGF \ TMLA\ RMVXj is* prdpp/ as as QSAEITGV 

shctrlkkirfakghvefffeshve 

7339 

A 

2 

105 

engwcwiatyg\k^kldsylslhtu^wsr/wdldlgtetmeilee^ 

G EVIiLD I GLG / KEFMTKTP KADS TE I K INR * DS Y I WKNENW I P I S HY 
MQELTQEGLGSRN 

7340 

A 

2 

362 

STHETFSKIDHM\MSTSLSKIKETIVLSIPSSSSSSSSSSHSKSNLQ 

NYTNTWK/I^LLQNT)F*VNNEIKMET*KFFEMNT)N^ 

A VIjK * NK KDHKIjTT * HXiKEIjQKHEQTQ S 

7341 

A 

58 

341 

IKKFRWGPKPPLKEPPGV/ SPDFPI I / NFGNPGPVPGKDFKFPNFSS 
S S PWGGRAPP WPTTPGG * MGRF P * PPAFG VPRGGDG S P PPGAPKEG 
PLS 

7342 

A 

2 

428 

FLRRVSLYCQAGVQWPTSVTEPPPPGFK*F/SC1,TL,PSSWDYRRAPP 
RPAKFCIF/M*RLYFTILARMVSIS * PCEPPASASQSAGI\TGVSHR 
AGLViFI^KPVTGLGFDSHHPYHSQISEKDPPPISSlFWKKSIHTNTj 
LIPN 

7343 

A 

2 

688 

PGPPPEVRQHIRVPGTRTGRAAIWPGSFPVGFPLHRAQGRGQSPFPF 
PQGTKGLGLEWKGCCHAGAQETGS PLP IMVHFSGS WEMVSRGKPS S * 
QLEA* GRGRAPGLGS KAGPRLHQDTPASQCMWVCLNTPS RQRVRGPD 
TQGYVCVPIiPPQPHAPRMNQGQLPPGAPPPPPPPSLVPKYFPPSQSS 
WDRKIPRLQAPPCPPRH/GPPS/AVTNALPNGPRLSAEAPWLG 

7344 

A 

1 

603 

SGCSRRCPGPWACWTPQKGFRGGRKRRSGLWGLCCWC\RPHSHRRAG 
CHHQDGECDRWGLLPRDHSRLWI FLRNYWHQLGGE* KANAGGSDRVY 

nPWf! / Z\.r?T3pC;J\Z\PQr22l\ Zi. VT .nMQTT .TsTD (IDC T>DT7T*\7\ O DTDT 7"/ r ^tAjr"T*r ,f T T MC 
W r rturroH/ir ouHHI Jjyrlo 1 xjTJrCijiro JrrCiiL'Aor llr V v7VyIjX L. 1 rib 

TRQRSPVTPWTASAS*R*EATPVTAVTSMPAGAQSPRPPTMSSSASA 
AARTGCTWNGWFWAL/ | 

7345 

A 

32 

773 

F WSGP ALGP S PLS G E / GDMVATDETVCVCSGCEG APDPG S * PQP APD 
QRLGQCPV/WLPVT*PTSTRGTWTASWP*LTDSSVTGPTS*SL*CSG 
REKRRQWAQPWTMATSDSSRTGGLPAAPST\ADASRARDVPGAEAAP 
GVHGLAATAGCLCQGEPVGPHVAVPVQESGADPQPQDEEEGPQPAQG 
PGCEGA/EGKFPSSSSATSVAAPSGSASCPAIiAATGS*PAASTARPR 


;l O O S »+3i >,, O 5~? ifi 1& til id! 






PGPSSTRRTAGTWWPS 

7346 

A 

19 

1562 

GR YWRWDHGRESRVSGPRFR P\ S FLAALPPGRHPWPWDAAAGGS PA 
S *GGAA* SALGLSQPRPPGASDAAPDSTPAGV* TASSSALI PSRAR* 
KGSGTLEETKSRPPHSHRSGPVDLSPRSSHSRDSTPVHGPYFRSAPD 
P\GAPPRPSAGQSQLGGGSSGSPGNSIPCAFVQQ*LERA*RSHS/SP 
G S EHRRA* SGGLROG VOG I E VFGLGHPPGG Y ^ PfiPO pop if QD PP n wo 

LGVEIWCRGLYIGWKSHSPHPALAEDWQPVPPAAMAVGAPGLRRIQR 
TSRRWGQCPNEAGHAADPRDSGPPGNYQPVSLHILHMDPGTC*PE*Q 
QHPRPPL *MAGKQWPCT * LDKGQSLGLAGSHQHQYCQWW * QQLQSHT 
SLCRSPPPR*RASASPC*ASWGRSSAGAAGFRALGR*HLGTSLPGSA 
GSLPPCTAEPPADASP*LSPTVLAELGSEGASEECHSCPHGSPQSLS 
GSHRPAESQYHVER/WKQDMGPISVKSPTPRRGPTSGSGLLRIiARSS 
L 

7347 

A 

473 

792 

MLARALDARFLRWCRRDQPLVYHSP*PPGASDAAPDSTPAGV*TASS 
S APAGEKDANQMEWARR/ VP RAPPPGSPGPSLI PSRAR* KGSGTLEE 
NKSRPPTQSPFGAG 

7348 

A 

3 

411 

LCAAVNRALDEG I PLTS AL YYATVTLDQVRNI LRSDTDVSMPLVEER 
HR>OjYDTGKILLEKFGGCFLNCVRESENSAQKLMHSW\AESCPSYRD 
VTLVEGKRASFYKRTQILVSYTCIVLEGK*DACIiQDILSITMIi 

7349 

A 

2 

331 

AGGPGLPCVPHLPAGARAWRCQQPPGAACCPGRRLPLP/GKPSPGSS 
RSSRQRGQSCRTVSVLPPSSPGC/RGSQVRPDLGPPKAEGCSPWGLR 
AESCGATGARVRCHRCW 

7350 

A 

1 

1062 

EAG PGGOAARS ORG S S LG L VL P P T .RLOR CG A P PR W A ^ PT P ZVH Pf! Zi PM 
QLAA^WLIiWP<^*PPPGLCYPA^pppppQKHP/GIiLPAPRALPSPRE 
LPGEEPSAHPVHQG/PSCRATRPASASFKSHSGEFRQGLISAAHILQ 
ELPGPAG\ ENFQKVTNGAAGPAARHGPS SRS SC/ PCTHGTSGNREKP 
lstrskkntcksawoattoaggpglpctvphpako 

CCPGRRIiPLP/GKPSPGSSRSSRQRGQSCRTVSVIjPPSSPGLPGTKV 
RPW*GHLASLVWPGPPGKSPGQSTRPVGTLKRGVHPLPQVVGQSRFA 
I LVLFGRCQGEGRAGMLGTCCLLAKPEVTVA 

7351 

A 

423 

826 

OKVCAHT / CLRGTI C * I FTHLC VLRGLCWLLGCEMCMM * MCKYVCMC 
EK\ CCTFVGRC/M * VC VCAGDREHLAECVRG C VRHWCTRA * VRMC I / 
CTCTC/LCV/CV*VSGVCICKSPC^CKSIVN/C^*A*THTWFCVHV 
CV 

7352 

A 

310 

863 

STHLSLPKCWGLQA*ATLAQPNSRSLIiKYKFIYMSKGHYSVKVE\KQ 
PTEWGRKIFWQITWSG *GGLESRI YKELLQLSNKKTNA* LKSGAKAQ 
HRHFSREHTHMGQQH\QQRCIiIASLIREMQIQMT/LRGYFTIiIRMII 
TRKSDNN* CWQECAEI GTLL / HYWWECEMVHGRGKI VWQFLKNIQLP 

7353 

A 

3 

358 

SISNENFPTKESQGPETFTGEFYQTFI/DLSPILLKLFQITD*AGTL 
PSSFYEIS ITI I PKPE * KSQElTif SAMYLMNIDTKIF\HKILATQI * P 
YINVI IHYDQVG\ FIPG\MQSSFNVIN 

7354 

A 

2 

207 

GRVDHTHPGLPRPLCPHSPDP / PQRM* SHSAVGASGLADPYERI WS 
* DGNS S S PNHQGDGAS QTSGEQ I 

7355 

A | 

1 

803 

QPRDPSSGTHHPDGDRGGQPAPICPRQGSCSHSPLEHAPCPILGPGS 
WPRWHL/DLLSLLGDPKVGGPIACPPGSPPHAWTG\PVACPGLGQPC 
SCTPILIPHPIPCQHPKAPGGHPLCSKLLTCLPATPAVPSQPGHHHP 
LFIMSGLMGAWPSPEAGPGVIjPSWTTAE*GTRHWTHEEEAALGATDA 
VTSP\PGFGRQVPAPRMPAWKCPCIRGLLHSL\PSEKPIiQVHRECVQ 
PCEGLGPLAQTGAPQAQAWGPTDLSPQLPHRKHLK 

7356 

A 

3 

927 

SSGTRHPDGDRGGQPAPICPRQGSCSHSPLEHAPCPlSrLGPGSWPRWH 
L/DLLSLLGDPKVGGPIACPPGSPPHAWTG/PLWPVRAIxASPAAAPP 
ILIPHPIPCQHPKAPGGHPLCSKLIiTCLPATPAVPSQP\AIITPCSL 
CQASWEPGLLQKLAPA\PPKLDHVGPRSQPGGPDVGVPVCTPKSRPS 
TCDTTPTPLPGLNEERATGHmKRLPWELLM/PVTSPLWLWAAGPCT 
*DACLEMSLHSWPPSQPPPREKPLRSAQGVCAAL*RAGTTLPRLGPL 
RHKAWGPTDLSPQLPPQKASKINTRGMESVK 
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7357 

B 

73 

255 

XGSSEQQDTCSNPGPCSSPTGNPEWGLRDTALLEDLGKASGLLLERM 
VDMPANNKCL I FRKN * 

7358 

A 

1 

710 

PIEITSQCGGIPRFAPITVGVRVSSLPHHSRGEGLQPPPSQSG*GSP 
AS/HHHSRGEGLQPPPSQSG*GS\QPPPSQSG*GSPASGPGFCWGQ/ 
RP/PLPSCSYSPGALQSSRTPAQTQGPAHHLRG\SGPVHGGPPAPSP 
SCCLLP/SLRVLPPGLPAPPPATSTVCPPLPVCTPPWVICQPPYPLA 
HQAGLG * LPLGDREGVPFHS PNPSLQVCPR \ PLR\ KPRPWS SGLGPH 
AHNPMPSMGRSPPRATWTLT 

7359 

A 

3 

750 

GRGAEQGSHLARAAAVPGSGPAWRPRPRGGLGLRPARSALRAGLPPG 
RVAAVGRGPALPGQG/HLQSSRTPA/PNPGPCSSPTGEGVPFHSPNP 
S/H/GKSVPGSSGKYPDLGPVGLCPHAHSPMPSMGRSPPR/GKRGL* 
PDAQMQEP/SKWGLSGHSPPGGPGKGQWPAPGEDGGHASQQQMPDIiP 
EKLSPSFTPAHLHSLPEA/PVRHEPCLPR/HGPCGRQPHPVCARHRR 
ARRAQPAQNKAC PAAGKKKKKPVPN 

7360 

A 

102 

380 

GKKIPMLYFAEIEQCILKFV\KRS*KAK*ILPKNNKAGGLKFPDFKT 
YYKAAASKHQVGGLWKELRSTLQAQQTWVPREELRSTSPSYQTQVQ 

7361 

A 

190 

375 

VI YRVS 1 1 S I KI PTTFFTE I EKTYNI/3/ WNDNRP * I PKAI LSL *NKV 
GD I PLPDYE YHLNMN 

7362 

A 

102 

345 

GKKIPMLYFAEIEKCILKFV\KRS*KAX*IIiPKNNRAGGLKFPDFKT 
YYKAAASKHQVGGLWKELRSTLQAQQTWVPREELR 

7363 

A 

12 

380 

DPTVCC^QE\HLTFKDTHRLKVKS\WKKILHANRNQKKAGIAKLISD 
KIDFKSKTVKRDII*MIRASIHPRRLINSYI/WAPNIGNT*IHKILK 
DLKGGINCNTIIVGDFNT\HR*SRQKINKHWA 

7364 

A 

3 

298 

RRSI/NSVAQAGVQWRDLSSLQAPPPGFTPFS*LSLPSSRDYRCLPP 
CLANFLYF* *RLGFTVLARMVLIS * PSDPPTSASESAGITGVSHCAW 
PISFS 

7365 

A 

82 

526 

NHSELQLKLRVA* KMKCTADLSLLEKDT* IKLEENR*QERIMLRAEV 
NEIENKIENINKTGGSSSEKYYKIDNPIjTRS irrk\ * T* ITSFRNEK 
GSWTNPTEIKRTVKEY/YYEKLYANKFraLDEMKKFLKHNLPKLTQ 
KINRKSECS 

7366 

A 

146 

589 

LPQVMAEFRNNPGEVEGRKAKSMKGQTTGKNQDNPVIDEIDFLEAFK 
NIQPS S / LFEASLGLMG I KPVDWEE I GGLEDVKPEVKTAH/ WSLRQK 
SGHCPV/ GARLPTGLLATLGSGSGSGRATEAVSGPAG* KRAS IGGSS 
QRPRRFPTVP 

7367 

A 

2 

1041 

CREAAMHALLHSEKDNPVIDEIDFLEAFKNIQPSSFRSVIGLMDIKP 
VDWEEIGGLEDVKLKLKQS I \ ESPLTFPWEFVRMGLTQPKGVLLYGP 
PGCAKTTLVRALATSCHCS FVS VSGADLFS P WGDSEKVLSQI FRQA 
RASTPAI LFLDE IDS I LGAS / RQPS KTGCDVQERVLS VLLNELYGVG 
LKTIERRGSKSSQQEFQEVFNRSVMIIAATNRPDVLDTALLRPGRLD 
KIIYIPPPDHK\ARLSIYKRLYKKPCPIGPDVSLENLAAETCFFSGA 
DLRNLCTEAALLALQENWTRPQLTVKQEHVF * NHLRL * NRS VKVART 
WLYMKT YLRKKDFLTVEG I 

7368 

A 

1 

2046 

MAANRKDFIHRRCGPELSQSEAVPQGLCRAERSSALAMLSGSVPRRT 
QLRPCKGTQRRCRRGGEADSGGEAVGGGAAERRSAAARRRGGRRRGG 
GEAVGGGAAERRSAAARRRRGGRRRRGGGEAGGGGAAEARRAAAARR 
AAAARRAAAARRAAAARRAAAARRIAESGGEAHSRGEAHSGGEAHSG 
GAGAERRRPRLHSPALKTHPGQSSEGHVEYKIiNHVEFEDQDDEARVQ 
YEGFRPGMYVRVE I ENVPCE FVQNFDPL Y P 1 1 LGGLGNS EGNVGHVQ 
MRLKKHL WYKKI LKS QDP 1 1 F S VG WRRFQT I LL Y Y I EDHNGRQRLLK 
YTPQHIHCGAAFWDIVFMRTWYPVSIPAFYNPVTSLLKPVGEKDTWS 
GMWTTGQLRIAHGVRLKTNKDSL YKWWDTAAS QS LKGQAQHQQG PQN 
GGALLWLEPSGAGTSTATTSLPGE7UDISFSTIVLKALQMSNCKSYKK 
SVSKLLNRRTQR7VPNVHLQILQKECLNLLNQKKCSTRPQSEQNVRLQ 
ILQKECFQTAQPKGRSVCFFPAKAEAGPTETVPAAYSDGVYEVIQAD 
EADISSASRMEVENLGLGPRTVDHAGLPWTVTDSCSHQDPQNRPLKK 
MLPSGQVALADIILINKTDLVPEEDVKKLRTTIRSRSSAGRGSRRMP 


ie>5*4 
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FSVSPLQPYTKCAPCISNSIVESL 

7369 

A 

1 

4340 

MSAFESVPDDEEVDVDAVPENKLTFDNIAEGFRLFKTACDFLYHMDL 
SMNEDIEKEVKEEIDPDEEESPKXKHLDKKRKLKEMFDAEYDEGEST 
YFDDLKGEMQKQAQCLGGRSSVSMQAHRRPVVHGGDSCSDPAAMLTT 
VKAEFTTGGVLQRGGLQQSWQAPIFMDVGEWEQRNWRLNCRKLIiWGF 
AAERS REMKPLLVEE VES KGFE I RERTEGRAVG I KSRKSGWCLGCPR 
RTENQPACLQSDLNSDPRALVIRTSTKVKETLSIiEEGTVKNTDYSEL 
GHMGREQWSSEQPLEQTLQDYTKRYTQQHYTEIDQDGGTQDLKTKKH 
H I P WDRT PLE PP P I VWVMG P PKVGKS TL I QCL I RNFTRQKLTE I R 
GPVTIVSGKKRRLTIIECGCDINMMIDLAKVADLVLMLIDASFGFEM 
ETFEFLNICQVHGFPKIMGVLTHLDSFKHNKQLKKTKKRIjKHRFWTE 
VYPGAKLFYLSGMVHGEYQNQEIHNIiGRFITVMKFRPLTWQTSHPYI 
LADRMEDLTNPEDIRTNIKCDRKVSLYGYLRGAHLKNKSQIHMPGVG 
DFAVSDISFLPDPCALPEQQKKRCLNEKEKLVYAPLSGVGGVLYDKD 
AVYVDLGGSHVFQVSDEVGPTHELVQSLISTHSTIDAKMASSRVTLF 
SDSKPLGSEDIDNQGLMMPKEEKQMDLNTGRMRRKAIFGDEDESGDS 
DDEEDDEMSEDDGLENGSSDEEAEEEENAEMTDQYMAVKGIKRRKLE 
LEEDS EMDLPAFADSDDDLERSSAEEGEAEEADESSEEEDCTAGEKG 
ISGSKAAGEGSKAGLSPANCQSDRVNLEKSLLMKKAALPTFDSGHCT 
AEEVFASEDESEESSSLSAEEEDSENEEAIRKKLSKPSQVSSGQKLG 
PQNFIDETSDIENLLKEEEDYKEENNDSKETSGALKWKEDLSRKAAB 
AFLRQQQAAPNLRKLIYGTVTEDNEEEDDDTLEELGGLFRVNQPDRE 
CKHKADSLDCSRFLVEAPHDWDLEEVMNSIRDCFVTGKWEDDKDAAK 
VLAEDEELYGDFEDLETGDVHKGKSGPNTQNEDIEKEVKEEIDPDEE 
ESAKKKHLDKKRKLKEMFDAEYDEGESTYF\DDLKGEMQKQAQLSRA 
EFEHQDDEARVQYEGCRPGMYVRI E I ENVPCEFVQNFDPHYP 1 1 LGG 
LGNSEGNVG YVQMRLKKHRWYKKILKSRDP 1 1 FSGM FNS ALEVAKFE 
GAV I RTVS G I RGQ I KKALRAPEGAFRAS FEDKLLMSD I VFMRTW YP V 
SIPAFYNPVTSLLKPVGEKDTWSGMRTTGQLRLAHGVRLKANKDSLY 
KPILRQKKHFNSLHIPKALQKALPFKNKPKTQAKAGKVPKDRRRPAV 
IREPHERKILALLDALSTVHSQKMKKAKEQRHLHNTCEHFRAKQKEEE 
EKLKRQKDLRKKLFR IQGQKERRNQKS S LKG AEGQLQ 

7370 

A 

18 

731 

KEELYGDFENLQTGDVH*GKFGPDSLNEDVQ\IQEEIDPD\EEESAK 
KKHIJJKKRKLKEMFDAEYDEGESTYFDDLKGEMQKRSTGKQHTRFPP 
RAPRHVQLiNHAE FEDQDDE ARVQ YEG FR PGMYVCVE I ENVPCE FV * N 
\FDPRYPIIIXK3LGNSEG\NVEHVQMRLKKHRWYKKI\LKSQDPFIS 
S VGGRR FQT I P\ LYY I EDHNGRQRLLKYTPQHMHCGAA\ FWGKI *LQ 
♦IiAYCRD 

7371 

A 

97 

373 

TYESEYFLLLLFLFFRRQSLALLPRLECSVKI IAHS * \NVKLLDSSD 
LPDSASRVAGIAGACHQAQLIFIFIFVEMGSCLSSFLSVGLNQLF 

7372 

A 

1 

466 

LVFFSWCLFVCLFVLKTESHSIAQAGVQ*RDLSSLQPPPPGFNRFSC 
LSLLSSWDYRRA/PTMPN*FYCWPAGLKLLTSGDPPASASPSVGITG 
VSHCAQPRIVSFTSNIILKFKNSLSKGCIilEAYFDKTVKATVIiEYLA 
LRNPKPEEHIWGFLY 

7373 

A 

181 

654 

qnrdvhp aas pnpsaaqg s * rakths \ egpls vpg ps paa/ parsl * 
eqnpk\pkhslp**lssqlgspledvgndtllsrhkpqsppsrgrpr 
pegtrlhfXtepqrlrssrqptitahwlpmpplptppaahacpplp 
thargtsskqaepppsacs 

7374 

A 

56 

482 

TRPTEHLKI;SSAVSPGFCRWPPSAPKAELSPAVPRS\PAAGAATIAV 

VRSRWS PGEPRCPLQTTASVRCAPMHRRPVDGACS PPW * ACRAAAR 

LGPREGPPWDPQSPQSRAEQKMLPVFGRASDAPFIDKKGFFCLISCC 
• 

G 

7375 

A 

265 

2253 ; 

LSILSHCKRGVGCWLFTVALGWDHEQPCAPSTHGRAQPE/GPRVXrPG 
RSRRP/ GTQTLAAPQPEPGECTSQARSGGPVSPHKAGSEAQES*EGS 
EDPQD*ELQQERTAKNILGKRLPQ*IILYCHSHLGEGPQT*NPGERR 
V*PPEFGLGCG*D/STLLLAEEGSGWRGP/PWPAGASTPLEAQAPV* 
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AGGCGPPAELPPQPGGAGHPGR\ PGGGSLGHQAGG * GQ * * PHPSGPA 
*PSA*GPRSRR\PSGGRSQPAVSGPGWAWHCPASGPPGTSSFEPQLW 
LWPLPGLPGQQRPPRLSAARGPQAHGPPRGPFSSPGDRHGPAGGPAV 
HPARHQR*PAVPAGVTQPGAVQLLLPPHRAGPL,QGPGLGSLAWASCE 
G P P S LLLAL PRA S WLQTENWCQWL P PP S QWS CLGNRQ * RPDFAQG I 
PLRGAPLGPRLPWRGLARAPAPGPWGAPSQCQLPWGRHPSGHLSACP 
GLP/GGSVSGPPSGGPAGSPPTGPS\GAPGSAG*GPGMAQGGPQEL* 
GQGPKAGEADSPPSPWPCLQGPPGLWAPLPRALDAPPPAWVYPPRPP 
AS AGG S PC AAPRS P P AAS PAG PAS G F / S AAP AC S L I TLKQDGGGPS I 
SGPASFLQLAWPPGFHPARLHGPSAREARAGSPGPGAGFSAGRGGFP 
PWSLSSAGCPVSPAA 

7376 

A 

58 

414 

RPAGCGSTRGTCEWARPSAAPCQTAAGGACSPWTPAPRSPAGRSWAS 
SSWPG*CWAPPSGHGAQQPSPSSGQWWVMRWAPAS/WLPSGATPGPR 
FLALGTIGVPAGPRSPSPSFPTQRF 

7377 

A 

1 

1548 

DKDNSGPTVLHLT7VRFGHPKVVNWLLHHGGGDPTAATDMGALPIHYA 
AAKGDFPSLRL.LVRHYPEGVNAQTKNGATPLYLACQEGHLEVTQYLV 
QECGADPHARAHDGMTPLHAAAQMGHSPVIVWLVSCTDVSLSEQDKD 
GATATHFAASRGHS KVLS WLLLHGGE I S ADLWGGTALYDAAENGELE 
CCQILVWGAELEVRDRDGYAAADLSDFNGHSHCTHCLRTVENLSME 
HCVLSRDPSVELEAKQPDSGMSSPNTTVSVQPLNFDLSSPTSTLSNY 
DSCSS SHS S I KGQHPPRGLS / KH * SCRHTELHGHAEPRAGPALGHDW 
EAHTPTTPTQLPPATPAPRHPTAPTPT* LP I SQASCGTTGS *HLHAD 
QEQTPPRGDRGPQEGAE/PPATATTGYGGRTPAASPALSASSPARGT 
TTGSWAAAPARRWSHARAWRTARRRNSLGTTCLTAAPRTPRRPGSSS 
CRRRRRRRPCPRPRVRRRRSRLCPSRALALAAGSAAPPRPPAAPSLS 
T 

7378 

A 

2601 

4760 

LTAASSCCYPREGWRYPREGWRYSREHNAILWPFGELMTEADILRIB 
QQSSPCSITAPPTSCAHWTRRPRAVRAASPRSPLAPRSASPSCRRTT 
WRP ALAS PAP P P PTARW PTG S P WTPW ARLRHR I ARRR YLS P S S WRAG 
RPSARNCAASRTTSTCARSASFTSSWSTGASGPSSDRAFRGPGAPRQ 
TAPWRDGRPCWPELEATDAPRLPVSKGEAHSPNERLRQLLRQRQAVG 
KLLHHWRS LRRHVP PS PGLAHGVYWPQH FLS PLDGGAPPRYES LTLD 
LFMLGYFQLPEMGLSREDRKFRHLLCYEMFHRLDSHPWERIRLFHRV 
VLEEVEAGRRGWSDGFEDLRHRFFGNGLEAGPAPEEQAKKKEEKGKE 
QERTEEAAPVQKGDPPKGQREALAPVPQPPPPPARPPARRASPPRLP 
GSLVFQEEEEARLASMPAWRWDLLRKKLSAFSPPGSTQHLRSQLSQC 
KQRYQDLOEKLLLSEATVFAQANELEKYSSHAGEPLLKQDSKQVQVD 
LQDLGYETCGQSKNEAEQEETWPPAPTWSRALPRVAPRSSSRRGRRY 
PERSQRRREVAATAMPKNKGKGGKNRRRGKNENESEKRELVFKEDGQ 
E Y AQV I KMLGNGRLE AMC FDG VKRLCH I RG KLRKKVW INTSD 1 1 LVG 
LRDYQDKKADVILKYNADEARSLQAYGELPEHAKINETDTFGPGDDD 
EIQFDDIGDDDEDI 

7379 

A 

1 

741 

MSEGWEIBEDKEKGKVVVETVVAKEGLSEPLLKQDSKQVQVDLQDLG 
YETCGQSKNEAEQEETTSPAPTWSRALPRVAPRSSSRRGRRYPERSQ 
RRREVAATAMPKNKGKGGKNRRRGKNENESEKRELVFKEDGQEYAQV 
IKMLGNGRLEAMCFDGVKRLCHIRGKLRKKVWINTSDIILVGLRDYQ 
DNKADVILKYNADEARSLQAYGELPEHAKINETDTFGPGDDDEIQFD 
D IGDDDED I DD I 

7380 

A 

3 

5650 

LLEKLRQRIHDKAVALERAIDEKFSALEEKEKELRQLRLAVRERDHD 
LERLRDVLSSNEATMQSMESLLRAKGLEVEQLSTTCQNIiQWLKEEME 
TKFSRWQKEQESIIQQLQTSLHDRNKEVEDLSATLLCKIiGPGQSElA 
EELCQRLQRKERMLQDLLSDRNKQVLEHEMEIQGLLQSVSTREQESQ 
AAAEKLVQALMERNSELQALRQYLGGRDSLMSQAPISNQQAEVTPTG 
RLGKQTDQGSMQIPSRDDSTSLTAKEDVSIPRSTLGDLDTVAGLEKE 
LSNAKEELELMAKKERESQMELSALQSMMAVQEEELQVQAADMESLT 
RNIQI KEDLI KDLQMQLVDPEDI PAMERLTQEVLLLREKVASVESQG 


,:L O O S 4-6 iMi»"3 ,, 0 12 S 6. 0 ic i; 






QEISGNRRQQLLLMLEGLVDERSRLNEALQAERQLYSSLVKFHAHPE 
SSERDRTLQVELEGAQVLRSRLEEVLGRSIiERLNRLETLAAIGGAAA 
GDDTEDTSTEFTDS I EEEAAHHSHQQLVKVALEKSLATVETQNPSFS 
PPSPMGGDSNRCLQEEMLHLRAEFHQHLEEKRKAEEELKELKAQIBE 
AGFSSVSHIRNTMLSLCLENAELKEQMGEAMSDGWEIEEDKEKGEVM 
VETWTKEGLSESSLQAEFRKLQGKLKNAHNIINLLKEQLVIiSSKEG 
NS KLT PELLVHLTS T I ER INTELVG S PGKHQHQEEGNVTVRP FPRPQ 
SLDLGATFTVDAHQLDNQSQPRDPGPQSAFSLPGSTQHLRSQLSQCK 
QRYQDLQEKLLLSEATVFAQANELEKYRVMLTGESLVKQDSKQIQVD 
LQDLGYETCGRSENEAEREETTSPECEEHNSLKEMVLMEGLCSEQGR 
RG S TLAS S S ERKPLENQLGKQE E FRVYGKS ENI LVLRKD I KDLKAQL 
QNANKVIQNLKSRVRSLSVTSDYSSSLERPRKLRAVGTLEGSSPHSV 
PDEDEGWLSDGTGAFYSPGLQAKKDLESLIQRVSQLEAQLPKNGLEE 
KLAEELRSASWPGKYDSLIQDQARELSYLRQKIREGRGICYLITRHA 
KDTVKSFEDLLRSNDIDYYLGQSFREQLAQGSQLTERLTSKLSTKDH 
KSEKDQAGLEPLALRLSRELQEKEKVIEVLQAKLDARSIiTPSSSHAI* 
SDSHRSPSSTSFLSDELEACSDMDIVSEYTHYEEKKASPSHSDSIHH 
SSHSAVLSSKPSSTSASQGAKAESNSNPISLPTPQNTPKEANQAHSG 
FHFHSIPKLASIiPQAPLPSAPSSFLPFSPTGPLLLGCCETPWSLAE 
AQQELQMLQKQIiGESASTVPPASTATLLSNDLEADSSYYLNSAQPHS 
PPRGTIELGRILEPGYLGSSGKWDVMRPQKGSVSGDLSSGSSVYQLN 
SKPTGADLLEEHLGEIRNLRQRLEESICINBRLREQLEHRLTST7UiG 
RGSTSNFYSQGLES I PQLCNENRVLREDNRRLQAQLSHVSREHSQET 
ESLREALLSSRSHLQELEKELEHQKVERQQLLEDLREKQQEVIiHFRE 
ERLSLQENDSSGPCLSLVRLQHKLVLLQQQCEEKQQLFESLQSELQI 
x r*AJLi x bN S KJnAjJj KA Y S L>D ACHQ 1 PLSSDLSHLVAEVRAXjRGQLEQSI 
QGNNCLRLQLQQQLESGAGKASLSPSSINQNFPASTDPGNKQLLLQD 
SAVSPPVRDVGMNSPALVFPSSASSTPGSETPIINRANGLGLDTSPV 
MKTPPKLEGDATDGSFANKHGRHVIGHIDDYSALRQQIAEGKLLVKK 
IVSLVRSACSFPGLEAQGTEVLGSKGIHELRSSTSALHHALEESASL 
LTMFWRAAJLPSTHIPVLPGKVGESTERE1.LELRTKVSKQERLLQSTT 
EHLKNANQQKJSSMEQFIVSQLTRTHDVLKKARTNLEVKSLRALPCTP 
AIj 

7381 

A 

1 

1131 

MKVTFANKPEGGGRLAKQRPPGRGARPRPKHEGGQSVIiGTRRPALLQ 

VoLlLi V oJLioCtt^i^lsJJoA X A X rt r A/VoKLj no JvV Jj»b WXjIjXjHGGE 1 o AJDLi WG 

GTALYDAAENGELGCCQILVVNGAELEVRDRDGY7VAADLSDFNGHSH 

EENEWPSRGQGLVPSAPTAVAQSMEHCVTjSRDPSVELEAKQPDSGMS 
SPNTTVSVQPLNFDLSSPTSTLSNYDSCSSSHSSIKGQHPPRAPNPQ 
ILQYKKRFSELEQLLERSGELEQQQLRDAEHSQDLESALIWLEEEQQ 
GGPGLAAWPPGRAPTDPLCPIQECQPGPGECHALRTAGPGRFGQPGS 
E 

7382 

A 

421 

582 

LPGAQGMPRPGVQ*PSGLHIPQGGSTRTSGLKAMRSPPKNHQVPDSR 
AGPWAYC 

7383 

A 

3795 

5290 

WNWTVHGDGS AGQGPWLHLFCHKRQHGG I SWPLVQREGRDEHFRNQR 
EIAPPAGREQTAVQKPQREMFCNSTGRLPGQPTEEIQV*RV*RPHKI 
YAEE\ERQFKEEKLAEQLKQAEELRQYKVIiVHSQERELTQLREKLRE 

SPENDEDEDEDVQVEEAEKVLESSAPREVQKAEESKVPEDSLEECAI 
TCSNSHSPCDSNQPHKNINITFEEDKVNSTLWDRESSHDECQDAVN 
ILPVPGPTSSATNVS^n^VSAGPLSSEKAEMNILEI^rEKLHPQLAEKK 
QQFRNLKEKCFVTQLACFLANQQNKYKYEECKDLIKSMLRNERQFKE 
EKLAEQLKQAEELRMGSGKYLNEHFVFTENDEDEDEDVQVEEAEKVIi 
ESSAPREVQKAEESKVPEDSLEECAITCSNSHGPCDSNQPHKNINIT 
FEEDKVNS ALWDRES SHDECQDAVNI LP 

7384 

A 

125 

605 

EAEALENQSQPCDTG/ PQSAFS PPGSTQHPRSQLSQCKQRYQDLQEK 
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LLLSEATVFAQANELEK*RVILSBPLLKQDSKQVQVDLQDLGHBTCG 
QSKNEAEQEETWPPAPYLVPRAPSCRPSEQ\SRRGRRYPERSQRRPR 
GRPPPPCPRIKVKDGKNKTQG 

7385 

A 

310 

329 

TVKLL*HVGNISLFPSPSP*MLNTQWGKKS\FNEWWWENWISTCRRI 
KLDPYLLPYIKTN*KSIKDLNCKTTK 

7386 

A 

310 

329 

TVKLL*HVGNISLFPSPSP*MLNTQWGKKS\FNEWWWENWISTCRRI 
KLDPYLLP Y IKTN* KS I KDLNCKTTK 

7387 

A 

3 

642 

GKGLFDDEDEESDLFMELAPQDRQAGASVKEESSSSKPGKKIPAGAVS 
VFLGDTDVFGAASVPSLKEPQKPEQPTPRKSPYGPPPTGLFFFSAPH 
S KPS KTRKVQS TAD I FGDEEGDLFKEKAVAS PEATVSQTDENKARAE 
KKVTLS YSKNLKPS SETK\TQKGLFSDEEDSEDLFS SQS ASNLKGAS 
LLPGKLPTSVSWFDDEDEEDNLLGVQ 

7388 

A 

1 

3096 

MVTHQQPAARKPNMTS KKPKPMGPKAHG I FSGTRKNNLE I YMDQTRT 

GIAKTKLSKNNKSGGMTLPDFKLYYKAIVAKIACGGSMYNSDTDEDE 

STEPS S SGQQI I ENS I TMNKMKLLKAKMKNMNLSKKH ITQVSDEEED 

DDGCDLFADSEKEEKDIEDIEENTRPKRSRPTSFADELAARIKGDAV 

GRVDEEPTTVSYEDDRRGKNQDAYTEGGLSTIKTVNCSSSL\PSGEA 

KLRKTL/ KEKKERRTPSDDEEDNLFAPPKXiTDEDFS PFGSGGGLFSG 

GKGLFDDEDEEDIEESSNPLWSGYSLRWAREPPQQPETHTWSTNEK 

PEQWLKGENWGSVSTPLQKRSRPTSFADELAARIKGDAMGRVDEEPT 

NEEDNLFAPPKLTDEDFSPFGSGGGLFSGGKGLFDDEDEESDLFTEA 

PQDRQAG AS VKEG KLGLGS CV S VCR VPKLS FS LGDGTLRRHGCAAS V 

PSLKEPQKPEQPTPRKSPYGPPPTGLFDDDDGDDDNDFFSTPHSKPS 

KTGTEKPFRKIGMDCDLGDTSSYHTKVSTVHITKNRNGGGSLNNYSS . 

SIPSTPSTSQEDPQFSVPPTANTPTPVCKRSMRWSNLFTSEKGSDPD 

KERKAPENHADTIGSGRAIPIKQGMLLKRSGKWLKTWKKKYVTLCSN 

GVLTYYSSLGDYMKNIHKKEIDLQTSTIKVPGKWPSLATSACAPISS 

S KSNDL S KDMDTGLGDS I CFS PS I S STTGPKLNPPPS PHANKKKHLK 

KKSTNNFMIVSATGQTWHFEATTYEERDAWVQAIQSQILASLQSCES 

SKSKSQLTSQSEAMALQSIQNMRGN/^HCVDCETQNPKWASLNLGVLM 

CIECSGIHRSLGTRLSRVRSLELDDWPVELRKVMSSIGNDLANSIWE 

GSSQGQTKPSEKSTREEKERWILSKYEEKLFLAPLPCTDLSLGQQLIi 

RATADEDLQTAILLLAHGSREEVNETCGEGDGCTALHLACRKGNWIi 

AQLL I WYGVDVMARDAHGNTALTYARQAS S QEC I NVLLQ YGCPDECV 

7389 

A 

1 

238 

SDLFTEASQDRQAGASVKEGDTDWGAASVPSLKEPQKPEQPTPRKS 
PYGPPPTGLFDDDDGDDDDDFFSAPHSKPSKT 

7390 

A 

1 

1374 

DD I FATEAI KPS QKTREKEKTLESNLiFDDN I D I FADLTVKPKEKS KK 
KVEAKS I FDDDMDD I FSTG IQAKTTKPKSRS AQAAPEPR FEHKVSNI 
FDDPLNAFGGQLRSPSRCCQADCCPPDSGLWVWGALVGLGGARAAQH 
ITQVSDEEEDDDGCDLFADSEKEEKDIEDIEENTRPKRSRPTSFADE 
LAAR IKGDAVGRVDEEPTTVS YEDDRRGKNQDAYTEGGLSTI KTVNC 
SSSL\PSGEAKLRKTL/KEKKERRTPSDDEEDNIiFAPPKLTDEDFSP 
FGSGGGLFSGGKGLFDDEDEESDLFTEAPQDRQAGASVKEGKLGLGS 
CVSVCRVPKIiSFSLGDGTLRRHGCAASVPSLKEPQKPEQPTPRKSPY 
GPPPTGLFDDDDGDDDNDFFSTPHSKPSKTGKVQSTADIFGDEEGDL 
FKEKAVASPEATVSQTDENKARAE KKVTLS YSQNVKP 

7391 

A 

1 

419 

MDSSRSVGNGFTYQDKLSSSKLTSVLEAVAGEYAIiVINGHSL/ARHR 
YP*LGQYLSSEHFCPGLLYLVLISPDFREDWSQRGLGGAETPRCLLE 
RLAL SG FS VLQAHALEADMELE FLETACAC KAV I CCR VT PLQKAQV 

7392 

A 

3 

701 

LVKS YLAQFAARAI ISELVS ISELAQPLERGTHFPLFLLCLiQQLAKL 
QDREWLTELFQQSKVNMQKMLPGKRKLTCLiFCFFNIISRDFRYLEWI 
YVGQTV* LSCSDNDKHYHRSHL *RE I YNLTTF I * SRC * VFFICWVG S 
LLCYKDLKYLGDI ICEALLSGQDHLVK* T IR* CRWLCLF \ ESDQNKD 
RMLEILEGKGLSFLFPTPQLEKDLFEQ\IKLDPSPQTIYKWIKDNI 

7393 

A 

83 

3064 

GRPRLLPPPSLPLFFVFRSPFPSLPYPRRPDPEEAAAVAAAEFLGEG 
ISFLLSPPLPTPSINIILLKILRCQAAKVESAIAEGGASRFSASSGG 
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GGSRGAPQHYPKTAGNSEFLGKTPGQNAQKWIPARSTRRDDNSAANN 

SANBKERHDAIFRKVRGILNKLTPEKFDKLCLEIiLNVGVESKLILKa 

VILLIVDKAIiEEPKYSSLYAQLCLRLAEDAPNFDGPAAEGQPGQKQS 

TTFRRLIilPKLQDEFENRTRNVDVYDKRENPIiLPEEEEQRAIAKIKM 

LGNI KF IGELGKLDL I HE S I LHKC I QTLLEKKKRVQLQDMGEDIjECI* 

CQ I MRTVG PRLDHERAKS LMDQY FARMCS LML S KE LPAR I RFLLQDT . 

VELREHHWVPRKAFLDNG PKT I NQ I RQDAVKDLG VF I P APMAQGMRS 

DFLLEGPFMPPRMKMDRDPLGGLADMFGQMPGSGIGTGPGVIQDRFS 

PTMGRHRSNQLFNGHGGHIMPPTQSQFGEMGGKFMKSQGLSQLYHNQ 

SQGLLSQLQGQSKDMPPRFS KKGQLNADE I SLRPAQS FLMNKNQVPK 

LQPQITMIPPSAQPPRTQTPPLGQTPQLGLKTNPPLIQEKPGKTSKK 

PPPSKEELL^TETVVTEYIiNSGNANEAVNGVREMR\AHKHFLREML 

SKVI ILSLDRSDEDKEKASSIA ISLLKQEGIATSDNFMQAFIJTvLDQ 

CPKLEVDIPLGEIPILAQFAARAIISEIiVSI/ SQNI*AQP\IiESGTHF 

P/LSFLL\CLQQLAKF\QDREWVNQELFSTKPRSNMQKMLPEIDQNK 

DRMLEIL\EGKGLSFLFPLLKLE\KELFEANKSWDPSPQTIYKWIKD 

NISPKLHVDKGFVEHL*MTSF\LQLHF*WS*TPPADETDSSSAPFQR 

NS * EQEKQLLL S FKPVMQKFLHDHVDLQVS AL YALQVHC YNSNFPKG 

MLLRFFVHFYDMEIIEEEAFLAWKEDITQEFPGKGKALFQVNQWLTW 

IiETAEEEESEEEAD 

7394 

A 

259 

1159 

EARWQDVGRLQLTKTV*PWGRGWSR\ASDPRXiHAYFSYTAATGKAEP 
RARTPAPCK\ ARERLGSCLWD * SPPPPPGWRPLRYPTPRRAGPAQFG 
KK\ RS AS ARSTWP * / VMQRLAPKKTRKEQS ANDHP I GGPEGRJLFTSQ 
JjQIiKFRAL S ER I S WL»E VSRAVTPTS AAVTS TPSTS KPRQKRPTNSQS 
RS AAKPTPGRRCQPGGRCPTCtRS P * LR *ARGRRHGEARRLPIiREAR 
RLPLREARRLPLRAPRS FRKVPS PGQGR S APQAGVGAGTGRRRRSMD 
AWGRARSLAGSAGGVI SLGGR 

73 95 

A 

3 

433 

HIKRVKJ^HMIISVGTEKAFDKIQHSFI IF FKFRKK*EWGHFFNIjIK 
SFS*KAn^ITLSGDAFSLILVIRQGCLLLPLLFDKVIiKVLDSAVKQ 
EKEIRDI*VGKEEIKLSLFEDDMIVYIGNNK*/SYRKLPEFINEFSK 
VAG 

7 396 

A 

3 

504 

AAS TGGG WVS E FE S VA PG S PD 1 1 AAPTC PQRT PR S LE / L PH FQS * TG 
PCLNPISPERVSEPGQNRGKPRPHPYPGPNQWPGPA/GGCWSKPGST 
PALSDDQANTQVDRN/ DLQVASEGEGA* PAELFPKKMKKRLGDLGPG 
FEGKNGLSTEDKNLIPQAWVPALCGYHVL 

7397 

A 

376 

419 

VRKYSLCFYIL*NLFQNIKAEGILPNSFNEARITLILKQTRHSKENK 
PYRPIS\MDRHAEILNKISAN* IR*RMKRI IHHGQVRFS *G/MWGWF 
NIRKQINV/ IHHTHSLKKKNHMIISINAEKEFDKIRKLPKLRNIYKQ 
NI*LTS*VMVRNS 

7398 

A 

205 

1464 

VYKK^EASRNEKGWVCGTHAGAFPDEGEGPEEAAFLGAETAAGAP/G 
ARDHPPFNHGYAGLLEGPDLQVGS PHSPGRRQA* VCG* RAPQSDCPG 
GSGRGSMGEAGMPPR/ QGPGGLQPPQEGSVFAVMYS IGGQGAVRRI P 
VRPQP \ GREGVGQGSLQPMKPDGEAAS I PTAGSRSGRCGPAGRGRRA 
LVGRSLRP*PPC*PSPR*SPPPSS\GASSPGHPRPALAHNG*SGWAT 
RGS * *DTAPS * HIiLVPKQIiAGTCLIiCRCPKS T * * AKLEE I VPGSFHS 
PRGSYSRGLGAQHQLPSPASVAHPRCQPAPSAASH\G*RRPRPACLS 

t uf^r^x PiyiTU* o * 7v r> o i"*^ r/^ 7\ riPUTunnp t~>t~\ x c* tv "en a nnrc *t Of\T fv* t> 
JUrltryiiQjWX b w AFoOVyAytali lxiFFCaPDl SAbRAPAGb * Jjoyiiyo*' 

G*DSGGLGSGGCAVPPVAAGHPGSGLAWQLPPPR*SPAPGSGSPAGR 

P 

7399 

A 

2 

681 

EAALTDLRVQARSQPGTMS IGTENRGSGLS AVAVQEVI SDWEVRLA 
NRPGGLEVLLNQEVLSFTEQSWMDLKGMFLSVAAGDRVCIMLASWAV 
LFVSLLGPFLSVSVLtPEKFLTHTHGLLGTLNNDPTDDFTLHSGRVL 
PPGTS PQELFLFGANWTVHNAS SLLTYDS WFLVHNFLYQPKHDPTFE 
PLFPSETTLNPSLAQEAAKLCGDDHFCNFDVAATGSLS 

7400 

A 

658 

1657 

PGILSLRNALLTGVDCKWVPVPPGHTHPQLRLFHFHPKTCSGWGEW 
GDHGWWAGPAPA\YHLHRDVQALWTNDHALAW\P*AMTSERTLWPG 
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HELS ARPGRS WRI S CPTGlA PQGGDDRTRGHWVTATCCSADGLRLCR 
YGD\ GVRGGLPTVAWP * LTGSCSLRYGSGQQCCYTADGTQLLTADSS 
GGSTPDRGHDWGAPPFRTPPRVPSMSHWLYDVLSFYYCCIiWAAPDPG 
KDRAGLGCTPT*PSTLTLASAFGDPHFVTFDGTNFTFNGRGEYVGMG 
*NQGGRLEPSGSRSAPTTAGTETRGTGLTAVAVQEGNSDWEVRLAN 
RTGPER 

7401 

A 

1 

423 

RPKPKNSF*LGGEKANKIPHPLMIKTLTKLGIEKTYVNIIKVIPNRP 
MAC 1 1 INGTKVKAF PVR FERQRC S P S PLLFN I GLKVPARA I EKEKEM 
KGI*IGKKEIKLCLSGND\MTLYLKQPKNSPKTIRTDTFSKVAGYKN 

7402 

A 

247 

660 

NWSGRRLRMWPS AALS P AVS S PALALTS PPKPLVKGRDGKPPAGRPP 
GREGPGAWR*GPSPPTTLGPAKAGSGQRGGRCQP*GPGWRPAASQPL 
SGCPWCPG ITELLLES PSETAEQDRGPGS GEDS * TPDF YTPTRP 

7403 

A 

229 

1047 

HSVQKAKADKTGVDGGSECGPPLLSLLQSHPRP*P*PLPQKPPEREE 
VWPKDGKLLASRAGGLVC I SLL * EPRVHSCTGPGLPVKAELGLLRAQ 
GLPHPAS PARLGNTG/ VGLATE * TG I SEHKLGPLGN * GVMGDAVR* G 
LCSEGLCPPSRPWGPMGVCGGIiAAKARPH*QG*VCAPGTHLEGSSVW 
SLPSPQLFPTPGPPAVCSALAPAPTPARTDGTIQNHLWPVTRKSALP 
VQEWGGDSWGTGQAQEEGPSVGYPGCP*LPGEGRIPPL 

7404 

A 

595 

626 

CPMPLVPPDISLINEQTEKEAGDLVCPS/AGPVPDSFPLGSEPSSHG 
VSELKTRAKRQRERHKKGGTEKDRKAERGT/PPGPLLPFLFVH*SG* 
CQVAQGALDI ISWLKI I VPKS VFPGC PLKS WAAAATPCS PCHPS SHC 
SPVGVPGRASKFCACPLECP*SPPSPI*PSLQGPHLCPPPFHASPPDG 
NPGGQGVPLCPLRGS PGHLCAMS RCPLRWNWP 

7405 

A 

1 

603 

EMESRSVT*AGVQ*QDLSSLQ/PLPLHPGFKQFSLPPVSPSKLGITR 
HTAYLNFFVFLVGDGDFAMLARLVSNSSSQ/CDPPSSASQSVGITGV 
SHGACLTCSLCCQTMCFLVASRYF* FWDGNLFCYNLQIGDVYFI FI I 
YLF\MRPSYSITQARMQWCD\LGSLPSPPPRFK*FSCLSI*PSSWDYT 
QHAPPCLANFFVFLAEM 

7406 

A 

211 

389 

PNLG * LNPPPPGLKNFSCLGLP/ S KWE * KGQCLPPGPVNFGFLTKLG 
VFPCGPGWLEIYN 

7407 

A 

3 

326 

LSGVMLSPGFPGYYLFST*CIMDNKATHRYCVYISSS*IFLQK/HIH 
DYLEVRRGSSETSTGIGRLNGPQIPCFIiliSTTHETSLYFHSDYSQNK 
QGFHIEYQGERSNKK 

7408 

A 

1 

555 

SDPRTETESHKMLTD*RSSYTjSLTALPF/PLiPKSEAITRNRASRAAS 
LVRLTPSKKDVGFLQKPRLRSCSPSSRCPPGSAARST*AAGTACGCC 
GTSPRQR*APGALGLRRPPQS*GRRSSPRPSIiTHPSSSAIjPPPPPTS 
PRTRYRG*SRAAPLLSLRLPSGKLFQPTCRPRVSLR*RPSQRVB 

7409 

A 

3 

379 

KLSPSKPILASTLMGPIYHSKVKF/IPGMKGQFNIQKSIIMTYHINI 
TKNKNDMVISSSSSSSSSSSSSSSSSSSSSSSPIEGNYFPITKE/YE 
KS I ANI ILDS * KLKAFS / LRSGKRQG YLLSLLLFN 

7410 

A 

1 

411 

NKVFHNNKRSIHQENKTILNVHVPNNKDPK*LK/ QLTKLTEDTDSSI 
IIVKDFJWPLS/TL/DRISTQNIS*DLENIiKNTMNQLDQINIYRT/L 
M P SNTK I EHMMGHKI S F I KFQR I KI I QNTFS ENGVTKSNR YLENPKIj 

7411 

A 

3 

414 

KGQDLYGEV*KVLLKVIKQVLNKWKDMSCSHN*HTNFISN*NCIKVF 
F * LDE I / CS FV * * NKC * RIAKEVLiS S SQ * CLGGHL/ TS PHIRRS YNT 
TFSQSIWCCYRTKQID*WTRTQNPEKDLRI*GDLIYDLRCCSSSRA 

7412 

A 

3 

333 

L I KEGQ I QKLS AKNGTPQDRHLFLFNSM I L YCVPKLRLMGQKFSVRE 
KMDI SGLQVQD I VKPNTAHTF 1 1 TGRKRSL \ ELQTRSSRATI EKHKQ 
NSETF\ KAFGGRLLARD 

7413 

A 

3 

392 

EEDIVNPANDLIKEGQIQKRSAKNGTPQDRHLFLFNSMILYCGPKLR 
LMGQKFSVRENMDIS^QGQDIVKPNTVHTFIITGTKRSIiELQTRTE 
EEKKD \ W I Q 1 1 PAT IGKHNQN\ GETFKAFGGPLNQGW 

7414 

A 

3 

1811 

R Y I D VLGRRY TAPGLG * LWRHPAQSAVPTAHCGDPVSLAAAGDGS PD 
IGPTGELSGSLKIPNRDSGIDSPSSSVAGENFPCEEGLEAGPSPTVL 
GAHAEMALDSQVPKVTPQEEADSDVGEEPDSENTPQKADKDAGLAQH 
SGPQKLLHIAQGFLAP\EETY\VKRLHL\LDQ\VFCTRG*RMRGIPP 
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ERSSWGHNSLNIFLPFQPASHGQFLLPELKTRITEEWDTNPRLGDIL 

AVT 7A TDT?T T<TM\T t T7 VTTTnsTTTT^'D TAl/flT VCTUTniJCDT T?irn\7\7TJGTrwrM7^rr» 

yj\Jj/vx-'r J-iiVrll Vjji x vlsJNr jJK/ivtjJj v o 1 W lyKorbr zvUV VHoXUJvv« VL- 
GNLTLQHHMLEPVQRVPRYELLLKDYLKRLPQDAPDRKDAERSLELI 

CI T 7A ANTT q u 7\ a t p inn? TCMTT "KT .T ,PVYT?OT .HHP PTl T \7>JD7ATJT?T.T VIST'S OTP* 

KLSAKNGTPQDRHLFLFNSMILYCVPKLRLMGQKFSVREKMDISGLQ 
VQDIVKPNTAHTFIITGRKRSLELQTRTEEEKKEWIQIIQATIEKHK 
QNSETFKAFGGAFSQDEDPSLSPDMPITSTSPVEPVVTTEGSSGAAG 
LEPRKLSSKTRRDKEKQSCKSCGETFNSITKRRHHCKLCGAVSPGRG 
GQGQEATLSSALTEQLSPHQQHTSHAHQPCTEGLPDPGSWI 

7415 

A 

189 

392 

KQKWE KKQL KKS PG PGAAAHTCN P S TLGG * GGR I TRS GNRD Y PG \ QH 
SETPSLT*NGKNX»GRGMVGGR 

7416 

A 

2 

325 

SHAGCL IR FWRKSMTPTHS LPLTPTFLGTCEAS FLEPRAS PVPPQCS 
MALRRYRLDMGQSFWGGIjPSSHPPDPSRPGFV/PGVGHVPGQEGPGG 
KPAPDSS*H*DPTGG 

7417 

A 

9 

1431 

NDNASFGEVYYHPYPPVGSQCHDESGAGFPPGPSCPGTWPTPALKPG 
REGSGG WLLGQAS P VTTAPCPAC I F * VPCCTGEGQGWLGAPGKDASH 
/GA* KM*GVKGRLCVGAPEGAG/ PSQGI * S FFTEP* * AHPQNGLYED 

JattAJNoKVVljXifc'riJr xr \ w y WXMWJJK1 o IroKr 1 ' V IrrlFW VKLAuL&rAu 

PGPHRTRTPSQVGVILFLQKRIKQPACEPYALFTPLSPHSS\PASAR 
PGS PSD *MCRAPEDSCPEMALLPGMGPPGSGERCS RAPS PGAQVTGG 

SPGAQVRAGVPRPITWGSGESRCSRPPSPGAQVRTGV\ PGPFTWGSG 
ESRCSGPLHLRPGS\YPVFLAPERKGPPQRSQEKGHKRGGSQAAPKP 
SSR / POES PS PLTPSGGPGS \ GRNGA* GVSEG * S APGSGGEEEDVFG 
C * GPPALGPEG 

7418 

A 

1 

390 

HLT FCHHHYDDRVRGL YKGQKEC PG I PR VYR I RHIGEGRHL YVLQFK 
DHPGIHQPLEPQVKYVGNMHSNKAFGR * LMLQLSEFLCEEIRNKNQR 
IVQLIQDTRIHILPSMNPDCYQGG\VAKGPNKPWLC 


TV 

J X 

O *4 

KT-TMT? 71 q P P P q * K\C CCC P\7 T TTK'fSP \ VT" VT'PHT^n^/TJOr! * nDDVVflT? ClflU* 
X\jXWlvM.oir rr o x\-rs.o oor v x X \ l ±rrl X SJ V Jr\^vs y^ttr f ± i\.s^rtvj\j x 

SSSSPFSFIFSYSIVLSSSR\PPHPGPTSPRLHPSPRSHIiSEVPPLT 
QVPPLQGPVPHPDPTS PWLYLSPS SYLY 

7420 

A 

2 

704 

TDSKCPFLVHQAHAAM*KERHFLTANGSCIKFHQDINRLLSSVFLP* 
ELVVRHCRGHQKKVDEIAJCGNRIADQRAKSAVRRPQGPKTLEAPLIW 
EGYIREIKPQYSP/DRDRMGHLSRYTFQPSGWLQSEDCKVRLPASSQ 
W K T I. K T IjH * A FHLiG KHKT SVH PKTVLRRKQTKT^GnTGP* YQ*TJPT.KKf 
NPF1^*LLPHHNQRMGSYSGEDWQIDFTHMPKINGIQCLLEWVHTFI 

7421 

A 

2 

707 

FSRDEVSPSWPGWSQTPD/LRDTP/CLGLPKC*DYRCGPPHLABSKF 
YFPII * KLYHSMLCL* ADSKS *KPMSLFLLFLFLFFLTQNFALVAQA 
GVQWHGFGSLQPLPPGFKRFSCLSLLS S WDYRCLPPRLA* / FFVFLV 
QMGF\TMLVRLVSNS*AQ/CDPHALASQSVGTIGISHCSWPNVIIYI 
ITTM*TFFSIPPSPLLIRH\ LPPLFPPPETESPSVA* AGVDWHTJTiGS 
LQPPRL 

7422 

A 

33 

111 

VMGPA^AGEQLRGATGEPEVMEPALEGTGKEGKKASSRK^TLAEPPA 

KGLLQPVKLS RAELYKE PTNEELNRLRETE I LFHS \ SLL.RLQVEELL 

KK\7RLSEKK30DRIDAFLiREVNORVVRVPSVPETELTDOAWL 

PI»\ PKCPMP * RACFR\ FLAPKPRITVCEAS YLSGQPS IRPDINVDVA 

LTMPREILQDKDGLNQRYFRKRAXYT^ 

NGCHLKPSLLLRPQ 

7423 

A 

74 

576 

LCWQPIRWWSPTLRANGHQK* / SGVAILISDKTNFKATAVERDKEGH 
YIMLKGLVQQENITILNIYAP1TTGTLTFIKRLLIDLRNEIDSNTIIV 
GDFSTPLTALDRPPRQK^KExT^Ll^TLEQMDLIDIYRTFH 
YTFYSTAHGTFSKIDkRIGHKISLNTFK 

7424 

B 

158 

34 6 

XLSQGEWPQIPVKSGGPHGAGVLGLHLEGPFISREKRGAHPEAKLR 
S FEADAFQDLLATQGQX * 

7425 

A 

1 

1770 

FFFVTFVMYFTTI FKKQTKTNLKSAS PTPQNHP/GSSSIiAQ/ PLSHP 
DPGPLSHNPGLAL,WDQPSLPAPPRPSSQGGGAIjCGEWGQ\PGPKAGP 


/051| 
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GLTSS/ GDWGVPSEVGDDQHASSGF* / P*GWGTFKH*QPASI#GTGRC 
STSNTSAENLTCLSLPGSPGKTAPLPGPAQAGAGQPLPKGCAAVKAB 
VGI PAP\ PP \ PGGQDPHPS PAPLG\ S PGACGLRSTPCALPQALPQAR 
PCPGRWFFPWTjFPFOTC^APnWPTQT. , nPf2'nTQP>XWZiT r^Trr^trDTC a d 

A/RGAYLLCPRTAP/GKQGGREPPTPELKWRESjQDNHP*RGAAPIPD 
V* PHCGCPQSQEGSRS PPASKLFQGESRPVFATWVCSRAKGTQKMAG 
*H * WGEQGTLVPRHTALQHQAGS PQPASTGPGDV* RDRVIEKRKQNA 
GHPWHSGLGSPGS\KNRGPGPSLPPGGQSGAPGNCALAQEPGCGF/T 
RCTPQTQEQKANP * LSTLPVTGPLP AGS SPPAHGGFTPW AP AT * LPA 
PPSH* *HFPGVCRSPVP/PSCRQALSPTGAQGPG*TTAQPSHPRP*M ! 
PSPGGQP\GWCHM*AQAKPGAKASPAGCTF\SQPPPPGRR 

7426 

A 

36 

62 0 

QHLKLLNI KNHTGEKS * KCKECDKAFKWLSHFI IGKIVHTEENFYKC 
EQCGKTFNECSHLIAQESIYT*KKMYKYKDHEKAINIYSHL/NHWRV 
HT**KEYKCNYCQKIFQ/ININL*SEEGIYFEDEHYKCEEGCSTFIC 
ITDFIVHILY*KKTLK\CCSNFVQHQGIYIEEKLCKFNKF*KTLFQK 
VQLRRHQSSY 

7427 

A 

3 

276 

PKKSTEKLLEVXSNNSKFTRQKANVQKSITFLQYNSNEQVKFKM*NT 
LP/YKMKYLGISVTKHMQDPYEEKYKTLMKDNKEEQNK*RDIPCL 

7428 

A 

125 

456 

LADWAHRAPAS PORNGADYAVY INTAOE FDG SD S GAR PDEAVSWG K T 
RVDAQPVK/VKRWLGGA*GLWDDECGSHGLPRFPPTQVYAAASLVFP 
LLEAETFAQKMDAFTHEK 

7429 

A 

1 

571 

LSGKEVRENGIHR\ IGNLlA VPN\ ENYCKFEDWLMPILDQMVMBQNK 
RGG FGLKK\ WT P S KM I ARLG KE T NNP E S VYYW A r>K"WTJ T PV P Q P A T .TD 
GS LGDM I FFHS YKNPGLVLDI VE AERG / AD YAVY INTAQEFDGSDS \ 
GARPDEA / VP PWGKI RVDAQPVKVYANAS LVFP \ LLVAETFAQ \ KMD 
AFMHEKNED 

7430 

A 

1 

311 

RRVSSESRWRSLESRGRSLEIQGRMERPRCQRFRENSCVAPRHCKGP 
GGKGLQALF * VPVAQLGEPGAQLGDPGAHGEATVPEVQGE / PAALLP 
GTAK\ PGGEGS PGFI PPRHCRGTGEEGS PGFTQGRETS KAQQRTGCL 
LSCRGCLCLSVSGDS 

7431 

A 

250 

504 

DDEKKSKIEE\VWVGSDEEEDNGKDKKKKTNKIKEK/YIDWEELNKI 
KNIWIRNSDDI \TQEEYREFYKSFTSD* EDHLAVKHFSVE 

7432 

A 

3 

542 

KRRGSFKMAJSLYQLPDESSSAJCAiVSLKEGSLSNTWNEKYSSlXJKTP 
VWKGRNTSSAVEMPFRNSKKSRLFSDEDDRQINTRSPKRNQRVAMVP 
QKFTATMSTPDKKA^QKIGFPXRNLLKLS*P^*WGIL/AKWFYSNI\ 
DKPLFEGDNDFCVCLKESFPNIiKTRKLTRVEWGKIRRLMG 

7433 

A 

3 

322 

FFLRWTL/DTVTRGGIQWCNLGSPQPP/PPRFKRFSCLSLSSSWDYR 
RPPPCPA2^FLYF**RilGFTMLAJ^LVI^NP*PRDPPASASQSAGITGVG 
HRAWPMP I FENRFDL 

7434 

A 

22 

475 

PRMISCRPTRPPTRP*SHSAPFRHFLLVHPVKAKYFLSAVMTTHHLA 
SASHFPPFAA*SATFPAAPQPMWTSDLPQPPSTCLPLGTSPLPSRPL 
LPVSSCS/SFPDPTQMPPLPST\FPGLPFL*PHSSALSL.PLL*LPHG 
FALSPSSVPFPT 

7435 

A 

14 

362 

LFYQTFSNVN* KT/ RHKMNK* TEDFNNTI EQLDLTVTF I SLHSTVE Y 
AILLITHATIFQIDHLLS*KTHLSK/CSADGITQIIPNYDTIKQEVK 
NTNMSKNK* KLNNKFSNTLLFKRL 

7436 

A 

1 

442 

SPFPPKNFYFSLRPLIFLRGVGPKLPPPKKRVFSKNPPEVFKSPPLK 
KKNFSFPPPVNLGPPKDLLKG/PPPSSSSSSSSPSSSSSSSSSSSSS 
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSPDRERPPTI 
*N*PTRP 

7437 

A 

8 

488 

DFA^SLVAQAGVQWHNLGSLQPLLPGFKRSSYLSLLSSWDYRCPPPCL 
ANF\ FI FSRDGGFTML\ ARLVLNS * PQ/ CDLPASASQS WITGVSHC 
AJ^\RAPSF*MSASGVTICVGWELFGMSHPVA^AjgDHLSGAL.VPFSAL 
F\ SHFMPGLLPTMFCPPLRGCSE 

7438 

A 

1 

445 

IDCWHKCQDNSMGKDNLFN* S YWDYWI S I * KKRTSNL ISP IHKIES / 
RWI INLNVNSKAiFRRKQRKTLHDCGLGNSFVEETQKALTN/WDI IK 
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LQ/ SFCSSKYTTKKLKR* ATDWGKI PTLFAKNNNDKICEHTHNKRRY 

V* /"MUTT Y1 TT/T»TTil * 

M FG QMHVKWWA 

7439 

A 

131 

763 

PGISADSRGQQKLS*WGPSHQPPPGPLKGSAPGG\LCGLVGPGAAKH 
ALRPS FHSGAPS AGNLGCFAAWASS LNSNQGS PPS S KGHI PKDRGEP 
RGTVPSLiRKATATGRKAERLSDEGRS SVHG * AG I PGHEGNAKENRDR 
HPWQLPGIASAAWIASGRGALPPGGQAGPQKLAPPFSSLSVPGGAR 
WG*AEGAPSKQRGKVTGEPGRQC 

7440 

A 

47 

265 

VHTVCARHTAPHPRAPAASPGGHGDCPQPNPPTPQGGAPARSMQGWA 

O DP DO PTJITT/TV T?T t, P/r*/l/ r * f P P TJTJ"KTO PPT 

7441 

A 

1 

330 

HHHLAPFSSLPFNIVIj/EBVIjARPINKEAK*KAP*LERKEVKLPLFA 
EEIQEMIWYIDNSKESTKE\LLELINKFIKATRYRFHIQ*LVFLYTC 
NNLK* KLIKYLS INLTKV 

7442 

A 

3 

423 

NNALLNN PWVKGAI PKEVRP YLD/ TRENKVTA* QNIiWDALKK* / CSG 
KL I ALKE * CQ I NNLSFPLKI PEK* EQ I KS PVN I KKNVI FKKLKIKKG 
LKV* KK* RKISETKSWFLEKVI KI Y* SQGRHARKKNEKSGITRDSRT 
RG 

7443 

A 

1 

689 

YIYIYSYINIQFVLFL*EHLLINKFLSFVYQLML 

LIRTM* AFLFYYQI ICI S YVIAAVLFCI * I FFP *HLCLIjWIjFMYFF 
NILRFIPLLRSDITSFHYMPLFGRFLSC*SKQEHLHNCQPGIYFINL 
LVWKHFFLDVI FTLVYYFVG/ FALI LYDFL IMS VC * TFYLSLF I SR/ 
ACLLFYPHI*LIFIWAYISRLEIISFHTF*GIYTFASEIVFIIF 

—1 A A A 

7444 

A 

3 

C A C 

545 

TNttiLFTLQNRCWTSSHCPMPAPALCPGKXiTVGTGSMGPR\SPWPPPS 
SANA*PVDKPSGQEG*GQCTAAAPPCRVAVSLSKVRAPGDGLSCGSS 
CPPG/RQGRGVPCPAAGERREGREDLKPRE*IGSCIKNKIiGISPAK* 
D*KNSQDKSKKNPIGD\RTSLAVGTVPQSTRPRVPLGYHYCY 

7445 

A 

243 

768 

NLPNK*QDEGRGGGAEPPPPWPG/SLPPRDVEAJ>SPPRLGAPAWPPA 
GRGWPPREGAWLG*VLCFRKRSPLPVLGVAGA*ASVGGGCVGRCPLS 
GAWPPPGVGDKPSPYCSERPAASFP/PRTCVFPNAEGGGERAPRDSR 
GEGSLPCCRGEERGQGGLKTTRINRFVYQEQARDFTS 

7446 

A 

596 

1335 

LEQITFSVATA^^DSRFPDFQDLDIJEPERACHDAAGTLEVDFFLFLNF 
GTVDLAGRKETTTEQALPPALLLGIRRPEFLSRPSVQTEDAPSKVNY 
VLCLLIAECSPGPSDYI.PCAIPSAYR/RWYIiDFCPR*FLiPPYY*RRH 
PFQWTTVTQEAFSHHDVAFTSTPVLFYPDSAQPFIVKSESSSQIAKA 
AljoyUKFoJbl YiiKALiHr t b*SLQRHTIDLDQGIF* ILMIiSEERQHLF 
ESSIGPHPTISKD 

1 A A *7 

TV 

T QQ 


J2.KS FS KKM P * N P PGGGE PP * PTLG I PS S PPPHLLGPKFLEKIFGLPG 
LPGGKIFP/P/GFAFGPNIrGSGGEIPGGINPRSLPLGER 

7448 

A 

154 

695 

GLLSNIiL I S I FF I PPKKS APKYQHKMHSQS FHFYCFNTHFNFQLNKF 
/ r JNJ&JLiy/ JS. w J^1jJ^e.*&K11JjiNQNKPSVTYR 

VKHKRTWQGLKQG V/ YWGF YVNVIGKS FFYFWDKYF I F I DWAKNTLR 
IhWIGPERPTVFLPNFIYSQSTKLALNWVINIMGQFIPSYSVSA 1 

7449 

A 

203 

443 

FFVKGP KNTHW / RKDS I VNKWC * KN * I S TCRRMRLDTCLSLYTKIK* 
KLRPIEESGTEPR 

7450 

A 

8 

312 

QTLALSPGLGVQWRDLGSPQALPPGFMPFLLP*ASQISRDYRRPAN/ 
LMPGPNFFIFQAETGFSTVLAPGWC\DLPDLVIHSASASPSAGITGV 
SHRCPARTF 

7451 

A 

1 

2554 

MKTTOLLLGLLLVHEALEDWGQHLPKN 

KIPKMKDRBSANSAPKTQSIMMQVLDKGRFQKPAATLSIiLAGQTVEL 

"D f~~* V/ - ' C? P TPUCVDR VT PlTPVTiCDT C*T "VTTvT O T> T T T5 - C^^\7Vi~*T^Utr^C/*MTV^7XT 

JVOoK lOWo I FAX IjU I J? JvUo 

PPTMARVPQVEPPVGETWESSTLrjADGAVDKRSSFALCSSFISTWKL 
ATLNDPVHLLSLVCTWTLRTAGRSRTARHVAVAASRCGHLTFAWQNQ 
YISLERLSVYMCRFVTWVNCMSRGFGVDYFIIQVLKLIQVI#PPDSVH 
PRSHKHLLSLRSEIAKTTS YCPKHSSLS I PLPRNQMRSGVSVTLEQR 
SESTSSPFPVTAL.LFVNLTSWLGKGPILPKYVAVSVQL.SRAGACFSD 
VTHRE VLMS FC YMWGGD FLLGGS L VCQVW FWFWSRG YTLL I F S E S 
REYLKEEEELFLCKGPGYAGQRVKKNDDPSSVKQNERYGQLTLVNST 
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SADTGEFSCWVQLCSGYICRKDEAKTGSTYIFFTESGGRISVLVNLV 
LTAQELLAHEPFDDVSIWSAPNLKHHLTVTQRRQVGKKAKKNYIjFQE 
HCGENTRKHVIRIGVSLALKISLCKYCNIPSQNDPEKQRIQCTMPKK 
KSHQHQQKYPQKPSPEFLACPKDRIEKGELFVPSPSYFDWYLNPDR 
QAWPC/PGDRAVGQSHAPQGIPSQGDPSQWNGHCL*HEAGLCVSAT 
S FRAPGCGLLQGGGRGQ I SDLRQVPAALRGG PNNS SKP AFCS S CHMQ 
APW*V/LAALSRRPQFPPLPSPPLPSLLGHQVTGIPSGPPSTTILAS 

SNKVKSGDDISVLCTVLGEPDVEVEFTWIFPGQKPPSRVTGLWAGTW 
WCLYRVL 

7452 

B 

X 

2586 

MKVWLLLGLLLVHEALEDVTGQHLPKNKRPKEPGENRIKPTNKKVKP 
KIPKMKDRDSANSAPKTQSIMMQVLDKGWFQKPAATLSLLAGQTVEL 
RCKGSRIGWSYPAYLDTFKDSRLSLYNSRGLLREGQAGPMGSQPKVN 
PPTMARVPQVE PPVGETWES STLLADGAVDKRSS FALCS S FI STWKL 
ATLNDPVHLLSLVCTWTLRTAGRSRTARHVAVAASRCGHLTFAWQNQ 
YISLERLSVYMCRFVTWVNCMSRGFGVDYFIIQVLKLIQVLPPDSVH 
PRSHKHLLSLRSEIAKTTSYCPKHSSLSIPLPRNQMRSGVSVTLEQR 
SESTSSPFPVTALLFVNLTSWLGKESREYLKEEEELFLCKGPGYAGQ 
RVKKNDDPSSVKQNERYGQLTLVNSTSADTGEFSCWVQLCSGYICRK 
DEAKTGSTYIFFTESGGRISVLVNLVLTAQELLAHEPFDDVSRNFAP 
NLKHHLTVTQRRQVGKKAKKNYLFQEHCGENTRKHVIRIGVSLALKI 
SLCKYCNIPSQNDPEKQRIQCTMPKKKSHQHQQKYPQKPSPEFLACP 
KDR I EKGELFVPS PS YFDVVYLNPDRQAVVPCRVTVLS AKVTLHREF 
PAKE I P ANGTD I VYDMKRG FVYLQPHS EHQG WYCRAEAGGRS Q I S V 
KYQLLYVAGWMFSFLVMTLTENPSSPFQPNPSIVHSGAASVLDQQSQ 
QFIQACFLLFLSHAGSMVSAAALSRRPQFPPLPSPPLPSLLGHQVRG 
± t? t» F P fc> r r I LiAS SN KVKS G DD X S VL CTVLGE PD VE VE F TW I F PG QK 
DERPVTIQDTWRLIHRGLGHTTRISQSVITVEDFETIDAGYYICTAQ 
NLQGQTTVATTVE FS * 

7453 

B 

520 


M TTIPXTUP A T?T RPPT PUTT T T /-» i.ti t ir»T T x 

WiUL Vr LaAiiJUAJiSLSr ILGILiGFTFLL* | 

7454 

A 

78 

624 

TNKSQGSQGLCVAGPRACPQSSPVQQKRNVPPLSILSAPFSGS*GL* 
PCGGVLGATGGAELMGLLRSWCPPGVCTVSLGQQVGSLSLESEPPAS 
SLNNSRGLLREGQAASVGGQPKVTSPTMARVL*VEPPVCETW\GPHP 
SWQMGPLPRG/VWAGPPQTPPCGVTQLAANGKQPVRKPGSYP 

7455 


3 

ODy 

X o f KjN LiO F F Kfc. r i KR P P S S S S S KKE F P F F F PGE S KGG VL S PRKG S 
SSRVKPFSAPTPPKKGGPRSSSSRGGKSSSSRNFKKKGGFPG*PRRF 
FFPGPGKPPPRPPKKAG/H/YRGKPPGPGPK 

7456 

j± 

2 

rT CT A 

QTLPFKGRLPQEDGLGIRRPSKRPALPWLCQGPAACPLPASLQVVPG 
HQMDP I QLVPP FLS PAR PLLEL * EGRQKGRWGRP S QAGGQEKQ\ LS S 
SSASPTAERARPPRAGTYG*CLALP*VTGGVGPMASTLP\VTTGMGQ 
DSRGNR*EGPGVR*/DWGCSPRER*EGGTGVALEERGLLQRQREKTS 
GVGVSEPGFVFQRPALPGPWVCPRLHKRLA 

7457 

C 

89 

376 

MFCVSEVQKLXXXXXXXXXXXXXXXXXXXXXXXXFSLXXXXXXXXXX 

XXXXXXXXXXXXXXXXXXSSWARLSLGVSSEFDAIQVPEFTDFTFS 
ox? 

7458 

A 

3 

866 

MNFSPLFPSNFGDFSQNISTPSEQQQPLAQNSSGKTEYMAFPKPFBS 
SSSI GAEKPRNKKLPEEE VES SRTPWL YEQEGEVEKPF I KTGFSVSV 
EKSTSSNRKNQLDTNGRRRQFDEESLESFSSMPDPVDPTTVTKTFKT 
RKASAQASLAS \ KDKTPKSKSKKRNS YSS WKSRV\ KNIRYESASM\ S 
STCEPCKSRNRHSAOTEEPVOAKVFS RKNHEOLEKT T Knra «5TR T <? <5 
ETGSDFSMFEALRDTI YS EVATLI SQNESRPHFL IELFHQLQLLNTD 
YLRQRAL 

7459 

A 

2 

743 

KASAQASLASKDKTPKSKSKKRNSTQLKSRVKNIRYESASMSSTCEP j 

CKSRNRHSAQTEEPVQAKVFSRKNHEQLEKIIKCNRSTEISSAHARR 

ILQQSNRNACNEAPETGSDFSMFEALRDTIYSEVATLISQNESRPHF 

LMEL\FHELQL\LNTDYLRQRALYALQDIVSRHISESHEKGENVKSV 

NSGTWIASNSELTPSESLATTDDVFIQEK*KYASQKHLHINVHSIFS 
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HDSLKLKHQEING 

7460 

A 

1 

336 

IYFPTPEKFGPPKETLKXGAPSSSSSKQKFPFFGPGGKQKGGFGSLQ 
TLPPS / RSSPFPPQPPQEKGFQGRSSSSG* ICIFKKKGGFPGWAGGV 
QNPNPKGFGPPGPLKPWG 

7461 

A 

1 

3010 

YRIPGGGTWQSARPRVGSRRAVDGEGARRGLCSPSSRRWRPGPPQPH 
CPGPRAPALSCAAAAPARRPRGHAESRRDGGLGSAEEEESWYDQQDL 
EQDLHLAAELGKTLLERNKELEGSLQQMYSTNEEQVQEIEYLTKQLD 
TLRHVNEQHAKVYEQLDLTARDLELTNHRLVLESKAAQQKIHGLTET 
I ERLQAQVEELQ AQVEQLRGLEQLRVLREKRERRRT IHTFPCLKELC 
TS PRCKD A FR LHS S S LEL P AAPG AGERAAAD PGGG AAL PGE PGAAAQ 
GAGGARVHRGAAGVLGAGAPAVRDGGLS PACAGAGGRAAGAAADEAG 
QDLPTGSGTTTWPRPCSHPSRRPLRPTIPSPAAGTTWAPRTGSPHRQ 
PLQATWCARAAATLRSTPSWPKTQPAGTRATSHCTPTALRKR/EHVH 
PAGGGRAVPRAAGEVRGAAEQVPAARGRSARRRRADLAPHLPGQLVE 
GPARG *GGPG *GQGRREEPEPARGGRGQAAGTEPARVQGALQRDLLQ 
DPED/ARLTSTPPKSRRTAASDPSPACSLPQGGSRGVPQAWAVQLPE 
SERPLAACHHSTRPPDP\KHAACSLLSGGSPPVLPPRSPWPPRASPK 
AQL * VQSQMS P/PTPGI PQLPQPLAS * PCAS PSDS * PGF * KPFW I S W 
T.PFFFWTi^T.PPTiRDT.OCKVQT.TPC* /ROTjF/rT.FPfiPOnOTjFOGFPnPP 
TAS PQP * CAD I PLAPAVP PPGVLTR SEMS PAMQSRKAS AGPGVSLLQ 
PCOGOVSPWI LIiAHRRSVASORRRRRS PS AEWCS PAOSLSPWIjPTPE 
MTCWALSFPGAGAVGSPHSL\PPPPTKPKLGIi*LPSLCDPPSQGP*C 
LSHSTLPTGQPWHRSCLQPPESSGHAPP\PWPMSTPASPSLGTGVF 
APCCIVVVIiGWG*GTCSLPPSHKMPAIiHIiPPLWGAFEDPAASGVLLQ 
DVRKS S S AKRHPGPQLALS S PSGLLVPS AALPETLETRRAIQS VEGG 
GAPGLLHRDPKWEPFMPYLFLNG 

7462 

A 

2 

190 

SSNKKKPPNKNHPEPDSFKTEFYQTF/ INTP* IPILKLFQKTELQEI 
LPNSFCEVSITNSSTRR 

7463 

A 

1 

395 

QYGLYNGGGAIM7VALHESFTSCLNYGGPQKFSPEGDLCSSPDPKICY 
VCKKS FKS S YSVKLHYRNVHLKEMHVCTVAGCNAAFPSRRSRD\ RKD 
*KADLIVTRHSANINLHRKLLTKELDDMGLDSSQPSLS 

7464 

A 

359 

1095 

PKGENGNLWCGASLLNARMRAGNKSPPjLPRIiPPGATKSLPQGAIiQFQ 
PLLAEPDSAAGLGTGGTPRLTEVGGAIj/S * *RSGSGFPENOKGPPDP 
ASGYI^LMHTPPFSQ\DPISPASHMPIPSWSTLFSVEDRRSRQKTTP 
GRWGSRCGLPHTCS I LVGGVTREGVEMLPMGTCGKCAPFAPHLSAAS 
SNSVLTAGDPAPGAGRLGSAAQEAALGQGGGRRSQAPPRRAIiQRPHA 
WVETTSQWRSAG 

7465 

A 

16 

405 

VGAIQRAPPAPPSCARCLHTPDIRATIiTQTPGGLTPQQEKDHEHGHD 
GRAHSGAVS VWVLDPGTCSRRRR\ WFS PL YLELLRS SGECRG* S PAS 
PSLSHHTAHVTPKAFIEHLLCARCKEDSGITGQGAF 

7466 

A 

16 

255 

ARPSCPCSWSFSCCGVSPGA/LVTEAAIFYETQPSLWAESESIiLKPL 
AJCLMTYFKNSTYLIRLFMIYRCKPVKSKKKKRN 

7467 


x 

1026 

MR ARR LPWALTLVAELGWDTOGGDOT^ PC5GMDRM <5 MP. A ECESTTVS P 
LSCSIPTGCXSQTREEVSARATPPPSLGASLLQTIiTPDTHCTGVSATI 
MSMLWFLLLWPFSHSSTLAKHKRIHTGEKPYKCEECGKAFSRSSTL 
AKHKRIHTGEKPYKCKECGKAFRQSSTLTKHKIIHTEEKPYKCKECD 
KAFKRLSTLAKHKI IHAGEKLYKCEECGKAFNRS SNLT IHKFIHTGE 
KPYKCEECGKAFNWSSSIjTKHKRIHTREKPFKCKECGKAFIWSSTLT 
RHKRIHTGEKPYKCEECGKAFSRSSTLTKHKTIHTGEKPYKCKECGK 
LLSTPQPLLNIK* 

7468 

A 

1255 

2710 

ARPSCPCSWSFSCCGVSPGA/LVTEAAIFYETQPSLWAESESLLKPIj 
ANVTIjTCQARLETPD FQLLKNG VAQEP VHLD S PA I KHQFLLTGDTQG 
PLPLPLGLVHRMDPAEQAPWS*QGPKAIjFNSTPPTESLPAPWIiSMAP 

vswitpgpetts\rcrgvlrgvtfllrregdheflevpeaqedveat 
fpvhqpgnyscsyrtdgegalsepsatvtieelaappppvl\stmes 
pprscti^nkvtltXcvaplsgsgisslrrggeraagnpggstspdr 
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IFFHLNAVALGDGGHYTCRYRLHDNQNGWSGDSAPVELILSDETLPA 
PEFSPEPESGRALRLRCLAPLEGARFALVREDRGGRRVHRFQSPAGT 
EALFELHNISVADSAl^SC\ryVDLKPPFGGSAPSERLELHVDGELAG 
HQRGRARVQCPSGPPVFPLFPWASDGGALGLGSAPIAYPGGEQAIGG 
RGSGDAWNFGLFPTDAS P 

7469 

A 

3 

1312 

GEAGRGGSTGLGAGQGCATSQGLAARSSWGHRSVGHQLGSRGPTRSS 
AQ*GAAAADLQGAGRHSDHNPPPT*AA*G*NLPAEE*GGPRAGAPAH 
AS C PTD P S AG E PG TG AG AARRWHQ PG LC S P S G V * PQAS PG S S PRAQT 
QS PGAAQEHHE PRE * AAQAGERPAGGS E VGAGASGG PAAGTGDACGN 
PGPAAAGSPRGGQGSRaATGHTGCGAVQLPRPAPSGRG*KCPAAAAA 
QETEG*VRPTAATliRLAGSGARRWCRPSASHHGPPDIPGGDSGGHPG 
SAPQP*AAAGPGCPQLPQEAGGSEPQA*RATGCLQGPGRCII/GPDPP 
EAPGLL PQH PGRAGTGAGTAAGPGHDG * RATF * ATGVRGPAP/ EAGT 
STKS*G*GSWQVQGTPGKWGLCLQPSPSIQGPAATRPSPK\GRAEQS 
SSASTEPSHPAPRKT 

7470 

A 

1 

1988 

MWE PCG S QGDT/ GVFGCRDR * GG * G AAGCAGRP S PGDRGQWPHACGG 
GGRGHPSTSTGAPAGHGEEAASEQRLLSGGPLRPPGPH*AHRHVGPG 
AGRGLC * LGRRE PQHRVPSRSAPAAPGRLPALPFLDEDKG * AEPREA 
APRRPRAAGHSPGHVSHCGEAPGGLDHIjHIiPQ/ spagmgsehdgrsg 
QC*PSRHTSP\PTRLQPSPPDTHPGGSSLPAPRPALSCWARVFASLG 
DQPAFLGGHTG PPDASGEPQHKH * PS /GPLS S S S FPGAP I QGRQR\ T 
ESLPGTC*GMEWPTAALG*RGPRPVTATPGTPGGAPTS\GFP\GCQA 
QKPEAGLWANGTMCCPAKHTWRSGPKI PTL I S I LPLLLLHGGAVSL 
GSPPQGSALRPAPASLGPCVTDKSIPSLAQNTKYSPDLP*ET**HLP 
ECSRG WGS PLLVLHPLLGHI LVS AH I PLPGYKACPRPLNLLGRFGA\ 
QGPRGSIiSRSSRGG * / PGPAPSALGVSGS PS SHPPN/ GLYAEAHRMG 
FLPDSKSKN\GLISKCKLTFHPQTSKPPAPISP*SKTSGGRKNRIiPG 
LHSQQSFGRMPPGMP*APSVYIiLPLLSHPLPLRTAGKPSSIjPQPRSP 

CT.V 1 ? CTTTT? ZiOOT.CT. Q\7D*f2f%'Df , M7T3r*TJ r PO r T v Q/ , viT nntJT ctt t-»7\ ttit nAr» 
ojjvoonx Jxrtyv ij ^-i J J v Jr " K^yjir^v ¥\jn X o lKy^ijirPrlljolljPAEl»PQS 

SFANRGIKASLRIRK 

7471 

A 

2 

353 

\HSLEKFIISAKERTEIELSYTKQLRNLLKTCQPKRNWKEKEYIT*N 
SPSLPVNKMSDHARQH*VITLGSIKLSSLRS 

7472 

A 

2 

390 

PLDPQRVKIEDKVNKSRS S * KN/ LGLEDLGTYS VI VTDADEDI S ASH 
TLTEEELEKLKKLSHEIKNPVIKLISGWNIDILERGEVRLWLEVQKL 
S PAAELHL I FNNKE IFSS PNRKINFDRE KGLVE VI IH 

7473 

A 

3 

223 

DAWVTSTNADKEFEKIQHPLMIKLSSK*VY\NMSYIYPDFKIIKGIY 
DKSLVN I I LNVEKLKAFF\ LRTGTRER 


n 
/■v 

o 

lUJi 

CPLPFSL*YSTCGEKQPQMSWRASAASPGQSAPSSSAVGCWKASCKQ 
YVWGTPS YS YTNRFPRRTPHS PGVWKAC I S RAG * PARKNS I»I CGLAG 
KTGALKDT\SGKAPVFPLCHRHSDPGGGFHCAELGTGNS PLGRRLLK 
^Juir Fi\±\v r\~h'kr\j±\t'r\. \ 1j Ir y KKlr AO Kfl r PFAQSSPPQSVWSRSRC 

TRMPGPTGPGRWGPAAGSWPATSIiASGCPSWEVCGATNP*APRIiSTA 

LGSCCPSPPEGKRRPNT\PATRPPCVPAG*APPPGHSRGNGGLBKKS 
• SF\ PA^PRT? *0^(^A<^PQPT.nT?PQ^T3Cl?T.TJC\TOO^i5X?T>T? T T , CTJ , Dr^'\rr%a e 

SVSEKGEKPLRSYDPRR 

7475 

A 

63 

389 

KNQEQESEETLPNLFYKASITLIPKLDTSSSSSSS\FSSSSSSSSP* 
A* * EPS KNGDRRNTP 

7476 

A 

3 

335 

HASGKDRHTDQRNR I KNPETDT * I YS TF * QKC / RL I * WRKDS LVNKW 
C*Sm^A\SPMKXIKLDLSSSSSSSSSSSSSSSSV*NVKLLGNNVGGN 
LQYRGLRVHTVDVKAQHI 

7477 

A 

50 

379 

LEAWAGLSQVQTPSKFLENPSQSYRLTVPLRKHVRPKKQHEIRRLA* 
VGNNIiSDFTG*/HPGCIRGLRPG\HLSRLMDLGLGLMVKSMERDQRL 
VERAQRLDQELLQALEK 

7478 

A 

62 

389 

KRDPRKTFITPPPVSGERQIRRGPTCKWGFLGLVGRGGPPEGEKSNL 
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LLWNPGPPLGPNIKKGPRLTTPSSSSSNFQREKFHPGGGAPP*NPT/ 
ILGRPGGPNKLGWGVP 

7479 

A 

18 

374 

RGEPPSPAPIGIFKSPKRGFPGPKTFPGGFLPKGKKKPGPIFPPGFb 
ETKEGGTPPNPFKK/ARVFLVFKPKAGSSSSSSSPNYKPIYLNNTDT 
KFFNNMLANRI *QCIKEVMQYDRTRG 

7480 

A 

2 

1289 

FFFFFSCYAVNAAGNEGNDTALSFPERSQSRLCGFSQNTLCLLPAFT 
RSNFPGSCRPRQPPGSLPALALAAGDSPGEAGPPGEGSRGIQHTFP* 
IPRPRRVQGSQSSPSPTSHPQQFINGEGQTS* *QNLGLRTQEFECNP 
RLISRL*WRFRSLFAFFPLFAFHSPSPPFF*GNV*ARLQGQKMESIi 
GAEGLHHRRPRQGWRKTKQKAPSAYVGKNGNPAP/PWFPPKVFRTGA 

HVD*PPT3MDTODVD*DUOT T\T 7\ T/"P''D/"'» T"»T^T}0 T\ T5T>r"TT y\ ri <-*iT A T3T UDCT) 

AJvir w Ljr FWFAoFJNJr ^KiaoXjIJXjAlV^ LWroK 

ELITPGCHVLIGLGSRLRGALFGG*GTAKRLSHKGRQSGP*N*P\GR 

urpADT T PQ217XQfiP\rP\7f2T.OP"Kn?PTA DOMOTT DT O CDT T "Df!/"2 T tf\T\CT?'T 
it X/VJt'ljXiroMM.oOfC VK VvaijyKJNJir' Ju \ ryw F X AjKIjo o F XjAjFvj\JX Jft. V JVC* 1 

VGI**GVGMR/DLGS*NEVSAFIGN*ESRGSGSSNRMVFLPLPNARE 
SPPSRFETVN 

7481 

A 

3 

352 

Q FS I E * KKRKI FPTQ A * DKQRLNVQKY K I LL VTRKEG PNGKRGKPC S 
♦IRKI^IVKTSIFPKLIDSYNKFPNY/LSASLLLELTSCILKLIWE/ 
QNPFMKMQRARTVKTLSKKKNKVEK 

7482 

A 

2 

253 

SMKKVERSKISILSFHIRKLEN/ QLNPK* /RRRREIKIGAEINEIEN 
RK* IEKINETKSYF\WKISKPLAKLIKEKTQITNTRNRAY 

7483 

A 

2 

1114 

WSHRDW I CPAS GFQMLNVGS VGQ I LRKKLMTYAL*LLQIjTVWRDSRIj 
LCREGAPK*SL/SSLPESRRQSSEFEKAKTCQLFINAAVDSPAIDYH 
1 S IiAQ S ALiQ I C DTH P ELiQNE I CC QJb I KQTRRRQPQNQPGPxjQGWQLLi 
ALCVGLFLPHHPFLWLLRLHLKRWADSRTEFGKYAIYCQRCVERTQQ 
NGDREARPSRMEILSTLLRNPYHHSLPFSIPVHFMNGIYQWGFDAS 
TTVEEFLNTLNQDTGMRKPAQSGFALFTDDPSGRDLEHCLQGNIKVX 
PSPFQEPSPTNSKEATLYKKSKLCKXFT^SGVFQNFNQSNLHDKKKKM 

OT^'K , \7Tr'DT"P , 'R , T.T WTJT.VT VT , l?TITTrPPlfr*\7'CM/^Krn , T , T CTDTTW/TIM 
\£LJr V I\Jr A Utr JUXiTivJuI 11 If XIJ. tvHrCrsAj V oiMuX^JJX LolKX J.KV JJn 

7484 

A 

4 

254 

GDVEY*SIYSLAPIRHWLRNGPRASTLWLFFFFFFFFF/ETQSCSVT 
\RLDCNGMISAHCHLRLPGSSDCPAS 


TV 

A 

95 

o / J. 

r Lih,L>KaYiJ\Ht r FFFrvlSAjb r PKxPJ\Ji(JI , KxPPKRKJ\IFPSHPPIxNlA3PP 
RDLLKGPP/HSSSSSSSPEVINIYCSPFREGHGRR*SKVSSWSTEVN 

V"DW*TMf J7PWFPC V / TArPOfPT DnnVDTTiTrUCVCDDV/UMrvGT DTT?r\YR 
Arv L/rvr rnlrol / MirCrbryvjlKlAVf blorrA/ ItJm^oXjF 1 t\JJ\J\ 

CSFPYSFSHHSDQSPTLVAPFFFPCLMRSSDKLMNDVPAPVFLPSFK 
GNHACSSR 

7486 

A 

2 

340 

ENG WCW I ATYGRTKTGFLS LTTCKS * LKRDLDLGTETME I LEENLGE 
VLLDIGLGK/NFMTKTPKADSTEIKIim*DLIKliKS*CTAKKS*T*/ 
SNRQSTEWKKI FGNYASDRTS 


TV 
n 

117 
XX/ 

■3"! A 

±j XMXlrvr iNiNJ-tl/CfriiJ^r ljvrtxlTJ A JrlVJjxyJilS/ rlKXiX V AJV W A IvOX VXWAjFJV 

QKA*GPDGFSGEMYQLLKEEIIYNVF. 

7488 

A 

325 

354 

CILKDRCR*LYAHKFNNLDEMDQFLQRHNTPKX»IQEETDIiL/ SIK*I 

AO 1 V IWijrivylVH OXrJU/OJT OU&rl X UXlXtJvdJSX X XIM VX* KlJJViiVj 

7489 

> 

A 

3 

458 

KSRE I TTLANNQCMATE S VDHELGRCTNS W\ KYELMRP SNKAPLLV 
LCEDHRGRMVKHQCCPGCGYFCTAGNFMECQPESS I SHRFHKDCAS \ 
QFN\NASYCPHCGE\ESSKAKEVTISLKQTPPRPWTPVPGQE*GSSL 
VGRADTTPGSGAR 

7490 

A 

49 

457 

MHNPDG S AS PTAD PGS ELQTLGQAARRPPPP / PRGTRR P PQDTP ISP 
ETP\GCFSRRPMPTREPPKTRGSRGHLHTHPPGPGPPLQGLAPRGLK 
T/RRPPPSVPAPAGTPOGK1^0FDCV*n*VAT.PnPPnPnF*AYW < 5HP 

/ Ar r r u v rrtrnu i r yuiwy Di/U v vj VrtJjrorroRyDrti nonr 

7491 

A 

1 

816 

MDEAQPQEALGASD WQTC S RTQCLQRCKARS QS DHFKG E LGVS S PRV 
HPQAARRPPPP/ PRGTRRPP/ PGPAHQPGNP*AASPGGRCPPGSPQR 
LAAP/ EGHLHTHPPGPGPP/ PPAGTPQGKDQEDCV*G * VALPGPPGR 
QEAYVJSHP*GA*A*APPSARL*A*VPASEQ*EGTEPLCRSVPGPVRR 
VLGAPARGGVCTGKAQAAGGPAELPAPTPKPEGGPSCSPGLEGV*DE 
ANGSWLPGQAGSLPLPEG*TEASQDSDPEIR*PRRTRPSARKPLSCF 
SRRPMPTREPPKTRGSRGHLHTHPPGPGPPPQPGPHKAKTKKIVFED 
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ELLSQALLGAKKPIGAIPKGHKPRPHPVPDYELKYPPVSSERERSRY 
VAVFQDQYGEFLELQHEVGCAQAKLRQLEALLSSLPPPQSQKEAQVA 
AKVWKLr lirlJS±<J v llJirtj»r JjJJlvy/VKv-rl x Jj JSXj rvL RHXj KTQ X QIvFDDQGDS E 
GSVYF 

7492 

A 

27 

447 

MHNPDGSASPTADPGSELQTLGQAARRPPPP/PRGTRRPPQDPPISP 
ET\LSCFSRRPMPTREPPKTRGSRGHLHTHPPGPGPPVSLDREGVPR 
F^jJj w jHJvL»JjAi^/\ V w bifvf Fl irKr NoRPRr PVAAAGTGAPRPQNQRPP 

7493 

A 

1 

368 

FQQNLTLISAHCNLCLRFK*FSCLSWDDRHVPTCLDNFIiYFW*RWGG 
r XrtvJjyA^ljJvijijlJoo IroAb/ or Pi\CW\DYRHEPCTWP*KYSF1jTEH 
*LLPVY*K* IAKPFGPKQIVKTFS *GCKNI 

7494 

A 

10 

505 

EGDKGTPVIiRGFSSVSGSWSRRMPPFLLLTCIiFITGTSVSPVALDPC 
SAYISLNEPWEGTIDHQLDESQGPPLCDNHVNGG\WYHFHGHGRGDA 
MPYLSFFTRKPLWEPTAPVWLNGSHPL*GDGIVQRQACASFNGNCCIj 

rVIX X X V JD V iroVj X X V X rCXJ X JVir o V 

7495 

A 

84 

250 

NR I EN PE I KLNTDS QL I LPR * FNWER / DS LFNKCWWNW I FKCKTMKIi 

U tr V if X Jr XylvlM 

7496 

A 

63 

743 

sscllaafrgpahashc\p*spahsskrpihaqlfplvaspgpeipq 
fgfar*scclpvpspgpvlppswvyspis*lkttyfgsapaqhlaaf 

VGLKLPQVKRSRPRSCCPGAFTGPATA*RWLPQAQVLPSPSIiPRPSP 
CLx^AFPVHVTACYPTA1jTDQTLPSHWTV*DKLIRLPASPGQSSCLT 
LAS IGPGAESAMVCLGPSHAFLKTLQATIWPDTCFCGPM 

"7 /I 0*7 

747/ 

A 

"i a f\ 
.54 0 


VPT I CQVMS * H FH P S S S S S S S S S S V INYY * / FGLLLC / QVAHFYNS I 
DQQMIQSQRPMMLQSALAFEQI IKVNGLLI 

7498 

A 

2 

826 

LVKKTCEVPLQAGRLYLDHHQSKPFLKLHSKrav*RCPLGIQTTGPV 
TLRR / AFNLRP VTiS PQQRVP VEARPRKCETHTES FKNS E ILKPHRAK 
PYACNECGKAFSYCSSLSQHQKSHT\GEKPYECSECGKAFSQE\SSL 
IQHQRIHTGEKPYKCSECGKSLSARNANIiTKHQANPHREEEALTGCS 
JbCh»KAr t> w xjJ_i£> \ljVyHyRlHTGEKPYECSDCGKAFRHSAx^TNHQ 
HTGEKPYKCSECGKAFSYCAAFIQHQRIHTGEKPYRCAACGKA 

7499 

A 

64 

372 

KLSFFHRESVNEIKKKTIDWENICNTYE*KPVTYK*FLQIN*KEIN/ 
SITGKNYKE 

7500 

A 

2 

414 

RQGL/ DSVTQARVQVf SDQ/ GSLQPRS PGLERSSHLSLLS SW* FEISF 
Kiijyyj_»vi ivjJM±xjLiiiyivx 1<jXWHK w FMIjYSS \FCIJFFE^lESHSVVQA 
GVQWHDLSSLQPPPLGFKRFSCLSLWSSWDYRHPSSCIaAYFCIFGST 

/ J w X 

A 
£\ 

1 ft 

/ J 

i> x kr\jPa\ V t> FAfWb F(jA£> PIjKS £> PExjHxjRIjS S S AS SLAXjlAj\NAL»SKAS 
PSPAAQ/ALPSPSPLLWPLTSSPGRQASPVAPPSASNGT*WGTC*PE 
AILGLQSAQPPGAPGPPH 

7502 

A 

3 

365 

KTASPVQVKFCAGVALPNLKQRPLREKAQQCI * PLLMAF\RNTGRIP 
CFSCNIPVLPGQKHGTEDYWFVQDLRATRQIFKAIYLVIPDAYTIiFM 
TLTSELYW/WSVLNLKDAFICIPLSPES 

7503 

A 

47 

231 

NCSSVFCFPGSQRVGSILGKSW*GWWATGVWLLNPLKAVGQEEYSL 
KMEGPRPEVTCRHL 

7504 

A 

1 

3099 

MGSCSGGERLGTSPKYSMGKWEFVAKEQCGVPCPVQLGTLRNDSLEA 
QLHE YVKQGNYVKVKKI LKKGSOPHS PRVH S LFTEGTLDPQAPDPCL 
MARETQNQDAPCPAPFMAEEASSPSTGQPSLCSFEINEIYSGCIiILE 
DDIEEPPGAASSLEADGPNQVDELKSMEEELDKMEREACCFGSEDES 
SSKAETEYSFDDWDWQNGSLSSLSLPESl^EAKSNLNNMSTTEBYLI 

^Kr^TiT^TiKTM0TTMTTPTS3T)'nPT.T?MTT7OTT nTr\n?MVriV1?rM?T?DMCT WAT 
oi\a_ v uj^jjixiriy x x i*ixi iii>i JL/X^J\.XjJ^>iXiiy A Xjly£»Vxl» 1*1 r\^rvJl^ri*ril\lMoJUr(AX 

SR^FTNAYKXiPLAVGPPSLNYIPPVIiQLSGGQKPDTSGNYPTLPRFP 
RMVRDFQGGHFGKRSRKRNGCGWKPFTQVSKVNTGDQSEISHQLPTL 
CDPGKQNTDEQFQCTQGAKDSLETSRIQNTSRSLTDIQDLSSISYEP 
DSSFKEASCKTPKINHAPTSVSTPLSPELLKQQQGSSTVLHE\TQQV 
MEEALQMIKGT* KNKKLTVKKKI WCFCPKKLKILERTQRE/PHSTIi 
DEDLERWLQPPEESXGSYKTFPRALKVAVRAAVPVLQDLDTMQYPLL 
GDPRLDQEQNHPAIP/PPHLKSQKTQTLASHASSRQWKVSPPREESS 
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TEEAVRASQGAEVGSSCFINPRCQI YS S VEFSRPEE*I*TITNLRR* P 
TIQGPEPPRSQSSP*DFVLPKG*SSTN*AHPKNYDY**DQISKESLS 

SLLKLSVIiKDSDGVHRVSARVDPESLHKYSAYPETKPSAKVLVSSQV 
ES NVRTP IRGNS EVGRRVT I S PGV/ PVSRANS PCDS S I S V* GLPQVI 
HVCYSRAHL*T\QTQKPPEITYMSQGPTPRYPELSQKPSIHAELELT 
PRPLPPRSLPRYGPDSSWWPLLNPEVETPQSQLTTPDFEPKCSPSLD 
LLLSGFKIDSSPFCEDLKFQREKASLSPPSPPKEFPSWAPLSEVPQT 
PKTJTnKOPTOPFTAFFT.AWGTPFFGPFFfiT.HTTWVT.lJT.FlTr ITVnQzW/ 

SSPVS 

7505 

A 

2 

708 

LTTH* I IHAGEKPYKCEKCGKAFNQFSNTLTKHKITHTGEKSYKCEEC 
GKAFIQSSTPSK\QINDGEKPYNCEECGKAFNQASNLY*/HKIIHTG 
EKP YKCEECDKAFNQS S I FTKHKKI HTGEKP YDCEKYGKGFN* S S VL 
NTHTT TTjTAEKLYKPVRCDSASDNTSNF^NHKPNHTfiEKS *KC*GECN 
KVFKWLSHLIIGNIHIGKISYK*EQCGKVFN*YTIiLHRKSFIFEKNC 
RN 

7506 



683 

CPCF*QPSDGAQT*NRIPH™CPIFroAA15KSYP*IARAl^LI J *NLM 
AYEQVNPRL/ CQVA I CP I KGKI PPGGD VLTS Y I KVCEGVRGTLRTAM 
VMAOAMVSTRMPGOFPGKCFLCSOSGHSKRNCPWHTDRCSFOHOOPK 
FFQQQAPLSTICPRCQKGNHWVAQCHSKFDIDGNPLRSIiP 

7507 

A 

2 

342 

STVKGPCSMCRTIHHVPHIMGFI FGVQGWHDM/ C/KS INVNHSVNRM 
KNKNHMI I / SNDAEKAFDKT * * S VS IKT/L/ NKLEVKGMYLYI I KVA 
YDRLPARI I ADD * RAES FPLRLGKR 

/ _> u o 


x 

J J -7 

KKANI RDQWP * EAKELMPQDNFN * LFPQTDATRS S I W/ LLKTS 1 1 KK 
YHVTHYEKALKTHEKELTCLRHDRVGTC 

7509 

A 

2 

681 

WEAiCQQKAAEAAPPTQEAQGETEPTEQAPDALEQAADTSRRNAETP 
EATTOQETDTDLPEAJPPPPLEPAVIARPGC^LSLHGIVEDRRPKER 
ISFEVIWIxAELFLEMLQRDFGYRWKMLLSLPEKVVSPPEPEKEEAA 

TfFPaTKT'PPttTKFFWKPPKTJPanMFnPATF^KZiDT.K'FnRT .T.PVPT.Q 

SGGEEEEKPRGEASEDLCEMALDPELLLLRDDGEEEFA 

7510 


670 


IVtVXrv. Jr Xr rylt fuf X O J.M V Xi V Xr Avdur 2r V_ \_ Xi ±r V O VlO XtXVnn V 

LFA^SDVSDPJVFE\WLSAFPIi\DSPYSIHHPRRIQV\FIiKREGSSRR 
WVLEPITADSDPAYSSKVLLLSSPGLEELYRCCMIiFVDDMAEPRETP 
EHPLKQIKFLLGRKEEEAVLVGGEWSPSLDGLDPQADPAG/SWVRTA 
I RLCAGPELG I DLS \ AVTKWWRFAE VS VPAAGD P PGGLQTVWYLPG 
CIxDHHAYFGRKWEGPGAKQKAARGSSPNPGRHKG 

7511 

A 

155 

361 

VNSSHRDQWNRIGGP/ ETNAYI YGQLVFGKEAKFI *WRKNSLFNKWC 
WET/ WIS LQKRMELDS YLTAYVKI 

7512 

A 

2 

495 

RKD I L YS W ARRFN I VKMLM LL P KA I YR FTV I P I FLLQND * TAFFAEM 
EKPVLKFI*SYRRP*IAKQS*SSSSSSSSSSSSSSSSSSSSSSSSSS 
S PVNS SHRDOWNRT GG P / ETNAYI YGOLVFDKF A KF T * WRKNST.FNTC 
WCWETGFPYKKE * SVDSYLiTAYVKI 

7513 

A 

155 

361 

VNS SHRDQWNR I GG P / ETNAY I YGQLVFGKEAKF I * WRKNSLFNKWC 
WET/ WISLQKRMELDS YLTAYVKI 

7514 

A 

2 

495 

RKI)ILYSWARRFNIV}WIjMLLPKAIYRFTVIPIFLLQND*TAFFAEM 
EKPV1*KFI*SYRRP*IAKQS*SSSSSSSSSSSSSSSSSSSSSSSSSS 
SPVNSSHRDQW1STRIGGP/ETNAYI YGQLVFDKEAKFI * WRKNSLFNK 
WCWETGFPYKKE* SVDS YLTAYVKI 

7515 

A 

1 

401 

NVliIWEDCIAGQAEVLHDS YGI I IDWSPKGMFSMNCTSQS\CPWPHY 
VQLV*x , ERSDSRlTOKKYGKSSx^LETWQYSG\PQPQMIWPAVGAKHK 
DLWKLLIAFDKIKIWEGKYTMPSQYNPNYILELVE1WTIW 

7516 

A 

10 

225 

YCNTI IVGDFNTPLSGMDRSSR* KT/NKETSELNYTVDQRSLTDI YR 
TFHPNNLM IHLKELE KQGQTKS N I N 

7517 

A 

37 

350 

WKMGQLPGRPQLPQLSQEKILSLNS PNVFSSSPLK* * *NILPKKKRP 
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DCFTHKFYQTFKEEIIPFLKTH\QEFEKKEILLKSF\YEGSVV*AAK 
LNADINKKTTH 

7518 

A 

3 

353 

CRLGGLKGRSREQGAVIREGEDTLG/SETEAGRPAGSLRQQQRRGRP 
WLNPVPGKHGVPTGEEHWAGEPAGWGGSVSGQAG*AESRLSSSGHSG 
QKGGAVMLLAADRRRGQRRRGGSG 

7519 

A 

54 

284 

NHKTPEENIGEKLYDIGL/NNDFLDMTPKAQAMKAKVDRWYGIKL*N 
LCTSKETINRI/NNRQPMELKKLMANHISDKR 

7520 

A 

29 

261 

NHKTPEENIGEKLYDIGL/NNDFLDMTPKAQAMKAKVDRWYGIK1»*N 
LCTS KETINR I /NNRQPMELKKLMANHI SDKR 

7521 

A 

5 

2089 

LSSPSQGRGADSA/PGEFSGTQLPPQGGELVPQHLAQPHAVPAA/EV 
GPAWWPGAGP*TSPSGRGRPRLCPPAPGPPLPLPAAGPG\SHGQPPF 
MLLDGL\PGPPGGSGAS\GLPGSSALP\RPRGEGVTPGIRGEGLRHT 
GA/GAPSHSHSTCPSRRWQAGCPPTLTAPPPPGWATSPPPPESRCRV 
SGTSR*A*NPGHAAASSGRWDRGPGAAAGPRPG*AHPA*GSGASGSS 
PGPEGGRARGT* PCSTE PS PGH* STGPSPCPAGGPRGRLLLPLPGPD 
QGRWAPTSGIPLRTG/ PKLQTPTSCPGR/ KLAEGEAPGLMGVQVRQC 
EAQHTQRVPWLWLQEGQDAVCVELEPVRVPAGPRGAADGRGCHRRQ 
A/ P * RAG PGRRPLCS CTAGAAGS FPPAPRSGQGQGLLGLAPGTAAEQ 
AAHRAEPPRCKMPA* / PGPCVSTTCPCVSASELGTPGSEPIiAGRWRE 
AGS SQAVGLAA\ GGGGC * DQRTPAGFGS * GAGKGLRGP PGLVTSLSA 
VGRCASEPGGASGAAGRRGRSGSPREPRVGSGASRSPCG*AGCRRAS 
SAGCRPRPGRA/ GSAGEKADTYSCCRAAS\GRPERPSRPGARS *tpv 
RVYTA/GMVGGQGLATPGSPCCCP\GPPGGRPRSRPE/SFPRADPSG 
YRGLPADRHWSKGP*GPALPGPGRSQAWALLGPQHPRSPAPAGAGRG 
RAGHRAR 

7522 

A 

45 

301 

AGHHHWRKGPEAPRHEMCPGSSKNQGLVQGLAGGWGTKGKILKGS\S 
TS PGEPGSGQIHLKKAGTCCGGS SKDRGVRNYSGRLLRR 

7523 

A 

453 

704 

AGHHHWRKGPEAPRHEMCPGSRSGSEGATCFARGR*RPPGNSLNVSR 
MGTRSSYSCRRHSPAFLSWGKGRCHLCNKAHLLAGA 

7524 

A 

3 

543 

TCC I QAAGTGGSGAPGLAS WGGAAPLWS S F * NSLP * Q I RFALVPLP 
GCRAPGTFPA\PLYSQWPFPGPQRWARSLARGEEGVGVXJ\PHPQPPT 
WPGRHGEQSPRRHHGVGHHHQHVMALGADMRRGLAAVAALVAS * AGR 
VLGWCGRPARRAAQLGQEGAQGTVLALNGLFFFPSRRHGC 

7525 

A 

218 

533 

AARRAGRPHHPKTRPAQLATSAATAAS PRR * GRPWGRGLG PAWRPGG 
RGAALGAQRCGHGVRAVPVARAELGAAGFGGAAGPGAARGQCPALLP 
GRVE PRAARG VRTTPRPGQLS SQQAQPQRRARAAYPRRE P * HADDDD 
LLHGADRSSRFGEVEHPVQGHTANPSDGGPVQQHQGHLPHRYHLHLL 
V*HSRPSGPPGATG 

7526 

A 

3 

1185 

GPRR * GRPWGRGLG PAWRPGGRGAALGAQRCGHGVRAVPVARAELGA 
AGFGGAAGPGAARGQCPALLPGRVE PRAARG VRTTPRPGQLS S QQAQ 
PQRRARAAY PRREP * HADDDDLLHGAGGVS VPRGAPAR WAAGGVGRG 
RRRPLLPWPGCERSFSSSL* I *HSQICFF* SSLEQCGEWEGVYG\DP 
SDGGPVQQHQGH/PASQPLPEWKVAEVTDGARSTEDTSWGKGAGGSS 
*EGGPSGPPGPPGAGGLPGHNGLDGQPGPQGPKGEKGANGKRGKMGA 
TG * L * RYGWHLGGAS FALL * AGHHHWRKGPEAPRHEMC PGS KGS FRE 
NGHTPSKPAQSSEGQ*LPHSPYQSGRWQKLLMGQGVQRTHHGAREQV 
AVPRKVDLLDHQDLREPAGCQDTTDWMDSLVLRAQKEKKEQMEKEEK 
WGLQDNYEDMAGI S EELPLPCYEQAI TTGEKAQKHQGTRCAQGARAA 
S E KTGTHR PS QLNPQKASNYHWNYTAAMDR SYR 1 1 1 1 1 IHEHRCRIM 
MPIGLSWEVLQKCDSVLRPRQWA 

7527 

C 

35 

217 

MXDSRGRXPGARTYRESPGPXXRMXGRSREIARWPRRRPLAPLLHRL 
LPSPRPGICGGAA* *' 

7528 

A 

2 

342 

RALPPPKDPSLLSSIGSYRVPAPESVEGPL/SPWERKQPGERPLSCM 
S/ GLPQLRVGGVRG* *EPQGLPQDGDPAHTRCGAEPSLLDSLPQDGV 
QDAAAAS FLHTPSQTPCVQRE 

7529 

A 

2 

441 

PRVRPRVRQSSHLSWRCRVPPPPPTMCLSAHPAPGPDPGTSCESHRV 
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I PMTS I AQWGN * V * RGE I TCLKSQE \ GKS * GHGAS C PMTGELKC PHC 
WG * TGPHQLPG/MSHMFVCI LRGLAGS PARTAARGFLEVAAFFKGTI 
PERKPLM 

7530 

A 

3 

471 

RALAAAERPLAPLLHRLLPS PRPG I CGGAA* PWERKQPGERP\ FPVS 
QYLPSAC/GASPPG/PLGTGRGHGADDAPRRTS * ISEVGSVSVRDPG 
♦IEQGSDGRLPIGHSRWDERGLRGLGFVTGK/PWIKGGAILPVGPSK 
WGLGYTGRGAGLARSLQGY 

7531 

A 

2 

1022 

FFFFFSCETTIENLIFFSGEVSNLILPIPVGHK\GL*GGVKEEGCGC 
VLGLRPAVGSPAGMAQPSAPGVQAGSSILEEGLEVLSQPRAPSRGSQ 
GPVSPPSWKVLQQPVRRTPTPHASWGKPTHASSFVGGLGAQGGKRMP 
LGGPGCRPAPGLGAHLYVVCPEFSAPIRGLRSGMVPLKKAATS*DTS 
GSSPGLGEPASPQLHLFN*SLRGASSVPLSTAS\PPEPGGLRTQALG 
RY*ETGKWPFTGLLPLPRLSGPSTDSGAGTR*EPMEERSEGSLG/ER 
PARDPPGSVPTCGWRGRGSPGTCG/LPGQRPLGRALRAAKTRPGTCR 
ATDRGVPPGLPRLCR 

7532 

A 

40 

405 

QWHRLAACSNIASAVSFRIFLYYGERRPSGGARPMLTSPSLTAWIIN 
PLSQQLL*YLAAEQPQLHRGPPRTPAPVSGAE*RWTFL/IARTHCLR 
RGQGRQPPPVSGATGAPSFPLQQPMPRR 

7533 

C 

182 

289 

MRARRWI LNLRAFVHAVPSTKGALS SHHVNANVW I 

7534 

C 

50 

223 

MEQLCTKVIIRFKITTLLALILXXXXXXXXXXXXXXXXXXXXXXLFSI 
LYEWYPCHLK* 

7535 

A 

3 

399 

SNLASAVSFRIFLYYGERRPSGGARPLLTSPSPTAWIINPLSQQLL* 
YLAAEQPQLHRGPPRTPAPVSGAE *RWTFL/ IAHTHCLRRGQGRQPP 
PVSGATGAPSFPLQQPMPRFTPGASSSHQEATKSSTREF 

7536 

A 

287 

568 

CLRDGS PEPSCGLHCHCPACQ*PTPHPHYKHRPAGES I PGNTPMSAS 
RKRPASPTSRLRAPPALILSFLLCKMGTLPHASRGCWKERSEIVIiK 

7537 

A 

574 

1268 

SIPCPSSSFSTWLQSSRNFTEDILASVLISASTQSAQSRSPQNQNLC 
KGGSQAAPPASSHPISSHGWGLCLDDPPAKDIIDFPLVLPGIPYAVS 
HOCRLQYGAYSAFCEDMDGDSSLISPWAKYPKQRPGQGVFIRVPKKH 
MQKGSACPLWPWGKSLPVKMHNPG\LAGRWKSS*STWT/IQS/RSC 
GLHCHCPACQ*PTPGPHHKHRPAGESIPGTTPNFCFEKEACVPYFM 

7538 

A 

1 

785 

NRQLSCGTALSTTGGIYIGERSVRVWGHRFHCLGNESLLDNCQMTVL 
GAPPCIHGNTVSVICTGSLTQPLFPCLANVSDPYLSAVPEGSALICL 
EDKRLRIiVDGDSRCAGRVEIYHDGFWGTICDPGWDLSDAHWCQKLG 
CGVAFNATVSAHFGEGSGPIWLDDLNCTGMESHLWQCPSRGWGQHDC 
RHKEDAGVI CS KFTAFRL YSETETESCAGR I EVF YNGTWGS VGRRN I 
TTAIA\GLCCR\HIX3CGGRNGC*PPCIHGNTVSVICTGSLTQPLFPC 
LANVSDPYLSAVPEGSALICLEDKRLRLVDGDSRCAGRVEIYHDGFW 
GTICDDGWDLSDAHWCQKLGCGVAFNATVSAHFGEGSGPIWLDDLN 
CTGMESHLWQCPSRGWGQHDCRHKEDAGVICSKFTAFRIiYSETETES 
CAGRI EVFYNGTWG S VGRRN I TTA I AR I VLQATLAVGEEMDANPS PI» 

7539 

A 

1 

1113 

GLVFSWGDGDFGKLGRGGSEGCNIPQNIERLNGQGVCQIECGAQFLL 
ALTKSGWWTWGKGDYFRLGHRSDVHVRKPQWEGLRGKKIVHVAVG 
AQHCLAVI^SGQVYAWGDNDHGOQGNGTTTVNRKPMIjVQGIjEGQKIM 
CVACGSSHSVAWTTMDVATPSVHEPVLFQTARDPLGASYLGGRVYSH 
KFS PMTQS FQEWW YVNLNSLMDQALTPQCGSGEDLY I LTGTVPSDYK 
VKDKSFTGSALAALVKGLPEALQRQFEYEDPIVRGGKQLtJiSPFFKV 
LVAI^CDLELDTLPCCAETHKWAWFRRRPDDWNLSAGGSGTIYGWGH 
DHRGQLGGIEGAKVKIPSPCEAL.TTLRPMQIiIGGBQTL.CCDS 

7540 

A 

1 

216 

VAFFKSAGASVQEKQAQLQEQVKEQRVCCQRLAHPVASAQKEPEAAR 
GPGAP / WAWGRV* QLYGPPEGEGRP 

7541 

A 

3 

143 

GGGRDGTARDMPPECHQCPRTGPPMGPGYQGPLLLPADVPLVSSW 

7542 

A 

39 

212 

EWAEVPPPPHCPDVLLHRSAADRRPPAADRWPRTAAGDPGNGLAAAA 
DSEPLGSSAP 

7543 

A 

72 

353 

QRCPGGHIYSHKPSPNDSVLPGTVVCESPQPNGPGFDPTVWQWGRPI 
YPHRHSALRLQS*RQSGSP*VCLAGSLLCCPWRRLGHGLCQAHPGQ 
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7544 

A 

5 

728 

AAILYSGRLSSQPVVVESIHPYTYDTSTSGTVKIPGAEGLRVEFDRQ 
C \NTERRHDPLTVMDGVNR I VSVRSGREWSDWSSELRI PGDELKWKF 
ISDGSWGWGWRFTVYPIMPAAGPKELLSDRCVLSCPSMDLVTCLIiD 
FRLNLASNRS I VPRLAAS LAACAQLS ALAASHRMWAIjQRIiRKLLTTE 
FGQSININRLLGENDGETRALSFTGSALAALVKGLPEALQRQFEYED 
PIVKGA 

7545 

A 

441 

1009. 

GLSHGRNTGPLTTSQASTS PRGNNPRACVARTRVGVPATELLWMEEM 
RCVQLFCYSKARSQCSHQG*DV*VFYQPACWVSDTWHYLQPLTDSG* 
YA*KLS/D*VMSGPILATGPCTSHLA*LLDSEPLERSVSSFCKWDVR 
LLG KTS LCCVGLEAXj S S S E I FNCLLGVFVWNLAS CL.CG S ACCRVFLR 
DT 

7546 

A 

59 

378 

KFLEQLFDDGDVYLLLLGSQAQQNNGTQQLLPKPGCSWWSCTIGCAV 
GLSRRGSSLASSWAEEVRLTSRRSPCLSAAPSSWGAGAPGPGSSAGL 
WAAARNFLCPLL 

7547 

A 

31 

302 

KGRRVLWWGDKNHERSWHKGVMGQRVQDRQKRKLCQLMGKKRSQRA* 
ILSGTEHLHVHSHLL\CQQQVSTGRAIiHHLLP*RIWRARAKNEQ 

7548 

A 

1171 

1731 

SPRWSAPRSPSGAPSDASCDGLVPMNGRKMACWPVDIPKTDWSPNPW 
QEKRE I VSSEDAVTPSAVTPS APS AS ARPF I PVTDDLGAAS 1 1 AETM 
TKTKEDVESQNKAAGPEPQALDEFTSLLIADDTRVVVDLLKIiSVCSR 
AGDRGRDVLSAVLSGMGTAYPQVADMLLELCVTEL*NPRSRTP*CFS 
CRACDEEVSLALECWEPEELPRTQSPALTTQRAQNTMNKIiNFHNRVM 
QDRRSVCIFLPNDESLNII/DKCLVFSWGDGDFGKLGRGGSEGCNIP 
QNIERLNGQGVCQIECGAQFSLALTKSGWWTWGKGDYFRLGHGSDV 
HVRKPQVVEGLRGKKIVHVAVGALHCLAVTDSGQVYAWGDNDHGQQG 
NGTTTVNRKPTIjVQGL*GQKITRVACGSSHSVAWTTVDVATPSVHEP 
VLFQTARDPLGASYLGVPSDADSSAASNKISGASNSKPNRPSLAKIL 
LSLDGNIAKQQALSHI LTALQ IMYATDAAVGALMPAAM 1 APVECPS F 
S \ S G AP SDAS \ AMG * S P * MGEK\ CLLAC * YS ENRL\ S PNPWQEKRE I 
VSSEDAVTPSAVTPS APS AS ARPF I PVTDDLGAAS I IAETMTKTKED 
VESQNKAAGPEPQALDEFTSLLIADDTRWVDLLKLSVCSRAGDRGR 
DVLSAVLSGMGTAYPQVADMLLELCVTELEDVATDSQSGR 

7549 

A 

2249 

5398 

KCHLRAGRCRDVTAIFWIjQS S FMDQLEEKADLRKS itf YQNQGGRAK 
DAALGGGHHQAGAQGGDEGEAAGAAADGVAAYSNYNNGHRKFIiATAQ 
NPADEPGPGAPAPQELGAAGKHGGLPNLRIiSFKLALIRDMAILPFQW 
NLLISFNAVLGLVFSWGDGDFGKLGRGGSEGCNIPQNIERLNGQGVC 
QIECGAQFLLAIiTKSGWWTWGKGDYFRLGHRSDVHVRKPQWEGLR 
GKKIVHVAVGAQHCLAVMDSGQVYAWGDNDHGQQGNGTTTVNRKPMIi 
VQGLEGQKIMCVACGSSHSVAWTTMDVATPSVHEPVLFQTARDPLGA 
SYLGVPSDVNSSAASNKISGASNSKPYHPSLAKILLSLDGNLAKQQA 
LSHILTALQIMYARDAWGALMPAAMIAPVECPSAAASDAFAMASPM 
NGEECMLAVDIEDRLSPNPWQEKREIV\SSEDWTPSAVTPSALSAS 
AGPFITVTDDPGAASIFAETMTKTEEVKRHSFFAIiYCEITKWQVEPQ 
IDDPNSNLEEVINEAEAITSENSLGLI SRWPRLFSQIQPDDSVLPG / 
DGGT* ISTA* WTRL * PHSVAVADMLLELCVTELEDVAT\ DLQSGRLS 
SQPVWBSSHPYTDDTSTSGTVKIPVPSSARSSSLKENSCASARTPL 
TPDVRVLHTKRFSSSLDTPEVEVLLTSNAAHSPGAEGLRVEFDRQCS 
SEWHHDPLTIMDGVNRIVSVWSGPVEAWNRTQRTVPAVQLSGSHSSS 
PGVEG WAGVI LRLLVYSGS VS VKS KAHRLAVACVGTPAVAFWWVGL 
GGATLVCWLLMGWSLACETQGCLQCSFTGPKELLSDCCVLSCPSTDL 
VTCLLDFRLNLASNRSVVPCLVALLAACAQLSGIiATSHRMWALQKIiR 
KLLTTE FGQS IN INRLLGENDGEARAL S FTG S ALAALVKGLPEALQR 
QFEYEDPIVRGGKQLLHSPFFKVLVALACDLELDTLPCCAETHKWAW 
FRRRPDDWNLSAGGSGTIYGWGHDHRGQLGGIEGAKVKIPSPCBALT 
TLRPMQL I GGEQTLCCD S 

7550 

A 

1 

609 

AAATTTTKLAAARQPLPGGGGESAVGAGAGREVFIYLFATASAKRGG 
GRAKRTPGAQSSGR/AVLPSLGPPGRFLAGVC*GSGGPSPFSSPPPN 
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HSG/Q/PGAGGEPGNRGPKMGVLAYSIiHL*VPCGVSRNTLPAGDGNP 
VP*SPFLSFTWSP/VP*PWG/ANGPWWADPPPSTHTVP*PSGPPGAS 
GAGPPPSWPRPAKGFAPPRGL 

7551 

B 

222 

360 

XAVIALRPWDVEAEQFLEVSFLLFFFLFSDPRPRDRLRLLERLREPT 
* 

7552 

A 

1 

603 

RDTSRNCSASTSQCRKASTAPGAEG/DGPQASGRGGAVTTRGSRASS 
GRGRSVMPLCSRA*DVTUU^RRSQARGSSGKLHLCSGKPLRKRSPCS 
RSVPAPGQSRGRRFTSSPPGFPASPSPCITERSRQKARRRTRSSSSS 
SSSSSSSSSSSSSSSSSSSSDGRKXRGKYKDKRRKKKKJO^KKLKKKG 
KEKAEHSRQS I IR 

7553 

A 

42 

384 

LKSAALSHGGAKQAGDPNLHPLPLRRDWAGPETNPRGKGHQHPSGFP 
LAP * NLPRLGRQR S S TGLGQRGP S G KQPS SHLEGQAVP SGRLAGEES 
TQDGRGTS QTDAGGWMAGM 

7554 

A 

1 

713 

AKGAGLEARGAMAHVGSRKRSRSRSRSRGRGSEKRKKKSRKDTSRNC 
SASTSQGRKASTAPGAEASPSPCITERSKQKARRRTRSSSSSSSSSS 
SSSSSSSSSSSSSSSDGRKKRGKYKDKRRKKKKKEEESLKEEGQRKK 
AEAQ/ RGRAS S AS WDP *DGAHQAY * REM/ DEGPKRKS * PKNDTERS 
TSQATQRGTADGHCQMQSDGYASREGPPHGQGPDGRQLAAQPGQVSG 
CQWEA 

7555 

A 

2 

643 

HALTTAFNIVLEVLVSAVRQEKEIKS\IVIGKKEIKLSLFTDDMIV\ 
YVE YPKIDKNA/ LLE 1 1 SNFT\ RWG YKVNTQNS VTFRVYHQ * AS / B 
YIjKLKKILFAIjEPPK\rEYIjGSNLTKHAQDLYEKNCKTLIKEIKELNK 
WRE I LCG * KY S VFKM S VLLNFVYRFNAVP I Q I P VS C FVG I / EQT/ S L 
KFIW\KAKRPRIANSILIENNKVTGLTLPFFKA 

7556 

A 

3 

578 

TSNECGKTFRHNSVLVIHKTIHTGEKPYKCNECGKVFNRKAKIARHH 
R IHTGKKH * KYEECDKVY\ I VNQVLKDRR IHTGEKAYKCKS \ CDKNL 
GRDSHLAQQTRSHTGEKPYKCTECGKAFSGQSTLIHHQAIHGIGKLY 
* CNDYHKVFSNTTTVSNHWRIHNERF * KCNKCGKFFRHCS YLAVHR* 
THAGE 

7557 

A 

286 

336 

GL * * NTTS / PKI STEYI S P * AKPPVMRGQDSVLYHSDYDEEEWNRVW 
ANVGKSLSCS I AMVDKLI ERDGGSEASGG INDGEKERS LTDAI PSLP 
REDWYEQLYPFILTLKECV 

7558 

A 

2 

246 

RPLKHTLVTCDKGGL*FEPSFVSG/PPRCA*GDGTKPEVPFIGTATF 
YHYLPTAYGCMLTST*RKVEETELDDFYDGNSRRGR 

7559 

A 

3 

349 

GWRLVRPLFMGHRQPVI SGLQPEQPLHLKL * FLC S GVSDCAS FP V/ Q 
S PEPKVVFAKEQLiARRKLQAEAGAS ATLS CEVAQAQTE VTWYKDGKK 
LSSSSKVCMEATGCTRRLWQQ 

7560 

A 

16 

221 

HREMKTGNDLKILQKVNKLWHLYTTAY*KRAIKKSKLIjVHTK\TWVN 
LKGWLSE* SQSQRVKYWSTRP 

7561 

A 

204 

870 

KGNQVSHE YS WKFRTLS LS * KQF * VL»KR\ YYRRAVKLVKMSNI AEHP 
GFMKTGECLRCMFWNNLGCIHFAMSKHNLGI FYFKKALQEE *QCLCT 
APVQVAIiIQVKNFQGRPMCTLLNP IRGYE\ LLYNCG I S \LLHI GRPL 
AAFECLIEAVHVYHANPRLWLRLAECCIAANKGTSEQETKGLPSKKG 
I VQS I VGQGYHRKI VLASQS IQNTVYNDGQSSAI P 

7562 

A 

1 

1047 

MVSISWPRDLPASASQSAGITGLIGALVLSVGIYAEVER/HEI*NP* 
KCLPGSSHHPHPPGRRHVHGLLHWCAGVPP*QPPELLASLRLSRGYG 
LVLS WLEPRYEKM I SGMYLGE I VRNIL IDFTKKGFLFRGQI SETLKT 
RGIFETKFLSQIESDRIiALLQVRAILQQLGLNSTCDDSILVKTVCGV 
VSRRAAQLCGAGMAAVVDKIRENRGLDRLNVTVGVDGTLYKI^PHFS 
RIMHQTVKELSPKCNVSFLLSEDGSGKGAALITAVGVRLRTEASS 

7563 

A 

77 

2246 ' 

DKTSRGTIRQQHSRFTKIRCSAATAADTQANRVWSGPLANSNRPAAE 
GPRDPPASASQSAGI^GVSHRLRFYFDHQDVAPECGAYFFHELAKEK 
RKGVERLLKMQNQRGGPAVFQDILKPGQDEWGKTLEAMEAAMAPGKN 
LNQSLLDLHALGSAPTDPHLCTDFLESHFRDEEVTFMPRKSADLHAAG 
E AAS LGG VS LNVMKG AS LHALCLRLLNS FG PG VATLCGERWHKNNR 
ASALRKCIFWPVSLNSSRARPRQRTTAGGAVPTAALQRVIPASSQCIj 
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PPAPMGTTEATLRMENVDVKEEWQDEDLPRPLPEETGVELLGSPVED 
TSSPPNTLNFNGAHRKRKTLVAPBINISLDQSEGSIiLSDDFLDTPDD 
LDINVDDIETPDETDSLEFLGNGNELEWEDDTPVATAKNMPGDSADI. 
FGDGTTEDGS/U^GRLWRTVIIGEQEHRIDLHMIRPYMKNATTHGGYY 
GEGLNAIIVPAACFLPDSSLPDYHYIMENLFLPPGQQTTPGPVGLTQ 
PAAGVEAGRVGTAGVLTLDPSMLGPCLLCPVSRYVISSLELLVAEDY 
MIVYI^NGATPRRRMPGIGWLKKCYQMIDRRLRKNLKSLIIVHPSWFI 
RTVLAISRPFISVKFINKIQYVHSLEDLEQLIPMEHVQIPDCVLHDR 
QIILTTLCAYGEGEALASHLFHVILPFHIHSCFPSSTETSDPPCGPG 
LDLAYTTPFSTHFAQRGK 

7564 

A 

1 

475 

HIKNMTFIYFSSFIFCFPPRWSL/DSVAQAGVQWRDLGSLQAPPPGF 
TPFSCLSL/LQSSWDLQAPRHRARLIFLYFFLVEMGF\TTLARMVSI 
S * PRDPPALASQS AG I TGVSHRARPETYNFCS Fill S SLPVEPEARK 
MTKNSDIGEIWINLGTCLIV 

7565 

A 

3 

364 

SAIIV*DINT\PIMGGTTRQMINKEIEDVNSAINQEDLLVIYRTLHT 
AAAEFTFFSSLHEVSSSI \DMMLGH*TSLNKF*S IEI IQNI\SDHKG 
I KFE INNKE I * KIHKYVE I S STLLSKQG 

7566 

A 

188 

1057 

VIKAAEGDSGSAWVLDEADPTGGPGLGCRGTKYLLRSGAGGLSD/GG 
PCGPGTDLALS PKP PGLHGP PS GPALAG * D \ LSRLLPRAHSENWPGA 
AGGGSHRA*VGS/ADQADNDTEGNNKDPGRPQAQPSCRGSLGPATGR 
SGWPYGSP*PTWGQGPVSTPREGPAGVRIEAKGKS/NVGGTGGMAPS 
PSRTCNLLSLPKLSGW*EAWNPGRADVHIMHAVGLIKTSHEWLDPP 
ADSQAGSCGASGQEPQPAASGSAAWRRCSQGGLGRL.RHQQQEEGPSS 
QHVGVGCVPCGG 

7567 

A 

2 

380 

AR* LFSRVNHMVGHKLSLNRF* KIDI IQS I FYDHNSMKLENNNKRKT 
GKFINLW/RI I * 1 1 YL*TTHK* KKKI TRE I EKYL EMTENRNKT YRNL 
WDAAKWPKG\EFIAIGA\SL*KGGKSPIQVS 

7568 

A 

1 

371 

GTSAWPQYKLDNDSK*PENGTFEFSILQVIiDNSCHKMGKWSBVPDVQ 
AFF\SHWSLPSLCSQC/GLIPNLSSFSPFWSFG/PPPQVPSP/TESF 
FSMDSSDLPPSPQAAPRQAEPGPNSHLASAPPPY 

7569 

A 

3 

369 

HEVKTMKI LEENLGEYLLDLVGKDF INRT/ PETI S *NKI YKFNF I KI 
KNFCSLKDILKEIN\RIDWEKIVTKHSDKGLVSRIKKFHDSV/YKMR 
TQFFPPKDKDLGRYFTNKET*MVASGHLKMC 

7570 

A 

1 

2722 

MGSAVPSQACLEALLLIVPWGACGVSQEEEESPAEGSKDEPGEQVEL 
KEEAEAPVEDGSQPPPPEPKGDATPEGEKATEKENGDKSEAQKPSEK 
ARAGPEGVAPAPEGEKKQKPTU^KRRMVEEIGA^ELVVLDIjPDLPEDKL 
AQSVQKLQDLTLRDLEKQEREKAANSLEAFIFETQDKLYQPEYQEVS 

TEEQREE I SGKLSAAS twledegvgattvmlkeklaelrklcqglff 

RVEERKKWPERLSALDNLLNHSSMFLKGARLIPEMDQIFTEVEMTTIi 
EKVINETWAWKNATIiAEQAKLPATEKPVLLSKDIEAKMM70iDREVQY 
LLNKAKFTKPRPRPKDKNGTRAEPPLNASANR7VFPSPLTPWADQNVL 
STVSPYPKILGKGETIWVPRVFGRMVGKSREAVAQAMVLiEMFREEDY 
YNDDVLDQMGASILGVEGPRRHPDEPPEDEVFELFPMFMGGLLSAHN 
RAVLAQLGCP IKNLDALENAQAIKKKLGKLGRQVIjPPS ELLDHLFFH 
YEFQNQRFSAEVLSSLRQLNLAGVROTPVKCTWAAVLGSGRHALDE 
VNLASCQLDPAGLRTLLPVFLRARKLGLQLNSLGPEACKDLRDLLLH 
DQCQITTLRLSNNPLTAAGVAVLMEGLAGNTSVTHLSLLHTGLGDEG 
LELLAAQLDRNRQLQELNVAYNGAGDTAAIjALARAAREHPSLEIjLHE 
VQRNLNSWDRARVQRHIiE/HP/ SCGIWKIAGVPPLILGARPSCCEWR 
ARAPSKSPLRRCPNSHPEPASVM*PA*TNLSIPWCCVASSLLRRCGS 
QSL*PSS/HPAAVSMETYHVTLTLPPTQRGEEQVELSTIEELIK\VP 
*SAPSQP*WSTSGLALPQEAWYPVRSHP*CPRIjSQPSPDLTGYSYGS 
FGHLTWEMSWKAPRNCAV j 

7571 

A 

2 

395 

LSNNPLTAAGVAVLMEGLAGNTSVTHLSLIiHTGLGDEGLELLAAQLD 
RNRQLQELlTVAYNGAGDTAALAIiARAAREHPSLELLHEVQRNLNSWD 
RARVQRHLELLLRDLEDSRGATLNPWRKAQLLRVEGE 
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7572 

A 

1776 

3257 

TNLSIPWCCVASSLLRRCGSQSL*PSSPPLPSPWRPTTSL*HCHQHR 
EVKNKWSSPQLRN* SRMP * S APSQP * WSTSGLALPQEAWYP/ REKPP 
MMPQAQPAI PRPNRLFLRQLRASH/ CGK/ WS WKAPRNCAV* LNSTTP 
CSRSGPSPRGLDPRWSGQKTWHLLGSQA*ATTPGPLMEL*/ELFNPD 
ACGRR VQTWL YGTVGTGKS TL VRKMVLDWC YGRL PAFELL IPFSSS 
LCQLVAQRYTPLKEVLPLMAAAGSHLLFVLHGLEHLNLDFRLAGTGL 
CSDPEEPQEPAAI I VNLLRKYMLPQAS ILVTTRPSAIGRI PSKYVGR 
YGEICGFSDTNLQKLYFQLRLNQPYCGYAVGGSGVSATPAQRDHLVQ 
ML SRNLEGHHQ I AAAC FL P S YCWL VCATLH FLHAPTPAGQTLTS I YT 
SFLRIiNFSGETLDSTDPSNLSLMAYAARTMGKLAYEGVSSRKTYFSE 
EDVCGCLGGWASGRRRSFSWLHIFRSGWP 

7573 

A 

201 

1436 

PCDPREYP**ATSPTRRSWPWSISHPRPGS/HPIJVLSQLFNPDACGR 
RVQTVVLYGTVGTGKSTL VRKMVLDWC YGRL PAFELL I PFS CEDVSS 
VGRAPASMCQLVAQRYTPLKEVLPLMAAAGSHLLFVTiHGLEHLNLDF 
RLAGTGLCSDPEEPQEPAAI I VNLLRKYMLPQAS ILVTTRPSAIGRI 
PSKYVGRYGEICGFSDTNLQKLYFQLRLNQPYCGYAVGGSRVSATPA 
QRDHLVQMLS RNLEGHHQ I AAACFL PS YCWLVCATLH FLHAPTPAGQ 
TLTS I YTS FLRLNFSGETLDSTDPSNLS LMAYAARTMGKIiAYEGVS S 
RKTYFSEEDVCGCLEAGI RTEEEFQLLH I F RRD ALRFF LAPCVEPGR 
AGTFVFHRARHAGI PRCPLHCAGFAQDDPAKGGQGSG 

7574 

A 

1 

377 

PFLPPYMKINSR*SKDINVNFKSGSSSSSSSSSSSSSSSSSSSSSSS 
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS \FSSSSSSS 
SSSSSSSSSSSPL*KKNTPIKKWVKDMDRPRA 

7575 

A 

288 

468 

KAEAGGWLEPRS SRPAWRREAGAGQSPGPG\GRGCSEL * LCHCAPAW 
TTE * DLAS KKKKQK 

7576 

A 

1 

378 

YRKP/ TADNI INGERLDAFPLRPGTRQG/ TPFLLLLFNIVPEVLVRA 
IKQDL\ ETI I IVI * IRKEEGKLSLCTDDMI FYVENPKEFIKILLEQI 
SS\QVAEYKINIQKSWFLYTWNEQSKYEIKIISRA 

7577 

A 

640 

1093 

QHKNLVL * S GRR * LLS QQEE S / G I F FTL V I TTFC WER * NRR I LS Q I * 

% M m /**V *vs TV T T TT^/\n f\t T V~ T\ TV T»T.TT> TV T/ 1 TV O t* TV T" TV ^> ^V TV TV T\ MT\rtlt.1 

RMTG AR I H FQEP V I P AP WEAKAGG S LE I RS S R P AWPTW 

7578 

A 

254 

701 

KTGKEGPVRWLMPVILAPWEAKAGGSLEVRSSRPAWPT**NPISS/K 

r% 1/\T T T^"¥* ▼ Ti t^T^iT ✓~ffT> TV* TT t"T Tfc FT* W TV ~t\ TV m ^ TV f> T TV TV **"V V^l TV /*V ^**V #V TV T^T\ *V ^"TT T ^ m TV y% 

NS KN I KI S LAWW CT P WLATEEAET * E S LEPGGRGCS E PRS CHSTPA 
WVIGAPSQKKKKKEKQARRSWLAGHSGLHLYASYAGRCGRVGGLLEP 

KJM oKAAWA 1 

7579 

A 

12 

154 

CLCMSTCMCA.CVCVSVCMHICVCDCVC/CVCVCACAYI*LRVCVLVC 

7580 

A 

139 

351 

AIMAPNFPS*VKT*SQEDGQTPNRIKLKKSTSRHII/IIAEN*/LAK 
*NILKAVREKWLITCRRTVISMTMDF 

7581 

A 

17 

245 

GG*GCSCSEL*SCHCSPAWVT\SKTLTQKKNPQF*QW*NI*IPCLL 
IDFSEFGLFLPKSYCYQCKVTLQCSKSLL , 

7582 

C 

82 

300 

MLGIFEGLQINQLVETGLKRWLRLRNS FLKRFCYLLKSLSRSSRTTT 
TTAFVCDGRWIMQIPGSVLPFPQCW* 

7583 

A 

1 

772 

KENLITFPRIQPISKDDFRRYRGAIARKRIRLGRKYVISHKEEVPLC 
IHWDISKQASLSDSNVEALKKLAAKRGWEVTSTVEKIKIYTLEEHDV 
LS VWVEKHV/ VEVQH I * L I RLLSDFTKRPLW\ DPHFVS L * S HRLG * V 
KDDQLY\HITCPILNDDKPKDLVVLRIHEGKPLKDGNTYHS/VAVKS 
VILPSVPPSPQYIRSEIICAGFLIHAI*QQFMHSYLTLTICLLGILP 
YFGWEILGGLGPKSIGRKQQPLVYSS 

7584 

A 

3 

307 

RFTEHHRTLTGEKPYECKECGKRLSSNTALTQHQRIHTGENPFECKE 
TQKTPVQA 

7585 

A 

1 

375 

TGVTPAATQASWTPP^IDGPTTQEKLS*ANTAPGTEGEQQVHGEKKE 
DPAVPSAPP*YEESTSGEGRKAWAFPPAPTGEPFHPNWAYVDPNSNS 
RYDNGVP\TGDHEL*TTFSWDDQKVRRVFVR 

7586 

A 

1 

512 

AAPESAPRLPGRRFRRGGGKRRRQAAVTQAATRASGTPSPRDGTMTQ 
GKLSVANKAPGTEGQQQVHGEKKEAPAVPSAPPSYEEATSGEGMKAG 
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AFPPAPTAVPLHPS WAYVDPS S S S S YDNGFPTGDHELFTTFS WDDQK 
VRRVFVRKVYTI LL IQLLVTLAWALFTF 

7587 

A 

1 

401 

FFFFNYITRYNKHYRYY KMSKKRCHKI FQMIDMLNAASCAALGRHSR 
WVLGGAHPTPP/GSAQAPAPWSHAMGARTLPPRPFRS \ FGHSSATPK 
VS PSAAMPTEAQRP/LPGCGPGSDFAGHPQC * PPQVQI PTSC 

7586 

A 

49 

430 

KWWKVDCVGDMLECASIGRSEAFKTSSFQFSLATSEVCGHSSATPKV 
S PS AAMPTEAQ/ QTPPGCGPGS \ ALLGIRSADHPKCRSPLRVPPPPR 
STSKVIiAPFLAS.SDHLEHTCSSQVRLPAAVRGAST 

7589 

A 

467 

1518 

LANCRC^PGGHWMQEMVRELWMWNVEEEEHEVG I CTWGGQHCGCPAK 
SLPGPHPGGVSAPQSASQLMVKLLVWQKSVHKLRKVSATSSIAVYPC 
PGQSSGGAESPAPGPGLAGWSHLCGAALAEVQAAPVSQAAVSDASLG 
PEWSQEGCRPGLTSGQHGGRDGR*RGSQRQGDP\QSPSPGIPASSAA 
L*VGLVLVKWRPVHKAYDPVPEASPLSANFRNPGPAEAPGPMGKTKG 
RGS AGGA\ G AG S AGLAPPPRS S QGAAGI GQ * PGLPEEAVHPCKGS CP 
ASSLSAGTPRAPPPHCPCLGPRRQGPCSHGVTPGCRSLG*AGGVG*A 
PPRTQRE WRPRAAQEAAFNMS 1 1 

7590 

A 

296 

423 

MVT * KL/ PYKKSQGLGTFTDEFYQILKEELiI S ILLTLFQKIQKE * VT 
YKKSQGLGTFTDEFYQILKEEL I S I LLTLFQKIQKE 

7591 

A 

3 

168 

VYTKILRADRDMLTGYPCCSTGLLTADD* * *DNTLEV/ IALRRGPFY 
LEGLDLSPK 

7592 

A 

19 

406 

PLQEIiGGQSLSLHYVGGRAPGPPAPGHPPVQSLQRPCVPTQQLLTPE 
FWCPAIi/ RQQQ*GHRQGTG\ PVPTPGPPLKGQWQQGPPQKEWQMRPK 
AFGHHPGGLCQSLGGAPRRWNRDTYQAGPRQGCGALM 

7593 

A 

2 

3532 

SGSGVRCALIfGATCLFACVPLSSVALESRRAAPSLDMSYNYVVTAQK 
PTAVNGCVTGHFTS AEDLNLLI AKNTTRI* E I YWTAEGLRPVKEVGMY 
GKIAVMELFRPKGESKDLLFIIiTAKYNACILEYKQSGESIDIITRAH 
GNVQDRIGRPSETGI IGI IDPECRMIGLRLYDGLFICVI PLDRDNKEL 
KAFNIRLEELJIVTDVKFLYGCQAPTICFVYQDPQGRHVKTYBVSIiRE 
KEFNKGPWKQENVEAEASMVIAVPEPFGGAI I IGQES ITYHNGDKYL 
AIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDG 
TVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDS 
NEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSL 
RIIRNGIGIHEHASIDIiPGIKGLWPLRSDPNRETYDTLVIiSFVGQTR 
VLMLNGEEVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQ 
EPKALVSEWKEPQAKNISVASCNSSQWVAVGRALYYLQIHPQELRQ 
I S HTEMEHEVACLD I TPLGDSNGLS PLCAIGLWTD I S AR I LKLPS FE 
LLHKEMLGGE 1 1 PRS I LMTTFE S SHYLLCALGDG ALFYFGLN I ETGL 
LSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFS 
NVNIjKEVWYMCPI^SDGYPDSLAIjANNSTIjTIGTIDEIQKIjHIRTV^ 
LYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSS 
SVSSSKLFSS STAPHETS FGEEVEVHNLL I IDQHTFEVLHAHQFLQN 
EYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIWFQYSDGK 
LQTVAEKEVKGAVYSMVEFNGKLLAS INSTVRLYEWTTEKDVRTBCK 
HYIWIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNP 
NWMSAVE I LDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLF 
HliGEFVNVFCHGSLVMQNLGETSTPTQGSVIiFGTVNGMIGLVTSLSE 
SWYNLLLDMQNRLNKVI KS VGKI EHS FWRS FHTERKTE PATGF IDGD 
LIESFLDI S RPKMQEWANLQYDDG SGMKREATADDL I KWEELTR I 
H 

7594 

A 

3 

3 00 

KGPWKQENVEAEASMVIA VPEPF GGAI J IGQES ITYHNGDKYJUAIAP 
PIIKQSTIVCHN\RVDP*WPQDTLLGDMEGRLFTALQKNPTEKKIPQ 
ALQQAN 

7595 

B 

1 

819 

MARGSAGLVLGRGRLHDWQLGSGHSAPGMLGQVAIiAGRTELAVPSLD 
TPGGGEQRASDLSRTLSSSHESDHTVRAAHSPLPAEAIiTRSLPCLQN 
VAHVRIj Y I KCQLDFGGDS IAVGTPWYPMVWI PWWIPWWIPWWME 
HTVLIHLPVGGHVGSSHLSAPVNNVCGNVAEEPPALCGTKASKPPLT 
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PPPIRSKAKFLGGSVCLGRHNKLSQTRWLKQQTFISHDPGGWKSNIK 
LTRPAQDLLMSVCMNCNTHGSEDMDVI SHLIKIRLKPK* 

7596 

A 

2 

3235 

SPASPAGRSAAHGSASPSGSPHPSWCTVGRSSCWSSPSP*ASSGSGT 
PPTQPGRCSLG*MRRRPAGGAQAFQLKGKKWRRFGGW*PRWEAGCFQ 
GEW*GATDSRTASNQLGPPHLQHLHINPPARQPPAFNRVVKQAPWRP 
QARAGAPG\KPDDQDAHQRVLQNGHLGAQTQEEVAVTLGWPWPLW 
GLRLLCGTAPCAGSARSP\RGELADGGDHSVEPPDGIPLDG/EQSGS 
GMSGALPGSSFLLCPRRCRPPSPAI,*CPASWTRPGRR*CAPRSAPAG 
AARGLYPEGTPGSPTPHPRGRGTGS*TPSPGTAAPCAPA*APGRRPG 
S*SRAWSA**SP\*LGPRGPQVWLVGQ/QGPGAPWPGTWAPALSCA 
AVRYRGPACCSGSGRWRAH*RSPT*WWCPGCPCARSAAP*CTA*SG* 
GGPWA*GGS*SGVR*TSMSSEPC\GCSSCRRTSAGPGPAGSAWGSAA 
TS PASAPPAAAPPYSRRWR * GAAGGSWRRCRWAGAGSAGPAACPGHT 
PPRSPRALRTWA*AGPP/GIGAQLLLRDQQLRLGCPPPPLREGCGLIi 
PAPRSPATRP/GLPLQQHAPGGHSGVE/GPSGCCPQTSYSEGCPPRS 
APPGSPRCTP*QRSRASSRPPRRRP*GGYRPAQTPGCSPWP*S*WRR 
PAAGG YTEPQ* RPQWRRCWA* S WPGT * GSGRGCCQAWRG* PSAASRP 
RTARPRSCHRCPPA/ paas isssstvgsts IGTTDGSLIDGEMARFS 
ATAASAR/ PPQGTQRGCGGEFGAFGETGEGRGTGQRGRRIPIjA\ PEG 
RPEGAGFSRVDAVFDS LA* PFEPRAMKS PGGGCP / GGFGRGAGPAHR 
GGLGPVHPAPSLAAPGGCGVTSGPSRPERRRRHPATPESBTDLTALE 
DPTS GTTGRKTVRREAGPEGGARGP/ PAPA* CLQRPMAAGGGLAGAA 
RMWSGPGGSAARGR\ PGL\ PFTGGAGLPGKSSDPLAS/NERGPVPVP 
PPGRNSLLPEAGGCPGSMHQALAGWCPPAQAVLWRRGSQRIiFTIAGC 
SCWARAPPGSEPPNMQTPASKQKLRGS *VGEQ/ PSGWRLGSAFQQLS 
GGFCPAGADDTAGAGA 

7597 

A 

2 

1066 

HMILCV^NSKDSTKJO^LDLINEFSKVAGYRINIQKSMAILFTNNNX, 
AEKSRKQCPF/TDNIKHLRIHIiTKKMKVHQYSLHTENHKTLMKYIEE 
DTNK*NDIPCS*IGRINWKVSILPKAIYRFNAIPIKISSAFFTEIE 
K\IIQFLWIHKKSQITKAILRKKNPGGIILPNLNLYYKTRVIKTIQY 
WHKRYTGQ/ WK* PE I NPNMYG * * I FDKS TKKTK \ WEKGSIjFNK\WCW 
DF\ TCKRMKLG\ PYITPYTKINPKWI KDLNVRT* SHKSRKENIRGKA 
PWTLGFGQCFIiGYHTKTSGYKSNNK*MKLYQSKKLLHRTKTINKMKR 
QCTDKEKI FANHI SGKVLI FRI * EELIQLNS 

7598 

A 

3 

333 

TLLPE I QAYTQEQWLGS PEEATALAERjjQQES AGPGLQMNGGijVLtGA 
WVCLPP/ SSRPPAEPSTGAQS WVGG* EGPGGCGKVMVTGGSLWLAW 
TAESEISSHTCVLRIVS \ 

7599 

A 

252 

686 

HLFRQLFLEVLMSLQSPCLVAHSASRWCTAPVTADAPSLWRALALWL 
WLAHGQGPGTGWPVLSLRRRSASRLAASPSENPPSRNIH*SPQRPVC 
SRGPSGPATCPLGLPRTSPGATRPGPVSPLPSPRSGIAAAPGLAPAS 
TPP 

7600 

A 

156 

889 

FRSSVSPGSGPPFSVSVTGSGCPS**HSGRPGVLV1.KLPGQS*ASLP 
PAFPGGSDVAAGRGLSSAADCFSLTGLGTLiATPVIjFLTSNIiKIjLQES 
VPK*IjNSASS/PGVPRP*LQMHRLCGG\PRSIiAVVGTRPGARNWLAS 
AFS / VGGGQLLGLQHPLQKI PPSRNIH * SPQRPVCWKAGRRKTKPRV 
DLQALETCPLGLPRTSPGATRPGPVSPLPSPRSGIAAAPGLAPASTP 
PLSPGPTPPLPP 

7601 

A 

231 

457 

QWLSLLPLGLGEDTKSLRANTSNLELHSQATIWEEGGPTSSSLWRPF 
NFPGSPTNRNPKLMPAVQPPHPSG*TPSVT\WLHISWRWSP*GQSKA 

PPTSCSSTDREPPAWAHSTDPPSCDGTAKVPSSAGSKLGRKHHPRAA 
VGSLGAS SHNIiKAMGEEPTL/VTALRGSKVTTV/SETTRPRJCKLSTD 
QYT * APPTGSHPKASTPKIRCCVS PFSRCYEE I PETG * L I KERGL ID 
SQFHMAGEPSGNLQPWRKAKEKQAPSSQSGRTE*VPAGEMLDTYKTI 
RSHETHSLSREQHGENHPYDPITSTWSHP*HVGIMGITIQGEIWVGT 
QSQTISLTYPSP*NQTKEVSNK*RPPYGRREDQPLPPCGGLSTSLVP 
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QQTGTP S SCQQC SRP I P VAEHPQ 

7602 

A 

2 

289 

IACYSPPFGGP/PSSPPPRAGGSNPPGPPGLTPFPPKK/AKKLL/RP 
SSSPPLFPPLGRVGPGDSFYPGGEGFP*PRVPIT*NWGPFPPPRGTK 
KKLPF 

7603 

A 

3 

805 

DRVSLLLPRLECNGAISPHCNFRRPGSSHSLASASRVAGITGVCHHA 
*LIFVFLVEMEFHHAGQAGLELLTSGDPSPPWSPKV/YRITRHGATA 
TWPYIFKNFYPTRPS*NSLSLKRTS*IQMPAL*MERLQFVCPS/LAE 
DLTKF*EKQDSKFTSQNREN*AFSRRS*GMLEKD*KWTTSDIV*VFV 
LFKSHVEMLS PVLEVGSSRRFLGHGGRSLMNGWVLSLW * * VSS YS IS 
SCESWF*KESSVFLTFLLLPFWPCDMPAPPLRLPP 

7604 

A 

75 

485 

EIQLVLSLKTSLNKFKKIEIISSILSDHSGIKLEINSKGKLQNHANT 
WKLNNLLLNEHWVKNKI KME I KTF/ FELNDNNGTAYQNLWDTAKAVL 
RGKFTALNAYIKKYERAQTDNLRSHLKELEKQQTKPKSSRRKKK 

7605 

A 

3 

1105 

HQNL S TCACFS S TCTG I RMMQRRLAWS LHKDD AQ I RE VFH I KTKKKK 
KQLQQNPPKETLSSDTPSP7VPTQWSYPTHPVHGTFSKTDHMIGYKAS 
LNKFKENEIISSTLSDHSGIKXEINCKRKLQNYANTSKLNNLFI^JDH 
WLKNEIKMEIKKFFELNDNSDTAYQNLWDSAKAVIiRGKFIALKAYIT 
KSERAQTDNLRIKNKNHMIISIDAEKAFDKIQHPFMIKTIiSKIGIQR 
TYLNVIKAI YDEPTANI ILNGEKHAVGIWNKEAKGKILAKRINVRI 
EYIKHSKSQDSFLKRMRENDQKKEAKKRGTWVQLKCQAAPPREVYFV 
RTNGKESGLLEPIPYEFKHNRCKNIKDLWTVKMFLFIK 

7606 

A 

802 

1204 

VLEVLTRAVRQEKEIKGIQIGKEEVKLLLFADDMIMYLENPKDSSKK 
LLGLMNEFSRVSGYKINVHKSVALLYTNSDQAENQIKNSAPFTW/S 
KKK* LND * I KYLG IHLTKEVKDLYKENYKTLLKES IDDISK 

7607 

A 

3 

421 

RVECWGLE YDRS AAASNTI TQVWTH I QALSHEQ/ S AAYTS FSNKH 
ESFSRIHLLLGHKLGLNHF*NVGIM/PN/IFSNPNRITLEINTRSKT 
EKSTNLWKLNNTLLNSQQSREE I PRE KENT * NENENAAAQHLQDAAK 
AA 

7608 

A 

1 

341 

RMGTCVNPRSLKGFGPRPGGKKKPQSSPLKS/SPKMALMGGNPGGAP 
PFFRFLFFNFRFGPKKSSSPAPIFFFLKKGL*INFGPPREPGGFNQK 
PQFPSAPGFEPWNPPQGPKP 

7609 

A 

1 

409 

VWCNS LRHLTNTMET I D WKDFE S WMLHWLL YEMS RHS LEQKPTDAPP 
KVLTKCQEEV I H I RAVHPGS FKP KRDENGNYI* PLQC YGS I R YCWRVL 
PNGTE VPNTRS RGHH I CS ETLELEDPS SGLGGTKHDLGP VPM 

7610 

A 

3 

644 

GDQRDLISNNEQLPMLG/ERPGAPESQRPAQPSRDTRPPLPSPKIAS 
PMCSLKKG*KAITPMPIiLPQSPKTPKP/VSKMRMATPLLMQALPM/D 
AL PQG PMQNATKY/ GNMTEDHVMHLIi / TVLTKC / QEEVS HI P AVHPG 
S FRP / KCDENGNYLPLQC YGS / TG YCWCVFPNGTEVPNT / KSRGHHN 
CS APS PS P I SHLYL I P * D PGAWLFRHPGQDKPSRNSR 

7611 

A 

1 

1032 

MHRRRSRS CREDQKP VMDDQRDL I SNNEQL PMIX3RRPGAPE S KCS RG 
ALYTGFS I LVTLIiLAGQATTAYFL YQQQGRLDKLTVTSQNLQLENIiR 
MKLPQAS \ KPVSKMRMATPLLMQAIjPMGALPQGPMQNATKYGNMTED 
HVMHLLQNAD\PLKVYPPLKGSFPENLRHL\KNTMETID\WKVFE\S 
WMHH\WL\LFEMSQPLPCSKSPLDAPP\KVIiTKCQ\EEVSHIPAVHP 
GS F / RGPKCGRGTGNYLA I PVAYWGASATC\ WCCLFPNG / TREVPNT 
RSRGHHN\CSESLGTWKDPSFLGI,GVLTK\QDLGPVPIVRASRGGLQ 
HPASPTQLQLSCSLQPPAPPPSPTL 

7612 

A 

59 

321 

LSGSPLVTATSQVPGLSRGKTGGPDPAL/SALPAPPTSPISPPPPSR 
SSLRRPRAPSGCPWGCPGSLQQAPGPLHWQDYRPGVRVRLPG 

7613 

A 

1 

465 

DEEDGENAHPYRNSDPVIGTHTEKVSLKASDSMDSLYSGQSSSSGIT 
SCSDGTSNRDSFRLDI?DGPYSG\PYCDQATLHTDFTPSPYDTDSLKI 
*KGDIIDIICKTPMRMWTGMVNNKVGNFQFIYVDVISDEEAAPMTIK 
ANRRS IS ILSKTLQ 

7614 

A 

1 

2119 

LSLFLDLNENDFLSNNIHTYQGKTLQGTSYQAIKSGSDPVESMGTLK 
RLQKLWTKKARVQNLDEVKPTLINLQDEDDTLISCLKIjTKSREKKV 
NSVSTRRKEEMEIRLDTLSASLGRSSTLNNCl^EDKLAWYEGEAYMW 
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HHWKPFPENPLWTCLDFQIAQVGPWDHCSSCIRHTRLKSSCSDMDLL 
HSWVTLCRQESSNITCWTKLREKRGDLFCILFVCFIDRKTPRKFSDQ 
QALEILKIGRKQKVHSLGWRIQNAQEKAIQCFREGETSKTKEYESV 
IKFYPKQRSSSFGNFDRFRNNSLSKPDDSTEAHEGDPTNGSGEQSKT 
SN\ KEAVWGKKMRAI S WTM\ RKKWGKKY I KALSEEKDEEDGENAHPY 
RNSDPVIGTHTEKVSLKASDSMDSLYSGQSSSSGITSCSDGTSNRDS 
FRLDDDGPYSGPFCGRARVHTDFTPSPYDTDSLKIKVIGEQGCLVTW 
KGD 1 1 D 1 1 CKTPMGMWTGMLNNKVGNFKFI YVDVI SEEE AAPKKI KA 
NRRSNS KKS KTLQE FLER I HI/QE YTS TLLLNG YETLEDLKD I KESHL 
IELNIENPDDRRRLLSAAENFLEEEIIQEQENEPEPLSLSSDISLNK 
SQLDDCPRDSGCYISSGNSDNGKEDLESENLSDMIiliTDPLQLSPQLP 
QRG AGN I NRR KTG F S LGHG KLNKEQRKDKKRIi I QG WEVAEEIX3PS AA 
FG 

7615 

A 

3 

452 

QTQREPTMVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFP J ri l 
KTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDL 
HAHKLRVDPWFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVS 
TVLiTSKYR 

7616 

A 

1 

475 

PDSSGPHRLRENPPMVAVSCPTKTNVKGPPGGKVGAHA\AQYG\SEA 
LGR\IFLSFPT\TKTYFPHFDL\SHG\SAQV*GATGKKVADALTNAV 
AHVDDMPNVAVRP * SDLHAHKLRVDPVNFKL»LSH\ CLLG * PWPAHLP 
RPSFTPCGGTPSLEQSSWASC 

7617 

A 

26 

461 

WESCGPHQRTG*AGSSRCGWRRRAAGPGCHQPRAGVPGRTCRAGRSP 
AAD AAGRC WLGG P P P PGCG PRCLR * CAGHPP AG S ARVWS WAGAS TG 
LGRLWGRRWSPSCHRCQSPGCQAGRRGTARLSWPLSRRTSCRIPAAR 

VYPR 

7618 

A 

3 

420 

LLDPSDGKSCGPTSKDCRRRIRQYSVEALPRPALPHTTRGSGRKEAR 
GEGQSGAGAPADPDKLEAAVAVGEDVQQAPGCHQPRAGVPGRTCRAG 
RS P ATRGTRDGTA/ QGRAG PATVP I P PG PWGGPTCRRCRTLI»ARRS 

7619 

A ! 

2 

227 

AGVFCDAGLTFTS S SGQQTAQRAELLCPQPAALRRRSEP FS VHLLDI 
S \ GE YLLGTF Y YCVENLRDG PCWADAV 

7620 

A 

11 

3 94 

SEFCDWVADLDVQKYEQPS / HEALTSMGSTEARVDYMGS S I LMGI FS 
NADLKLQDEWKWLYNTLDSSITDKSEIFVHGDLKWDIFQVMISRST 
TPDL I KI GMKLQE F FTQQFDTSTRAL S TWG P VP YI» 

7621 

A 

3 

185 

FFFFFER/CPTLAQARVLWCLHSSPQPQSPW/VSSNPAS/ASQAAGT 
AGMCHRAWtLKTEVGLD 

7622 

A 

1543 

1745 

FVKSKALAFFLSFFFFFFLKQSLSVTQAGVQW\PVDSLQPLPPRFKR 
FSCLSLPSSWDYRCVPQCPAN 

7623 

A 

1 

259 

SENQEQEEVITVRVYDPRVQNEGSWNSYVD\YKIFLHTNSKAFTAKT 
S CVRRR YRE F VWLR KQLQR YAGL VP VS EL PGMPTL YGL Y 

7624 

A 

3 

265 

CCHPRLACSG\MIL\AYCQS\RLPGSKRFPCLSLPSSWDYKQPCHHT 
RIILVFLVET\GFHHVGQTGLE/MLRNSGDPPTLASQKCWDYKA 

7625 

A 

2 

376 

RAALTPSTYKGAPS PK/ GAPHST\GVTS AS PKEAPTPPAVTPPSPEK 
GPATPAPKGTPTSPPVTPSSLKDSPTSPASVTCKMGATVPQASKGLP 
AKKGPTALKEVLVAPAPESTPIITAPTRKGPQ 

7626 

A 

2 

427 

IFPPPKRGFFSKKKGWFNPPASSSRFFARPPPGVGPPPQVSSSSSS 
PFYFYASSSSYSSSSNFPS/HSSSSSSSSSSSPRQQHPHLYPHCRLG 
WAERAW / P PGTPLYHYKTQRKQGTSASPGRGTTCHTPQS PPLPGHHR 
CPW 

7627 

A 

3 

255 

TFIRTVGWAGQTRLAPGTPLYHYKTQRKQGTSASPGRGTIiSRREVTV 
I KTLLGRTLQP PLGATKPAPPR S KRFMGDTAS LDDW 

7628 

A 

48 . 

557 

AGLGKARLAPGTPLYHYKTQRKQGTSASPGRGTTVTQRGHCYQNAPG 
RTLQPPPCGQQTSGH*GPGWPYESQRIiWKTPQRS*Q*FL.HPPPR/HC 
HTPACGAASASPCVGRYCPRCSCAPARSASLAALTRETAPLLPSPSW 
APGDPTLEGWPGNGSPLPPRGM*PHLYPHCRLGWAKRAWPRGHHCIT 
IKPRGNKEQVQVRGEGPLSRKEVTVIKTLIiWHFSHHLAASKPAAIE 
GLDGliM S LRG YGKLHKDLDNDFC I LLLAL PHS CLRS SFCI TMCRALL 


low© 
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PEMLMCSCSLSISGRSDSGNSSSSSFSFLGSRRPHVGGLAREWFTTP 
TEGDVTGIEDGL 

7629 

B 

122 

1028 

MENGQSTAAKLGLPPLTPEQQEALQKAKKYAMEQS IKSVLVKQTIAH 
QQQQLTNLQMAAVTMGFGDPLSPLQSMAAQRQGALAIMCRVYVGSIY 
YEI/GEDTIRQAFAPFGPI KS IDMSWDSVTMKHKGFAFVEYEVPEAAQ 
LALiEQMNSVMLGGRNI KVGRPSNIGQAQP I IDQLAEEARAFNRI YVA 
S VHQDL SDDD I KS VFEAFGKI KS CTLARDPTTGKHKG YGF I E YEKAQ 
S S QDAV S S MNLi FDLGG Q Y LR VG KAVT P PM PL>L T P ATP GG Li P P AAAVA 
AAAATAKI TAQE AVAGAAVLX * 

7630 

A 

3 

408 

KSMTEAEQQQLIDDHFLFDKPVS PLLLiASGMARDWPDARG I / W I ETL 
FKS KDYEFM WN PHLG Y I LTCPSNL1GTGL1RAGVH X KHEKr ob VJL» JvKJjK 
LQKRGTGGVDTAAVGGVFDVSNADRLGFSEVELVQMWDGVN 

7631 

A 

1 

170 

KFSEVLKRLRLQKRGTGGVDTAAVGGVFDVSNADRLGFSEVELVQMV 
VDGVPTMHP 

7632 

A 

1 

412 

S YEVFKELFDPI I SDRHGGYKPTDKHKTDLN\ PNLKGGDDLDPNYVLi 
S SRVRTGRS I KGYTLPPHCS RGERRAVEKLS VEALNS LTGE FKGKYY 
PLKSMTEKEQQQLIDDHFLFDKPVSPLLLASGMARDWPDARGI 

7633 

A 

2 

1250 

LPTDGAPPPPPGRPPAAAMPFSNSHNALKLRFPAEDEFPDLSAHNNH 
MAKVXjTPELYAELRAKSTPSGFTLDDVIQTGVDNPGHPYIMTVGCVA 
GDEESYEVFKDLFDPIIEDRDGGYKPSDEHKTDLDPDNL.QGGDDLDP 

ny\vlissrvrtgrsirgfclpphcsrgegra\ieklavb\aiissl\d 
gdlagryyalksmte^qqqliddhflfdkpv\splllglghgrrnw 

PDARGIWHNDNKTFLVWVNE\EDHLRV\ISMQKGG\NMKEVF\TRFC 
TGLTQI ETLFKS K\ DYEFMWNPFHWATSIjP\ CHCNLG\ TGLRA\GLH 

I \ KLPNLGQGMRKFSEV\ lkpaaelsksrrppggvehglrwagvfdv 

FQTLNRPGPSQSVEIi\VQM\WDGVK\LL\lEMEQ\RLEQGQAIDDIi 
HAWPKK 

7634 

A 

1 

175 

EQLP YFVTF I SMPATT/ EGRRGFSLS VESACSNYATTVQVKWNRMH 
ISPRALGTLHHN 

7635 

A 

1 

83 

SVTLFSMPATMKGRRGFSVSVESACSN 

7636 

A 

61 

2028 

GELLCPPPPPPHFRAPGTMSALAGEDVWRCPGCGDHIAPSQIWYRTV 
NETWHGSCFRCSECQDSLTNWYYEKDGKLYCPKDYWGKFGEFCHGCS 
LLMTGPFMVAGEFKYHPECFACMSCKVIIEDGDAYAIiVQHATIiYCGK 
CHNEVVIjAPMFERLSTESVQEQLPYSVTLISMPATTEGRRGFSVSVE 
S ACSNYATTVQVKEVNRMH I S PNNRNAIHPGDR I LE INGTPVRTLRV 
EEVEDAISQTSQTLQLLIEHDPVSQRLDQLRLEAftliAPHMQNAGHPH 
ALSTLDTKENLEGTLRRRSLRRSNSISKSPGPSSPKEPLLFSRDISR 
S E S LRCS S S Y S QQ I FRPCDL I HGEVLGKGFFGQAI KVTHKATGKVMV 
MKELIRCDEETQKTFLTEVKVMRSLDHPNVLKFIGVIjYKDKKIiNIjLT 
EYI EGGTLKDFLRSMDPFPWQQKVRFAKGI ASGMAYL»HSMCI IHRDL 
NSHNCL I KLDKTVWADFGLSRL I VEERKRAPMEKATTKKRTLRKND 
RKKRYTWGNPYWMAPEMLNGKS YDETVDI FS FGI VLCEI IGQVYAD 
PDCLPRTLDFGLNVKLFWEKFVPTDCPPAFFPLAAICCRLEPESRPA 
FSKLEDSFEALSIiYLGELGIPLPAELEELDHTVSMQYGLiTRDSPP 

7637 

A 

3 

295 

LWPSRPTTDPLQPRPADPGLRRLPPD/SPSPSRPARPTLCPFSPAAR 
FPVAPSSALASSEDLQPPSPSSSGSGLPGQAPPCYAPTYFPPGEEPP 
PYAP 

7638 

A 

517 

1038 

N KR I YNCMM I S S RR VI S L S P I VLLQ I FD CPRLKFS E I PQRLTALLLP 
PDPIVINHVl\SEV7^PACFSDQGMISFLPSLPLPIiLPLPSLtiPPCWI 
CPSPGAJjOPVWoHKoxixi.t3J-LlJr Iroo VJUIrolJs^iSJS.XMv, a UXLJ VD vdCf ajjv 
GQMSSFIiLSTANQQEISALDSKVGPKPRAGVGH 

7639 

A 

154 

1594 

LSRCRTLPSMTPGLQHPPTWQRPGMPSGARMPHQGRPWAPRAPRTW 
AAPPCDPAWPPRDGARPQASSAPARQSQAQSQGQPEPTAPARSRSAK 
RRKMADKILPQRIRELVPESQAYMDLLAFERKLDQTIMRKGVDIQEA 
LKRPMKQKRKLRLY I SNTFNPAKSDAEDSDGS I AS WELRVEGKLLDD 
P S KQKRKFS S FFKS LVI ELDKDL YG PDN \ HL VEWHRTPTTQETDGFQ 
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VK\RPGDLSVRC\TLLLiMLDYQPPQFKIiDPRLARLLGLHTQSRSAIV 
QAIiW\QYVKTNRLQDSHDKE\YTMGDKYFPQNF\DC\PRLKFT*IPQ 
RVTAVL\MPPDPIVINHVIS\VDPSD\QKKT\ACYDIDVEVEEPIK/ 
GGQMS S FLLSTA/NNQQE I S ALDSKIHETIES IN\ QLKIQ\RDF\ML 
SFLRRPHKLCARTWLRSQEPGTSR* LTDVAGNPEEERRA\ EFYHQ\ P 
WSQEAVSRYFY\CKIQASAGKEAW 

7640 

A 

256 

427 

HPICPQTTKAFLPRMLELQNGHIVCLNSVIALSAIPGA\IDYCTSEA 
ADFGFMESLT 

7641 

A 

1 

985 

RIARPGEFRPLPGSAARRMAWKRLGALVMFPLQMIYLWKAAVGLVIi 
PAKLRDLSRENVLITGGGRGIGRQLAREFAERGARKIVIi\WGRTEKC 
LKETTEEIRQMGTECHY\FICDVGNRD\EVYQTAKAAREKVGDITI1p 
VK\NAAVV/HMGKSLMD\SDDDAFLKAQ\NINTLG\QFWNHQGPFSG 
KLCWKQQNGPH/LVCLQLPLLGIVLAI/LPGAIDY\CTSKASAFAFM 
ESLDPG/V1AGTCPGV\SATTVLALSTTSTEMFQGH*EFRF\PNIjFP 
PLK\ PETVARK/ TQVE AVQLNQ ALLLIj PWTMHAL V I LKS ILPQAALE 
EIHKILKETYNLE 

7642 

A 

1 

376 

kqaasnqldslmglg/dshpveergiilcefcgvqlfydvlvhyqdq 
wdlrpgtaffrvtegiprlhsqXaqetwpel/plrrvrrqenlssgy 
lddtlletangptfclparpf\nnmtatynqlsrs 

7643 

A 

1 

2753 

mkefsstaqgntevihtgtlqrherhhigdfcfqemekdihdfefqw 

KEDERNSHEAPMTEIKQIiTGSTNRHDQRHAGNKPIKDQLGSSFHSHL 

pelhmfqtegkignqveksinsaslvstsqriscrpkthisknygnn 
flnsslltqkqevhmreksfqc^sgkafnyssviirkhqiihiigakq 
ykctvcgkvfnqkryiachrrchtgkkpykcndcgktfsqeltltch 
hrlhtgekhykcsecgktfsrnsalvihkaihtgeksykcnecgktf 
sqtsylvyhrrlhtgekpykceecdkafsfksnlerhrkihtgekpy 
kcnecsrtfsrkssltrhrrlhtgekpykcndcgktfsqmsslvyhr 
rlhtgekpykceecdeafsfksnlerhrrihtgekpykc1?dc\gkaf 
tshshlirhqrihtgqksykceecdeafsfksnlerhriihtgekly 
kcnecgktfsrkssltrhcrlhtgekpyqcnecgkafrgqsaliyhq 
aihgigklykcndchqvfsnattianhwrihneersykcnrcgkffr 
hrsyixavhwrthsgekpykceecdeafsfksnliqrhrrihtgekpyr 

CNECGKTFSRKSYLTCHRRLHTGEKPYKCNECGKTFGRNSAIjIIHKA 
IHTGEKPYKCNECGKAFSQKSSLTCHLRLHTGEKPYKCEECDKVFSR 
KSSLEKHRRIHTW/REKPYKCKVCDKAFGRDSHLGT\HTRIHTGEKP 
YKCNECGKNFRHNSALVIHKAIHSGEKPYKCNECGKTFRHNSALEIH 
KAIHTGEKPYKCSECGKVFNRKANLSRHHRLHTGEKPYKCNKCGKVF 
NQQAHLACHHRIHTGEKPYKCNECGKTFRHNSVLVIHKTIHTGEKPY 
KCNECGKVFNRKAKLARHHRIHTGKKH 

7644 

A 

1 

150 

DSFG7VSRFLRIHAE\TYFTLRAYFAESRCIEALDELASLQVTMHQAQ 
KHT 

7645 

A 

81 

331 

KGARFLLLILYLLDAFIiF\NLDLFFLEYPSYFIiLCFCSIiLLFFFCHC 
FSLYFFVCRPSSSLFSSSILLILLFFLVRFLLFCLFL 

7646 

A 

701 

1116 

ATRHSMIiSCI I YTYI YIKHTHTHVFI YI YTYTYIHTHI YTYTYICIH 
THIHIYIYIYIHTHFFFFFFFFETESRSVTQAGVQWHDLGSLQAASW 
GHA/ D S PASTS QAAGTTG AHHHAQ/ L I FFFFVFLVETGPHRAS QDS 

7647 

A 

1 

207 

RSVFRERGKLSAG/SLVERLYRVRRFDRFERILKMARKAVETHVLRS 
PHLVSDYRDS I IPSESLLALPC 

7648 

C 

244 

432 

MFFTNIKMIAYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

7649 

A 

898 

1145 

I FQLLLMEG P AN 1 1 GNTG S SNTGRSNTGPSNQS S YL I S FLFLFCF FE 
TESRSGRPGLGVQWAWISGSL\QALPPGSTPFSCIiS 

7650 

A 

1 

476 

ATRELVTSSGLRRKTGLWARPREKPTAWHLASLSRPDELVRNPQEPQ 
PGPREEGSLGHVARTPAPGGGGQRHLSRAGHSRPGRRFKATLWL\SB 
EHPLS LGDQVTP I IDLMAI SNAHFAKLRDF I TLRLPPG \ FPVKI E I P 
LFHVLNARIHLSGNLCGC 
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7651 

A 

1 

1071 

MTSAKTEIVLGKRDLLEEVIDILPAPVQAFIiYETACQERFCAEIiCDA 
LRERHDSGE I LS YLQAHQVFLVP VTPAI* I FAI TVAT IGS FQFGYNTG 
VINAPETIIKEFINKTLTDKANAPPSEVLLTNLWSLSVAIFSVGGMI 
GSFSVGLFViraFGRRNSMLIVNLLAATGGCLMGLCKIAESV 
RLVIGLFCGLCTGFVPMYIGEISPTALRGAFGTLNQLGIVIGILVAQ 
IFGLELILGSEELWPVL.LGFTILPAILQSAALPCCPESPRFL.LINRK 
KEENATRILQRLWGTQDVSQDIQJb»Jnl^ 1 v uajjr kvo 
S YRQPI IIS I VIiQLSQQLSGINAVS VIL 

7652 

A 

2 

240 

SCSIGSr QFG/ Y I irijVXWAJrlil i. J. JSJvr JLJNJV/ obl/iA/iWnr irorj v 1 
NLWSLSVAIFSRRGYESASLFPSGLFVN\RFGQVLT 

7653 

A 

2 

391 

QTGRGMWCSLVL.FLLILPATPRKQSEPKTTDAAAPGTADTAGPGTMD 

TA VPRKAETA V PCj.KKj>J lAVro lAJJirAolrlj 1AU 1 /ULrTUi A 1 i. ir i. v uno 

WPSLTQST\KAPSPRLLPAPSPGSCSATSTSCSRSP 

7654 

A 

1 

200 

SSGHAEPGSGGPGFLSPHCYEEVCGFFPSLSSFSPSPPECSPQIiVPA 

n 71 mrn* t "Pits 7\ D TTJT UT TV V 

7655 

A 

3 

416 

CLFPVQLLLDHGADPNQRDGLGNTPLHLAACTNHVPVITTLLRGG\S 

nT?T?nrr pcrcDCDDrPCDHT \Tt3Tvr , ZiO\7T^TiT.TYP AfZ'PTPT.'HT.AESTCLiNT 

LLEGHAQCLEAVRLEEKHI IHMLKEHLELLGRHYQLINAWMTLD 

7656 

A 

1 

397 

KVQSRCS S K\ ENILRASHS AVDI TKVARRP \ RMAS FALTSMDKAF I T 
VXiEMTPVIjGTEI XNYRY GMAHOljVy i VVoUUlvr or i-ir o-Ujj x jc i r 
SFHFSLSLFIYLSLRSFFKLRESFFCFSSIFLSSTFALS 

7657 

A 

210 

441 

LNPILYSPGRDHRVQNFHljPRIjCIQlvJ-ir FJv\i>l\A-r 1 x liijlr/uiKivxvijv 
QLEPLTDDELEPDFVRQVTESMSYIFSHSMI 

7658 

A 

8 

399 

RVGVAGPAPWWLRVSLGGR\DFHTPIS\VTKVAERGKAEDADLRPGD 
1 1 VAINGESAEGMLHAEAQSKIRQS PS PLRLQLDRSQATS PGQTNGD 
SSLEVLATRFQGSVRTYTESQSSIjRobxbbFifaJj^toKA 

7659 

A 

2 

260 

WEPPWPGPVLGLPLPPLPPPSSPSSLPIPPPPRGCSLGASSA/WTAW 
VSSPKEPVRPQEPPPGGEGSNENHRRKWHPPF S VMLiKbU i 

7660 

A 

2 

442 

PRVRVYYFRQGHEAYVEMARKNKI YS INPKKQPWHKMELREQELMKI 
VGIKYEVGLPTLCCLKLiAF LDPDTLyKljlXjVjbr IMr^i rLUWiruv xur J_>v 
LRQQFDDAKYRRWNIGKGE/CSDRFRSVIDDAWWFGTIESQEPLQLE 

YPDSL»r Q 

7661 

A 

965 

1232 

TGVQWPDLGSLQPLPPEFKQFSCLSLLRGWDCRRPLPQIxADFCIFSR 
DGG YCYVGQAGLKLLTSN/ DLPAS ASQSAG I TGVSLCTQPKS S 

7662 

B 

111 

374 

MVAAATG S E I LLWALQAEGGGS E I GDSGNW I E IAYGTS LGGVRVT VQ 
HPEWGSGPQLLQTFTVHRSPVTKIMLWEKHIilSVCADNNH* 

7663 

A 

387 

545 

RQDGQGSVPSEAAEPAQPSGASAARRTASPCPTEQHDAPQPWQCRAA 
GPNAG 

7664 

B 

181 

4716 

MGRPAPAVPRPARPATPPAWTAALPAGRPRGDPGFRAFLCPIilCHNG 
GVCVKPDRCLCPPDFAGKFCQLHSSGARPPAPAIPGLTRSVYTMPLA 
NHRDDEHGVASMVSVHVEHPQEASVVVHQVERVSGPWEEADAEAVAR 
AEAAARAEAAAPYTVLAQSAPREDGYSDASGFGYCFRELiRGGECASP 
LPGLRTQEVCCRGAGLAWGVHDCQLCSERLGNSERVSAPDGPCPTGF 
ERVNGSCEDVDECATGGRCQHGECANTRGGYTCVCPDGFLLDSSRSS 
CISQHVISEAKGPCFRVLRDGGCSLPILRNITKQICCCSRVGKAWGR 
GCQLCPPFGSEGFREICPAGPGYHYSASDLRYNTRPLGQEPPRVSLS 
QPRTLPATSRPSAGFLPTHRLEPRPEPRPDPRPGPEFPLPSIPAWTG 
PEIPESGPSSGMCQRNPQVCGPGRCISRPSGYTCACDSGFRLSPQGT 
RCIDVDECRRVPPPCAPGRCENSPGSFRCVCGPGFRAGPRAAECLDV 
rvTrmro VPPPmT/?R C FNTPGS FLiCVCPAGYOAAPHGAS CODVDECTQ 
SPGLCGRGACKNIiPGSFRCVCPAGFRGSACEEDVDECAQEPPPCGPG 
RCDNTAGSFHCACPAGFRSRGPGAPCQDVDECARSPPPCTYGRCENT 
EGSFQCVCPMGFQPNAAGSECEDVDECENHLACPGQECVNSPGSFQC 
RACPSGHHLHRGRCTDVDECSSGAPPCGPHGHCTNTEGSFRCSCAPG 
YRAPSGRPGPCADVNECLEGDFCFPHGECLNTDGSFACTCAPGYRPG 
PRGASCLDVDECSEEDLCQSGICTNTDGSFECICPPGHRAGPDLASC 
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LDVDECRERGPAIiCGSQRCENSPGSYRCVRDCDPGYHAGPEGTCDDV 
DECQEYGPEICGAQRCENTPGSYRCTPACDPGYQPTPGGGCQDVDEC 
RNRSFCGAHAVCQNIiPGSFQCLCDQGYEGARDGRHCVDVNECETLQG 
VCGAALCENVEGSFLCVCPNSPEEFDPMTGRCVPPRTSAGMFPGSQP 
QAPASPVLPARPPPPPI.PRRPSTPRQGPVGSGRRECYFDTAAPDACD 
NILARNVTWQECCCTVGEGWGSGCRIQQCPGTETAEYQSLCPHGRGY 
LAPSGDLSLRRDVDECQLFRDQVCKSGVCVNTAPGYSCYCSNGYYYH 
TQRLECIDNDECADEEPACEGGRCVNTVGSYHCTCEPPLVLDGSQRR 
CVSNESQSLDDNliGVCWQEVGADLVCSHPRLDRQATYTECCCIiYGEA 
WGMDCALCPAQD SDDFEALCNVLRPPAYS PPRPGG FGL PYE YG PDLG 
PPYQGLPYGPELYPPPAliPYDPYPPPPGPFARREAPYGAPRFDMPDF 
EDDGGPYGESEAPAPPGPGTRWPYRSRDTRRSFPEPEEPPEGGSYAG 
S LAE P YEELEAEECG I LDG CTNGRCVR VPEG FTCR C FDGYRLDMTRM 
ACVD I NECDEAEAAS PLCVNARCLNTDG S FRC I CRPG FAPTHQPHHC 
APARPRA* 

7665 

A 

209 

1288 

EE KNERRDRDTERKGEARTS KTS QKRETKETTGGKTKQKRKKEKTTD 
KKTKRKKRTSKTEENKKRKQEKEDRERDESTKRRKEGERERRGNARR 
RKKGGEHSKKRGERRNNSKGEGEEEERGRKTKAKTRRTRQATNTDSK 
NEKGRKGKGQTRARRPAETGRRRRKRKRQRKAGETTPGETGRWCASV 
RSVDGSPTTAPTVIjECEGSRRLGS\RPRRYLIiNCQANGSIiAMWDIjTT 
A\ MDGFGQ AP AGGLTEQELMEQLEHCELAP P / R/ CAS S S LNGAVS P S 
PSPRISLT/NASTQASSNTSFVWPTVGSPKPPARLKARRRGWGEALW 
KAARNWCGVGQTSDGHPHQAPWPSKRSRHSPHTFQE 

7666 

A 

3 

1704 

GGRRAAALG V A I G AAGAARAAAWTG P ARRAS PRALH PGRVRPALRP W 
ADRLPLYPDLRAPQRHQRGQRRSRRGGPGGTRLPRIiVPSEAAEPAQP 
SGASAARRTAQPLSDGATRCPPTLAMHGCWAECWMRKPIjPHPQVRFY 
IiVEGQPEEPGMVRLVCGHHNWIAVAYTQFLVCYRLKEASGWQLVFSS 
PRLDW P I ERliALH \ ARVHGGALGEHDKMVAAATG SEIL L WALQAE GG 
GS EIGVFHLGVPVEALFFVGNQL IATSHTGRIGVWNAVTKHWQVQEV 
QPITSYDAAGSFLLLGCNNGSIYYVDVQKFPLRMKDNDLLVSELYRD 
P7VEDGVTALSVYLTPKTSDSGNWIEIAYGTSSGGVRVIVQHPETVGS 
GPQLFQTFTVHR\SPVTKIMLSEKHIiISVCADNNHVRTWSVTRFR\G 
MISTQPGSTPIiASFKILALESADGHGGCSAGNDIGPYGERDDQQVFI 
QKVVPSASQLFVRLSSTGQRVCSVRSVDGSPTTAFTVLECEGSRRLG 
SRPRRYLLTGQANGSLAMWDLTTAMDGLGR/SPAGGIiTEQELMEQLE 
HCELAP 

7667 

A 

8 

567 

TLVTVGNSQGREGLAFIjVS CPEGRAD CASES ACS SDSLDEARSSGSE 
GTPADTGDLSPGHGASAPSVSREARQTVPPLTVRLHTQSVSECITED 
GRTVAVGDIVWGHRR*GRTCWPGASDLCSGTL*AFWPAACRAHWARW 
SAWCEVPRGPAVCSGRGWT*AQRLiLPLSVFVPTTQRMAALAHEAV 

7668 

A 

2 

610 

SPEI LVDPRLRPRVRIiAFLVSCLVGRADCAS ES ACS SDSLDEARS SG 
SEGTPADTGDLSPGHGASAPSVSREARQTVPPLTVRLHTQSVSECIT 
EDGRTVAVGD I VWGKIHGFPWWPARVLD I SL/ WPEGGRRAVLAR\RK 
VSWI G SPTTS FLS I SKLS PFSEFFKLRFNRKKKGMYRKAITEAANAA 
RHVAPEIRELLTQFET 

7669 

A 

3 

392 

ADAAVPTRCVTSGHSLPISGHAAESELFHR/ p/ agi ilpsrrnvlpi 
RG AS STRRAR PLERPATPVP VAPS S RAARS RH I D PLLQ S S S PTP PG P 
TGSAAYVAVPTPNSSS PGPAGRPAGAMLYHSWVSE I TE 

7670 

A 

5 

395 

GFRNPPRDSHHPHSTRRRRAFLRRWEKDPGRGKGRAPE PAQQRDAS P 
PTRRPLEGP / PREHLRQWERLP FPKS PWKE VFAFVS ANKACSALS GP 
VAPVETAAEPSPPS/PRVRGRSGPRTKGARPRNRCCDD 

7671 

A 

2 

399 

ARTNTGGRPYCFIRM\ SNLIHRTQQGYNNPRPLSTNI I LSDIQTETK 
LRPPYQISMCELGSAKGVTSAFSADCKGAEKISSGHQRLEPATLSGI 
A\GFILSLLWGALNIjIPGFHAIQRLLQREGDDFNSFIAXjF 

7672 

C 

151 

339 

MSTFPIHISPKLRGDDVRGRXLXWRWPLRGFCATVRWCPPGAAGWCA 
XPVRAES AFXRPAKS * 
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7673 

A 

19 

444 

SVRWNSCRAEAPQWPAPAAAPAAHSPHIiSLGESGLGKLlLINSLFLT 
DIiYSPEYPGPSQRIKKPVQVYILVFLIDDKLE*Y*YTQSTCCNFHyA 
S\QSWQPAINYIDSKFEDYLNAESRVNRCQMPGNRVQGCIiYFIAPSG 
H 

7674 

A 

152 

532 

QALRANRPAPVAEGATDRTVAQKLLNGQRQLQPRPPTRLI * AWVEQS 
KVLIK\DGGVQLLLTIVDTPRFGDAVPNSNCWQPVINYIDIKFEDYL 
NAESRVNRCQMPGNRVQ* *LYFIAPSGHGPLHN 

7675 

A 

1 

628 

GSARSAAAEERSWSSTMVAQQKNLEGYVGFANLPNQVYRKSVKRGF j 
EFTLMWGESGLGKSTLINSLFLTDLYSPEYPGPSHRIKKTVQVEQC 
GV\LIKEGGVHLLLTIVDTPGF\GDAVDN\SNC\WQPV\IDYIDSKF 
EDYLNAESRVTRRQMPD\NRVQCCLYFIAPSGHGIi\KPLDIEFMKRIi 
HEKVNI I PL IAKADTLTPEECQQFK 

7676 

A 

2 

373 

VLELEERALLRE ARLGRARS S GGMQATPATEGLARPQAPS S SAFRCP 
YCKGKFRTSAERERHLHILHRPWKCGLCSFGSSQEEELLHH\TCQVC 
GQSFTQS WFLKGHMRKHKAS FDHACPVCGR 

7677 

A 

44 

985 

RGSMEGSRPRAPSGHLAPSPPAFDGELDLQRYSNGPAVSAGSLGMGA 
VSWSESRAGERRFPCPVCGKRFRFNSILALHLRAHPGAQAFQCPHCG 
HRAAQRALLRSHLRTHQPERPRSPAARLLLELEERALLREAPTGESP 
KLRGHAGHPLPLMVW / PRPQAPS S SAFRC P YCKGKFRT\ RLERERHL 
HILHRPWKCGLCSFGSSQEEELLHHSLTAHGAPERPIiAATSVAPPPQ 
PQPQPPPQPEPRSVPQPEPEPEPEREATPTPAPAAPEEPPAPPEFRC 
QVCGQSFTQSWFLKG\HMRKHKAS FDHACPVCGR 

7678 

A 

1 

583 

GNAGLSSLTEGVLDLFAVKAVYMGHPGIDIHTVCVQNKLGSMFLSET 
GVTLLYGLQTT\DNRLLHFVAPKHTAKMLFSGLLELTRAVRKMRKFP 
DQRQQWLRKQYVSLYQEDGRYEGPTLAHAVELFGGRRWSARNPSPGT 
S AKNAE KPNMQRNNTLG I STTKKKKKI LMRGES GEVTDDEMATRKAK 
MHKECR 

7679 

A 

2 

429 

SVGSESDSSKE\GPSMTRSGPLRRSRAVPG/PEESQGNTLVIRWIH 
DLQQTKC I RFNPD ATVWVAKQR I LCTLiTQ S LKDVLNYGLFQPASNGR 
DGKFLDEERLLREYPQPVGEGVPSLEFRYKKRVYKQASLDEKQLAKL 

HTK 

7680 

A 

1 

248 

VIFLVNSGSEANELAMIiMARAHSiraiDIISF/RTMCPDVFRGPWGGS 
HCRDS PVQT I RKCS CAPG PTGQGGCQWS VG S WKGHE 

7681 

A 

1 

501 

MTL I WRHLLRPLCLVTS APR I LEMHPFL S RKWKPGGRG S VEDQREEH 
AWDRGDLHREGCERSCPERMIPNSNGEGPQQRAADCGPMCEEKTTTL 
ESPSWQLQHREAEEKNFSTDQTLSLRRPNPNTASAGLHGRQCLKRTS 
RTSVTKLSLHTKPRMPPCDFMPERYQ 

7682 

C 

60 

380 

MELPGGTGCQPTMCPDVFRGPWGGSHCRDSPVQTIRKCSCAPDCCQA 
KDQYIEQFKDTLSTSVAKSIAGFFRRTYSRCEWSCPVPKGVFKGSLL 
SLVRARGRPCALQ 

7683 

A 

3 

649 

LSPSYFHSISFPHSLFLPLEGTVTRNRELWGLLQAMGVHHSRIPLLA 
PPPHLNIiKVQQDSEQLEPGLQSASMGVGFLSGRNDSNLETFAETLVP 
GSIPLPGILEMHPFLSLGTSRTSVTKLSLHIKPRMPPCDFMPERYQV 
IFLVNSGSEA\NE\liAMIiMARAHSNNIDI\ISFRGAY\HGCSPYTLG 
LTNVGTYKM\ KLPGGTGCQPVSLE I FLLFL 

7684 

A 

27 

211 

FHIVQYFNFFVCLFVCLFETESTLSPRLEGSGVISAHCNL\TLPPGF 
NNSCASASRVAGTTQ 

7685 

A 

1 

312 

RPRRRGDHREVKGGRGGKGGEGRPQGGQRWAVGKGGEGRPQGGQRWA 
VGKGGEGRPQGGQRWAEGKGGEGRPQGGQR\GQWGKGGEGRPQGGER 
\GPGGKGGEG 

7686 

A 

2 

320 

KYVFTD I S YS I PHRERFI WREPSGTLRKAS WEERDRM IQVYFPKEG 
RKILTPIIFKEEN1>RFEPDSTEYIKVHHKTYEDIGKRGK\YDLLRST 
RYFGWEWVWVFC 

7687 

A 

22 

1071 

GVLRAWNGSVSGLESSPGTVACSNRYLALSRWGCHSRRINSYGAESG 
SPETKKPTFMDEEVQ\RVLTKMTGLNLQKTFKPAIQE1»KPPTYKIjMT 
QAQLEEATRQAVEAAKWLKMPPVLEERVPINDVLAGDKDFWKGTET 
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TKYVFT\DISYClPHRERFl\WREPSGTLRKASL\EDRDRMIQVyF 
PKEGRKILPPI I FKEENLRTMYSQDRHVDVLNLCFAQFBPDSTEYIK 
VHHKT YED I DKRGKYDLLR S TR YFG\ EMVWYFVNNKKIDGLL IDQ I Q 
RDL IDDATNIiVQIiYHVLHPDGQS AQGAKDQAAEG INLI KVFAKTEAQ 
KGAYIELTLQTYQEALSRHSAAS 

7688 

■ 

A 

2 

383 

FSAPAASEPLPFSGPPPSEPLPFSGPPPSDPLAFAAPPPSEPLTFSG 
PPPPSEPLTFSGPPPSDPLCFTAPPPSEPVPFSGPAPSDRLPFFGPT 
TSEPPNR/PGMPTQPHPICLSKNTPLPEPTRLHP • 

7689 

A 

3 

2485 

PAIKIRRRRVRDLQDPPPQMAPEIQPPSHHFSPEQRALLYEDTLYTV 
LHRLGHPEPNHVTEASELLRYLQEAFHVEPEEHQQTLQRVRELEKPI 
FCLKATVKQAKGILGKDVSGFSDPYCLLGIEQGVGVPGGSPGSRHRQ 
KAWRHTIPEEETHRTQVITQTLNPVWDETFILEFEDITNASFHIiDM 
WDLDTVESVRQKLGELTDIiHGLRRIFKEARKDKGQDDFLGNVVLRLQ 
DLRCREDQWYPLEPRT\ ES YPDRGQGHLQGQLVHKRRATS ASR\ SQP 
S YTVHLHLLQQLVSHEVTQHEAGSTS / WGRVAES \ QAATVLFLHATQ 
KDLSDFHQSMAQWVAYSRLYQSLEFPSSCLLHPITSIEYQWIQGRLK 
AEQQEELAASFSSLLTYGLSIiIRRFRSVFPLSVSDSPARIiQSLLRVL 
VQMCKMKAFGELCPNTAPLPQLVTEALQTGTTEWFHLKQQHHQPMVQ 
GI PEAGKALLGLVQDVIGDLHQCQRTWDKI FH\NELI S STLKIHLFS 
MAFRELQWLVAKRVQDHTTWGDWS PEMGE S LFQL Y I S LKELCQLR 
MSSSERDGVLALDNFHRWFQPAIPSWIiQKTYNEAIARVQRAVQMDEL 
VP\LGELTKHSTSAVDIjSTGFAQISHTARQLDWPDPEEAFMITVKFV 
EDTCRLALVY \ C Sit I K\ ARARELF FRARRTKARAANMLCSTVVNDMEQ 
LRLVI \GKLPAQIAWEALEQRVGAVLEQGQLQNTLHAQIiS \GAJjAC3lj 
GHEI\RTG\VRTIAE\QLEVGIAKHIQIOiVGVRESVLPEDAILPLMK 
fle velc ymntnlvqenf s s lltllwthtltvxveaa 

7690 

A 

31 

511 

GDHASDKETITIVNIYAPNIRACSKYIKQTLTELKGEVDSNVITAIS 
ILRFQQLPTLIKTESLRNRNSSSSSSSSS/PSSSSSSSSLKKSLPTK 
ESIGPDGFTDEFYQTFKEELiK/ PKSSSbbobbobboboboooooaso 
SSSSSPEKDATREENYRPISL 

7691 

A 

2 

189 

TYHIDRIKDKSHMIISIATEKAFDKIEHPFMIKALSQLEWG\MTNV 
I KAI YKKSMAN I TQQ 

7692 

A 

3 

411 

NTANRAARVSASIEDSPDNSETIiPPDESFRNGENAKRFAIDIGGTLT 
KLAYYSTVQHKVAKGRAFDHSGKDTER\DTSRPMRVQFKKRAI#LDGT 
S LRRRKPTPKPAWTS S KT I S S TQRP / EVI QATGGGAYKVKDLI 

7693 

A 

88 

482 

KEELYGDFEDLET/G/DVHKGKSGPDTQNEDIEK\QDEIDPPBKKSA 

KKKHLDKKRKLKEMLDAEYDEGESTYFDDLK^ 

YKKILKSQDPIIFSVGWRRVHTILLYYIEDHNGRQRLLKY 

7694 

A 

1 

1068 

MAANRKDFIHRRCGPELSQSEAVPQGLCRAERSSALAMLSGSVPRRT 
QLRPCKGTQRRCRRGGEADSGGEAVGGGAAERRSAAARRRGGRRRGG 
GEAVGGGAAERRSAAARRRRGGRRRRGGGEAGGGGAAEARRAAAARR 
AAAARRAAAARRAAAARRAAAARRIAESGGEAHSRGEAHSGGEAHSG 
GAGAERRRPRLHSPALKTHPGQSSEGHVEYKLNHVEFEDQDDEARVQ 
YEGFRPGMYVRVEIENVPCEFVQNFDPLYPIILGGLGNSEGNVGHVQ 
MRLKKHLWYKKI \ LKSQDPFI S SVGGRRFQTI P\ LYYI EDHNGRQRL 
LKYTPQHMHCGAA\ FWGKI * LQ * LAYCRD 

7695 

A 

66 

844 

KEELYGDFEDLETGDVHKGKSGPDTQNEDVEK\KEEIDPDEEESAKK 
KHLDKJO^KLKEMFDVEYDEGESTYFDDLKGEMQKQAQITAYRFSPPT 
P S CT\ LNHAEFEDQDDEARVQ YEGFRPGMYVCVE I ENVP CEFV* NFD 
PRY P 1 1 IX3GLGNS EGNVGYVQVGPFAAYLVPEAIjW I S P P S I ILPSHP 
PQMRLKKHRWYKKILKSRDP 1 1 FS VGWRRFQTI PLCYI EDHNGRQRL 
LKYTPQHVHRG/AAFWDKICLQ*IiAYC 

7696 

A 

1 

1177 

LRMRLGTRSALWACRGWGQQHMHAWLTEQPLGVLFPEELYGDFEDLE 
TGDVHKGKSGPDTQNEDIEKEVKEEIDPD\EEESAKKKHIiDKKRKLK 
EMFDAEYDEGESTYFDDLKGEMQK\KAQLN\HVEFEDQDDEARVQYE 
GFRPGMYVRIEIENVPCEFWQNFDPHYPIILGGLGNSEGNVGYVQM 
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RLKKHRWYKKI LKSRDPI I FS VGWRRFHTIPLYYI EDHNGRQRLLKY 
TPQHMHCGAAFWGP I TPQGTGFLAI QS V\ SG IM PD FRVAATG WLDL 
DKSIKIVKKLKLTGFPYKIFKNTSFIKGMFNSALEVAKFEGAVIRTV 
SGI RG\ Q I KKALRAPEG VFRAS FEDKLLM SD I VLMRLS TCS I PA/ FY 
/NPVTSC*TIGEKHLVRMRPRP 

7697 

A 

163 

371 

KGTTIO^WKLSDFKSYFKAIVIKTVRN/WHKYK\HIDQWNRIQNPEI 
NPHI YGQM I FKKRAKNKQWKKDGI* 

7698 

A 

270 

1479 

PGCAFSSSGTSRQLAPVLGYLGSRQKHSLPDLPYDYGALEPHINAQI 
MQLHHSKHHAAYVNNLNVTEEKYQBAIJ^ 

GDVTAQIALQPALKFNGGGHINHSIFWTNLSPNGGGEPKGWIYCAPL 

STTSCTVGIDVISWAL*QK1F*IHVI*HFTVIIGNLFICGWFWDFFF 

NR\ELLEAIKRDFGSFDKFKEKLTAASVGVQGSGWGWLGFNKQRGHL ■ 

QIAACPNQDPLQGTTGYI * IGLSWPVWKTSPLLKNVDSKVKGRKTV 

YQGDGHYVNHHPNQVFWGRGAVKALNRNI»KGAPDLELDGESRANCGE 

KGREESERSGR*AWQLCSVTGAPFFLTGLIPLLGIDWEHAYYLQYK 

NVRPDYLKAIWNVINWENVTERYMACKK 

7699 

A 

94 

330 

KSRS YKKMCLHFNFSGDVTAQI ALQPALKFNGGGH\ INHS I FWTNLS 
PNGGGEPKGWYILVHPYLHFVHSRNRLLIYWR 

7700 

A 

33 

807 i 

GPRAAQERHSWLWWLRQRLQQIGGISGSTSTSSMLSRAVCGTSRQLA 
PVLGYLGSRQKHSL\PDLPYDYGALEPHINAHIMHLHHSKHHAAYVN 
NLNVTEEKYHEALA\ RGDVTAQI ALQPAlA KFNGGGH INHS I FWTNIi 
SPNGGGEPKG/EELLEAIK\R\DFGSFDKFK\EKLTAASVGVQGSGW 
GWLGF\NKERGHLQI AACPNQDPLQGTTG\ LI PLLGIDVWEHAYYLQ 
YKNVRPDYLKAIWNVINWENVTERYMACKK 

7701 

A 

2 

436 

SGRGASLRPGVRPPSVSWEPYSGPGSTPPPPSSRSKDVGKPAQSWVR 
GGSTQPNTGQLPKEGLSTPHPVPRAGIPSSRQALLSSGNPGTRAPSA 
PALRPEAQASPAEFPGAP/SPELGGERLGGARFQEPPPQPAASFPGA 
AATQA 

7702 

A 

1 

389 

PGSSRQRCWHQPCFACQACGQALINI.IYFYHDGQLYCGRHHAELLRP 
RCPACDQL I F S WRCTEAEGQRWHENHFCCQDCAGPLGGGRYALPGGS 
PCCPSCFENRYSDAGSSWAGALEGDAFLGE/HWKVTHSL1GRI1DSTN 

7703 

A 

1 

1217 

MGGHLANVPTGTSLQQTGLMEKHFTSGKQIiLKTLQNN I PQTTERTHN 
GWHQAWLLVFSCVRSFYVAALFAVGLGCGIHWFCLAVFSVCAGHSTA 
WSLPGSQQHLSANMFVALHSYSAHGPDBLDLQKGEGVRVIiGKCQDG 
WLRGVSLVTGRVGIFPNNYVIPIFSTPSQLSGCREVGPTLVKPWHK 
MGVYGWEEGS ISEGD\HGKAVPSNPS / WPTAI VNPVRSTAGPGTLG 
QGSLRKGRSSMRKNGSLQRPLQSGIPTLWGSLRRSPT/MGPSASAV 
PILPATGDPLLPLSRGGGDGVQA/ SPSRGSPPSRASAGAVRPGSTPR 
PAPSLWKTKKSPSRVS/liLPKPPASAPPSILVKPENSRNGIEKQ/DQ 
NRE I S E IQALLPPNITLP YPTSGKPEQPSLPQCVPA 

7704 

A 

27 

211 

FHIVQYFNFFVCLFVCLFETESTIiSPRLEGSGVISAHCNIj\TLPPGF 
NNSCASASRVAGTTQ 

7705 

A 

27 

211 

FHIVQYFNFFVCLFVCLFETESTLSPRLEGSGVISAHCNIj\TIiPPGF 
NNSCASASRVAGTTQ 

7706 

A 

1 

514 

LSQSEKNNYYSSRFSPDSQYIDNRSVNSDRASQ/ARERAPRLNHPPE 
QIDSHSRLPHSAHPSGKPPSAPASAPQNVFSTTVSSGYNTKKIGKRIi 
N IQLKKGTEGLGFS I TS RD VT I GG S AP I YVKN I LPRGAAI QDGRLKA 
GDRLIEVNGV\DI*VGKSQEEWSLLRSTKMEG 

7707 

A 

65 

517 

DPRVRAAGILYEFTWNQFCTWYLELTGPV1WGGTEAKLRGTRHTLVT 
VLEGLLRLAHPIIPFITETIWQRVKVLCGITADTIMLQPFPQYDASQ 
VDEAALADTEWLKQAIVAVRNIRAEMNIAPGKPLELIiVAG\CTRNPN 
TSVNENRGFb 

7708 

A 

2 

507 

EPVKKIFNYIIHVFRNIMDFKLFLVFVAGVFLFFYARTLSQSPTFYY 
SSGTVLGVLMTLVFVLLLVKRFIPKYSTFRALMVGCWFASVYIVCQL 
MEDLKVOjRYDNRIYVLGYVLIVGFFSFXWCYKHGPLADDRSRSLLM 
WMLRLLSLVLVYAGVAVPQFAYAAI I L 
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7709 

A 

2 

419 

EPVKKIFNYMIHVNRNIMDFKLFLVFVAGVFLFFYARTLSQSPTFYY j 
S SGTVLGVLMTLVFVLLLVKRFI PKYSTFWALMVGCWFASVYI VCQL 
MBDLKWLWYENR I YVLS YVLiI VGFFS \ FWGTSMGPIiADDKSRNI* 

7710 

A 

14 

428 

RP IRMAVKQKI SMLCHVNPEQVI CIHDVS STYRVPVLLEEQS I VKYF 
KERLHLP I GDS ASNLLFKWRNMADRYERLQKI CS IAL\VGKYTKLRD 
CYASVFKALEHSALAINHKLNLMYIDSIDLEKITETEDPVKFHE 

7711 

A 

61 

204 

GRYAYVWHCPQGPGLETPL\VADTSGAYFRREGLGSNYLGGRSPTEV 

s 

7712 

C 

537 

710 

MCHHTQPIFVFWEMRFYYVGQAGLEVLTSGDPPVSAAQSTGITGLS 
1JRAQPSSVRF* 

7713 

A 

788 

896 

KHG\WIiGTVAHTCNPSTLGGQGGWIMRSGVQDQPGQ 

.7714 

A 

1 

453 

SPVPQLIVPTVLSVVLLEPFLGAPLPANRLWTHIiWAPRAWKQSPWQA 
p\GLGRLPSSLGIGSCGAPGAYTRPDIRLHDKACPAEHGGGRGPGEA 
ASRPLTAPSPGGCPERGLGTTPSPQSRTASSGGSYRSFTDPPPPPPI 
CHHRRF FLTL 

7715 

A 

48 

421 

DLQSSQGGQQPRVQSWSPPGRGIPQLPCAKAIiYNYEGKEPGDLKFSK 
GD 1 1 1 LRTQVDENWYHGEVNG I HGFFP \ TNFVQ 1 1 KPLPQPPPQCKA 
LYDFEGKDKEADKDCLPFAKDDVLTVIRRVD 

7716 

A 

2 

494 

LTLLNTITPSLARGNFYPLEGGRVIjLDGKPISAYDHKYIjHRVISLVS 
QEPVLFARSITDNISYGLPTVPFEMVVEAAQKANAHGFIMELQIX3YS 
TETGEKGAQLSGGQKQRVAMARALVRNPPVLILDEATNAIiDADSRDIj 
I QHALS \ GNLRKHRDFIjS P P WJ P 

7717 

A 

2 

721 

EGKVEEPENPAAKEKCEGKEEEEETDGSGKESKQECEAEASSVKNEL 
KGVEVGANTGSKSISEKGSEE\VKRKSWRMTNKSEESSQPEAGAVSR 
GKNFDEESNASMSTARDETRDGFYMEDGDPSVAQLLHERTFAFSFWP 
KDRVMINRLDNI CEAV\ LKGKW\ PVNRRQMFDFQGLI PGYTPTTVDS 
PLQKRSFAELSMVGQASISGSEDITTSPQLSKEDAI>NLSVPRQRRRR 
RRKIGN 

7718 

A 

111 

383 

DRGLAS FAQGLEVQWHNLS S LQPS \ PPVLRRF / S PCLGLPRGCD YRP 
V P PHQ AN FC I VS RDG F YHVGQAGLELLTS SD PPTS QS QS AG I TG V 

7719 

A 

1 

190 

ENKEEGQLPSSFYEASIILTLKPDKDSTKKS/SYRPT/SIiMNFDAK\ 
IFNKMFTNRIQQYIERIIH 

7720 

A 

1 

2499 

MAAAVAAPLAAGGEEAAATTSVPGS PGLPGRRS AERALEDAVATGTL 
NLSNRRLKHFPRGAARSYDLSDITQADLSRNRFPEVPEAACQLVSLE 
GLSLYHNCLRCLNPALGNLTAIiTYLNLSRNQLSLLPPYICQLPIiRVL 
IVSNNKLGALPPDIGTLGSLRQLDVSSNELQSLPSELCGIiSSIiRDLN 
VRRNQLSTLPEELGDLPLVRLDFSOTRVSRIPVSFCRLRHLQVILLD 
SNPLQS PPAQVCLKGKLH I FKYLSTEAGQRGS ALGDLAPSRPPS FS P 
CPAEDLFPGHRYDGGLDSGFHSVDSGSKRWSGNESTDEFSELSFRIS 
ELAREPRGPRERKEDGS ADGDPVQ IDF IDSHVPGEDEERGTVEBQRP 
PELSPGAGDRERAPNSRREEPAGEERRRPDTLQLWQERERRQQQQSG 
AWGAPRKDSLLKPGLRAWGGAAAVSTQAMHNGSPKSSASQAGGCSG 
AGSP\APAPASQEPLPIAGPATEPAPRPLGSIQRPNSFIjFRSSSQSG 
SGPSSPDSVLRPRRYPQVPDEKDLMTQLRQVLESRLQRPLPEDLAEA 
LASGVILCQI4ANQLRPRSVPFIHVPSPAVPKLSALKARKNVESFLBA 
CRKMGVPEADLCSPSDLLQGTARGLRTALEAVKRVGGKAIiPPLWPPS 
GLGG FWF YWLMLLL YVTYTRLLDPRS PQVAWEVAPS RMTPLAPWD 
PKYEAKAGPRPVWVSWGQTCGTGWGAQGAVRWPEAPVIiCPPHPRGPT 
VAQEPRSQAGRCVTPHSGRCMKQPRAGVSGPWPIiPQGTGMDSRRPQM 

QG S RWCAVKM S S S RTLCC PGGS V F PC 1 <~ FKF F t> K 

7721 

A 

1 

647 

RPPPVPQVPDKKDLMTQLRRVLESRIiQR\PLPEDLAEAIiASGVILCQ 
LiANQLRPR\ S VPFIHVPS PAVPKLSALKARKNVESFLEACRKMGVPE 
ESLCQPHHILiEEEGAPGRG\ lrli iaawqrpagpalgvkvgaqgpp 
RPGPREQKTRLGSALSEKHVVRVAWAWARU3LPRLAASGVEGLGGRT 
CGCRRCRVSRNKQKPEALWKRGCKI KKKL 

7722 

A 

3 

451 

SRSLVQRGGIPLGGWRMGVRRTGQVGPTMHPPPVSGASP/PPPPPPP 
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PPPPHHHPHLLRTSCVPSGLYAQSSSSSSSSSSSSSSSSSSSSSSSS 
S S SQLNKLPKFQVS S PNS PTLS S S FS FLPFFPAS SLTRQFFI ECLPG 
AGMSQVLGK 

7723 

A 

3 

461 

ADRPWHEPGAGCAADCEDPHRRPGACEPGAPPAARAAGLGRGTRHGN , 
GDILS FEDANRAMQTGVTG IM I ARGALLKPWLFTE I KEQRHWD ISSS 
ERLDILRDFTNYGLEHWGSDTQGVEKTRRFLLEWLSFLCRYVPVGLL 

ERXPQRINERP 


TV 
/\ 


834 

LDIRGKLYLAPLTTCGNLPFRRICKRFGADVTCGEMAVCTNIiLQGQM 
S EWALLKRHQCEDI FGVQLWSGGDAL I HLWG LMAV AGTVPRRQTWLL 
PGRPRERRPLAQGSSVGRQRPSGYSSDSPLTLLFCHYPPTAGGAFPD 
TMTKCAELL S RTVEVDFVD I NVG C P IDL VYKKGGGCALMNRS TKFQQ 
I VRGMNQPTHG I LSWERQAS FGFLSVTSHPPSGFAAVQVAVARVFLG 
CPCSLHSPAPCLCPRWRKDSFQEVIDAEYVLLEVNGQFSYSYS 

7725 

A 

2 

258 

RPPYYLGRDYLETLMASQKAADWIRI / SVPGWAGVGELQSGARGHGG 
YFAGPGPDLAFHLLSHSEMLliGPVPPSFAFLPKHKANAYK 

7 /<£0 


JL 

1034 

MSHQARPIITESSVGQGYSGDVSSICKVLEEFGVTLRTLPATGTAPP 
PTQLKS SEIJ\NKNPRHPVKFGFQIDHESRLGVIJ3VPLTVKI RTGVQE 
RVNIJ^HRLLPELRDWGVTUjVTIjHGRSREQRYTKLADWQYIEECVQT^ 
SPMPLFGNGDILSFEDANRAM/LDWCHRDHDCPMGGAESDVRSCSCV 
AAEGTGNCCCCGI**LWVESPGLWPVAPCSSPGLFTE\IKEQRHWDI 
S S SERLDI LRDFK/ RTTGLKHWGSEHAGA* KRPGRFLLEWLSFLCRY 
VPVGLLERLPQE\INERPPYYIX3RDYLETIiMASQKAADWIRISEMLL 
GPVPPS FAFLPKHKANAYK 

7727 

TV 

J. 

1 057 

JL \J -J f 

AQQVPAGPGTSTPPSSPVRTCGPLTDEDWRLRPCEKKRLDIRGKLY 
IJVPLTTG\GFLFFRRICKKFGADVTCGEMAVCTRLLQGQMSEWAI.LK 
RHQCED I FGVQLEGAFP \DTMTK\ VPELLSRTVEVDFVD I \NVGCP I 
DLVY KKGGG \ RALMNR S TKFQQ I WGMNQVLD VPLTVK\ VRTGVQG A 
CEPGR/NRLIARAAGTGAWHSFTIJiGPLSGSSANTKLADWQYIEECV 
QA / AAAPCPLFGNGD I L S FED / WPTAPMQNWCSTG I M I ARGALLK\ P 
GL FTE I KEQRHWDI SSSERLDI LRDFTNYGLEHWG \ SDTQGV\ EKTR 
RFL\ LEWLLIjPGAGYVPRGARLERIjPQR INERT 

7728 

A 

3 

369 

LLGQAVKYGV\NPGPYGGTT\RKLYEKKLLRLREQGPESRCSTPVPT 
ISSSAENARQNGSNDSGRYSVNEEDSKIELKLEKRDPLKGRAKTPVT 
RKQRRVELS ES YS QAGMDEGVWS S GS SKG 

7729 

A 

1 

1492 

ALMDEILMLQDBINELQSSIiTVEELVSESCEADPAEQIiAIiQSTLTVliA 

ERMSTIRMKASGKRQLLEElOJNrDQLEEQRQEQALQRYRCEADEIiDSW 

LLSTKATLDTALSPPKEPMDMEAQLMDCQNMLVEIEQKWAIiSELSV 

HNENLLLEGKAHTKDETVEQLAGKLRRLKGSLLELQRALHDKQIiN^ 

GTAQEKEESDVDLTATQS PG VQEWIiAQARTTWTQQRQS S LQQQKELE 

QELAEQKSLLRSVASRGEEILIQHSAAETSGDAGEKPDVLSQELGMB 

EEE\TSAEDQMRMKWESLHQEFSTKQKLLQNVLEQEQEQVLYSRPNR 

LLSGVPLYKGDVPTQDKSAVTSLLDGLNQAFEEVS SQSGGAKRQS IH 

LEQKLYDGVSATSTWLDDVEERLFVATALLPEETETCLFNQEILAKD 

IKEMSEEI^KNKNLFSQAFPENGDNRDVIEDTLGCLLGRLSLLDSVV 

NQRCHQMKERLQQILNFQVSKILSKDT 

7730 

A 

1073 

1233 

PYSGLGIFFFFLRHGLTLSPRLGCSELRLCHCTPAWVT\SKTVSKNK 
KQKNSR 

7731 

A 

34 

446 

STHASGDHSKPIICS/TG/SYFFANSLPSKGPTTLMPVNTAALTVPA 
NPASVITVPSKLPTSSKPPGAVPSNALTNPAPSKLPINSTRAGMVPS 
KVPTSMVLTKGCASTVPTDGSS INEETPMAPTPAGAVGS SLAWLDT 

7732 

A 

3 

418 

THASEFIGLAGSGAATVIFVSAYELNGLFSVLMLAWSFIiPIYIAGQV 
DMYAGAI FI QQS LHLDLYLAI VGLLAI TAVYTVAGGLAAVI YTDALQ 
TL I ML I G ALTLMG Y S F AAVGGMEGLKE KYFLALASNRS ENS SCG 

7733 

A 

1 

2115 

MNNHVKLIFKILSSLNTTLPSTSAWSSIRASNYNVPLSSTAQSTSGS 
SWGESSSGRITNWLVLKNLTPQIDGSTLRTLCMQHGPLITFHLNLPH 
GNALVRYSSKEEVVKAQKSLHMCVnjGNTTIIiAEFASEEEISRFFAQS 
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QSLTPSPGWQSLGSSGAKPVYGFQCMTSDHAPTCQVDEPGNDDKVKL 
GPLTQR I KFRLNGMKKQQVEVFENTDSE I PUDS S PS FVTCQPCD FGH 
DLNSQSHHS PCKDPEVS FRTMESGTS S PQPPQLDPLDAFPQKGLEPG 
D I AVIjVIiYFLFVLAVGIjWSTVKTKRDiVKGYFLAGGDMVWWPVGASIi 
FASNVGSGHFIGLAGSGAATGISVSAYELNAHALKLPRVRHAKKFGF 
QIPIVAFKDTFVQEVTPFVGAISTCKIiVHGTVPCQRQLPEGINVLKD 
S VKY AVFVNDLTALAVTQTGLGGHVS RTGTHAHWTQH I QP I PRLQER 
IQDPEEQPRRIDGHRCKVTCRMIPVLALRTGSSGGTSSLPCKEQCPT 
EKTRTQTLI VLRDPPVS AI KS PRGVRTPVPGRQNADPPRAKQKQRKA 
TMGLMCLPYLLPAECRAVTAHFTRPFLRFSRPPLFSHLTSAGPGREV 
NPHSATPSGSTTPTGPQDGGPSNWTRRLGSLSVPQFGAVLWHNGRTS 
RNHIGPTQQVDRTRTDTPVCQIAPSGHSHRQKRNELIDPLCDMRKAI 

7734 

A 

1 

662 

SLFASNVGSGHFIGLAGSGAATGISVSAYELNGLFSVLMLAWFLPI 
Y I AGQVTTM PE YLRKR FGG I R I P I I LAVL YLF I YI FTKI S VDMYAGA 
IFIQQSLHLDLYXAIVGLIAITAVYTVAGGLAAVIYTDALQTLIMLI 
GALTLMCYSFAAVGGMEGIiKEKYFLALASNRSENSSCGLPREDAFHI 
FR\NPLTSDLPWPGVLFGMS I PSLWYWCTDQGG 

7735 

A 

3 i; 

315 

PPATPAPGFPGGPGPPRSPTGLPRQPQGSLQQPAAPPAPLKSSGRPR 
HGGAGQPSQPGPGAGHRRGRTRAVPQAGPGPQRSSGHGPSWRRGGRA 
APGLCPPPP 

7736 

B 

107 

388 

SKKLLFAGSRSQLVQLPVADCIKYRSCADCVLARDPYCAWSVNTSRC 
VAVGGHFGSIiLIQHVMTSDTSGICNLRGSKKVRPTPKNITWAGTDL 
★ 

7737 

A 

98 

473 

I RGDT I S YTVRS PPPRAGRE YA\ MSTAGPPS PFS PLTQPFPALYREE 
GKYKI PLNPGAK/ YVRAPPNPS PQTHSRLHF PS PHPEN IHRPWSRTR 
PTPSPALPLKGHSTLSTPPPYVHRCPTFGGSGG 

7738 

A 

2 

433 

GDTGHRERSLGLLR£KHTWGPPHPQATPQAAVPSSRYTQPGPQ\GWV 
TVQGQPPSRGEHPRPAAGLSQLLHLGVPDIGLGQSGLDCEVTPKPSY 
SFPGTSGAGPPGTMGSSLPGNGNWDEEAHTLPQGQEGMNEGGVPWRL 
IFR 

7739 

A 

328 

497 

HNTDLFIY/CIYEMESCSVTQAGVQWRDLGSLQPPPLGCKQCSCLSL 
PS S WGI/QVPATT 

7740 

A 

1 

374 

MEKEEPGVERKKENRVKEVIHKNRKGAEGPPAGAALPLGGRGEGP/ S 
PEPPGGAAAPRAGERCGQLAGASPEPAETGTREPRPRGASAARKLAA 
AAPVGLLRPGCPRAGRRRAPWPGKRIiSEPRV 

7741 

A 

34 

367 

ERPGPVGTLTTPTQGPHPPAGP/ S /GRPGSLGFTWPYPGPCTWRPAW 
FPARPPPAGSITPPGFCTHTLVSLDPGRDCPSPPTGDCDSLCPPHPD 
ASPGI KAFCAQGKKKKKKK 

7742 

A 

2 

464 

IRARNYKLRKVMVDGDIPPRV\KKDAHEIiILDFIRSRPPLKQVSERR 
LRRLPPKQRSLHEKILEEIKQERRLRPVRGE/ ELGCPRVWLS / SPAS 
STPAPEMPSPPPASTCQSQMLGAAPSAR/VPRVIiLKAPTLAEMEEMN 
TS E I RAQGPRNVTAHNQ 

7743 

A 

2 

1004 

TGPGVPMCQVGEDYGEPAPEEPPPAPRPSREQKCVKCKEAQPVWIR 
AGDAFCTIDCFKAFYVHKFRAMLGKNRL^ 

VWQDLEGLS QDFAKRLRFVAGVI FFDEGAACGQSLEERS KTLAEVKP 
ILQATGFPWHWALEEELVGSEGAYKAAVDS F/ RPAAACAGGPGWIi 
ARLKGRNSHPSP/ SLDPQNLARPPAPAQTEALSQLFCSVRTLTAKEE 
LLQTLRTHIilLHVARAHGYSKVMTGDSCTRLAI KLMTOLALGRGAFL 
AWDTGFSDERHGDV\ATWPMRDHTLKEVAFYNRLFSVPSVFTPAVDT 

7744 

A 

2 

397 

APDTPDSSKFQRSK/NYKGPUDPQRGKIEDKVNKSKVILKEPGLEDIi 
GTYSDIVTDADEDISASHTLTEEELEKLKKLSHEIRNPGIKLISGWN 
IDI LERGEARLWLELEKLCPAAELHLI VHNKEI FRLPNR 

7745 

A 

1 

850 

QKJCELGHVNGLVDKSGKRTTSPSSDTDLLDRSASKTELKAIAHARIL 
ERRASRPGTPTSSASTETPTFEQNDVDEDIIDVDEEPVAAEPDYVQP 
QLRRPFELLIAAAMERNPTQFQLPNELTCTPALPGSSKRRRKEETTG 
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K2JVKKTQ\RELDHNGLVPLPVKVCFTCNRRCRVAPLIQRDYCPLIiFH 
KOXrLEPPLTAMPLGRVmCSNHIEHV^^ 

VSQHWKVDFLNRIHKKHPPNRRVLQSVKRRSLKVPDAIKSQNQVST 
P 

7746 

A 

1 

201 

MKKKKEEEEEEEEEEEEEEEEKEEEEEEEEEEEEEEEEE/EEEEKEE 
EEEEEEEEEEEEEEEKKKKKKHNIFKILSNLYIFIIIFYCSKK 

7747 

A 

3 

112 

KRTRLGriMAHA/CNLSTLGGQGGRVTRSGVRDQPDLK 

7748 

A 

1 

1478 

LTVIRRVDENWAEGMLADKIGIFPISYVEFNSAAKQLIEWDKPPVPG 
VDAGECSSAAAQSSTAPKHSDTKKNTRKRHSFTSLTMANKSSQASQN 
RHSMEISPPVLISSSNPTAAARISELSGLSCSAPSQVHISTTGLIVT 
P PPS S PVTTG PS FTF P S DD P I QAALiO 1 L»N P PJj P P P PLi IiAAi vliAolF 
PGATAAAAAAGMGPRPMAGSTDQIAHLRPQTRPSVYVAIYPYTPRJCE 
DELELRKGEMFLVFERCQDGWFKGTSM\HTSK\IGGFPGN\YVAPVT 
RAVTNAS QAKVPMSTAGQTS RGVTMVS PS TAGG \ PAQKA\ QGNGVAG 
SP\ S WPAAWS AAH I QTS PQAKVLLHMTGQMTVNQARNAVRTX VAA 
HNQ \ ERP \ TAAVTP I Q VQNAAGLS PAS VGLS HHSLAS PQPAPLMPG \ 
PATHTAAI S I SRASAPlA ACAAAAPLTS PS I TSASLEAEPSGRI VTV 
LPGLPTSPDSASSACGNSSATKPDKDSKK 

7749 

A 

139 

6503 

LSGITKMTLHATRGAALLS WVNSLHVADPVEAVLQLQDCS IFIKI ID 
RIHGTEEGQQILKQPVSERLDFVCSFLQKNRKHPSSPECIjVSAQKVL 
EGSELEIJUa^TMLLLYHSTMSSKSPRDWEQFEYKIQAEIAVILKFVL 
DHEDGLNLNEDLENFLQKAPVPSTCSSTFPEELSPPSHQAKREIRFL 
ELQKVASSSSGNNFLSGSPASPMGDILQTPQFQMRRLKKQLADERSN 
RDELEliEIiAENRKLLTEKDAQ I AMMQQR I DRLALLNEKQAAS PLEP K 
ELEELRDKNESLTMRI^TLKQCQDLKTEKSQMDRKINQLSEENGDL 
SFKLREFASHLQQLQDALNELTEEHSKATQEWIiEKQAQLEKELSAAL 
QDKKCLEEKNEILQGKLSQLEEHLSQLQDNPPQEKGEVLGDVLQLET 
LKQEAATIiAA^TQLQARVEMLETERGQQEAKLLAERGHFEEEKQQIi 
S S L I TDLQS S I SNL S QAKEELEQ AS QAHGARLT AQVAS LTS ELTTLN 
ATIQQQDQEIiAGLKQQAKEKQAQLAQTLQQQEQASQGLRHQVEQLSS 
SLKQKEQQLKEVAEKQEATRQDHAQQLATAAEEREASbRERDAALKQ 
LEALEKEKAAKLEILQQQLQVANEARDSAQTSVTQAQREKAELSRKV 
EELQACVETARQEQHEAQAQVAELELQLRSEQQKATEKERVAQEKDQ 
LQEQLQALKESLKVTKGSLEEEKRRAADAliEEQQRCISELKAETRSL 
VEQHKRERKELEEERAGRKGLEARLQQLGEAHQAETEVLRREIAEAM 
AAQHTAESECEQLVKEVAAWRERYEDSQQEEAQYGAMFQEQLMTLKE 
ECEKARQELQEAKEKVAGIESHSELQISRQQNEI^LHANLARALQQ 
VQEKEVRAQKLADDLSTLQEKMAATSKEVARLETLVRKAGEQQETAS 
RELVKEPARAGDRQPEWLEEQQGRQFCSTQAALQAMEREAEQMGNEL 
ERLRAALMESQGQQOEERGQQEREVARLTQERGRAQADLALEKAARA 
ELEMRIiQNALNEQRV^FATLQEALAHAL^ 

QIKELEELRQTVKQLKEQLAKKEKEHASGSGAQSEAAGRTEPTGPKL 

EAIiRAEVSKLEQQCQKQQEQADSLERSLEAERASRAERDSALETLQG 

QLEEKAQELGHSQSALASAQRELAAFRTKVQDHSKAEDKWKAQVARG 

RQEAERKNSL I S SLEEEVS I LNRQVLEKEGESKELKRLVMAESEKSQ 

KLEE\ S CACCRQRQP ATVP E LQN AALLCGRRCRAS G / REAEKQRVAS 

ENLRQELTSQAERMELGQELKAWQEKFFQKEQALSTLQLEHTSTQA 

LVSELLPAKHLCQQLQAEQAAAEKRHREELEQSKQAAGGLRAELLRA 

OP E "LGET, T PT.RO KVAEOERTAOOL.RAE KAS YAE OL SMliKKAHGLIiAE 

ENRGL^ERANLGRQFLEVTSLDQAREKYVQELAAVRAD^ 

EAQSTARELEVMTAKYEGAiCVKVLEERQRFQEERQKLTAQVEQLEVF 

QREQTKQVEELS KKUADSDQAS KVQQQKLKAVQAQGGESQQEAQRLQ 

AQLNELQAQLSQKEQAAEHYKLQMEKAKTHYDAKKQQNQELQEQLRS 

LEQLQKENKELRAEAERLGHELQQAGLKTKEAEQTCRHIjTAQVRSLE 

AQVAHADQQLRDLGKFQVATDALKSREPQAKPQLDLSIDSLDLSCEE 

GTPLS ITS KLPRTQPDGTSVPGEPAS PI SQRLPPKVESLESLYFTP I 
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PARSQAPLESSI>DSU5DVFQDSGRKTRSARRRTTQIINITMTKKIiDV 
EEPDSANSSFYSTRSAPASQASLRATSSTQSLARLGSPDYGNSTUiLS 
LPGYRPTTRSSARRSQAGVSSGAPPGRNSFYMGTCQDEPEQLDDWNR 
I AELQQRNRVCP PHLKTC YPLESR P S LS LGT I TDEEMKTGDPQETLR 
RASMQPIQIAEGTGITTRQQRKRVSLEPHQGPGTPESKKATSCFPRP 
MTPRDRHEGRKQSTTEAQKKAAPASTKQADRRQSMAFSILNTPKKLG 
NSLLRRGASKKALSKASPNTRSGTRRSPRIATTTASAATAAAIGATP 
RAKGKAKH 

77S0 

A 

1 

5290 

MTLHATRGAALLSWVNSLHVADPVEAVLQLQDCS IFIKI IDRIHGTE 
EGQQI LKQPVS ERLDFVCS FLQKNRKHPSS PECLVS AQKVLEGS ELE 
LAKMTMLLLYHSTMSSKSPRDWEQFEYKIQAELAVILKFVLDHEDGL 
NLNEDLENFLQKAPVPS TCS STF PEELS PPSHQAKRE I RFLELQKVA 
SSSSGNNFLSGSPASPMGDILQTPQFQMRRIiKKQLADERSNRDELEL 
ELAENRKLLTEK3DAQIAMMQQRIDR1JUjIiNEKQAASPLEPKELEELR 
DKNESLTMRLHETLKQCQDLKTEKSQMDRKINQLSEENGDLSFKLRE 
FASHLQQLQDALNELTEEHSKATQEWIiEKQAQLEKELSAALQDKKCL 
EEKNEILQGKLSQLEEHLSQLQDNPPQEKGEVLGDVLQLETLKQEAA 
TLAANNTQLQ7VRVEMLETERGQQEAKLLAERGHFEEEKQQLSSLITD 
LQSSISNLSQAKEELEQASQAHGARLTAQVASLTSELTTIiNATIQQQ 
DQELAGLKQQAKEKQAQIiAQTLQQQEQASQGLRHQVEQLSSSLKQKE 
QQLKEVAEKQEATRQDHAQQLATAAEEREASLRERDAALKQLEALEK 
EKAAKLEILQQQLQVANEARDSAQTSVTQAQREKAELSRKVEELQAC 
VETARQEQHEAQAQVAELELQLRSEQQKATEKERVAQEKDQLQEQLQ 
AI.KESLKVTKGSLEEEKRRAADALEEQQRCISELKAETRSLVEQHKR 
ERKELEEERAGRKGLEARLQQLGEAHQAETEVLRRELAEAMAAQHTA 
ESECEQLVKEVAAWRERYEDSQQEEAQYGAMFQEQLMTLKEECEKAR 
QELQEAKEKVAGIESHSELQISRQQNELAELHANLARALQQVQEKEV 
RAQKLADDLSTLQEKMAATSKEVARLETLVRKAGEQQETASRELVKE 
PARAGDRQPEWLEEQQGRQFCSTQAALQAMEREAEQMGNELERL.RAA 
LMESQGQQQEERGQQEREVARLTQERGRAQADLALEKAARAELEMRI* 
QNALNEQRVEFATLQEALAHALTEKEGKDQELAKLRGLEAAQIKELE 
ELRQTVKQLKEQLAKKEKEHASGSGAQSEAAGRTEPTGPKLEALRAE 
VSKLEQQCQKQQEQADSLERSLEAERASRAERDSALETLQGQLEEKA 
QELGHSQSALASAQRELAAFRTKVQDHSKAEDEWKAQVARGRQEAER 
KNSLISSLEEEVSILNRQVIjEKEGESKELKRLVMAESEKSQKLEEXS 
CACCRQRQPATVPELQNAALLCGRRCRASG / REAEKQRVASENLRQE 
LTSQAERAEELGQELKAWQEKFFQKEQALSTLQLEHTSTQALVSELL 
PAKHLCQQLQAEQAAAEKRHREELEQSKQAAGGLRAELLRAQRELGE 
LIPLRQKVAEQERTAQQLRAEKASYAEQLSMIjKKAHGLIjAEENRGLG 
ERANLGRQFLEVELDQAREKYVQELAAVRADAETRLAEVQREAQSTA 
RELEVMTAKYEGAKVKVLEERQRFQEERQKLTAQVEQLEVFQREQTK 
QVEELSKKLADSDQASKVQQQKLKAVQAQGGESQQEAQRLQAQLNEL 
QAQLSQKEQA7VEHYKLQMEKAKTHYDAKKQQNQELQEQLRSLEQLQK 
ENKELRAEAERIiGHELQQAGLKTKEAEQTCRHIiTAQVRSLEAQVAHA 
DQQLRDIiGKFQVATDALKSREPQAKPQLDLSIDSLDLSCEEGTPLSI 
TRSGGSLPPYVCIiWSACCLSGCILVR 

7751 

A 

241 

6574 

LSG I TKMTLHATRGAALLS WVN S LHVADPVEAVLQLQD C S I F I KI I D 
RIHGTEEGQQILKQPVSERLDFVCSFLQKNRKHPSSPECLVSAQKVIi 
EGSELELAKMTMLLLYHSTMSSKSPRDWEQFEYKIQAELAVILKJVI* 
DHEDGLNLNEDLENFLQKAPVPSTCSSTFPEELSPPSHQAKREIRFL 
ELQKVASSSSGNNFLSGSPASPMGDILQTPQFQMRRIiKKQLADERSN 
RDELELELAENRKLLTEKDAQIAMMQQRIDRLALLNEKQAASPLEPK 
ELEELRDKNESLTMRLHETLKQCQDLKTEKSQMDRKINQLSEENGDL 
SFKIiREFASHLQQLQDAIiNELTEEHSKATQEWLEKQAQLEKELSAAL 
QDKKCLEEKNEILQGKLSQLEEHLSQLQDNPPQEKGEVLGDVLQLET 
LKQEAATLAANNTQLQARVEMLETERGQQEAKLLAERGHFEEEKQQIi 
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SSLITDLQSSISNIjSQAKEELEQASQAHGARLTAQVASLTSEIirii^ 

ATIQQQDQBIiAGLKQQAKEKQAQLAQTLQQQEQASQGLRHQVEQLSS 

SLKQKEQQLKEVAEKQEATRQDHAQQLATAAEEREASLRERDAALKQ 

LEALEKEKAAKLEILQQQLQVANEARDSAQTSVTQAQREKAELSRKV 

EELQACVETARQEQHEAQAQVAELELQLRSEQQKATEKERVAOEKDQ 

LQEQLQALKESLKVTKGSLEEEKRRAADALEEQQRCISELKAETRSL 

VEQHKRERKELEEERAGRKGLEARLLQLGEAHQAETEVLRRELAEAM 

AAQHTAESECEQLVKEVAAWRDGYEDSQQEEAQYGAMFQEQLMTLKE 

ECEK7VRQELQEAKEKVAGIESHSELQISRQQNKLAELHANLARALQQ 

VQE KEVRAQKLADDLS TLQEKMAATS KEVARLETLVRKAGEQQETAS 

RELVKEPARAGDRQPEWLEEQQGRQFCSTQAALQAMEREAEQMGNEL 

ERliRAALMESQGQQQEERGQQEREVARLTQERGRAQADLALEKAARA 

ELEMRLQNALNEQRVEFATLQEALAHALTEKEGKDQELAKLRGIjEAA 

QIKELEELRQTVKQLKEQLAKKEKEHASGSGAQSEAAGRTEPTGPKL 

EALRAEVSKLEQQCQKQQEQADSLERSLEAERASRAERDSALETLQG 

QLEEKAQELGHSQSAIiASAQRELAAFRTKVQDHSKAEDEWKAQVARG 

RQEAERKNSL I SSLEEEVS I LNRQVLEKEGE S KELKRLVMAES EKS Q 

KLEESCACCRQRQPATVPELQNAALLCGRRCRASGREAEKQRVASEN 

LRQELTSQAERAEELGQELKAWQEKFFQKEQAIiSTLQLEHTSTQALV 

SELLPAKHLCQQLQAEQAAAEKRHREELEQSKQAAGGLRAELLRAQR 

ELGEIiIPLRQKVAEQERTAQQLRAEKASYAEQLSMLKKAHGLLAEEN 

RGLGERANLGRQFLEVELDQAREKYVQEIAAVRADAETRLAEVQREA 

QSTARELEVMTAKYEGAJCVXVIiEERQRFQEERQKIjTAQVEELSKKLA 

DSDQASKVQQQKLKAVQAQGGESQQEAQRFQAQLNELCAQLSQKEQA 

AEHYKLQMEKAICTHYDAKKQQNQELQEQIiRSLEQLQKENKELRAEAE 

RLGHELQQAGLKTKEAEQTCRHLTAQVRSLEAQVAHADQQLRDLGKF 

QVATDALKSREPQAKPQLDLS IDSLDLSCEEGTPLS ITSKLPRTQP0 

GTSVPGEPASPISQRLPPKVESLESLYFTPIPARSQAP\LESSLDSL 

GDVFL\DSGRKTRSARRRTTQI INI\TMTKK\LDV\EEPD/ SAPNLS 

FYS\TRSAPASQASLRAT\SST*SLTRLG\SPDYGNSADLSLPGYRP 

TTRSSARRSQAGVSSGAPPGRNSFYMGTCQDEPEQLDDWNRIAELQQ 

RNRVCPPHLKTCYPLESRPSLSLGTITDEEMKTGDPQETLRRASMQP 

IQIAEGTGITTRQQRKRVSLEPHQGPGTPESKKATSCFPRPMTPRDR 

HEGRKQSTTEAQKKAAPASTKQADRRQSMAFS I LNTPKKLGNS LLRR 

GAS KKALS KAS PNTRSGTRRS PRI ATTTASAATAAAIGATPRAKGKA 

KH 

7752 

A 

775 

1128 

REGLYLSRFPFHSKLPRT\QPDGTSVPGEPASPISQRLPPKVESLES 
LYFTPIPARSQAPLESSIiDSLGDVPLDSGRKTRSARRRTTQIINITM 
TKVRLL.GQGLQGSNSHFLSRASLQ 

7753 

A 

331 

531 

LKIFFFFPSGGDMSKNVSQSQMAKLjNQQMAKMI^PRVL\HHMGGMAG 
LQSMMRQFQQGAAGNMKGMM 

7754 

A 

1 

1517 

. MVLADLGRKITSALRSLSNATI INEEVLNAMLKEVCTALLEADVNIK 
LVKQIiRENVKSAIDLEEMASGLNKRKMIQHAVFKELVKLVDPGVKAW 
TPTKGKQNVIMFVGLQGSGKTTTCSKIiAYYYQRKGWKTCIilCADTFR 
AGAFDQLKQNATKARIPFYGS YTEMDPVI IASEGVEKFKNENFEII I 
VDTSGRHKQEDSLFEEMLQVANAIQPDNIVYVMDASIGQACEAQAKA 
FKDKVDVAS VI VTKLDGHAKGGGAIiS AVAATKS P 1 1 F I GTGEH I DD F 
EPFKTQPFISKLLGMGDIEGLIDKVNELKIiDDNEALIEKLKHGQFTL 
RDMYEQFQNIMKMGPFSQILGMIPGFGTDFMSKGNEQESMARLKKLM 
TIMDSMNDQELDSTDGAKVFSlWSGRIQRliARPSCVb l i=>ijvyr»ijjj J-w 
YTKFAQMVK*MGGIKGLF\KGGDMSIQIGARSQIAK\LNQQMAKMMDP 
R\ VLHHMGGMARLQSMMRQ\ FKQGAAGNMKGMMGFNNM 

7755 

A 

42 

451 

FGL FNRL PYPPQHPKPHYS RI PGA/ HAPPGPQNP S T\ FL S PL YCWG P 
SSKSPGAPKRCPSSISTPPPSATPPRTLTGSWPGQRCQKDGEAQSYQ 
VRQDPTASPQPAGQSRLPSWSGNQTYIiQGLHGRGPRARTLQLPG 

7756 

A 

3 

359 

LLIHSSTHPPTYPPIHPPGHHFFLPLFFPFFFSFSPSFHPSTHPLIH 
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LSTYPPIHPSVYPPVPLSLS VYS S IHLPTPEYFLFQPVAH/RRSHGR 
I S SMPGEQS P I PRLWKS PSSKPPFPW 

7757 

A 

1 

1668 

MLVHTQPPE\mQYNHSSAATRSRFQRLQLQLIWGWSQVLPMSAFYQI 
IIQASGLRLHACVTPGAGTHSLPFLPSVGNLPSNKGNSRVYCFFAAD 
LFD I LLPMLN I YQE FVRNHQ YS LQVLANCKQNRD FDKLLKQ YEANPA 
CEGRMLETFLTYPMFQIPRYIITLHELLAHTPHEHVERKSLEFAKSK 
LEELSRMIVEGCDILLDTSQTFIRQGTMKNSNTIIVKHEWGRQLHSD 
WTLLLSGPD I hCSS KEDRS AWPTAVAVPS ADDTQAKAQEKLAI#\ SKP 
FHIRLGTMSEQS*SLHQRTEQE\GKLPCSGHHRTFTLQIPLNSSAPP 
LSTAWRTDGGQCPLLCFTGFCFCNSHSCSRTWE\PPEEEDDICLPTS 
QARCNEYVGLRKSIMLVQKKDKKNITCRSLTPSEEGLGTIVKVLVFS 
EIMFNLDFKIVVEPPDAAAFT\A^liAPSRQEKAAWMSDISQCVDNIR 
CNGLMT I VFEENS KVTVPHM I KAKVNGCTMSHLDNSLTLPFTDTEQG 
RS S S WTAGHVKAPCPLHAQLPLLASLFLQELQAFARMS PYP 

7758 

A 

1 

2247 

MTILPKAIYKFNAIPIKIPPSFFTELDKIILKLIWNQKGAHIAKARL 
SKKNKSGDITLPDFKLYYKATVTKTAWYWYKNRHVDQWl^IENPEIN 
PNTYSQL I FNKRNKNVRCREGTAVQRMLS S KVS EAG Y I CNQALPTMT 
G VRRLALETAS LGD S PL AHAKATD FELNNELKMNVLRRARRS IiARPR 
P S S PRKE S RRE Y PQ AVY FS S VHVFAS PS LMQQHQMLVCDVLATGLAL 
SQDDQDDIHLKLEDIIQMKHSLRVSANPPQQQGSAALAVLEVDIIPK 
AGCVYVC PLS S CL I WGWS QVLPMS AF YQ III QAS GLRLHACVTPGAG 
THSLPFLPSVGNLPSNKGNSRVYCFFAADLFDILLPMLNIYQEFVRN 
HQYSLQVLANCKQNRDFDKLLKQYEANPACEGRMIiETFLTYPMFQIP 
RYI ITLHELLAHTPHEHVERKSLEFAKS KLEELSRVMHDEVSDTENI 
RKNLAI ERM I VEGCD I LLDTS QTF I RQGTMKNSNT 1 1 VKHEWGRQ Y I 
LTGRSSSAARTFSAAPRRTEEEDDICLPTSQARCNEYVGLRKSIMLV 
QKKDKKNITCRSLTPSEEASDPRERIiTKMKGAKLNSSAPAIiRNRFPA 
VG S KPLL PMSNIiVASQ IMNYADVS SRANAI EKWVAVAD I CRCLHNYN 
G VLE I TS ALNRS AI YS AAAAAWWAGWRGVKLKLHFPD SLAAMVLDV 
NSVPLIDALGKTQKAEVGAIFLWLKVSHGDVSALGQCQHPALGF 

7759 

A 

1 

3711 

MQKSVRYNEGHAliYLAFIiARKEGTKRGFLS KKTAEASRWHEKWFAL Y 
QNVLFYFEGEQSCRPAGMYLLEGCSCERTPAPPRAGAGQGGVRDALD 
KQ YY FTVLFGHEGQKPLELRCEEEQDGKEWMEAI HQAS YAD I L I ERE 
VLMQKYIHLVQIVETEKIAANQLRHQLEDQDTEIERLKSEIIALNKT 
KERMRPYQSNQEDEDPDIKKIKKVQSFMRGWLCRRKWKTIVQDYICS 
PHAESMRKRNQIVFTMVEAESEYVHQLY I LVNGFLRPLRMAAS S KKP 
PIS HDDVS S I FLNS ET IMFLHE I FHQGLKAR IANWPTL I LADLFD I L 
LPMLNIYQEFVRNHQYSLQVLANCKQNRDFDKLLKQYEANPACEGRM 
LETFLTYPMFQIPRYIITLHELLAHTPHEHVERKSLEFAKSKLEELS 
RVMHDEVSDTENIRKNLAIERMIVEGCDILLDTSQTFIRQGSLIQVP 
SVERGKLSKVRLGSLSLKKEGERQCFLFTKHFIilCTRSSGGKIjHLIiK 
TGGVLSLIDCTLIEEPDASDDDSKGSGQVFGHLDFKIVVEPPDRAAF 
TVVLLAPSRQEKAAWMSDISQC^NIRCNGLMTIVFEENSKVTVTHM 
IKSDARLHKDDTDICFSKTLNSCKVPQIRYASVERLLERLTDLRFLS 
IDFLNTFLHTYRI FTTAAVVLGKLSDI YKRPFTS I PVRSLELFFATS 
QNNRGEHLVDGKSPRLCRKFSSPPPLAVSRTSSPVRARKLSLTSPLN 
SKIGALDIjTTSSSPTTTTQSPAASPPPHTGQIPLDLSRGLSSPEQSP 
GTVEENVDNPRVDLCNKLKRS IQKAVLESAPADRAGVESSPAADTTE 
LS P CRS P S TPRHLR YRQPGGQTADNAHC S VS PAS AFAI ATAAAGHG S 
PPGFNNTERTCDKEFI IRRTATNRVLNVLRHWVSKHAQDFELNNELK 
MNVLNLLEEVLRDPDLLPQERKAAANILRALSQDDQDDIHLKLEDI I 
QMTDCMKAECFESLS^ELAEQITLLDHVIFRSIPYEEFLGQGWMKL 
DKNERTPYIMKTSQHFNDMSNLVASQ IMNYADVS SRANAI EKWVAVA 
D I CRCLHNYNG VLE I TS ALNRS AI YRLKKTWAKVS KQTKALMDKLQK 
TVSSEGRFKNLRETLKNCNPPAVPYLGMYLTDLAFIEEGTPNFTEEG 
LVNFSKMRM I SH 1 1 RE I RQFQQTS YRIDHQPKVAQYLLDKDLI I DED 


.± o o a «■!•• & «+ 3; „, o a a & o la 






TLYELSLKIEPRLPA 

7760 

A 

1 

226 

SLEFSQSKLQYLSRVKH\DEVSDTENIRINLAIERMIVEGCDILLDT 
SQTFIRQWDHENQQHNHSKARVGQAVTF 

//OX 

ri 


X w O J 

GRL.GPHWS POREGEAOAWOAAGPEPCPAGROLRPGEKS STAAPGPGA 
KLPPTARGQRGRPAALSAGPTEPAEPTPTRNSGWPASAPPV/ PRWGQ 
PLLGVPGPEAGRYSA\APSLSALRFPAWWPQDVPVAWQSATDDAYEA 
LS PS S CA FAM\ PS S VPAY P S I TVTPDEEQNLNHY I QVLENLVRS VPS 
GEPGREKKSNSPKHVYSIASKGSKFKELVTHGDASTENDVLTNPISE 
ETTTFPTGGFTPE IGKKKHTESTPFWS IKPNNVS I VLHAEEPYI ENE 
EPEPEPEPAAKQTEAPRMLPVWESSTSPYVTSYKSPVTTLDKSTGI 
GDLYRIRRCSSALR 

7762 

A 

3 

344 

KERSREKTGGGGDLQREVAREDRRWRRS \ QREDAREDRRWRR S S ERG 
RLRRQAVAEI FRERDAREDRRWRRS SERGRPRRQ/ RGGGDLQREDAR 
EDRRWRRSSEREMPEKTGGGG 

7763 

A 

205 

340 

NPYIYSQLFFDKGTKNIHWG\RDSIiFNKCCSENLISII\RKIKLDP 

7764 

A 

1 

668 

GDPGAGPGDHNRFDCGPQPPPPPKCELLHVAIVCAGHNSSRDVITLV 
KSMLFYRKNPLHLHMVTGAVARNIMETIiFHTWMVPAVRVS FYHADQL 
KPQVS W I PNKHYSGLYGLMKLVLPSAIjPAEIiARVI VLDTDVTFASDI 
SELWALFAHFSGVILLRLDRLRQAGWEQMWRLTARRELIi/GLPATSL 
ADQDIFNAVIKEHSGLVQRIiPCVWNVQLSNHTLAE 

7765 

A 

3 

1289 

INFPTTSGLDGDPGAGPGDHNRSDCCPQPPPPPRCELLHVAIVCAGH 
NS SRDV I TLVKSMLF YT^KNPIjHLHIjVxxOAVARNI LETL FHTW1WPAV 
RVS F YHADQLKPQVS W I PNKHYSGL YGLMKLVLPS ALP AELARVI VL 
DTDVTFASDISELWALFAHFSDTQAIGLVENQSDW\YLGNLWKNHMP 
WPALGRGFNTGVILLRLDRFR\ KAGWEQMWRIiTARREL\ LSLPATSL 
ADQDIFNAVIKEHPGLVQRIjPCVWNVQLSDHTLAERCYSEASDLKVI 
HWNS PKKVlA VKNNHVEFFRKI YLILLENQGNLLLEEVFLCPSQPPT 
\GADQLQ\RPLPQLDEEDPCFEFRQQQLTVHRGH\LLFIiPHEPPPPR 
PHDVTLVAQLSMDRLQMLEALCRHCPGPVSLALYLTDAEAQQFLHFV 
EANGVG I DRV 

7766 

A 

530 

820 

RVSRGRKWFFIALKRMPAMKKAMNLFLGLSNVRTVHPEGFTVYISra 
I S FPSLSGYRTGLRS FGLVKQKKS P I RMPCVYTNPCVS I VSRMAQAS 
SL 

7767 

B 

1 

954 

MATAAT I P S VATATAAALGEVEDEGLLAS LFRDRF PEAQWRER PDVG 
RYLRELSG SGLERLRRE PERLAEERAQLLQQTRDLAFANYKTF I RG A 
ECTERIHRLFGDVEASLGRLLDRLPSFQQSCRSPRAARNAVSIMAAG 
AGTAGPASGPGWRDPAASQPRKRPGREGGEGARRSDTMAGGGGSSD 
GSGRAAGRRASRSSGRARRGRHEPGLGGPAERGAGEARLEEAVNRWV 
LKFYFHEALRAFRGSRYGDFRQIRDIMQ7VLTFFRGTLDQSSKQELSL 
DLSSESIKNIQVNITYSLRSRDCANFDILYDIKPF* 

7768 

C 

97 

408 

MKTKVQPRLTVRVARNCSPKNKAHDNKQNWSWEEDSQITEPSAGLNS 
SQEAVSAPPSKP1VFNQPLPGEKNPKYEDLLCRNFGAGWWSWLGLVL 
LPGIKETFS* 

7769 

A 

1694 

1868 

PLIACSFFLFLFFFFLRQSLGSVHPGLGV\QWRDLGSLRPPFPGLTQ 
FSCLSLPRSWN 

7770 

A 

106 

317 

CSAETEREIGRRK*CIQESSSPSPLSIKKCPICKAD*YLi*SHI*GATA 
DAAPLFQLSSL*QDLPSYREADL 

7771 

A 

321 

451 

KIMbCFQLTTCPRMMS I TS S AMWMRMVWS VEQVFLSNS VQK>1R 

7772 

A 

2516 

2719 

FVKSKALAFFLSFFFFFFLKQSLSVTQAGVQW\PVDSLQPLPPRFKR 
FSCLSLPSSWDYRCVPQCPAN 

7773 

A 

3 

282 

LVIiRLRNSRKELNVFLFEFTPGRDTADGVSQELFSAGLVDGHDVFIV 
AANLQKIVDDPKALKx^LTFKLASGCDGSEIPDEVKIilGFAQVSVS 

7774 

A 

1 

1126 

MRPGSLTDWLEPRSEGGKEKSIPERRNGPCKDPGAAACKDCIiTSVIi 
FAximHMYLHNLLLRQGVSRASRQSALTQSSGTPAMYQCSKT^IFQ 
GIQVDYQEVTHTGPLSIEGSMLDIIKYI\VNRGEHKNGVLEEAIIAT 
ILKEV\ LEGLDYLHRNGQIHRNSRKELNDIRFEFTPGRGPKGNIX3EC 
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DEKHTTVLMRVEQQWDHTERKTQTQMSTEAKVNDTADGVSQBLFSAG 
LVDGHDWIACPSRNIiRLFIGIISTFPGAQAKTLGIVLDFSFSHTSH 
PTHLNVPR I LPLLT I S H P I TFQAPLNS QI*S SCI HPF AAI PAS VLASL 
QPVLI STARDPGTSHYS SAIjGFIGQQI PAYMELSPVIGELCLSKVTE 

7775 

A 

2 

1622 

AATGKQALHCIRMSMKWTSTUiLLIQLSCYFSSGSCGKVIiVWPTEFSH 
WMNI KT ILDELVQRGHEVTVLAS S AS I S FDPNS PSTLKFEVYPVSLT 
KTEFEDIIKQLVKRWAEIiPKDTFWSYFSQVQEIMWTFNDILRKFCKD 
IVSNKKXiMKKLQESRFDVVLADAVFPFGEIiLAELLKIPFVYRPRFSP 
GYAIEKHSGGLLFPPSYVPVVMSELSDQMTFIERVKNMIYVLYFEFW 
FQIFDMKKWDQFYSEVLGRPTTLSETMAKADIWLIRNYWDFQFPHPI* 
LPNVE FVGGLHC KPAKPL P KEME E FVQS S GENG VWFS LG SMVSNTS 
EERANVIASALAKIPQKVLWRFDGNKPDTLGLNTRLYKWIPQNDLLG 
HPKTRAF I THGG ANG I YKA I S PR I PMVGVPL F ADQPDN I AHMKAKGA 
AVSLDFHTMSSTDLIJ^AIjKTVIiroPLYKENAMKLSRIHHDQPVKPLD 
RAVFWIEFVMRHKGAKHLRVAAHDLTWFQYHSLDVTGFLLACVATVI 
FI ITKCLFCVWKFVRTGKKGKRD 

111S 

A 

1 

698 

MAAKLVGIEFDPCWVKMLMEKGHQKTREHNYPSMEQSEKAGVTDNIQ 
GKTKALAGSRQSVANGKNSHCKENVTWLLI IHYVQPMKP YEVFWIEF 
VTHQKATKYXjCQLPTTS PD AVF P FGELLAELLKI PFVYS LRF S PGYA 
IEKHSGGLLFPPSYVPVVMSELSDQMTFIE/RGRPTTLSETMAKADI 
WLIRNYVTOFQFPHPLLPNVEFVGGIjHCKPAKPLPKTIjGHGWDRAP 

1111 

A 

1 

1584 

MSMKVTTSALLLIQLSCYFSSGSCGKVLVWPTEFSHWMNIKTIIiDELV 
QRGHEVTVLASSASISFDPNSPSTLKFEVYPVSLTKTEFEDIIKQLV 
KRWAELPKDTFWSYFSQVQEIMWTFNDILRKFCKDIVSNKKLMKKLQ 
ESRFDWLADAVFPFGELLAELLKIPFVYRPRFSPGYAIEKHSGGLL 
pppq YVPWM^PT.tJnflMTP T RR VTCKTMT YVT.YFEFWFOT FDMKKWDOF 
YSEVLGRPTTLSETMAKADIWLIRlTx^ 

KPAKPLPKEMEEFVQSSGENGVVVFSLGSMVSNTSEERANVIASAIiA 
KI PQKVLWRFDGNKPDTLGLNTRL YKW I PQNDLIiGHPKTRAF I THGG 
ANG I YKAI S PRI PMVGVPLFADQPDNI AHMKAKGAAVSLiDFHTMS ST 
DLLNT^KTVINDPLYKENAMKLSRIHHDQPWPLDRAVFWIEFVMRH 
KGAKHLRVAAHDLTWFQYHSLDVTGFLLACVATVIFI ITKCLFCVWK 
FVRTGKKGKRD 

7776 

A 

2 

1627 

AAxXSKQALHCIRMSMKWTSALLLlQLSCYFSSGSCGKVLVWPTEFSH 
WMNIKTILDELVQRGHEVTVLASSASISFDPNSPSTLKFEVYPVSLT 
KTEFED 1 1 KQLVKRWAELPKDTFW S YF SQVQE IMWTFND ILRKFCKD 
IVSNKKLMKKLQESRFDVVLADAVFPFG 

PGYATEKHSGGLliFPPS YVPWMS ELSDOMTF I ERVKNMI YVLYFEF 

wfqifdmkk>toqfysevlgrpttl\se™;^^ 

PLLPNVEFVGGLHCKPAKPLPKEME\ EFVQS SGENGVWFSLGSMVS 
NTSEERANVIASALAKIPQKVLWRFDGNKPDTLGI^^ 
LLGHPKTRAFITHGGANGIYEAIYHGIPMVGVPLLADQPDNIAHMKA 
KGAAVSLDFHTMSSTDLl^ALKTVINDPLYK^NAMKLSRIHHLXJPVK 
PLDRAVFW I E FVMRHKG AKHLRVAAHDIjTWFQ YHS I*DVTGFI*IjACVA 
TVIFI ITKCLFCVWKFVRTGKKGKRD 

7779 

A 

1 

187 

SGRPFFFFFGLPFLFFFFFF*DRVSLLI»PRI*QCNGTISAHCNliCLPG 
SSDSPASASRVAELP 

7780 

B 

436 

711 

DVYKIGG I G xVPVGR\^TGVLKPGM\AH^FAP VN\TTTFA^SVEMHHEA 
L S EALPGDNVGFNVKKVS VKDVRRG1WAGDS K1TOPPMEAAGFTAQ * 

7781 

B 

26 

364 

MHHEALSEALPGDNVG FNVKNVS VKDVRRGNVAGDS KND PPMEAAGF 
TAQV 1 1 LNH PGQ I S AG YAL YW I AI VDMVPGKPMCVES FS D YPPLGRF 
AVRDMRQTVAVGV I KAVDKKAAGLAS * 

7782 

A 

3 

523 

FAP VNVTTE VKS VEMHHE AL» S EAL PGDNVGFNVK^A^S VKBVRRGNVA 
GD / SKNDPPMEAAGFTAQVI I LNHPGQISAGYAPVXiDCHTAHIACKF 
AEIiKEKIDRRSGKKLEDGPKFMVPGKPMC\VESFS\DYPPLGRFAVR 
DMRQTVAVGVI KAVDKKAAGAGKVTKSARKLQKAK 
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7783 

B 

32 

188 

MVVTFAPVNVTTEVK5VEMHHEALSEALPGDNVGF 
FLASPX* 

7784 

B 

60 

209 

MHHEALSEALPGDNVGFNVKNVSVKDVRRGNVAGDSKNDPPMEAAGF 
TAO* 

7785 

A 

3 

313 

RYEEWKEVSTYI KKIG YNPDTVAFVPI SGWNGDNMLEPSANMPWFK 
GWKVTRKDGNASGTTLLEALDCILPPTRPTDKPLRLPLQDVYKIGGI 
GTVPVQKAK 

7786 

A 

13 

555 

INPPPPTVPVGRVETGVLKPGMVVTFAPVNVTTEVKSVEMHHEALSE 
ALPGDKVGFNVKI^SVKDVRRGWAGDSKNPPPMETUVGFTAQVI ILN 
HPGQISAGYAPVLDCHTAHIACKFGRVETGVIiKPGMVVTFAPVNVTT 
EVKSVEMHHEALSEALPGDNVGFQCQECVCQGCSSWQRCW 

7787 

A 

3 

366 

IGY /NPDTVACV/ P ILGWNGDNMLEQSANMPWFKGWKVTRKDGNASG 
TTLLEAli)CILPPTRPTDKPLRLPLQDVYKIGGIGTVPVGRVETGVL 
KPGMVVTFAPVNVTTEVKSVEMHHEAQKAK 

7788 

A 

1 

513 

E AG I S KNGQTREHALLAYTLGVKQL I VG VNKMDSTE P P Y S QKR YEE I 
VKEVSTYIKKIGYNPDTVAFVPISGWNIACKFAELKEKIDRRSG\KK 
LEDGPKFLKSG\DAAI VDMVPG\ KPMCVES FS \ DYPPLGRFAV\ RDM 
RQTVAVGVIKAVDKKAAGAGKVTKSAQ\ kaqkak 

/toy 



1099 

MQSERGITIDISLWKFETSKYYVTIIDAPGHRDFIQNMITGTSQADC 
AVXi I VAAGVGEFEAG I S KNGQTREHALLAYTLGVKQLI VGVNKMDST 
EPPYSQKRYEEIVKEVSTYIKKIGYNPDTVAFVPISGWNGDNMLEPS 
A^PWFKGWKVTRKDGNASGTTLIiEALDCILPPTRPTDKPLGLPLQD 
VYKIGGIGTVPVGRVETGVLKPG^T^A^FGPVNVTTEVKSVEMHHEAIi 
GEALPGDNVGFNVKNVS VKDVRRGNVAGDSKNDPPMEAAGFPAQVI I 
LNHPGQI S AGYAPVLDCHTAH I ACKFAELKEKIDRRSGKKLEDGPKF 
LKSGDAAIVDMVPGKPMCVESFSDWQGHQVCPESSEG 

7790 

A 

1253 

2560 

SLVMLPLFDMVPGQAHVLLRASQDYPPFGSLCWFVIMRTDSCGWVSI 
KSSGQKRLLELIDAPGHRDFIKNMITGTSQADCAVLIVAAGVGEFEA 
G I S KNGQTREHAIiLAYTLGVKQL I VGVNKMD S TE PP YS QKR YEE I VK 
EVSTYIKKIGYNPDTVAFVPISGWNGDNMLEPSANMPWFKGWKVTRK 
DGNASGTTLLEAVDCILPPTRPTDKPLRLPLQDVYKIGGIGTVPVGR 
VETGVLKPGIWVTFAPVNVTTEVKS V^MHHEALS EALPGDNVGFNVK 
NVSVKDVRRGNVAGDSKNDPPME\ AAGFTAQVI I \ LNHPGQI SA\GY 
AP\VLDCHTAHI ACKFAELKEKIDRRSGKKLEDGP\ KFLKSGDA/ AP 
LVDtyVP\ GKPHVCLRS F S \ D YPPLGRFAVRDMRQTVAVGVI KS S / VD 
KKAAGAGKVTKSAQKAQKAK 

7791 

A 

39 

362 

AEKWARNMPFFPDLQMTDQVSLLRLTWSELFGLNAAQCSMPLNEA\P 
LLAAAGLQAY PMS ADRRVAFMDH I R I FQEQAEKFRALHVGS PE KQCL 
KAIVLFTSNTWGRG 

7792 

A 

19 

386 

FISSIKFIMSFNFTSLFHFFISLGQFISSHYISSK/ SHFISFHVISS 
VHLIFSSHHFMGSSQQLFSSDHFIISCHRFMSYLLRSVQFHFNISFH 
YFRSSHLITSCHHISSCNQNSSSSQHFIS 

7793 

A 

16 

451 

TQGSHQSPPPGS\PHAAHTCCPGRGGEAGGTGSLCLLGQGHTREPAA 
LKDWPGGRPEGHKVRSDPPGRPASSTVQLKGLSGERDGCPPLRRGPG 
PHRAEGS PSRALRPPGTRGNTATRTPVS SGMRAASCQAWRNEVRSLM 
VQAP 

7794 

A 

1 

427 

VSCAPGYVLKGQASIKC\njGHPSHWSDPPPICRAASLDGFYNSRSLD 
VAKAPAASSTLDAAHTAAAIFLPLVAMVLLVGGVYFYFSRRRENMKS 
PSRWGQSREVNSDLHHSPAARLLASCCPSTSCIYHLGGDAAJCPVLRP 

L 

7795 

A 

1 

330 

FFFFFFQKHHFNKETTEIKDCSLAGAQTPYNTLHVQSGFQPAPGRLP 
GALLPPVPCQIPGSLDYWLLSRNMPTPSPYPITQWVKRESLTSGLGI 
GRRGLLGVKDVCNSLN 

7796 

A 

3 

1229 

SPPSPL\PGLASQDSRPVFPSPTPAMAAVPTQPQSKEGPWSPESESP 
MLRITAPLPPGPSMAVPTLGPGEIASTTPPSRAWTPTQEGPGDMGRP 
WVAEWSQGAGIGIQGTITSSTASGDDEETTTTTTIITTTITTVQTP 
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GPCSWNFSGPEGSLDSPTDLSSPTDVGLDCFFYISVYPGYGVEIKVQ 
NISLREGETVTVEGLGGPDPLPLANQSFLLRGQVIRSPTHQAALRFQ 
SLPPPAGPGTFHFHYQAYLLSCHFPRRPAYGDVTVTSLHPGGSARFH 
CATG YQLKG ARHLTCLNATQP FL / WDLKEPVC I AACGG VI RNATTG \ 
RIVSPGFPGNYSNNItTCHWIiLEAPEGQRLHLHFEKVSIiAEDDDRLII 
RNGDNVEAPPVYDSYEVEYPPPPPPLQPHYHRVSV 

7797 

A 

110 

343 

FVLFPTKVYEGERAMTKDNNLLGKFELTGIPPAP*GVPQIEVTFDID 
ANGILNVSAVDKSTGKENKITITNDKGKEA 

7798 

A 

1 

335 

NFCTREDHQYAPLDPKHGHLPPRMPPSERLMAAEAAFYSPPYHDRPT 
NSEGWEHTGLYE S FRAKLRS PRRKG PE KRNSGPSRSRNRS KSRGRS S 
SRSNSRSS*SSGSYSRS 

7799 

A 

110 

409 

PVTYLAQAGYEVVGLDWTVAPKKARECVGKTVTLQGNIjDPCALYASEE 
EIGQLVKQMLDDFGPHRYIANLGHGLYPDMDPEHVGAFVDAVHKHSR 
LLRQN 

7800 

A 

40 

621 

HTDTYPHPHLIARPQGFPELKNDTFLRAAWGEETDYTPVWCMRQAGR 
YLPEFRETRAAQDFFSTCRSPEACCELTLQPLRRFPLDAAIIFSDIL 
WPQALGMEVTMVPGKGPSFPEPLREEQDLERLRDPEWASELGYVF 
QAITLTRQRLAGRVPLIGFAGAPVMWDRAGTRGAGRSLWKVWGRQKE 
GSVWLL 

7801 

A 

1 

338 

GTS FYLYRKGYLS LS KGVPMSHYAGTIiLLLLAGVACLiRG I VRWTNPH 
YRKTMTI P* ASHRNQSSKYPRHVANYNCDFRSWPDDFHCDEPITRNE 
SQGGPIRRSVHTAS PKPM 

7802 

A 

1 

390 

ILDCHTDDSGTYRAVCTNYKGEASDYATLDVTGGDYTTYASQRIDEE 
VPRSDFPELTRTEAYAVSSFKKTSEMEASSSVREVKSQMTETRESLS 
SYEHSASA* MKSAAIi * EKS LEE KSTTRKI KTTLAAR 

7803 

A 

3 

235 

VNSAEEAEEQANTNLSKFRKVQHELDE / AVALPHFDLL I QLVLHLAD 
EAEERADIAESQVNKLRAKSRDIGTGGLNEE 

7804 

A 

2 

561 

MRPHRLDEAEQIAIiKGFFKQLQKLEARGRELENELEAEQKRNAESVK 
GMRKSERTPSKELTYQTEEDRKNLAAACRTLVDKLQLKA\KAYKRQA 
EEAEEQS PTPNL\ SKFRKV\ QARAGLRAEERADIAESQVNKLRAK\ S 
RD IGTKGLEMKELAFATS LNLAP PLGRCPANAPCWS LCNS FLGREAE 

7805 

A 

3 

303 

GRRPPAL IDSRSPS PAS P PTQ * CRPPS FPGPAPPPRLWLAPPFCPP P 
PPHGYGPFPARKRRGAALGLTRVLPSPGSAVRRSPGPGPGSACDKVF 
RTTSR 

7806 

A 

190 

389 

RRTYTSHLLACLRQGLAFS PRLECGGKI RAHCS LQLYG S SDPPT * AP 
QTAGTKQHNQR I AQCNADN 

7807 

A 

3 

271 

FFLRQSHS VT\RLECTGAI * AHCNLCLPGSSDSPASAS * VAGITGMC 
HHARL I FVFLVETGFHHVGQAVLELL / NIjM I CP PQP PKVLGLQA 

7808 

A 

610 

923 

FYLSEEAEFFFFFFETES/HLLSPRLECSGMMSAHCNLRLPGSSDSP 
ASAS * VAGITGGHHPAQLIFVAF/MYQPGFCHVGQAGQQLLA\ * VIC 
LPQPPKVIiGITGA 

7809 

A 

487 

796 

LSIFFFFFFEGRGAIVANVSLNFPGSKKPPPSLSKEPGTTGAAPPRE 
IFFPPKKTRLGFALHPGSPPPPGPWAPIiWSKGEKVGILKKHKVGRGR 
GFSPVSPPF*KPPPSLSKEPGTTGAAPPREIFFPPKKTRIiGFALHPG 
SPPPPGPWAPLWSKGEKVGIIiKKHKVGRGRGFSPVSPPF 

7810 

A 

1 

340 

GTSQVLTIiTIAEV* *NIiG*VGCK*CKHTEAKRMPCAEDYLSVVHEQI 
CVAHENTPVSDRVPIRCPESLAIRLPCFSALEVDETYGD*EYNAEAF 
TXiHAD I CTLSEKERP I KKQ 

7811 

A 

3 

337 

IESSLHQVESMHGAGNAKKNWQ/ RIQEHFFFATFHPLKDYCL/ EGTN 
LVEADNQAEWTDVQKKI IPWNSRVSDL\DL\ELLFQDRAARLGK\ S I 
SR\LIRCGPSLHSDKPTNFREG 

7812 

A 

3 

927 

YGEENLKTSICTFLAVLSHLDIITQNIPEKKLILKQALIVVLQWCFN 
HNFSVRLYALVALKKLWTVCKVLSVEEFDALTPVIESSIjHQVESMHG 
AGNAKKNWQRIQEHFFFATFHPLKDYCLETIFYILPRLSGLIEDEWI 
TIDKFTRFTDVPLAAGFQWYLSQTQLSKLKPGDWSCX5DIGTNLVEAD 
NQAEWT\ DVQKKI I PWNSRVSDLDLELLFQDRAARlA GKS I SRLI W 
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AS L I DKPTNIiGGLCRTCEVFGAS VL WG SLQC I SDKQFQHLS VS AEQ 
WLPLVEVKPPQLIDYLQQKKTEGYTI I 

7 813 

A 

3 

362 

ATCSSDQSVKVWBKSESGDWHCTASWKTHSGSVWRVTWAHPBFGQVL 
AFCTFDRSSAVWDEILSESRDNIj*GHTHWVKRTTLEDSIVSATCV*S 
FLSHVGIMLLSCGTFFPACI CGVFP 

7814 

A 

9 

307 

SHLLRSLRQENRLNPRGGGCNQPRSGHCTPAWATRAKLHLKKQTNKM 
LANLIQQHIKKVIHHYQVGFTPGMQG*FNICKSVSVIYHINRTTHKN 
VAAAA 

7815 

A 

3 

219 

AASTMPMSFEWQRQ*RFPPFYTLQPNVNTRQKQLTA*CSIiVLSLCRL 
HKQSSMTLIKTRLTSDSITVMIiDC 

7816 

A 

15 

335 

QHFWERFHVTKDRINTVERQETDWKIFATYLMKXV*ISRKCYNSYKL 
VRKKQLNRKWAKEKNKQSQKKKQSGQ * TKTVQPQ * NKRN AN * R YYLS 
PISLAKSNKLNY 

7817 

A 

1 

331 

GLIEILVIYVSDTPVK*DV*SDIFEQ*AYGKIDGGNDVCDLQSSEGT 
NTKMKNNEEMM I GEAMDETGHDGETENEG I STKTS KPDEAETNML I A 
EMDTFVCDTVEMSTEG 

7818 

C 

51 

200 

MSTPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

xs* 

7819 

A 

3 

219 

MLAGCSRTPELR*STHCSLPKCWDYRRAHNAQPFLTliN*RSETRRSL 
* F * IMWLPLEFLGFPLESQQGCTA 

7820 

A 

1 ! 

358 

VHCLVPDLLQSNNNPCYWGVMDKYAAEALIjEGKPEGTFIiIjRJDSAQED 
YL FS VS FRR YSRS LHAR I EQRNHNFS FD AHD P * VFHS PD I TGLLEH Y 
KDPSA*MIFEPLLSTPLIRTFPFCL 

7821 

A 

260 

482 

LILKFV*NHKIPQIAKTILSKENKAGGITLPDAKISCKAVVTKTALY 
WHTNRHIEQWNRIANTEINPNICNQLI 

7822 

A 

1 

306 

VKAKAKKLQIQPIQLELLYCSYDMQEEDFAHRAMDYFPKIENNLSTR 
^HMVSSFCIENWHQLESLSLLILHNMPHYEEDEEFEGLRIiDMVQ*R 
LTSCSDAG 

7823 

A 

3 

396 

SSLSLSLSSSVSVTFGFGTPFSAVSSSIPDGGLALAFSSSGLSTSFS 
SP*SHSSSSSPPVPPLSSDTRRSTSPADMAAPSAT*TRRSTAPSNPF 
PSLAAPRREKGGNPPTNERRKLETPIPGRRRQRRQR 

7824 

A 

3 

346 

HEHECYAKVLYELKPVGEEPQNVS KQNCDLFEQLGQYEF *NAIiliVRY 
TKNVPQGTTPALGQVLRI^RKVSSKCCKHPEAKTMPCABDYLSVILN 
QLCALHDRSAVSDRVSRCWT 

7825 

A 

1 

413 

RAGYRQKWGSLATVIGQLGLPSGEGLVWPQRRREGHRN^CSELVLD 
DLHVITQNRANHRPRCGGSLLSELKFAPLALQPGRQSKTLSQKKNKQ 
TTTKKNTKKQPGSVAHACNPSTLGG*GRQITLPSGVRDQPGQH 

7826 

A 

37 

363 

ASDWNQDRARGRGQGGCETRDSSEQSQGTNGQPPAEASKDSWSSGGK 
MAR*APSPQRQKAGYPGSPSPQPAPASMRSLGPPQFQEPPQALFPLE 
VPIPSPHTRNKMARA 

7827 

A 

3 

356 

HERAPLLNNSCFL* AFDPYQCDRYLFKI VTDVYYTP YVXi I YYI * APL 
PLLLQLTHTHFLSLFFI**VIiNIYALVYLSTALPLSIiFISISSPRYSL 
CLFFHFIiSLMLFLSSLSMYCSHY 

7828 

A 

2 

269 

ARVAPLKGVQEQEVRHEGEVTKPEKEKGEETRIENGKLIWSDSCGR 
VYVSGKIYYPM*AYNDGLILERYQIMPIDIIVSYVFIMICLD 

7829 

A 

2 

351 

KDIRVGHRQNRALPPQASPPTVAPGQPTALPPTGHSGPTMGWPSRLS 
QLSLITAPTSASPTVVKLPSGACWPLSSTFASSPLSLVQ*P*NSDSD 
HATSAMAPSSESRECLSPWPAR 

7830 

A 

2 

377 

SHCLEPKDIRVGHRQNRALPPQASPPTVAPGQPTALPPTGHSGPTMG 
WPSRIiYQLSLITAPTSASPTWKLPSGACWPLSSTFASSPLSLVQ*P 
♦NSDSDHATSAMAPS3ESRECLSPWPARPRA 

7831 

A 

176 

379 

SVTDHPIYNCNLSPSPNPGPSDPSDPVIiPFVVMTDYLLP*CVICCCF 
LSDPPHCSPHPVGGSFFGIN 

j 7832 

A 

1 

344 

NSGRVGSGNL I S PLPPGP PGPRPLEEATPV1»LSQG I PEREDS FRNLR 
TDL YIHQLKS * DS PE I S S LCQGEEATPRHSDKHNS * NAAS EGKGCGF 
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PS.PTMEVEIASHDEEDVIHS 

7833 

A 

355 

535 

PNRPITGEVTTKin J TKKSLRPDGFTDEFC*TFKEGFIPTLLKTVKEQ 
LHPTFLYEASITL 

/ O J "* 

A 

61 

250 

RSRLCAPVRESHTYLIMPGLCC*FFLYYIFFGFFFSWFCLLSFYSP 
FFSFVSSVSFV*FWVF 

7835 

A 

3 

66 

DAWBKKICETEDWLFEIIHQRISSSRMKRNKESLQDV*NTIKRTNIR 
IIGVIKEKREKGRKLI*EIMAENSPNLGKDMSIQVYEAQRSPYRIHS 
KKTSPKHMIIKLSKAKSRR*IRG 

7836 

A 

3 

146 

HlWAGAVAGILEHCV^PIDC^TRMQSLQPDPAARYRimiEALWRI 
I RTEGLWRPMRGLNVS AS CAGS AYALYFACSD * LTQLP AI AMCWRPS 
GGL 

7837 


2 

329 

VASGRRKNLLSKVTGLEVPEGSTPDRPCSGRQSLNPPVTSFPNPSQR 
NRTPPPTTRQMATL * EALHG SHLPPPGRAERCTS S PG PRTTRPQVSD 
RRAQSPTSYVGPHRP 

7838 

A 

2 

357 

EAGAHT* ITMNKQRLLKETVDDTVEVSDFTVSRSNAHEAFNTAFTTIj 
AACAAS IG* VLL YL YLTTCPC * CKTKRQNNMLHQSNANSS I LRPGPA 
SDASADERNACARKRAVLLEPLKD 

7839 

A 

3 

266 

ECWYTRHMPPHPAQCFFYNYAWNIiYFFPYRYLHLLDGNENYPCLVDA 
DGDVISFPPITNSEKTKVGWHVCEHGVSSRLFPVELTQVC 

7840 

A 

42 

226 

LFFSEMFFSFK\nOCTTSDLFLKVTSATSLQICKDVMDAIjILVSVGWL 
LLRNAFEREWCSAP 

7841 

A 

1 

507 

LQLEAEEQRKQKKRQS VSGLHRYLHLLDGNENYPCLVDADGDVI S FP 
PITNSEKTKVKKTTSDLFLEVTSATSLQICKDVl^ALILKMAEMKKY 
TLENKEEGSLSDTEADAVSGQLPDPTTNPSAGKDGPSLI.WEQVRW 
\DIiEGSLKWYPSK\ADL\ATAPPHVTWR 

7842 

A 

51 

342 

P ILVYRPDVD P * VAKDYAS FRNYT * GPLLDRVFTTY * LMHTHQTVD F 
I RSRHAQFGGFS YGKMTVMEVEDLLDGLMDESDPDLNFLKS FHA* KI 
AY 

7843 

A 

44 

1353 

KMLYKNFIMQSQELTALWEAKVHPAMLGLDELGRSGCGHCTQADLRF 
GDAAGRDPGQDNDRNTAEPAFPPPPRVMAAAAALRAPAQSSVTFEDV 
AVNFS LEEWS LLNEAQG CL YHDVKLETLTL I S S LGKRQTIiHTRERP Y 
ECIECGKAFAEKSSLINHRKVHSGAKRYECNECGKSFAYTSSLIKHR 
R IHTGERPYECS ECGRS FAENS S L I KHLRVHTGERP YECVECGKS FR 
RSS SLLQHQRVHTRERP YE CSECGKS FS LRSNL IHHQRVHTGERHEC 
GQCGKS FSRKS S L I IHLRVHTGERPYECSDCGKS FAENS SLI KHLRV 
HT(3ERPYECIDCGKSFRHSSSFRRHQRVHTGMRPYK*SKFWKFSCPG 
FLLLQGQRVHTGSRCYECDKWGIFFS*NASFFT*KSAPTEEVPFECN 
ECEKAFSPLSLVT 

7844 

A 

2 

354 

NLLYQFWIHTEVINNIjGPLELILNTRSHHRVHDGRNRYCIDKNYAGV 
LIIWDK5FGTFEAENEKVVYGLTHPINTFEPIKVQFHHLFSIWTTFW 
ATPGFFNKFSVI FKGP* WGPGKP 

7845 

A 

3 

388 

TVRVCACDHHGNMQS CHAEAL I HPAGLS TGALVAI LLC I VI LLETW 
LFAALRRQRKKEPLIISKEDIIDNIVRYNDEGGGEEDTQAIDIRTLM 
NPVP I * DTHLRRCFAP * TSLYYHRATPTRS CYHP 

7846 

A 

151 

352 

YCCTYYIFVLFIP*SSCGLTLIFITCIILFGSISFFLFFTIVFSIII 
VTTFKFRLLYSI IFLSYLLC 

7 847 

A 

2 

265 

VKPS P * PLTGALS ALLMTSGLAM * FHFHS ITLL ILGLLTNTPPVQKG 
LRYGIILFITSE/VFFFAGFF*AFYTPA*AYPQLGGTAQQDT 

7848 

A 

1 

156 

GIRHEAYHIVKPSP*PLTGALSALLMTSGLAM*FHFHLKKKKKIGSS 
FIAVG 

7849 

C 

512 

778 

MTSGLSHVDFTSNS ITLPHTRAYXTNTLNHIPMMARMLTRRKAHTKG 
HHTRTCS KKAFRYGDKS YLFTSG S FF S S QGFI LEP FLPLQA* 

7850 

A 

15 

613 

LKTTHLLFQVSVGQESRCGLAKSSGSKSLTRVKVRGQPRLWCHLKAQ 
LGKDLLLSSLTWLLAACGSSPWGLWASAPGLLLASHHPRFSATWVF 
HKVAHNTAAGFMSTGKQEG * RDKI TAKQKFAS YNLI SEVTTHHFLP Y 
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S I VEKOYTRS S PHTKERDS PRG * LSEGKDHWE PFQICLPATDVIALFY 
ENKHQTHY I HL 

7851 

A 

66 

357 . 

KSMTDGHIQTVTNIRQLPARTLYTHWPREFSP*SPPLERMHTRGARR 
EARQKAPRIRDKRRGKSRVHCLGEELAKGGGWDRIRWGKGERQGDSG 
RV 

7852 

A 

3 

360 

HASAHASGQGDDQLERDHEN*DRFVRQWGINNYRISEESGHPYNIP 
GKIFHDCADLDTSEH*LCKCDSFDKTLKPTINLVSYNRNFA*TNIDE 
NFRCGKSPSYSSCYSKHEELHNGM 

7853 

A 

79 

415 

LSKGTGRKRDLGTLPAHPKAKTDPAAAPQGQPGVGVRSSPPCPG/QR 
EQQGSAGSPGWMGGGCPCPVS*PLKGQNQPSPSSLGGSKGSFFSPPD 
PAVARGRRVRGEERGQVRE 

7854 

A 

171 

173 

LTAEACLS FPS PLPSSLGVSGCPHHGGLGPAHPS ALPQPPGCI FEAA 
WPPRALT*PLSSPLTPPAPGHCRIRRGEEGASAAFPGCLGTGLVLSF 
EDDHRLQD*GQAHTQSQPGASPQTLCCFRGLGHAGELLTPTPTVQPG 
LFPESASKWPQLFGPRILPGPPKGSPTTKAWPPGVCGQQLPLVKGHP 
G*PSPASGRSRCEVSVPSCPWLPLGWGCLAPAGCRSQHWRD 

7855 

A 

27 

263 

FHCF I SNF * VANRS SLLL IDFS VGLLKCKS FI FFNNAI LHYCLMMVC 
EYDSLFLSYMFLLIFFYFYYIFIYLLFIHPY 

7856 

A 

92 

355 

VLNS I * SGNQGKS YANVYRLLYLDPI PKI YAEAYTP*NVNSTNLETK 
SPKTIQKFPEDREFKNDFFEKTKNGDRETGAFHPFLLFLV 

7857 

A 

464 

2848 

G E TGM S TALTHTTVAMR C PMLGGGGG P TYG P PQ P WGH PD VH I MQHHV 
LP I QARLGS I AE IDIX3VPPPVMKTFKEFLLSLDDS VDETEAVKRYND 
YKLDFRRQQMQDFFLAHKDEEWFRSKYHPDEVGKRRQEARGALQNRL 
* RVFLSLMETGWFDNLLLDIDKADAI VKMLDAAVI KMEGGTENDLRIL 
EQEBEEEQAGKPGEPSKKEEGRAGAGLGDGERKTNDKDEKKEDGKQA 
ENDSSNDDKTKKBEGDGDKEEKKEDSEKEAKKSSKKRNRKHSGDDSF 
DEGSVSESESESESGQAEEEKEEAEALKEKEKPKEEEWEKPKDAAGL 
ECKPRPLHKTCSLFMRNIAPNISRAEIISLCKRYPGFMRVALSEPQP 
ERRFFRRGWVTFDRS VNI KE ICWNLQNIRL / RECELS PGVNWDLTRR 
VRNINGITQHKQIVRNDIKLAAKIjIHTLDDRTQLWASEPGTPPLPTS 
LPSQNPILKNITDYLIEEVSAEEEELLGSSGGAPPEEPPKEGNPAEI 
NVERDEKLIKVLDKLLLYIiRIVHSLDYYNTCEYPNEDEMPNRCGIIH 
VRGPMPPNRISHGEVLEWQKTFEEKLTPLLSVRESLSEEEAQKMGRK 
DPEQEVEKFVTSNTQELGKDKWLCPLSGKKFKGPEFARKHIFNKHAE 
KIEEVRKEVAFFNNFLTDAKRPALPE I KPAQPPGPAQ I LPPGLTPGL 
PYPHQTPQGLMPYGQPRPPILGYGAGAVRPAVPTGGPPYPHAPYGAG 
RGNYDAFRGQGGYPGKPRNRMVRGDPRAIVEYRDLDAPDDVDFF 

7858 

A 

1073 

3541 

GETGMSTALTHTTVAMRCPMLGGGGGPTYGPPQPWGHPDVHIMQHHV 

LP IQARLGS I AEIDLGVPPPVMKTFKEFLLSLDDSVDETEAVKRYND 1 

YKLDFRRQQMQDFFLAHKDEEWFRSKYHPDEVGKRRQEARGAL\QNR 

LRVFLSLMBTGWFDNLLLDIDKADAIVKMLDAAVIKMEGGTENDLRI 

LEQEEEEEQAGKPGEPSKKEEGRAGAGLGDGERKTNDKDEKKEDGKQ 

AENDSSNDDKTKKSEGDGDKEEKKEDSEKEAKKSSKICRNRKHSGDDS 

FDEGSVSESESESESGQAEEEKEEAEEALKEKEKPKEEEWEKPKDAA 

GLECKPRPLHKTCSLFMRNIAPNISRAEIISLCKRYPGFMRVALSEP 

QPERRFFRRGWVTFDRSVNI KE I CWNLQNIRGARSLINRWCSRGKGL 

RAAVSLASGLRECELSPGVNRDLTRRVRNINGITQHKQIVRNDIKLA 

AKLIHTLDDRTQLWASEPGTPPLPTSLPSQNPILKNITDYLIEEVSA 

EEEELLGSSGGAPPEEPPKEGNPAEINVERDEKLIKVLDKLLLYLRI 

VHSLDYYNTCEYPNEDEMPNRCGIIHVRGPMPPNRISHGEVLEWQKT 

FEEKLTPLLSVRESLSEEEAQKMGRKDPEQEVEKFVTSNTQELGKDK 

WLCPLSGKKFKGPEFVRKHI FNKHAEKIEEVKKEVAFFNNFLTDAKR 

PALPE I KPAQPPGPAQ 1 1 HVPVRWLPPGLTPGLPYPHQTPQGLMPY 

GQPRPPILGYGVPTGGPPYPHAPYGAGRGNYDAFRGQGGYPGKPRNR 

MVRGDPRAIVEYRDLDAPDDVDFF 

7859 

B 

145 

226 

MAVNCSEMRLKSSWNGCAIANEGDCHLEATSICSSTSFMIDMLGEFRH 


JL 01 0 S «+ 6. «4 3 ... O H JJ & O S 






CESSILLASTAX* 

7B60 

A 

136 

1559 

I LTMRE I VH I QAGQCGNQ I GAKFWEVI SD EHG IDPTGTYHGDSDLQL 
DR I S VYYNEATGGKYVPRAILVDLEPGTMDS VRSGPFGQIFRPDNFV 
PGQSGAGNNWAKGHYTEGAELVDSVLDWRKEAESCDCLQGFQLTHS 
LGGGTG/SPGMGTL\LISKIREEYPDRIMNTFSWP\SPKV\SDTW 
EP YN \ ATLS V\HQLVENT\ DETYC IDNE\ AL YD I CFRTLKLTTPT\ Y 
GDLNHLVS \ ATM SG V \ TT \ CLR FPG \ Q \ LNADLRKL VS / VNMVPFPR 
Ii\HFFMPGFAPLHQPVESQQYRGSHKCRELTQQ/VSFDAKEH*WAAC 
GPPATGRYL\TVAA\AFRGRMSMKEVDEQMLN\VQNKN*QLTLWEWF 
PQQCSRTACLFTSQPRGPSRWAVHLSLGNRHSHSRELFQSGIFGSSF 
TW PC FRRKAF/ L FHW/ YTQGEGHGTEMG VS TEG * EATLNDLRL * S I K 
QLPRMPTQEEEEDFGEEAEEEA 

7861 

A 

2 

3 04 

ARGLCPLGYTGLKCETDIDECSPLPCLNNGVCKDLVGEFICECPSGY 
TGQRCEENINECSSSPWLNKGICVDGVVCYCCTFV*GFCIFKFSCSS 
CPLYTI 

7862 

A 

1 

345 

GTSGTSRFL*DVLQMNDFGTRWRFYATVIYQKPQLKSLLILE*RETW 
VLGTDGTLQTKEEKDPRLRKTLLVYVAPLLVLGSDVLEALSGAAPLR 
LFLKDAIRDQARIGRAERTVL 

7863 

A 

2 

343 

VRSAYPRRLENSLAGQRRHHLCTPLLPPAGGARAPQPQPQPQLLGAR 
PGPGRCRMRRRPQAPRAGRGAALARAL*PLSSTASRRGACPGAAPSD 
SPEDPASDHGHSPRLGGRC 

7864 

A 

1 

338 

QIDEIVTETLLTEAERGAKVYLTTGYFNLTQAYMDLVLGTRAEYQIL 
LASPEVNGFFGAKGVAGAIPAAYVHIERQFFNDECSL*QSRAVQLSE 
YWRCSGRSLSMEFVRHDI 

7865 

A 

174 

363 

WQFTFTFSKSIKKDSKEEIYCQLPRDTKIEDFGTGPRSRYPLVALLT 
LSDEDDREIMYYFHG 

7866 

A 

1 

1260 

VRCRPVRNSRVDPRVRMAAVFLVTLYEYSPLFYIAVVFTCFIVTTGL 
VLGWFGWDVPVI LRNSEETQFSTRVFKKQMRQVKNPFGLE I TNPS S A 
SITTGITLTTDCLEDSLLTCYWGCSVQKLYEALQKHVYCFRISTPQA 
LEDALYSEYLYQEQY\FILSSDIPISTEN*TSYG*SVANLLIILGIL 
EACLLLQNKHS PCIRTCSVYRI S VSGTGFACFRLI YPLALI IRLLMD 
KVPLTY*SFLGY*KHVYCFRISTPQALEDALWSEYTjYQEQHFIKKDS 
KEEIYCQLPRDTNIDDFGTVPRSRYPLVALLTLSDEDDREIYDIISM 
VSVIHIPDRTYKLSCRILYQYLLLAQGQFHDLKQLFMSANNNFTPSN 
NSSSEEKNTDRSLLEKVGLSESEVEPSEENSKDCVVCQNGTVNW 

7867 

A 

106 

387 

RWACS AEQKAEPAGS SRTGGMGS AC I KVTKYFLFLFNL I FF I LGAVI 
LGFG VW I LADKS S F I S VLQTS S S S LRMG AYVF I G VGAVTMLMGFLG W 

7868 

A 

2 

767 

AVEFGLCS PTPTEARAGDAGPAARSRKQNPQS PPCCCVDDTWAQAEV 
GPVTSCTGFVEGSSRTGGMGSACIKVTKYFLFLFNXiIFFILGAVILG 
FG VW I LADKS S F I S VLQTS S SS LRMG AYVF I G VGAVTMLMG FLGC I G 
AVNEVRCLLGLYFAFLLLILIAQVTAGALFYFNMGKLKQEMGGIVTE 
L I RD YN S S RED S LQDAWD YVQAQVKCCG WVS F YNWTDN AELMNRPEV 
TYPCSCEVKGEEDNSLSVRK 

7869 

A 

149 

421 

CGTDKTFFWSCTVRENLNSYSEELLSRRQICKNKNDPPGDRD*FDYS 
RSDYEHSRRGRSYDSSMESRNRDREKRRERERDTDRKRSRKSP 

7870 

A 

1 

445 

GQDSRSRDNGPDGMEPEGVIESNWNEIVDSFDDMNLSESLLRGIYAY 
GFEKPSAIQQRAILPCIKGY/DCDLYETLTITQAVIFINTRRKVDWL 
TE KMHARDFTVS AMHGDMDQKERDVI MREFRS GS S R VL I TTDLLARG 
IDVQQVSL 

7871 

A 

2 

363 

ELDTLCDLYEP * PSPS 1 1 FINTRRK/ VDWLTEKMHARDFTVSAMHGD 
MDQKERDVIMREFRSGSSRVLITTDL\RIGRGGRFGRKGVAINMVTE 
'EDKRTLRDIETFYNTSIEEMPLNVADLI 

7872 

A 

3 

449 

NLQE WKLDTLCDLYETLTI TQAVI F INTRRKVDWLTEKMHARDFTVS 
AMVCLPAASLLWVCPSEVSYLKPGFLEPRCLPGLLHICFLFQHGDMD 
QKERDVIMREFRSGSSRVLITTDLLVSRGN**QRQKGGSKVIPSPRG 
HQCLSGK 
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7873 

A 

236 

534 

SYAIYWLFFSSLAPKYIKMFVIiDEADEMIiSRGFKDQiyDIFQKLNSN 
TQVVLLSATMPSDVLEVTKKFMRDPIRILVKKEEVTLEGIRQFYINV 
EREEW 

7874 

A 

1 

366 

AEIRDRSPCGELHVEREEWKLDTLCDLYETLTITQAVIFINTRRKVD 
WbTEKMHARDFTVSAMHGDMDQKERDVIMREFRSGSSRVLITTDLLV 
SRGN* *QRQKGGSKVIPSPRGHQCLSGK 

7875 

A 

5 

1285 

FLRIMSASQDSRSRDNGPDGMEPEGVIESNWNEIVDSFDDMNLSESL 
LRGIYAYGFEKPSAIQQRAILPCIKGYDVIAQAQSGTGKTATFAISI 
LQQ I ELDLKATQALVLAPTRELAQQ I QKVVMALGDYMGASCHAC I GG 
TNVRAEVQKLQMEAPHI I VGTPGRVFDML YR\ RYLAP \ KS IRMLVLN 
EADEMLSRGFKDQIYDIFQK\LNSNTQ\WLLSATMPFD\VLEVTKK 
FMRGP\IRILVKKE\ELTLEG\IRQFLHQPWNEEEWKL\DT*CD\LY 
ET\ LTI T\ QAV\ I FINTRRK/ VWDWPHPRKMHA\RDFTVSRHAMGDM 
GPKGNETVIMEGSFRSG\S*QSF*LPT*PCWARGIDVPARFSLVI\N 
YDLPTNR\ENYIHR\IGSTVDRFGRKGVAINMVTEEDK\RALRTLGA 
SYNTS IEEMPLNV\ ADLI 

7876 

A 

3 

380 

MDTTPLS YSES * SDEYLVTNM* LQDAFCRGLVKPGLNVLLKGPINAV 
IDVNGLKQCLA* FKRYLECL*RLDVTLCPVP* IGGSDAPLPHNKDHK 
AAD P * DYFQR * MS F YRQAHAAMLAVLPRLHH 

7877 

A 

2 

398 

RESESCAAADTPALETLSFHGDWEIIEVFNRDLGSTYPDDLAL*MED 
VDFEE E YEE * G YD * GWVL YPQEGVI GNMEGPDYS * VTFALHS S S VYR 
VS LD P KTYTLAVTGG YDDG AFVWLLT YG * LLFE * ASHE 

7878 

A 

3 

401 

NSQVLLDSPIQLSKITENYECITELSGLLKREQSSTVNASNLENDEA 
RLKAYLENRSEIDSLKTSNPIDL*EASISQAGSIN*SRF*SSGMNKS 
YDVRIDNFNGSIGDRILLTAANVILAWCRIiYGLMRQNV 

7879 

A 

1 

424 

PVLRVARERSLPGPQEGPHGRASESASPIiAPSIAPSPPGPPALSASH 
IMGSQALRSPETPW*SPWDPPAPLDLVSPWQFPPGPHASATP*RSPL 
GLQTPQAPGSWAPHVPATPQASGLPAPHVTETP*ASS*LAMPVPETP 

7880 

A 

156 

312 

LFSLACRYHVYPTCRNTSGILGFTVSWTNAIPFIS*LSSFYYIYFL 
YNFG 

7881 

A 

15 

396 

FGCLLQGRRPLKSHSPWQGPGASGRPAQGKHKNR*RGQNPSNLSHPA 
PTPPWPGIKCPLPRAGPSLSSLLPGRVLCARPEFPLHGRGIiEPSASL 
LPSALAPSPLRGLKGTERSRLKADQAWGGWKE 

7882 

A 

2 

102 

NKILI IKTNSRKK*KIE* * RDNNQSNNNKNHNK 

7883 

A 

3 

399 

NFLYTFLYFPNFV*SAHCF*NQKKKNKYFLLREKCLLNCFPHIiYVAF 
YVYIPSIFNLSGC*YLPSCSIFYVFIPLSHPLLFFFLCFSLSFSLPF 
FLSLFCFPFLAFLFHPSFILLLDSSACLLCISCFFHL 

7884 

A 

190 

341 

DGKGLLQVCLPLQDVYKIGGIGTVPVGRVETGVLKPG\MVVNFA\PV 
NVTT 

7885 

A 

377 

1073 

LQGHLRLTGAVIiIVAAGVGEFESWSPRMGRPESMPFIiAYTLGVKRIi 
\IVGVNKM\DSS*PPYS\QKRYEEIVKEGSTYIKKIGYNPDTVAFVP 
I SGWNGDNM\ LEPSANMPWFKGWKVTRKDG\NASGTTLLEAL\DC\ I 
LPPT\RPT*QSPCGLPSSRDV\YKIGG/lLGTVPVGPSGRIiGVLKPG 
MGGSPLAPVKT^rXTEVKICSEMXHHGRF^VEALSWGDNVGLSMFQGL 
LFCSK 

7886 

A 

2 

416 

SVGDPCPSTASVDQCTIVP*PMEGPLLI*YRRCIEGVGGIFLWDPQPS 
SPQHPGATPSTGEPGFPWHQDLSHIQQPAAWAALNNHKPPVRS*P 
PSATPEAFT*FSKCDDLDEELSLLRYCHAFSEC*HDIRPSHIHP 

7887 

A 

1 

360 

TERFGM*GGEAVCLYEEPVSELLRRCGNCTRESWWSFYIiSADHELL 
S PTNYHFLS S PKE AGGLCKAQ ITAII FQQGD I FVFDLETS AVRPFVW 
WQVEAIPVDYLPCSFVMI IPTLYCSP 

7888 

A 

152 

379 

WEGGGSGTILARRFWDLQAYRPHAQVPGFLR*SHLMAVGHAEATDLT 
ALTML I LTP EAWQLHT I L YLS GLLAMLP 

7889 

A 

302 

565 

YWLFYDFMIAJEKILNGYFCCCC*DRSCPVAQAGELWHERGSLQPRPP 
GVKQCSYFSLLSSWDRRHVPPHLANF*TFFCRDRVSPVLP 
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7890 

A 

1282 

4603 

GSBPCLEHVSLLiNNPLS 1 1 PDYRTKVLAQFGERASEVCLDDTVTTEK 
EI^TVEVLKAIQKAKEVKSKLSNPEKKGGEDSRLSAAPCIRPSSSPP 
TVAPAS AS L\ PQP I \ LSNQG I MFVQEEALAS S LS STDS LTPEHQP I A 
QGCSDSLES I PAGQAASDDLRDVPGAVGGASPEHAEPEVQVVPGSGQ 
1 1 FLP FTC I G YT ATNQD F I QRL S TL I RQ A I ERQLPAW I EAANQREEG 
QGEQGEEEDEEEEEEEDVAENRYFEMGPPDVEEEEGGGQGEEEEEEE 
EDEEAEEERLALEWALGADEDFLLEH I R I LKVLWCFLIHVQGS I RQF 
AACLVLTDFGIAVFEIPHQESRGSSQHILSSLRFVFCFPHGDLTEFG 
FLMPELCLVLKVRHSENTLFIISDAANLHEFHTVDLRSCFAPQHMAML 
CSPILYGSHTSLQEFLRQLLTFYKVAGGCQERSQGCFPVYLVYSDKR 
MVQTAAGD YS GN I EWAS CTLC S AVRRS CCAPS EAVKSAAI P YWLLLT 
PQHLNV I KAD FNPM PNRGTHNCRNRNS FKL S RV PLS TVXiLD PTRS CT 
QPRGAFADGHVLELLVGYRFVTAIFVLPHEKFHFLRVYNQLRASIiQD 
LKTWIAKTPGTGGSPQGSFADGQPAERRASNDQRPQEVPAEALAPA 
PVEVPAPAPAAASASGPAKTPAPAEASTSALVPEETPVEAPAPPPAE 
APAQYPSEHLIQATSEENQIPSHLPACPSLRHVASLRGSAIIELFHS 
S I AEVENEELRHLMWS S WF YQTPGLEVTACVIiLSTKAWFVIiHDGL 
RRYFSEPLQDFWHQKNTDYNNSPFHISQCFVLKLSDLQSVNVGLFDQ 
HFRLTGSTPMQVVTCLTRDSYLTHCFLQHLMVVL.SSLERTPSPEPVD 
KDFYSEFGNKTTGKMENYELIHSSRVKFTYPSEEEIGDLTFTVAQKM 
AEPEKAPALSILLYVQAFQVGMPPPGCCRGPLRPKTLLLTSSEIFLL 
DEDCVHYPLPEFAKEPPQRDRYRLDDGRRVRDLDRVLMGYQTYPQAL 
TLVFDDVQGHDLMGSVTLDHFGEVPGGPARASQGREVQWQVFVPSAE 
SREKLI SLLARQWEALCGRELPVELTG 

7891 

A 

1 

286 

S PPTWPAS ASLPQP I LSNQG ILGDE * VI LVS * ACLLARPARKRGGC 
AVPSGAEARDTQERTPP7VRAPGDAELGVRSWENSSTRCFKSQKNIPT 
P 

7892 

A 

3 

419 

PPPLWFNQDPHVESKGLAAGRPVPSTSQGRTQFRGSAPPPKDPLPRT 
GPS ANRRVPHTFQPD I S FPGS PKPPS * DRPHPNPSLRS PLVSHS VHN 
QAGRWAKVQATARERKSDTKSIWLWRWRRILKRTRSNSLYTLHG 

7893 

A 

2 

236 

KKSNLRMLAEHGGSHL * SQLCRRLRQENHL I SGGGGCS ESTSHHCTP 
AWGTE*DFVSEKKKKKLTGWPDPSGRIKWWC 

7894 

A 

3 

254 

LECNGTISAHCNLRIjMVSG*FSRIjSIjPSS*DYRRPRLRPANFCIIjVE 
MGFHHVGQAGLKLQTSGDLPLSLPKCWDYRRAHPLA 

7895 

A 

1 

185 

RQSLALSPRLECSGAIIAHCSLEPLGLRQLCCLSLPKC*DYRHEPLS 
LACLFLFYPPEHLT 

7896 

A 

1 

182 

KLVGLEPVSTYSIRVSAFTIVGNGNQFSNWKFTTQESG*IQFLSLK 
CFFLYLTPFFSFS 

7897 

A 

4 

329 

AGEGGLQALILRDGAWPQPPSGD*QVKGLEPLTPAPAGSSQGPGPGP 
PIASSPVGQSKTDTATTTKVGPVGGGQPAPMHQQLPPSPGDPAAEDA 
HTWGERSLSGMKGK 

7898 

A 

115 | 

408 

HWKERGSLYPA*TQGSACVLPGQGHLPRPQATPPTTAHLVPPQHCRR 
PGPAAPAP*QGMSTPTTATLFSRLQGRLQLPMGAPSGQGPWIiYCRGE 
GPRA 

7899 

C 

128 

253 

MNSPDWELWQDNS PQDS CVLI PRTCDHAASRGRRDS SGVIE * 

7900 

A 

856 

1071 

SFALVAQAGVQWHDLGSLQPLPPRFK*FSHLSLPSSWDFRCPEPYWL 
I FVFLVETGFHHVDQAGLELLTSGD 

7901 

A 

87 

395 

RSGGGVPPRNPPPLGGRGGGFPQGPGFQITLSPKGKPLFFLKSQNFP 
G*WAPPLKPPFKGG*TKKIFLTLEGGVPINGNFPPPPPPGGKNFFSK 
KKKKKKKK 

7902 

A 

28 

403 

GGT I YGNCWP ATVPAG S PGVPAVGGQPG PLPVL * RTS PETCPPLGV 
TTGSLCLAPYHSLS FGGHGLPGTPREAG *REGGNSLQLQLPNPKPFL 
PSRPEDPTWSPSSQGEELRRRKVRCDLKSLT 

7903 

A 

50 

543 

GLFQKPGGTHVLPPAPKAVQASTDPPLKGGSPLLQGPWRRGRVGCRG 
AQKGGPGDRGSSLLFKPGDQGGPSLTSGQHSKSEAGVWQGLSWSPLG 
CGFLPEPHQRPLEPREGPRRAGVTPGLEHSSQSPEAPGDRNK*QSET 
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TREKE S TVS TKS PTLPKHPGQLG 

7904 

A 

1 

610 

SHKTNVQQCTCYCGGPGDWYLKMLQCCKCKQWFHEACVQCLQKPMLF 
GDRFYTFICSVCSSGPDY\LKRLP\LQWVDIAHLCLY\NLSVIHKKK 
YFDSELELMTY\INENWDRLHPGELADTPKSERYEHVLE\ALNDYKT 
MFMSGKEIKKK\ KHLFGLR\ IRVPPVPPNVAFKAEKEPEGTSHEFKL 
KGRKAS KP I S DSREVSNG I 

7905 

A 

440 

608 

CVKLFLDTSQLADT PKS ER YEHVLE ALNX) YKTM * VDLFYVS L * GRHL. 
VSIRNNGIV 

7906 

A 

49 

796 

SHFIYFLTIW ISS LVNEVC * S S P I FLMNLQWLY I FWLLLLYELCCR 
I FSHFWCIiFTMF IN I VTF * F * YLQ I IVFLYG * LGFFYGYVKN I VNI 
LLLHAL* FCLSHFGFI PLGTI FNI * YEVIFLLL *G* S WPALFIEKK 
QKQKTFLFLLL*SAFSVVYWQYMQRSFMGSLLYWFFSCANVHSLHYF 
MI FKKI FDI * YQS PFTSLLFINSLWLFLVLCS SLYILEIAYHVSLKK 
AEH I ENL I GVAMNL 

7907 

A 

178 

565 

KGGATCPESPQDRKRRGNIiDMEKLYSENEGMASNHGKMENE*QPQDQ 
RKPQVTLYSGRQEVRKRGKDRKQGQNRR*GNV*RIKGKPESEGEAKE 
GKSEREGESEMEGGSEREGKPEIEGKPESEGEPLE 

7908 

A 

3 

415 

VTGVLVYLAVERL I SGDYE I DGGTMLSTSGCAEAEN I IMG * TLHQSG 
HGHSHGTTNQ*EENPSVRAVFIHVIGDFMQSMGVIaVAAYILYFKPEY 
KYVDPICTFVFSILRLGTTLTILRDVILVLMEGTPQGVYFTAV 

7909 

A 

119 

428 

LESLSHKRS S PRH I VIRLS KAKTKKS VLRAVRQKYQ ITYKRTPI SLS 
VDCS AQTLQARRDLGPTFCLLKENNNQP 1 1 LY P VK* S F INEGKI VFF 
REISPERIC 

7910 

A 

13 

235 

DLINKISKAAGYKINTEKSVAFLYTNSNQTEKEIKKAIPFTITTKNK 
TKYLGINLTKEV*ELYQENFETLMKGI 

7911 

A 

39 

141 

LSIRGLNIIIKRQRL*DWIKQQDSTLCCP*EIH 

7912 

A 

1 

285 

QQRRPPGLRPPHHPRGQDQLWLGKGWGAAQL*EGPSGGSRGRGVERE 
RGRRRKGRKGRA*RGTRKAGGEGRRETALASIiPLRGHRSPQERPRPT 
L 

7913 

A 

1 

452 

TAKTTFFRNAHSKHLS*NVCKPVEETQRPPTLQEIKQKIDSYNTREK 
NCLGMKLREDGTYTG F I KVHL KLRR PGTVP AG I RPQS I YDAIKEANL 
AVTTDKRTSFYL.PLDAIKQLHISSTTTVSEGIQGLVKKFNGVDNPQK 
VALFKRIHK 

7914 

A 

1 

484 

NVLCGNNQ ISDLG I LLPEVCMAPEEKGDKDDQLNKETED YLNSLFEG 
\FLKVTEDSLSYEDNQDDDSDLLQDLSPEEASYSIiQENIiPSDESCLS 
LDDLAKRIEIAEWPTEGLVS ILKKRNDTVGDHPAQMQHKPSKRRVR 
FQEIDDSLDQESLGFGDGAL 

7915 

A 

2 

1121 

HYYATDRFP FERVR I LL YYT YS YVP S FP S YTKCCFE FLT I Y FS S S LE 
SKTCLGTESSKESQHTVEPLGSSPCCHQMDVQTDSPSLSVTAGKDHM 
EELLCSAEATLALHTQSSETAGSPSGPDSSEDACEDDSRLQLAQTEA 
CQDVARIEGIAEDPKVFLSSKSKTEPLISPGCDRIPPALISEGKYSQ 
AQRKELRIiPLRDASEAIiPTDQLENNELNELQQPDLTDSDGKSPQAQA 
DSNGSENVLCGNNQISDLGILLPEVCMAPEEKGDKDDQL.NKETEDYL 
NSLLKGCLKDTEDSLSYEDNQDDDSDLLQDLSPEEASYSLQENLPSD 
ESCLSLDDLAKRIEIAEWQR*STIKSKMAFKKHIYVLNIALFL 

7916 

A 

2 

390 

SDSRETRWINLKDSVFCLYCKLFGEGKNQVKNENGCKDWQHLSHIL 
SKHEESEIWINNSVKYSKLKSDLKKNKAIDAAERRLYENEEK*WCAV 

T r\7 \J\TT VT OPCT PMMTDXTT TT T D VTCTTT TV TlTPTTtTT 

VVHV1 1 Jj£>Cr JjIjPIWXKJNJIjTIjLiRKTS VIjAI VSTIVTj 

7917 

A 

1 

240 

KRGEKAPETSKTTSTWRIPCRKRRLSNRDFPDLSTGEELPETCGSCL 
RRGSGREN*DAETWPAQCPECESKAAVSTRFLF 

7918 

A 

3 

422 

EPLSSPIiFCP*NQSKIHPPPIFIiAPTWPHPHHFPPGPGQPPFPLSTG 
SHLVL*QKPKGAHEHLSGIPPSLAQNPTWVPPHPGKRSSPHSATRPC 
KICPRIiLAPSTIiLTPSNIiTGP*PLLPGALPGLLGPPENFCTGCFL 

7919 

A 

2 

127 

RFWD YGR I ALVS * ADS ES RFQRLS S TS S SGQQDFENEL VG I 

7920 

A 

1 

346 

DSRSPSPASPPTQ*CRPSSFPGPAPPPRLWLAPPFCPPPPPHGYGPF 
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PARKRRGAALGLTRVLPPPGSAVRRSPGPGPGSAWDKTLKMKQKKFG 
RSVGLWREIjIRDCSLAELLSA 

7921 

A 

184 

291 

GRSLEASQSAGITGVSHSTRPIDILLIKAVYISLSH 

7922 

A 

481 

716 

VFFFFFETECRSIAQAGV*CSLRLPPPGSSDSRASVSRVAGIVGACH 
HAQLI FVFWAMGFHHVGEAGLEHLTSSDLP 

7923 

A 

4 

423 

VARERRLPGPQEGPHGRAAGSASPIiAPSIAPSPPGPPALPASHILGS 
QALPSPETPW*SPVTOPPAPLALVSPWQFPPGPPAPATP*RSPLGLQT 
PQAPGSWAPPVPATPQASGLPAPHVPETPWASS*LALPVPETPWDL 

7924 

A 

3 

437 

GYLSEWEENTPPKMEKEGLEIMIGKKKGIQGHYNSCYIiDSTLFCLF 
AFS S VLDTVLLRPKEKtfDVE YYS ETQELLRTE I VNPLR I YGYVCATK 
IMKLRKIIiEKVEAASGFTSEEKD/PAGQKVQDCYFYQIFMEKNEKVG 
VPTIQ 

7925 

A 

3 

187 

VLAGLELEDECAGCTDGTFRGTRYFTCALKKALFVKLKSCRPDSRFA 
LLQPVSNQIERCNSLAFGGYLSEWEENTPPKMEKEGLBIMIGKKKG 
IQGHYNSCYLDSTLFCLFAFSSVLDTVLLRPKEKNDVEYYSETQELL 
RTEIVNPLRIN*YCSRFPIRLSAVTL 

7926 

A 

4 

343 

KLEN* KMVLKE I KEDIiNKQTD I LFS * LQRLITVRMS I LPKLI YKFSA 
IPIQIPA*FL*IKIIIKCMRKGK*TRIAETASSSSSPSQSFILSPKIi 
DHRGGITANCTPPWAI KGKLL 

7927 

A 

1 

531 

F YHADHAALKDEMTE Y * VFLQ I KR YL YHGRIiliCKHPDAALLAAY HQ 
AEIGNHDSG*HPEGYSSKFQFFPKHSEKLERKIAEIHKTELSGQTPA 
TSELNFLRKAQTLETYGVDPHPCKDVSGNAAFLAFTPFGFWLQGNK 
RVHFIKWNEVTKLKFEGKTFYTjYVSQKKKGIGSCPV 

7928 

A 

115 

348 

KSCLNI FGLKQTTSSLRWNRVKK*AHTYG *QLIFNKGAKI -IHWIKYN 
PANG I C YW I S I CKKVNLDP YL» I P YT 1 1 QNG S 

7929 

A 

4 

430 

AWLSALNSSLPATWSCCPSLGASHKLQWEATSPTVPPRAPSQVKCAA 
CEPTPPTLPCLPFQPPPRSLSSQAQVLLTGHPG*GPLRRQGEGLLRS 
KGCQARLTQPNSGEGGAVS * TGRPGDGAVGEAAPETTRRDCPTAPPA 
S 

7930 

A 

1 

380 

GTRLLVRYTPKEPQVWTPTFVEVFGDLGQVGPE*C*HPEAKRMPCAQ 
DYLSWLNQLCVLYEKTPLSDRDTKCCTESLVNRRPCFSALEGDEAY 
DPQELNAETLTFH7UD I CTLS EKERQ I YKQTVL 

7931 

A 

2 

389 

NVQKSLAFLYTNNSQVESQIRNAIPFTPATKRIKYLGIPLTREVKGL 
YNENYKTLLKEFREDPNKWKNFPCSWGRINIIKMAMLSKAM*RFPR 
VHI KI PGTFFTELGKP I LKFI WNQKRAQI PRATRA 

7932 

A 

72 

373 

TS TLKKLEEKQTQSRRKE I KI PAE INAI ENKQIEKI SKTKSCS FK* S 
IKI TNP * LKQ * TKEKI * INNI KNKRGQVTMAFVA I KRI SRE YCQ * L Y 
TNKFNI 

7933 

A 

2 

259 

KQWKPAPPGPANPLPRGTSGPE * KGPHSDKAKPFSPFFKKRQWGGAP 
SRS KKARGKGQP PKGGHIiKGPLRWNGV * RAPLNRTAA 

7934 

A 

3 

335 

QIKPPRLRDTPHSFTPYHPVRRGLVYFPSAPSPAKGEQLHTRTRHNV 
LGSHPRRRLPRSS*SSHRVTRGSLGSVPPARSTSYGPSRQWDLACPL 
TP EGVWRTKNNKPEAL 

7935 

A 

37 

281 

GPSPYTDPTPTYP*LTIiLLTNPRTNSTLPPTLHIiPSRPTPSSPPSTS 
LPPSLVLRFFLKIQSGKRRAYPEHFWGQNGERKV 

7936 

A 

13 

234 

DLINKIS KAAGYKINTEKS VAFL YTNSNQTE KE I KKAI P FTITTKNK 
TKYLG INLTKEV * ELYQENFETLMKG I 

7 9^7 

7V 

* 

*± & o 

V JjE.il Lt V x b V Jj r W c>£> £> V FAW y WLiAH V YH SHGMMRAAQMCYIKTIjQIjAS 
QRRSWWAKLSSLWRIJUjLALKGCMANIS 

FCPljAVLLLAIjLQFKRKiWST*TLCLLERVVYHPGPSKFIGSNWPAV 

7938 

A 

132 

395 

ETNSLFGPQVKRQGNDLG * LNPPPRGFKKFSGLTLRRSGNYRPSPPR 
VFGF*GKRGLNWVAKGGLDLLS*NNPPPWPSQGARIMGKS 

7939 

C 

170 

394 

MYLGASSQLSPLYPELQAQEPMIQIPRPPHCGSIHWWGTSHSVPLQ 
PLKHRQRPLEYCPLPLHNTGQAALDQSG 

7940 

A 

107 

380 

PPFVAQLGGKGGNLG*LNPPPRG*RQFSSLGPPISGNYGPPPPSRVI 
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FGFLRKKGVSPSGPGGFGIPDLKIGPFNPPKGI#G*RGGPWGPNQ 

7941 

A 

83 

410 

PEI I SGFFFDHHGI KLETPKRNFGNYPNTWKLSNTLLNDQGVIEGI S 
SS SLKWLKTNEKGNPTYQS L * DTAKAVLKGNLTAI S AYIKNREKLQI 
NKQCIFNYPEKQEQM 

7942 

A 

15 

334 

QHFWENFHVTKDRINTGERQETDWKI F AT YLMKKV * I SRKCYNS YKL 
VRKKQLNRKWAKE KNKQS QKKKQSGQ * TKTVQPQ * NKRNAD * RYYLS 
PISLAKSNKLNY 

7943 

A 

1 

400 

ESRLICQQMYMGEKPFGCSCCEKAFSNKSYLLVHQQTHAEEKPYGCN 
ECGKDFSSKSYLIVHQRIHTGEKLHECSECRKTFSFHSQLVIHQRIH 
TGENPYECCECGKVFSRIDQIiVSHQKTHS*QKPYVCNEW 

7944 

A 

46 

364 

SHLPNSSFEQKYAYEFVPGYQKHYTLSKFPWPFFAEMEKLILKFKWK 
S*SHITEAILRRKNKVGDLTLSDVKSSL*RYDNQLLSV*Y*YKHRRI 
GPWDHGRPDIHL 

7945 

A 

3 

312 

VCPCSQLNKAI TSTTRCLRGFVCS S S CYRRRARERGMEKQTQRERER 
QR*RERQRRGEERQRGGKREPHSRARGGGGSRERERVREL*SDSDRS 
LEREALLR 

7946 

A 

225 

722 

GLAHSAVRHSSFWGHLQDSGPLVPPGAGQGSLGLLAAEWVGKASPWG 
PAPACPTPHS EALVCS FRTS P PS TPHAHPGPVSGP AS VLAALRLGLH 
LLRGCGGGTPTS PQTLQDEEAN* GHRGHGDRLLRLTQAPTAQPLWHR 
PWQKDWPNLLPRPLSFWCRPPPRP 

7947 

A 

43 

418 

AYWGRKTPWLTI SSFWPTPGSPYYDNGRPLCYSDSDAVLiIiCFDISRP 
ETGDSAIiKKVRLDQLSNIi*LSMRTKKESGPCHECLLLTSRNGSQGRK 
RFAQGHTISYSSGTGTWVSLPHCLPPFLLSH 

7948 

A 

2 

221 

AGPGGSPHGYVGSPLRISTRTHLCFAATRSLLHIDPLIVIiPLVDLKD 
RI 1 1 *NLWGLQPLPPASLLQPTAIiYS 

7949 

A 

1 

409 

SPGTCGRRPQTM/PPGAPDFPAVEGRSLGLSPSQGGPAGAGGDAGPQ 
EDVRQP PGELAHGPLLPAQTG/ S PG PQGLWLHRNPTS / GPPSQI GEG 
AEQGDEGVADAPQIQCKNVLKIRRRKttNH^ 

7950 

A 

2 

363 

YSTSPAGQVGRLSPSQGGPAGAGGDAGPQEDVRQPPG/EAGSRPAAS 
CPDW I PG/ AAGHVAPPQS YQCPPSQIGEGPGGTPETQADQVRERPEA 
HLAEGGAKGSPRRAGRPPRSTCGANESG 

7951 

A 

337 

1067 

NLKAHPRRPFNPPLASGPFPLILAGADTW*GIGGWKPTSSLRVPGIQ 
SGAGSERGREPSSPGG*RTSSWGPASPPAPAGPPCEGERPRIiPGPAG 
DVL * RETADPAAQHSMRQAKAGGRLPKERPSTAGKSGAPGAWS VGGG 
HRSQGSSTRLKLTVLRPPSGTSGPSPSRPRSAAATRRFDRTAPAGVIj 
AAPDSALAQAPPGPKGDG FRGCGPW I YPQVQNRFRTTQG PDSQRADG 
SAGSRLRK 

7952 

A 

127 

440 

SLRYTFLYTFPI I LEHI I VXjMQQTCCLHGFI S I YLLIHTFILQIHPF 
HTtiFQQTCSSIQQLYFLV*YFAYVYTFLFILYLSYPFLYYY*ACFET 
FDSFNEIGRV 

7953 

A 

2 

341 

MATSELSCEVSEENCERREAFWAEWKDLTLSTRPEEGCSLHEEDTQR 
HETYHQQGQCQVIiVQRS PWLMMRMG I U3RGLQE YQLP YQRVLPLP I F 
TPAKMGATKEERGVPSTLC 

7954 

A 

288 

802 

GVIMATSELSCEVSEENCERREAFWAEWKDLTLSTR/AEREGCSLHE 
EDTQRHETYHHCX5QGQV\LAQRSPWLMMRMGILGRGLQEYQLPYQRV 
LPLPI FTPAKMGATNEEREDTP IQLQELLALETALGGQRVDRQEVAE 
ITKQLPPWPVSKPGALRRSLSRSMSQEAQRG 

7955 

A 

72 

489 

HEAGSRMNGDHNSDVFSQD*QDFVQHFCQIGRVIiTEDEMGHPEIGDA 
IARLKEVLEYNAIGGKYNRGLTVWAFRELVEPRKQDADSLQRAWTV 
GWCVE/ADDIMDSSLTRRGQICWYQKPGVGLDAINDANL.LEACIYR 

7956 

A 

1 

236 

GTRWAFRELVEPRKQDADSLQRAWTVGWCVELVRGVGQGRRSFLQW 
W*LGWEGEGWA*NLFLGCVCVSLSLVCYGQI 

7957 

A 

317 

1395 

TGDQNSDVYAQEKQDFVQHFSQIVRVIiTEDEMGHPEIGDAIARLKEV 

L\ E yna IGGKYNRGLTW\ vafr\ elv\ eprkqda\ dslpagpwtvg 

\WCVELLQAF\FLVADDIMDSSLTPPGNRSCWYQKPGVGIjDAINDG\ 
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NL»L\ EAC I YRLLKL YCREQP YYLNLI \ ELFLQS S \ YQTE IGQTLDLL 
TAPP/QGNVDLSEFTEKRYKSIVKYKTAFYSFYLPIAAAMYMAGIDG 
EKEHANAKKILLEMGEFFQIQDDYLDLFGDPSVTGKIGTDIQDNKCS 
WLWQCLQRATPEQYQILKENYGQKEAEKVARVKALYEELDLPAVFL 
QYEEDSYSHIMALIEQYAAPLPPAVFLGLARKIYKRRK 

7958 

A 

3 

484 

ETPVPATSAAIRILVLLPSLRVTS PGRQGPSVDRWFPPFKFRPERS 
RHAAAGPNWRPQPKPHEVGGERRGRPNAPANSFRSCSNLSPQYSLP 
KQNKSSTNP*IPNKPTPPASPSGRTDGLSLLQPLPGSDGQASPTMRS 
SPAISSALEIEFGKTNPFH 

7959 

A 

3 

434 

SGSSRPRGSAQPRPGPGACGMAAPTSE*SSPPAWQPRPPCGGLHHCP 
ACVHPHRDSGGSVPAPAACCHTPHGRPWPEFEGQRPQTQPQPQGPAW 
VR PVLTLFAGRVAS VPLVS LCAWAS G S QPGPPS GGS GRALCTGLGAP 
PA 

7960 

A 

2 

332 

VCKLQPQPLPAHSGSPNIHISQLCPSHAWGLLVLSSGCVTFLRESFF 
LCFALVLQRKSSSLGILSCPGSIFTALNSTNLYIIjGFLGGFCLVFF* 
DRVSLCHPGWSEWQS 

7961 

A 

25 

364 

LFYLiYH1»LFFGFGLFLFFL»IKLGSSLFINFSKKSV1i*FTNWIYFSVL 

yftdfcfifiisffgfsliyfiivgfplkfigfilsfi*itvfygqa 
qwltpvipilwvaevrgsl 

7962 

A 

174 

470 

ngpp * rkkffls i prki wappgff * / egpplffffffltescs va* p 
gmrwhnpgllqplpsgfkqfsclslpsswdyrcplphpanffvwfl 

VETEFC 

7963 

A 

65 

2774 

PGKTGLGNMHAQRQFWAAVRAEVRRHEVAKQALm^LRKIiAERVDDP 
ELQDS IQASLDS IREAVINSQEWTLSRS IPELRLGVIiGDARSGKSSL 
IHRFLTGSYQVLEKTESEQYKKEMLVDGQTHLVIilREEAGAPDAKFS 
GWADAVI FVFSLEDENS FQAVS RLHGQLS S LRG EGRGGLAIaALVGTQ 
DRI S AS S PRVVGDARARALCADMKRCS YYETCATYGLNVDRVFQEVA 
QKVVTLRKQQQLLAACKSIiPSSPSHSAASTPVAGQASNGGHTSDYSS 
SLPSSPNVGHRELRAEAAAVAGLSTPGSLHRAAKRRTSLFANRRGSD 
SEKRSLDSRGETTGSGRAI PI KQS FLLKRSGNSLNKEWKKKYVTLSS 
NG Flili YHP S I ND Y I HSTHG KEMDLLRTTVKVPG KR PPRA I S AFGPS A 
SINGLVKDMSTVQMGEGLEATTPMPSPSPSPSSLQPPPDQTSKHLLK 
PDRNIiARALSTVLSGQDYYPYFRDKETEAQKIKCLAQGLKANCTPSG 
DLSPLSREPPPSPMVKKQRRKKLTTPSKTEGSAGQAEEENFEFLIVS 
STGQTWHFEAASFBERDAWVQAIESQILASLQCCESSKVKLRTDSQS 
EAYAIQA I RNAKGNS I CVDCGAPNPTWASIiNLGAL I C I E CSG IHRNL 
GTHLSRVRSLDLDDWPRELTLVLTAIGNDTANRVWESDTRGRAKPSR 
DSSREERESWIRAKYEQLLFLAPLSTSEEPLGRQLWAAVQAQDVATV 
-LLLLAHARHGPLDTSVEDPQLRSPLHIJ^EIJVHWITQIiliLW 
AARDAQVRTALFYARQAGSHLCADILLQHACPGEGGSAATTPSAATT 
PSITATPSPRRRSSAASVGRADAPVALHSWGRPTFTMGVXPPKAKGK 
EKGISIESTK 

7964 

A 

243 

687 

PHPIPHSEENFEFLIVSSTGQTWHFEAASFEERDAWVQAIESQILAS 
LQCCES S KVKVRV * GGVEDWPQELTLVLTAI GNDTAEW * EESRGV* 
EGGGAWRQSQCQPQPSLQLRTDSQSEAVAIQAIRNAKGNSICAHASA 
DAWVEA 

7965 

A 

3 

370 

PSDPEVAGDPIFPPGPPSCPEVKDKTKSSISIX3WKPPAKDGGSPIKG 
YIV*MQEEGTTDWKRWEPYKLITTWECGGPNLKELRKYRFRVKAVN 
EAGESEPRDTI\*E1PATDIQEEPEVF±D 

7966 

A 

3 

78 

FKIQNLIASTVRTVMVADCSRFDSPDLLLEAGDPATSPCRIFDLGSD 
NEEWAIPDPSHPKEGLSYSLSCVLLGRSLPRCLISCYIPLCLSVAS 
GPHKFY* PHCQHRPHGDGGRLQPLRQP 

7967 

A 

195 

345 

NDISANPAI KKFW KLPKCPTTEGWLNT * LY IKLVEYYATLKVCGDLY 
VR 

7968 

A 

2 

802 

WRNSADVWVnfCGGPLLDTLPSNWSGTCTLVQFAIPFALAFLQPEKEK 
PQHRKIREAPYGSFDSQVYLDATGVPQGVPHKFKAQDQIAAGFESIF 
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WVATTISKNIDWINYIYYNQQRFINYTRDAVKGIAEQLGPTSQMAWEN 
RMALDM ILAKKGGVCVM I KTQCCTF I PNNTAPSGS I TRALQGLTALS 
NELAKNSGVNDPFSGWLERWFGKWKGIIASILTSLAARIGVVILFGC 
*VTPCIRGLVQRLIETVLTKTSLSSPPPYSD 

7969 

A 

113 

362 

WLHAPECFSQCPSQPSPS*LGCQGPESDPDPRSWGPPLSQAGSPGR 
PGGTVITLCCLWPHLAPLFHLWSPGKCTSLLWQHS 

7970 

A 

1 

955 

MTQPFNYSAFERIIFAGSPGHTVFSSERSLLVRPRSHPEPKGEHYVT 
GSPTPENQRTSAAMSKPHSEAGTAFIQTQQLHAAMADTFLEHMCRLD 
IDSPP/RRPLPATPQKPPPWVPWRPPSSAAVGP*SSSPSLADVKSQV 
LVTCCLACTCHWQREEWVLVTLGQGCLSVSEFRV 

7971 

A 

113 

716 

RSLGPCQHSWQSSPGGPRPQPRPPGLGPCTAQGRCHGTGGPDE/TVW 
GFRWAMMGARGRYL/AHLVSRLEMGWGSYIRVHFTRPQEVPSGRSN 
SSPSHPAQ*GLWPLQHLLRRTPPEARAAPHPISPSAC*VTICPLPPF 
HRPRDQDHLAPARS AS PTVP \ PQPMVGRTPPTP * TLTQRGS PGPMS P 
AP I PLAANDSDKPS ST 

7972 

A 

244 

1510 

KFPPKKTPPHLGMESAITLWQFLLQLLLDQKHEHLICWTSNDGEFKL 
LKAEEVAKLWGLRKNKTNMNYDKLSRALRYYYDKNI I KKV I GQKFVY 
KFVSFPEILKMDPHAVEISRESLLLQDSDCKVSPEGR^AHKHGLAVL 
RSTSRNE Y IHS GLYS S FTINSLENPPDAFKAI KREKLEE PPEDS PPV 
EEVRTVIRFVTNKTDKHVTRPWSLPSTSRAAAASAFLASSVSAKIS 
SIJ4LPNAASISSASPFSSRSPSLSPKSPLPSEHRSLFLEAACHDSDS 
LEPLNLSSGSKTKSPSLPPKAKK\PKGLEISAPPLVLSGTDIGSIAL 
NSPALPSGSLTPAFFTAQTPNGLLL\TPS\PLL\SSIHFW\GSL*SK 
FVPLSPARLQGPSTLFQFPTLLNGHMPVPIPSLDRAASPVLLSSNSQ 
KS 

7973 

A 

1 

1230 

MGE I AFP I CAS VMVGFDLQHGMES AI TLWQFLLQLLLDQKHEHL I CW 
TSNDGEFKLLKAEEVAKLWGLRKNKTNMNYTDKLSRALRYTYDKNIIK 
KVT GQKFVY KFVS FPE I LKMDPHAVE I S RESLLLQDSDCKAS PEGRE 
AHKHGLAALRSTSRNEYIHSGLYSSFTINSLQNPPDAFKAIKTEKLE 
E PPEDS PPVEEVRTVIRFVTNKTDKHVTRPVVS LPS TSEAAAASAFL 
ASSVSAKISSLMLPNAASISSASPFSSRSPSLSPNSPLPSEHRSLFL 
BAACHDSDSLEPLNLSSGSKTKSPSLPPKAKKPKGLEISAPPLVLSG 
TD I GS I ALNS PALPS GSLTPAFFTAQEMMVHVYSDSTRYQDLWELGY 
VHPLVSSVLSHYCRPMVLKVSMQNNLLEDAHLHI 

7974 

A 

3 ! 

360 

LPAPGARPEWLLFPAPWGPVLCWTPQS I ISLHFFLNPVGQTLSSSS 
SSSSSP*RT*LPMTLS*VGAVSGFKPISRPPEPHSGHNSPSLSQPSL 
TILVS SNS PGSGKARGPQCPQAWL 

7975 

A 

194 

373 

LFG*LKAPPPGLKQFSGLSLRRSGNYGPPPPPPVYFFVFLKKNGFPL 
VTQGGLNLRTWG 

7976 

C 

238 

459 

MQGVISPSTSTSLFVFLALVLPASHXPPQPPPLPGPPSEIAHLTTVR 
RGAALANVNHENI SDTDENKLNAFVTA | 

7977 

A 

1 

505 

AGGAGEETPHQESQRQGCALPQEPPLYPPCPAIHPPPLPLGGLLLFP 
SLPSLALSCMVPPPWAAQEGI*MENHQPPEKKRLWGPPPANSPFPAR 
PLNPHLFQAGTGCPPCPREVPTPSPPWAEPPPLPGPPSEIAHLTTVR 
RGAALANVNHENI SDTDENKLNAFVTA 

7978 

A 

39 

141 

LSIRGLNIIIKRQRL*DWIKQQDSTLCCP*EIH 

7979 

A 

8 

533 

FNYLLKFKIRRPPHAFQMFPMSSQGKEGPETAESELSHHGCLSL*R* 
E I S WAPPTCGAFPS S AATLGAS PVLGAS VPLGAAGMQRGRRGHRI PG 
DTWGLLLPGAGGTDAESPAHNWEP*TSVPPSRPGHPSGPSAGLQSGR 
PPL*GDCEWGRKAGKGHAY*VPGPELHTLPLSDP 

7980 

A 

1 

9786 

DGRVKYTLNKNSLKI Bl PLP FGGKS SRDLKMLETVRTPAX»HFKS VGF 
HLPSREFQVPTFTIPKLYQLQVPLLGVLDLSTNVYSNLY1WSASYSG 
GNTSTDHFSLRARYHMKADSWDLLSYNVQGSGETTYDHKNTFTLSC 
DGSLRHKFLDSNIKFSHVEKLGNNPVSKGLLIFDASSSWGPQMSASV 
HLDSKKKQHLFVKEVKIDGQFRVS S F YAKGTYGLSCQRD PNTGRLNG 
ESNLRFNSS YLQGTNQITGRYEDGTLSLTSTSDLQSGI I KNTASLKY 
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ENYELTLKSDTNGKYKNFATSNKMDMTFSKQNALLRSEYQADYBSLR 
FFSLLSGSLNSHGLELNADILGTDKINSGAHKATLRIGQDGISTSAT 
TNXrKCSLLVLENELNAELGLSGASMKLTTNGRFREHNAKFSLDGKAA 
LTELSLGSAYQAMILGVDSKNIFNFKVSQEGLKLSNDMMGSYAEMKF 
DHTNSLNIAGLSLDFSSKLDNIYSSDKFYKQTVITLQLQPYSLVTTIiN 
SDLKYNALDLTNNGKLRLEPLKLHVAGNLKGAYQNNEIKHIYAISSA 
ALSASYKADTVAKVQGVEFSHRLNTDIAGLASAIDMSTNYNSDSLHF 
SNVFRSVMAPFTMTIDAHTNGNGKLALWGEHTGQLYSKFLLKAEPIA 
FTFSHDYKGSTSHHLVSRKSISAAIiEHKVSALLTPAEQTGTWKLKTQ 
FNNNEYSQDLDAYNTKDKIGVELTGRTLADIiTLLDSPIKVPLLLSEP 
INI IDALEMRDAVEKPQEFTIVAFVKYDKNQDVHS INLPFFETLQEY 
FERNRQTI IVVLENVQRNLKHINIDQFVRKYRAALGKLPQQANDYLN 
SFNWERQVSHAKEKLTALTKKYRITENDIQIALDDAKINFNEKLSQL 
QTYM IQFDQYI KDS YDLHDLKI AI AN I IDE 1 1 EKLKSLDEHYHIRVN 
LVKTIHDLHLFIENIDFNKSGSSTASWIQNVDTKYQIRIQIQEKLQQ 
LKRHIQNIDIQHLAGKLKQHIEAIDVRVLLDQLGTTISFERINDVLE 
HVKH FV INL I GD FE VAE KINAFRAKVHEL I ER YE VDQQ I QVLMDKLV 
ELAHQYKLKETIQKLSNVDQQVKIKDYFEKLVGFIDDAVKKLNELSF 
KTF I ED VNKFLDML I KKLKS FD YHQFVDETNDK I REVTQRIiNGE I QA 
LELPQKAEALKLFLEETKATVAVYLESLQDTKITLIINWLQEALSSA 
S IiAHMKAKFRETLEDTRDRMYQMD I QQELQRYL S LVGQVYS TLVTY I 
SDWWTLAAKNLTDFAEQYSIQDWAKRMKALVEQGFTVPEIKTILGTM 
PAFEVSLQALQKATFQTPDF I VPLTDLRI PS VQINFKDLKNI KI PSR 
FSTPEFTILNTFHIPSFTIDFVEMKVKIIRTIDQMLNSEIiQWPVPDI 
YLRDLKVEDI PLARITLiPDFRLPEI AI PEFQLPHI SHT I EVPTFGKL 
YSILKIQSPLFTLDANADIGNGTTSANEAGIAASITAKGESKLEVLN 
FDFQANAQLSNPKINPLALKESVKFSSKYLRTEHGSEMLFFGNAIEG 
KSNTVASLHTEKOTLELSNGVIVKINNQLTLDSNTKYFHKLNIPKLD 
FSSQADLRNEIKTLLKAGHIAWTSSGKGSWKWACPRFSDEGTHESQI 
SFTIEGPLTSFGLSNKINSKHLRVNQNLVYESGSLNFSKLEIQSQVD 
SQHVGHSVLTAKGMALFGEGKAEFTGRHDAHLNGKVIGTLKNSLFFS 
AQPFEITASTNNEGNLKVRFPLRLTGKIDFLNNYALFLSPSAQQASW 
QVSARFNQYKYNQNFSAGNNENIMEAHVGINGEANTjDFLNIPLTIPE 
MRLPYTIITTPPLKDFSLWEKTGLKEFLKTTKQSFDLSVKAQYKKNK 
HRHS I TNPLAVLCEFISQS I KS FDRHFEKNRNNALDFVTKS YNETKI 
KFDKYKAEKSHDELPRTFQIPGYTVPWNVEVSPFTIEMSAFGYVFP 
KAVSMPS FS II/3SDVLVPS YTLILPSLELPVLHVPRNIiKIiSIiPHFKE 
LCTI SHI FI PAMGNNTYDFS FKSSVITLNTOAELFNQSDIVAHLLSS 
SSSVIDALQYKLEGTTRLTRKRGLKLATALSLSNKFVEGSHNSTVSL 
TTKNMEVS VAKTTKAE I PI LRMNFKQELNGNTKS KPTVS SSMEFKYD 
FNSSMLYSTAKGAVDHKLSLESLTSYFSIESSTKGDVKGSVLSREYS 
GTIASEANTYI^SKSTRSSVKLC^TSKIDDIWNIjEVKENFAGEATLQ 
R I YS LWEHS TKNHLQLEGLF FTNGEHTS KATLELS P WQM S ALVQVHA 
SQPS S FHDFPDLGQEVALNANTKNQKI rwknevrihsgs fqsqvels 
NDQEKAHLDI AGSLEGHLRFLKNI ILPVYDKSLWDFLKLDVTTS IGR 
RQHLRVSTAFVYTKNPNGYSFS IPVKVLADKFITPGLKLNDLNSVLV 
MPTFHVPFTDIiQVPSCKLDFREIQIYKKLRTSSFALNLPTLPEVKFP 
EVDVLTKYSQPEDSLIPFFEITVPESQLTVSQFTLPKSVSDGIAALD 
LNAVANKIADFELPTIIVPEQTIEIPSIKFSVPAGIVIPSFQALTAR 
FEVDS PVYNATWS ASLKNKAD YVETVLDS TCS STVQFLE YELNVLGT 
HKIEDGTLASKTKGTFAHRDFSAEYEEDGKYEGLQEWEGKAHLNIKS 
PAFTDLHLRYQKDKKGI STS AAS PAVGTVGMDMDEDDD FSKWNFYYS 
PQSS PDKKLTI FKTELRVRESDEETQI KVNWEEEAASGLLTSLKDNV 
P KATG VI, YD YVN KYHWEHTG LTLREVS S KLRRNLQNNAE WVYQGA I R 
Q I DD I D VR FQKAASGTTGT YQE WKDKAQNLi YQELLTQEGQAS FQGLK 
DNVFDGLVRVTQEFHMKVKHLIDSLIDFLNFPRFQFPGKPGIYTREE 
LCTMFIREVGTVIiSQVYSKVHNGSEILFSYFQDLVITLPFELRKHKL 


.i. o o s "mi-bs h- 3 ,„ o a a & o 2 






IDVISMYRELLKDLSKEAQEVFKAIQSLKTTEVLRNLQDLLQFIFQL 
I EDNI KQLKEMKFTYL INY I QDE I NT I FND Y I P YVFKLLKENLCLNL 
HKFNEFIQNBLQEASQELQQIHQYIMALREEYFDPSIVGWTVKYYEL 
EEKIVSLIKNIiLVALKDFHSEYIVSASNFTSQLSSQVEQFLHRNIQE 
YLS ILTDPDGKGKEKIAELSATAQEI IKSQAIATKKI ISDYHQQFRY 
KLQDFSDQLSDYYEKFIAESKRLIDLSIQNYHTFLIYITELLKKLQS 
TTVMNPYMKLAPGELTI IL 

7981 

A 

27 

13822 

VPFSVAAAEEPAQPARAARPRPGRSPGAAPPQIAMDPPRPALLALLA 

LPAIiLLLLLAGARABEEMLENVSLVCPKDATRFKHIiRKYTYNYEAES 

S SGVPGTADSRS ATR INCKVELEVPQLCS F I LKTSQCI LKEVYGFNP 

EGKALLKKTKNSEEFAAAMSRYELKLAIPEGKQVFLYPEKDEPTYII, 

NIKRGIISALLVPPETEEAKQVLFLDT^GNCSTHFTVKTRKGNVAT 

SXSTERDLGQCDRFKPIRTGISPLALIKGMTRPLSTLISSSQSCQYT 

LDAKRKHVAEAI CKEQHLFLPFS YKNKYGMVAQVTQTLKLEDTPKIN 

SRFFGEGTKKMGLAFESTKSTSPPKQAEAVLKTVQELKKLTISEQNI 

QRANLFNKLVTELRGLS DEAVTSLLPQL 1 EVS S P I TLQALVQCGQPQ 

CS TH I LQWLKRVHANPLL I DWTYLVAL I PEPS AQQLRE I FNMARDQ 

RSRATLYALSHAVNNYHKTNPTGTQELLDIANYLMEQIQDDCTGDED 

YTYLILRVIGNMGQTMEQLTPELKSSILKCVQSTKPSLMIQKAAIQA 

LRKMEPKDKDQEVLLQTFLDDASPGDKRLAAYLMLMRSPSQADINKI 

VQILPWEQNEQVKNFVASHIANILNSEELDIQDLKKLVKEVLKESQL 

PTVMDFRKFSRNYQLYKSVSIPSLDPASAKIEGNLIFDPNNYLPKES 

MLKTTLTAFGFASADLIEIGLEGKGFEPTLEALFGKQGFFPDSVNKA 

L YWVNGQVPDGVS KVL VDHFG YTKDDKHEQDMVNG I ML SVEKL I KDL 

KSKEVPEARAYLRILGEELGFASLHDLQLLGKLLLMGARTLQGIPQM 

IGEVIRKGSKNDFFLHYIFMENAFELPTGAGLQLQISSSGVIAPGAK 

AGVKLEVANMQAELVAKPSVSVEFVTNMGI 1 1 PDFARSGVQMNTNFF 

HESGLEAHVALKPGKLKFIIPSPKRPVKXLSGGNTLHLVSTTKTEVI 

PPLIENRQSWSVCKQVFPGLNYCTSGAYSNASSTDSASYYPLTGDTR 

LELELRPTGEIEQYSVSATYELQREDRALVDTLKFVTQAEGAKQTEA 

TMTFKYNRQSMTLSSEVQIPDFDVDLGTILRVNDESTEGKTSYRLTL 

DI QNKKITEVALMGHLS CDTKEERKI KGVI S I PRIiQAEARSEILAHW 

SPAKLLLQMDSSATAYGSTVSKRVAWHYDEEKIEFEWNTGTNVDTKK 

MTSNFPVDLSDYPKSLHl^ANRLIiDHRVPQTDMTFRHVGSKLIVAMS 

SWLQKASGSLPYTQTLQDHLNSLKEFNLQNMGLPDFHIPENLFLKSD 

GRVKYTLNKNS LKI E I PLPFGGKS SRDLKMLETVRTPALHFKS VGFH 

LPSREFQVPTFTI PKLYQLQVPLLGVLDLSTNVYSNIiYNWSAS YSGG 

NTSTDHFSLRARYHMKADSWDLLSYNVQGSGETTYDHKNTFTLSCD 

GSIiRHKFLDSNIKFSHVEKLGNNPVSKGLLIFDASSSWGPQMSASVH 

LD S KKKQHLFVKE VK IDGQFR VS S F Y AKGT YGL S CQRD PNTGRLNGE 

SbXLRFNSSYLQGTNQITGRYEDGTLSLTSTSDLQSGIIKNTASLKYE 

NYELTLKSDTNGKYKNFATSNKMDMTFSKQNALLRSEYQADYESLRF 

FSLLSGSLNSHGLELNADILGTDKINSGAHKATLRIGQDGISTSATT 

NLKCSLLVLENEIJ^AELGLSGASMKLTTNGRFREHNAKFSIJX5KA^ 

TELS LG S AYQAM I LGVD S KN I FNF KVS QEGLKLSNDMMG S Y AEMKFD 

HTNSI^IAGLSLDFSSIOjDNIYSSDKFYKQTVNLQLQPYSLVTTLNS 

DLKYNALDLTNNGKLRLEPLKLHVAGKTLKGAYQNNEIKHIYAISSAA 

LSAS YKAD WAKVQGVEFSHRLNTD IAGIiASAIDMSTNYNSDSLtHFS 

NVFRSVMAPFTMTIDAHTNGNGKLAIiWGEHTGOLYSKFIiLKAEPLAF 

TFSHDYKGSTSHHXiVSRKS I S AALEHKVSALLTPAEQTGTWKLKTQF 

NNNEYSQDLDAYNTKDKIGVELTGRTIoADLTLLDSPIKVPLLLSEPI 

NIIDALEMRDAVEKPQEFTIVAFVKYDKNQDVHSINLPFFETLQEYF 

ERNRQTI I VVLENVQRNLKHINIDQFVRKYRAALGKLPQQANDYLNS 

FNWERQVSHAKEKLTALTKKYRITENDIQIALDDAKINFNEKLSQIjQ 

tymiqfdqyikdsydlhdlkiaianiideiieklksldehyhirvnl 
vktihdlhlfienidfnksgsstaswiqnvdtkyqiriqiqeklqql 
krhiqnidiqhlagklkqhieaidvrvlldqlgttisferindvleh 
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VKHFVINL IGDFE VAEKINAFRAKVHEL I ERYEVDQQI QVLMDKLVE 

LAHQYKLKETIQKIiSNVliQQVKIKDYFEKLVGFIDDAVKKIiNELSPK 

TF I EDVNKFLDML I KKLKS FDYHQFVDETNDKI REVTQRLNGE I QAL 

ELPQKAEAIiKLFLEETKATVAVYLESLQDTKITIillNWLQEALSSAS 

LAHMKAKFRETLEDTRDRMYQMDIQQELQRYLSLVGQVYSTLVTYIS 

DWWTLAAKNLTDFAEQYSIQDWAKRMKALVEQGFTVPEIKTILGTMP 

AFEVSLQALQKATFQTPDFIVPLTDLRIPSVQINFKDLKNIKIPSRF 

STPEFTILNTFHIPSFTIDFVEMKVKIIRTIDQMIiNSEIiQWPVPDIY 

LRDL KVED I PLAR ITLPDFRLPE I A I PEF 1 1 PTLNLNDFQVPDLHI P 

EFQLPHISHTIEVPTFGKLYSILKIQSPLFTLDANADIGNGTTSANE 

AG I AAS I TAKGES KLEVLNFDFQANAQLSNPKINPLALKES VKFS S K 

YLRTEHGSEMLFFGNAIEGKSNTVASLHTEKNTLELSNGVIVKINNQ 

LTLDSNTKYFHKLNIPKLDFSSQADLRNEIKTLLKAGHIAWTSSGKG 

SWKWACPRFSDEGTHESQISFTIEGPLTSFGLSNKINSKHLRVNQNIi 

VYESGSLNFSKLEIQSQVDSQHVGHSVLTAKGMAIiFGEGKAEFTGRH 

DAHLNGKVIGTLKNSLFFSAQPF^ITASTNNEGNLKVRFPLRIiTGKI 

DFLNl^ALFLSPSAQQASWQVSARFNQYKYNQNFSAGNNENIMEAHV 

GINGEANLDFLNIPLTIPEMRLPYTIITTPPLKDFSLWEKTGLKEFIj 

KTTKQSFDLSVKAQYKKNKHRHS ITNPLAVLCEFISQS IKSFDRHFB 

KNRNNALD FVTKS YNETKI KFDKYKAEKSHDELPRTFQ I PGYTVPW 

NVEVSPFTIEMSAFGYVFPKAVSMPSFSIIiGSDVRVPSYTLILPSLE 

LPVLHVPRNLKLSLPDFKELCTISHIFIPAMGNITYDFSFKSSVITTi 

NTNAELFNQSDIVAHLLSSSSSVIDAIiQYKLEGTTRIiTRKRGLKLAT 

ALSLSNKFVEGSHNSTVSLTTKNMEVSVATTTKAQIPILRMNFKQEL 

NGNTKSKPTVSSSMEFKYDFNSSMLYSTAKGAVDHKLSLESLTSYFS 

I E S S TKGDVKGS VLSRB YSGTI AS EANTYLNS KSTRS S VKLQGTSKI 

DDIWNLEVKENFAGEATLQRIYSLWEHSTKNHLQLEGLFFTNGEHTS 

KATLELS PWQMS ALVQVHAS QPSSFHDFPDLGQEVALNANTKNQKIR 

WKNEVRIHSGSFQSQVELiSNDQEKAHLDIAGSLEGHIjRFI»KNIILPV 

YDKSLWDFLKLDVTTS IGRRQHLRVSTAFVYTKNPNGYSFS I PVKVL 

ADKFIIPGLKLNDLNSVLVMPTFHVPFTDLQVPSCKLDFREIQIYKK 

IJ^TSSFALNLPTLPEVKFPEVDVLTKYSQPEDSIjIPFFBITVPESQL 

TVSQFTLPKSVSDGIAALDIiNAVANKIADFELPTIIVPEQTIEIPSI 

KFSVPAGIAIPSFQALTARFEVDSPVYNATWSASLKNKADYVETVLD 

STCS STVQFLEYELNVLGTHKI EDGTIiAS KTKGTFAHRDFS AE YEED 

GKYEGLQE WEGKAHLN I KS P AFTDLHLR YQKDKKG I STS AAS PAVGT 

VGMDMDEDDDFSKWNFYYSPQSSPDKKLTIFKTELRVRESDEETQIK 

VNWEEEAASGLLTSLKDNVPKATGVLYDYVNKYHWEHTGLTLREVSS 

KLRRNLQDHAEVATyQGAIREIDDIDERFQKGASGTTGTYQEWKDKAQ 

NLYQELLTQEGQASFQGLKDWFDGLVRVTQEFHMKVKHLIDSLIDF 

LNFPRFQFPGKPGIYTREELCTMFIREVGTVLSQVYSKVHNGSEILF 

SYFQDLVITLPFELRKHKLIDVISMYRELLKDLSKEAQEVFKAIQSL 

KTTEVLRNIiQDLLQFIFQIiIEDNIKQLKEMKFTYLINYIQDEINTIF 

ND Y I P YVFKLLKENLCLNLHKFNE F I QNELQEA S QELQQ I HQ Y IMAL 

REEYFDPSIVGWTVKYYELEEKIVSIiIKNLLVALKDFHSEYIVSASN 

FTSQLSSQVEQFLHRNIQEYLSILTDPDGKGKEKIAELSATAQEIIK 

SQAI\ATKKIISDYHQQFRYKLQDFSDQLSDYYEKFIAESKRLIDLS 

IQNYHT\FLIYITELLKKLQSTT\VMNPYMKLAPGELTIIL 

7982 

A 

40 

401 

ELFGIIALQTIAYSILWDLKFLMRNLALGGGIiLLLLAESRSEGKSMF 
AG\LMFMTLLHFDASFFSIVQNIVGTALMILVAIGFKTKIiAALTLVV 
WLFAINVYFNAFWTI PVYKPMHDFLKY 

7983 

A 

1 

900 

FRAAGAGADGRE PAS ERASRAE PPAVAMGQNDLMGMAED FADQFLRV 
TKQYLPHVARIiCLISTFLEDGIRMWFQWSEQRDYIDTTWNCGYLLAS 
SFVFLNIjLGQLTGCVLVLSRNFVQYACFGLFGI IFLY\T IAYS ILWD 
LKFLMRNLALGGGLLLLLAE S RS EG KSM F AG \ VPTMRE S S P KQYMQL 
GGRVLLVLMFMTLLHF\ DAS FLFLFVQN I VG\ TALMI \ L.VAIGFKTK 
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L\AALTLVVWLFAINVYFNAFWTIPVYK\ PMHDFLKYDF\ FQTMSVI 
GGLLLV\VALGPGGVSMDEKKKEW 

7984 

A 

2 

398 

GRVGRPTRPPTRPPTRPPTRSSSSSSSSSSSSSSSSSSSSSSSSSSS 
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSFSSS 
SSGPFFGVSPCVPEILKSSSPRIGCPRN*AFPFKRGNL 

7985 

A 

5 

230 

IGNFKlfVFSIVNITTRQNTSTDIENLKlWTHV*NTVSNS*YTVFSSA 
CGTFTKISHLLRH*TSLDKF*KLDVLWS 

7986 

A 

1 

366 

NPSIVGGQDLCKGNYKTQIKDLLADGSSSSSSSSSSSSSSSSSSSSS 
SSSSSSSSPAGPMKTPTSFLIELGKTIVQFIi*NQKRV*ITKVILSKK 
PQKTENKTKLEAFHSTLCYIiPSNYATRIj 

7987 

A 

3 

363 

ISKKKFFLKGFFFIjGFF*KKPPFFFFPPPPPG*RGKKKIFFFFWVKK 
NPLEKNFFSLFLKKGGPPTPPGGFFFFKKKKRAPQGGGPPKIFFFFF 
FFFFFFFFFFFGDGVLLCCPGWSAV 

7988 

A 

2 

360 

GLPS*HYSPPQLPH*SHSHTIAPKAEEKANRKETHFSRSPQQISGRC 
PPQDKSLAKVDFLDQLRPMVGHSLGRGPGHLPKHMPTPEQIQEEPEA 
HIHLPLVCEQLPLLPWEDFVYGSSR 

7989 

A 

490 

583 

NH*SPMFGYCICVCFETESCSVAQAGVQWYNRGSLQPPI»PGPKGFVC 
LSLPSSWDYRRRPP 

7990 

A 

1 

129 

QPGPEGKIRFFLKIPNIiTPSGGKSLKFPLFKRVKPENCIiSLRG*GCN 
♦PGPEGKIRFFLKIPNLTPSGGKSLKFPIiFKRVKPENCLSLRG 

7991 

C 

96 

380 

MWIPFSKFLVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

F 

7992 

A 

202 

499 

I LGFVQWHDHS S PQPKTPGLKQS S CLNFLS S * DNRHMP PGLANFFI F 
VEMGSCYVAQACLELLGSSNPLASVSQNAGITSMSQCTQLFLLLLNR 
FPAFL 

7993 

A 

1 

206 

NS S VFLHLQ AI»G I PLHLAYVSVFP PTPG * GSGHGRS LLSI SVGQGLP 
PQRPS PAS PFHLD I IHQGP AW 

7994 

A 

2 

349 

TCHRKRKKRNPRHGEVGLGYPTDESAFPSAPALGPGEPRVPGSSSGR 
ASGH*WYPNAVESP*EEGEL*GRRGETLVAEKKKKPRHGEVGPGSPM 
EKKCLPISP*AGPRGPRRPWF 

7995 

A 

269 

365 

GHLS AIGHPFP * MS FNHGRKTRTWNYt»GSEEQ 

7996 

A 

1 

314 

PSRSFI SNH * PHQDV* PRFFNG IHL * YSHTPG INHVEPE PNSTRSKA 
CWEGFGS SNSGNGS FLGAR * VDVQEVLTTTWRV I RVWN * CFGRNSAP 
MAQSKSSSES 

7997 

A 

65 

355 

YFYYIFSIYHFIAPPYYFSLVFLHIiVCPYLSSFLSR*KIHVCLKDFK 
VNFS * TI PKLSKLLELLES I SLFFL *NVDFI ILEILI PLYSAILKRV 
YF 

7998 

A 

1 

957 

MGLLLKASSLASVTGFCLLQEGPQDQEVFYLLLFVNYFSAKQLLAYR 
CFVEMGSEIbGPTLSETRPEALPPPSSNETPAVSDSKEKKNAAXKKC 
LYNFQDAFMEANKWMATS SATSSVSCTATTVQS SNSQFRVSSKRPP 
SVARGDPDCEGHRCENGVYDPQQDDGDESADEDSCSEHSSSTSTSTN 
QKEGKYCDCCYCEFFGHGGNGFVIiVIiAIMRAAADTPALSAGEPSFRR 
KEPSTVEMIEERPSCCDCSTLKSSAHTSQSYSRFRLTIGPDWEALGE 
KPCPVPQKTRDCWSAENTVCPPFRSVQHRGTHDGPRP 

7999 

A 

97 

4146 

EKEPYEEISAEADREPQQLQNYWSEVRYTVRCIYRQAGTPLADDQDQ 
SLVPDKEGVKELVDRLCERDPYQLYQRLEQQAREYVLEMKVRLLRQL 
SAAAKVKAPSGLQGPPQAHQFISLLLEEYGALCQAARSISTFLGTLE 
NEHLKKFQVTWELHNKHLFENLVFSEPLLQSNLPALVSQIRLGTTTH 
DTCSEDTYSTLLQRYQRSEEELRRVAEEWLECQKRIDAYVDEQMTMK 
TKQRMLTEDWELFKQRRFIEEQLTNKKAVTGENNFTDTMRHVLSSRL 
SMPDCPNCNYRRRCACDDCSLSHILTCGIMDPPVTDDIHIHQLPLQV 
DPAPDYLAERSPPSVSSASSGSGSSSPITIQQHPRLILTDSGSAPTF 
C SDDEDVAPL S AKFAD I YPLSNYDDTE WANMNG I HS ELNGGGENMA 
LKDESPQISSTSSSSSEADDEEADGESSGEPPGAPKEDGVLGSRSPR 
TEESKADSPPPSYPTQQAEQAPNTCECHVCKQEASGLTPSAMTAGAL 
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PPGHQFLSPEKPTHPALHLYPHIHGHVPLHTVPHLPRPLIHPTLYAT 
PPFTHSKALPPAPVQNHTNKHQVFNASLQDHIYPSCFGNTPEWNSSK 
FISLWGSEVMNDKNWNPGTFLPDTISGSEILGPTLSETRPEALPPPS 
SNETPAVSDSKEKKNAAKKKCLYNFQDAFMEANKVVMATSSATSSVS 
CTATTVQSSNSQFRVSSKRPPSVGDVFHGISKEDHRHSAPAAPRNSP 
TGLAPLP7\IjSPAALSPAAljSPASTPHIxANIiAAPSFPKTATTTPGFVD 
TRKSFCPAPLPPATDGS I SAPPSVCSDPDCEGHRCENGVYDPQQDDG 
DESADEDSCSEHSSSTSTSTNQKEGKYCDCCYCEFFGHGGPPAAPTS 
RNYAEMREKLRLRLTKRKEEQPKKMDQISERESVVDHRRVEDLLQFI 
NSSETKPVSSTRAAKRARHKQRKLEEKARLEAEARAREHLHLQEEQR 
RREEEEDEEEEEDRFKEEFQRLQELQKLRAVKKKKKERPSKDCPKLD 
MLTRNFQAATESVPNSGNIHNGSLEQTEEPETSSHSPSRHMNHSEPR 
PGLGADGDAADPVDTRDSKFLLPKEVNGKQHEPLSFFFDIMQHHKEG 
NGKQKLRQTS KAS S EPARRPTEPPKATEGQSKPRAQTES KAKWDLM 
S I TEQKREERKVNSNNNNKKQLNH I KDEKSNPTPMEPTS PGEHQQNS 
KLVIJ^SPQPKGKNKKNKKKKGDRVNNSIDGVSLLLPSLGYNGAILA 
HClOiRLPGSSDCAASASQVVGITDDVFLPKDIDLDSVDMDETEREVE 
YFKRFCLDSARQTRQRLS INWSNFSLKKATFAAH 

8000 

A 

97 

2728 

EKEPYEEISAEADREPQQLQNYWSEVRYTVRCIYRQAGTPLADDQDQ 
SLVPDKEGVKELVDRLCERDPYQLYQRLEQQAREYVLEMKVRLLRQL 
SAAAKVKAPSGLQGPPQAHQFI SLLLEEYGAJLCQAARS ISTFLGTLE 
NEHLKKFQVTWELHNKHLFENIjWSEPLLQSNLPALVSQIRIjGTTTH 
DTCSEDTYSTLLQRYQRSEEELRRVAEEWLECQKRIDAYVDEQMTMK 
TKQRMLTEDWELFKQRRFIEEQLTNKKAVTGENNFTDTMRHVLSSRL 
SMPDCPNCNYRRRCACDDCSLSHILTCGIMDPPVTDDIHIHQLPLQV 
DPAPDYLAERSPPSVSSASSGSGSSSPITIQQHPRLII/TDSGSAPTF 
CSDDEDVAPLS7UCFADIYPLSNYDDTEWANMNGIHSELNGGGENMA 
LKDESPQISSTSSSSSEADDEEADGESSGEPPGAPKEDGVLGSRSPR 
TEESKADSPPPSYPTQQAEQAPNTCECHVCKQEASGLTPSAMTAGAL 
PPGHQFliSPEKPTHPALHLYPHIHGHVPLHTVPHLPRPLIHPTLYAT 
P P FTH S KAL P P AP VQNHTNKHQ VFNAS LQDH I Y P S C FGNTPE WNS S K 
FISLWGSEVMNDKNWNPGTFLPDTISGSEILGPTLSETRPEALPPPS 
SNETPAVSDSKEKKNAAKKKCLYNFQDAFMEANKVVMATSSATSSVS 
CTATTVQSSNSQFRVSSKRPPSVGDVFHGISKEDHRHSAPAAPRNSP 
TGl^PLPAIiSPAALSPAALSPASTPHIiANLAAPSFPKTATTTPGFVD 
TRKSFCPAPLPPATDGS IS APPSVCSDPDCEGHRCENGVYDPQQDDG 
DESADEDSCSEHSSSTSTSTNQKEASSCTNK 

8001 

A 

1 

2295 

MGLLLKASSLASVTGFCLLQEGPQDQEVFYLIiLFVNYFSAKQLLAYR 
CFVEMGSEILGPTLSETRPEALPPPSSNETPAVSDSKEKKNAAKKKC 
LYNFQDAFMEANKWMATS SATS S VS CTATTVQS SNSQFRVS SKRPP 
SVGDVFHGISKEDHRHSAPAAPRNSPTGLAPLPALSPAALSPAALSP 
ASTPHLANLAAPS F PKTATTTPGFVDTRKS FCPAPLPPATDGS I SAP 
PSVCSDPDCEGHRCENGVYDPQQDDGDESADEDSCSEHSSSTSTSTN 
QKEGKYCDCCYCEFFGHGGPPAAPTSRNYAEMREKLRLRLTKRKEEQ 
PKKMDQISERESVVDHRRVEDLLQFINSSETKPNGFVLVIiAIMRAAA 
DTPALSAGEPSFRRKEPSTVEMIEERPSCCDCSTLKS SAHTSQSYSR 
FRLTIGPDWEALGEKPCPVPQKTRBCWSAENTLEEKARIiEAEARARE 
HLHLQEEQRRREEEEDEEEEEDRFKEEFQRLQELQKLRAVKKXKKER 
PS KDCPKLDMLTRNFQAATE S VPNSGN I HNGS LEQTEEP ETS SHS PS 
RHMNHSE PRPGLGADGD AAD PVDTRD S KFLLPKE VNGKQHE PLS FFF 
DIMQHHKEGNGKQKLRQTSKASSEPARRPTEPPKATEGQSKPRAQTE 
SKAKWDLMS ITEQKREERKVNSNNNNKKQLNHI KDEKSNPTPMEPT 
S PGEHQQNS KLVIjAESPQPKGKNKKbnCKKKGDRVNKS IDSKNLDPI I 
HGAFTDLLS PTTQ 

8002 

A 

97 

4027 

EKEPYEEISAEADREPQQL.QNYWSEVRYTVRCIYRQAGTPLADDQDQ 
SLVPDKEGVKELVDRLCERDPYQLYQRLEQQAREYVLEMKVRLLRQL 
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SAAAKVKAPSGLQGPPQAHQFISLLIiEEYGALCQAARSISTFLGTLE 

NEHLKKFQVTWELHNKHLFENLWSEPLLQSNIiPALVSQIRIiGTTTH 

DTCSEDTYSTLLQRYQRSEEELRRVAEEWLECQKRIDAYVDEQMTMK 

TKQRMLTEDWELFKQRRFIEEQLTNKKAVTGENNFTDTMRHVLSSRL 

SMPDCPNCNYRRRCAC7DDCSLSHILTCGIMDPPVTDDIHIHQLPLQV 

DPAPDYLAERSPPSVSSASSGSGSSSPITIQQHPRLILTDSGSAPTF 

CSDDEDVAPLSAKFADIYPLSNYDDTEWANMNGIHSELNGGGENMA 

LKDESPQISSTSSSSSEADDEEADGESSGEPPGAPKEDGVLGSRSPR 

TEESKADSPPPSYPTQQAEQAPNTCECHVCKQEASGLTPSAMTAGAL 

PPGHQFLS PEKPTHPALHL Y PH IHGHVPLHTVPHLPRPL IHPTLYAT 

PPFTHSKALPPAPVQNHTNKHQVFNASLQDHIYPSCFGNTPEWNSSK 

FISLWGSEVMNDKNWNPGTFLPDTISGSEILGPTLSETRPEALPPPS 

SNETP AVSDS KEKKNAAKKKCL YNFQDAFMEANK\ WMATS S ATS S V 

S WTANTVQS SNSQFRVS S KRPPS VGDVFHGI SLGD\ QKNS APAAPRN 

S PTGLAPLPALS PAALS PAALS PASTPHLANLAAPS FPKTATTTPGF 

VDTRKSFCPAPLPPATDGS ISAPPSVCSDPDCEGHRCENGVYDPQQD . 

DGDESADEDSCSEHSSSTSTSTNQKEGKYCDCCYCEFFGHGGPPAAP 

FINSSETKPVSSTRAAKRARHKQRKLEEKAP^EAEARAREHLHIiQEE 
QPJU^EEEDEEEEEDRFKEEFQRLQELQKLRAVKKKKKERPSKDCPK 
LDMLTRNFQAATES VPNS GN I HNGS LEQTEE PETS SHSP SRHMNHS E 
PRPGLGADGDAAD PVDTRDS KFLLPKEVNGKQHE PLS FF FD IMQHHK 
EGNGKQKLRQTS KAS S E PARRPTE P PKATEGQS KPRAQTE S KAKWD 
LMSITEQKKEERKVNSNNNNKKQLNHIKDEKSNPTPMEPTSPGEHQQ 
NSKXiVLAESPQPKGKNKKNKKKKGDRVNNSIDDVFLPKDIDLDSVDM 
DETEREVEYFKRFCLDSARQTRQRLSINWSNFSLKKATFAAH 

8003 

A 

1 

450 

GVPYKRH I EGF * PGLNGPLKFNGPS SS S SSDGLLRLFSGGSGGQS PN 
WVPLGSYQGRSRPVPPGGSSAEPFPPSPASGGTHVPQLRALPPLHGH 
SAAHPHLSLALWPPPVVRTL*GHWGPGPPTLDAPPSPSHLQVPLSS 
QGGIPPHWT 

8004 

A 

16 

303 

GENPLMRWPEALPGVFKTCFSPPT*GYGGGLMFLCHHADSINFSAFQ 
PINNADFIVPIEIEGTTHQVSCQPDHTQPRGWGVCLLPHLYWGPPPI 
SG 

8005 

A 

146 

809 

RIFSLCSAALSKKHGNPQDCSRRAPGNVPAAISRAPSLVLGLRARTV 
TRKGGLRPRKGEQGP I PS P SDP AVKGPGAEPGQARFAPVYQI L IGAP 
RWE PLLSTGG YTVSGS S PWEVGTRANQPRQ * ALPASHLPPLASRS S 
S S S HL YTAAP PL VSRGSGG S AGRALDS S F PQHS LAKPRP S AGLEAPG 
HWGRTGGDRRRKGTGGEPHTFAEVEALPASARP 

8006 

A 

111 

371 

FGPGPPL*SPLF*GSSSSHSGVGK*NPSFLPGGTPVFFKNPKFPPPW 
GGPPVSPLFWGGETGNWGNPGGGTLEGPGYPPLALPGGK 

8007 

A 

85 

394 

DWRRRRRDSRSS * REDCQSTI YVRKRAKSNCWCWSKEKS KSHERDS I 
PIRIEEKRMRSRSKGRDHENVKEKEKQSDSIGNDQE6SRSREKSNQF 
DSKSNERDH 

8008 

A 

2 

2255 

AAC^RRRARVTWRRQMPLAQLKEPWPIiMEIiVPLDPENGQTSGEEAGL 
QPSKDEGVLKEISITHHVKAGSEKADPSHFELLKVLGQGSFGKVFLV 
RKVTR PDSGHL Y AMK\^KKATLKVRDRVRTKMERD I LADVNHP FWK 
LHYAFQTEGKLYL I LDFLRGGDLFTRLS KEVMFTEEDVKFYLAELAL 
GLDHLHSLGIIYRDLKPENILLDEEGHIKLTDFGLSKEAIDHEKKAY 
S FCGTVE YMAPEWNRQGHSHS AD WWS YGVLMFEMLTGS LPFQGKDR 
KETMTLILKAKLGMPQFLSTEAQSLLRALFKRNPANRLGSGPDGAEE 
IKRHVFYSTIDWNKLYRREITPPFKPAVAQPDDTFYFDTEFTSRPPY 
E S S WQP PPELGPNTWFWGVPS VATGLMEDDGKPRAPQAP IjHS WQQL 
HGKI^VFSDGYWKETIGVGSYSECK^CVHKATNMEYAVKVIDKSKR 
DP S EE I E I LLR YGQHPNI I TLKDVYDDGKHVYLVTELMRGGELLDKI 
LRQKFFSEREASFVLHTIGKTVEYLHSQGVVHRDLKPSNILYVDESG ^ 
HPECLRICDFGFAKQLRAENGLLMTPCYTANFVAPEVLKRQGYDEGC 
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DIWSLGILLYTMLAGYTPFANGPSDTPEEILTRIGSGKFTIjSGGNWN 
TVSETATOLVSKMLHVDPHQRLTAKQVLQHPWVTQKDKLPQSQLSHQ 
DLQLVKGAMAATYSALNSSKPTPQLKPIESSILAQRRVRKLPSTTL 

8009 

A 

1 

1845 

MAEGDAKGDKAKVKDEKPAPPKPEPKPKKAPAKKGBKVPKGKKGKA0 

AGKEGNNPAENGDAKTDQEGPAAQLQGVWTADSLQLSAPGIASDADS 

LLAQDH I LPHPS S S FPRAAP I RKRFTTI TWFSHINCSLRTGHDDNGG 

FVEQKGGKVRDRVKTKMERD I LADVNH P FWKLH YAFQTEGKLYL I L 

DFLRGGDLFTRLSKEVMFTEEDVKFYLAEIiALGLDHLHSLGIIYRDL 

KPENILLDEEGHIKLTDFGLSKEAIDHEKKAYSFCGTVEYMAPEGSA 

LMSPGGCFRAKLGMPQFLSTEAQSLLRALFKRNPANRIiGSGPDGAEE 

IKRHVFYSTIDWNKLYRREIKPPFKPAVAQPDDTFYFDTEFTSRTPK 

DSPGIPPSAGAHQLFRGFSFVATGLMEDDGKPRAPQAPLHSWQQLH 

GKlHjVFSDGYVVKETIGVGSYSECKRCrVHKATNMEYAV^ 

PS EE I E I LLRYGQHPNI I TLKDVYDDGKHVYLVTELMRGGELLDKIL 

RQKFFSEREASFVLHTIGKTVEYLHSQGEWKWALRPNYLRSCWLGRY 

PL S LRG SNSDS P S QLE F P SNKC S APLRG S ERG VGG AAAAGAVGTRAE 

RHKN 

8010 

A 

12 

452 

SEPP*ITSVDPRVRGNASTGYGKIWLDDVSCDGDESDLWSCRNSGWG 
NNDCSHSEDVGVICSDASDMELRLVGGSSRCAGKVEVNVQGAVGILC 
ANGWGMNIAEWCRQLECGSAIRVSREPHFTERTLHILMSNSGCTGG 
EASLW 

8011 

A 

121 

342 

QNESWHMKTFLAVFICLPLKIRQIKCIYIFYN*ELKLYSVILKYFFL 
YDFRPVCLVLSEIPFMSTWFLLVSTFR 

8012 

A 

3 

385 

EKGPLS PKVLFNHLCQKAPRFKDFQQQDSQELLHYIiLDAGRTE * TKR 
IQASILKAFNNPTTKTADDETRKKVKTYGKEGV1CMNFIDRIFIGELT 
STVMCEECiUaiSTVKDPFIDISLPIIEERVSKP 

8013 

A 

18 

280 

KCRSRN*TA*FHRIJ^AKALSLLTLIjNVYQKKHLVEILSYHNroSQTR 
NAPELDCL IRLQAQN I QQRH I VFLTGKE S TPKLSTMN I GM 

8014 

A 

3 

271 

FFLRQSHSVT\RLECTGAI * AHCNLCLPGSSDSPASAS * VAGITGMC 
HHARLIFVFLVETGFHHVGQAVLELL/NLMICPPQPPKVIiGLQA 

8015 

A 

609 

921 

FYLSEEAEFFFFFFETES/HLLSPRLECSGMMSAHCNIiRLPGSSDSP 
ASAS * VAG ITGGHHPAQIjI FVAF/MYQPGFCHVGQAGQQIjtiA\ * VIC 
LPQPPKVLGITGA 

8016 

A 

14 0 

347 

FPPICYLLEIGSHSWQAGVQWHRHSAQYP* IHEHKQSSCVSLMSS* j 
HYSHVPPSPGNILLL*RRGSCV 

8017 

A 

2 

349 

AGVPPGNPPLWGGEGGGSPRGGGLKPGFPQRGNPFFFKKSQPTPPGG 
GPPLIPPPWGGGAGGSPLPQGQRFQ*TKIGPFPSPRGKKKKPPFPKK 
KKKKKKRKEKSEMPGFMVLNA 

8018 

C 

282 

497 

MGSFQSHHTKWQSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX^ 
XXXXXXXXXXXXXXPGGGPGAQFPL 

8019 

C 

267 

473 

MGS FQSHHTKWQSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXTGGGPAP I 

8020 

c 

435 

821 

MKSRVIGWASGLKNEIFFFFFFFFVmDRYRT^LPRLVLNIiLFLIEPP 
VSILVKHWDYRHEPPPPRXRLPFCMMRLEGPHCFRDFATIYKKQSVR 
TVAGLFS CL I KFS AHTNLDQR I WDLNP ISGYFPN* 

8021 

A 

121 

818 

H S KGSQVQS PGPEPS S AGPKPGAQPQPTWGLAAPEEGREGGKCGRHC 
RNSKPLSQPCAGVWSGRGAEEGGAGPDHPARPTGPPSSPSCSAGLPD 
L* *GLKPPLEASSMDPGFVLEGGPRVREGVCQDGI * A*ELEEVPSPG 
GS ATSGAATLSDRRGGNGASQARLP*PSHWLPRAK*APQG* AS PGSP 
TACPAHPHSWKGGSAGPGMGRGRWGDRGGAWGTGSHCNPTPSS 

8022 

A 

11 

369 

SFTCCFQMHTKLLCLNTLCNTFLSTGAFFLIVLTWLSPIAVRLSLNV 
ISERPFLFILSVVDPCAHILYLSLLFVS*VXiPQYVIALFVPHKRM*E 
TMYACL PVCP * LLTACLEHNCS I F I 

8023 

A 

3247 

5031 

EGLRGTALLLTGGGRGP * GQPQQ/ PGGPAS S PS PAAPGPAAAEPACS 
TALWPTVSLRAPSPCPPQRAGQGYGSWG/GEPG/NSMASGGDG/EPL 
SQGLAPAGTPGLARLQGPGHWAVSLWPSLRGFPGGPQTGQSPLSQPH 
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SALTGRGR S PGAI/GP PP PQRPHQR PAPDPARPQR * TPLTAGATTRPQ 
HLPQGNCRCSLLLLFLLLLFFLLLFLFLIiFFFLILFFLFLIFLLLLL 
FLLPISLRLNRSQWDLGGGBQHVREAHPPDPVEQPRPFSVIGEVSFE 
VLVGQTQRHSSWEAIHLKWGRHGEHLGLEVIGPLPNPRKAWMPHLLQ 
LGPRDAEHRGHE S WAS S EGRVAPVGR PHRAALS RAAALNEGTGGAR 

GAAAAGVRVMLRLQRGGGEHGGRQSSQGAVPREGVPRESVLLQLSLW 
ACGVRLSPSRAGSRVQG 

8024 

A 

87 

569 

KQPPPHGESAAWAATSRPGWWLLCRPAEGRTPGWTAFDVHPTLMQKL 
PEHSGNFSGQSPPCQIiEALPRPSTCPLGSSRGSGREGPLPPT*ELFP 
NGQSVRCQVGGPGYDSWCPGPGPAPCPPTEP*M*GPQALTSAPGPNK 
HPGLELGGRRGPERAPGPF 

8025 

A 

2 

348 

KALACPPTCC VLG VHHAVS AGGF PGRRPLVRRH I APAGHQVTLS FLH 
PTLCCHPKSNCTPCGSRRAVPTPTPLPAPGPQAFCPVPGSHWLAWSP 
RGRSPPTSVAPTQPY*QPPSC 

8026 

A 

3 

361 

HEGTVQQPQARGLYEDLLMSEGENDEDDAGSDDNGDNIFSAIHLSES 
GRI)SDA*FGGIRPKQSRMLKENTRMVMDNEERIjLFyEGDCGEASHGL 
BDSCLSYGSYEDPDPKWNTQNTSFC 

8027 

A 

2 

424 

LFW S PQf QREPTMVLS PADKTNVKAAWGKVGAHAGE YGAEALERHFD 
L S HG S AQ VKGHG KKVAD AL TN AVAHVDDM PNAL SAL S DLHAHKLR VD 
PVNFKLSHCLLVTLAAHIjPAEFTPAVHASLDKFLASVSTVXTSKYR 

8028 

A 

3 

456 

PQTQREPTMVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPT 
TKNYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSD 
LHAHKLRVD\ PVNFKLLSHCLLVTLAAHLPAEFTPAVHA\ SLDKILA 
SVSTVLTSKYR 

8029 

A 

3 

198 

RQGSCSVTQPGVQWNNLGLIi*P*PPGIiKQSSYLSSPSSWDYRCVPTR 
LANSYFKFI*YVFFCLF 

8030 

A 

1 

346 

LFAQPNPVTFKIRIYCMSKDIIDKVRTmPPEWEKICAiraVYNKGLGS 
RI YKVLLLB* KNIVRLNQKANAQFYAG YKLRCYRKFSNKS IVNE* KV 
LLFMR*EYIPVSTHFICCVIL 

8031 

A 

60 

525 

KLPLKAKMGKEKTHINIVVIGHVDSGKSTTTGHLIYKCGGIDKRTIE 
KFE KEAAE AD CAVL. I VAAG VG E FEAG I CKNG QTREHALLAYTLGVKQ 

LIVGVNK24DSTEPPYSQKRYEEIVKEVSTYIKKIGYNPDTVAFVPIS 
GWNGDNMLEPSAN 

8032 

B 

68 

679 

MGKEKXHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAA 
EMGKGSFKYAWVI^KLKAERERGITIDISLWKFETSKYYVTIIDAPG 
HRD F I KNM I TGTSQADCAVIj I VAAG VGEFEAG I S KNGQTREHAIJjAY 

TLG\nCQLIVGVNKMDSTGIAILTGDFPSLEPRHVSTWLQHVVTIPTR 
NWHKCYCVGWANFLN* 

8033 

A 

53 

1497 

KLPLKAKMGKEKTYINL\AVIGHVDSGKSTTTGHLIYKCGGIDKRTI 
EKFEKEAAEMGKG S FKYAWVXDKLKAERERG ITIDISLW\ KFETSKY 
YVTIIGAPGHRDFIKNMITGTSQTVDWCCPEIjLLLGVGEFEAGISKNG 
QT\REHALLAYTLGVKQIjI\AGANKMDSTEPPYSQKRYEEIVKEV\S 

tyikkigy\npatvaf\vpisgwng\dnmlep\sanmpwfkgwkvs\ 
rki)wqcqlgttllealdcilpptrptdkllriafq\da*qnggigtv 
p\ vg\rvetgvlkpg\mgvtf\ apvnvyngk* ksvemhheafecklf 
pg\dkcgafnvk\nvsvkdv\rrgnvagdskndppmea\agftaqvi 
iiinhsgqisagyapvldchtahiackfaelkekidrrsgkkledgpk 
flksgdaaivdmvpgkpmcvesfsdypplgcfavrdmrqtvavgvik 

AVDKKAAGAGKVTKSAQKAQKAK 

8034 

A 

3 

309 

RFTGHYRMLSGEKPYEWIEWGKRLSSNTALTQHQRIHTGENPFECKE 

CGKAFNQKITLIQHQRVHTGEKPYEC*ACGKTFRWGGRFILHQNLPT 
QKTPVQA 

8035 

A 

3 

366 

RBSVVEILFEQDNKEQSVATLILDSLIQCPIDTRKQIAENLEMMFDV 
GKTQPPLLKRAFSTEK* KFD* /TINLAS YQIFNQL* ANCTKYVGCFV 
I EDYSGS ES ILCLLFAL I YNS FDFVSLV 

8036 

C 

310 

546 

MGIMSVPPHPQNVDYPLDGEKILHILGFNPEIQLWEILFEQDNGRAI 


I04.M7 
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SWPLLILGFPLYS VPIGHQGRQLGRRNLGRS * 

8037 

A 

97 

333 

HRIRPICSPQYMEKVYVTKNNNYRILEIDMNFFPPPETEGVGGQVLB 
VMPHEELKPSIFFFF*DRVSLCHPGWSAWQS 

8038 

A 

250 

378 

TESLYLNTIKATYEKFTANITLNNEMLKAFSLRLDTR*EHSI*S 

8039 

A 

280 

759 

GYINNGGRKRRTELKPTEPRSCRAEGGGVEGGPPACPPPTNCLQRLQ 
PHLG S CTE I S GQERE * G Y I NNGGRKRRTE LKPTE PRS CRAEGGGVEG 
GPPACPPPTNCLQRLQPHLGSCTEISGQERESGGGEGNSGPSNNRVH 
GFSKAGHGLGVGEEEEERQLWL*TGRKQRPNIPIRRLPQASLHLPPA 
SPCQPAVPGALSSPRRWPQPPASHSCPSPPRHTKS 

8040 

A 

1 

401 

ILVLDPPPQPPKLLSVKGNPEDFIFRQFFFSPYPLFPFPPFSY7UCPP 
VSX^TPFPPHPRFSKLLSQGPADMPPLPGSLP*LLHYSACMLHHLLF 
CSIAHCAFRLLSCQGCQYLHGLMHMFRSGHMHIYAAGRV 

8041 

A 

32 

379 

CVYTFIC*EISGLFPVWG*YRK*GSSSSSSSSSSSSSSSSHFNKYSS 
CMFNFF *NC * TVLQSGCTI LLLHHWCMRVLS PP * CQQHLAVFFS I I A 
ILLSV*LHLTVIFICIPQITS 

8042 

A 

30 

436 

DGS S LCRPGWS AEQDCVS S GGEKKKKGRDNI EDEACS SRPSR * LFEE 
NIHLVHALIEEDQIANIIDISAGSAYTILDEKLTLSKRFTRWVPKPIi 
PSDQLQTRAEFUyiEILNGWDQDPEAFIiQRIVTGDET*FYQR 

8043 

A 

3 

342 

IPKPMAKLPFMKYRDLHTNLLYSRMETGPKHIPMKTLLKIMEDIALG 
MDNLSLMNYLYLELTARNCMERDDMTDCEADFGLSRKRYSGENYLHC 
RIANRPV*GSAIESLAHR 

8044 

A 

3 

380 

L I KKHVD FDLATNVQEE KI AALED F ADQL I TAGHYAKGD I FS RRNEV 
LDRWRRVKTQM I E KRA * V * KSQTLQQFNRDVDENEAW I S EKLHTASD 
DSYKDPTNIQLCKLLSKHQKHQAF*TELPAT 

8045 

A 

169 

375 

SSSSLPLHITNNPLHQQPLPPQLPAYTGAEGDPCALLS*YCLIHIjIjN 
TEAVNNI ITFLGKQPCKRYDKV 

8046 

A 

27 

289 

RS PKTHGGGRRWGMVRPGTAPT/ PCCPHRPQLP/CQPLLGPPE * SLL 
EGRSSSLDSGMEESQTHAAPACRTWCSDGPR*PWMGPAARCSG 

8047 

A 

3 

284 

LFLWRRQLTLTENPRWWPTIiGDGQARDCSDACCPHRPQIjPSSHCSQT 
KSSPIPEVPRPPKHR*GGVCAGPSHLPGGPGSSPSPPCPWPGSGL.F 

8048 

A 

459 

1020 

EPPRGAEQGGVPGGKGPETSAAPAPTPAREPERVPGGA\ ppapecac 1 
AELVAAPAPAGGSIA PAGPRPWERPRWAAGAP/ QGDKGPGEAATSPA 
/RGDSGVEPRGGGVPLPRHAGPQARRCEPTLPHDLYSGQVQPEHRAA 
G/LHPRSSGTI*APCTTCR\AA*V*DSSIPESREELRPSKKRSFRRS 
EKEK 

8049 

A 

3 

419 

VTCGKAFAQKAQL IVHQ I IHTGKKPYDCGACGKDFREKVHLVRHQRT 
HTGEKPYDRSERGKAFSHKSDLIIHHRVHTGEKPYECCECGIAFSQK 
S PL I IHCRIHTGERP YECI KCGKAFYHK* QLS I AHRAHTGD I AG 

8050 

A 

29 

223 

L I KKLAS CGGEHLYS * LLKRLRWEN * S SLG V * GCGEL * S QHCTPTWV 
ME * G PAQKHKNKTKKLL 

8051 

A 

194 

373 

ICQPLSLSSTANHPVLCLCFTHKNLHIFIA7VLFIIAKSWKQP*CTSV 
DEWIWLGLAPLY 

8052 

A 

1 

325 

FFFFF * DRVS LLLPKLECNGTI S AHCNLRLPGS SDS PAS AS SSFFTI 
HVAPLPQLRHY * I FYDGTRS LYAG * YS QNLL I YS APRQPQ YHFKDMN 
KILDSIQRSYTKI 

8053 

A 

228 

481 

QFFRNT I FF *DRVSLLLPK1»ECNS AI S AHHNLCLPGS SD S PAS ASRV 
AGITGTCHHTRLL\FVFLVETGFYHVSQSGLELI.TSGD 


A 
n 

X / D 


ljJLVlu^Fr JjK. w £>WKIF*MSK* *VMI*IH*GFVFIjFVC\FF*DRVSLL»LP 
RLECNGPISAHRNIiHLPDSSDSPASAYIKGFVRQLSHEE 

8055 

A 

257 

453 

TKGGGYTQRTAIQFILFIYLFIY*DGVSLLLPRLE*NGAISAHCNI>H 
LPGSSDSQKKTKKNFCTQ 

8056 

A 

277 

1230 

ISFHLSTFGAPSFFFFFEMEFSLLLPRLECNGAISAHRNLRLPGSSD 
S PAS AS PVGWDYRHVHPRS ANFVFFF SRDGVS PCWSGLVSNS RPQM I 
PPSRPPK\njDTGLATMPGLCIiANFCGRNRVSLMCPSWSPELKQSTCL 
SLPKCWDYRRAAVPGLFILFFLRHRCPTLTQDEVQWCDHSSLQPSTP 
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EI KHPPASASQSS WDQRHAPJLHLANFYFYF * FFETES\ HSVT\RLEC 
SGA I LAHCNLCLPG S S YS PAPAS * VAGTTGAH / RRLANFFVFL VEMG 
FHHVRQVDARSLDLVICLPRPPKVLGLQDVSHHRPAYF 

an ci 

TV 

A 

1 

467 

FFFLF*EGVSLLLPRLECSGAISA/HCNLHLPGSSDSPASASQEDGI 
TGVKYHAWIilL/ VFLVE IGFHHVGQADLKP * PQVIHPPLFFFFLRQS 
FALVAQAGVQWCDLSSLQTPPPRVQGILIiAQPPEYLVAGFTGMRHHT 

T? T . T? T? 1? TT 7A T?T .\Tf? TV2 

A.ijf JC IT r JnE Jj v i!# 1 vjr xiirV_ 

8058 

A 

274 

321 

EEEEEEEEEEEEEDQE 

8059 

C 

69 

254 

MDHLYNXXXXXXXXXXXXXXXXXXXXXXXXXEIQFXXXXERERNKNP 
FPAGDD I I S RG VGQ * 

8060 

A 

32 

169 

NPVPPYPLCPALVFFLLLIYLLIY*PSPLLRMSAPAGKGFLFLSL 

8061 

A 

417 

490 

GDFFFFLRDRVLLCHPGWNAVVQS 

8062 

A 

424 

579 

RQVNMAEGIIHDTETKMEEFKDQLPADECNKIjEEEISKI^ELIiARKD 
SETGE 

8063 

A 

1 

2157 

- 

GFQKRCRHRIAQLFAVGALVCCLVLLHLSAMISASRAAAARLVGAAA 
SRGPTAARHQDSWNGLSHEAFRLVSRRDYASEAIKGAWGIDLGTTNT 
SCVAVMEGKQAKVLENAEGARTTPSWAFTADGERLVGMPAKRQAVT 
NPNNTFYATKRL IGRRYDDPEVQKD I KNVPFKI VRASNGDAWVEAHG 
KLYS P SQ IGAFVLMKMKETAENYLGHTAKNAVI TVPAYFNDSQRQAT 
KD AGQ I S GLNVXiRV I ME PTAAALA YGLD KS EDKV I AVYDLGGGTFD I 
SILEIQKGVFEVKSTNGDTFLGGEDFDQALLRHIVKEFKRETGVDLT 
KDNMALQRVREAAEKAKCELS S S VQTD INLP YLTMDSSGPKHLNMKL 
TRAQFEG I VTDL I RRT I APCQKAMQDAEVS KSD I GE VI LVGGMTRMP 
KVQQTVQDLFGRAPS KAVNPDEAVAIGAAI QGGVLAGDVTDVLLLDV 
TPLSLGIETLGGVFTKLINRNTTIPTKKSQVFSTAADGQTQVEIKVC 
QGEREMAGDNKLLGQFTLIGIPPAPRGVPQIEVTFDIDANGIVHVSA 
KDKGTGREQQIV\ IQSSGGLSKDDIENMVKNAEKYAEEDRRKKERVE 
AVNMAEG I IHDTETKME E FKDQLPC/ ADECNKLKEE I S KMRELL \ AR 
KD S ETGEN I R\ Q AA S S L \ QQAS \ L KLFRNGLTKKMG S * AEEGSG \ S S 
GHWGNKREDSKGREKPVN 

8064 

A 

500 

753 

NGVLAPAVQAGVQWRDPGSPPPPPPGFKRFFCLSLLSSWDYSHAPPR 
PANFVFLVEMRFLHVGQAGLELPTSGGLPALASQSAG 

8065 

B 

1 

1713 

MAIiWRGSAYAGFLAIiAIiTWRDRKKKFPGDDEAPGV^ 

RQWLRKRVEVVCTGRSANTVCAGVRAAGLVEKSPPPSLSRVGRRFRF 

CGDLDCPDRVLAEISTIJVKMVECTGSTLGGGGYKKILKLTADAKF^ 

EHEGKKD IDS PTFLS SAS S WWS S S S AYGSRRPGGFELKL IGQQGES G 

DVKATVAVLSFILSGAAKHSVDGKSLASELQQLGLPKEHAASPCCCY 

EEKQSPLQKHLRVCSLRTWSLAARVAEGTAETVDPSAAPKTSVWSSR 

VCWGRLLAT I SPS VNETDTCGMEDRRLVVCRPWRAQCHMKAWGWRHP 

EEGGTHSGAPSAVLiQALAVAIQLGGHLADPLIiQVDPLS S CGAVSLDI 

LIYLVFYYRTASVPETYIVKTLFKKLESQSLIQSNVLTRSNSMKAER 

GEEAAKEKSEASRGWFMRFKERSCLHNIKMQGEAASADVEAAASYPK 

DLAKITDEHGYTKQQVRCIQNSLLLEGNAEEVRKIIVCSMSPPLKPG 

GP VTL VEE FLCAS KASVS AVSG W I S KWRNWQE IRRQEEREDRAS LPD 

LQASES* 

8066 

A 

1 

3053 

MRPGFWSVGMVEGVPGEPDPYPVLCYTPWPGHDLPASRAALAQGRS 
WSLHRMIGQDSGRRRSRRQHFAPGTSSGLRSAPGIjTRAGPAPPEAVS 
PSHVI VDSADLAGPEKE I PGPWLPRAMYEAPGVKRAWAAGAGMRGRQ 
W JL»K ivK V h, V V C 1 (j R b ANTVC AG VRAAGL VE KS P P P S LS RMGRRFRFCG 
DtiDC PDRVXAE I S TLAKM VE CTGS TLGGGG YKKI LKLTADAKFVEEH 
EGKKDIDS PTFLS S ASS WWS SSSAYGSRRPGGFELKLIGQQGESGDV 
KATVAVLSFILSGAAKHSVDGKSLASELQQLGLPKGAQAQPPPATSR 
ASPFYPQSTRPARAAVMRRSKAPLQKHLRVCSLRTWSL7UVRVAEGTA 
ETVDPSAAPKTSVWSSRVCWGRLLATISPSVNETDTCGMEDRRLGVQ 
AIJ\CWPHDGLGWRHPEEGGTHSGGSPLEHLEPKKLSLTFAIPGTRP 
LIMMKVDPNPLRGLKEQSEPILSPDARKGSSGDGEVQRNLVPAAVNP 
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YVM I FLPALFRVLVLAAVFGQLKE YQQRKS PG I PAGAKTKKKKTDS S 
PETTTSGGCHSPGDSQYQEIAVALESSSVTINQLNENIESLKQQKKQ 
VEHQtiEEAKKTNNEIHKAQMEQLETINILTLEKADLKTTLYHTKRAA 
RHFEEESKDLAGRLQYSLQRIQELERALCAVSTQQQEEDRGHCLSSP 
DQNFS L FT I QS S SCRE AVLQRRLQQT I KERALLNAHVTQVTES LKQV 
QLERDEYAKHIKGERARWQER1WKMSVEARTLKEEKKRDIHRIQELE 
RSLSELKNQMAEPPSLAPPAVTSVVEQLQDEAKHLRQE/VGRSGGKA 
PVPGGKQSGLES P * QGTKAETPGAGGDAPRAGGAES AGAGETV* TKR 
EASGAAEDATGAG*EAAKAGAEATQTGGEAAKGGGEAAKAGKEAVGP 
GGEAVEEGGEATKAGGEARALPEPQARQAAGRATVQLRGSEQREKER 
TAVGAAS KG AAGEARRGEGDAVHGHLS AADL 

8067 

A 

330 

452 

EADQYLLWGPEPFPPTPDPEPTRIiPLGESPAGADRAGGHP 

8068 

A 

3 

347 

PEWVPPSSGWRQPQAFMWHCARQGLHTTSLLSSMPQVSVSFTEGEMV 
AKS LPQHTRLDHTEVLGAAEGS TVS AVP S ATRAARDHVRRLQTRKCF 
CKGALLLL I TAARAGRVLWG 

8069 

A 

3 

1279 

TAEETSGTSAGRNNVI FFAYNMTAGFGGHSDAENNEKKS ALQLEQQV 
KELQEKLDEVKEMSQYQELAVALESSSVTINQLNENIESLLALVEHS 
VKVAALDVAEGGAASGAGLQVAEAEALPLTHTVSAFVFGPVGRTGVG 
PSEGPAGGRRGPFIYVDIFEGHLRAWEKGQGTVLLHNSVPVADGQP 
HEVSVH I NAHRLE I S VDQ YPTHTSNRGVLS YLEPRGS LLLGGLDAEA 
SCHLQEHRIjGLIPGATNVSLIjGCMEDLRVNGQRQGLREALLTRNMAA 
GCRLEEEEYEDDAYGHYEAFSTLAPEAWLAMELPEPCVPEPGLPPVF 
ANFTQLLT I S PLWAKGGTAWLEWWHVQPTLDLIGAELRKSQCSAPC 
GSGVQRRLVKCVNTQTGLPEEDSDQCGHEAWPESSRPCGTEDCEPIE 
SP 

8070 

A 

1 

496 

LLGVAPSRAFQEEIIiR/DRASFHE/ RPNLFAXiKHPTSKAECTAEKCY 
RVTKGRG I FPSGS PFKS VTLEDGKTF I PGQGNNAYVFPQVALGV IAG 
G IRHI PDE I FLLTAEQ I AQEVFEQHLSQGRL Y P PL S T I RDVSLR I AI 
KVLDYAYKHNLDSYTWPKEAMNVQTV 

8071 

A 

228 

448 

RAG I PTTRLFPVGVAA I AGAFTEQ ILRDMAS FHERP 1 1 FALSNPTSK 
AECTAEKCYRVTEVSGESFLPPAERT 

8072 

A 

3 

1877 

EGEDRGLPRTMGAAIiGTGTRLAPWPGRACGALPRWTPTAPAQGCHSK 
PGPARPVPLKKRG YDVTRNPHLNKGMAFTLEERLQLG IHGLI PPCFL 
SQDVQLLRIMRYYERQQSDLDKYIILMTLQDRNEKLFYRVLTSDVEK 
FMPIVYTPTVGLACQHYGLTFRRPRGLFITIHDKGHI^ATMLNSWPED 
NIKAVWTDGER ILGLGDLGCYGMGI PVGKLAIiYTACGGVNPQQCLP 
VLLDVGTNNEELLRDPLYIGLKHQRVHGKAYDDLLDEFMQAVTDKFG 
INCLIQFEDFANANAFRLLNKYRNKYCMFNDDIQGTASVAVAGIU^A 
LRITmTKLSNHVFVFQGAGEAAMGIAHLLVMALEKEGVPKAEATRKI 
WMVDSKGLIVKGRSHLNHEKEMFAQD\HPESNSIJDEVVRLVXPTAII 
GVAAIAEA\FTEQILRNMASF\RRAPIIFALSNPPRKAECTA\EKCY 
RVTEGPRGFFASG\SPF*GVLIWEMGKTFIPGGRGNNA\YVFPG\VA 
TG\VIA\GGIRHIP\DEIFLLDSRAKLPQEVSEQHL\SQGRLYPPLS 
TIRDVSLRIAIKVLDYAYKHNLV\SYYPEPKDKEAFCKIPGSYTPDY 
DSFYT/VDSYIWAQGKAMNVQTV 

8073 

A 

325 

587 

VGHIILLRGTLGFPWFYTSWAELLRFLLFSGIGVGGVREQGLTLSPR 
LKSRGV ITAHCNLELLGS S S P PAS AFQG AR I TGVHHDPAN 

8074 

A 

124 

343 

NHFQHD FPTTSDQQHC I SNQPS PLLHPHPS QYQCQAAYPHPPHCNAH 
GTGAKLSPSPRPPQLPPQGPLTRISC 

8075 

A 

32 

487 

S RRHG S S L WGKVNVEDAGGETLGRLL VVY P WTQRFFDS FGN1»S S AS A 
IMGNPKVKAHGKKVLTSLGDAIKHIiDDLKGTFAQLSELHCDKLHVDP 
ENFKLLGNVLVTVLAIHFGKEFTPEVQASW/QEDGDWSGQCPVLQIP 
IiSSLPMMQSFQG 

8076 

A 

3 

573 

FCPRGQEFGEGNKLLVPDAHGVISQRRTKATI\TSLWGK\VNVE\DA 
GGET\LGRL\LWYP\WT\QRFFD\SFGNLSSASAI\MGNP\KVKAH 
G\KKVTjTILGRCQQSTLDDLKGTFAQL\SELHCDK\liHVDPENFKLL 
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G\NVLVTVL\AIHF\GKDFTPEGCRASW\QKMG0\FSGQCPV\LQ\I 
PLKPLGP * IQ\ S FQG 

8077 

A 

2 

347 

ARGRKI QTHMPKDHSD I PD INRAH I INDAFNIiTSAHTVPDTLAL INT 
LFLIDQRQYMPWEAVLSSLSYFKLMFDRSHVYSPMNNYLNTLVTPLF 
IHFTNNT INWME I PEHLMDHY 

8078 

C 

213 

263 

MGFHHVSQDCLDLLTS * 

8079 

C 

637 

1452 

MGIAAASVLGDLAWFGMHVPAQADGQDHPQELDHGHTHAHAEDDAQ 
VLLQPRLHLLHAALLVHRPVLRVVATLGPVAGGIJUBALPHRKAVVLF 
PLDFAGTGVGDLRAIHELSWCPWEADPAAALHLSLHWPGILQAV 
LATAVWSNELSHDATHLLLQLAHVEVEEG PGRHLGNEDQEQES KVQ 
PQQ AADLVDG ADAAQEAHEHSDCPHADED I GPHPKRAGGGLQDRDET 
ALVGQDPHPEAQDHCAQDKEDQVEEEEKVFGDFDTG* 

8080 

A 

226 

314 

KWNSRPGWAHTCNPSTLGGRGGQITRSG 

8081 

A 

2 

350 

KTLL F FFF FR FE PHNLLTS M I CGL VYNQP P PQHL VH I LLKRR I W PQG 
NILGINKKRPGTVAHTCNPSTLGGRGGRITRSGVRD*PGQHSESPSL 
PTIQKNYPGVMAGACNPSYSGG 

8082 

A 

172 

287 

IVYASEEKI*TFQEQGWPGAVAHTCNPSTLGGRGGRITRSGDRDHPG 
*HRPP 

8083 

A 

14731 

15032 

IAGCRPGAVAHSCNPSTLGGRGGQI / TLRSGV*DQPGQYGETPSLLK 
IQKLGRRGGRRLKSQLLREAEAGESLEPGRQFAVSRflRATAIiQPGNR 
ARLHLKK 

8084 

A 

1 

193 

GTSFYLYRKGYLSLSKWPFSHYAGTLLLLLARVACLLGIVRWAYPH 
FPQFLAISSPIHLYLTS 

8085 

A 

1 

346 

VAAEI PALNWNQNGLCLRPGHTGPLESPAS PRPPGCGDRPATDFLAR 
GVAG CHGRERQPD I GGHGAARP PQS KS PWGCVS ANCGAGHS CRLSGC 
RALPCLTPGSWSWRKDSQISC 

8086 

A 

3 

522 

LTDDEMRRLNIPVLQDDGFLFLWVTGRAMELGRECIiNLWGYERVDEI 
I WVKTNQLQR 1 1 RTGRTGHWLNHGKEHCLVGVKGNPQGFNQGLDCDV 
IVAEVRSTSHKPDEIYGMIERLSPGTRTIELFGRPHNVQPNWITLGN 
QLDG IHLLDPD WARFKQR YPDG IIS KPKNL 

8087 

A 

1 

840 

HMELPYGTLTDDEhlRRIjNIPVLQDDGFLFLWA/TGRAMELGRECIjNLW 
GYERVDEI IWVKTNQLQRI IRTGRTGHWLNHGKEHCLVS SSGAQFNR 
WS TKKNHL I S Y * EKVQRCG F I R * S VI LW * AG \ G VKENPQGFNQGLDL 
* CDRS *GMCFSGLQSFHI FVGYQLFMVGC IL I PRFFLI * I INI IGKG 
LGIAILCNLKKQLVFIS*VRSTSHKPDEIYG\MIERLSPG\TRKIEL 
FGRPHNVQPNWI TLGNQLDGI HI»LDPD WARFKQR YPDG 1 1 SKPKNI* 

8088 

A 

13 

323 

AESTCPYYNGTPGSGRRPPVPLPPFLWGGPLSSLRSHYQHPSPYCRL 
AGSGNADGWPVPAVPDPGIRWHAQAIiRHSSPCLTLQPQLLYPLIIiWQ 
ATYARbALA 

8089 

A 

3 

218 

QEGQKRDVCEHHGKKLQI FCKEDGKVI CWVCELSQEHQGHQTFRINE 
WKECQVGSKIEEERAESRRWYLM 

8090 

A 

3 

332 

HEVRFFWDCRPAEQYGAGHLS TAFHLDSDLMLQNPS E FAQS VKS LL 
EAQKQS IESGS IAGGEPILCFMGSGREEEDMYMNMVIAHFIiQKNTEYV 
SIASGGFMALQQHLA 

8091 

A 

3 

334 

HEDEFKPLVGDPHNLMDPNRELFDQIiGEYKFQNALLFHYTKTVPHVS 
TPTLVEVSRNLRKVGSTCWKHPESKRMPCAEYYLSAVXIHLCVLHEN 
TLFRDRVIKCCTESLV 

8092 

A 

1 

1431 

hfflKAGLLGLCARAWNSWMASSGOTRRDPLANKVALVTASTDGIGFA 
IARRLAQDGAH\AA^SSRKQQNVDQAVATLQGEGIiSVTGTVCHVGKAE 
DRERLVATAVKLHGG I D I LVSNAAVNPFFGS I MDVTEE VWDKTLD IN 
VKAPALMTKAWPEMEKRGGGSWIVSSIAAFSPSPGFS PYNVSKTA 
LLGLNNTLAIEIiAPRNIRVNC/LAPGLIKTSFSRM\liGEPEDCAGIV 
SFLCSEDASYITGE1WNLSVMFTGGGVCRAASWKEGGTGTPRTPRE 
SPRQREPGETSSTDTQENKVWNGLPANPQRPAAEGPVRRKTNKQKGI 
ASTSAKDS INIRTGKDIHTKTPS IGHQHQRPKVDKTTKMERNQSKKA 
ETSRNQNVSSLPKEYKSSPAREQNWMENKFDDLTDVSFRRSVITNYT 
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QLKEHVLTHCKEAKNIiDKMLNEWLTRM 
HEGYTSFNS 

8093 

A 

195 

434 

TSVPSCVRCRYIILRSSSALTNILGTHSNSSFFHASSSALHSCCFPF 
F S WLQTLD I NVKAPALMTKAVVPEMEKRG YRE 

8094 

A 

43 

1601 

SMHKAGLLGLCARAWNSVRMASSGMTRRDPLANKVAIjVTASTDGIGF 
AIARRLAQDGAHVWSSRKQQNVDQAVATLQGEGLSVTGTVCHVGKA 
EDR\ ERLVATAVKLHGG ID ILVSNAAVNPFFGS IMDVTEEVWDKTLD 
INVKAPALMTKAWPEMEKRGGGSWIVSSIAAFSPSPGFSPYNVSK 
TVLLGLTKTLAI ELAPRNI RVNCLAPGL I KTS F\ S RMLW\ MDKEKEE 
SMKETLR\ IRRLGEPEDCAGIVSFLCSEDASYITGETVVNLSVMFTG 
GGVCRAASWKEGGTGTPRTPRESPRQREPGENQARSSPRMALRPEFP 
PESASTSWMGPEPVGNRERPCKPARKVGASPRALRRLGSATRMLSRA 
ALRAGVGARAPRPGGGRGAHATATVWSGLASAAALTARALLKRSIiAC 
AAAPQRCGWRRLLCSSLGSRGQGGQAHRGAHLRWRYGTASVFGTPR 

LVASQIQHVPSGPAPRPGASHPATPSIQTLRQPLHLSPDIAVPGASS 
AGAGA 

8095 

A 

2 

172 

ARDNQHDLQEWEEHLQEHTNVTWGNPTIKPHSLPSHTVWTWCFSCF 
HTPCVAYRG 

8096 

A 

3 

825 

KPCMMDTTGMSCSDHVTVDSPYVVAVRIADETS INTRSFSKFVGMSA 
DATPLHVARSGNDTEDFSNPSACSDIYGESISSHFTESTGKLIESLS 
DCASSLPIKKIAGSNYNTFLDSELSMLNGTKVIiSDSAVGIDLGSTGD 
TTEAIiHELRTAEEFKTEEQDASGSIEFGVSFPDRESSSMETSIEPKA 
TETSRTEGITAIEESWESMFNDDGDCLDPRLLQELSGNTKSRESIQE 
PRSDYYNHEVPDIDLSDCEFPHVTEIYDFPQEFRTEDL 

8097 

A 

2 

469 

TVLCATGGGAYKFEKDFRTIGNLHIiHKLDELDCLVKGLLYIDSVSTO 
GQAECYYFANASEPERCQKMPFNLDDPYPLLWNIGSGVSILAVHSK 
DNYKRVTGTSLGGGTFLGLCSLLTGCESFEEAIiEMASKGDSTQADKIi 
VRDIYGGDYERFGL 

8098 

A 

1 

2694 

MTSEGVPSARAVI1ALYENDLKALRTLCSCPI1GFEGQRCEINPDDCED 
NDCENNATCVDGINNYVCICPPNYTGELCDEVIDHCVPELNLCQHEA 
KCIPLDKGFSCECVPGYSGKLCETDNDDCVAHKCRHGAQCVDTINGY 
TCTCPQGFSGPFCEHPPPMVLLQTSPCDQYECQNGAQCIWQQEPTC 
RCPPGFAGPRCEKLITVNFVGKDSYVELASAKVRPQANISLQVATDK 
DNGILLYKGDNDPLALELYQGHVRIjVYDSLSSPPTTVYSVBTVNDGQ 
FHSVELVTLNQTLNLVVDKGTPKSIiGKLQKQPAVGINSPLYLGAFPW 
FGMDIGGTLVKLS YFEP ID ITAEEEQEEVESLKS I RKYLTSNVAYGS 
TGIRDVHLELKDLTLFGRRGNLHFIRFPTQDLPTFIQMGRDKNFSTL 
QTVLCATGGGAYKFEKDFRTIGNLHLHKLDELDCLVKGLLYIDSVSF 
NGQAECYYFANASEPERCQKMPFNLDDPYPLLWNIGSGVSILAVHS 
KDNYKRVTGTSLLTGCESFEEALEMASKGDSTQADKLVRDIYGGDYE 
RFGLPGWAVASSFGNMIYTOSKRESVSKEDIJUUVTLVTITNNIGSVAR 
MCAVNEKINRWFVGNFLRVNTL SMKLLAYAIjDYWS KGQLKAL FLEH 
EFGYNIGVISAPETVIQDFLSYTLKERSEDSPSEVIiLMSPWPSSVAI 
FPNGDMIGSFLLHSLSTTLAAQNLLKLISNFSKVSGYKINAQKSQAF 
LYTNKRQTESQIMSELPFTIASKTIKYIiGIQLTRDVKDIjFKENYKPL 
LNEIKEDTNKWKNIPCSWVGRINIVK>1AILPKVIYRFNAIPIKLPMT 
FFRELEKTTLKFT^QKRAHITKSILSQKNKAGGITLPDFKLYYKAT 
VTKTA 

8099 

A 

346 

461 


8100 

A 

3 

1560 

SSLWRRQPELKAYRTMEYMAESTDRSPGHILCCECGVPISPNPANIC 
VACLRSKVDISQGIPKQVSISFCKQCQRYFQPPGTWIQCALESRELL 
ALCLKKIKAPLSKVRLVDAGFVWTEPHSKRLKVKLTIQKEVMNGAIL 
CK3VFVVDYWQSQMCGDGHESEKLKDFWK\AVIQVRQKTIiHKKTFYY 
LEQL I LKYGMHQNTLR I KE I HDG LDF YY S S KQHAQKMVE FLQCTVPC 
RYKASQRL I S QD IHSNTYNYKSTFS VE I VP I CKDNWCLS PKLAQSL 
GNMNQICVCIRVTSAIHLIDPNTLQVADIDGSTFWSHPFNSLCHPKQ 
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LEEFIVIECSIVQDIKRAAGAGMISKKHTLGEVWVQKTSEMNTDKQY 
FCRTHLGHLI^PGDLVIiGFDIjANCZNIjNDEHVNKMNSDRVPDVVIj I KK 
SYDRTKRQRRRNWKLKELARERENMDTDDERQYQDFLEDLEEDEAIR 
KNVNI YRDSAIPVESDTDDEGAPRI SLAEMLEDLHI SQDATGEEGAS 
MLT 

8101 

A 

1 

171 

GTSISKNRLLSIKTFLYWTILGFSHAFIFFFGSYLLIGKDTSLLGNG 
QMFRNCTFDA 

8102 

A 

3 

231 

RRQAQI AAGRVLWALFGGCGGLHSRLDGAAI / CLPGHCHGSRVEVT 
YETHQCGLVGLKLLRGGREQSDRGGSLGLT 

8103 

A 

48 

532 

NLMNICRGPCLLPGAPFMRDLAAVA/SVGLP/LVRRQAQVAAGRVIiV 
VALFEGCGGLHS\RLDGAAICLPGHCHGSRVEVTYEAPQCGLVGLKL 
LRGGREQSDEGGSLGLT*DAQLGPGAEHPEIRAAWE*TAVIISLVLS 
LEPRDGQGGRVARLGAREAIGNP 

8104 

A 

3 

421 

AAEASVGL/PLLRRQAQIAAGRVL\WALFGGCGGLHSRLDGAAICL 
PGHCHGSRVEVTYETHQCGLVGLKLLRGGREQSDRGGSLGLT*DGQL 
GPS AEDYTISSCPHLTGIVGLIPGLDSADGQGGRVPRLG ARES VRDT 

8105 

A 

94 

582 

LSHPLPL\LGRQAQIAA\GRVLVVAIiLGGCGGLHSRLDGAAICLPGH 
CHGSRVEVTYETHQCGLVGLKLLRGGREQSDRGGSLGLT*DGQIiGPS 
AEHQS VS VPS LTVI I S L I LRLE VGDGEVAVS PRAGAREAIRNS APLV 
VASTTFKPHQVPRGLSCLLLMPRDWPTVSTAQSAGEPDRGSQGSRG 
RG 

8106 

A 

2 

332 

GSLQGLPGQSFSFGGLPLLGRQVRVAAGRVLV\VALFGGFGGLHSR\ 
LIXjAAICLPGHCHSSRVEVTDQTH*CGLVGLELLRGGREQSDSGVGL 
GLTQDGDLRSTSEDKVC 

8107 

C 

30 

653 

MTCSPLLLTLLIHCTGSWAHSVLTQPPSVSAAPGQQITISCSGSRLN 

VGNYFVSWYQQLPGTAPKIIiIYDNDKRPSGVPARFSGSKSGTSATLA 

ITGLQTGDEA1TYYCGAWDTSLDNWWGGGTEVTVIX3QPKAAPSV^ 

PPSSEELQANKATLVCLISDFYPGAVTVAWKADSSPVKAGVETTTPS 

KQSNNKYAASSYLSLTPEQ* 

8108 

A 

484 

632 

PWSHFRRF * LTMR I F I EGLL S AGRRA/ WM PWGGRRDPS PG I LQL * S P 
V\GGVRDENL*TFCRGHCLLHGAPFMRDLAAVAFVGLPLIiRRQAQVA 
AGRVLWALFGGFGGLHSRLDGAAICLPGHCHSSRVEVTDQTH*CGL 
VGLELLRGGREQSDSGVGLGLT*DGDLGPTSEDSKGATTSKRAAVII 
SLWSLEPRDCQGGRVGRHGAREAI INP 

8109 

B 

27 

1337 

MAWTPLLLLFPLLLHCTGSLSQPVLTQSSSASASLGSSVTLTCTLSS 
GHSDYIIAWHQQLPGKAPRSLMKVEGDGSYTCTGRGVPDRFSGSSSGA 
NRYLSISNFQSEDEADYYCQAWDSSTAHSDTGRCGIVGRPPVRQCWP 
LLLLGLAMVAHGLLRPMVAPQSGDPDPGASVGSSRSSLRSLWGRLLL 
QPSPQRADPRCWPRGFWSEPQSLCYVFGTGTKVTVLGQPKANPTVTL 
FPPS SEELQANKATLVCL I SDF YPGAVTVAWKADG S PVKAGVBTTKP 
SKQSNNKYAASSYLSLTPEQWKSHRSYSCQLQWGLGLGASQGGFLYE 
PVSQCVVFGGGTKLTVLGQPKAAPSVTLFP PS SEELQANKATLVCL I 
SDFYPGAVTVAWKADSSPVKAGVETTTPSKQSNNKYAASSYLSLTPE 
QWKQAQKLQLPET* 

8110 

C 

54 ; 

410 

MEGAHEEWAVSSLLPSPAGGGGCEPARREESSYRIKMTQXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAC 
RQQEDRLGQCCSLLHLQCLAQSRCH 

8 111 

A 

1 

175 

LS HFD S VTGFLS AGRLTPS TP S WRWKEDAHRPFLMQAI DGNVRQCE S 

8112 

A 

1815 

2017 

GQGAVAHT\CNPSTLGGQGGRITLRSGVQNQPDQHGETLSLLNISGV 
WWRAPVIAAFQEAETQESLEP 

8113 

A 

1109 

1482 

YQGLAAWQRKK^LRHMIEDRQGCHNHNHGKDQDQKGPPSLTPNPWA 
DGSRTQNFPS CRKMGPGAVAHACNPS I LGGQGGADHLRS GVRDQPGQ 
HGETPVSTKKKKKKKKKKKKKLAGHGGSRL 

8114 

A 

266 

400 

WEEEVCLLLLPNDMTVYLEHPKDSSKKLLELVNAFSKVSRYIV 

8115 

A 

6 

363 

PFAPAAGIRHEVNMGQKQEILDLFSSVLYECVTQETPEMLPAYIAMD 
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QAIRRIXSRREMSETSELWQIKLVLEFFSSRSHQERIiQNHPKRGLFMN 
s E plpwkct t DNTT.nnwT.nvnnn 

war ±jtr v v ix x x lj x xxxvWrVXJW V vrV7XV 

8116 

A 

688 

1127 

RELHYPVPSRGGRWNDLSSSGSGEKRRLISKPVCSQIKSSFINPQSA 
PDSSSNISAGGGGHGKPAOORLiPDREAC7.T J pPT.FPMPQT.nr , T?Pr2QPC 
CGEGR 

8117 

A 

2 

312 

LKGGPFSDSYRLFOFHFHWGSTITOHGSEHTVnnvicvPAPT wpwwmq 
AKYS S LAEADS KADGLAI I G AM KKDDKANP P LDHVQVG VRADRNF YG 
IEVLSDALI 

8118 

A 

2 

51 

QNASCFCWCCCCCRNW 

8119 

A 

203 

366 

1 E S PE ANS C IDGQLL FDGG I R I AHW AKDS S SSSSSSSSSSS TCRRMK 
LDPYHTP 

8120 

A 

152 

315 

IESPETNSCIDGQLLFDGGIRITHWAKDSSSSSSSSSSSSSTCRRIQ 
LDPYLTP 

8121 

A 

11 

381 

RLHRFFLGESEQPSPASSSSSSSSSLTPSQTRQHGPLRSIMKDLHSD 
DYEEESYEVDDxTOOTSI<x^ERPANR*RTRSRTVSLSDGSDSESRTSS* 
PLHYEPPPPLLYTNNNQILEVESPITHSL 

8122 

A 

124 

1924 

ITAEGGRRRGGYHTTPQQTSGPPG^SSCSVQVKLx^ 
TVISGFTHDWMVFVRGPEHSNIQHFVE^ 

YKVEESGYAGFILPIEVYFKmc^EPRK^FDYDLFXxHIiEGHPPViraL 
RCElOiTFxTOPTEDFRRKLLKAGGDPNRSIHTSSSSSSSSSSSSSSSS 
SSSSSSSSSSSSSSSSSSSSSSSSSSTSFSKPHKLMKEHKEKPSKDS 

MSKEPKPDSNLLTITSGQDKKAPSKRPPISDSEELSAKKRKKSSSEA 
LFKSFSSAJPPLILTCSADKXQIKDKSHVKMGK\nCIESETSEKKKSTL 
rrruui VxJx^x>liJoJ-lvx1»r»W ±ooJvoJJoiis2PoPAS \SSSSSSSSFHTIPGP 
GQQGPIi \ RS IMKDLHSDD \ NEEEADEAEDSDNDS EMERPV\NRGGSR 

SRRV\SLS1X3SDSESSSASSPLHHEPPPPLLKTNNNQDSWKWKSPIK 
OSK\ J5D*GKTjKTJf?PP*or!T.T*MT?TA/nn T?T , r!P'c , TT^r , 'PT7T7DtrTT r\c\y\TKt 

W*^»*>- \ "J" v»x\_LJj\xvvJXiV. Ve^lix J *Jdxj V Unf X bKr J.4JX* AJ&Xt»XvTl .xxj^^JLVIv 
L I E E TGH FH I TNTTFD FDL.C S "LD KTTVR KT ,H «3 VT . T?T Q fVTQ 

8123 

A 

2 

319 

VPGAGLGRVAWE IAMLG YACQGNEWS FFMLFS SNFVLNRCSE INKYK 
LYPWIHQFSNNRRSADQIRPIFFPDVDPHSLPPGSNFSMTAGDFQEI 
YSECSKSKSSV 

8124 

A 

1 

323 

VHCLGADLLE INNNPC YWGVlvlDKYAAEAliliEGKPEGTFIiLRDSAQED 
YLFS GS FRR YS RS LHAR I EQWNHNFS FDAHD PCVFHS PD I TGIxLEHY 
KDPSACMFFEPLI* 

8125 

A 

31 

118 

TFLAQNVQLCFPYYPSPGVLFPKKEPDDS 

8126 

A 

1 

4938 

MEKKI TG YTTVD I SQWHRKEHFEAFQS VAQCT YNQTVQLD I TAFLKT 
VKKNI<HI<FYPAFIHIIxARLIWA^ 

FHEQTETFSSLWSEYHDDFRQFLHIYSQDVACYGENLAYFPKGFIEN 
MFFVSANPWVSFTSFDL1>TVANMDNFF 

IQLPVSQPVPTIQGEPQIPVATQPSWPVHSGAHFLPVGQPLPTPLL 
PQYPVS QI PIS TPHVSTAQTGFSS LP I TMAAG I TQPLLTLAS S ATTA 
AIPGVSTWPSQLPTLLQPVTQLPSQVHPQLLQPAVQSMGIPANLGQ 
AAEVPLSSGDVLYQGFPPRLPPQYPGDSNIAPSSNVASVCIHSTVLS 
PPMPTEVLATPGYFPTVVQPYVESNIiLVPMGGVGGQVQVSQPGGSLA 
QAPTTSSQQAVLESTQGVSQVAPA^PVAVAQTQATQPTTLASSVDSA 
HSDVASGMSDGNENVPSSSGRHEGRTTKRHYRKSVRSRSRHEKTSRP 
KLRILIWSNKGDRVVECQLETHiraKMV^ 

NDFILAIERESFVDQWEIIEKADEMLSEDVSVEPEGDQGLESLQGK 
DDYGFSGSQKLEGEFKQPIPASSMPQQIGIPTSSLTQWHSAGRRFI 
VSPVPESRLRES1WFPSEITOTVAASTAQSPGMNLSHSASSLSLQQA 
FSELRRAQMTEGPNTAPPNFSHTGPTFPVVPPFLSSIAGVPTTAAAT 
APVPATSSPPNDISTSVIQSEVTVPTEEGIAGVATSTGWTSGGLPI 
PPVSESPVLSSWSSITIPAWSISTTSPSIiQVPTSTSEIWSSTAL 
YPSVTVSATSASAGGSTATPGPKPPAWSQQAAGSTTVGATLTSVST 


1, 0 U S NhS Ml- 3; ,„ O a Ei ft O E! 






TTSFPSTASQLCIQLSSSTSTPTLABTVVVSAHSLDKTSHSSTTGLA 
FSLSAPSSSSSPGAGVSSYISQPGGLHPLVIPSVIASTPILPQAAGP 
TSTPLLPQVPSIPPLVQPVANVPAVQQTLIHSQPQPALLPNQPHTHC 
PEVDSDTQPKAPG IDD I KTLEEKLRS LFSEHS SSGAQHAS VSLETSI* 
VIESTVTPGIPTTAVAPSKLLTSTTSTCLPPTNLPLGTVALPVTPVV 
TPGQVSTPVSTTTSGVKPGTAPSKPPLTKAPVLPVGTELPAGTLPSE 
QLPPFPGPSLTQSQQPLEDLDAQLRRTLSPEMITVTSAVGPVSMAAP 
TAITEAGTQPQKGVSQVKEGPVLATSSGAGVFKMGRFQVSVAADGAQ 
KEGKNKSEDAKSVHFESSTSESSVLSSSSPESTLVKPEPNGITIPGI 
SSDVPESAHKTTASEAKSDTGQPTKVGRFQVTTTANKVGRFSVSKTE 
DKITDTKKEGPVASPPFMDLEQAVLPAVIPKKEKPELSEPSHLNGPS 
SDPEAAFLSRDVDDGSGSPHSPHQLSSKSLPSQNLSQSLSNSFNSSY 
MSSDNESDIEDEDIiKLELRRLRDKHLKEIQDLQSRQKHEIESLYTKL 
GKVPPAVIIPPAAPLSGRRRRPTKSKGSKSSRSSSLGNKSPQLSGNL 
SGQSAASVLHPQQTLHPPGNIPESGQNQLLQPLKPSPSSDNLYSAFT 

SDGAISVPSLSAPGQGCAKFNCASEQVTFKPGGRRTRFLSTPCLALC 
V 

8127 

A 

472 

4356 

PPTCSNSPSSFLSQGFPPRLPPQYPGDSNIAPSSNVASVCIHSTVLS 

PPMPTBVLATPGYFPTWQPYVESNLLVPMGGVGGQVQVSQPGGStA 

QAPTTS S QQAVLE S TQGVS QVAPAEPVAVAQPQATQPTTLAS S VDS A 

HSDVASGMSDGNENVPSSSGRHEGRTTKRHYRKSVRSRSRHEKTSRP 

KLR I IiNVSNKGDRWECQLETHNRKMVTFKFDLDGDNPEE I ATIMVN 

NDFILAIERESFVDQVREIIEKADEMLSEDVSVEPEGDQGLESLQGK 

DDYGFSGSQKLEGEFKQP I PASSMPQQIGI PTSSLTQWHSAGRRFI 

VSPVPESRLRESKVFPSEITDTVAASTAQSPGMNLSHSASSLSLQQA 

FSELRRAQMTEGPNTAPPNFSHTGPTFPWPPFLSSIAGVPTTAAAT 

APVPATSSPPNDISTSVIQSEVTVPTEEGIAGVATSTGWTSGGLPI 

PPVSESPVLSSWSSITIPAWSISTTSPSLQVPTSTSEIWSSTAL 

YPSVTVSATSASAGGSTATPGPKPPAWSQQAAGSTTVGATLTSVST 

TTSFPSTASQLSIQLSSSTSTPTLAETVVVSAHSLDKTSHSSTTGLA 

FSLSAPSSSSSPGAGVSSYISQPGGLHPLVIPSVIASTPILPQAAGP 

TSTPLLPQVPSIPPLVQPVANVPAVQQTLIHSQPQPALLPNQPHTHC 

PEVDSDTQPKAPG IDDIKTLEEKLRSLFSEHSSSGAQHASVSLETSL 

VIESTVTPGIPTTAVAPSKLLTSTTSTCLPPTNIiPLGTVALPVTPVV 

TPGQVSTPVSTTTSGVKPGTAPSKPPLTKAPVLPVGTELPAGTLPSE 

QLPPFPGPSLTQSQQPLEDLDAQLRRTLSPEMITVTSAVGPVSMAAP 

TAI TEAGTQPQKGVSQVKEGPVLATS SGAGVFKMGRFQVS VAADGAQ 

KEGKNKSEDAKSVHFESSTSESSVLSSSSPESTLVKPEPNGITIPGI 

SSDVPESAHKTTASEAKSDTGQPTKVGRFQVTTTANKVGRFSVSKTE 

DKITDTKKEGPVASPPFMDLEQAVLPAVIPKKEKPELSEPSHLNGPS 

SDPEAAFLSRDVDDGSGSPHSPHQLSSKSLPSQNIjSQSIjSNSFNSSY 

MSSDNESDIEDEDLKLELRRLRDKHLKEIQDLQSRQKHEIESLYTKIi 

GKVPPAVIIPPAAPLSGRRRRPTKSKGSKSSRSSSLGNKSPQLSGNIi 

SGQS AASVLHPQQTIiHPPGNI PESGQNQLLQPLKPS PSS DNLYSAFT 

SDGAISVPSLSAPGQGNKATI IVQKQ 

8128 

A 

2 

237 

ISGQEGLQAVLASDYS FAQFRYLQRLLLVHGRWS YFRMC KFLCYFFY 
KNFAFTLVHFWFGFFCGFSAQTVYDQWFITL 

8129 

A 

503 

639 

LLLLFSLLFHRSLLFIPFRRLLGI I TKKDILRHMAQTANQDPAS I 

8130 

A 

464 

2848 

GETGMS TALTHTTV AMP r* PMT >attf2aa id Tvn "D r> r\ Ttwnxj nrvtrtiTMrvtrcrtr 

LPIQARIX3SIAEIDLGVPPPVMKTFKEFLLSLDDSVDETEAVKRYND 
YKLDFRRQQMQDFFLAHKDEEWFRS KYHPDEVGKRRQEARGALQNRL 
RVFLSLMETGWFDNI1I1LD I DKADA I VKMLDAAVI KMEGGTENDLR I L 
EQEEEEEQAGKPGEPSKKEEGRAGAGLGDGERKTNDKDEKKEDGKQA 
ENDSSNDDKTKKSEGDGDKEEKi<EDSEKEAKKSSK3a^NRKHSGDDSF 
DEGSVSESESESESGQAEEEKEEAEALKEKEKPKEEEWEKPKDAAGL 
ECKPRPLHKTCSLFMRNIAPNI SRAEI ISLCI^YPGFMRVALSEPQP 
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ERRFFRRGWVTFDRSVN I KB I CWNLQN IRL/ RECELS PGVNWDLTRR 
VRNINGITQHKQIVRITOIKIiAAKIiIHTIiDDRTQLWASEPGTPPIjPTS 
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ISKQEYDESGPSIVHRKCF 

8196 

A 

80 

1281 

LPAQKLDTMCDEDETTALVCDNGSGLVKAGFAGDDAPRAVFPSIVGR 
PRHQGVMVGMGQ KDS YVGDEAQS KRG I LTLKYP I EHG 1 1 TNWDDMEK 
IWHHTFYNELRVAPEEHPTLLTEAPLNPKANREKMTQIMFETFNVPA 
hTYVAIQAVLSLYASGRTTGIVLDSGDGVTHNVPIYEGYAIiPHAIMRIi 
DIAGRDLTDYIiMKILTERGYSFVTTAEREIVRDIKEKLCYVALDFEN 
EMATG/APSIHPPMEKSYELPDG\QVITIGNERFRCPHTIi\FQPSFI 
GM\ESAGl\HETT\YNSIMKCDIDl\RKDL\YAN\NVMFGGHPRIrYP 
WEFAD\RMQERDSPAIiAPQAPMEDQGFIRPRRSAKYS\VWIGRI*PSW 
PSLSHLPSKMWITEAGSTDEAGPFPSSNRKCF 

8197 

A 

2 

228 

GRVDGNWAVNEGLSIFVILWIX3LNVFLFVWYYRVYDIPPKFFYTRK 
LLGVSINSIPCNIGWFTFLIRHSCSSSC 

8198 

A 

1 

369 

GTSSEPVFLLSSNSSLRPAEPRFSLVDATSLHIESLSLGDEGIYTCQ 
EILNVTQWFQVWLQVASGPYQIEVHIVATGTLPNGTLYAARGSQVDF 
SCNSSSRPPPWEWWFQALNSSSESFGHN 

8199 

A 

1 

416 

NQTHYALQASLISIiDFYEKHFDIYYPLSKLDLIAIPDFAPGAMENWG 
LITYRETSLLFDPKTSSASDKLWVTRVIMELAHQWFGNbVTMEWWN 
DIWLNEGFAKYMELIAVNATYPELQFDDYFLNVCFEVITKDSLN 

8200 

A 

2 

397 

LNTGEQKDMTEFFTDLITKIEEMSPELKNTVKSLFGGVITNNVVSLD 
CEHVSQTAEEFYTVRCQVADMKNIYESIJDEVTIKDTLEGDNMYTCSH 
CGKKVRAEKRACFKKLPRILSFNTMRYTFNMVIMMKE 

8201 

B 

1 

555 

MESTYMPINSGLDKEN\mSECKAYNPRPFCKTYTMDKQPLNTGEQKD 
MTE FFTDLITKI EEMS PELSLRCEELHLHAENLSRRVWELLMIiLPTC 
PNKLMAFQNI SDEQGS YSDLYPDSDDS SEDQVENS KNS WSCKI SSNF 
TSSSSKTIVnrFSAWEQLRYTLDNNVLKKSVIIQKDSESYEYD* 

8202 

A 

1 

901 

MSLSSGASGGKGVDANPVETYDSGDEWDIGVGNLIIDLDADLEKDQQ 
KLEMSGSKEVGIPAPNAVATLPDNIKFVTPVPGPQGKEGKSKSKRSK 
SGKDTSKPTPGTSLFTPSEGAASKKEVQGRSGDGANAGGLVAAIAPK 
GSEKAAKASRSVAGSKKEKENSSSKSKKERSEGVGTCSEKDPGVLQP 
VPLGGRGGQYDGSAGVlDTGAVEPLGSIAIEPGAALNPLGTKPEPEEG 
ENECRLLKKVKSEKMESPVSTPAVLPIHLLVPVVNNDISSPCEQIMV 
RTRSVGVNTCDVGSGHRA 

8203 

A 

1 

407 

AFFMARKAMGGHVTTFSWLLCNFQITIQDIWIGTDYLTVI^AMHR^ 
CLLRPVWNHNHHTVSPVPTGMIGPVA/MALLLTIjPVI irvttvpgktv 
TVACTFNFS PWTNDP I ER I NV AD AM LT ARG IIRFIIDFSAP 
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8204 

A 

2 

1149 

PDLELPRARPQLVNS PGADKMETNS SLPTNI SGGTPAVS AGYLFLD I 
I T XJjVF AVTF VXfU VJj GNGJj V I WVAG FRMTHTVTTISYLNLiAVADFCF 
TSTLPFFMVRKAMGGHWPFGWFLCKFVFTIVDINLFGSVFLIAIilAL 
DR CVCVLH P VWTQNHRTVS L AKKV I IGPWVMALLLTLPVI IRVTTVP 

r* VT l /^'T^7fir , T , 17MT?ODWTT l vTT^'Dlf T?I> T KT\7 2l\7 THAT Ttrnp TTDPTTPDCTk nnjto 
vjXN. 1 o 1 V/\C 1 r vir oJrW 1 JN U ir iVtK X IN V/W-MMIj Jl VKOl J.Kr J. Ibr oAPMS 

I\VAVSYGLIATKIHKQGLIKVQVRP*GALSFVA\KAFFLCWSP\YQ 
\WAP**PTSQESREVNCKGMYQRKLGIA\VDVNKVPLAFFNS/CAF 
NP^TVIJVF\MG\ODFRERLIHALPAKSGRGPLTFD™PTCPVP^T.'^^rQT 
LPS\AEVALQAK 

8205 

A 

2 

387 

NGMLGNRFKSGFIFKVYLQETAGETWSVBQVCSWLVEKNLGELVHRF 
QEEEVSGAALLALNDRMVQQLVKKI GHQAVLMDL I KKYKQNTQGLKS 
PENPKKAALVMQTEAARDYRDEESSS PARHGEQM 

8206 

A 

2 

294 

IQGGLDADLRATCLEEMTKRDVPGFAIGGLSGGESKSQFWRMVALST 
SRLPKDKPRYLMGVGYVVDREARALPVGSGFLGTPYPAWGGGIWGKR 
TQV 

8207 

A 

541 

713 

SEVSAMVFLFFFF/CFFEMESRSAAQAGVQWCNLGSLQALPPRFTPF 
SCLSLPSSWDY 

8208 

A 

2 

1730 

- 

KKSIQQLVPEYNEKQTPESLPQNSNPDWHPTNLTLSDETCQRSKNLK 
VGDKSPSVSPSMPENQSATKELGQ^lNLTEP^KMDTGVVI J LS 
DLCQSQLPENKESKEDYVFYLTVXSSLTERLDFAAEQDIiELTSEEEQ 
ERLKGCENKQPQDKAI FKTCSPTEKTPKKQNTKQ INLPLLHLQKTSQE 
PEMAKDCDREDI P I YPVLPHVQKSEEMR I EQGKLEWKNQLKLVINEL 
3\\2 K r ijii 1 1 tils, x KX PAL P Jir. Jc* PXj JjDN STRGTD V KD I P r NL» INN I PGCE 
EEDASEISVSVILLAYQESKKDTFCLCEIi^ 

RNADMLYNKDSEQLRIKEEECGKVVETKQQLKWl^njRRLVKELRTVRN 
NLDLWQERNDAQKQLS EEQDAR ILQDQI LTS KQKBLEMAQKKRNPE 

ptl O A 21 V"KTT ,Ci & IT TTO T T . C: T . OT? TCMTCPT .VTH TT VWWT.VT7P NTH /^r^PTTPtf INCH? 

KVVMREFQQEWTDLLKQQPTSEATSRCHIInTLDETQDSKKKLGQIRSE 
VCMKHNMSTVNI* 

8209 

A 

1 

183 

RTRGRTRGILCHMQISYAQVETMRQSIEELHYYELELYVLQFHHLQV 
VWENHLCI FFYYLK 

8210 

A 

3 

320 

RAALVLGGPKFAS KTPRPVTEPLVLLSGMQLAKLI RQRKVKC IDDVQ 

AYINRIKDWPMINGIDKHRFEEAMNEAHAVHQKLAEKHVDVI 

TWTFLWALNA 

8211 

A 

2 

198 

STLIVTHALHKVKVGKSPPVRGWLSGKVSLPCHFSTMPTLPPSYNTS 
EFLRIKWSKIEVDITGIH 

8212 

A 

110 

364 

QGD S LDDTWWNQQQ PLKE SLSWFQSLSPGIG FVDG S S CG AVREBLAA 
T .T\7Mr*T . t pp ppmT .m .niffiMnnnQT.pp t t .\tczj?t .t .tch 

8213 

A 

1 

849 ! 

MP ES WLAAFP PYGD I KKWGLQLKEGLHQNLTMQDS IQIFD FQLPGL 

LRPSSVLWTLKPYHTAGLASDRDSYKGILACMPCMFGSRRSPLRDIK 

SHQADLELEVKNSIjDTIHRLESELKKQSKIQSQMKVEKAHLEEEIAE 

LKKSQAQDKAKLLEMQESIKDLSAIRADLANKLAEEERAK^ 

SDLTAQAKSRDEETATIITQLKLEPJ5VHQRELIQ)LTSSIiQSVKTKHE 

QNIQELMKHFKKEKSEAENHIRTLKVPLL 

A 


7V 
>\ 

X 


I»JW VWiAr.br I -Liryr LnoVbKl VyAAISJ.fc'Oil.W V bfcMruiLoALwAliKVr 

I YYN I C PGRMGLHWLLNQE KHISI EEEHLRRMEEARLQL KDQLLCLE 
TEQES I LGV I GKE IDAACKTFS KDS VEKLKNI FMLAIDTVSTRVWL 
CSSIDYALRVYGGSNFTDYGLESELKKQSKIQSQMKVEKAHLEEEIA 
ELKKSQAQDKAKLLEMQES I KDLSAIPJ^LANIO^AEEEPJ^JCKAVLKD 
LSDLTAQAKSRDEETATIITQLKLERDVHQRELKDLTSSLQSVKTKH 
EQNIQELMKHFKXEKSE^ENHIRTLKVPLLNYPDDTEGIFLVAHEVN 
KA 

8215 

A 

23 

627 

S I KCTLVS I Y I L 1 1 LGWQRKNCF I F YHRTYKI KYS I L I L F S RLKKE A 
PAKEQEPVSKEIARPNMAERETETSNSESKQDKAASSKEKNGCNANS 
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FEGSSTTKSEESITVSDKENETCLADQETGSKNIVSCDSNIGADKVE 
KKKQ I QHVCQEMELKMCQS SEN I I L SDQ I KDHNS S EARF S S KN I KDL 
RLASDNVSIDPFF 

8216 

C 

79 

171 

MD I S GTKCD FTVQ VQ LR F CL S E TKATQPTN * 

8217 

A 

3 

632 

HAS ADLWAD I S QRFRETCFAFALTPQQVQQ I S S SMD I SGTKCDFTVQ 
VQLRFCLSETSCPQEDHFPPNLCVKVNTKPCSLPGYLPPTKNGVEPK 
RPSRPINITSLVRLSTTVPNTIWSWTAEIGRNYSMAVYLVKQLSST 
VLLQRLRAKGIRNPDHSRALIKEKLTADPDSEIATTSLRVSLLCPLG 
KMRLTI PCRALTCSHLQCFDG 

8218 

A 

1 

188 

NTIVVSWTAEIGRI^SMAVYLVKQLSSTVLLQRLRAKGIRNPDHSRA 
LSTYDKLISLIQLFC 

8219 

A 

1 

2295 

VFEVVERVEELRRKWPVAWGKLDDGSIGWTPYADQVFRIRAELRKK 
RLSDVhTVERVLNVQGKQFRVLFLSTXmTRHTCKHKQTPIKKKEQLLE 
DSTEDLDYGFLSNYKXLNTAITRAQSIiVAWGDPlALCSIGRCRKFW 
ERF IALCHENS SLHG ITFEQI KAQLEALELKKTYVLNPLAPEF I PRA 
LRLQHSGSTNKQQQSPPKGKSLHHTQNDHFQNDGIVQPNPSVLIGNP 
IRAYTPPPPLGPHPNLGKSPSPVQRIDPHTGTSILYVPAVYGGNWM 
SVPLPVPWTGYQGRFAVDPRIITHQAAMAYNMNLLQTHGRGSPIPYG 
LGHHPPVTIGQPQNQHQEKDQHEQNRNGKSDTNNSGPEINKIRTPEK 
KPTEPKQLPRPPFPI PQQHTLLNQQQNNLPEQPNQI PPQPNQAGPNN 
AFFNSAVAHRPQSPPAEAVIPEQQPPPMLQEGHSPLRAIAQPGPILP 
SHLNS F I DENPSGLP IGEALDRIHGS VALETLRQQQARFQQWS EHHA 
FLSQGSAPYPHHHHPHLQHLPQPPLGLHQPPVRADWKLTSSAEDEVE 
TTYSRLEIDEEAGEEKLEASRGWFMRFKERSHPHNIKVQGETASANI 
EAAASYPEDL7VKIIDEDGYTKLQIFCVDKTASYWKKMLSRTFSAPPK 
TVKPPEDQLKSENLEVSSSFNYSVIiQHLGQFPPIjMPNKQIAESANSS 
SPQSSAGGKPAMSYASALRAPPKPRPPPEQAKKSSDPLSLFQELSLG 
SSSGSNGFYSYFK 

8220 

A 

143 

5972 

VMEDRRAEKSCEQACESLKRQDYEMALKHCTEAIiLSLGQYSMADFTG 
PCPLEIERITIESLLYRIASFLQLKNYVQADEDCRHVLGEGLAKGED 
AFRAVLCCMQbKGKLQPVSTILAKSLTGESLNGMVTKDLTRLKTLLS 
ETETATSNALSGYHVEDLDEGSCNGWHFRPPPRGITSSEEYTLCKRF 
LEQGICRYGAQCTSAHSQEELAEWQKRYASRLIKliKQQNENKQIjSGS 
Yl^TLIEK^n^SLSPEKVLSECIEGVKVEHNPDLSVTVSTKKSHQTW 
TFALTCKPARML YRVALLYDAHRPHFS I IAISAGDSTTQVSQEVPEN 
CQE W I GGKMAQNGLDHYVYKVG I AFNTE I FGTFRQT I VFD FGLEP VL 
MQRVMIDAASTEDLEYLMHAKQQLVTTAKRWDSSSKTIIDFEPNETT 
DLEKSLLIRYQIPLSADQLFTQSVLDKSLTKSNYQSRLHDLLYIEEI 
AQYKE I S KFNLKVQLQI LAS FMLTGVSGG AKY AQNGQLFGRFKLTET 
LSEDTIAGRLVMTKVNAVYLLPVPKQKLVQTQGTKEKVYEATIEEKT 
KEYIFLRLSRECCEELNLRPDCDTQVELQFQLNRLPLCEMHYALDRI 
KDNGVL FPD I SMTPT I PWS PNRQWDEQLDPRLNAKQKEAVLAI TTPL 
AIQLPPVLIIGPYGTGKTFTLAQAVKHILQQQETRILICTHSNSAAD 
LY I KDYLHPYVEAGNPQARPLRVYFRNRWVKTVHPVVHQYCLISSAH 
STFQMPQKEDILKHRVVVVTLNTSQYLCQLDLEPGFFTHILLDEAAQ 
AMECETIMPIALATQNTRIVLAGDHMQLSPFVYSEFARERNLHVSIiL 
DRLYEHYPAEFPCRILLCENYRSHEAIINYTSELFYEGKLMASGKQP 
AHKDFYPIiTFFTARGEDVQEKNSTAFYNNAEVFEWERVEELRRKWP 
VAWGKLDDGS I GWTP YADQVFR I RAELRKKRLSDVNVERVLNVQGK 
QFRVLFLS TVRTRHTCKHKQTP I KKKEQLLEDSTEDLDYGFLSNYKL 
LNTAITRAQSLVAWGDPIALCSIGRCRKFWERFIAIjCHENSSLHGI 
TFEQIKAQLEALELKKTYVLNPLAPEFIPRALRLQHSGSTNKQQQSP 
PKGKSLHHTQNDHFQNDGIVQPNPSVLIGNPIRAYTPPPPLGPHPNL 
GKS PS PVQR IDPHTGTS I LYVPAVYGGNWMSVPLPVPWTGYQGRFA 
VDPR 1 1 THQAAMAYNMNLLQTHGRG S P I P YGLGHHP PVT 1 GQPQNQH 
QEKDQHEQNRNGKSDTNNSGPEINKIRTPEKKPTEPKQVDLESNPQN 


1. 0 O S H- 6, 4- 31 ,„ 01 iS! iE! 16 O B 






RS PESRPS WYPSTKFPRKDNLNPRHINLPLPAPHAQYAI PNRHFHP 

LPQLPRPPFP I PQQHTLLNQQQNNLPEQPNQI PPQPNQWQQQSQLN 

QQPQQPPPQLSPAYQAGPNNAFFNSAVAHRPQSPPAEAVIPEQQPPP 

MLQEGHSPLRAIAQPGPILPSHLNSFIDENPSGLPIGEALDRIHGSV 

ALETLRQQQARFQQWSEHHAFLSQGSVPYPHHHHPHLQHLPQPPLGL 

HQPPVRADWKLTSSAEDEVETTYSRFQDLIRELSHRDQSETRELAEM 

PPPQSRLLQYRQVQSRSPPAVPSPPSSTDHSSriFSNFNDNSRDIEVA 

SNP AFPQRLPPQ IFNSPFSLPS EHLAPPPLKYLAPDG AWTFANLQQN 

HLMGPGFPYGLPPLPHRPPQNPFVQIQNHQHAIGQEPFHPLSSRTVS i 

SSSLPSLEEYEPRGPGRPLYQRRISSSSVQPCSEEVSTPQDSLAQCK 

ELQDHSNQS S FNFS SPES WVNTTS STP YQN I PCNGS SRTAQPRELI A 

PPKTVKPPEDQLKSENLEVSSSFNYSVLQHLGQFPPLMPNKQIAESA 

JM b 0 b P w b b Avjvj ivP AM b 1 Ab AJLRAP P KPRPPPEQAKKSSDPLiSLtFQEIj 

SLGSSSGSNGFYSYFK 

aZZx 

A 


*7 c c 

JS.I VNNr xliKNi? PAxjJjbTCsrEA? JjKXjPF ERLAFVIjSSNSLKHCTExjEIjFK 
AACRWLRLEDPRMDYAAKIiMKNIRFPLMTPQDLINYVQTVpFMRTDN 

X \- VIM XJXJUXUn&n jl yrlT^irr X l*iv^±r VriyoUK IrtlKoUo IXlJLiV i Jjub VLiavvw 

WS KELRMYDERAO E WR S LiAPMDAP R YOHG T A V T ttW PT . YWC3f30 SMY 
DTKGKPAVDTVFRFD PR YNKWMQVAS LNEKRTF FHL S ACKAHLYAVG 
RHSAAGELATVECYN . 

8222 

A 

57 

272 

HIFLTSWNFYIITOELHLNN^ 

\7T .T .M\7r*T? WPP Q TfP A TTKT . C A 1? f2 TT T7 


A 



DTP Pf^TTV T VTY"2f2T?'n'nT V^FWWQ W\mt~*'D\TT A Df^TJOU VOUXf C\7\7 A UHDVT 
lr lA.roi\I X V X \J\JE*ULJl-i V A V nof VULK V lAKUnonJVo n vo V vf\r Uir X X 

TSVEEGDPMEFSGSDEDFQDLLHFG 

8224 

A 

180 

293 

WVPFFCDLRQGQPVYPITGGFRQPYPSSLSVDTSMSR 

8225 

A 

76 

551 

IKMSGKANASK1<2NJAQQLK3^ 

MTILSNSIi^KEEIQYHLNFIiKKRLLQQCETLKVPPKKMEDL 
LL1WERARDKANEEGLA 




TAR 
b 

blJvjbAbJKljl^YFbEPVHxIjKKAI: AK x VSGVSHVIjKi>njSFSLQSSDTF 

8227 

A 

16 

548 

QLNGRS IRHEVMS HRKFS APRHG SLGFLPRKRS S RHRGKVKS F PKDD 
PSKPVHLTAPLGYKAGMTOIVREVDRPGS1WNKKEWEAVTIVETPP 
MVVVGIVGYVETPRGLRTFKTVFAEHISDE/ CPJ^LPLRQKKAHLMEI 
HVNGGTVAE KLD WARERLEQQVPVNP VFGQDEM I D VI 

8228 

A 

3 

348 

TRH I VREVDRPGS KVNKKEVVEAVTIVENTPMVVVG IVG YV/ ETPR/ 

GLRTFKTVFAEHISDECKPJRFYKNW^^ 

QTMEEKKAFMGPLKKDRIAKEEGA 

8229 

A 

1 

956 

VNHAPAHASDGVMSHRKFSaPPJIC\SxjGFLPRKRSS\RHRGKVKSFP 
MDDPSK\ PVHLTAF\LGYKAG\MTHIVP^\VDR\PGSKVNKK\EVVE 
\AVTIV*DTTHWCLLGIVGYVGNPSEGLRD/TFKTCLLLEHISY*MP 
KKKrx* w KxWAxjKS*ECjRPxjPKxCKEIGl^ 
KLIAQVHPCSIAQTQ\^L^ 

DWARE KLE \ QQVPVNQVFGQDEM I \ DVI \ GVTKGK\ G FKGVTRS \ WP 

TM*T.PPV7A\ WT d\ T.QP\7Ar*17r* BUHD At? AUXJUAfA D HPr»VPVADVTT T V 

HSDL 



1 

X 

■37c 

ywiAni XUPFlNKv,EllAAWxxNxJxjrybv~KK^ 

ERGQGTWDTPGGQMSFSIPAHKSGLVIGLGGDNVKAITQQTGALVEI 
SRQLPPTGDPNYKLLIIRGSPQQIDHTKPRA 

8231 

A 

1 

399 

LDVKL IEGLWKAGTTVRFPAI IRGVPVPTAKWTTDGSE 1 KTDEHYT 
VETDNFS S VliTI KNCLRRDTGEYQITVSNAAGS KTVAVHIjTVXiDVPG 
PPTGPINILDVTPEHMtlSWQPPKDDGGSPVINYIVEIQ 

8232 

A 

2441 

2706 

FCFGLSCSLLPFFFFFFFEMESRSVAQAGVQWCDLSSLHPLPPG\SS 
NAPASASRVAGTTGAYHQCLANFGFVFLVETGVSLCWLGWS 

8233 

A 

1275 

1749 

ILFWIIQLNLECSEFSWRLRQGRGIMLFFFPFLFFFETESHSVAQTG 
VQWCDLGSLQPPPP\GLSDSPALASSVSWITDVRHHLWLIFVFLVET 
GFRHVGQASLKLPTSGDLPTLASQSAGITGVSHYAWLIFVFLVETEF 
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HHVGQAGLELLAPSDPPA 

8234 

A 

3 

200 

DALAFVIKVFAAEEVDGCSVTTGS/HNHSVASGSNHRTRFCFTVYLS 
S I VS WRY * SCRCRECRGRAF 

8235 

A 

1 

1214 

PGS P IVKS AKFVTDLLLHF I KDQTCYNI I PL YNSMKKKVXSDSEDEE 
KDADVPGTSTRK31KDHQPRRRLRNRAQSYDIQAWKKQCEELLNLIFQ 
CEDSEPFRQPVDLLEYPDYRDIIDTPMDFATVRETLEAGNYESPMEL 
CKDVRLIFSNSKAYTPSKRSRiYSMSLRLSAFFEEHISSVLSDYKSA 
LRF YKRNT I TKRRKKRNR SSSVSS SAAS S PERKKR I LKPQLKS ES S T 
SAFSTPTRS I PPRHNAAQINGKTESS S WRTRSNRVWDPWTEQPS 
lbbAAMr 1 rKANASAIPGKTILENSVKHSKAIJOTLSSPGQSSFS 
TRNNSAKENMfiKEKPVKRKMKSSVLPKASTLSKSSAVIEQGGCKEQR 
FLYQGPFQVNGHGGQPSKXiVKRGPGKET 


A 

O A 

1 Q "3 

183 

P V P E I KW PHJb P P I TMEE V P PL S G TD ADLLV P YD I YVLS PC CNLKCL 
IQACL 

8237 

A 

256 

491 

PPMLGKDKKEKEKALKRKITKEEKELRLKLRPLYQFMSCKEFDDLFE 
NMHKEKMLRAKI RELQR YRRNG I TKMEE S AE 

8238 

A 

243 

385 

DISHIiSADGYSSSDSFTSDPEQIGSNVTRQRSVFLSMKS^ILNYIlAAF 

8239 

A 

3 

404 

SCYGFNWGQV\SDGGQLRSINGEIjYAPLQH\HHVNVEGATHKQVVD 
LIRAREKELILTVLSVPPHEADNLYPSDDSLGQSFYDYTEKQAVPIS 
VPRYKHVEQNGEKFVVYNVYMAGRQLCS KRYRQFAI LHQNL 

8240 

A 

1 

706 

GGGGGG S VLHCAGNGGGGGGG PRWR I VKSE SG YG FNVRGQVS EGGQ 
LRS INGELYAPLQHVS AVLPGGADDRAGVRKGDRIIiEVNHVNVEGAT 
HKQWDLIRAGEKELILTVLSVPPHEADNLDPSDDSLGQSFYDYTEK 
QAVPISVPRYKHVEQNGEKFVVYNVYl^GRQLCSKKYREFAII^ 
KREFANFTFPRLPGKWPFSLSEQQLDARRRGLEEYLEKVCSIRVIGE 

8241 

A 

323 

497 

DRVSLCCPGVQWHNLGSLQPLPPWFKRFSLLSLPSSWDHRRMPPHPA 
NPCILSRHGVL 

8242 

A 

1 

483 

GRDLDWDEEMAKAQSHDCVPVLSEHPLYILYTSGTTGLPKGVIRPTG 
GYAVMLHWSMS S I YGLQPGEVWWAASDLSWWGHS YICYGPLLHGNT 
TVLYEGKPVGTPDAGAYFRVLAEHGVAALFTAPTAIRAIRQQDPGAA 
LiG KQ Y S liTR F KTL FVAGRTM 

8243 

A 

2 1 

740 

QMAHAVMIRDIGSNSPVSKIAGVLVTCHGIKKGDTWIYMPMIPQAMY 
TMLACARIGAIHSLIFGGFASKELSSRIDHVKPKVVVTASFGIEPGR 
RVE YVPLVEEALKI GQHKPDKIL I YNRPNME AVPLAPGRDLDWDEEM 
AKAQSHDCV^VLSEHPLYILYTSGTTGLPKGVIRPTGGYAVMLHWSM 
S S 1 YGLQPGES QTDPSGG I PEEVI I VI GDDS SMLALVEL PVRQ I VEM 
EDSDTDNPDPV 



*> A Q 


VAN I KKE5TH I VWY KDERE I S VDE KHDFKDG I CTLL I TEVGD YKCKM 

8245 

A 

24 

428 

GQEKLVPPSVELDVTCLIEGLVVKAGTTVRFPAI IRGVPVPTAKWTTD 
GSEIKTDEHYTVETDNFSSVIjTIKNCLRRDTGEYQITVSNAAGSKTV 
aahltvld v pg p ptg p i n i ld vtp ehmt i s wq p p kddgg s 

8246 

A 

1 

240 

TPSRCCCCRSAPTLVGVRTRGFNIPKQVSRSGLYRSPSMPDNTjNRPS 
lkqvr,jur kvgs 1 fdkv1uckyfsgqgklrkvrtf 

8247 

A 

935 

1150 

QRRRGRGWTNCGSTLREDGHIiPEPAFLHSAQEPRSGALACGKATGEH 
RAPRRPPGRVRSPAHPEPCPSSGV 

8248 

A 

190 

584 

KIKHRPEEEPRWAAAGAQSAGPGAAEVAPPRPGTVAPGANGMTDSAT 
ANGDDRDPEIELFVKAGIDGESIGNCPFSQRLFMILWLKGWFNVTT 
v i^jji\_rvi\^i^j^4_ijrxi>iXxrtir Vjr i ri.f Jr J? Jj 1. i? JMols vis. 1 JJ V IM JS^xjLn* 

8249 

A 

201 

364 

M I L YKS S LKYVFP YCNQLFLTAVNFTEVNEENKNDIjFQEVt'S S I E TL 
AFTFGNM 

8250 

A 

1 

1122 

MHKDWWFD VS GEHKE ALLHE LQLCDSTLKP AAHGQTRIi VDLS L S EH 
KAQGTGAHVYEERHQVSVSRTDDVGCWPSECLRACGAEVGCSNIAY 
PKLVMELMPIGLRGLMIAVMLAALMSSLTSIFNSSSTLFTMDIWRRL 
RPRSGERELLLVGRTGIPSTPPAPQSRLSFLLPETPPLERYLLGLW 
MDLWLVIVALIGVSVAWIPVLQDSNSGQLFIYMQSVTSSLiAPPVTAV 
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FVLGVFWRRANEQGAFWGL I AGLVVGATRIiVLE FLNPAP PCGEPDTR 
PAVLG S IHYLHFAVAIiFALSGAVWAG S LLTPPPQS VQI ENIjTWWTIj 
AQDVPLGTKAGDGQTPQKHAFWARVCG FNAI LLMCVN I F FYAYFA 

8251 

A 

1 

1197 

MHKDWWFDVSGEHKEALLHELQLCDSTLKPAAHGQTRLVDLSLSEH 
KAQGTGAHVYEERHQVSVSRTDDVGCWPSECL\RACGAEVGCSNIA 
YPKLVMELMPI/GFSGAGDFPGPPLSNPLFLEAFDPSGLRGLMIAVM 
LAAIiMSSLTSIFNSSSTLFTMDIWRRLRPRSGERELLLVGRTGIPST 
PPAPQSRLSFLLPETPPLERYLLGLWMDLWLVIVAIilGVSVAWIPV . 
LQDSNSGQLFIYMQSVTSSLAPPVTAVFVLGVFWRRANEQGAFWGLI 
AGL WG ATRL VXiE F L»NP AP PCGE PDTR P AVLG S I HYLH FGV AL F AL S 
6AWVAG S LLTPP PQS VQ I ENLTWWTL AQD VPLGTKAGDGQTPQKHA 
FWARVCG FNAI LLMCVN I FFYAYFA 

8252 

A 

1 

624 

MGCG I E FVTKS S KA I ATKTKI DNWDL I KELLH S KRNYQ YS KQT I YKE 
LCGCSERSQGEVEILNLEPIiDEGASMHPCDDTYCGPFPESEPEVKAV 
ANFLRKHRKHIRAYLSFHAYAQMLLYPYSYKYATIPNFRCVESAAYK 
AVNALQSVYGVRYRYGPASTTLCSSAGSGNQIDTRQVKQEKSIQILL 
I L CVQDLHNR VKS KEVAKAR 

8253 

A 

88 

627 

PHGPKTMEEGGSTGSAGSDSSTSGSGGAQQRELERMAEVLVTGEQLR 
LRLHEEKVIKDRRHHLKTYPNCFVAKELIDWLIEHKEASDRETAIKL 
MQKLJUDRGI IHHVCDEHKEFKDVKLFYRFRKDDGTFPIjDNEVKAFMR 
GQRLYEKLMSPENTLLQPREEEGVKYERTFMASEFLDWL 

8254 

A 

3 

439 

siyaspdkrnqthyalqaslklldfyekyfdiyyplskldliaipdf 
apgamenwglityretsllfdpktssasdklwvtrviahelahqwfg 
nlvtmewwotiwlkegfakymeliavnatypelqfddyflnvcfevi 

TKDS 

8255 

A 

1 

401 

s prtgggggfqs papagertmhclgae ylvs aegaprqrewrpqi yr 
kctdtawlflfflfwtglvfimgysvvagaagrllfgydsfgnmcgk 
knspVegaplsgqdmtlkkhvffmnscnlevkgtqlnrm 

8256 

A 

220 

415 

KYAFLVLHI YAS I WEERRLFTTENTPI KYGSQI FTLVQPIHLPQKVA 
WHCWGHQKGQDE I GQKN 

8257 

A 

2 

359 

HELMLDKAPVLKVIVNSLKNMINTFVPSGKVMQVVDEKLPGLLGNFP 
GPFEEEMKGIAAVTDIPLGKVHLEALKKKVIKFFYKFPLRCDIHTAQ 
VLYVXDIQHRCCMSLDIHTAQVHRS 

8258 

A 

1 

1270 

RTRGRTRGELVLESGEWRWLLERCRPGGAVAL ILLAAWS \ CAV\ SQ 
HAPPWTEDCRKSTYPPSRPTYRGAVPWDTI/HVLDLPPYKRWDELML 
DMAPV/ LS KVI VNS LKNM INTFVP \ SGKI MQ WDEK\ LPGLLGNFPG 
PFEEEMKGIAAV/ SLDIPLGEIISFNIFYELFTICTSIVAEDKKGHL 
I HGRNMG F \ GVF\ L VWN I NNDTWV I TEQLKPLTVNLDFQRNNKTVFK 
AS S F AG YVGMIjT\ G FKPGL FS LTLNERFS INGG YLG I LEW I LGKKDA 
MWIGFLTRTVLENSTSYEEAKNLLTKTKILAPAYFILGGNQSGEGCV 
ITRDRKESLDVYELDAKQGRWYWQTNYDRWKHPFFLDDRRTPAKMC 
IJNRTSQENISFET^lYDVIJSTKPVIJTKIiTVYTTLIDVTKGQFETYLRD 
CPDPCIGW 

8259 

A 

108 

988 

TARGSKARSGGTRWSLGPPSSSGGRRSWQPLVISRAV*H*GKSSRHQ 
PLYSGHPHLGPVSHRGGPGPVRQPAAT*RPACASKLVQPWH*WGPG* 
LGSQPALAGHLCRTGCEESHHTCFPGYVKP*RL*ALPGGRPV*VGER 
ARARPDEPREAAGFCGTEEQPHTDPAHPRLKRDCL/PSVSQAAVSTT 
ASAGFPPAWAASAGFPPTWTPNLSPATLGMQRSSCALTFRRICRFLS 
DNKWMQRSITSMAIWICNGSGGSTSGRVSFSSTLGGTDCSSSSLSKS 
ETFPSATS FSLR 

8260 

A 

1 

319 

GVKNWHPQIPAIiPRFKKSLPASVSQ/VAVSTTASAGFPPAWAASAGFP 
PAWAASAGFPPTWTPNIJIRARHSLVCKTERSRPTQSSKPYWAPTMCW 
STLRLEQKRPCH 

8261 

A 

19 

373 
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KVNWARSTSRQQLLESARTKSWAEREGGTRWRAW 
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JvJMr ijFWUoAJvKljr r r r r r r Fr Fr 1jXxUIj1jJLjJjxjYFIvF/MI 
PLI*AFILSPTAL*RGVFHH 

8263 

A 

400 

582 

VLVSFGVLLLLLFIFFTSFSFLFFFFLLFFFFFFFFFFFFFFFFFFF 

8264 

A 

1205 

1681 

SSLIFLLSSLISSMNRSISSPSPSIISMLSTSLLGTSSPKPGAGGAQ 
TPLTFPPGSWYSPPPKAGASFCTGAFSTTLFSLGTFSMLSFFPVFFF 

PPPTT?TT?T?PPPPPT?PI?PT?Piri5*I«*T?T?T T .T T T T T T VUVPr'ADPflT TiTJ nnA 

TLPLALQGVTSMAVIjMV 

8265 

A 

1 

403 

KSLQYSNTGDMILVVSGSCHAKVIDRDGFEVMECIKGDQYIVDMANT 
tvunxni'jun x vjorvxiir j\x j\.vjc*r i t i x ^- 0 inj j. v x x w ci v c*JMo JSJVHIyo V 1* ivPlv 
TMHGQKVIPTTCTYSRDGNLIAAACQNGS IQIWDRNLTVH 

8266 

A 

3 

399 

LEKQMEKAHEDSEKLREIVLPMEKEIEELKAKLLRAEELIQEIQRRP 
PJIAPSLHGSTELLPLSRDPSPPLEPLEELSGDGGPAAEAFAHNCDDS 

AQT QQPQT.f5nf?VnQQ^C?T.Pn < 5PrV^T.QT3'PO'PT? r r7ACT \7C 

X OO r OUOUO V O DOUr yO AyuLlD rCi^£>i!i iHoJjVO 

8267 

A 

36 

448 

NFMDVFWSLPFVGEKVTDLFL1WLSICSDDELMTEGEDQFDGSAAA 

RTCR TTPWK'TP A Tr?KM7APVPQ\7T.PTrT?C:PC!\7T. r rT.Trf2T TDTfiMLDCmTT A 
J\.Ivi-> X X r\X\ IN.X XVrt.X\JI\i T LrtJ\. V rOV JjX\£jJCjOIjO V Xl X XjISXjXjX ±r X ^VLLiir.iD^J VitM. 

GGRQTLQSAIRGFSPPHRICSFEEAKGLDRINERMPPRKDAVQ 

8268 




RHPPHVPHPPT .MT , 'P"{"^'R'nOTJTYl7r2<^ A 7\ A P VT7 T TDMVTDa TfTWM A QtrCOIT 
j\riE*T\TiCsi\ri.Ctr»Lji w i x a\jaLJ\^r u VooiwuUvAJSiJL X rvIN IV X xvH X 0 I\I v l/-i±\. V r 0 V 

LREESESVlVlXKGLTPTGMLPSGVIiAGGRQx^ 

nPQPPTTR TPQFPF.AICfiT.TjP TITFPMPPPKTi AV/^rir>fII« r NJCTA MT^UaTC 

NHGTGNHTAQ 

8269 


644 

809 

VWTtfPTiTFYPICWAOPTCT.PPPQT.FTPT.TrVf^Cian ZVPTrPT TPWTTTP7A TfiVM 

ARVFSVLR 

8270 

A 

37 

235 

SHSSLPCSASSPPALVSPGSFSGLVYKxWTWWTALKGVPJ^ 
GCL IRKGPWPHHPMTGAGL 

8271 
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ii J V 

TTT^ VT'T CPirMT* C CD T"17 T T'TV VTPi^ 7M(^\ DTVDri rTiTm 7 C T">t t Ar»TT tjt pxttdct 

£»ui\x ibuybbrKfiVl lAIYLU vnyr X JvEJb VWJiV^KijljAtj±ljrlijONXlir X 
TAGGAQVSFKTGKIMPACPPRSLCSPWET 

8272 

A 

516 

695 

GLSHQIIl^mFYLKSSPXGAMAHACNPSTLGGRGQLIMRSGVRGQPV 

nWnVTDYT T TTTTI 

8273 

A 

2 

435 

grdsdrnsseegtaekskklrt1^hsqtcdwgnllqdii1x3vfkyl 
plldrahasqvcri^jqvfhmpdlwrcfefei^qpatsylka™ 

lAyilMui&MnLUX Vor JlvVlJoolSJio/^AA^lJXJjoyjjVNCoJji^ 
STA 

8274 


■a 

X / 

KVEKRRVNLPRVLSMPPVAGTOCHXAYDREVHLRCELSPGYx^ 
TFIjKDAPGEVLTjRVFQTRPVQT.*? ATP avaKTJTTPna AT.PTXVPPRT 

x J. jji\x/rtxvjij v JJiJlv V r O i. VJXV V OiJOnXivrt V /-VX\_LM X X x V3>t_M. Xi tr J-\ V x_>xv\J X 

8275 

A 

308 

614 

hgshgiaxrprqsqaqplvhkvccfqeeqmgapegkgpecgsricrl 

LAQGLGFHLVCx^RV*H1xAJPH^ 

gplssspp 

8276 

A 

11 

161 

lffpdi^edetedddlfetefrqykrtyymtkmgvdwseyvfana 

T T.K 

8277 

A 

163 

437 

HLAGIWPPKQELGSGVGEIAMC^SEKFI^ 

GSMFLPP l^EAGATLYSIAOTGGLVLFSMFLLYDTQKVI KRAQV 

8278 

A 

1 

660 

MAXWTLRPTPLLVTCML I CAPGVMGAWAPLTI LGG PLL I RAAW YTA 
GIVGGLSWAMCAPSEKFLNMGAPLGVGLGLVFVSSLVDQMGRWFVA 
GGAAVGIiGALCYYGLGLSNEIGAI EKAVEYWFNS FVCHSNQQNACSH 
ELHDERLLGDMGLPILHAJ^LLpj^ 
ALTLQLKKCSSGLMLMEFTGLTMFPI ILKQLD 

8279 

A 

2 

1246 

SCSAPEELCSLRPTx>GAJU^GNCEAK^^^GDPJVFQICSVDLGAPPP^lIJA 
ARL\ VCLRTLPSRVFHPA^TKASP\ VVKNS ITKNQWI#LTP\ SREYAT 
*T*/RLGIPJlWGELGQl5LKEGSIGlOTSMGKNILKFDSDGEEWLVA\G 
GA\ WGLGS I CAYLLALGL/ GLN* DLGAI * KRL* FGPQ YVK\ DRI \H 
STYM\YI^GGVXGLTA\LSAIAIQQNRLFFMExjP*WRRIjIjGWTIGVD 

lwqpwlgagmlvrs i p ydqs pg pkhl awllh s gvmgawaplt i lgg 
plNlipjvawyXtagivgglsXtvaXmcapsekflnmgaplgsgagvs 
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VFVSSIGI*CFPPPTD/TVAGATI»\YSV\AMYGGISSFIjSMF1*LYDT 
PESKSSRAEVSPMYG\ VQKYDPI \NSML\ S I Y\MDTLNI FMRVATML 
ATGGNRKK 

8280 

A 

325 

432 

GVSGAIiTVLMKDALKPNLMQTLEVSRVTPAFLNWFW 

8281 

A 

1 

293 

E KGHTRTAQ FD I S VA S E I MAVLALTT S LEDMRERLG KMWAS S KKGE 
PVSAEDLGVSGALTVLMKDAIKPNLMQTLEVSRVTPAFIiNWFWTVRA 
EWL 

8282 

A 

1 

366 

PTRPRLGNLRPRLGNPPILRHQGVRESAVCAEAMAAAAPVAADDNKR 
RRRPGADSISSYEAQITALKQERQQQQQDCEEKERELGRLKQIiLSRA 
YPLDSLEKQMEKVGIGMASLLPWDLEEC 

8283 

A 

780 

891 

KYSIFSDFLADQAACYVQAIQWILHYYYHGVQSWSW 

8284 

A 

44 

155 

GSVPNKFYVHNDMFRYEDEVFGDSEPELDEGEVTLKI * 

8285 

A 

336 

416 

IjAARRQEQRATVRAIjGF WAFS LQAKE 

8286 

A 

1 

294 

KLDTWEFPIRGL1^SEFVCNMSARPYVYDLIAVSNHYGAMGVGHY\ 
KAAYVLFYQRRDDEFYKTPSLSSSGSSDGGTRPSSSQQGFGDDEACS 
MDTN 

8287 

A 

302 

486 

MVFPPLVFQTKAAYVLFYQRRDDEFYKTPSLSRSGSSDGGTRPSSSQ 
QGFGDDEACSMDTN 

8288 

A 

1 

620 

AYEKHVSMLQPQKKKKTTVALRDCIELFTTMETLGEHDPWYCPNCKK 
HQQATKKFDLWSLPKILVVHLKRFSYNRYWRDKLDTVVEFPIRGLNM 
SEFVCm.SARPYVYDLIAVSNHYGAMGVGHYTAYAKNKLNGKVT^ 
uow v o iirto r»uy i v xj\j\j\± v Jjr I \JK±<JJL>li>r x Jvl iroJboooOoolJvjv? IKJr 
SSSQQGFGDDEACSMDTN 

8289 



804 

PHAPSREGLSTVGSICSPQVDIGLNNGDMSDSVGSLKYICPSSPKEK 
S LF S FLYTE I CLQGLVGKEGS QQMVEGL I S WRRL YCVLRGGKL YCFY 
S PEE I EAKVEP AL WP I NKMLEREREGERAERNPS QENKEWRR PGQF 
ALEMEGPSLRGPAIjRLAGLPTQQETRIRAMDKDAKKRIHNFSVINPV 
PGQAI TQI FAVDNREDLQKWMEAFWQHFFDLSK 

8290 

A 

164 

399 

TAISHSLSEWSYLCII^VIGALFSSFGEVMFSSMVLMLVDVYLCLV 
IKELGFYYSLCSLGLFVPILLGKTLQAFKGN 

8291 

A 

1 

2226 

MDKFWWHAAWGLCLVPLSLAQIDLNITCRFAGVFHVEKNGRYSISRT 
EAADLCKAFNSTLPTMAQMEKALS IGFETCRYGF I EGHWI PRIHPN 
SICAANNTGVYILTYNTSQYDTYCFNAS7VPPEEDCTSVTDLPNAFDG 
PITITIVNRDGTRYVQKGEYRTNPEDIYPSNPTDDDVSSGSSSERSS 
TSGGYIFYTFSTVHPIPDEDSPWITDSTDRIPATTLMSTSATATETA 
TKRQEAWDWFSWLFLPSESKNHLHTTTQMAGTSSNTISAGWEPNEEN 
EDERDRHLSFSGSGIDDDEDFISSTISTTPRAFDHTKQNQDWTQWNP 
SHS NPEVLLQTTTRMTDVDRNGTTAYEGNWNPEAHPPL I HHEHHEEE 

W I'DUCTCTTrtli r PT3 O OTTDDTJV r POV1?ATilI?r , 'KTn T»TTJ"C/~» VT'TI /~\»T*T~»T} TT'T^v f™> TT c~» rn 

Cilrna lbl lyAi Jroo I IJ&JBi ±Aiyi\_fc»yWr oWKWrlJBGYRQTPKJBUSHST 
TGTAAASAHTSHPMQGRTTPSPEDSSWTDFFNPISHPMGRGHQAGRR 

EGLEEDKDHPTTSTLTSSNRl^VTGGRRDPlTOSEGSTTLIjEGYTSi^ 
PHTKESRTFIPVTSAKTGSFGVTAVTVGDSNSNVNRSLSGDQDTFHP 
SGGSHTTHGSESDGHSHGSOEGGANTTSGPTRTPOTPRWT.TTT.a^T.T. 
ALALILAVCIAVNSRRRCGQKKKLVINSGNGAVEDRKPSGLNGEASK 
SQEhlVHLVNKESSETPDQFMTADETRNLQNVDMKIGV 

8292 

A 

876 

1104 

HFDNCSNCMVTAASAHTSHPMQGRTTPSPEDSSWTDFFNPISHPMGR 
GHQAGRRMDMDSSHSTTLQPTASPNTGF 

8293 

A 

2 

427 

AIPraKPPESTQFSHHWYLTDWKSGDTYQSLSSSSTATGTTVIECTQ 
VQGSSESLASPSTSRATTPSQLSIEVEAREISSPGRPPGLMSPSSGY 
SSQSETPTPTVSMSLTLGHLPPPSSSVRVRPWPERKSSIjPPTSPME 

8294 

A 

1 

128 

DKEIFSKSFLSHFILLHISGSLPKVHPRYDLDGDASMAKGSI 

8295 

A 

33 

279 

PPVPNS SRRPTRPPTRPRTS YEKQGYLLLPPVFS I VLEVLARAI SQE 
TEIKSVQMGKEEVKLSLVCVCVYLCMRKSCGIHQ 
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A 
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QERLPSGIRPLSGNIRVATAMPPGTARPGSRGCPIGTGGVLSSQIKV 

MxlXtlr V 1 yyuliibnAlbl ISAar'y K.y J. JjUIvd X X x^LjXjXaKoIVX odIi A X JS VJM IS. 

LQKG I EMYNQENS VYLSY EKRAETLAVE I KELQGQLADYNMLVDKLN 
TNTEMEEVMNDxTJMLKAQNDRETQSLDVIFTERQAKEKQIRSVEEEI 
EQEKQA 

8297 

A 

1 

441 

AWKLSKVl^FWANADDSSKTELLFAALKALKYLFRFIIQSRVLYL 

ppvfiocvnrjnPTrHWCTonT t?t . ta ttxtivit .lvmi? dt t<t?7i\tv t vp a a t wt "d c 
Kr AJJoUor IMJMo IKybr LAr XMI v LLj| v H-'Kir Jj£iC*/\V 1\X I\.Vj>/\AXjJ^i Lilri> 

IINDV101»VFDPVELSVLFCKFIQSIPDNQLVRQKIiNCMTKIVESTLF 
RQSECK 

DTQQ 

JO 

A 
f\ 

u« 


nri nunv^Vw v oauvcvAvr j-/ x a r/ii x x a inuivr ax v a jvir vci xjxvt tr x x 
TQFFPTLLLWCFSALLPTIVYYSGFFEDHWTRSGElTOTTMHKCYTIiL 
I FMVLLLPSLGLS SLDLFFRWLFDKXFIiAEAAIRLESGFLPNNGAFL 
KNYV IAS PF I GNAMDLLRI PGLLMYM I RLCLTRS AAERRNVKRHQAY 
E 

8299 

A 

303 

617 

AKPSSGFECSRSAFLSKECGKQLGPSSSSALSLSGSFFHLGAPS * PR 
TPRKSDGAJICRPHPPSSMIGVQLLLGGCLAQHPHHRAQLIaDADVPTT 
ISIKHVKGQP 

8300 

A 

50 

258 

NHWFCCFCCFSDRGEIKLGIlWIKRIHVIRKIFFPFRTEASIiASRR 
EQKREQYRQVKAHVQKEDRRVQ 

8301 

A 

110 

180 

CFPAAPDEDSTTNITKKQKWTVE 

8302 

A 

187 

571 

GDSRSRVITIRQDTAWNAWSASHCRRI.SASSRSPLTGPASVGGMAATG 

ANAEKAESHITOCPVRLLNPNIAKMKED^ 

VKFVPVRflCJP^RMKAPTRCJRaPHrnHNPAR^RYT. 

V Xvr V v» V UuOrOru ll\rir X IvOUniulUUllivivnUOw X Xj 

8303 

A 

2 

150 

FVDLLISNIAKMKEDILYHFNLTTSRimFPALFGDVKVRGQVLTPWN 
LA 


TV 

TOT 
6 £. 1 

n cr "7 

ANAEKAESHNDCPVRLLNPNIAKMKEDILxTiFl^ 

GDV\KFVC\VGGSPSRNBKPSSGCRLGA\ELGIxDCPGRDYPNICAGT 
DRYAMYKVGP VLS VSHGMG I PS I S IMLHEL IKLL YYARCSNVT 1 1 R I 
GTSGGIGLEPGTWITEQAVDTCFK 

8305 

A 

2 

358 

MAFLWLVCCFALVGATFGCGVPAIQPGLTGLSRIVNGEDAIPGSWPW 
S/aoxjS^xJK. Xv?i*ilr l^ovsoXvXoiSJNVV V v x/\/vrll_Aj VJvl X JJV V Vf\\j&rlJ\2\>als 
EKNVQVLKIVQVFKNPKFNSFTVR 

ft Ofi 

O j v/ D 

n 

i 

X 

/ X*± 

X tr i\J\J-\jr I\J\S\rLr\ X o \J V X" V AjvjAt 17 X X/-1 VXj1 t 1oM\^±jO VY.M.XlVEi.E»ll V X lyHJir X 

lxNPDQSGEFMFDFDGDEIFHVD^lAKKExVWRLEEFGRFASFEAQGAL 
ANI AVDKANLE IMTKRSNYTP I xlTOKFTPPVVlWxT'^xxRNGKPVTTGV 
SETVFLPREDHLFRKFHYLPFLPSTEDVYDCRVEHWGLDEPLLKIIWE 
FDAPS PL PETTENWCALGLTVGLVG IIIGTIFII KGLRKSNAAERR 
GPL 

8307 

A 

225 

384 

ITFLFIKNPTFDSECYFFPHSLPCTLPYFPPEFDAPSPLPETTENVV 

\-~nXJV7XJ 

8308 

A 

2 

862 

FLS VLPHSRALLTPKRAPKKKMAI SGVPVLGFFI IAVLMSAQESWAI 

XVCaCtXl V liyrtTif X A-iAV rfy^ OXj JC l v iX7 Uc xJ\3UE* X At ti V IJrxrtXVTUC* X V rXrtXJlIilZi jT 

GRFASFEAQGALANIAVDKANLEIhlTKRSlTCTP 

PVELREPNVL I CF IDKFTPPVVNVTWLRNGKPVTTG\VSET\VFLPR 
\EDHFFR\KPHYLPF\LPSTEDV\YDCR\VEHWGIaDEPLLKHWEFDA 
PSPL\ PETTENWCALGL\AVGLVGI I IGTIFHHSRELRKSNCRQKR 
KGGLCKAQWR 

8309 

A 

2 

224 

INNSLF I KKGPTALQPQHRTRPWTATASPAJRGRFATSMPS SRliAATV 
AGCG I PDSGRPAPTRDSQNMAS PPDWH 

8310 

A 

1 

3253 

MELSVTLVSRAPRPGL^VFRQLNTAIAVSQMSSGQCRLAPLIQVIQD 

CSHLYHYTVKLLFKLHSCLPADTLQGHRDRFHEQFHSLRKFFRRASD 

ML YFKRL IQI PRLPEG PPN FLRAS ALAEH I KPVWI PEEAPEDEE PE 

NLIEISTGPPAGEPVWADLFDQTFGPPNGSVKDDRDLQIESLKREV 

EMLRSELEKIKLEAQRYIAQLKSQVNALEGELEEQRKQKQKA1.VDNR. 

QLRHEl^QLRAAQLEGERSQGLREEAERKASATEARx^TKLKEKHSEL 


,1, 0 O S «+6 "43 . o h a f » CP E 






VHVHAELIiRKNADTAKQLTVTQQSQEEVARVKEQLAFQVEQVKRESB 
LKLBEKSDQLEKLKRELEAKAGELARAQEALSHTEQSKSELSSRLDT 
LSAEKDALSGAVRQREADLLAAQS LDDFS PTS KLQRLLAESRQMVTD 
LELSTLLPISHENIiTGSATNIRKCCLQSCSHGLIAKPSATLIQHVtili 
LFIAEGTLRVEGSYSFTATAPDTALDSDGEPGQGPLCNFNAKLTSGC 
DRGRLWHPRQRPVRRFSEQQRSHRCGPATALFQAARAHPPSPGLQWP 
AISISKAINTQEAPVKEKHARRILGRLLDPGVGHHSGHTPREVGFTF 
WSTPLGCRCPAVHSQLEFCHVTjKGPRECTHGCLMPQQHVCCFRNSPA 
AMVQPQRLLHSRTRTVGTGLLSPANLPNPQRIPKRMPPAPPGDPAPP 

gigngacplqsmagpsgyiqdtgsaissiqqtstpqcsrenqakalc 

PPGTSLTPEVLQC I QI AGPPGLSGLTVCVIjKMQTS S PDS SAAGLGEP 
RNLRPATGSLTTEVPGLWSCHISQLESQEQELRQRLLDEQFSVFRGA 
AAEAAVILQDAVS KLDDPLHLRCTS S PDYLVS SAQEALDAVSTLEEG 
HAQYLTSLADASALVAALTRFSHLAADTI INGGATSHLAPTDPADRL 
IDTCRECGARALELMGQLQDQQALRHMQASLVRTPLQG I LQLGQAAV 
DGGGRRLPLSDSHPHQELKPKSLDVRQEELGAWDKEM/AATSAAIE 
DAVRR IEDPQLLHR/ H * * R* GAVTRGGLHLEGHQSLLS WSRQAIRLL 
VTTST 

8311 

A 

3 

3253 

ESRGRSRTKAGGGGRMNS I KNVPARVLSRRPGHSLEAEREQFDKTQA 
ISIS KAINTQEAPVKEKHARR 1 1 LGTHHEKGAFTFWS YAIGLPLPS S 
SILSWKFCHVLHKVLRDGHPNVLHDCQRYRSNIREIGDI»WGHLHDRY 
GQLVNVYTKLLLTKISFHLKHPQFPAGLEVTDEVIiEKAAGTDVNNIF 
QLTVEMFDYMDCELKLSESVFRQLNTAIAVSQMSSGQCRIAPLIQVI 
QDCSHLYHYTVTCLLFKLHSCLPADTLQGHRDRFHEQFHSLRNFFRRA 
SDMLYFKRLIQIPRLPEGPPNFLRASALAEHIKPVWIPEEAPEDEE 
PENL IEISTGP PAGE PVWADLFDQTFGP PNGS VKDDRDLQI ESLKR 
EVEMLRSELEKIKLEAQRYIAQLKSQVNALEGELEEQRKQKQKALVD 
NEQLRHELAQLRAAQLEGERSQGLREEAERKASATEARYNKLKEKHS 
ELVHVHAELLRKWADTAKQLTVTQQSQEEVARVKEQIJIFQVEQV1CRE 
SELKLEEKSDQLEKLKRELEAKAGELARAQEALSHTEQSKSELSSRIi 
DTLSAEKDALSGAVRQREADLLAAQSLVRETEAALSREQQRSSQEQG 
ELQGRLAERESQEQGLRQRLLDEQFAVLRGAAAEAAGILQDAVSKLD 
DPLHLRCTSSPDYLVSRAQEALDAVSTLEEGHAQYLTSLADASALVA 
ALTRFSHLAADTIINGGATSHIiAPTDPADRLIDTCRECGARAIiELMG 
QLQDQQALRHMQASLVRTPLQGILQLGQELKPKSLDVRQEELGAWD 
KEMAATSAAI EDAVRR IEDMMNQARHAS SGVKLEVNER I LNS CTDLM 
KAI RLLVTTSTSLQKE I VESGRGAATQQEFYAKNSRWTEGLI SASKA 
VGWGATQLVEAADKWLHTGKYEELIVCSHEIAASTAQLVAASKVKA 
NKHSPHLSRLQECSRTVNERAANWASTKSGQEQIEDRDTODFSGLS 
LIKLKKQEMETQVRVLELEKTLEAERMRIiGELRKQHYVIxAGASGSPG 
EEVAIRPSTAPRSVTTKKPPLAQKPSVAPRQDHQLDKKDGIYPAQLV 
NY 

8312 

A 

1 

673 

SKSELSSRLDTLSAEKDALSGAVRQREADLLAAQSLDDFSPTSKLQR 
LLAESRQMVTDLELS TLLP I SHENLTG SATNI RKCCLQS CSHGLI AK 
P S ATL I QHVLLLF I AEGTLRVEG S YS FTATAPDTALDSDGEARPGAA 
FATSMPSSRItAATVAGCG I PRQRPVAGGYGGGWHRCTRDSQNMGKPR 
LTGHGQLNRPVTAS FPLRKQLTVGVS S PQEQGMYGH 

8313 

A 

177 

395 

ETPGGFKI)NLHAVICLAENSVGPNATRPDDIHIjLFSGKTVEINNTDA 
EGTIiVLADGVPYACKDIjGADT TLiDM 

8314 


177 

402 

ETPGGFKDNLHAVICLAENSVGPNATRPDDIHLLHSGKTVEINNTDA 
EGTL VLADGVS YAC KDLGAD 1 1 LDMAT 

8315 

A 

3 

831 

QKNTHYMMIFDAFVIIiTCLVSLILCIRSVIRGLQLQQEFVNFFLLHY 
KKEVSVSDQMEFVNGWYIMI I ISDILTI IGS ILKMEIQAKSLTS YDV 
CSILLGTSTMLVWLGVlRYLrGFFAKYNLLILTLQAALPNVIRFCCCA 
AMIYLGYCFCGWIVLGPYHDKFRSLNMVSECLFSLINGDDMFATFAK 
MQQKSYLVWLFSRIYLYSFISLFIYMILSLFIALITDTYETIKQYQQ 
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DGFPETELRTFISECKDLPNSGKYRLEDDPPVSLFCCCKK 

8316 

A 

1 

473 

LIPSDRPIiIASTHACADSNDLKVIIISAVGPVFSSVHDLKBLTEEQG 
RDYHAEVFQTCS KVALEMLFTGEP I S AQEALLHGLLS KWPEAELQE 
ETMRIARKIASLSRPWSIiGKATFYKQLPQDLGTAYYLTSQAMVDNL 
ALRDGQEG I TGFLQKR 

8317 

B' 

549 

602 

MWWNSLDGIRNIVLSNPKKRNTLSLAMLKSLQSDILHDADSNDLKVI 
IISAEGPVFSSGHDLKELTEEQGRDYHAEVFQTCSKVMMHIRNHPVP 
VIAMWGLATAAGCQLVASCDICRGERQDLFCHSWVPRKVAIiEMLLT 
GEPISAQEALLHGLLNKGGPEAELQEETMRIARKIASLSRPGVSLGK 
ATFYKQLPQDLGTDYYLTSQAMVDNLALRDRQEGITAFLHKRKPVWS 
HEPV* 

8318 

A 

1 

350 

PCCSHPNAVLGVQQTLEEMDFQRGIWSAALNGDLGRVKHLIQKAEDP 
SQPDSAGYTALHYASRNGHYAVCQFLLESGAKCDAQTHGGATALHRA 
SYCGHTEIGRLLLSHGSNPKVG 

8319 

A 

272 

364 

YYKCGAQFALVTLGTLGTYTAFTVAVTRWR 

8320 

A 

1 

322 

I FGELLLFLHHQ Y PGD I GC FA I Y FLNLLTIiKPGEAMFLEANVPHAYD 
KGGE PHFS S E PHCHPCWAL FP YLDKGRRRWKETPKDLAAIjTS EGS W 
AQQATAGRDDECI 

8321 

A 

37 

344 

PPAPDTDPTPTYPGLTLLLTNPRTNSTLPPTLHIiPSRPTPSSPPSTS 
LPPSLVLGFFLKIQSGKRRAYPEHFWVQNGERKAQPSVHAPVNSASA 
YVDAAANS 

8322 

A 

3 

414 

SARLHTVYRVPRLNHPPEEIGCHSRVPGSAHPSEKPPSAPASAPQNV 
FSTTVS SGYNTKKIGPRLNI QLKKGTEGLGFS ITSRDVTIGGSAPI Y 
VKN I LPRGAAI QDGRLKAGDRL I EGNGVDLVGKSQEEWSIili 

8323 

A 

37 

438 

S L P RGGFRCCLCHVTTANRPS LD AHLGGRKHRHL VELRAARKAQGLR 
SVFVSGFPRDGDSAQLSEYFLAFGPVASWMDKDKGVFAIVEMGDVG 
AREAVLSQSQHSLGGHSLRVPPPEQKESRAPPPNPQRSGP 

8324 

A 

2 

425 

GRVGKLRKMMIIAENKKDMSGHYQNALYLGDGSNRGRILKNCGQKSL 
AYFTSATHGLYEDTESLKETFDPKKETIPHIYPNARLLQPPAPILPL 
DTNWTLLTVSNGFFEGTI ARKGKGGVLAAC ID I VTGWS IGWRRGVKL 

8325 

A 

543 

848 

KS QE S EC PDS VQRD VLS GGRHTHVKRKKVTFLEEVTE Y Y I SGDEDRK 
GPWEEFARIXSCSVOKRIQETEDAIGYCLTFEHRERIGNRIjQGTCCKG 
LNVLKQC 

8326 

A 

2 

242 

PPDCSRAACPVLCSGNGQYSKGRCLCFSGWKGTECDVPTTQCIDPQC 
GGRG X C X MG b CALN 5 G Y KG E b L. t» EArR Y X P .KJUs 

8327 

A 

2 

246 

AVPDCSRAACPVLCSGNGQYSKGRCLCFSGWKGTECDVPTTQCIDPQ 

LbbRblLlMbbLALWbblKbbbLbhAPKllr 

832 8 

A 

2 94 

402 

X IHIjYF CVr r EDHSDNPXjGAAV rLAHbLbH 1 r GW I 

8329 

A 

259 

350 

PSGESWPSMGHRLIPGIRCGPNNREESGDR 

8330 fj 

A 

2 

410 

RPPPPVSSVPGPGRDQLGVTPLSGRHSLLCCWVRIVFPESDGAP*MA 
SFPPRVNEKEIGKLLLNLVDHTEVVRDLTFAPDGSLILVSASRDKTTi 
RVWDIjKDDGNI^KVIjRGHQKIjV^ I 

8331 

A 

453 

684 

SLRFVLSVFCQSLLLGKAESSKYBTRETPKAWKESAAPELLGWA\HS 
SGLTCDIKCCRARDLPWSGPKIHTSVMCPSS 

8332 

A 

1 

92 

FFFFFFFFFFFFFFFFFFFFFYKSIiIKKLNKNI 

8333 

A 

67 

301 

LGIiLAIiATTPQL I FVFFVKMGPDWSR I PELKQSTHLGLPKCWNHRHK 
PLCNTFQSYFYPWTYNCHPSCKKIKVKWRQV 

8334 

A 

3 

276 

S D C TWT X NLP I G FG VHL»Q FVNF S TE T I HD Y IJsVRSGSSETSTVXGKIj 
SGPQIPSSLFSTTHETSLYFHSDYSQNKQGFHIVYQGKRNNKK 

8335 

A 

1 

564 

MSLGCFSETWTPPNGSVGMYTSAIQMSVFSFAALGGQTEVQFGNMPR 
RGTVSSNERSHYSRYGGGITVLREEKGRNRHIDTCHCVRIAYSVQYS 
NMLCRERVENTSHPGEMQVTIQNLMPATVYIFRVMAQNKHGSGES SA 
PLRVETQPEAISDVFHYLQILVLPNTTTLLSSSADELISYFRGYSSR 

8336 

A 

1 

364 

KSQLMCQQMYMGEKPFGCRCCEKAFSNKSYXLVHQQTHAEEKPYGCN 
ECGKDFSSKSYLIVHQRIHTGEKLHECSECRKTFSFHSQLGIHQRIH 
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TGENPYECCECGKVFSKKDOT.VSHOKT 

8337 

A 

3 

103 

GTLCCDS SQTQTEWMTS I F 1 AQVRDLFELS SRV 

8338 


1 • 

744 

V AMVJ V JC OXvOxrXvJU VrtOv?i/JJo V UC» JjJj^XU*! V yj>lx!»l^^xiXr MxJiAVO IJfnV 

APPSSSPGRSHSKDRTLGKPDSLLVPAVASDSCNNSISLLSEKLTSS 
CSPHHIKRSVVEAMOROARKMCNYDKILATKT^^ n 

RQARTGNNFVKRRPGRPRKCPLQAVVSMQAFQAAQFVNPELNRDEEG 
AALHL S PDTVTDV I EAWQS VNLN PEHKKGLKRKGWLLEEQTRKKQK 
PLPEEEEQENNKR 

8339 

A 

3 

266 

VGWQQGVPEPPPGKAVTVPLRPADPFKGKKAPARSGFPAIiFPGDPPG 
EESLGRRRPGSSPLTPLSIPEVIKIFVPPAPQKSGHFFSF 

8340 

A 

481 

637 

LFLPFSRVLLERL I VNR PHPWGLL I TF I ELI KNPAFKFWNHE FVHCA 
PGRVG 

8341 

A 

1 

3335 

LNLKFE I EVLCKNLALD INELKPGNLLKDKDRLKNLDEQLSAPKKDV 

KQPEELPPITTTTTSTTPATNTTCTATVPPQPQYSYHDINVYSLAGL 

APHITLNPTIPLFQAHPQLKQCVRQAIERAVQELVHPWDRSIKIAM 

TTCEQIVRKDFALDSEESRMRIAAHHMI^lRl^TAGMAMITCRE 

ISTNLKNSFASALRTASPQQRE^DQAAAQl^QDNCELACCFIQKTA 

VEKAGPE^KRLATEFELRKHARQEGRRYCDPWLTYQAERMPEQIR 

LKVGGVDPKQIaAVYEEFARIWPGFLPTNDLSQPTGFLAQPMKQAWAT 

DD VAQ I YDKC I TE LEQHLHA I PPTLAMNPQAQALRS LLE VVVLSRNS 

RDAIAALGLLQKAVEGLLDATSGADADLLLRYRECHLLVLKALQDGR 

AYGSPWCNKQITRCLIECRDEYKYNVEAVELLI 

AQSMENGLNY>1AVAFAMQLVKILLVDERSVAHVTE 

INAHSRGNAPEGLPQLMEVVRSl^EAMIDRAHG^PNF>llWSGISQi\S 

EYDDPPGLREKAEYLLREV7VNLYHSAAA.GRDSTKAFSAFVGQMHQQG 

I LKTDDL I TRF FRLCTEMCVE I SYRAQAEQQKNPAANPTMIRAKCYH 

NLDAFVRLI ALLVKHSGEATNTV^ 

QSEFQQLPYHRIFIMLLLELNAPEITV^ETINFQTLTAFOmTFHILRP 
l iS-Hi-'or V x/VWL»JiljJLi>jiKlr XAKMlAAHTPQQKGWPrlYAQIjL 
APFLR1WELTKPMQILYKGTLRVLLVLLHDFPEFLCTDYHYGFOTVIP 
PNC I QLRNL I LS AFPRNMRLPD PFTPNLKVDMLS E INI APR I LTNFT 
GVMPPQFKKDLDSYLKTRSPVTFLSDLRSNLQVSNEPGNRYNl^ 
ALVLYVGTOATAHTHMTCnQTPQMQTTTWc? awmtiT unxn twttyt T» r r t T7r , 'D 

viji vu A A. A. XAAM I\0 O A r Ol'lO A X A XI O/ATllTliJ X A* \Jvi LAV ULiU A HiLjK. 

YXFLNAI ANQLRYPN\ SHTHYF\ SCT\MLYPFCRGPFRKPSQEQITR 
VLLERLIVNRP\HPWGLLITFIELIKNP\AFK\FW\NHGICYTCAPG 
NSEKLFPVGSAQWLAWGQEGQGPSKLIGRGTGIiPS 

8342 

A 

1 

520 

KMD *5 TE P P Y ^OKP YPF T VKPVQ TV T ic T nvTvrDTvrt/ ft inra x or'WKrr'n'Kr 
xvl ai_/o Aorr x oyxuv a c»CiA v rs_d v o A x JLxvxvJlu a IN at1«> 1 V/A_r V Jr -L oumPJuJJIv 

MLEPSANMPWFKGWKVTOKDGNASGTTLLEALDCIL/PPTRPTO 
RLPLQDVYKIGGIGTVPVGRVETGVLKPGMWXMHHEALSEALPGDN 
VG FNVKNVD VKDVRRGNG AGD S TNE PTMEANG FT 

8343 

A 

1 

443 

AYTLG VKQL I VG VNKMD S TE P P YSQKR YE E I VKEVS TY I KKI G YNP0 
TVAFVPT cjGWTJn'H'WMT.PPc: ftNTrfPWPTfr2WTr\/TT?TrT , »r , "KTft or""i t tat t» 

A Vrt-C V AT A. i3\jri|HOJL/lMl i AJA> AT 0«AM 1 T 1 AT r» AT IVvjVV AvV A t\.J\XJ\3lHJ\^\D A, 1 lilir.AliU 

CILPPTRPTDKPLRLPLQDVYKIGGIGI^PVGFAPVNV^TEV^V^M 
HHEALS 

8344 

A 

41 

666 

PLHLAEEKNGYNPDTVAP^VPT ^RWMRnWMT.PP Q AWMPWI?Tff"IWTri7TPTf 
t uiiMrujijiuvu x ivru x v n.c V At a. OVj>infiMVjAVA»J*lAJAjA^OAnA>ll ir iAT V» JC IxunXVV 

DGNASGTTLLEAIaDCILPPTRPTDKPLRLPLQDDYKISGWNGDNMLE 
PSA1WPWFKGWKVTRKDGNASGTTLLEAIjDCILiPPTCPTDKPLRI»PL 
QDWKIGGIGTVPVGRV^TGVXKPGlWrc^ 
ALSETLPGDNVSFNVKNSVC 

8345 

A • 

2 

396 

AD I ERLPJ4RRVMAAATHAS Al SGWNGDNMLE PSA1WPWFKGWKVTRK 
DGNASGTTLLEALN\ CG I GTVPVGRVETGVLKPGMEVTFAPGNVTTE 
VTCSVEKHHEAXSEALSGDNAGFIWKNVSVx^ 

8346 

A 

1 

461 

MLLEAIjDCILPPTRPTOKPLHLPLQDVTKIGGIGSVPVGRv^TOvXK 
PGMv^FAPVTWTTEVKSVTSMHHEALSE 
RHG1WAGDSKNDPPMEAAG\RFAVTU)MRQaVAVGV^ 
KVTKSAQKAQKAK 
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8347 

A 

1 

743 

MGKEKTHINIWIGHVDSGKSTTTGHLIYKCSSIDKRTIEKFEKETA 
EMGKGS FKYGWVLDKLKAERERG I T I D I SLWKFETS KYYVTITDAPG 
HRDFI KNMITGTSQADCAVP I VAAGVGEFEAG I SKNGQTREHALLAY 
TLG VKQL I VGVNKMDS TE P P YS QKR YEEMWLRE VS TY I KKI G YNPDT 

TIHLWVALLFVI 

8348 

A 

208 

94 7 

NWFKGWKVTRKDGNASGTTLLEALDCILPPTRPTDKPLRLPLQDVYK 
IGGIGTVPVGRLETGVLKPGMVVTFATVNVTTEVTCSVENHLVAPLDC 
PRTP\FGPTRPSTYIKKIGYY\PDTLAFEPISGWNGDDMLEPSANMP 
WFKGWKVTRKDGNASGTTLLEALDCILPPTRPTDKPLRLTIQDVYKI 
GGIGTVTVGRVETGVLKPGMVVTFAPWGTT*VKSVAMHHEALSEAH 
SVDNVGFNVKNV 

8349 

A 

1 

1434 

RGITIDISLWKFETSKYYVTIIDAPGHRDFIKNMITGTSQADCAVLI 
VAAGVGEFEAGISKNGQTREHALLAYTLGVKQLIVGVNKMDSTEPPY 
SQKRYEE I VKEVSTY I KKI GYNPDTVAFVP I SGWNGDNMLEPSANMP 

WPK'r3WTfV r PPTrnf3'MA QRTTT.T nPTT DDTDDTnvDT dt dt fvrvirwT 
rtr tW3rvs\ v j. KIUJvjiNrtou J. J. uJj&AXjJJV*. J. Jurr i. xtJtr JL xJ 1ji\.Xj Jr Li\21J V X txX 

GGIGTVPVGRVETGV/LFVPISGWNGDNMLEPRA1WPWFKGWKVTRK 
DGNASGTTLIEALDCILPPTRPTDKPLRLiPLQDVYKIGGIGTGPVGR 
LETGVLKPGMVVTFAPGNVTTEVKSVETl^RALSEAFPGDNVGFNVK 
NVSVKDVRHGNVAGDSKNDPPMEAAGFTAQVIILNHPGQISTGYAPV 
LDCHTAHIACKFAELKEKIDHRSGKKLEDGPKFLKSVDAAIVDMVPG 
KPMCVESFSDYPSI^CFAVRDMRQIVTVGVIKAVDKKTAGAGKVTKS 
AQKAQKAK 

8350 

A 

1 

1254 

MG KEKTH I N I WI GHVDSG KS TTTGHL I YKCGG I DKRT I EKFEKEAA 

RMri KTJ Q PlCYft WVT .OTTT .VByDPDP TTTnTCT wvnrTOvwiprT ttmvtv"» 
nnyixoof ivi/\r» V JUL* A^Ix/-LiSKrlKw .L JL 1U1 oJLivVlvr £1 ± 0 J\.X 1 VI XXUAJro 

HRDFI KNM ITGTS QADCAVLI VAAGVGEFEAG I SKNGQTREHALLAY 
TLVAFVP I SGWNGDNMLEPSANMPWFKGWKVTRKDGNAS GTTLLEAL 
DCILPPTHPTDKPLRLPLQDVYKIGGIGTVPVGRVETGVIiKPGMVVT 
FAPVNVTTEVKS VEMHHEALS E AL PGDNVG FNVKNVS VKDVRRGNVA 
GDSKNDPPMEAAGFTAQVIILNHPGQISTGYAPVLDCHTAHIACKFA 
ELKEKIDRRSGKKLEDGPKFLKSGDAAIVDMVPGKPMCVESFSDYPP 
LGRFAVRDMRQTVAVG V I KAVDKKAAGAG KVTKS AQKAQKAK 

8351 

A 

1 

1195 

MQSERGITIDISLWKFETSKYYWIIDAPGHRDFIQjNIMITGTSQADC 

A. VT ■ TV A A CZ\Td T? T?P ft. f2 T cifKmrVT'D'Ciia t t A vtt r^\rvr\T t x m is~tjrr\ 0 t» 
«»*JA vrtrtu voDr CiAuioivHuy 1 k r. mm 1 1 > iM y J. Jjvj V JvyLiX V^VNJ\J v lXJo X 

E P P YS QKR YE E I VKEVSTY I KK I G YN PDTVAFVPI S GWNGDNMLEPS 

ANMPWFKG WKVTRKDGNAS GTTLLEALDC I LP PTRPTD KPLGLPLQD 

VYKIGGIGTVTVGRVETGVI J KPG^TVVTFGPVNVTTEVKSV^ 

GEAL PGDNVG FNVKNVS VKDVRRGNVAGD S KND PPMEAAGFPAQVI I 

LNHPGQISAGYAPVIiDCHTAHIACKFAELKEKIDRRSGKKLEDGPKF 

LKSGDAAIVDMVPGK£MCVESFSDYPPIjG 

VDKKAAGAGKVTKSAQKAQKAK 

8352 

A 

1 

2471 

MGKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAA 
EMGKGSFKYAWVLDKLKAERERGITIDISLWKFETSKYYVTIIDAPG 
HRDFIKNMITGTSQADCAVLIVAAGVGEFEAGISKNGQTREHALIaAY 

VAFVPISGWN\GDNMLEPSANMPWFKGWKVTRKDGNASATTJ^EA1^ 

CILPPTRPTDKPLRLPLQDVYKIGGIGTDPAGRVKTGVXiKPGMV/VA 

PFAPVNGTTEVKSVEMHHEALSE/ ALLLGDKGAFNVKNVSVKDVRRG 

\NVAGDSKOT>PPMEAS\GFTAQVIILNHP\SQKNARHMPLEIJD 

HIACKFAELKEKIDRRSGKKLEDGPKFLKSGDAAIVDMVPGKPMCVB 

S FSDYPPLGRFAVRDMRQTVAVGVI KAVDKKAAGAGKVTKSAQKAQK 

AK 

8353 

A 

184 

819 

ETE I KMASRG KTETS KLKQNLEEQLDRLMQQLQDLEECREELDTDE Y 
EETKKETLEQLS EFNDSLKKI MSGNMTLVDELSGMQLA I QAAI SQAF 
KTPKVIRLCGKK/HHPVQLRPR * AEMDRDL MVGKLERDL YTQQKVE I 
LTALRKIX5EKLTADDEAFLSANAGAILSQFEKVSTDLGRPPSxTWxTLi 
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LDSHPSKNIDVPSKISFLLKIQDL 

8354 

A 

1 

387 

GTRSGNMTLVDELSGMQIiAIQAAISQAFKTPEVILLFAKKQPGQIjRT 
RLAEMDRDLMVGKLERDLYTQQKVEILTALRKLGEKLTADDEAFLSA 
NAGAILSQFEKVSTDLGSGDKILALASFEVEKTKK 

8355 

A 

3 

1022 

FLASLQWVQTSSFSSEKFVITDLLKPTSVNLSKSFSVQLCSIAGGSL 
RPTPEIQEVKTPEELETFMLKHGENIIDTLGAEVDRLEKELKSGSLV 
PCIPAAPAVTKRGQGTAPAIASVGGSPKPWQLPRGVGPVGAQKSRTE 
VWKSLPRFQRMYGKHLDAQNDRSTDSLHSAPGKATDTQCQPLKAAKS 
<jJjx Jt*/^JLjl^WoC-PWXjWriF 1 r (-X SVTWSDLiEQXiRKIRRRSPHEDTESF 
TVYLRSDVEAKSLEVWGSPEALAREKKLRKEAEIEYREKGS INSATL 

T /"PTTQ'PYT.Q'nT'PV /PTDCPaWPT TfPTPVTTDTMaXTCnODMTPCT PT T 
X / fir oKI boU J. rC I / K. 1 KbLHW^,JLiJ\L X V.1VX Xxr ifl/vNbLb JfcrN 1 CoJuv, JjJj 

YSSPALPWTLKFA 

8356 

A 

4 

516 

PLWWNSVI IAATCMGLTS ITGNPLYDSLGSLGVGTLLGMVSAFLIYT 
NTE7VLLGRSIQPEQVQRLTELLENDPSVRAIHDVKATDLGLGKVRFK 
AEVDFDGRWTRSYLEKQDFDQMLQEIQEVKTPEELETFMLKHGENI 
IDTLGAEVDRLE KELKKRNPEVRHVDLE I L 

8357 

A 

1 

248 

GSTHASERRQNKTTTRTKVIPAASEEQLLLVELVRSISVMRAETVIQ 
TVKEVLKQPPAIAKDKKHLSLEVCMLQFFYAYIQR 

8358 

A 

3 

4135 

IPMWSDFDLPDQQIEILQSSDSGCSQSSAGDNLSYEVDPETVNAQE 
DSQMPKESSPDDDVQQWFDLICKWSGLEVESASVTSQLEIEAMPP 
KCSDIDPDEETIKIEDDSIQQSQNAliLSNESSQFLSVSAEGGHECVA 
NGISRNSSSPCISGTTHTLHDSSVASIETKSRQRSHSSIQFSFKEKL 
SEKVSEKETIVKESGKQPGAKPKVTCIjARKKI>DDKKKSSNEKLKQTSV 
FFSDGLDLENWYSCGEGDISEIESDMGSPGSRKSPNFNIHPLYQHVL 
LYLQLYDSSRTLYAFSAIKAILKTNPIAFVNAISTTSVNNAYTPQLS 
LLQNLLARHRISVMGKDFYSHIPVDSNHNFRSSMYIEIDISLCLYYM 
RSHYPTHVKVTAQDLIGNRNMQMMS I EILTLLFTELAKVIESSAKGF 
PS F I SDMLS KCKVQKVI LHCLLS S I FSAQKWHS EKMAGKNLVAVEEG 
FSEDSLINFSEDEFDNGSTLQSQLLKVLQRLIVLEHRVMTIPEENET 
GFDFWSDLEHISPHQPMTSLQYLHAQPITCQGMFLCAVIRALHQHC 
ACKMHPQWIGLITSTLPYMGKVLQRVVVSVTLQLCRNLDNIiIQQYKY 
ETGLSDSRPLWMASIIPPDMILTLLEGITAIIHYCLLDPTTQYHQLL 
VSVDQKHLFEARSGILSILHMIMS SVTLLWSILHQADSSEKMTIAAS 
ASLTTINLGATKNLRQQI LELLGP I SMNHGVHFMAAI AFVWNERRQN 
KTTTRTKVI PAASEEQLLLVEL VRS I S VMRAETVI QTVKEVLKQPPA 
I AKDKKHLS LEVCMLQF F YAY I QR I P VPNLVD S WAS LL I LLKDS I QL 
SLPAPGQFLILGVLOTFIMKNPSLENKKDQRDLQDVTHKIVDAIGAI 
AGSSLEQTTWLRRNLEVKPSPKIMVDGTNLESDVEDMLSPAMETANI 
TPSVYSVHALTLLSEVI4AHLLDMVFYSDEKERVIPLLVNIMHYVVPY 
LRNHSAHNAPSYRACVQLLSSLSGYQYTRRAWKXEAFDLFMDPSFFQ 
MDASCVNHWRAIMDNLMTHDKTTFRDLMTRVAVAQSSSLNLFANRDV 
ELEQRAMLLKRLAFAIFSSEIDQYQKYLPDIQERLVESLiRLPQVPTL 
HSQVFLFFRVI.LLRMSPQHLTSLWPTMITELVQVFLLMEQELTADED 
loKlbbFb VAoijlil 1 x lCjCaNUr bTb YNSQRWIjNLYIjSACKFIjDIjAIiA 
LPSENLPQFQMYRWAFIPEASDDSGLEVRRQGIHQREFKPYVVRLAK 
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QVFNSKVTSRCGGHSGSPILYSNAFPNKDMKLENHKPCSSKARQKIE 
EMVEKDFLEGMI KT 
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KEFSELVARIDYN1X3EFLLIEAADFLPKWLDPENSTNRGFFCHGE1^ 
IIPAPIKFGAESWLPTTPPTIPQAI^IITAHSEKILASKSIPSVVNR 
RITGYPEKIQASLNRAHCFLPAGIVAVLKQRPRLVAAAVQAFLL 
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A 

43 

239 

AWGRKTPWLTISSFWPTPGSPYYDNVRPLCYSDSDAVLLCFDISRP 
KTVDSALKKVRLDQLSNL 
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8363 

A 

64 

240 

TLKFWKEGGPPISPPQKKGPFPKIPRGGL*GPPKKKKNFSFPPPVKL 

d P P K"f3 T P TCP & P P 

8364 

A 

19 

471 

TPRQARHEGED FVS VS PQHLHKFLAS I DLN I ENEKQVYNAAI KWLLA 
NPQHHSKWLDETLAQVRLPLLPVDFLMGVGAKEQIVKQNLKCRDLLD 
FARNYHLHLSSRAVPDFEYSIRTTPRKHTAGVLFCVSGRGGSGDPFR 
SIECYSINKN 

8365 

A 

3 

4 01 

ii>\£>>um>\V3Uok/\± jl vKlMvjr* J. ± FMbKbr I PJe*TERQRLQIjLGFLKPEjuxjG 
REFTHLEFPRRVLPKELGQRMLYRDQNMTGWAYKKIELEDLRFPIiVC 
GEGKKARVMATIGVTRGLGDHSLKVCSSTLPIKPFLSC 

8366 

A 

186 

840 

NFTLQKQTAGWIWPMSCSLTTLGLEHSLPRPATEPLERPSRSHIAPT 
EYSQLVPLSSCGCEGLLPLPQATITSHSHPEAGPGLEARTPELQLFS 
GQLSFWSAAEETKTRPPQARATKPPKPKSSWQRSCISSEESETPGLP 
IDFPSCP YALVRLPLRS SLS SQEVRVPQG S SRAKSL PARE WEAW PHP 
ATATVRAWAWPCSGVPLMERGGSRSGPGR 

8367 

A 

7 

633 

PTITNVWSETSQELADGLRRGSQGDSVQQNGPPRKHIVERYTEFYHV 
f i rtiJJAiisjsjCLii.JbU I E»OWKPKTGTTQSRSFRILAQITGTEHLKESEA 
DNTEKANNSQEPSPQLASSVASTRSMPESLDSPTSGRPGVTSLTTAA 
AFKPVGSTGVIKSPSWQRPNQGVPSTGRISNSAAYSGSVAPANSALG 

RKHIVERYTEFYHVPTHSDASKKRLIEDTEDWRPRTGTTQSRSFRIL 
AQI TGTEHLKES EADNTEKANNS QE PS PQLAS S VASTRSM PES LDS P 
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AYSGS VAPANSALGQTQPSDQDTLVQRAEHI PAGN 

8368 

C 

78 

386 

MTHLEAQNKIKGCTGSLNMTLQRASAAPKPEPGSCSKAHSHQRVFRD 

r ourtOttu x rtCjCUlrK. v l VIM X. yrvvjxVx FFiS^irr'KlSjr'HCoAIj YRVJjSCTHF 

TVMPARRD* 

8369 

A 

20 

443 

^ JX^VTVWf^PPfSPAOT.fST <*2TJT nD\TDr , VDPeoAiiJPmrnnec''mi*T t <«^r»r» 

PATASELIKFPLSSQAQNSCKWSYIRKGPARSSIPSPFLASADSCRV 
VSCPDOPRAPCRGOCWPnf?VnAA<5 AT.PWOPADCT/rPT cvoftDTBD 

8370 

A 

49 

354 

SCHIIPSLTFPTRFIQRSPVTLLRSQWILILQWAIASTTLDLRDAN 
CSVMRFVPJDLIHTGVANDVSIKMTPPT.T»Nc:PVT.T <5T rpQTrpwnHDC 
ISFLDLYW 

8371 

A 

239 

402 

IESPETNSCIDGQLLFDGGIRITHWAKDSSSSSSSSSSSSSTCRRMK 
LDPYLTP 

8372 

A 

141 

351 

HTTASPFWQGLTNVRCROTSLTTFQVIPDIjDGWFHAQPVQLQLLSHP 
VTEFILPPSLVLPLPGESHFLL 

8373 

A 

239 

474 

nKmv iwwKUVr LiAiorori' \KP^yijliKQxTAJjYRKATSPN*FAPNCY 
PDPQPHLG*IxAPVAHVPIi*QRRKAESPPSSN 

8374 

A 

1 

740 

i*i v oi/n\j\c*juoriiN v JLiJf Xjiyx X JrlM Vo VJNr or Ao v jl LAob 1 J_ixr uor AriPJSA 

TS RGAVATGTTHLAS AVE PNGDS W CKQRS PRVSVHYI R I N I TGWVCT 

CCTIERNIMLSPSIxNlRlWITARLRPKTGTQMPLREQRYTGIDENAH 
WERRVF / R VP ALHLR ft P53 OT ,P KOY T IV T , VP TC A T c dm * tt a dm r» v "dfitd n 

PHLDRLAPVAHFPL * Q * RKAES PPS SN* VARGTCTS * LS KPRTVWKD 
PVARInTRPPVGPT 

8375 

A 

803 

985 

GLALTPRTAGGGGLTAIxRRGSLEPGRGRGWLLLPVSQEVCTTPAILG 
vissspprilgtr 

8376 

A 

1 

319 

VDDLLLGHPTAVGCTKGTDALHRHLEDCGYKVSKKKAQICRQQPWAS 
VIDWEljYVIX5MSPPNP(V^PPnilon2lV\7T , T nT\A7T?aDOT DnaTanrivax? 

LI AF 1GALELSE 

8377 

A 

339 

808 

GSQSQGGNRGWFSVPASRGVPPPAVMGVLRARGGRGAGSQSPPPGGC 
LPPMRWGSGATRS1^I J WWETSDGLEPTPSRRWWGSS*TL/GGSPVQV 
Y * LVTPDSG S SH * KANS FCLSGDNL I GKKAS FRS KQ VNQLS SAGS KP 
NNVDGLGTVGCKVSVA 

8378 

A 

588 

2278 

GWAHGGEACFCVPAIxHLCRPSQLEKQYPVLY* KATSSN* FAPNYYPD 
P*PHLG*XiPPW 

8379 

A 

779 

986 

ELQLYWLCKKPPIDIRSHWKSC*AGISTSRLCQLLPKR*GSGVLPAD 
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SMMRGKV\SRSAWHRSLWSSPL 

8380 

A 

179 

795 

PAFFLIPGCAPPVG* * IPSNVPAVGRS\DFFSWTPYTHSPPDAGVEH 
PFPWRTRLPWGVPAKGHQRTGATRSLGFWWDTSGGLEPTPPRRWWGS 
S * TLEEARS KCTE * * HLTPDL PTERQTASGS QGR I * C * RKHL S / WS K 
QVKQLSSAGSNPSNVYGLG/DCWM*SHCSLLNQAQILSRPWLGLRL 
GNRQEGS VPWRLTRNYHS KGS 

8381 

A 

1 

1029 

MGS CMARAS PMGATPC FAAVG P ID CQ AE ECRHTVWD WWVALPT AAVG 
DPIGQPIQYARSTCEAGVGELHIQDALPEDHGTYTCLAENALGQVSC 
S ARVTVREGTS PG PKGKTGHCGLG S VS PLGMKELALMENVAE PG S VK 
LSARTVQVQPSLALSVAPGHGGEHGCGASGLGVDDLLLGHPTAVGCA 
TGTDAPHRHLEDCGYKVS KKKAQI CQQQHDARQGPAVPPGIQAYGAA 
PFEDLQVDFREMPKCGGNKYVLVLGRTYSGWVEAYPTRTEKTREVTP 
VLLRDLIRRFRPPLWIGSDNGPAFLAALVQKTAKGIQNNITGGVYTL 
CDIDSHIILFRSGY 

8382 

A 

989 

2798 

EENCRNRLGLQGRIHPQLLQIASQVFVNRYAVSRKENGKENGGQARP 
HADLFVSCSNQRGPRKEARERGPWERNSAWLSEFAALMTFCWDTPRQ 
SGVPREQMPYTAPGGLWVEGVQEESSDLPTADGVSKGWPPCWRALAA 
TALL VQEANKLTLGRNLN I KAS RAVVTLMNTKGHHWLTDARL I KYQT 
LLCENPRITIEVO^TLHPASLLPVSESTVEPDCVEVIJDSVDSSRPDL 
RDQAWASGDWKLYVYGSSFFNPQGERGLQVDFTFTEMPKCGGNKHVL 
VLGRTYSGWVEAYPTRTE KAGEVI PVLLRDV I PRFRPPLW IGSDNGP 
AFIJVALVQKTAKPAIiALLSLSLSMYLVTVLRNLFS I LAVS SDCPLHT 
PMYFFLSNLCWPDIGFTSAMVPKMIVDTQSHSRVISHAGCLTQMSFL 
LLVACIEGMLLTVMAYDCFVAICRPLHYPVIVNPHLCVFFVLVSFFL 
SLLDSQLHSWIVLQLTIIKNVEISNLVCDPSQLLNLACSDSVINNIF 
IYFDSTMFGFLPISGIFLSYYKIVPSILRISSSDGKYKAFSTCGCHL 
AWCW F YGTG IGMYLTS AVS P PPRNG WAS VMYAWTPC 

8383 

A 

2 

397 

S I LKLPQWLKGLLAFLVKPLL PRLS AFLSNM KSRSAGKLWELQHE I E 
VYRKTVIAQWRALDLDVVLTPMLGPALDLNAPGRATGVVPVTTVTAE 
DE AQMEHYRG Y FGD I WDKMLQKGMKKS VGL P VAVQCV 

8384 

A 

3 

1773 

PGGRQQQAEGIMVQYELWAALPGASGVALACCFVAAAVALRWSGRRT 
ARGAWRARQKQRAGLENMDRAAQRFRLQNPDLDSEALLALPLPQLV 
QKLHSRELAPEAVLFTYVGKAWEWKGTNCVTSYLADCETQLSQAPR 
QGLLYGVPVSLKECFTYKGQDSTLGLSLNEGVPAECDSVWHVLKLQ 
GAVPFVHTNVPQSMFSYDCSNPLFGQTVNPWKSSKSPGGSSGGEGAL 
IGSGGSPLGLGTDIGGSIRFPSSFCGICGLKPTGNRLSKSGLKGCVY 
GQEAVRLSVGPMARDVESLALCLRALLCEDMFRLDPTVPPLPFREEV 
YTS S QPLRVGYY ETDNYTMP S PAMRRAVLETKQS LEAAGHTLVP FLP 
SN I PHALETLS TGGL FS DGGHTFLQNFKGD FVDPCLGDLVS I LKLPQ 
WLKGLLAFLVKPLLPRLS AFLSNM KSRSAGKLWELQHE I EVYRKTVI 
AQ WRALDLD VVLTPMLAPALDLNAPGRATGAVS YTML YN CLD FPAGV 
\ Vx-'V 1 1 V 1/LblJElAyroxvrax^Uxr OJJX 

8385 

A 

660 

774 

PSWLVTTLLLPSSAGKLWELQHEIEVYRKTVICQWRA 

8386 

A 

27 

374 

SSPPAWQPRPPCGGLHHCPACVHPHRDSGGSVPAPAACCHTPHGRPW 
PEFEGQRPQTQPQPQGPAWVRPVLTLSAGRVASVPLVSLCAWASGSQ 

8387 

A 

685 

769 

EQHFLPLRILKQEHPELAVLADSVDLQE 
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8389 

A 

308 

411 

HWLKNKQ I DQW I R I KNPETNPHTYS EL I FDKGAK 

8390 

A 

3 

503 

LVILDYIIRNTDRGNDNWLVRYEKQKCEKEIDHKESKWIDDEEFLIK 
IAAIDNGLAFPFKHPDEWRAYPFHWAWLPQAKVPFSEEIRNLILPYI 
SD^FVQDLCEDLYELFKTDKGFDKATFESQMSVMRGQILNLTQALR 
DGKSPFQLVQ I PCVI VERS QGGSQG 

8391 

A 

4 

318 

NKKSQESEGQLSQGLKPASFQPGVWIHPPGVSRVPTVT^LGTCQPPS 
SQEQI PSHRSFS PDTEMGVS PAGSGSLETPANTHPAGPAGHLSCPGV 


.1, 0 O B 4-6 4-:5 . O i5 E! i& O S 






S OVKP AVAGGK 

8392 

C 

340 

474 

MSRQWIPXSRQKFEADCPGESGSSGKKTLTRSWCRSCCC^IiXNXS 

8393 

B 

317 

702 

LAEWRPREPMQRKLKVTSLBPGTVVITEQAVDTCFKAEFEQIVLGKR 
VIRKTDLNKKLVQELLLCSAELSEFTTWGNTMCTLDFYEGQGRLDG 
ALCS YTEKDKQAYLEAAYAAGVRNI EMES S VFAAI * 

8394 

A 

209 

582 

EQNWMLRVWMCGWICFFLPYALLCAPWLPCFNFILFYTPAISIiDVWL 
FFPPGLGSPGTWITEQAVDTCFKGRVLSRI\nW3KRVIRKTDIiNKKL 
VQELLLCSAELSEFTTWGNTMCTLDFYER 

83 95 

A 

r» o t 

227 

13 02 

GDoRoRVrRRyDrAWNAWfaAonLKRbSASS 

ANAEKAESHNDCPVRLLNPNIAKMKEDILYHFNLTTSRHNFPTLFWE 
FKFGLFGGNP \ S RMKPF I \ RCLGPE I GLDCPVRD YPN\ I CAGTDRYA 
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GIGLEPGTWITEQAV\DTCFKAEFKQl\VLGKRVIR\KT\DLNKKIi 
VQELLL\VSAELSEFT/TTWGNTMCTL\DFYE\GQGR\LDGALCSY 
TEKDKOAYliEAAY\ aarvpolfemes s vfa\ amcs acgloaawcvt 
LLNRLEGDQISSP\R1WLSEYQQ\RPQRLVSYFIKKKLSKA 

8396 

A 

3 

339 

QWYSF1TOQHVSRITQEDIKKTHGGSSGSRGYYSSAFASSTNAYMLIY 
RLKDPARNAKFLEVDEYPEHIKl^VQKERELEEQEKRQREIERNTCK 
I KLFCLHPTKQVMMENK 

8397 

A 

2 

376 

NHEKDDNSLKI ISNASCTTl^ 

ATQKTGDGPSG\NWGVMAAGLSKTSSLPLLG/LGIOVGGKVIPELNGK 
LTGMAFRVPTGNVAGVDLTCRLKKPGKYDDI K 

8398 

A 

3 

1141 

I RHE VRQS AAS S FAS PAEPHRSDTMGKVKVGVNGFGRIGRLVTRAAF 
NSGKVDIVAI1^PFIDL1TxMVYMFQxT)STH ENGKLV 
l\NGNPITIFQEPKIPSKIKVGGDAGRLKYV\Vx?raLAVFTT\MEKG 
LGAS FCRGGAKRVNQSLPPSG * CPPCFGHGV* TIEKYDNSLK\ IISN 
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ALPGNCG\VMGPRGSSRTSSLPLIxAAAKAVGKVNPLSx^G\EAHLGM 
G FRVPNANV\ S WDLTCRLEKPAK\ YDD I KKWKQASEGPLKG I LGY 

SNR\ VVDLMAHMASKE 

8399 

A 

1 

374 

QFEFSIDSFQIIIiDSLLLFGQGSSTPMSEAFHPTVTGESLYGDFTEA 
LEHLRHRVIATRSPEEIRGGGLLKYCHIiLVRGFRPRPSTDVRALQRY 

MPOp irp T TjT^PTjT ,VPn]?l?TT ,rp VT .J?7A'WirrcG7A 
rlV»Oi\r x lUc r xJxj V E*\^Jvrv.X XJdI\. X XJCfcrull* \JVJ^\ 

8400 

A 

127 

420 

VLLSIFFQPJDVDNLPCLQHLFLSFlWISSroRVSCLMSSSLSDITF 

DGNPIAQESWYKHTVLQNTlMQLRQLDMKRITEEERRMASVx^ 

KKRE 

8401 

A 

3 j 

509 

KNR I KKI SNLENLKSLDVLDLHGNQ ITKIEN INHLCELR VLNLARNF 
LSHVDl^NGLDSLTELl^^ 

FDS VS CLAD S S S L SD I TFDGNP I AQES W YKHTVLQNMMQLRH * DMEG 
E FTGRRKGVWHL.F * P KKGG KP KF 

8402 

A 

2 

385 

LKI IACLSLtlTNNMVGAI TGGLPELASPJLTR I SAVLLEGMNKETFNIi 
KEVIiNS I GI QTCVEVNKTLMERGLPTLNAE I QANL I GQ F S S I E EEDN 
P I WSL IDKR I KLYMRRLLCLPS PQKCMPPMPGG 

8403 

A 

1 

431 

ERVGNVCSLE I SNIQKGEGGEYMCHAVNI IGEAKSFANVDIMPQEER 
WALPPPEFCRSKYGFKGGRRQRTKN* FR\ IFEMPPRFIMPICDFKI 
PENSDAVFKCS VI G I PTPEVKW YKE YMC I E PDN I KYVI S EEKGSHTL 
IIR 

8404 

A 

1 

12064 

MVVEWLHDGKPLRAANP LPM TNTEFG YC <S T .DYRVAYSPDR GT T TCP AT 
NTCyGTDHTSATLIVKDEKSLVEESQL^ 

LTGVTTDQKEKQKPDIVLYPEPVRVLEGETARFRCRVTGYPQPKVNW 

YLNGQL I RKS KR FRVR YDG I H YLD I VBCKS YDTGEVKVTAENPEGVI 

EHKVKLEIQQREDFRSVLRRAPEPRPEFIT\^EPGKLQFEVQKVDRPV 

DTTETKEWKLK11AERITHEKVPEESEELRSKFKRRTEEGYYEAITA 

Vx5LKSRKKDESYEELLRKTKDELLHWTKELTEEEK3CAlx^ 

TFKPDKIELSPSMEAPKIFERIQSQTVGQGSDAHFRVRWGKPDPEC 
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EWKNGVKIERSDRIYWYWPEDNVCELVIRDVTAEDSASIMVKAINI 

AGETS SHAFLLVQAKQL I TFTQELQDVVAKBKDTMATFECETSEP FV 

KVKWYKDGMEVHEGDKYRMHSDRKVHFLSILTIDTSDAEDYSCVTjVE 

DENVKTTAKLIVEGAWEFVKELQ0IEVPESYSGELECIVSPENIEG 

KWYHNDVELKSNGKYT I TSRRGRQNLTVKDVTKEDQGE YS FVIDGKK 

TTCKLKMKPRPIAILQGLSDQKVCEGDIVQLEVKVSLESVEGVWMKD 

GQEVQPSDRVH I VI DKQSHMLLI EDMTKEDAGNYS FTI PALGLSTSG 

RVSWSWVITPLKDVNVIEGTKAVLECKVSVPDVTSVKWYLNDEQI 

KPDDRVQAIVKGTKQRLVIl^RTHASDEGPYKLIVGRVETNCNLSVEK 

I KI I RGLRDLTCTETQNWFE VELSHS G I DVLWN FKDKE IKPSSKYK 

IEAHGKIYKLTVIjNMMKDDEGKYTFYAGENMTSGKLTVAGGAISKPL 

TDQTVAESQEAVFECEVANPDSKGEWLRDGKHLPLTNNIRTVKIKKT 

LKNIiTVTETQDAVFTVELTHPNVKGVQW IKNGWLESNEKYAI SVKG 

TI YSLRI KNCAI VDESVYGFRLGRLGAS ARLHVETVKI I KKPKDVTA 

LENATVAFEVSVSHDTVPVKWFHKSVEIKPSDKHRLVSERKVHKLML 

QN I SPSDAGE YTAWGQLECKAKL.FVETLH ITKTMKNI EVPETKTAS 

FECEVSHFNVPSMWLKNGVEIEMSEKFKIWQGKLHQLIIMNTSTED 

SAEYTFVCGNDQVSATLTVTPIMITSMLKDINAEEKDTITFEVTVNY 

EGI SYKVTLKNGVE I KSTDKCQMRTKKLTHSLNIRNVHFGDAADYTFV 

AGKATSTATLYVEARHIEFRKHIKDIKVLEKKRAMFECEVSEPDITV 

QWMKDDQELQITDRIKIQKEKYVHRLLIPSTRMSDAGKYTWAGGNV 

STAKLFVEGRDVRIRS I KKEVQVIEKQRAWEFEVNEDDVDAHWYKD 

GIEINFQVQERHKYWERRIHRMFISETRQSDAGEYTFVAGRNRSSV 

TliYVNAPEPPQVLQELQPVTVQSGKPARFCAVI SGRPQPKI SWYKEE 

QLLSTGFKCKFLHDGQEYTLLLiIEAFPEDAAVYTCEPKNDYVPEVVS 

PDQEMPVYPPAIITPLQDTVTSEGQPARFQCRVSGTDLKVSWYSKDK 

KIKPSRFFRMTQFEDTYQLEIAEAYPEDEGTYTFVASNAVGQVSSTA 

NXiSLEVS FKKEPLGQKPS FI QPLS S LRVHNGETVRFHARVS GI PKPE 

IQWFHNQQLILPTKDVVFHFEESTGMALMLIVDAYSEHAGQYSCKAA 

NS AGEATCAATLTVTPKVQALDRQS SGKDVRES AKSQAVADSS FTKE 

ESKISQKEIKSFQGSSYEYEVQVFESVSQSSIHTAASVQDTQLCHTA 

SLSQIAESTELSKECAKESTDDSFIDVTWTHEGAKIEESERLKQSQN 

GNIQFLTICNVQLVDQGLYSCIVHNDCGERTTSAVLSVEAPESILHE 

R I EQE I EMEMKEL F S EG E S EHS ERDTRDA FSDS ED IDHKS MAAKR YA 

SRISSTSSWPEYFKPSFTQKLTFKYVLEGEPWFTCRLIACPTPEMT 

WFHNNRPIPTGLRRIIKAESDLHHHSSSIiEIKRVQDRDSGSYRIiLAI 

NSEGSAESTASLLVIQKGQDEKYLEFLKRAERTHENVEALVERGEDR 

I KVDLRFTGS PFNKKQDVEQKGMMRT I HFKTM S S AKKTD YM YDEE YL 

ESKSDIRGWLNVGESFLDKETKVKLQRLREARKTLMEKKKLSLLDTS 

SEISSRTLRSEASDKDILFSREDMKIRSMSDLAESYKVDHSAESIVQ 

NPHALSNQMDQNIESEELPTSFQTIVDEEIFQTEIRMSQEALVKESL 

PKDHLYGEILVNENTQARGQLEEIMANTTIGESSTYITNVCEKEEVY 

ETPENVSQAITPHASESFGTLVNVEESEEIASERIKKDDLRELQLSA 

STRIDEFKTEQKEENMRFFENSFRKRPQRCPPSFIiQEIESQEVYEGD 

SCNFVCHFQGYPQPIVTWYNNDMPIPRNQNFIIHSLENYSILTLSSV 

HHQNEGS ITCVLFNQYGTVKTTSMLKVKAXQKHDVKAHKVPVFHDYL 

DEEEELALVFDQAKGAHPSMSQEGQTNLHLLKTNPPVPPSGDTELLS 

FPVEIQVTAATPIPEQDKESKEVFQTEELEPKAMPQDQVTQSPKHRF 

WLSDITNEPPKMLQEMPKHARCREGDSIILECLISGEPQPVVTWFQ 

NGVLLKQNQKFQFEEVNCSHKLYIKDVNSQDSGKYKCVAENNSGAVE 

SVSDLTVEPVTYRENSQFENIGEIYGKYSRDQQLQDQGESVRAHFYD 

YPAGPFTPWTNVKEYSVRDYFQSLETIEQIDQKEQVRCIPSREKIPR 

FVHGASRTIKISKPIRAEFIQCQAEGKERHVSEKSKLHQAEGTVYPF 

VDDFSDVTIKKEIRNNFGKLGRSEKENVQECAQSDYLPNIHSERISD 

SYOTKDSSAIVYEESLGEEIHYPGKKVKHRIIEFEKLHVEKGVLEKR 

PTRTSIVNPPQKKIDDKAFSLKQRESRSSNLNANMYQ7^EKMSPNTES 

DSSNIAINLKLLSSQTHKEFDAQEREQQEKI SLIDKPAI SKRAEHES 
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PITFDLKQFHTQIKHTDVKFQELDSGQPEEAYFKIQHPADTENIVFD 
LKQMYSHIGDPALEFQGQETREQQE IHYKEKI PS PETLQPDTHNI S K 
S VQNNVFASQE I S SSQEL SNRTMVEKS S I DENS I S LEKEVRHVQEQN 
LDILKTDLSLKSFSEEIYSESCALLPTSSADIEETDLSEKSCPLENG 
GRSSISHLKKAASEEKPLGVGEMEEECTLEPELAAFPKQDGGTQEYT 
DATLEDHRGDVQEADTLHRQLSLSQCFPLLMTEEQQNPGEQISTNIH 
ASGEEKCYEEVQVQNEASFSTLEGEMIETSFSQNIPKLDEAHTTEAA 
ESETSLTQYLLAAGKREVPETKDTRDQAKLVQSESITSMEVEEVTFN 
TVYEYYNQKQESIiGRPLSPESDISIGVGSTTSEEISELDQFYTPPSS 
VEYFESPKSPDLYFNPSDITKQSSIHSGGETVERYSTPLGEVAERYS 
TPSEGEVGERYSTPPGETLERYSTPPGETLERYSTPPGETLERYSTP 
PGETLERYSTPPGETLERYSTPPGEALERYSIPTGGPNPTGTFKTYP 
SKIEREDGTPNEHFYTPTEERGSAYEIWRSDSFGTPNEAIEPKDNEM 
¥¥z>t X£5fjji2sj<JvV iliN 1 1 1a? r I VEVEGLPVPGVKVTx RNKSLLEPDER 
IKMERVGNVCSLEISNIQKGEGGEYMCHAVNIIGEAKSFANVDIMPQ 
EERWALPPPVTHQHVMEFDLEHTTSSRTPSPQEIVLEVELSEKDVK 

KIDLEVFEMPPRFIMPICDFKIPENSDAVFKCSVIGIPTPEVKWYIV 
YMC^PDNIGDVISEEKGSHTLTTRNVCLSDSATYRCRAVNCVGEAI 
CRGFLTMGDSE I FAVI AKKSKVTLS SLMEELVLKSNYTDS FFEFQW 
EGPPRFIKGISDCYAPIGTAAYFQCIi 

8405 

A 

2 

384 

DDTRYHTMQAAVDAVARVTGQHPEVPPAFWNNAFTLLSAVSLPRREP 
TVSSFYVKRAELWDTWKVAHLKEHRRVFQAMWLSFLKHKLPLSLYKK 
VLL I VHDA I LPQIiAQPTLMIDFLTRACDLGGAli 

8406 

A 

1 

1677 

KR P KLKKAS KRMTCHKR YK I QKKVREHHRKLRKEAKKQGHKKPRKD P 

GVPNSAPFKEALLREAELRKQRLEELKQQQKLDRQKELEKKRKLETN 

PDIKPSNVEPMEKEFGLCKTENKAKSGKQNSKKLYCQELKKVIEASD 

VVLEVLDARDPLGCRCPQVEEAIVQSGQKKLVLILNKSDLVPKENLE 

S> VVLtN YIjKJ^LPTVVFRASTKPKDKGKITKRVKAKKNAAPFRSEV 

KEGLWKLLGGFQETCSKAIRVGVIGFPNVGKSSIINSLKQEQMCNVG 

VSMGLTRSMQWPLDKQITIIDSPSFIVSPLNSSSALALRSPASIEV 

VKPMEAASAILSQADARQVVLKYTVPGYRNSLEFFTMLAQRRGMHQK 

GGIPNVEGAAKLLWSEWTGASLAYYCHPPTSWTPPPYFNESIVVDMK 

SGFNLEELEKNNAQSIRAIKGPHLANSILFQSSGLTNGIIEEKDIHE 

ELPKRKERKQEEREDDKDSDQETVDEEVDENSSGMFAAEETGEALLR 

RLQQWSLQGLLSWIKSLKRMMLMTSVQIMCNRTMAFYDFFF 

8407 

A 

2 

2580 

GQKKLVLILNKSDLVPKENIiESWLNYLKKELPTVVFRASTKPKDKGK 
ITKRVKAKKNAAPFRSEVCFGKEGLWKLLGGFQETCSKAIRVGVIGF 
PNVGKSS I INSLKQ\ EQMCNVGVIHGGLQGSMQWPLD\ KQITI ID\ 
SPSFIVSPLNSSSAIiALRSPASI\EVVKPM\EAASAILFQAD\VRQV 
VLKYTVPGLQG I L WE FFT\ MLAQRKEVWHQKGG\ I PKCLEGAAKLLW 
SEWTGASLAYYCHPPTSWTPPPYFNESIVVDMKSGFNLEELEKNNAQ 
S IRGENCVSLLS S S ADRPVELLPVYMGLLS FPAI KGPKLANS I LFQS 
SGLTNGI IEEKDIHEELPKRKERKQEERJEDDKDSDHETVDEEVDKKL 
DSMGSKRRRATSPSSSVSGDFDIXSHHSVSTPGPSRKRRRI/SNIiPTVI) 
P I AVCHELYNTIRD YKDEQGRLLCEIiF IRAPKRRTDS EI YEDAVELQ 
QFFIKI RDELCKNGE I LLS PALS YTTKHLHNDVEKERKEKLPKE I EE 
jjj\0j1nj<jc»£,c<iu<V£.IjX auKJjHYCTNPDSKPKYPAQRTPQIHLQGEVIjFY 
ESKFRTLEEVTVVl^THIISINQPDYYEWSQPIDLMKIQQKIiKMEE 
YDDVNLLTADFQLLFNNAKSYYKPDSPEYKAACKLWDLYLRTRNEFV 
QKGEADDEDDDEDGQDNQGTVTEGLLKI S I TQKIRPWI LFHYGES W 
NLLRADQRLIFAKSWPRASRYQQGHQDLFILRSDLPSQVFIRDKLME 
RRNRRTGRTEKAR I WEVTDRTVRTWIGEAVAAAAADGVT FSVPVTPH 
TFRHS YAMHMLYAG I PLKVLQSLMGHKS IS STEVYTKVFALDVAARH 
RVQFAMPESDAVAMLKQLS 

8408 

A 

2 

259 

VFLSEQDEVLLIQEAKRECRGSWYLPAGRMEPGETIVEAJLYREVKEE 
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8409 

A 

1 

257 

RTLLVPGLGGQRKPPPFRPGGANGDKAPPPAPTPPGIPRCAAQESLG 
AAASGQVPWCSRGPAPGPLRSVCRAEGRGRPPYHAPFF 

QA1 ft 


•9 

.3 © / 

T MTTHT? WMTJVT; PF FFFT . K"0 YTF DTiPF) FNTTYTtTP T T\ T M\7 T TM<'2T?7iDr'kl?TXm 

ALINQKYDEVMATYILLGRKPPEFEGGESLSSGNLCQRSRPSSDLNN 
STLOSPAHLKVORS I S ANOKORRFSDH 

8411 

A 

2 

244 

E FSDALGYLQLLNS CSDAAGAPAYS FSISS SMATTTGVDPVAKWWAS 
LTAWIHWLRRDEEAAERLCPLVEHLPRVLQES 

04 XZ 

TV 


1 ^ Q 

PP\7PPPVPPP\TRnPVnPQnA2Vf7aP71VQFQTQQCM71T'r , rf2T?Qtyi\ , D\7T 71 
rl\ v i\ri\ vi\rA V ivoj\VV7\ v OJ^rv\V'/iiir/il oro XOuOruil J. X U£»oriUrViin 

PPSAPPCSVTEGSGDPPSSAGVDPVAJCWWASLTAWIHWLRPJDEEAA 
ERLCPLVEHLPRVLQESE 

8413 

A 

2 

182 

RA^ATMGETVMSVKIIRNRLTGIPAGYCFVEFADLATAEKCLHKING 
KPLPGATPVRTFR 

8414 

A 

2 

382 

CRTLCEGPQRFEEYEYLGYKAGLYEAIAJ^HYMQVLVCQHECVRELAT 
RPGRLS P I ENFL PLHYD YLQFAYYRVGE YVKALECAKA YLLCHPDDE 
DVLDNVDYYESLLDDS IDPAS IEAREDLTMFV 

8415 

A 

59 

572 

SGRPFFFFFFSATGACLFPPGPGRLGAEYRQRHWGTWVWHGLEIiGRL 
GSNREGCARACRDWSHPPRTERGPSGHGGITSARLGTGTGERLRSGC 
VQGLVGMGRPVDRAC * SVLEPGGTPGRANWAJLDVEKLGNKY * ENKSL 
WVSRP/RQRCDCRQDRPWERPRLQVTPIARQ 

8416 

A 

3 

1735 

LCTRKIjGTRKIjGPLEGYSRAAAYYSGDYERAVRDLFJ^ 
IRTRCARHCAARHPLPPPPPGEGPGAELPLFRSLLGRARCYRSCETQ 
RLGGPASRHRVS EDVRSD FQRRVP YNYLQRAY I KLNQLEKAVEAAHT 
FFVAJ^PEHMEMQQNI F^TYRATAGVEALQLVDREAKPHM 
YEADDFEMAIRHFEQALREYFVEDTECRTLCEGPQRFEEYEYLGYKA 
GLYEAIADHYMQVTjVCQHECVTIEx^TRPGRLSPIEN 
AYYRGSGNFKEPIJKI^CFYVDALLPENRNQIL 
VGGQIAIYLCKAQELLVGEYVKALECAKAYl^^ 

ESDLDDSIDPASIFJVREDLTMFVIO^KIjESEIjIKSAAEGLGFSYTEP 
Pr ivDlvV 1 bARAr JjbFMKW VAbb 1 VVkbiiHTQF RAxjEVl\J\QvNfc»RNWy 
ASELPRGNOTEGQKENGLFGSSAHAQHKTL^ 

FT , OQTTJQM2iTJT.FOTnT'DWPTTKrn'K'r*FCQVT f2DT?VQTQVTr>nj'l'nT7TrrPT 

QIFETNYLRPRGD 

8417 

A 

2 

442 

LTSSVDLYFGAJfflPLHVTMLPNPSHLFAVNPVAVGKTRGRQQSRQDG 
DYS PDNS AQPGDRVI CLQVHGDAS FCGQG I VPETFTLSNLPHFKI GG 
SVmiVxWQx^YTTPA^RGRSSLYCSDIGKLVGCAIIHVNGDSPEPA^ 
VRATR 

8418 

A 

412 

569 

LSLSAAAAAGRYGGTFQNVSLQLPITLNKFFQPTEMASQDFFQRWKQ 
LSNPQ 

8419 

A 

2 

2812 

RGl^VFISDIRNCKSKEAEIKRINKELANIRSKFKGDKAL 
YVCKLLF I FLLGHD I DFGHMEAVxSTLLS SNRYTEKQ I GYLFI SVLVNS 
NSEL I RL INNAI KNDLAS RNPTFMGLAIuHCIASVGSREMAEAFAGEI 
PKVLVAGDTMDS VKQS AALCLLRL YRTS PDLVPMGDWTS RWHLLND 
QHLGVVTAATSLITTl^QKNPEEFICrSVSLAVSRLSRIVTSASTDLQ 
DYTYYFVPAJPWLSVKLLRLLQCYPPPDPAWGRLTECLETILNKAQE 
PPKSKKVQHSNAKNAVLFFAISLIIHHDSEPl^LVRACNQx^^ 
RETNLRYLAljESMCTxaA^SEFSHEAVKTHIETVINALK^ 
RAVDLLYAJ^CDRSNAPQIVAEML^ 

JrtVUI in I V U X X XAV4 JJXA.X riVJJJ X V OXjCj VWIivVXyXV XfMJtXJXV V Uw X/"W"IXV 

TVFEALQAJ>ACHF^VKVGGYILGEFGNLIAGDPRSSPLIQFHLLHS 
KFHLCSVPTRALLLSTOIKFVl^FPEVKPTIQDVLRSDSQLRNADVE 
LQQRAV^YLPXSTVASTOII^Tv^ 

S TVTDLEDTKRDRS VD VNGGPE PAPAS TS AVS TPS PS AJDLLGLGAAP 
PAPAGPPPSSGGSGLLVDVFSDSASWAJ>LAPGSEDNFARFVCKNNG 
VLFENQLLQIGLKSEFRQNLGRMFIFYGNKTSTQFLNFTPTLICSDD 
LQPNLNLQTIO>VDPTVEGGAQVQQVVNIECv/SDFTFJ\PVx^ 
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GGT\FQN\VSLQIiPITL\NKFFQP\TEMASQDFF\QRWEGSL\SNPQ 
QG S CRN I F S KAKHPMDTEVTKAKI I G FG S ALLEEVD PNPANFVG AG I 
IHTKTTQ IGCIiVRLEPNIiQAQMYRX LTLRTS KEAVS \ Q\ RLCELLS A 

8420 

A 

2 

398 

SKKRKRQEEKHVCKKHNQVLTFFCQKDLELLCPRCSLSTDHHHHCVW 
P I KKAAS YHRKKLEE YNAPRKERVELTEKV I TMQTRKSLELKKKVKH 

0 2V'P'P\7'K"PI?PT7OT.PT.'I?T rUsTPOT^T^/T ."DT4T .OTTPOMFiTT .OOT. 
KMnn v i^tr HjT E/\JLtr\.±jr L>\Jri Ej\JCj X v JjlUUiyU C yiw JLJU lr\£lJ 

8421 

A 

3 

440 

IASRNVPDLTLIDLPGIPTGAAGNQPADIGDNIKTLIKKLIQRHETI 
RLAWPRIVD I AATE ALTMAQ^VDPEGDRT I E I LTKPDLVDKGTDNK 

A^TTYVTVPTOT.VT.'HT.TCTr ZiWMT\7Trr , T?f3rvOr3'nnnDMQT DT? r T , T.DT?'nVT T?VP*TvTW 
i\v lj v v rci^J ±j v ijxiijivr\_rt-rii r i J. v A.v_r\.vjyW Vj - L ^^ J -' V* v ioXj irxi i. Xj irrCL/zvx r r xvi.vri 

PYIK 

8422 

A 

3 

356 

NDEQAPAQSLLTPPRPAAIMPLRSVPSASPSPPSASSTGLHAAPQAF 
PVPTfJPT? PnPRT.FT.TWP<?f5KT.02VPWVr , T.nTT.VTWTPc;PQPT.r?nr!P9P 
RSGKCriRPSOTjWSMHT.PFPSHRS 

8423 

A 

3 

341 

LKLYPNETIVGNGSPSQTNNGAMLIWIKRKIHFVGTKSYPASVSKVA 
GKKGTRLHAQRFLVHLERTGGKCHYTMIKGS IHLEYTAI I II YAPNI 
RTPKYT KOTLTKLKGE I K 

8424 

A 

1 

338 

LLRVYSFMALKMYHQLPTQVS S STALNLQHQLPTGS PDARPSKWNIN 
S P PRFPAPRPSTCNI S S PPGLQMHS PQTGTSTPRRVS RCTACKLEHQ 
LPSGIjOTjHS potataaas 

8425 

A 

1 

70 

GNYGGGPGYSSRGGYGGGGVDAA 

8426 

A 

62 

375 

GRGGFFLGPRGPKKKTPGGEGFTRFFDPGWGKGPEKKTPPPWGGKGG 
PKKMNLWGAGKIKFLEIKGKSFPRGTTPNFFGPPLFFSPPKTQGPGN 

T?T T"IT?CTTVT VN7 

8427 

A 

2 

375 

PRCQQPVFFAEKVSSLGKlvJWHRFCLKCERCHSILSPGGHAEHNGRPY 
CHKPCYGALFGPRGVNIGGVGS YLYNPPTPS PGCTTPLS PSSFSPPR 

8428 

A 

2 

460 

PGRPTRPTSPSNTPASISPKLNVEVKKETVDKEKAVTDEKPKQKDKPG 
EEEEKCD I S S KYHYLTENDLEE S PKGGLD I L KSLENTVTS AINKAQN 

KNQTLVSPPSS 

8429 

A 

1 

342 

RLTVLLPLLLFKVCFRSTTPCTSHPSQLCLPVSPSPPQACLSLTARI 
LPTPTPRQITPPKSLTGTNLGTLPIIRNTLDEAPTRRILAVAAEFSG 
MCLILLGWWLIIFFKSRISG 

8430 

B 

212 

587 

MGKGS FKYAWVLDKLKADRERG I T I D I S LGKFETS KYYVTI IDAPGH 
RD F I KNM I TGTS Q AD CAVL I VAAG VGE FEAG I S KNGQTR EHALLA YT 
LGVKQLIGGVTKMDSTEPPFRRKRYEEIVKGX* 

8431 

A j 

1 

1386 

MDIKKGITDISASLRVESGWEARTRKEKTHINTVIIGHVDSGKSTTT 
GHLIYKCGGVDKKTIEKFEKEAAEMGKCSFKYAWVl^KIjKAEREHGI 
T I D I SLiWKFETS KY YVT 1 1 DAPGHRDL I KNMITGTSQADCAVIjIVAA 
G FGE FEAG I S KNGQTREHALLA YTLGVKQL I VGVNKMD S TEP P Y SHK 
RYEE I VKE VS TY I KKI GHNTDTVAFVP VS G WNGDNTLEP S ANMPWFK 
GWKVTRIQ5GNASGTTLLEA1»DCILPPTRPTDKPLCLPLQDVYKIGGI 
VNVATE VKS VEMHHEAL S E VL PGDNVG FNVXNVS VKD VRRGNVAGD S 
KKDPPMEAAGFTAQVI I LNHPGQI S AGYAPVLDCHTAH I ACKFAELK 
EKTDRHSGKKLEDGPKFLKSGDAAIVDMVPGKPVYVESFSDYPPLGR 
FAVHD I RQTVAVGV I KAVDKKAAGAGKVTKS AQKAQKAK 

8432 

A 

66 

1574 

KLTPLKPKWESVKTHINL\AVIGH\VDSGKS\TTTGHLIYKCGGIDT 
KNT-TTiKTJTiTKKAAFMG\ Kf;c;F\ TCYAWVT.nKT.KAT?PFPf5TT , TIjT^T»WKF 
ETSKYYVTIIDAPG\HRDFIKN\MITGTSQA*PGLGGPIiFGG/CSPIi 
IVAAG\VGEFEAGISIOJGQ\TREHALlxAYT\LGVKQL\IVGC*QKWD 
ST\EPTPTAQKKILKEIVK/EKVSTFH*RKFGYNP\DTVAF\VPNFL 
VWN\GDNMLGAQVPNMPWFQGIX3KVTRra 

P\PTRPTDQAPLR\LPFQ\DVLPKIGGYLGTWFLVWPEWETGVL/RN 
PGMGVTF\APVNWT\EVKSVE^tHHEALSEA\L^ 

KDVRRGlWAGDSKl^PPMEAAGFTAQVIILNHPGQISAGYAPVxxDCH 
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lAn J. MA- JVT ADJjJSJbiVX JJ±utooJ\J\X«iUJuJrJ\J? iiJU>bl/iUil V JJJ*j V Jr *jXvfI*lL. 

VESFSDYPPLGRFAVRDMRQTVAVGVIKAVDKKAAGAGKVTKSAQKA 
QKAK 

8433 

A 

2 

265 

AIQPRLVAVSKTKPADMVIEAYGHGQRTFGENYVQELLEKASNPKIL 
SLCPEIKWHFIGHLQKQNVNKLMGKSEFLIKRENIMPRTE 

8434 

A 

3 

318 

GTRDLP AI Q PRLVAVS KTKPADMVI EAYGHGQRTFG ENYVQELLEKA 
SNPKI LS LCPE I KWHFI GHLQKQNVNKLMGKI KLNMKTKLFCHC IAS 

TTT DtTCMCf 

1 1 -LiUvjKijoJNJr 

8435 

A 

2 

. 896 

VG PRGMWRAGSMSAELGVGCALRAVNERVQQAVARRP/ RGDLPAIQP 

DT .\T\ tt\rOTfT\ I/"DA"P>M\ \7TT7 2iVr2TJr20'D r P'C*r2T?M'V\7r>'C , T \ T T7 V A O \ XT DVT 

T^T.PPRTKWHFTVnWT.nTCniJVTJKT.MZVVPlJT.FMTA PTV\ n<2V"KT.\ IYTiK" 
juo — tr i_» x x\ f» xi jt x \ vjxi u\( xv^im v m jvuiinv xtxmxjJt i \ c< j. v \ JU' > v i\_u \ r\ i j i\ 

VNSSW\QRK\GSPERLKVMGPGFNTSREEIYLFSVSLLEGKHG\LPP 
SETIAIVEHINAKC\PNL\EFVGL\MTLG\SFGHDLSQGPNPDFQLL 
LSL PEETWKS * R I PADQVELSMGL S ADMQHAVEVRSTDVW I RS TVF 
GEPDYSKKPTPDKCAADVKAPLEVAQEH 

8436 

A 

202 

349 

HRPROOSNTCF1TOKCHHSRPAPLRLPAPEPRPPGPTOPDDPPGIOPD 
LP 

8437 

A 

1 

129 

DLSOTSHGEATQKHSEKCKASCTHMFITALFLIAKHWNQPTCL 

8438 

A 

3 

389 

PQTVAATTVPTNTSWAPTTTSLGPAKDKPGLRKAAQGGGSTFTSQGG 
TPDATAASGAPVSPQAAPEPSKRPHHGDPQDGPSHRDSWLTVTPGTS 
RPLS TS SGVFTAATGPTPAAFDTS VS PPSRGFLR 

843 9 

A 

505 

693 

QAQNLS PVTPSTS FTV I RNVQFK I LDAWAQEPLHRGAGQ 1 1 PTARR 
WYSAFLMGARDLGCN 

8440 

A 

46 

3454 

AGESI^aDTDLYDEFGl^^YIGPELDSDEDDDELGRETKDIJDEMDDDDDD 

DDVGDHDDDHPGME^AnLiHEDIOCYYPTAEEVYGPEVETIVQEEDTQPL 

TEPI I KPVKTKKFTLMEQTLPVTVYEMDFLADLMDNSEIjIR^ 

HIiHHGKTCFVDCLIEQTHPEIRKRYDQDLCYTDILFTEQERGVGIKS 

TPVTVVLPDTKGKSYLFNIMDTPGHVNFSDEVTAGLRISDGVVLFID 

AAEGVMI^TERLIKHAVQERI^VTVCINK^ 

LRHIVDEVNGLISMYSTDEl^ILSPLl^ITVCFSSSQYSICFTLGSFA 
KI YADTFGD INYQE FAKRLWGD I YFNPKTRKFTKKAPTS S SQRS FVE 
F I LE PL YKI LiAQ WGD VDTS LPRTLDELG I HLTKEELKLN I RPLLRL 
VCKXFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAM 
SDCDPDGPLMCHTTKMYSTDMVQFI^FGRVLSGTIHAGQPVKVLGE 
NYTLEDEEDSQI CTVGRL W I S VARYH I EVNRVPAGNWVL I EGVDQP I 
VKTATI TEPRGNEEAQI FRPLKFNTTS VI KI AVEPVNPS ELPKMLDG 
LRKVNKS YPSLTTKVEESGEHVI LGTGELYLDCVMHDLRKMYSE IDI 
KVADP\T\TTFCETVVT2TSSLKCFAETPNKK^ 

ENEWQ I TWNRKKLGE FFQTKYD WDLLAARS I WAFG PD ATGPNI LVD 
DTLPSEVDKALLGSVKDSIVQGFQWGTREGPLCDELIRNVKFKILDA 
V VAyh P JjHKCKjGQI X PTARR VvYS AFLMATPRLMEP YYFVEVQAP AD 
CVSAVYTVLARRRG\HVTQGCTHPKASPL*TIKAF\IPAIDSFGFET 
DLRTHTQGQAFSLSVFHHWQIVPGDPLDKSIVIRPLEPQPAPHIxARE 
FM I KTRRRKGLS EDVS I S KFFDDPMLLELAKQDWLNYPMSQQLLQA 
SDHTTWGNHLSDSSRNTFTC I VYLRNLTAACI04DDDDDDDDVGDHDD 

DVQAQSRSSIDERRGPDHPYGCDHPKMNLKDSLDVAKPADAKNGNSI 
RLALTVSEA 

8441 

A 

2 

640 

AVVVICSDASDMELMLVGGFFRCAGKVEVNVQGAVGILCANGWGMNI 
AEVVCRQLECASAVTVSREPHFTERTLHILMSNSGCTGGEASLWDCI 
RWEWKQTAOILimEASLICSAHRQPRLVGADMPCSGRVEVKHAYTWR 
SVCDSDFSLHAAITVLCRELNCGDAISLSVGDHFGKGNGLTWT^EKFQC 
EGSETHLALCPIVQHPEDTCIHSR 

8442 

A 

60 

345 

VECPLFCSWEETSESGTGSFLRGPVATEREGGECSFPPALSSCLPGA 
SSRPPFPSLPPPPALSFHSPPPTPSDQVLPIAVPPPTPYLARVDLDL 

8443 

B 

26 

384 

MHHEALSEALPGD^GFNVKIWSVK^ 
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TAQVI ILNHPGQI SAGYALYWIAIVDMVPGKPMCVESFSDYPPliGRF 
AVRDMRQTVAVGVI KAVDKKAAGLAS * 

8444 

A 

33 

99 

KLPLKAKMGKEKTHINIWIGHVDSGKSTTTGHLIYKCGGIDKRTIB 
KFEKEAAEMGKGSFKYAWVLDKLKAERERGITIDISLWKFETSKYYV 
TI IDAPGHRDF I KNM ITGTSQADCAVLIVAAGVGEFEAGI SKNGQT\ 
REHALLAYTLGVKQLIVGVNKMDSTEPPYSQKRYEEIVKEVSTYIKK 
IGYNPDTVAFVPISGWNGDN\MLEL\CSNMPWFKGMK\VTRKDGNA\ 
SGTTLLEALDCILPPTRPTDKPLR\LPLQDVYKIGGIGTVPVGRVET 
GVLKPGMWTFAP VN\ VTTEVKS VEMHHEALS \ EALPGDNVGFNVKK 
CWSRMFRRGNVAG\DSKNDPPMEAAGFT\AQVIILNHP\SQKNARH 
MPLELDCHTAHIACKFA*A*RKRFDRRSG\KKL\EDGP*NSLKSG*C 
WPLVDMVPWASPMCV*EASQDYP\PLGRFWPFRYMRQTALRLGVNQK 
QLDKK\AAG\AGKVHQSLAQKA*KPKWERKRLISTLSSIjDT 

8445 

A 

2 

443 

ANAALCQHPTGCQTSADLHWTLPPPRHSPPPPPLHILPPPRHRPPPP 
PLHIIPPPPPPFHSQAQSSLRPLAPLHFSLSEGHGSARPPSTEHHPG 
QMGAGEAPPPPPLPPPPSQTRKYCRGRPPRRQNPANCSGCARPESLL 
PPLDPG 

8446 

A 

76 

413 

LRLWISNSFEYQNTGQDKMHVRSYIACELSVDFQWVLVTPHQCPGI 
VGVSEETSVSAKMMPRAVISWWEERRNRSQRKGDEKHNHESQGIKFG 
ENFGRDFERGALSQPWTY 

8447 

A 

524 

955 

FCSMSSQKWSWQAQPLSWRHWSQGPVPSLPAKLLFKGFIjPGTAKPAC 
SAFREAAALAFIQDNTCTAISEEKGNGSRFLGFPSARZiRGRPRAESPR 
PEPRARPRATQPGPAAPAAHATPPPGPAPAPYLVIRGASGGRGNVRG 
PK 

8448 

A 

3 

208 

AASTTPTRG CHAPS VHTAPTHTAYVYTRPSGASAFASPQQSTVSVPD 
SNS S S PNHQGDG AS QTSGEQ I 

8449 

A 

3 

208 

AASTTPraGCHAPSVHTAPTHTAYVNNKPVGASALASPQESTGRVSD 
SNSSSPNHQGDGASQTSGEQI 

8450 

A 

3 

329 

TRNDERPAAEYEQPWEWKKEQIVRALSVQFEGAERPSFREETVRQHH 
RQKS WTQKI LKPALSDHSEGEKVDPGLPLEKQPWYHGAI SRAEAESR 
LQPCKEAGYLVRNS 

8451 

A 

56 

325 

FPLDSTGSELKQNILSFTGLPPAMQKVMYKGLPPEDKTLREIKVTSG 
AK I MGGG S P INGVLS VHTP I YS AHLD AKAEKNKKE PLCRQKQ 

8452 

A 

33 

159 

QRLPRTPLTGQGTRRLHRSSQTLPVSKVYFKNYVPCRRLQS 

8453 

A 

2 

494 

TVTIYSLFLFLFVRNIHKNQCKTISVGVLFYITKLPLDISYKRGGLT 
GEKRDKLALQSTARNIVFKIDFQHMDPWGKEEGAFDTGRVCDERCSL 
LVPCPVSGVRGSLCYGLLYFRVKRNKGVSLSTPPPQPHPPPEPAVGC 
IWTKPAKPCFSLCSGGRLKPDMG 

8454 

A 

130 

716 

LFARLPEAIPWRQGKESSAQIRLEPPSRAQGETGFSWLGPDTTHSRF 
WWWLGL/CGEGWAGIPLCFFSPRKYNTPITR0FLGPPLTGQGNKKT\ 
NRSSQTLPVIEKPLSLL*KLLPSLRERE*LFIGLGVRAPSPGSLLIS 

GLGAQACPQ/ glpclclypclrgpgqdfpnqprpqhpps PQKPPSSF 

PHGSYVESLF 

8455 

A 

1 

578 

GGRSVLPVDPRVRSHFPMTHGNTGLSAGLDASAQTT\SHELTIPNDL 
IGCIIGRQGAKXNEIRQMSGAQIKIANPVEGSTDRQVTITGSAASIS 
LAQYLINVSLENAKPSSQAASVTIPDHIjS INLSQPSTPS SSFSSTTT 
PSLATAGTSDAPSSLPNPSSDRPLCLQSAWHETHPFPGSPPPRLLLP 
ATPPL 

8456 

A 

2 

875 

I ITLAGPTNAIFKAFAMI IDKLEEDI S \ SSMTNSTAASRPPVTLRLV 
VPASQ\CGSLIGKGGCKIKEYTREYRGFRSKVAGDMLPNLTERA\IT 
IAGIP\QSIIECVKQICWMLETLSQSPPKGVTIP\YRPK\PSSSPV 
ILQGGQPYTIQGQYAIPQPDLTKLHQLAMQQS\HFPMTHGNTGIQWH 
LNPASPRGSKAYW\AGL\DASAQTTSSWNSPFPNGFDWAGIIGAFKG 
AKI\NEIRQM\SGAQIKIANPVE\GST\DRQVT\IT\GSAASISPGV 
NIIINVRLSSE\TGGMGSS 

8457 

A 

3 

376 

HSSGLGGGGVMLVHDIRRNESHIilDFRESAPGALREETLQRSWETKP 


JL U O IB >Mh& N»3; „ O 2 •£!! 6 f Si H! 






GLLVGVPGMVKGLHEAHQLYGRLPWSQVLAFAAAVAQDGF1TVTHDLA 
RAlxAEQLPPNMSERFRETFLPFG\RPPLPGS 

8458 

A 

1 

1965 

MAAENEASQES ALGAYS PVDYMS ITS FPRLPEDEPAPAAPLRGRKDE 
DAFLGDPDTDPDSFLKSARLQRLPSSSSEMGSQDGSPLRETRKDPFS 
AAAAECSCRQDGLTVIVTACLTFATGVTVALVMQIYFGDPQIFQQGA 
WTD AARCTS LG I EVLS KQG S S VD AAVAAALCLG I VAPHS S GLGGGG 
VMLVHDIRRNESHLIDFRESAPGALREETLQRSWETKVGTLDLLHPIi 
LSGPPRLPWSQVLAFAAAVAQDGFNVTHDLARALAEQLPPNMSERFR 

AIT Jjrout\ r r Jjr OOlji-iflXV ir X/ J_li-VJ_» VUJJVJjO X SUirniir 1 rtUUlM JlJ X XjXjJ 1 V 

AEAQHAGGVITEEDFSlSTx-SALVEKPVCGVTRGHLVLSPPPPHTGPAIi 
T AI.N T T iEGFNLiTSLiVSREOAT,HWVAETTjK T ALAXiASRLGDPVYDST 
I TESMDDMIiS KVEAAYLRGH INDSQAAPAPLLPVYELDGAPTAAQVL 
IMGPDDFIVAIWSSLNQPFGSGLITPSGILLNSQMLDFSWPNRTAim 
SAPS LENS VQPGKRPLSFLLPT\A^PAEGLCGTYlxALGANGAARGLS 
GLTQVLL1WLTLNRNLSDS LARGRLH PDLQSNLLQVDSEFTEEEIEF 
LEA^GHHVEKVDVLSWVHGSRRTNNF 1 1 AVKDPRS PDAAGATI I» 

8459 

A 

4 

267 

DG VLLCHLGT I S AHCNLHLPGSS S S PAS VS QI AGVTGVHYHVWLNFF 
LLVETOFCHVGQADTPFLPKKKKKKAW 

8460 

A 

10 

465 

TGPVAMGRV I RGQRKGAGS VFRAilVKHRKGAARLRAVDFAERHGY IK 
G I VKD 1 1 HD PGRGAPIxAKVVFRDP YRFKKRTELF I AAEG IHTGQFVY 
PGKKAOTjNTGNVTjPVGTMPEGTTVCX PVTSKKVTSSANRAWGWAG 
GGRIDKPILKA 

8461 

A 

5 

807 

TGPVAJyiGRVIRGQRKGAGSVFRAilVKHRKGAARLRAVDFAERHGYIK 

vj x v s\u x x JiXyJtr vjjx\.vji k vir xxrvrv. v v jt xwr x rv.jr x\xvx\. x h>xjx x/vLDVjrxn xvyr v x 

CGKKAQLNIGlWLPVGTMPEGTIVCCLGGEAWKTRGKIiAl^^ 
VISHNP\ETKJCTRVKLPSGS\KKVISSA1TOAVVGVVAGGWPELTNPS 

l.kag\ pjxyhkykak/rorncwp\ rirg\ vamwpvehtfgggo /hpoh 

IASPPTIRRDAP/AGRK^GLIAARRTGRLRGTKTVQEKEN 

8462 

A 

11 

350 

SILQKAKAKILEFATPTTKKEAQKFIGLFGYWRHHMPHMGNILQPLR 
AVTRKRYDFHWGQKESVAFQQAEQAVQLTPDLWPIRNRPVELQVAVxj 
dohanws lrokoggkrvpl 

8463 

A 

305 

676 

RSKSDSAADLPSRRARLSGTGEKASENKGDPVHD*ADGGFVPLEDSL 
CM*FPAWASSPSAJU^SIGLTVQVAGLQGLPWAPWVVPLGSTGRRSSG 
ELEES EAG YLSTS STDGLLGS FKEGCS S C 

8464 

A 

352 

784 

DTAl^PPPYIQRlxACSAJaAPPTASPAPLHVPVPGVQGLIQAVLLVIilF 
VLRKR I KLTVELFQ I TNKAI S S APFLLFQPLWTFAI LI FFWVIiWVAV 
LLSLGTACGRPRCTGSLCSQRPFHQCQRAWLCGWADQQLNHFLIFRR 
CPS 

8465 

A 

3 

383 

HEAYAPRIFFEGSRPPWVTPRSQQDCSEYLRFLLDRLHEEEKILKVQ 
ASHKPSEILECSETSLQEVASKAAVLTETPRTSDGEKTL.IEKMFGGK 
LRTHIRCLNCTSTSQKVEAFTDLSLAFWPSSS 

8466 

A 

1 

233 

GTRQEGNTTVLVEVNCETDFVSRjNTL 

Q PS AY S KVS LGFVS S VLNLLS S LGLLL FLL 

8467 

A 

3 

358 

HEWMHLTQPELQELTSLLTVNSEFVDWRKFLLVTSMPWPIPLEEELL 
ETLQKFKAVDKEQLGTITFEQYMQAGLWFTGDEDIKIPENPLEPLPF 
NRQEHLIEFFFRLFADYEKDPPQL 

8468 

A 

2 

374 

AREGL WKS KFTPENTTKEL F YKADGE S CS TSMMYQEGKF P YRHMAES 
TQVLELPFKGDYITOGLILSKPDKILAPVDKELTPKVLPDWLDEDED 
MMLVVHMPRLLIEDGFSLKDQLPDNGLVHL 

8469 

A 

2 

484 

KFQNAIjLVRYTKKVPQVSTPTLVEVSRI^GKVGSKCCKHPEAKRM 

a^dylsvvlnqlcvlhektpvsdrvtkccteslvnrrpcfsalevde 

tyvpkefnaetftfhWkhkpkatkeqlkav^dfaafve 

ketcfaeegkklvaasqaal 

8470 

A 

2 

435 

GMFLYEYAPJUIPDYSVVLLLRIjAKTYETTLEKCCAAADPHECYAKVF 
DEFKPLVEEPQNLIKQNCELFEPDICTLSEKERQIKKQTALVELVKH 
KPKATKEQLKAVMDDFAAFVEKCCKADDra 
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LGL 

8471 

B 

73 

384 

MFLYEYARRHPDYSVVLLLRLAKTYETTLEKCCAAADPHECYAKVFD 
BFKLLVEEPQNL I KQAWRVQI PECAI S SLHQESTPSVNSNSCRGLKK 
PRKSGQQML* 

8472 

C 

11 

475 

MFLYEYARRHPDYSVVLLLRIJUCTYDAEDYIjSVVIiNQLCVLHEKTPV 
q dp vtlfp ptp 9 t . vn r p p p f ^ a t . f vdf t yvp kp ftj a ft ftfw zvd t ptt . 
S E KE R Q I KKQT AL VE L VKH KP KATKE QL KA VXDD FXAFVE KC C KGDX 
KGELLXRRXVXKLL 

8473 

A 

1 

608 

EFSMLEKSHCIAEVENDEMPADLPSKAADFVESKDVCKNYAEAKDVF 
LGMFLYEYARRHPDYSVVLLLRLAKTYETTLEKCCAAADPHECx'AKV 
FDEFVNRRPCFSALEVDETYVPKEFNAETFTFHADICTLSEKERQIK 
KQT AL VE L VKH KP KAT KE Q L KA VMDD FAAF VE KCC KADD KE TC F AE E 
GKKLVAASQAALGL 

D A 1 A 

o 4 / 4 

A 

i 
1 

C Q O 

71 W TPFlNJOriQ TC Q VT.VnTPPWDTJ.PVQWPT AP\7FWnPM"D7x , nT.PQT.7x 7x 

DFVESKDVCKNYAEAIOWFLGMFLYEYARIfflPDYSVV^^ 
xrrLEKCCAAADPHECYAKVFDEFKPLVEEPQNLIKQNCELFEQLGEY 
KFQNALLVRYTKKVPQVSTPTL\^VSRNLGKVGSKCCKHPEAKRMPC 
AED YL S RG PRTS RM FAKTMLRQRM S S WAC / CL YEYARRHPD YS WLL 
LRLAKTYETCLEKCCAAADPHECYAKVPDEFKPL 

LFEQLGEYKFQNALLVRYTKKVPQVSTPTLVEVSRNLGKVGSKCCKH 
PEAKRMPCAEDYLSRGP 

8475 

A 

1 

1428 

MKWVTFISLLFLFSSAYSRGVFRRDAHKSEVAHRFKDLGEENFKAtV 

LIAFAQYLQQCPFEDHVKLVNFArrEFAKTCVADESAENCDKSLHTLF 

GDKLCl^ATLRETYGEhlADCCAKQEPERNECFLQHKDDNPNLPRLVR 

Pfi VD VM CTAFJ1DNE E T FLKKY L x\EIARRHFYFYAPELLFFAKRYKAA 

FTECCQAADKAACLLPKLDELRDEGKASSAKQRIiKCASLQKFGERAF 

KAWAVARLSQRFPKAEFAEVSKX.VTDLTKVHTECCHGDLLECADDRA 

DIjAKY I CENQDS I SS KLKECCEKPLLEKSHC I AE/ HRNLGKVGSKCC 

KHPEJUOWPCAEDYLSWLNQLCVLHEKTPVSDR\T^^ 

PCFS ALE VDET YVPKE FNAETFTFHAD I CTLS EKERQI KKQTALVEL 

VTTH'K'P'K'JVT'K'FOT .TC7X\/M - nr*'C2l 7A F\7T*Vr , PK"7AT^DWT?Tr , l? 7AT?"G*f2 VVT VTA 7X Q 
V J\jl Ivlr Jvrt X XVCj^Xj V IMLJU-T rt/ir V r. m i . r\ m i ji j i\ r«. J. i r M r* A J IS A Li VhAO 

QAALGL 

8476 

A 

1 

748 

ADLP S LAADFVE S KDVCKNYAEAKDVFLGMFLYEYARRHPDYSVVLL 
LRLAKTYETTLEKCCAAAHPHECYAKVFDEFKPLVEEPQNLIKQNCE 
LFEQLGEYKFQNALLVRYTKKVPQVSTPTLVEVSRNI^KVGSKCCKH 
PEAKRMPVNRRPCFSALETOETx^PKEFNAE 

QIKKQTALVELVKHKPKATKEQLKAVMDDFAAFVEKCCKADDKETCF 
AEEGKKLVAASQAA 

8477 

A 

1 

1515 

MKWVTFISLLFLFSSAYSRGVFRRDAHKSEVAHRFKDLGEENFKALV 
LIAFAQYLQQCPFEDHVKLVNEVTEFAKTCVADESAF^CDKSl^TLF 
GDKLFVEVSR1\TLGKVGSKCCKHPEJVK^ 

jvx irvoiJKV x J\x*C i CiOJj vxviKKJrCr oAbtiVUbl x V ir Jvbr 1>J AiS JL r ArxlMJJ 
I CTL S EKERQI KKQTAL VELVKHKPKATKEQLKAVMDD FAAFVEKCC 
KADDKETCFAEEGKKLVAASQAALGLTCEACQEPGGLVVPPTDAPVS 
PTTLx^^DISEPPIxHDFYCSPJjLDLWLLDGSSRLSEAEFEVLKAFV 
VDMMERLR I S QKWVR V AWE YHDG SHAY I GLKDRKRPS E LRR IAS QV 
KYAGSQVASTSEVLKYTLFQIFSKIDRPEASRIALLLMASQEPQRMS 
RNFVRYVQGLKKKKVIVIPVGIGPPiANLKQIRilEKQAPENKAFVLS 
SVDELEQQRDEIVSYLCDLAPEAPPPTLPPDMAQV 

8478 

A 

1 

2931 

MKWVTF I S LLFLFS S AYS RGVFRRTPLGPAS SLPQS FLLKCLEQVRK 
IQGDGAALQEKLCATYKLCHPEELVLLGHSLGI PWAPLS SCPSQALQ 
LAGCLSQLHSGLFLYQGLLQALEGISPELGPTLDTLQLDVADFATTI 
WQQMEELGMAPALQPTQGAMPAFASAFQRRAGGVLVASHLQSFLEVS 
YRVLRHLAQPGGGGDAHKSEVAHRFKDLGEENFKALVLIAFAQYLQQ 
CPFEDHVKLVNE\TTEFAKTCVADESAENCDKSLHTLFGDKLCTVATL 
RETYGEMADCCAKQEPERNECFLQHKDDNPNLPRLVRPEVDVMCTAF 
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HDNEETFLKKYLYEIARRHPYFYAPELLFFAKRYKAAFTECCQAADK 
AACLLPKLDELRDEGKASSAKQRLKCASLQKFGERAFKAWAVARLSQ 
RFPKAEFAEVSKLVTDLTKVHTECCTGDLLECADDRADLAKYICENQ 
DSISSKLKECCEKPLLEKSHCIAEVENDEMPADLPSLAADFVESKDV 
CKITYAEAKDVFLGMFLYEYARRHPDYSVVLLLRLAKTYETTLEKCCA 
AADPHECYAKVFDEFKPLVEEPQNLIKQNCELFEQLGEYKFQNALLV 

KY 1 J\.iV V Vb JL Jr A IiVIjVo x\X* jjulS.vu5 ivn tr iL/-\jvtvl*j tr x xjo v v 

LNQLCVLHEKTPVSDRVTKCCTESLVE1TOEMPADLPSLAADFVESKD 
VCKNYAEAKDVFLGMFLYEYARRHPDYSVVLLLRLAKTYETTLEKCC 
AAADPHECYAKVFDEFKPLVEEPQNLIKQNCELFEQLGEYKFQNALL 
VRYTKKVPQVSTPTLVEVSRNLGKVGSKCCKHPEAKRMPCAEDYLSV 
VLNQLCVLHEKTPVSDRVTKCCTESLVNRRPCFSALEVDETYVPKEF 
N7VETFTFHADICTLSEKERQIKKQTALVEIiVLCRCRSS*M\HAKAVM 
DDFAAFVEKCCKADDKETCFAEEGKKLVAASQAALGL 

8479 

B 

1 

6124 

MKWVTFISLLFLFSSAYSRGVFRRDAHKSEVAHRFKDLGEENFKALV 
L I AFAQYLQQC P FEDHVKL VNE VTE F AKTCV ADE S AENCDKS LHTLF 
GDKLCTVATLRETYGBMADCCAKQEPGRNECFLQHKDDNPNLPRLVR 
PEVDVMCTAFHDNEETFLKKYL YE I ARRHPY F YAPELLFFAKRYKAA 
FTECCQAADKAACLLiPKLDELRDEGKAS SAKQRLKCAS LQKFGERAF 
KAWAVARIjSQRFPKAEFAEVSKLVTDLTKVHTECCHGDLLECADDRA 
DLAKYICENQDSISSKLKECCEKPLLEKSHCIAEVENDEMPADLPSL 
AADFVESKDVCK}TYAEAKI)V 

YETTLEKCCAAADPHECYAKVFDEFKPLVEEPQNLIKQNCELFEQLG 
EYKFQNALLVRYTK^LQQCPFEDHVIOjWEVTEFAKTCVADESAEN 
CDK5LHTLFGDKLCTVATLRETYGEMADCCAKQEPGRNECFLQHKDD 
NPNLPRLVRPEVDVMCTAFHDNEETFLKKYLYEIARRHPYFYAPELL 
FFAKRYKAAFTECCQAADKAACL.LPKLDELRDEGKASSAKQRLKCAS 
LQKFGERAF KAWAVARLS QRFPKAEFAEVS KXiVTDLTKVHTECCHGD 
LLECADDRAD1AKY I CENQD S I S S KLKECCEKPLLEKSHC I AEVEND 
EMPADLPSLAADFVESKDVCKRYAEAKDVFLGMFLYEYARRHPDYSV 
VLLLRLAKTYETTLEKCCAAADPHECYAKVFDEFKPLVEEPQNLI KQ 
NCELFEQLGEYKFQNAAISSLHQEKVSRNLGKVGSKCCKHPEAKRMP 
CAED YLS WLNQLCVLHEKTPS FLLKCLEQVRKI QGDGAALQEKLCA 
TYKLCHPEELVLLGHSLG I PWAPLS SCPS QALQLAGCLSQLHSGLFL 
YQGLLQALEGI S PELGPTLDTLQLDVAD FATT I WQQME ELGMAPALQ 
PTQGAMP AFAS AFQRRAGGVLVASHLQS FLEVSYRVLRHLAQPGGGG 
DAHKSEVAHRFKDI^EENFKALVLIAFAQYIiQQCPFEDHVKLVNEVT 
EFAKTCVADESAENCDKSLHTLFGDKLCTVATLRETYGEMADCCAKQ 
EPERl^CFLQHKDDNPNLPRLVRPEVDVMCTAFHDNEETFLKKYLYE 
IARRHPYFYAPELLFFAKRYKAAFTECCQAADKAACLLPKLDELRDE 
GKASSAKQRLKCASLQKFGERAFKAWAVARLSQRFPKAEFAEVSKLV 
TDLTKVHTECCHGDLLECADDRADLAKYI CENQDS I SS KLKECCEKP 
LLEKSHCIAEVENDEMPADLPSLAADFVESKDVCKNYAEAKDVFLGM 
FLYEYARRHPDYSVVIiLLRLAKTYETTLEKCCAAADPHECYAKVFDE 
FKPLVEEPQNLIKQNCELFEQIiGEYKFQNAIiLVRYTKKVPQVSTPTL 
VEVSRNLGCTGSKCCKHPEAKRMPCAEDYLSVVI^QLCVLHEKTPVS 

T^P\7T , VrY^TTTQTA7TtfPPPr'T?Q AT. PA/HPT YVPTC^ 

fkppntk"kntat,vkt.vkhkpkatkeolkavmddfaafvekcckaddk 
ETCFAEEDAPVSPTTLYVEDISEPPLHDFYCSRLLDLVFLIjDGSSRL 
SEAEFEVLKAFVVDMMERLRISQKWVRVAVVEYHDGSHAYIGLKDRK 
RPSELRRIASQVKYAGSQVASTSEVLKYTLFQIFSKIDRPEASRIAL 
LLMASQEPQRMSRNFVRYVQGLKKKKVIVIPVGIGPHANLKQIRLIE 
KQAPENKAFVLS S VDELEQQRDE I VS YLCDLAPEAPPPTLPPDMAQV 
# 

8480 

A 

241 

348 

SFFSQTQASEQLTKSKQGLALRTLPPVQFPPPPPRP 

8481 

A 

2 

191 

VGRVGLELLASSDLPTSGSPSAGITGMSHCPWPNFLLIFIGYFFTVL 


iotas' 
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TPCEYYSGGMTVPLWL 

8482 

A 

3 

372 

APQIKRHLEVKASPSSLEGESSLSQQPTPLPQTFMAPKRIiDKAKEAQ 
RTPSPGGGQPLSPLPHPFTQELFTVPPVMPGHADTLLLKCPYPILRP 
APRPPAYRIHPVCQEAIGTLaAILSALCS 

8483 

A 

1 

351 

TLSTLRYADRAKQIKCWAVINEDPNAKIiVRELKEEVTRLKDLLRAQG 
LGDIIDIDPLIDDYSGSGSKYLKDFQNNKHRYLLASENQRPGHFSTA 
SMGSLTSSPSSCSLSSQVGLTSV 

8484 

A 

44 

455 

HACERIHTGEKPYICFEWGSFFRKHSNLTQHQRIHTGEKPHKCDECG 
KTFQTKANLSQHQRIHSGEKPYKCKECGKAFCQSPSLIKHQRIHTGE 
KPYKCKECGKAFTQSTPLTKHQRIHTGERPYKCSECGKAFIQK 

8485 

A 

200 

409 

MRQDFEIILFLFVSTLVCCSLVIKRKMWPGAMAYACNPSTLGDRGRQ 
I TRSGVQDQPGQYGETS SLLKK 

8486 

A 

1 

110 

PTRS GTFKS GVL YKGHGE I PNW ADENS VDEAE VC AS 

8487 

A 

48 

464 

QLGTRHNISASQVNAWHVAESAVNYSLGDEQGFCGSPWPNCNFPEYF 
TYSVLLSLLACSVFLQINCIGKLVLMLAIELIYVLIVEVPGVTLFDN 
ADLLVTANAIDFFNNGTSQCPEHATKVALKWTPI I ISVFVLAIi 

8488 

A 

1 

721 

MHWGVGFAS SRPCAAH5LS WNQS I S FFG WWAGSEEPFSF YGD 1 1 AFPL 
QD YGG IMAG LG SDPWWKKTL YLTGGALLAAAAYLLHELIj V I RKQQE I 
DS KDAI I LHQFARPNNGVPSLS P FCLKMETYLRMADLP YQNYFGGKL 
SAQGKMPWIEYNHEKVSGTEFIIDFLEEKLGVNLNKNLGPHERAISR 
AVTKIWEEHFYXALSWRWYHPVPTCPLQKFLQFVRQLMIjFMRTPSPS 
ALPGH 

8489 

A 

3 

785 

SCAACTGGLALLCFRPCVVDLSWNQSISFFGWWAGSEEPFSFYGDII 
AFPLQDYGGIMAGLGSDPWWKKTLYLTGGALLAAAAYLLHELLVIRK 
QQE I DSKDAI I LHQFARPNNGVPS LS PFCLKMETYLRMADLP YQNYF 
GGKLSAQGKMPWIEYNHEKVSGTEFIIDFLEEKLGVNLNKNLGPHER 
AISRAVTKMVEEHFYCLPAVNLCFWNKVQLPVLRPVLIVPLPAPPAS 
PLTGFLKCSLYPRAPYRSSCSSSGS 

8490 

A 

5 

302 

FSNAERVGGLEEERESVGPLRQDFSMSSSALIGVLVIAVAIATVIVI 
SLVMLRKRQYGTISHGIVEVDPMLTPEERHVNKMQNHGYENPTYKYL 
EQMQI 

8491 

A 

1 

2086 

MAATGTAAAAATGRLLLLLLVGIjTAPALALAGYI EALAANAGTGFAV 
AEPQIAMFCGKiNMHVNIQTGKWEPDPTGTKSCFETKEEVLQYCQEM 
YPELQITNVMEANQRVSIDNWCRRDKKQCKSRFVTPFKCLVGEFVSD 
VIJ^VTEKCQFFHKERMEVCENHQHWHTVVKEACLTQGMTLYSYGMLL 
PCGVDQFHGTE YVCCPQTKI IGS VS KEEEEEDEEEEEEEDEEEDYDV 
YKSEFPTEADLEDFTEAAVDEDDEDEEEGEEWEDRDYYYDTFKGDD 
YNEENPTEPGSDGTMSDKEITHDVKVPPTPLPTNDVDVYFETSADDN 
EHARFQKAKEQLEIRHRNRhTORVKXEWEEAELQAKNLPKAERQTLIQ 
HFQAMVKALEKEAASEKQQLV^THLARVEAML^ 

QSDPPRPHRILQAIiRRYVRAENKDRLHTIRHYQHVLAVDPEKAAQMK 
SQVMTHLHVIEERRNQSLSLLYKVPYVAQEIQEEIDELLQEQRADMD 
QFTASISETPVDVRVSSEESEEIPPFHPFHPFPALPENEGSGVGEQD 
GGLIGAEEKVINSKNKVDENMVIDETIjDVKEMIFNAERVGGLEEERE 
SVGPLREDFSLSSSALIGLLVIAVAIATVIVISLVMLRKRQYGTISH 
GIVEVDPMLTPEERHLNKMQNHGYENPTYKYLEQMQI 

8492 

A 

3 

2011 

ANAGTGFAVAEPQIAMFCGKLNMHVNIQTGKWEPDPTGTKSCFETKE 

EVLQYCQEMYPELQITNVMEANQRVSIDNWCRRDKKQCKSRFVTPFK 

CLVGEFVSDVIiLVPEKCQFFHKERMEVCENHQHWHTVVKEACLTQGM 

TLYS YGMLLPCGVDQFHGTE YVCCPQTKI I GSVSKEEEEEDEEEEEE 

EDEEEDYDVYKSEFPTEADLEDFTEAAVDEDDEDEEEGEEWEDRDY 

YYDTFKGDDYNEENPtEPGSDGTMSDKEITHDVKVPPTPLPTNDVDV 

YFETSADDNEHARFQKAKEQLEIRHRNRMDRVKKEWEEAELQAKNLP 

KAERQTLIQHFQAMVKALEKEAASEKQQLVETHLARVE 

ALENYIiAALQSDPPRPHR I LQALRRYVRAENKDRLHTI RHYQHVXiAV 

DPEKAAQMKSQVMTHLHVIEERRNQSLSLLYKVPYVAQEIQEEIDEL 


\0 (o*o 
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LQEQRADMDQFTAS I SETP VDVRVS SEES EE I PPFHPFHPFPALPEN 
EDTQPELYHPMKKGSGVGEQDGGLIGAEEKVINSKNKVDENMVIDET 
LDVKEM I FNAERVGGLEEERES VG PLRED FS LS S SAL I GLLVI AVA I 
ATVI VT SLVMLRKRQYGTI SHGI VEVDPMLTPEERHLNKMQNHGYEN 
PTYKYLEQMQI 

8493 

A 

305 

506 

LFQVIDETLDVKEMIFNAERVGGLQEERESVGPLREDFSLSSSALIG 
LLVIAVAIATVIVISLVMLR 

8494 

A 

3 

1588 

EEEDYDVYKSEFPTEADLEDFTEAAVDEDDEDEEEGEEVVEDRDYYY 
DTFKGDDYNEENPTEP6SDGTMSDKEITHDVKAVCSQEAMTGPCRAV 
MPRWYFDLSKGKCVRFIYGGCGGNRNNFESEDYCMAVCKAMIPPTPL 
PTNDVDVYFETSADDNEHARFQKAKEQLE IRHRNRMDRVKKEWEEAE 
LQAKNLPKAERQTLIQHFQAMVKALEKEAASEKQQLVETHLARVEAM 
LNDRRRMALENYLAALQSDP PRPHRI LQALRRYVRAENKDRLHTI RH 
YQHVLAVDPEKAA\QMK\SQVMTHL\HVIEERRNQSLSLLYKVPYVA 
\QE\IQEEIDELLQEQRADMDQFTASISETPVDVRV\SSDESEEIPP 
FHPFHPFPALPENEGSGVGEQDGGLIGAEEKVINSK\NKVDEKHGPL 
NETLDVKEMIFNAERVGGL\EEERESVGPLREDFSL\SSSALIGLLV 
IAVAIAT\VIVISLVMLRKRQYGTI\SHGIVE/VVDPMLTPEERHLN 
KMQNHGYENPTYKYLEQMQI 

8495 

B 

73 

1845 

MAATGTAAAAATGRLLLLLLVGLTAPALALAG Y I EALAANAGTGF AV 
AE PQ I AMFCGKLNMHVN I QTGKWE PD PTGTKS C FETKEE VLQ YCQEM 
YPELQITNVMEANQRVSIDNWCRRDKKQCKSRFVTPFKCLVGEFVSD 
VLLVPEKCQFFHKERMEVCENHQHWHTVVKEACLTQGMTLYSYGMLL 
PCGVDQFHGTEYVCCPQTKIIGSVSKEEEEEDEEEEEEEDEEEDYDV 
YKSEFPTEADLEDFTEAAVDEDDEDEEEGEEWEDRDYYYDTFKGDD 
YNEENPTEPGSDGTMSDKEITHDVXAVCSQEAMTGPCRAVMPRWYFD 
LSKGKCVRFIYGGCGGNRNNFESEDYCMAVCKAMIPPTPLPTNDVDV 
YFETSADDNEHARFQKAKEQLE IRHRNRMDRVKKEWEEAELQAKWLP 
KAERQTLIQHFQAMVKALEKEAASEKQQLVETHLARVEAMLNDRRRM 
ALENYLAALQSD P PR PHR I LQ ALRR YVRAENKDRLHT I RH YQHVLAV 
D PEKAAQMKSQVMTHLHV I EERRNQ I LS LL YKVP YVAQE I QEE I DEL 
LQEQRADMDQFTAS ISEHPCGRPGEL* 

8496 

B 

55 

274 

MADANKAEVPGATGGDS PHLQPAEPPGEPRREPHPAEAEKQQPQHS S 
SSNGVKMENDESAKEEKSDLKEKSTGX* 

8497 

A 

3 

754 

FLFFEPHPAEARKQQPQHSSSSNGVKMENDESAKEEKSDLKEKSTGS 
KKANRFHPYSKDKNSGTGEKKGPNRNRVFISNIPYDMKWQAIKDLMR 
EKVGEVTY\^LFKDAEGKSRGCGVVEFKDEEFVKKALETMNKYDLSG 
RR VN I KED PDGEN ARRALQRTGTS FQG SHASD VG S GLVNL PPS I LNN 
PNI PPEVI SNLQAGRLGSTI FVANLDFKVGWKKLKEVFS I AGTVKAG 
RY\KEDKDGQSRGMGT 

8498 

A 

2 

347 

MPCDEEPSQLEELADFMEQLTPIEKYALNYLELFHTSIEQEKERTSE 
DAVMTAVRAWEFWNLKTLQEREARLRLEQEEADLLTYTREDAYSMEY 
VYEDVDGQTEVMPLWTPPTPP 

8499 

A 

1 

414 

AWHEETHKVDI/3LPEKXKNKJKWKJEPETRYSVLNOT 

TSPSKSVAHGQAPEMPLVKXKKKKKK\VSAALGKEVKRSQSKLRLRN 

TSP*VMTLRPPQRKR*SPKRR*SSQSSRSQL*KGRKRRRGKRGG 

8500 

A 

1 

1977 

MEVDFKIRKVGQWTITLQEHPVSVLLFIEETHPENKPTSTAVEESHI 
SRDTVMVNFQCQLDWSKGYLEAYLTKINSICSHRFLLDGSVTCHDET 
G S S RDARDTVGTGVR VNDE S VEQLGLRRTMWLG I CRG AAMAAVS TVT 
AFAGRPRPGRSRNPRGWAGDSKWTSGSRRSWLSRGGGEISPTGMITK 
THF\VYLGLPEKKECKI^VVKEPETRYSVIjNMDDYFADVSPIiRATSPS 
* \DVAHGQAPEMPLVIQQCKKKKKGVSTLCEEHVEPETTLPARRTEKS 
PSLRKQVFGHLEFL\RGKRKNKKSPLAMSHASGVKTSQ\DPRQGEEE 
TRVGKK\ SKKHKKEKKGGPGPHSL\ PVQDPWFCEAREARDVGDTCS V 
GKKDEEQAALGQKRKRKSPREHNGKVKKKKKIHQEGDALPGHSKPSR 
SMESSPRKGSKKKPVKVEAPEYIPISDDPKASAKXKMKSKKKVEQPV 
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IBEPALKIIKICKKKRKESGVAGDPWKBVVFPSEMDFAVGELRETDTDIj 
EVVLEKKGNMDEAHIDQVRRItALQEEIDRESGKTEASETIUCWTGTQF 
GQWDTAGFENEDQKLKFLRLMGGFKNLSPSFSRPASTIARPNMALGK 
KAADS LQQNIiQRD YDRAMS WKYS RGAGLGF STAPNK I FYI DRNAS KS 
VKLED 

8501 

A 

2 

376 

TAGFLLPQGLPCKYHLMYGSHLMPNVWRRTTTREPKPLGDAEPNTRS 
I^WEVKRSSSGPWVFKSNIACHIPIFRITSTSSAYD/WPFTHSF**F 
SGIIPPCPHGCHOPFSODMVTFPIX , OT,PAHT.W 

8502 

A 

2 

956 

GLSQPKPRKETPIPSDTEEEEPLLQAHPWCQKIEHEQPLGPQGLGQ 
GPPTVVEHTSYSAYTPTELWELAHPSLHQRLQVSRQLAQGKGDHILT 
EWLMAAMWTGWNDARELSKTVSKWQSYAELVEDTRDQRPHVELAIHW 
YPTNVQQVLVLVGTDADYSLVYGKRDKFLGKAAYTDGYRGQSVKMTV 
DYQELNKVTAPLHAAVLS ITDLMDHLTMELGQYHYAVDLANASLSDD 
IAPESQEQFGFTWEGRQWTFTVLPQGYVHSPTICHGLVATDLATWKF 
PKGIRLFHYIDDIMLTSDSPADLEAWPLLQQHLAA 
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SCI 


TISSUE 
ORIGIN 


RNA SOURCE 


adult brain 


GIBCO 


HYSEQ 
LIBRARY 
NAME 


AB3001 


SEQ ID NOS: 


8-9 105 140-142 144-145 180-183 220 230 292- 
293 296-297 488-490 536 546-551 559-560 579 
597-598 656-658 748-751 781 831-833 836-840 
974 1048-1051 1107-1109 1185-1186 1206-1209 
1287-1293 1297-1299 1305-1308 1316^1317 1334- 
1335 1354-1355 1369-1370 1413-1414 1425-1431 
1439-144i 1466-1470 1472-1477 1485-1489 1495- 
1497 1501-1502 1518-1520 1561 1592-1595 1602 
1607-1608 1610 1630-1634 1698-1700 1702-1710 
1718 1726-1728 1781-1789 1809-1810 1813-1815 
1824-1825 1854-1856 1890 1912 1950-1953 1970- 
1971 1980-1984 2016-2017 2050-2052 2059-2060 
2063 2068-2070 2079 2081 2131-2136 2186-2189 
2262-2264 2266-2267 2276-2278 2315-2322 2341- 
2346 2353-2357 2385-2386 2407 2417-2419 2430- 
2434 2436-2438 2440-2441 2445-2451 2453 2469- 
2470 2481 2487-2488 2511 2515-2516 2549-2550 
2559-2565 2567-2568 2571 2587-2590 2623 2636- 
2641 2644-2652 2655-2671 2679-2711 2716-2719 
2722-2724 2731 2794 2798 2810-2811 2826-2834 
2842-2848 2850-2851 2867 2913-2914 2917-2930 
3102-3105 3112-3114 3133-3136 3152-3155 3163- 
3164 3186 3262-3269 3276-3284 3287-3289 3298- 
3300 3305 3308-3312 3386-3389 3402-3406 3454 
3471-3474 3497 3508-3511 3524 3528-3530 3605- 
3610 3638 3642-3643 3669-3672 3678-3679 3724- 
3726 3733-3737 3782-3787 3840 3850-3860 3866- 
3870 3901-3904 3991-3998 4042-4043 4048-4051 
4055-4056 4061-4062 4080-4081 4128-4129 4170- 
4171 4182-4183 4203-4204 4208 4216-4217 4225- 
4227 4253 4266 4288-4289 4292 4327-4333 4415- 
4417 4422-4426 4436-4438 4441-4443 4445-4447 
4484 4502-4503 4510-4511 4514-4518 4553-4555 
4566 4614 4629-4634 4691-4692 4733 4737 4774- 
4776 4778-4779 4790-4795 4804-4805 4811 4843- 
4844 4910-4916 4922 4950-4953 5013 5029-5033 
5054-5055 5112-5113 5116 5146 5186-5189 5191- 
5192 5216-5220 5236-5240 5255 5258-5262 5334- 
5335 5343 5435-5440 5448-5451 5469-5471 5483- 
5493 5687-5690 5692 5710 5723-5729 5757-5758 
5818-5821 5829-5834 5848 5904-5914 5925-5930 
5999-6000 6016-6020 6023-6024 6078-6084 6087- 
6114 6119-6120 6180-6181 6305-6307 6348 6367- 
6368 6385-6389 6399 6440-6446 6455 6662 6670- 
6676 6691 6698 6751-6756 6780 6784 6811-6812 
6845 6890-6894 6949-6953 6957 6987-6989 7001- 
7002 7086-7087 7104 7152-7153 7281-7284 7310- 
7311 7375 7398. 7434-7447 7599-7600 7630-7633 
7730-7737 7739-7741 7770-7772 7797 7921-7922 
7958 8031-8033 8296 8361-8363 8372-8387 8430- 
8432 8455-8456 8466 


adult brain 


GIBCO 


ABD003 


6-9 51 85 93 104 140-142 
209 213-215 230 250-252 2 
298-321 330-340 345-349 3 


144-145 180-183 201- 
72 275-276 280-282 
60 370 383 395 402 


:i O d "+6 M-3 . O H iirli! 6 02 


416-418 425 466 488-494 498 527-529 537-538 
549-551 584 592 648 650-651 656-665 670 688 
714-715 718 726-728 735 752-756 759-760 767 
786-787 828-829 839-840 857-859 875-876 934- 
935 941-942 949-951 1000-1001 1022-1023 1037 
1185-1186 1190-1191 1206-1209 1216-1221 1227- 
1247 1261-1263 1287-1289' 1297-1305 1314-1317 . 
1324-1333 1341-1348 1354-1355 1357-1358 1368- 
1370 1374-1379 1394-1395 1410-1414 1439-1441 
1448-1450 1453-1454 1460-1462 1465-1470 1472- 
1480 1485-1491 1514-1515 1540-1548 1557-1570 
1572 7 1574 1579-1580 1583-1595 1602 1610 1619- 
1620 1624-1628 1630-1634 1637-1642 1655-1658 
1671-1672 1675 1712-1714 1716 1718 1728 1781- 
1783 1786-1789 1809-1810 1813-1815 1817-1821 
1824-1825 1832-1834 1839-1840 1850-1863 1870- 
1874 1876 1890 1896 1904-1906 19i2 1916 1921- 
1927 1937-1945 1950-1953 1958-1961 1969-1971 
1980-1984 1986-1991 2003-2004 2018-2019 2032- 
2036 2038-2044 2050-2058 2061 2063 2067-2073 
2079 2081 2087-2097 2112 2120-2121 2123 2131- 
2142 2155-2157 2165-2175 2182-2183 2186-2191 
2199-2200 2204-2207 2223-2224 2226-2230 2237- 
2239 2243 2255-2257 2266-2271 2276-2278 2282- 
2283 2290 2297-2305 2315-2322 2324-2327 2329- 
2333 2335-2336 2341-2358 2370-2372 2381 2385- 
2386 2398-2403 2407-2415 2433-2434 2440-2441 
2445-2451 2455 2458-2461 2469 2476-2477 2487- 
2490 2511 2514-2520 2526-2528 2543-2550 2557 
2567-2568 2571 2585-2586 2589-2590 2592-2596 
2619-2623 2625 2637-2641 2644-2647 2651-2655 
2658-2660 2665-2667 2671 2684-2688 2692-2696 
2698-2703 2712 2716-2743 2748-2756 2758-2771 
2776-2795 2801-2804 2810-2811 2814-2815 2823- 
2834 2850-2851 2867 2875-2879 2913-2914 2931- 
2946 2948-2971 2974-2979 3014-3016 3018-3023 
3034-3039 3043-3044 3066 3068-3071 3075-3078 
3088-3091 3093 3102-3105 3115-3116 3122-3133 
3137 3149-3150 3159-3160 3163-3172 3182-3183 
3251-3252 3274-3284 3287-3292 3298-3300 3305- 
3314 3346-3356 3371-3374 3384-3389 3400-3401 
3410-3411 3425-3428 3450-3452 3454 3456-3463 
3469-3474 3486-3488 3494-3497 3507 3512-3516 
3524-3527 3548-3553 3555-3557 3566-3567 3588- 
3593 3597-3599 3605-3610 3617-3619 3629-3632 
3638 3642-3643 3658 3660-3663 3675-3679 3687- 
3692 3705 3722 3728 3730-3731 3733-3739 3743- 
3752 3761-3764 3769-3770 3776-3784 3794-3795 
3798-3801 3841-3849 3866-3891 3901-3904 3926 
3931-3936 4009-4011 4042 4048-4051 4053-4056 
4058-4060 4065 r 4066 4073-4079 4082-4085 4092- 
4095 4097-4100^ 4107-4109 4128-4129 4132 4153 
4166 4170-4171 4173 4182-4183 4191-4193 4202 
4223-4224 4255 4286 4288-4289 4298-4322 4330- 
4333 4335-4353 4382-4384 4422-4426 4444 4448- 
4452 4484 4491-4494 4498-4503 4510-4511 4514- 
4518 4550-4553 4559 4566 4609 4621 4644-4645 
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7 7 7**: 
3 2 / b 

-3278 

3355; 

3356 

3358 

3383 

3481- 

3485 

3508- 




3 51b 

3605 

-3610' 

3617- 

-3619 

3634 

-3636 

3678 

-3679 




7 7 a 7 
3 / U / 

-3713 

3733- 

•3737 

3958 

-3960 

4048- 

4051 

4055- 




4056 

4058 

-4060 

4073- 

-4079 

4086 

-4088 

4163 

4223- 




4224 

4249 

-4250 

4289 

4361 

-4362 

4422- 

4424 

4445- 




4447 

4498 

-4501 

4551- 

-4552 

4671 

4678 

4759 

4775- 




4776 

4791 

-4792 

4840 

4972 

-4976 

4979 

5216 

-5220 
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adult brain 


adult brain 


Clontech 


Clontech 


ABR006 


ABR0 08 


5236-5240 5246 5312 5466 5475-5478 5497 5503 
5631 5639-5641 5829-5834 5875-5881 5956-5960 
5988-5992 6003-6005 6011-6012 6032 6157-6158 
6174-6176 6455 6476-6479 6491-6499 6662 6670- 
6676 6858 6897-6898 6960-6961 6970-6976 7014- 
7027 7188-7190 7265 7464 7539-7549 7630-7633 
7638-7639 7672-7675 7748 7757-7760 7773-7774 
7907 8021 8202 8261-8264 8418-8419 8427 8466 


3-5 105 111 113 230 272 2 
432-433 465 468 517 757-7 
833 1113-1117 1122-1147 1 
1175-1176 1287-1289 1316- 
1346 1399-1401 1421-1423 
1490-1491 1540-1548 1562- 
1608 1610 1626-1628 1702- 
1767 1809-1810 1819-1821 
1944-1945 1950-1953 2020 
2290 2302-2305 2329-2331 
2386 2398-2403 2428-2429 
2495-2499 2544-2548 2561 
2580 2589-2590 2592-2596 
2647 2671 2684-2688 2698 
2810-2811 2881-2888 2905 
2979 3017 3029-3031 3045- 
3116 3133 3165-3172 3218- 
3284 3287-3289 3305 3311- 
3348 3355-3356 3411 3454 
3610 3656-3657 3699-3701 
3776-3778 3782-3784 3841- 
3887 3904 4035-4040 4042 
4102-4106 4164 4182-4183 
4386-4387 4445-4447 4510- 
4727 4828 4848-4850 4906 
4967 5022-5028 5034-5035 
5255 5313-5314 5418-5420 
5459 5485-5488 5580-5581 
5834 5882-5885 5956-5960 
6002 6048-6050 6123-6127 
6582 6717 6936 6941-6942 
7162-7165 7226 7346-7347 
7586 7595-7596 7630-7633 
8253 8355-8356 


94-295 307-320 401 
58 805 814-821 831- 
151-1152 1160-1162 
1317 1334-1335 1341- 
1439-1441 1460-1462 
1567 1579-1580 1602- 
1710 1726-1727 1765- 
1824-1825 1896 1912 
2027 2114-2119 2137 
2341-2343 2379 2385- 
2442-2451 2458-2461 
2565 2567-2568 2574- 
2612-2615 2636 2644- 
2703 2716-2719 2721 
2907 2951-2952 2978- 
3048 3095-3096 3115- 
3219 3262-3269 3280- 
3312 3334-3344 3346- 
3497 3548-3550 3606- 
3728-3729 3733-3739 
3849 3881-3882 3886- 
4053 4074-4079 4096 
4281 4289-4290 4343 
4511 4562 4702 4724- 
4928 4950-4953 4962- 
5107-5109 5223-5224 
5428 5431-5434 5452- 
5634 5723-5729 5829- 
5986 5993-5994 6001- 
6159 6243 6473 6563- 
7050-7053 7060-7062 
7377-7384 7446 7585- 
7748 8139 8227-8229 


8-19 22-28 32 38 67 
111 113 123-126 149 
211 217 227 230 241 
287 294-295 298-299 
370 380-382 384-385 
442-445 454-455 467 
491-494 499-500 517 
556 559-560 576-578 
634 688 692 714-715 
833 909-912 918 928 
979-980 982-985 991 
1038 1041-1043 1052 
1113-1120 1122-1149 
1190-1192 1213-1221 
1276 1287-1289 1300 


77-79 88 90 93 
-151 172-175 193 
•243 250-251 269 
305 330-340 345 
389-394 401-406 
469-473 476-477 
527-531 534-535 
580-581 583 592 
719 757-758 811 
931-932 941-947 
-994 1011-1012 1 
-1053 1055-1059 
1151-1152 1173- 
1244-1245 1261- 
-1302 1305 1311- 


104-107 
-194 210- 
-270 286- 
-349 365 
430-433 
480-482 
541-545 
631 633- 
-813 830- 

960-971 
014 1016 
1070-1105 
1176 1188 
1263 1275- 
1313 1316- 
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1317 

1374- 

1421- 

1498- 

1561- 

1607- 

1641 

1702- 

1789 

1823- 

1876 

1932 

1970- 

2002 

2067- 

2138- 

2182- 

2231- 

2271 

2358 

2389- 

2454- 

2521 

2592- 

2641 

2682- 

2743- 

2811 

2853- 

2893 

3016 

3097- 

3152- 

3186- 

3284 

3346- 

3386- 

3463 

3507- 

3550 

3617- 

3646- 

3694 

3727 

3766 

3805 

3881- 

3919 

4015- 

4051 

4092- 

4162 

4217 

4289 

4367 

4436- 

4537 


1334- 
1379 
1423 
1500 
1570 
1608 
1653- 
1710 
1801- 
1825 
1878 
1944- 
1971 
2016- 
2073 
2141 
2183 
2233 
2291- 
2362- 
2390 
2455 
2544- 
2597 
2644- 
2688 
2747 
2821- 
2855 
2902 
3034- 
3105 
3155 
3187 
3287- 
3348 
3389 
3477- 
3516 
3554- 
3628 
3652 
3699- 
3730 
3779- 
3841- 
3882 
3924- 
•4018 
4053 
-4095 
4170 
4219 
4294 
4374 
-4438 
4547 


1335 
1391 
1439- 
1511- 
1572- 
1616- 
1658 
172 6- 
1804 
1835- 
1890- 
1945 
1980- 
2017 
2102- 
2143- 
2186- 
2237- 
2294 
2364 
2392- 
2464- 
2553 
2603- 
2652 
2698- 
2758 
2822 
2857- 
2931- 
3035 
3111 
3157- 
3221- 
3289 
3353- 
3412- 
3478 
3522- 
3559 
3631- 
3656- 
3701 
3733 
3780 
3849 
3886 
3926 
4020 
4058 
4097 
-4171 
•4220 
•4295 
4376 
4480 
-4549 


1341- 
1396- 
1444 
-1513 
-1574 
-1617 
1660- 
-1727 
1806- 
-1840 
-1893 
1950- 
-1984 
2020- 
-2107 
-2147 
-2198 
-2240 
2297 
2375 
-2405 
-2467 
2558 
-2609 
2658- 
-2710 
2776 
2836 
-2859 
-2946 
3054 
3115 
-3158 
-3226 
3305 
-3357 
-3413 
3481 
-3524 
3564 
-3632 
-3657 
3704 
-3737 
3782 
3855 
-3887 
3931 
-4026 
-4062 
-4100 
4180 
4223 
4342 
4382 
-4484 
4551 


1347 

1398 

1455- 

1525- 

1579- 

1619- 

1674 

1759- 

1808 

1843 

1896 

1953 

1986- 

2027 

2114- 

2155 

2204- 

2249- 

2299 

•2377 
2407 
2473 
2565 
2616- 

-2660 
2712 

-2783 

•2838 
2868- 
2948- 

-3063 

-3116 
3165- 
3250- 

-3312 
3361- 
3425- 

-3485 
3528- 

-3569 
3634- 
3675- 

-3705 
3743- 

-3787 

-3857 
3894- 

-3936 
4035- 
4067- 
4102- 

-4181 

-4224 
4349- 

-4384 
4504 

-4552 


1354- 

1402- 

1462 

1537 

1580 

1620 

1691- 

1760 

1813- 

1854- 

1906- 

1958- 

1991 

2032- 

2121 

2165- 

2207 

2250 

2302- 

2379 

2420- 

2495- 

2574- 

2617 

2664- 

2716- 

2798 

2842- 

2873 

2952 

3068- 

3122- 

3172 

3252 

3315- 

3366 

3428 

3489- 

3530 

3606- 

3636 

3679 

3707- 

3747 

3791 

3861- 

3895 

3950- 

4040 

4069 

4109 

4190 

4228 

4350 

4397 

4506 

4561 


1355 

1407 

1465 

1542- 

1583- 

1633- 

1692 

1765- 

1815 

1856 

1911 

1961 

1998- 

2035 

2123- 

2170 

2209- 

2262- 

2305 

2381 

2423 

2499 

2580 

2630- 

2667 

2719 

2801- 

2848 

2875- 

2978- 

3071 

3132 

3178- 

3262- 

3323 

3371- 

3435- 

3490 

3540- 

3610 

3638 

3683- 

3713 

3753- 

3798- 

3867 

3897- 

3955 

4045- 

4074- 

4128 

4203- 

4264 

4361 

4422 

4514 

4569 


1359- 

1413- 

1490- 

1548 

1595 

1634 

1698- 

1769 

1820- 

1861- 

1921 

1964- 

1999 

2050- 

2126 

2174- 

2217 

2264 

2340 

2385- 

2442- 

2509- 

2585- 

2633 

2671- 

2722- 

2804 

2850- 

2878 

2979 

3086- 

3149- 

3179 

3269 

3334- 

3372 

3452 

3497- 

3543 

3612- 

3642- 

3686 

3717- 

3759 

3801 

3878- 

3900 

3983- 

4046 

4085 

4X29 

4207 

4266 

4362 

4424 

4518 

4572 


1370 

1414 

1491 

1557 

1602 

1636- 

1700 

1781- 

1821 

1869 

1930- 

1966 

2001- 

2060 

2128 

2175 

2225 

2268- 

2347- 

2386 

2451 

2510 

2588 

2636- 

2678 

2724 

2807- 

2851 

2881- 

3014- 

3087 

3150 

3183 

3276- 

3344 

3375 

3456- 

3499 

3548- 

3615 

3643 

3693- 

3723 

3765- 

3804- 

3879 

3904- 

3987 

4048- 

4089 

4157- 

4216- 

4271 

4366- 

4429 

4520- 

4607- 


J- 
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4608 

4675 

4745- 

4811 

4880 

4949- 

5001 

5081 

5180- 

5226- 

5325 

5416 

5466 

5554- 

5660 

5698 

5744 

5845 

5952 

5994 

6050 

6149 

6200 

6407 

6646 

6717 

6811 

6887 

6969 

7099 

7184 

7295 

7384 

7422 

7509 

7586 

7650 

7700 

7761 

7893 

8 012 

8253 

8299 

8409 


4613 
4693 
•4749 
4817 
4892 
4953 
5015 
5107 
5183 
5227 
5343 
5418 
5467 
-5560 
5664 
-5701 
-5753 
-5847 
5954 
5998 
6084 
6151 
-6202 
-6412 
-6649 
-6721 
-6812 
6907 
7001 
-7102 
7188 
7305 
7391 
7424 
-7510 
7595 
7662 
7703 
7794 
7907 
8039 
8259 
8320 
8421 


4626- 

-4694 
4757- 
4823- 

-4898 
4957- 

-5017 

-5109 
5186- 
5233- 
5351 

-5420 
5472 
5569 

-5667 
5705- 
5757- 
5851- 
5956- 
6009- 
6091- 

-6154 
6236- 
6448 
6652- 
6724- 
6828- 

-6908 

-7004 
7145- 

-7190 
7310- 
7398- 

-7425 
7521 

-7596 

-7666 
7714 

-7796 
7914- 
8115- 

-8260 
8325 
8457- 


4627 

4702 

4758 

4824 

4922- 

4958 

5029- 

5147- 

5189 

5240 

5372- 

5431- 

5492- 

5588 

5673 

5709 

5758 

5852 

5960 

6010 

6113 

6157- 

6241 

6501- 

6660 

6725 

6832 

6926 

7014- 

7146 

7198- 

7311 

7400 

7431 

7532- 

7599- 

7672- 

7729 

7813 

7915 

8116 

8277- 

£329 

8458 


4659 
4721 
4765 
4830 
4923 
4960 
5031 
5148 
5196 
5275 
5373 
5434 
5493 
5596 
5678 
5714 
5798 
5864 
5964 
6016 
6117 
6159 
6245 
6504 
6665 
6762 
6846 
6936 
7027 
7159 
7199 
7317 
7402 
7437 
7537 
7600 
7675 
7735 
7820 
7919 
8179 
8279 
8332 
8483 


4660 
-4723 
4767 
-4835 
4927 
•4961 
5060 
5153- 
-5199 
•5278 
5393- 
5446- 
5504- 
5624- 
-5679 
5719- 
-5805 
-5869 
-5966 
-6021 
-6118 
6161- 
6256 
6598- 
-6668 
-6763 
-6848 
-6937 
7046- 
-7160 
7209 
7324 
-7403 
7465- 
7539- 
7604- 
7689 
-7737 
7823 
7955 
8208 
8282 
8353 
8490 


4662 
4732 
4783 
4848 
4937 
4968 
5065 
5154 
5208 
5284 
5395 
5447 
5505 
5626 
5691 
5720 
5823 
5882 
5984 
6027 
6133 
6169 
6353 
6599 
6682 
6784 
6868 
6954 
7053 
7166 
7226 
7329 
7408 
7468 
7549 
7606 
7693 
7745 
7843 
7957 
8221 
8283 
8354 
6495 


4672- 
4738- 
4793- 
4853 
•4939 
-4970 
5072 
5167 
5216 
-5285 
5402 
5452 
5517 
5648 
5694 
5723 
-5838 
-5885 
-5985 
6043 
-6134 
6182 
-6355 
6603 
-6683 
6805 
6877 
-6955 
7056 
-7167 
-7227 
-7330 
7412 
7470 
7561 
7634 
-7696 
7748 
7867 
7982 
8239 
8296 
8402 
8498 


-4673 
-4741 
-4794 
4873- 
4941 
4999- 
5080- 
-5173 
-5220 

5324- 
-5403 
-5459 
-5518 
-5649 
-5696 
-5729 
5840 
5950- 
5993- 
6048- 
6147 
-6191 
6398 
-6606 
6691 
-6807 
6882 
6964- 
-7059 
7179- 
7287 
7377- 
-7414 
7498 
7584- 
-7636 
7698- 
7755 
-7868 
-7983 
-8240 
8298- 
8405 
-8500 


adult brain 


Clontech 


ABR011 


384-385 1064-1065 1439-1441 1602 1950-1953 
2258-2261 2268-2271 2810-2811 3133 3276-3279 
3287-3289 3346-3348 3410 3623-3628 3642-3643 
4229 4648-4652 4702 4841-4842 5107-5109 5359 
5416 5829-5834 7392-7393 7509-7510 7599-7600 
8355-8356 


adult brain 


BioChain 
Inst . 


ABR012 


14-19 26-28 140-142 384-385 416-418 1022-1023 
1246-1247 1439-1441 1445 1448-1450 1575-1577 
1712-1713 1896 1921 2079 2262-2264 2658-2660 
2684-2688 2953-2959 3276-3278 3346-3348 5452- 
5459 5829-5834 5853-5855 6456-6459 8397-8398 


adult brain 


Invitrogen 


ABR013 


384-385 1439-1444 1448-1450 2083-2085 2268- 


lOkfli 
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adult brain 


Invitrogen 


ABR014 


2271 2656-2657 2960-2962 2974-2977 3276-3278 
3346-3348 3544-3547 3605-3610 4001-4008 4080- 
4081 4213-4215 4674 4702 5829-5834 6119-6120 
7274-7275 8397-8398 


264-265 345-349 786-787 2658-2660 2725-2727 
2810-2811 2856 3064-3066 3133-3137 3149-3150 
3760 4000 4048-4051 4061-4062 4092-4095 4429 
4823-4824 4910-4916 5593 6032 6119-6120 7099- 
7102 7152-7153 7158 7162-7165 7527-7531 7730 


adult brain 


Invitrogen 


ABR015 


549-551 559-560 1182-1183 2487-2490 2589-2590 
2592-2596 2671 2692-2696 2731 2794 2913-2914 
2974-2977 3017 3043 3065 3163-3164 3781 3901- 
3903 3910-3919 4288 4854-4872 5452-5459 5475- 
5478 5492-5493 5624-5626 6032 6119-6120 6464- 
6466 6950 6957 7599-7600 8443-8444 


adult brain 


Invitrogen 


ABR016 


546-548 1022-1023 1050 2487-2490 2511 2559- 
2560 2587-2588 2644-2647 2704-2710 2889-2893 
2948-2950 2953-2959 3066 3073-3074 3138-3141 
3779-3780 3798-3801 3904 3986-3987 4020-4026 
4390 4793-4794 4922 5624-5626 6119-6120 6142 
6954-6955 7099-7102 7162-7165 7470 7551-7554 


adult brain 


Invitrogen 


ABT004 


38 90 105-107 113 143 172-175 210-211 230 
286-287 302 321 330-340 345-349 370-372 386 
402 478 480-482 534-535 543-545 565-575 688 
736-737 765 811-827 830 1113-1117 1151-1152 
1181 1213-1221 1244-1245 1303-1304 1334-1335 
1341-1346 1354-1355 1442-1444 1448-1450 1455- 
1459 1465 1490-1491 1498-1500 1525-1528 1549- 
1551 1562-1570 1607-1608 1619-1620 1633-1634 
1691-1692 1728 1765-1767 1786-1789 1813-1815 
1850-1851 1861-1863 1890-1893 1912 1921 1944- 
1945 1998-1999 2020-2027 2067 2124-2126 2143- 
2146 2165-2170 2262-2264 2282-2283 2289 2300- 
2301 2307-2322 2328 2332-2333 2340 2347-2349 
2398-2403 2420-2423 2440-2451 2482-2488 2522- 
2525 2536-2537 2549-2550 2561-2565 2574-2580 
2585-2586 2630-2633 2637-2641 2655 2658-2663 
2671 2698-2710 2739-2743 2757 2776-2777 2810- 
2813 2835 2842-2848 2850-2851 2874-2879 2913- 
2915 2940-2946 3029-3031 3045-3048 3115-3116 
3175-3177 3186 3221-3226 3259-3260 3262-3269 
3276-3279 3308-3310 3346-3348 3386-3389 3416- 
3418 3494-3496 3512-3516 3602-3604 3631-3632 
3646-3652 3683-3686 3699-3701 3710-3713 3738- 
3739 3785-3787 3794-3795 3798-3801 3841-3849 
3886-3887 3904 3910-3919 3926 3938-3941 3950- 
3955 4020-4021 4048-4051 4053 4058-4060 4063- 
4064 4066 4082-4085 4092-4095 4102-4109 4143- 
4144 4157-4162 4170-4171 4203-4204 4223-4224 
4252 4263 4294-4295 4323 4366-4367 4378-4380 
4433 4436-4438.4502-4503 4507 4520-4528 4556- 
4558 4662 4672-4673 4676 4702-4703 4705 4715 
4724-4727 4738-4741 4744 4768-4769 4796-4803 
4806-4809 4826 4892-4898 4929 4943 4957-4958 
5019-5021 5043-5049 5056 5107-5109 5116 5139- 
5140 5147 5152 5226-5228 5284-5285 5288 5323 






5389 

5393 

-5395 

5435-5437 

5520 

-5522 

5565- 

5566 




5639 

-5641 

5658- 

5659. 

5715 

-5718 

5721 

5723- 

•5729 




5736 

-5743 

5746- 

5753 

5757 

-5758 

5776 

5798- 

•5805 




5818 

-5821 

5829- 

5834 

5840 

5845 

-5847 

5850 

5869 




5882 

-5885 

5925- 

5930 

5949 

5988 

-5992 

6025- 

•6026 




6069 

-6073 

6119- 

6120 

6150 

6174 

-6176 

6235 

6359- 




6360 

6397 

6414- 

6418 

6486 

-6488 

6548- 

6549 

6712 




6759 

-6761 

6799- 

6807 

6817 

- 6 621 

oo7z - 

c a *7 A 
DO /4 

oy o / - 




6909 

6911 

-6915 

6939 

6956 

7001 

-7002 

/ \j \j j 

t V JL J. 




7080 

-7082 

7188- 

7190 

7192 

-7193 

7327 

•7 *S A Q 

1 C F\ 




7375 

7402 

-7403 

7408 

7422 

7424 

-7425 

•7 A A A 

1 A A C 




7470 

7539 

-7549 

7622 

7630 

-7633 

7650 

7 © 72 - 

•7o7d 




7720 

-7721 

7743 

7766- 

^7769 

7864 

7890- 

7 891 

t o "> -a 
7 943 




7929 

7933 

7970- 

7971 

8021 

8115 

-8116 
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8395 

8397 

-8398 

8418- 

-8419 

8427 

8457- 

Q A C Q 

8458 


adipocytes 

Stratagene 

ADP001 

14-19 26-28 102 

-103 

121 

140-142 384 

- 385 

4 03 - 




406 408 463-464 

468 

527- 

529 5 

79 645 

650- 

•651 




696 

328-829 937 

-942 

1022 

-1023 

1033- 

1035 

1110 - 




1111 

1151 

-1152 

1160- 

-1161 

1206 

-1209 

1213- 

-1221 




1244 

-1245 

1294- 

1302 

1327 

-1333 

1354- 

1355 

1391 




1413 

-1414 

1439- 

1441 

1445 

1465 

1481- 

1484 

1488- 




1489 

1511 

-1513 

1558- 

-1560 

1562 

-1567 

1572- 

-1574 




1592 

-1595 

1602 

1607- 

-1608 

1610 

1619- 

1620 

1659 




1679 

1691 

-1692 

1712- 

-1713 

1726 

-1727 

1774- 

-1776 




1781 

-1785 

1824- 

1825 

1870 

-1874 

1894- 

1895 

1912 




1929 

1934 

-1935 

1962- 

-1966 

1968 

1970- 

1971 

1974- 




1978 

1998 

-1999 

2079 

2103 

-2107 

2112 

2128 

2138- 




2141 

2182 

-2183 

2223- 

-2224 

2231 

-2233 

2244- 

-2248 




2262 

-2264 

2268- 

2271 

2282 

-2283 

2285 

2315- 

-2322 




2332 

-2333 

2382- 

2384 

2487 

-2488 

2502- 

2508 

2522- 




2525 

2567 

-2568 

2574- 

-2580 

2603 

-2609 

2630- 

-2633 




2642 

-2643 

2656- 

2660 

2722 

-2727 

2731 

2739- 

-2742 




2748 

-2749 

2784- 

2786 

2788 

-2794 

2796 

2817 

2823- 




2825 

2867 

2879 

2974- 

-2977 

3014 

-3016 

3018-3023 




3036 

-3039 

3054- 

3057 

3073 

-3078 

3102- 

3105 

3111 




3138 

-3141 

3149- 

3150 

3152 

-3155 

3165- 

3172 

3217 




3259 

-3260 

3274- 

3275 

3280 

-3284 

3287- 

3289 

3306- 




3307 

3324 

-3332 

3346- 

-3348 

3414 

-3415 

3419- 

-3421 




3425 

-3428 

3454 

3469- 

-3470 

3498 

-3499 

3507- 

-3511 




3524 

3555 

-3557 

3570 

3600 

-3601 

3605 

3623- 

-3628 




3631 

-3633 

3683- 

3686 

3699 

-3703 

3722 

3728 

3733- 




3737 

3771 

-3774 

3794- 

-3795 

3986 

-3987 

3991- 

-3998 




4048 

-4051 

4057 

4092- 

-4095 

4157 

-4162 

4170- 

-4171 




4194 

-4195 

4233- 

4240 

4294 

-4295 

4422- 

4424 

4429 




4436 

-4438 

4484 

4498- 

-4501 

4644 

-4645 

4648- 

-4652 




4662 

4715 

4721- 

4723 

4729 

4734 

4854- 

4872 

4936 




5015 

-5017 

5019- 

5028 

5065 

-5072 

5080- 

5081 

5098- 




5100 

5177 

-5179 

5236- 

-5240 

5284 

-5285 

5435 

-5437 




5475 

-5478 

5485- 

5488 

5530 

-5545 

5565- 

5566 

5596 




5616 

-5617 

5639- 

5641 

5714 

5719 

-5720 

5757 

-5760 




5780 

-5782 

5953. 

5999-6000 

6038 

-6042 

6112 

-6113 




6123 

-6127 

6161- 

6164 

6297 

-6298 

6385- 

6388 

6407- 




6411 

6455 

-6459 

6464- 

-6466 

6548 

-6549 

6697 

6703- 




6704 

6718 

-6721 

6742 

6799 

-6804 

6828- 

6832 

6840 




6858 

6943 

6967- 

6969 

6997 

-6998 

7035 

7086 

-7087 




7105 

-7106 

7179- 

7180 

7248 

-7249 

7251- 

7259 

7281- 




7282 

7349 

-7350 

7377- 

-7384 

7457 

-7459 

7838 

7859- 
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7860 

7907 

7955- 

7957 

7982 

-7983 

8018- 

8020 

8031- 




8033 

8046- 

8048 

8077 

8194 

-8196 

8250- 

8251 

8430- 




8432 









adrenal 

Clontech 

ADR002 

3-5 14-19 

26-28 

48 67 10 

5-107 

121 140-14 

2 176 

gland 



201-209 212 221 

-222 

260 266-268 280 

-282 

305 



384-385 460-462 

468 

495 524-529 537 

546- 

548 




656-658 688 735 

739 

814- 

821 828-829 

836- 

838 




918 960-961 1005-1007 1010 1048 1110-1111 




1185- 

-1186 

1206- 

1209 

1244 

-1245 

1261- 

1263 

1297- 




1302 

1341- 

1346 

1439 

-1441 

1445 

1460- 

1462 

1466- 




1470 

1472- 

1477 

1481 

-1484 

1495 

-1497 

1507- 

-1513 




1531- 

-1539 

1552- 

1555 

1557 

-1560 

1562- 

1570 

1610 




1612- 

-1615 

1619- 

1620 

1624 

-1625 

1635- 

1641 

1655- 




1657 

1675- 

1677 

1693 

1726 

-1729 

1759- 

1760 

1765- 




1767 

1774- 

1776 

1813 

-1815 

1817 

-1818 

1839- 

-1840 




1850 

-1852 

1864- 

1874 

1903 

1913 

1916- 

1920 

1930- 




1932 

1937- 

1942 

1950 

-1953 

1958 

-1966 

1980- 

-1984 




1986 

-1991 

2003- 

2004 

2016 

-2017 

2050- 

2052 

2063- 




2066 

2068- 

2070 

2080 

2095 

-2097 

2112 

2122 

2124- 




2126 

2131- 

2136 

2142 

2148 

-2152 

2155 

2158- 

-2162 




2174 

-2175 

2223- 

2224 

2249 

-2250 

2262- 

2264 

2268- 




2275 

2286- 

2288 

2291 

-2294 

2300 

-2301 

2315- 

-2322 




2335 

-2336 

2344- 

2346 

2358 

2368 

2379 

2385- 

-2386 




2408 

-2415 

2433- 

2438 

2453 

2474 

-2475 

2487- 

-2488 


_ 


2495 

-2499 

2509- 

2510 

2512 

2515 

-2516 

2551- 

-2553 




2558 

2571 

2585- 

2586 

2619 

-2622 

2626- 

2629 

2648- 




2650 

2656- 

2660 

2682 

-2683 

2689 

-2691 

2722- 

-2724 




2758 

2778- 

2783 

2788 

-2794 

2796 

2801- 

2804 

2807- 




2813 

2826- 

2834 

2842 

-2848 

2850 

-2851 

2916 

2940- 




2946 

2948- 

2950 

2960 

-2971 

2973 

2978- 

2979 

3014- 




3016 

3036 

3049- 

3053 

3058 

-3063 

3072 

3080- 

-3085 




3095 

-3096 

3117- 

3121 

3138 

-3141 

3149- 

3150 

3217 




3221 

-3226 

3250 

3262 

-3273 

3276 

-3279 

3287- 

-3289 




3311 

-3312 

3315- 

3318 

3346 

-3354 

3371- 

3372 

3384- 




3385 

3410- 

3411 

3473 

-3476 

3508 

-3511 

3528 

-3530 




3551 

-3553 

3571- 

3587 

3597 

-3604 

3617- 

3 619 

3623- 




3628- 

3633 

3638 

3642 

-3643 

3660 

-3663 

3683- 

-3686 




3699 

-3701 

3707- 

3709 

3722 

3733 

-3739 

3779- 

-3780 




3788 

-3789 

3841- 

3849 

3886 

-3887 

3928- 

3936 

3983- 




3989 

3991- 

3998 

4012 

-4013 

4015 

-4018 

4027-4040 




4042 

-4043 

4061- 

4062 

4070 

-4072 

4082-4085 

4127- 
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4172 

4252 

4286 




4294 

-4297 

4331- 

4333 

4348 

4368 

4429- 

4431 

4498- 




4501 

4510- 

4513 

4520 

-4528 

4566 

4605 

4609 

4613 




4691 

4729 

4738- 

4741 

4745 

-4749 

4752 

4780 

-4782 




4785 

4791- 

•4792 

4804 

4845 

-4846 

4889 

4908- 

-4909 


- 


4937 

-4939 

4954- 

4955 

4993 

-4994 

4996-5001 

5005- 




5007 

5022- 

-5028 

5056 

5076 

-5079 

5098- 

5100 

5111- 




5113 

5129- 

•5131 

5150 

5161 

-5163 

5177- 

5179 

5194 




5205 

5209- 

-5212 

5247 

5284 

-5287 

5298 

5312 

5321- 




5322 

5349 

5352- 

5355 

5392 

5396 

-5401 

5405 

-5409 




5452 

-5459 

551^- 

5516 

5526 

-5529 

5553 

5569 

5585 




5588 

5594 

5619- 

5620 

5674 

-5676 

5683 

5693 

5697 




5723 

-5729 

5746- 

5753 

5757 

-5758 

5761- 

5763 

5777 \ 




5798 

-5805 

5835- 

5838 

5856 

-5863 

5872 

5899 

-5903 




5925 

-5930 

5939 

6008 

6028 

-6031 

6051- 

-6057 

6065- 




6068 

6147 

6165- 

6169 

6209 

-6210 

6221- 

6223 

6234 
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6244 

6282 

-6284 

6348 

6391 

6412 

6414- 

-6417 

6464- 




6466 

6501 

-6504 

6557 

-6559 

6600 

-6601 

6618 

6639- 




6642 

6670 

-6676 

6709 

-6710 

6718 

-6721 

6742 

6764- 




6765 

6863 

-6867 

6896 

6920 

-6921 

6925 

6941- 

-6942 




6957 

6970 

-6976 

6990 

-6992 

6997 

-6998 

7029- 

-7033 




7044 

7080 

-7082 

7086 

-7087 

7099 

-7102 

7114- 

-7115 




7162 

-7165 

7217 

7321 

7352 

7366 

-7367 

7375 

7408 




7446 

7509 

-7510 

7539 

-7550 

7584 

7603 

7620 

7662- 




7666 

7747 

7753- 

-7754 

7780 

-7790 

7869 

7907 

7946 




7987 

8031 

-8033 

8044 

8139 

8194 

-8196 

8211 

8265 




8296 

8413 

8463- 

-8464 

8483 

8490 

-8495 



adult heart 

GIBCO 

AHR001 

14-19 40- 

41 47 

85-94 104 

-107 

119 132-137 144- 




145 

164-169 172-175 

180- 

183 195 197-209 

233- 




237 

260 285 288-289 

298- 

299 322 330-340 

360 




373 

400 402 407 416 

-418 

425 432-433 442- 

445 




447- 

448 460-465 467 

-468 

474-475 480-486 

488- 




494 

504-5 

13 515-516 

518- 

523 527-529 539- 

•540 




546- 

554 582 635-639 

650- 

651 661-665 681- 

-683 




685- 

686 688 696 705 

712- 

713 720-721 726- 

•727 




735 

757-760 764 786 

-787 

790-792 814-821 

830 




839- 

840 877-878 883 

-898 

920 923-927 941- 

■942 




949- 

952 958 960-961 

967- 

973 984-985 992- 

•994 




1003 

1005 

-1007 

1022 

-1023 

1032 

1041- 

-1043 

1054- 




1061 

1070 

-1105 

1110 

-1111 

1148 

-1149 

1160- 

•1161 




1185 

-1186 

1194- 

-1196 

1200 

-1209 

1213- 

-1221 

1227- 




1253 

1261 

-1263 

1290 

-1299 

1305 

-1317 

1324- 

1333 




1341 

-1348 

1357- 

1370 

1374 

-1379 

1394- 

-1398 

1402- 




1409 

1413 

-1420 

1425- 

-1431 

1439 

-1441 

1445 

1448- 




1450 

1455 

-1462 

1465- 

-1470 

1472 

-1484 

1488- 

1491 




1505 

-1506 

1511- 

1515 

1525 

-1537 

1542- 

-1551 

1557- 




1567 

1575 

-1577 

1583- 

-1595 

1602 

1610 

1616- 

1617 




1619 

-1621 

1633- 

1635 

1637 

-1641 

1653- 

-1657 

1671- 




1672 

1680 

-1692 

1712- 

-1714 

1718 

1726- 

1728 

1759- 




1760 

1778 

1780- 

1785 

1790 

-1795 

1801- 

1810 

1819- 




1851 

1853 

1857- 

1863 

1870 

-1874 

1876 

1883 

1903 




1906 

-1912 

1916 

1921 

1929 

-1932 

1944- 

1945 

1950- 




1953 

1958-1969 

1980- 

-1984 

1992 

-1994 

2000- 

2008 




2014 

-2019 

2032- 

2035 

2038 

-2044 

2050- 

2052 

2059- 




2060 

2063 

2067- 

2070 

2074 

-2075 

2077- 

2079 

2081 




2087 

-2097 

2108- 

2110 

2112 

2120- 

-2121 

2124- 

2126 




2131 

-2141 

2148- 

2157 

2171 

-2175 

2178- 

2189 

2195 




2221 

-2228 

2231- 

2233 

2237 

-2242 

2255- 

2257 

2262- 




2264 

2266- 

-2271 

2276- 

•2280 

2297- 

-2299 

2302- 

2305 




2307 

-2322 

2328 

2334- 

-2336 

2341- 

-2343 

2347- 

2349 




2353 

-2358 

2362- 

2364 

2370 

-2372 

2375- 

2377 

2385- 




2386 

2398- 

-2405 

2407- 

-2415 

2420- 

-2423 

2428- 

2429 




2433 

-2434 

2442- 

2444 

2453 

2458- 

-2461 

2464- 

2470 




2476 

-2477 

2482- 

2486 

2489 

-2490 

2493- 

2494 

2509- 




2511 

2517- 

-2520 

2526- 

-2528 

2543- 

-2553 

2558- 

2565 




2569 

2573- 

-2580 

2583- 

-2588 

2591- 

-2596 

2616- 

2617 




2630 

-2633 

2636- 

2643 

2648 

-2652 

2658- 

2660 

2665- 




2667 

2672- 

-2678 

2682- 

-2683 

2689- 

-2691 

2704- 

2710 




2713 

-2720 

2722- 

2727 

2731 

-2738 

2748- 

2749 

2758 




2776 

-2777 

-2794 

2796 

2810-2811 

2814- 

2817 

2823- 




2825 

2835- 

-2838 

2842- 

2848 

2850- 

-2851 

2853- 

2855 




2860 

-2863 

2866- 

2867 

2881 

-2888 

2913- 

2 914 

2917 




2948 

-2971 

2978- 

2979 

3014 

-3016 

3036- 

3039 

3043- 
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6244 

6282 

-6284 

6348 

6391 

6412 

6414- 

•6417 

6464- 




6466 

6501 

-6504 

6557 

-6559 

6600 

-6601 

6618 

6639- 




6642 

6670 

-6676 

6709 

-6710 

6718 

-6721 

6742 

6764- 




6765 

6863 

-6867 

6896 

6920 

-6921 

6925 

6941- 

6942 




6957 

6970 

-6976 

6990 

-6992 

6997 

-6998 

7029- 

7033 




7044 

7080 

-7082 

7086 

-7087 

7099 

-7102 

7114- 

7115 




7162 

-7165 

7217 

7321 

7352 

7366 

-7367 

7375 

7408 




7446 

.7509 

-7510 

7539 

-7550 

7584 

7603 

7620 

7662- 




7666 

7747 

7753- 

-7754 

7780 

-7790 

7869 

7907 

7946 




7987 

8031 

-8033 

8044 

8139 

8194 

-8196 

8211 

8265 




8296 

8413 

8463- 

8464 

8483 

8490 

-8495 



adult heart 

GIBCO 

AHR001 

14-19 40- 

41 47 

85-94 104 

-107 

119 

144- 




145 

164-169 172-175 

180- 

183 195 197-209 

233- 




237 

260 2 

85 288-289 

298- 

299 322 330-340 

360 




373 

400 4 

02 407 416 

-418 

425 4 

32-433 442- 

445 




447- 

448 4 

60-465 467 

-468 

474-475 480-486 

488- 




494 

504-5 

13 515-516 

518- 

523 527-529 539- 

540 




546- 

554 5 

82 635-639 

650- 

651 661-665 681- 

683 




685- 

686 688 696 705 

712- 

713 720-721 726-727 




735 

757-760 764 786 

-787 

790-792 814-821 

830 




839- 

840 877-878 883 

-898 

920 923-927 941- 

942 




949- 

952 958 960-961 

967- 

973 984-985 992- 

994 




1003 

1005 

-1007 

1022 

-1023 

1032 

1041- 

1043 

1054- 




1061 

1070 

-1105 

1110 

-1111 

1148 

-1149 

1160- 

1161 




1185 

-1186 

1194- 

1196 

1200 

-1209 

1213- 

1221 

1227- 




1253 

1261 

-1263 

1290 

-1299 

1305 

-1317 

1324- 

1333 




1341 

-1348 

1357- 

1370 

1374 

-1379 

1394- 

1398 

1402- 




1409 
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14 Z b 

-1431 

1439 

-1441 

1 A A C 

14 4b 

1448- 




1450 

1455 

-1462 

1465 

-1470 

1472 

-1484 

1488- 

1491 




1505 

-1506 

1511- 

1515 

1525 

-1537 

1542- 

1551 

1557- 




1567 

1575 

-1577 

1583 

-1595 

1602 

1610 

1616- 
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1619 

-1621 

1633- 

1635 

1637 

-1641 

1653- 

1657 

1671- 




1672 

1680 

-1692 

1712 

-1714 

171B 

1726- 

1728 

1759- 




1760 

1778 

1780- 

1785 

1790 

-1795 

1801- 

1810 

1819- 




1851 

1853 

1857- 

1863 

1870 

-1874 

1876 

1883 

1903 




1906 

-1912 

1916 

1921 

1929 

-1932 

1944- 

1945 

1950- 




1953 

1958 

-1969 

1980 

-1984 

1992 

-1994 
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6557 

-6559 

6607 

6613 

-6615 

6665 

-6668 

6703- 




fi7 fi A 
O / V* 

6718 

-6721 

6766 

-6767 

6799 

-6804 

6859 

6862 




D O / D 

6895 

6909 

6916 

-6917 

6941 

-6942 

6957 

7009- 




/ U XX 

7014 

-7034 

7080 

-7082 

7088 

-7090 

7099 

-7102 




•71 nfi 

/ 1U J 

-7106 

7152 

-7153 

7188 

-7190 

7211 

7213 

7234 




7281 

-7282 

7310 

-7311 

7402 

-7403 

7408 

7464 

7481 




74 84 

-7485 

7509 

-7510 

7561 

7599 

-7600 

7705 

7749- 




7752 

7770 

-7772 1 

7916 

7923 

8018 

-8020 

8031 

-8033 




8102 

-8109 

8158 

8296 

8306 

-8308 

8320 

8420 

-8421 




8459 









lymph node 

Clontech 

ALN001 

14-19 144 

-145 

197-209 280-282 

380- 

382 469-473 



488- 

490 5 

27-52 

9 532 

549- 

551 5 

97-598 641 

-644 




648 

694 714-715 724 

790- 

792 834-835 857 

-859 



) 


:i oro p «+& «+3; . o 52 a & in ;s 


923-927 992-994 1015-1016 1022-1023 1028-1030 
1033-1035 1072-1105 1216-1221 1246-1247 1261- 
1263 1311-1313 1324-1326 1339-1340 1359-1367 
1413-1420 1439-1441 1465-1470 1481-1487 1498- 
1500 1511-1513 1558-1561 1575-1577 1602 1694 
1712-1713 1716 1786-1789 1864-1874 1885-1889 
1896 1916 1921 1930-1932 1944-1945 1962-1963 
1968 1979 2003-2004 2018-2019 2032-2035 2037 
2063-2066 2074-2075 2081 2083-2085 2114-2119 
2186-2189 2272-2275 2387 2407 2430-2432 2453 
2511 2526-2528 2543 2551-2553 2592-2596 2625 
2716-2719 2725-2727 2731 2794 2796 2835 2879 
3043 3066 3071 3095-3096 3133 3137 3149-3150 
3163-3164 3175-3179 3221-3226 3250 3253-3258 
3274-3275 3279-3284 3287-3292 3311-3312 3346- 
3348 3370 3386-3389 3407-3408 3425-3428 3486 
3497 3512-3516 3528-3535 3555-3557 3570 3600- 
3601 3605 3616 3678-3679 3683-3686 3697 3722 
3730 3738-3739 3761-3764 3850-3854 3904 3926- 
3927 3931-3936 3957 3966-3968 3986-3987 3991- 
3998 4042 4061-4062 4074-4079 4166-4167 4203- 
4207 4253 4344-4347 4349-4350 4369 4422-4424 
4445-4447 4498-4501 4514-4518 4538 4609-4610 
4644-4645 4648-4652 4704 4731 4783 4851-4853 
4873-4876 5032-5033 5036-5038 5043-5049 5168- 
5173 5177-5179 5232 5343 5381 5479-5482 5515- 
5516 5530-5545 5570-5572 5580-5581 5631 5744- 
5745 5775 5853-5855 5891 5942-5943 5945-5948 
5971-5983 6076 6112-6113 6137-6139 6155-6156 
6196 6312-6315 6353-6355 6382 6391 6456-6459 
6463-6466 6476-6479 6634-6638 6665-6668 6743- 
6744 6766-6767 6782-6784 6799-6804 6844 6872- 
6874 6911-6915 7014-7027 7035-7037 7281-1282 
7337 7340-7342 7396 7486 7488-7489 7508 7613- 
7614 7672-7675 7685 7753-7754 7780-7790 7924- 
7925 7936 7976-7978 8012 8031-8033 8102-8109 
8153 8227-8229 8321 8423-8425 8441 8459 


adult liver 


GIBCO 


ALV001 


6-7 73 75-76 83 
217 241-243 252 
419-424 426-428 
546-555 586-587 
767 831-833 836 
1011-1012 1022- 
1209 1249-1253 
1303-1305 1311 
1353 1359-1368 
1472-1480 1498- 
1537 1542-1551 
1585 1592-1599 
1672 1712-1713 
1815 1824-1825. 
1870-1874 1BB& 
1986-1991 2016- 
2074-2075 2086 
2171-2173 2182- 
2237-2239 2258 
2328 2332-2333 


85 140-142 144-145 
275-276 283-284 351 
469-473 498 518-523 
669 705-706 736-737 
-838 941-942 949-951 
1023 1106-1109 1173 
1261-1263 1273-1279 
1313 1327-1333 1341- 
1399-1401 1453-1459 
1500 1507-1513 1516- 
1557 1568-1570 1572 
1603-1608 1619-1620 
1728 1784-1785 1790 
1832-1834 1839-1840 
1912 1916 1950-1953 
2017 2050-2052 2059- 
2112 2124-2126 2137 
2183 2195 2223-2224 
2264 2300-2301 2307 
2375-2377 2382-2386 


164-169 
-355 402 
527-529 
759-760 
992-994 
1174 1200- 
1297-1299 
1347 1349- 
1466-1470 
1517 1531- 
1574 1583- 
1659 1671- 
1805 1813- 
1857-1860 
1958-1963 
2O60 2063 
2X58-2162 
2226-2227 
2322 2324- 
2388 2407 


/t>7<*| 


13 H 6 02 


2417 
2477 
2636 
2776 
2823 
2931 
3110 
3218 
3279 
3389 
3507 
3610 
3669 
3748 
4056 
4387 
4502 
4764 
5022 
5216 
5343 
5431 
5581 
5687 
5966 
6056 
6243 
6513 
6718 
6862 
7012 
7184 
7343 
7551 
7772 
7937 
8143 
8363 


2419 
2538 
2651 

•2777 

-2825 

■2934 
3134 

-3219 
3287 
3412 

-3511 
3623 

-3674 

•3752 
4070 
4422 

-4503 
4768 

-5028 

-5220 
5356 

-5434 
5616 

-5690 
5984 
6058 
6291 
6584 

-6721 
6872 

-7013 
7205 

-7344 

-7554 
7775 
7979 
8187 
8426 


2433- 
-2542 
-2 652 
2796 
2839- 
2978- 
-3136 
3221 
-3289 
-3413 
3522 
-3628 
3680 
3791 
-4072 
-4424 
4514 
-4769 
5034 
5284 
-5358 
5475 
-5617 
5825 
-5985 
-6060 
-6292 
6588 
6742 
-6874 
7038 
-7206 
7375 
7626 
-7778 
-7983 
-8188 
-8427 


2434 

2549- 

2658- 

2797 

2848 

2979 

3138- 

3228 

3305 

3454 

3523 

3633 

■3682 
3904 
4120- 
4426 

-4518 
4830- 

-5035 

-5285 
5364 

-5478 
5621- 

-5828 
5998 
6065- 
6396 

-6595 
6766- 
6897- 

-7040 
7231- 
7397 

-7629 
7780 
8031 
8259 
8430- 


2442- 
2550 
2660 
2810- 
2853- 
3036 
3141 
3250 
3346- 
3481- 
3560 
3637 
3702- 
3926 
4121 
4429 
4610 
4835 
5076- 
5297 
5381 
5547- 
5622 
5856- 
6003- 
6073 
6455- 
6634- 
6767 
6898 
7056- 
-7233 
7485 
7680- 
-7790 
8033 
8260 
8432 


2451 
2559- 
2669- 
2811 
2855 
3045- 
3152- 
3259- 
3348 
3485 
3570 
3644- 
3703 
4022- 
4128- 
4436- 
4613 
4873- 
5079 
5302- 
5393- 
5552 
5639- 
5869 
6005 
6112- 
6459 
6638 
6799- 
6911- 
7059 
7255- 
7490- 
•7683 
7859- 
8102 
8277 
8469 


2453 
2565 
2671 
2814- 
2879 
3 057 
3155 
3260 
3359 
3497 
3.588 
3645 
3728 
4026 
4129 
4438 
4657 
4877 
5186 
5305 
5395 
5554 
5641 
5932 
6025 
6113 
6464 
6662 
6804 
6915 
7114 
7259 
7495 
7706 
•7860 
•8113 
8279 
•8479 


2468 
2585- 
2731 
2815 
2881- 
3071 
3163- 
3274- 

•3360 
3500- 

•3590 
3660- 
3738- 
4042 
4289 
4484 
4695- 
4996- 

-5189 
5318 
5406- 

-5558 
5677 
5949 

-6026 
6117- 

-6466 
6665- 
6845- 
6987- 

-7115 
7274- 
7539- 

-7707 
7929- 
8139 
8324 


2476- 
2588 
2738 
2817 
2883 
3108- 
3164 
3275 
3386- 
3503 
3602- 
3663 
3739 
4055- 
4386- 
4486 
4697 
4998 
5205 
5323 
5408 
5573- 
5683 
5964- 
6051- 
6120 
6512- 
6668 
6848 
6989 
7179- 
•7277 
7549 
7770- 
•7930 
8142- 
8361- 


adult liver 


Invitrogen 


ALV002 


8-9 67 144-145 250-251 303 371-372 380-382 
403-406 419-424 451 456-458 469-473 527-529 
597-598 603-606 656-658 705 726-727 773 779- 
780 905-912 1061 1066 1173-1174 1179-1180 
1190-1191 1210 1244-1245 1249-1253 1261-1263 
1273-1279 1287-1289 1303-1304 1311-1313 1349- 
1353 1391 1442-1444 1448-1450 1460-1462 1498- 
1500 1516-1517 1542-1548 1568-1570 1572-1574 
1583-1591 1596-1599 1603-1608 1626-1628 1630- 
1632 1658-1659 1673-1674 1678 1702-1710 1718 
1728 1765-1767 1780 1784-1785 1791-1795 1813- 
1815 1852 1854-1856 1896-1900 1912 1937-1942 
1958-1961 1968. 2001-2002 2038-2044 2064-2067 
2074-2075 2095-2097 2112 2124-2126 2137 2148- 
2152 2158-2162 2192-2194 2199-2200 2208 2229- 
2230 2237-2239 2244-2250 2262-2264 2276-2278 
2300-2301 2315-2322 2334 2337-2339 2369 2381 
2417-2419 2436-2438 2454 2468 2482-2486 2513 
2538-2542 2549-2550 2570 2574-2580 2583-2584 


,:tooe«+6 | Mi- 


liS Hi 


i a 


2587- 

2641 

2698- 

2807- 

3066 

3108- 

3217- 

3389 

3605 

3683- 

3802- 

3989 

4106 

4438 

4606 

4958 

5079 

5250- 

5459 

5626 

5723- 

5864- 

6065 

6312 

6464 

6721 

6915 

7098 

7205 

7377 

7494 

7723 

7983 

8398 

8490 


2588 
2651 
2703 
2809 
3068 
3110 
3219 
3438 
3612 
3686 
3803 
4012 
4271 
4484 
4745 
5008 
5162 
5254 
5465 
5639 
5729 
5868 
6068 
6315 
6472 
6805 
6925 
7102 
7206 
7384 
7515 
7727 
8046 
8421 
8495 


2592- 
-2652 
2739- 
2817 
3070 
3122- 
3227- 
■3449 
-3615 
3702- 
3858 
-4013 
4296- 
4486 
-4749 
-5012 
-5163 
5343 
-5466 
-5641 
5746- 
5988- 
6117 
6346 
6613- 
-6807 
6937 
7108 
7231 
7418 
7599 
7770 
-8048 
8427 


2596 

2658- 

2742 

2823- 

3073- 

3132 

3228 

3454 

3639- 

3703 

3868 

4022- 

4322 

4505- 

4760- 

5014 

5184 

5389 

5517- 

5660 

5747 

5992 

6118 

6348 

6615 

6858 

6958- 

7110 

7233 

7428- 

7600 

7772 

8187- 

8430- 


2603- 

2660 

2744- 

2825 

3074 

3137 

3250 

3487- 

3640 

3730 

3962- 

4026 

4363- 

4507 

4762 

5022- 

5209- 

5396- 

5518 

5697 

5776 

6003- 

6151- 

6361- 

6627- 

6897- 

6959 

7114- 

7255- 

7429 

7662- 

7799- 

8188 

8435 


2609 
2682 
2747 
2884 
3080 
3152 
3274 
3488 
3644 
3753 
3963 
4043 
4364 
4512 
4764 
5028 
5212 
5401 
5565 
5705 
5798 
6005 
6154 
6362 
6628 
6898 
7001 
7115 
7259 
7451 
7666 
7800 
8227 
8446 


2626 
•2683 
2778 
•2893 
-3085 
-3156 
•3275 
3551 
-3652 
-3759 
3983 
4092 
4422 
-4513 
4890 
5034 
5228 
5435 
-5566 
-5709 
-5805 
6044 
6256 
6399 
6695 
6907 
-7002 
7159 
7274 
-7454 
7680 
7955 
-8229 
-8447 


2629 
2689- 

•2783 
3040- 
3102- 
3165- 
3305 

-3553 
3660- 
3794- 

-3985 

•4095 

•4424 
4556- 
4928 

•5035 
5241- 

-5437 
5598 
5719- 
5829- 

-6045 
6303- 
6407- 

-6696 

-6908 
7086 

-7160 

-7275 
7465 

-7683. 

-7957 
8391 
8469 


2636- 

2691 

2796 

3043 

3105 

3172 

3386- 

3560 

3663 

3795 

3988- 

4102- 

4436- 

4558 

4957- 

5076- 

5243 

5452- 

5624- 

5720 

5834 

6062 

63 04 

6411 

6718- 

6911- 

7087 

7191 

7375 

7468 

7703 

7979- 

8397- 

8479 


adult liver 


Clontech 


ALV003 


105 786-787 1277-1279 1596-1599 1602-1606 
1813-1815 2884-2888 3065 3218-3219 3566-3567 
4120-4121 4395-4396 4999-5001 5034-5035 6084 
8430-8432 8468 


ovary 


Invitrogen 


AOV001 


6-13 20-21 38 40-43 
113 139-145 160 164- 
209 213-215 220-222 
266-271 273-278 280- 
304-305 307-320 329- 
376 384-385 397-398 
432-433 442-445 449 
477 488-495 514-516 
562 576-579 597-598 
656-665 688-690 712- 
748-750 753 767-770 
827 831-833 836-840 
881-882 887-891 920 
952 960-961 967-969 
1002 1005-1007 1017- 
1037 1055-1059 1061 
1152 1160-1161 1163- 
1187-1188 1194-1196 


48 51-54 68 91 
169 180-183 193 
225 230 250-251 
282 288-289 298 
340 345-349 356 
400 408-409 416 
454-455 466 469 
527-535 537 543 
603-606 627-628 
717 719 728 733 
772 779-781 805 
842 857-859 874 
923-927 930 941 
974 982-983 992 
1018 1021-1023 
1072-1106 1110 
1165 1177-1178 
1206-1209 1213 


92 104-107 
-195 201- 

253-259 
-299 301 
-370 373 
-418 425 
-473 476- 
-554 556- 

650-651 
-735 739 

809 814- 

877-879 
-944 949- 

994 1000- 
1O32-1035 
1X21 1151- 
1182-1183 
1221 1227- 


:L O O ifjl M-6 4" 25 ,„ O HI E? 6 O £?! 


1253 

1261 

-1263 

1287 

-1293 

1316 

-1317 

1324- 

1335 

1341 

1370 

1374 

-1391 

1394 

-1398 

1446 

-1450 

1453- 

1459 

1465 

1502 

1505 

-1506 

1511 

-1513 

1557 

-1570 

1572- 

1577 

1579 

1602 

1610 

1612 - 

1617 

1619 

1634 

1636 

-1642 

1655 

-1669 

1692 

1695 

-1696 

1698 

-1700 

1718 

-1721 

1726- 

1728 

1759 

1773 

1777 

-1800 

1806 

-1810 

1820 

-1821 

1823- 

1825 

1839 

1874 

1876 

-1877 

1884 

1890 

1913 

1916 

1921- 

1927 

1929 

1953 

1958 

-1966 

1968 

1970 

1984 

1986 

-1994 

1998 

-1999 

2036 

-2044 

2046- 

2058 

2063 

2110 

2112 

2114 - 

2126 

2129 

2162 

2165 

-2173 

2182 

-2184 

2223 

-2224 

2226- 

2228 

2237 

2250 

2255 

-2257 

2266 

-2280 

2294 

2307 

-2322 

2324 

-2327 

2365 

-2366 

2369- 

2372 

2374 

2404 

-2423 

2428- 

2434 

2436 

2455 

-2456 

2458- 

2461 

2464 

2478 

2481 

-2490 

2493 

-2494 

2513 

2515 

-2531 

2543 

2551 

2567 

-2568 

2574- 

2580 

2585 

2601 

2603 

-2611 

2616 

-2617 

2636 

-2641 

2648- 

2650 

2653 

2667 

2671 

-2678 

2682 

-2683 

2716 

-2720 

2722- 

2727 

2731 

2772 

-2775 

2778- 

2783 

2794 

2809 

2812 

-2815 

2817 

-2819 

2884 

-2903 

2913- 

2914 

2916 

2962 

2972 

-2979 

3014 

-3016 

3043 

3049 

-3063 

3065 

-3066 

3080 

-3087 

3095- 

3096 

3102 

3141 

3149 

-3150 

3152 

-3155 

3165 

-3172 

3175- 

3179 

3217 

3258 

3261 

-3275 

3279 

-3284 

3306 

-3332 

3346- 

3356 

3358 

3386 

-3389 

3399- 

.3401 

3407 

3430 

3435 

-3449 

3454 

-3455 

3491 

-3499 

3504- 

3523 

3525 

3567 

3570 

3588- 

3590 

3595 

3643 

3646 

-3652 

3660 

-3663 

3683 

-3686 

3693- 

3694 

3696 

3713 

3717 

-3726 

3728 

-3730 

3761 

-3764 

3769- 

3775 

3779 

3798 

-3801 

3841 r 

3854 

3859 

3891 

3894 

-3921' 

3928 

-3949 

4009 

-4013 

4015- 

4018 

4022 

4043 

4045 

-4047 

4052 

4055 

4089 

4092 

-4095 

4102 

-4109 

4138 

4142 

4153 

4163 

4165 

4180 

-4183 

4191- 

4193 

4202 


1297- 
•1348 

1402- 

1470 

1525- 
-1580 
•1620 

1671- 

1702- 
-1760 

1813- 
-1840 
-1893 

1936- 
-1971 

2001- 
-2082 
-2142 

2186- 
-2239 

2282- 

2332- 
-2378 
-2438 
-2467 

2500- 
►2553 
-2586 

2619- 
-2654 

2689- 
-2742 

2796 

2823- 
-2946 

3018- 

3068- 
-3105 

3157- 

3220 

3287- 

3361- 
-3408 

3464- 
-3530 
-3610 

3669- 
-3697 

3733- 
-3780 
-3860 

3956- 
-4026 
-4064 

4128- 
-4167 
-4207 


1308 
1354 
1420 
1472 
1537 
1583 
1622 
1675 
1710 
1762 
1815 
1843 
1896 
1942 
1974 
2008 
2087 
2148 
2194 
2241 
2283 
2349 
2382 
2442 
2469 
2501 
2558 
2589 
2625 
2656 
2696 
2744 
2798 
2835 
2948 
3028 
3071 
3108 
3158 
3227 
3292 
3366 
3410 
3478 
3548 
3612 
3672 
3699 
3741 
3788 
3866 
3970 
4035 
4067 
4129 
4169 
4209 


1311 

•1355 
1425 

-1489 
1540 

-1595 
1628 
1678 
1714 

-1763 
1817 

•1860 

•1900 
1946 

•1978 
2014 

-2101 

•2152 
2196 

-2242 
2285 
2353 

•2386 

•2451 
2474 
2509 
2561 

-2590 
2630 
2660 
2704 

-2749 

-2804 
2839 
2950 
3034 
3075 

•3111 
3161 

-3248 
3296 
3368 

•3421 
3481 
3553 

•3640 
3675 

-3705 
3743 

-3789 

-3879 
3983 
4O40 
4072 
4X31 
4X74 
4211 


-1313 
1359- 

-1431 
1495- 

-1555 
1600- 
1630- 
1680- 

-1716 
1765- 

-1818 
1864- 
1907- 
1950- 
1980- 

-2019 
2103- 
2155- 

-2203 
2244- 
2291- 

-2357 
2388 
2453 

-2475 
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7770- 

7772 

7780- 




7790 

7814- 

7815 

7817- 

-7821 

7847* 

-7849 

7856- 

■7860 




7867- 

-7869 

7907 

7921- 

-7922 

7941- 

-7944 

8013- 

8020 




8027- 

-8028 

8031- 

-8033 

8043 

8064 

8089 

8096 

8102- 




8109 

8117- 

8118 

8121- 

-8123 

8125 

-8129 

8163- 

8166 




8176 

-8181 

8194- 

-8197 

8216 

-8220 

8227- 

8229 

8234- 




8235 

8267- 

8269 

8277- 

-8279 

8335 

8337 

8342- 

•8352 




8370 

8397- 

8398 

8421 

8430 

-8435 

8445- 

8447 

8449 




8459-8461 

8490- 

-8495 






bone marrow 

ciontecxi 


14-19 44-45 51 

84 94 132 

-137 144-145 162-163 




172-175 233-237 261 

275-276 283-284 

377- 

-378 




384-385 495 499-500 

586-587 650-651 755 

790- 




792 834-835 855-856 

973 

982-9 

33 102 

2-1023 




1110 

-1111 

1172 

1200- 

-1209 

1213 

-1221 

1227- 

-1243 




12 87 

-1289 

1297* 

-1299 

1303 

-1304 

1316- 

1317 

1341- 




1347 

1357- 

•1358 

1369 

-1370 

1374 

-1377 

1415- 

-1420 




1446 

-1447 

1465 

1478 

-1480 

1505-1506 

1542- 

-1548 




1557 

-1560 

1562 

-1567 

1575 

-1577 

1602 

1607- 

-1608 




1671 

-1672 

1694 

1702 

-1711 

1781 

-1783 

1827- 

-1830 




1839 

-1840 

1885 

-1889 

1891 

-1896 

1903 

1907- 

-1911 




1916 

1921 

1934 

-1935 

1944 

-1945 

1985- 

1991 

2003- 




2004 

2045 

2063 

2071 

-2073 

2103 

-2107 

2128 

2137 




2178 

-2181 

2223 

-2224 

2237 

-2239 

2262- 

2264 

2297- 




2299 

2302- 

-2305 

2315 

-2322 

2337 

-2339 

2350-2352 




2382 

-2384 

2392 

-2397 

2407 

2430 

-2432 

2493 

-2494 




2502 

-2508 

2511 

2551 

-2553 

2558 

2574- 

2580 

2587- 




2588 

2592- 

-2596 

2636 

2656 

-2660 

2671 

2731 

-2738 




2744 

-2747 

2817 

2835 

2853 

-2855 

2857- 

2863 

2867 




2881 

-2883 

2931 

-2934 

2953 

-2959 

2 972 

2974 

-2977 




3014 

-3016 

3037 

-3039 

3064 

-3066 

3071 

3088 

-3091 




3115 

-3116 

3122 

-3132 

3134 

-3136 

3138- 

•3141 

3163- 




3172 

3181 

3218 

-3219 

3221 

-3226 

3253- 

3258 

3274- 




3275 

3280- 

-3284 

3290 

-3292 

3306 

-3307 

3315 

-3318 




3334 

-3344 

3346 

-3348 

3370 

3375 

-3380 

3386 

-3389 




3410 

3412- 

-3413 

3419 

-3421 

3477 

-3478 

3481 

-3485 




3517 

-3520 

3540 

-3543 

3548 

-3550 

3605 

3623 

-3628 




3678 

-3679 

3699 

-3701 

3704 

3738 

-3739 

3748 

-3752 




3760 

3798- 

-3801 

3850 

-3854 

3861 

-3865 

3931 

-3936 




3950 

-3955 

3957 

3986 

-3989 

3991 

-3998 

4000 

-4008 




4015 

-4018 

4035 

-4040 

4061 

-4062 

4092- 

-4095 

4128- . 




4129 

4157- 

-4162 

4170 

-4171 

4205 

-4207 

4296 

-4297 




4348 

4377 

4659 

-4660 

4662 

4674 

4830- 

-4835 

4851- 




4853 

4873 

-4876 

4889 

4924 

4937 

-4939 

4984 

-4985 




5080 

-5081 

5114 

5129 

-5130 

5139 

-5140 

5201 

5232 




5236 

-5240 

5244 

-5245 

5250 

-5254 

5302- 

-5305 

5416 




5452 

-5459 

5485 

-5488 

5504 

-5505 

5588 

56 94 

-5695 


1 our 


'•-\ 
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C *7 o o 

C *7 *5 Q 

57217 

D / *± O - 

K7A1 

5757-5758 

5764 

5770 

5807- 




CQftQ 

CQ1 O. 

5821 

5825- 

5828 

5853 - 

5855 

5882- 

5885 




3 yj / 


5964- 

5966 

5971- 

5980 

6174-6176 

6236- 




CO A 1 
DZ J. 

GO A A 

6353- 

6355 

o45© - 


6464- 

-6466 

6476- 





6552 - 

6554 

6603- 

DOUD 

DO 

6642 

6652 

6665- 




6668 

6718- 

6721 

6897- 

6898 

6938 

6941- 

•6942 

6957- 




6959 

6964- 

6969 

6987- 

6989 

7008- 

•7011 

7060- 

7062 




7084- 

7085 

7274- 

7275 

7294 

7412- 

-7414 

7422 

7446 




7599- 

7600 

7609-7611 

7613- 

•7616 

7689 

7693- 

7696 




7714 

7780- 

7790 

7799- 

•7801 

7919 

7982- 

-7983 

8031- 




8033 

8046- 

8048 

8089 

8092- 

-8094 

8102- 

-8113 

8117 




8126- 

-8127 

8161 

8186 

8293 

8298- 

-8299 

8397- 

8398 




843 0- 

8432 

8442 







bone marrow 


RMD004 

499-500 855-856 1003 1005-1007 1160-1161 



1671- 

•1672 

1694 

1916 

1962- 

-1963 

3065 

3270- 

•3273 




3313- 

-3314 

3376- 

-3379 

3904 

5107- 

-5109 

5764 

6613- 




6615 

7615- 

-7616 

8031- 

-8033 

8430- 

-8432 



bone marrow 

t IT 

Clontecn 


36-37 499- 

500 676-680 786-787 

982-983 1022- 




1023 

1337- 

•1338 

1424 

1437- 

-1438 

1511- 

-1513 

1896 




2492 

3037- 

-3039 

3065 

3227 

-3228 

3346 

-3348 

3595- 




3596 

3605 

3760 

5999- 

-6000 

7615 

-7616 

8102- 

-8109 




8117 









colon 

. 

Invitrogen 


8-9 106-107 233-237 

266-268 403-406 645 

941- 



942 1290-1293 1300-1304 1374-1377 1439-1441 




1446 

-1447 

1616 

-1617 

1765 

-1767 

1784 

-1785 

1813- 




1815 

1912 

1921 

1930- 

-1932 

1958 

-1961 

1970 

-1971 




1985 

2059- 

-2060 

2124- 

-2126 

2158 

-2162 

2237 

-2239 




2353 

-2357 

2476- 

-2477 

2480 

2515 

-2516 

2574 

-2580 




2587 

-2588 

2598-2601 

2603 

-2609 

2669 

-2670 

2698- 




2703 

2817 

2842 

-2848 

2868 

-2873 

2915 

2978 

-2979 




3014 

-3016 

3033 

3043 

3095 

-3096 

3313 

-3314 

3349- 




3352 

3386 

-3389 

3623 

-3628 

3639 

-3640 

3693 

-3694 




3728 

3730 

3743 

-3747 

3841 

-3849 

3904 

3988 

-3989 




4128 

-4129 

4155 

4223 

-4224 

4252 

4662 

4672 

-4673 




4780 

-4782 

4910 

-4916 

5157 

5190 

5250 

-5254 

5280- 




5282 

5343 

5435 

-5437 

5452 

-5459 

5475 

-5478 

5517- 




5518 

5614 

-5617 

5621 

-5622 

5624 

-5626 

5660 

5686- 




5690 

5714 

5851 

-5852 

6013 

6065 

-6068 

6077 

6142 




6236 

-6241 

6353 

-6355 

6443 

-6444 

6501 

-6504 

6560- 




6562 

6586 

-6587 

6703 

-6704 

6734 

-6736 

6799 

-6804 




6925 

6932 

-6934 

6941 

-6942 

6990 

-6992 

7036 

7079- 




7082 

7099 

-7102 

7188 

-7190 

7255 

-7259 

7303 

-7304 




7398 

7402 

-7403 

.7433 

7662 

-7666 

7676 

-7677 

7770- 




7772 

8102 

-8109 

8227 

-8229 

8413 




Mixture of 

Various 

CTL016 

144- 

145 193-194 641 

-644 

1275- 

1276 

13 94- 

1395 

16 tissues- 

Vendors* 


1490 

-1491 

1511 

-1513 

1610 

1671 

-1672 

1986 

-1991 

mRNAs* 



2074 

-2075 

2095 

-2097 

2850 

-2851 

3217 

-3219 

3486 | 


* the 16 tissue-mRNAs and their vendor source, are as follows: 1) Normal adult brain mRNA (Invitrogen), 2) 
normal adult kidney mRNA (Invitrogen), 3) normal adult liver mRNA (Invitrogen), 4) normal fetal brain mRNA 
(Invitrogen), 5) normal fetal kidney mRNA (Invitrogen), 6) normal fetal liver mRNA (Invitrogen), 7) normal fetal 
skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontecn), 9) human bone marrow mRNA (Clontecn), 10) 
human leukemia lymphablastic mRNA (Clontecn), 11) human thymus mRNA (Clontecn), 12) human lymph node 
mRNA (Clontech), 13) human spinal cord mRNA (Clontecn), 14) human thyroid mRNA (Clontecn), 15) human 
esophagus mRNA (BioChain), 16) human conceptional umbilical cord mRNA (BioChain). 
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3986-3987 5425-5426 
7468 8031-8033 8430 


5554-5558 5829-5834 7465- 
-8432 


Mixture of 
16 tissues- 
mRNAs* 


Various 
Vendors * 


CTL021 


447 650-651 730-731 1003 1022-1023 1272 1349- 
1353 1402-1407 1439-1441 1446-1447 1495-1497 

1511-1513 1784-1785 1796-1797 1896 2262-2264 

2362-2364 2534-2535 2953-2959 3065-3066 3115- 

3116 3138-3141 3217 3302-3303 3456-3463 4120- 

4121 4180-4181 4484 5177-5179 5764 5829-5834 

5971-5980 6414-6417 6877 7014-7027 7159-7160 

7609-7611 7630-7633 8031-8033 8102-8109 8430- 

8432 8460-8461 


Mixture of 
11 tissues- 
mRNAs** 


Various 
Vendors** 


CTL02 8 


2814-2815 4840 4950-4953 8430-8432 


cervix 


BioChain 


CVX001 


6-7 14-19 40-41 
111 131 143 150 
253-254 266-268 
384-385 397-398 
498 527-529 534 
650-651 656-658 
803-804 814-821 
945-947 952 974 
1023 1033-1037 
1161 1163-1165 
1216-1221 1227- 
1293 1297-1302 
1355 1369-1370 
1448-1450 1453- 
1487 1501-1502 
1557 1561-1567 
1610 1612-1615 
1659 1671-1674 
1718-1721 1726- 
1778 1798-1805 
1844-1848 1850- 
1885-1889 1903 
1936-1942 1968 
2003-2004 2014 
2066 2071-2075 
2136 2156-2157 
2201-2207 2226- 
2266-2267 2276 
2299 2302-2305 
2341-2349 2353- 
2389-2390 2404- 
2461 2469 2487- 
2522-2525 2543 
2619-2622 2636- 
2704-2711 2716- 


47 52-54 74 81-82 88 105-107 
151 197-209 213-215 250-251 
277-278 280-282 301 351-352 
402 416-418 454-458 475 495 
535 539 546-548 588-591 645 
699-702 706 712-713 739 774 
857-859 875-878 887-891 918 
992-994 1005-1007 1016 1022- 
1050 1072-1105 1110-1111 1160- 
1182-1183 1194-1196 1206-1209 
1243 1246-1247 1249-1253 1287- 
1311-1315 1334-1335 1347 1354- 
1374-1379 1408-1420 1439-1441 
1454 1466-1470 1478-1480 1485- 
1514-1515 1529-1539 1542-1551 
1572-1574 1586-1591 1600-1602 
1619-1620 1622-1623 1653-1657 
1676-1677 1694 1712-1714 1716 
1727 1730 1759-1760 1762-1763 
1809-1810 1820-1821 1824-1825 
1851 1854-1856 1864-1874 1883 
1912-1913 1916 1921 1930-1932 
1970-1971 1980-1984 1998-1999 
2015 2018-2019 2046-2052 2062- 
2079-2080 2112 2114-2123 2131- 
2165-2173 2182-2183 2186-2189 
2227 2243 2249-2250 2255-2257 
2278 2282-2283 2291-2294 2297- 
2315-2322 2328-2331 2335-2339 
2357 2368 2375-2377 2381-2386 
2406 2416-2423 2428-2435 2458- 
2488 2495-2499 2511 2514-2516 
2567-2569 2585-2588 2603-2609 
2641 2651-2652 2658-2663 2671 
2724 2731-2738 2750-2755 2772- 


** The 11 tissue-mRNAs and their vendor source, are as follows: 1) adult trachea mRNA (Clontech), 2) adult 
pituitary gland mRNA (Clontech), 3) adult heart mRNA (Invitrogen), 4) fetal heart mRNA (Invitrogen), 5) adult 
placenta mRNA (Invitrogen), 6) adult brain mRNA (Invitrogen), 7) adult small intestine mRNA (Invitrogen), 8) fetal 
lung mRNA (Invitrogen), 9) adult lung mRNA (Invitrogen), 10) adult uterus mRNA (Invitrogen), 11) fetal brain 
mRNA (Invitrogen). 
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2775 

2817 

2889- 

3018- 

3080- 

3141 

3258 

3334- 

3419- 

3488 

3530 

3605 

3653- 

3710- 

3767- 

3867 

3960 

4054 

4163 

4208 

4297 

4367 

4429 

4496 

4558 

4645 

4692 

4765- 

4842 

4891 

4934^ 

5012 

5059- 

5146 

5192 

5268- 

5335 

5406- 

5471 

5526- 

5580- 

5598 

5685 

5709 

5777 

5881 

5996 

6047 

6119- 

6234 

6345 

6414- 

6459 

6601 

6668 

6706- 

6811 


2778- 

2823- 

2893 

3025 

3091 

3149- 

3270- 

3344 

3421 

3491- 

3540- 

3612- 

3655 

3713 

3770 

3883- 

3964- 

4061- 

4165- 

4225- 

4330- 

4369 

4436- 

4504 

4562 

4653 

4710 

4767 

4848- 

4899 

4936 

5015- 

5060 

5148 

5209- 

5273 

5349 

5408 

5479- 

5545 

5581 

5601 

5687- 

5713 

5797 

5920- 

5998 

6051 

6120 

6236- 

6349- 

6417 

6464- 

6613 

6670 

6708 

6813 


2783 
2838 
2915 
3036 
3095 
3150 
3273 
3349 
3424 
3493 
3543 
3615 
3660 
3717 
3788 
3884 
3968 
4062 
4167 
4227 
4333 
4377 
4438 
4510 
4606 
4658 
4719 
4771 
4850 
4901 
4940 
5017 
5076 
5153 
5213 
5286 
5352 
5416 
5482 
5547 
5583 
5630 
5690 
5723 
5809 
5923 
6003 
6056 
6122 
6241 
6350 
6427 
6466 
6615 
6676 
6718 
6817 


2788- 
2842- 
2948- 
-3043 
•3105 
3161- 
3276- 
•3356 
3438- 
3497 
3551- 
3629- 
-3663 
-3722 
-3789 
3888- 
4012- 
4067- 
4173- 
4263 
4336- 
4386- 
4441- 
-4511 
4609 
-4660 
4721- 
-4772 
4873- 
-4903 
-4941 
5019- 
-5079 
-5154 
5216- 
-5287 
-5355 
5435- 
5489- 
-5552 
-5584 
5658- 
5692 
-5729 
-5817 
5932 
-6005 
6058- 
-6127 
6267 
6369- 
-6431 
6512- 
6627- 
6688 
-6721 
-6821 


2794 
2848 
2950 
3054- 
3108 
3173 
3284 
3373 
3449 
3500 
3560 
3630 
3669 
3724 
3798 
3891 
4018 
4073 
4175 
4266 
4337 
4387 
4443 
4529 
4614 
4662 
4723 
4779 
4876 
4905 
4949 
5033 
5084 
5157 
5220 
5302 
5365 
5440 
5493 
5554 
5588 
5660 
5694 
5746 
5823 
5940 
6013 
6060 
6131 
6272 
6370 
6435 
6513 
6628 
•6689 
6776 
6828 


2796 
2853- 
2953- 
•3057 
•3110 
3175- 
3287- 
■3379 
3454 
•3503 
3566- 
3633 
-3679 
-3728 
-3801 
3896 
4027- 
4082- 
4182- 
4269- 
4343 
4391 
4445- 
•4537 
4625 
4671- 
4729 
4790 
4882 
4920- 
4954 
5039- 
-5097 
5159- 
5244- 
-5305 
5377 
5452- 
5515- 
-5558 
5590- 
5664- 
-5695 
-5747 
-5824 
-5941 
6028- 
6065- 
6165- 
6297- 
6395 
-6438 
6515 
6639- 
6698- 
-6779 
-6832 


2798- 

2855 

2959 

3066 

3122 

3179 

3289 

3399 

3475- 

3512- 

3567 

3638 

3687 

3730 

3850 

3927 

4044 

4088 

4183 

4270 

4353 

4418 

4452 

4550 

4636 

4673 

4734 

4804 

4886 

4921 

4957 

5042 

5134 

5160 

5245 

5323 

5390 

5459 

5516 

5561 

5592 

5667 

5698 

5757 

5853 

5988 

6031 

6068 

6169 

6298 

6397 

6443 

6586 

6642 

6700 

6784 

6840 


2800 
2875- 
2974 
3075 
3133 
3217 
3324- 
3410- 
3476 
•3516 
3588- 
3644- 
3692 
3733- 
•3854 
•3936 
4048- 
4107- 
4203- 
4286 
4360 
4420- 
4484- 
4553 
•4639 
4688- 
4738- 
4828 
-4888 
4925- 
-4958 
5050- 
5138- 
5184 
5250- 
-5325 
5392- 
5466 
5521- 
5567- 
5594 
5677- 
-5701 
-5758 
-5855 
-5992 
6038- 
6087 
6174 
6301 
6399 
-6444 
-6587 
6661 
6703 
6799 
6845 


2810- 
2878 
2979 
3078 
3137- 
3253- 
3332 
3415 
3479- 
3528- 
-3590 
-3645 
3705 
-3739 
3866- 
3958- 
-4051 
-4109 
-4204 
4294- 
4366- 
-4424 
-4485 
4556- 
4644- 
-4690 
-4741 
4841- 
4890- 
-4926 

5005- 
-5052 
-5140 

5191- 
-5254 

5334- 
-5401 

5469- 
-5522 
-5568 

5596- 
-5683 
5705- 
5776- 
5875- 
5995- 
-6043 
-6090 
-6176 
6342- 
6402 
6456- 
6600- 
6663- 
6704 
6809 
6849 
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6861-6862 

6872- 

6874 

6876 

6885 

-6886 

H O D O 
6000- 

C O Q A 




6896 

6907- 

-6909 

6935 

6941 

-6942 

6944- 

C Q A C 

6945 

0 952 




6990-6992 

7014- 

7027 

7088 

-7090 

7092 

F»n aq 
7099- 

•7102 




Till. 
/ JL X X 

/ J- x ^ 

7135 

7151- 

•7154 

7162 

-7 lo7 

71/4" 

. *7 1 "7 C 
• / 1 / D 




7219 

7225 

7229 

724 8- 

-7249 

7255 

*T *> C Q 

-7259 

7275*- 

• /Zo U 




7283- 

-7284 

7292 

73 3 4 

734 6 

-7347 

7349- 

•7 0 C A 

/3oo- 




7367 

7373 

7377 - 

73 84 

7 33z 

-7 "0 o o 

- / j y j 

/3 y 0 

T A O A - 

. *7 A O R 
• / 4 Z D 




7446 

7550 

7599- 

7600 

7620 

T> c *"i 

7626 

-7633 

*7 C A A 

7644 

/ D4 0 - 




7650 

7661 

7676- 

7677 

7705 

77 J.O 

7747 

7753- 

•7754 




7764 

-7765 

7780- 

7790 

7899 

T OA A 

-7900 

n n A T 

7903 

7907 

7935 




7937 

7963- 

- /17DD 


* / / 3 

•7 qt <r 
/ zt 1 0 

- / 5* / / 

7991- 

-7992 




8031 

-8033 

8102- 

8109 

8202 

8227 

-8229 

8306-8308 




8340 

-8341 

8353- 

8354 

8414 

-8416 

8427 

8436 

8455- 




8456 









diaphragm 

— . . 

BioChain 

DIA002 

94 1206-1209 13 

11-1313 1357-1358 14 

02-1407 



1561 

1659 

1870- 

1874 

1950 

-1953 

2156- 

Z XD / 





2264 

2453 

2493- 

2494 

374 0 

-3741 

4484 

MOO" 





5757 

-5758 

7472 

7953- 

•7954 





endothelial 

Stratagene 

EDT001 

8-9 20-25 

67 93 

102- 

-10*7 

121 132-137 

1 j A 

140- 

- 142 

cells 



144- 

145 172-175 

195- 

-196 

201-2 

39 213 

-2 lt> 

Z3U 




233-237 241-243 

253- 

-254 

273-274 277 

O *7 Q 
- 2 / O 

■7 U C\ - 

z 0 u — 




284 296-297 304 

345- 

-349 

371-374 397 

O QQ 

- 3 9o 

4 U3 - 




407 416-418 432 

-433 

449 

459-4 

54 513 

CI T 

3 ± / 

32 / — 




531 

546-551 580 

-581 

597- 

598 633-634 

641- 

C A C 




650- 

S51 656-658 

661- 

-665 

720-721 733 

- /3!> 

/ D / - 




760 765 767 779 

-780 

806- 

807 831-833 

OCT 

pCQ' 




868- 

871 877-878 

887- 

-891 

923-927 937 

_ QAO 

27/4 




1005 

-1007 

1016 

1019 

1022 

-1023 

1033- 


1 AAA- 




1046 

1050 

1070- 

1071 

1151 

-1152 

1160- 

1 lOl 

J. J. / ^ 




1176 

1194 

-1196 

1206- 

-1209 

1213 

-1221 

1 O O "7 . 

.1947 




1249 

-1253 

1261- 

1263 

1287 

-1302 

1306- 

i "inn 
j. j \> 0 

1*311 _ 
X O JL M 




1313 

1316 

-1317 

1324- 

-1335 

1339 

-1346 

1 .3 4 O 

11CQ_ 




1370 

1374 

-1379 

1391 

1394 

-1414 

1425- 


144 8 - 




1450 

1453 

-1459 

1465 

1472 

-1487 

1490- 


-L ZJ \J ZJ 




1506 

1511 

-1513 

1525-1528 

1531 

-1537 

1540 

-1541 j 




1549 

-1555 

1557- 

1570 

1572 

-1577 

1579- 

1580 

1583- 




1585 

1592 

-1595 

1600- 

-1602 

1507 

-1608 

1610 

1616- 




1617 

1619 

-1620 

1622- 

-1628 

1630 

-1634 

1637 

-1642 




1653 

-1658 

1671- 

1674 

1679 

1691 

-1692 

1698 

-1700 




1702 

-1710 

1712- 

1713 

1716 

1718 

1726- 

1728 

1759- 




1760 

1765 

-1767 

1779-1795 

1798 

-1800 

1805 

1809- 




1810 

1813 

-1815 

1817- 

-1818 

1820 

-1821 

1824 

-1825 




1849 

-1851 

1854- 

.1856 

1861 

-1869 

1881- 

1882 

1885- 




1889 

1894 

-1896 

1907- 

-1913 

1916 

1921- 

1927 

1930- 




1932 

1934 

-1935 

1937- 

-1946 

1950 

-1953 

1955 

-1963 




1967 

-1968 

1970- 

1971 

1980 

-1984 

1986- 

1994 

1998- 




1999 

2001 

-2008 

2016 

-2019 

2032 

-2044 

2046 

-2052 




2061 

2063 

-2075 

2077- 

-2082 

2087 

-2094 

2103 

-2107 j 




2112 

2114 

-2121 

2128 

2131 

-2136 

2138- 

2142 

2147- 




2154 

2156 

-2157 

2165 

-2175 

2182 

-2183 

2185 

-2194 




2199 

-2200 

2209 : 

2217 

2229 

-2230 

2237- 

2239 

2241- 




2242 

2249 

-2250 ; 

2255 

-2257 

2262 

-2264 

2266 

-2275 




2282 

-2283 

2285 

2297 

-2299 

2302 

-2306 

2315 

-2322 




2324 

-2327 

2329-2349 

2353 

-2358 

2365- 

2366 

2368- 




2372 

2381 

2385- 

2386 

2388 

-2390 

2392- 

2397 

2408- 




2415 

2417 

-2419 

2428 

-2429 

2433 

-2434 

2445 

-2453 




2458 

-2461 

2468 

2470 

2482 

-2488 

2493- 

2494 

2509- 
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2511 

2550 

2580 

2637- 

2671 

2742 

2796- 

2838 

2879 

2948- 

3045- 

3080- 

3111 

3157- 

3250 

3298- 

3344 

3375 

3438- 

3497 

3539 

3590 

3638- 

3669- 

3722 

3794- 

3901- 

3983- 

4062 

4150 

4187 

4252 

4350 

4382- 

4441- 

4544- 

4645 

4699 

4760- 

4838 

4939 

5008- 

5060 

5123- 

5160 

5196- 

5278 

5325 

5408 

5459 

5501 

5565- 

5615 

5664- 

5701 

5776 

5835- 


2513 
2558 
2587 
2641 
2704 
2744 
2798 
2842 
2884 
2950 
3048 
3085 
3117 
3158 
3253 
3300 
3346 
3386 
3449 
3500 
3544 
3595 
3640 
3679 
3724 
3795 
3903 
3998 
4070 
4166 
4191 
4269 
4352 
4385 
4447 
4549 
4648 
4703 
4762 
4848 
4950 
5012 
5065 
5124 
5162 
5199 
5280 
5336 
5415 
5466 
5507 
5566 
5621 
5667 
5705 
5780 
5838 


-2514 
2561- 

•2588 
2648- 

•2711 

-2747 
2806- 

-2848 

-2901 
2953- 
3054- 
3088t 

-3133 
3165- 

-3269 
3306- 

-3356 

-3389 
3454 

-3503 

-3547 

-3599 
3642- 
3683- 

-3726 
3850- 
3910- 
4001- 

-4072 
4170- 

-4193 

-4270 
4361- 
4415- 
4480- 
4565 

-4653 
4719 
4768- 

-4850 

-4953 
5015- 

-5072 
5129- 

-5163 
5236- 

-5282 
5340 
5431- 
5469- 

-5512 
557Q- 

-5622 
5677 

-5709 

-5782 
5851- 


2522- 

2565 

2592- 

2654 

2713- 

2772- 

2811 

2853- 

2908- 

2971 

3063 

3091 

3137- 

3172 

3274- 

3310 

3358 

3416- 

3471- 

3507- 

3551- 

3605- 

3643 

3686 

3728 

3854 

3919 

4021 

4102- 

4171 

4203- 

4272- 

4362 

4417 

4484 

4 603 

4 662 

4721- 

4769 

4873- 

4957- 

5017 

5076- 

5130 

5168- 

5243 

5284- 

5343 

5434 

5472 

5526- 

5580- 

5624- 

5683 

5713- 

5793- 

5863 


2525 

2567- 

2596 

2656- 

2715 

2775 

2814- 

2855 

2911 

2974- 

3066- 

3095- 

3141 

3175- 

3275 

3313- 

3361- 

3418 

3478 

3516 

3560 

3610 

3646- 

3699- 

3730 

3876- 

3926 

4027- 

4109 

4173 

4204 

4273 

4366- 

4420- 

4491- 

4606 

4664- 

4727 

4771- 

4877 

4958 

5022- 

5081 

5134 

5173 

5246 

5285 

5351 

5438- 

5475- 

5545 

5581 

5626 

5687- 

5714 

5796 

5875- 


2536 

2568 

2603 

2660 

2720 

2778- 

2817 

2860- 

2913- 

3025 

3067 

3096 

3149- 

3177 

3279- 

3314 

3366 

3425- 

3481- 

3521 

3564- 

3612 

3652 

3703 

3738- 

3877 

3928- 

4040 

4128- 

4177 

4216- 

4286 

4367 

4424 

4494 

4609 

4665 

4729 

4772 

4909 

4968- 

5033 

5101 

5139- 

5177- 

5255 

5296 

5377 

5440 

5478 

5547- 

5585 

5630- 

5690 

5746- 

5809- 

5881 


2537 

2543 

2549- 

2571 

-2572 

2574- 

2609 

2630 

-2633 

2665 

-2667 

2669- 

2722 

-2727 

2731- 

2786 

2788 

-2794 

2823 

-2834 

2836- 

2863 

2867 

2875- 

2915 

2918 

-2946 

3034 

-3039 

3043 

3071 

3073 

-3074 

3102 

-3105 

3108- 

3150 

3152 

-3155 

3181 

3220 

-3228 

3284 

3290 

-3292 

3324 

-3332 

3334- 

3368 

-3369 

3373- 

3428 

3430 

-3433 

3486 

3491 

-3493 

3524 

-3527 

3537- 

3567 

3570 

3588- 

3616 

3623 

-3630 

3658 

3660 

-3663 

3707 

-3713 

3717- 

3759 

3776 

-3780 

3885 

3888 

-3893 

3 936 

3957 

-3960 

4042 

4044 

4061- 

4129 

4132 

4142 

4180 

-4183 

4186- 

4217 

4230 

4249- 

4294 

-4322 

4349- 

4369 

4378 

-4380 

4429 

4436 

-4438 

4514 

-4535 

4538 

4614 

4625 

4644- 

4685 

-4687 

4697 

4743 

4745 

-4749 

4823 

-4824 

4837- 

4919 

-4926 

4937- 

4970 

4972 

-4976 

5036 

-5042 

5054 

5107 

-5109 

5121 

5140 

5146 

5159- 

5179 

5191 

-5192 

5258 

-5262 

5275- 

5302 

-5305 

5323- 

5393 

-5401 

5406- 

5443 

-5445 

5448- 

5485-5488 

5500- 

5552 

5559 

-5560 

5588 

5596 

5614- 

5632 

5658 

-5659 

5694 

-5695 

5698- 

5753 

57 57 

-5758 

5814 

5825 

-5828 

5892 

-5893 

5896- 
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5898 

5920- 

5923 

5932 

5940 

-5941 

5945 

-5949 

5969- 

5970 

5984- 

5986 

5988- 

5994 

5999 

-6000 

6003 

-6007 

6028 

-6031 

6048 

-6061 

6087 

-6090 

6119 

-6120 

6123- 

6127 

6131 

6165 

-6169 

6172 

-6176 

6180 

-6191 

6195- 

6196 

6203 

6209 

-6210 

6221 

-6223 

6234 

-6235 

6243- 

6244 

6291- 

6292 

6350 

6365 

-6366 

6369 

-6370 

6378 

6392 

-6395 

6400 

-6401 

6424 

-6425 

6445 

6456 

-6459 

6464 

-6472 

6489 

6501- 

6505 

6512 

-6513 

6519 

-6521 

6548 

-6549 

6555 

-6562 

6600 

-6601 

6623 

-6642 

6653- 

6660 

6665- 

6668 

6688- 

6689 

6697 

6734 

-6736 

6742- 

6744 

6766- 

6769 

6785 

6799 

-6807 

6813 

6828 

-6832 

6836 

-6838 

6849 

6858 

6861 

6863 

-6867 

6885 

-6886 

6890 

-6894 

6897 

-6898 

6911 

-6915 

6923 

^6924 

6941- 

6943 

6957 

6960 

-6961 

6990 

-6992 

6997 

-6998 

7001- 

7002 

7012- 

7027 

7029- 

7033 

7050 

-7053 

7056 

-7059 

7063 

-7067 

7080 

-7082 

7086 

-7090 

7098 

7104 

-7112 

7150 

-7153 

7181 

-7184 

7188 

-7190 

7248 

-7249 

7251- 

7254 

7281- 

7282 

7308- 

7309 

7321 

7344 

7346 

-7347 

7357 

-7359 

7366 

-7367 

7375 

7377 

-7384 

7396 

7398 

7402 

-7403 

7446 

7451- 

7454 

7463 

7470 

7527 

-7531 

7550 

-7554 

7556 

7599- 

7600 

7626 

-7629 

7650 

7686- 

7687 

7693- 

7696 

7698- 

7700 

7710 

7730 

7748 

7757- 

7760 

7770- 

7772 

7780- 

7790 

7797 

7839 

-7841 

7853- 

7854 

7859- 

7860 

7890- 

7891 

7907 

7955 

-7957 

7963- 

7964 

7972- 

7973 

7976- 

7977 

7982 

-7983 

8018 

-8020 

8031 

-8033 

8062 

-8064 

8070 

-8072 

8086 

-8087 

8096 

8115 

-8116 

8163 

-8166 

8194 

-8196 

8227 

-8229 

8234- 

8235 

8254 

8261 

-8265 

8289 

8320 

8353 

-8354 

8387 

8397 

-8398 

8413 

8418- 

8419 

8430 

-8432 

8443 

-8444 

8450 

-8456 

8465 

-8466 

8488 

-8495 

8499 

-8500 


565- 

575 3188-3214 3942-3945 4456-4479 4854- 


Genomic 
clones from 
the short 
arm of 
chromosome 
8 


DNA from 

Genetic 

Research 


EPM001 


4872 


Genomic 
clones from 
the short 
arm of 
chromosome 
8 


DNA from 

Genetic 

Research 


EPM003 


3188-3214 3942-3945 4456-4479 4563-4564 4854- 
4872 


Genomic 
clones from 
the short 
arm of 
chromosome 
8 


DNA from 

Genetic 

Research 


EPM004 


565-575 3188-3214 3942-3945 4456-4479 4563- 
4564 4854-4872 6712 7707 


esophagus 


BioChain 


ESO002 


144-145 973 1022-1023 1033-1035 1148-1149 
1579-1580 2059-2060 2063 3163-3164 4490 6552- 
6554 6703-6704 7953-7954 8430-8432 


fetal brain 


Clontech 


FBR001 


264-265 386 656-658 1160-1161 1300-1302 2064- 

2066 2148-2152 2353-2357 2369 2389-2390 2442- 

2444 2656-2657 2689-2691 2731 2814-2815 2960- 

2962 3080-3085 3137 3227-3228 3639-3640 3840 

3876-3877 3969-3970 4015-4018 4035-4040 4061- 
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4062 4101 4263 4508 

4809 4917-4918 4954 

5758 5777 6452-6453 

8229 8413 


-4509 4648-4652 4662 4806- 
5244-5245 5255 5686 5757- 
7416-7417 7626-7629 8227- 


fetal brain 


Clontech 


FBR004 


272 280-282 992-994 1033-1035 
1821 1972 2062 2315-2322 2612- 
2810-2811 2821-2822 2905-2907 
3269 3280-3284 3375 3410 3430 
3886-3887 3950-3955 4498-4501 
4835 4848-4850 4873-4876 4962- 
5428 5721 5731-5735 6406 7357- 
7837 8430-8432 


1048 1610 1820- 
2615 2681 2711 
3095-3096 3262- 
3620-3622 3729 
4629-4634 4830- 
4967 5413-5414 
7359 7794-7796 


fetal brain 


Clontech 


FBR00 6 


8-19 22-25 30 67 77 
180-183 193-194 279 
366-369 384-385 402 
488-490 517 527-529 
587 688 692 828-829 
982-985 992-994 101 
1190-1191 1206-1209 
1335 1339-1340 1359 
1413-1414 1455-1462 
1515 1518-1520 1525 
1579-1580 1592-1595 
1623 1636-1641.1671 
1781-1783 1813-1815 
1840 1844-1849 1857 
1945 1950-1953 1958 
2020-2027 2050-2060 
2107 2114-2119 2124 
2157 2165-2170 2174 
2223-2224 2262-2264 
2385-2386 2389-2390 
2444 2454 2493-2494 
2574-2580 2585-2588 
2644-2647 2651-2652 
2742 2744-2747 2776 
2848 2850-2851 2857 
2931-2934 2960-2962 
3035 3037-3039 3045 
3102-3105 3108-3111 
3162 3165-3172 3221 
3284 3287-3289 3305 
3358 3386-3389 3425 
3517-3519 3528-3530 
3559 3564-3565 3568 
3639-3640 3656-3657 
3695 3699-3701 3707 
3788-3789 3798-3801 
3857 3876-3877 3892 
3950-3955 3958-3960 
4018 4020-4026 4035 
4067-4072 4074-4079 
4170-4171 4177 4180 
4288 4361-4362 4369 
4514-4518 4520-4528 
4617 4629-4634 4662 
4773 4806-4809 4817 


-79 93 105 113 143 150-151 
286-289 294-295 362-363 
430-431 439-440 480-482 
534-535 580-581 583 586- 
880 899 934-935 941-947 
1-1012 1148-1149 1162 

1227-1243 1311-1313 1334- 
-1367 1396-1398 1402-1407 

1490-1491 1498-1500 1511- 
-1528 1531-1537 1558-1567 

1607-1608 1616-1617 1622- 
-1672 1680-1692 1702-1710 

1820-1821 1823-1825 1835- 
-1869 1876 1896 1912 1944- 
-1963 1970-1971 2016-2017 

2071-2073 2087-2094 2103- 
-2126 2128 2137-2141 2156- 
-2175 2204-2207 2209-2217 
2315-2322 2324-2327 2379 
2420-2423 2430-2432 2442- 
2509-2511 2551-2553 2558 
2597 2625 2630-2633 2636 
2656-2660 2722-2724 2739- 
-2783 2796 2821-2822 2842- 
-2859 2868-2878 2889-2893 

3018-3023 3029-3031 3034- 
-3048 3058-3063 3086-3087 

3122-3133 3142r3147 3161- 
-3228 3250 3276-3278 3280- 
-3307 3346-3352 3355-3356 
-3428 3438-3449 3481-3485 

3540-3543 3548-3550 3555- 
-3569 3605-3610 3620-3622 

3678-3679 3683-3686 3693- 
-3709 3743-3759 3779-3780 

3804-3805 3841-3849 3855- 
-3893 3904-3909 3931-3936 

3983-3987 4001-4008 4015- 
-4040 4047-4051 4061-4062 
4102-4109 4128-4129 4166 
-4181 4203-4204 4252 4271 
4382-4384 4397 4436-4438 
4536-4537 4559 4607-4608 
4702 4705 4729 4738-4741 
4830-4835 4873-4876 4917- 
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4918 

4936 

-4939 

4949 

-4953 

4968- 

4970 

4984 

-4985 

5015 

-5017 

5032 

-5033 

5056 

5061 

5080 

-5081 

5107- 

5109 

5180 

-5183 

5226 

-5227 

5241- 

5243 

5250 

-5254 

5265 

-5267 

5275 

-5278 

5280- 

5282 

5301 

5369 

-5371 

5389 

5416 

5465 

5469 

-5472 

5554- 

5558 

5569 

5664- 

5667 

5671 

5678 

-5679 

5694- 

5695 

5714 

-5718 

5723- 

5729 

5757 

-5758 

5825 

-5834 

5839 

5869 

5918 

-5919 

5964 

-5966 

-5988 

-5992 

6008 

6048- 

6056 

6161 

-6164 

6236 

-6241 

6245 

6309 

-6311 

6596- 

6597 

6634 

-6638 

6665 

-6668 

6682 

-6683 

6718- 

6721 

6724 

-6725 

6799- 

6804 

6811 

-6812 

6825 

-6827 

6875 

6899 

-6900 

6916- 

6917 

6954 

-6955 

6957 

6997- 

6998 

7056 

-7059 
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1 ft A A 
1044 

-1852 

1854- 

-1863 

1870- 




1874 

1876- 

-1877 

1879- 

1 o o r\ 
■ loo U 

1885- 

-1890 

1894- 

-1896 




1904 

-1905 

1907- 

•1913 

1 Q1 A 

iyio 

-1927 

1930- 

-1932 

1934- 




1942 

1944- 

-1968 

1973 

1 q a a 

-1984 

1986- 

-1991 

1997- 




2004 

2016- 

-2019 

2030- 


2036- 

-2037 

2045 

2050- 




2052 

2062- 

-2075 

2077- 


2081- 

2086 

2095- 

-2097 




2103 

-2110 

2112 

2114- 


2124- 

2126 

2129- 

-2137 




2143 

-2146 

2153- 

•2154 


-2175 

2186- 

2189 

2199- 




2203 

2208 

2222- 

2224 

ooo<; 

-2227 

2237- 

•2239 

2241- 




2242 

2244- 

2251 

2255- 

. 0*><C A 

2266- 

2280 

2285- 

•2288 




2291 

-2294 

2297- 

2305 

« jU / 

-2322 

2324- 

2327 

2337- 




2346 

2350- 

-2358 

2361- 


2368 

2370- 

2374 

2379 




2381 

-2387 

2389- 

2390 


: 2403 

2407 

2417-2423 




2428 

-2434 

2436- 

2438 

O A A *!> 

-2451 

2453- 

•2454 

2458- 




2463 

2468- 

2470 

2473 

O A *7 *C 
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-2477 

2482- 

2486 

2489- 




2490 

2492- 

2494 

2502- 

Z311 

2514- 

2521 

2526- 

2556 




2558 

-2560 

2567- 

2568 

2574 

-2580 

2583- 

2588 

2592- 




2596 

2603- 

2609 

2616- 

2 617 

2625- 

2633 

2636- 

•2641 




2651 

-2652 

2656- 

2660 

2665« 

-2667 

2671 

2682- 

•2683 




2704 

-2711 

2720-2727 

2731-2742 

2744- 

2749 

2757 




2776 

-2786 

2788- 

2794 

2796 

2799- 

2800 

2806 

2812- 




2819 

2826- 

2838 

2842- 

2848 

2850- 

2851 

2853- 

•2863 




2866 

•2879 

2881- 

2868 

2902 

2904 

2918- 

2946 

2948- 




2950 

2960- 

2962 

2974- 

2979 

3014- 

3023 

3034- 

3066 




3071 

3080- 

3091 

3095- 

3096 

3102- 

3105 

3108- 

•3114 
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3122-3141 3148-3150 3152-3158 3161-3172 3175- 
3177 3181 3188-3219 3221-3226 3249-3269 3274- 
3275 3279-3292 3295 3301 3305-3310 3313-3332 
3334-3352 3359-3366 3370-3372 3375 3380-3383 
3386-3389 3395-3397 3399 3410 3416-3418 3422 
3431-3433 3435-3437 3454 3464-3468 3471-3486 
3491-3493 3497 3500-3503 3507-3516 3520 3522- 
3527 3536 3551-3560 3564-3569 3595-3599 3602- 
3610 3612-3616 3623-3633 3637-3655 3658-3663 
3673-3674 3682-3686 3693-3695 3699-3701 3705- 
3713 3717-3722 3728 3730 3733-3739 3743-3766 
3771-3774 3776-3780 3785-3789 3792 r 3796 3798- 
3801 3804-3805 3841-3854 3861-3865 3868 3878- 
3879 3883-3884 3888-3891 3901-3909 3927-3936 
3946-3949 3958-3960 3964-3965 3969-3989 3991- 
3998 4000-4013 4027-4034 4041-4046 4048-4051 
4058-4062 4070-4072 4074-4081 4086-4088 4092- 
4095 4097-4100 4102-4118 4120-4121 4127-4129 
4141-4154 4163 4166 4169-4171 4177 4180-4181 
4189 4191-4200 4205-4207 4213-4217 4225-4227 
4231 4242-4245 4255 4266 4274-4285 4294-4297 
4336 4344-4348 4366-4367 4377. 4386-4396 4420- 
4424 4429 4436-4447 4453-4455 4480-4486 4489- 
4495 4498-4504 4510-4518 4529-4546 4606-4609 
4611 4614 4625 4644-4645 4648-4652 4662 4678- 
4679 4685-4687 4697 4703 4721-4723 4729 4738- 
4741 4745-4749 4756-4759 4764-4770 4780-4782 
4795 4825 4830-4838 4845-4853 4873-4877 4886- 
4890 4910-4918 4920-4922 4924 4929-4948 4955 
4957-4958 4968-4970 4978 4990-4994 4996-5001 
5005-5012 5014-5017 5019-5038 5043-5049 5053- 
5055 5060-5061 5065-5072 5076-5083 5098-5101 
5107-5109 5114 5120-5145 5147 5153-5154 5161 
5168-5173 5180-5184 5196-5200 5206-5207 5209 T 
5213 5216-5223 5226-5232 5236-5240 5244-5245 
5255 5258-5262 5268-5272 5275-5278 5280-5282 
5286-5288 5290-5292 5297 5302-5306 5316 5332 
5336 5343 5349-5351 5389 5393-5401 5406-5409 
5423-5442 5452-5459 5469-5471 5475-5478 5485- 
5488 5492-5493 5500-5501 5507-5512 5521-5522 
5526-5545 5547-5558 5565-5566 5579-5581 5585- 
5587 5594 5601 5616-5617 5619-5622 5624-5626 
5631 5634-5649 .5658-5659 5664-5667 5671 5683 
5687-5690 5694-5695 5698-5701 5705-5709 5714- 
5718 5723-5729 5746-5758 5764-5769 5773-5782 
5797 5815-5828 5842-5843 5853-5855 5864-5869 
5882-5890 5904-5914 5920-5921 5932 5937-5938 
5940-5941 5945-5948 5950-5952 5956-5960 5964- 
5966 5970 5988-5994 5999-6005 6021 6027 6038- 
6045 6048-6050 6058-6061 6063-6068 6084 6087- 
6090 6119-6120 6123-6127 6132 6145-6X46 6156- 
6158 6165-6171 6174-6176 6178-6191 6196 6203 
6209-6210 6213-6225 6235-6249 6251-6256 6286 
6289 6293 6297-6298 6303-6311 6335 6353-6360 
6364-6366 6385-6388 6397 6407-6419 6424-6425 
6443-6444 6452-6453 6456-6459 6464-6472 6501- 
6504 6506-6510 6528 6548-6554 6560-6562 6600- 
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6606 

6706- 

6744 

6828- 

6863- 

6911- 

6957 

7014- 

7083 

7144 

7181- 

7231- 

7286 

7347 

7465- 

7556 

7619 

7698- 

7797- 

7859- 

7917 

7983 

8070- 

8122 

8198 

8338- 

8387 

8440 

8495 


6613- 

6708 

6751- 

6832 

6867 

6917 

6967- 

7027 

7086- 

7151- 

7184 

7233 

7305 

7375 

7468 

7561- 

7638- 

7700 

7801 

7862 

7930- 

7998: 

8089 

8133- 

8227- 

8339 

8397- 

8443- 


6615 
6713- 
6756 
6836- 
6881 
6920- 
6969 
7029- 
7090 
7153 
.7188- 
7238- 
7308- 
7377- 
7477 
7583 
7639 
7703 
7805 
7867- 
7931 
8002 
8091- 
8140 
8230 
8342- 
8398 
8444 


6629 

6714. 

6766- 

6838 

6890- 

6924 

6990- 

7033 

7098- 

7159- 

7195 

7240 

7311 

7384 

7502 

7599 

7661 

7711 

7810 

7868 

7946 

8018- 

8095 

8142- 

'8277- 

8354 

8413 

8455- 


6668 

6718- 

6767 

6840 

6894 

6940- 

6992 

7050- 

7102 

7160 

7200 

7255- 

7315 

7411- 

7506 

7600 

7667- 

7753- 

7812 

7890- 

7949- 

8021 

8101- 

8145 

8279 

8364 

8418- 

8456 


6699- 

6723 

6776- 

6845- 

6897- 

6943 

7001- 

7053 

7108- 

7162- 

7205- 

7259 

7321 

7417 

7527- 

7609- 

7668 

7754 

7822- 

7891 

7951 

8026- 

8113 

8182- 

8298- 

8367- 

8419 

8459 


6700 

6727- 

6779 

6848 

6898 

6946- 

7002 

7063- 

7115 

7167 

7206 

7270 

7329- 

7424- 

7538 

7612 

7671 

7761 

7828 

7904- 

7955- 

8033 

8115- 

8189 

8299 

8368 

8430- 

8465- 


6703- 

6728 

6799- 

6858 

6901- 

6949 

7009- 

7078 

7133- 

7171- 

7222- 

7272- 

7330 

7425 

7551- 

7615- 

7680- 

7766- 

7830 

7905 

7957 

8062- 

8116 

8194- 

8306- 

8372- 

8432 

8482 


6704 

6739- 

6809 

6861 

6906 

6952 

7011 

7080- 

7135 

7175 

7229 

7284 

7346- 

7446 

7554 

7616 

7683 

7790 

7843 

7907 

7980- 

8063 

8121- 

8196 

8308 

8382 

8439- 

8490- 


fetal 
liver- 
spleen 


Soares 


FLS002 3-9 14-19 26-28 33 4 
107 111 123-126 143 
221-222 230 253-254 
276 280-282 285 296- 
379 384-385 408 416 
466 469-473 483-486 
551 563-564 597-598 
650-651 656-658 661 
713 718 733-735 740- 
836-838 860-861 909- 
969 974 984-985 992- 
1023 1033-1035 1037 
1147 1153-1158 1173- 
1213-1221 1227-1245 
1296 1303-1305 1309- 
1339-1348 1354-1355 
1390 1402-1414 1439- 
1490-1491 1501-1502 
1528 1531-1537 1542- 
1583-1599 1602-1606 
1629-1634 1637-1642 
1677 1679-1690 1695- 
1725-1728 1730 1759- 
1774-1776 1778 1781 
1817-1819 1824-1825 
1860 1870-1877 1879- 
1905 1907-1911 1915- 
1937-1945 1950-1953 


0-41 51 56 69 73 80 106- 
161 172-175 180-183 195 
259 262 264-268 271 275- 
297 307-320 330-342 375- 
424 432-433 441 463-464 
488-495 527-531 533 549- 
612-613 632 638-639 648 
665 698 705-706 709 712- 
741 806-807 809 831-833 
913 937-940 949-951 962- 
994 1008 1017-1018 1021- 
1064-1065 1070-1071 1122- 
1174 1185-1187 1194-1209 
1249-1257 1273-1279 1290- 
1310 1316-1317 1324-1335 
1357-1370 1374-1377 1380- 
1441 1455-1462 1472-1484 
1507-1510 1514-1515 1525- 
1555 1557-1567 1572-1577 
1610 1612-1615 1619-1620 
1658-1659 1671-1674 1676- 
1696 1702-1710 1719-1721 
1760 1762-1763 1765-1767 
1783 1786-1790 1813-1815 
1844-1848 1850-1851 1854- 
1880 1890-1893 1896 1904- 
1916 1922-1927 1934-1935 
1958-1961 1964-1966 1979- 
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1984 

1986 

-1991 

1997 

2035 

2037 

-2044 

2061 

2078 

2081 

-2082 

2086 

2122 

2129 

-2137 

2148 

2182 

-2184 

2190 

-2191 

2224 

2237 

-2240 

2243 

2267 

2286 

-2289 

2291 

2335 

-2346 

2362 

-2364 

2417 

-2419 

2430 

-2435 

2470 

2480 

2482 

-2486 

2520 

2538 

-2543 

2549 

2570 

2574 

-2580 

2583 

2616 

-2617 

2619 

-2623 

2661 

-2663 

2671 

2682 

2721 

2725 

-2727 

2731 

2783 

2794 

2797 

2799 

2817 

-2819 

2826 

-2834 

2863 

2866 

-2878 

2880 

2908 

-2911 

2917 

-2946 

3016 

3018 

-3023 

3036 

3063 

3067 

3071 

3073 

3102 

-3105 

3108 

-3111 

3150 

3152 

-3155 

3157 

3175 

-3177 

3187 

3217 

3262 

-3273 

3280 

-3284 

3310 

3315 

-3323 

3334 

3386 

-3389 

3395 

-3397 

3454 

3471 

-3472 

3475 

3498 

-3503 

3508 

-3511 

3540 

-3547 

3555 

-3557 

3590 

3595 

-3599 

3616 

3643 

3646 

-3655 

3658 

3686 

3696 

3699 

-3701 

3729 

-3730 

3733 

-3739 

3780 

3788 

-3789 

3791 

3850 

-3854 

3861 

-3870 

3893 

3901 

-3903 

3928 

3966 

-3968 

3971 

-3987 

4041 

4044 

4048 

-4051 

4074 

-4088 

4092 

-4095 

4129 

4136 

4139 

4141 

4165 

-4166 

4169 

4174 

4205 

-4207 

4216 

-4220 

4259 

4264 

-4266 

4271 

4288 

4293 

4296 

-4297 

4363 

-4364 

4366 

-4369 

4396 

4420 

-4421 

4429 

4483 

4485 

-4486 

4490 

4518 

. 4520 

-4535 

4542 

4564 

4607 

-4609 

4613 

4662 

4664 

-4665 

4675 

4692 

4699 

4716 

-4717 

4735 

4743 

4745 

-4749 

4795 

4825 

4837 

-4838 

4876 

4886 

-4889 

4901 

4926 

4929 

-4935 

4941 

4978 

4983 

-4985 

4993 


-2000 
2064^ 
2095- 
-2152 
2195- 
-2248 
-2294 
2378- 
2440- 
2495- 
-2550 
-2586 
2625 
-2683 
-2742 
-2800 
2836- 
-2883 
2948- 
3040- 
-3078 
3117- 
-3158 
-3226 
3287* 
-3344 
3399 
-3478 
3517- 
3560 
-3628 
3660- 
3705 
3748- 
3796 
3876- 
-3936 
4009- 
4054 
4097- 
-4142 
-4178 
4233- 
4274- 
4323- 
4377 
4436- 
-4495 
-4543 
-4614 
4679 
4719 
4757- 
4845- 
-4903 
4943- 
-4994 


2016- 

2067 

2097 

2156- 

2200 

2251 

2300- 

2379 

2441 

2499 

2558< 

2592- 

2637- 

2692- 

2744- 

2806 

2848 

2889- 

2950 

3042 

3088- 

3133 

3161- 

3250- 

3292 

3346- 

3416- 

3481- 

3519 

3564- 

3631- 

3663 

3707- 

3759 

3798- 

3877 

3958- 

4013 

4058- 

4100 

4145- 

4182- 

4240 

4275 

4325 

4386- 

4438 

4497- 

4547- 

4617 

4681- 

4721- 

4758 

4646 

4906 

4948 

4996- 


2017 

2071- 

2103- 

2162 

2204- 

2255- 

2305 

2381- 

2454 

2502- 

2560 

2596 

2641 

2711 

2755 

2809 

2853- 

2893 

2 960*- 

3045- 

3091 

3137- 

3162 

3252 

3298- 

3352 

3423 

3486 

3524 

3565 

3633 

3678- 

3709 

3761- 

3801 

3880- 

3960 

4015- 

4062 

4110- 

4151 

4183 

4242- 

4278- 

4331- 

4387 

4441- 

4501 

4552 

4629- 

4683 

4723 

4764- 

4848- 

4919- 

4955 

4998 


2028 

2032- 

2073 

2077- 

2107 

2114- 

2165 

-2173 

2208 

2222- 

2257 

2266- 

2328 

-2331 

2386 

2407 

2456 

2464- 

2510 

2514- 

2567 

-2568 

2603 

-2609 

2651 

-2652 

2716 

-2719 

2757 

2772- 

2814 

-2815 

2856 

2860- 

2902 

-2904 

2972 

2978- 

3048 

3058- 

3 095 

-3096 

3141 

3149- 

3165 

-3172 

3259 

-3260 

3300 

3305- 

3371 

-3379 

3425 

-3429 . 

3491 

-3493 

3528 

-3530 

3570 

3588- 

3637 

3642- 

3679 

3683- 

3717 

-3721 

3766 

3776- 

3 804 

-3805 

3882 

3888- 

3962 

-3963 

4018 

4020- 

4067 

-4069 

4118 

4120- 

4153 

4163 

4191 

-4193 

4245 

4255 

4279 

4285 

4333 

4348 

4391 

4394- 

4447 

4480- 

4504 

4510- 

4561 

4563- 

4639 

4647 

4685 

-4687 

4729 

4731 

4767 

4790 

4 850 

4873- 

4 922 

4925- 

4968 

-4976 

5005 

-5007 
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5014 

-5017 

5022 

-5028 

5034 

-5038 

5054 

5065 

-5072 




5076 

-5079 

5084 

-5100 

5107 

-5109 

5120 

5129 

-5131 




5134 

5136 

5138 

-5140 

5142 

5145 

5148 

5150 

-5151 




5159 

-5163 

5167 

-5176 

5185 

-5189 

5193 

5196 

-5199 




5206 

-5207 

5224 

5226 

-5228 

5236 

-5243 

5250 

-5255 




5258 

-5262 

5268 

-5272 

5290 

-5292 

5297 

5306 

.5316 




5321 

-5322 

5326 

-5332 

5336 

5341 

5345 

5349 

5352- 




5359 

5369 

-5371 

5390 

5392 

5396 

-5401 

5406 

-5409 




5415 

-5416 

5427 

-5428 

5431 

-5437 

5441 

-5445 

5466 




5469 

-5471 

5475 

-5482 

5485 

-5488 

5492 

-5493 

5504- 




5505 

5514 

5517 

-5518 

5521 

-5522 

5547 

-5558 

5567- 




5568 

5570 

5579 

-5581 

5585 

5590 

-5592 

5594 

5596 




5601 

-5602 

5614 

-5615 

5619 

-5622 

5624 

-5626 

5643- 




5645 

5648 

-5649 

5658 

-5660 

5683 

5685 

5687 

-5690 




5694 

-5695 

5697 

-5701 

5705 

-5709 

5714 

5723 

-5729 




5746 

-5753 

5757 

-5758 

5764 

-5769 

5773 

-5774 

5778- 




5779 

5798 

-5805 

5809 

-5814 

5818 

-5821 

5825 

-5828 




5839 

-5840 

5842 

-5843 

5845 

-5848 

5850 

5882 

-5885 




5891 

5904 

-5914 

5920 

-5923 

5925 

-5930 

5932 

5937- 




5938 

5940 

-5941 

5945 

-5949 

5953 

-5954 

5956 

-5960 




5962 

-5963 

5970 

5986 

5988 

-5992 

5997 

6001 

-6008 




6027 

6038 

-6042 

6044 

-6045 

6048 

-6050 

6065 

-6068 




6077 

6082- 

-6083 

6087- 

-6090 

6112 

-6113 

6117 

-6120 




6128 

-6130 

6140 

-6141 

6155 

6157 

-6158 

6161- 

-6164 




6170- 

-6176 

6179- 

-6191 

6209 

-6210 

6221- 

-6223 

6228- 




6229 

6234 

6236- 

-6241 

6247 

6253 

6268 

6272 

6285 




6287 

6293 

6303- 

-6304 

6308 

6316- 

-6322 

6346 

6356- 




6360 

6364 

6385- 

-6388 

6391- 

-6395 

6404 

6407- 

-6411 




6424- 

-6426 

6443- 

-6444 

6501- 

-6505 

6508- 

-6509 

6517- 




6521 

6525- 

■6526 

6596- 

-6597 

6602- 

-6606 

6613- 

-6615 




6617 

6629- 

-6642 

6646- 

-6650 

6661 

6663- 

-6668 

6670- 




6676 

6688- 

6689 

6698 

6711- 

-6712 

6718- 

•6723 

6727- 




6728 

6731- 

•6732 

6734- 

-6735 

6742 

6749 

6757- 

•6761 




6764- 

-6765 

6776- 

-6780 

6785 

6805- 

•6807 

6828- 

-6832 




6840 

6843- 

6844 

6849 

6861 

6872- 

6874 

6890- 

-6894 




6911- 

6915 

6923- 

6931 

6935 

6937 

6941- 

6942 

6954- 




6955 

6958- 

6959 

6990- 

6992 

6997- 

6998 

7001- 

-7002 




7014- 

7027 

7029- 

7033 

7038- 

7040 

7046- 

7049 

7056- 




7059 

7063- 

7078 

7083-7085 

7088- 

7090 

7099- 

•7102 




7105- 

7106 

7111- 

7112 

7135 

7140- 

7142 

7151 

7159- 




7160 

7162- 

7167 

7181- 

7184 

7191 

7205- 

7206 

7223- 




7224 

7227 

7230- 

7234 

7248- 

7249 

7279- 

7280 

7285 




7308- 

7311 

7321 

7329- 

7330 

7334 

7416- 

7417 

7424- 




7425 

7428- 

7429 

7446 

7465- 

7468 

7480 

7494 

7501 




7506 

7570- 

7573 

7575 

7579 

7582- 

7584 

7593- 

7594 




7599- 

7600 

7603 

7624 

7640- 

7641 

7645 

7650 

7658 




7661- 

7666 

7680- 

7683 

7693- 

7696 

7698- 

7700 

7702 




7717 

7723- 

7727 

7729 

7748 

7770- 

7774 

7805 

7809 




7825 

7839- 

7841 

7886 

7980- 

7981 

8029 

8031- 

8033 




8073 

8086- 

8087 

8089 

8095 

809?- 

8098 

8101 

8121- 




8122 

8126- 

8127 . 

8129 

8133- 

8135 

8163- 

8X66 

8175 




8187- 

8188 

8192- 

8193 

8203- 

8204 

8234- 

8235 

8238 




8267- 

8269 

8271 

8297 

8306- 

8308 

8353- 

8356 

8387 




8397- 

8398 

8406- 

8407 

8413 

8418- 

8419 

8427 

8433- 




8435 

8455- 

8456 

8463- 

8464 

8490- 

8495 



fetal 

Soares 

FLS003 

499-500 786-787 

793- 

794 855-856 1003 1022- 

liver - 



1023 

1200- 

1205 

1249- 

1253 

1277- 

1279 

1349- 

1355 


icn*>o 


± OCTp 4-3 . O iS 2 &XJ H 


spleen 1516-1517 1784-1785 1870-1874 2315-2322 3065 

3217 3507 3623-3628 3730 4070-4072 4395-4396 
j 4484 4486 5022-5028 5107-5109 7108-7110 7797 

8469-8479 ' 

fetal liver Invitrogen FLV001 3-5 20-21 105 140-142 191 210-211 230 280-282 

345-349 419-424 479 488-490 499-500 527-529 
583 625-626 656-658 712-715 733-734 765 786- 
787 793-794 831-833 905-908 918 945-947 989 


1003 

1044 

-1046 

1070 

-1071 

1200 

-1205 

1213 

-1221 

1244 

-1245 

1261 

-1263 

1273 

-1279 

1286 

-1293 

1311- 

1313 

1337 

-1338 

1348 

-1355 

1359 

-1368 

1424 

1437- 

1438 

1442 

-1444 

1455 

-1462 

1498 

-1502 

1542 

-1548 

1561 

1568 

-1570 

1572 

-1574 

1603 

-1608 

1619 

-1620 

1630 

-1632 

1658 

-1659 

1673 

-1674 

1691 

-1692 

1702- 

1710 

1759 

-1760 

1764 

-1767 

1780 

-1785 

1791 

-1795 

1813 

-1815 

1839 

-1840 

1861. 

-1863 

1890 

1907 

-1912 

1921 

1930 

-1932 

1944 

-1945 

1955 

-1961 

1970 

-1971 

1995 

-1996 

2059 

-2060 

2120 

-2121 

2124 

-2126 

2137- 

2141 

2147 

2156 

-2162 

2220 

2262 

-2264 

2300 

-2301 

2307 

-2322 

2340 

2365 

-2366 

2468 

2502 

-2510 

2534- 

2535 

2538 

-2542 

2549 

-2550 

2574 

-2580 

2587 

-2588 

2651 

-2652 

2658 

-2660 

2698 

-2703 

2711 

2725 

-2727 

2842 

-2848 

2850 

-2851 

2868 

-2873 

2881 

-2883 

2902 

2913 

-2914 

2918 

-2930 

3037 

-3039 

3043 

3054 

-3057 

3065 

3108 

-3iio 

3115 

-3116 

3217 

-3219 

3305 

-3307 

3313 

-3323 

3346 

-3352 

3358 

3395 

-3397 

3399 

3410 

3419 

-3422 

3454 

3522 

-3523 

3605 

3660 

-3663 

3722 

3742 

3926 

3942 

-3945 

3961 

3986 

-3987 

3991 

-3998 

4047 

4082 

-4085 

4102 

-4106 

4122 

-4126 

4209 

4242- 

4245 

4293 

4366 

-4367 

4378 

-4380 

4395- 

-4396 

4429 


4480-4483 4486 4547-4549 4556-4558 4674 4676 
4764 5019-5028 5034-5035 5065-5072.5076-5079 
5098-5100 5107-5109 5153-5154 5241-5243 5351 
5364 5393-5395 5416 5431-5437 5443-5445 5554- 
5558 5585 5616-5617 5687-5690 5694-5695 5723- 
5729 5748-5753 5780-5782 5856-5863 5882-5885 
5940-5941 6003-6005 6044-6045 6051-6056 6058- 
6060 6069-6073 6117-6120 6236-6241 6353-6355 
6485 6508-6509 6548-6549 6718-6721 6742 6766- 
6767 6805-6807 6872-6874 6909 6958-6959 7050- 

! 7053 7080-7082 7086-7087 7151 7159-7160 7188- 

7191 7205-7206 7231-7233 7373 7375 7418-7419 
7465-7468 7474 .7568 7570-7573 7615-7616 7658 
7662-7666 767.0 7672-7675 7698-7700 7748 7753- 
7754 7799-7800 7867-7868 7929 7972-7973 8046- 

I 8048 8077 8110-8113 8115-8116 8187-8188 8397- 

8398 8418-8419 8430-8432 8457-8458 8466 8490- 

8495 

fetal liver Clontech FLV002 14-19 26-28 384-385 1796-1797 3334-3344 4063- 

4064 4498-4501 4778 

fetal liver Clontech FLV004 8-9 407-408 62.5-626 790-792 909-912 945-947 

1022-1023 1055-1059 1070-1071 1166-1171 1277- 
1279 1357-1358 1507-1510 1557 1603-1606 1610 

j 1671-1672 1695-1696 1885-1889 1896 1944-1945 

I 1950-1953 2074-2075 2385-2386 2538-2542 2544- 
' 2548 2651-2652 2656-2660 2731 2881-2888 2963- 


ACID'S 46 M- 





2971 

3065 

3108- 

3110 

3133 

3138 

-3141 

3148 

3152- 




3155 

3165- 

-3172 

3218 

-3219 

3262 

-3269 

3305 

-3307 




3315 

-3318 

3507 

3522 

-3523 

3642 

-3643 

3738 

-3739 




3861 

-3865 

3871- 

-3875 

3888 

-3891 

3931- 

-3936 

4Q15- 




4018 

4061- 

-4062 

4120 

-4121 

4429 

4873- 

-4876 

4996- 




4998 

5034- 

-5035 

5485 

-5488 

5526 

-5529 

5554 

-5558 




5757 

-5758 

6119- 

-6120 

6161- 

-6164 

6634- 

-6638 

7014- 




7027 

7083 

7152- 

-7153 

7205- 

-7206 

7274-7275 

7465- 




7468 

7615- 

-7616 

7745 

7799- 

-7800 

8075-8076 

8187- 




8188 

8338 

8430- 

-8432 






fetal 

Invitrogen 

FMS001 

3-5 

90 94 

266-268 2 

30-282 290 

416-418 4 

32-433 

muscle 



447 

527-529 608-609 

636 648 650-651 764 

860- 




861 

973 1213-1215 1227-1243 1287-1289 1297- 




1302 

1309- 

-1313 

1337 

-1338 

1354 

-1355 

1439 

-1441 




1481 

-1487 

1514- 

-1515 

1529- 

-1530 

1607- 

•1608 

1619- 




1620 

1626- 

-1628 

1660 

-1669 

1673 

-1674 

1691 

-1692 




1702 

-1710 

1728 

1817 

-1819 

1827 

-1830 

1841 

-1842 




1844 

-1848 

1877 

1907 

-1911 

1998 

-1999 

2005 

-2008 




2020 

-2027 

2037- 

2044 

2053- 

-2058 

2063 

2067 

2U2 




2124 

-2126 

2148- 

2152 

2158- 

•2162 

2225 

2249 

-2250 




2266 

-2271 

2300- 

2301 

2315- 

-2322 

2332- 

2333 

2340 




2408 

-2415 

2420- 

2423 

2440- 

-2441 

2482- 

2486 

2513 




2543 

2616- 

2617 

2668 

2671 

2698 

-2703 

2713 

-2715 

- 



2806 

-2809 

2836- 

2838 

2856- 

•2859 

2874 

2915 

2918- 




2930 

2953- 

2959 

3024- 

-3025 

3066 

3097- 

3101 

3156 




3217 

3221- 

3226 

3346- 

-3348 

3454 

3464- 

3468 

3605 




3616 

3633 

3637- 

3638 

3653- 

•3655 

3683- 

3686 

3696 




3722 

3730 

3797- 

3801 

3841- 

•3854 

3861- 

3865 

3878- 




3879 

3920- 

3921 

3926 

3946- 

-3949 

3986- 

3987 

4080- 




4081 

4107- 

4109 

4213- 

-4215 

4232 

4294- 

4295 

4361- 




4362 

4373 

4422- 

4424 

4429 

4484 

4510- 

4511 

4614 




4676 

4721- 

4723 

4777 

4925- 

4926 

4999- 

5001 

5015- 




5017 

5060 

5080- 

5081 

5250- 

5254 

5343 

5389 

5393- 




5395 

5425- 

5426 

5430 

5452- 

5459 

5475- 

5482 

5515- 




5516 

5646 

5705- 

5709 

5714 

5723- 

-5729 

5764 

5783- 




5791 

5845- 

5847 

5925- 

-5930 

5945- 

•5948 

5988- 

-5992 




5999 

-6000 

6008 

6013 

6128- 

6130 

6268 

6305- 

-6307 




6359 

-6360 

6385- 

6388 

6491- 

6499 

6552- 

6554 

6616 




6646 

-6649 

6688- 

6689 

6695- 

6696 

6776- 

6779 

6785 




6805 

-6807 

6853- 

6854 

6907- 

6908 

6941- 

6942 

7045 




7159 

-7160 

7179- 

7184 

7191 

7328 

7398 

7472 

7551- 




7554 

7570- 

7573 

7630- 

7633 

7693- 

-7696 

7723- 

-7727 




7770 

-7772 

7864 

7867- 

-7868 

7923 

7953- 

7954 

7961 




8014 

-8015 

8031- 

8033 

8194- 

8196 

8227- 

8229 

8298- 




8299 

8306- 

8308 

8397- 

•8398 

8430- 

8432 



cecal 

Invitrogen 

FMS0O2 

3-5 144-14 

5 1000-1001 1619-1620 1827-1830 

muscle 



1870- 

-1874 

1876 

2003- 

2004 

2225 

2262- 

2264 

2268- 




2271 

2389- 

2390 

2630- 

2633 

2731 

3138- 

3141 

3163- 




3164 

3217 

3346- 

3348 

3620- 

3622 

3905- 

3 909 

5409 




6552 

-6554 

8397- 

8398 

8430- 

8432 




fetal skin 

Invitrogen 

FSK001 

3-5 8-9 14 

-21 3 

8 55 

67 85 

90 105-107 114-115 




140-142 144-145 

193- 

194 201-209 253 

-254 

259- 




260 266-27 

0 296 

-297 

304 3 

07-320 345 

-349 

355 




366-369 37 

1-373 

390- 

394 4 

68 476-477 

480- 

-482 




499-500 514 527 

-531 

543-548 586-587 

597- 

-598 




648 650-65 

1 671 

-672 

685-686 706 714 

-715 

726- 


!. O t Q^I! NhB MK3 , O IS a 6. 0 H 


727 733-735 763 831-833 875-876 920 941-942 
949-951 973 979-980 989 1003 1005-1007 1011- 
1012 1033-1035 1070-1071 1110-1111 1122-1147 
1160-1161 1213-1221 1227-1243 1286-1293 1300- 
1302 1311-1315 1341-1347 1349-1355 1359-1367 
1374-1377 1391 1415-1420 1439-1444 1455-1459 
1466-1470 1478-1480 1485-1489 1511-1515 1518- 
1520 1531-1537 1540-1541 1561-1567 1579-1580 
1600-1602 1607-1608 1610 1619-1620 1635 1637- 
1641 1655-1657 1659-1669 1671-1672 1711 1718 
1726-1728 1730 1770-1773 1780-1785 1791-1795 
1806-1810 1813-1815 1827-1831 1839-1840 1850- 
1851 1857-1860 1870-1874 1881-1882 1890-1893 
1896 1907-1912 1921-1927 1958-1961 1964-1966 
1972 1980-1984 1998-1999 2003-2008 2016-2017 
2020-2027 2059-2060 2062-2067 2082 2108-2110 
2il2 2124-2126 2147 2156-2157 2165-2173 2204- 
2207 2209-2217 2223-2224 2249-2250 2268-2271 
2279-2280 2282-2283 2291-2294 2315-2322 2332- 
2333 2340 2353-2357 2389-2390 2407 2417-2419 
2428-2429 2440-2441 2445-2451 2468 2491 2493- 
2494 2500-2508 2522-2528 2549-2550 2567-2568 
2570 2572 2589-2590 2603-2611 2619-2622 2626- 
2629 2636-2643 2653-2654 2656-2663 2697-2703 
2713-2715 2731 2739-2749 2794 2796 2806-2809 
2816 2823-2825 2836-2838 2842-2848 2850-2851 
2856 2867 2875-2878 2880 2904 2913-2915 2918- 
2934 2940-2946 2948-2950 2963-2971 2974-2979 
3014-3016 3018-3025 3043 3045-3053 3066 3080- 
3085 3097-3101 3108-3110 3122-3132 3138-3141 
3149-3150 3156-3158 3165-3172 3215-3217 3250 
3261 3280-3284 3287-3289 3305-3307 3311-3312 
3371-3372 3383 3399 3410 3414-3415 3435-3437 
3454 3487-3488 3520 3528-3530 3537-3539 3551- 
3553 3564-3565 3570-3587 3602-3610 3620-3622 
3631-3633 3637 3675-3677 3683-3692 3696 3707- 
3713 3728 3740-3747 3760 3791 3794-3795 3802- 
3803 3888-3891 3910-3921 3926 3931-3936 3961 
3983-3987 3990 4000 4015-4018 4022-4026 4035- 
4040 4043 4048-4051 4061-4062 4070-4072 4092- 
4095 4107-4109 4151 4163 4177 4191-4193 4223- 
4224 4242-4245 4252 4255 4289 4294-4297 4334 
4349-4350 4386-4387 4394 4420-4421 4430-4431 
4436-4438 4484 4504-4506 4512-4518 4544-4546 
4614 4618 4626-4627 4636-4639 4666-4667 4674- 
4676 4721-4723 4737-4741 4765-4767 4774 4790 
4793-4794 4840 4900 4910-4918 4925-4926 4936 
4941 4943 4949 4957-4958 4968-4970 4999-5001 
5015-5017 5019-5033 5061 5065-5075 5080-5081 
5084-5097 510$ 5110 5134 5168-5173 5180-5183 
5228 5236-5243 5284-5285 5297 5315 5323 5356- 
5358 5364 5372-5373 5389 5393-5401 5406-5408 
5413-5414 5416 5423 5425-5426 5428 5431-5437 
5466-5467 5475-5478 5483 5500-5501 5517-5518 
5526-5529 5561 5565-5566 5585 5596 5635 5639- 
5641 5680-5682 5686 5697 5714 5723-5729 5731- 
5735 5748-5753 5765-5768 5825-5828 5839 5873 


1 0-7*3 


2 iSHfi. O 


H! 


fetal skin 


Invitrogen 


FSK002 


5896- 

6011- 

6131 

6234 

6414- 

6548- 

6668 

6804 

6998 

7153 

7265 

7375 

7490- 

7573 

7693- 

7780- 

7907 

8110- 

8357- 

8460- 


5898 

6013 

6142 

6236- 

6417 

6549 

6688- 

6828- 

7000- 

7181- 

7276- 

7404 

7492 

7599- 

7696 

7790 

7923 

8113 

8358 

8461 


5920 

6044- 

6159 

6241 

6424« 

6560- 

6689 

6835 

7002 

7184 

7277. 

7424- 

7496 

7600 

7702 

7793 

7998- 

8162 

8383- 

8490- 


-5921 
•6045 
6182 
6291- 
6425 
6562 
6691 
6849 
7044 
7188- 
7303- 
7425 
7539- 
7623 
7723- 
7797 
8002 
8185 
8385 
8495 


5945- 

6077 

6191 

6292 

6455- 

6634- 

6739- 

6858 

7098- 

7191 

7305 

7461 

7549 

7630- 

7727 

7859- 

8027- 

8211 

8399 


-5949 
6085- 
6198- 
6297- 
6459 
6638 
6744 
6909 
7102 
7196- 
7310- 
7463 
7551- 
7633 
7731 
7860 
8028 
8227- 
8429- 


5986 

6086 

6199 

6298 

6462 

6653- 

6776- 

6967- 

7116- 

7197 

7311 

7474 

7554 

7638- 

7743 

7887 

8031- 

8229 

8432 


6003 

6119 

6203 

6385 

6464- 

6660 

6780 

6969 

7132 

7251- 

7321 

7480 

7568 

7639 

7770- 

7890- 

8033 

8309- 

8446- 


6005 

6120 

6208 

6388 

6466 

6662- 

6799- 

6997- 

7151- 

7259 

7373- 

7483 

7570- 

7689 

7772 

7891 

8039 

8312 

8447 


144-145 260 283-284 
407 425 552-555 580 
1111 1148-1149 1177 
1347 1369-1370 1488- 
1710 1781-1783 i854- 
1916 1930-1932 2138 
2417-2419 2502-2508 
2658-2660 2665-2667 
2804 2850-2851 2857- 
2978-2979 3014-3016 
3250 3412-3413 3507 
3788-3789 3841-3849 
3988-3989 4009-4011 
4081 4266 4629-4634 
4955 4957-4958 5236- 
5687-5690 5714 5757- 
6179 6182-6191 6455 
7004 7432 7474 7570- 
8031-8033 8414-8416 


307-320 330-340 351-352 
-581 909-912 1016 1110- 
•1178 1185-1186 1334-1335 
•1489 1505-1506 1659 1702- 
1856 1870-1874 1894-1896 
2141 2174-2175 2341-2343 
2543 2558 2603-2609 2636 
2682-2683 2698-2703 2801- 
2859 2889-2893 2931-2934 
3071 3138-3141 3165-3172 
3728 3733-3739 3748-3752 
3861-3865 3938-3941 3956 
4020-4021 4048-4051 4080- 
4783 4840-4842 4873-4876 
5240 5448-5459 5621-5622 
5758 5964-5966 6069-6073 
6743-6744 6911-6915 7001- 
7573 7720-7721 7955-7957 
8459 


fetal 
spleen 


BioChain 


FSP001 


546-548 625-626 790-792 920 1022-1023 1357- 
1358 1415-1420 1455-1459 1511-1513 1549-1551 
1602 1679 1896 2003-2004 2223-2224 2453 3115- 
3116 3163-3164 3274-3275 5416 6923-6924 7446 
8430-8432 


umbilical 
cord 


BioChain 


FUC001 


3-5 14-19 26-28 42- 
140-142 144-145 164 
221-222 296-297 307 
418 456-458 460-462 
527-531 533 537 541 
588-591 597-598 650 
786-787 793-801 803 
951 1003 1022^1023 
1111 1160-1161 1175 
1227-1247 1286-1289 
1313 1337-1340 1354 
1408-1409 1424-1431 


43 48 93 105 114-115 128 
-169 193-194 196 213-215 
-320 355 370-372 387 416- 

488-495 499-500 515-516 
-542 549-551 556 579 583 
-651 714-715 735 757-758 
-804 855-856 860-861 949- 
1033-1035 1064-1065 1110- 
-1176 1200-1209 1213-1215 

1300-1302 1306-1308 1311- 
-1355 1359-1368 1374-1390 
1439-1441 1465 1478-1480 


1485-1487 1529-1530 1540-1548 1552-1555 1557- 
1561 1572-1577 1583-1595 1600-1602 1607-1608 
1610-1615 1619-1620 1637-1642 1658-1659 1702- 
1710 1712-1713 1715 1718 1774-1776 1781-1783 
1801-1804 1809-1810 1813-1815 1820-1821 1831- 
1834 1844-1852 1857-1860 1870-1874 1876-1877 
1881-1882 1896 1907-1912 1916 1921 1929 1934- 
1935 1962-1963 1972 1980-1984 1986-1991 2003- 
2008 2014-2015 2018-2019 2030-2031 2036-2037 
2050-2052 2059-2060 2063-2066 2071-2075 2081 
2103-2107 2112 2114-2121 2138-2141 2156-2157 
2165-2170 2185 2199-2200 2209-2217 2223-2224 
2228 2262-2264 2266-2267 2279-2280 2297^2299 
2315-2322 2324-2327 2341-2343 2353-2357 2362- 
2364 2370-2372 2381-2386 2404-2405 2407-2415 
2417-2419 2430-2434 2453 2455 2458-2461 2479- 
2480 2482-2486 2489-2490 2493-2494 2515-2516 
2536-2537 2551-2553 2558 2567-2568 2570-2571 
2585-2586 2619-2622 2626-2633 2656-2660 2665- 
2667 2689-2697 2704-2711 2716-2719 2722-2724 
2732-2738 2750-2755 2778-2783 2794 2796 2810- 
2811 2814-2815 2817 2823-2825 2835-2838 2842- 
2848 2857-2859 2912 2915 2918-2930 2935-2946 
2948-2959 2974-2979 3018-3025 3037-3039 3049- 
3053 3065-3066 3073-3078 3080-3085 3088-3091 
3097-3105 3133-3136 3138-3147 3149-3150 3161- 
3162 3215-3217 3220 3227-3228 3250 3262-3275 
3279-3284 3287-3292 3301 3306-3307 3313-3314 
3349-3354 3357 3373-3374 3376-3380 3400-3401 
3410-3415 3419-3421 3424 3431-3433 3450-3452 
3454 3469-3470 3487-3488 3507 3551-3553 3555- 
3557 3560 3564-3567 3602-3605 3611-3615 3620- 
3622 3631-3633 3637 3642-3643 3646-3652 3658 
3660-3663 3669-3672 3675-3679 3699-3701 3717- 
3722 3724-3726 3728 3733-3739 3742-3747 3761- 
3766 3785-3787 3850-3854 3888-3891 3894-3895 
3901-3904 3931-3936 3946-3949 3966-3968 3986- 
3989 4015-4019 4022-4040 4042 4047-4051 4061- 
4062 4067-4069 4092-4095 4107-4109 4128-4129 
4165-4166 4169 4177 4180-4183 4190-4195 4202- 
4204 4230-4231 4233-4240 4242-4245 4247 4285 
4289 4294-4295 4360 4363-4364 4366-4367 4370 
4395-4396 4422-4424 4427 4430-4431 4436-4438 
4445-4455 4480-4484 4491-4494 4498-4503 4512- 
4513 4536-4538 4553 4561-4562 4609 4644-4645 
4688-4690 4692-4694 4719 4724-4727 4729 4753- 
4755 4786-4787 4795 4822 4873-4876 4883-4884 
4917-4918 4925-4926 4936 4946-4948 4957-4958 
4972-4976 5008-5012 5015-5017 5019-5031 5036- 
5038 5054 5060 5107-5109 5139-5140 5151 5159- 
5161 5168-5179 5202 5205 5208 5213 5241-5243 
5268-5272 5275-5278 5280-5282 5288 5294 5323 
5331 5336 5340 5349-5351 5406-5408 5415 5423 
5425-5426 5448-5459 5466 5472 5475-5478 5492- 
5493 5553 5559-5561 5564 5579-5581 5594 5597 
5631 5658-5660 5673 5677 5693-5695 5731-5735 
5757-5760 5775 5798-5805 5809-5814 5818-5821 
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-2242 

2255 

-2257 
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-2264 

2266- 

2271 

2276 

-2278 

2291- 

2294 

2297- 




2299 

2302 

-2305 

2307- 

2314 

2328 

-2333 

2340 

-2343 




2353 

-2357 

2370- 

2372 

2378 

2388- 

-2390 

2398 

-2403 




2417 

-2419 

2433- 

2438 

2440 

-2441 

2445- 

2451 

2455 




2482 

-2488 

2493- 

2494 

2511 

2513 

2515- 

2520 

2526- 




2528 

2543 

2549- 

2553 

2557 

-2560 

2567- 

2568 

2570- 




2573 

2583 

-2596 

2603- 

2609 

2626- 

-2629 

2644 

-2647 




2655 

-2660 

2665- 

2667 

2671 

2682- 

-2688 

2697 

2711 




2720 

2722 

-2724 

2731- 

2742 

2744- 

-2749 

2772 
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2794 2796 2799-2804 2810-2815 2818-2820 2823- 
2835 2842-2848 2850-2851 2889-2901 2908-2911 
2917-2930 2935-2939 2948-2962 2974-2979 3014- 
3016 3036-3039 3043 3045-3048 3058-3063 3065 
3067-3074 3079-3121 3133 3137-3141 3149-3172 
3175-3177 3182-3183 3221-3226 3253-3258 3262- 
3269 3274-3284 3287-3292 3305-3314 3324-3332 
3334-3344 3346^3354 3357 3361-3366 3373-3375 
3384-3385 3399 3410 3425-3428 3438-3452 3454 
3473-3476 3481-3485 3497 3508-3511 3517-3519 
3521 3524-3530 3551-3560 3568-3569 3588-3590 
3602-3610 3612-3615 3617-3619 3638 3642-3652 
3660-3663 3667-3679 3683-3686 3704-3705 3728- 
3729 3733-3739 3748-3752 3767-3781 3788-3789 
3794-3795 3798-3801 3804-3805 3850-3854 3861- 
3868 3888-3891 3894-3895 3897-3904 3931-3936 
3938-3941 4015-4018 4022-4026 4035-4040 4042- 
4043 4047-4060 4065-4089 4102-4106 4110-4118 
4128-4129 4131 4137-4138 4143-4144 4147 4163 
4166-4167 4173 4180-4183 4191-4193 4203-4207 
4229 4242-4245 4253 4264 4266 4286-4292 4330 
4338 4343 4363-4364 4369 4378-4380 4394 4397- 
4429 4436-4438 4441-4455 4484 4491-4494 4502- 
4504 4508-4509 4520-4535 4550 4554-4555 4566 
4611 4613-4614 4635 4644-4645 4648-4652 4662 
4666-4670 4678 4685-4692 4702 4706-4709 4718 
4721-4723 4729 4771-4777 4791-4792 4806-4809 
4812-4815 4817-4821 4873-4876 4910-4916 4919 
4936-4939 4949-4979 4984-4985 4989 4993-4994 
5005-5007 5014 5019-5021 5032-5033 5060 5076- 
5079 5101 5107-5110 5112-5113 5129-5130 5134 
5139-5140 5146 5161-5163 5168-5173 5177-5179 
5208 5213 5216-5220 5236-5243 5250-5255 5268- 
5272 5275-5278 5284-5285 5288-5292 5310 5323 
5334-5336 5340 5343 5435-5437 5446-5451 5460- 
5466 5469-5471 5492-5493 5503 5520-5522 5546- 
5553 5580-5581 5594 5596 5598 5618 5652 5656- 
5672 5674-5690 5692-5697 5715-5718 5721 5757- 
5758 5798-5805 5825-5834 5856-5863 5875-5881 
5896-5898 5904-5914 5920-5921 5932 5945-5948 
5950-5952 5956-5960 5984-5986 5988-5992 5999- 
6000 6003-6005 6027 6043 6046 6048-6068 6087- 
6090 6112-6113 .6119-6120 6123-6127 6159 6174- 
6176 6192-6199 6213-6217 6226 6243 6276-6278 
6285 6291-6292 6316-6321 6350 6353-6355 6359- 
6360 6365-6366 6385-6388 6392-6394 6420-6431 
6443-6444 6448 6464-6472 6511-6521 6525-6526 
6555-6556 6634-6638 6665-6668 6670-6676 6688- 
6689 6695-6696 6701-6702 6706-6708 6711 6718- 
6721 6724-6725 6751-6756 6766-6767 6776-6779 
6786-6804 6828 r 6832 6840 6846-6848 6853-6854 
6861 6872-6874' 6911-6917 6923-6936 6943-6945 
6957 6977-6978 6990-6992 7003-7004 7014-7027 
7050-7053 7056-7059 7080-7082 7086-7087 7104 
7108-7110 7135-7144 7147-7158 7174-7X75 7179- 
7180 7247 7261 7274-7275 7279-7280 7283-7284 
7303-7304 7310-7311 7324 7327 7366-7367 7377- 
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3085 3088-3091 3093 3097-3105 3108-3110 3115- 
3121 3133 3137-3141 3149-3150 3152-3155 3157- 
3158 3165-3214 3217 3227-3228 3262-3269 3274- 
3275 3279-3284 3287-3289 3296-3297 3305-3312 
3334-3348 3375 3386-3389 3399 3402-3408 3416- 
3418 3424 3438-3452 3454 3464-3468 3491-3493 
3497 3507-3519 3522-3524 3551-3553 3566-3567 
3597-3599 3602-3610 3617-3619 3623-3628 3631- 
3632 3642-3643 3646-3652 3658 3667-3668 3675- 
3677 3693-3694 3696 3699-3701 3705 3707-3713 
3717-3721 3728 3733-3737 3743-3747 3753-3759 
3769-3770 3775 3781-3789 3798-3801 3840-3849 
3861-3865 3868 3876-3879 3885 3892-3893 3896- 
3900 3904 3910-3919 3926-3936 3983-3985 4009- 
4013 4042-4043 4047-4051 4053 4057 4065-4066 
4074-4081 4089-4109 4128-4129 4131-4132 4143- 
4144 4163 4169 4177 4185 4189 4191-4193 4218 
4223-4224 4232 4242-4245 4266 4288-4290 4293 
4329-4330 4339 4344-4347 4349-4350 4382-4384 
4386-4387 4395-4396 4425 4428 4430-4435 4484 
4490 4496 4498-4501 4508-4518 4520-4528 4538- 
4541 4551-4552 4554-4555 4559 4565 4567 4603 
4626-4627 4629-4634 4666-4670 4672-4673 4675- 
4676 4680 4702-4703 4706-4709 4724-4727 4729 
4734 4760-4762 4773 4775-4776 4778-4779 4791- 
4792 4795 4806-4809 4811 4830-4836 4877 4889 
4892-4898 4909 4919 4924 4929 4937-4939 4949 
4955 4959 4971-4976 4980 4982-4989 4991-4994 
5005-5007 5034-5035 5054 5059 5065-5081 5084- 
5100 5107-5109 5134 5139-5140 5147 5161 5177- 
5183 5191-5192 5205 5208 5226-5227 5255 5258- 
5262 5264 5275-5278 5280-5282 5288 5290-5292 
5298 5318 5346-5347 5361-5362 5364 5366 5377 
5379 5389 5409 5416 5425-5426 5431-5437 5448- 
5459 5463-5464 5467-5473 5580-5581 5597 5616- 
5617 5621-5622 5644-5645 5652 5657 5660 5662- 
5667 5683 5691 5693-5695 5697-5713 5715-5730 
5748-5753 5757-5758 5780-5782 5798-5805 5809- 
5814 5823-5834 5856-5863 5875-5885 5904-5917 
5932 5944 5950-5952 5986 5988-5994 6003-6005 
6008 6016-6020 6046 6069-6075 6080-6081 6119- 
6120 6122-6127 6159 6174-6176 6180-6191 6200- 
6210 6213-6217.6236-6241 6243-6246 6272 6274 
6359-6360 6385-6388 6398-6399 6419 6432-6434 
6443-6444 6447 6476-6479 6483-6484 6491-6499 
6522-6527 6548-6549 6634-6638 6653-6660 6662 
6669-6676 6697 6701-6702 6711-6712 6724-6725 
6727-6728 6742 6759-6761 6766-6767 6776-6780 
6786-6804 6828-6832 6850-6854 6858 6890-6894 
6907-6908 6911-6915 6920-6921 6923-6924 6937 
6940 6946-6949 7050-7053 7056-7059 7063-7067 
7088-7090 709^-7102 7104-7106 7135 7148-7149 
7151 7159-7170 7174-7175 7215 7231-7233 7248- 
7249 7260 7274-7275 7292 7305 7392-7393 7398 
7405 7412-7414 7424-7425 7428-7429 7437-7438 
7461 7470 7474 7539-7549 7566 7570-7573 7587- 
7589 7595-7598 7601-7608 7630-7633 7672-7675 
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7630-7633 7662-7666 7712-7713 7794-7796 7798 
7801 7859-7860 7916 7923 7955-7957 7970-7971 
8008-8009 8031-8033 8062-8063 8227-8229 8277- 
8279 8397-8398 8405 8430-8432 8490-8495 8499- 
8500 


infant 
brain 


Soares. 


IBM002 


565-575 592 781 1197-1199 1394-1395 1439-1441 
1913 2067 2156-2157 2223-2224 2369 2603-2609 
2612-2615 2626-2629 2684-2688 2797 3029-3031 
3045-3048. 3305 3410 3605 3696 3861-3865 4053 
4163 4169 4253 4378-4380 4955 4980 4982 5288 
5379 5521-5522 5585 5721 5780-5782 5988-5992 
6203 7260 7398 7424-7425 7470 7598 8430-8432 


infant 
brain 


Soares 


IBS001 


144-145 172-175 280-282 307-320 384-385 681- 
683 883 1033-1035 1070-1071 1287-1289 1439- 
1441 1607-1608 1770-1773 1801-1804 1912 2016- 
2017 2158-2162 2223-2224 2255-2257 2369 2404- 
2405 2549-2550 2557 2626-2629 2637-2641 2658- 
2660 2850-2851 2953-2959 2963-2971 3043 3045- 
3048 3080-3085 3122-3132 3250 3346-3348 3551- 
3553 3606-3610 3683-3686 3696 3699-3701 3717- 
3721 3769-3770 3840 4009-4011 4080-4081 4185 
4189 4253 4429 4484 4551-4552 4666-4670 4703 
4729 4795 4980 5076-5079 5107-5109 5157 5364 
5393-5395 6069-6073 6075 6209-6210 7050-7053 
7161 7168 7260 7265 7292 7327 7375 7717 7743 
7923 8031-8033 8242-8243 8274-8275 8455-8456 
8498 


lung 

fibroblast 


Stratagene 


LFB001 


8-9 55 121 132-137 140-142 172-175 197-209. 
233-237 296-297 345-349 361 373 416-418 430- 
431 474 534-535 546-548 559-560 645 650-651 
803-804 857-859 941-942 1216-1221 1246-1248 
1297-1299 1306-1308 1348 1368 1394-1395 1399- 
1401 1413-1414 1425-1431 1439-1441 1453-1454 
1465 1481-1487 1540-1541 1557-1561 1568-1570 
1575-1577 1583-1585 1600-1602 1612-1615 1619- 
1620 1630-1632 1655-1657 1671-1672 1691-1692 
1716. 1726-1728 1774-1776 1801-1804 1857-1860 
1890 1896 1903 1912 1916 1955-1957 1967-1968 
1974-1978 1986-1991 2005-2008 2018-2019 2037 
2050-2052 2063 2079 2081 2087-2094 2129-2136 
2138-2141 2156-2157 2165-2170 2182-2183 2241- 
2242 2255-2257 2268-2271 2315-2322 2353-2357 
2385-2386 2389-2390 2404-2405 2407-2415 2417- 
2419 2428-2429 2458-2461 2470 2482-2488* 2511- 
2512 2515-2516 2558 2567-2568 2587-2588 2653- 
2654 2656-2660 2722-2724 2731-2738 2784-2786 
2794 2796-2797 2799-2800 2814-2816 2826-2834 
2836-2838 2850-2851 2879 2889-2893 2935-2939 
2953^2959 2978-2979 3014-3016 3054-3057 3066 
3071 3095-3096 3138-3141 3149-3150 3157-3158 
3253-3258 3270-3275 3279 3290-3292 3306-3307 
3313-3314 3324-3332 3349-3352 3375 3425-3428 
3454 3469-3470 3473-3476 3548-3553 3555-3557 
3560 3564-3565 3602-3610 3612-3615 3620-3622 
3633 3673-3674 3678-3679 3707-3709 3717-3722 
3724-3726 3728 3730 3742 3748-3752 3769-3770 


lung tumor 


Invitrogen 


LGT002 


3776-3780 3888-3891 3896 3904 3910-3919 3991- 
3998 4012-4013 4015-4018 4020-4021 4027-4041 
4048-4051 4070-4072 4107-4109 4132 4167 4170- 
4171 4180-4181 4194-4195 4253 4286 4294-4295 
4336 4349-4350 4385 4422-4424 4436-4438 4441- 
4443 4445-4447 4484 4644-4645 4648-4652 4889 
4922 4924 5015-5017 5019-5021 5036-5038 5147 
5250-5254 5268-5272 5302-5305 5350 5396-5401 
5409 5415 5466 5497 5526-5529 5547-5552 5631 
5658-5659 5677 5719-5720 5798-5805 5809-5817 
5853-5855 5875-5881 5920-5921 5932 5937-5938 
5949 5>999-6000 6131 6133-6134 6213-6217 6291- 
6292 6297-6298 6439 6448 6452-6453 6464-6466 
6560-6562 6608-6610 6663-6664, 6691 6742 6762- 
6763 6808-6809 6836-6838 6944-6945 6990-6992 
7014-7027 7162-7165 7281-7282 7297 7327 7334 
7377-7384 7446 7461 7686-7687 7807-7808 7845 
8086-8087 8129 8261-8264 8291-8292 


8-9 20-21 49-50 75-76 83 102-105 114-115 121 
140-145 164-169 180-183 185 196 201-209 213- 
215 230 233-237 250-251 253-254 264-265 273- 
276 279 283-284 286-287 298-299 307-320 330- 
340 366-369 375 395 416-418 441-445 452-453 
456-458 460-462 465 468 476-477 488-494 504- 
512 527-531 546-548 557 576-578 608-609 645 
656-658 705 707-708 712-713 767 809 814-821 
831-833 883 930 941-942 952 973 989 992-994 
1005-1007 1011-1012 1022-1023 1037 1039-1043 
1064-1111 1148-1149 1160-1161 1181-1183 1194- 
1205 1213-1215 1227-1243 1249-1253 1261-1263 
1275-1276 1287-1293 1297-1308 1311-1313 1316- 
1317 1324-1326 1339-1340 1347 1357-1358 1368- 
1370 1394-1401 1410-1412 1445-1450 1455-1462 
1466-1470 1472-1487 1490-1491 1505-1506 1514- 
1515 1525-1530 1540-1548 1557-1567 1571 1583- 
1595 1602 1610 1616-1617 1624-1625 1637-1642 
1653-1659 1673-1674 1676-1678 1694 1702-1710 
1716 1718-1721 1726-1727 1764-1767 1774-1776 
1780-1795 1798-1800 1806-1810 1813-1815 1817- 
1818 1823-1825 1831 1850-1851 1861-1863 1870- 
1874 1877 1885-1889 1894-1895 1904-1905 1907- 
1913 1916 1921-1927 1929 1934-1935 1944-1945 
1950-1953 1955-1966 1974-1978 1980-1991 2001- 
2002 2005-2008 2018-2019 2028 2032-2035 2037 
2046-2052 2063-2073 2079 2081-2085 2087-2097 
2112 2114-2119 2123 2137 2143-2146 2148-2152 
2155-2162 2165-2173 2184 2199-2200 2209-2217 
2223-2224 2226-2227 2237-2240 2262-2264 2268- 
2271 2276-2278 2300-2301 2315-2322 2324-2327 
2337-2339 2341-2343 2353-2357 2370-2372 2374 
2378 2381-2387.. 2398-2403 2407-2419 2433-2434 
2436-2438 2453' 2455 2464-2467 2482-2486 2511 
2514-2516 2521 2526-2531 2544-2550 2558 2561- 
2565 2574-2580 2585-2590 2592-2596 2616-2618 
2626-2633 2636-2641 2648-2654 2658-2660 2665- 
2667 2671-2678 2689-2691 2704-2711 2716-2720 
2722-2727 2731-2742 2757 2788-2794 2796 2817 
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2826-2835 2842-2848 2857-2863 2867-2879 2889- 
2901 2908-2911 2913-2915 2917 2940-2946 2948- 
2971 2973 2978-3025 3032 3034-3036 3043 3045- 
3048 3054-3057 3066-3072 3075-3078 3088-3091 
3097-3101 3111-3133 3137-3141 3149-3150 3163- 
3172 3188-3214 3218-3219 3227-3228 3250-3258 
3262-3269 3274-3275 3279-3284 328.7-3292 3306- 
3307 3311-3312 3324-3332 3334-3344 3346-3354 
3358 3361-3366 3386-3389 3399-3409 3411-3421 
3424 3435-3449 3454 3464-3470 .3473-3474 3477- 
3480 3487-3488 3491-3493 3497-3499 3507-3511 
3517-3519 3525-3530 3537-3543 3551-3553 3555- 
3560 3566-3569 3597-3601 3605-3610 3612-3628 
3633-3636 3638-3640 3642-3643 3660-3663 3675- 
3677 3683-3686 3697-3698 3714-3722 3724-3726 
3728 3730 3738-3739 3743-3759 3761-3766 3776- 
3781 3785-3789 3791 3796 3841-3854 3869-3870 
3881-3884 3888-3891 3896 3901-3904 3920-3923 
3926-3927 3950-3955 3958-3961 3983-3987 3990- 
3998 4001-4008 4012-4013 4015-4018 4035-4040 
4042 4048-4051 4054 4061-4062 4070-4072 4080- 
4085 4092-4095 4102-4109 4143-4144 4153 4163 
4165-4167 4170-4171 4173 4182-4183 4203-4207 
4209 4223-4224 4230 4233-4240 4252 4255 4267- 
4268 4272-4273 4277 4291 4344-4347 4352 4361- 
4362 4368-4369 4378-4380 4386-4387 4392 4415- 
4417 4420-4421 4429 4436-4438 4441-4443 4480- 
4484 4505-4507 4512-4518 4529-4535 4553 4556- 
4559 4561 4566 4573-4602 4604 4607-4609 4624 
4628 4644-4645 4659-4660 4662 4676 4692 4698 
4715-4717 4721-4727 4729 4731-4732 4736 4742 
4744 4760-4762 4765-4772 4780-4782 4791-4792 
4795 4817 4830-4836 4847 4873-4876 4882 4885- 
4890 4906 4925-4926 4943 4950-4953 4955 4957- 
4958 4962-4967 4972-4976 4978 4999-5012 5019- 
5021 5032-5033 5036-5038 5060 5065-5072 5076- 
5079 5084-5097 5107-5109 5129-5130 5134 5136 
5147 5155-5156 5161 5168-5173 5177-5179 5200 
5202 5205 5224 5236-5245 5250-5254 5258-5262 
5289 5313-5314 5356-5359 5361-5364 5374 5392- 
5401 5406-5408 5411 5413-5414 5416 5435-5440 
5443-5445 5448-5459 5472 5474 5485-5488 5492- 
5493 5504-5506 5553-5558 5565-5566 5580-5581 
5588 5590-5592 5594 5597-5598 5616-5617 5621- 
5622 5629 5631 5639-5641 5652 5658-5661 5664- 
5667 5673-5676 5683 5687-5690 5694-5695 5697- 
5701 5723-5729 5757-5758 5765-5768 5797-5805 
5809-5817 5839 5851-5855 5882-5885 5892-5893 
5932 5937-5938 5940-5941 5971-5980 5986 5988- 
5992 6001-6005 6027 6038-6042 6047 6058-6060 
6065-6068 6075-6076 6112-6113 6117-6120 6123- 
6127 6131 6137-6138 6155-6156 6172-6177 6180- 
6194 6213-6217 6234 6236-6241 6243 6260-6262 
6291-6292 6297-6299 6303-6304 6309-6311 6334 
6338-6339 6348 6353-6355 6361-6362 6385-6388 
6395 6399-6401 6414-6417 6424-6425 6435-6438 
6456-6459 6464-6466 6512-6513 6527 6548-6549 
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6923- 
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7150- 

7190 
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7812 
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8405 
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-6599 
6660 
6729 
6776 
6858 
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6992 
7053 
7153 
7198 
7275 
7355 
7416 
7531 
7613 
7675 
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7957 
8039 
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8413 
8466 


6613 
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6742 
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6182- 

6191 

6291- 

6292 

6381 

6456 

-6459 

6483 

-6484 

6608- 

6610 

6705 

6743 

-6744 

6766 

-6767 

6776 

-6779 

6799- 

6804 

6878 

6887 

-6889 

6916 

-6917 

6941 

-6942 

7009- 

7011 

7014- 

7027 

7279- 

7280 

7474 

7500 

7505 

7697- 

7700 

7912 

7928 

8031- 

8033 

8300 

8302 

-8308 

8397- 

8398 

8430- 

8432 

8439- 

8440 



melanoma 
from cell 
line ATCC 
#CRL 1424 


Clontech 


MEL004 


102-103 121 140-142 193- 
382 403-406 416-418 527- 
650-651 661-665 670 779- 
974 1005-1007 1022-1023 
1246-1248 1294-1299 1374 
1409 1415-1420 1425-1431 
1502 1511-1513 1529-1530 
1575-1577 1592-1595 1619 
1680-1690 1702-1710 1774 
1854-1856 1864-1869 1912 
1962-1963 1967-1968 2005 
2075 2081 2086 2114-2119 
2200 2315-2322 2337-2340 
2389-2390 2408-2415 2440 
2493-2494 2511 2522-2525 
2625 2630-2633 2653-2654 
2794 2796 2814-2815 2818 
2851 2867 2908-2911 2948 
3014-3016 3018-3023 3036 
3165-3172 3175-3177 3221 
3287-3289 3306-3307 3346 
3372 3419-3421 3487-3488 
3590 3597-3599 3605 3658 
3717-3722 3730 3733-3737 
3931-3936 3991-3998 4020 
4061-4062 4070-4072 4166 
4181 4230 4281 4348-4350 
4387 4422-4424 4436-4438 
4645 4648-4652 4674 4729 
4758 4806-4809 4825 4829 
5107-5109 5205.5226-5227 
5283 5323 5332' 5347 5390 
5529 5565-5566 5593 5664 
5718 5775 5778-5779 5849 
5993-5994 5998-6000 6147 
6467-6472 6485-6488 6519 
6663-6664 6858 6907-6908 


194 225 272 285 380- 
529 537 579 641-644 
780 806-807 811-813 
1054-1059 1160-1162 
-1379 1396-1398 1408- 
1465 1478-1484 1501- 
1540-1541 1558-1561 
1620 1622-1623 1675 
1776 1790-1795 1849 
1916 1929 1937-1942 
2009 2068-2070 2074- 
2122 2131-2136 2199- 
2353-2357 2382-2384 
2441 2445-2451 2470 
2592-2596 2603-2609 
2713-2715 2731-2742 
-2819 2826-2834 2850- 
-2950 2973 2978-2979 
3071 3095-3096 3111 
-3226 3274-3275 3279 
-3348 3353-3354 3371- 
3500-3503 3560 3571- 
3678-3679 3693-3694 
3769-3770 3861-3865 
-4026 4035-4040 4057 
4168 4170-4171 4180- 
4369 4382-4384 4386- 
4484 4538 4614 4644- 
4734 4745-4749 4757- 
4836 4889 4949 5060 
5250-5254 5258-5262 
5452-5459 5466 5526- 
-5667 5693 5697 5715- 
5886-5889 5925-5930 
6177 6343-6345 6395 
-6521 6618 6643-6644 
6916-6917 7014-7027 





7083 

7159 

-7160 

7188 

-7190 

7213 

7238 

-7240 

7308- 




7311 

7363 

7416 

-7417 

7439 

7465 

-7468 

7509- 

-7510 




7551 

-7555 

7797 

7843 

7867 

-7868 

8110 

-8113 

8140 




8153 

8227 

-8229 

8392 

-8395 

8413 




mammary 

Invitrogen 

MMG001 

3-5 

8-9 20-21 

67 102-103 

105- 

107 1 

14-115 143- 

gland 

- 


145 

172-175 193-194 

201- 

209 2 

13-215 225 

253- 




254 

264-268 271 275 

-276 

280-282 285 302 

345- . 




349 

360 366-370 375 

402 

416-418 44 

1 454- 

-455 




465 

468-473 47 

5-477 

491- 

495 527-52 

9 534- 

-535 




546- 

548 552-55 

4 558 

-560 

565-5 

78 58 

3 597- 

-598 




650- 

651 676-68 

0 692 

696 

705-7 

06 712-717 

720- 




721 

732-734 742-743 

748- 

751 759-760 767 

774 




781 

805 811-821 828 

-829 

831-833 881-883 

887- 




891 

930 941-944 949 

-951 

960-961 982-983 

989 




992- 

994 1005-1007 1014 1017-1018 1032-1035 




1038 

1041- 

-1043 

1067 

-1105 

1107 

-1111 

1118- 

-1120 




1122 

-1147 

1151 

-1152 

1182 

-1183 

1185 

-1186 

1190- 




1191 

1213- 

-1221 

1244 

-1245 

1254 

-1257 

1261- 

-1263 




1265 

-1269 

1273 

-1274 

1287 

-1289 

1297 

-1305 

1311- 




1313 

1316- 

-1317 

1334 

-1335 

1347 

1354 

-1355 

1359- 




1368 

1374-1377 

1391 

1394 

-1401 

1413 

-1420 

1439- 




1450 

1455- 

-1459 

1472 

-1477 

1488 

-1489 

1498- 

-1502 




1514 

-1515 

1518 

-1520 

1525 

-1528 

1531 

-1537 

1542- 




1551 

1557- 

-1567 

1571 

-1574 

1583 

-1591 

1607-1608 




1610 

1612- 

-1615 

1619 

-1620 

1635 

1637 

-1642 

1658- 




1659 

1671- 

-1674 

1678 

1694 

1702 

-1710 

1718 

1726- 




1728 

1759- 

-1760 

1765 

-1767 

1780 

-1789 

1791- 

•1797 




1801 

-1804 

1806 

-1810 

1813 

-1815 

1827- 

-1830 

1839- 




1840 

1844- 

-1849 

1852 

1857 

-1869 

1876 

1883 

1890- 




1893 

1897- 

-1900 

1903 

1906 

-1912 

1916 

1921- 

-1927 




1930 

-1932 

1934- 

-1935 

1944 

-1946 

1958- 

-1963 

1967- 




1968 

1970- 

•1972 

1985 

-1991 

1998- 

-2000 

2003-2004 




2016 

-2027 

2053- 

-2058 

2063 

-2066 

2071- 

-2075 

2077- 




2079 

2083- 

-2085 

2103- 

-2107 

2112 

2120- 

-2121 

2124- 




2126 

2128 

2137- 

-2141 

2148 

-2154 

2158- 

-2162 

2165- 




2170 

2174- 

-2175 

2184 

2190 

-2191 

2201- 

-2217 

2223- 




2224 

2237- 

2239 

2244- 

-2248 

2255- 

-2257 

2262- 

2264 




2266 

-2271 

2276- 

-2280 

2282 

-2283 

2286- 

-2288 

2307- 




2322 

2324- 

•2328 

2332- 

-2333 

2353-2357 

2361 

2370- 




2372 

2389-2390 

2407 

2417 

-2423 

2428-2434 

2436- 




2438 

2442- 

2451 

2453 

2455 

2464- 

-2467 

2471- 

2472 




2476 

-2477 

2481- 

-2486 

2521 

-2525 

2533 

2549- 

2553 




2559 

-2565 

2567- 

:2568 

2574 

-2580 

2585- 

-2586 

2589- 




2590 

2592- 

2596 

2603- 

-2609 

2625- 

-2641 

2 648- 

2652 




2656 

-2663 

2671 

2682- 

•2683 

2698- 

-2710 

2739- 

2742 




2748 

-2749 

2757 

2772- 

-2775 

2784- 

-2786 

2797 

2805 




2807 

-2813 

2817 

2835- 

•2838 

2842- 

•2848 

2850- 

2851 




*i o c *7 
2857 

-2859 

2864- 

-2865 

2868 

-2878 

2881- 

2888 

2903 




2905 

-2911 

2913- 

•2914 

2918 

-2934 

2940- 

2946 

2948- 





2953- 

2959 

2963- 

-2971 

3014- 

•3016 

3034- 

3035 




3037 

-3043 

3045- 

•3057 

3066 

3073- 

3078 

3080- 

3085 




3102 

-3105 

3108- 

•3111 

3115- 

-3116 

3122- 

3137 

3142- 




3147 

3157- 

3158 

3165- 

•3172 

3218- 

3219 

3227- 

3228 




3250 

3259- 

3278 

3280- 

•3284 

3296- 

3297 

3306- 

3307 




3311 

-3318 

3324- 

•3332 

3334- 

-3344 

3346- 

3352 

3355- 




3356 

3358 

3361- 

3366 

3370- 

-3372 

3375 

3386- 

3389 




3399 

-3401 

3407- 

3408 

3410 

3414- 

3415 

3419- 

3421 


;l □ OJS «+ £» Mi- 31 « O E ££ & O H 


3429 3454 3479-3480 3486 3494-3499 3504-3507 
3512-3516 3520-3523 3528-3530 3537-3539 3544- 
3553 3555-3557 3560 3566-3567 3591-3593 3597- 
3605 3612-3615 3623-3628 3631-3633 3639-3640 
3644-3652 3660-3663 3669-3672 3675-3679 3683- 
3686 3696 3698-3703 3705 3722 3724-3726 3730 
3740-3752 3779-3780 3794-3795 3798-3801 3841- 
3849 3868 3878-3879 3892-3893 3897-3903 3922- 
3926 3938-3941 3956 3961-3963 3983-3987 3991- 
3998 4020-4034 4042-4043 4045-4046 4048-4051 
4054 4061-4062 4080-4085 4102-4118 4128-4129 
4180-4181 4205-4207 4216-4217 4223-4224 4228 
4231 4242-4245 4252 4263 4265 4293-4323 4331- 
4333 4361-4364 4366-4369 4378-4380 4386-4387 
4422-4424 4429 4436-4438 4480-4484 4505-4506 
4514-4518 4520-4535 4547-4549 45S1-4552 4556- 
4558 4569-4572 4606-4608 4659-4660 4662 4672- 
4674 4693-4694 4697 4715 4721-4727 4729 4733 
4738-4741 4744 4757-4758 4765-4769 4780-4783 
4836 4845-4846 4873-4877 4879-4880 4886-4889 
4899 4905 4910-4916 4920-4922 4928 4936-4939 
4943 4949 4957-4958 4972-4976 4978 4984-4985 
4991-4992 4999-5001 5019-5028 5036-5038 5043- 
5049 5065-5072 5076-5081 5098-5100 5105 5107- 
5109 5112-5113 5134 5139-5140 5159-5160 5168- 
5173 5177-5179 5190 5206-5207 5213 5216-5220 
5226-5227 5236-5243 5258-5262 5280-5282 5284- 
5285 5289 5297 5302-5306 5323 5343 5351-5355 
5359 5364 5366 5374 5379 5389 5392-5395 5406- 
5409 5431-5437 5443-5445 5452-5459 5465-5466 
5475-5478 5554-5560 5562-5563 5565-5568 5585 
5590-5592 5598 5616-5617 5621-5622 5652 5673 
5678-5679 5687-5690 5694-5695 5698-5701 5714 
5719-5720 5723-5729 5746-5753 5775 5777 5797- 
5805 5809-5814 5829-5834 5850-5852 5856-5863 
5869-5871 5882-5885 5937-5938 5940-5941 5949 
5964-5966 5971-5980 5988-5992 5995-5996 5999- 
6000 6003-6005 6021 6032 6044-6045 6048-6056 
6058-6060 6062 6065-6068 6074 6077 6084 6119- 
6120 6133-6134 6149 6174-6176 6179-6181 6198 
6203 6213-6217 6234 6236-6241 6256 6294 6305- 
6308 6353-6355 6359-6360 6383-6388 6392-6395 . 
6400-6401 6424-6425 6451 6489 6491-6499 6501- 
6504 6517-6518 6548-6549 6629-6638 6646-6649 
6653-6660 6688-6689 6691 6703-6704 6712 6734- 
6735 6742 6751-6756 6766-6767 6770 6799-6807 
6828-6832 6858 6863-6867 6872-6874 6897-6898 
6907-6908 6911-6915 6923-6924 6941-6943 6958- 
6959 6987-6989 6997-6998 7001-7004 7007 7009- 
7011 7014-7027 7050-7053 7063-7067 7080-7082 
7086-7087 7098 7108-7110 7116-7132 7144-7146 
7151-7153 7155-7157 7171-7173 7181-7184 7188- 
7190 7219 7225 7231-7233 7251-7260 7274-7275 
7286 7291 7310-7311 7313 7335 7344-7345 7349- 
7350 7357-7359 7373 7375 7377-7384 7394 7396 
7398-7400 7402-7404 7412-7414 7416-7417 7424- 
7425 7461 7463 7465-7468 7470 7474 7498 7509- 


<©7SD 


) 


- - 


1 OH" 


fin O..' 

as o 

„,H pi l},„{i 


O ?J> 
Uu.ii IJ,^, 




7510 

7532 

-7537 

7539 

-7549 

7551 

-7554 

7568 

7584 




7592 

7599 

-7600 

7609 

-7612 

7638 

-7639 

*7 c c n 
7 65 0 

7661- 




7668 

7672 

-7675 

7689 

7698 

-7700 

7702 

7722 

-7727 




7748 

7755 

7766- 

7769 

7793 

7 813 

7 864 

7867 

-7868 




7870 

-7875 

7899- 

7900 

7923 

7933 

7955- 

7957 

7967 




7970 

-7971 

7982 - 

7983 

8046 

-8048 

8070- 

8072 

8077 




O "1 A *5 

qui 


Q1 *JC 

O 4.J J 

Ol A4 

O Xzf 4 

- o iy o 

8227- 

8229 

8241 





-8312 

8366 



ft n*7 

8392-8395 

8397- 




Q 1 Q Q 

8405 

8406 

Q. A 1 1 

O 4 lo 

_ O A 1 Q 

- o 4 ± y 

8436 

8457-8458 




U A C C 

84 do 

8486 

8490- 

O A Q C 

o4 y y 

q c n n 
- obUU 




Induced 

Stratagene 

NTD001 

201- 

209 272 386 


-651 

303 - 8 

04 809 946 

1033 - 

neuron 



1035 

1227 

-1247 

1^ 8 / 

i n o o 

1311 

-1313 

1380 

-1390 

cells 



13 96 

-1398 

1514- 

i e i c 

1515 

i c o e 
1525 

1 C 1 Q 

-1528 

1561 

1568 

-1570 




1652 

1718 

1798- 

1 Q A 

lo 00 

1813 

-1815 

1854- 

1856 

1912 




1950 

-1953 

1967 

2 003 

AAA J 

-2004 

*^ f\ t t\ 
2030 

-2031 

2223 

-2224 




2255 

-2257 

2341- 

O "i A "5 

2353 

-2357 

2385- 

2386 

2482- 




2488 

2561 

-2565 

A1 

2 603 

-2609 

2656 

-2657 

2671 

2732- 




2742 

2794 

2798 

2812 

-2813 

2935 

-2939 

2953 

-2962 




297 4 

-2977 

3014- 

3016 

3036 

T f\ *5 Q 

-3 03 9 

3138- 

3141 

3152- 




3155 

3253 

-3258 

3 262 

-3269 

3274 

-3275 

3279 

3290- 




32 92 

3296 

-3297 

3425 

-342 8 

3497 

3521 

3548 

-3550 




35oO 

3566 

-3567 

1 C Q C 

3 b y b 

-3 5 So 

3602 

-3605 

3673 

-3674 




372 8 

3733 

-3737 

3 / / y 

T *7 o rt 

-3 780 

a on/i 
3 804 

-3805 

3888 

-3891 




TOCO 

3958 

-3960 

3986- 

3 98 / 

4 A A1 

4 001 

^ AAQ 

-4008 

4043 

4061 

-4062 




4070 

-4072 

4080- 

4081 

4101 

-4106 

4177 

4292 

4329 




4484 

4553 

4827 

4 892 

-4898 

4909 

4922 

4929 

4950- 




4953 

5019 

-5021 

5177 

-5179 

5208 

5244- 

5245 

5315 




5352 

-5355 

5441- 

5442 

5446 

-5447 

5475- 

5478 

5521- 




5522 

5559 

-5560 

5664 

-5667 

5683 

5691 

5705 

-5709 




5999 

-6000 

6119- 

6120 

6279 

6362 

6385- 

6388 

6443- 




6444 

6455 

6464- 

6466 

6588 

-6595 

6695- 

6696 

6785 




/r *"» o n 
6799 

-6804 

6861 

6946 

-6348 

c c% tz r\ 
6950 

6957 

7014 

-7027 





7104 

7152-7153 

7162 

-7165 

7377- 

7384 

7439 




/bo U 

7735 

-7737 

7809 

7823 

7989 

8031- 

8033 

8277- 




Q O T Q. 

82 / y 

8413 

8430- 

8432 






Retinoid 

Stratagene 

NTR001 

144 - 

145 1005-1007 1022-1023 1160-1161 1246- 

acid 



1247 

1300 

-1302 

1399 

-1407 

1448 

-1450 

1518 

-1520 

induced 



1619 

-1620 

1784- 

1785 

1894 

-1895 

1912 

2018 

-2019 

neuronal 



2095 

-2097 

2120- 

2121 

2223 

-2224 

2262- 

2264 

2315- 

cells 



2322 

2732 

-2738 

2794 

3133 

3274 

-3275 

5032 

-5033 




5080 

-5081 

5634 

5673 

6119 

-6120 

7955- 

7957 

8397- 




8398 

8430 

-8432 







neuronal 

Stratagene 

NTU001 

85 144-14 

5 288- 

289 

383 4 

02 432-433 

468 

638- 

cells 



639 

650-651 803 

-804 

836- 

838 934-935 

970 

1022- 




1023 

1064 

-1065 

1072 

-1106 

1246 

-1247 

1316 

-1317 




1341 

-1346 

1455- 

1459 

1466 

-1470 

1511- 

1513 

1540- 




1548 

1561 

1619- 

1620 

1637 

-1641 

1658 

1712 

-1713 




1768 

-1769 

1781- 

1785 

1798 

-1800 

1813- 

1815 

1839- 




1840 

1850 

-1851 

1861 

-1863 

1894 

-1895 

1897 

-1900 




1907 

-1911 

1930- 

1932 

1950 

-1953 

1955- 

1957 

1964- 




1967 

2003 

-2004 

2074 

-2075 

2079 

2103- 

2107 

2192- 




2194 

2209 

-2217' 

2223 

-2224 

2262 

-2264 

2266 

-2267 




2307 

-2322 

2398- 

2403 

2417 

-2419 

2502- 

2508 

2522- 




2525 

2536 

-2537 

2549 

-2550 

2567 

-2568 

2658 

-2660 




2698 

-2703 

2796 

2810 

-2811 

2842 

-2848 

2908 

-2911 




2980 

-3013 

3049- 

3053 

3071 

3080 

-3085 

3X08 

-3110 


• ton 


1 00$$ M-iS 4-3 . O E:5 £2 6 ill S3 


3137 3165-3172 3274-3275 3279 3306-3307 3334- 
3344 3346-3348 3353-3354 3491-3493 3508-3511 
3522-3524 3558-3559 3605 3699-3701 3723 3730 
3802r3803 3871-3875 3892-3893 3946-3949 3958- 
3960 4048-4051 4057 4242-4245 4266 4547-4549 
4565 4573-4602 4854-4872 4950-4953 4957-4958 
5073-5075 5226-5227 5250-5254 5438-5440 5446- 
5447 5621-5622 5631 5634 5851-5852 5932 5964- 
5966 6203 6383-6384 6443-6444 6517-6518 6718- 
6721 6799-6804 6911-6915.6926 6941-6943 6950 
6957 7303-7304 7310-7311 7570-7573 7743 7907 
8031-8033 8151-8152 8159-8160 8163-8166 8397- 

i 8398 8408 8430-8432 8466 . 

pituitary Clontech PIT004 26-28 106-107 416-418 454-455 786-787 992-994 

9 land 1297-1299 1354-1355 1425-1431 1481-1484 1568- 

1570 1602 1619-1620 1922-1927 2003-2004 2112 
2148-2152 2171-2173 2208 2241-2242 2266-2267 
2511 2526-2528 2592-2596 2656-2657 2665-2667 
2722-2724 2732-2738 2806 2918-2930 2940-2946 
2978-2979 3037-3039 3165-3172 3274-3275 3279 
3454 3477-3478 3486-3488 3497 3551-3553 3560 
3605 3722 4015-4018 4043 4102-4106 4166 4353 
4422-4424 4445-4447 4484-4485 4662 4699 4721- 
4723 4806-4809 4839 4908 4928 4957-4958 4972- 
4976 5301 5351 5448-5459 5523 5559-5560 5562- 
5563 5594 5988-5992 5998 6057 6282-6284 6305- 
6307 6489 6766-6767 6776-6779 6896 6943 6977- 

6978 7561 7747 8413 8488-8489 

placenta Clontech PLA003 1185-1186 1261-1263 1357-1358 1511-1513 1784- 

1785 2074-2075 3054-3057 3138-3141 3904 3971- 
3982 4662 5022-5028 5241-5243 6464-6466 6634- 

6638 7068-7078 7188-7190 8430-8432 

prostate Clontech PRT001 94 121 195 221-222 253-254 355 441 534-535 

640 748-750 830 887-891 952 982-983 992-994 
1033-1035 1193 1246-1247 1261-1263 1297-1299 
1378-1379 1394-1395 1402-1407 1415-1420 1465 
1481-1484 1525-1528 1538-1541 1549-1551 1561 
1575-1577 1592-1595 1600-1602 1610 1624-1625 
1658 1675 1680-1690 1813-1815 1849 1864-1869 
1912 1930-1932 1962-1963 1980-1984 1986-1994 
2003-2004 2036 2050-2052 2063 2112 2114-2119 
2137 2165-2173 2268-2271 2315-2322 2324-2327 
2341-2343 2370-2372 2375-2378 2388 2407 2430- 
2432 2455 2493-2501 2509-2511 2559-2565 2589- 
2590 2619-2622 2630-2633 2689-2691 2722-2727 
2732-2738 2744-2747 2776-2783 2794 2810-2811 
2835 2850-2851 2856 2867 2884-2888 2908-2911 
2948-2950 3017-3023 3049-3053 3065-3066 3072 
3134-3136 3149-3150 3227-3228 3274-3275 3279 
3287-3289 3313-3314 3346-3352 3371-3372 3384- 
3385 3399 3411 3416-3418 3454 3469-3470 3473- 
3476 3481-3485' 3507-3511 3631-3632 3673-3674 
3678-3679 3722 3724-3726 3730 3765-3766 3788- 
3789 3869-3870 3892-3893 3904 3931-3936 4042 
4061-4062 4174-4176 4216-4217 4246 4248 4255 
I | 4331-4333 4366-4367 4422-4424 4441-4443 4508- 


) 


,:1 0 O B «+3 UJ 01 H E & O ££ 


rectum 


salivary- 
gland 




4511 

*t Z? o o 

/cin 

4^1 7 

*± D 

A C C Q 
40DO 

A c *7 r\ 
-4670 

4685 

-4687 



4697 

4710 

4 71 

— 471 7 
*± f J. 1 

All A 

A 7 CI 


-4792 

4999- 



5001 

5005 

Zj Vw / 


— Zj v O ± 

ci r\n 
31U / 

ci no 

5205 

-5207 



5255 

5268 

-5272 

52 98 

Zj O \J t. 

zj z> \j Zj 


b 6 1 J 

-5314 



5369 

-5371 

5377 

54 06 

-5409 



C C A C 

5546 

5590- 



5592 

5594 

5639 

-5641 

C£CO 

jo 

-j D Z? 


-boly 

5835- 



5838 

5870 

-5871 

5875 

-5881 

5925 

-5930 

5956 




5999 

-6000 

6028 

-6031 

6234 

6253 

6267 

6287 

6349 



6391 

6424 

-6425 

6731 

6743 

-6744 

6747 

-6748 

6781 



6784 

6840 

6957 

6960 

-6961 

6990 

-6992 

7043 

7080- 



7082 

7181 

-7184 

7213 

7334 

7402 

-7403 

7446 

7770- 



7772 

7907 

7916 

7949 

-7951 

8031 

-8033 

8110 

-8113 | 



8129 

8144 

8261 

-82 64 

8367 

-8368 




Invitrogen 

REC001 

253- 

254 264-265 416 

-418 

459 4 

69-473 580 

-581 



736- 

737 775 1033-1035 1110-1111 1244-1245 



1261 

-1263 

1300 

-1304 

1359 

-1367 

1374 

-1391 

1439- 



1444 

1446 

-1450 

1465 

1481 

-1484 

1518 

-1520 

1542- 



1548 

1571 

-1574 

1626 

-1628 

1691 

-1692 

1694 

1765- 



1767 

1781 

-1783 

1831 

1835 

-1838 

1912 

1921 

1936 



1972 

1985 

1998 

-1999 

2063 

-2066 

2068 

-2070 

2158- 



2162 

2282 

-2283 

2407 

2436 

-2438 

2511 

2513 

2522- 



2525 

2551 

-2553 

2559-2560 

2585 

-2588 

2603 

-2609 



2623 

2630 

-2633 

2636 

-2643 

2658 

-2660 

2672 

-2678 



2692 

-2696 

2731 

2796 

2812 

-2813 

2820 

2842 

-2848 



2850- 

-2851 

2915 

2948- 

-2950 

2953 

-2959 

2974 

-2977 



3049- 

-3053 

3066 

3080- 

-3085 

3115 

-3116 

3313- 

-3314 



3346- 

-3348 

3410 

3454 

3555- 

-3557 

3566- 

-3567 

3597- 



3599 

3605 

3623- 

-3628 

3633 

3641 

3705 

3717- 

-3722 



3740- 

-3741 

3798- 

-3801 

3888- 

-3891 

3904 

3926 

3931- 



3936 

4020- 

-4021 

4102- 

-4106 

4180- 

-4181 

4254 

4331- 



4333 

4336 

4388 

4394 

4498- 

-4501 

4556- 

•4558 

4648- 



4652 

4706- 

-4709 

4738- 

-4741 

4845- 

-4846 

4873- 

-4877 



4907 

4957- 

-4958 

4962- 

-4967 

5111 

5236- 

5243 

5280- 



5282 

5297 

5306 

5343 

5356- 

5358 

5366 

5402- 

•5403 



5406- 

•5408 

5435- 

5437 

5452- 

5459 

5475- 

5478 

5504 - 



5505 

5561 

5621^5622 

5673 

5697- 

5701 

5714 

5723- 



5729 

5748- 

5753 

5869 

5882- 

5885 

5971- 

5980 

5988- 



5992 

6013 

6198 

6203 

6244 

6476- 

6479 

6489 

6613- 



6615 

6653- 

6660 

6688- 

6689 

6703- 

6704 

6766- 

6767 



6960- 

6961 

6997- 

6998 

7001- 

7002 

7086- 

7087 

7098 



7251- 

7254 

7310- 

7311 

7321 

7398 

7424- 

7425 

7496 



7532- 

7537 

7676- 

7677 

7907 

7914- 

7915 

7923 

8102- 



8109 

8185 

8309- 

8312 

8383- 

8385 

8455- 

8456 


Clontech 

SAL001 

10-13 

55 105 144-145 

172- 

175 272 283-284 

307- 



320 342-34 

4 416 

-418 

468 527-531 539 

748- 

750 



881-882 94 

5-947 

1160 

-1161 

1163 

-1165 

1213 

-1215 



1227- 

1243 

1261- 

1263 

1300- 

1302 

1305- 

1308 

1311- 



1313 

1316- 

1323 

1439- 

1441 

1446- 

1447 

1472- 

1477 



1505- 

1506 

1511- 

1513 

1561 

1602 

1616- 

1617 

1659 



1694 

1779 

1784- 

1785 

1801- 

1804 

1813- 

1815 

1958- 



1961 

1968 

1985 . 

1997 

2028 

2071- 

2073 

2083- 

2085 



2112 

2148- 

2154 

2156- 

2157 

2220 

2266- 

2267 

2291- 



2294 

2307- 

2322 

2385- 

2387 

2420- 

2423 

2453 

2603- 



2609 

2626- 

2629 

2651- 

2654 

2682- 

2683 

2731 

2794 



2796 

2826- 

2834 

2842- 

2848 

2875- 

2878 

2913- 

2914 



2948- 

2950 

2978- 

2979 

3065 

3163- 

3172 

3280- 

3284 



3287- 

3292 

3425- 

3428 

3469- 

3470 

3473- 

3474 

3479- 







ii ill IF 

? 'Pi! {L il 



11 IP 1 iP 

[{".., o P 1 

"tuP ".,,}« il«t.t 




34 80 

34 87- 

-3488 

34 97 

3522 

-3 3 4, J 







3605 

3612 * 

-3615 

3623 

-3628 

3722 

J / O X 

-3764 

37 ft A- 




3789 

3791 

3958 

-3960 

3986 

-3987 

3 991 

_ 1QQO 





4061 

-4062 

4110 

-4118 

4128 

-412 9 

4264 

4 331- 

-4333 

** J O J 




4356 

-4359 

43 61 

-4362 

44 80 

-44 84 

4498 

*± O 

t 3 V X 

3 O X 




4573 

-4602 

4629 

-4634 

4731 

4765 

-4767 

4 93 fi 

*± J -J o 

4 94 3 




4972 

-4976 

5129 

-5130 

5139 

-5140 

5185 

5290- 
j *~ *j \j 

.CO 




5441 

-5442 

5523 

5588 

5614 

-5615 

5624 

-5626 

5631 

J w J X . 




5678 

-5679 

5694 

-5695 

5697 

5757 

-5758 

-J Q J X 

.coco 





602 3 - 

-6024 

6117 

-6120 

61 7 R 

v X f © 



-£311 




6326 

-633 3 

6359 

-63 60 

6456 

-6459 

6464 

. 646 6 

6560 - 




6562 

6627- 

-6628 

6663 

-6664 

6718 

-6721 

6925 

7044 




7150 

7181- 

-7184 

7276 

-7277 

7551 

-7554 

8102- 

-8109 





8133 

-8135 

8157 

8403 

-8404 

8430 

-8432 



saliva 

Clontech 

n t\ t c? r\ *a 
SALS 0 3 

343- 

344 740-74 

1 822 

-827 

1446- 

1447 

1568-1570 j 

gland 



1827 

-1830 

2123 

2315 

-2322 

2974 

-2977 

8031- 

-8033 

skin 

ATCC 

SFB001 

416- 

418 1841-1 

342 1896 2315-2322 2823-2825 

fibroblast 



3454 

3560 

3605 

3678 

-3679 

3983 

-3985 

4203- 

•4204 




4349 

-4350 

5425 

-5426 

5466 

6957 

7014 

-7027 


skin 

ATCC 

SFB002 

416- 

418 650-651 1540-1541 1791-1795 1841-1842 

fibroblast 



2005 

-2008 

2279 

-2280 

2315 

-2322 

2567 

-2568 

3319- 




3323 

3560 

3983 

-3985 

5032 

-5033 

5280 

-5282 

5425- 




5426 

5631 

6957 

7014 

-7027 





skin 

ATCC 

SFB003 

144- 

145 1540-1541 1791-1795 2174-2175 2315- 

f ibrob last 



2322 

2953- 

-2959 

5425 

-5426 

6703- 

-6704 

6957 


small 

Clontech 

SIN001 

38 42-43 119 195 283-284 

296-297 432-433 534- 




535 

546-548 645 698 

712-713 735 960-969 

1072- 




1105 

1439- 

-1441 

1446- 

-1447 

1466- 

-1470 

1478- 

1480 




1511 

-1513 

1540 

-1541 

1571 

1637- 

1641 

1658 

1671- 




1672 

1675 

1680- 

-1690 

1694 

1770- 

-1773 

1781- 

1783 




1813 

-1815 

1819 

1827- 

-1830 

1839- 

-1840 

1844- 

1848 




1854 

-1856 

1906 

1912 

1922- 

-1927 

1962- 

-1963 

2018- 




2019 

2122 

2196- 

-2198 

2243 

2300- 

-2301 

2307- 

2314 




2350 

-2352 

2369 

2379 

2430- 

-2432 

2598- 

-2601 

2626- 




2629 

2636 

2658* 

-2660 

2689- 

-2691 

2725- 

-2728 

2748- 




2749 

2799- 

2800 

2842- 

-2848 

2879 

2913- 

-2914 

2953^- 




2959 

3066 

3161- 

-3162 

3280- 

-3284 

3287- 

•3289 

3324- 




3332 

3410 

3487- 

-3488 

3537- 

-3539 

3620- 

•3622 

3633 




3683 

-3686 

3707- 

-3709 

3738- 

-3739 

3782- 

3784 

3796 




3878 

-3879 

3883- 

-3884 

3888- 

-3891 

3904 

3966- 

3968 




4097 

-4100 

4163 

4165 

4242- 

•4245 

4353 

4422- 

4424 




4444 

4484 

4542 

4550 

4573- 

-4602 

4605 

4648- 

4652 




4662 

4666- 

4667 

4710 

4735 

4745- 

4749 

4765- 

4767 




4841 

-4842 

4920- 

-4921 

4936 

5123- 

5124 

5X74- 

5176 




5191 

-5192 

5200 

5255 

5351 

5366 

5392 

5507- 

5512 




5526 

-5529 

5554- 

-5558 

5594 

5599- 

•5601 

5664- 

5667 




5698 

-5701 

5891 

5895 

5981- 

5983 

6069- 

6073 

6087- 




6090 

6276- 

6278 

6342 

6378 

6448 

6491- 

6499 

6596- 




6597 

6634- 

6638 

6863- 

6867 

6941- 

6942 

7310- 

7311 




7335 

7343 

7398 

7423 

7461 

7483 

7761 

7770- 

7772 




7780 

-7790 

7845 

7907 

8035- 

8036 

8102- 

8X09 

8353- 




8354 

8367- 

8368 







skeletal 

Clontech 

SKM001 

94 144-145 

172-175 3 

90-394 447 495 

973 1003 

muscle 



1160 

•1161 

1206- 

•1209 

1227- 

1243 

1300- 

13 02 

1309- 




1310 

1374- 

1377 

1396- 

1398 

1439- 

1441 

1561 

1583- 




1585 

1691- 

1692 

1695- 

1696 

1718 

1805 

1819- 

1821 


:i O O i& 1{ + £» «*3 . O a 2 <i O 2 


skeletal 
muscle 


Clontech 


SKM002 


1827 

-1830 

1839 

-1842 

1850 

-1851 

1861 

-1863 

1936 

2079 

2095- 

•2097 

2231- 

-2233 

2262 

-2264 

2329 

-2331 

2353 

-2357 

2453 

2470 

2544 

-2548 

2668 

2682 

-2683 

2704 

-2710 

2744 

-2747 

2796 

2835 

2842 

-2848 

2963- 

2971 

3036 

3122 

-3132 

3165 

-3172 

3346 

-3348 

3373- 

3374 

3386-3389 

3450- 

•3452 

3507 

3644 

-3645 

3678- 

3679 

3910-3919 

3926 

4015 

-4018 

4221 

4360 

-4362 

4395 

-4396 

4484 

4502- 

4503 

4505 

-4506 

4551 

-4552 

4873 

-4876 

4944 

-4945 

5098 

-5100 

5162 

-5163 

5236- 

5240 

5284- 

5285 

5352- 

5355 

5360 

5546 

5599 

-5600 

5616 

-5617 

5624 

-5626 

5723 

-5729 

5769 

5845 

-5849 

6272 

6305- 

6307 

6407- 

6411 

6552-6554 

6814 

-6816 

6861 

7181- 

7184 

7281- 

7282 

7324 

7328 

7428 

-7429 

7625 

7802- 

7804 

7863 

7953-7954 

8157 

8161 

8194- 

8196 

8367- 

8368 

8403- 

8404 






447 1540-1541 1827-1830 1 
2836-2838 3983-3985 4514 


841-1842 1962-1963 
4518 7014^7027 


skeletal 
muscle 


Clontech 


SKMS03- 


1540-1541 1827-1830 1841-1842 2836-2838 8194- 
8196 


skeletal 
muscle 


Clontech 


SKMS04 


1827-1830 1841-1842 2143-2146 2836-2838 3560 
5723-5729 7780-7790 


spinal cord 


Clontech 


SPC001 


3-9 106-107 109 112 140-142 144-145 193-194 
201-209 241-243 269-270 294-299 321 350 375 
380-382 390-394 430-431 475 515-516 549-551 
562 596 603-606 608-609 650-651 718 765 786- 
787 984-985 1005-1007 1033-1035 1048 1064- 1 
1066 1072-1105 1110-1111 1160-1161 1175-1176 
1227-1243 1246-1247 1261-1263 1290-1293 1305 
1311-1313 1348 1354-1355 1368 1380-1390 1394- 
1395 1410-1412 1439-1441 1465 1472-1480 1490- 
1491 1505-1506 1511-1513 1540-1541 1552-1555 
1557 1586-1591 1600-1601 1612-1615 1619-1620 
1637-1641 1659 1678 1718 1726-1727 1817-1818 
1870-1874 1876 1896 1903-1905 1912 1916 1921 
1930-1932 1944-1945 1947-1948 1950-1953 1958- 
1963 1980-1984 2016-2017 2020-2027 2062-2063 
2074-2075 2079 2086 2148-2152 2190-2191 2262- 
2264 2268-2271 2282-2283 2302-2305 2307-2322 
2324-2327 2353-2357 2408-2415 2440-2441 2454 
2478 2480 2487-2488 2559-2560 2567-2568 2592- 
2596 2603-2611 2651-2652 2655-2660 2671 2684- 
2688 2698-2711 2725-2727 2732-2738 2744-2747 
2788-2794 2796 2810-2811 2816-2817 2823-2825 
2842-2848 2856 2868-2873 2879 2881-2883 2905- 
2907 2918-2930 2935-2939 2948-2950 2953-2959 
2978-2979 3049-3053 3065 3071 3086-3087 3137 
3149-3150 3152-3155 3165-3172 3221-3226 3250- 
3252 3274-3284 3287-3289 3355-3356 3386-3389 
3410 3425-3429 3455-3463 3475-3476 3481-3485 
3487-3488 3497 3507-3511 3555-3557 3560 3602- 
3610 3612-3615 3642-3643 3658-3659 3675-3677 
3705 3722 3733-3739 3742 3765-3766 3776-3778 
3782-3784 3868 3878-3880 3885 3897-3900 3904 
3910-3919 3931-3936 3938-3941 3966-3968 3986- 
3987 4001-4008 4027-4040 4042 4048-4051 4061- 
4062 4065 4092-4095 4101-4106 4131 4142 4150 


\pisr 


■J 




:tOC 

;ire ni 

•6. «+ 


11 in!! 2! 

B O ic!! 




4169 

-4171 

4180 

-4181 

4191 

-4193 

4242 

-4245 

4286 




4289 

4327 

4363 

-4364 

4382 

-4384 

4422 

-4425 

4429 




4445 

-4447 

4484 

4542 

4612 

-4613 

4644 

-4645 

4691 




4724 

-4727 

4738 

-4741 

4745 

-4749 

4768 

-4769 

4771- 




4772 

4793 

-4794 

4796 

-4803 

4806 

-4809 

4825 

4848- 




4850 

4919 

4922 

4930 

-4931 

4944 

-4945 

5032 

-5033 




5054 

5107 

-5109 

5116 

5166 

5206 

-5207 

5284 

-5285 




5306 

5336 

5369 

-5371 

5377 

5386 

5392 

5448 

-5451 




5492 

-5493 

5530 

-5545 

5547 

-5553 

5635 

5715 

-5718 




5723 

-5729 

5757 

-5758 

5765 

-5768 

5770 

5777 

5829- 




5834 

5853 

-5855 

5875 

-5881 

5981 

-5983 

6003 

-6005 




6014 

-6020 

6069 

-6073 

6075 

6080 

-6083 

6087 

-6090 




6170 

-6171 

6174 

-6176 

6213 

-6217 

6267 

6272 

6282- 




62 84 

6291 

-6292 

6305 

-6307 

6322 

6356 

-6358 

6443- 




6444 

6446 

6448 

6476 

-6479 

6511 

6517 

-6518 

6583 




6653 

-6660 

6662 

6759 

-6761 

6764 

-6765 

6780 

6810 




6909 

6916 

-6917 

6970 

-6976 

6990 

-6994 

7001 

-7002 




7014 

-7027 

7091 

7099 

-7102 

7152 

-7153 

7162 

-7165 




7169 

-7170 

7196 

-7197 

7224 

7238 

-7240 

7283 

-7284 




7352 

7404 

7443 

7457 

-7459 

7561 

7609 

-7611 

7661 




7729 

7732 

-7734 

7766 

-7772 

7780 

-7790 

7843 

7882 




7906 

7958 

8013 

8031 

-803a 

8158 

8198 

8430- 

-8432 




8490 

-8495 








adult 

Clontech 

SPLcOl 

6-7 56 105 121 

345-349 351-352 432 

-433 941- j 
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5558 5614-5615 5748-5753 5869 5953 6003-6005 
6063-6064 6115-6116 6182-6191 6213-6217 6260- 
6262 6268 6326-6333 6348 6452-6453 6743-6744 
6757-6758 6836-6838 6858 7188-7190 7375 7421 
7613-7614 7626-7629 7907 7987 8309-8312 


Mixture of 
16 tissues- 
mRNAs*** 


Various 
Vendors*** 


SUP002 


384-385 546-548 1022-1023 1070-1071 1113-1117 
2636 2656-2657 2731 3066 4014 4061-4062 4929 
5416 5466 7003-7004 7470 7551-7554 7764-7765 
7949-7951 8430-8432 


Mixture of 
16 tissues- 
mRNAs * * * 


Various 
Vendors*** 


SUP005 


305 365 384-385 726-727 757-758 786-787 811- 
813 2534-2535 2570 2656-2657 2665-2667 2807- 
2811 2823-2825 3095-3096 3115-3116 3753-3759 
3983-3985 4266 4377 4395-4396 4949 5446-5451 
5554-5558 5694-5695 5705-5709 6957-6959 7029- 
7033 7780-7790 7949-7951 8031-8033 8397-8398 
8430-8432 8469-8479 


Mixture of 
16 tissues- 
mRNAs * * * 


Various 
Vendors*** 


SUP008 


576-578 726-727 730-731 806-807 1110-1111 
2538-2542 2585-2586 2636 2642-2643 2651-2652 
2656-2657 2671 2810-2811 2857-2859 2935-2939 
2978-2979 4102-4106 4847 4873-4876 5547-5552 
5554-5558 7091 7465-7468 7773-7778 8031-8033 
8329-8332 8430-8432 8490-8495 


Mixture of 
16 tissues- 
mRNAs * * * 


Various 
Vendors*** 


SUP0 09 


726-727 730-731 786 
2567-2568 2636 2656 
3037-3039 3043 3064 
5554-5558 5631 6065 
7949-7951 8397-8398 


-787 793-7 
2657 2823 
3066 3133 

-6068 6990 
8469-8479 


94 2534-2535 
-2825 2935-2939 
4000 5521-5522 
-6992 7152-7153 


thalamus 


Clontech 


THA002 


14-19 67 90 180-183 283-284 612-613 716-717 
719 831-833 945-947 1151-1152 1324-1335 1347 
1354-1355 1402-1407 1439-1441 1455-1459 1490- 
1491 1538-1539 1558-1560 1619-1620 1691-1692 
1702-1710 1726-1727 1759-1760 1781-1783 1791- 
1795 1864-1869 1890 1907-1912 1921 1970-1971 
2020-2027 2068-2070 2112 2131-2136 2148-2154 
2251 2255-2257 2268-2271 2282-2283 2332-2333 
2344-2346 2353-2357 2435 2521 2574-2580 2585- 
2586 2610-2611 2637-2641 2658-2660 2682-2683 
2697 2711 2739-2742 2748-2749 2796 2850-2851 
2868-2873 2875-2878 2905-2907 2913-2914 2935- 
2939 3122-3132 3137-3141 3173 3221-3226 3276- 
3278 3280-3284 3287-3289 3308-3310 3346-3348 
3456-3463 3487-3488 3497-3499 3605 3612-3615 
3644-3645 3667-3672 3705 3717-3721 3748-3752 
3798-3801 3983-3985 4035-4040 4043 4048-4051 
4082-4085 4176 4180-4181 4218 4252 4296-4322 
4378-4380 4415-4417 4419 4422-4424 4435 4520- 
4528 4539-4541 4646 4721-4723 4784 4793-4794 


The 16 tissue-mRNAs and their vendor source, are as follows: 1) Normal adult brain mRNA (Invitrogen), 2) 
normal adult kidney mRNA (Invitrogen), 3) normal adult liver mRNA (Invitrogen), 4) normal fetal brain mRNA 
(Invitrogen), 5) normal fetal kidney mRNA (Invitrogen), 6) normal fetal Over mRNA (Invitrogen), 7) normal fetal 
skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontech), 9) human bone marrow mRNA (Clontech), 10) 
human leukemia mRNA (Clontech), 1 1) human thymus mRNA (Clontech), 12) human lymph node mRNA 
(Clontech), 13) human spinal cord mRNA (Clontech), 14) human thyroid mRNA (Clontech), 15) human adult 
esophagus mRNA (BioChain), 16) human fetal umbilical cord mRNA (BioChain). 
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-5992 

6021 

6032 

6084 

6182 

-6191 




6287 

6309 

-6311 

6412 

6600 

-6601 

6634 

-6638 

6665- 




6668 

6682 

-6683 

6718 

-6721 

6726 

6766 

-6767 

6828- 




6832 

6875 

6881 

6895 

6907 

-6908 

6949 

7080 

-7083 




7179 

-7184 

7186 

-7190 

7238 

-7240 

7274 

-7275 

7287 




7290 

7303 

-7304 

7317 

7329 

-7330 

7346 

-7347 

7394 




7412 

-7414 

7446 

7539 

-7549 

7595 

-7596 

7609 

-7611 




•7626 

-7629 

7689 

7698 

-7701 

7723 

-7727 

7745 

7755 




7770 

-7772 

7780 

-7790 

7802 

7867 

-7868 

7913 

7923 




8110 

-8113 

8130 

-8132 

8216 

-8218 

8227 

-8229 

8277- 




8279 

8296 

8315 

8335 

8372 

-8382 

8402 

8445 

8498 

— r ; 

thyroid 

Clontech 

THR001 

6-9 

14-19 

48 52-55 

67 87 

94 105 113 121 

123- 

91 ana 



126 

150-151 161 195 

201- 

209 2 

12-215 231 

253- 




254 

266-270 272 277 

-278 , 

286.-287 304 321 

351- 




352 

373 3 

94-385 407 

416-418 426-42 

8 430 

-431 




454- 

455 468-473 495 

513 . 

527-529 53 

I 537 

539 




556 . 

559-560 579 588 

-591 603-606 60 

3-609 

645 




650-651 670 706 712 

-713 733-737 753 757 

-758 




765 774 788-789 803 

-804 828-829 866-867 

884 




887-891 921 923-927 

941-942 952 971 974 

982- 




983 1022-1023 1037-1038 1160-1161 1185-1186 




1206 

-1209 

1227- 

-1248 

1287- 

-1289 

1297- 

-1304 

1309- 




1313 

1316- 

-1317 

1324- 

-1335 

1339- 

-1340 

1347 

1349- 




1353 

1357- 

-1370 

1378- 

-1379 

1396- 

-1414 

1425- 

-1431 




1439- 

-1441 

1478- 

-1480 

1485- 

-1491 

1498- 

-1500 

1511- 




1513 

1518- 

-1520 

1531- 

-1539 

1542- 

1551 

1557 

1561- 




1567 

1571 

1579- 

-1580 

1586- 

-1595 

1600- 

-1602 

1607- 




1608 

1610 

1619- 

-1620 

1622- 

•1628 

1636 

1655- 

1658 




1660- 

-1669 

1671- 

-1672 

1694 

1714- 

•1715 

1717- 

•1721 




1726- 

1728 

1780- 

1785 

1790-1795 

1798- 

•1800 

1813- 




1815 

1819- 

1821 

1824- 

1825 

1839- 

1851 

1853- 

1856 




1864- 

-1874 

1876 

1884- 

1889 

1896 

1907- 

1913 

1916 




1921 

1944- 

1945 

1947- 

1948 

1950- 

1953 

1958- 

1961 




1964-1969 

1980- 

1984 

1986- 

1991 

2001- 

2 0 04 

2014- 




2017 

2028 

2032- 

2035 

2037- 

2044 

2050- 

2052 

2059- 




2060 

2063 

2067- 

2070 

2080 

2086 

2112 

2114- 

2119 




2122 

2138- 

2141 

2155 

2158- 

2162 

2171- 

2173 

2182- 




2183 

2186- 

2189 

2195- 

2198 

2204- 

2208 

2240 

2255- 




2257 

2266- 

2267 

2276- 

2280 

2282- 

2283 

2285 

2291- 




2294 

2302- 

2305 

2315- 

2322 

2329- 

2333 

2341- 

2349 




2353- 

2357 

2361 

2370- 

2372 

2374- 

2378 

2382- 

2386 




2404- 

2415 

2417- 

2423 

2433- 

2434 

2442- 

2444 

2453- 




2454 

2458- 

2461 

2464- 

2467 

2469 

2489- 

2490 

2511- 




2512 

2515- 

2516 

2543 

2549- 

2550 

2558 

2573 

2597 




2603- 

2609 

2623 . 

2636- 

2641 

2651- 

2652 

2656- 

2660 




2665- 

2667 

2682- 

2683 

2704- 

2711 

2720 

2722- 

2727 




2729- 

2742 

2748- 

2749 

2778- 

2783 

2794 

2796 

2806 




2814- 

2815 

2820 

2826- 

2835 

2842- 

2848 

2850- 

2851 




2857- 

2863 

2867- 

2873 

2875- 

2879 

2902 

2913- 

2915 




2931- 

2946 

2948- 

2971 

2978- 

2979 

3014- 

3016 

3018- 




3023 

3033- 

3039 

3043 

3045- 

3048 

3065- 

3067 

3071 


I0-7(p0 


± O On;| Nl-6 H+3 - O 2 E! ife O *2 


3080-3091 3095-3096 3102-3105 3111 3122-3137 
3142-3147 3149-3150 3152-3155 3157-3172 3187- 
3214 3221-3226 3259-3260 3262-3273 3279-3284 
3287-3289 3298-3300 3306-3310 3315-3318 3346- 
3352 3357-3358 3361-3366 3370-3375 3384-3385 
3395-3397 3399 3407-3408 3422 3431-3433 3435- 
3437 3450-3452 3454 3456-3463 3475-3478 3486- 
3488 3491-3493 3498-3499 3507-3519 3521-3523 
3528-3530 3551-3553 3558-3559 3564-3567 3570 
3595-3599 3602-3605 3617-3619 3629-3630 3633 
3637-3638 3644-3645 3675-3677 3683-3694 3697 
3699-3701 3707-3713 3717-3722 3730 3733-3739 
3742-3747 3753-3766 3769-3774 3779-3780 3782- 
3784 3788-3789 3850-3854 3866-3867 3869-3870 
3888-3893 3901-3904 3910-3919 3926-3936 3956 
3958-3960 3962-3968 3983-3987 4015-4018 4020- 
4040 4042 4055-4056 4070-4072 4082-4088 4090- 
4091 4096-4100 4128-4129 4132 4134 4142 4150 
4166-4169 4177 4182-4183 4219-4220 4228 4241 
4251 4253 4264 4271 4296-4297 4331-4333 4336 
4353 4355-4359 4366-4367 4386-4387 4390 4392 
4422-4424 4429 4436-4438 4441-4444 4484 4498- 
4501 4512-4518 4529-4535 4544-4546 4554-4555 
4561 4610 4613-4614 4622 4625 4644-4646 4662 
4666-4671 4674-4675 4691 4710 4731-4733 4745- 
4749 4765-4769 4771-4772 4774 4791-4792 4840- 
4842 4873-4876 4889 4899 4909-4916 4920-4923 
4927 4972-4976 4978 5008-5012 5036-5038 5060 
5082-5097 5101 5107-5109 5118-5119 5141 5146 
5150 5153-5155 5159-5160 5162-5163 5167-5179 
5185-5189 5191-5193 5200 5205 5213 5247 5258- 
5262 5273 5284-5285 5288 5290-5292 5302-5305 
5331 5334-5335 5343 5351 5360-5362 5396-5401 
5406-5409 5415-5416 5425-5426 5428 5435-5440 
5452-5459 5461 5466 5472 5500-5501 5515-5518 
5526-5529 5546-5558 5565-5566 5569 5580-5581 
5583-5584 5590-5593 5624-5626 5629 5639-5641 
5658-5659 5671 5673 5677 5680-5682 5694-5695 
5698-5701 5744-5747 5757-5758 5761-5763 5765- 
5768 5770 5775 5792 5798-5805 5809-5817 5825- 
5828 5848 5851-5852 5873 5875-5889 5899-5914 
5920-5921 5925-5930 5932 5949-5952 5964-5966 
5969 5998 6001-6005 6013 6028-6031 6058-6060 
6063-6068 6080-6081 6115-6118 6123-6127 6137- 
6138 6140-6141 6156 6165-6169 6174-6176 6178 
6182-6191 6197 6213-6217 6243 6254-6256 6265 
6272 6276-6278 6287 6289 6291-6292 6297-6298 
6301 6305-6311 6316-6321 6338-6339 6343-6345 
6348-6349 6353-6355 637_8 6398 6418 6427-6431 
6445 6462 6464-6466 6476-6479 6516 6552-6556 
6598-6599 6613-6615 6618 6629-6633 6662-6676 
6698 6703-6704 6718-6721 6734-6735 6742 6759- 
6770 6776-6779 6784-6798 6828-6835 6839 6846- 
6848 6858 6862 6896-6898 6922-6924 6927-6931 
6937 6944-6945 6952 6954-6955 6957-6959 6990- 
6992 7001-7002 7014-7027 7029-7033 7043-7044 
7050-7053 7099-7102 7108-7110 7144 7155-7157 



t 
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7169 

- / X f 3 

7196 

-7197 

7225 

7230 

727C 

) 7300 

7308- 




7311 

/ J Z 1 

7426 

7428 

-7429 

7439 

7443 

7505 

7509- 




7510 

T c a *a 
7593 

-7594 

7621 

7630 

•7633 

7662 

-7666 

7693 -„ 




7696 

*7 *7 o a 

77 2 3 

7770 

-7772 

7780 

-7790 

7805 

7852 

7893 




7904 

- /905 

7907 

7953 

-7954 

7989 

8031 

-8033 

8043 

- 



8102 

-8113 

8145 

8162 

8192 

-8196 

8216 

-8218 

8227- 




8229 

8234 


8253 

8277 

-8279 

8329 

-8332 

8372- 




8382 

8397 

-8398 

8403 

-8404 

8413 

8421 

8430 

-8432 




8443 

-8444 

8466 

8490 

-8495 





trachea 


TRC001 

55 69 161 

230 

241-243 264-265 

293 

397-3 

98 




411- 

413 416-418 491 

-494 

533 549-551 753 

759- 




760 

857-859 945-947 

1014 

1049 

-1050 

1160 

-1161 




1216 

-1221 

1334 

-1335 

1339 

-1340 

1415 

-1420 

1446- 




1447 

1465 

1542 

-1548 

1558 

-1560 

1568 

-1574 

1610 




1616 

-1617 

1626 

-1628 

1694 

1712 

-1713 

1728 

1778 




1813 

-1815 

1832 

-1834 

18.53 

1884 

-1889 

1922 

-1927 




2018 

-2019 

2037 

2050 

-2052 

2063 

-2066 

2087 

-2094 




2103 

-2107 

2120 

-2121 

2163 

-2164 

2182 

-2183 

2255- 




2257 

2266 

-2267 

2370 

-2372 

2378 

2382 

-2384 

2387 




2404 

-2405 

2551 

-2553 

2567 

-2568 

2748 

-2749 

2796 




2805 

2948 

-2952 

3033 

3049 

-3053 

3065 

3073 

-3074 




3088 

-3091 

3152 

-3155 

3221 

-3226 

3274 

-3275 

3279 




3290 

-3292 

3361 

-3366 

3409 

3414 

-3415 

3435 

-3437 




3479 

-3480 

3500 

-3503 

3508 

-3511 

3536 

3555 

-3557 




3597 

-3599 

3616 

3638 

3678 

-3679 

3707 

-3709 

3742 




3771 

-3774 

3776- 

-3778 

3785. 

-3787 

3931 

-3936 

3986- 




3987 

4042 

4044 

4061- 

-4062 

4131 

4205 

-4207 

4255 




42.63 

4272 

-4273 

4415- 

-4417 

4444 

4536 

-4537 

4613 




4662 

4731 

4750 

4900 

5043- 

-5049 

5062 

5065- 

-5072 




5107- 

-5109 

5129- 

-5130 

5151 

5156 

5161 

5167 

5193 




5206- 

-5207 

5284- 

-5285 

5290- 

-5292 

5431- 

-5434 

5452- 




5459 

5526- 

•5529 

5546 

5583- 

-5584 

5614- 

-5615 

5631 




5660 

5677 

5693 

6200- 

-6202 

6213- 

-6217 

6305- 

-6307 




6645 

6662 

6665- 

•6668 

6726- 

-6728 

6762- 

-6763 

6868 




6979- 

-6982 

6997- 

•6998 

7014- 

-7027 

7046- 

-7049 

7465- 




7468 

7555 

7720- 

•7721 

7801 

7989 

8234- 

•8235 


uterus 


UTR001 

6-7 10-13 

114-115 121 144-145 

201-209 213-215 




246 345-349 441 527- 

529 537 814-821 845 

982- ! 




983 1024 1067-1069 1192 1297-1302 1311-1313 




1380- 

1390 

1478- 

1480 

1490- 

1491 

1514- 

1515 

1540- 




1551 

1575- 

1577 

1600- 

1602 

1612- 

1615 

1716 

1780 




1798- 

1800 

1820- 

1821 

1835- 

1840 

1849 

1854- 

1856 




1870- 

1874 

1916-1920 

1967 

1974- 

1978 

2003- 

2008 




2029 

2046- 

2049 

2063 

2074- 

2075 

2081 

2113 

2122 




2131- 

2136 

2148- 

2152 

2241- 

2242 

2266- 

2267 

2315- 




2322 

2353- 

2357 

2381- 

2386 

2388 

2404- 

2405 

2433- 




2434 

2478 

2511 

2558 

2587- 

2588 

2592- 

2596 

2619- 




2622 

2636 

2656- 

2657 

2672- 

2678 

2692- 

2696 

2713- 




2715 

2725- 

2727 

2732- 

2738 

2748- 

2749 

2778- 

2783 




2810- 

2811 

2874 

2931- 

2934 

2948- 

2950 

3 014- 

3016 




3065 

3086- 

3087 

3108- 

3110 

3149- 

3150 

3227- 

3228 




3298- 

3300 

3306- 

3307 

3349- 

3352 

3359- 

3360 

3416- 




3418 

3487- 

3488 

3494- 

3496 

3508- 

3511 

3 605 

3638 




3641 

3707- 

3709 

3717- 

3722 

3730 

3850- 

3854 

4042 




4173 

4182- 

4183 

4208 

4269- 

4270 

4286 

4372 

4422- 




4424 

4448- 

4452 

4480- 

4483 

4491- 

4494 

4508- 

4509 




4614 

4688- 

4690 

4757- 

4758 

4804 

4885 

4 957- 

4958 


■■) ± O DB^ft H-3 ., D H K i& O H 


5073 

-5075 

5098 

-5100 

5184 

5206- 

5207 

5216 

-5220 

5284 

-5285 

5302 

-5305 

5331 

5377 

5392 


-54 82 

5515 

-5516 

5561 

5567 

-5568 

5633 

5678- 

5679 

5731- 

5735 

5769 

5853 

-5855 

5945 

-5948 

5964- 

5966 

6013 

6058 

-6060 

60B7 

-6090 

6213 

-6217 

6236- 

6241 

6251 

6260 

-6262 

6291 

-6292 

6627 

-6628 

6703- 

6704 

6799- 

6804 

6828 

-6832 

6840 

§949 

7003- 

7004 

7111 

-7112 

7151 

7274 

-7275 

7392 

-7393 

7443 

7480 

7671 

7853- 

7854 

7937 

7987 

8052 

-8057 

8086- 

8087 

8110 

-8113 

8436 

8459 
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SEQ 

ACCESSION 

DESCRIPTION 

SMITH- 

% IDENTITY 

ID 

NUMBER 


WATERMAN 


NO. 



SCORE 


1 

V00488 

Homo sapiens alpha globin 

776 

90.141 

2 

V00488 

Homo sapiens alpha globin 

529 

86.400 

3 

M60832 

Homo sapiens alpha-2 type VIII 
collagen 

96 

31.343 

5 

M60832 

Homo sapiens alpha-2 type VIII 
collagen 

266 

31.092 

6 

Z93394 

Caenorhabditis elegans similar to 
Probable rabGAP domains 

54 

33 .333 

7 

Z93394 

Caenorhabditis elegans similar to 
Probable rabGAP domains 

782 

46.324 

8 

X55777 

Homo sapiens put . ORF 

56 

32.432 

9 

X55777 

Homo sapiens put. ORF 

318 

63.750 

10 

S72008 

Homo sapiens CDC10 homolog=hCDC10 

48 

25.000 

11 

S72008 

Homo sapiens CDC 10 homolog=hCDC10 

572 

71.429 

12 

S72008 

Homo sapiens CDC10 homolog=hCDC10 

452 

67.797 

14 

X76088 

Mus musculus DNA binding protein RFX1 

49 

33 .333 

16 

X76088 

Mus musculus DNA binding protein RFX1 

49 

23.077 

17 

X76088 

Mus musculus DfrA binding protein RFX1 

54 

28.205 

18 

X76088 

Mus musculus DNA binding protein RFX1 

56 

29.730 

19 

X76088 

Mus musculus DNA binding protein RFX1 

112 

41.071 

21 

U25281 

Rattus norvegicus SH3 domain binding 
protein 

222 

32.710 

22 

L24521 

Homo sapiens transformation-related 
protein 

47 

33.333 

23 

L24521 

Homo sapiens transformation-related 
protein 

65- 

26.596 

24 

L24521 

Homo sapiens transformation-related 
protein 

297 

63 .855 

26 

AJ004810 

Zea mays cytochrome P4 50 monooxygenase 

105 

49.091 

27 

AJ004810 

Zea mays cytochrome P4 50 monooxygenase 

54 

36.111 

28 

AJ004810 

Zea mays cytochrome P450 monooxygenase 

442 

56.477 

29 

X55777 

Homo sapiens put. ORF 

219 

52 .055 

30 

X74370 

Homo sapiens mucin 

130 

38 .554 

31 

AF189307 

Homo sapiens unknown 

136 

45.000 ! 

32 

X55777 

Homo sapiens put . ORF 

347 

67 .089 

33 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

291 

65 .476 

34 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

982 

54 .315 

35 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

385 

43 .580 

36 

V00488 

Homo sapiens alpha globin 

776 

90 .141 

38 

AF182293 

Homo sapiens U6 snRNA- associated Sm- 
like protein LSm7 

213 

67 . 857 

4 0 

D90041 

Homo sapiens arylamine N- 
acetyl transferase 

481 

79.000 

41 

D90041 

Homo sapiens arylamine N- 
acetyl transferase 

1320 

97 .500 

42 

AL078468 

Arabidopsis thaliana putative protein 

194 

26 .425 

43 

AL078468 

Arabidopsis thaliana putative protein 

438 

52 .518 

44 

M15530 

Homo sapiens B-cell growth factor 

145 

59 .091 


unci 
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47 

X55777 

Homo sapiens put* ORP 

337 

73.684 

48 

S58722 

Homo sapiens X- linked retinopathy 
protein {C-terminal, clone XEH.8c} 

256 

68.333 

49 

A31036 

Nicotiana alata PRP2 

101 

28.037 

50 

A31036 

Nicotiana alata PRP2 

125 

32.258 

51 

AL031177 

Homo sapiens COL4A6 (Collagen Alpha 
6 (IV)) 

118 

28.906 

54 

X55777 

Homo sapiens put . ORF 

377 

76.316 

55 

AF084256 

Homo sapiens beta glucuronidase 
isof orm d ' 

153 

56.522 

3D 

X65165 

Volvox carteri extensin 

276 

31.925 

-> / 

Add / / / 

Homo sapiens put . ORF 

47 

29.032 

c o 

X55777 

Homo sapiens put* ORF 

48 

28.261 

59 

X55777 

Homo sapiens put. ORF 

49 

52.941 

^ A 

6 0 

X55777 

Homo sapiens put. ORF 

48 

31.429 

61 

X55777 

Homo sapiens put . ORF 

52 

25.926 

62 

X55777 

Homo sapiens put . ORF 

44 

40.000 

63 

X55777 

Homo sapiens put. ORF 

50 

25.000 

64 

X55777 

Homo sapiens put. ORF 

47 

30.357 

65 

X55777 

Homo sapiens put. ORF 

51 

17.647 

66 

X5577 7 

Homo sapiens put. ORF 

281 

54.945 

68 

X55777 

Homo sapiens put. ORF 

317 

67.089 

69 

X55777 

Homo sapiens put. .ORF 

318 

69.444 

71 

M88700 

Homo sapiens dopa decarboxylase 

483 

73.000 

72 

U79260 

Homo sapiens unknown 

118 

39.394 

73 

M19301 

Homo sapiens alpha -keto acid 
dehydrogenase precursor 

468 

75.352 

74 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

368 

50.633 

75 

X84692 

Mus musculus spermatid perinuclear RNA 
binding protein 

459 

97.260 

76 

X84692 

Mus musculus spermatid perinuclear RNA 
binding protein 

483 

95.122 

78 

AF196972 

Homo sapiens JM24 protein 

48 

36.364 

79 

AF196972 

Homo sapiens JM24 protein 

347 

55.769 

81 

L26953 

Homo sapiens chromosomal protein 

147 i 

35.652 

82 

L26953 

Homo sapiens chromosomal protein 

224 

34.225 1 

83 

U04627 

Homo sapiens 78 kDa gastrin-binding 
protein 

643 

78.417 

85 

U41806 

Homo sapiens p60 

795 

84.967 

87 

X55777 

Homo sapiens put. ORF 

243 

56.000 

89 

U76992 

Homo sapiens Tat-SFl 

335 

77.333 

90 

AF071081 

Mycobacterium tuberculosis proline- 
rich mucin homolog 

333 

29.805 

91 

AF083501 

Macaca mulatta rhadinovirus 17577 
latent nuclear antigen 

202 

25.000 

93 

X71442 

Rattus norvegicus ORF 1; putative 

162 

37.391 

94 

W\ 0"X 04. 

Kauuus norvegicus cardiac myosin lignt 
chain 2 

529 

79 .290 

95 

AB013606 

Oryzias latipes elongation factor 1 
alpha 

2355 

79.870 

97 

AB013606 

Oryzias latipes elongation factor 1 
alpha 

1284 

59.845 

98 

AB013606 

Oryzias latipes elongation factor 1 

982 

8X .818 


± O iff III Mh& . O S! a ift O ).:•:!: 




alpha 



99 

AB013606 

Oryzias latipes elongation factor 1 
alpha 

1832 

74.333 

100 

AC003027 . 

Arabidopsis thaliana lcl|prt_seq No 
definition line found 

303 

37.991 

101 

AC003027 

Arabidopsis thaliana lcl|prt_seq No 
definition line found 

386 

50.735 

103 

M81757 

Homo sapiens S19 ribosomal protein 

434 

80. 159 

104 

M34371 

Nicotiana plumbaginifolia extensin 
precursor 

197 

26.207 

105 

AF030001 

Mus musculus unknown 

102 

28.713 

107 

AC006232 

Arabidopsis thaliana putative proline- 
rich protein PRP2 precursor 

344 

26.837 

108 

X79802 

Drosophila melanogaster WM6 

244 

54.255 

109 

U65092 

Homo sapiens melanocyte -specific gene 
1 nuclear protein 

159 

55.932 

111 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

190 

40.909 

113 

M83179 

Gallus gallus alpha-3 type IX collagen 

142 

28.571 

114 

M68867 

Homo sapiens retinoic acid binding 
protein II 

35 

50.000 

115 

M68867 

Homo sapiens retinoic acid binding 
protein II 

852 

99.275 

116 

X55777 

Homo sapiens put. ORF 

102 

73.684 

117 

X55777 

Homo sapiens put. ORF 

368 

73.077 

119 

U58658 

Homo sapiens unknown 

209 

49.485 

121 

AC006233 

Arabidopsis thaliana unknown protein 

436 

28.288 

123 

AP000061 

Aeropyrum pernix 114 aa long 
hypothetical protein 

73 

31.707 

125 

AP000061 

Aeropyrum pernix 114 aa long 
hypothetical protein 

42 

40.000 

12 6 

AP000061 

Aeropyrum pernix 114aa long 
hypothetical protein 

128 

46.429 

127 

X55777 

Homo sapiens put . ORF 

302 

68.919 

128 

X55777 

Homo sapiens put. ORF 

256 

61.039 

130 

S58722 

Homo sapiens X- linked retinopathy 
protein {C-terminal, clone XEH.8c} 

147 

45.763 

131 

Z71614 

Saccharomyces cerevisiae ORF YNL33 8w 

84 

42.857 

135 

M12937 

Homo sapiens ferritin heavy subunit 

337 

82.857 

136 

M12937 

Homo sapiens ferritin heavy subunit 

49 

54.545 

137 

M12937 

Homo sapiens ferritin heavy subunit 

313 

59.391 

138 ; 

L24521 

Homo sapiens transformation-related 
protein 

239 

58.889 

139 

S80905 

Homo sapiens Conl=*salivary 
concanavalin-A binding protein fexon 
3} 

290 

27.594 

140 

M12937 

Homo sapiens ferritin heavy subunit 

337 

82.857 

141 

M12937 

Homo sapiens ferritin heavy subunit 

252 

42.442 

142 

M12937 

Homo sapiens ferritin heavy Bubunit 

298 

55.941 

143 

X55777 

Homo sapiens put. ORF 

160 

36.607 

144 

X68142 

Oryctolagus cuniculus elongation 
factor 1 gamma 

862 

97.674 

145 

X6B142 

Oryctolagus cuniculus elongation 
factor 1 gamma 

2265 

87.557 
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146 

X92098 

Homo sapiens transmembrane protein 

417 

70.476 

147 

M86511 

Homo sapiens monocyte antigen CD14 

276 

74 . 627 

149 

AB011169 

Homo sapiens KIAA0597 protein 

269 

41.000 

150 

Z77132 

Unknown Similarity to Worm protein 
C05C12.3; cDNA EST yk224bl0.3 comes 
from this gene; cDNA EST yk2 

460 

44.382 

151 

Z77132 

Unknown Similarity to Worm protein 
C05C12.3; cDNA EST yk224bl0.3 comes 
from this gene; cDNA EST yk2 

54 

35.714 

152 

AF039694 

Homo sapiens antigen NY-CO- 16 

369 

81.818 

153 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

871 

48.866 

154 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

1520 

69.895 

155 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

349 

90.000 

156 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2923 

98.681 

157 

AJ001019 

Homo sapiens ring finger protein 

53 

36.000 

158 

AJ001019 

Homo sapiens ring finger protein 

380 

46.667 

159 

AJ001019 

Homo sapiens ring finger protein 

14 3 

62.222 

160 

S80119 

Rattus sp. reverse transcriptase 
homo log 

193 

27.974 

161 

AF008551 

Homo sapiens aurora-related , kinase 1 

236 

35.602 

162 

U93569 

Homo sapiens putative pl50 

1209 

57.944 

163 

U93569 j 

Homo sapiens putative pl50 

1346 

61.877 

164 

X53581 

Rattus norvegicus ORF3 

43 

22.222 

165 

X53581 

Rattus norvegicus ORF3 

58 

57.143 

166 

X53581 

Rattus norvegicus ORF3 

58 

57.143 

167 

X53581 

Rattus norvegicus ORF3 

53 

32.500 

168 

X53581 

Rattus norvegicus ORF3 

50 

33.333 

170 

U58658 

Homo sapiens unknown 

190 

55.556 

172 

Y15173 

Human papillomavirus type 75 E4 
protein 

133 

35.955 | 

173 

Y15173 

Human papillomavirus type 75 E4 
protein 

63 

40.909 

174 

Y15173 

Human papillomavirus type 75 E4 
protein 

44 

28.125 

175 

Y15173 

Human papillomavirus type 75 E4 
protein 

196 

34.965 

176 

U49974 

Homo sapiens mariner transposase 

813 

68.511 

177 

Z97628 

Caenorhabditis elegans Similarity to 
Brugia peptidylprolyl isomerase 
(TR:G984562) 

112 

50.000 

178 

Z97628 

Caenorhabditis elegans Similarity to 
Brugia peptidylprolyl isomerase 
(TR:G984562) 

889 

50.173 

179 

Z97628 

Caenorhabditis elegans Similarity to 
Brugia peptidylprolyl isomerase 
(TR:G984562) 

553 

50.955 

180 

U23484 | 

Caenorhabditis elegans weakly similar 
to serine/threonine protein kinase 

58 

27.619 

181 

U23484 

Caenorhabditis elegans weakly similar 
to serine/threonine protein kinase 

47 

31.944 



183 

U23484 

Caenorhabditis elegans weakly similar 
to serine/threonine protein kinase 

219 

35.294 

185 

X55777 

Homo sapiens put . ORF 

363 

65.882 

186 

X98494 

Homo sapiens M phase phosphoprotein 10 

138 

20.936 

187 

X98494 

Homo sapiens M phase phosphoprotein 10 

572 

66.667 

189 

X98494 

Homo sapiens M phase phosphoprotein 10 

607 

57.965 

190 

X55777 

Homo sapiens put. ORF 

306 

65.000 

191 

L24521 

Homo sapiens transformation-related 
protein 

335 

65.957 

193 

U79260 

Homo sapiens unknown 

189 

43.421 

194 

U79260 

Homo sapiens unknown 

49 

31.250 

195 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

254 

34.704 

196 

U79260 

Homo sapiens unknown 

164 

53.226 

199 

X55777 

Homo sapiens put. ORF 

115 

80.769 

200 

X55777 

Homo sapiens put. ORF 

310 

68.354 

201 

U00001 

Homo sapiens Human homologue of S . 
pombe nuc2+ and A. nidulans bimA 

59 

23.729 

202 

U00001 

Homo sapiens Human homologue of S. 
pombe nuc2+ and A. nidulans bimA 

80 

23.200 

203 

U00001 

Homo sapiens Human homologue of S. 
pombe nuc2+ and A. nidulans bimA 

57 

29. 167 

204 

U00001 

Homo sapiens Human homologue of S. 
pombe nuc2+ and A. nidulans bimA 

52 

29. 630 

205 

U00001 

Homo sapiens Human homologue of S . 
pombe nuc2+ and A. nidulans bimA 

55 

29.032 

206 

U00001 

Homo sapiens Human homologue of S. 
pombe nuc2+ and A. nidulans bimA 

56 

20.588 

207 

U00001 

Homo sapiens Human homologue of S. 
pombe nuc2+ and A. nidulans bimA 

55 

29.032 

208 

U00001 

Homo sapiens Human homologue of S. 
pombe nuc2+ and A. nidulans bimA 

57 

29.167 

209 

U00001 

Homo sapiens Human homologue of S. 
pombe nuc2+ and A. nidulans bimA 

91 

28.099 

210 

AF145615 

Drosophila melanogaster BcDNA.GH03377 

577 

38.542 

211 

AF145615 

Drosophila melanogaster BcDNA.GH03377 

337 

56.452 

212 

M15530 

Homo sapiens B-cell growth factor 

140 

77.143 

213 

L23646 

Caenorhabditis elegans homology with 
dnaJ; putative 

84 

36.585 

214 

L23646 

Caenorhabditis elegans homology with 
dnaJ; putative 

247 

29.224 

215 

L23646 

Caenorhabditis elegans homology with 
dnaJ; putative 

46 

25.000 

216 

M15530 

Homo sapiens B-cell growth factor 

156 

61.364 

217 

D90901 

Synechocystis sp. hypothetical protein 

219 

39.456 

218 

L10908 

Mus musculus Gcapl gene product 

126 

33.333 

219 

AF010144 

Homo sapiens neuronal thread protein 

J\U / C JM Xlr 

351 

8X.333 

220 

U93569 

Homo sapiens putative pl50 

195 

37.288 

221 

L27428 

Homo sapiens reverse transcriptase 

35 

50.000 

222 

L27428 

Homo sapiens reverse transcriptase 

498 

45 .525 

225 

U21603 

Jembrana disease virus tat protein 

116 

38 .462 

226 

X55777 

Homo sapiens put . ORF 

235 

61.111 

227 |AL022120 

Mycobacterium tuberculosis 

183 

25 .869 
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228 

L24521 

Homo sapiens transformation-related 
protein 

279 

53.448 

230 

X51394 

Xenopus laevis APEG precursor protein 

132 

25.789 

231 

L24521 

Homo sapiens trans format ion -related 
protein 

415 

69.000 

232 

M22334 

Homo sapiens unknown protein 

349 

44 .944 

233 

X87212 

Homo sapiens cathepsin C 

63 

100.000 

234 

XB7212 

Homo sapiens cathepsin C 

503 

98.667 

235 

X87212 

Homo sapiens cathepsin C 

1052 

95.152 1 

236 

X87212 

Homo sapiens cathepsin C 

3084 

99.352 

237 

X87212 

Homo sapiens cathepsin C 

466 

94 .340 

238 

L24521 

Homo sapiens transformation-related 
protein 

410 

65.517 

239 

X61712 

Homo sapiens HLA-A*6602 antigen 

227 

52.273 

240 

X61712 

Homo sapiens HLA-A*6602 antigen 

842 

95.455 

243 

M11759 

Lycopersicon esculentum cell wall 
hydroxyproline-rich glycoprotein 

170 

40.278 

245 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

191 

52.174 

246 

X53581 

Rattus norvegicus ORF2 

154 

45.000 

247 

U82987 

Homo sapiens Bel -2 binding component 3 

134 

29.891 

250 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

435 

55.970 

251 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

303 

53.846 

252 

X55777 

Homo sapiens put . ORF 

286 

69.697 

253 

M13043 

Mus musculus alpha- 1 (IV) collagen 

74 

30.612 

254 

M13043 

Mus musculus alpha-1 (IV) collagen 

14 9 

39. 080 

256 

X55777 

Homo sapiens put . ORF 

169 

74.359 

258 

X55777 i 

Homo sapiens put . ORF 

281 

66.667 

259 

M80344 

Homo sapiens ORF1 codes for a 40 kDa 
product 

151 

26.471 

260 

X58523 

Mus musculus MIPP 

233 

30.882 

264 

AF016448 

Caenorhabditis elegans No definition 
line found 

90 

29.762 

265 

AP016448 

Caenorhabditis elegans No definition 
line found 

171 

25.346 

266 

X65063 

Giardia muris unknown 

134 

24.528 

267 

X65063 

Giardia muris unknown 

108 

29.703 

268 

X65063 

Giardia muris unknown 

109 

30.198 

269 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

113 

35.185 

270 

AF010144 

Homo sapiens neuronal thread protein j 
AD7C-NTP 

485 

56.805 

271 

AF187988 

Homo sapiens zinc finger protein 
ZNF222 

595 

76.552 



Homo sapiens p4 0 

286 

32.338 i 

273 

X55777 

Homo sapiens put . ORF 

47 

35.484 

274 

X55777 

Homo sapiens put . ORF 

317 

65.789 

276 

M25282 

Paracentrotus lividus alpha collagen 
type 1 precursor 

212 

27.624 

277 

AC003973 

Homo sapiens ZNF91L 

67 

25.600 

278 

AC003973 

Homo sapiens ZNF91L 

1223 

46.833 
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279 

X51394 

Xenopus laevis APEG precursor protein 

162 

33.108 

281 

M64793 

Rattus norvegicus salivary proline- 
rich protein 

89 

35.385 

282 

M64793 

Rattus norvegicus salivary proline- 
rich protein 

255 

32.061 

283 

D86971 

Homo sapiens no similarities to 
reported gene products 

52 

24.324 

284 

D86971 

Homo sapiens no similarities to 
reported gene products 

788 

55. 022 

285 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c} 

201 

53.731 

286 

V01555 

Human herpesvirus 4 BKRF1 encodes ' 
EBNA-1 protein, latent cycle gene. 

80 

35.556 ; 

287 

V01555 

Human herpesvirus 4 BKRF1 encodes 
EBNA-1 protein, latent cycle gene. 

950 

65.468 

288 

Z46242 

Caenorhabditis elegans similar to WD 
domain, G-beta repeat; cDNA EST 
yk283e3.3 comes from this gene; cDNA 
EST yk236e2.3 comes from this gene; 
cDNA EST yk283e3.5 comes from this 
gene; cDNA EST yk238e2.5 comes from 
this gene 

311 

40.881 

289 

Z46242 

Caenorhabditis elegans similar to WD 
domain, G-beta repeat; cDNA EST 
yk283e3.3 comes from this gene; cDNA 
EST yk238e2.3 comes from this gene; 
CDNA EST yk283e3.5 comes from this 
gene; cDNA EST yk238e2.5 comes from 
this gene 

873 

4 8.718 

290 

X73675 

Human herpesvirus 6 ORF5 

259 

57.143 

293 

M15530 

Homo sapiens B-cell growth factor 

139 

64.665 | 

295 

D13951 

Nicotiana tabacum extensin precursor 

183 

34.677 

297 

X55777 

Homo sapiens put. ORF 

265 

70.769 

299 

U00032 

Caenorhabditis elegans No definition 
line found 

336 

43.243 

301 

L24521 

Homo, sapiens transformation-related 
protein 

331 

54.918 

3 02 

X55684 

Lycopersicon esculentum extensin 
(class I) 

83 

37.879 

303 

M12099 

Mus musculus proline-rich protein 

180 

34.320 

3 04 

U47856 

Araneus diadematus fibroin- 4 

179 

41.558 i 

3 05 

X88799 

Oryza sativa DNA binding protein 

129 

41.791 

307 

Z25535 

Homo sapiens nuclear pore complex 
protein hnupl53 

67 

31.250 

308 

Z25535 

Homo sapiens nuclear pore complex 
protein hnupl53 

98 

32.184 

309 

Z25535 

Homo sapiens nuclear pore complex 
protein hnupl53 

769 

100.000 

310 

Z25535 

n^iiiu sdpiens nucxear pore complex 
protein hnupl53 

65 

31.731 

311 

Z25535 

Homo sapiens nuclear pore complex 
protein hnupl53 

58 

22 .751 

312 

Z25535 

Homo sapiens nuclear pore complex 
protein hnupl53 

65 

37.931 

313 | 

Z25535 

Homo sapiens nuclear pore complex 

83 

30.159 
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314 

Z25535 

protein hnupl53 

Homo sapiens nuclear pore complex 
protein hnupl53 

52 

32 30fi 1 

315 

Z25535 

Homo sapiens nuclear pore complex 
protein hnupl53 

233 

1 00 nftn 1 

J.UU . UUU I 

316 

Z25535 

Homo sapiens nuclear pore complex 
protein hnupl53 

60 

3 o oon f 

317 

225535 

Homo sapiens nuclear pore complex 
protein hnupl53 

j 70 

22 "Ml 1 

318 

Z25535 

Homo sapiens nuclear pore complex 
protein hnupl53 

9658 

100.000 

319 

Z25535 

Homo sapiens nuclear pore complex 
protein hnup!53 

70 

23.333 

320 

Z25535 

nuillu oapiens nuciear pore complex 
protein hnupl53 

1579 

40.405 j 

321 

Z81138 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST EMBL:D65543 comes 
from this gene 

130 

33.824 | 

324 
325 
326 

L76159 
L76159 
1*76159 

Homo sapiens FRG1 gene product 
Homo sapiens FRG1 gene product 
Homo sapiens FRG1 oene nrnHnrf 

391 
74 

73.469 
81.250 1 

327 
328 
329 
332 
333 
334 
335 

L76159 
L76159 
X55777 
L19267 
L1926? 
L19267 
L19267 

Homo sapiens FRG1 gene product 
Homo sapiens FRG1 gene product 
Homo sapiens, put. ORF 
Homo sapiens putative 
Homo sapiens putative 
Homo sapiens putative 

Homo sapiens putative n 

1 534 
540 
1029 
331 
55 
59 
65 

77.698 | 

78.125 1 

79.565 j 

71.429 I 

60.000 

2 8.205 j 

27.731 1 

336 
337 
339 
340 
341 

L19267 
L19267 
L19267 
L19267 
M11902 

Homo sapiens putative 
Homo sapiens putative 

Homo sapiens putative j 
Homo sapiens putative | 
Mus musculus proline-rich salivary 
protein 1 

104 

86 

90 

380 

69 

192 

62.963 J 
2 6.271 j 
32.258 f 
50.694 1 
29.851 j 

-> / . \J Z? 

342 

M19537 

Drosophila melanogaster RNA polymerase 
II largest subunit (, EC 2.7.7.6) | 

168 

2 6.829 ] 

343 

K03204 

Homo sapiens salivary proline-rich 
protein precursor 1 

755 

95.413 1 

344 

K03204 

Homo sapiens salivary proline-rich 
protein precursor | 

1223 

84.865 

345 

AF010144 

Homo sapiens neuronal thread protein j 
AD7C-NTP 

56 

4 1 . 667 I 

346 

AF010144 

Homo sapiens neuronal thread protein S 

AD7C-NTP 

56 

41.667 j 

347 

AF010144 

Homo sapiens neuronal thread protein J 
AD7C-NTP 

53 

34.884 j 

348 

AF010144 

riKJixwj ticipxeiiu neuronal cnreao protein 1 
AD7C-NTP j 

126 

56.250 

349 
350 

AF010144 
X55777 

Homo sapiens neuronal thread protein f 
AD7c-NTP 

Homo sapiens put. ORF 

342 

46.445 I 

351 
352 

AF000195 
AF000195 

Caenorhabditis elegans similar to I 
oxysterol -binding proteins I 
Caenorhabditis elegans similar to 

307 
68 

SB. SSI 
39.286 I 




1021 

40.909 J 
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oxysterol -binding proteins 



353 

AF167320 

Mus musculus zinc finger protein 
ZFP113 

846 

67.660 

355 

AF109402 

Mus musculus neurotrophic factor 
artemin precursor 

152 

35.766 

356 

L27428 

Homo sapiens reverse transcriptase 

51 

20. 183 

358 

L27428 

Homo sapiens reverse transcriptase 

452 

54 .745 

360 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

348 

66.346 

361 

X55777 

Homo sapiens put. ORF 

388 

75.000 

362 

A31800 

Homo sapiens bile salt-stimulated 
lipase 

4908 

95.861 

363 

A31800 

Homo sapiens bile salt-stimulated 
lipase 

1998 

73.585 

364 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

381 

52.151 

365 

V00148 

Caenorhabditis elegans unnamed protein 
product 

170 

30.457 

366 

AF003540 

Homo sapiens Krueppel family zinc 
finger protein 

47 

28.889 

367 

AF003540 

Homo sapiens Krueppel family zinc 
finger protein 

81 

36.364 

368 

AF003540 

Homo sapiens Krueppel family zinc 
finger protein 

56 

25.000 

369 

AF003540 

Homo sapienB Krueppel family zinc 
finger protein 

151 

37.984 

370 

AJ010099 

Homo sapiens NKp44RGl 

171 

27.411 

371 

AE001904 

Deinococcus radiodurans hypothetical 
protein 

170 

23.011 

372 

AE001904 

Deinococcus radiodurans hypothetical 
protein j 

165 

33.333 ! 

373 

AF008554 

Rattus norvegicus. implantation- 
associated protein i 

537 

79.213 

374 

S58722 

Homo sapiens X-linked retinopathy { 
protein {c- terminal, clone XEH.8c} ! 

243 j 

70.370 

375 

AF010144 

Homo sapiens neuronal thread, protein 
AD7c-NTP 1 

286 

53.077 

J / / 

AF132 3,77 

Drosophila melanogaster unknown | 

94 j 

61.905 

378 

AF132177 

Drosophila melanogaster unknown j 

199 j 

52.564 i 

382 

L24521 

Homo sapiens transformation-related 
protein j 

334 

5 8.4 62 

TOO 

Jo J 

U4 7856 

Araneus diadematus fibroin-4 | 

245 j 

31.193 

384 

AJ004810 

Zea mays cytochrome P450 mohooxygenase 

54 i 

36.111 

385 

AJ004810 

Zea mays cytochrome P450 monooxygenase j 

408 

55.026 1 

386 

L23429 

Canis familiaris beta-galactbsides- 1 
binding lectin 1 

160 1 

34.783 

0 *7 
JO/ 


Mus musculus ORF j 

150 1 

32.061 

388 

AF177942 

Xenopus laevis Icatanin p60 I 

1 1 oc 1 

5 3 . 864 

389 

M96232 

Litomosoides carinii major 
microfilaria! sheath protein j 

13 8 

27.612 

390 

Z48045 

Caenorhabditis elegans sre-2 \ 

432 j 

48.175 ■ 

391 

Z48045 

Caenorhabditis elegans sre-2 | 

165 

56.757 

3 92 

Z48045 

Caenorhabditis elegans sre-2 ] 

202 

29.437 

393 

Z48045 

Caenorhabditis elegans sre-2 j 

670 ( 

49.515 


IOT72. 



3 94 

248045 

Caenorhabditis elegans sre-2 

57 

22 .500 

395 

X55777 

Homo sapiens put. ORF 

328 

60.920 

396 

AL133087 

Homo sapiens hypothetical protein 

124 

23 . 203 

397 

Z82268 

Unknown predicted using Genefinder ; 
similar to CUTICLE COLLAGEN 34; cDNA 
EST EMBL:D6562 9 comes from 

83 

30.889 

398 

Z8?268 

Unknown predicted using Genefinder; 
gimilar to CUTICLE COLLAGEN 34; cDNA 
EST EMBL:D65629 comes from 

116 

30 . 435 

399 

U63542 

Homo sapiens FAP protein 

176 

41.346 

400 

S58722 

Homo sapiens X-linked retinopathy 
protein {C- terminal, clone XEH.8c} 

209 

70.000 

401 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

377 

66.346 

402 

Z49209 

Saccharomyces cerevisiae unknown 

159 

27.687 

403 

L24521 

Homo sapiens transformation-related 
protein 

434 

38.416 

405 

L24521 

Homo sapiens transformation-related 
protein 

354 

66.667 

406 

L24521 

Homo sapiens transformation-related 
protein 

369 

59.524 

407 

X82200 

Homo sapiens gpStafSO 

606 

57.988 

408 

AF071081 

Mycobacterium tuberculosis proline - 
rich mucin homolog 

221 

28.918 

409 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

307 

48.322 

410 

M22332 

Homo sapiens unknown protein 

140 

35.165 

411 

X55777 

Homo sapiens put . ORF 

47 

30.000 

412 

X55777 

Homo sapiens put. ORF 

141 

54.348 

413 

X55777 

Homo sapiens put . ORF 

157 

54.545 

414 

K02550 

Oncorjiynchus mykiss 70-kilodalton heat 
shock protein 

114 

59.524 

415 

K02550 

Oncorhynchus mykiss 70-kilodalton heat 
shock protein 

552 

67.421 

416 

D17554 

Homo sapiens TAXREB 107 

43 

31.818 

417 

D17554 

Homo sapiens TAXREB 10 7 

1656 

98.962 

418 

D17554 

Homo sapiens TAXREB107 

484 

87. 379 

419 

X14690 

Homo sapiens lambda HuHITI-13 

3406 * 

96.514 

420 

X14690 

Homo sapiens lambda HuHITI-13 

409 

35.740 

421 

X14690 

Homo sapiens lambda HuHITI-13 | 

355 

100.000 

422 

X14690 

Homo sapiens lambda HuHITI-13 S 

387 

79.07a 

423 

X14690 ! 

Homo sapiens lambda HuHITI-13 | 

3266 

93.939 

424. 

X14690 

Homo sapiens lambda HuHITI-13 

656 

91.270 

425 

Z81038 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk488a2.5 comes 
from this gene 

216 

46.000 

426 

X57960 

Mus musculus ribosomal protein L7 

422 

48.026. 

427 

X57960 

Mus musculus ribosomal protein L7 

397 

48.716 

428 

X57960 

Mus musculus ribosomal protein L7 

239 

53.409 

430 

U96174 

Onchocerca volvulus 0vB8 

153 

35.135 

431 

U96174 

Onchocerca volvulus OvB8 

489 

39.908 

432 

AP000399 

Oryza sativa EST AU056133 (S20320) 
corresponds to a region of the 
predicted gene; similar to 

126 

63.636 
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Caenorhabditis elegans cosmid D1054; 
hypothetical protein (Z74030) 


~] 1 

433 

AP000399 

Oryza sativa EST AU056133 (S20320) 
corresponds to a region of the 
predicted gene; similar to 
Caenorhabditis elegans cosmid D1054; 
hypothetical protein (Z74030) 

414 

52 .239 1 

/l "J A 

1*76159 

Homo sapiens FRG1 gene product 

361 



74.725 _ 

AIR 


Homo sapiens FRG1 gene product 

1033 

93.333 1 



Homo sapiens FRG1 gene product 

599 

78.378 | 

A ^ *7 


Mus musculus embigin precursor 

242 

53 .750 

4 1 Q 

^ J J 

lull C "5 D (T 

M1538 6 

Homo sapiens gamma -globin 

897 

93.421 f 

440 

M15386 

Homo sapiens gamma -globin 

631 

77.640 

4 i 1 

L10110 

Octopus dofleini alpha tubulin 

358 

73.333 

/I A O 

44^ 

U6384 0 

Rattus norvegicus nucleoporin p54 

701 

67.039 

>i /i i 
44 3 

U6384 0 

Rattus norvegicus nucleoporin p54 

192 

51.852 

444 

U63840 

Rattus norvegicus nucleoporin p54 

1162 

69.381 | 

445 

U63840 

Rattus norvegicus nucleoporin p54 

1155 

88.835 

447 

AJ011118 

Mus musculus skeletal muscle and 
cardiac protein 

1408 

I 85.714 j 

448 

AB033021 

Homo sapiens KIAA1195 protein 

3187 

98.952 j 

449 

AL133204 

Homo sapiens hypothetical protein 

473 

67.480 j 

450 

U60269 

Homo sapiens putative envelope 
protein; orf similar to env of Type A 
and Type B retroviruses and to class 
II HERVs 

495 

78.689 j 

453 

AC007842 

Homo sapiens BC331191 1 

649 

50.192 j 

454 

AB033055 

Homo sapiens KIAA1229 protein 

4386 

100.000 j 

455 

AB033055 

Homo sapiens KIAA122 9 protein 

501 

92.000 | 

456 

M14676 

Homo sapiens src-like tyrosine kinase 
(put. ) ; putative 

396 

77.922 

457 

M14676 

Homo sapiens src-like tyrosine kinase 
(put . ) ; putative | 

3580 | 

99.628 

j 

458 

M14676 

Homo sapiens src-like tyrosine kinase - 
(put.) ; putative 

384 

100.000 | 
1 

ACQ 

AL03 1852 

Schizosaccharomyces pombe conserved ] 
hypothetical protein 1 

320 1 

39.041 

A d f\ 

460 

AB020499 

Homo sapiens MD-1 homologue f 

43 

34.783 j 

4ol 

AB020499 

Homo sapiens MD-l homologue | 

40 

50.000 | 

AC* 

4o2 

AB020499 

Homo sapiens MD-1 homologue 

120 

43.333 j 

A C A 

4o4 

AF109134 

Homo sapiens 7-60 i 

427 i 

41.079 | 

A C C 

D25215 

Homo sapiens KIAA0032 

253 1 

5X.316 | 

4oo 

S79410 

Mus sp. nuclear localization signals 
(NLS) -binding protein=spot-l 

117 

46.939 I 

A £Z Q 

X96722 

Saccharomyces cerevisiae ORF N0885 | 

224 

55.263 | 

470 

X96722 

Saccharomyces cerevisiae ORF NO 8 85 

53 

35.294 I 

4 /l 

X96722 

Saccharomyces cerevisiae ORF N0885 I 

106 

30.435 i 

472 

X96722 

Saccharomvces eeirfwf rn?i? vrnoQc 1 

714 J 

35.669 

473 

X96722 

Saccharomyces cerevisiae ORF N0885 | 

64 

40.000 I 

474 

M11902 

Mus musculus proline-rich salivary 
protein | 

157 

42.857 

475 

AF010144 

Homo sapiens neuronal thread protein [ 
AD7C-NTP J 

329 

70.652 

476 

AF036705 

Unknown Similar to phytoene 

90 | 

60.000 j 
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desaturase; coded for by C. elegans 
CDNA CEESX74F; coded for by C. el 



477 

AF036705 

Unknown Similar to phytoene 
desaturase; coded for by C. elegans 
cDNA CEESX74F; coded for by C. el 

613 

50.971 

478 

AB015630 

Homo sapiens type. 11 membrane protein 

280 

38. 158 

479 

AF146793 

Mus musculus H5AR 

360 

57.258 

481 

AF114816 

Homo sapiens KRAB-zinc finger protein 
SZF1-1 

2445 

100. 000 

482 

AF114816 

Homo sapiens KRAB-zinc finger protein 
SZF1-1 

703 

95.455 

483 

AF021833 

Mus musculus protein kinase A 
anchoring protein 

2330 

94.086 

484 

AF021833 

Mus musculus protein kinase A 
anchoring protein 

1828 

69.933 

485 

AF021833 

Mus musculus protein kinase A 
anchoring protein 

434 

48.454 

486 

AF021833 

Mus musculus protein kinase A 
anchoring protein 

570 

86.275 

488 

M32334 

Homo sapiens intercellular adhesion 
molecule 2 (ICAM-2) 

404 

87.640 

489 

M32334 

Homo sapiens intercellular adhesion 
molecule 2. (ICAM-2) 

571 

88 . 000 

4 90 

M32334 

Homo sapiens intercellular adhesion 
molecule 2 (ICAM-2) 

1149 

81.597 

491 

AF113251 

Homo sapiens putative 2-hydroxyacid 
dehydrogenase 

644 

100.000 

492 

AF113251 

Homo sapiens putative 2-hydroxyacid 
dehydrogenase 

572 

97.701 

493 

AF113251 

Homo sapiens putative 2-hydroxyacid 
dehydrogenase 

253 

95.122 

494 

AF113251 

Homo sapiens putative 2-hydroxyacid 
dehydrogenase 

1321 

94.378 

A Q C 

4 95 

X83413 

Human herpesvirus 6 U88 

527 

54.872 

4 96 

AC004528 

Homo sapiens R32184 3 

933 

60.357 

497 

AC004528 

Homo sapiens R32184 3 

624 

91.176 

A Q Q 

U79260 

Homo sapiens unknown 

266 

63.855 

499 

V00488 

Homo sapiens alpha globin 

925 

100.000 

500 

V00488 

Homo sapiens alpha globin 

320 

67.939 

301 

N15386 

Homo sapiens gamma -globin 

897 

93.421 

502 

M153 86 

Homo sapiens gamma -globin 

456 

74.453 

504 

U93571 

Homo sapiens p40 

371 

55.085 

505 

U93571 

Homo sapiens p40 

326 

47.321 

506 

U93571 

Homo sapiens p40 

184 

46.032 

507 

U93571 

Homo sapiens p4 0 

302 

55.319 

509 

U93571 

Homo sapiens p40 

346 

50. 893 

510 

U93571 

Homo sapiens p40 

363 

2 7.972 

511 


Homo sapiens p40 

396 

47.590 

513 

Z37139 

Unknown similar to guanine nucleotide 
binding protein; cDNA EST EMBI/:T00917 
comes from this gene; c 

920 

60.435 

514 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c} 

188 

67.925 

515 

L19871 

Homo sapiens activating transcription 

242 

72.881 


toll* 
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factor 3 



516 

L19S71 

Homo sapiens activating transcription 
factor 3 

980 

78.947 

517 

AL033514 

Caenorhabditis elegans Y75B8A.12 

244 

35.455 

519 

U55376 

Caenorhabditis elegans F16H11.2 gene 
product 

301 

83 .929 

520 

U55376 

Caenorhabditis elegans F16H11.2 gene 
product 

142 

78.571 

521 

U55376 

Caenorhabditis elegans F16H11.2 gene 
product 

31 

36.842 

522 

U55376 

Caenorhabditis elegans F16H11.2 gene 
product 

174 

83 .333 

523 

U55376 

Caenorhabditis elegans F16H11.2 gene 
product 

54 

35.714 

524 

AF003535 

Homo sapiens 0RF2-like protein 

816 

80.000 

525 

AF003535 

Homo sapiens ORF2-like protein 

96 

42.000 

526 

AF003535 

Homo sapiens 0RF2-like protein 

1161 

74.793 

527 

Y12431 

Mus musculus 5S ribosomal protein 

353 

100.000 

528 

Y12431 

Mus musculus 5S ribosomal protein 

231 

94.444 

530 


Plasmodium falciparum 3* end., gene 
product 

37 

30.556 

531 


Plasmodium falciparum 3* end., gene 
product 

206 

28.636 

532 

X78933 

Homo sapiens zinc finger protein 

1627 

60.574 

533 

L24521 

Homo sapiens transformation-related 
protein 

254 

63.043 

534 

AL050321 

Homo sapiens dJ717M23.1 (novel gene) 

5126 

100.000 

535 

AL050321 

Homo sapiens dJ717M23.1 (novel gene) 

352 

96.386 

536 

X53581 

Rattus norvegicuB ORF2 

139 

54. 000 

53 7 

X83413 

Human herpesvirus 6 U8 8 

446 

30.000 

539 

X92485 

Plasmodium vivax pval 

144 

56.863 

541 

AJ235271 

Rickettsia prowazekii unknown 

47 

35.294 

542 

AJ235271 

Rickettsia prowazekii unknown 

265 

41.284 

543 

AF061346 

Mus musculus Edpl protein 

195 

35.211 I 

544 

AF061346 

Mus musculus Edpl protein 

153 

30.464 

545 

AF061346 

Mus musculus Edpl protein 

50 

26.471 

546 

M22463 

Bos taurus ATP synthase gamma subunit 
precursor 

39 

26. 087 

547 

M22463 

Bos taurus ATP synthase gamma subunit 
precursor 

1533 

88.255 

548 

M22463 

Bos taurus ATP synthase gamma subunit 
precursor 

701 

76.923 

549 

X57351 

Homo sapiens 1-8D 

527 

90.909 

550 ! 

X57351 

Homo sapiens 1-8D 

488 i 

76.119 

551 

X57351 

Homo sapiens 1-8D 

383 

76.984 

552 

AJ005870 

Xenopus laevis RalB -binding protein 

157 

54.762 

553 

AJ005870 

Xenopus laevis RalB-binding protein 

2802 

77.632 

554 

AJ"O05 87O 

V V ~J O i V 

Aciiupuo laevis K.a jljd —oinQing protein 

1014 

64 . 746 

555 

L24521 

Homo sapiens transformation-related 
protein 

388 

58. 120 

556 

AF045640 

Caenorhabditis elegans No definition 
line found 

201 

42.353 

557 

U79260 

Homo sapiens unknown 

254 

59.494 

558 

X57960 

Mus musculus ribosomal protein L7 

200 

75. 000 
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559 

AL033512 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST EMBL:M88962 comes 
from this gene 

118 

55.556 

560 

AL033512 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST EMBL:M88962 corned 
from this gene 

370 

31.716 

562 

Z83227 

Caenorhabditis elegans predicted using 
Generinaer; weaK similarity to high- 
suipaur Jveracina . ; cjjxvia tsoi yKbbiai. J 
comes from this aene 

127 

40.000 

564 

U792 60 

Homo s r5T) i £*Tn fi unknown 

/ jj 

ol. 429 

565 

M60177 

Escherichia coli enterobart"in 

CO 


566 

M60177 

Escherichia coll pnfprnharhin 

o / 


567 

M60177 

CiOV>UCi Xv^Uia L.U± X CIA l~ t: J. UJUd v«- L- -LH 

*7 1 
/ -L 

25 . 532 

568 

M60177 

pRrhpri r*V» i A rnl i pnhproharh { n 

uQUUCl i. L>lii.a *_0_L_L CII LC.L. UiJdV U. JLU 

JJ 

40 . 74 1 

569 

M60177 

Escherichia col i pnfprnharhin 

■313 

34 .Job 

570 

M60177 

Escherichia coll pnfprnharhi n 

Off 

95 .035 

571 

M60177 

Escherichia coll pnfproharfin 

O -3 

Z*> . / 24 

572 

M60177 

Escherichia coli pntPT*r»harf i r» 

r> d 

J 1 • 818 

573 

M60177 

Escherichia coli enterobactin 

2057 

94.627 

574 

M60177 


r- O "3 n 

97 . 564 

575 

M60177 

Escherichia coli enterobactin 

1825 

78.426 

_> / O 

£j 1 *4 O D O 

Saccharomyces cerevisiae ORF YOH24C 

137 

26.667 

577 

Z74866 

Saccharomyces cerevisiae ORF YOL124c 

530 

37.553 

578 

r» «-» y« O /? /T 

£* /4o66 

Saccharomyces cerevisiae ORF YOL124c 

245 

32.061 

579 

S58722 

Homo sapiens X- linked retinopathy 
protein Jj^-terminal, clone XEH.8c) 

192 

69.492 

580 

L21936 

Homo sapiens succinate dehydrogenase 
flavoprotein subunit 

681 

92.381 

581 

L21936 

Homo sapiens succinate dehydrogenase 
flavoprotein subunit 

685 

70.21S 


o rr i o c *> 

Homo sapiens NF2=neurof ibromatosis 
type ^ \aicernativeiy spliced, rorm 
A4} 

108 

65.625 

CD/ 
o o *± 

AXi U V V O J* 2* 

Methanobacterium thermoautotrophicum 
conserved protein 

110 

33.333 

JO J 


Homo sapiens hypothetical protein 

343 

76. 923 

586 

D86979 

Homo sapiens KIAA022 6 

158 

72.000 

3D / 

DB6979 

Homo sapiens KIAA0226 

253 

61.017 j 

588 

AC002045 

Homo sapiens Unknown protein product 
CIT987SK-A-589H1 1 splice form 2 

48 

28.750 

589 

AC002045 

Homo sapiens Unknown protein product 
C1T987SK-A-589H1 1 splice form 2 

88 

51.724 

con 

«V U U ,£ U 3 

Homo sapiens Unknown protein product 
CIT987SK-A-589H1 1 splice form 2 

200 

68.085 


tvitioo ft a c 

Homo sapiens Unknown protein product 
CIT987SK-A-589H1 1 splice form 2 

126 

43.662 

592 

U60882 

Rattus norvegicus protein arginine N- - 
methyl transferase 

■1. <£* f -J 

C C TOO 

593 

X55777 

Homo sapiens put . ORF 

310 

67.442 

594 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

111 

31.250 

595 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

332 

70.732 


lorn 
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596 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP . 

310 

66.667 

597 

U79260 

Homo sapiens unknown 

46 

37,037 

599 

M15386 

Homo sapiens gamma -globin 

897 

93.421 

600 

M15386 

Homo sapiens gamma -globin 

528 

76.812 

601 

M15386 

Homo sapiens gamma-globin 

897 

93.421 

602 

M15386 

Homo sapiens gamma -globin 


90.647 

604 

U16031 

Homo sapiens IL-4 Stat 

47 

44.444 

6 05 

U16031 

Homo sapiens IL-4 Stat 

42 

25.7J.4 

608 

M15530 

Homo sapiens B-cell growth factor 

121 

75. 000 

C C\ Q 

M15530 

Homo sapiens B-cell growth factor 

188 

61.702 

610 . 

M15386 

Homo sapiens gamma-globin 

897 

93.421 

611 

M153 86 

Homo sapiens gamma -globin 

531 

71.429 

612 

X55777 

Homo sapiens put . ORF 

284 

61.842 

613 

X55777 

Homo sapiens put . ORF 

173 

59.574 

616 

X55777 

Homo sapiens put - ORF 

37 

55.556 

617 

X55777 

Homo sapiens put . ORF 

239 

55.556 

621 

AJ238096 

Homo sapiens Lsm4 protein 

134 

28.030 

623 

M15386 

Homo sapiens gamma -globin 

897 

93.421 

624 

M15386 

Homo sapiens gamma -globin 

627 

81.132 

625 

U01317 

Homo sapiens G-gamma globin 

889 

99.259 

626 

U01317 

Homo sapiens G- gamma globin 

695 

92.647 

627 

Z98763 

Schizosaccharomyces pombe hypothetical 
protein 

116 

37.255 

628 

Z98763 

Schizosaccharomyces pombe hypothetical 
protein 

255 

33.571 

632 

X55777 

Homo sapiens put . ORF 

242 

53.333 

633 

AC004557 

Arabidopsis thaliana F17L21.21 

87 

45.455 

634 

AC004557 

Arabidopsis thaliana F17L21.21 

260 

32.787 

635 

AF044953 

Homo sapiens NADH: ubiquinone 
oxidoreductase PGIV subunit 

217 

67.925 ! 

638 

M32865 

Homo sapiens Ku protein subunit 

150 

100.000 

639 

M32865 

Homo sapiens Ku protein subunit 

742 

94.355 

640 

D89150 

Schizosaccharomyces pombe similar to 
Saccharomyces serevisiae hypothetical 
52.9KD protein in CDC26-YMR31 
intergenic region, SWISS -PROT 
Accession Number P43616 

327 

62.162 

641 

X77689 

Danio rerio translational elongation 
factor- 1 alpha 

2477 

83.550 

643 

X77689 

Danio rerio translational elongation 
factor- 1 alpha 

1018 

81.818 

644 

X77689 

Danio rerio translational elongation 
factor-1 alpha 

1909 

74.074 

645 

AL035259 

Schizosaccharomyces pombe very 
hypothetical protein 

96 

37.037 

648 

U27842 

Xenopus laevis insulin receptor 
subs trate- 1 - like nrnh <=> i r» 

122 

26.891 

650 

X03342 

Homo sapiens rpL32 (aa 1-135) 

353 

100.000 

651 

X03342 

Homo sapiens rpL32 (aa 1-13 5) 

783 

97.015 

652 

J04765 

Homo sapiens osteopontin 

531 

73.874 

653 

J04765 

Homo sapiens osteopontin 

1638 

89.251 

654 

J04765 

Homo sapiens osteopontin 

1578 

85.047 

657 

M99435 ! 

Homo sapiens transducin-like enhancer 

964 

85.870 


urn* 
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protein 



658 

M99435 

Homo sapiens transducin-like enhancer 
protein 

501 

88,750 

660 

D00570 

Mus musculus. open reading frame (251 
AA) 

109 

25.342 

661 

Z81138 

Caenorhabditis elegans predicted using 
Gene finder; cDNA EST EMBL:D65543 comes 
from this gene 

47 

50. 000 

662 

ZB1138 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST EMBI*;D65543 comes 
from this gene 

41 

29.630 

663 

Z81138 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST EMBL:D65543 comes 
from this gene 

44 

34.211 

664 

Z81138 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST EMBL:D6554 3 comes 
from this gene 

160 

39.394 

665 

Z81138 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST EMBL:D65543 comes 
from this gene 

149 

38.384 

666 

L24521 

Homo sapiens transformation- related 
protein 

344 

65.476 

668 

V00488 

Homo sapiens alpha globin 

601 

77.049 

669 

AF162270 

Homo sapiens catalase interacting 
protein 

117 

77.273 

670 

X55777 

Homo sapiens put . ORF 

334 

69.863 

671 

M15386 

Homo sapiens gamma -globin 

897 

93.421 

673 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2658 

90.043 

674 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

1520 

69.895 

675 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

1555 

84.775 | 

676 

Y14040 

Homo sapiens CASH beta protein 

1297 

94.521 

677 

Y14040 

Homo sapiens CASH beta protein 

1297 

94.521 

678 

Y14040 

Homo sapiens CASH beta protein 

40 

24 .444 

679 

Y14040 

Homo sapiens CASH beta protein 

1376 

99.548 

680 

Y14040 

Homo sapiens CASH beta protein 

1237 - 

98.039 

681 

X81824 

Dictyostelium discoideum putative RNA 
helicase 

42 

46.667 

682 

X81824 

Dictyostelium discoideum putative RNA 
helicase 

60 

39.394 

683 

X81824 

Dictyostelium discoideum putative RNA 
helicase 

110 

42.857 ! 

684 

S58722 

Homo sapiens X- linked retinopathy 
protein {C- terminal, clone XEH.8c} 

154 

62 .791 

685 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2658 

90.043 

686 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2225 

85.300 

689 

AC004537 

Homo sapiens similar to tumor 
suppressor p33INGl; similar to 
AF044076 (PID :g2829208) 

599 

74 .101 

690 

AC004537 

Homo sapiens similar to tumor 
suppressor p33INGl; similar to 

93 9 

93 .902 
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AF044076 (PID:g2829208) 



692 

U13066 

Nicotiana alata arabinogalactan- 
protein precursor 

133 

38.542 

696 

X55777 

Homo sapiens put. ORF 

196 

71.429 

698 

X55777 

Homo sapiens put . ORF 

292 

.59.459 

699 

U55376 

Caenorhabditis elegans F16H11.2 gene 
product 

. 119 

51.515 

700 

U55376 

Caenorhabditis elegans F16H11.2 gene 
product 

207 

50. 000 

701 

U55376 

Caenorhabditis elegans F16H11.2 gene 
product 

178 

53.061 

702 

U55376 

Caenorhabditis elegans F16H11.2 gene 
product 

124 

44.444 

703 

U58658 

Homo sapiens unknown 

148 

45.161 

704 

D00570 

Mus musculus open reading frame (251 
AA) 

150 

22.561 

705 

AL078579 

Arabidopsis thaliana putative proline- 
rich protein 

175 

34.815 

706 

X55777 

Homo sapiens put . ORF 

276 

63.514 

708 

AF048977 

Homo sapiens Ser/Arg-related nuclear 
matrix protein 

632 

73.965 

709 

X55777 

Homo sapiens put. ORF 

277 

64.615 

712 

M26708 

Homo sapiens prothymosin alpha 

152 

96.000 

713 

M26708 

Homo sapiens prothymosin alpha 

374 

62.281 

714 

AL050369 

Homo sapiens hypothetical protein 

3160 

100.000 

715 

AL050369 

Homo sapiens hypothetical protein 

1887 

68.657 

716 

X52164 

Mus musculus Q300 protein (AA 1-77) 

41 

26. 087 

717 

X52164 

Mus musculus Q300 protein (AA 1-77) 

125 

65.217 

718 

X55777 

Homo sapiens put. ORF 

214 

52.941 

720 

U24169 

Homo sapiens JTV-1 

545 

78.333 

721 

U24169 

Homo sapiens JTV-1 

772 

94.030 

723 

AF034636 

Plasmodium falciparum merozoite 
surface protein 1 

120 

33 .784 

726 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2658 

90.043 

727 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2087 

81.276 1 

728 

AJ133034 

Chlamydia pneumoniae outer membrane 
protein 12 

140 

28.333 

730 

M15386 

Homo sapiens gamma -globin 

897 I 

93.421 

731 

M15386 

Homo sapiens gamma -globin 

873 ! 

96.129 

732 

Y15913 

Homo sapiens COL1A1 and PDGFB fusion 
transcript 

95 

34.524 

733 

AF079527 

Mus musculus IER5 

64 

39.130 

734 

AF079527 

Mus musculus IER5 

209 

34.807 

735 

M63279 

Plasmodium falciparum malaria antigen 

107 

56.667 

736 

M17375 

Gallus gallus type XII collagen 

114 

38.028 

tot 

Ml nnr 

Ml / 3 / 5 

Gallus gallus type XII collagen 

114 

36.250 

738 

U79302 

Homo sapiens unknown 

145 | 

31.210 

739 

M15530 

Homo sapiens B-cell growth factor 

135 

76.923 

740 

L24521 

Homo sapiens transformation-related 
protein 

44 

25.000 

741 

L24521 

Homo sapiens transformation-related 
protein 

170 

45.161 : 
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743 

AF170583 

Homo sapiens ets homologous factor 

581 

73.043 

744 

U04609 

Bradyrhizobium elkanii NodK 

53 

30.120 

745 

U04609 

Bradyrhizobium elkanii NodK 

56 

33.333 

746 

U04609 

Bradyrhizobium elkanii NodK 

48 

34.375 

747 

U04609 

Bradyrhizobium elkanii NodK 

123 

25.510 

748 

AJ133273 

Hylobates lar atrophin-1 

115 

36.082 

749 

AJ133273 

Hylobates lar atrophin-1 

81 

29.630 

750 

AJ133273 

Hylobates . lar atrophin-1 

138 

32.143 

753 

AF175738 

Abax parallelepipedus cytochrome c 
oxidase subunit II 

123 

20.440 

754 

L24521 

Homo sapiens transformation-related 
protein 

203 

45.783 

755 

M15530 

Homo sapiens B-cell growth factor 

154 

69.697 

756 

L24521 

Homo sapiens transformation-related 
protein 

196 

46.667 

757 

AF077599 

Homo sapiens hypothetical SBBI03 
protein 

223 

60.656 

758 

AF077599 

Homo sapiens hypothetical SBBI03 
protein 

2142 

99 .369 

759 

Z81490 

Unknown similar to WD domain, G-beta 
repeats (2 domains) ; cDNA EST 
EMBL:T00482 comes from this gene 

1295 

45.187 

760 

281490 

Unknown similar to WD domain, G-beta 
repeats (2 domains) ; cDNA EST 
EMBL:T00482 comes from this gene 

779 

64 .706 

763 

AF045012 

Hepatitis B virus surface antigen 

102 

34 .722 

764 

AF148223 

Plasmodium falciparum merozoite 
surface protein 2 

135 

35.514 

765 

M19419 

Mus musculus proline-rich salivary 
protein 

133 

34 .074 

768 

U79260 

Homo sapiens unknown 

150 

64 .103 

771 

1*24521 

Homo sapiens transformation- related 
protein 

192 

40.244 

772 

L24521 

Homo sapiens transformation- related 
protein 

192 

40.244 

773 

X55777 

Homo sapiens put* ORF 

242 

55.844 

774 

X92485 

Plasmodium vivax pval 

171 

44 .444 

775 

AB032916 

Hylobates klossii dopamine receptor D4 

143 

36.364 " 

777 

X67703 

Drosophila melanogaster Mst84Db 

92 

30.769 

778 

U03470 

Rattus norvegicus ligand for Fas 
antigen 

152 

38.889 

779 

M84443 

Homo sapiens galactokinase 

281 

100. 000 

780 

M84443 

Homo sapiens galactokinase 

324 

69.512 

781 

AB032910 

Hylobates muelleri dopamine receptor 
D4 

146 

32.000 

782 

M15530 

Homo sapiens B-cell growth factor 

211 

83.784 

783 

M15530 

Homo sapiens B-cell growth factor 

207 

58.491 

784 

M15530 

Homo sapiens B-cell growth factor 

240 

8O.00Q 

785 

AF189307 

Homo sapiens unknown 

108 

77.273 

786 

V00488 

Homo sapiens alpha globin 

776 

90.141 

787 

V00488 

Homo sapiens alpha globin 

869 

1O0.000 

788 

M15386 

Homo sapiens gamma -globin 

897 

93.421 

789 

M15386 

Homo sapiens gamma -globin 

515 

82.222 

793 

M15386 

Homo sapiens gamma-globin 

897 

93.421 
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794 

M15386 

Homo sapiens gamma-globin 

492 

86.525 

795 

M15386 

Homo sapiens gamma- globin 

159 

88.462 

796 

M15386 

Homo sapiens gamma -globin 

460 

64.122 

797 

M15386 

Homo sapiens gamma -globin 

896 

92 .763 

798 

M15386 

Homo sapiens gamma- globin 

724 

81.646 

799 

M15386 

Homo sapiens gamma-globin 

669 

81.250 

800 

M15386 

Homo sapiens gamma-globin 

938 

88.889 

801 

M15386 

Homo sapiens gamma-globin 

9S4 

100.000 

802 

X55777 

Homo sapiens put . ORF 

338 

64 .474 

803 

X55777 

Homo sapiens put. ORF 

123 

80.000 j 

. 804 

X55777 

Homo sapiens put. ORP 

351 

63.953 

805 

X55777 

Homo sapiens put . ORF 

304 

63.889 

806 

D87453 

Homo sapiens KIAA02 64 

40 

35.000 

807 

D87453 

Homo sapiens KXAA0264 

172 

23.214 

809 

D64005 

Synechocystis sp. transposase 

111 

43.590 

811 

X01655 

Homo sapiens type III procollagen (aa 
892-1023) 

42 

37.500 

812 

X01655 

Homo sapiens type III procollagen (aa 
892-1023) 

67 

35.714 

813 

X01655 

Homo sapiens type III procollagen (aa 
892-1023) 

135 

35.514 

814 

X55777 

Homo sapiens put. ORF 

311 

59.259 

815 

X55777 

Homo sapiens put . ORF . 

302 

63.636 

816 

X55777 

Homo sapiens put. ORF 

299 

65.333 

818 

X55777 

Homo sapiens put. ORF 

252 

51.190 

819 

X55777 

Homo sapiens put. ORF 

327 

68.919 

820 

X55777 

Homo sapiens put . ORF 

204 

70.732 

821 

X55777 

Homo sapiens put. ORF 

371 

53.509 

822 

J02963 

Homo sapiens platelet glycoprotein lib 
precursor 

55 

55.556 

823 

J02963 

Homo sapiens platelet glycoprotein lib 
precursor 

55 | 

55.556 

825 

JO 2 96 3 

Homo sapiens platelet glycoprotein lib 
precursor 

42 

24.000 

826 

J02963 

Homo sapiens platelet glycoprotein lib 
precursor 

73 

28.148 

827 

J02963 

Homo sapiens platelet glycoprotein lib 
precursor 

176 

29.902 

828 

M61870 

Homo sapiens Krueppel- related DNA- 
binding protein 

1242 

92.821 

829 

M61870 

Homo sapiens Krueppel -related DNA- 
binding protein 

574 

81.373 

830 

Z71178 

Unknown cDNA EST EMBL:M88812 comes 
from this gene; cDNA EST EMBL:Z14521 
comes from this gene; cDNA 

176 

32.847 

831 

AL031349 

Schizosaccharomyces pombe hypothetical 
protein 

221 

60.000 

832 

AL031349 

Schizosaccharomyces pombe hypothetical 
protein 

169 

58.000 

833 

AL031349 

Schizosaccharomyces pombe hypothetical 
protein 

69 

36.923 

835 

AF062240 

Homo sapiens immunoglobulin heavy 
chain variable region 

550 

84.211 

836 

D31767 

Homo sapiens KIAA0058 

53 

28.947 
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837 

D31767 

Homo sapiens KIAA0058 

328 

97.778 

838 

D31767 

Homo sapiens KIAA0058 

1004 

98.204 

839 

D87666 

Homo sapiens heat shock protein 90 

557 

96.667 

840 . 

D87666 

Homo sapiens heat shock protein 90 

353 

68.387 

842 

L01114 

Klebsiella aerogenes unusual codon 
usage. ORF X 

108 

37.500 

844 

V00488 

Homo sapiens alpha globin 

270 

73.333 

845 

Z93785 

Unknown similar to ATPase; cDNA EST 
BMBL:D27734 comes from this gene; cDNA 
EST EMBli:D27733 comes fr 

369 

48.739- 

ft A £ 
Oft D 


Homo sapiens gamma -globin 

760 

100.000 

0 1/ 

Ml 51 ft C 

Homo sapiens gamma-globin 

666 

76,316 ! 

010 

HI j J ob 

Homo sapiens gamma -globin 

932 

95.395 


Ml CI QC 

Homo sapiens gamma -globin 

613 

98.925 

ft t; n 

O 3 v 

Ml CIRC 

P1JL3 Job 

Homo sapiens gamma-globin 

835 

88.816 

0 3 x 

Ml cibc 

Homo sapiens gamma-globin 

882 

100.000 

ft ^ 9 
O 3 ^ 

Ml C "3 ft £ 
WIS j O t> 

Homo sapiens gamma-globin 

926 

81.283 

ft R 0 
O j j 

Ml ClflC 
1*1X3 J 0 O 

Homo sapiens gamma -globin 

850 

99.225 I 

OC.A 

Ml cior 

Homo sapiens gamma -globin 

672 

63.404 

855 

M15386 

Homo sapiens gamma -globin 

519 

62.092 


M15386 

Homo sapiens gamma-globin 

433 

84.810 

857 

X79238 

Homo sapiens ribosomal protein L3 0 

323 

100.000 

. 858 

X79238 

Homo sapiens ribosomal protein L3 0 

207 

100.000 

860 

V00488 

Homo sapiens alpha globin 

776 

90.141 

861 

V00488 

Homo sapiens alpha globin 

416 

78.226 

863 

AL031670 

Homo sapiens dJ681N20.2 (ferritin, 
light polypeptide -like 1) 

317 

98.039 

864 

AL031670 

Homo sapiens dJ681N20.2 (ferritin, 
light polypeptide-like 1) 

1062 

95 .977 

865 

AF123880 

multiple sclerosis associated 
retrovirus element unknown protein 
U5/1 

150 

69.697 

866 

U09368 

Homo sapiens zinc finger protein 
ZNF140 

1232 

45.498 

867 

U09368 

Homo sapiens zinc finger protein 
ZNF14 0 

296 

62.500 

868 

J02963 

Homo sapiens platelet glycoprotein lib 
precursor 

115 

47.222 

869 

J02963 

Homo sapiens platelet glycoprotein lib 
precursor 

166 

75.000 

870 

J02963 

Homo sapiens platelet glycoprotein lib 
precursor 

118 

76.190 

871. 

J02963 

Homo sapiens platelet glycoprotein lib 
precursor 

192 

33.987 

872 

M11759 

Lycopersicon esculentum cell wall 
hydroxyproline-rich glycoprotein 

128 

35.185 

875 | 

AC004973 

Homo sapiens carrier protein-like ; 
similar t~r* om ftftft (t>tt\ .r*o<rec7/i \ 

1520 

100.000 

876 

AC004973 

Homo sapiens carrier protein-like; 
similar to Q01888 (PID :g266574*J 

169 

1O0.000 

877 

U29488 

Caenorhabditis elegans No definition 
line found 

455 

42.593 

878 

U29488 

Caenorhabditis elegans No definition 
line found 

551 

45.205 
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881 

AF062385 

Homo sapiens unknown 

925 

82.584 

882 

AF062385 

Homo sapiens unknown 

1117 

81.641 

883 

AF132984 

Homo sapiens nuclear pore complex 
interacting protein NPIP 

510 

73.874 

885 

X14332 

Homo sapiens ventricular myosin light 
chain 2 (AA 1-165) 

661 

85.039 

886 

X14332 

Homo sapiens ventricular myosin light 
chain 2 (AA 1-165) 

781 

83 .140 

887 

AP000616 

Oryza sativa similar to RING-H2 finger 
protein RHAla (AF078683) 

44 

60 .000 

889 

AP000616 

Oryza sativa similar to RING-H2 finger 
protein RHAla (AF078683) . 

90 

30.909 

890 

AP000616 

Oryza sativa similar to RING-H2 finger 
protein RHAla (AF078683) 

60 

32 .500 

891 

AP000616 

Oryza sativa similar to RING-H2 finger 
protein RHAla (AF078683) 

62 

32 .353 

893 

U57092 

Homo sapiens Rab30 

44 

30.303 

894 

U57092 

Homo sapiens Rab30 

42 

26.415 

895 

U57092 

Homo sapiens Rab30 

40 

36.364 

896 

U57092 

Homo sapiens Rab3 0 

48 

43 .750 

897 

U57092 

Homo sapiens Rab30 

49 

50 .000 

898 

U57092 

Homo sapiens Rab30 

127 

95-455 

899 

D67035 

Homo sapiens SCP-1 

189 

22.000 ! 

900 

U70214 

Escherichia coli glutamate 5 -kinase 

41 

41.176 

902 

U70214 

Escherichia coli glutamate 5-kinase 

41 

39.286 

903 

U70214 

Escherichia coli glutamate 5-kinase 

914 

100. 000 

904 

U39317 

Homo sapiens UbcHSB 

130 

100.000 

905 

U12089 

Human immunodeficiency virus type 1 
envelope glycoprotein 

41 

40.000 

906 

U12089 

Human immunodeficiency virus type 1 
envelope glycoprotein 

39 

45.455 

907 

U12089 

Human immunodeficiency virus type 1 
envelope glycoprotein 

70 

42.308 

908 

U12089 

Human immunodeficiency virus type 1 
envelope glycoprotein 

46 

21.277 

909 

D49387 

Homo sapiens NADP dependent 
leukotriene b4 12-hydroxydehydrogenase 

281 

77.612 

910 

D49387 

Homo sapiens NADP dependent 
leukotriene b4 12-hydroxydehydrogenase 

682 

85.185 

911 

D49387 

Homo sapiens NADP dependent 
leukotriene b4 12-hydroxydehydrogenase 

1092 

92.708 

912 

D49387 

Homo sapiens NADP dependent 
leukotriene b4 12-hydroxydehydrogenase 

443 

93.151 

913 

S58722 

Homo sapienB X- linked retinopathy 
protein {c-terminal, clone XEH.8c} 

120 

51.613 

914 

U93571. 1 

Homo Bapiens p4 0 

258 

55.422 

915 

U93571 

Homo sapiens p40 

497 

57.554 
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Homo sapiens p4 0 

471 

58.915 

920 

M14549 

Drosophila melanogaster segmentation 
protein PRD 4 

117 

61.905 

921 

AF169035 

Homo sapiens protein kinase 

164 

41.667 

923 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2684 

90.217 

924 

U09823 

Oryctolagus cuniculus elongation 

2658 

90. 043 
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factor 1 alpha 



925 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

1021 

100. 000 

927 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2182 

83 . 093 

929 

AF159055 

Homo sapiens leucine zipper-like 
protein 

115 

75 . 000 

930 

L24521 

Homo sapiens transformation-related 
protein 

228 

91.176 

931 

AF121360 

Drosophila melanogaster DNZDHHC/NEW1 
zinc finger protein 11 

59 

26.415 

932 

AF121360 

Drosophila melanogaster DNZDHHC/NEW1 
zinc finger protein 11 

357 

32.692 

934 

J00314 

Homo sapiens beta -tubulin 

640 

95.833 

936 

X16134 

Drosophila melanogaster Abd-B protein 
(AA 1-491) 

160 

29.752 

937 

AL117434 

Homo sapiens hypothetical protein 

161 

96.296 

93 8 

AL117434 

Homo sapiens hypothetical protein 

4186 

100.000 i 

93 9 

AL117434 

Homo sapiens hypothetical protein 

49 

21.168 

94 0 

AL117434 

Homo sapiens hypothetical protein 

698 

99.107 

941 

D45834 

Size marker plasmid pKF33 9 protein 
resposible for tetracycline resistance 

47 

22 .581 

942 

D45834 

Size marker plasmid pKF33 9 protein 
resposible for tetracycline resistance 

561 • 

94 . 624 

943 

U05237 

Homo sapiens FAC1 gene product 

5415 

100.000 ! 

944 

U05237 

Homo sapiens FAC1 gene product 

455 

78.022 

945 

U48248 

Rattus norvegicus protein kinase C- 
binding protein Beta 15; RBCK1 

69 

36.066 

946 

U48248 

Rattus norvegicus protein kinase C- 
binding protein Beta 15; RBCK1 

48 

25.806 

947 

U48248 

Rattus norvegicus protein kinase C- 
binding protein Beta 15; RBCK1 

180 

25.581 

949 

AF151857 

Homo sapiens CGI -99 protein 

778 

92.437 

950 

AF151857 

Homo sapiens CGI -99 protein 

437 

100.000 

951 

AF151857 

Homo sapiens CGI- 99 protein 

1040 

80.000 

952 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c} 

150 

80.769 

953 

X52332 

Homo sapiens- zinc finger protein 10 

129 

53.846 ! 

954 

X52332 

Homo sapiens zinc finger protein 10 

1038 

41.737 

955 

AB022277 

Homo sapiens tubulointerstitial 
nephritis antigen 

4 92 

66 . 087 

956 

AB022277 

Homo sapiens tubulointerstitial 
nephritis antigen 

1999 

68 . 067 

957 

AB022277 

Homo sapiens tubulointerstitial 
nephritis antigen 

3332 

99.580 

960 

X99699 

Homo sapiens XIAP associated factor-1 
(ZAP-1) . 

2204 

9 8 . 738 

961 

X99699 

Homo sapiens XIAP associated factor-1 
(ZAP-1) 

406 

98.214 

962 

AL009147 

Unknown 

/prediction (method: " "genscan" '» , 
version: " "1 . 0" " , score : " "184 . 75" ") ; 
/predict ion= (method: 

158 

41.538 

964 

AL009147 

Unknown 

458 

55.147 


;.«.. O O p 4-3 „, 01 ii? H ft O 




/prediction^ (method : " "genscan" " , 
version : » » 1 . 0 " » , score : » " 184 . 7 5 " » ) ; 
/prediction^ (method: 



965 

AXiQU9147 

Unknown 

/predictions (method : " "genscan" '• , 
version: " "1 . 0"", score : " "184 . 75" " ) ; 

/picUILUlvli 0 ( IIIC LilUU • 


z ± . y / o 

966 

AL009147 

Unknown 

/prediction^ (method : " "genscan" " , 
vprnion* " "1 0"" score* ""184 75" 11 ) • 

/prediction^ (method: 

747 

39.031 

967 

L34291 

Pisum sativum ribulose-1,5 
N-methyl transferase 

45 

25.000 . 

9fi ft 

/DO 

JU -J A J7 X 

D"i Riim nahi viim bin! nap — 1 ^ 

bisDhosnhate carboxvlase larae subunit 
N-methyl transf erase • 

53 

29 .268 

969 

L34291 

Pisum sativum ribulose-1,5 
bisphosphate carboxylase large subunit 
N-methyl transferase 

159 

27.184 

971 

AF031903 

Mus musculus ADP-ribosylation-like 
factor homolog ARL6 

366 

93.651 

974 

U79260 

Homo sapiens unknown 

273 

61.333 

979 

X78925 

Homo sapiens zinc finger protein 

173 

71.429 

980 

X78925 

Homo sapiens zinc finger protein 

162 

60.870 

982 

US9415 

Mus musculus elongation factor 2 

1488 

95.367 

983 

U89415 

Mus musculus elongation factor 2 

134 

95.455 

984 

AF151824 

Homo sapiens CGI- 66 protein 

43 

44.000 

985 

AF151824 

Homo sapiens CGI- 66 protein 

190 

66 . 038 

986 

U58658 

Homo sapiens unknown 

170 

68 . 571 

988 

Z78542 

Unknown cDNA EST EMBL:D72182 comes 
from this gene; cDNA EST EMBL:D72353 
comes from this gene; cDNA 

149 

29 . 000 

989 

X67703 

Drosophila melanogaster Mst84Dd 

97 

38.095 

991 

U44731 

Mus musculus purine nucleotide binding 
protein 

576 

76.271 

992 

U70063 

Homo sapiens acid ceramidase 

54 

20.000 

993 

U70063 

Homo sapiens acid ceramidase 

47 

42.857 

994 

U70063 

Homo sapiens acid ceramidase 

50 

26.923 

995 

U70063 

Homo sapiens acid ceramidase 

44 

21.212 

996 ! 

U70063 

Homo sapiens acid ceramidase 

47 

33.333 

997 

U70063 

Homo sapiens acid ceramidase 

740 

84.496 

999 

U70063 

Homo sapiens acid ceramidase 

2260 

94.710 

1000 

Z99260 

Schizosaccharomyces pombe putative 
microsomal dipeptidase precursor 

45 

23.762 

1001 

299260 ] 

Schizosaccharomyces pombe putative 
microsomal dipeptidase precursor 

171 

44.068 

1002 

M68864 

Homo sapiens QRF 

152 

47.826 

1004 

S79410 

Mus sp. nuclear localization signals 
(NLS) -binding protein=spot-l 

139 

85.714 

1005 

Z37166 

Homo sapiens nuclear RNA helicase 
(DEAD family) 

716 

90.769 

1006 

Z37166 

Homo sapiens nuclear RNA helicase 
(DEAD family) 

445 

100.000 

1007 

Z37166 

Homo sapiens nuclear RNA helicase 

2459 

93.533 
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(DEAD family) 



.1011 

AF016448 

Caenorhabditis elegans No definition 
line found 

48 

26. 190 

1012 

AF016448 

Caenorhabditis elegans No definition 
line found 

901 

35.561 

1013 

S58722 

Homo sapiens X- linked retinopathy 
protein {C- terminal, clone XEH.8c} 

112 

73. 684 

1017 

D79988 

Homo sapiens There are three putative 
hydrophobic domains in the central 
region. 

586 

100 .000 

1018 

D79988 

Homo sapiens There are three putative 
hydrophobic domains in the central 
region . 

169 

81.250 

1019 

AL132896 

Caenorhabditis elegans predicted using 
Genef inder; preliminary prediction 

192 

28.148 

1021 

U82303 

Homo sapiens unknown 

109 

80.952 

1022 

M10119 

Homo sapiens ferritin light subunit 

353 

62.295 

1023 

M10119 

Homo sapiens ferritin light subunit 

1080 

98. 857 

1024 

U79260 

Homo sapiens unknown 

170 

56. 000 

1025 

AF125462 

Caenorhabditis elegans No definition 
line found 

138 

38. 710 

1026 

AF125462 

Caenorhabditis elegans No definition 
line found 

107 

40.476 

1027 

AF125462 

Caenorhabditis elegans No definition 
line found 

88 

45.238 

1028 

U89439 

Bos taurus ubiqui tin- like protein 

578 

98.851 

1029 

U89439 

Bos taurus ubiquitin-like protein 

539 

86.316 

1030 

U89439 

Bos taurus ubiquitin-like protein 

381 

80.412 

1031 

AF084530 

Homo sapiens cyclin-D binding Myb-like 
protein 

218 

94.444 

1032 

X55777 

Homo sapiens put . ORF 

335 

66.667 

1033 

Y17454 

Homo sapiens LSFR1 protein 

585 

100.000 

1035 

Y17454 

Homo sapiens LSFR1 protein 

602 

100. 000 ! 

1036 

AF107406 

Homo sapiens GW12 8 

124 

73.913 

1037 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

241 

60.938 

103 9 

U93564 

Homo sapiens putative pl50 j 

1882 

84.330 

1040 

U93564 

Homo sapiens putative pl50 

2284 

93.931 

1041 

AF186469 

Rattus norvegicus TM6P1 

391 

54.422 

1042 

AF186469 

Rattus norvegicus TM6P1 

1482 

90.336 

1043 

AF186469 

Rattus norvegicus TM6P1 

1052 

91.975 

1044 

AL117629 | 

Homo sapiens hypothetical protein 

1253 

100. 000 

1045 

AL117629 ! 

Homo sapiens hypothetical protein 

771 

73.016 

1046 

AL117629 

Homo Bapiens hypothetical protein 

471 

53.191 

1048 

AF025664 

Bos taurus Na-Ca+K exchanger 

157 

71.429 

1049 

Z34278 

Homo sapiens mucin 

144 

29.126 

1050 

AF151831 

Homo sapiens CGI- 73 protein 

413 

33.696 

1052 

U4 3 5 8 5 

Mus musculus protein kinase related to 
Raf protein kinases; Method: 
conceptual translation supplied by 
author 

348 

76.623 

1053 

U43585 

Mus musculus protein kinase related to 
Raf protein kinases; Method: 
conceptual translation supplied by 

351 

74.684 
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author 



1055 

AF054180 

Homo sapiens hematopoietic cell 
derived zinc finger protein 

JVC 

OA ftn/ 1 

1056 

AF054180 

Homo sapiens hematopoietic cell 
derived zinc finger protein 

4 9 

37. 143 

1057 

AF0541BO 

Homo, sapiens hematopoietic cell 
derived zinc finger protein 

49 


1058 

AF05418.0 

Homo sapiens hematopoietic cell 
derived zinc finger protein 

498 

75 . 000 

1059 

AF054180 

Homo sapiens hematopoietic cell 
derived zinc finger protein 

193 

78.788 

1060 

U43701 1 

Homo sapiens ribosomal protein L23a 

315 

84.483 

1061 

AL035309 

Homo sapiens hypothetical protein 

152 

88.462 

1062 

M83248 

Homo sapiens nephropontin 

1111 

92.670 

1063 

M83248 

Homo sapiens nephropontin 

1072 

100 . 000 

1064 

J04765 

Homo sapiens osteopontin 

1414 

84 .516 

1065 

J04765 

Homo sapiens OBteopontin 

1627 

84 . 424 

1066 

U05812 

Herpetomonas muse arum 3 ' -end 

56 

70.000 

1067 

AF084256 

Homo sapiens beta glucuronidase 
isoform d 

69 

7f\ coo 

1068 

AF084256 

Homo sapiens beta glucuronidase 
isoform d 

235 

1 *> CA Q 

1069 

AF084256 

Homo sapiens beta glucuronidase 
isoform d 

250 

74 ci n 

1070 

AF114430 

Homo sapiens phosphatidylcholine 
transfer protein 

198 

90.000 

1071 

AF114430 

Homo sapiens phosphatidylcholine 
transfer protein 

1405 

100.000 

1072 

AF151794 

Phascolarctos cinereus pol protein 

628 

49.495 

1088 

AF151794 

Phascolarctos cinereus pol protein 

523 

~j yj • oil 

1105 

AF151794 

Phascolarctos cinereus pol protein 

1214 

4 1 7fit? 

1106 

U53366 

Oncorhynchus mykiss terminal 
deoxynucleotidyl transferase 

473 

^* ~* ' ~> ~j j 

1107 

Y14486 

Homo sapiens cytosolic serine 
hydroxymethyltransf erase 

1926 

97 . 068 

1109 

Y14486 

Homo sapiens cytosolic serine 
hydroxymethyltransf erase 

1882 

98.714 

1110 

L36342 

Morone saxatilis beta-actin ~^ 

253 

92.105 

1114 

AB024366 

TT virus ORF1 

113 

57 . 895 

1115 

AB024366 

TT virus ORF1 

107 

64.000 

1116 

AB024366 

TT virus ORF1 

35 


1117 

AB024366 

TT virus ORF1 

109 

64.000 

1118 

AL022157 

Homo sapiens SPIN (SPINDLIN HOMOLOG 
(PROTEIN DXF34) ) 

153 

^ 714 

1119 

AL022157 

Homo sapiens SPIN . (SPINDLIN HOMOLOG 
(PROTEIN DXF34)) 

1291 

/ / • JOD 

1120 

AL022157 

Homo sapiens SPIN (SPINDLIN HOMOLOG 
(PROTEIN DXF34) ) 

274 

72.000 

1148 

AF003151 

Caenorhabditis elegans No definition 
line found 

46 

54.545 

1149 

AF003151 

Caenorhabditis elegans No definition 
line found 

156 

29.508 

1151 

U96721 

Homo sapiens alternative Hermansky- 
Pudlak syndrome associated protein 

245 

62.500 . 
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1152 

U96721 

Homo sapiens alternative Hermansky- 

614 

98.958 



Pudlak syndrome associated protein 


- 

1153 

U93571 

Homo sapiens p40 

647 

68.667 

1154 

U93571 

Homo sapiens p40 

250 

55.263 

1155 

U93571 

Homo sapiens p40 

194 

61. Ill 

1156 

U93571 

Homo sapiens p40 

1054 

92. 000 

1157 

U93571 

Homo sapiens p40 

43 

31. 818 

1158 

U93571 

Homo sapiens p40 

770 

71. 667 

1160 

D14531 . 

Homo sapiens 'human homologue of rat 

910 

78 . 646 



ribosomal protein L9 1 



1161 

D14531 

Homo Bapiens 'human homologue of rat 

442 

65.366 



ribosomal protein L9 1 



1163 

AC002535 

Arabidopsis thaliana putative G-beta- 

60 

47.059 



repeat containing protein, 5 1 partial 



1164 

AC002535 

Arabidopsis thaliana putative G-beta- 

44 

24.211 



repeat containing protein, 5* partial 



1165 

AC002535 

Arabidopsis thaliana putative G-beta- 

409 

36.744 



repeat containing protein, 5 1 partial 



1166 

X55777 

Homo sapiens put . ORF 

195 

47.674 

1167 

X55777 

Homo sapiens put. ORF 

315 

63 .095 

1168 

X55777 

Homo sapiens put, ORF 

57 

47.619 

1169 

X55777 

Homo sapiens put . ORF 

276 

66.234 

1170 

X55777 

Homo sapiens put. ORF 

383 

64.444 

1171 

X55777 

Homo sapiens put. ORF 

310 

58 . 537 

1172 

D37786 

Felis catus Max 

135 

100 . 000 

1173 

U34973 

Mus musculus protein tyrosine 

240 

65 000 

\J ~J . \I \J V 



phosphatase -like 



1174 

U34973 

Mus musculus protein tyrosine 

458 

47 . 826 



phosphatase -like 



1175 

AL032657 

Unknown predicted using Genefinder; 

262 

58 .571 



similar to DnaJ domain ; Thioredoxin; 





cDNA EST yk43 3f3.5 come 



1176 

AL032657 

Unknown predicted using Genefinder; - 

183 

41.791 



similar to DnaJ domain ; Thioredoxin; 





cDNA EST yk433f3.5 come 



1177 

AF140227 

Streptococcus pyogenes M protein 

120 

28.125 

1178 

AF140227 

Streptococcus pyogenes M protein 

14 5 

30.612 

1180 

X55777 

Homo sapiens put. ORF 

159 

53.846 

1181. 

X99099 

Medicago sativa auxin induced proline 

94 

40.541 



rich protein 



1182 

D89667 

Homo sapiens c-myc binding protein 

191 

80.556 

1183 

D89667 

Homo sapiens c-myc binding protein 

734 

93.671 

1184 

U79260 

Homo sapiens unknown 

146 

52.941 

1185 

D49382 | 

Mus musculus septin 

260 

75.439 

1186 

D49382 

Mus musculus septin 

1911 

87.366 

1187 

S58722 

Homo sapiens X-linked retinopathy j 

138 

45.614 



protein {C-terminal, clone XEH.8c} 



1189 

AL080206 

Homo sapiens hypothetical, protein 

128 

74.074 i 

1191 

AC006234 

Arabidopsis thaliana unknown protein 

141 

24 .476 

1192 

X99518 

Saimiriine herpesvirus 2 StpCl^ 

111 

51 .163 

1193 

S58722 

Homo sapiens X-linked retinopathy 

109 

59.259 



protein {C-terminal, clone XEH.8c} 



1194 

AF125053 

Homo sapiens pyruvate dehydrogenase El 

700 

99 .074 



alpha subunit 
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1195 

AF125053 

Homo saniena ovruvatp HphvHrnnpnp rp» ri 
alpha subunit 

i n 
ui 

DC 71 J 

OD . / 

1196 

AF125053 

Homo RanienH nvnivafp H a Vi vH map n a a p 171 

alpha subunit 

821 

/ . 03/ 

1197 

Z34286 

Orvctolaous cuniculua ORP miahh pxtpnd 
further in 5 1 -direction 

917 

. /o o 

1198 

Z34286 

Oryctolagus cuniculuB ORF might extend 
further in 5' -direction 

1420 

*7 A *7 K.A 

1199 

Z34286 

Oryctolagus cuni cuius ORF might extend 
further in 5' -direction 

168 

*7 1 n cl i 

1201 

Z26248 

Homo sapiens eosinophil granule major 
basic pre-proprotein 

45 

J D • _> O *k 

1202 

Z26248 

Homo sapiens eosinophil granule major 
basic pre-proprotein 

45 

~> ^> m J _> D 

1203 

Z26248 

Homo sapiens eosinophil granule major 
basic pre-proprotein 

1423 

99 .507 

1204 

Z26248 

Homo sapiens eosinophil granule major 
basic pre-proprotein 

1310 

99.457 

1205 

Z26248 

Homo sapiens eosinophil granule major 
basic pre-proprotein 

584 

67.647 

1206 

AF070664 

Homo sapiens HSPC008 

681 

74.648 

1207 

AF070664 

Homo sapiens HSPC008 

584 

69. 853 

1208 

AF070664 

Homo sapiens HSPC008 

182 

100.000 

1209 

AF070664 

Homo sapiens HSPC008 

1610 

94 .479 

1210 

A14829 

Homo sapiens preproapolipoprotein 

601 

61.644 

1211 

V00488 

Homo sapiens alpha globin 

905 

i no nnn 

1212 

V00488 

Homo sapiens alpha globin 

321 

/ -J . O J 3 

1213 

AF128406 

Homo sapiens prenyl -dependent prelamin 
A binding protein Narf 

690 

24 . 878 

1214 

AF128406 

Homo sapiens prenyl -dependent prelamin 
A binding protein Narf 

551 

95 . 604 

1215 

AF128406 

Homo sapiens prenyl -dependent prelamin 
A binding protein Narf 

1663 

80.819 

1216 

AL110239 

Homo sapiens hypothetical protein 

470 

98.507 

1217 

AL110239 

Homo sapiens hypothetical protein 

510 

100.000 

1218 

AL110239 

Homo sapiens hypothetical protein 

157 

96.552 

1219 

AL110239 

Homo sapiens hypothetical protein 

881 

71.233 

1220 

AL110239 

Homo sapiens hypothetical protein 

44 • 

35.000 

1221 

AL110239 

Homo sapiens hypothetical protein 

727 

72.691 

1222 

AJ242797 

Bos taurus hemoglobin alpha chain | 

759 

86.620 

1223 

AJ242797 

Bos taurus hemoglobin alpha chain 

551 

75.410 

1224 

AJ242797 

Bos taurus hemoglobin alpha chain [ 

278 

66.071 

1225 

J00044 

Capra hircus alpha- ii globin 

797 

85.915 

1226 

J00044 

Capra hircus alpha-ii globin 

348 

66.923 

1228 

AL117237 

Homo sapiens hypothetical protein 

822 

74 .586 

1230 

AL117237 

Homo sapiens hypothetical protein 

198 

96.667 

1232 

AL117237 

Homo sapiens hypothetical protein 

105 

100. 000 

1233 

AL117237 

Homo sapiens hypothetical protein 

1592 

89.085 

1236 

AL117237 

Homo sapiens hypothetical protein 

1122 

54.722 | 

1237 

AL117237 

Homo sapiens hypothetical protein 

707 

45 .161 

1239 

AL117237 

Homo sapiens hypothetical protein 

218 

64 .286 

1240 

AL117237 

Homo sapiens hypothetical protein 

2069 

81.546 

1241 

AL117237 

Homo sapiens hypothetical protein 

1907 

64 .151 

1242 

AL117237 

Homo sapiens hypothetical protein 

4888 

80.164 
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1243 

AL117237 

Homo sapiens hypothetical protein 

5468 

94.673 

1244 

U27112 

Homo sapiens HT-1080 protein 

67 

71.429 

1245 

U27112 

Homo sapiens HT-1080 protein 

1649 

86.550 

1246 

X13794 

Homo sapiens lactate dehydrogenase B 

542 

67.361 

1247 

X13794 

Homo sapiens lactate dehydrogenase B 

907 

75.783 

1248 

AJ225898 

Oryctolagus cuniculus translationally 
controlled tumor protein 

349 

67.630 

1249 

M60618 

Homo sapiens nuclear autoantigen 

218 

41.379 

1250 

M60618 

Homo sapiens nuclear autoantigen 

263 

50.943 

1251 

M60618 

Homo sapiens nuclear autoantigen 

328 

75.641 

1252 

M60618 

Homo sapiens nuclear autoantigen 

3195 

99 . 582 

1253 

M60618 

Homo sapiens nuclear autoantigen 

2121 

82 . 258 

1254 

X06749 

Gallus gallus metallthionein (AA 1 - 
63) 

321 

59.211 

1255 

X06749 

Gallus gallus metallthionein (AA 1 - 
63) 

263 

71.795 

1256 

X06749 

Gallus gallus metallthionein (AA 1 - 
63) 

147 

60.526 

1258 

X05289 

Papio hamadryas anuhis alpha -1 globin 

848 

92.254 

1259 

X05289 

Papio hamadryas anubis alpha-1 globin 

307 

74.603 

1260 

A78803 

unidentified unnamed protein product 

359 

74.242 

1261 

M28583 

Homo sapiens MHC class II DR-beta 

1539 

80.519 

1262 

M28583 

Homo sapiens MHC class II DR-beta 

1445 

87.640 

1263 

M28583 

Homo sapiens MHC class II DR-beta 

1373 

92.135 

1264 

A78803 

unidentified unnamed protein product 

286 

68. 939 

1265 

X73460 

Homo sapiens ribosomal protein L3 

1209 

80. 972 

1266 

X73460 

Homo sapiens ribosomal protein L3 

1909 

79. 167 

1267 

X73460 

Homo sapiens ribosomal protein L3 

1710 

83.019 

1268 

X73460 

Homo sapiens ribosomal protein L3 

2265 

87.841 

1269 

X73460 

Homo sapiens ribosomal protein L3 

1882 

88.321 

1270 

X05289 

Papio hamadryas anubis alpha-1 globin 

848 

92.254 

1271 

X05289 

Papio hamadryas anubis alpha-1 globin 

265 

64.655 

1272 

X51706 

Rattus rattus ribosomal protein L9 (AA 
1-192) 

295 

54.500 

1274 

J03048 

Homo sapiens hemopexin precursor 

1484 

72.967 

1275 

M74220 

Homo sapiens plasminogen 

4801 

95.007 | 

1276 

M74220 

Homo sapiens plasminogen 

3096 

74.590 ! 

1277 

M16961 

Homo sapiens alpha- 2 -HS -glycoprotein 

826 

68.812 

1278 

M16961 

Homo sapiens alpha- 2 -HS -glycoprotein 

979 

80 . 628 

1279 

M16961 

Homo sapiens alpha- 2 -HS -glycoprotein 

1425 

77.344 

1280 

X52977 

Xenopus laevis elongation factor 1- 
alpha (454 AA) 

2420 

82 . 119 

1281 

X52977 

Xenopus laevis elongation factor 1- 
alpha (454 AA) 

1386 

62 . 827 

1283 

X52977 

Xenopus laevis elongation factor 1- 
alpha (454 AA) 

1782 

74.586 

1284 

X52977 

Xenopus laevis elongation factor 1- 
alpha (454 AA) 

2433 

81.236 

1285 

X52977 

Xenopus laevis elongation factor 1- 
alpha (454 AA) 

1461 

68.790 

1286 

M15386 

Homo sapiens gamma -globin 

214 

60.927 

1287 

Z83110 

Unknown cDNA EST yk472b5.3 comes from 
this gene; cDNA EST yk474a7.3 comes 
from this gene; cDNA EST 

218 

38.462 


io-m 
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12 88 

Z83110 

v/iiAiiv/ wax i_/i.v/-i x?o x y / ^xjz? v .3 com" o x. X7wm 
this oene : cDNA EST vk4 74a7 ^ mm&a 

from this gene; cDNA EST 

IXC 

X4 J> 

41 • 509 

1?89 

Z8311Q 

Unknown cDNA EST yk472b5.3 comes from 
this gene; cDNA EST yk474a7.3 comes 
from this gene; cDNA EST . 

211 

29.932 

1290 

AJ243310 

Homo sapiens C14orf3 protein 

103 

28.889 

1291 

AJ243310 

Homo sapiens C14orf3 protein 

528 

30.667 

1292 

AJ243310 

Homo sapiens, C14orf 3 protein 

87 


1293 

AJ243310 

Homo sapiens C14orf3 protein 

312 

33 .535 

1295 

AF078819 

SDalax ehrenberai hiah mnlri 1 i t-\/ nmnn 
protein 

"J7Q 

i Art n rt n 
1UU . uuu 

1296 

AF078819 

Scalane ehrenbera i hiah mnh "i 1 ■? t-vr nrmm 
protein 

ft ft Q 
D O 27 

Q 1 O 1 O 

o X . oXo 

1297 

X53777 

Homo saoiens outativp T i hnflnma 1 
protein (AA 1-184) 

27 O 27 

Q *7 ACQ 

1298 

X53777 

Homo sapiens putative ribosomal 
protein (AA 1-184) 

QTC 
^ / J 

qa 'yea 
27*k . zoo 

1299 

X53777 

Homo saoiens "nutative ri hoqom^ 1 
protein (AA 1-184) 

£ n q 

O \J 27 

7Q A O C 

1300 

U4 3 9 1 8 

Mng mil s cuius orolif erahion-ARflftfi ahprf 
protein 1 

1 JO 


1301 

U43918 

Mus musculus proliferation-associated 
protein 1 

1244 

89.671 

1302 

U43918 

Mus musculus proliferation-associated 
protein 1 

1576 

85.354 

1304 

S56599 

Mus sp. AP56=acetaminophen-binding 
protein 

1543 

65.051 

1305 

J03037 

Homo sapiens carbonic anhydrase II 

1298 

87.925 

1307 

AF0132 14 

Rhr t~ sti t*"ii r "i H i p riKnenmal 

phosphoprotein PO 

IX / O 

74 . 744 

1308 

AP013214 

Bos taurus acidic ribosomal 
phosphoprotein PO 

670 

60.185 

-L ~> KJ J 

HT.n^ 1 7*7 A 
rtUu JI / /O 

xiomo sapiens auo4i3^x.2 IPUTATIVE novel 
protein similar to part of APOBEC1 
\riiuiuuiiii jl , rtponpuprotein r> nucNA 
editing protein) ) 

886 

86.747 

1310 

AL03 177 8 

Homo qani *»nn H.T^dR91 f 'DT7T , zi r PT\n? nr»\/ial 

protein similar to part of APOBEC1 
(Phorbolin 1. AooliDODrotein R mRNA 
editing protein) ) 

lOOl 

86 .580 

1311 

M21142 

Homo sapiens guanine nucleotide 1 - j 
binding protein G-s-alpha-2 

1374 

on q q n 

27 \J . 27 O \J 

1312 

M21142 

Homo sapiens guanine nucleotide - 
binding protein G-s-alpha-2 

2108 

98 . 576 

1313 

M21142 

Homo sapiens guanine nucleotide - 
binding protein G-s-alpha-2 

1691 

86 . 047 

1314 

AF151825 

Homo sapiens CGI -67 protein 

659 

50.000 

1315 

AP151825 

Homo sapiens CGI- 67 protein 

732 

62.983 

1316 

S85655 

Homo sapiens prohibitin 

347 

87 .324 

1317 

S85655 

Homo sapiens prohibitin 

829 

79.505 

1318 

L10391 

Oryctolagus cuniculus flavin- 
containing monooxygenase FM03 

54 

58 .333 

1319 

L10391 

Oryctolagus cuniculus flavin- 
containing monooxygenase FM03 

538 

68 .852 


,i, o ens m-3 - 01 s a 6 o a 

/ 


1320 

L10391 

Oryctolagus cuniculus flavin- 
containing monooxygenase FM03 

2830 

74 . 765 

1321 

L10391 

Oryctolagus cuniculus flavin- 
containing monooxygenase FM03 

777 

72 . 222 

1322 

L10391 

Oryctolagus cuniculus flavin- 
containing monooxygenase FM03 

57 

17 fl«?7 

1323 

L10391 

Oryctolagus cuniculus flavin- 
containing monooxygenase FM03 

652 

82.759 

1326 

X99920 

Homo sapiens S100 calcium-binding 
protein A13 (S100A13) 

216 

65 714 

1327 

AF150089 

Homo sapiens small zinc finger- like 
protein 

37 

29 . 167 

1328 

AF150089 

Homo sapiens small zinc finger-like 
protein 

42 

36.000 [ 

1329 

AF150089 

Homo sapiens small zinc finger-like 
protein 

36 

19 . 149 

1330 

AF150089 

Homo sapiens small zinc finger-like 
protein 

46 

25 . 000 

1332 

AF150089 

Homo sapiens small zinc finger-like 
protein 

560 

97 . 619 

1333 

AF150089 

Homo sapiens small zinc finger-like 
protein 

338 

84 . 000 

1334 

D63480 

Homo sapiens The KIAA014 6 gene product 
is novel . 

757 

72.067 

1335 

D63480 

Homo sapiens The KIAA0146 gene product 
is novel. 

4971 

95.022 

1336 

U01317 

Homo sapiens A-gamma globin 

242 

61 .594 

1337 

A78803 

unidentified unnamed protein product 

750 

8 8 . 652 

1338 

A78803 

unidentified unnamed protein product 

313 

73 . 134 

1339 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

222 

50 . 000 

1340 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

717 

4 6.524 

1341 

AB002310 

Homo sapiens KIAA0312 

949 

98.773 

1342 

AB002310 

Homo sapiens KIAA0312 

928 

67.829 

1343 

AB002310 

Homo sapiens KIAA0312 

567 

100.000 

1344 

AB002310 

Homo sapiens KIAA0312 

1677 

87.781 

1345 

AB002310 

Homo sapiens KIAA0312 

4324 

84.217 

1346 

AB002310 

Homo sapiens KIAA0312 

10980 

96.025 

1347 

U47101 

Homo sapiens NifU-like protein 

327 | 

72.868 

1348 

AC011560 

Arabidopsis thaliana hypothetical 
protein 

351 

41.935 

1350 

M11791 

Homo sapiens proapo-A-I protein 

1065 

95 .402 

1351 

M11791 

Homo sapiens proapo-A-I protein 

232 

100.000 

1353 

M11791 

Homo sapiens proapo-A-I protein 

790 

70.175 

1354 

S62928 

Homo sapiens PRB1M precursor 
protein=basic proline-rich proteins 
(Ps, PmF, PmS, and Pe) precursor (Ps 
l=basic proline-rich protein) {c- 
terminal} 

105 

32.394 

1355 

S62928 

Homo sapiens PRB1M precursor 
protein=basic proline-rich proteins 
(Ps, PmF, PmS, and Pe) precursor (Ps 
l=basic proline-rich protein) {c- 
terminal} 

137 

29.605 
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1356 

AJ005562 

Mus musculus SPR2D protein 

134 

43.750 

1357 

X17042 

Homo sapiens hematopoetic proteoglycan 
core protein (AA 1 - 158) 

554 

98. 810 

1358 

X17042 

Homo sapiens hematopoetic proteoglycan 
core protein (AA 1 - 158) 

712 

86.875 

1359 

AJ242910 

Homo sapiens N-Acetylglucosamine 
kinase 

107 

100. 000 

1360 

AJ242910 . 

Homo sapiens N-Acetylglucosamine 
kinase 

623 

68.590 

1361 

AJ242910 . 

Homo sapiens N-Acetylglucosamine 
kinase 

289 

82.812 

1362 

A0T242910 

Homo sapiens N-Acetylglucosamine 
kinase 

65 

100.000 

1363 

AJ242910 

Homo sapiens N-Acetylglucosamine 
kinase 

39 

44.444 

1364 

AJ242910 

Homo sapiens N-Acetylglucosamine 
kinase 

56 

52.941 . 

1365 

AJ242910 

Homo sapiens N-Acetylglucosamine 
kinase 

56 

52.941 

1366 

A.J242910 

Homo sapiens N-Acetylglucosamine 
kinase 

53 

47.059 

1367 

AJ242910 

Homo sapiens N-Acetylglucosamine 
kinase 

1743 . 

91.453 

1368 

AJ003023 

Oryctolagus cuniculus hnRNP-Bl protein 

1272 

78.378 

1369 

X76488 

Homo sapiens sterol esterase 

246 

92.308 | 

1370 

X76488 

Homo sapiens sterol esterase 

2349 

93.000 

1373 

X12525 

Xenopus laevis alpha 3 -act in 

1275 

81.818 

1376 

AF121009 

Mycobacterium tuberculosis H37Rv 
hypothetical protein JV0534 

70 

26.316 

1377 

AF121009 

Mycobacterium tuberculosis H3 7Rv 
hypothetical protein Jv0534 ; 

248 

30.263 

1378 

X70476 

Homo sapiens subunit of coatomer 
complex 

339 

78.161 

1379 

X70476 

Homo sapiens subunit of coatomer 
complex 

5128 

91.338 

1381 

S80864 

Homo sapiens cytochrome c-like 
polypeptide 

39 

37.500 

1383 

S80864 

Homo, sapiens cytochrome c-like 
polypeptide 

254 

84 .444 

1384 

S80864 

Homo sapiens cytochrome c-like 
polypeptide 

135 

82.609 

1387 

S80864 

Homo sapiens cytochrome c-like 
polypeptide 

310 

72.000 

1388 

S80864 

Homo sapiens cytochrome c-like 
polypeptide 

3 97 

60.305 

1389 

S80864 

Homo sapiens cytochrome c-like 
polypeptide 

146 j 

76.316 

1390 

S80864 

Homo sapiens cytochrome c-like i 
polypeptide 

818 

83.077 

1391 

Z69720 

Homo sapiens 3 -methyl -adenine t DNA 
glycosylase 

1177 

82.895 

1392 

X05289 

Papio hamadryas anubis alpha -1 globin 

848 

92.254 

1393 

X05289 

Papio hamadryas anubis alpha- 1 globin 

257 

56.391 | 

1394 

D31767 

Homo sapiens KIAA0 05 8 

739 

98.165 

1395 

D31767 

Homo sapiens KIAA005 8 

704 

89.881 
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13 96 

L10678 

Homo sapiens profilln II 

751 

86,131 

1397 

L10678 

Homo sapiens prof il in II 

752 

100. 000 

1398 

L10678 

Homo sapiens profilin II 

582 

86.331 

1399 

M24842 

Homo sapiens keratin 18 

243 

95.122 

1400 

M24842 

Homo sapiens keratin 18 

1931 

88.950 

1401 

M24842 

Homo sapiens keratin 18 

1334 

78.396 

1402 

U50733 

Homo Bapiens dynamitin 

645 

93 . 578 

1403 

U50733 

Homo sapiens dynamitin 

421 

98.551 

1404 

U50733 

Homo sapiens dynamitin 

654 

93 . 694 

1405 

U50733 

Homo sapiens dynamitin 

2285 

95 .431 

1406 

U50733 

Homo sapiens dynamitin 

252 

97.368 

1407 

U50733 

Homo sapiens dynamitin 

1372 

77.412 

1408 

U19718 

Homo sapiens microf ibril-associated 
glycoprotein 

1067 

100.000 

1409 

U19718 

Homo sapiens microf ibril-associated 
glycoprotein 

837 

90.217 

1410 

Z11694 

Homo sapiens 41kD protein kinase 

236 

97.436 

1411 

Z11694 

Homo sapiens 41kD protein kinase 

41 

21.000 

1412 

Z11694 

Homo sapiens 41kD protein kinase 

1269 

79.784 

1413 

AF047439 

Homo sapiens unknown 

251 

100.000 i 

1414 

AF047439 

Homo sapiens unknown 

1096 

75.719 

1416 

L13635 

Rattus rattus growth response protein 

420 

92.000 

1417 

L13635 

Rattus rattus growth response protein 

791 

99.180 

1418 

L13635 

Rattus rattus growth response protein 

662 

87.582 

1419 

L13635 

Rattus rattus growth response protein 

1267 

68 . 320 

1420 

L13635 

Rattus rattus growth response protein 

989 

72.763 

1421 

AF071491 

Homo sapiens potassium channel 

1893 

99 . 648 

1422 

AF071491 

Homo sapiens potassium channel 

957 

97. 987 

1423 

AF071491 

Homo sapiens potassium channel 

54 35 


1424 

V00488 

Homo sapiens alpha globin 

328 

75.676 

1426 

D12618 

Mus musculus nucleosome assembly 
protein-1 

777 
til 

J 3 . O 

1427 

D12618 

Mus musculus nucleosome assembly 
protein-1 

277 

d / . o y <, 

1428 

D12618 

Mus musculus nucleosome assembly 
protein-1 

277 

O / . OiJZ 

1429 

D12618 

Mus musculus nucleosome assembly 
protein-1 

788 

96. 183 

1430 

D12618 

Mus musculus nucleosome assembly 
protein-1 

45 

26.389 

1431 

D12618 

Mus musculus nucleosome assembly 
protein-1 

1871 

88.101 

1432 

M25660 

Homo sapiens beta-hemoglobin 

200 

70.175 

1433 

X54101 

Homo sapiens NKG5 product 

495 

100.000 

1434 

X54101 

Homo sapiens NKG5 product 

150 

100.000 

1435 

X54101 

Homo sapiens NKG5 product 

402 

77.108 | 

1436 

X54101 

Homo sapiens NKG5 product 

526 1 

87.023 j 

1437 

A78803 

unidentified unnamed protein product 

918 

100.000 

1438 

A78803 

unidentified unnamed protein product 

336 

71.429 

1439 

AF000381 

Homo sapiens non- functional folate 
binding protein 

57 

20.588 

1440 

AF000381 

Homo sapiens non- functional folate 
binding protein 

57 

28.889 

1441 

AF000381 

Homo sapiens non-f unctional folate 

1042 

84.942 
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binding protein 



1442 

L76200 

Homo sapiens guanylate kinase 

38 

35.000 

1443 

1*76200 

Homo sapiens guanylate kinase 

208 

78.049 

1444 

1/76200 

Homo sapiens guanylate kinase 

511 

67.431 

1445 

1*38941 

Homo sapiens ribosomal protein L34 

242 

71.717 

1447 

AF108225 

Monodelphis domes tica immunoglobulin 
alpha heavy chain precursor 

1084 

42.938 

1449 

X00318 

Homo sapiens apoferritin H chain 

42 

19.540 

1450 

X00318 

Homo sapiens apoferritin H chain 

467 

71.220 

1451 

M15386 

Homo sapiens gamma-globin 

897 

93 .421 

1452 

M15386 

Homo sapiens gamma-globin 

403 

77.019 

1453 

X61123 

Homo sapiens BTG1 

962 

67.589 

1454 

X61123 

Homo sapiens BTG1 

425 

65.193 

1455 

U39412 

Homo sapiens alpha SNAP 

36 

66.667 

1456 

U39412 

Homo sapiens alpha SNAP 

267 

100.000 

1457 

U39412 

Homo sapiens alpha SNAP 

400 

100. 000 

1458 

U39412 

Homo sapiens alpha SNAP 

42 

39.130 

1459 

U39412 

Homo sapiens alpha SNAP 

871 

74.671 

1460 

AB012918 

Canis familiaris Catalase 

2807 

80.000 

1462 

AB012918 

Canis familiaris Catalase 

1758 

71.619 

1463 

A78803 

unidentified unnamed protein product 

918 

100.000 

1464 . 

A78803 

unidentified unnamed protein product 

341 

67.969 

1465 

D88010 

Homo sapiens ribosomal protein S13 

386 . 

83 .562 

1466 

M23613 

Homo sapiens nucleophosmin 

534 

59.639 

1467 

M23613 

Homo sapiens nucleophosmin 

1029 

88.780 

1468 

M23613 

Homo sapiens nucleophosmin 

877 

100.000 

1469 

M23613 

Homo sapiens nucleophosmin 

789 

85.517 

1470 

M23613 

Homo sapiens nucle opho smi n 

1174 

78 .710 

1471 

M14567 

Dasyurus viverrinus alpha globin 

306 

58.974 

1472 

D86850 

Homo sapiens HADHB 

2742 

100.000 

1473 

D86850 

Homo sapiens HADHB 

1877 

78.537 

1474 

D86850 

Homo sapiens HADHB 

173 

76.190 

1475 

D86850 

Homo sapiens HADHB 

48 

30.189 

1476 

D86850 

Homo sapiens HADHB 

260 

100.000 

1477 

D86850 

Homo sapiens HADHB 

1909 

80.042 

1478 

S73591 

Homo sapiens VDUPlol,25- 
dihydroxyvitamin D-3 up-regulated 

758 

98.214 

1479 

S73591 

Homo sapiens VDUP1=1,25- 
dihydroxyvitamin D-3 up- regulated 

1074 

94.022 . 

1480 

S73591 

Homo sapiens VDUP1=*1,25- 
dihydroxyvitamin D-3 up -regulated 

1664 

87.121 

1482 

M60854 

Homo sapiens RPS16 

338 

53.472 

1483 

M60854 

Homo sapiens RPS16 

367 

5 6.173 | 

1484 

M60854 

Homo sapiens RPS16 

237 

67.066 

1486 

M17885 

Homo sapiens acidic ribosomal 
phosphoprotein <P0) 

1209 

76.027 

1487 

M17885 

Homo sapiens acidic ribosomal 
phosphoprotein (P0) 

785 

63.205 

1489 

AJ012008 

Homo sapiens RNCC protein 

741 

66.792 

1490 

U08291 

Cercopithecus aethiops prion protein 

1731 

95.918 

1491 

U08291 

Cercopithecus aethiops prion protein 

1614 

90.514 

1492 

X55656 

Homo sapiens gamma -G globin 

958 

97.333 

1493 

X55656 

Homo sapiens gamma -G globin 

256 

62.424 


Ion* 
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1494 

M15386 

Homo sapiens gamma -globin 

275 

65.714 

14 9$ 

AF042746 

Homo sapiens cytochrome c oxidase 
sub unit IV 

676 

79.167 

1496 

AF042746 

Homo sapiens cytochrome c oxidase 
subunit IV 

356 

100 . 000 

1497 

AF042746 . 

Homo sapiens cytochrome c oxidase 
subunit IV 

200 

47 . 239 

1499 

AF010494 

Callithrix sp. sorbitol dehydrogenase 

310 

75.862 

1500 

AF010494 

Callithrix sp. sorbitol dehydrogenase 

1547 . 

82.477 

1501 

AB025357 

Anguilla japonica proliferating cell 
nuclear antigen 

751 

88 . 372 

1502 

AB025357 

Anguilla japonica proliferating cell 
nuclear antigen 

557 

64.207 

1503 

X55656 

Homo sapiens gamma -G globin 

958 

97.333 

1504 

X55656 

Homo sapiens gamma -G globin 

390 

64.072 

1505 

L20688 

Homo sapiens GDP dissociation 
inhibitor 

401 

100.000 

1506 

L20688 

Homo sapiens GDP dissociation 
inhibitor 

744 

78.365 

1507 

K03020 

Homo sapiens phenylalanine hydroxylase 

542 

95.699 

1508 

K03020 

Homo sapiens phenylalanine hydroxylase 

35 

42.105 

1509 

K03020 

Homo sapiens phenylalanine hydroxylase 

995 

90.698 

1510 

K03020 

Homo sapiens phenylalanine hydroxylase 

2149 

87.905 

1511 

AF076191 

Trichosurus vulpecula gamma-actin 

347 

87.302 

1512 

AF076191 

Trichosurus vulpecula gamma-actin 

466 

82.456 

1513 

AF076191 

Trichosurus vulpecula gamma-actin 

1556 

82.429 

1515 

X83218 

Homo sapiens ATP synthase, oligomycin 
sensitivity conferring protein 

492 

67 . 982 

1516 

D29832 

Homo sapiens anti thrombin III (AT- III) 
variant 

2827 

91 . 057 

1517 

D29832 

Homo sapiens antithrombin III (AT- III) 
variant 

1562 

78 . 395 

1518 

D13748 

Homo sapiens eukaryotic initiation 
factor 4AI 

612 

75. 676 

1519 

D13748 

Homo sapiens eukaryotic initiation 
factor 4AI 

394 

56.757 

1520 

D13748 

Homo sapiens eukaryotic initiation 
factor 4AI 

806 

51.937 

1521 

X55656 

Homo sapiens gamma -G globin 

141 

44.253 

1522 

M32723 

Homo sapiens G- gamma -hemoglobin 

298 

87.671 

1523 

M32723 I 

Homo sapiens G- gamma -hemoglobin 

567 

96.809 

1524 

M32723 

Homo sapiens G- gamma -hemoglobin 

113 

50.467 

1525 

AF078848 

Homo sapiens BUP 

677 

67.179 

1526 

AF078848 

Homo sapiens BUP 

791 

72.340 

1527 

AF078848 

Homo sapiens BUP 

293 

95.918 

1528 

AF078848 

Homo sapiens BUP 

532 

77.295 

1529 

D11428 

Homo sapiens PMP-22 (PAS.-II/SR13/Gas-3) 

255 

83.333 


D11428 

Homo sapiens PMP-22 (PAS-II/SR13/Gas-3) 

761 

88.889 

1531 

AF015913 

Homo sapiens SkblHs 

691 

88.496 

1532 

AF015913 

Homo sapiens SkblHs 

926 

7 6.963 

1534 

AF015913 

Homo sapiens SkblHs 

1021 

56.250 

1535 

AF015913 

Homo sapiens SkblHs 

486 

97.101 

1536 

AF015913 

Homo sapiens SkblHs 

890 

90.789 

1537 

AF015913 

Homo sapiens SkblHs 

3255 

89.325 


JL O O M-6 Ni-3 „, O S! iE! i& li e 


1538 

X5 S6 87 

Lycopersicon esculentum extensin 
(class II) 

43 

I 31 - 250 

1539 

X55687 

Lycopersicon esculentum extensin 
(class II) 

95 

30 . 000 

1541 

U94855 

Homo sapiens translation initiation 
factor 3 47 kDa subunit 

1292 

80.688 

1542 

U91318 

Homo sapiens pM5 (3* partial) 

6449 

85.726 

1543 

U91318 

Homo sapiens pM5 (3 1 partial) 

2232 

88 .725 

1544 

U91318 

Homo sapiens pM5 (3* partial) 

246 

70.968 

1545 

U91318 

Homo sapiens pM5 (3 1 partial) 

284 

88.000 

1546 

U91318 

Homo sapiens pM5 (3' partial) 

214 

100. 000 

1547 

U91318 

Homo sapiens pM5 (3» partial) 

1561 

1 87 - 197 

1548 

U91318 

Homo sapiens pM5 (3» partial) 

5972 

| 91.810 

1549 

AJ011915 

Homo sapiens synaptosome associated 
protein of 23 kilodaltons , isoform A 

889 

74 . 408 

1550 

AJ011915 

Homo sapiens synaptosome associated 
protein of 23 kilodaltons, isoform A 

1064 

92.398 

1551 

AJ011915 

Homo sapiens synaptosome associated 
protein of 23 kilodaltons, isoform A 

945 

88.557 

1552 

U77665 

Homo sapiens RNaseP protein P3 0 

497 

98. 684 

1553 

U77665 

Homo sapiens RNaseP protein P3 0 

243 

100.000 

1554 

U77665 

Homo sapiens RNaseP protein P30 

613 

1 100 . 000 

1555 

U77665 

Homo sapiens RNaseP protein P30 

1076 

78.417 

1556 

AF110493 

Homo sapiens hemoglobin gamma -G 

316 

62 . 698 

1557 

U58658 

Homo sapiens unknown 

231 

1 70. 149 

1558 

X74776 

Drosophila melanogaster ribosomal 
protein L19 

658 


1560 

X74776 

Drosophila melanogaster ribosomal 
protein L19 

331 


1562 

M87503 

Homo sapiens I FN -alpha responsive 
transcription factor 

880 

O / • v JL J 

1563 

M87503 

Homo sapiens IFN-alpha responsive 
transcription factor 

826 | 

98.400 

1564 

M87503 

Homo sapiens IFN-alpha responsive 
transcription faqtor 

345 

ft* At* O 

1565 

M87503 

Homo sapiens IFN-alpha responsive 
transcription factor 

55 1 

31.429 

1566 

M87503 

Homo sapiens IFN-alpha responsive 
transcription factor 

57 

39.474 

1567 

M87503 

Homo sapiens IFN-alpha responsive 
transcription factor 

2148 1 


1569 

D16111 

Homo sapiens human homologue of rat 
phosphatidylethanolamine binding 
protein 

48 j 

28.000 

1570 

D16111 

Homo sapiens human homologue of rat 
phosphatidylethanolamine binding 
protein 

536 I 

75 . 978 

1571 

Z83838 

Homo sapiens GTPASE- ACTIVATING PROTEIN 

564 

62.069 

1572 

Z14136 

Homo sapiens spermidine/ spermine Nl- 
acetyl transferase 

396 

39.286 

1573 

Z14136 

Homo sapiens spermidine/ spermine Nl- 
acetyl transferase 

285 

30.769 

1574 

Z14136 

Homo sapiens spermidine/spermine Nl- 
acetyl transferase 

206 

36.872 ! 
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1575 

S79980 

Bos taurus ribosomal protein L37 

540 

97.468 

1576 

S79980 

Bos taurus ribosomal protein L37 

628 

97.849 

1577 

S79980 

Bos taurus ribosomal protein L37 

423 

86.364 

1578 

M15386 

Homo sapiens gamma-globin 

328 

69.939 

1579 

X52882 

Homo sapiens t -complex polypeptide 1 
<AA 1-556) 

50 

72.727 

1580 

X52882 

Homo sapiens t -complex polypeptide 1 
(AA 1-556) 

2908 

94.118 

1581 . 

M23984 

Rattus norvegicus mitochondrial 
phosphate transporter precursor 

595 

74 .497 

1582 

M23984 

Rattus norvegicus mitochondrial 
phosphate transporter precursor 

1206 

72.776 

1583 

M83654 

Homo sapiens cytoplasmic 
phosphotyrosyl protein phosphatase 

390 

61.261 

1584 

M83654 

Homo sapiens cytoplasmic 
phosphotyrosyl protein phosphatase 

1065 

100.000 

1585 

M63654 

Homo sapiens cytoplasmic 
phosphotyrosyl protein phosphatase 

686 

82.738 

1586 

AB007922 

Homo sapiens KIAA0453 protein 

1078 

80.258 

1587 

AB007922 

Homo sapiens KIAA0453 protein 

52 

71.429 

1589 

AB007922 

Homo sapiens KIAA0453 protein 

47 

22.222 j 

1590 

AB007922 

Homo sapiens KIAA04 53 protein 

613 

73.026 

1591 

AB007922 

Homo sapiens KIAA0453 protein 

6686 

98.769 

1593 

X87176 

Homo sapiens 1 7bet a -hydroxys teroid 
dehydrogenase 

300 

80.357 

1594 

X87176 

Homo sapiens 17beta -hydroxys teroid 
dehydrogenase 

120 

95.000 

1595 

X87176 

Homo sapiens 17beta- hydroxys teroid 
dehydrogenase 

3682 

87.071 

1596 

M62839 

Homo sapiens apolipoprotein H 

640 

100. 000 

1597 

M62839 

Homo sapiens apolipoprotein H 

882 

99.180 

1598 

M62839 

Homo sapiens apolipoprotein H 

651 

82.836 

1599 

M62839 

Homo sapiens apolipoprotein H 

903 

64.644 

1600 

J02621 

Homo sapiens high mobility group 
protein 14 

262 

100.000 

1601 

J02621 

Homo sapiens high mobility group 
protein 14 

526 

95.000 

1602 

L08048 

Homo sapiens HMG-1 

931 

86.364 

1603 

M10014 

Homo sapiens fibrinogen gamma chain | 

387 

98.276 

1604 

M10014 

Homo sapiens fibrinogen gamma chain 

753 

96.460 

1605 

M10014 

Homo sapiens fibrinogen gamma chain 

47 

19.802 

1606 

M10014 j 

Homo sapiens fibrinogen gamma chain j 

1687 

77.655 

1607 

Y11435 

Homo Bapiens b4 integrin interactor 

897 

73.059 1 

1608 

Y11435 

Homo sapiens b4 integrin interactor 

527 

61.049 

1610 

S7029Q 

Homo sapiens glutamine synthetase, GS 
{EC 6.3.1.2} 

1802 

88.398 

1611 

AJ005894 

Homo sapiens JM2 7 

384 

80.612 

1612 

D84108 

Homo sapiens RBP-MS/type 2 

1370 

100.000 

1613 

D84108 

Homo sapiens RBP-MS/type 2 

726 

98.230 i 

1614 

D84108 

Homo sapiens RBP-MS/type 2 

980 

86.070 

1615 

D84108 

Homo sapiens RBP-MS/type 2 

946 

92.973 | 

1616 

Z81038 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk488a2.5 comes 
from this gene 

63 

32.500 
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1617 

Z81038 

1 Caenorhabditis elegans predicted using 

Genefinder; cDNA EST yk488a2.5 comes 
I from this gene 

1 201 

ii - 929 

1618 

X55656 

1 Homo sapiens gamma -G globin 

J 413 

70 1 fi 

1619 

U27315 

Mus musculus adenine nucleotide 
1 translocase-1 

369 


1620 

U27315 

1 Mus musculus adenine nucleotide 
1 translocase-l 

696 

61 .589 

1622 

AC003973 

1 Homo sapiens ZNF91L 

1256 

43 . 371 

1623 

AC003973 

Homo sapiens ZNF91L 

3211 

43 .592 

1624 

AL050373 

I Homo sapiens hypothetical protein 

] 744 

8 1 91 5 

1625 

AL050373 

1 Homo sapiens hypothetical protein 

1 2078 

87 . 047 

1626 

X55683 

J Lycopersicon esculentum extensin 
(class I) 

85 

34.783 

1627 

X55683 

I Lycopersicon esculentum extensin 
(class I) 

48 

25.714 

1628 

X55683 

Lycopersicon esculentum extensin 
1 (class I) 

120 

45.652 

1629 

X69151 

J Homo sapiens vacuolar proton- ATPase 

j 1684 

O . A O O 

1630 

L35546 

1 Homo sapiens gamma -glutamyl cysteine 
synthetase light subunit 

51 

17 1 A1 

1631 

L35546 

1 Homo sapiens gamma -glutamyl cysteine 
synthetase light subunit 

1274 

Qfl Q 0£ 

1632 

L35546 

Homo sapiens gamma -glutamyl cysteine 
synthetase light subunit 

1242 

OA AQQ 
O \J • *fc o o 

1634 

AF039698 

Homo sapiens antigen NY-CO- 33 

4175 


1635 

X55777 

Homo sapiens put. ORF 

303 J 

68.966 

1637 

AC004401 

Arabidopsis thaliana putative casein 
kinase II catalytic subunit 

3 7 

A A AAA 

4 4. 444 

1638 

AC004401 

Arabidopsis thaliana putative casein 
kinase II catalytic subunit 

789 

CO CIO 

1639 

AC004401 

Arabidopsis thaliana putative casein 
kinase II catalytic subunit 

1458 

69 . 967 

1640 

AC004401 

Arabidopsis thaliana putative casein 
kinase II catalytic subunit 

44 

39.130 

1641 

AC004401 1 

Arabidopsis thaliana putative casein | 
kinase II catalytic subunit | 

1363 

67.705 

1642 

U58658 

Homo sapiens unknown ! 

271 

65.789 

1643 

M32724 | 

Homo sapiens A-gamma-hemoglobin ! 

815 

100 ooo 

1644 

M32724 j 

Homo sapiens A-gamma-hemoglobin 

202 

54.861 

1645 

X17459 | 

Mus musculus J kappa RS-binding 

protein | 

525 

on fine; 

1647 

X17459 | 

Mus musculus J kappa RS-binding 
protein j 

380 

7fl -a i -> 

/ O i Jlj 

1648 

X17459 | 

Mus musculus J kappa RS-binding 
protein 

974 

7 ^ . O J A 

1649 

X17459 

Mus musculus a kappa RS-binding 

protein j 

1035 

98.851 

1650 

X17459 

Mus musculus J kappa RS-binding 
protein 

892 

95.322 

1651 

X17459 | 

Mus musculus J kappa RS-binding j 
protein 

2483 

83.778 

1652 

X92485 j 

Plasmodium vivax pval i 

175 

50.000 j 

1653 

Z75331 1 

Homo sapiens nuclear protein SA-2 

52 

4 8.000 
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1654 

Z75331 

Homo sapiens nuclear protein SA-2 

7303 

98.883 

1655 

AF126736 

Homo sapiens ubiquitin processing 
protease 

5044 

OA 1 CD 

1656 

AF126736 

Homo sapiens ubiquitin processing 
protease 

46 

nnn 
*o . uuu 

1657 

AF126736 

Homo sapiens ubiquitin processing 
protease 

4893 

95.995 

1658 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

634 

65.217 

1659 

M76979 

Homo sapiens pigment epithelial- 
differentiating factor 

1276 

73.258 

1660 

AF093673 

Cricetulus griseus layilin 

606 

85.437 

1661 

AF093673 

Cricetulus griseus layilin 

496 

69.231 

1662 

AF093673 

Cricetulus griseus layilin 

97 

100.000 

1664 

AF093673 

Cricetulus griseus layilin 

45 

29.545 

1665 

AF093673 

Cricetulus griseus layilin 

41 

27.273 

1666 

AF093673 

Cricetulus griseus layilin 

44 

27.273 

1667 

AF093673 

Cricetulus griseus layilin 

43 

83.333 

1668 

AF093673 

Cricetulus griseus layilin 

51 

27.660 

1669 

AF093673 

Cricetulus griseus layilin 

1928 

81.948 

1670 

AF174599 

Homo sapiens F-box protein Fbxll 

1279 

95.431 

1672 

X80638 

Mus musculus p21RhoC 

545 

61.722 

1673 

AB000407 

Gallus gallus DNA topoisomerase I 

477 

46.396 

1674 

AB000407 

Gallus gallus DNA topoisomerase I 

2736 

70.536 

1675 

L10908 

Mus musculus Gcapl gene product 

83 

47. 059 

1676 

Z48166 

Schizosaccharomyces pombe gar 2 

53 

18.248 

1677 

Z48166 

Schizosaccharomyces pombe gar2 

140 

25.287 

1678 

L24521 

Homo sapiens transformation-related 
protein 

334 

62.766 

1679 

U17332 

Mus musculus ribosomal protein L9 

245 

52.356 

1681 

X92485 

Plasmodium vivax pval 

68 

37.931 

1683 

X92485 

Plasmodium vivax pval 

68 

41.176 

1684 

X92485 

Plasmodium vivax pval 

94 

59.091 

1685 

X92485 

Plasmodium vivax pval 

67 

30.882 

1686 

X92485 

Plasmodium vivax pval 

76 

28.873 

1687 

X92485 

Plasmodium vivax pval 

65 

31.343 

1689 

X92485 

Plasmodium vivax pval 

58 

26. 667 

1690 

X92485 

Plasmodium vivax pval 

284 

47.059 

1691 

X55777 

Homo sapiens put. ORF 

51 

32 . 143 

1692 

X55777 

Homo sapiens put. ORF 

360 

57.426 

1693 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

695 

62 . 842 

1694 

X51754 

Homo sapiens lambda- immunoglobulin 
light chain 

925 

7 7 .366 

1695 

X99699 

Homo sapiens XIAP associated factor-1 
(ZAP-l) 

406 

98.214 

1696 

X99699 

Homo sapiens XIAP associated factor-1 
(ZAP-l) 

2184 

98.738 

1697 

Z47547 

Chondrus crispus unique orf 

79 

38.298 

1698 

AL117608 

Homo sapiens hypothetical protein 

39 

47.059 

1699 

AL117608 

Homo sapiens hypothetical protein 

800 

100.000 

1700 

AL117608 

Homo sapiens hypothetical protein 

344 

84.783 

1701 

AL031910 

Leishmania major hypothetical protein 
L2743.10 

114 

42.857 


lb*0| 
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1702 

X90857 

Homo sapiens -14 . 

717 

84 . 138 

1703 

X90857 

Homo sapiens -14 

861 

98 . 630 

1704 

X90857 

Homo sapiens -14 

957 

82 .979 

1705 

X90857 

Homo sapiens -14 

239 

100. 000 

1706 

X90857 

Homo sapiens -14 

900 

30 . 323 

1707 

X90857 

Homo sapiens -14 

43 

35.484 

1708 

X90857 

Homo sapiens -14 

250 

88.636 

1709 

X90857 

Homo sapiens -14 

188 

80 000 

1710 

X90857 

Homo sapiens -14 

2759 

91 .379 

1711 

X02345 

Pan troglodytes beta-qlobin 

138 - 

54.545 

1712 

X02490 

Homo sapiens longest open reading 
frame 

558 

81 . 132 

1713 

X02490 

Homo sapiens longest open reading 
frame 

389 

62.500 

1714 

X55777 

Homo sapiens put. ORP 

349 

62.921 

1715 

X17576 

Homo sapiens nek protein (AA 1-377) 

2018 

90.237 

1716 

X88799 

Oryza sativa DNA binding protein 

115 

34.409 

1717 

L10908 

Mub musculus Gcapl gene product 

132 

33.846 

1718 

J04973 

Homo sapiens core protein II precursor 

1292 

72.708 

1719 

AB008164 

Homo sapiens ST1C2 

657 

86. 667 

1720 

AB008164 

Homo sapiens ST1C2 

1799 

70.476 

1721 

AB008164 

Homo sapiens ST1C2 

1337 

91.030 

1722 

370290 

Sordaria macrospora alpha- tubulin 

807 

59.000 

1723 

Z70290 

Sordaria macrospora alpha -tubulin 

1594' 

A 1 1 
D^k . J .5 o 

1724 

270290 

Sordaria macrospora alpha -tubulin 

1700 

66.524 

1725 

AF084256 

Homo sapiens beta glucuronidase 
isoform d 

J. / o 

5 9. 615 

1726 

U07973 

Gallus gallus alpha- 1 collagen type 
III 

77 

3 t . / / o 

1727 

U07973 

Gallus gallus alpha-1 collagen type 
III 

265 


1728 

L77967 

Ovis aries small proline-rich protein 
with paired repeat 

84 


1729 

U79260 

Homo sapiens unknown 

268 

A A 9n^ 

1730 

U58658 

Homo sapiens unknown 

232 

O / . X. J 

1759 

U53831 

Homo sapiens putative collagen homolog 
protein-b 

58 

46. 154 

1760 

U53831 | 

Homo sapiens putative collagen homolog 
protein-b 

133 

41.270 

1762 

AC0053$6 

Arabidopsis thaliana putative proline- 
rich cell wall protein 

37 

3 7 . 500 

1764 

U14009 

Pyrus communis arabinogalactan-protein [ 

138 

3 5 . 156 

1766 

D50579 

Homo sapiens carboxylest erase 
precursor 

1286 

91 . 038 

1767 

D50579 

Homo sapiens carboxylest erase 
precursor 

2704 

89. 860 

1768 

L33989 

Homo sapiens ORF; putative 

67 

31.081 


T "3 "5 Q Q Q 

Homo sapiens ORF; putative 

113 

32.456 

1770 

AJ243460 

Leishmania major proteophosphoglycan 

72 

23.622 

1771 

AJ243460 

Leishmania major proteophosphoglycan 

58 

26.786 

1772 

AJ243460 

Leishmania major proteophosphoglycan 

178 

26.104 

1773 

AJ243460 

Leishmania major proteophosphoglycan 

178 

26.697 

1774 

U07974 

Gallus gallus unknown 

71 

28 . 846 

1775 

U07974 

Gallus gallus unknown 

100 

30.435 


\o?ov 
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1776 

U07974 

Gallus gallus unknown 

124 

27 . 211 

1777 

K02576 

Homo sapiens salivary proline-rich 
protein 1 

172 

37 . 748 

1778 

X55687 

Lycopersicon esculentum extensin 
(class II) 

87 

31.579 

1779 

M15530 

Homo sapiens B-cell growth factor 

169 

39.167 

1780 

L17318 

Rattus norvegicus proline-rich 
proteoglycan 

185 

30.542 

1782 

Z28396 

Sus scrofa glucosephosphate isomerase 

1444 

83.582 

1783 

Z28396 

Sus scrofa glucosephosphate isomerase 

2091 

78.191 

1785 

M17885 

Homo sapiens acidic ribosomal 
phosphoprotein (P0) 

839 

60.641 

1786 

AC002073 

Homo sapiens Putative gene. Genscan 
predictions confirmed by EST 
splicing.; coded for by human cDNAs 
AA122029 (NID:gl678048) , D31562 

(NID:g644442) , AA158721 

(NID:gl733515) , R59640 (NID :g830335) 
and F13082 (NID:g709111) 

741 

100.000 

1787 

AC002073 

Homo sapiens Putative gene. Genscan 
predictions confirmed by EST 
splicing.; coded for by human cDNAs 
AA122029 (NID:gl678048) , D31562 
(NID:g644442) , AA158721 
(NID:gl733515) , R59640 (NID :g83 0335 ) 
and F13082 (NID : g709111) 

1683 

97.338 

1788 

AC002073 

Homo sapiens Putative gene. Genscan 
predictions confirmed by EST 
splicing.; coded for by human cDNAs 
AA122029 (NID:gl678048) , D31562 
(NID:g644442) , AA158721 
(NID:gl733515) , R59640 (NID : g830335) 
and F13082 (NID :g709111) 

38 

20.312 

1789 

AC002073 

Homo saoiens Putative optip npnaran 
predictions confirmed by EST 
splicing.; coded for by human cDNAs 
AA122029 (NID:gl678048) , D31562 

(NID:g644442) , AA158721 

(NID:gl733515) , R59640 (NID:g830335) 
and F13082 (NID : g709111) 

13/ / 

94 . 697 

1790 

U90446 

Mus musculus RNAse L inhibitor 

3265 

92.257 

1791 

X71129 

Homo sapiens electron transfer 
flavoprotein beta subunit 

465 

73 . 729 

1792 

X71129 

Homo sapiens electron transfer 
flavoprotein beta subunit 

656 

64 . 878 

1793 

X71129 

Homo sapiens electron transfer 
flavoprotein beta subunit 

336 

45.509 

1794 

X71129 

Homo sapiens electron transfer 
flavoprotein beta subunit 

48 

19.588 

1795 

X71129 

Homo sapiens electron transfer . 
flavoprotein beta subunit 

973 

79.921 

1796 

X78992 

Homo sapiens ERF- 2 

2655 

86.032 

1797 

X78992 

Homo sapiens ERF -2 

2815 

94.366 ! 

1798 

AB020724 

Homo sapiens KIAA0917 protein 

447 

75.000 

1800 

AB020724 

Homo sapiens KIAA0917 protein 

2983 

87.500 | 



. G plpi 4-6.H-3 . OB E* i& O 


1801 

D13286 

Homo sapiens glucocerebrosidase 

428 

100 . 000 

1802 

D13286 

Homo sapiens glucocerebrosidase 

892 

98 .561 

1803 

D13286 

Homo Sapiens glucocerebrosidase 

3227 

95.379 

1804 

D13286 

Homo sapiens glucocerebrosidase 

2728 

86 . 989 

1805 

L24521 

Homo sapiens transformation-related 
protein 

376 

62.385 

1806 

AF107406 

Homo sapiens GW128 

42 

38.462 

1807 

AF107406 

Homo sapiens GW128 

46 

83 . 333 

1808 

AF107406 

Homo sapiens GW12 8 

185 

48. 980 

1809 

L22253 

Homo sapiens 9G8 splicing factor 

1200 

57 . 664 

1810 

L22253 

Homo sapiens 9G8 splicing factor 

1203 

90. 909 

1811 

J04765 

Homo sapiens osteopontin 

907 

90 . 476 

1812 

J04765 

Homo sapiens osteopontin 

1090 

81 . 481 

1813 

J04794 

Homo sapiens aldehyde reductase (EC 
1.1.1.2) 

471 

98 . 592 

1814 

J04794 

Homo sapiens aldehyde reductase (EC 
1.1.1.2) 

318 

87.719 

1815 

J04794 

Homo sapiens aldehyde reductase (EC 
1.1.1.2) 

890 

71.637 

1817 

AF151874 

HomO sapiens CGI-116 protein 

177 

93.103 

1818 

AF151874 

Homo sapiens CGI-116 protein 

618 

85.500 

1819 

L10908 

Mus mus cuius Gcapl gene product 

116 


1820 

AF110763 

Homo sapiens skeletal muscle LIM- 
protein 1 

1863 

78.386 

1821 

AF110763 

Homo sapiens skeletal muscle LIM- 
protein 1 

1735 

93.262 

1822 

M69297 

Homo sapiens ORF 3 

224 

3 7 . 121 

1823 

AF088982 

Homo sapiens heat shock protein hsp40- 
3 

1872 

92.045 

1824 

U27460 

Homo sapiens uridine diphosphoglucose 
py ropho sphory 1 a s e 

277 

100 . 000 

1825 

U27460 

Homo sapiens uridine diphosphoglucose 
py ropho sphory 1 a s e 

2122 

85.575 

1826 

X79855 

Homo sapiens troponin T 

341 

72.477 

1827 

L49470 

Mus musculus troponin T fast skeletal 
muscle isoform 

3 01 

7 7 £\1 O 

1828 

L49470 

Mus musculus troponin T fast skeletal 
muscle isoform 

481 

95 . 062 

1829 

L49470 

Mus musculus troponin T fast skeletal 
muscle isoform 

533 

88 . 889 

183 0 

L49470 

Mus musculus troponin T fast skeletal 
muscle isoform 

970 

7 8 . 810 

1831 

A75737 

Homo sapiens unnamed protein product 

1642 

71.498 

1832 

U46756 

Gallus gallus CLE 7 

511 

87.619 

1833 

U46756 i 

Gallus gallus CLE 7 

793 

73 . 626 

1834 

U46756 

Gallus gallus CLE 7 

527 

59.922 

1835 

AF174429 

Arabidopsis thaliana FH protein 
interacting protein FIP2 

206 

34.043 

1836 

AF174429 

Arabidopsis thaliana FH protein 
interacting protein FIP2 

170 

41.250 

1837 

AF174429 

Arabidopsis thaliana FH protein 
interacting protein FIP2 

206 

34.043 

1838 

AF174429 

Arabidopsis thaliana FH protein 
interacting protein FIP2 

430 

31.683 
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1839 

U60882 

Rattus norvecricus orotein arai r\i n*» N- 
methyltransf erase 

195 

an nnn 

1840 

U60882 

RattuS norveaicus orotein arainin^ KT- 
methyl transferase 

965 

/ \J • / jZ 

1841 

M37984 

Homo sapiens slow twitch 
skeletal/cardiac muscle troponin C 

101 

1 on nnn 

1842 

M37984 

Homo sapiens slow twitch 
skeletal/cardiac muscle troponin C 

414 


1843 

AL049483 

Arabidopsis thaliana putative leucine- 
rich-repeat protein 

34 9 

A (ZA 

1844 

M14156 

Homo sapiens insulin- like growth 
factor IB 

421 

63 . 303 

1845 

M14156 

Homo sapiens insulin- like growth 
factor IB 

599 

71 . 223 

1846 

M14156 

Homo sapiens insulin-like growth 
factor IB 

599 

71.223 

1847 

M14156 

Homo sapiens insulin- like growth 
factor IB 

1356 

100 ooo 

184 8 

M14156 

Homo sapiens insulin- like growth 
factor IB 

560 

71 OOfi 

1849 

Z50109 

Unknown similar to 60S ribosomal 
protein; cDNA EST EMBL:T01767 comes 
from this gene; cDNA EST EMBL: 

292 


1850 

Z74167 

Saccharomyces cerevisiae ORF YDIill9c 

102 

37.736 

1852 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

304 

62 821 

1853 

U01849 

Trypanosoma brucei ORF2 

148 

27 .559 

1854 

L19783 

Homo sapiens GPI-H 

806 

82.927 

1855 

L19783 

Homo sapiens GPI-H 1 

309 

97 .77 8 

1856 

L19783 

Homo sapiens GPI-H i 

841 

84.694 

1857 

X13482 

Homo sapiens U2 snRNP- specific A* 
protein (AA 1-255) 


/ j . jjj 

1858 

X13482 

Homo sapiens U2 snRNP- specif ic A' 
protein (AA 1-255) 

827 

83 . 52 9 

1859 

X13482 

Homo sapiens U2 snRNP -specific A» 
protein (AA 1-255) 

107 

100 . 000 

1860 

X13482 

Homo sapiens U2 snRNP- specific A* 
protein (AA 1-255) 

1028 

82.101 

1861 

D21235 

Homo sapiens HHR23A protein 

586 

90.000 

1862 

D21235 

Homo sapiens HHR23A protein 

634 

97.115 

1863 

D21235 

Homo sapiens HHR23A protein 

1167 

78.364 

1864 

AB028956 

Homo sapiens KIAA1033 protein 

629 

91.129 

1865 

AB028956 

Homo sapiens KIAA1033 protein 

1170 

80.658 | 

1866 

AB028956 

Homo sapiens KIAA1033 protein 

276 

79.630 

1867 

AB028956 

Homo sapiens KIAA1033 protein 

175 

90.909 

1868 

AB028956 

Homo sapiens KIAA1033 protein 

889 

86.624 

1869 

AB028956 

Homo sapiens KIAA1033 protein \ 

7118 

96.669 

1871 

L01131 

Homo sapiens decor in 

849 

91.083 

1872 

L01131 

Homo sapiens decorin 

1017 

55.710 

1873 

L01131 

Homo sapiens decorin * \ 

1699 

92.113 

1874 

L01131 

Homo sapiens decorin 

1206 

78.706 

1875 

U50545 

Cavia porcellus cytosolic beta- 
glucosidase 

2621 

80.810 

1876 

AF038953 

Homo sapiens E25 protein 

949 

77.978 
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1877 

U79260 

Homo sapiens unknown 

189 

49.485 

1878 

AOT005559 

Mus musculus SPR2A protein 

113 

29.762 

1880 

X71355 

Homo sapiens Sodium- Phosphate 
Transport System 1 

2564 

91.755 

1881 

AJ223980 

Homo sapiens BCL7C 

305 . 

70.588 

1882 

AJ223980 

Homo sapiens BCL7C 

978 

85.068 

1883 

M15530 

Homo sapiens B-cell growth factor 

186 

45.570 

1884 

X66285 

Mus musculus HC1 ORF 

157 

33.846 

1885 

AF053233 

Homo sapiens endobrevin 

330 

94.643 

1886 

AF053233 

Homo sapiens endobrevin 

348 

92.424 

1887 

AF053233 

Homo sapiens endobrevin 

373 

100. 000 

1888 

AP053233 

Homo sapiens endobrevin 

334 

84.615 

1889 

AF053233 

Homo sapiens endobrevin 

329 

79.048 

1890 

D21820 

Lilium longiflorum ORF 

172 

37.681 i 

1893 

AF071081 

Mycobacterium tuberculosis proline- 
rich mucin homolog 

353 

29.579 

1894 

X71980 

Torpedo marmorata alpha- tubulin 

1010 

89.362 j 

1895 

X71980 

Torpedo marmorata alpha- tubulin 

1689 

84.096 

1896 

M11147 

Homo sapiens ferritin light chain 

316 

65.241 

1897 

M20472 

Homo sapiens clathrin light-chain A 

525 

78.761 

1898 

M20472 

Homo sapiens clathrin light -chain A 

374 

98.276 

1899 

M20472 

Homo sapiens clathrin light -chain A 

861 

74.000 

1900 

M20472 

Homo sapiens clathrin light -chain A 

919 

77,533 

1901 

D14531 

Homo sapiens 'human homologue of rat 
ribosomal protein L9 1 . 

865 

78.836 

1902 

D14531 

Homo sapiens 'human homologue of rat 
ribosomal protein L9 1 

371 

63 .473 

1903 

U79260 

Homo sapiens unknown 

376 

66.337 i 

1905 

X97276 

Sparus aurata, metallothionein A 

86 

43.750 

1906 

AC006592 

Arabidopsis thai i ana unknown protein 

134 

33.333 

1907 

AF115850 

Homo sapiens PAR protein 

523 

87.640 

1908 

AF115850 

Homo sapiens PAR protein 

53 

25.600 

1909 

AF115850 | 

Homo sapiens PAR protein 

1418 

100.000 

1910 

AF115850 

Homo sapiens PAR protein 

212 

89.583 

1911 

AF115850 

Homo sapiens PAR protein 

756 

68.966 

1912 

D00682 

Homo sapiens cofilin 

414 

68.605 

1913 

M20372 

Homo sapiens ornithine decarboxylase 
(EC 4.1.1.17) 

1307 

81.766 

1914 

X92485 

Plasmodium vivax pval | 

359 

62.000 

1915 

AF026689 

Homo sapiens prostate-specific 
transglutaminase 

98 

55.000 

1916 

X69392 

Homo sapiens ribosomal protein L26 

314 ! 

64.706 

1917 

L48937 

Homo sapiens ptp-IVlb, PTP-IV1 gene 
product 

923 

83 .234 

1918 

Ii48937 

Homo sapiens ptp-IVlb, PTP-IV1 gene 
product 

406 

98.413 

1919 

L4 8937 

Homo sapiens ptp-IVlb, PTP-IV1 gene 
product 

41 

20.370 

1920 

L48937 

Homo sapiens ptp-IVlb, PTP-IVlgene 
product 

580 

71.910 

1921 

Z81029 

Unknown Similarity to S.pombe 
hypothetical protein C1D4.09C 
(SW:Q10154); cDNA EST EMBL:T00543 
comes 

281 

34.343 


low* 


1. 0 O m 1+3 . O H c! i& O k£ 


1922 

AL021366 

Homo sapiens CICK0721Q.5 (polypeptide 
from patented cDNA EMBL : E06811) 

298 

ioo.ooo ! 

V w * W W V 

1924 

AL021366 

Homo sapiens CICK0721Q.5 (polypeptide 
from patented cDNA EMBL:E06811) 

668 

86.466 

1925 

AL021366 

Homo sapiens CICK0721Q.5 (polypeptide 
from patented cDNA EMBL:E06811) 

407 

100. 000 

1927 

AL021366 

Homo sapiens CICK0721Q.5 (polypeptide 
from patented cDNA EMBL : E06811) 

348 

67. 857 

1926 

M15530 

Homo sapiens B-cell growth factor 

166 

69.048 

1929 

X55777. 

Homo sapiens put . ORP 

324 

56.989 

1930 

Z32554 

Cricetulus griseus epsilon-COP; ldlPp 

970 

64.961 

1931 

Z32554 

Cricetulus griseus epsilon-COP; ldlFp 

485 

54.974 

1932 

Z32554 

Cricetulus griseus epsilon-COP; ldlFp 

693 

65.741 

1934 

X65550 

Homo sapiens antigen of the monoclonal 
antibody Ki-67 

21240 

99.908 

1935 

X65550 

Homo sapiens antigen of the monoclonal 
antibody Ki-67 

17404 

8.946 

1936 

U97553 

murine herpesvirus 68 unknown 

12 0 

37.931 

1937 

AB030905 

Homo sapiens Heterochromatin protein 1 
gamma 

1157 

99.422 

1938 

AB0309O5 

Homo sapiens Heterochromatin protein 1 
gamma 

42 

25 . 926 

1939 

AB030905 

Homo sapiens Heterochromatin protein 1 
gamma 

36 

24 . 324 

1940 

AB030905 

Homo sapiens Heterochromatin protein 1 
gamma 

297 

97 . 826 

1941 

AB030905 

Homo sapiens Heterochromatin protein 1 
gamma 

44 

35. 714 

1942 

AB030905 

Homo sapiens Heterochromatin protein 1 
gamma 

542 

79. 775 

1943 

AF084256 

Homo sapiens beta glucuronidase 
isoform d 

237 

66 . 667 

1944 

D88378 

Homo sapiens proteasome inhibitor 
hPI31 subunit 

756 

81. 818 

1945 

D88378 

Homo sapiens proteasome inhibitor 
hPI31 subunit 

1277 

83.154 

1946 

Y07566 

Homo sapiens RIT (Ric-related gene 
expressed in many tissues) 

744 

77.729 

1947 

M15530 

Homo sapiens B-cell growth factor 

49 

39.130 

1948 

M15530 

Homo sapiens B-cell growth factor 

152 

68.421 

1949 

U79260 

Homo sapiens unknown 

207 

37.398 

1950 

AF119297 

Homo sapiens neuroendocrine-specif ic 
protein-like protein 1 

327 

96.078 

1952 

AF119297 

Homo sapiens neuroendocrine -specific 
protein-like protein 1 

486 

78.846 

1953 

AF119297 

Homo sapiens neuroendocrine- specif ic 
protein- like protein 1 

1074 

87.866 

1954 

M92296 

Pongo pygmaeus gamma -1 globin 

219 

59.712 

1955 

AB020981 

Homo sapiens cyclin B2 

254 

100.000 

1956 

AB020981 

Homo sapiens cyclin B2 

638 

98.039 

1957 

AB020981 

Homo sapiens cyclin B2 

1643 

81.509 

1958 

D90861 

Escherichia coli similar to 

141 

36.905 

1959 

D90861 

Escherichia coli similar to 

65 

30.357 

1960 

D90861 

Escherichia coli similar to 

321 

34.528 
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1961 

D90861 

Escherichia coli similar to 

196 

42.241 

1962 

M64716 

Homo sapiens ribosomal protein 

587 

97.872 

1-963 

M64716 

Homo sapiens ribosomal protein 

126 

38.060 

1964 

AC004382 

Homo sapiens Unknown gene product 

947 . 

98.052 

1965 

AC004382 

Homo sapiens Unknown gene product 

834 

68 . 376 

1966 

AC004382 

Homo sapiens Unknown gene product 

1653 

81.013 

1967 

X54941 

Homo sapiens Cksl protein homologue 

304 

62 . 393 

1968 

X56932 

Homo sapiens 23 kD highly basic 
protein 

469 

75.829 

1969 

M14820 

Trypanosoma brucei ORF2 bases 1807- 
2850; first start codon at 2032; 
putative 

119 

29.167 

1971 

AL035064 

Schizosaccharomyces pombe queuine 
trna-ribosyl transferase 

1057 

43 . 867 

1972 

S62936 

Homo sapiens PRB1S precursor 
protein=basic proline-rich proteins 
(Ps, PmF, PmS, and Pe) precursor {C- 
terminal} 

222 

28.241 

1973 

M25660 

Homo sapiens beta-hemoglobin 

209 

71.930 

1974 

D26482 

Homo sapiens SCR2 

327 

100.000 

1975 

D28482 

Homo sapiens SCR2 

287 

100.000 

1976 

D2 8482 

Homo sapiens SCR2 

1183 

62 .147 

1977 

D28482 

Homo sapiens SCR2 

1785 

84.412 

1978 

D28482 

Homo sapiens SCR2 

2033. 

76.221 

1979 

U79260 

Homo sapiens unknown 

168 

57.377 

1980 

D16217 

Homo sapiens calpastatin 

621 

98.980 

1981 

D16217 

Homo sapiens calpastatin 

61 

19.780 

1982 

D16217 

Homo sapiens calpastatin 

4082 

93 .723 

1983 

D16217 

Homo sapiens calpastatin 

2485 

94 .444 

1984 

D16217 

Homo sapiens calpastatin 

3586 

93 .287 

1985 

X57801 

Homo sapiens immunoglobulin lambda 
light chain 

538 

54.878 

1986 

AF155114 

Homo sapiens NY-REN- 5 7 antigen 

107 

58.065 

1988 

AF155114 

Homo sapiens NY-REN- 57 antigen 

43 

28.000 

1989 

AF155114 

Homo sapiens NY-REN- 57 antigen 

39 

29.268 

1990 

AF155114 

Homo sapiens NY-REN- 57 antigen 

503 

100.000 

1991 

AF155114 

Homo sapiens NY-REN-57 antigen 

2284 

89.796 

1992 

X79417 

Sus scrofa 40S ribosomal protein S12 

240 

97.222 

1993 

X79417 

Sus scrofa 40S ribosomal protein S12 

147 

70.270 | 

1994 

X79417 

Sus scrofa 4 OS ribosomal protein S12 

263 

68.696 1 

1995 

J04495 

Macaca mulatta alpha-globin 

895 

96.479 

1996 

J04495 

Macaca mulatta alpha-globin 

290 

63 .281 

1997 

AF044958 

Homo sapiens NADH: ubiquinone 
oxidoreductase ASHI subunit 

216 

60.000 

1998 

S85655 

Homo sapiens prohibitin 

1224 

82.288 

1999 

S85655 

Homo sapiens prohibitin 

680 

93 .333 

2000 

AF083929 

Mus musculus ESI 8 

187 

39.216 

2001 

AL023286 

Schizosaccharomyces pombe hypothetical 
protein 

270 

57.333 

2002 

AL023286 

Schizosaccharomyces pombe hypothetical 
protein 

259 

43.094 

2003 

AF051137 

Monodelphis domestica ribosomal 
protein S4 Y isoform 

676 

68.966 

2004 

AF051137 

Monodelphis domestica ribosomal 

675 

67.491 
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protein S4 Y isoform 



2005 

L05092 

Homo sapiens ribosomal protein Lll 

756 

77.914 

2006 

L05092 

Homo sapiens ribosomal protein Lll 

38 

66.667 

2007 

L05092 

Homo sapiens ribosomal protein Lll 

320 

100 . 000 

2008 

L05092 

Homo sapiens ribosomal protein Lll 

406 

62.694 

2009 

M15530 

Homo sapiens B-cell growth factor 

205 

57 .407 

2010 

L41558 

Rattus rattus rBBl gene product 

122 

58.333 

2011 

AF043183 

Homo sapiens T cell receptor beta 
chain 

245 

66 . 667 

2012 

AF043183 

Homo sapiens T cell receptor beta 
chain 

1196 

94 . 924 

2013 

AF043183 

Homo sapiens T cell receptor beta 
chain 

1772 

92 . 038 

2014 

D83489 

Tokudaia osimensis part of Zfy 
containing 11 zinc finger motifs 

46 

44.444 

2015 

D83489 

Tokudaia osimensis part of Zfy 
containing 11 zinc finger motifs 

192 

25.263 

2016 

AF047659 

Caenorhabditis elegans No definition 
line found 

331 

47.748 

2017 

AF047659 

Caenorhabditis elegans No definition 
line found 

875 

42 . 998 

2018 

X73974 

Homo sapiens ribosomal protein L4 

784 

93.130 

2019 

X73974 

Homo sapiens ribosomal protein L4 

1155 

77.852 

2020 

AF085693 

Rattus norvegicus G protein-coupled 
receptor kinase-associated ADP 
ribosylation factor GTPase-activating 
protein 


o o Ada 

2021 

AF085693 

Rattus norvegicus G protein-coupled 
receptor kinase-associated ADP 
ribosylation factor GTPase-activating 
protein 


38 - 039 

2022 

AF085693 

Rattus norvegicus G protein-coupled 
receptor kinase-associated ADP 
ribosylation factor GTPase-activating 
protein 

ft ^ ^ 

O J J 

q -> oca 

2023 

AF085693 

Rattus norvegicus G protein-coupled 
receptor kinase-associated ADP 
ribosylation factor GTPase-activating 
protein 

61 

j \j . \j \j \j 

2024 

AF085693 

Rattus norvegicus G protein-coupled 
receptor kinase-associated ADP 
ribosylation factor GTPase-activating 
protein 



2026 

AF085693 

Rattus norvegicus G protein-coupled 
receptor kinase-associated ADP 
ribosylation factor GTPase-activating 
protein 

66 

X7.JDD 

2027 

AF085693 

Rattus norvegicus G protein- coupled 
receptor kinase-associated ADP 
ribosylation factor GTPase-activating 
protein 

2624 

87.928 

2028 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

366 

40.175 

2029 

M15530 | 

Homo sapiens B-cell growth factor 

197 

43.689 

2030 

S47413 

Arabidopsis thaliana, C24, Peptide 

43 

3V.500 


100^4643 




Partial, 112 aa glycine-rich protein,. 
atGRP {clone atGRP-4) 



2031 

S47413 

Arabidopsis thaliana, C24, Peptide 
Partial, 112 aa glycine-rich protein, 
atGRP {clone atGRP-4}. 

111 

31.776 

2032 

U83176 

Mus musculus ROSA26AS 

43 

40.000 

2033 

U83176 

Mus musculus ROSA2 6 AS 

52 

22.727 

2034 

U83176 

Mus musculus ROSA2 6AS 

2049 

82.500 

2035 

U83176 

Mus musculus ROSA26AS 

1952 

79.419 

2036 

U41654 

Homo sapiens adenovirus E3-14.7K 
interacting protein 1 

925 

75.767 

2037 

X91911 

Homo sapienB rtvp-1 

992 

86.131 

2038 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

147 

68 .571 

2039 

AF010144 

Homo sapiens neuronal thread protein 
AP7C-NTP 

61 

29.412 

2040 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

61 

23 . 932 

2041 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

61 

36.000 

2042 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

42 

30.612 

2043 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

74 

31.783 

2044 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

397 

32.500 

2045 

X61109 

Gorilla gorilla beta-globin 

501 

80.952 

2046 

X87212 

Homo sapiens cathepsin C 

503 

98.667 

2047 

X87212 

Homo sapiens cathepsin C 

1052 

95.152 

2048 

X87212 

Homo sapiens cathepsin C 

734 

73.885 

2049 

X87212 

Homo sapiens cathepsin C 

1746 

87.703 

2051 

M74090 

Homo sapiens TB2 

212 

96.970 

2052 

M74090 

Homo sapiens TB2 

683 

70.388 

2054 

M19419 

Mus musculus proline-rich salivary 
protein 

105 

32.394 

2055 

M19419 

Mus musculus proline-rich salivary 
protein 

105 

37.778 

2056 

M19419 

Mus musculus proline-rich salivary 
protein 

141 

29.167 

2057 

M19419 

Mus musculus proline-rich salivary 
protein 

141 

29.050 

2058 

M19419 

Mus musculus proline-rich salivary 
protein 

165 

35.417 

2059 

Ali050019 

Homo sapiens hypothetical protein 

733 

65.455 

2060 

AL050019 

Homo sapiens hypothetical protein 

4704 

98. 143 

2061 

X15606 

Homo sapiens ICAM-2 preprotein (AA -21 

623 

72.759 

2062 

X73541 

Saccharomyces cerevisiae- ORF YKR412 

165 

21.394 

2063 

AB021288 

Homo sapiens beta 2 -microglobulin 

539 

85.833 

2064 

M69043 

Homo sapiens MAD3 

310 

95.918 

2065 

M69043 

Homo sapiens MAD3 

606 

50 .730 

2066 

M69043 

Homo sapiens MAD3 

1164 

75.301 

2068 

U46690 

Mus musculus ATP -dependent RNA 
helicase 

663 

90.909 


I0SIO 
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I 2069 

U46690 

1 Mus imisculus ATP-dependent RNA 
1 helicase 

342 

1 DO . oo / 

2070 

■ ; 

U46690 

1 Mus musculus ATP-dependent RNA 
helicase 

1134 

1 63 fiqn 

2071 

D80010 

Homo sapiens KIAA0188 

621 

89.381 

2072 

D80010 

Homo sapiens KIAA0188 

51 

37.931 

2073 

D80010 

1 Homo sapiens KIAA0188 

5542 

1 9S 

2074 

L20868 

1 Homo sapiens ribosomal protein L4 

784 

93.130 

2075 

L20868 

1 Homo sapiens ribosomal protein L4 

1206 

1 7rt 7T) 1 

2076 

L28677 

1 Tetrahymena pyriformis 0RF3 

80 


2077 

AF151876 

1 Homo sapiens CGI-118 protein 

414 

97.333 

2078 

AF151876 

Homo sapiens CGI-118 protein 

751 

\ / 4Z • O OD | 

2079. 

X02490 

1 Homo sapiens longest open reading 
frame 

313 

55.689 | 

2080 

L24521 

1 Homo sapiens transformation-related 
protein 

262 

I 67.857 | 

2081 

D23661 

j Homo sapiens ribosomal protein L37 

363 

78. 652 | 

2082 

AF060228 

1 Homo sapiens retinoic acid receptor 
j responder 3 


1 76 . 471 1 

2083 

AF043181 

1 Homo sapiens T cell receptor beta 
chain 

A 1 & 

1 38 . 095 1 

2084 

AF043181 

1 Homo sapiens T cell receptor beta 
chain 

729 

88.000 

2085 

AF043181 

1 Homo sapiens T cell receptor beta 
chain 

1755 


2086 

U30221 

Crithidia fasciculata NADH 
dehydrogenase subunit 5 

105 

29.630 

2088 

X94335 

Saccharomyces cerevisiae YOR3160w 

38 

**7 t;nn 

2089 
2090 
2092 

X94335 | 
X94335 j 
X94335 ! 

Saccharomyces cerevisiae YOR3160w 
Saccharomyces cerevisiae YOR3160w 
Saccharomyces cerevisiae YOR3160w 

49 
49 
267 

28.571 | 
60.000 

2093 
I 2094 
2095 

X94335 
X94335 
AP031079 

Saccharomyces cerevisiae YOR3160w 
Saccharomyces cerevisiae YOR3160w 
Camptotheca acuminata isopentenyl 
diphosphate isomerase I 

61 j 
658 

197 I 

23.762 ! 

48.819 

^ y . 23 1 \ 

2096 

AF031079 

Camptotheca acuminata isopentenyl 
diphosphate isomerase I 

147 

4,1.121 J 

2097 

2098 
2099 

AF031079 

U80744 1 
U80744 ; 

Camptotheca acuminata isopentenyl 
diphosphate isomerase I 
Homo sapiens CTG4a 
Homo sapiens CTG4a 

607 I 
403 

796 \ 

en nnn 1 

7C R49 1 

68.421 j 

2100 
2101 
2102 

U80744 
U80744 
AB004316 

Homo sapiens CTG4a 
Homo sapiens CTG4a 

Bos taurus mitochondrial methionyl- 
tRNA transformylase 

921 1 
701 

1 Q1 K i 
1 

luU. UUU | 

80.690 j 
8 3. 967 [ 

2103 

i "■> i r\ a 

D50918 j 

Homo sapiens The KIAA0128 gene is 
related to cdclO. 

407 

96.970 j 

1 2104 

D50918 

Homo sapiens The KIAA0128 gene is 
related to cdclO. 

682 ; 

98.230 J 

2105 

D50918 

Homo sapiens The KIAA0128 gene* is 
related to cdclO . 

222 j 

100.000 I 

2106 
1 2107 

D50918 [ 
050918 

Homo sapiens The KIAA0128 gene is 
related to cdclO . 

Homo sapiens The KIAA0128 gene is | 

1973 j? 
2413 _L 

94.277 j 
95.305 ! 
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2108 

AJ011916 

Homo sapiens hypothetical protein 

1170 

79.098 

2109 

AiJ011916 

Homo sapiens hypothetical protein 

901 

97.887 

2110 

AJ011916 

Homo sapiens hypothetical protein 

1179 

90.909 

2111 

U28917 

Mus musculus 60S ribosomal protein 

548 

67.453 

2112 

X64707 

Homo sapiens BBC1 

644 

69.091 

2114 

AF098798 

Homo sapiens unknown 

363 

62.931 

2115 

AF098798 

Homo sapiens unknown 

987 

99.342 

2116 

AF098798 

Homo sapiens unknown 

701 

61.395 

2117 

AF098798 

Homo sapiens unknown 

469 

98.551 

2118 

AF098798 

Homo sapiens unknown 

304 

92.453 

2119 

AF098798 

Homo sapiens unknown 

1780 

88.301 

2120 

M32865 

Homo sapiens Ku protein subunit 

402 

96.970 

2121 

M32865 

Homo sapiens Ku protein subunit 

1747 

59. 744 

2122 

X92485 

Plasmodium vivax pval 

284 

42.857 

2123 

AF172066 

Homo sapiens retinoic acid repressible 
protein 

886 

85.484 

2124 

U10414 

Caenorhabdi t is elegans Contains 
similarity to Pfam domain: PF00005 
(ABC_tran) , Score=245.2, E-value«3e- 
70, N=2 

326 

60 526 i 

2 125 

U10414 

rapnorh^hdi t" i a eleaans Cnnta ins 

similarity to Pfam domain: PF00005 
(ABC__tran) , Score=245.2, E-value=3e- 
70, N=2 

433 

*x W • £. £. 

2126 

U10414 

Caenorhabditis elegans Contains 
similarity to Pfam domain: PF00005 
(ABC_tran) , Score=245.2, E-value=3e- 
70, N=2 

1865 

48.378 

2128 

AF071317 

Mus musculus COP9 complex subunit 7b 

1318 

72.836 

2129 

AC002388 

Arabidopsis thaliana hypothetical 
protein, 5* partial 

54 

53.333 

2130 

AC002388 

Arabidopsis thaliana hypothetical 
protein, 5» partial 

177 

37.415 

2131 

X69556 

Bos taurus cpnlO protein 

418 

100.000 

2132 

X69556 

Bos taurus cpnlO protein 

418 

1O0.000 

2133 

X69556 

Bos taurus cpnlO protein 

303 

97.826 

2134 

X69556 

Bos taurus cpnlO protein 

569 

78.571 

2135 

X69556 

Bos taurus cpnlO protein 

207 

63.514 

2137 

U62673 

Mus musculus histone H2a(B)-613 

427 

77.612 

2138 

X94335 

Saccharomyces cerevisiae YOR3513C 

38 

22.857 

2139 

X94335 

Saccharomyces cerevisiae YOR3513C 

35 

25.000 

2140 

X94335 

Saccharomyces cerevisiae YOR3513C 

56 

28.205 

2141 

X94335 

Saccharomyces cerevisiae YOR3513C 

530 

57.297 

2142 

U05313 

Trypanosoma brucei CR3 

84 

37.143 

2143 

X73434 

Ovis aries KAP5 . 4 keratin protein 

97 

40.698 j 

2145- 

X73434 

Ovis aries KAP5.4 keratin protein 

201 

32.022 

2146 

X73434 

Ovis aries KAP5.4 keratin protein 

197 

36.571 

2147 

X91809 

Homo sapiens GAIP 

1312 

93.839 

2148 

AF045244 

Klebsiella pneumoniae ribitol kinase 

213 

36.567 

2149 

AF045244 

Klebsiella pneumoniae ribitol kinase 

314 

47.000 

2150 

AF045244 

Klebsiella pneumoniae ribitol kinase 

175 

71.875 

2151 

AF045244 

Klebsiella pneumoniae ribitol kinase 

154 

34.021 

2152 

AF045244 

Klebsiella pneumoniae ribitol kinase 

1248 

44.358 



2153 

Z71261 

Caenorhabditis elegans Similarity to 
Bovine protein kinase C inhibitor 1 
(SW:IPK1 BOVIN) 

350 

40. 523 

2154 

Z71261 

Caenorhabditis elegans Similarity to 
Bovine protein kinase C inhibitor 1 
(SW;IPK1 BOVIN) 

215 

44.048 

2155 

X15081 

Crithidia fasciculata MURF2 protein 
(AA 1-348) 

197 

25.714 

2157 

U15174 

Homo sapiens BCL2/ adenovirus E1B 19kD- 
interacting protein 3 

969 

86.500 

2158 

AJ007418 

Homo sapiens phenol sulf ©transferase 

623 

94.898 

2159 

AJ007418 

Homo sapiens phenol sulfotransf erase 

604 

100.000 

2160 

AJ007418 

Homo sapiens phenol sulfotransf erase 

822 

97.600 

2161 

AJ007418 

Homo sapiens phenol sulfotransf erase 

331 

60.638 

2162 

AJ007418 . 

Homo sapiens phenol sulfotransf erase 

1095 

82.237 

2163 

Y00144 

Lycopersicon esculentum biotin-binding 
protein 

131 

36.923 

2164 

Y00144 

Lycopersicon esculentum biotin-binding 
protein 

122 

45.283 

2166 

U4 3 7 0 1 

Homo sapiens ribosomal protein L23a 

297 

80.702 | 

2167 

U43701 

Homo sapiens ribosomal protein L23a 

513 

62 .238 

2168 

U43701 

Homo sapiens ribosomal protein L23a 

436 

100. 000 

2169 

U43701 

Homo sapiens ribosomal protein L23a 

425 

88.095 ! 

2170 

U43701 

Homo sapiens ribosomal protein L23a 

640 

78.981 

2171 

AF172095 

Picea rubens unknown 

154 

27.206 

2172 

AF172095 

Picea rubens unknown 

68 

27.119 

2173 

AF172095 

Picea rubens unknown 

147 

28.947 

2175 

AF115345 

Homo sapiens calcium-regulated heat 
stable protein CRHSP-24 

456 

71.875 

2176 

Z30423 

Caenorhabditis elegans T20G5.9 

51 

25.000 

2177 

Z30423 

Caenorhabditis elegans T20G5.9 

136 

28. 169 

2178 

X54162 

Homo sapiens 64 Kd autoantigen 

152 

31.522 

2179 

X54162 

Homo sapiens 64 Kd autoantigen 

151 

32.967 

2180 

X54162 

Homo sapiens 64 Kd autoantigen 

95 

44.118 

2181 

X54162 

Homo sapiens 64 Kd autoantigen 

952 

38.447 

2182 

AB007941 

Homo sapiens KIAA0472 protein 

221 

100.000 

2183 

AB007941 

Homo sapiens KIAA0472 protein 

2296 

97.820 

2184 

M16715 

Gallus gallus collagen type IX 

104 

40.541 

2185 

X83703 

Homo sapiens nuclear protein 

1686 

90. 123 

2186 

AL132880 

Caenorhabditis elegans predicted using 
Genefinder; preliminary prediction 

90 

41.667 

2187 

AL132880 

Caenorhabditis elegans predicted using 
Genefinder; preliminary prediction 

46 

26.190 

2189 

AL132880 

Caenorhabditis elegans predicted using 
Genefinder; preliminary prediction 

167 

27.083 

2190 

AF039568 

Homo sapiens vesicle trafficking 
protein 

641 

67.836 


Ar 03 95 68 

Homo sapiens vesicle" trafficking 
protein 

1304 

89.167 

2192 

X79444 

Homo sapiens endonuclease 

1884 

88.288 

2193 

X79444 

Homo sapiens endonuclease 

466 

84.211 

2194 

X79444 

Homo sapiens endonuclease 

1758 

9V .315 

2195 

AF000412 

Plasmodium berghei merozoite surface 
protein- 1 

151 

36.364 


10? l?j 
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2196 

AC004955 

Homo sapiens supported by ESTs T61992 
(NID :a665235) and W26450 
(NID:gl307167) and Genscan 

693 

87.500 

2197 

AC004955 

Homo sapiens supported by ESTs T61992 
(NID:g665235) and W26450 
(NID:gl307167) and Genscan 

159 

100 ooo 

^ \I \J . WWW 

2198 

AC004955 

Homo sapiens supported by ESTs T61992 
(NID:g665235) and W26450 
(NID:gl307167) and Genscan 

3159 

84.992 

2199 

X92485 

Plasmodium vivax pval 

50 

43.333 

2200 

X92485 

Plasmodium vivax pval 

174 

42 . 708 

2202 

AF103731 

Homo sapiens putative glycolipid 
transfer protein 

2529 

100.000 

2203 

AF103731 

Homo sapiens putative glycolipid 
transfer protein 

852 

± \J » O J J 

2204 

Z75550 

Unknown limited similarity with some 
myosins; cDNA EST EMBL:C084 02 comes 
from this gene; cDNA EST E 

58 

22.222 

2205 

Z75550 

Unknown limited similarity with some 
myosins : CDNA EST EMBIj*C0 84 02 comes 
from this, gene; cDNA EST B 

58 

22.222 

2206 

Z75550 

Unknown limited similarity with some 
myosins; cDNA EST EMBL:C08402 comes 
from this gene; cDNA EST E 

53 

34.615 

2207 

Z75550 

Unknown limited similarity with some 
myosins; cDNA EST EMBL:C08402 comes 
from this gene; cDNA EST E 

374 

on diQ 

2208 

AJ243938 

Homo sapiens G18.2 protein 

111 

33.333 

2209 

AF053368 

Mus musculus lysyl oxidase -related 
protein 2 

356 

6 1 . 000 

2210 

AF053368 

Mus musculus lysyl oxidase-related 
protein 2 

268 

7 1 . 429 

2211 

AF053368 

Mus musculus lysyl oxidase -related 
protein 2 

341 

80 . 303 

2212 

AF053368 

Mus musculus lysyl oxidase-related 
protein 2 

329 

7 8.788 

2214 

AF053368 

Mus musculus lysyl oxidase-related 
protein 2 

849 

8 1 . 879 

2215 

AF053368 

Mus musculus lysyl oxidase-related 
protein 2 

1113 

59 . 365 

2216 

AF053368 

Mus musculus lysyl oxidase-related 
protein 2 

1966 

84 .503 

2217 

AF053368 

Mus musculus lysyl oxidase-related 
protein 2 

4677 

88 . 173 

2218 

J03191 

Homo sapiens profilin 

412 

100 . 000 

2219 

J03191 

Homo sapiens profilin 

300 

58.503 

2220 

AL031009 

Homo sapiens C431H6.1.2 {PUTATIVE 
novel protein) (PUTATIVE isoform 2) 

594 

94.898 

2221 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

395 

48.372 

2222 

U79260 

Homo sapiens unknown 

243 

59.155 

2223 

V00599 

Homo sapiens beta- tubulin 

757 

98.261 

2224 

V00599 

Homo sapiens beta -tubulin 

1512 

82.183 

2225 

L13291 

Homo sapiens ADP-ribosylarginine 
hydrolase 

962 

44.789 
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2226 

AF000944 

Rattus norvegicus TFIIA small subunit 

376 

1 G*7 QQft 

1 o/. oyu 

2227 

AF000944 

Rattus norvegicus TFIIA small subunit 

195 


2228 

U05313 

Trypanosoma brucei CR3 

88 


2229 

X83218 

Homo sapiens ATP synthase, oligomycin 
sensitivity conferring protein 

399 

100.000 

2230 

X83218 

Homo sapiens ATP synthase, oligomycin 
, sensitivity conferring protein 

381 

66.957 

2231 

AC002398 

pir B53814 Homo sapiens p20 protein 

121 

30.097 

2232 

AC002398 

pir B53 814 Homo sapiens p20 protein 

175 


2233 

AC002398 

pir B53814 Homo sapiens p20 protein 

180 

33.858 

2234 

M37275 

Drosophila melanogaster ND4-L gene 
product 


1 15 . 3 85 

2235 

M37275 

Drosophila melanogaster ND4-L gene 
product 

4 9 

1 Jo . 842 

2236 

M37275 

Drosophila melanogaster ND4-I* gene 
product 

108 

28.947 

2238 

AF047181 

Homo sapiens NADH- ubiquinone 
oxidoreductase subunit CI-SGDH 

852 

1 OA ji-ic 

2239 

AF047181 

Homo sapiens NADH-ubiquinone 
oxidoreductase subunit CI-SGDH 

599 

1 69 849 

2240 

U79260 

Homo sapiens unknown 

148 

52 . 083 

2242 

AF153612 

Homo sapiens peroxisomal D3,D2-enoyl- 
CoA isomerase 

1253 

76 86? 

2245 

Y11739 

Homo sapiens transcription factor 

59 

35.556 

2246 

Y11739 

Homo sapiens transcription factor 

92 

[ 23 . 177 

2247 

Y11739 

Homo sapiens transcription factor 

150 

32 . 086 

2248 

Y11739 

Homo sapiens transcription factor 

97 

39.394 

2249 

A27282 

Homo sapiens TGR-CL3C 

85 

31.193 

2250 

A27282 

Homo sapiens TGR-CL3C 

86 | 

D Q<7 

2251 

M69297 

Homo sapiens ORF 3 

297 

51.111 

2252 

M22332 

Homo sapiens unknown protein 

159 3 

80.645 

2253 

M22332 

Homo sapiens unknown protein 

ICO 

ID O 

77 .419 

2254 

M22332 

Homo sapiens unknown protein 

203 

60.000 t 

2255 

D61391 

Homo sapiens | 
phosphoribosypyroohosnhate svnfhph^fip. 
associated protein 39 

577 

100.000 

2256 

D61391 

Homo sapiens 

phosphoribosypyrophosphate synthetase- 
associated protein 39 

477 1 


2257 

D61391 

Homo sapiens 

phosphoribosypyrophosphate synthetase - 
associated protein 39 

1647 r 


2260 

M83248 

Homo sapiens nephropontin 

983 

88 . 235 

2261 

M83248 

Homo sapiens nephropontin 

1482 

89 . 809 

2262 

X53778 

Homo sapiens uracil DNA glycosylase 

870 i 

61.039 1 

2263 

X53778 

Homo sapiens uracil DNA glycosylase 

59 

2 6 000 

2264 

X53778 

Homo sapiens uracil DNA glycosylase 

976 

77 . 971 

*5 O C K 

U62326 

Rattus norvegicus macrophage migration 
inhibitory factor 

244 

61.864 

2266 

AB024935 

Mus musculus Sid3177p * i 

316 

56.757 

2267 

AB024935 

Mus musculus Sid3177p 

460 | 

72.941 I 

2268 

Y13374 

Homo sapiens putative prenylated 
protein 

446 

96.774 

2269 

Y13374 

Homo sapiens putative prenylated 

514 _L 

98.529 


1, 0 OB H- & m» : •::!; ,„ O 2 S! S O ii? 




protein 



2270 

Y13374 

nuiuu odpieim putative preny_L ctceQ 

protein 

123 

65.714 

2271 

Y13374 

Hnmn n^ni pna mihahi' va nrnnvl *- ^ j 

protein 

1106 

88 . 679 

2272 

AB0019.93 

Homo 0 ati 1 pnn alia ma f-n ♦* \ e-\r\ fa/«i>A«> 

homologous protein 

221 

100 . 000 

2274 

AB001993 

HomO Q A 1 pnq nl "i r> mat'iirai* \ t-w\ 4? -\ 

1 oa^icuo vj-LXa ilict L UI dLlUIl L aCCOl 

homologous protein 

611 

78. 195 

2275 

AB001993 

Homo Rani pn 0 rt 1 4 a ma f-n vn £ -» ^ +- 

homologous protein 

445 

76. 871 

2276 

AF110812 

Mus musculus 25 kDa FK506 -binding 
protein 

**D J. 

89 . 041 

2277 

AF110812 

Mus musculus 25 kDa FK506-binding 
protein 

233 

97.222 

2278 

AF110812 

MUS mUSCUluS kHa T^TfCft Yirl-I nrf 

protein 

c a c 
685 

74 . 249 

2279 

Z49148 

Homo saoiens riboaomal o-roh^in t.^q 


77 . 987 

2280 

Z49148 

Homo saoiens ribosomal nrnt-p^n too 

315 . 

61 . 453 

2281 

S71129 

Homo sat) ipnfl APpfvl rVinl inAaf pmop 

acetylcholine acetvlhvdrolase arhR 
{I4-E5 domain, alternatively spliced} 
{EC 3.1.1.7} 

80 

42 . 188 

2282 

L03814 

Mus musculus regulatory protein 

388 

on n n n 

2283 

L03814 

Mub musculus regulatory protein 

1383 


2284 

AJ224997 

Rattus norvegicus huntingtin 

91 

42.308 

2285 

M15530 

Homo sapiens B-cell growth factor 

129 

-j zr .acq 

2286 

AF007170 

Homo sapiens unknown 

154 

85.714 

2287 

AF007170 

Homo sapiens unknown 

l/Oi 

58 . 176 

2288 

AF007170 

Homo sapiens unknown 

3584 

99.290 

2289 

U79260 

Homo s aniens unknown 

245 

43 . 103 

2290 

AB020725 

Homo sapiens KIAA0918 protein 

2558 

47.707 

2291 

AF200715 

Homo Rp^Ti T P>Tl <3 PTR Hrtma i n a *-»■•»• 

protein CED-6 

1606 

84.539 

2293 

AF200715 

Homo saDiena PTE Hnma *{ n aHs»»*»*-/~k-»- 
protein CED-6 

392 

98 . 333 

2294 

AF200715 

Homo s aniens PTB domain ^Hanhnr 
protein CED-6 


96 . 440 

2295 

AB032917 

Hylobates muelleri dopamine receptor 
D4 

135 

30.508 

2296 

AB032917 

Hylobates muelleri dopamine receptor 
D4 

123 

31.452 

2297 

U70734 

Homo sapiens 38 KDa Mov34 homo log 

72 8 

ion r\ r\ i\ 

2298 

U70734 

Homo sapiens 38 kDa Mov34 homo log 

1 7fT 

ion rtnn 

2299 

U70734 

Homo sapiens 38 kDa Mov34 homolog 



2300 

AF056184 

Homo sapiens WS basic-helix-loop-helix 
leucine zipper protein 

O 13 

a "3 "> ft ft 

2301 

AF056184 

Homo sapiens WS basic-helix-loop-helix 
leucine zipper protein 

1171 

84 .307 

2302 

Y08319 

Homo sapiens kinesin-2 

1137 

89.848 

2303 

Y08319 

Homo sapiens kinesin-2 

179 

93.333 

2304 

Y08319 

Homo sapiens kinesin-2 

285 

93.750 

2305 

Y08319 

Homo sapiens kinesin-2 

4002 

96.916 

2306 

M15530 

Homo sapiens B-cell growth factor 

140 

54.717 


lime 
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2308 

A32622 

synthetic construct HBsAg ayw Pre-Sl 
sequence 

53 

46 . 667 

2309 

A32622 

synthetic construct HBsAg ayw Pre-Sl 
sequence 

49 

38 .462 

2310 

A32622 

synthetic construct HBsAg ayw Pre-Sl 
sequence 

41 

40 . 000 

23X1 

A32622 

synthetic construct HBsAg ayw Pre-Sl 
sequence 

70 

37 . 500 

2312 

A32622 

synthetic construct HBsAg ayw Pre-Sl 
sequence 

51 

30.303 . 

2313 

A32622 

synthetic construct HBsAg ayw Pre-Sl 
sequence 

66 

30.645 

2314 

A32622 

synthetic construct HBsAg ayw Pre-Sl 
sequence 

106 

30.120 

2315 

X73460 

Homo sapiens ribosomal protein L3 

949 

74.545 

2316 

X73460 

Homo sapiens ribosomal protein L3 

676 

71.667 

2317 

X73460 

Homo sapiens ribosomal protein L3 

1209 

80.972 

2318 

X73460 

Homo sapiens ribosomal protein L3 

1909 

79.167 S 

2319 

X73460 

Homo sapiens ribosomal protein L3 

1710 

83.019 

2320 

X73460 

Homo sapiens ribosomal protein L3 

2210 

85.750 | 

2321 

X73460 

Homo sapiens ribosomal protein L3 

2265 

87.841 

2322 

X73460 

Homo sapiens ribosomal protein L3 

1519 

80.288 

2323 

D14688 

Rattus norvegicus myosin regulatory 
light chain 

371 

59.474 

2324 

X81109 

Homo sapiens tumor-associated antigen 

371 

93.548 

2325 

X81109 

Homo sapiens tumor-associated antigen 

437 

100.000 

2326 

X81109 

Homo sapiens tumor-associated antigen 

402 

100.000 

2327 

X81109 

Homo sapiens tumor-associated antigen 

602 

66 . 540 

2328 

Y15061 

Homo sapiens GalT4 protein 

2204 

92.913 

2329 

U64444 

Homo sapiens ubiquitin fusion- 
degradation 1 like protein 

749 

87.248 

2330 

U64444 

Homo sapiens ubiquitin fusion- 
degradation 1 like protein 

2162 

99.688 

2331 

U64444 

Homo sapiens ubiquitin fusion- 
degradation 1 like protein 

888 

69.081 

2332 

X58833 

Streptomyces coelicolor actVA 4 

44 

24.324 

2333 

X58833 

Streptomyces coelicolor actVA 4 

169 

27.987 

2334 

U10903 

Mus musculus zinc finger protein 

196 

31.366 

2335 

AL050300 

Arabidopsis thaliana putative protein 

199 

53.030 

2336 

AL050300 

Arabidopsis thaliana putative protein 

481 

43.243 

2338 

X83617 

Homo sapiens RanBPl 

328 

100.000 

2339 

X83617 

Homo sapiens RanBPl 

494 | 

69.378 

2341 

AB012190 

Homo sapiens Nedd8 -activating enzyme 
hUba3 

182 

100.000 

2342 

AB012190 

Homo sapiens Nedd8 -activating enzyme 
hUba3 

531 

92.683 

2343 

AB012190 

Homo sapiens Nedd8- activating enzyme 
hUba3 

1839 

86.801 ^ . 

2346 

U58658 

Homo sapiens unknown 

187 

50.602 

2348 

AB033105 

Homo sapiens KIAA1279 protein ' 

2047 

93.732 

2349 

AB033105 

Homo sapiens KIAA1279 protein 

3199 

95.138 

2350 

D50930 

Homo sapiens The KIAA0140 gene product 
is novel . 

60 

26.957 

2352 

D50930 

Homo sapiens The KIAA0140 gene product 

368 

36.889 
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is novel. 



2353 

AB017018 

Homo sapiens JKTBP2 

502 

60.811 

2354 

AB017018 

Homo sapiens JKTBP2 

1178 

60.054 

2355 

AB017018 

Homo sapiens JKTBP2 

71 

35.849 

2356 

AB017018 

Homo sapiens JKTBP2 

1930 

92 .962 

2357 

AB017018 

Homo sapiens JKTBP2 

2436 

94 .104 

2358 

AF026977 

Homo sapiens microsomal glutathione S- 
transferase 3 

242 

52 .695 

2359 

M13100 

Rattus norvegicus unknown protein 

221 

40.000 j 

2361 

U40766 

Meloidogyne incognita COL-1 

129 

31.515 i 

2362 

A78803 

unidentified unnamed protein product 

769 

90.071 

2363 

A78803 

unidentified unnamed protein product 

918 

100. 000 

2364 

A78803 

unidentified unnamed protein product 

337 

66.418 

2365 

A27282 

Homo sapiens TGR-CL3C 

67 

30.909 

2366 

A27282 

Homo sapiens TGR-CL3C 

72 

30.508 

2367 

AB000584 

Homo sapiens TGF-beta superfamily 
protein 

1090 

77 .116 

2368 

M11759 

Lycopersicon esculentum cell wall 
hydroxyproline-rich glycoprotein 

101 

29.703 

2369 

AF181640 

Drosophila melanogaster BcDNA.GH09817 

704 

41.566 | 

2370 

AB007141 

Mus musculus AZ2 

404 

82.432 

2371 

AB007141 

Mus musculus AZ2 

522 

82 .653 

2372 

AB007141 

Mus musculus AZ2 

1450 

72.596 

2374 

U79260 

Homo sapiens unknown 

328 

67.532 

2375 

AF151877 

Homo sapiens CGI-119 protein 

499 

98 . 810 

2376 

AF151877 

Homo sapiens CGI -119 protein 

256 

67 .442 

2377 

AF151877 

Homo sapiens CGI -119 protein 

905 

87.554 

2378 

U09368 

Homo sapiens zinc finger protein 
ZNF14 0 

2803 

93 .506 

2379 

U58658 

Homo sapiens unknown 

212 

51 . Ill 

2380 

AP000061 

Aeropyrum pernix 175aa long 
hypothetical protein 

101 

28.788 

2381 

AL080092 

Homo sapiens hypothetical protein 

261 

61 . 654 

2382 

M77693 

Homo sapiens spermidine/ spermine Nl- 
acetyl transferase 

14 0 

100. 000 

2383 

M77693 

Homo sapiens spermidine/ spermine Nl- 
acetyltransf erase 

466 

98.551 

2384 

M77693 i 

Homo sapiens spermidine/ spermine Nl- 
acetyl transferase 

551 

62.755 

2385 

AB011145 

Homo sapiens KIAA0573 protein 

684 

97.059 ! 

2386 

AB011145 

Homo sapiens KIAA0573 protein 

1817 

77.662 | 

2387 

AF004561 

Homo sapiens p21-Arc 

570 

74 .211 

2388 

AF060087 

Mus musculus proteasome subunit iota 

555 

64.885 1 

2389 

AF127761 

Homo sapiens ribonucleoprotein RBM8 

444 

64 . 844 

2390 

AF127761 

Homo sapiens ribonucleoprotein RBM8 

876 

87.006 

2392 

L17318 

Rattus norvegicus proline- rich 
proteoglycan 

142 

34 . 932 

2393 

L17318 

Rattus norvegicus proline-rich 
proteoglycan 

142 

29.891 

2394 

L17318 

Rattus norvegicus proline-rich 
proteoglycan 

52 

33.333 

2395 

L17318 

Rattus norvegicus proline-rich 
proteoglycan 

136 

31.915 

2396 

L17318 

Rattus norvegicus proline-rich 

135 

32.292 
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proteoglycan 



2397 

L17318 

Rattus norvegicus proline-rich 
proteoglycan 

239 

31.553 

2398 

AF106941 

Homo sapiens beta-arrestin 2 

560 

77.099 

2399 

AF106941 

Homo sapiens beta-arrestin 2 

783 

80.240 

2400 

AF106941 

Homo sapieris beta-arrestin 2 

37 

44.444 

2401 

AF106941 

Homo sapiens beta-arrestin 2 

320 

97.917 

2402 

AF106941 

Homo sapiens beta-arrestin 2 

709 

100.000 

2403 

AF106941 

Homo sapiens beta-arrestin 2 

2171 

93.253 

2404 

AF056035 

Rattus norvegicus s-nexilin 

476 

94.203 

2405 

AF056035 

Rattus norvegicus s-nexilin 

2991 

84.072 

2407 

A01769 

synthetic construct vascular 
anticoagulating protein 

975 

79.819 

2408 

AL034428 

Homo sapiens dJ705D16.1 (U2 small 
nuclear RNA- associated B 1 ' antigen) 

56 

40.000 

2410 

AL034428 

Homo sapiens dJ705D16.1 (U2 small 
nuclear RNA-associated B ,f antigen) 

56 

40.000 

2411 

AL034428 

Homo sapiens dJ705D16.1 (U2 small 
nuclear RNA-associated B • • antigen) 

635 

69.461 

2412 

AL034428 

Homo sapiens dJ705D16.1 (U2 small 
nuclear RNA-associated B 1 1 antigen) 

40 

54.545 

2413 

AL034428 

Homo sapiens dJ705D16.1 <U2 small 
nuclear RNA-associated B 1 1 antigen) 

51 

33.333 

2414 

AL034428 

Homo sapiens dJ705D16.1 (U2 small 
nuclear RNA-associated B ,( antigen) 

56 

20.000 

2415 

AL034428 

Homo sapiens dJ705D16.1 (U2 small 
nuclear RNA-associated B 1 1 antigen) 

1000 

84.556 

2416 

U58658 

Homo sapiens unknown 

183 

46.988. 

2417 

D63486 

Homo sapiens The KIAA0152 gene product 
is related to a putative C.elegans 
gene encoded in cosmid F44E2 . 

621 

91.346 

2418 

D63486 

Homo sapiens The KIAA0152 gene product 
is related to a putative C.elegans 
gene encoded in cosmid F44E2 . 

38 

30.556 

2419 

D63486 

Homo sapiens The KIAA0152 gene product 
is related to a putative C.elegans 
gene encoded in cosmid F44E2 . 

1376 

86.242 

2420 

AF18O920 

Homo sapiens cyclin L ania-6a 

845 

80.000 

2421 

AF180920 

Homo sapiens cyclin L ania-6a 

597 

80.315 

2422 

AF180920 

Homo sapiens cyclin L ania-6a 

491 

95.000 

2423 

AF180920 

Homo sapiens cyclin L ania-6a 

2694 

91.902 

2424 

U68105 

Homo sapiens poly (A) -binding protein 

3245 

85.270 | 

2425 

U68105 

Homo sapiens poly (A) -binding protein 

669 ! 

68.902 

2426 

U68105 

Homo sapiens poly (A) -binding protein 

3871 

94.182 

2427 

U68105 

Homo sapiens poly (A) -binding protein 

4101 

99.371 

2428 

D84316 

Drosophila melanogaster rabl4 

406 

72.340 

2429 

D84316 

Drosophila melanogaster rabl4 

564 

69.271 

243 0 

U82303 

Homo sapiens unknown 

117 

80.769 

2431 

U82303 

Homo sapiens unknown 

45 

28.571 

2432 

U82303 

Homo sapiens unknown 

169 

56.897 

2433 

AJ238094 

Homo sapiens Lsml protein 

713 

98.261 | 

2434 

AJ238094 

Homo sapiens Lsml protein 

465 

88.889 

2435 

J04564 

Homo sapiens snRNP polypeptide B 

160 

2 8.736 

2436 

AL080159 

Homo sapiens hypothetical protein 

86 

30.337 
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2437 

AL080159 

Homo sapiens hypothetical protein 

101 

33.333 

2438 

AL080159 

Homo sapiens hypothetical protein 

389 

40 . 230 

2440 

X61046 

Hydra sp. mini -collagen 

177 

38.462 

2441 

X61046 

Hydra sp. mini -collagen 

154 

35.185 

2442 

AL049709 

Homo sapiens dJ18C9.2 (novel gene 
(locus D20S101) similar to Gamma- 
glutamyltranspeptidase, contains CCA 
trinucleotide repeat, based on Genscan 
and Fgenesh predictions.) 

496 

78.899 

2443 

AL049709 

Homo sapiens dJ18C9.2 (novel gene 
(locus D20S101) similar to Gamma- 
glut amyltranspeptidase, contains CCA 
trinucleotide repeat, based on Genscan 
and Fgenesh predictions.) 

679 

100.000 

2444 

AL049709 

Homo sapiens dJ18C9.2 (novel gene 
(locus D20S101) similar to Gamma- 
glutamyltranspeptidase, contains CCA 
trinucleotide repeat, based on Genscan 
and Fgenesh predictions . ) 

3877 

91. 742 

2445 

AB029022 

Homo sapiens KIAA1099 protein 

52 

36.364 

2446 

AB029022 

Homo sapiens KIAA1099 protein 

266 

67.797 i 

2447 

AB029022 

Homo sapiens KIAA1099 protein 

249 

80.851 i 

2448 

AB029022 

Homo sapiens KIAA1099 protein 

71 

24.561 

2449 

AB029022 

Homo sapiens KIAA1099 protein 

57 

60 .000 

2450 

AB029022 

Homo sapiens KIAA1099 protein 

1894 

55.609 

2451 

AB029022 

Homo sapiens KIAA1099 protein 

2364 

50.932 

2452 

U79260 

Homo sapiens unknown 

338 

68.817 

2453 

U31118 

Xenopus laevis cytoplasmic myosin II 
regulatory light chain 

485 

70.000 

2454 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

620 

48 .197 

2455 

AC005328 

Homo sapiens R26660 1, partial CDS 

201 

78 .462 

2456 

X06026 

Homo sapiens T-cell receptor T3 gamma 
chain 

589 

74 . 346 

2457 

AF159055 

Homo sapiens leucine zipper- like 
protein 

154 

61 .224 

2458 

X16064 

Homo sapiens tumor protein (AA 1 - 
172) 

617 

98 .969 

2460 

X16064 

Homo sapiens tumor protein (AA 1 - 
172) 

41 

24 .000 

2461 

X16064 

Homo sapiens tumor protein (AA 1 - 
172) 

366 

59.887 

2462 

U66372 

Bos taurus ribosomal protein S29 

215 

67.308 | 

2463 

U66372 

Bos taurus ribosomal protein S29 

184 

96.875 

2464 

X56389 

Canis familiaris rab4b 

914 

97 .973 

2465 

X56389 

Canis familiaris rab4b 

775 

83 .051 

2466 

X56389 

Canis familiaris rab4b 

874 

88 .199 

2467 

X56389 

Canis familiaris rab4b 

1084 

90 v741 

2468 

Z25820 

Homo sapiens dodecenoyl-CoA delta- 
isomerase , 

1254 

80 .769 

2470 

AF047183 

Homo sapiens NADH- ubiquinone 
oxidoreductase subunit CI-B12 

266 

66.981 ; 

2471 

Y14586 

Cricetulus griseus kinesin light chain 
isoform D 

97 

64 .000 
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2472 

Y14586 

Cricetulus qriseus kinesin liaht chain 
isoform D 

115 

*k & . / 

2475 

X14968 

Homo saoiens RI I -aloha sufoun i t ( aa 
404) 

JIG** 


2476 

U67934 

Homo sapiens 44.9 kDa. protein C18B11 
homo log 

56 


2477 

U67934 

Homo sapiens 44.9 kDa protein C18B11 
homolog 

963 

90 . 052 

2478 

U43360 

Peromyscus maniculatus reverse 
transcriptase 

300 

34 . 021 

2479 

U97553 

murine herpesvirus 68 unknown 

112 

30.000 

2480 

U58658 

Homo sapiens unknown 

179 

51.351 

2481 

U79260 

Homo sapiens unknown 

253 

50 .467 

2482 

AF151826 

Homo sapiens CGI-68 protein 

320 

65.385 

2483 

AF151826 

Homo sapiens CGI-68 protein 

269 

86 . 667 

2484 

AF151826 

Homo sapiens CGI-68 protein 

494 

95.000 

2485 

AF151826 

Homo sapiens CGI-68 protein 

52 

37 . 500 

2486 

AF151826 

Homo sapiens CGI-68 protein 

1244 

80.571 

2487 

D10699 

Rattus norvegicus ubiquitin carboxyl- 
terminal hydrolase PGP9-5 

44 6 

Q*7 ni c 

27 / m U ±0 

2488 

D10699 

Rattus norvegicus ubiquitin carboxyl- 
terminal hydrolase PGP9.5 

672 

70 . 192 

2489 

U96915 

Homo sapiens sin3 associated 
polypeptide pi 8 

47 

32.353 

2490 

U96915 

Homo sapiens sin3 associated 
polypeptide pl8 

600 

83.436 

2491 

L47668 

Homo sapiens alpha-2 collagen type I 

134 

44 .737 

2492 

U01317 

Homo sapiens beta-globin 

343 

6 0.736 

2494 

AF004561 

Homo sapiens p21~Arc 

498 

73 .684 

2495 

D50840 

Homo sapiens ceramide 
glucosyltransf erase 

816 


2496 

D50840 

Homo sapiens ceramide 
glucosyltransf erase 

848 

QfZ peg 

2497 

D50840 

Homo sapiens ceramide 
glucosyltransf erase 

939 


2498 

D50840 

Homo sapiens ceramide 
glucosyltransf erase 

389 

i no onn 

2499 

D50840 

Homo sapiens ceramide 
glucosyltransf erase 

19*57 

Z> f 

do cno 

2500 

L24521 

Homo sapiens transformation-related 
protein 

143 

-3 J ■ © *X 

2501 

L24521 

Homo sapiens transformation-related 
protein 

278 

GO 701 

2502 

AL031588 

Homo sapiens dJ1163J1.3 (novel protein 
similar to mouse B99) 

768 

CQ QC ft 

2503 

AL031588 

Homo sapiens d*J1163J1.3 (novel protein 
similar to mouse B99) 

454 

68 .254 

2504 

AL031588 

Homo sapiens dJ1163J1.3 (novel protein 
similar to mouse B99) 

905 

76.119 

2505 

AL031588 

Homo sapiens dJX163J1.3 (novel* protein 
similar to mouse B99) 

93 

22 .909 

2506 

AL031588 

Homo sapiens dJ1163J1.3 (novel protein 
similar to mouse B99) 

661 

87 .903 

2507 

AL031588 

Homo sapiens d«J1163J1.3 (novel protein 

3198 

73 .684 
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similar to mouse B99) 



2508 

AL031588 

Homo sapiens dJ1163J1.3 (novel protein 
similar to mouse B99) 

4125 

96.639 

2509 

Y11651 

Homo sapiens phosphate cyclase 

497 

55.897 

2510 

Y11651 

Homo sapiens phosphate cyclase 

1999 

92.000 

2511 

X51706 

Rattus rattus ribosomal protein L9 (AA 
1-192) 

400 

63 ,317 

2512 

X55777 

Homo sapiens put. ORF 

351 

69.048 

2513 

X96713 

Globodera pallida collagen 

174 

39.167 

2514 

X67703 

Drosophila melanogaster Mst84Db 

84 

35.000 

2515 

AF160728 

Homo sapiens sex comb on midleg-like-1 
protein 

1027 

79.048 j 

2516 

AF160728 

Homo sapiens sex comb on midleg-like-1 
protein 

1001 

89.573 

2517 

AC004539 

Homo sapiens unknown function; similar 
to Y09105 (PID:gl666171) 

40 

42 . 857 

2519 

AC004539 

Homo sapiens unknown function; similar 
to Y09105 (PID:gl666171) 

46 

22.581 

2520 

AC004539 

Homo sapiens unknown function; similar 
to Y09105 (PID:gl666171) 

820 

90.909 

2521 

X59720 

Saccharomyces cerevisiae YCR053w, 
len:514 

878 

37 . 472 

2522 

AL117204 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk381b7.5 comes 
from this gene 

57 

43.478 

2523 

AIill7204 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk381b7.5 comes 
from this gene 

74 

32.143 

2524 

AL117204 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk381b7.5 comes 
from this- gene 

175 

4 5 . 614 } 

2525 

AL117204 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk381b7.5 comes 
from this gene 

1082 

34.412 

2526 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

189 

78 . 947 

2527 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

189 

78 . 947 . 

2528 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

268 

52.294 

2530 

U80559 

Human immunodeficiency virus type 1 
tat protein 

100 

38.710 

2532 

S58722 

Homo sapiens X-linked retinopathy 
protein {C- terminal, clone XEH.8c} 

245 

64.062 

2533 

AP000060 

Aeropyrum pernix lOOaa long 
hypothetical protein 

109 

34.066 

2534 

X55656 

Homo sapiens gamma-G globin 

843 j 

89.542 

2535 

X55656 

Homo sapiens gamma-G globin 

421 

81.699 

2536 

AJ223953 

Homo sapiens hPTTG 

469 

100.000 

2537 

AJ223953 

Homo sapiens hPTTG 

469 

63.514 

2538 

X69532 

Homo sapiens inter-alpha- trypsin 
inhibitor heavy chain HI 

4 94 

62.143 

2539 

X69532 

Homo sapiens inter-alpha- trypsin 
inhibitor heavy chain HI 

1070 

94 .118 

2540 

X69532 

Homo sapiens inter-alpha-trypsin 

554 

83 .178 
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inhibitor heavy chain HI 



2541 

X69532 

Homo sapiens inter -alpha* trypsin 
inhibitor heavy chain HI 

728 

54 . 110 

2542 

X69532 

Homo sapiens inter- alpha- trypsin 
inhibitor heavy chain HI 

4793 

89 . 915 

2543 

AJ243721 

Homo sapiens dTDP-4 -keto-6-deoxy-D- 
glucose 4 -reductase 

1093 

74 . 419 

2544 

D13 892 

Homo sapiens carboxyl 
methyl trans f erase 

509 

93 . 103 

2545 

D13892 

Homo sapiens carboxyl 
methyl transferase 

38 

30 . 000 

2546 

D13892 

Homo sapiens carboxyl 
methyltransf erase 

41 

33 . 333 

2548 

D13892 

Homo sapiens carboxyl 
methyltransf erase 

785 

80 . 085 

2549 

M16447 

Homo sapiens dihydropteridine 
reductase (EC 1.6.99.7) 

713 

92 .500 

2550 

M16447 

Homo sapiens dihydropteridine 
reductase (EC 1.6.99.7) 

1022 

89 .474 

2552 

U35376 

Homo sapiens repressor transcriptional 
factor 

534 

60.526 

2553 

U35376 

Homo sapiens repressor transcriptional 
factor 

1868 

50.000 

2554 

S42658 ' 

Homo sapiens S3 ribosomal protein 

1032 

87.500 

2555 

S42658 

Homo sapiens S3 ribosomal protein 

847 

65 . 021 

2556 

S42658 

Homo sapiens S3 ribosomal protein 

689 

77.647 

2557 

AF041330 

Bodo saltans NADH dehydrogenase 
subunit 5 

166 

28.671 

2558 

AF030162 

Homo sapiens inner mitochondrial 
membrane translocase Tim23 

750 

79.535 

2559 

U27488 

Pseudorabies virus glycoprotein gX 

57 

44 . 000 

2560 

U27488 

Pseudorabies virus glycoprotein gX 

148 

39.326 

2561 

AF034580 

Mus musculus TSG118.1 

64 

34.375 

2562 

AF034580 

Mus musculus TSG118.1 

270 

43.796 

2563 

AF034580 

Mus musculus TSG118.1 

2 94 

75.410 

2564 

AF034580 

Mus musculus TSG118.1 

64 

34 .375 

2565 

AF034580 

Mus musculus TSG118.1 

1622 

58.378 

2566 

X61046 

Hydra sp. mini -collagen 

160 

44 . 156 

2567 

AF126062 

Homo sapiens Arf-like 2 binding 
protein BART1 

624 

97.030 

2568 

AF126062 

Homo sapiens Arf-like 2 binding 
protein BART1 

673 

89.222 

2569 

U26162 

Homo sapiens myosin regulatory light 
chain 

269 

61.622 

2570 

U79260 

Homo sapiens unknown 

195 ] 

52.381 

2571 

U05313 

Trypanosoma brucei CR3 

78 

36.364 

2572 

Z14134 

Dictyostelium discoideum ras -related 
protein 

247 

29.648 

2573 

U79260 

Homo sapiens unknown 

309 

64.894 

2574 

AF044127 

Homo sapiens peroxisomal short'- chain 
alcohol dehydrogenase 

385 

98.361 

2575 

AF044127 

Homo sapiens peroxisomal short -chain 
alcohol dehydrogenase 

1359 

90.837 

2576 

AF044127 

Homo sapiens peroxisomal short-chain 

46 

30.303 
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alcohol dehydrogenase 



2577 

AF044127 

Homo sapiens peroxisomal short -chain 
alcohol dehydrogenase 

162 

92 . 593 

2578 

AF044127 

Homo sapiens peroxisomal short-chain 
alcohol dehydrogenase 

153 

100.000 

2579 

AF044127 

Homo sapiens peroxisomal short -chain 
alcohol dehydrogenase 

706 

61.450 

2580 

AF044127 

Homo sapiens peroxisomal short -chain 
alcohol dehydrogenase 

1286 

92.490 

2581 

AB030236 

Canis familiaris D4 dopamine receptor 

147 

41.270 

2582 

AB030236 

Canis familiaris D4 dopamine receptor 

105 

42 .424 

2583 

M19419 

Mus musculus proline-rich salivary . 
protein 

189 

30.682 

2584 

M19419 

Mus musculus proline-rich salivary 
protein 

205 

38.583 

2585 

Z81039 

Unknown predicted using Genefinder; 
cDNA EST EMBL:T01209 comes from this 
gene; cDNA EST yk278all.3 

300 

33.803 

2586 

Z81039 

Unknown predicted using Genefinder; 
CDNA EST EMBL:T01209 comes from this 
gene; cDNA EST yk278all.3 

208 

28.994 

2587 

AF151875 

Homo sapiens CGI -117 protein 

276 

62.921 

2588 

AF151875 

Homo sapiens CGI-117 protein 

660 

85.165 

2589 

M27444 

Bos taurus phosphoprotein 

511 

78.846 

2590 

M27444 

Bos taurus phosphoprotein 

923 

81.053 

2591 

S54563 

Mus sp. alpha 2 <XI) collagen 

111 

40.000 

2592 

X73608 

Homo sapiens testican 

42 

27.586 

2593 

X73608 

Homo sapiens testican 

42 

27.586 

2594 

X73608 

Homo sapiens testican 

46 

35.897 

2595 

X73608 

Homo sapiens testican 

1481 

87.747 

2596 

X73608 

Homo sapiens testican 

2423 

90.265 

2597 

M57290 

Gallus gallus non-tyrosine protein 
kinase 

155 

28.761 

2599 

D78255 

Mus musculus PAP-1 

45 

27.273 

2600 

D78255 

Mus musculus PAP-1 

1240 

92.821 

2601 

D78255 

Mus musculus PAP-1 

770 

51.125 

2603 

AF173378 

Homo sapiens 60S acidic ribosomal 
protein PO 

47 

28.571 

2604 

AP173378 

Homo sapiens 60S acidic ribosomal 
protein PO 

44 

77.778 

2605 

AF173378 

Homo sapiens 60S acidic ribosomal 
protein PO 

44 

77.778 

2606 

AF173378 

Homo sapiens 60S acidic ribosomal 
protein PO 

364 

XOO.000 

2607 

AF173378 

Homo sapiens 60S acidic ribosomal 
protein PO 

51 

23.684 

2608 

AF173378 

Homo sapiens 60S acidic ribosomal 

orohein PO 

545 

100. 000 

2609 

AF173378 

Homo sapiens 60S acidic ribosomal 
protein PO 

1082 

87.288 

2610 

X67703 

Drosophila melanogaster Mst84Db 

79 

38.889 

2611 

X67703 

Drosophila melanogaster Mst84Db 

152 

45.455 

2612 

M13974 

Bos taurus beta type protein kinase C 

517 

81.928 

2613 

M13974 

Bos taurus beta type protein kinase C 

991 

58.983 


P. 


2614 

M13974 

Bos taurus beta type protein kinase C 

791 

38.095 

2615 

M13974 

Bos taurus beta type protein kinase C 

3455 

77.581 

2616 

U03886 

Homo sapiens a gene isolated from a 
CpG island between STS and KAL 

1574 

97.610 

2617 

U03886 

Homo sapiens a gene isolated from a 
CpG island between STS and KAL 

1099 

89.961 

2618 

AJ005894 

Homo sapiens JM27 

158 

44.304 

2619 

X03145 

Homo sapiens pot- ORF III 

45 

44.444 

2620 

X03145 

Homo sapiens pot. ORF III 

44 

30.000 

2621 

X03145 

Homo sapiens pot. ORF III 

130 

37.168 

2622 

X03145 

Homo sapiens pot. ORF III 

162 

34 .400 

2653 

X55683 

Lycopersicon esculentum extensin 
(class I) 

159 

36 . Ill 

2624 

Z98204 

Hordeum vulgare extensin 

354 

29.499 

2625 

S76877 

Danio rerioazebraf ish | embryos , 
Peptide, 215 aa casein kinase 2 beta 
subunit, CK2 beta=protein kinase 

434 

64.348 

2627 

AF135421 

Homo sapiens GDP-mannose 
pyrophosphorylase B 

50 

23.529 

2628 

AF135421 

Homo sapiens GDP-mannose 
pyrophosphorylase B 

2150 

82.394 

2629 

AF135421 

Homo sapiens GDP-mannose 
pyrophosphorylase B 

2279 

98.333 

2631 

Y16610 

Homo sapiens paraplegin 

528 

98.795 

2632 

Y16610 

Homo sapiens paraplegin 

1781 

56.792 

2633 

Y16610 

Homo sapiens paraplegin 

4874 

98.996 

2634 

D86864 

Homo sapiens acetyl LDL receptor 

733 

59.091 

2635 

D86864 

Homo sapiens acetyl LDL receptor 

2083 

37.377 

2636 

221507 

Homo sapiens human elongation fac tor- 
i-delta 

1095 

82.353 

2637 

AF151863 

Homo sapiens CGI-105 protein 

621 

91.589 

2638 

AF151863 

Homo sapiens CGI-105 protein 

1812 

89.172 

2639 

AF151B63 

Homo sapiens CGI-105 protein 

696 

53.571 

2640 

AF151863 

Homo sapiens CGI-105 protein 

356 

47.191 

2641 

AF151863 

Homo sapiens CGI-105 protein 

683 

67.000 

2642 

AF132181 

Drosophila melanogaster unknown 

730 

46.209 

2643 

AF132181 

Drosophila melanogaster unknown 

1478 

35.006 

2644 

AB029029 

Homo sapiens KIAA1106 protein 

304 

94.118 

2645 

AB029029 

Homo sapiens KIAA1106 protein 

293 

97.959 

2646 

AB029029 

Homo sapiens KIAA1106 protein 

4806 

97.305 

2647 

AB029029 

Homo sapiens KIAA1106 protein 

6761 

99.113 

2648 

AL021916 

Homo sapiens b34I8.1 (Kruppel related 
Zinc Finger protein 184) 

100 

25.000 

2649 

AL021918 

Homo sapiens b34I8.1 (Kruppel related 
Zinc Finger protein 184) 

204 

55.357 

2650 

AL021918 

Homo sapiens b34I8.1 (Kruppel related 
Zinc Finger protein 184) 

1604 

40.859 

2651 

L39874 

Homo sapiens deoxycyt idyl ate deaminase 

38 

26. 154 

2652 

L39874 

Homo sapiens deoxycyt idylate deaminase 

605 

69.149 

2653 

U77381 

Arabidopsis thaliana WD-4 0 repeat 
protein 

296 

58.333 

2654 

U77381 

Arabidopsis thaliana WD-40 repeat 
protein 

524 

46.041 

2655 

AP000060 

Aeropyrum pernix 128aa long 

137 

39.623 
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hypothetical protein 



2656 

S57432 

XenODUB laevis Rnl pnnrvhpa pr^r^t- ■{ 

165 aa 4 OS ribosomal small stibunit 
protein S10 


90 . 090 

2657 

S57432 

Xenopus laevis, splehocytes , Peptide, 
165 aa 40S ribosomal small subunit 
protein S10 

279 

58.915 

2658 

AJ005962 

Drosophila melanogaster clathrin- 
associated protein 

746 

78.873 

2660 

AJ005962 

Drosophila melanoaaster clathrin- 
associated protein 

"I ft A Q 

7 8.378 

2661 

Z93241 

Homo sapiens dJ222E13 . la. 1 (C-terminal 
part of novel protein dJ222E13.1) 
(partial isoform 1) 

120 

57.576 

2662 

Z93241 

Homo sapiens dJ222B13 . la. 1 (C-terminal 
part of novel protein dJ222E13.1) 
(partial isoform 1) 

796 

89 . 510 

2663 

Z93241 

Homo sapiens dJ222E13 . la. 1 (C-terminal 
part of novel protein dJ222B13.1) 
(partial isoform 1) 

186 

61.538 

2665 

M27024 

Homo sapiens heat shock protein 

473 

49.167 

2666 

M27024 

Homo sapiens heat shock protein 

458 

98. 649 

2667 

M27024 

Homo sapiens heat shock protein 

3109 

87 . 919 

2668 

U5865B 

Homo sapiens unknown 

301 

64 .286 

2670 

AP000060 

Aeropyrum pernix 14 3aa long 
hypothetical protein 

HI 

30. 894 

2671 

D10653 

Homo sapiens cell surface glycoprotein 

1226 

93.305 1 

2672 

L48546 

Homo sapiens tuberin 

956 

95.455 

2673 

L48546 

Homo sapiens tuberin 

283 

97.727 

2674 

L48546 

Homo sapiens tuberin 

258 

97.500 

2675 

L48546 

Homo sapiens tuberin 

452 

97.222 

2676 

L48546 

Homo sapiens tuberin 

11312 

88.681 

2677 

L48546 

Homo sapiens tuberin 

10891 

96.152 

2678 

L48546 

Homo sapiens tuberin 

11286 

97.573 

2679 

AJ005564 

Mus musculus SPR2P protein 

48 

58.824 

2680 

AJ005564 

Mus musculus SPR2F protein 


n icq 

2681 

L43361 

Mycoplasma gallisepticum 
haemagglut inin 

109 

35.556 

2682 

Z69637 

Caenorhabditis elegans predicted using 
Genefinder; Similarity to E.coli 
hypothetical protein YCAC 
(SW: YCAC_ECOLI) ; cDNA EST yk555d!2.3 
comes from this gene 

309 

63.291 

2683 

Z69637 

.— :* 

Caenorhabditis elegans predicted using 
Genefinder; Similarity to E.coli 
hypothetical protein YCAC 
(SW:YCAC_ECOLI) ; cDNA EST yk555dl2.3 
comes from this gene 

476 

4 8.131 

2686 

AB023185 

Homo sapiens KIAA0968 protein 

278 

56.098 ! 

2687 

AB023185 

Homo sapiens KIAA0968 protein / 

3181 

97.546 

2688 

AB023185 

Homo sapiens KIAA0968 protein 

3273 

96.947 

2690 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c} 

181 

59.701 

2691 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c} 

219 

59.701 
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2692 

AF151886 

Homo sapiens CGI -12 8 protein 

630 

96.040 

2693 

AF151886 

Homo sapiens CGI -12 8 protein 

352 

91.525 

2694 

AF151886 

Homo sapiens CGI -12 8 protein 

854 

76.923 

2695 

AF151B86 

Homo sapiens CGI -12 8 protein 

678 

69.945 

2696 

AF151886 

Homo sapiens CGI -12 8 protein 

472 

70.349 

2697 

AF132970 

Homo sapiens CGI-36 protein 

543 

71.176 

2698 

U54638 

Mus musculus rhotekin 

726 

66.292 

2699 

U54638 

Mus musculus rhotekin 

2192 

76.821 

2700 

U54 63 8 

Mus musculus rhotekin 

339 

60.396 

2701 

U54638 

Mus musculus rhotekin 

55 

28 .767 

2702 

U54638 

Mus musculus rhotekin 

756 

62.755 

2703 

U54638 

Mus musculus rhotekin 

2257 

79.667 

2704 

U97006 

Caenorhabditis elegans No definition 

214 

43 . 000 



line found 



2705 

U97006 

Caenorhabditis elegans No definition 

131 

39.130 



line found 



2706 

U97006 

Caenorhabditis elegans No definition 

120 

36. 957 



line found 



2707 

U97006 

Caenorhabditis elegans No definition 

122 

36 . 207 



line found 



2708 

U97006 

Caenorhabditis elegans No definition 

142 

29 . 600 



line found 



2709 

U97006 

Caenorhabditis elegans No definition 

327 

44 . 737 



line found 



2710 

U97006 

Caenorhabditis elegans No definition 

2671 

2 9 . 978 



line found 



2711 

AJ238629 

Chlorella protothecoides putative 

66 

31. Ill 



extensin-like protein 



2712 

S58722 

Homo sapiens X- linked retinopathy 

185 

61.818 



protein {C-terminal, clone XEH.8c} 



2713 

Z49199 

Homo sapiens RNA polymerase II subunit 

427 

72.277 ! 



HRPB17 ; 



2714 

Z49199 

Homo sapiens RNA polymerase II subunit 

210 

82.500 



hRPB17 



2715 

Z49199 

Homo sapiens RNA polymerase II subunit 

701 

88. 816 



hRPB17 



2716 

D50926 

Homo sapiens The KIAA0136 gene product 

4748 

84 . 701 



is novel. 



2717 

D50926 

Homo sapiens The KIAA0136 gene product 

1947 

81. 888 



is novel. 



2718 

D50926 

Homo sapiens The KIAA0136 gene product- 

328 

94.340 



is novel. 



2719 

D50926 

Homo. sapiens The KIAA0136 gene product 

5808 

97. 275 



is novel. 



2720 

L24521 

Homo sapiens transformation-related 

176 

58.108 



protein 



2721 

AF010144 

Homo sapiens neuronal thread protein 

283 

42.442 



AD7C-NTP ' : 



2722 

AJ000644 

Homo sapiens SPOP 

780 

92.424 

2723 

AJ000644 

Homo sapiens SPOP 

90 

100.000 

2724 

AJ000644 

Homo sapiens SPOP 

1837 

92.308 

2726 

J04196 

Bos taurus GTP-binding protein 

1026 

100.000 

2727 

J04196 

Bos taurus GTP-binding protein 

763 

80.214 

2728 

M15530 

Homo sapiens B-cell growth factor 

219 

66.038 
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2730 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

465 

■ 

52.308 

2731 

M34671 

Homo sapiens antigen CD59 precursor 
(CD59) 

299 

66. 6G7 

■ ni*) 
i / Jz 

Woo DO/ 

Homo sapiens NAP 

658 

99.020 

*3 *7 "* >l 

4/34 

MBooo/ 

Homo sapiens NAP 

263 

90.000 

OTIC 

Moooo / 

Homo sapiens NAP 

783 

99.194 

•3 "7 ^ £ 

Moooo/ 

Homo sapiens NAP 

277 

67.692 

"5 1 1 *7 
/.lot 

MO £ f f7 

Homo sapiens NAP 

291 

70.769 

2738 

M86667 

Homo sapiens NAP 

1638 

85.600 

273 9 

AL0B0155 

Homo sapiens hypothetical protein 

804 

75.132 

2740 

AL080155 

Homo sapiens hypothetical protein 

832 

88.312 

2741 

AL080155 

Homo sapiens hypothetical protein 

2026 

94.805 

2742 

AL080155 

Homo sapiens hypothetical protein 

2339 

95.852 

2743 

AF020261 

Santalum album proline rich protein 

245 

31.270 

2744 

M94383 

Drosophila melanogaster suppressor of 
hairless protein 

209 

61.224 

2745 

M94383 

Drosophila melanogaster suppressor of 
hairless protein 

322 

90.385 

2746 

M94383 

Drosophila melanogaster suppressor of 
hairless protein 

681 

85.345 

2747 

M94383 

Drosophila melanogaster suppressor of 
hairless protein 

1699 

67.086 

2748 

X01655 

Homo sapiens type III procollagen (aa 
892-1023) 

43 

39.130 

2749 

X01655 

Homo sapiens type III procollagen (aa 
892-1023) 

106 

38.596 

2750 

AB025008 

Homo sapiens novel member of chitinase 
family 

144 

58.537 

2751 

AB025008 

Homo sapiens novel member of chitinase 
family 

44 

22.772 

2752 

AB025008 

Homo sapiens novel member of chitinase 
family 

861 

47.703 

2753 

AB025008 

Homo sapiens novel member of chitinase 
family 

44 

36.364 , 

2754 

AB025008 

Homo sapiens novel member of chitinase 
family 

508 

34.225 

2756 

S74659 

Homo sapiens aggrecan keratan sulfate 
domain {hexapeptide repeats} 1 

110 

35.870 


A85785 

Homo sapiens duffy antigen/ cherokine 
receptor 

1539 

66.350 

07CO 


Sus scrofa FOSB protein 

99 

43.860 



Homo sapiens unknown protein 

1528 

87.589 

2763 

M22334 

Homo sapiens unknown protein 1 

897 

94.000 

4 f P / 

M22334 

Homo sapiens unknown protein 

714 

90^625 

2768 

M22334 

Homo sapiens unknown protein 

49 

37.500 ! 


M2 2334 

Homo sapiens unknown protein 

1181 

94.898 

2770 

M22334 

Homo s aniens unknown r»"rot- pin ~ ■ 

"925 

90.854 

2771 

M22334 

Homo sapiens unknown protein 

2804 

88.690 

2772 

AL031427 

Homo sapiens dJ167A19.3 (novel 
protein) 

506 

78.505 

2773 

AL031427 

Homo sapiens dJ167A19.3 (novel 
protein) 

385 

59.028 

2774 

AL031427 

Homo sapiens dJ167A19.3 (novel 

236 1 97.500 
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protein) 



2775 

AL031427 

Homo sapiens dJ167A19.3 (novel 
protein) 

576 

85.714 

2776 

Y17328 

Rattus norvegicus mu crystallin 

121 

91.667 

2777 

Y17328 

. Rattus norvegicus mu crystallin 

1339 

76.398 

2778 

X71440 

Homo sapiens peroxisomal acyl-CoA 
oxidase 

4141 

95.008 

2779 

X71440 

Homo sapiens peroxisomal acyl-CoA 
oxidase 

289 

95.000 

2780 

X71440 

Homo sapiens peroxisomal acyl-CoA 
oxidase 

450 

92.958 

2781 

X71440 

Homo sapiens peroxisomal acyl-CoA 
oxidase 

385 

90.000 

2782 

X71440 

Homo sapiens peroxisomal acyl-CoA 
oxidase 

423 

98.507 

2783 

X71440 

Homo sapiens peroxisomal acyl-CoA 
oxidase 

3736 

95.152 

2784 

SS4563 

Mus sp. alpha 2 (XI) collagen 

70 

30.769 

2785 

S54563 

Mus sp. alpha 2 (XI) collagen 

70 

30.769 

2786 

S54563 

Mus sp. alpha 2 (XI) collagen 

112 

33.663 

2787 

AF189307 

Homo sapiens unknown 

133 

44.828 

2788 

AF132961 

Homo sapiens CGI -27 protein 

220 

91.892 

2790 

AF132961 

Homo sapiens CGI -27 protein 

597, 

77.311 

2791 

AF132961 

Homo sapiens CGI-27 protein 

1386 

72.121 

2792 

AF132961 

Homo sapiens CGI -27 protein 

416 

52.023 

2793 

AF132961 

Homo sapiens CGI -27 protein 

1838 

98.969 

2794 

Z47087 

Homo sapiens RNA polymerase II 
elongation factor-like protein 

506 

70.760 

2795 

J04655 

Ascaris suum collagen 

125 

38.235 

2796 

X77337 

Homo sapiens phosphate carrier 

1404 

82.703 

2797 

Z79694 

Unknown predicted using Gene finder; 
similar to collagen; cDNA EST 
EMBL:D65535 comes from this gene; 

154 

33.333 

2798 

AF022985 

Caenorhabditis elegans Similar to 
collagen 

175 

30.070 

2799 

U44111 

Homo sapiens histamine N- 
methyltransf erase 

1448 

85.502 

2600 

U44111 

Homo sapiens histamine N- 
methyl transferase 

1355 

90.236 . 

2801 

AB011084 

Homo sapiens KIAA0512 protein 

626 

34.106 

2802 

AB0110B4 

Homo sapiens KIAA0512 protein 

39 

22.222 

2803 

AB011084 

Homo sapiens KIAA0512 protein 

644 

27.087 

2804 

AB011084 

Homo sapiens KIAA0512 protein 

445 

32.787 

2805 

X73435 

Ovis aries KAP5.5 keratin protein 

115 

33.708 

2806 

M90657 

Homo sapiens L6 

386 

41.053 

2807 

X60957 

Homo sapiens receptor tyrosine kinase 

7394 

95.958 

2808 

X60957 

Homo sapiens receptor tyrosine kinase 

406 

95.588 

2ff09~ 

X60957 

Homo sapiens receptor tyrosine kinase 

7014 

94.952 " 

2811 

X56468 

Homo sapiens 14 . 3 . 3 protein 

787 

85.214 

2812 

X59244 | 

Homo sapiens ZNF43 

38 

50.000 

2813 

X59244 

Homo sapiens ZNF43 

1918 

41.124 

2814 

AC004544 

Homo sapiens cytochrome C oxidase; 
match to P14406 (PID:gll7121) 

324 

64.486 

2815 

AC004544 

Homo sapiens cytochrome C oxidase; 

154 

59.140 
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O O 1 C 

V A O vl a A 

Homo 8 3pl ens longest open reading 

ti alHC 

446 

70 . 807 

zox / 

TTfl ft no 1 

xionio sapiens nicotinainiQe im- 

ma f riv 1 t* T*A Tlrt T 7 ^ T~ A O 
11 iu wiiy a w& ouio 1.CA1 as c 

671 

63 . 793 

2818 



1 O A A 

97 .461 

2 819 

AP16118 1 

■/"VF X O X X O X 

M11 a mnflPiilna DC CT nyftfo^ n 
nuo iiiubcuius r j j 1 jJi. ULvlll 

O Q O A 

Oft AAA 

89 . 908 

2 820 

AiT005564 

J^KI \J \S ~J —f \J ** 

Mil A mil ftr*ii 1 11 fx Q.T5P0I? nrnhp^ n 

/ / 

VI A 01 /T 

40 . 816 

2821 

X67703 

Drosori'hiia melanoaaBtPT Mflfft4Da 

XUj 

O A A vl O 

-54 . 043 

2822 

X67703 

Drosoohila melanoaantpr mrt* fl4Da 


O Q T Q T 

2 824 

M86564 

Rattus norveaicus alnha — r> Y~ot* Vi vmrj r S n 

^ JX 

Ol 701 

2825 

M86564 

Rattus norveaicus • altsha — wi'Oi'Hvmon 1 r» 

24 6 

' /re O Q Q 

2826 

U18385 

Caenorhabd i tt i h eleaans coll aapn 


AO 

2828 

U18385 

Gaenorhabditis elegans collagen 

73 

23.611 

2 829 

U183 85 

C*Zi onoT" A a Vifl ^ h 4 o a1 Anana ha! 1 anon 
u a cixui lid UU 1 L i d cic^aiib LU1 1 dy ell 


66 . 667 

2 830 

Til ft 1 ft ^ 

Pa onrir ViaVn"? ^ t*1 fl ol Adano rnl 1 anon 


^ . 632 

0 ft"*! 

<£. O J X 

TT1 ft T. ft 

Pa an^TVl a V"»/^ ^ K i q A I Afta n a ca! 1 arron 

i_cit;iiox xidxjcix c x t3 cxeyaiis coiidyen 

46 

28 .571 

"? ft 0 

T71 ft ^ ft c; 

LdenotflaoaiLis eiegans collagen 

117 

20 . 904 

^ O .3 *x 

m 8"^ ft ^ 

W J. O J O ZJ 

Pa On /""»t"Vi a Vi/I -| t~ "{ t3 a 1 Afra n a 1 aft An 
^aCilUIllcUJUJLUXo CXcy alio L.OXldycIl 

1X4 

20. 690 

* 0 j 

v^i O X D . 

nuiuu ttcipicim [iiyuoxu xcyuidtory X xyxxt, 

chain 

•a T 

6 5 . 169 

2 836 

AL05017 9 

Homo RAr>1 pinq hvriAt*h Ah ^ r»a 1 riT-nf-** t n 

f* D _> 

—1 -T c Q C 

2837 

AL05 0179 

Unmn oar>i our hvnnh hph { pal nrAho "4 n 

xiwinw Oayxcilo liy uuuilCUXk/dX UXULCXU 


TC O O A 

2838 

AI.050179 

#1 Jul V ^ w JL t ^ 

Homo RATii otifi hvnnt*hphi ral nrrthoi'n 

O / X 

C»7 CAT 

2839 

X57432 

paf^f iiq T*rai f- t*n a n'hnnnnial nrnhoin QO 

XV C* V» V. UO X. d. UO X, Xi/U DUlUa X £S A. V> V? XXI O A 

OA 
O *± 

C Q QOX 

o o • Ois4 

2 840 

X574 32 

Pat*hllfl Y" ri t" t~ 1 1 Q ■{ Knonma 1 nT-r*\i- ~1 n 0"> 
J\ c* ^ lud XdlLUo X XJJUoUlliaX pxULtrlll o ^5 


J4 . 483 

2 841 

X57432 

O f- f 1 1 fl T*at*T*llfJ Y*i l^rtQftmal nT**"!** o 4 n QO 
AdUtUD XaUUUo x xx/uouiuai pxuLcxu 

■311 
Oil 

*7 A AAA 

/ 0 . 000 

2 842 

niJv j ull/1 

Wninrt cj ani on a Vi\/nAt* V> ^ «a 1 T-\>-/-\t~ a 4 n 
X1UIIK-/ oapxcJlo XiyiJt-/l_Xlt;v. -XQ^clX pxOLcin 


22 . 3 08 

2843 

AL050101 

Homo sapiens hypothetical protein 

49 

34.286 

9 ft A A 

Z O 4 4 


xiomo sapiens nypocnetxcax protGin 

59 

26.000 

2845 

AL050101 

Homo sapiens hypothetical protein 

447 

98.611 


7VT ACAT AT 

Homo sapiens hypothetical protein 

47 

29.787 

2847 

AL050101 

Homo sapiens hypothetical protein 

46 

34.483 

2848 

at r\ C 1 A1 

ALiOb 0101 

Homo sapiens hypothetical protein 

3185 

97.253 

2 849 

AF041330 

Bodo saltans NADH dehydrogenase 

BUDUniL i> 

158 

28.333 

O ft P <\ 

Z 0 D U 

VI CIOQ 

Paracent rotus lividus beta-tuhulin (AA 

•1 — £L A.H \ 
X — *x*x f / 

677 

99 . 029 

? ft n 

Y1 CO QO 

rdxdceiicroLus iiviauB Deta-cuDUiiii \aa 
1 - 447) 

1737 

77 . 007 

2853 


Tlrann aan^ ono C2. 1 *7 y*f*rkt* a ■{ n 

xivvuiw ooipxexis virix / proLcin 

-S r~ l~ 

92 . 857 

2854 

AF064603 

Homo b i pnn GA17 nrnt" oin 

X? 

2*Z> . t> D ^ 

2855 

AF064603 

Homo Rani on r OA! 7 nrnhp^n 


<c<c cu *a 
oo.ooj 

2856 

X52235 

Homo SaTii on g ORPTT 

OUMJ.V1IO J. J> 


O. 1 OCA 

J 1 . you 

2857 

V01184 

Mus muBCulus readino framp v-f no fncc) 

7 Aft 

7fl 710 
/ u . / lO 

2858 

V01184 

Mus musculua readincr f ramp v-fnq ^n^^^ 

fiO ft 

A A O CtfS 
./DO 

2859 

V01184 

Mus musculus reading frame v-fos (p55) 

1140 

69.159 

2861 

A48861 

Homo sapiens uiiriafne'd protein product 

2066 

93.294 

2862 

A48861 

Homo sapiens unnamed protein product 

1478 

91.103 

2863 

A48861 

Homo sapiens unnamed protein product 

1277 

77.273 

2864 

AJ005564 

Mus musculus SPR2F protein 

67 

32.203 

2865 

AJ005564 

Mus musculus SPR2F protein 

105 

37.313 

2866 

M15530 

Homo sapiens B-cell growth factor 

155 

56.522 
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2867 

AL080080 

Homo sapiens hypothetical protein 

1746 

96.864 

2868 

AF128625 

Homo sapiens CDC4 2 -binding protein 
kinase beta 

905 


2869 

AP128625 

Homo sapiens CDC4 2 -binding protein 
kinase beta 

842 

97 1 fil 

2870 

AF128625 ' 

Homo sapiens CDC42 -binding protein 
kinase beta 

9242 

ft 7 on Q 

2871 

AF128625 

Homo sapiens CDC4 2 -binding protein 
kinase beta 

302 

100 . 000 

?872 

AF128625 

Homo sapiens CDC42 -binding protein 
kinase beta 

7 95 

97.458 

2873 

AF128625 

Homo sapiens CDC4?-binding protein 
kinase beta 

10874 

98.660 | 

2874 

J04657 

Ascaris suum collagen 

230 

36.287 

2875 

AF042386 

Homo sapiens cyclophilin-33B 

580 

72.441 

2876 

AF042386 

Homo sapiens cyclophilin-33B 

2000 

100.000 

2877 

AF042386 

Homo sapiens cyclophilin-33B 

103 

88.889 

2878 

AF042386 

Homo sapiens cyclophilin-33B 

434 

56.213 ! 

2879 

AF101074 

Homo sapiens step II splicing factor 
SLU7 

3499 

97.952 

2880 

AP000060 

Aeropyrum pernix lOlaa long 
hypothetical protein 

132 

41.250 

2881 

AF125101 

Homo sapiens HSPC040 protein 

415 

82.143 \ 

2 882 

AF125101 

Homo sapiens HSPC04 0 protein 

158 

100.000 

2883 

AF125101 

Homo sapiens HSPC04 0 protein 

323 

81. 905 

2884 

X98126 

Homo sapiens enoyl-CoA hydratase 

241 

71.642 

2885 

X98126 

Homo sapiens enoyl-CoA hydratase 

383 

94 . 737 

2886 

X98126 

Homo sapiens enoyl-CoA hydratase 

421 

75.248 

2887 

X98126 

Homo sapiens enoyl-CoA hydratase 

1235 

85 . 441 

2888 

X98126 

Homo sapiens enoyl-CoA hydratase 

1048 

77.888 

2889 

AF098499 

Caenorhabditis elegans No definition 
line found 

181 

32 . 026 

2890 

AF098499 

Caenorhabditis elegans No definition 
line found 

242 

4 1 . Ill 

2891 

AF098499 

Caenorhabditis elegans No definition 
line found 

315 

40. 000 

2892 

AF098499 

Caenorhabditis elegans No definition 
line found 

291 

37.500 

2893 

AF098499 

Caenorhabditis elegans No definition 
line found 

221 

32.353 

2894 

M28297 

Rattus norvegicus alpha- 1- inhibitor 3 

36 

46.154 

2895 

M28297 

Rattus norvegicus alpha- 1 -inhibitor 3 

231 

43.689 

2896 

M28297 

Rattus norvegicus alpha- 1- inhibitor 3 j 

212 

39.362 j 

2897 

M28297 

Rattus norvegicus alpha-l-inhibitor 3 

350 

37.696 | 

2898 

M28297 

Rattus norvegicus alpha-l-inhibitor 3 

93 

55.556 

2899 

M28297 | 

Rattus norvegicus alpha-l-inhibitor 3 

146 

36.111 

2900 

M28297 

Rattus norvegicus alpha-l-inhibitor 3 

43 

23.214 

2901 

M28297 

Rattus norvegicus alpha-l-inhibitor 3 

247 

34.951 

2902 

AJ005560 

Mus musculus SPR2B protein 

133 

32.292 

2903 

S83341 

Homo sapiens prion protein 
{intervening sequence, repetitive 
element} 

125 

38.372 

2904 

X80035 

Oryctolagus cuniculus cysteine rich 
hair keratin associated protein 

183 

32.117 
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2905 

AB025186 

Homo sapiens EB3 protein 

1009 

84.492 

2906 

AB025186 

Homo sapiens EB3 protein 

617 

92.188 

2907 

AB025186 

Homo sapiens EB3 protein 

1275 

79.567 

2908 

AF168008 

Danio rerio PEA3 

52 

24.000 

2909 

AF168008 

Danio rerio PEA3 

334 

81.356 

2910 

AF168008 

Danio rerio PEA3 

134 

90.476 

2911 

AF168008 

Danio rerio PEA3 

1722 

60.163 

2912 

X53064 

Homo sapiens small proline -rich 
protein 

110 

39.344 

2913 

AF060862 

Homo sapiens unknown 

749 

88.235 

2914 

AF060862 

Homo sapiens unknown 

401 

50. 000 

2915 

U68729 

Meloidogyne incognita cuticle 
preprocol lagen 

186 

30.808 

2916 

AF026689 

Homo sapiens prostate-specific 
transglutaminase 

111 

52.083 

2917 

X55777 

Homo sapiens put. ORF 

277 

51.807 

2919 

AF044670 

Homo saDiens 33 kDa VaniD-aBsociated 
protein; VAP-33 

417 

67 961 

2920 

AF044670 

Homo saoienB 33 kDa Vamo-associated 
protein; VAP-3 3 

51 

c o ooo 

—J \J . U VV 

2921 

AF044670 

Homo saoiens 3 3 kDa VamD-associated 
protein; VAP-33 

48 

35 . 088 

2922 

AF044670 

Homo saoiens 3 3 kDa Vamo- associated 
protein; VAP-33 

32 

too 000 

2923 

AF044670 

Homo saoiens 33 kDa Vamo— assori at pd 
protein; VAP-33 


O Q 167 

^ _7 . ID / 

2924 

AF044670 

Homo saoiens 3 3 kDa Vamo-associated 
protein; VAP-33 

46 

4 7 05 Q 

2925 

AF044670 

Homo saoiens 33 kDa VamD-a<?flori afpd 
protein; VAP-33 

46 

O Q 6^ n 

2926 

AF044670 ! 

Homo sapiens 33 kDa Vamp-associated 
protein; VAP-33 

48 

22 . 222 

2927 

AF044670 

Homo sapiens 3 3 kDa Vamp -associated 
protein; VAP-33 

47 

3 2 . 000 

2928 

AF044670 

Homo sapiens 33 kDa Vamp -associated 
protein; VAP-33 

59 

2 0 . 833 

2929 

AF044670 

Homo sapiens 33 kDa Vamp- associated 
protein; VAP-33 

48 

43 . 750 

2 930 

AF044670 

Homo sapiens 33 kDa Vamp-associated 
protein; VAP-33 

1255 

8 8 . 235 

2931 

U47686 

Homo sapiens signal transducer and 
activator of transcription StatSB 

762 

7 0 . 899 

2932 

U47686 

Homo sapiens signal transducer and 
activator of transcription StatSB 

1006 

91 . 875 

2933 

U47686 

Homo sapiens signal transducer and 
activator of transcription StatSB 

430 

73.684 

2934 

U47686 

Homo sapiens signal transducer and 
activator of transcription StatSB 

4969 

98.861 

2936 

X57346 

Homo sapiens HS1 

481 

84.000 

2937 

X57346 

Homo sapiens HS1 

202 

100.000 

2938 

X57346 

Homo sapiens HS1 

1040 

80.784 | 

2939 

X57346 

Homo sapiens HS1 

795 

71.698 

2940 

D10026 

Rattus norvegicus glutathione S- 
transf erase Yrs-Yrs 

477 j 

60.741 


1 0**>L~ 
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2941 

D10026 

Rattus norvegicus glutathione S- 
transferase Yrs-Yrs 

521 

69.565 

2942 

D10026 

Rattus norvegicus glutathione S- 
transf erase Yrs-Yrs 

175 

80.645 

2943 

D10026 

Rattus norvegicus glutathione S- 
transferase Yrs-Yrs 

268 

53.165 

2945 

D10026 

Rattus norvegicus glutathione S- 
transferase Yrs-Yrs 

219 

86. 842 

2946 

D10026 

Rattus norvegicus glutathione S- 
transferase Yrs-Yrs 

742 

68.016 

2947 

M15530 

Homo sapiens B-cell growth factor 

156 

54.717 

2948 

X63563 

Homo sapiens RNA polymerase II 140 kDa 
subunit 

49 

31.034 

2949 

X63563 

Homo sapiens RNA polymerase II 140 kDa 
subunit 

48 

29.268 

2950 

X63563 

Homo sapiens RNA polymerase II 140 kDa 
subunit 

6745 

93.361 

2 952 

U23452 

Caenorhabditis elegans No definition 
line found 

367 

31.410 

2953 

X54977 

Bos taurus 17,000 dalton myosin light 
chain 

777 

84.106 

2954 

X54977 

Bos taurus 17,000 dalton myosin light, 
chain 

652 

75.839 

2955 

X54977 

Bos taurus 17,000 dalton myosin light 
chain 

582 

91.667 

2956 

X54977 

Bos taurus 17,000 dalton myosin light 
chain 

55 

100.000 

2957 

X54977 

Bos taurus 17,000 dalton myosin light 
chain 

444 

86.076 

2958 

X54977 

Bos taurus 17,000 dalton myosin light 
chain 

605 

86.986 

2959 

X54977 

Bos taurus 17,000 dalton myosin light 
chain 

454 

74.684 


AL050272 

Homo sapiens hypothetical protein 

48 

37.500 

A Sol 

AJL050272 

Homo sapiens hypothetical protein 

151 

100.000 

y 62 

AIj050272 

Homo sapiens hypothetical protein 

704 

94.776 

1 Q £T "» 

Y14040 

Homo sapiens CASH beta protein 

259 

100.000. 

O or /I 

^ 9 64 

Y1404 0 

Homo sapiens CASH beta protein 

669 

100. 000 

^2 y 6b 

Y14040 

Homo sapiens CASH beta protein 

1297 

94.521 

£ y 66 

Y14040 

Homo sapiens CASH beta protein 

1107 

97.778 

z y o / 

VI A f\ A f\ 

x 1404 0 

Homo sapiens CASH beta protein 

1285 ! 

99.507 

2968 

Y14040 

Homo sapiens CASH beta protein 

1424 

100.000 

2969 

Y14040 

Homo sapiens CASH beta protein 

1297 

94.521 

2970 

Y14040 

Homo sapiens CASH beta protein 

1192 

98.522 

2971 

Y14040 i 

Homo sapiens CASH beta protein 

1076 

92.825 

2972 

U79260 

Homo sapiens unknown 

166 

47.059 

2973 

X16660 

Homo sapiens p25 protein 

140 

37.500 

2 974 

Ajj do / 

I*ycopersicon esculentum extensin s 
(class II) 

39 

37.500 

2976 

X55687 

Lycopersicon esculentum extensin 
(class II) 

39 

37.500 

2977 

X55687 

Lycopersicon esculentum extensin 
(class II) 

87 

33 . 803 

2978 

D25542 

Homo sapiens human gcp372 \ 

53 

40.000 


io«33 



HQ Ni»i& HI- 3 ,„ 02H 


lb I 


I ICS 


2979 

D25542 

Homo sapiens human gcp372 

19354 

98.299 

2980 

U92819 

Homo sapiens unnamed HERV-H protein 

480 

100.000 

2988 

U92819 

Homo sapiens unnamed HERV-H protein 

263 

63.235 

2989 

U92819 

Homo sapiens unnamed HERV-H protein 

242 

81.395 

2993 

U92819 

Homo sapiens unnamed HERV-H protein 

221 

78.049 

3000 

U92819 

Homo sapiens unnamed HERV-H protein 

259 

73.214 

3012 

U92819 

Homo sapiens unnamed HERV-H protein 

247 

83.721 

3015 

AJ130978 

Homo sapiens Ariadne -2 jprotein (ARI2) 

40 

23.333 

3016 

AJ130978 

Homo sapiens Ariadne-2 protein (ARI2) 

2249 

89. 084 

3017 

AF151905 

Homo sapiens CGI-147 protein 

663 

78.610 

3018 

AF132947 

Homo Bapiens CGI -13 protein 

606 

83.47.1 

3019 

AF132947 

Homo sapiens CGI- 13 protein 

142 

91.304 

3020 

AF132947 

Homo sapiens CGI -13 protein 

140 

88. 889 

3021 

AF132947 

Homo sapiens CGI -13 protein 

1471 

75.516 

3022 

AP132947 

Homo sapiens CGI-13 protein 

453 

98.592 

3023 

AF132947 

Homo sapiens CGI-13 protein 

2508 

94.433 

3024 

D42073 

Homo sapiens reticulocalbin 

194 

54.000 

3025 

D42073 

Homo sapiens reticulocalbin 

1176 

58.446 

3026 

U46188 

Homo sapiens KRAB A domain 

28 

26.316 

3027 

U46188 

Homo sapiens KRAB A domain 

149 

72.500 

3028 

X52949 

Giardia intestinalis unidentified 
reading frame; alternative codon use 

145 

27 604. 

3029 

Z38112 t 

Unknown cDNA EST EMBL:D35592 comes 
from this gene; cDNA EST EMBL:D68073 
comes from this gene; cDNA 

308 

38.462 

3030 

238112 

Unknown cDNA EST EMBL:D35592 comes 
from this gene; cDNA EST EMBL:D68073 
comes from this gene; cDNA 

630 

3 9 . 589 

3031 

Z38112 

Unknown cDNA EST EMBL:D35 592 comes 
from this gene; cDNA EST EMBL:D68073 
comes from this gene; cDNA 

476 j 

26.571 

3032 

M68864 

Homo sapiens ORP 

1078 

85.959 

3033 

AB016592 

Mus musculus GOB- 4 

563 

60;714 

3034 

AJ001810 

Homo sapiens mRNA cleavage factor I 25 
kDa subunit 

265 

100.000 

3035 

AJ001810 

Homo sapiens mRNA cleavage factor I 25 
kDa subunit 

698 

72 .840 

3036 

Y17394 

Homo sapiens prefoldin subunit 3 J 

687 

76.777 

3037 

L11566 

Homo sapiens ribosomal protein LI 8 

755 

78.035 

3038 

L11566 

Homo sapiens ribosomal protein LI 8 

1135 

81.385 

3039 

L11566 

Homo sapiens ribosomal protein LI 8 

417 

62.694 

3040 

AF107727 

Rattus norvegicus sertolin 

92^ 

68.182 

3041 

AF107727 j 

Rattus norvegicus sertolin 

72 

30.588 

3042 

AF107727 

Rattus norvegicus sertolin j 

291 

47.482 

3043 

U36598 

synthetic construct HSOD-GlyProGly-A+ 

419 

68.391 

3044 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

484 

59.172 

3045 

AC009322 

Arabidopsis thaliana Unknown protein 

35 j 

33.333 

3047 

AC009322 

Arabidopsis thaliana Unknown protein 

47 

57.143 | 

3048 

AC009322 

Arabidopsis thaliana Unknown protein 

199 

34.211 

3050 

X80038 

Homo sapiens Polio virus receptor 
protein 

71 

50.000 

3051 

X80038 

Homo sapiens Polio virus receptor 
protein 

2251 

76.531 
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3052 

X80038 

Homo sapiens Polio vims receptor 
protein 

3457 

98.513 

3053 

X80038 

Homo sapiens Polio virus receptor 
protein 

2715 

70.856 

3 054 

D78152 

Homo sapiens annex in IV 
(carbohydrtate-binding protein p33/41) 

647 

97 .170 

3055 

D78152 

Homo sapiens annexin IV 
(carbohydrtate -binding protein p33/41) 

218 

97.297 

3056 

D78152 

Homo sapiens annexin IV 
(carbohydrtate-binding protein p33/41) 

676 

85.135 

3057 

D78152 

Homo sapiens annexin IV 
(carbohydrtate-binding protein p33/41) 

1218 

85.061 

3058 

AB028966 

Homo sapiens KIAA1043 protein 

6924 

78.369 

3059 

AB028966 

Homo sapiens KIAA1043 protein 

316 

54.808 

3061 

AB028966 

Homo sapiens KIAA104 3 protein 

587 

100.000 

3062 

AB028966 

Homo sapiens KIAA1043 protein. 

6707 

90.262 

3063 

AB028966 

Homo. sapiens KIAA1043 protein 

8008 

98.420 

3064 

A01592 

Homo sapiens haemoglobin A beta chain 

267 

54.375 

3065 

L48932 

Homo sapiens beta-globin 

323 

69.173 

3066 

U28917 

Mus musculus 60S ribosomal protein 

549 

60.930 

3067 

AF067619 

Caenorhabditis elegans contains 
similarity to CUB domains (Pfam; CUB , 
score; 101.9 and 42.78) 

390 

33.597 

3068 

Y18206 

Homo sapiens serine-threonine specific 
protein phosphatase 

60 

44 .737 

3069 

Y18206 

Homo sapiens serine-threonine specific 
protein phosphatase 

58 

42.857 

3070 

Y18206 

Homo sapiens serine -threonine specific 
protein phosphatase 

429 

37.676 

3071 

AF006636 

Homo sapiens melanoma differentiation 
associated protein-9 

1310 

85.526 

3072 

AF113944 

Muntiacus muntjak prion protein 

108 

40.244 j 

3073 

AP000006 

Pyrococcus horikoshii 249aa long 
hypothetical protein 

202 

52.727 

3074 

AP000006 

Pyrococcrus horikoshii 249aa long 
hypothetical protein 

331 

38.350 

3075 

AC005031 

Homo sapiens neuronal apoptosis 
inhibitory protein 

126 

24.324 

3076 

AC005031 

Homo sapiens neuronal apoptosis 
inhibitory protein 

77 

26.667 

3077 

AC005031 

Homo sapiens neuronal apoptosis j 
inhibitory protein 

612 

30.490 

3078 

AC005031 

Homo sapiens neuronal apoptosis 
inhibitory protein 

383 

25.664 

307 9 

M26312 

Oryctolagus cuniculus unknown protein 

174 

36.170 

3080 

X75756 

Homo sapiens protein kinase C mu 

62 

30.435 ! 

3081 

X75756 

Homo sapiens protein kinase C mu 

574 

58.084 

"3082 

X75756 

Homo sapiens protein kinase C mu 

46 

46.667 

3083 

X75756 

Homo sapiens protein kinase C mu 

496 

64.602 

3084 

X75756 

Homo sapiens protein kinase C mu 

53 

33 .333 

3085 

X75756 

Homo sapiens protein kinase C mu 

3208 

62 .891 

3086 

AF033566 

Mus musculus cdc2/CDC28-like protein 
kinase 4 1 

673 

77.419 

3087 

AF033566 

Mus musculus cdc2/CDC28-like protein 

2680 

84 .742 





kinase 4 



3088 

AL035071 

Homo sapiens dJ1085F17.2 (EB1 (APC 
binding protein) ) 

361 

1 on nnn 

X V V . \J\J u 

3089 

AL035071 

Homo sapiens dJ1085F17.2 (EB1 (APC 
binding protein) ) 

511 

92 . 683 

3090 

AL035071 

Homo sapiens dJ1085F17.2 (EB1 (APC 
binding protein) ) 

350 

98.305 

3091 

AL035071 

Homo sapiens dJ1085F17.2 (EB1 (APC 
binding protein) ) 

1366 

93 . 727 

3092 

U49973 

Homo sapiens ORF2 : function unknown 

537 . 

72.868 

3093 

X92485 

Plasmodium vivax pval 

171 

40.496 

3095 

AC005521 

Homo sapiens similar to yeast SSU72; 
similar to P53538 (PID :gl711532) 

364 

72.973 

3096 

AC005521 

Homo sapiens similar to yeast SSU72 ; 
similar to P53538 (PID:gl711532) 

477 

62.755 

3098 

AB032953 

Homo sapiens KIAA1127 protein 

558 

93.407 

3099 

AB032953 

Homo sapiens KIAA1127 protein 

843 

100.000 

3100 

AB032953 

Homo sapiens KIAA1127 protein 

783 

96.800 

3101 

AB032953 

Homo sapiens KIAA1127 protein 

10807 

97.589 

3102 

AP007548 

Homo sapiens golgi SNARE 

40 

42.857 

3103 

AF007548 

Homo sapiens golgi SNARE 

215 

100.000 

3104 

AF007548 

Homo sapiens golgi SNARE 

838 

82.075 

3105 

AF007548 

Homo sapiens golgi SNARE 

690 

67.907 

3106 

M84237 

Homo sapiens integrin beta-1 subunit 

29 

33.333 

3107 

M84237 

Homo sapiens integrin beta-1 subunit 

114 

56.098 

3108 

Z11922 

Canis familiaris Ran 

1269 

94.737 

3109 

Z11922 

Canis familiaris Ran 

1109 

81.019 

3110 

Z11922 

Canis familiaris Ran 

706 

74.123 

3111 

L34587 

Homo sapiens RNA polymerase II 
elongation factor SIII, pl5 subunit 

250 

68.571 

3112 

D31769 

Mus musculus amelogenin precursor 

63 

28.571 

3113 

D31769 

Mus musculus amelogenin precursor 

93 

30.488 

3114 

D31769 

Mus musculus amelogenin precursor 

135 

32.000 

3115 

D44497 

Homo sapiens human p57 

455 

100.000 

3116 

D44497 

Homo sapiens human p57 

2059 

86.596 

3117 

X86779 

Homo sapiens FAST kinase 

159 

28.319 

3118 

X86779 

Homo sapiens FAST kinase 

90 

28.125 

3119 

X86779 ! 

Homo sapiens FAST kinase 

177 

28.743 

3120 

X86779 

Homo sapiens FAST kinase 

.325 

28.852 

3121 

X86779 

Homo sapiens FAST kinase 

349 

29.688 

3123 

AC007018 

Arabidopsis thaliana unknown protein 

152 

28.395 

3124 

AC007018 

Arabidopsis thaliana unknown protein 

421 

3 9.234 

3125 

AC007018 

Arabidopsis thaliana unknown protein 

331 

50.000 

3126 

AC007018 

Arabidopsis thaliana unknown protein 

1209 

33.877 

3127 

AC007018 

Arabidopsis thaliana unknown protein 

122 

45.833 

3128 

AC007018 

Arabidopsis thaliana unknown protein j 

76 

24.359 

3129 

AC007018 

Arabidopsis thaliana unknown protein j 

257 

43 .038 


r*v. uu / uxo 

Arabidopsis thaliana unknown protein 

58 

57.143 ! 

3131 

AC007018 

Arabidopsis thaliana unknown protein 

55 

34.483 

3132 

AC007018 

Arabidopsis thaliana unknown protein 

1855 

40.157 | 

3133 

L25899 

Homo sapiens ribosomal protein L10 

702 

74.222 

3134 

M97796 

Homo sapiens Id-2 

553 

93 .407 

3135 

M97796 1 

Homo sapiens Id-2 

738 

98.276 

3136 

M97796 

Homo sapiens Id-2 

293 

64.626 
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3137 

U89439 

Bob tauniB ixbiquitin-like protein 

1 383 

j 85.417 

3138 

X79536 

Homo sapiens hnRNPcore protein Al 

508 

| 92 . 405 

3139 

X79536 

Homo sapiens hnRNPcore protein Al 

1725 

96.498 

. 3140 

X79536 

Homo sapiens hnRNPcore protein Al 

851 

1 56 . 727 

3141 

X79536 

Homo sapiens hnRNPcore protein Al 

874 

47.515 

3143 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

121 

1 on ah cl 

3144 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

110 

1 3 1 5**^ 

3145 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

157 

I 68.421 

3146 

- 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

116 

86.364 

3147 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

655 

47.453 

3148 

AF021255 

fowl adenovirus 8 unknown 

142 

33.929 

3149 

AF151885 

Homo sapiens CGI-127 protein 

666 

100.000 

3150 

AF151885 

Homo sapiens CGI -12 7 protein 

517 

82.540 

3151 

AF027140 

Homo sapiens zinc finger protein 

166 

50.769 

3152 

AB014601 

Homo sapiens KIAA0701 protein 

48 

36.667 

3153 

AB014601 

Homo sapiens KIAA0701 protein 

56 

22 . 414 

3154 

AB014601 

Homo sapiens KIAA0701 protein 

45 

22.222 

3155 

AB014601 

Homo sapiens KIAA0701 protein 

3242 

94.671 

3156 

U58658 

Homo sapiens unknown 

231 

60.274 

3157 

D14694 

Homo sapiens Whole ORF continues from 
bpl9 (right after 'tag 1 ) to bpl596 
Ctga 1 ).; similar to Chinese hamster 
phosphatidylserine synthase. 

1 A 1 1 

100 . 000 

3158 

D14694 

Homo sapiens Whole ORF continues from 
bpl9 (right after 'tag') to bpl596 
(•tga*).; similar to Chinese hamster 
phosphatidylserine synthase. 

2714 

92 7 97 

3160 

X55683 

Lycopersicon esculentum extensin 
(class I) 

146 

45.205 j 

3161 

AB018319 | 

Homo sapiens KIAA0776 protein j 

1786 j 

71.975 

3162 

AB018319 

Homo sapiens KIAA0776 protein | 

4434 1 

94.907 

3163 

AB021288 

Homo sapiens beta 2 -microglobulin j 

627 1 

83 . 193 

3164 

AB021288 

Homo sapiens beta 2 -microglobulin | 

667 1 

99.160 

3165 

U36764 

Homo sapiens TGF-beta receptor I 
interacting protein 1 

433 

9 q m 551 

3166 

U36764 

Homo sapiens TGF-beta receptor 
interacting protein 1 

952 

75 . 369 

3168 

U36764 

Homo sapiens TGF-beta receptor 
interacting protein 1 | 

51 

23 .256 

3169 

U36764 

Homo sapiens TGF-beta receptor 
interacting .protein 1 j 

647 

98 . 936 

3170 

U36764 

Homo sapiens TGF-beta receptor j 
interacting protein 1 | 

50 

25.455 

3171 

U36764 

Homo sapiens TGF-beta receptor 
interacting protein 1 ' j 

41 

34.043 

3172 

U36764 

Homo sapiens TGF-beta receptor 1 
interacting protein 1 1 

993 

72.754 

3173 

AF083115 

Homo sapiens paraneoplastic cancer- 1 
testis -brain antigen | 

344 | 

36.364 



3174 

L28"918 

Bos taurue cyanogen bromide 

111 

31.395 

3175 

AC004 521 

Aiavxuv^Dltf Lila± laIJa il y £>t> UIlC? L. 1 1- cl x 

protein 

0*7 

zo. 421 

3176 

AC004521 

protein 


25 . 833 

3177 

AC004521 

ArabidoDsis thaliana hvoothetical 
protein 

396 


3178 

AF004715 

Homo sapiens jerky gene product 
homo log 

45 

30 000 

3179 

AF004715 

Homo sapiens jerky gene product 
homo log 

636 

33 . 249 

3160 

X55777 

Homo sapiens put. ORF 

311 

63.291 

3181 

AJ388557 

Canis familiaris zinc finger protein 

1552 

37 . 834 

3182 

M13100 

Rattus norvegicus unknown protein 

223 

33 . 168 

3184 

M88365 

Homo sapiens DNA-binding protein 

106 

43 . 182 

3185 

X55777 

Homo sapiens put. ORF 

253 

47 . 253 

3186 

X51394 

Xenopus laevis APEG precursor protein 

179 

27 . 946 

3187 

X99962 

Homo sapiens rab-related GTP-binding 
protein 

916 

71 . 560 

3188 

U80929 

Cloning vector pBACe3 . 6 levansucrase 

2988 

100*000 

3194 

U80929 

Cloning vector pBACe3 . 6 levansucrase 

1493 

90 . 076 

3199 

U80929 

Cloning vector pBACe3 . 6 levansucrase 

871 

98.519 

3200 

U80929 

Cloning vector pBACe3 . 6 levansucrase 

1939 

72 . 747 

3201 

U80929 

Cloning vector pBACe3.6 levansucrase 

1508 

98.712 

3202 

U80929 

Cloning vector pBACe3 . 6 levansucrase 

2641 

92.222 

3203 

U80929 

Cloning vector pBACe3 . 6 levansucrase 

1499 

96.250 

3206 

U80929 

Cloning vector pBACe3 . 6 levansucrase 

1244 

100 . 000 

3207 

U80929 

Cloning vector pBACe3 . 6 levansucrase 

2974 

99.778 

3208 

U80929 

Cloning vector pBACe3 . 6 levansucrase 

3105 

100 . 000 

3209 

U80929 

Cloning vector pBACe3 . 6 levansucrase 

2778 

100.000 

3211 

U80929 j 

Cloning vector pBACe3 . 6 levansucrase 

2622 

98 . 010 

3213 

U80929 

Cloning vector pBACe3 . 6 levansucrase 

2110 

84.461 

3214 

U80929 

Cloning* vector pBACe3 . 6 levansucrase 

2 040 

90 . 368 

3215 

A78803 

unidentified unnamed protein product 

918 

100 000 ' 

3216 

A78803 

unidentified unnamed protein product 

543 

78.295 

3217 

X59864 

Rattus rattus ORF 


K f\ Q£ fl 

3218 

M19684 

Homo sapiens alpha-l-antitrypsin- 
related protein 

689 

94.828 

3219 

M19684 

Homo sapiens alpha- 1-ant it ryps in- 
related protein 

2676 

99.286 

3220 

U64033 

Mus musculus Tera 

1257 

90.254 

3221 

D85818 

Mus musculus RNA nolvmpra r f* • t T 
subuunit RPB14 

A ft Q 

•7 rt A*7£ 
/ U . *4 / O 

3222 

D85818 

Mus musculus RNA nnl vmpra rp t T 
subuunit RPB14 

A fl 


3223 

D85818 

Mus musculus RNA DolvmpraRP TT 

subuunit RPB14 

.j r> 

in die 

3224 

D85818 

Mus musculus RNA polymerase II 
subuunit RPB14 

40 

35.294 

3225 

D85818 

Mus musculus RNA polymerase II 
subuunit RPB14 

240 

83.673 

3226 

D85818 

Mus musculus RNA polymerase II 
subuunit RPB14 

480 

80.488 

3227 

L54057 

Homo sapiens deduced protein product 

41 

35.714 
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shows significant homology to 
coactosin from Dictyostelium 
discoideum 



322B 

L540S7 

Homo sapiens deduced protein product 
shows significant homology to 
coactosin from Dictyostelium 
discoideum 

850 

95 . 070 

3249 

V00488 

Homo sapiens alpha globin 

357 

73.810 

3250 

J04599 

Homo sapiens proteoglycan I precursor 

1977 

94.102 

3251 

AJ011863 

Homo sapiens homeobox protein LSX 

1799 

76.010 

3252 

AJ011863 

Homo sapiens homeobox protein LSX 

4728 

98.037 

3254 

AF044773 

Homo sapiens breakpoint cluster region 
protein 1 

51 

53 . 846 

3255 

AF044773 

Homo sapiens breakpoint cluster region 
protein 1 

58 

22 .619 

3256 

AF044773 

Homo sapiens breakpoint cluster region 
protein 1 

574 

100. 000 

3257 

AF044773 

Homo sapiens breakpoint cluster region 
protein 1 

53 

24.138 ! 

3258 

AF044773 

Homo sapiens breakpoint cluster region 
protein 1 

497 

71. 622 

3259 

AJ238629 

Chlorella protothecoides putative 
extensin-like protein 

101 

37.838 

3260 

AJ238629 

Chlorella protothecoides putative 
extensin-like protein 

81 

38.776 

3261 

X67703 

Drosophlla melanogaster Mst84Dd 

120 

45.283 

3262 

AF083391 

Homo sapiens putative WHSC1 protein 

916 

98.621 

3263 

AF083391 

Homo sapiens putative WHSC1 protein 

4107 

100.000 j 

3264 

AF083391 

Homo sapiens putative WHSC1 protein 

73 

19.167 | 

3265 

AF083391 

Homo sapiens putative WHSC1 protein j 

4107 

100. 000 

3266 

AF083391 

Homo sapiens putative WHSC1 protein 

156 

100.000 

3267 

AF083391 

Homo sapiens putative WHSC1 protein 

64 

23 .273 

3268 

AF083391 

Homo sapiens putative WHSC1 protein 

3205 

91.439 

3269 

AF083391 

Homo sapiens putative WHSC1 protein 

3968 

99.048 

3270 

AF063605 

Homo sapiens brain my047 protein 

152 

76.667 

3271 

AF063605 

Homo sapiens brain my047 protein 

198 

71.053 

3272 

AF063605 

Homo sapiens brain my047 protein ! 

152 

76.667 | 

3273 

AF063605 

Homo sapiens brain my047 protein 

411 

62.687 

3274 

X67247 

Homo sapiens ribosomal protein S8 

1160 

90. 865 

3275 

X67247 

Homo sapiens ribosomal protein S8 

402 

60.965 

3277 

AJ133469 

Equus cabal lus prostaglandin D2 
synthase 

301 

70 . 312 

3278 

AJ133469 

Equus caballus prostaglandin D2 
synthase 

680 

69.744 

3279 

AF084445 

synthetic construct calmodulin mutant 
SYNCAM17 

243 

71.429 

3280 

AF036145 

Homo sapiens meningioma-expressed 
antigen 5 

275 - 

97.826 

3281 

AF036145 

Homo sapiens meningioma-expressed 
antigen 5 

4537 

92.999 

3282 

AF036145 

Homo sapiens meningioma-expressed 
antigen 5 

1202 

75.297 

3284 

AF036145 

Homo sapiens meningioma-expressed 
antigen 5 

4999 

98.977 
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3285 

A78803 

vm identified unnampd nrnfA-f n -nyor1iir»t- 

y lo 

100 . 000 

3286 

A78803 

unidentified unnamed protein product 

J 2*1 

71 . 654 

3288 

AF063866 

Melanoplus sanguinipes entomoppxvirus 
ORF MSV233 hypothetical protein 

73 

26.667 

3289 

AF063866 

Melanoplus sanguinipes entomopoxvirus 
ORF MSV233 hvnothetical orotein 

129 

33.684 

3290 

U66372 

Bos taurus ribosomal protein S2 9 

^ j j 

94 • 595 

3291 

U66372 

Bos ta\irus ribosomal protein S2£ 

412. 

98.214 

3292 

U66372 

Bos taurus ribosomal protein S29 


96 . 154 

3293 

K01899 

Homo sapiens hbbm fused globin protein 
( beta chain secpuence to base 268) 

675 

98.020 

3294 

K01899 

Homo sapiens hbbm fused globin protein 
( beta chain sequence to base 268) 

512 

92.784 

3295 

AF117710 

Homo sapiens hemoglobin beta chain 

461 

75.573 

3296 

AF062529 

Homo saoiens d i nhnfjnhm' nnnl t*nl 

polyphosphate phosphohydrolase 

O O Q 

100 . 000 

3297 

AF062529 

Homo a aT"\ -J f»ri n H ■! T^hrtonVioi nna 4 
DOlvDhost)hafce nhoflnhoVivHi-ol aa*» 

741 

90. 419 

3298 

U34343 

Homo sapiens 13 kD differential- ion- 
associated protein 


95 . 918 

3299 

U34343 

Homo sapiens 13kD differentiation- 
associated protein 

641 

98.947 

3300 

U34343 

Homo sapiens 13kD differentiation- 
associated protein 

n "\ 

85 . 217 

3301 

M19434 

Macaca mulatta gamma-globin 

309 

61.029 

3302 

M12429 

Pongo pygmaeus alpha -2 globin 

907 

97.887 

3303 

M12429 

Pongo pygmaeus alpha- 2 globin 

329 

70.492 

3304 

M26650 

ntjimj sdpiens lbieu amyioiQ. polypeptide 
(hIAPP) 

530 

97. 753 

3305 

K02215 

Homo sapiens preangiotensinogen 

2352 

91.718 

3307 

D26068 

Homo sapiens KIAA003 8 

1255 

96.983 I 

3308 

A "RAO QOQ-5 
«-T \) z> i* O J7 3 

Homo sapiens i-beta-1, 3 -N- 
acetylglucosaminyl transferase; poly-N- 
acetyllactosamine extension enzyme i- 
antigen; iGnT 

627 

100.000 

3309 

/VC J O J J 

xiomo sapiens l-Deta-l, 3-N- 
acetylglucosaminyl transferase; poly-N- 
dLetyiiacLOBamine extension enzyme i - 
antigen; iGnT 

2366 

99.429 

3310 

AF029893 

nomo sapiens l-JDcta-i, J*JN* 

ac&tvl Ct 1 1 1 pnfl ami n\rT t~ rana f a%*s e» a • vr 
c*v-^ '■j y iaw woct in j. ii jr X HdlloIcraBc ; pOXV W — 

acetvllactosamine extension pn7vmp *f « 
antigen; iGnT 

2294 

94.498 

3311 

Z68179 

Homo sapiens retinoic acid induced 
gene E 

529 

94 .048 

3312 

Z68179 

Homo sapiens retinoic acid induced 
gene E 

374 

76.978 

3313 

X75315 

Homo sapiens SEB4B 

144 

100.000 

3314 

X75315 

Homo sapiens SEB4B 

1441 

94 .783 

3315 

AC006042 

Homo sapiens supported by human ESTs 
AI681256. l(NID:g4891438) , N32168 . 1 (NID : 
gll52567) , and genscan 

57 

27 .273 

3317 

AC006042 

Homo sapiens supported by human ESTs 
AI681256.1 (NID :g4 891438) , N32168 . 1 (NID : 
gll52567) , and genscan 

504 

44 .167 
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3318 

AC006042 

Homo sapiens supported by human ESTs 
AI681256. 1 (NID:g4 891438) ,N32168 . 1 (NID: 
gll52567) , and genscan 

348 

49.242 

3319 

AC005035 

Homo sapiens nik-like;' similar to 
U88984 (PID:gl872546) 

663 

ox* / 0 u 

3320 

AC005035 

Homo sapiens nik-like; similar to 
U88984 (PID:gl872546) 

265 


3321 

AC005035 

Homo sapiens nik-like; similar to 
U88984 (PID:gl872546) 

772 

91 . 912 

3322 

AC005035 

Homo sapiens nik-like; similar to 
U88984 (PID:gl872546) 

318 

98 . 039 

3323 

AC005035 

Homo sapiens nik-like; similar to 
U88984 (PID:gl872546) 

7547 

84.632 

3324 

M64716 

Homo sapiens ribosomal protein 

53 

47.619 

3325 

M64716 

Homo sapiens ribosomal protein 

799 

100. 000 

3326 

M64716 

Homo sapiens ribosomal protein 

194 

93.750 

3327 

M64716 

Homo sapiens ribosomal protein 

40 

35.556 

3328 

M64716 

Homo sapiens ribosomal protein 

33 

34.375 

3329 

M64716 

Homo sapiens ribosomal protein 

43 

20.588 

3330 

M64716 

Homo sapiens ribosomal protein 

215 

100.000 

3331 

M64716 

Homo sapiens ribosomal protein 

42 

24.561 

3332 

M64716 

Homo sapiens ribosomal protein " 

306 

68.966 

3333 

X60730 

Rattus norvegicus beta T globin chain 

153 

43.537 

3334 

M74493 

Homo sapiens ADP-ribosylation factor 3 

57 

31.579 

3335 

M74493 

Homo sapiens ADP-ribosylation factor 3 

33 

41.176 

3336 

M74493 

Homo sapiens ADP-ribosylation factor 3 

341 

100.000 

3337 

M74493 

Homo sapiens ADP-ribosylation factor 3 

297 

100.000 

3338 

M74493 

Homo sapiens ADP-ribosylation factor 3 

53 

22.222 

3339 

M74493 

Homo sapiens ADP-ribosylation factor 3 

43 

26. 190 

3340 

M74493 

Homo sapiens ADP-ribosylation factor 3 

45 

25.000 

3341 

M74493 

Homo sapiens ADP-ribosylation factor 3 

56 

22.857 

3342 

M74493 

Homo sapiens ADP-ribosylation factor 3 

47 

33.333 

3343 

M74493 

Homo sapiens ADP-ribosylation factor 3 

56 

22.857 j 

3344 

M74493 

Homo sapiens ADP-ribosylation factor 3 

686 

81.622 

3345 

X61046 

Hydra sp. mini -collagen 

150 

39.706 

3346 

D45131 

Homo sapiens basigin 

116 

52.174 

3347 

D45131 

Homo sapiens basigin 

1491 

81 . 126 

3348 

D45131 

Homo sapiens basigin 

1373 

95.556 

3349 

X15734 

Rattus norvegicus s -adenosylmethionine 
synthetase 

49 

0 ft «;7i 

3350 

X15734 

Rattus norvegicus s -adenosylmethionine 
synthetase 

44 

3 1 . 250 

3351 

X15734 

Rattus norvegicus s -adenosylmethionine 
synthetase 

52 

35.556 

3352 

X15734 

Rattus norvegicus s -adenosylmethionine 
synthetase 

2209 

83 . 879 

3353 

AF132941 

Homo sapiens CGI -07 protein 

197 

93,939 

3354 

AF132941 

Homo sapiens CGI -07 protein 

2917 

92.157 

3355 

U53154 

Caenorhabditis elegans No definition 
line found 

40 

71.429 

3356 

U53154 

Caenorhabditis elegans No definition 
line found 

301 

28 .326 

3357 

AC003083 

Homo sapiens mitochondrial carrier 
protein-like; similar to Q09461 

1862 

97 .727 
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(PID:g2497990) 



3358 

AE000757 

Aquifex aeolicus hypothetical protein 

320 

42 . 953 

3359 

U64028 

Homo sapiens NADPH : ubiquinone 
oxidoreductase subunit B13 

426 

65 .487 

3360 

U64028 

Homo sapiens NADPH : ubiquinone 
oxidoreductase subunit B13 

401 

96.522 

3361 

L76159 

Homo sapiens FRG1 gene product 

433 

56.742 

3362 

L76159 

Homo sapiens FRG1 gene product 

391 

73.469 

3363 

L76159 

Homo sapiens FRG1 gene product 

361 

74.725 

3364 

L76159 

Homo sapiens FRG1 gene product 

538 

73.228 

3365 

L76159 

Homo sapiens FRG1 gene product 

961 

71.255 

3366 

L76159 

Homo sapiens FRG1 gene product 

1358 

94.961 

3367 

M34059 

Homo sapiens beta-globin 

418 

82.308 

3368 

AL117608 

Homo sapiens hypothetical protein 

800 

100.000 

3369 

AIill7608 

Homo sapiens hypothetical protein 

214 

97.368 

3370 

M74019 

Homo sapiens immunoglobulin light 
chain variable region 

636 

81.102 

3371 

D38305 

Homo sapiens Tob 

349 

96.296 

3372 

D38305 

Homo sapiens Tob 

1773 

95.652 

3373 

M69175 

Homo sapiens H-protein 

278 

100.000 

3374 

M69175 

Homo sapiens H-protein 

735 

88.571 

3375 

D23662 

Homo sapiens ubiqui tin -like protein 

200 

86.792 

3376 

Y00278 

Homo sapiens CFAg (AA 1-94) 

596 

100.000 

3377 

Y00278 

Homo sapiens CFAg (AA 1-94) 

171 

100.000 

3378 

Y00278 

Homo sapiens CFAg (AA 1-94) 

463 

65.278 

3379 

Y00278 

Homo sapiens CFAg (AA 1-94) 

350 

96. 154 

33B0 

L48220 

Homo sapiens beta-globin 

173 

79.245 

3381 

M15386 

Homo sapiens gamma -globin 

897 

93.421 

3382 

M15386 

Homo sapiens gamma -globin 

317 

69.375 

3383 

AE000036 

Mycoplasma pneumonia H08_orfl57a 
Protein 

106 

- 

38.095 

3384 

AP054181 

Homo sapiens CI-MNLL homolog gene 

323 

97.917 

3385 

AF054181 

Homo sapiens CI-MNLIj homolog gene 

336 

93.548 

3386 

AF169481 

Homo sapiens cytochrome b5 reductase 1 

345 

100.000 

3387 

AF169481 

Homo sapiens cytochrome b5 reductase 1 

344 

60.000 

3388 

AF169481 

Homo sapiens cytochrome b5 reductase 1 

720 

81.481 

3389 

AF169481 

Homo sapiens cytochrome b5 reductase 1 

1725 

93.069 

3390 

M64247 

Homo sapiens cardiac troponin I 

218 

72.059 

3391 

M64247 

Homo sapiens cardiac troponin I 

522 

71.318 

3392 

M64247 

Homo sapiens cardiac troponin I 

642 

87.603 

3393 

M64247 

Homo sapiens cardiac troponin I 

599 

100.000 

3394 

M64247 

Homo sapiens cardiac troponin I 

1017 

91.346 

3395 

J03910 

Homo sapiens metallothionein-IG 

169 

77 . 778 

3397 

J03910 

Homo sapiens metallothionein-IG 

176 

89. 655 

3398 

U93162 

Homo sapiens C4- sterol methyl oxidase 
homolog 

674 

83.088 

3399 

AC007633 

Unknown similarity to several 
hypothetical proteins- Arabidopsis 
thaliana 

523 

45.217 

3400 

AJ223980 

Homo sapiens BCL7C 

305 

70.588 

3401 

AJ223980 

Homo sapiens BCL7C 

983 

85.202 

3402 

AJ225089 

Homo sapiens 2' -5* oligoadenylate 
synthetase (p590AS) 

191 

1O0.000 

3404 

AJ225089 

Homo sapiens 2' -5' oligoadenylate 

62 

25.000 
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synthetase (p590AS) 



3406 

AJ225089 

Homo sapiens 2" -5* oligoadenylate 
synthetase (p590AS) 

2969 

98 . 586 

3407 

AJ006068 

Homo sapiens dTDP-D-glucose 4,6- 
dehydratase 

469 

76.190 

3408 

AJ006068 

Homo sapiens dTDP-D -glucose 4,6- 
dehydratase 

2181 

97.167 

3409 

X66285 

Musmusculus HC1 ORF 

152 

41.176 

3410 

M29039 

Homo sapiens transactivator 

1667 

84.507 

3411 

AB018257 

Homo sapiens KIAA0714 protein 

7252 

99.820 

3412 

AF004161 

Oryctolagus cuniculus peroxisomal Ca- 
dependent solute carrier 

370 

77.333 

3413 

AF004161 

Oryctolagus cuniculus peroxisomal Ca- 
dependent solute carrier 

2674 

87.421 

3414 

Y14551 

Homo sapiens DIF-2 protein 

478 

97.297 

3415 

Y14551 

Homo sapiens DIF-2 protein 

792 

96. 815 

3416 

Z14961 

Homo sapiens codes for truncated alpha 
Ig chain of patient BEN 

119 

41.818 

3417 

Z14961 

Homo sapiens codes for truncated alpha 
Ig chain of patient BEN 

119 

41. 818 

3418 

Z14961 

Homo sapiens codes for truncated alpha 
Ig chain of patient BEN 

57 

35.484 

3419 

AL050044 

Homo sapiens hypothetical protein 

572 

100.000 

3420 

AL050044 

Homo sapiens hypothetical protein 

95 

100.000 

3421 

AL050044 

Homo sapiens hypothetical protein 

466 

89.516 

3422 

M93311 

Homo sapiens metallothionein-III 

147 

59.091 

3423 

S58722 

Homo sapiens X- linked retinopathy 
protein {C-terminal, clone XEH.8c} 

166 

64.516 

3424 

AB014561 

Homo sapiens KIAA0661 protein 

3543 

59.462 

3425 

AL050051 

Homo sapiens hypothetical protein 

48 

33.333 

3426 

AL050051 

Homo sapiens hypothetical protein 

557 

98.780 

3427 

AL050051 

Homo sapiens hypothetical protein 

43 

33 .333 

3428 

AL050051 

Homo sapiens hypothetical protein 

715 

94 . 068 

3429 

M15530 

Homo sapiens B-cell growth factor 

172 

75.610 

3430 

D87973 

Mus musculus Impact 

1611 

76.582 

3433 

S58722 

Homo sapiens X- linked retinopathy 
protein {C- terminal, clone XEH.8c} 

230 

62 . 687 

3434 

AL032653 

Caenorhabditis elegans similar to 
Ubiquitin- conjugating enzymes; cDNA 
EST EMBL:C08678 comes from this gene 

499 

56.463 

3435 

AC004255 

Arabidopsis thaliana T1F9.23 

221 

39.683 

3436 

AC004255 1 

Arabidopsis thaliana T1F9.23 

76 

39.286 

3437 

AC004255 

Arabidopsis thaliana T1F9.23 

235 

33.621 

3438 

J01566 

Plasmid ColEl 13.8 kd ORF 

85 

38.462 

3439 

J01566 

Plasmid ColEl 13.8 kd ORF 

212 

36.975 j 

3440 

J01566 

Plasmid ColEl 13.8 kd ORF 

64 

56.250 

3441 

J01566 

Plasmid ColEl 13.8 kd ORF 

58 

32.727 

3442 

J01566 

Plasmid ColEl 13.8 kd ORF 

66 

40.541 

3443 

J01566 

Plasmid ColEl 13.8 kd ORF 

184 

35.965 

3444 

J01566 

Plasmid ColEl 13.8 kd ORF 

54 

28.261 

3445 

J01566 

Plasmid ColEl 13.8 kd ORF 

181 

37.273 

3446 

J01566 j 

Plasmid ColEl 13.8 kd ORF 

171 

35.294 

3447 

J01566 j 

Plasmid ColEl 13.8 kd ORF 

204 

36.975 

3448 

J01566 

Plasmid ColEl 13.8 kd ORF 

129 

37.500 
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3449 

J01566 

Plasmid ColEl 13.8 kd ORP 

165 

36 . 937 

3450 

S76965 

Homo sapiens protein kinase inhibitor, 
PKI 

463 

100 . 000 

3451 

S76965 

Homo sapiens protein kinase inhibitor, 
PKI 

32 

31.579 

3452 

S76965 

Homo sapiens protein kinase inhibitor, 
PKI 

403 

100.000 

3453 

AJ005565 

Mus musculus SPR2G protein 

164 

41.270 

3454 

M15664 

Oryza sativa hi stone 3 

457 

80.000 

3455 

AB030234 

Canis familiaris D4 dopamine receptor 

128 

40.000 

3456 

U33837 

Homo sapiens gp3 3 0 precursor 

692 

98.824 

3457 

U33837 

Homo sapiens gp330 precursor 

1458 

100.000 

3459 

U33837 

Homo sapiens gp330 precursor 

1644 

55.198 

3461 

U33837 

Homo sapiens gp330 precursor 

1598 

32.513 

3462 

U33837 

Homo sapiens gp330 precursor 

24096 

90.617 

3463 

U33837 

Homo sapiens gp330 precursor 

33391 

99.399 

3464 

U67085 

Homo sapiens TcD37 homo log 

184 

45.349 

3465 

U67085 

Homo sapiens TcD37 homolog 

817 

72.864 

3466 

U67085 

Homo sapiens TcD37 homolog 

587 

90.566 

3467 

U67085 

Homo sapiens TcD37 homolog 

478 

94. 118 

3468 

U67085 . 

Homo sapiens TcD37 homolog 

2575 

99.768 

3469 

J04513 

Homo sapiens basic fibroblast growth 
factor (ctg start codon) ; putative 

39 

3 8.889 

3470 

J04513 

Homo sapiens basic fibroblast growth 
factor (ctg start codon) ; putative 

1370 

98.558 

3471 

AF176514 

Mus musculus B2F1- inducible protein 

1337 

90. 955 

3472 

AP176514 

Mus musculus E2Fl-inducible protein 

391 

94 . 828 

3473 

AF002697 

Homo sapiens E1B 19K/Bcl -2 -binding 
protein Nip3 

317 

98. 039 

3474 

AF002697 

Homo sapiens E1B 19K/Bcl -2 -binding 
protein Nip3 

891 

83.920 

3475 

AB033045 

Homo sapiens KIAA1219 protein 

2110 

99.685 

3476 

AB033045 

Homo sapiens KIAA1219 protein 

3449 

99.437 

3477 

AL110238 

Homo sapiens hypothetical protein 

53 

32.558 

3478 

AL110238 

Homo sapiens hypothetical protein 

936 

99.275 

3479 

U21049 

Homo sapiens DD96 

397 

68.421 

3480 

U21049 

Homo sapiens DD96 

595 

94.828 

3481 

AL031427 

Homo sapiens dJ167A19.1 (novel 
protein) 

610 

77 . 612 

3482 

AL031427 

Homo sapiens dJ167A19.1 (novel 
protein) 

767 

99.123 

3483 

AL031427 

Homo sapiens dJ167A19.1 (novel 
protein) 

51 

30.233 

3484 

AL031427 

Homo sapiens dJ167A19.1 (novel 
protein) 

597 

96.117 

3485 

AL031427 

Homo sapiens dJ167A19.1 (novel 
protein) 

1997 

99.673 

3486 

AF042081 

Homo sapiens SH3 domain binding 
glutamic acid-rich-like protein 

450 

81.356 

^3487 

U79260 

Homo sapiens unknown 

51 

25.000 

3488 

U79260 

Homo sapiens unknown 

175 

51.000 

3489 

Y13141 

Bromheadia f inlaysoniana extensin j 

94. 

34.615 ! 

3490 

Y13141 

Bromheadia f inlaysoniana extensin 

73 

27.273 

3491 

AF081258 

Homo sapiens testis-specif ic 

39 

3 8.095 ! 
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chromodomain Y-like protein 



3492 

AF0812S8 

Homo sapiens testis- specif ic 
chromodomain Y-like protein 

1473 

99.556 

3493 

AF081258 

Homo sapiens testis-specif ic 
chromodomain Y-like protein 

3478 

93 .613 

3494 

AC004770 

Homo Bapiens BC269730 1 

888 

80 .571 

3495 

AC004770 

Homo sapiens BC269730 1 

50 

32 .258 

3496 

AC004770 

Homo sapiens BC269730 1 

608 

98.095 

3497 

D13315 

Homo sapiens lactoyl glutathione lyase 

942 

95.028 

3498 

AF009356 

Homo sapiens regulator of G-protein 
signaling-16 

488 

83.516 

3499 

AF009356 

Homo sapiens regulator of G-protein 
signaling-16 

1027 

94.608 

3500 

J03561 

Homo sapiens gro protein 

219 

55.319 

3501 

J03561 

Homo sapiens gro protein 

444 

77.670 

3502 

J03561 

Homo sapiens gro protein 

500 

66.187 

3503 

J03561 

Homo sapiens gro protein 

596 

91.667 

3504 

AF174592 

Homo sapiens F-box protein Fbl6 

897 

95.000 

3505 

AF174592 

Homo sapiens F-box protein Fbl6 

457 

74 .257 

3506 

AF174592 

Homo sapiens F-box protein Fbl6 

1888 

97.578 

3507 

D14696 

Homo sapiens KIAA0108 

1025 

88.511 

3506 

U51205 

Homo sapiens HCOP9 

587 

78 .358 

3509 

U51205 

Homo sapiens HCOP9 

541 

100.000 

3510 

U51205 

Homo sapiens HCOP9 

48 

50 . 000 

3511 

U51205 

Homo sapiens HCOP9 

787 

84 .862 

3512 

AL031907 

Schizosaccharomyces pombe hypothetical 
protein 

138 

25 . 373 

3513 

AL031907 

Schizosaccharomyces pombe hypothetical 
protein 

48 

2 9.630 

3514 

AL031907 

Schizosaccharomyces pombe hypothetical 
protein 

96 

26 . 761 

3515 

AL031907 

Schizosaccharomyces pombe hypothetical 
protein 

68 

36.667 

3516 

AL031907 

Schizosaccharomyces pombe hypothetical 
protein 

235 

33 .333 

3517 

D89077 

Homo sapiens Src-like adapter protein 

45 

.62 .500 

3518 

D89077 

Homo sapiens Src-like adapter protein 

603 

90.196 

3519 

D89077 

Homo sapiens Src-like adapter protein 

1592 

97 . 834 

3520 

L11566 

Homo sapiens ribosomal protein L18 

474 

68 .229 

3521 

AF011564 

Oryctolagus cuniculus splicing factor 
SRp55 homolog 

181 

50 . 633 

3522 

X76105 

Homo sapiens DAP-1 

46 

27 . 907 

3523 

X76105 

Homo sapiens DAP-1 

704 

100.000 

3524 

X60673 

Homo sapiens nucleoside -triphosphate- - 
adenylate kinase 

1076 

94 . 643 

3525 

U64856 

Caenorhabditis elegans weak similarity 
to TPR domains 

314 

47.664 

3526 

U64856 

Caenorhabditis elegans weak similarity 
to TPR domains 

547 

42 .640 

3527 

U64856 

Caenorhabditis elegans weak similarity 
to TPR domains 

701 

34 .550 

3529 

AL050197 

Homo sapiens hypothetical protein 

228 

97 .500 

3530 

AL050197 

Homo sapiens hypothetical protein 

1034 

85 . 830 

3531 

AL023657 

Homo sapiens SH2D1A 

315 

96 .154 


3532 

AL023657 

Homo sapiens SH2D1A 

301 

60.784 

3533 

AL023657 

Homo sapiens SH2D1A 

301 

94 .000 

3534 

AL023657 

Homo sapiens SH2D1A 

665 

85.938 

3535 

AL023657 

Homo sapiens SH2D1A 

660 

92 .248 

3536 

S82297 

Homo sapiens beta 2 -microglobulin 

227 

82 .692 

3537 

M85183 

Rattus norvegicus vasopressin receptor 

48 

38.889 

3538 

M85183 

Rattus norvegicus vasopressin receptor 

48 

38 .462 

3539 

M85183 

Rattus norvegicus vasopressin receptor 

1721 

59.091 j 

3540 

AL049667 

Homo sapiens hypothetical protein 

424 

78 .313 

354X 

AL049667 

Homo sapiens hypothetical protein 

713 

84.615 

3542 

AL049667 

Homo sapiens hypothetical protein 

1194 

73 . 868 

3543 

AL049667 

Homo sapiens hypothetical protein 

3455 

99.058 

3544 

M12 886 

Homo saoiens T-cell recentor beta 
chain 

232 

i nn nnn 

3545 

M12886 

Homo saniens T-cell receotoi* beta 
chain 

250 

do -a 

3546 

M12886 

Homo sapiens T-cell receptor beta 
chain 

1225 

fZ R A04 

O O * ** W *i 

3547 

M12886 

Homo sapiens T-cell receptor beta 
chain 

1457 

7 O Q74 

3548 

J05071 

Bos taunia GTP-bindinei rpcnil ahorv 

protein gamma -6 subunit 

Ol A 
aIU 

i nn nnn 

3549 

J05071 

nnq taurua C3TP-bi ndi ncr rponil ahnrv 

protein gamma- 6 subunit 

O 0 1 

inn nnn 

3550 

JO 5071 

Pnq haiiniR f?TP — h i tiH t ncf rpnul ahnrv 

protein gamma- 6 subunit 

n«? 

J \J o 


3552 

AF092878 

Homo sapiens zinc RING finger protein 
SAG 

654 


3553 

AF092878 

Homo sapiens zinc RING finger protein 
SAG 

689 

91.228 

3554 

X95073 

Homo sapiens Translin associated 
protein X 

1850 

100.000 

3555 

AB007157 

Homo sapiens ribosomal protein S21 

276 

100 . 000 

3556 

AB007157 

Homo sapiens ribosomal protein S21 

313 

94.444 

3557 

AB007157 

Homo sapiens ribosomal protein S21 

269 

100. 000 

3558 

AC006585 

Arabidopsis thaliana putative 
extragenic suppressor protein 

196 

33.333 

3559 

AC006585 

Arabidopsis thaliana putative 
extragenic suppressor protein 

1189 

45.810 

3560 

J01175 

Lytechinus pictus histone H3 

289 

75.207 

3561 

■T\±J\J i J. 1 U U 

UlUUlUWll 

/prediction^ (method : " "genscan" " , 
vers ion : M n 1 . 0 " n , score : " " 5 3 . 3 6 " " ) ; 
/prediction^ (method: n 

z y y 

bo. oZ*k 

3 562 


KJ 1 LrLLlKJ WI1 

/prediction^ (method: n "genscan" " , 
version: Mn 1.0 nM , score : " "53 . 36" ") ; 
/prediction^ (method: " 

«-> o tr 

60 . 294 

3563 

AL021106 

Unknown 

/prediction^ (method: " "genscan nn , 
version : " " 1 . 0 w " , score : " " 53 . 36 " " ) ; 
/prediction- (method: " 

618 

54.124 

3564 

AB001022 

Schizosaccharomyces pombe unnamed 
protein product 

53 

71.429 


ItttMU 


3565 

AB001022 

Schizosaccharomyces pombe unnamed 
protein product 

147 

48.780 

3566 

AL117419 

Homo sapiens hypothetical protein 

2356 

91.484 

3567 

AL117419 

Homo sapiens hypothetical protein 

2490 

97.922 

3568 

AF151820 

Homo sapiens CGI -62 protein 

183 

100. 000 

3569 

AF151820 

Homo sapiens CGI -62 protein 

1844 

94 .545 

3570 

AF047384 

Rattus norvegicus postsynaptic protein 
CRIPT 

500 

98.020 

3571 

U79745 

Homo sapiens monocarboxylate 
transporter homologue MCT6 

55 

27.083 

3572 

U79745 

Homo sapiens monocarboxylate 
transporter homologue MCT6 

61 

26.923 

3574 

U79745 

Homo sapiens monocarboxylate 
transporter homologue MCT6 

54 

28.947 

3575 

U79745 

Homo sapiens monocarboxylate 
transporter homologue MCT6 

68 

26.437 

3576 

U79745 

Homo sapiens monocarboxylate 
transporter homologue MCT6 

52 

25.882 

3578 

U7 9745 

Homo sapiens monocarboxylate 
transporter homologue MCT6 

55 

33.333 

•JCTQ 
J J / J 

U79745 

Homo s aniens monocarboxvlate 
transporter homologue MCT6 

59 

36.000 

J> J O w 

U7 9745 

Homo sapiens monocarboxylate 
transporter homologue MCT6 

50 

40. 000 

•icoo 
-J 0 0 


Homo saiaiena monocarboxvlate 
transporter homologue MCT6 

50 

83 . 333 


U79745 

Homo sapiens monocarboxylate 
transporter homologue MCT6 

54 

28.000 

3584 

U79745 

Homo sapiens monocarboxylate 
transporter homologue MCT6 

47 

37 . 500 

3585 

U79745 

Homo sapiens monocarboxylate 
transporter homologue MCT6 

3279 

99.426 

3586 

U79745 

Homo sapiens monocarboxylate 
transporter homologue MCT6 

2734 

90.574 

3587 

U79745 

Homo sapiens monocarboxylate 
transporter homologue MCT6 

69 

30.526 

3588 

X79238 

Homo sapiens ribosomal protein L30 

739 

100.000 

3589 

X79238 

Homo sapiens- ribosomal protein L3 0 

455 

81.481 

3590 

X79238 

Homo sapiens ribosomal protein L30 

214 

65.289 

3591 

AF064462 

Gallus gallus transcription factor 
LEF-1 

260 

90.244 

3592 

AF064462 

Gallus gallus transcription factor 
LEF-1 

952 . 

56.902 

3593 

AF064462 

Gallus gallus transcription factor 
LEF-1 

2120 

84.962 

3594 

S58722 . 

Homo sapiens X- linked retinopathy 
protein. {c-terminal, clone XEH.8c} 

188 

67.925 

3595 

AJ010973 

Homo sapiens DEDD protein 

72 

39.286 

3596 

AJ010973 

Homo sapiens DEDD protein 

535 

56.291 

3597 

AJ002308 

Homo sapiens synaptogyrin 2 

662 

72.189 

3598 

AJ002308 

Homo sapiens synaptogyrin 2 

732 

96.396 

3599 

AJ002308 

Homo sapiens synaptogyrin 2 

1361 

98.206 

3600 

AF129534 

Homo sapiens F-box protein Fbx4 

192 

46.903 

3601 

AF129534 

Homo sapiens F-box protein Fbx4 

2289 

91.731 




3602 

Y12711 

Homo saDiena tiutative Droaesterone 
binding protein 

212 

48 . 148 

3603 

Y12711 

Homo saDiena Dutative oroaesterone 
binding protein 

421 

92.500 

3604 

Y12711 

Homo sapiens putative progesterone 
binding protein 

939 

92 . 929 

3605 

X56932 

Homo sapiens 23 kD highly basic 
protein 

545 

80 .660 

3606 

Y13736 

Homo sapiens Protein Enriched in 
Diabetes 

385 

100. 000 

3607 

Y13736 

Homo sapiens Protein Enriched in 
Diabetes 

464 

100 . 000 

«k W V ♦ w V V 

3608 

Y13736 

Homo sapiens Protein Enriched in 
Diabetes 

37 

50 . 000 

3609 

Y13736. 

Homo sapiens Protein Enriched in 
Diabetes 

35 

100.000 

3610 

Y13736 

Homo sapiens Protein Enriched in. 
Diabetes 

765 

98.462 

361X 

M11759 

Lycopersicon esculentum cell wall 
hydroxyproline-rich glycoprotein 

117 

38.824 

3612 

U39318 

Homo sapiens UbcHSC 

101 

93.333 

3613 

U39318 

Homo sapiens UbcHSC 

43 

42.105 

3614 

U39313 

Homo sapiens UbcHSC 

301 

100.000 

3615 

U39318 

Homo sapiens UbcHSC 

666 

92.617 

3616 

AL110265 

Homo sapiens hypothetical protein 

808 

95.775 

3617 

U63131 

Homo sapiens CDC3 7 homolog 

46 

32.000 

3618 

U63131 

Homo sapiens CDC37 homolog 

256 

37.705 

3619 

U63131 

Homo sapiens CDC 3 7 homolog 

730 

40.938 

3620 

Z82285 

Caenorhabdit is elegans predicted using 
Genef inder 

130 

2 1 . 930 

3621 

Z82285 

Caenorhabditis elegans predicted using 
Genef inder 

134 

27 . 119 

3622 

Z82285 

Caenorhabditis elegans predicted using 
Genef inder 

208 

27 . 861 

3623 

AF100754 

Homo sapiens ancient ubiquitous - 
protein AUP1 isoform 

773 

100 . 000 

3624 

AF100754 

Homo sapiens ancient ubiquitous 
protein AUP1 isoform 

674 

59 . 292 

3625 

AF100754 

Homo sapiens ancient ubiquitous 
protein AUP1 isoform 

667 

85 . 075 

3626 

AF100754 

Homo sapiens ancient ubiquitous 
protein AUP1 isoform 

496 

100 . 000 

3627 

AF100754 

Homo sapiens ancient ubiquitous 
protein AUP1 isoform 

57 

27 . 273 

3628 

AF100754 

Homo sapiens ancient ubiquitous 
protein AUP1 isoform 

2492 

81 . 784 

3629 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

124 

83 .333 

3630 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

407 

78.571 

3633 

X52022 

Homo sapiens collagen type VI, alpha 3 
chain 

20067 

99.057 

3 634 

AF156098 

Homo sapiens RNA binding motif protein 
7 

131 

55.000 

3635 

AF156098 

Homo sapiens RNA binding motif protein 

129 

65.625 


)C»HB 
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3636 

AF156098 

Homo sapiens RNA binding motif protein 
7 

363 

40,099 

3637 

214 014 

"W-f rnh i ana haKa mim "D ■{ a h ■! 1 <avt*An a «? r\ 
iiiwutiaua Lauauuui rioUXX cALciio 1X1 

like protein, partial CDS only 

11Q 


3638 

X64229 

Homo sapiens outative oncooene 

2231 

-7 / » O U V 

3640 

X07311 

Drosophila tnelanogaster heat shock 
protein 

161 

1 ft no ft 

3641 

AC002301 

Homo sapiens Homo log of rat Zymogen 
granule membrane protein 

943 

89.349 

3642 

AL080097 

Homo sapiens hypothetical protein 

212 

100 . 000 

3643 

AL0B0097 

Homo sapiens hypothetical protein 

1303 

88 .519 

3644 

AP151887 

Homo sapiens CGI -12 9 protein 

148 

100.000 

3645 

AF151887 

Homo sapiens CGI -12 9 protein 

281 

75.000 

3646 

AJ237946 

Homo sapiens DEAD Box Protein 5 

302 

100.000 

3647 

AJ237946 

Homo sapiens DEAD Box Protein 5 

637 

86 . 154 

3648 

AJ237946 

Homo sapiens DEAD Box Protein 5 

126 

87.500 ! 

3649 

AJ237946 

Homo sapiens DEAD Box Protein 5 

290 

/ ZJ * \J \J \J 

3650 

AJ237946 

Homo sapiens DEAD Box Protein 5 

452 

94.444 

3651 

AiJ237 94 6 
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3652 

AJ237946 

Homo sapiens DEAD Box Protein 5 

2439 

93.555 

3653 

U37283 

Homo sapiens microf ibril-associated 
giycoprotein-z PiAvjr-/ 

808 

88.652 

3654 

U37283 

Homo sapiens microf ibril-associated 
glycoprotein- 2 MAGP-2 

285 

95.000 



Homo sapiens microf ibril-associated 
glycoprotein -2 MAGP-2 

591 

81 . 868 

i em.c 


Homo sapiens Genscan gene prediction; 

(NTD -al4875QO) 
\ i. » J- t-f . y J. *± o i zj ^ \j / 

310 

100 . 000 

3657 

AC00252 8 

TTorno Rani ptiq Hpnnpa n o^n#a> n>*oH t > <~»y^ * 

90% similarity to AA023673 
(NID:gl487590) 

/ O M 

Q 1 ICC 

o 1 . Job 

3658 

AF126743 

Homo sapiens DNAJ domain -containing 
protein MCJ ! 

509 

88 . 742 

3659 

U62940 

Rattus norvegicus mt-GrpE#l precursor 

495 

65 .714 

3661 

U96876 

Homo sapiens insulin induced protein 1 

473 

100.000 

3662 

U96876 

Homo sapiens insulin induced protein 1 

535 

95.294 

3663 

U96876 

Homo sapiens insulin induced protein 1 

1520 

87.814 

3664 

AL050170 

Homo sapiens hypothetical protein. 

367 

54 .887 

3665 

AL050170 

Homo sapiens hypothetical protein 

342 

100.000 

3666 

AL050170 

Homo sapiens hypothetical protein 

681 

93 .284 

3667 

AB018292 

Homo sapiens KIAA074 9 protein 

45 ! 

34 .483 

3668 

AB018292 

Homo sapiens KIAA074 9 protein 

4692 

99.705 

3669 

Z97i84 

Homo sapiens HKE2 

435 

75 .229 

3670 

Z97184 

Homo sapiens HKE2 

423 

67 . 442 

3671 

Z97184 

Homo sapiens HKE2 

389 

46.875 

3672 

Z97184 

Homo sapiens HKE2 

714 

100.000 ; 

3673 

D85758 

Homo sapiens human protein homologous 
to DROER protein 

35 

71 .429 

3674 

D85758 

Homo sapiens human protein homologous 
to DROER protein 

358 

87.255 | 

3675 

AF151906 

Homo sapiens CGI -14 8 protein 

1067 

92 .090 

3676 

AF151906 

Homo sapiens CGI -14 8 protein 

40 

24.194 j 


3677 

AF151906 

Homo sapiens CGI -14 8 protein 

868 

90.055. 

3678 

226876 

Homo sapienB ribosomal protein 

279 

100.000 

3679 

Z26876 

Homo sapiens ribosomal protein 

178 

79.688 

3680 

X15001 

Bacteriophage 186 computer protein 79 

35 

54.545 

3681 

X15001 

Bacteriophage 186 computer protein 79 

86 

45.098 

3682 

X15954 

Homo sapiens mannose binding protein 

1514 

98.387 

3683 

AE000775 

Aquifex aeolicus hypothetical protein 

260 

48.101 

3684 

AE000775 

Aquifex aeolicus hypothetical protein 

221 

32 .571 

3685 

AE000775 

Aquifex aeolicus hypothetical protein 

74 

25.490 

3686 

AE000775 

Aquifex aeolicus hypothetical protein 

1543 

41.390 

3687 

AJ005569 

Mus musculus SPR2K protein 

29 

100.000 

3688 

AJ005569 

Mus musculus SPR2K protein 

79 

29.668 

3691 

AJ005569 

Mus musculus SPR2K protein 

32 

33.333 

3692 

AJ005569 

Mus musculus SPR2K protein 

87 

42 .500 

3693 

U29380 

Caenorhabditis elegans similar to 
adenylate cyclase 

50 

45.833 

3694 

U29380 

Caenorhabditis elegans similar to 
adenylate cyclase 

450 

41.509 

3695 

AB024370 

TT virus ORF1 

128 

54 . 545 

3696 

M21302 

Homo sapiens small proline rich 
protein 

94 

41.304 

3697 

X02530 

Homo sapiens early response precursor 
polypeptide (aa-21 to 77) 

591 

98.980 

3698 

Z35227 

Homo sapiens small G protein 

918 

93.229 

3700 

AF004814 

Mesocricetus auratus ubiquitin 
conjugating enzyme 

989 

Q Q O 7 "1 

3701 

AF004814 

Mesocricetus auratus ubiquitin 
conjugating enzyme 

500 

71.341 

3703 

AF143676 

Homo sapiens mult ispanning nuclear 
envelope membrane protein nurim 

1683 

QQ 617 

3704 

AF036718 

Homo sapiens FGFR signalling adaptor 
SNT-2 

3403 

99 . 187 

3705 

L29028 

Unknown amino acid feature: N- 
glycosylation sites, aa 41 . . 43, 46 
48, 51 . . 53, 72 . . 74, 107 . 

142 

31.008 

3707 

AL05027B 

Homo sapiens hypothetical protein 

346 

57.983 

3708 

AL050278 

Homo sapiens hypothetical protein 

39 

62 .500 

3709 

AL050278 

Homo sapiens hypothetical protein 

571 

88.430 ! 

3710 

AB014543 

Homo sapiens KIAA0643 protein 

164 . 

95.833 

3711 

AB014543 

Homo sapiens KIAA0643 protein 

390 

100.000 

3712 

AB014543 

Homo sapiens KIAA0643 protein 

68 

36.364 

3713 

AB014543 

Homo sapiens KIAA0643 protein 

2237 

98.511 

3714 

AC004262 

Homo sapiens R29368 2 

1204 

65.484 1 

3715 

AC004262 

Homo sapiens R2 93 68_2 

1698 

8O.409 

3716 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

325 

59.690 | 

3717 

AF076974 

Homo sapiens TRRAP protein 

12447 

92.864 

3718 

AF076974 

Homo sapiens TRRAP protein 

590 

88.288 

3719 

AF076974 

Homo sapiens TRRAP protein 

10666 1 

88.345 j 

3720 

AF076974 

Homo sapiens TRRAP protein 

24935 

99.348 

3721 

AF076974 j 

Homo sapiens TRRAP protein 

25166 

99.197 

3722 

AB000911 

Sus scrofa ribosomal protein 

242 

61.151 

3723 

X52354 

Homo sapiens KOX 23 protein (56 AA) 

141 

39.655 | 

3724 

M14300 

Homo sapiens 2A9 peptide 

342 

100.000 | 
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3726 

M14300 

Homo sapiens 2A9 peptide 

184 

61.957 

3727 

D86984 

Homo sapiens similar to yeast 
adenylate cyclase (S56776) 

1516 

53.092 

3728 

M36341 

Homo sapiens ADP-ribosylation factor 4 

830 

90.608 

3729 

AF097994 

Homo sapiens L-kynurenine/alpha- 
aminoadipate aminotransferase 

2525 

96.028 

3730 

AJ249731 

Homo sapiens putative G8 . 1 protein 

620 

96 . 970 

3732 

AJ005567 

Mus musculus SPR2I protein 

102 

41.463 

3733 

AF075587 

Homo sapiens protein associated with 
Myc 

1029 

96.622 

3734 

AF075587 

Homo sapiens protein associated with 
Myc 

633 

81.203 

3735 

AF075587 

Homo sapiens protein associated with 
Myc 

373 

98.214 

3736 

AF075587 

Homo sapiens protein associated with 
Myc 

3777 

92.718 

3737 

AF075587 

Homo sapiens protein associated with 
Myc 

30049 

98.370 

3738 

AF106518 

Homo sapiens sialomucin CD164 

61 

29.508 

3739 

AF106518 

Homo sapiens sialomucin CD164 

820 

85.393 

3740 

AF132950 

Homo sapiens CGI- 16 protein 

277 

£7.089 

3741 

AF132950 

Homo sapiens CGI-16 protein 

1883 

87.470 

3742 

Z50194 

Homo sapiens PQ-rich protein 

2797 

99.750 

3743 

AF148509 

Homo sapiens alpha 1 , 2 -mannosidase 

579 

90.291 

3744 

AF148509 

Homo sapiens alpha 1 , 2 -mannosidase 

216 

45.263 

3745 

AF148509 

Homo sapiens alpha 1, 2 -mannosidase 

289 

60.976 

3746 

AF148509 

Homo sapiens alpha 1, 2 -mannosidase 

1562 

99.142 

3747 

AF148509 

Homo sapiens alpha 1 , 2 -mannosidase 

4566 

98.426 

3748 

AF203978 

Homo sapiens MAX-like bHLHZIP protein 

172 

96.154 

3749 

AF203978 

Homo sapiens MAX-like bHLHZIP protein 

1379 

81.720 1 

3750 

AF203978 

Homo sapiens MAX-like bHLHZIP protein i 

188 

96.296 

3751 

AF203978 

Homo sapiens MAX-like bHLHZIP protein 

743 

64.865 

3752 

AF203978 

Homo sapiens MAX-like bHLHZIP protein 

1376 

93.117 

3754 

AB033117 

Homo sapiens KIAA1291 protein 

259 

100.000 

3755 

AB033117 

Homo sapiens KIAA1291 protein 

264 

75.000 

3756 

AB033117 

Homo sapiens KIAA1291 protein 

580 

64.118 

3757 

AB033117 

Homo sapiens KIAA12 91 protein 

412 

95.455 

3758 

AB033117 

Homo sapiens KIAA1291 protein 

1719 

73.902 [ 

3759 

AB033117 

Homo sapiens KIAA1291 protein 

6696 

96.917 | 

3760 

L48218 

Homo sapiens beta-globin 

111 

77.273 

3762 

D14572 

Mus musculus , PEBP2bl protein 1 - 

1121 

98.780 

3763 

D14572 

Mus musculus ' PEBP2bl protein* 

1133 

96.471 

3764 

D14572 

Mus musculus l PBBP2bl protein 1 

1139 

88.325 

3765 

AJ001417 

Homo sapiens extraneuronal monoamine 
transporter 

192 

90.909 

3766 

AJ001417 

Homo sapiens extraneuronal monoamine 
transporter 

3670 

98.921 

o / o / 


Homo sapiens unknown 

43 

31. Ill 

3768 

AF148856 

Homo sapiens unknown 

947 

89.080 

3769 

U39317 

Homo sapiens UbcHSB 

101 

93.333 

3770 

U39317 

Homo sapiens UbcHSB 

763 

94.631 

3771 

U58658 

Homo sapiens unknown 

35 

50.000 

3772 

U58658 

Homo sapiens unknown 

84 

66.667 ! 

3773 

U58658 

Homo sapiens unknown 

93 

33.333 
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3774 

U58658 

Homo sapiens unknown 

j 237 

1 73.438 

3775 

X55777 

Homo sapiens put. ORP 

332 

62.500 

3776 

AL049705 

Homo sapiens hypothetical protein 

36 

18.919 

3777 

AL049705 

Homo sapiens hypothetical protein 

64 

31.034 

3778 

AL049705 

Homo sapiens hypothetical protein 

702 

100.000 

3779 

AL035494 

Homo sapiens dJ635G19.2.2 (novel 
protein (PUTATIVE PARTIAL isoform 2)) 

308 

60. 638 

3780 

AL035494 

Homo sapiens dJ635G19 . 2 . 2 (novel 
protein ( PUTATIVE PARTIAL isoform 2) j 

143 

55.128 

3781 

A0T002309 

Homo sapiens synaptogyrin 3 

1498 

j 98.690 

37 82 

X59727 

Homo sapiens 63kDa protein kinase 

339 

38.189 

3783 

X59727 

Homo sapiens 63kDa protein kinase 

1070 

94.767 

3784 

X59727 

Homo sapiens 63kDa protein kinase 

3483 

97.853 

3785 

Y15054 

Rattus norvegicus 70 kD tumor- specific 
antigen 

645 

76.033 

3786 

Y15054 

Rattus norvegicus 70 kD tumor- specif ic 
antigen 

1278 

68.106 

3787 

Y15054 

Rattus norvegicus 70 kD tumor- specif ic 
antigen 

2410 

83.790 

3788 

AB020715 

Homo sapiens KIAA0908 protein 

47 

29.545 

3789 

AB020715 

Homo sapiens KIAA0908 protein 

2743 

97.234 

3791 

Z36909 

Acinetobacter calcoaceticus 
phenol hydroxylase component 

182 

27.826 

3792 

M2 6 1 6 7 

Homo sapiens platelet factor 4 

350 

62.832 

3793 

M26167 

Homo sapiens platelet factor 4 

288 

66.667 

3794 

M57567 

Homo sapiens ADP-ribosylation factor 

28 

27.778 

3795 

M57567 

Homo sapiens ADP-ribosylation factor 

812 

92.308 

3796 

AF084256 

Homo sapiens beta glucuronidase 
isoform d 

175 

69.231 

3797 

J04040 

Homo sapiens preproglucagon 

901 

92.265 

3799 

Z38011 

Mus musculus DMR-N9 ~| 

371 

91.935 

3800 

Z38011 

Mus musculus DMR-N9 

589 

85.484 

3801 

Z38011 

Mus musculus DMR-N9 

4017 

89.153 

3802 

Z11898 

Homo sapiens octamer binding protein 
3A 

2082 1 

88.116 

3803 

Z11898 

Homo sapiens octamer binding protein j 
3A j 

2077 I 

95.890 

3804 

AL050157 

Homo sapiens hypothetical protein j 

379 j 

98.718 

3 805 

AL050157 

Homo sapiens hypothetical protein | 

454 j 

70.199 

3 806 

U62039 

Elephantulus edwardii reverse 1 
transcriptase 1 

111 j 

48.936 

1 O A O 

3 8 0.7 

M2 01B9 

synthetic construct tet protein | 

847 

93.662 

o on q 

MO A 1 o ri 

M2 018 9 

synthetic construct tet protein i 

817 

100. 000 

i Q1 o 

018 9 

synthetic construct tet protein ! 

1795 

99.281 



synthetic construct tet protein 

1072 

100.000 

O. Q 1 O 

M2 0189 

synthetic construct tet protein 

518 j 

100.000 

1 ft 11 

M201B9 

synthetic construct tet protein 

927 i 

89.474 

3815 

M20189 

synthetic construct tet protein ! 

O / -5 | 

85 . 185 

3816 

M20189 

synthetic construct tet protein | 

2507 

100. 000 

3817 

M20189 

synthetic construct tet protein 

1155 1 

94.845 

3818 

M20189 

synthetic construct tet protein 

1150 | 

99.438 

3819 

M20189 

synthetic construct tet protein | 

2558 

99.495 

3820 

M20189 

synthetic construct tet protein | 

927 | 

89.474 

3821 

M20189 

synthetic construct tet protein 

1293 1 

1O0.000 
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3822 

M20189 

synthetic construct tet protein 

1315 

100.000 

3823 

M20189 

synthetic construct tet protein 

1150 

91.220 

3824 

M20189 

synthetic construct tet protein 

1767 

93.980 

3827 

M20189 

synthetic construct tet protein 

1147 

100,000 

3833 

M20189 

synthetic construct tet protein 

1767 

93.980 

3834 

M20189 

synthetic construct tet protein 

1150 

91.220 

3835 

M20189 

synthetic construct tet protein 

540 

100.000 

3836 

M20189 

synthetic construct tet protein 

930 

88.439 

3837 

M20189 

synthetic construct tet protein 

1335 

88.655 

3839 

M20189 

synthetic construct tet protein 

1672 

93.310 

3840 

AF164680 

Mus musculus precerebellin-1 

907 

76.166 

3841 

AF078832 

Homo sapiens methyl-CpG binding 
protein splice variant 3 

831 

92.143 

3843 

AF078832 

Homo sapiens methyl-CpG binding 
protein splice variant 3 

2149 

69.940 

3844 

AF078832 

Homo sapiens methyl -CpG binding 
protein splice variant 3 

3177 . 

83.128 

3845 

AF078832 

Homo sapienB methyl -CpG binding 
protein splice variant 3 

3931 

100.000 

3846 

AF078832 

Homo sapiens methyl -CpG binding 
protein splice variant 3 

3676 

90.429 

3847 

AF078832 

Homo sapiens methyl -CpG binding 
protein splice variant 3 

3863 

95.470 | 

3848 

AF078832 

Homo sapiens methyl -CpG binding 
protein splice variant 3 

3602 

86.246 

3849 

AF078832 

Homo sapiens methyl -CpG binding 
protein splice variant 3 

3271 

88.225 

3850 

AL050405 

Homo sapiens hypothetical protein 

701 

77.564 

3851 

AL050405 

Homo sapiens hypothetical protein 

161 

78.125 

3852 

AL050405 

Homo sapiens hypothetical protein 

390 

100. 000 

3853 

AL050405 

Homo sapienB hypothetical protein 

1951 

81.026 

3854 

AL050405 

Homo sapiens hypothetical protein 

1904 1 

97.826 

3855 

AF119835 

Homo sapiens stem cell factor 
precursor 

81 

100. 000 

3856 

AF119835 

Homo sapiens stem cell factor 
precursor 

340 

69.136 

3857 

AF119B35 

Homo sapiens stem cell factor 
precursor 

1157 

98.895 

385 6 

AL050159 

Homo sapiens hypothetical protein 

925 

57.269 

3859 

U97553 

murine herpesvirus 68 unknown 

107 

32.394 

3860 

U97553 

murine herpesvirus 68 unknown 

123 

35.714 

3861 

X74504 

Mus musculus T10 

191 

80.000 

3862 

X74504 

Mus musculus T10 

741 

61.905 

3 863 

X74504 

Mus musculus T10 

44 | 

41.176 

3 864 

X74504 

Mus musculus T10 

378 

85.135 

3865 

X74504 

Mus musculus T10 

1199 

66.000 

3866 

AB006746 

Homo sapiens hMmTRAlb 

50 

36.000 

J oo / 

ADUUO /4D 

xiomo sapiens oriiniKAiD 

969 

47 . 756 

3868 

AC008075 

Arabidopsis thaliana F24J5.4 

126 

29.457 

3869 

U69161 

Homo sapiens CC3 

456 

90.196 

3870 

U69161 

Homo sapiens CC3 

1283 

96.667 ! 

3871 

L31840 

Rattus norvegicus nuclear pore complex 
protein NUP107 

61 

23 .881 

3872 

L31840 

Rattus norvegicus nuclear pore complex 

55 

21.379 
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protein NUP107 



3873 

L31840 

Rattus norvegicus nuclear pore complex 
protein NUP107 

57 

24.675 

3874 

L31840 

Rattus norvegicus nuclear pore complex 
protein NUP107 

1415 

50.732 

3875 

L31B40 

Rattus norvegicus nuclear pore complex 
protein NUP107 

57 

22 .000 

3876 

AF060862 

Homo sapiens unknown 

240 

100.000 

3877 

AF060662 

Homo sapiens unknown 

650 

97.479 

3878 

U08018 

Bos taurus cartilage leucine-rich 
protein 

769. 

38.378 

3879 

U08018 

Bos taurus cartilage leucine-rich 
protein 

76 

28.440 

3880 

AJ010103 

Homo sapiens IRClb 

99 

36.232 

3881 

AF094583 

Homo sapiens putative HIV-1 infection 
related protein 

589 

98.851 

3882 

AF094583 

Homo sapiens putative HIV-1 infection 
related protein 

737 

100.000 

3884 

U97006 

Caenorhabditis elegans No definition 
line found 

2203 

41.540 

3885 

AL133045 

Homo sapiens hypothetical protein 

1147 

91.045 

3887 

X67703 

Drosophila melanogaster Mst84Dd 

73 

38.000 

3888 

AF151886 

Homo sapiens CGI -12 8 protein 

352 

91.525 

3889 

AF151886 

Homo sapiens CGI -12 8 protein 

854 

76.923 

3890 

AF151B86 

Homo sapiens CGI -128 protein 

444 

61.438 

3891 

AF151886 

Homo sapiens CGI -12 8 protein 

471 

70.349 

3892 

AL033514 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST EMBL:D71127 comes 
from thiB gene; cDNA EST EMBL:D73731 
comes from this gene; cDNA EST 
yk527c3.3 comes from this gene; cDNA 
EST yk645b5.3 comes from this gene 

34 

33.333 

3893 

AL033514 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST EMBL:D71127 comes 
from this gene; cDNA EST EMBL:D73731 
comes from this gene; cDNA EST j 
yk527c3.3 comes from this gene; cDNA 
EST yk645b5.3 comes from this gene 

635 

47.037 

3894 

U01878 

Saccharpmyces cerevisiae unknown 

93 

46.875 

3895 

U01878 

Saccharomyces cerevisiae unknown 

190 

40.650 

3896 

AL080058 

Homo sapiens hypothetical protein 

2140 

9X.008 

3897 

AL050008 

Homo sapiens hypothetical protein 

127 

64.286 

3898 

AL050008 

Homo sapiens hypothetical protein 

163 

42 .353 

3899 

AL050008 

Homo sapiens hypothetical protein 

125 

66.667 

3900 

AL050008 

Homo sapiens hypothetical protein 

709 

57.436 

3901 

Z66521 

Caenorhabditis elegans similar to 
mitochondrial RNA splicing MSR4 like 
protein; cDNA EST EMBL:C09217 comes 
from this gene 

202 

36.735 

3902 

Z66521 

Caenorhabditis elegans similar, to 
mitochondrial RNA splicing MSR4 like 
protein; cDNA EST EMBL:C09217 comes 
from this gene 

313 

65.217 

3903 

Z66521 

Caenorhabditis elegans similar to 
mitochondrial RNA splicing MSR4 like 

4 94 

50 .515 
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protein; cDNA . EST EMBI»:C09217 comes 
from this gene 

- 

- 

3904 

AF187305 

Myxine glutinosa calmodulin 

699 

92.617 

3905 

AJ010103 

Homo sapiens IRClb 

42 

30.882 

3906 

AJ010103 

Homo sapiens IRClb 

61 

30.233 

3907 

AJ010103 

Homo sapiens IRClb 

61 

63.636 

3908 

AJ010103 

Homo sapiens IRClb 

53 

26.154 

3909 

AJ010103 

Homo sapiens IRClb 

102 

39. 189 

3910 

AB032999 

Homo sapiens KIAA1173 protein 

433 

100.000 

3911 

AB032999 

Homo sapiens KIAA1173 protein 

52 

34.783 

3912 

AB032999 

Homo sapiens KIAA1173 protein 

458 

97.333 

3913 

AB032999 

Homo sapiens KIAA1173 protein 

66 

23.404 

3914 

AB032999 

Homo sapiens KIAA1173 protein 

142 

100.000 

3915 

AB032999 

Homo sapiens KIAA1173 protein 

53 

37.500 

3916 

AB032999 

Homo sapiens KIAA1173 protein 

53 

30.769 

3919 

AB032999 

Homo sapiens KIAA1173 protein 

54 

28.571 

3919 

AB032999 

Homo sapiens KIAA1173 protein 

1893 

92.604 

3921 

D83146 

Mus musculus Six5 

3740 

85.714 

3922 

U38544 

Mus musculus alpha -1 type 1 collagen 

47 

41.667 

3923 

U38544 

Mus musculus alpha -1 type 1 collagen 

113 

34.343 

3924 

AF037440 

Edwardsiella ictaluri putative 26 kDa 
protein 

66 

24.000 

3925 

AF037440 

Edwardsiella ictaluri putative 2 6 kDa 
protein 

131 

23.301 

3926 

AF176520 

Mus musculus WD repeat -containing F- 
box protein FBW5 

2857 

85.442 

3927 

AF098066 

Homo sapiens squamous cell carcinoma 
antigen recognized by T cell 

2235 

51.811 

3928 

AL031287 ! 

Homo sapiens match: ESTs : Em:N54 3 88 
Em:R14260 Em:H08089 Em:Z24943 
Em:AA130053 

575 

96.739 

3929 

AL031287 

Homo sapiens match: ESTs: Em:N54388 
Em:R14260 Em:H08089 Em:Z24943 
Em:AA130053 

431 

63.793 

3930 

AL031287 

Homo sapiens match: ESTs: Em:N5438 8 
Em:R14260 Em:H08089 Em:Z24943 
Em:AA130053 

2758 

94.467 

3931 

D28877 

Homo sapiens hnRNP Bl protein 

2065 

91.892 

3932 

D28877 

Homo sapiens hnRNP Bl protein 

488 

100.000 

3933 

D28877 

Homo sapiens hnRNP Bl protein 

160 

85.714 

3934 

D28877 

Homo sapiens hnRNP Bl protein 

59 

24.000 

3935 

D28877 

Homo sapiens hnRNP Bl protein 

2314 

96.893 

3936 

D28877 

Homo sapiens hnRNP Bl protein 

2168 

95.882 

3938 

AF152497 

Homo sapiens protocadherin beta 4 

1212 

75.397 

3939 

AF152497 

Homo sapiens protocadherin beta 4 

859 

74.093 

3940 

AF152497 

Homo sapiens protocadherin beta 4 

1291 

96.209 

3941 

AF152497 

Homo sapiens protocadherin beta 4 

4747 I 

97.442 

3942 

D90870 

Escherichia coli similar to 

47 

24 .638 

3943 

D90870 

Escherichia coli similar to 

65 

31.746 

3944 

D90870 

Escherichia coli similar to 

1618 

99.618 

3945 

D90870 

Escherichia coli similar to 

1712 

98.889 

3946 

AF133913 

Mus musculus ARL-6 interacting 
protein-6 

35 

35.000 

3947 

AF133913 

Mus musculus ARL-6 interacting 

78 

92 .308 
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protein-6 



3948 

AF133913 

Mus musculus ARL-6 interacting 
protein-6 

78 

92.308 

3949 

AF133913 

Mus musculus ARL-6 interacting 
protein-6 

455 

66.087 

3950 

AB017800 

Homo sapiens nolp 

53 

45.000 

3951 

AB017800 

Homo sapiens nolp 

291 

67.213 

3952 

AB017800 

Homo sapiens nolp 

137 

54.286 . 

3953 

AB017800 

Homo sapiens nolp 

233 

44 .037 

3954 

AB017800 

Homo sapiens nolp 

173 

58.333 

. 3955 

AB017800 

Homo sapiens nolp 

1115 

53.810 i 

3956 

AF059336 

Plasmodium vivax circumsporozoite 
protein 

132 

27.317 

3957 

U89439 

Bos taurus ubiquitin-like protein 

398 

79.381 

3958 

AL133078 

Homo sapiens hypothetical protein 

39 

24.242 

3959 

AL133078 

Homo sapiens hypothetical protein 

63 

30.508 

3960 

AL133078 

Homo sapiens hypothetical protein 

942 

67.308 

3961 

Y18206 

Homo sapiens serine -threonine specific 
protein phosphatase 

352 

36.420 

3962 

AF078164 

Homo sapiens Ku70 -binding protein 

436 

100.000 | 

3963 

AF078164 

Homo sapiens Ku70-binding protein 

1772 

97.213 

3965 

U79260 

Homo sapiens unknown 

313 

60.000 

3967 

AF039687 

Homo sapiens antigen NY-CO- 1 

756 

99. 174 

3968 

AF039687 

Homo sapiens antigen NY-CO- 1 

2067 . 

96.739 

3969 

AF110532 

Homo sapiens uncoupling protein UCP-4 

1379 

100.000 

3970 

AF110532 

Homo sapiens uncouplinq protein UCP-4 

201 

54.795 

3971 

X17098 

Homo sapiens precursor (AA -34 to - 
390) 

1582 

59.449 

3972 

X17098 

Homo sapiens precursor (AA -34 to - 
390) 

1371 

74 . 828 

3973 

X17098 

Homo sapiens precursor (AA -34 to - 
390) 

2387 

87. 198 

3974 

X17098 

Homo sapiens precursor (AA -34 to - 
390) 

1906 

61.525 

3975 

X17098 

Homo sapiens precursor (AA -34 to - 
390) 

2402 

87.440 

3976 

X17098 

Homo sapiens precursor (AA -34 to - 
390) 

2296 

78.158 ; 

3977 

X17098 

Homo sapiens precursor (AA -34 to - 
390) 

1856 

53.695 

3978 

X17098 

Homo sapiens precursor (AA -34 to - 
390) 

1625 

67.391 

3979 

X17098 

Homo sapiens precursor (AA -34 to - 
390) 

1436 

61.667 

3980 

X17098 

Homo sapiens precursor (AA -34 to - 
390) 

2286 

85.132 

3981 

X17098 

Homo sapiens precursor (AA -34 to - 
ion) 

1955 

78.788 

3982 

X17098 

Homo sapiens precursor (AA -34 to - 
390) 

2188 

88.221 1 

3984 

AC004798 

Homo sapiens R31546 1 

4508 

86.418 

3985 

AC004798 

Homo sapiens R31546 1 

4647 

99.590 

3986 

AF110520 | 

Mus musculus NG28 

60 J 

30.645 

3987 

AF110520 

Mus musculus NG28 

3317 

72.453 
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3988 

AB015343 

Homo sapiens HRIHFB2122 

41 

50.000 

3989 

AB015343 

Homo sapiens HRIHFB2122 

1455 

100.000 

3990 

X00824 

Gallus gallus collagen 

96 

45.312 

3993 

AJ251595 

Homo sapiens transmembrane 
glycoprotein 

1039 N 

99.367 

3994 

AJ251595 

Homo sapiens transmembrane 
glycoprotein 

536 

100.000 | 

3995 

AJ251595 

Homo sapiens transmembrane 
glycoprotein 

156 

90.909 

3996 

AJ251595 

Homo sapiens transmembrane 
glycoprotein 

5029 

100. 000 

3997 

AJ251595 

Homo sapiens transmembrane 
glycoprotein 

.2743 

66.173 

3998 

AJ251595 

Homo sapiens transmembrane 
glycoprotein 

1404 

99.550 

3999 

X04601 

Mus mus cuius short ORF (AA 1-62) 

65 

41.667 

4000 

L48220 

Homo sapiens beta-globin 

162 

68.519 

4001 

X85545 

Homo sapiens protein kinase 

198 

86.667 

4002 

XB5545 

Homo sapiens protein kinase 

436 

96.721 

4003 

X85545 

Homo sapiens protein kinase 

1261 

97.382 

4004 

X85545 

Homo sapiens protein kinase 

716 

67.027 

4005 

X85545 

Homo sapiens protein kinase 

2098 

87.430 

4006 

X85545 

Homo sapiens protein kinase 

797 

64 .706 

4007 

X85545 

Homo sapiens protein kinase 

852 

68 .500 

4008 

X85545 

Homo sapiens protein kinase 

2315 

99.721 

4010 

AF1B7987 

Homo sapiens zinc finger protein 
ZNF221 ~ . 

2845 

76.588 

4011 

AF187987 

Homo sapiens zinc finger protein 
ZNF221 

4111 

96.812 

4012 

AF116909 

Homo s api ens unknown 

39 

26 .667 

4013 

AF116909 

Homo sapiens unknown j 

838 

93 .985 

4014 

D49474 

Mus musculus HMG-box transcription 
factor 

2364 

82 .185 

4015 

AF013969 

Mus musculus antigen containing 
epitope to monoclonal antibody MMS- 
85/12 

55 

22 .581 

4016 

AF013969 

Mus musculus antigen containing 
epitope to monoclonal antibody MMS- 
85/12 

54 

27 .907 

4017 

AF013969 

Mus musculus antigen containing 
epitope to monoclonal antibody MMS- 
85/12 

911 

71.569 

4018 

AF013969 

Mus musculus antigen containing 
epitope to monoclonal antibody MMS- 
85/12 

4894 

,57.293 

4019 

X88799 

Oryza sativa DNA binding protein 

138 | 

38 .667 

4020 

Z98602 

Schizosaccharomyces pombe conserved 
uncharacterized protein domain- 
containing protein 

306 

33 .146 

4021 

Z98602 

Schizosaccharomyces pombe conserved 
uncharacterized protein domain- 
containing protein 

687 

43 .443 

4022 

AC008015 

Homo sapiens unknown 

251 

59.459 

4023 

AC008015 

Homo sapiens unknown 

337 

100.000 
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4024 

AC008015 

Homo sapiens unknown 

226 

94 .286 

4025 

AC008015 

Homo sapiens unknown 

296 

53.488 

4026 

AC008015 

Homo sapiens unknown 

703 

92 . 857 

4UZ / 

til /I Q ^ O 

Homo sapiens ribosomal protein L28 

514 

68 . 613 

VI A *5 O 

Ttt /I Q C Q 

Homo sapiens ribosomal protein L2 8 

160 

92 . 000 

A n 9 Q 


Homo sapiens ribosomal protein L2 8 

257 

100 . 000 

Ann 
*± U J X 

TT"I il QCO 

Homo sapiens ribosomal protein L28 

457 

98 . 571 



Homo sapiens ribosomal protein L2 8 

431 

97 . 059 

A ft 1 


Homo sapiens ribosomal protein L2 8 

514 

93 . 458 


TTl /I Q C. O 

Homo sapiens ribosomal protein L»2 8 

243 

57.364 1 

4 UJt> 

T TCI C Q 

Homo sapiens FRG1 gene product 

36 

25 . 806 

4036 

L76159 

Homo sapiens FRG1 gene product 

337 

64.286 

4 037 

L76159 

Homo sapiens FRG1 gene product 

1200 

87.259 

4038 

L76159 

Homo sapiens FRG1 gene product 

873 

70.485 

4039 

L76159 

Homo sapiens FRG1 gene product 

1046 

78.298 

4040 

L76159 

Homo sapiens FRG1 gene product 

543 

77.344 

4041 

L27065 

Homo sapiens NF2 

201 

76.923 

4042 

AF092128 

Homo sapiens putative transmembrane 
protein E3-16 

1175 

85.348 

4043 

AJ133104 

Rattus norvegicus vesicle associated 
membrane protein 2B 

601 

96.396 

A A A A 

At 0 / /2 04 

Homo sapiens HSPC018 

906 

92 . 353 

4045 

I"T"T O *T ^ f\ 

U79260 

Homo sapiens unknown 

117 

81. 818 

4046 

U79260 

Homo sapiens unknown 

321 

72.368 

4047 

U31382 

Homo sapiens G protein gamma -4 subunit 

438 

97.333 

4048 

M11749 

Homo sapiens Thy-1 

592 

96.875 

4049 

M11749 

Homo sapiens Thy-1 

965 

98.052 

4050 

M11749 

Homo sapiens Thy-1 

44 

26.923 

4051 

M11749 

Homo sapiens Thy-1 

805 

95.062 

4052 

AF159055 

Homo sapiens leucine zipper- like 
protein 

88 

59.375 

4053 

S79410 

Mus sp. nuclear localization signals 
(NLS) -binding protein=spot- 1 

169 

56.000 

4054 

AF084256 

Homo sapiens beta glucuronidase 
isoform d 

146 

48.077 

4056 

AJ005559 

Mus musculus SPR2A protein 

95 

36.735 

4057 

X62844 

Pygmy chimpanzee papillomavirus type 1 
E5 

99 

38.806 

4058 

U20897 

Homo sapiens melanoma ubiquitous 
mutated protein 

460 

96.053 

a r\ c a 

4059 

U2 0897 

Homo sapiens melanoma ubiquitous 
mutated protein 

2900 

99.542 

A Ct C f\ 

4U60 

TTl A o on 

U20897 

Homo sapiens melanoma ubiquitous 
mutated protein 

2667 

87.607 

4061 

X99404 

Homo sapiens Berg36 

40 

71.429 

4062 

X99404 

Homo sapiens Berg3 6 

2161 

99.375 . 

4063 

Y13141 

Bromheadia f inlaysoniana extensin 

71 

88.889 

4064 

Y13141 

Dj.uiiuicaui.a <l -Liix ay oUilXalla CALCimi.Il 

T T A 

114 

3 3 • 871 

4065 

X52164 

Mus musculus Q3 00 protein (AA 1-77) 

122 | 

50.000 

4066 

X74142 

Homo sapiens transcription factor 

3120 

97.500 

4067 

AF155105 

Homo sapiens putative zinc finger 
protein NY-REN-34 antigen 

48 

32.500 

4068 

AF155105 

Homo sapiens putative zinc finger 
protein NY-REN-34 antigen 

86 

29.508 
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4069 

AF155105 

Homo sapiens putative zinc finger 

197 

30,579 



protein NY-REN-34 antigen 



4070 

AL079310 

Homo sapiens hypothetical protein 

804 

72.775 

.4071 

AL079310 

Homo sapiens hypothetical protein 

1588 

95.667 

4072 

AL079310 

Homo sapiens hypothetical protein 

3096 

98.381 

4073 

L05188 

Homo sapiens small proline-rich 

121 

36.538 



protein 2 



4074 

D87470 

Homo sapiens KIAA0280 

802 

92.593 

4075 

D87470 

Homo sapiens KIAA02 80 ' 

646 

96.117 

4077 

D87470 

Homo sapiens KIAA02 80 

49 

29.630 

4078 

D87470 

Homo sapiens KIAA02 80 

211 

60.714 

4079 

D87470 

Homo sapiens KIAA02 80 

1969 

100.000 

4080 

AB018566 

Homo sapiens Proline synthetase 

590 

96.296 



associated 



4081 

AB018566 

Homo sapiens Proline synthetase 

1279 

90.311 



associated 



4082 

U69127 

Homo sapiens FUSE binding protein 3 

883 

83.648 

4083 

U69127 

Homo sapiens FUSE binding protein 3 

2438 

79.093 

4084 

U69127 

Homo sapiens FUSE binding protein 3 

237 

100.000 

4085 

U69127 

Homo sapiens FUSE binding protein 3 

3830 

97.655 . 

4086 

AF151883 

Homo sapiens CGI - 12 5 protein 

481 

72.500 

4087 

AF151883 

Homo sapiens CGI- 125 protein 

642 

100.000 

4088 

AF151883 

Homo sapiens CGI -12 5 protein 

560 

82.222 

4089 

AF042713 

Rattus norvegicus neurexophilin 3 

1602 

94.841 

4090 

X67507 

Medicago sativa ENOD12B 

57 

32.558 

4091 

X67507 

Medicago sativa ENOD12B 

128 

36.765 

4092 

AL022238 

Homo sapiens dJ1042K10.3 (novel 

58 

29.730 



protein) 



4093 

AL022238 

Homo sapiens dJ1042K10.3 (novel 

48 

35.714 



protein) 



4094 

AL022238 

Homo sapiens dJ1042K10.3 (novel 

52 

50.000 



protein) 



4095 

AL022238 

Homo sapiens dJ1042K10.3 (novel 

1453 

93.416 



protein) 



4096 

AF191687 

Homo sapiens alanine -glyoxylate 

83 

39.286 



aminotransferase homolog 



4097 

AF100772 

Homo Bapiens tenascin-Ml 

70 

32.653 

4098 

AF100772 

Homo Bapiens tenascin-Ml 

2557 

99.155 

4099 

AF100772 

Homo sapiens tenascin-Ml 

8869 

97. 129 

4100 

AF100772 

Homo sapiens tenascin-Ml 

18627 

99.670 

4101 

L24521 

Homo sapiens transformation- related 

359 

58.824 



protein 



4102 

AF006264 

. Homo sapiens recombination and sister 

939 

87.356 



chromatid cohesion protein homolog j 



4104 

AF006264 

Homo sapiens recombination and sister 

320 

60.360 



chromatid cohesion protein homolog 



4105 

AF006264 

Homo, sapiens recombination and sister 

14 6 

100.000 



chromatid cohesion protein homolog 



4106 

AF006264 

Homo sapiens recombination and sister 

3147 

96.903 



chromatid cohesion protein homolog 



4107 

Y14391 

Homo sapiens GTP-binding protein 

395 

71.717 

4108 

Y14391 

Homo sapiens GTP-binding protein 

62 

43.243 

4109 

Y14391 

Homo sapiens GTP-binding protein 

2386 

95.302 

4111 

U63542 

Homo sapiens FAP protein 

200 

81.818 



4112 

U63542 

Homo sapiens FAP protein 

200 

81.818 

4113 

U63542 

Homo sapiens FAP protein 

200 

81,818 

4114 

U63542 

Homo sapiens FAP protein 

47 

24.000 

4115 

U63542 

Homo sapiens FAP protein 

52 

29.630 

4116 

U63542 

Homo sapiens FAP protein ■ 

203 

80 .000 

4117 

U63542 

Homo sapiens FAP protein 

200 

81.818 

4118 

U63542 

Homo sapiens FAP protein 

215 

78.947 

4119 

A27282 

Homo sapiens TGR-CL3C . 

98 

39.394 

4120 

M64983 

Homo sapiens fibrinogen beta chain 

767 

41.667 

4121 

M649S3 

Homo sapiens fibrinogen beta chain 

46 

33.333 

4122 

Z49825 

Homo sapiens hepatocyte nuclear factor 
4 alpha (HNF4alpha4) 

3334 

95 .810 

4123 

Z4982S 

Homo sapiens hepatocyte nuclear factor 
4 alpha (HNF4alpha4) 

2940 

91.633 

4124 

Z49825 

Homo sapiens hepatocyte nuclear factor 
4 alpha (HNF4alpha4) 

2582 

91.589 

4125 

Z49825 

Homo sapiens hepatocyte nuclear factor 
4 alpha (HNF4alpha4) 

526 

67.133 

4126 

Z49825 

Homo sapiens hepatocyte nuclear factor 
4 alpha (HNF4alpha4) 

3079 

93 .849 

4127 

M15530 

Homo sapiens B-cell growth factor 

153 

44 .828 

4128 

X55687 

Lycopersicon esculentum extensin 
(class II) 

158 

38 .356 

4129 

X55687 

Lycopersicon esculentum extensin 
(class II) 

124 

36 .232 

4131 

L10908 

Mus musculus Gcapl gene product 

87 

30 .357 

4132 

L10908 

Mus musculus Gcapl gene product 

88 

22 .857 

4134 

U14913 

Saccharomyces cerevisiae Ylrl98cp 

125 

36 .471 

4136 

X53581 

Rattus norvegicus ORF4 

248 

51 .389 

4137 

AF084256 

Homo sapiens beta glucuronidase 
isoform d 

35 

60 .000 

4138 

AF084256 

Homo Bapiens beta glucuronidase 
isoform d 

191 

57 .143 

4139 

M69297 

Homo sapiens ORF 3 

149 

62 .500 

4140 

AF133849 

Plasmodium falciparum DBL alpha 
protein 

107 

37 .500 

4141 

AF076469 

Canis familiaris cyclin-dependent 
kinase inhibitor 

107 

39 .474 

4142 

A31038 

Nicotiana alata PRP3 

145 

36 .620 

4143 

M60732 

Hordeum vulgare cold- regulated 

101 

36 .842 

4144 

M60732 

Hordeum vulgare cold- regulated 

142 

33 .696 

4146 

M15530 

Homo sapiens B-cell growth factor 

174 

54 .098 

4148 

D16632 

Mus musculus Al 

37 

57 .143 

4149 

D16632 

Mus musculus Al 

108 

38 .889 

4150 

M15530 

Homo sapiens B-cell growth factor 

167 

63 .158 

4151 

M15530 

Homo sapiens B-cell growth factor 

145 

44 .828 

4152 

M15530 

Homo sapiens B-cell growth factor 

14 8 

50 .000 

4153 

X55777 

Homo sapiens put* ORF 

243 

53 .012 

4155 

AF023530 

Mus musculus RNA helicase A 

371 

72 .152 

4156 

U01317 

Homo sapiens beta-globin 

636 

75 .385 

4157 

AF039853 

Mus musculus anti-activating 
transcription factor 1 Ig variable 
heavy chain 

46 

32 .432 

4158 

AF039853 

Mus musculus anti-activating 

42 

36 .364 


/O8b0 





transcription factor 1 Ig variable 
heavy chain 



4159 

AF03.9853 

Mus musculus ant i -activating 
transcription factor 1 Ig variable 
.heavy chain 

42 

53.333 

4160 

AF039853 

Mus musculus anti -activating 
transcription factor 1 Ig variable 
heavy chain 

78 

42. 857 

4161 

AF039853 

Mus musculus anti -activating 
transcription factor 1 Ig variable 
heavy chain 

59 

41.667 

4162 

AF039853 

Mus musculus anti -activating 
transcription factor 1 Ig variable 
heavy chain 

131 

31.395 

4163 

Z73102 

Caenorhabditis elegans Similarity to 
B.subtilis DNAJ protein 
( S W : DNAJJBACSU) ; CDNA EST yk4 3 7 a 1 . 5 
comes from this gene 

225 

33.898 

4164 

AJ005559 

Mus musculus SPR2A protein 

103 

32.609 j 

4165 

AF107406 

Homo sapiens GW128 

12 9 

50.000 

4166 

M36914 

Zea mays cell wall protein (put.); 
putative 

129 

33.333 

4167 

AF034551 

Zalophotrema at 1 anti cum NADH 
dehydrogenase subunit 3 

96 

33 .929 

4168 

X81900 

Homo sapiens NADH oxidoreductase 
subunit MWFE 

268 

75.806 

4169 

K03208 

Homo sapiens salivary proline-rich 
protein precursor 

157 

29.839 

4170 

A27282 

Homo sapiens TGR-CL3C 

61 

37.500 

4171 

A27282 

Homo sapiens TGR-CL3C 

104 

45 .283 

4172 

S79410 

Mu3 sp. nuclear localization signals 
(NLS) -binding protein=spot-l 

130 

47.500 

4173 

M15530 

Homo sapiens B-cell growth factor 

142 

42 .857 

4174 

Z14014 

Nicotiana tabacum Pistil extensin 
like protein, partial CDS only 

45 

29.787 

4175. 

Z14014 

Nicotiana tabacum Pistil extensin 
like protein, partial CDS only 

122 

38.356 

4176 

L20934 

Anopheles gambiae NADH dehydrogenase 
subunit 4L 

110 

31.343 

4177 

A3 103 8 

Nicotiana alata PRP3 

136 

36.486 

4178 

U03161 

Pseudomonas aeruginosa predicted 
protein of 93 amino acids, MW of 9569 

78 

40 .476 

4160 

AB030237 

Canis familiaris D4 dopamine receptor 

89 

35 .000 

4181 

AB030237 

Canis familiaris D4 dopamine receptor 

138 

30.526 

4182 

D29833 

Homo sapiens proline rich peptide P-B 

117 

39.394 

4183 

D29833 

Homo sapiens proline rich peptide P-B 

98 

42 .857 

4184 

M22332 

Homo sapiens unknown protein 

134 

44 .000 

4185 

U20239 

Mus musculus fibrosin 

97 

35.938 

4186 

AJ242540 

Volvox carteri f . nagariensis 
hydroxyproline-rich glycoprotein DZ- 
HRGP 

144 

42 .466 

4187 

AT242540 

Volvox carteri f . nagariensis 
hydroxyproline-rich glycoprotein DZ- 
HRGP 

448 

34 .855 

4188 

AF186275 

Echinometra mathaei bindin precursor 

111 

40 .000 


JL OO B M-3 . O 5 Ei! 6 O H 


% xoy 


Honio sapiens B-cell growth factor 

165 

54 .386 

A 1 Qfl 

Zn?rtOO "IPC 

Cyanidium caldairium unknown 

100 

33 .784 


M.JT V / vODZ 

Homo sapiens HSPC006 

527 

83 . 654 

- A 1 QO 

AT U / UDOZ 

Homo sapiens HSPC006 

1023 

89 . 005 

41 DO 
7 17 J 

/Vr U / Uooz 

Homo sapiens norLOOo 

537 

95 . 238 

4194 

U58658 

Homo sapiens unknown 

102 

50.000 

4 195 

U5 8658 

Homo sapiens unknown 

294 

66.197 

4196 

AF008196 

Homo sapiens bax epsilon 

35 

100.000 ! 

4197 

AP008196 

Homo sapiens bax epsilon 

35 

100.000 

4199 

AF008196 

Homo sapiens bax epsilon 

50 

50.000 

4200 

AF008196 

Homo sapiens bax epsilon 

142 

59.459 

4202 

U63332 

Homo sapiens super cysteine rich 
protein; SCRP 

128 

35.556 

4204 

U89695 

Rattus norvegicus non-hiBtone 
chromosomal architectural protein 

rlrltiJL - L. 

117 

36.667 


HT U ± 7407 

Homo sapiens uncoupling protein 2 

443 

88 . 608 

4206 

AF019409 

Homo sapiens uncoupling protein 2 

417 

100.000 

42 07 

AF019409 

Homo sapiens uncoupling protein 2 

601 

66.092 

42 08 

U05438 

Human immunodeficiency virus type 1 
envelope glycoprotein, v4v5 region 

109 

31.481 

A O f* Q 

AoU lb 4 J Z 

gallid herpesvirus 1 ORF 94 

51 

55. 556 

401 A 

4 / JIU 

Ar U^z 7Z 3 

Mus musculus beta-def ensin 3 

88 

35 .484 

4211 

AP000060 

Aeropyrum pernix 150aa long 
hypothetical protein 

119 

30.769 

4212 

M15530 

Homo sapiens B-cell growth factor 

176 

53.448 

4213 

AB032918 

Hylobates moloch dopamine receptor D4 

95 

34.211 

4214 

AB032918 

Hylobates moloch dopamine receptor D4 

78 

24.561 

4215 

AB032918 

Hylobates moloch dopamine receptor D4 

89 

29.630 

4216 

AF005370 

Alcelaphine herpesvirus 1 capsid 
protein 

87 

26.549 

4217 

AF005370 

Alcelaphine herpesvirus 1 capsid 
protein 

170 

33.071 

4218 

M19419 

Mus musculus proline-rich salivary 
protein 

195 

33.333 

4219 

AP000061 

Aeropyrum pernix 114aa long 
hypothetical protein 

82 

36.207 

4220 

AP000061 

Aeropyrum pernix 114aa long 
hypothetical protein 

118 

30.769 

*±ZZ X 

Ulobo b 

Homo sapiens T cell receptor alpha 
chain 

107 

31.148 

A O O O 


Homo sapiens B-cell growth factor 

161 

58.140 . 

A OO ^ 

JipAi T /r O O 

Arabidopsis thai i ana unknown protein 

124 

61.538 

4224 

AC011623 

Arabidopsis thaliana unknown protein 

122 

36.697 

A O O C 


Plasmodium vivax pval 

77 

31.373 

A OO C 

A924 85 

Plasmodium vivax pval 

185 

53.247 

4229 

U11682 

Trypanoplasma borreli unassigned 
readina frame 

68 

27.273 

4230 

M15530 

Homo sapiens B-cell growth factor 

185 

56.604 

4231 

AF159055 

Homo sapiens leucine zipper-like 
protein 

117 

59.375 

4232 

AC003981 

Arabidopsis thaliana F22013.17 

47 

38.889 

4233 

AF003534 

Chilo iridescent virus hypothetical 
protein 030L 

31 

44.444 
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4234 

AF003534 

Chilo iridescent virus hypothetical 
protein 030L 

35 

42.857 

4235 

AF003534 

Chilo iridescent virus hypothetical 
protein 03 0L 

77 

35.484 

4236 

AF003534 

Chilo iridescent virus hypothetical 
protein 030L 

38 

35.294 

4237 

AF003534 

Chilo iridescent virus hypothetical 
protein 030L 

40 

19.355 

A O 1 o 

Ar 0 03 53 4 

Chilo iridescent virus hypothetical 
protein 030L 

40 

70.000 

4240 

AF003534 

Chilo iridescent virus hypothetical 
jprotein 030li 

112 

30.303 



Plasmodium falciparum rps8 

97 

31.776 

a a *a 

AO 0055 6 4 

Mus musculus SPR2F protein 

75 

36.842 

4245 

AJ005564 

Mus musculus SPR2F protein 

89 

33.333 

4246 

U28971 

Caenorhabditis elegans similar to kD 
tandem repeat region of RD protein 
(nuclear rna-binding protein) 

146 

59.459 

4z4 / 

t mono 
JjO 7803 

Homo sapiens dynamin 

138 

41.667 

424 8 

AF107406 

Homo sapiens GW128 

164 

47.059 

4250 

L77967 

Ovis aries small proline-rich protein 
with paired repeat 

88 

34.483 

4251 

AJ005564 

Mus musculus SPR2F protein 

102 

41.860 

4252 

X90732 

Brassica napus Biotin carboxyl carrier 
protein 

133 

25.342 

4Zjj 

Ar 0o05o0 

beak and feather disease virus unknown 

110 

24.742 

4254 

AB032980 

Homo sapiens KIAA1154 protein 

1449 

99.554 | 

4255 

M77663 

Homo sapiens keratin 10 

230 

29.134 

4256 

U3 1928 

Plasmodium vivax merozoite surface 
protein- 1 

104 

25.954 

/ 

Xy 84 85 

Plasmodium vivax putative 

230 

43.617 

"± o o 

Ml CCJft 

Homo sapiens B-cell growth factor 

162 

72.222 

*x ^ _) ^ 

Ar Uo425b 

Homo sapiens beta glucuronidase 

IdULUI III U 

113 

36.145 

4261 

M88366 

Homo sapiens DNA-binding protein 

70 

43.750 

O 

1*1 J. 3 3 .3 U i 

Homo sapiens B-cell growth factor 

168 

42.222 

4263 

AJ005565 

Mus musculus SPR2G protein 

97 

35.417 

/l "> c a 

AFQ07826 

Homo sapiens bax epsilon 

167 

50.000 

4265 

M15530 

Homo sapiens B-cell growth factor 

155 

68.571 

4266 

X55777 

Homo sapiens put. ORF 

240 

54.286 j 

4267 

J02963 

Homo sapiens platelet glycoprotein lib 
precursor 

168 

75.000 

4268 

J02963 

Homo sapiens platelet glycoprotein lib 
precursor 

164 

58.537 

4270 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

425 

74.074 

A O "7 1 

Y1 ^ O O *7 

A16827 

Dictyostelium discoideum 2C gene 
product (AA 1-98) 

110 

33.846 

4273 

AL023781 

Schizosaccharomyces pombe Caffeine - 
induced death protein Cid2p 

12 0 

211875 

4274 

M15530 

Homo sapiens B-cell growth factor 


100.000 

4275 

M15530 

Homo sapiens B-cell growth factor 

120 

67.857 

4276 

M15530 

Homo sapiens B-cell growth factor 

182 

60.465 

4277 

S58431 

Gallid herpesvirus 2 orf 24 

118 

45.283 j 


Ife8(e3 


..to en g <+e f-ii- :.;••!; ,„ o e a & o e 


A "3 7 Q 


Drosophila melanogaster segmentation 
protein fKU *± 

56 

75.000 


MI4C4Q 

urosopmia melanogaster segmentation 

100 

52.381 

4280 

M1553 0 

rnjimj BdpxcnB d-ccjli growtfl, laccor 

132 

57.895 

4281 

f^-C U J O X J J 

xiuiiiciii xiiuiiunuuciiciency virus cype x 
envelope alvcoDrotpin 

72 

46.667 

4282 

U5865 8 

Homo saoiena unknown 

o / 

8 0 . 0 0,0 

4283 

U58658 

Homo s aniens unknown 

f z 

90 . 000 

4284 

U58658 

Homo sapiens unknown 

157 

46.774 

4285 

L10908 


107 

26.230 

4286 

M15530 

Homo sat) ipriR R-rpl 1 nrouth f ar t. nr 

187 

71 . 053 

4287 

M15530 

Homo BaDiPTlR R-rpl 1 nrnwHi f onh r*»-r- 

181 

55 . 932 

4288 

AJ005559 


107 

33 . 824 

4290 

U05313 


l-^.D 

36.364 

4291 

X77858 

putative E5 

87 

66 . 667 

4292 

M15530 

Homo sapiens B-cell arowth fachnr 

7 n a 

DU . 000 

4293 

U63542 

Homo sapiens FAP protein 

x o u 

65 .789 

4294 

L23927 

Escherichia coli putative 

39 

41.667 [ 

4295 

L23927 

Juov^iJ^JL ±^ll±a t_ ^ X X UULdLX vc 

87 

34 . 091 

4296 

Z22 864 

d v J. ail uUCl 1U V XX Uq UK v O 

. 85 

33.962 

4297 

Z22864 

avi ^n arlpnov^ run OPPQ 
av xau aucilvviiUci v/iv r O 

86 

36.364 

4319 

AF126163 

iiuiHw oaijxciio fin 1 irtj pxUtcin 

153 

75 . 862 

4322 

AF126163 

*T>X^ X^OID J 

nvJdivJ bctpicllS xIJxLlRj prOtGXIl 

150 

75 . 000 

4323 

D50915 

nuuiu bapienH ine mmuxz!) gene product 
is novel. 

80 

44 . 444 

4325 

M62415 

Pseudopleuronectes americanus HPLC6 

89 

52 .000 

4327 

A3 * 1D1 

Mus musculus Q300 protein (AA 1-77) 

146 

65 .517 

4328 

M15530 

Homo sapiens B-cell growth factor 

136 

59.459 ! 

4329 

L10908 

Mus musculus Gcapl gene product 

95 

31.818 

X -j t n 

4J3U 

X95276 

Plasmodium falciparum rpl23 

76 

30 .159 

4 j j 1 

AC0044 16 

Homo sapiens 

wuvou : n_KCjOXJNi2 . gw. 1335199 . a gene 

Lyx UUUL- L. 

30 

38 .462 

4332 

AC004416 

*AV_ V V/ ^* X O 

numu odpxcno 

»* • xvvj v x jivxo i y w . x«5Joxi72i'.a gene 
product 

31 

40 .000 

4335 

L10908 

Mus musculus Gcapl gene product 

OA 

3 8. 462 

4336 

D87437 

Homo sapiens KIAA0250 

5405 

99 .751 

4337 

AF061244 

Agrocybe aegerita unknown 

117 

30 .667 

4338 

X66285 

Mus musculus HC1 ORF ( 

125 

29 .508 

4339 

X7 7 816 

Kautus norvegxeus PR-vnetal 

145 

48 .869 

4340 

w j 0 0 j 0 

Homo sapiens unknown 

220 

62 .162 

4341 

UD J Jl^ 

womo sapxens r ap protein 

188 

71.429 

4343 

OCO4C7 

fiasmouium raxcxparnm, Malayan Camp 

o v^-L a L. t; , repuiae rdxt iai, 1/b aa 

PfEMP2, MESA, PP33 0=large antigenic 
protein {repeating structure) 

108 

31 .707 

4345 

AF004561 

Homo sapiens p21~Arc * 

562 

92 .308 

4346 

AF004561 

Homo sapiens p21-Arc i 

151 

81 .250 

4347 

AF004561 

Homo sapiens p21-Arc 

847 

85 .870 

4346 

M15530 

Homo sapiens B-cell growth factor 

142 

49 .057 

4349 

M15530 

Homo sapiens B-cell growth factor 

214 

73 .810 


.1. 0 Olfc): NO „, 0! iE fl! 6. ill s 


4350 

M15530 

Homo sapiens B-cell growth factor 

55 

50.000 

4351 

AC004416 

Homo sapiens 

WUGSC:H_RG013N12 .gw. 1335199. a gene, 
product 

153 

58.333 

4352 

U58755 

Caeriorhabditis elegans C34D4 . 11 gene 
product 

124 

40. 816 



Trypanosoma brucei ORF2 

100 

36.066 

4354 

M15530 

Homo sapiens B-cell growth factor 

167 

71.795 

4356 

M69297 

Homo sapiens ORP 2 

367 

73 . 810 

4357 

M69297 

Homo sapiens ORF 2 

440 

78.261 

4358 

M69297 

Homo sapiens ORF 2 

425 

81.176 

4359 

M69297 

Homo sapiens ORF 2 

407 

77.647 

4360 

M15S30 

Homo sapiens B-cell growth factor 

176 

40.506 

4361 

AC007168 

Arabidopsis thaliana hypothetical 
protein 

85 

57.895 

4362 

AC007168 

Arabidopsis thaliana hypothetical 
protein 

254. 

30.698 

4363 

AFQ10144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

108 

33.088 

4364 

AF010144 

Homo sapiens neuronal thread protein 

AD7C-NTP 

441 

72.619 

4365 

M11759 

Lycopersicon esculent urn cell wall 
hydroxyproline-rich glycoprotein 

152 

34.940 

4366 

U97553 

murine herpesvirus 68 unknown 

121 

36.082 

4367 

U97553 

murine herpesvirus 68 unknown 

145 

36.264 

4370 

AF107406 

Homo sapiens GW12 8 

150 

50.000 

4372 

X52X64 1 

Mus musculus Q300 protein (AA 1-77) 

127 

37.500 

4373 

D00570 

Mus musculus open reading frame (251 
AA) 

119 

23.506 

4374 

U16359 

Rattus norvegicus nitric oxide 
synthase 

42 

35.294 

4375 

U16359 

Rattus norvegicus nitric oxide 
synthase 

112 

100.000 

4376 

U16359 

Rattus norvegicus nitric oxide 
synthase 

112 

71.429 

4377 

U96418 

Dennyus somadikartai cytochrome b 

143 

25.253 

4378 

X97667 

Gallus gallus homologous to Drosophila 
homeobox gene tinman 

47 

26.829 

4379 

X97667 

Gallus gallus homologous to Drosophila 
homeobox gene tinman 

91 

46.154 

4380 

X97667 

Gallus gallus homologous to Drosophila 
homeobox gene tinman 

149 

42.857 

4382 

AF022985 

Unknown Similar to collagen; coded for 
by C. elegans cDNA yk55f3.3; coded for 
by C. elegans cDNA 

102 

34.711 

4383 

AF022985 

Unknown Similar to collagen; coded for 
by C. elegans cDNA yk55f3.3; coded for 
by C. elegans cDNA 

125 

32.609 

A "5 a A 

7VTJAOO nor 

Unknown Similar to collagen; coded for 
by C. elegans cDNA yk55f3.3; coded for 
by C. elegans cDNA 

157 

33 .636 

4386 

L23852 

Homo sapiens Putative 3 1 end of coding 
region; putative 

77 

36.538 

4387 

L23852 

Homo sapiens Putative 3 1 end of coding 
region; putative 

77 

36.538 


lOgfeT 
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4388 

S58722 

Homo sapiens X- linked retinopathy 

113 

40.000 



protein {C- terminal , clone XEH.8c} 



4389 

X55777 

Homo sapiens put. ORP 

213 

54.098 

4390 

X52164 

Mus musculus Q300 protein (AA 1-77) 

121 

37.736 

4391 

M15530 

Homo sapiens B-cell growth factor 

174 

68.421 

4393 

AF026689 

Homo sapiens prostate-specific 

131 

62.791 



transglutaminase 



4394 

X92485 

Plasmodium vivax pval 

178 

46.753 

4395 

M15386 

Homo sapiens gamma-globin 

897 

93.421 

4396 

M15386 

Homo sapiens gamma-globin 

544 

77.083 

4398 

S79410 

Mus sp. nuclear localization signals 

212 

.85.294 



(NLS) -binding protein=»spot-l 

- 


4399 

S79410 

Mus sp. nuclear localization signals 

212 

85.294 



(NLS) -binding protein=spot- 1 



4400 

S79410 

Mus sp. nuclear localization signals 

212 

85.294 



(NLS) -binding protein=spot-l 



4401 

S79410 

Mus sp. nuclear localization signals 

212 

85.294 



(NLS) -binding protein=spot-l 



4402 

S79410 

Mus sp. nuclear localization signals 

212 

85.294 



(NLS) -binding protein=spot-l 



4403 

S79410 

Mus sp. nuclear localization signals 

212 

85.294 



(NLS) -binding protein=spot-l 



4404 

S79410 

Mus sp. nuclear localization signals 

212 

85.294 



(NIiS) -binding protein=*spot- 1 


- 

4405 

S79410 

Mus sp. nuclear localization signals 

219 

87.879 



(NLS) -binding protein=spot- 1 



4406 

S79410 

Mus sp. nuclear localization signals 

204 

96.429 



(NLS) -binding protein=spot-l 



4407 

S79410 

Mus sp. nuclear localization signals 

212 

85.294 



(NLS) -binding protein=spot-l 



4408 

S79410 

Mus sp. nuclear localization signals 

212 

85.294 



(NLS) -binding protein=spot-l 



4410 

S79410 

Mus sp. nuclear localization signals 

212 

85.294 



(NLS) -binding protein=spot-l 



4411 

S79410 

Mus sp. nuclear localization signals 

215 

82.857 



(NLS) -binding protein=spot-l 



4413 

S79410 

Mus sp. nuclear localization signals 

187 

39.831 



(NLS) -binding protein=spot-l 



4414 

S79410 

Mus sp. nuclear localization signals j 

236 

84.211 



(NLS) rbinding protein«spot-l 



4415 

AF014955 

Homo sapiens TFAR19 

436 

74.590 

4416 

AF014955 

Homo sapiens TFAR19 

218 

89.744 

4418 

U16359 

Rattus norvegicus nitric oxide 

96 

68.182 



synthase 



4419 

X07858 

Saccharomyces cerevisiae ORF (l is 3rd 

107 

30.488 



base in codon) (266 is 1st base in 





codon) 



4420 

X92485 

Pi AnmrtH^ iim vi vay nval 
r laoiiiuuxum v^voa y-f v a. JL 


53 .226 

4421 

X92485 

Plasmodium vivax pval 

55 

53.846 | 

4422 

X79238 

Homo sapiens ribosomal protein L30 

739 | 

100.000 

4423 

X79238 

Homo sapiens ribosomal protein L30 

291 

94 .231 

4424 

X79238 

Homo sapiens ribosomal protein L30 

3 97 

78.814 

4425 

S79410 

Mus sp. nuclear localization signals 

147 

54.054 



(NLS) -binding protein=spot-l 




.1. 0 O 8 HhiL-i. «+3 . o e a i& u e 


4426 

AF 063 bob 

Melanoplus sanguinipes entoraopoxvirus 
ORF MSV233 Hypothetical protein 

110 

34.483 

A A *> *7 

C"7 Q A 1 ft 

Mus sp. nuclear localization signals 

/ XTT O \ "V"\ 4 4 v\i~t t-\ -V /-n o { n v— . * 4— I 

\xmjuo / -Dinamy prouein=spot-i 

142 

52.273 

4428 

npi c one q 

nomo sapiens xeucine zipper- 11X6 
protein 

147 

80 . 000 

4429 

U344 67 

rjS i q 3 v* i i inn an 1 "NTD'D T.O HA CO 1 U Q — *_ _ % , 1 J _ 
ruoax jLUiii op • niu\JjZ U 4 D 7 DcUa* LuDUlin 

110 

36. 170 

4430 

X63417 


586 

49 . 457 

4431 

X63417 

Homo sapiens IRLB 

142 

36.066 

4432 

TT1 A** Q 

UIOJ J? 


125 

53 . 846 

4433 

M12668 

Measles virus unknown protein 

124 

35.526 

4 A 1 A 

** rk J ^ 


Rattus norvegicus nitric oxide 
synthase 

109 

68.000 

A A 1 R 

rlDD J U U 

Rattus sp. synapsin I 

106 

35.385 j 

A A *3 <T 


Caenorhabditis elegans No definition 
line found 

57 

25.000 

4437 

U/i J DID 

LaenornaDuiciB eiegans no aetlnition 

70 

32 . 609 

4438 

U2 3 5 1 6 

v^acuui uauui t x 0 cxcyaUD 1M \J dCtllllUlOn 

line found 

1 0 1 a 

29 . 909 

4439 

V00488 

Homo sapiens alpha globin 

925 

100.000 

4440 

V004 Rft 

nuiuu Bcipicns dipna. y j. ODin 

359 

80 . 645 

4441 

A06407 

synthetic construct tac-sod hybrid 

288 

64 .286 

A A AO 

u / 

synthetic construct tac-sod hybrid 

.488 

100.000 

4443 

A06407 

synthetic construct tac-sod hybrid 

266 

69.565 

444 4 

AF106329 

Escherichia coli unknown; orf86 

117 . 

55.172 

444 5 

M2673 0 

Homo sapiens ubiquinone -binding 
protein \QP) 

582 

68.987 

4446 


Homo sapiens ubiquinone -binding 

th» rrit"Pi / AO ^ 

168 

100 . 000 

4447 

M2 67**0 

nomo Bdpiens uxjiquinoiie-Dinuing 
nrotein (OP) 

319 

74 . 359 

4448 

X69962 

Homo a 3 T-) T*l R imnn rnp»r1 Y\Trr"%t~& t n nvrk/^nr«f 
tiuinw Dapxcjio uiuiamcu piV/Lclll pxuUuCC 

O/l 

8 9 . 655 

4449 

X69962 

Unmn oari^ priq unTi^rnfar? nrrit-o-l n T-\w»^r3iis-«+» 
nuiiiv Dajpxciio -Ullilallicu piOLclIl pxO^UC L 

114 

35 . 644 i 

4450 

XS9962 

Homo sapiens unnamed protein product 

43 

70.000 

4451 

X69962 

nuiuo Hdjpxciifc* uxiiidiTieci prone in product 

3819 

93 .513 

4452 

X69962 

Homo sapiens unnamed protein product 

4220 

100.000 

A A "5 

unn a n o 

Homo sapiens alpha globin 

.830 

100.000 

4454 

V00489 

Homo sapiens alpha globin 

741 1 

88.652 

44CC 

von A A Q 

Homo sapiens alpha globin 

94 

50.820 

4460 

D90825 

Escherichia coli Probable ATP- 
dependent helicase dinO homolog. 

53 

23.256 

4461 

D90825 

Escherichia coli Probable ATP- 
dependent helicase dinG homolog. 

81 

37.255 

4465 

D90825 

Escherichia coli Probable ATP- 
dependent helicase dinG homolog. 

57 

53.333 

A AC O 


Escherichia coli Probable ATP- 
dependent helicase dinG homolog. 

60 

27.778 

4478 

D90825 

Escherichia coli Probable ATP- 
dependent helicase dinG homolog. 

68 

27.778 

4479 

D90825 

Escherichia coli Probable ATP- 
dependent helicase dinG homolog. 

2727 

99.523 

4480 

AB030483 

Mus musculus B9 

42 

4 0.000 

4481 

AB030483 

Mus musculus B9 

78 

25.253 


I08<p"i 


1, 0 O B NH& «+3 ,„.o e K £„ o' id; 


4482 

AB030483 

Mus musculus B9 

44 

25.862 

4483 

AB030483 

Mus musculus B9 

163 

26.543 

4484 

L26247 

Homo sapiens isolog of yeast suil and 
rice gos2; putative 

361 

100.000 

4485 

X52164 

Mus musculus Q300 protein (AA 1-77) 

101 

71.429 

4486 

A00279 

synthetic construct Human serum 
albumin 

548 

39.286 

4488 

AL049685 

Homo sapiens hypothetical protein 

1193 

100. 000 

44 89 

U18922 

Saccharomyces cerevisiae YerlSlcp 

100 

26.263 

4490 

S45142 

Pisum sativum ENOD3=metal -binding 
protein 

104 

38.889 

4491 

L02426 

Homo sapiens 26S protease (S4) 
regulatory subunit 

1774 

83.631 

4492 

L02426 

Homo sapiens 26S protease (S4) 
regulatory subunit 

371 

89.394 

44 93 

L02426 

Homo sapiens 2 6S protease (S4) 
regulatory subunit 

342 

93.220 

4494 

L02426 

Homo sapiens 26S protease (S4) 
regulatory subunit 

2120 

88.616 

4495 

M84237 

Homo sapiens integrin beta-1 subunit 

136 

64.706 

4496 

M37679 

Mus musculus Ig heavy chain precursor 

75 

60.000 1 

4497 

X15081 

Crithidia fasciculata ctyochrome c 
oxidase subunit I (2034 is 3rd base in 
codon) 

68 

62.500 

4498 

AP000062 

Aeropyrum pernix 106aa long 
hypothetical protein 

57 

53.846 

4499 

AP000062 

Aeropyrum pernix 106aa long 
hypothetical protein 

58 

38.710 

4500 

AP000062 

Aeropyrum pernix 106aa long 
hypothetical protein 

57 

53.846 

4501 

AP000062 

Aeropyrum pernix 106aa long 
hypothetical protein 

111 

32.432 

4503 

AP000062 

Aeropyrum pernix 92aa long 
hypothetical protein 

91 

34.848 

4504 

X72879 

Homo sapiens ORF 

145 

84.615 

4505 

AF107406 

Homo sapiens OW128 

46 

36.000 

4506 

AF107406 

Homo sapiens GW128 

164 

47.273 j 

4507 

X00822 

Gallus gallus collagen type III 

70 

34.091 

4509 

S79410 

Mus sp. nuclear localization signals 
(NliS) -binding protein=spot-l 

136 

79.167 

4510 

M93698 

Oncorhynchus mykiss ependymin 

185 

25.490 

4511 

M93698 

Oncorhynchus mykiss ependymin 

249 

24.752 

4512 

X55687 

Lycopersicon esculentum extensin 
(class II) 

43 

4 6.154 

4 513 

X55687 

Lycopersicon esculentum extensin 
(class II) 

82 

32.432 



Homo sapiens proteasome subunit z 

715 

64.220 

4516 

D38048 

Homo sapiens proteasome subunit z 


1 OO . 000 

4517 

D38048 

Homo sapiens proteasome subunit z 

328 

98.077 

4518 

D38048 

Homo sapiens proteasome subuni^t z 

1039 

83.883 

4519 

AJ131555 

Hepatitis B virus X protein 

74 

32.558 

4521 

AF095350 

Homo sapiens RAB-like protein 2A 

417 

82.927 

4522 

AF095350 

Homo sapiens RAB-like protein 2A 

327 

54.386 

4523 

AF095350 

Homo sapiens RAB-like protein 2A 

46 

2 3.077 j 


108698 


± O O &4' 6 <4»3 „, 10 H H 6 O B 


4524 


XllJIIlV' odpXCim AnJD XXJVC piULcJ.Il ^Si-L 

1 Q A 

7o . 190 

4525 

AF095350 

Homo Bapiens RAB-like protein 2A 

38 

44.444 

fijZD 


riomo sapiens KAo-iuce protexn 2A 

691 

60 . 759 

4527 

AF095350 

Homo sapiens RAB-like protein 2A 

1418 

93.724 


TV «> A Q C *a C A 

Homo sapiens RAB-like protein 2A 

1308 

75. 166 

4530 

AL021747 

Schizosaccharomyces pombe hypothetical 
protein 

152 

33.721 

4531 

AL021747 

Schizosaccharomyces pombe hypothetical 
protein 

38 

28.125 

4532 

AL021747 

Schizosaccharomyces pombe hypothetical 
protein 

127 

36.364 



ocnizosaccnaromyceB pomoe nypocnecicax 

• t*» v*#~> hoi n 
JL/J.UI.CXU 

40 

27.586 

4 534 

AL021747 

O^IiX^Uoav,UIiaXU(liyi..co LJvJlIlXJti IiypULllcLlCaX 

protein 


21 . 429 

4535 

AL021747 

Schizosaccharomyces pombe hypothetical 

155 

29.703 

4536 

AB013367 

Bacillus halodurans unknown 

152 

43.103 

4537 

AB013367 

Bacillus halodurans unknown 

671 

38.217 

4538 

X76268 

Bacteriophage phi-C31 ORF 14 

74 

38.889 

4539 

Z14963 

Homo sapiens codes for truncated alpha 
mRNA of alpha heavy chain disease 
patient LTB 

80 

51.613 

4540 

214963 

Homo sapiens codes for truncated alpha 
mRNA of alpha heavy chain disease 

pdCl CllL. -Li 1 if. 

80 

51.613 

4 54X 

7.1 4 Qfi"* 

rioTiio Bapiens coues ror cruncauea axpna 

liuvlMrt. Ul clJ»£Jllcl XXGctVy l_ilclJ.ll ulScdbc 

patient LTE 

86 

30 . 556 

4542 

L10908 

VI U o IIILlDl^ Ul Lit* VjU-cIUJX (jciit; px UUUwL 

± Xz> 

4 0.541" 

4543 

AJ242956 

Homo sapiens El fusion protein 

147 

66.667 

4544 

T.l OQOR 

rnis idudouiud oCapi gene produce 

34 

3 6 . 364 

4545 

L10908 

Mus musculus Gcapl gene product 

44 

29.167 


T 1 A Q A O 

Mus musculus Gcapl gene product 

84 

57. 143 


KOODOO 

Schizosaccharomyces pombe hypothetical 
trp-asp repeats containing protein 

59 

33 .333 

4 ^4 fl 

oOODOO 

Schizosaccharomyces pombe hypothetical 
trp-asp repeats containing protein 

57 

27 . 500 

4*^4 Q 

KOODOO 

Schizosaccharomyces pombe hypothetical 
trp-asp repeats containing protein 

938 

54 . 741 


LIU JUO 

raus muscuxus i?capi gene product 

240 

50 . 000 

4551 

D26462 

Nicotiana glauca X Nicotiana 
langsdorf f ii tumor-related protein 

38 

24.000 

4552 

D26462 

Nicotiana glauca X Nicotiana 

langsdorf fii tumor-related protein ! 

84 

37.143 

4553 

D16387 

Asterina pectinifera cytochrome 
oxidase subunit III 

75 

38.462 

4554 

U16359 

Rattus norvegicus nitric oxide 
synthase 

70 

4 4 444 

4555 

U16359 

Rattus norvegicus nitric oxide- 
synthase 

123 

63.333 

4556 

S77099 

Drosophila pseudoobscura, Peptide, 14 9 j 
aa Jan A 

53 

41.379 

4557 

S77099 

Drosophila pseudoobscura, Peptide, 14 9 

260 

45.745 
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4558 

S77099 

LTObopxixia pseuaoooscurd , Jreptxae, 149 
aa. * iTan A 

160 

45.872 

4559 

AF001904 

Homo sapiens 3-hydroxyacyl-CoA 
dehvdroaenase isnform 9 

104 

47.727 

4560 

A50313 

unidentified SEGMENT ALPHA OP HUMAN 
APOMYOGLOB IN 

294 

100. 000 

4561 

X52164 

Mus musculus Q300 protein (AA 1-77) 

131 

38.000 

4562 

AL049638 

iuaivxuu^Diu tnanana puLdClve protein 

388 

41.011 

4563 

U16359 

Rattus norvegicus nitric oxide 

O y XX LlidOC 

52 

30 .303 

4564 

U16359 

Rattus norvegicus nitric oxide 

O Jf XI UilCloC 

106 

47.500 

4565 

U31788 

Human papillomavirus type 4 4 

86 

40.000 

4566 

M22659 

MU9 musculus TCR alpha-chain (Va8- 
Jal4T-C) 

79 

32.653 

4567 

U16359 

Rattus norvegicus nitric oxide 

Rvnh \\ Pi Q#a 

125 

94.118 

4568 

U16359 

Rattus norvegicus nitric oxide 

125 

94.118 

4569 

AC003034 

Homo sapiens Gene with similarity to 
iat Kianey-specitic ( ks J gene 

211 

56.757 

4570 

AC003 034 

xiomo sapiens bene wicn similarity to 

rat }r i dnf^v ~ Rnpr^i f ■{ p /vq^ noma 
■*-***» d]jcc xi.iu \ ivo } gene 

261 

55.446 

4571 

AC003034 

rat kidnev-soecif ic (K*3) a#»n*> 

518 

100. 000 

4572 

AC003034 

Homo saniens Gene with Rimi larifv t-r* 
rat kidney-specific (KS) gene 


99 . 451 

4573 

AF110315 

Homo sapiens unknown 

80 

42.500 

4582 

AF110315 

Homo sapiens unknown 

ob 

3 1 . 132 

4587 

AF110315 

Homo sapiens unknown 

o / 

2 0 . 635 

4592 

AF110315 

Homo sapiens unknown 

Q -I 

81 

25 . 466 

4599 

AF110315 

Homo sapiens unknown 

86 

24.6X5 

4601 

AF110315 

Homo can^ <=»n«a nnV n num 

2669 

100. 000 

4603 

U50403 

Homo r ri n i ri ps Kypa at* pan/^o -v m\t-\v\-w «. «-» <-> « 
iiwuivy oapicuo uicdoL vaflCcx Slipp IcSSOlT 

element Ishmael Upper RP2 

118 

64.000 

4604 

X70050 

Giardia intestinalis ubiquitin 

156 

32.857 

4605 

X66285 

Mus musculus HC1 ORF 

14 0 

28.814 

4606 

L41675 

Human T-cell lymphotropic virus type 2 

TiT-of*** - ! n 11 -ir\7 

^XULCXil XX A V 

84 

39.286 

4607 

AB033064 

Homo sapiens KIAA1238 protein 

193 

53.077 

4608 

/VOVJ JUD^t 

Homo sapiens KIAA1238 protein 

2681 

100.000 

460? 

Z83227 

Caenorhabditis elegans predicted using 
uenerinaer; weax similarity to high- 
sulphur keratins.; cDNA EST yk663al.3 
comes from fhi r nt^n p» 

wviiiwo *>*- VIII LUXO MCUC 

92 

38.889 

4610 

K01664 

Drosophila melanogaster Bkm-like 
protein 

92 

34.000 

4611 

S79410 

Mus sp. nuclear localization signals 
(NLS) -binding protein=spot-l 

143 

79.167 

4612 

Y08982 

Homo sapiens synaptonemal complex 
lateral element protein 

9956 

100.000 

4613 

AF030429 

Canis familiaris dentatorubro- 

129 

29.032 
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pallidoluysian atrophy protein 



4614 

AF043703 

Caenorhabditis elegans No definition 
line found 

104 

38.636 

4615 

L10908 

Mus musculus Gcapl gene product 

92 

60.000 

4617 

AL117669 

StreDtomvces coelicolor A"* (?\ 
hypothetical protein 

j 126 

27 .473 

4620 

Z70222 

hypothetical 9.8 kD protein ZK652.3 of 
Caenorhabditis elegans, spP34661 

1 loo 

93 . 939 

4621 

AF159055 

Homo sapiens leucine zipper-like 
protein 

lift 

c o /r »"i ^ 

58 . 621 

4622 

AF068302 

Homo sapiens 

choline/ ethanolaminephosphotransf erase 

1 Q7Q 

97 . OoO 

4623 

X52560 

Homo sapiens nuclear factor NF-IL6 (AA 
1-345) 

1 2385 

QQ A •> f\ 

4624 

AB023483 

Homo sapiens docking protein BRDG1 

1927 

go fifii 

4625 

S58722 

Homo sapiens X-linked retinopathy 
protein (C-terminal, clone XEH.8c) 

130 

J o . o o / 

4626 

AL121804 

Drosophila melanogaster BACR7C10 . z 

[ 45 


4627 

AL121804 

Drosophila melanogaster BACR7C10 . z 

1 103 

36 111 

4628 

D90764 

Escherichia coli ORF_ID : o253#13 ; 
similar to 

74 

JO. / 

4629 

AB032974 

Homo sapiens KIAA1148 protein 

1 53 

2 9 7 fl7 

• /Of 

4630 

AB032974 

Homo sapiens KIAA114 8 protein 

628 

79.365 

4631 

AB032974 

Homo sapiens KIAA114 8 protein 

49 


4632 

AB032974 

Homo sapiens KIAA114 8 protein 

628 

79.365 

4633 

AB032974 

Homo sapiens KIAA114 8 protein 

4 0 

AC A c e 

4 5 . 455 

4634 

AB032974 

Homo sapiens KIAA114 8 protein 

1032 

100.000 1 

4635 

M11221 

Daucus carota extensin 

•7 Q 
/ O 

52.381 

4636 

AF066061 

Mus musculus glucosidase II beta- 
subunit 

46 

21.212 

4637 

AF066061 

Mus musculus glucosidase II beta- 
subunit 

64 

75 . 000 

4638 

AF066061 

Mus musculus glucosidase II beta- j 
subunit | 

J-VO 

c o err 

4639 

AF066061 

Mus musculus glucosidase II beta- j 
subunit 

147 

4 ft Ann 

4640 

U00039 

Escherichia coli livF 

224 

96.970 

4641 

U00039 

Escherichia coli livF 

759 

QQ 1 #TfJ 

4642 

U00039 

Escherichia coli livF | 

1504 

-7 » 3 / X 

4643 

U00039 

Escherichia coli livF 

1121 

Q"7 74fi 

4644 

U22229 

Felis catus ribosomal protein L41 

90 

i nn nnn 

4645 

U22229 

Felis catus ribosomal protein L41 I 

76 


4646 

AF159055 

Homo sapiens leucine zipper-like 
protein | 

128 

6*7 74*> ' 

4647 

X02375 

Homo sapiens pro-alpha 1(11) collagen | 

69 

4 6 . 429 

4648 

X77770 

Homo sapiens ribosomal protein S26 1 

37 

3 6 .364 

4649 

X77770 

Homo sapiens ribosomal protein S26 | 

785 

98.261 


A77770 

Homo sapiens ribosomal protein S26 

647 

87.387 

4651 

X77770 

Homo sapiens ribosomal protein S26 | 

3 92 

98.333 

4652 

X77770 

Homo sapiens ribosomal protein S26 

314 

65.079 

4653 

U18339 

Variola virus D4L 

97 

40.000 

4654 

AE000659 

Homo sapiens TCRAV10S1 

620 

84.211 

4655 

U16359 

Rattus norvegicus nitric oxide 

109 

57.143 


I0VH 
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4656 

AL022299 

ocnizoBaccnaroinyces pornoe hypothetical 

144 

29-885 



RNA—hind i ner nrn^p^n 


4657 

X80914 

Ft H 1 TTIOTI ^ d hPTT»»OV^ rilfl O <~\ 
Da^mviiJLU IJCipCp V X X UB 46 Oil 

130 

38.596 

4658 

M13941 

Human hprnpovl 1*11 0 A mml n >- ' n « 1 

j. 11x11 id 1.1 iicipcovHUo ** ilUC JLcal anulQcit X 

105 

36.923 

4659 

AO 014 2 

Homo oani p^t* a 1 \mrr»Vir"Nl A n a. T 7VO 0 
xiwiiiw DapxCHo X jr UipilUX XIlC J tf\\y 

495 

100.000 

4660 

A00142 

Homo RADI Ana 1 vmriftnl 1 na T 7\P *5 

408 

78.313 

4661 

M24743 

Homo qnni ona MTJO Uoat- DVin^b- *\v>a4>a<!« 

xiwiut^ uapicuD i v ixik«. neac shock proccm 

120 

62.857 



HSP70-1 


4665 

Z68218 

Caenorhabdit i s plpa^nn ifniHi 9 i 

34 0 

1 69.231 

4666 

X96401 

Homo can^ ana DOY a -I i-> 

civ^uii^ oapiciiu KvA protein 

66 

46.154 

4667 

X96401 

xioino sapiens kua protsin 

3808 

100.000 

4669 

M37679 

fjus muscuius ig neavy chain precursor 

33 

44.444 

4670 

M37679 

mus muBcuius ig neavy cnain precursor 

102 

53.333 

4671 

U11682 

x xypanopiasma Dorreii unassigned 

62 

29.412 



rpaHi Tin •FY-amo 


4673 


Arabidopsis thaliana contains 

1014 

51.227 



Bimiidtity t.o niscone aeacetyiases ; 




xrxctiii rruuoou, DCOretslj.3 / E^Se-lO, N=l 



4674 

Y11514 

Homo sapiens transcription factor 

912 

94.937 

4 675 


vaenornajDaitis elegans F35C11.4 

176 

23.792 

4 676 

uu 4 / 

Homo sapiens furin 

95 

61.905 

*a D / / 


Homo sapiens leucine zipper- like 

105 

54.167 



procem 


4678 

U5 8755 

uaenornaDaicis elegans C34D4.11 gene 

123 

47.826 



)^ UUUU L> 


4679 

O / ?1 lu 

Mus sp. nuclear localization signals 

105 

60.000 



\nxio/ -jDinaing protein=spot-l 


4680 


Homo sapiens beta glucuronidase 

199 

53 .704 



isotox in a 


4681 

J\. f O / -L / 

Homo sapiens MT-11 protein 

413 

92.157 

4682 

Y7C7 t 7 

Homo sapiens MT-11 protein 

470 

88.525 

4685 


Mus musculus glycine tyrosine -rich 

31 

50.000 



hair nrr»t*f>^ n 

**«-»-X ^1 UUC 111 , 


4686 

D86422 

tiuo uiubLuiuo glycine tyrosine— ri cn 

38 

44.444 



hair Drotein 


4687 

D86422 

MUS milRCllltIR nlvr^no H/moino v! »K 

* muo^uiuo y J.y u jLiic cyiOBiiic~riCfl 

73 | 

41.935 



hair Drotein 


4688 

X02156 

4ja.\-^.ixcxxkJiuyv^trtf LclcVlSldc UKr 1 

61 1 

32.000 

4689 

X02156 

flarrharnmvrpo <-«<a-r-»-tr^ o<4 a a tttsi? -i 
wa^uimiuiuycco LcicvlSlac UK.XT 1 

58 

21.053 j 

4690 

X02156 

oacLnaxomyces cerevisiae urf l 

69 

52.632 

4691 

U79260 

HfMTIO C3 ^a f*A "1 Ann tmlr-n/*\i.Tvi 

bapiens unxnowa 

245 

60.317 

4692 

X70050 

«j.axuid liitcuuinaiis uDiquitln 

156 | 

32.857 

4693 

L77967 

UV1U dxxes Bman proiine-ricn protein 

103 

32.609 



with "Dairc^d rpnpa t* 


4694 

L77967 

VVAH «axxes smaii proline— rich protein 

103 

32.609 



with paired repeat 


4695 

X66901 

Mus musculus En-2/lacZ fusion protein 

41 i 

19.444 

4696 

X66901 i 

Mus musculus En-2/lacZ fusion protein 

307 

90.196 

4697 

AJ010857 

Homo sapiens MAFF protein 

1055 

100.000 1 

4698 

AF159055 

Homo sapiens leucine zipper- like 

132 | 

64.516 



protein 


4699 

AF001160 

Homo sapiens G-protein gamma subunit 

44 8 

100.000 

4700 

AJ005559 

Mus musculus SPR2A protein 

53 [ 

53.333 
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4701 

AJ005559 

Mus mus cuius SPR2A protein 

109 

30 . 769 

4702 

AC004991 

Homo sapiens match to spliced EST 
yc79ell.rl (NID: 674336) ; supported by 
Gens can 

482 

100.000 

4703 

S4 84 06 

Mus sp. alpha 1 (XII) collagen 
{triple-helical domain COL2} 

109 

43 . 478 

4704 

U50403 

Homo sapiens breast cancer suppressor 
element Ishmael Upper RP2 

143 

68 . 966 

4705 

U34749 

Bos taurus immunoglobulin heavy chain 
variable region 

66 

31.111 

4706 

X70070 

Homo sapiens neurotensin receptor 

990 

100.000 

4707 

X70070 

Homo sapiens neurotensin receptor 

431 

100.000 

4708 

X70070 

Homo sapiens neurotensin receptor 

46 

50.000 

4709 

X70070 

Homo sapiens neurotensin receptor 

2743 

100 . 000 

4710 

U18339 

Variola virus D4L 

92 

66.667 

4711 

AF093098 

Homo sapiens transcription factor 
TBLYM* T-box transcrinti on f artoi* 
family member 

3821 

100.000 

4712 

AF093098 

Homo sapiens transcription factor 
TBLYM * T-box traiiBcri'Dt l on fart or* 
family member 

1357 

99.465 

4713 

AF093098 . 

Homo sapiens transcription factor 
TBLYM; T-box transcription factor 
family member 

844 

83 . 893 

4714 

AF093098 

Homo sapiens transcription factor 
TBLYM; T-box transcription factor 
family member 

3574 

83 . 178 

4715 

M74421 

human herpesvirus 1 ORF2 

110 

46.512 

4716 

Z71614 

Saccharomyces cerevisiae ORF YNL33 8w 

48 

50.000 

4717 

Z71614 

Saccharomyces cerevisiae ORF YNL338W 

93 

50 . 000 

4718 

U28944 

Caenorhabditis elegans C18A3.7 gene 
product 

92 

27.536 

4719 

X92485 

Plasmodium vivax pval 

169 

81.481 

4720 

U63332 

Homo sapiens super cysteine rich 
protein; SCRP 

168 

62.963 

4721 

AF045940 

Mus mus cuius nitric oxide synthase 

49 

58.333 

4722 

AF045940 

Mus muBCulus nitric oxide synthase 

44 

36 . 842 

4723 

AF045940 

Mus musculus nitric oxide synthase 

72 

55.000 

4724 

Y18423 

Homo sapiens vasoactive intestinal 
polypeptide receptor 2 

1630 

100 000 

4725 

Y18423 

Homo sapiens vasoactive intestinal 
polypeptide receptor 2 

3008 

100 000 

4727 

Y18423 

Homo sapiens vasoactive intestinal 
polypeptide receptor 2 

511 

46 .667 

4728 

S58722 

Homo sapiens X-linxed retinopathy 
protein {c-terminal , clone XEH.8c} 

96 

60.000 

4729 

AE000848 

Methanobacterium thermoautotrophicum 
ribosomal protein L37a 

102 

40.000 

4731 

AB001684 

Chlorella vulgaris ORF54d 

102 

52.632 

4733 

L08193 

Ricinus communis sugar carrier protein 

94 

36.957 

4734 

Z36002 

Saccharomyces cerevisiae ORF YBR134w 

105 

37.931 

4735 

AC003058 

Arabidopsis thaliana hypothetical 
protein 

209 

52.632 

4736 

U26748 

Brassica napus protein kinase BNPK-4 

129 

43 .860 
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4737 

X77639 

Sus scrofa cellular retinol binding 
protein II 

569 

59.398 



Rattus norvegicus MEGF2 

75 

31.707 

4739 

AB011528 

Rattus norvegicus MEGF2 

62 

33.898 

4740 

AB011528 

Rattus norvegicus MEGF2 

8832 

93.353 

4741 

AB011528 

Rattus norvegicus MEGF2 

20430 

90.994 

4743 

X63508 

Mycobacterium tuberculosis predicted 
ORF 

127 

46.809 

4744 

X14776 

Rattus norvegicus TP2 

116 

30.488 

4746 

Z71267 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk275h2.3 comes 
from this gene; cDNA EST yk309gll.3 
comes from this gene; cDNA EST 
yk309gll.5 comes from this gene; cDNA 
EST yk275h2.5 comes from this gene 

77 

45.833 

4747 

271267 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk275h2.3 comes 
from this gene; cDNA EST yk309gll.3 
comes from this gene; cDNA EST 
yk309gll.5 comes from this gene; cDNA 
EST yk275h2.5 comes from this gene 

45 

28.571 

4748 

Z71267 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk275h2.3 comes 
from this gene; cDNA EST yk309gll.3 
comes from this gene; cDNA EST 
yk309gll.5 comes from this gene; cDNA 
EST. yk275h2.5 comes from this gene 

79 

45.833 

4749 

Z71267 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk275h2.3 comes 
from this gene; cDNA EST yk309gll.3 
comes from this gene; cDNA EST 
yk309gll.5 comes from this gene; cDNA 
EST yk275h2.5 comes from this gene 

184 

41.892 


AIi034488 

Unknown similar to DEAD/DEAH box 
helicase ; Helicases conserved C- 
terminal domain; cDNA EST yk313 

1304 

43.568 

4751 

U97553 

murine herpesvirus 68 unknown 

112 

35.632 

Aid 
4752 

Y10696 

Homo sapiens INE1 

381 

100.000 | 

4753 

U28738 

Caenorhabditis elegans T28D9.11 gene 
product 

29 

31.818 

4754 

U28738 

Caenorhabditis elegans T28D9.11 gene 
product 

48 

56.522 

4755 

U28738 

Caenorhabditis elegans T28D9.11 gene 
product 

106 

39.655 

4756 

AF084256 

Homo sapiens beta glucuronidase 
isoform d 

187 

55.769 

4757 

M35611 

Hordeum vulgare C-hordein storage 
protein j 

47 

30.000 

4758 

M35611 

Hordeum vulcrare C-hordein atoraap 
protein 

J-JLO 

3 6. 842 

4759 

AF159055 

Homo sapiens leucine zipper-like 
protein 

160 

77.419 

4760 

X99226 

Homo sapiens Fanconi anemia 
complementation group A protein 

62 

25.641 

4761 

X99226 

Homo sapiens Fanconi anemia 

56 

53.846 
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complementation group A protein 



4762 

X99226 

Homo sapiens Fanconi anemia 
complementation group A protein 

9678 

100.000 

4763 

1*10908 

Mus musculus Gcapl gene product 

77 

45.833 

4764 

X52164 

Mus musculus Q300 protein (AA 1-77) 

115 

46.667 

4765 

U93215 

Arabidopsis thai i ana* glycine-rich 
protein isolog 

98 

51.724 

4766 

U93215 

Arabidopsis thaliana glycine-rich 
protein isolog 

88 

56 .522 

4767 

U93215 

Arabidopsis thaliana glycine-rich 
protein isolog 

162 

37 .778 

4768 

Z75532 

Caenorhabditis elegans similar to 
mitochrondrial carrier protein; cDNA 
EST yk264h5.5 comes from this gene 

803 

48 .473 

4769 

Z75532 

Caenorhabditis elegans similar to 
mitochrondrial carrier protein; cDNA 
EST yk264h5.5 comes from this gene 

804 

44.898 

4770 

U57237 

Human immunodeficiency virus type 1 
Tat protein 

106 

36.667 

4772 

M11759 

Lycopersicon esculentum cell wall 
hydroxyproline-rich glycoprotein 

182 

34 .831 

4773 

AF083116 

Homo sapiens paraneoplastic cancer- 
testis-brain antigen 

671 

31 .579 

4774 

AC004381 

Homo sapiens Unknown gene product 

2866 

100.000 

4775 

AL031910 

Leishmania major hypothetical -protein 
L2743.10 

29 

25 .000 

4776 

AL031910 

Leishmania major hypothetical protein 
L2743.10 

80 

73 .333 

4777 

AF025642 

Human immunodeficiency virus type 1 
tat protein 

74 

• 

64 .706 

4778 

L77968 

Ovis aries type II small proline-rich 
protein 

83 

37.838 

4779 

AB029612 

Lactobacillus gasseri gassericin Tl 

96 

53 .333 

4780 

AF153830 

Sus scrofa CD3 zeta chain 

184 

97 .222 

Vl -t D *1 

4781 

AF153830 

Sus scrofa CD3 zeta chain 

146 

100.000 

4782 

AF153830 

Sus scrofa CD 3 zeta chain 

1062 

98.171 

4783 

AC005328 

Homo sapiens R26660 1, partial CDS 

1514 

92 .975 

4784 

X71350 

Xenopus laevis ribosomal protein S27 
homologue 

106 

36.000 

ft / o !d 

Z4 8613 

Saccharomyces cerevisiae unknown 

100 

35 .000 


TTC 0 0 0 0 

Cowpox virus unknown orf 

51 

30.233 

4787 

U59898 

Cowpox virus unknown orf 

100 

38.298 

4/88 

AJ3 88550 

Canis familiaris hypothetical protein 

62 

45.283 

4789 

AJ388550 

Canis familiaris hypothetical protein 

102 

37.097 

4790 

U58658 

Homo sapiens unknown 

238 

60.938 ! 

4791 

X53511 

Chlamydia pneumoniae 9kDa CrP 

60 

34 .091 

4792 

X53511 

Chlamydia pneumoniae . 9kDa CrP 

104 

40 .426 

4794 

AP000061 

hypothetical protein 

125 

27 . 551 

4795 

267753 

Odontella sinensis 50S ribosomal 
protein L23 

106 

29.310 

4796 

Y10148 

Homo sapiens neurotensin receptor 2 

194 8 

77.073 

4799 

Y10148 

Homo sapiens neurotensin receptor 2 

69 

35 .714 

4801 

Y10148 

Homo sapiens neurotensin receptor 2 

55 

2X.277 
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4802 

Y10148 

Homo sapiens neurotensin receptor 2 

j 2733 

100.000 1 

4803 

Y10148 

Homo sapiens neurotensin receptor 2 

50 

39.130 | 


7VT nOAO^C 

Homo sapiens hypothetical protein 

977 

98.667 1 

4805 
4806 

4 Q f\ *7 

U69537 
D29833 

T-v ■-> o Q 1 

Dz9833 

Homo sapiens forkhead transcription 
factor HFH-4 

Homo sapiens proline rich peptide P-B 

2930 
89 

99.525 j 
30.556 | 

*k o U o 
A Q rt Q 

1M9833 

Homo sapiens proline rich peptide P-B 
Homo sapiens proline rich peptide P-B 
Homo sapiens proline rich peptide P-B 

83 
68 
111 

28.750 j 
3 8.235 I 
32.836 

4810 

X51987 

Pisum sativum early nodule protein 
(ENOD2) C-terminal (112 AA) 

j H4 

1 OP Ot A 1 

j Z o . o 14 I 

4811 

AF062529 

Homo sapiens diphosphoinositol 
polyphosphate phosphohydrolase 

738 

76.871 

4812 

U18466 

African swine fever virus pE66L 

35 

38.095 

^ O 13 

TT*1 O A C f 

U184 66 

African swine fever virus pE66L 

44 

23 .077 j 

4814 
4815 

U18466 
U18466 

African swine fever virus pE66L 
African swine fever virus pE66L 

100 

51.852 

4817 

AF084256 

Homo sapiens beta glucuronidase 
isoform d 

51 
121 

25.714 
j 59.375 

4818 

AC003058 

Arabidopsis thaliana hypothetical 
protein 

45 

31.S79 j 

4819 

AC003058 

Arabidopsis thaliana hypothetical 
protein 

37 

20.000 

4820 

AC003058 

Arabidopsis thaliana hypothetical 
protein 

38 

[ 60.000 | 

4821 
4822 

AC003058 
AF107406 

Arabidopsis thaliana hypothetical 
protein 

Homo sapiens GW12 8 

121 

76.000 j 

4Q01 
** O j£ J 

4824 

M17375 
Aif \JZ /bl4 

Gallus gallus type XII collagen 
Gallus gallus type XII collagen 
Homo sapiens zinc finger protein 

107 | 
34 

126 J 
3562 

44.118 I 
71.429 I 
33.721 j 
100. 000 

4826 
4827 
4828 

X60435 
X92896 
X66236 

Homo sapiens PACAP precursor 
Homo sapiens ITBA2 
Mus musculus 185 kJDa 
glycophosphoprotein 

1176 1 
124 j 
98 

100. 000 
32.000 j 
40.741 I 

4829 

U16359 

Rattus norvegicus nitric oxide ! 
synthase 

146 

51.111 j 

4830 

L02785 

Homo sapiens Nuclear localization I 
signal at AA 569-573, 576-580, 579- 
583; acidic transcr. activ. domain 
620-640,; homeobox motif 653-676 j 

46 

50.000 

4831 

L02785 

Homo sapiens Nuclear localization j 
signal at AA 569-573, 576-580, 579- 
583; acidic transcr. activ. domain 
620-640,; homeobox motif 653-676 | 

44 

23.077 

4832 

L02785 

Homo sapiens Nuclear localization 
signal at AA 569-573, 576-580, 579- 
583; acidic transcr. activ. domain 
620 — 640. .* homeoboY mrv^i f ceo _/r"7<r 1 

45 

42.105 j 

4833 

L02785 

Homo sapiens Nuclear localization J 
signal at AA 569-573, 576-580, 579- 
5 83; acidic transcr. activ. domain 
620-640,; homeobox motif 653-676 1 

332 

35.567 | 

4834 

L02785 

Homo sapiens Nuclear localization i 
signal at AA 569-573, 576-580, 579- | 

57 

24.138 
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583; acidic transcr. activ. domain 
620-640, ; homeobox motif 653-676 



4835 

L02785 

Homo sapiens Nuclear localization 
signal at AA 569-573, 576-580, 579- 
583; acidic transcr. activ. domain 
620-640, ; homeobox motif 653-676 

648 

30.853 


ACQ 044 16 

Homo sapiens 

WUGSC:H_RG013N12.gw. 1335199. a gene 
proQucc 

109 

51.613 

4837 

S79410 

Mus sp. nuclear localization signals 
\nijs>) -Dinaing protein=spot- 1 

43 

23 .913 

4838 

S79410 

Mus sp. nuclear localization signals 
ixvijos -ninuing protein=spot-l 

157 

88 .000 

4839 

AF146738 

Rattus norvegicus testis specific 
protein 

119 

30.000 

4 84 0 

U w J J OO 

iiomo sapiens zinc ringer protein 

*-*vi r 1 j j 

1864 

52.834 

4841 

X55683 

Lycopersicon esculentum extensin 

41 

66.667 

4842 

X55683 

Lycopersicon esculentum extensin 

135 

46.341 

4843 

AF018639 

Dictyostelium discoideum PprA 

135 

34.091 . 

*± O *± *4 

MX UiODJ 2* 

Dictyostelium discoideum PprA 

154 

37.234 

** 3 

\T(\ 1CT) 
VUlO J ^5 

Cercopithecidae gen. sp* 
luctaiiOLiiionein i^J 

307 

92.105 
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cercopitneciaae gen. sp. 

metallothionpin f } 

314 

47.368 

4847 

AF174487 

xai_^iu«^ oapicllU ~rclatcQ OV a. i 1 cl 11 

xiller protein -. 

13 99 

100 . 000 

4848 

U93023 

Drosophila soonae hunchback protein 

91 

25.806 

4849 

U93023 


32 

50 .000 

4850 

U93023 

Drosophila soonae hunchback protein 

99 

30.769 

4851 

AB000792 

sua acroia mecai loumonein isororm 

139 

81.818 

4852 

AB000792 

Sus scrofa metallothionein isoform 

65 

29.630 

4853 


Sus scrofa metallothionein isoform ! 

141 

78.261 

4855 

U82598 

Escherichia coli 2,3- 
dihydrosybenzoate-AMP ligase 

59 

31.818 

4857 

U82598 

Escherichia coli 2,3- 
dihydrosybenzoate -AMP ligase 

60 

3 0.928 

4859 

U82598 

Escherichia coli 2,3- 
ainyarosyoenzoate -amp ligase 

49 

50.000 

4660 

U82598 

Escherichia coli 2,3- 
tiiiiyciiosyjDenzoace — amjt ugase 

69 

37.209 

4661 

U82598 

Escherichia coli 2,3- 

ui jiyarosyDenzoa te - AMr ligase 

64 

26.415 

4862 

U82598 

Escherichia coli 2,3- 
amyarosyoenzoace -amp ligase 

52 

35.000 I 

4863 

U82598 

Escherichia coli 2,3- 

di hydro sybenz oat e -AMP ligase 

58 

52.632 

4864 

U82598 

Escherichia coli 2,3- 
dihydrosybenzoate-AMP ligase 

52 

21.875 

4866 

U82598 

Escherichia coli 2,3- 
dihydrosybenzoate-AMP ligase 

62 

31.325 

4868 

U82598 

Escherichia coli 2,3- 
dihydrosybenzoate-AMP ligase 

51 

42.105 
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4869 

U82598 

JCi^i- Jlc!X JLUnia. COll jL t ^ — 
dihvdros vben 7 r>a i* p - AMP T-tnaao 

j 52 

38.462 

4870 

U82598 

Escherichia rnl \ O "X - 
dihydrosybenzoate -AMP ligase 

1 ^ 

36. 000 

4872 

U82598 

Escherichia coli 2,3- 
dihvdrosvbenzoate-AMP liaA«e» 

2233 

100.000 

4873 

AF017777 

Drosophila melanogaster helicase 

43 

33.333 

4874 

AF017777 

Drosoohila mslanoaaRfpr HaI {raaA 

1 40 

25 . 000 

4875 

AF017777 

Drosophila melanogaster helicase 

1448 

49.012 

4876 

AP017777 

DrOsonh i 1 a mp» 1 annrta ohor Kal 1' nae^ 

1 1490 

49.798 

4877 

M31607 

Homo sapiens H-chain C-region MAL 
Drotein mRNA 

87 

58.333 

4878 

AF057285 

Mus musculus intersectin-EH binding 

iUUCXU J. l~f£J Jl. 

69 

37.500 

4879 

Y10141 

Homo sapiens variable number tandem 
reoeat (VNTR) 

100 

31.579 j 

4880 

Y10141 

Homo sapiens variable number tandem 
reneat ( VWTR ^ 

123 

32.353 

4881 

AF159055 

Homo sapiens leucine zipper- like 
orotein 

151 

82.759 

4882 

U16359 

synthase 

1 9 * 

57.143 

4883 

X70811 

receptor 

1 2074 

82.090 

4884 

X70811 

Homo saoiens Beta aHrpuprrr-i «-> 
receptor 

2812 

100. 000 

4885 

U79260 

Homo sapiens unknown 

All 

63 .492 

4886 

D00596 

Homo sapiens thymidylate synthase 

DDI 

81 . 250 

4887 

D00596 

Homo sapiens thymidylate synthase 

oi .1 n 

98 .4 08 

4888 

D00596 

Homo sapiens thymidylate synthase 

•3/1 

8 8 . 525 

4889 

Y08265 

Homo sapiens DAN26 

CO*) 

100 . 000 

4890 

AF008196 

Homo sapiens bax epsilon | 


=>*k . 545 

4891 

L10908 

Mus musculus Gcapl gene product 


3 0. 263 

4892 

AP105371 

Bos taurus neutrophil beta-def ens in 12 I 

jri 

34 . 211 

4893 

AF105371 

Bos taurus neutrophil beta-def ensin 12 | 

67 

35.000 

4894 

AF105371 

Bos taurus neutroDhil beta-defpnflin i*? 1 


33 .333 

4895 

AF105371 

Bos taurus neutrophil beta-def ensin 12 j 

46 

45.833 

4896 

AF105371 

Bos taurus neutiror*VH 1 hpta-Hofonn-Cn io 1 

48 

33.333 j 

4898 

AF105371 

Bos taurus neutrophil beta-def ensin 12 | 

110 1 

39.216 

4899 

AB019241 


108 

32.857 

4900 

AF159055 

Homo sapiens leucine zipper-like 
protein f 

115 

52.500 

4901 

M14170 

Homo sapiens preplacental alkaline 
DhosDhataRp (IZC 1 1 ** i } 1 

434 

98.507 

4902 

M14170 

phosphatase (EC 3.1.3.1) 

1125 

77 . 682 

4903 

M14170 

Homo nairl <=»n*3 y\y~ot~\ 1 -a £i-r-» t- i -«i v^i i 1 

uvjtuw oca^jxcj.*» ^»xrepj-a.cenuajL aiKanne 
phosphatase (EC 3.1.3.1) | 

3553 

100.000 

4904 

U16359 

Rattus norvegicus nitric oxide 

synthase I 

124 

85.000 

4905 

S71386 

Sus scrofa glucose transporter 4 | 

82 

51.852 

4906 

AB001111 

Archaeoglobus fulgidus A. fulgidus 
predicted coding region AF23 92 | 

56 

46.667 

4907 

S58722 

Homo sapiens X-linked retinopathy 

210 

46.970 


10878 
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protein {C-terminal, clone XEH.8c} 



4908 

S79410 

Mus sp. nuclear localization signals 
(NLS) -binding protein=spot-l 

125 

44.444 

4909 

Z81105 

Caenorhabditis elegans R05D7.3 

155 

36.364 

4910 

AB016080 

Mus musculus calcium binding protein 
Kip 2 

40 

33.333 

4911 

AB016080 

Mus musculus calcium binding protein 
Kip 2 

40 

33.333 

4912 

AB016080 

Mus musculus calcium binding protein 
Kip 2 

36 

33.333 

4913 

AB016080 

Mus musculus calcium binding protein 
Kip 2 

47 

27.500 

4914 

AB016080 

Mus musculus calcium binding protein 
Kip 2 

40 

33 .333 

4915 

AB016080 

Mus musculus calcium binding protein 
Kip 2 

387 

50.000 

4916 

AB016080 

Mus musculus calcium binding protein 
Kip 2 

44 

46.667 


Y 12670 

Homo sapiens leptin receptor gene- 
related protein 

196 

100.000 

4918 

Y12670 

Homo sapiens leptin receptor gene- 
related protein 

657 

87.857 

4919 

Y13141 

Bromheadia f inlaysoniana extensin 

90 

56.522 

4920 

X52164 

Mus musculus Q300 protein (AA 1-77) 

39 

31.250 j 

4921 

X52164 

Mus musculus Q300 protein (AA 1-77) 

111 

78.571 

4922 

S81809 

Homo sapiens basic fibroblast growth 
factor, bFGF 

104 

39.706 

4923 

U16359 

Rattus norvegicus nitric oxide 
synthase 

96 

80.000 

4924 

AL034567 

Arabidopsis thaliana nodulin-like 
protein 

134 

40 .000 

4925 

AB025355 

Homo sapiens SOX20 

71 

50.000 ! 

4927 

U31238 

Hepatitis C virus core protein 

111 

33 .962 

4928 

AF016446 

Caenorhabditis ejegans similar to C. 
elegans cuticlin 2 precursor CUT-2 
\SP : P34682 ) 

82 

57.895 

4929 

AC004022 

Homo sapiens Neurabin-like; similar to 

U72994 \PID : g2623757) 

464 

44 .898 

v 

Cj£, DZ 

Saccharomyces cerevisiae ORF YKR047w 

69 

36.066 

"x J j X 


Saccharornyces cerevisiae ORF YKR047w 

94 i 

37.209 C 



Homo sapiens hypothetical protein I 

31 

22 .727 



Homo sapiens hypothetical protein 

275 

100. 000 

4935 

X55686 

Lycopersicon esculentum extensin 
(class II) 

91 

52 .381 

*± J O t> 


Gallus gallus alpha-1 collagen type IX 

107 

36.364 

4938 

U60024 

Ovis aries BIIIA3 

41 

66 .667 

493 9 

U60024 

Ovis aries BIIIA3 

72 

37.143 ! 

, 4940 

U78969 

Qti^wwvf^uo v/'-'yciico uujuiowxx 

96 

37 . 879 

4941 

AL032637 

Caenorhabditis elegans cDNA EST 
yk367gl2.5 comes from this gene; cDNA 
EST yk367g!2.3 comes from this gene 

105 

51.515 

4942 

U23523 

Caenorhabditis elegans F53A9.1 gene 
product [ 

96 

71 .429 

4943 

AL022729 

Homo sapiens hypothetical protein 

212 

96.875 
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4944 

D83735 

Homo sapiens neutral calponin 

1347 

84. 921 

4945 

D83735 

Homo sapiens neutral calponin 

314 

71.429 

4946 

AJ005564 

Mus musculus SPR2F protein 

64 

29.730 

4947 

AJ005564 

Mus musculus SPR2P protein 

141 

39.394 

4948 

AJ005564 

Mus musculus SPR2F protein 

84 

52.174 

4949 

Y00664 

Homo sapiens open reading frame 1 (AA 
1-86) 

110 

45.652 

4950 

AB033017 

Homo sapiens KIAA1191 protein 

44 

66.667 

4952 

AB033017 

Homo sapiens KIAA1191 protein 

57 

53.846 

4953 

AB033017 

Homo sapiens KIAA1191 protein 

623 

94.118 

4954 

X97278 

Danio rerio metallothionein 

77 

38.235 

4955 

U27113 

Simulium nyssa NADH dehydrogenase 
subunit 4 

65 

47.059 

4957 

M21353 

Homo sapiens alpha-2 type I collagen 

44 

37.500 

4958 

M21353 

Homo sapiens alpha-2 type I collagen 

82 

58.333 

4959 

U63332 

Homo sapiens super cysteine rich 
protein; SCRP 

71 

36.364 

4960 

D90857 

Escherichia coli menD protein 

34 

30.000 

4961 

D90857 

Escherichia coli menD protein 

106 

34.000 

4962 

AB028992 

Homo sapiens KIAA1069 protein 

68 

28.333 

4963 

AB028992 

Homo sapiens KIAA1069 protein 

49 

19.231 

4964 

AB028992 

Homo sapiens KIAA1069 protein 

57 

39.130 

4965 

AB028992 

Homo sapiens KIAA1069 protein 

2525 

94.595 

4966 

AB028992 

Homo sapiens KIAA1069 protein 

57 

30.000 

4967 

AB028992 

Homo sapiens KIAA1069 protein 

5068 

100. 000 

4968 

Z97210 

Schizosaccharomyces pombe hypothetical 
protein 

75 

22.222 

4969 

Z97210 

Schizosaccharomyces pombe hypothetical 
protein 

37 

38.889 

4970 

Z97210 

Schizosaccharomyces pombe hypothetical 
protein 

201 

40.196 

4971 

U80569 

Human immunodeficiency virus type 1 
tat protein 

75 

30.769 

4972 

M24900 

Homo sapiens triiodothyronine receptor 

29 

16.000 

4974 

M24900 

Homo sapiens triiodothyronine receptor 

413 

88.312 

4975 

M24900 

Homo sapiens triiodothyronine receptor 

585 

96.774 

4976 

M24900 

Homo sapiens triiodothyronine receptor 

444 

88.235 

4978 

L32783 

Gallus gallus protein- tyrosine 
phosphatase 

71 

53.846 

4979 

AF110963 

Human immunodeficiency virus type 1 
Tat 

120 

32.000 

4980 

U42581 

Rattus norvegicus p85 

71 

32.258 

4983 

Y14967 

Mycobacterium leprae probable ESAT.-6- 
like protein 

102 

38.298 

4984 

AB007861 

Homo sapiens KIAA0401 

63 

43.333 

4985 

AB007861 

Homo sapiens KIAA0401 

2399 

100.000 

4986 

X17622 

Homo sapiens put. HBK2 protein (AA 1- 
529) 

3570 

100.000 

4987 

X17622 

Homo sapiens put. HBK2 protein . (AA 1- 
529) 

3570 

100.000 

4988 

A65268 

unidentified unnamed protein product 

69 

48.000 

4989 

X79828 

Mus musculus NK10 1 

3084 

8 0.855 

4990 

U79260 

Homo sapiens unknown 

285 

50.806 

4991 

AB023188 

Homo sapiens KIAA0971 protein 

4094 

99.692 


I0V«5 
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4992 

AB023188 

Homo sapiens KIAA0971 protein 

345 

72.381 

4993 

X92485 

Plasmodium vivax pval 

46 

22.222 

4994 

X92485 

Plasmodium vivax pval 

345 

55.670 

4996 

X61046 

Hydra sp. mini -collagen 

68 

37.037 

4997 

X61046 

Hydra sp. mini -collagen 

161 

33.981 

4998 

X61046 

Hydra sp. mini-collagen 

154 

36.264 

5001 

AF071081 

Mycobacterium tuberculosis proline- 
rich mucin homolog 

345 

31.398 

5002 

246970 

Leishmania mexicana secreted acid 
phosphatase 2 (SAP2) 

93 

23.103 

5003 

Z46970 

Leishmania mexicana secreted acid 
phosphatase 2 (SAP 2 ) 

.99 

21.466 

5004 

Z46970 

Leishmania mexicana secreted acid 
phosphatase 2 (SAP2) 

773 

38.202 

5005 

AF146793 

Mus mus cuius PFT27 

525 

'59.669 

5006 

AF146793 

Mus musculus PFT27 

324 

76.119 

5007 

AF146793 

Mus musculus PFT27 

1357 

85.449 

5008 

AF190797 

Mus musculus actin-related protein 11 

52 

23.077 

5009 

AF190797 

Mus musculus actin-related protein 11 

258 

50.000 

5010 

AF190797 

Mus musculus actin-related protein 11 

45 

66.667 

5012 

AF190797 

Mus musculus actin-related protein 11 

493 

59.355 

5013 

AJ225898 

Oryctolagus cuniculus translationally 
controlled tumor protein 

250 

67.470 

5014 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

316 

59.813 

5015 

D86983 

Homo sapiens similar to D.melanogaster 
peroxidasin(U11052) 

2781 

60.089 

5016 

D86983 

Homo sapiens similar to D.melanogaster 
peroxidasin (U11052) 

5275 

93.510 

5017 

D86983 

Homo sapiens similar to D.melanogaster 
peroxidasin (U11052 ) 

9621 

98.663 

5019 

U44839 

Homo sapiens UHX1 protein 

311 

96.154 

5020 

U44839 

Homo sapiens UHX1 protein 

698 

59.746 

5021 

U44839 

Homo sapiens UHX1 protein 

3567 

84.767 

5022 

X02761 

Homo sapiens fibronectin precursor 

3014 

83.538 

5023 

X02761 

Homo sapiens fibronectin precursor 

6971 

95.175 

5024 

X02761 

Homo sapiens fibronectin. precursor 

574 

100.000 

5025 

X02761 

Homo sapiens fibronectin precursor 

5136 

96.091 

5026 

X02761 

Homo sapiens fibronectin precursor 

4869 

87.068 

5027 

X02761 

Homo sapiens fibronectin precursor 

15815 

96.238 

5028 

X02761 

Homo sapiens fibronectin precursor 

14788 

97.469 

5029 

J04628 

Rattus norvegicus 3- hydroxy! sobutyr ate 
dehydrogenase 

456 

97.143 

5030 

J04628 

Rattus norvegicus 3-hydroxyisobutyrate 
dehydrogenase 

805 

83.226 

5031 

J04628 

Rattus norvegicus 3-hydroxyisobutyrate 
dehydrogenase 

1325 

68.280 

5033 

X57958 

Homo sapiens ribosomal protein L7 

720 

75.000 

5034 

X05006 

Homo sapiens S-protein 

302 

100.000 

5035 

X05006 

Homo sapiens S-protein 

2003 

84.940 

5037 

M19419 

Mus musculus proline-rich salivary 
protein 

66 

38.462 

5038 

M19419 

Mus musculus proline-rich salivary 
protein 

176 

40.741 
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5039 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

415 

62.698 

5040 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

54 

28.000 

5041 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

157 

68.421 

5042 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

580 

56.650 

5043 

M11902 

Mus musculus proline -rich salivary 
protein 

51 

40.000 

5044 

M11902 

Mus musculus proline-rich salivary- 
protein 

117 

35.000 

5046 

M11902 

Mus musculus proline-rich salivary 
protein 

198 

32.353 

5047 

M11902 

Mus musculus proline-rich salivary 
protein 

121 

36.082 

5048 

M11902 

Mus musculus proline-rich salivary 
protein 

184 

39.130 

5049 

M11902 

Mus musculus proline-rich salivary 
protein 

140 

31.250 

5050 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

81 

71.429 

5051 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

108 

55.882 

5052 

AF010144 . 

Homo sapiens neuronal thread protein 
AD7C-NTP 

181 

51.316 

5053 

U79260 

Homo sapiens unknown 

340 

65.217 

5054 

U79260 

Homo sapiens unknown 

193 

48.454 

5055 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

395 

69.149 

5056 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

276 

61.538 

5057 

U58652 

Oryctolagus cuniculus ubiquitin- 
conjugating enzyme E2-32k 

57 

46.875 

5058 

U58652 

Oryctolagus cuniculus ubiquitin- 
conjugating enzyme E2-32k 

892 

7 0 . 612 

5059 

U88968 

Homo sapiens alpha enolase like 1 

370 

4 9 . 749 

5060 

?v i~* A A "i A c 0 

Arahidopsis thaliana unknown protein 

o c c 

-a a (no 

5061 

U22376 

Homo sapiens alternatively spliced 
proauCL us my cxon ijh 

40 J 

/ 4 . lZ> / 

C A C *1 

5 062 

D25215 

UnmA tn -s v\ A a n a VTAAAA"i*> 

Homo sapiens jviaauuj^ 


a n 1 1 ft 

4U. 11U 

c a £ *a 
3 Ob J 


nomo sapiens JM>yiiuji protein 

^ QO 

c n ft ft n 

DUO* 

A15U2 

xiomo sapiens Mnnxu^x protein 

DD O 

t» O • D^flk 



nomo sapiens maaujsu 


1 ft R £ A 
.3 \J . DDI 

JUDO 

AoUuZJ^ 0 

xiomo sapiens rvx.tt>iu j d u 

1 £A 


!>Uo/ 

A 13 A A O "3 A O 

nomo sapiens js.±i\i\\j jd \j 

1/3 

1 ftft ftftft 

Z> U OO 

AT5 A AO *5 vl 0 

xiomo sapiens mahujdu 


. % J J 

5070 

AB00234 8 

Homo sariipris KTAA035G 

1026 

63.696 

5071 

AB002348 

Homo sapiens KIAA0350 

2554 

69.711 

5072 

AB002348 

Homo sapiens KIAA0350 

5705 

97.289 

5073 

AF086790 

Homo sapiens aconitase precursor 

4046 

100. 000 

5074 

AF086790 

Homo sapiens aconitase precursor 

246 

69.444 

5075 

AF086790 

Homo sapiens aconitase precursor 

3541 

91.405 

5076 

AL023854 

Caenorhabditis elegans similar to 

45 

38.710 


x o cp m H + "+ s . o e es is o b 




Regulator of chromosome condensation 
(RCC1) ; cDNA EST yk246c2.5 comes from 
this gene 



5077 

AL023854 

Caenorhabditis elegans similar to 
Regulator of chromosome . condensation 
(RCC1) ; cDNA EST yk246c2.5 comes from 
this gene 

52 

27.119 

5079 

AL023854 

Caenorhabditis elegans similar to 
Regulator of chromosome condensation 
(RCC1) ; cDNA EST yk246c2.5 comes from 
this gene 

312 

37.104 

C ADA 

M15 577 J 

Human herpesvirus 4 unknown protein 

91 

28.302 


Ml c Oil 

Human herpesvirus 4 unknown protein 

127 

26. 891 

DUO/ 


Homo sapiens alternatively spliced, 

327 

66.292 


U<£ A J / O 

fiomo sapiens aiternativei.y spiiceu 

JjJ JL UV* Uo -L 1 1 ^ CAUli J J r\ 

401 

66 . 000 

5084 

anno ■> o^i 

xiomo oopicHo ivj./\m. j. ^ u / protein 


100 . 000 

5085 


nomu oapiciiB ivi/\M.izis / protein 

9434 

96 . 230 

J v OD 


xioiiio sapiens a.i/\a.i^u / protein 

63 

22 . 115 

3 v o / 

nOU J juj j 

nomo sapiens j\.i/\/iizu / protein 

57 

28 . 125 

5086 

nOUJ JUJ J 

numo o dp j. cim i\.a.*\j-\ x ^ t> / piotcin 

446 

100 . 000 

5089 


ntHiiu octpiciio Airtrtizu / protein 

54 

2 1 . 978 

5090 

/VD u J J v J J 

XlOlllO bapxcuD IVlnnX ^ f / piOLcin 

746 

97 . 273 

5091 

/vd v/ j j y j j 

xiumo oapiclJU w / protein 


20 . 213 

5093 

nuV O O \I o ^ 

nuuiu DapiciiQ i\.iiAHizu / protein 

5 1 

32 . 500 

5094 

AV-D W _J _j \J O O 

nuiiiu bapiciis itinni « u / protein 

47 

25 . 287 

5095 

AB033033 

Homo sapiens KIAA1207 protein 

55 

30.233 


zvnm inn 

Homo sapiens KIAA1207 protein 

60 

31.034 

5097 

AB033033 

Homo sapiens KIAA1207 protein 

1632 

89.103 

5098 

AF159297 

Zea mays extensin-like protein 

81 

36.000 

5099 

AF159297 

Zea mays extensin-like protein 

109 

35.294 

5100 

AF159297 

Zea mays extensin-like protein 

324 

24.430 

5101 

X06323 

Homo sapiens put. ribosomal protein L3 
(AA 1 - 348) 

590 

67.232 

5103 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

379 

67.000 

5104 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

407 

70.103 

5105 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

318 

63.736 

5106 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

390 

72.527 

r i a»j 
blU / 

D87914 

Homo sapiens ornithine decarboxylase 
antizyme 

797 

74.611 

5108 

D87914 

Homo sapiens ornithine decarboxylase 
antizyme 

494 

65.333 

5109 

087914 

HnitlO Rani P71R nm ~\ t"V)i np HprarVw^wl aeo 

antizyme 

Aft! 

53 . 710 

5110 

U79260 

Homo sapiens unknown 

370 

71.739 j 

5111 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

308 

70.588 

5112 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

400 

72.632 


5113 

AF010144 

Homo .sapiens neuronal thread protein 
AD7c-NTP 

481 

54.419 

5114 

U79260 

Homo sapiens unknown 

383 

68.817 

5116 

AC005532 

Homo sapiens supported by GENSCAN 
prediction and spliced EST; similar to 
Z35641 (PID:g3874821) and AI059600 
(NID : g3 33 3377) 

1657 

93.455 

5117 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

175 

48.235 

5118 

AF010144 

Homo sapiens neuronal thread protein 

AD7C-NTP 

91 

26.087 

5119 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

445 

48.333 

5120 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

344 

58.416 

5121 

AF010144 

Homo sapiens neuronal thread protein 

AD7C-NTP 

455 

58.228 

5122 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

195 

60.000 

5123 

AF010144 

Homo sapiens neuronal thread protein 

t\ Y-\ ft \|||||> 

AD7C-NTP 

51 

31.507 

5124 

AF010144 

Homo sapiens neuronal' thread protein 
AD7C-NTP 

340 

44.937 

5125 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

260 

55 . 238 

5126 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

112 

3 7 . 647 

5127 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

362 

63 . 000 

5128 

X92485 

Plasmodium vivax pval 

283 

45 . 775 

5129 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

52 

47.368 

5131 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

193 

47.674 

5133 

AL031349 

Schizosaccharomyces pombe conserved 
nypotnetxcai protein 

560 

51.337 

5134 

U58652 

Oryctolagus cuniculus ubiquitin- 
conjugating enzyme E2-32k 

947 

75.502 

5136 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

401 

78.750 

5137 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

271 

64.557 

5138 

Z68302 

Caenorhabditis elegans similar to 
Histidine acid phosphatases; cDNA EST \ 
yk432al.3 comes from this gene; cDNA 
EST yk432al.5 comes from this gene; 
cdna est yK504a7 . 3 comes irora cms 
gene; cDNA EST yk561d!2.3 comes from 
thi s * gene * " 

202 

25.466 

5139 

Z71316 

Saccharomyces cerevisiae ORF YNL04 Ow 

80 

31.579 

5140 

Z71316 

Saccharomyces cerevisiae ORF YNL04 0W 

398 

28.256 

5141 

AF138957 

Bos taurus type II collogen cyanogen 
bromide fragment CB8 

107 

36.782 

5142 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

305 

57.292 
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5143 

U22376 

Hpmo sapiens alternatively spliced 
product using exon 13 A 

385 

64 .948 

5144 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

?03 

62 .500 

5145 

U79260 

Homo sapiens unknown 

142 

62.500 

5146 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP . 

361 

69.072 

5147 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

374 

71 .429 

5148 

S60312 

Mus sp. DMR-N9 (C- terminal} 

688 

52.913 

5150 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

394 

75 . 000 

5151 

AL109731 

Schizosaccharomyces pombe puative ATP- 
dependent RNA helicase cdc2 8 

132 

31.313 

5152 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

304 

63.014 

5154 

L21990 

Homo. sapiens spiceosomal protein 

262 

30.385 

5155 

AF163772 

Leishmania major 713 8.2 

403 

39 . 698 

5156 

M11902 

Mus musculus proline-rich salivary 
protein 

236 

31.731 

5157 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

429 

72 . 9.17 

5158 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

396 

66.000 

5160 

U79260 

Homo sapiens unknown 

327 

64 . 894 

5161 

AF170303 

Mus musculus nuclear body associated 
kinase 2a 

2786 

84 .865 

5162 

M74495 

Mus musculus adenylosuccinate 
synthetase 

124 

51.724 

5163 

M74495 

Mus musculus adenylosuccinate 
synthetase 

1780 

85 . 965 

5164 

X92485 

Plasmodium vivax pval 

198 

64 . 000 

5166 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

667 

60 . 847 

5167 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 
isof orm 

■a e o 

3 DO 

O O • bUO 

5168 

AJ24 8288 

Pyrococcus abyssi hypothetical protein 

1 O *7 

1 1 . DO / 

5169 

AJ248288 

Pyrococcus abyssi hypothetical protein 



5170 

AJ248288 

Pyrococcus abyssi hypothetical protein 

D*i 

Z 7 . zoo 

5171 

A.J248288 

Pyrococcus abyssi hypothetical protein 

3D 

TO OOQ 
Z O . O O J 

5172 

AJ248268 

Pyrococcus abyssi hypothetical protein 

"5 O T 

j 3 * y t x 

5173 

AJ248268 

Pyrococcus abyssi hypothetical protein 

o *a ft 

1 A ft QA 

5174 

U90908 

Homo sapiens unknown 

83 

37.500 

5175 

U90908 

Homo sapiens unknown 

265 


5176 

U90908 

Homo sapiens unknown 

382 

31.429 j 

5177 

AL049929 

Homo sapiens hypothetical protein 

765 

64 . 384 

5178 

AL049929 

Homo sapiens hypothetical protein 

865 

7 8.947 

5179 

AL049929 

Homo sapiens nypocneticax protein 



5180 

U41557 

Caenorhabditis elegans proline and 
glycine-rich 

141 

27.976 

5181 

U41557 

Caenorhabditis elegans proline and 
glycine-rich 

73 

29.592 

5182 

U41557 

Caenorhabditis elegans proline and 
glycine -rich 

179 

31.765 


i M8T 


5183 

U41557 

Caenorhabditis elegans proline and 
glycine-rich 

310 

30.263 

5184 

U97191 

Caenorhabditis elegans similar to 
nucleoporins 

266 

29.515 

5185 

AJ011812 

Homo sapiens transcription factor NRF 

287 

28.470 

5187 

AJ003087 

Homo sapiens liver glycogen synthase 

4341 

95.352 

5188 

AJ003087 

Homo sapiens liver glycogen synthase 

1283 ! 

77.778 

5189 

AJ003087 

Homo sapiens liver glycogen synthase 

285 

79.032 

5190 

283227 

Caenorhabditis elegans predicted using 
Genefinder; Weak similarity to high- 
sulphur keratins.; cDNA EST yk663al.3 
comes from this gene 

205 

32.026 

5191 

U56964 

Caenorhabditis elegans weak similarity 
to S. cerevisiae intracellular protein 
transport protein US)1 (SP:P25386) 

192 

50.820 

5192 

U56964 

Caenorhabditis elegans weak similarity 
to S. cerevisiae intracellular protein 
transport protein US)1 (SP:P253 86) 

760 

36.034 

5193 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

403 

68 . 421 

5194 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

356 

73 . 864 

5195 

U93564 

Homo sapiens p4 0 

749 

94 . 531 

5196 

U09874 

Mus musculus SKD3 

1223 

97.884 

5197 

U09874 

Mus musculus SKD3 

1052 

97 .531 

5198 

U09874 

Mus musculus SKD3 

49 

45.833 

5199 

U09874 

Mus musculus SKD3 

1183 

89.069 

5200 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

276 

42 .328 

5201 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

357 

64 .948 

5202 

AF092449 

Heterodera glycines mucin- like protein 

152 

31.250 

5205 

U02020 

Homo sapiens pre-B cell enhancing 
factor 

437 

93 .902 

5206 

AL080220 

Homo sapiens hypothetical protein 

313 

100 . 000 

5207 

AL080220 

Homo sapiens hypothetical protein 

2197 

96.089 

5208 

M15888 

Bos taurus endozepine-related protein 
precursor 

1906 

76 . 804 

5210 

AF104413 

Homo sapiens large tumor suppressor 1 

44 

62 . 500 

5211 

AF104413 

Homo sapiens large tumor suppressor 1 

3434 

52 .845 

5212 

AF104413 

Homo sapiens large tumor suppressor 1 " 

84 

29.412 

5213 

L26290 

Caenorhabditis elegans clathrin- 
associated protein homblogue 

237 

70.513 

5214 

M27826 

Homo sapiens neutral protease large 
subunit 

244 

57 . 333 

5215 

M27826 

Homo sapiens neutral protease large 
subunit 

646 

71 . 523 

D Z. ± D 

UO / / 


861 

99 . 270 

5217 

U67734 

Homo sapiens cPLA2 interacting protein 

47 

33 .333 

5218 

U67734 

Homo sapiens cPLA2 interacting- protein 

3125 

99.783 

5219 

U67734 

Homo sapiens cPLA2 interacting protein 

2748 

87.084 

5220 

U67734 

Homo sapiens cPLA2 interacting protein 

616 

62 .000 

5221 

X92485 

Plasmodium vivax pval 

270 

57.955 

5222 

S58722 

Homo sapiens X- linked retinopathy 

269 

67.164 
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protein {C- terminal, clone XEH.8c) 



5223 

S5 9722 

Homo sapiens X- linked retinopathy 
protein {c-terminal f clone XEH.8c} 

28Q 

76. 119 

5224 

AF098504 

Caenorhabdit is elegans contains 

similarity to protein kinases 

( Pf am : pkinase . hmm, score: 14 9.36) 

391 

32 .484 

5226 

J05031 

Rattus norvegicus isovaleryl-CoA 
aenyarogenase precursor i .i . . 1U/ 

2238 

87. 79* 

J dt •& / 


KaLLUb IIUL VcyiCUb iaOValciy X'LOn 

Q C A 

70 . 035 

5228 

AF002281 

♦ »Jt W V mm mm W A* 

Rattus norveaictifl aloha -Aft* iTiin-? 
associated LIM protein 



5229 

AF026689 

Homo saoiens Drostate-softcif ic 
transglutaminase 

101 

4 Q no ft 

5232 

X92485 

Plasmodium vivax pval 

231 

54 . 545 

5233 

AF132150 

Drosophila melanogaster unknown 

351 

44 . 615 

5234 

AF132150 

Drosophila melanogaster unknown 

586 

41.126 

5235 

AF132150 

Drosophila melanogaster unknown 

352 

44 .604 

5236 

Y10372 


f OS 


5237 

Y10372 

Bos taurus actin-binding protein CP3 

604 

94.949 

5238 

Y10372 

Bos taurus actin-binding protein CP3 

326 

100.000 


1 10 J 72 

Bos taurus actin-binding protein CP3 

1217 

78 . 641 

5241 

U46751 

Homo sapiens phospho tyrosine 
independent ligand for the Lck SH2 
domain p62 

318 

67.857 

5242 

U4 6751 

Homo sapiens phospho tyrosine 
maepenaent _Ligana tor tne LcK SH2 

/-3 aim n •* « £1 O 

620 

54.J53 

5243 

U46751 

T-T/~\TTif-\ o ra t*"> \ pncj T^Vir^ ct%Vi ajno 

nuiiiu Dctpxciia piiubpxiuLy xubine 
i ndpDfindRnt liaand for the* T.r*tr 
domain p62 


90 . 625 

5244 

AF057019 

Dictyostelium discoideum interaptin 

74 

21, 569 

5245 

AF057019 

Dictyostelium discoideum interaptin 

275 

19 . 666 

5249 

X92485 

Plasmodium vivax pval 

345 

60.825 

5250 

X71442 

Rattus norveaicua ORF 1 • nn t~ pi t~ S vp 

o o 

4 o rinn 

5251 

X71442 

Rattus norveaicus ORP 1 • nntat i vp 

4 Q 


5252 

X71442 

Rattus norveaicua ORP 1 • riii r* a t* i 

^ fl 
J O 

■in nnn 

5253 

X71442 

Rattus norveaicua ORP 1? rjutativp 


O ^ m JUU 

5254 

X71442 

Rattus norvegicus ORP 1; putative 

192 

4 ^ 1 *?4 

5255 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

422 

45.856 

5258 

AB020656 

Homo sapiens KIAA084 9 protein 

867 

86.228 

5259 

AB020656 

Homo nan^ns TCTAA0R4Q r^r*ot**»4« 

D / ^5 • 

XUQ. 000 

5.260 

AB020656 

Homo sapiens KIAA084 9 protein 

320 

100.000 

5262 

rVD v O DO 

nuiuu uapicuo ivihauo^j protein 

"I O Q "7 

92 . 969 

5264 

U22376 

Homo sapiens alternatively spliced 
proauct. using exon ija 

257 

64.045 

5265 

S58722 

Homo sapiens X- linked retinopathy 
protein {C- terminal, clone XEH.8c} 

180 

76 923 

5266 

S58722 

Homo sapiens X- linked retinopathy 
protein {C-terminal, clone XEH.8c} 

180 

76.923 

5267 

S58722 

Homo sapiens X- linked retinopathy 
protein {C-terminal, clone XEH.8c} 

199 

56.897 

5269 

AL030996 

Homo sapiens dJ1189B24.4 (novel 

190 

60.784 
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PUTATIVE protein similar to 
hypothetical proteins S. pombe 
C22F3.14C and C. elegans C16A3.8) 



5270 

AL030996 

Homo sapiens dJ1189B24.4 (novel 
PUTATIVE protein similar to 
hypothetical proteins S. pombe 
C22F3.14C and C. elegans C16A3.8) 

75 

60.000 

5271 

AL030996 

Homo sapiens dJ1189B24.4 (novel 
PUTATIVE protein similar to 
hypothetical proteins S. pombe 
C22F3.14C and C. elegans C16A3.8) 

1097 

55.501 

5272 

AL030996 

Homo sapiens dJ1189B24.4 (novel 
PUTATIVE protein similar to 
hypothetical proteins S. pombe 
C22F3.14C and C. elegans C16A3.8) 

6855 

96.619 

5273 

M87306 

Tetrahymena thermophila micronuclear 
linker hi Stone polyproteih 

143 

23.154 

5274 

X78327 

Rattus norvegicus ribosomal protein 
L13 

229 

61.957 

5276 

U34958 

Rattus norvegicus O protein beta 1 
subunit 

495 

100.000 

5277 

U34958 

Rattus norvegicus G protein beta 1 
subunit 

951 

Q C 111 

5.278 

U34958 

Rattus norvegicus Q protein beta 1 
subunit 

570 

84.354 

5279 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

230 

5 8.120 

5280 

U14971 

Homo sapiens ribosomal protein S9 

885 

77.202 

5282 

U14971 

Homo sapiens ribosomal protein S9 

511 

•71 1 C A 

5283 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

281 

C C /TOO 

5284 

D89052 

Homo sapiens proton-ATPase-like 
protein 

437 

55.689 

5285 

D89052 

Homo sapiens proton-ATPase-like 
protein 

549 

59.836 

5286 

AL117626 

Homo sapiens hypothetical protein 

4 y l 

»7 O £ OA 
/ J • © O *k 

5287 

AI/L17626 

Homo sapiens hypothetical protein 

O / 2* 

cn i *7fT 
D X • X / O 

5288 

U22376 

Homo sapiens alternatively spliced 
proQucu usxng exon i^h 

1 A ft 


5289 

U553 do 

cuticle collagen 

150 

31.250 

5290 

AF 06193 o 

nOIRO Sapicllo BLaUJLCU JUiULCXU 

92 

40.541 

5291 

Tl T?A C 1 Q *2 O 

U/\nm aan^ ona a ♦* 1 1 ■#* t>Y\ Y\y~f*it~ f* ~\ n 

49 

2 7 .2 73 


7V CftC 1 0"3 O 

AirUo 19 J o 

XJr^mrs aan^ ona at" rii 1 ~F f*r\ r\f n^P - ! Tl 
nuinu bapicUo ouauicu piutciu 

971 

48 .526 

5293 

X53581 

Rattus norvegicus ORF4 

140 

32.867 

5294 

Y0063 8 

Homo sapiens jlaja \j\a to i<ioi/ 


81 . 934 

5295 

U87607 

Rattus norvegxcus putative t\j>ti\. oiiiqid(j 
protein 1 

AO 0 

4 0 . 000 


1J ^ J J U 

Mus musculus proline-rich protein 

268 

32.877 

5297 

AB011532 

Rattus norvegicus MEGF6 

2066 

74.143 

5298 

U09411 

Homo sapiens zinc finger protein 
ZNF132 

266 

34.066 

5301 

X92485 

Plasmodium vivax pval 

280 

52.727 

5302 

AL032637 

Caenorhabditis elegans cDNA EST 
EMBL:D66101 comes from this gene; cDNA 

134 

32.558 
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EST EMBL:D69769 comes from this gene 



5303 

AL032637 

Caenorhabditis elegans cDNA EST 
EMBL:D66101 comes from this gene; cDNA 
EST EMBL:D69769 comes from this gene 

47 

25.641 

5304 

AL032637 

Caenorhabditis elegans cDNA EST 
EMBL:D$6101 comes from this gene; cDNA 
EST EMBL:D69769 comes from this gene 

59 

40.000 

5305 

AL032637 

Caenorhabditis elegans cDNA EST 
• EMBL : D66101 comes from this gene; cDNA 
EST EMBL:D69769 comes from this gene 

135 

30.709 

5306 

L11231 

Xenopus laevis myosin heavy chain B 

305 

38 . 863 

5307 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

365 

65.979 

5308 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

53 

25 .397 

5309 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

466 

58.940 1 

5310 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

193 

50 . 000 

5311 

X92485 

Plasmodium vivax pval 

198 

49 . 524 

5312 

S58722 

Homo sapiens X- linked retinopathy 
protein (C- terminal, clone XEH.8c} 

226 

63.855 1 

5313 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

168 

62 .791 

5314 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

497 

49. 153 

5315 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

230 

61.644 | 

5316 

U79260 

Homo sapiens unknown 

165 

50.633 

5317 

M26361 

Mus musculus LINE/Ig H-chain fusion 
protein 

144 

50.000 

5318 

AF055985 

Onchocerca volvulus pyrrol idone- rich 
antigen 

169 

36 . 082 

oil? 

UZ2 3 / o 

Homo sapiens alternatively spliced 
product using exon 13 A 

*> a f\ 

71. 212 


t to *>nc 

iiomo sapiens aJLucrnauivexy spiiccu 

piUUUUl. UDlJiy CAU1J X _> r\ 


C C TOO 

X 


Mi i a tni 10 /"*t 1 1 1 1 a hrtiifl 1 — 1 "5 If Una qp 

J IUo IHUqCUXUO tUUDiCU X X JVC iVJUldOC 

*± O A 

1 O . OiZ 

5322 

r\~C J <i J J 

Miia mil orn 1 ii q fmtnl pH»1 { Vp k^rtaap 

623 

inn nnn 

5323 

U97553 

mi it- -i np Vt » Tnp rs'v "i t~iio fiR iiTiVnnwn 

llllii. XilC llCi^CoVXiUU DO U11JV11UWU 

1 ft? 


5324 

X924 85 

"P"l aqmnHiiim vi vav tyv;» 1 

IT XaDlllUUlUIII V _L V aA p vaX 

A ft 

ID . Do / 

5325 

X92485 

Plasmodium vivax pval 

312 

65.432 



nuinu odpiciib o/ii x 

*7 
J / 

** O • 1D4 

5327 

AF029061 

Homo sapiens BAT1 

54 

30.435 

c *a o 
5328 

AF029061 

Homo sapiens BAT1 

3 8 

71 . 429 

C 1 *> o 

AF029061 

Homo sapiens BAT1 

173 

43 . 956 

cr *5 *5 /\ 
533 0 

AF0290bl 

Homo sapiens BAT1 

49 

25 . 806 

J J J X 

AT \-r O -J O O O 

i ¥ it; x aiiu uyx u o t>diiy uxiixkJtp^ti cuuumupuAVixuo 
ORF MSV23 3 hypothetical protein 

i n c 

1UQ 

i!D. / ^ J 

5332 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

605 

52.823 

5335 

X66285 

Mus musculus HC1 ORF 

120 

37.500 

5336 

AB028981 

Homo sapiens KIAA1058 protein 

623 

38.674 

5337 

AF149422 

Homo sapiens unknown 

872 

52.537 
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5340 

Z67990 

Caenorhabditis elegans similar to 
cuticle collagen 

154 

37.288 

5341 

U22376 ! 

Homo sapiens alternatively spliced 
product using exon 13 A 

163 

45.570 

5342 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

330 

67.532 

5343 

U12822 

Peromyscus maniculatus major 
histocompatibility complex class I 
antigen 

1171 

62 . 147 

5344 

U79260 

Homo sapiens unknown 

171 

50 . 000 

5345 

AF041054 

Mus musculus E1B 19K/Bcl -2 -binding 
protein homo log 

282 

61 . 364 

5347 

X92485 

Plasmodium vivax pval 

1 C ft 

ISO 

4 3* 590 

5348 

Y12713 

Mus musculus Pro-Pol-dUTPase 
polyprotein 

219 

3 9 . 048 

5350 

U7 92 6 0 

Homo sapiens unknown- 

J ** J 

in ft n n 

5351 

U4 8852 

Cricetulus griseuB HT protein 



5352 

AB007862 

Homo sapiens KIAA04 02 

60 

50.000 

5353 

AB007862 

Homo sapiens KIAA0402 

O Q ""7 T 

y y / 1 

GO *y D "> 
DO. J OZ 

5354 

AB007862 

Homo sapiens KIAA0402 

10962 

95.259 

5355 

AB007862 

Homo sapiens KIAA0402 

10930 

98 . 906 

5357 

AF049588 

Canis familiaris synapsin I 

109 

29.310 

5358 . 

AF049588 

Canis familiaris synapsin I 

137 

31.132 

5359 

M17463 

Human papillomavirus type 5 ORF E4 
from bp 32 85 to 4 022; putative 

131 

28.502 

5360 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

354 

68.750 

5361 

AL132862 

Caenorhabditis elegans predicted using 
Genefinder; preliminary prediction 

353 

28.772 

5362 

AL132862 

Caenorhabditis elegans predicted using 
Genefinder; preliminary prediction 

1129 

2 6 . 759 

5363 

AF101074 

Homo sapiens step II splicing factor 
SLU7 

1730 

84 . 881 

5364 

D87515 

Rattus norvegicus aminopeptidase-B 

1657 

4 8 .362 

5365 

AL133084 

Homo sapiens hypothetical protein 

792 

92.701 

5366 

U49973 

Homo sapiens ORF2 : function unknown 

237 

67.742 

5369 

U37143 

Homo sapiens cytochrome P450 
monooxygenase CYP2J2 

212 

47.059 

5370 

U37143 

Homo sapiens cytochrome P450 
monooxygenase CYP2J2 

207 

44.286 

5371 

U37143 

Homo sapiens cytochrome P450 
monooxygenase CYP2J2 

637 

35.811 

5372 

AC006233 

Arabidopsis thaliana hypothetical 
protein 

94 

68.182 

5373 

AC006233 

Arabidopsis thaliana hypothetical 
protein 

104 

30.435 

5374 

AF047002 

Homo sapiens transcriptional 
coactivator ALY 

1090 

92 . 373 

5376 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

392 

61.321 

5377 

U79260 

Homo sapiens unknown 

278 

56.383 

5378 

X56015 

Crithidia oncopelti NADH dehydrogenase 
subunit 5 

152 

30.370 

5379 

AF010144 

Homo sapiens neuronal thread protein 

163 

35.938 
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AD7C-NTP 



5380 

U22376 

Homo. sapiens alternatively spliced 
product using exon 13 A 

330 

76.812 

5381 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

312 

59.140 

5382 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

411 

74 .444 

5383 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

407 

64.000 

5386 

L11672 . 

Homo sapiens zinc finger protein 

187 

48.611 

5388 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

248 

62.025 

5389 

Z99709 

Caenorhabditis elegans cDNA EST 
EMBIj:D73217 comes from this gene; cDNA 
EST yk478c5.3 comes from this gene; 
cDNA EST EMBL»:M89187 comes from this 
gene 

210 

27.941 

5390 

U79260 

Homo sapiens unknown 

192 

46 . 154 

5391 

U79260 

Homo sapiens unknown 

175 

40.171 

5392 

U27196 

Gallus gallus zinc finger protein 

484 

64.567 

5393 

AF133124 

Homo sapiens transcription factor 
IIIC63 

366 

89.394 

5394 

AF133124 

Homo sapiens transcription factor 
IIIC63 

305 

100.000 

5395 

AF133124 

Homo sapiens transcription factor 
IIIC63 

1499 

79.211 

5399 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

133 

37.895 

5400 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

363 

68.000 

5401 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

298 

50.000 

5402 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

52 

25.000 

5403 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

337 

59.783 j 

5404 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

400 

53.020 

5405 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

337 

6 6. 279* 

5406 

X92485 

Plasmodium vivax pval 

46 

2 3 . 256 

5407 

X92485 

Plasmodium vivax pval 

48 

28.571 

5408 

X92485 

Plasmodium vivax pval 

265 

58. 140 

5409 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

340 

68. 000 

5410 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

328 

6 0 . 000 

5411 

M94043 

J£attus norvecjicus rab-related GTP- 
binding protein 

395 

57.627 

5412 

X66285 

Mus musculus HC1 ORF 

121 

32.787 

5413 

AF116463 

Streptomyces lincolnensis unknown 

103 

29. 060 

5414 

AF116463 

Streptomyces lincolnensis unknown 

185 

28.213 

5415 

X55777 

Homo sapiens put. ORF 

222 

44.444 

5416 

U40420 

Caenorhabditis elegans weak similarity 
to procollagen alpha chain 1 (V) chain 

359 

41.317 
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5418 

Z38061 

Saccharomyces cerevisiae mal5, stal, 
len: 1367, CAI : 0.3, AMYHJfEAST P08640 
GLUCOAMYLASE SI (EC 3.2.1.3) 

108 

27.660 

5419 

Z38061 

Saccharomyces cerevisiae mal5, stai, 
len: 1367, CAI : 0.3, AMYHJfEAST P08640 
GLUCOAMYLASE SI (EC 3.2.1.3) 

56 

24,419 

5420 

Z38061 

Saccharomyces cerevisiae mal5, stal, 
len: 1367, CAI: 0.3, AMYH__YEAST P0864 0 
GLUCOAMYLASE SI (EC 3.2.1.3) 

370 

22.662 

5421 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 
isoform 

315 

65 . 060 

5422 

U79260 

Homo sapiens unknown 

291 

58.511 

5423 

U93569 

Homo sapiens p40 

566 

50.838 

5424 

M69297 

Homo sapiens ORF 3 

242 

41.221 

5425 

M77349 

Homo sapiens transforming growth 
factor induced protein 

376 

98.333 

5426 

M77349 

Homo sapiens transforming growth 
factor induced protein 

3242 

90.691 

5427 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

.197 

40.594 

5428 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

243 

51.402 

5429 

X92485 

Plasmodium vivax pval 

259 

61.039 

5430 

U22376 - 

Homo sapiens alternatively spliced 
product using exon 13 A 

362 

77.778 

5432 

L07925 

Rattus rattus guanine nucleotide 
dissociation stimulator 

318 

70.833 

5433 

L07925 

Rattus rattus guanine nucleotide 
dissociation stimulator 

402 

79.747 

5434 

L07925 

Rattus rattus guanine nucleotide 
dissociation stimulator 

3898 

69.353 

5435 

AF180350 

Rattus norvegicus protein phosphatase 
2A B regulatory subunit delta isoform 

862 

85.366 

5436 

AF180350 

Rattus norvegicus protein phosphatase 
2A B regulatory subunit delta isoform 

283 

92.063 

5437 

AF180350 

Rattus norvegicus protein phosphatase 
2A B regulatory subunit delta isoform 

1769 

79. 830 

5438 

M19419 

Mus musculus proline-rich salivary 
protein 

85 

25.664 

5439 

M19419 | 

Mus musculus proline-rich salivary 
protein 

44 

27.027 

5440 

M19419 

Mus musculus proline-rich salivary 
protein 

191 

33.121 

5442 

*ti r*\ *\ ***** r\ 

U79260 

Homo sapiens unknown 

360 

69.474 ! 

5443 

U73199 

Mus musculus Rho-guanine nucleotide 
exchange factor 

126 

47.826 

C A A A 

5444 

U73 199 

Mus musculus Rho-guanine nucleotide 
exchange factor 

125 

48.980 

5445 

U73199 

Mus musculus Rho-guanine nucleotide"" 
exchange factor 

83 8 


5446 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

116 

86.364 

5448 

AC005306 

Homo sapiens R27216 1 

41 

40.000 

5449 

AC005306 

Hpmo sapiens R27216 1 

270 

52.439 

5450 

AC005306 

Homo sapiens R27216 1 

69 

27.273 
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5451 

AC005306 

Homo sapiens R27216 1 

1169 

78.138 

5452 

AF132984 

Homo sapiens nuclear pore complex 
interacting protein NPIP 

326 

53.043 

5454 

AF132984 

Homo sapiens nuclear pore complex 
interacting protein NPIP 

54 

27.500 

5455 

AF132984 

Homo sapiens nuclear pore complex 
interacting protein NPIP 

1223 

86.726 

5457 

AF132984 

Homo sapiens nuclear pore complex 
interacting protein NPIP 

625 

85.047 

5458 

AF132984 

Homo sapiens nuclear pore complex 
interacting protein NPIP 

1740 

92.642 

5459 

AF132984 

Homo sapiens nuclear pore complex 
interacting protein NPIP 

756 

95.161 

5461 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

229 

35.795 

5463 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

367 

67.089 

5464 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

339 

67.816 

5465 

D32202 

Homo sapiens alpha 1C adrenergic 
receptor isoform 2 

252 

63.095 

5466 

AL023893 

Drosophila melanogaster 
/prediction (method : n "genef inder" n , 
version: n "0B4 nn , score :" n lll . 64 - n ) ; 
/predictions (method: n "genBcan" n , 
version: nn 1.0 nu ) 

500 

32.530 

- 

5467 

Z47357 

Caenorhabditis elegans ZK1128.2 

517 

34.129 

5468 

Z81479 

Unknown predicted using Genef inder; 
similar to collagen; cDNA EST 
EMBL:D654 95 comes from this gene; 

158 

28.230 

5469 

AJ000506 

Mus musculus Homeodomain protein 
Meis2c 

54 

33.333 

5470 

AJ000506 

Mus musculus Homeodomain protein 
Meis2c 

181 

92.857 

5471. 

AJ000506 

Mus musculus Homeodomain protein 
Meis2c 

2481 

84.449 

5472 

U76638 

Homo sapiens BRCA1 -associated RING 
domain protein 

252 

29.811 

5473 

X55777 

Homo sapiens put. ORF 

167 

37.857 

5474 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

396 

73.563 

5475 

D13748 

Homo sapiens eukaryotic initiation 
factor 4AI 

612 

75.676 

5476 

D13748 

Homo sapiens eukaryotic initiation 
factor 4AI 

360 

69.811 

5477 

D13748 

Homo sapiens eukaryotic initiation 
factor 4AI 

806 

65.126 

5478 

D1374 8 

Homo sapiens eukaryotic initiation 

1 avUUI *±Jn.±. 

1591 

79.524 

5480 

U78597 

Plectonema boryanum kinesin light 
chain 

552 

35.759 

5481 

U78597 

Plectonema boryanum kinesin light 
chain 

61 

34.694 

5482 

U78597 

Plectonema boryanum kinesin light 
chain 

447 

40.359 
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54 83 

X55777 

Homo sapiens put. ORF 

265 

58.442 

54 85 

V ft ft ft r* ft 

X89858 

drosophila melanogaster actin binding 

240 

76.316 



protein 



5486 

v ft n o r ft 

X8 9858 

Drosophila melanogaster actin binding 

61 

28.261 



protein 



C4D7 
b4 O / 


Drosophila melanogaster actin binding 

466 

49.324 



JL KJ L. t2 J. 11 



J ** o o 


jjruaopnua meianogasuer acLin oinamg 

763 

31. 546 



Ul U LCJ.11 





noino sapiens aihauo / / protein 

274 

48.819 

CAQl 
J *X J. 


Homo sapiens K1AA0 877 protein 

429 

54.730 

O r* ^7 Z 

7x1711*7 ^ n*i o 

Homo sapiens sec 61 homo log 

451 

94.521 

b4 

AJr U / / UJz 

Homo sapiens sec61 homolog 

1306 

33.519 

b4 i?4 

7\ D A 1 1 ft "I 

ABO OJ 102 

Homo sapiens 26S proteasome subunit 

255 

41. 830 



P*l 4 . 3 





Mus muBculus plenty— of -prolines — 101 ; 

256 

29. 703 



ruri vji , oxij pniio protein 



QA OfC 

b4 yo 

tto o *7 <r 
UiiZ J / b 

Homo sapiens alternatively spliced 

353 

71.264 



proaucc using exon u a 



tZA Q"7 
b4 U / 

m QO iTn 

Homo sapiens unknown 

158 

42.308 - 


TIT C70 
lillb / ^ 

Homo sapiens zinc finger protein 

2471 

57.288 

cm 
_> _>u ± 

IjIId / 2 

Homo sapiens zinc finger protein 

2827 

55. 116 

J Jv J 

t nil') 
1j -J 2 -3 / 2 

Mus musculus AM PA selective glutamate 

232 

80. 392 



receptor 



CCAC 
j 3 3 


nomo sapiens neuronal unread protein 

272 

3 9. 614 



AD7c-NTP 



5506 

AL117428 

Hnmn nan -i prtn Vi\/T~»r^t~ Vi c» t* -1 ral f-^i^^t- « -i n 

A AC. 

39 . 070 

™? J V o 

nT -L O JL ^ U / 

mus muscuiuB u^i procein 

1837 

70. 044 


M.r J. j ±Z\J / 

Mus musculus G21 protein 

884 

67 . 797 


TV t?1 1 1 **> l*> "T 

AJ? 1 J 1^07 

Mus musculus G21 protein 

859 | 

65.789 

boll 

7\ 17 1 ii ft ft "7 

ArlJ 120 7 

Mus musculus G21 protein 

241 

53-097 

b512 

AF131207 

Mus musculus G21 protein 

2136 

96.588 

5513 

U2 23 76 

Homo sapiens alternatively spliced 

386 

75.000 



product using exon 13A 



C C 1 A 

S58722 

Homo sapiens X- linked retinopathy 

272 

52.727 



protein {C-terminal, clone XEH.8c} 



5515 

AF010144 

Homo sapiens neuronal thread protein 

381 

68.571 



AD7c-NTP 



5516 

AF010144 

Homo sapiens neuronal thread protein 

54 

29.268 



AD7c-NTP 



5517 

AIi008726 

Homo sapiens dJ337018.2 (Lysosomal 

264 

49.265 



Protective Protein precursor (EC 





3.4.16.5, Cathepsin A, 





Carboxypeptidase C) ) 



5518 

ik * ft ft ft «-i «-» s- 

AL008726 

Homo sapiens dJ337018.2 (Lysosomal 

1492 

97.211 



Protective Protein precursor (EC 





3.4. Id. 5, catnepsin A, 





Carboxvpeptidase C) ) 



5519 

X92485 

Plasmodium vivax pval 

194 

48.913 

5520 

M12100 

Mus musculus proline -rich protein MP-3 

411 

38.535 

5521 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 

50 

31.429 



isof orm 



5522 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 

276 

57.317 



isof orm 




,:i o o p "+6 "+3; M rji a s & ii a 



ADD ID J 

voivox carten exceDSin 

219 

32 .474 

5526 

AF182645 

Homo sapiens chondrosarcoma- associated 
protein 2 

2980 

86.535 

JOZ / 

BUI Q O C A C 

Homo sapiens chondrosarcoma- associated 

1541 

82.392 

ft 

AF1 ft'*? fiA R 

xiomo sapiens cnonarosarcoma- associated 

J- w t_ C J- J. X ^ 

214 

100. 000 

5529 

AF1B2645 

iiuiuu oapicllD CllUnUlUbaXLUiIla dBSOClaUGu 

protein 2 

*5 C Q 't 

2597 

78 ; 956 

5530 

M12100 

Mils musculus proline- rich protein MP-3 

^ -P 

J 3 . 13 0 

5531 

M12100 

Mus musculus proline-rich protein MP-3 


J U . U U V 

5532 

M12100 

Mus musculus proline-rich protein MP-3 

66 

4 / . jjb 

. 5533 

M12100 

Mus musculus proline-rich protein MP-3 

55 


5534 

M12100 

Mus musculus proline-rich protein MP-3 

62 


5536 

M12100 

Mus musculus proline-rich protein MP -3 

65 

^ O • / 1^ 

5537 

M12100 

Mus musculus proline-rich protein MP- 3 

£i A- 

3 Z . 4 DQ 

5538 

M12100 

Mus musculus proline-rich protein MP-3 

V/ 

** 1 . 1 / o 

5539 

M12100 

Mus musculus proline-rich protein MP-3 

63 

24.658 

5541 

M12100 


^4 

54 . 545 

5542 

M12100 

Mus musculus proline-rich protein MP-3 

46 

83.333 

5543 

M12100 

"jua muscuius piyiine ricn protein Mjt — j 

155 

31 . 356 

5544 

M12100 

Mus musculus proline-rich protein MP-3 

352 

38.860 



nomo sapiens uKir/ : tunction unJcnown 

252 

78. 846 

5547 

AE001799 

Thermotoga maritima translation 
elongation laccor v3 

42 

31.579 

5550 

AE001799 

Thermotoga maritima translation 
elongation factor G 

588 

42.857 

5551 

AE001799 

Thermotoga maritima translation 
elongation factor G 

53 

31.818 


AE0Q1799 

Thermotoga maritima translation 
elongation factor G 

656 

33.333 j 

jjjj 

tvt ft n ftn 

UnJcnown 

/predict ion= (method: " "genef inder " " , 
version: " "084 nn , score: n "67. 72" ") ; 
/ px cuiLuiou- \ uitj tno 

285 

35,404 

5554 

D00632 

Homo sapiens glutathione peroxidase 

37 

28.125 


L/UU DJ« 

Homo sapiens glutathione peroxidase 

1379 

92.035 

5558 

D00632 

Homo sapiens glutathione peroxidase 

1057 

86.580 


AiJ^s 4z b4 (J 

Voivox carteri f . nagariensis 
hydroxyproline-rich glycoprotein DZ- 

397 

32.203 

5561 

M35297 

Rattus norvegicus G-protein coupled 
receptor 

401 

82.143 

5562 

TJ2 0 ^ 7 K 1 

xiuino sapiens alternatively spliced 

piUUU^U Ho _L iiy CAUIl X<7x\ 

292 - 

57.609 

5563 

U22376 

Homo sapiens alternatively spliced 

435 

43.500 

5564 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

275 

64 ♦ 103 

5565 

U12141 

Saccharomyces cerevisiae Ynl24 i 30p 

43 

100.000 

5566 

U12141 

Saccharomyces cerevisiae Ynl243 0p 

279 

29.936 

5567 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

91 

84.615 

5569 

D87459 

Homo sapiens Similar to Volbox carteri 

153 

33.083 j 
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extensin (S22697) 



5570 

AF095737 

Homo sapiens unknown 

195 

48.421 

5572 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

274 

60.811 

5573 

D88440 

Gallus gallus high molecular mass 
nuclear antigen 

158 

24.219 - 

5574 

D88440 

Gallus gallus high molecular mass 
nuclear antigen 

82 

26.733 

5575 

D88440 

Gallus gallus high molecular mass 
nuclear antigen 

74 

.24.242 

5577 

D88440 

Gallus gallus high molecular mass 
nuclear antigen 

236 

27.211 

5578 

D88440 

Gallus gallus high molecular mass 
nuclear antigen 

114 

25.260 j 

5579 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

200 

43.243 

5580 

U37429 

Caenorhabditis elegans similar to M. 
musculus MER5 ana other AHPC/TSA 
proteins 

1005 

45. 052 

5581 

TTO Til 

U3742 9 

Caenorhabditis elegans. similar to M. 

muBCulUB rlDKj cilAQ OlOci AnrL/ loA 
nrnt" pinq 

349 

42 .484 

CCQO 
J J O ■£ 


ii 1 1 DCtUXCLtO UJiTvllvWH 

0 n ^ 

<c 0 z> 

O -J . 7J J i 

J JO J 

TT7Q2 fiO 

/ 7« vu 

Wftmn qani pn o unlmnun 



5584 

U79260 

Homo sapiens unknown 

225 

54.444 

croc 

AdU IDjj o - 

nomo sapiens xvx>\rt.u / i7z> protein 

0 1\ 0 

2 5 ♦ UUU 

5586 

U7 92 60 

Homo sapiens unknown 

190 

82.353 

5587 

U79260 

Homo sapiens unknown 

190 

82 .353 

5588 

X92485 

Plasmodium vivax pval 

239 

52.688 

5589 

X92485 

Plasmodium vivax pval 

302 

58 . 621 

5590 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

113 

35.185 

5591 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

88 

75 . 000 

5592 

AP010144 

Homo sapiens neuronal thread protein 

PAJ / C _ 1M ± f 

413 

69 . 903 

5593 

AF063866 

Melanoplus sanguinipes entomopoxvirus 
OKr jiovzjj nypocnecicai protein 

117 

29.114 

5594 

U79260 

Homo sapiens unknown 

339 

62 . 637 

5595 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

401 

75.281 

5596 

AJ243459 

Leishmania major proteophosphoglycan 

178 

24.658 

5597 

D45913 

Mus musculus leucine-rich- repeat 
protein 

4285 

93 . 733 

f- r- O O 

bb9o 

L11672 

Homo sapiens zinc finger protein 

442 

36.466 s 

C C Q Q 

d _> y y 

U22 3 76 

Homo sapiens alternatively spliced 
product using exon 13 A 

100 

65 .385 



nuiilvs Dapicilo aJ.LClllaLJ.vciy opiJLCU 

product using exon 13A 

jjJ 

fo tf • (£.V_ 

5601 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

362 

51.572 

5602 

X92485 

Plasmodium vivax pval 

352 

59.596 

5603 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 
isof orm 

307 

61.905 
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5605 

S58722 

Homo sapiens X-linked retinopathy 
protein {c- terminal , clone XEH.8c} 

180 

76 . 923 

5606 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c) 

180 

76 . 923 

5607 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c} 

201 . 

57 . 143 

5608 

X92485 

Plasmodium vivax pval . 

155 

44 . 444 

5609. 

X52574 

Mus musculus GTP binding protein 

542 

56.696 

5611 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

234 

68 . 852 

5612 

AL049628 

Streptomyces coelicolor hypothetical 
proline-rich protein 

67 

35 . 849 

5613 

AL049628 

Streptomyces coelicolor hypothetical 
proline-rich protein 

179 

36.190 

5615 

AB032904 

Hylobates syndactylus dopamine 
receptor D4 

128 

29.630 

5616 

U77415 

Mus musculus Bopl 

626 

69.178 

5617 

U77415 

Mus musculus Bopl 

3572 

82.749 

5618 

AF131833 

Homo sapiens Unknown 

187 

63.333 

5620 

235597 

Caenorhabditis elegans C36E8.4 

219 

24.675 

5622 

AF070572 

Homo sapiens unknown 

1052 

39. 841 

5623 

M15530 

Homo sapiens B-cell growth factor 

128 

51.852 

5624 

S77099 

Drosophila pseudoobscura, Peptide, 14 9 
aa Jan A 

53 

4 1 . 379 

5625 

S77099 

Drosophila pseudoobscura. Peptide, 14 9 
aa Jan A 

260 

45 . 745 

5626 

S77099 

Drosophila pseudoobscura, Peptide, 149 
aa Jan A 

164 

39 . 450 

5628 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

278 

67 . 568 

5629 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

352 

68.605 

5630 

U79260 

Homo sapiens unknown 

325 

66. 667 

5631 

AJ388521 

Canis familiaris Ribosomal protein 

146 

50.413 

5632 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

269 

71.014 

5633 

M13100 

Rattus norvegicus unknown protein 

223 

41.481 

5634 

AF159297 

Zea mays extensin-like protein 

383 

23 .791 

5635 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

301 

67.568 

5636 

U79260 

Homo sapiens unknown j 

99 

82.353 

5637 

U79260 

Homo sapiens unknown 

280 

63 .218 

5638 

U79260 

Homo sapiens unknown 

346 

69.388 

5640 

AL035064 

Schizosaccharomyces pombe queuine 
trna-ribosyltransf erase 

1036 

61 . 847 

5641 

AL035064 

Schizosaccharomyces pombe queuine 
trna-ribosyltransf erase 

513 

47 .345 

CCA ■ 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

330 j 

63 .36^6 

5643 

Y12713 

Mus musculus Pro-Pol-dUTPase 
polyprotein 

274 

41 .830 

5644 

AL117513 

Homo sapiens hypothetical protein 

2392 

100.000 

5645 

AL117513 

Homo sapiens hypothetical protein 

210 

78 .571 

5646 

AF010144 

Homo sapiens neuronal thread protein 

164 

73 .171 
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AD7C-NTP 



5647 

U22376 

Homo sapiens altezmatively spliced 
product using exoh 13A 

457 

70 ICC 
'« • IDS 

5648 

AF049588 

Canis familiaris synapsin I 

70 


5649 

AF049588 

Canis familiaris synapsin I 

232 

30.085 

5651 

S58722 

Homo sapiens X- linked retinopathy 
protein {C-terminal, clone XEH.8c} 

216 

04 . X l *k 

5652 

U55366 

Caenorhabditis elegans Similar to 
cuticle collagen 

179 

J . J X X- 

5653 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

94 

72.727 

5654 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

160 

61 . 905 

5655 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

624 

58 . 421 

5656 

X92485 

Plasmodium vivax pval 

217 

49.451 

5657 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

400 

77.647 

5658. 

M20471 

Homo sapiens clathrin light -chain a 

644 

83.465 

5659 

M20471 

HomO sapiens clathrin light ^ chain a 

1169 

98 . 889 

5660 

S70011 

Rattus sp. tricarboxylate carrier 

378 

78.571 | 

5661 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

316 


5662 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

124 


5663 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 


A A A Qrt 

5664 

X15014 

Homo sapiens RAI». A polypeptide (AA 1- 
206) 

63 8 

8 0 . 303 

5666 

X15014 

Homo sapiens RAL A polypeptide (AA 1- 
206) 

1216 

85 . 062 

5667 

X15014 

Homo sapiens RAL A polypeptide (AA 1- 
206) 

939 

88.942 

5668 

X92485 

Plasmodium vivax pval 

128 

72 .414 

5669 

X92485 

Plasmodium vivax pval 

80 

37.931 ! 

5670 

X92485 

Plasmodium vivax pval 

375 

64 .444 

5671 

X78933 

Homo sapiens zinc finger protein i 

759 

58 .911 

5672 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

257 

52 . 000 

5673 

M58378 

Homo sapiens synapsin lb 

187 

27 . 863 

5674 

U79260 

Homo sapiens unknown 

71 

64 .706 

5675 

U79260 | 

Homo sapiens unknown 

99 

82 . 353 

5676 

U79260 

Homo sapiens unknown ] 

360 

64 .356 

5677 

AF000996 

Homo Bapiens ubiquitous TPR motif, Y 
isof orm 

368 

70 . 238 

5678 

AL132862 

Caenorhabditis elegans predicted using 
Genefinder; preliminary prediction 

53 

4b *± • X -3 O 

5679 

AL132862 

Caenorhabditis elegans predicted using 
Genefinder; preliminary prediction 

876 

27.368 

5680 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 
isof orm 

124 

73 .333 

5681 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 
isof orm 

65 

26 .562 

5682 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 

310 

63 .529 ! 
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isoform 



5683 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

185. 

50.549 

5684 

U22S76 

Homo sapiens alternatively spliced 
product using exon 13 A 

298 

56.522 

5685 

X92485 

Plasmodium vivax pval 

208 

49.515 

5686 

AC002074 

Homo sapiens collagen I alpha -2 
prepropeptide 

126. 

-34.884 

5687 

AF067728 

Rattus norvegicus trans activating 
protein BRIDGE 

157 

88.462 

5688 

AF067728 

Rattus norvegicus transactivating 
protein BRIDGE 

355 

79.452 

5689 

,AF067728 

Rattus norvegicus transactivating 
protein BRIDGE 

986 

76.829 

5690 

AF067728 

Rattus norvegicus transactivating 
protein BRIDGE 

265 

66.957 

5691 

AF085809 

Mus musculus synapsin lb 

195 

29.555 

5692 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

327 

62.376 

5693 

D86978 

Homo sapiens similar to a C.elegans 
protein encoded in cpsmid 
K12D12 (249069) 

705 . 

63.200 

5694 

U88154 

Homo sapiens proline and glutamic acid 
rich nuclear protein isoform 

83 

36.232 

5695 

U88154 

Homo sapiens proline and glutamic acid 
rich nuclear protein isoform 

244 

2B.986 

5696 

X83413 

Human herpesvirus 6 US 8 

241 

31.731 

5697 

AF126867 

Mus musculus calpain-like protease 

906 

60.223 

5698 

S64860 

Mus sp. NonO=non-POU domain -containing 
octamer-binding protein 

92 

36.735 

5699 

S64860 

Mus sp. NonO=mon-POU domain -containing 
octamer-binding protein 

343 

78.788 

5700 

S64860 

Mus sp. NonO=non-POU domain- containing 
octamer-binding protein 

354 

61.111 

5701 

S64860 

Mus sp. NonO=non-POU domain-containing 
octamer-binding protein 

1135 

50.909 

5702 

M69297 

Homo sapiens ORF 3 

162 

39.024 

5704 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

242 

69.091- 

5705 

AP010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

59 

26.957 

5706 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

50 

32 .609 

5707 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

167 

47.761 

C »7 a a 

5708 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

72 

25.510 

5709 


nuiuu . sapiens neuronal enreau proccin 
AD7C-NTP 

142 

48 . 980 

5710 

L77968 

Ovis aries type II small proline -rich 
protein 

90 

38.182 

5711 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

448 

72 .449 

5712 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

244 

64 .179 
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5713 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

392 

77. 108 

5714 

AF134726 

Homo sapiens NG22 

813 


5717 . 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

371 

66.019 

5718 

AF010144 

Homo sapiens neuronal thread protein 
AD7o-NTP 

321 

42 . 857 

5719 

AF187072 

Homo sapiens N-acetylglucosamine-1- 
phosphodiester alpha-N- 
acetylglucosaminidase 

438 

98.529 

5720 

AF187072 

Homo sapiens N-acetylglucosamine-1- 
phosphodiester aipha-N- 
acetylglucosaminidase 

3438 

97 . 683 

5721 

AB011094 

Homo sapiens KIAA0522 protein 

96 

26.966 

5722 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

340 

54.088 

5723 

Z11583 

Homo sapiens NuMA protein 

8771 

94.095 

5725 

Z11583 

Homo sapiens NuMA protein 

493 

70.677 

5727 

Z11583 

. Homo sapiens NuMA protein 

542 

75.833 

5728 

Z11583 

Homo sapiens NUMA protein 

696 

100 . 000 

5729 

Z11583 

Homo sapiens NuMA protein 

9382 

93 .337 

5730 

X92485 

Plasmodium vivax pval 

273 

4 7 . 368 

5731 

AF121893 

Rattus norvegiqus sequence-specific 
single- stranded-DNA-binding protein 

955 

65 . 816 

5735 

AF121893 

Rattus norvegicus sequence-specific 
single -stranded-DNA-binding protein 

1176 

52.401 

5737 

AB020686 

Homo sapiens KIAA0 879 protein 

41 

31.250 

5738 

AB020686 

Homo sapiens KIAA0879 protein 

321 

29. 876 

5739 

AB020686 

Homo sapiens KIAA0879 protein 

45 

28.571 j 

5740 

AB020686 

Homo sapiens KIAA0879 protein 

213 

31.111 

5741 

AB020686 

Homo sapiens KIAA0879 protein 

38 

40.000 I 

5742 

AB020686 

Homo sapiens KIAA0879 protein 

272 

27.700 

5743 

AB020686 . 

Homo sapiens KIAA0879 protein 

49 

29.167 1 

5744 

AJ005071 

Gallus gallus Tapasin 

48 


5745 

AJ005071 

Gallus gallus Tapasin j 

207 

34.848 

5746 

Z46787 

Caenorhabditis elegans similar to 
Glutaredoxin, Zinc finger, C3HC4 type 
(RING finger) 

572 


5747 

Z46787 

Caenorhabditis elegans similar to 
Glutaredoxin, Zinc finger, C3HC4 type 
(RING finger) 

1186 

51.266 

5749 

U48251 

Homo sapiens protein kinase C-binding 
protein RACK7 

1037 

7 O 11 c. 

5750 

U48251 

Homo sapiens protein kinase C-binding 
protein RACK7 

1726 

0 ** • 0 j 0 

5751 

U48251 

Homo sapiens protein kinase C-binding 
protein RACK7 

615 

68 . 156 


U4825 1 

Homo sapiens protein kinase C-binding 
protein RACK7 

3245 

90.150 

5753 

U48251 

Homo sapiens protein kinase C-binding 
protein RACK7 

2985 

92.844 

5754 

AF110103 

Tupaia belangeri beta-actin 

347 

87.302 

5755 

AF110103 

Tupaia belangeri beta-actin 

865 

1O0.000 

5756 

AF110103 

Tupaia belangeri beta-actin 

312 

64.286 


I o9oo 
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5757 

L24775 

Rattus norvegicus tropomyosin 

912 

90.541 

5758 

L24775 

Rattus norvegicus tropomyosin 

1160 

95.673 

5759 

AF021873 

Caiiis familiaris beta-actin 

369 

76.316 

5760 

AF021873 

Canis familiaris beta-actin 

1991 

88.451 

5761 

D37984 

Homo sapiens DNA helicase Ql 

42 

46.667 

5762 

D37984 

Homo sapiens DNA helicase Ql 

827 

100. 000 

5763 

D37984 

Homo sapiens DNA helicase Ql 

219 

89. 130 

5764 

AF184157 

Macacamulatta demidefensin 2 

281 

86.207 

5765 

AF016270 

Homo. sapiens thyroid hormone receptor 
coactivating protein 

5462 

93 . 696 

5766 

AF016270 

Homo sapiens thyroid hormone receptor 
coactivating protein 

471 

75.497 

5767 

AF016270 

Homo sapiens ' thyroid hormone receptor 
coactivating protein 

193 

97.143 

5768 

AF016270 

Homo sapiens thyroid hormone receptor 
coactivating protein 

5776 

98.914 

5769 

Z22658 

Homo sapiens thrombin inhibitor 

612 

72.000 

5770 

AL034433 

Schizosaccharomyces pombe ptr3 or 
ubiquitin-activating enzyme el 

390 

45.989 

5771 

X52851 

Homo sapiens peptidylprolyl isomerase 

292 

97.917 

5772 

X52851 

Homo sapiens peptidylprolyl isomerase 

357 

56 . 383 

5774 

AJ242540 

Volvox carter! f . nagariensis 
hydroxyproline-rich glycoprotein DZ- 
HRGP 

264 

38.168 

5775 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

285 

71. 014 

5776 

X70223 

Rattus norvegicus peroxisomal membrane 
protein 

584 

65 . 896 

5777 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 
isof orm 

311 

64 . 634 

5778 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

203 

53 . 704 

5779 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

236 

56.098 

5781 

X89633 

Saccharomyces cerevisiae hypothetical 
protein 

71 

52.632 

5782 

X89633 

Saccharomyces cerevisiae hypothetical 
protein 

229 

3 0 . 058 

5783 

AC005724 

Arabidopsis thaliana putative DNA 
repair and recombination protein of 
the SNF2 family 

55 

50.000 

5784 

AC005724 

Arabidopsis thaliana putative DNA 
repair and recombination protein of 
the SNF2 family 

46 

35.088 i 

5785 

AC005724 

Arabidopsis thaliana putative DNA 
repair and recombination protein of 
the SNF2 family 

66 

34.884 

5786 

AC005724 

Arabidopsis thaliana putative DNA 
repair and recombination protein of 
the SNF2 family 

51 

19.149 

5787 

AC005724 

Arabidopsis thaliana putative DNA 
repair and recombination protein of 
the SNF2 family 

47 

42.857 

5788 

AC005724 

Arabidopsis thaliana putative DNA 
repair and recombination protein of 

50 

28.125 
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the SNF2 family 



5789 

AC005724 

Arabidopsis thaliana putative DNA 
x. cpa j.1 anu i cuuiiLL/xnauioii protein or 
the SNF2 familv 

64 

37.500 

5790 

AC005724 

Arabidoosi s thaliana mi t* a t- i dm a 
repair and recombination protein of' • 
the SNF2 family 

<cn 

27.027 

5791 

AC005724 

Arabidopsis. thaliana putative DNA 
repair and recombination protein of 
the SNF2 family 

loin 

IT OTA 

5794 

M90657 

Homo sapiens L6 

166 

74 . 194 

5795. 

M90657 

Homo sapiens L6 

734 

7 8. 199 

5796 

M90657 

Homo sapiens L6 

342 

84 . 848 

5797 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

395 

69. 697 

5798 

AB014596 

Homo sapiens KIAA0696 protein 

956 

83 . 060 

5799 

AB014596 

Homo sapiens KIAA0696 protein 

1300 

85.597 

5800 

AB014596 

Homo sapiens KIAA0696 protein 

2780 

80 . 622 

5801 

AB014596 

Homo sapiens KIAA0696 protein 

268 

86. 000 

5802 

AB014596 

Homo sapiens KIAA06 96 protein 

265 


5803 

AB014596 

Homo sapiens KIAA0696 protein 

191 


5804 

AB014596 

Homo sapiens KIAA0696 protein 

725 

c n Ann 

5805 

AB014596 

Homo sapiens KIAA0696 protein 



5807 

U82163 

Oryctolagus cuniculus No definition 
line found 

4S V JL 

^> y . z ± t? 

5808 

U82163 

Oryctolagus cuniculus No definition 
line found 

289 

5 1 . 327 

5809 

AB023175 

Homo sapiens KIAA0958 protein 

763 

ion ooo 

5810 

AB023175 

Homo sapiens KIAA0958 protein 

251 

1O0 000 

v v . \s \J \J 

5811 

AB023175 

Homo sapiens KIAA0958 protein 

2644 

96. 02 0 

5812 

AB023175 

Homo sapiens KIAA0958 protein 

49 

4 6 . 154 

5813 

AB023175 

Homo sapiens KIAA0958 protein 

1812 


5814 

AB023175 

Homo sapiens KIAA095 8 protein 

2943 

100 .000 

5817 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

334 


5818 

AF004107 

Mus musculus unknown 

215 

64 .286 

5819 

AF004107 

Mus musculus unknown 

1 O OR 

b y . j j j 

5820 

AF004107 

Mus musculus unknown 

405 

50.920 

5821 

AF004107 

Mus musculus unknown 

646 

70.000 

5822 

AF055085 

Mus musculus f»i~vt~ViT~r»i rl 

* * *A fc-J ill %A- *J V— \JL JL \JL O \Zt J. V w X-J 2m V-/ JL \JL 

differentiation-related factor 

i r% f\ 
J 00 

4 8 . 544 

5823 

Z93244 

Homo sapiens bK116F5 2 fPUTATTVR 
RhoGAP (CDC42 GTPAse Activating 
Protein) LIKE protein) 

lie 

54 . 348 

5824 

Z93244 

Homo sapiens bK116F5.2 (PUTATIVE 
RhoGAP (CDC42 GTPAse Activating 
Protein) LIKE protein) 

140 


5825 

AL117482 

Homo sapiens hypothetical protein 

71 

36.000 

5826 

AL117482 

Homo sapiens hypothetical protein 

55 

30.233 

5827 

AL117482 

Homo sapiens hypothetical protein 

533 

1O0.000 

5828 

AL117482 

Homo sapiens hypothetical protein 

1901 

1O0.000 

5829 

L18865 

Homo sapiens , gene product 

41 

71.429 

5830 

L18865 

Homo sapiens , gene product 

45 

43.750 

5831 

L18865 

Homo sapiens , gene product ! 

45 

43.750 
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5832 

L18865 

Homo sapiens , gene product 

494 

97.403 

5833 

L18865 


1261 

100.000 

5834 

L18865 

Homo sapiens , gene product 

1256 

100.000 

5835 

AC004668 

Homo sapiens similar to murine cell 
cycie regulator miuai; similar to 
A57591 (PID:g2137417) 

391 

71,579 

5836 

AC004668 

Homo sapiens similar to murine cell 

1; leyuidtor toiuai/ similar to 
A57591 (PID:g2137417) 

537 . 

73 .846 

5837 

AC004668 

Homo sapiens similar to murine cell 
cycle regulator MIDA1; similar to 

A57591 (PTD«a51 77417^ 

3664 

90.866 

5838 

AC004668 

Homo sapienB similar to murine cell 
wyt>xc icyuidLoi vixuiw. , similar to 
A57591 (PID:g2137417) 

4150 

100. 000 

5839 

AF149414 

2i t*;^ V> \ Hnn a i a hVial ^ ana <- ■£ -»-, n 
mauiuupoXo Lllalldna contains 

similaritv to Pfam f am-i 1 v ppnm /r»-c 
cvtosine-soecif ic DNA- methvl aqp) - 
score=*10.4. E=0.051, N=l 

584 

59.333 

5840 

S58722 

Homo sapiens X- linked retinopathy 
protein {C- terminal, clone XEH.8c} 

JlD 

77 .419 

5841 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

J O 

C "5 ICQ 

5842 

U79260 

Homo sapiens unknown 

38 

1 1 A "> Q 

5843 

U79260 

Homo sapiens unknown 

250 

63.514 

5844 

X92485 

Plasmodium vivax pval 

1^ 

6 8 . 293 

5846 

AB014513 

Homo sapiens KIAA0613 protein 

55 

22.222 

5847 

AB0145X3 

Homo sapiens KIAA0613 protein 

4900 

99 . 591 

5848 

L16547 

BOS taurus chloride channpl nrn f ai'n 


87 . 571 

5849 

AF041206 

Homo s aniens midline l rprAhpi 1 a-r 
isoform 1 

199 

23 . 469 

5850 

AF106062 

Homo SatJienS Wiskohf -Al Hr-i rh avnHrnmo 

protein interacting protein 

277 

45 . 385 

5852 

Y08134 

HomO S3ni pnfl api fl Rnh"! nnftmvol 4 n 

like phosphodiesterase 

2747 

95. 494 

5853 

L38941 

Homo saniens ribosomal nrnh ■? t 

199 

85.366 

5855 

L38941 

Homo sapiens ribosomal protein L34 

344 

71.311 

5856 

AF086624 

x\a\.vuo iiwi vcyxLUd oci lllc LnicOnXIlc 

kinase 

55 

70 . 000 

5857 

AF086624 

Rattllfl noTVpai run o/a>>"i no t-)i vorw\^ no 

kinase 

39 

27.778 

5862 

AF086624 

a lluo uuivcyiLUH at-i lne unreonine 
kinase 

51 

39.130 

5863 

AF086624 

RattllR nnrvpni run nan'na 4-Vi>>n(-\n-{ 

kinase 

14 86 

92 . 116 

5864 

U04301 

OrVCtol ACniQ nini nilna mannAmrl 
vxjrubuiav^us uuij j.v,u J. Uo UlciJli JCJoy J. — 

oligosaccharide alpha-1, 2-mannosidase 

567 

62 . 308 

5865 

U04301 

Oryctolagus cuniculus mannosyl- 
oligosaccharide alpha-1 , 2-mannosidase 

568 

49 . 083 

5866 

U04301 

Oryctolagus cuniculus mannosyl- 
oligosaccharide alpha-1 , 2-mannosidase 

428 

36.735 

5867 

U04301 

Oryctolagus cuniculus mannosyl- 
oligosaccharide alpha-1, 2-mannosidase 

563 

66. 142 

5868 

U04301 

Oryctolagus cuniculus mannosyl- 
oligosaccharide alpha-1, 2-mannosidase 

1334 

63 .352 


:iL O O B, "+ & H-7S ,„ id E H 6. 0 a 


5869 

U21324 

Caenorhabditis elegans similar to 
entire S. cerevisiae ABC1 protein 
(Swiss-Prdt Acc: P27697) 

1144 

45.985 

5870 

U59299 

Homo sapiens MCT 

3185 

96.654 

5873. 

U59299 

Homo sapiens MCT 

250 

62.025 

5872 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c} 

268 

75.862 

5873 

228201 

Saccharomyces cerevisiae ORF YKL201C 

271 

40 .288 

5874 

J04204 

Bos taurus 32 kd accessory protein 

867 

66.995 

5875 

AL023553 

Homo sapiens dJ347H13.5 (novel protein 
similar to yeast DNA-directed RNA 
Polymerase III 25 kD Polypeptide) 

52 

30.303 

5876 

AL023553 

Homo sapiens d*J347H13.5 (novel protein 
similar to yeast DNA-directed RNA 
Polymerase III 25 kD Polypeptide) 

43 

42.857 

5877 

AL023553 

Homo sapiens dJ347H13.5 (novel protein 
similar to yeast DNA-directed RNA 
Polymerase III 25 kD Polypeptide) 

43 

43.750 

5878 

AL023553 

Homo sapiens dJ347H13.5 (novel protein 
similar to yeast DNA-directed RNA 
Polymerase III 25 kD Polypeptide) 

46 

53.333 

5879 

AL023553 

Homo sapiens dJ347H13.5 (novel protein 
similar to yeast DNA-directed RNA 
Polymerase III 25 kD Polypeptide) 

53 

35.714 

5880 

AL023553 

Homo sapiens dJ347H13.5 (novel protein 
similar to yeast DNA-directed RNA 
Polymerase III 25 kD Polypeptide) 

46 

31 . 579 

5881 

AL023553 

Homo sapiens dJ347H13.5 (novel protein 
similar to yeast DNA-directed RNA 
Polymerase III 25 kD Polypeptide) 

262 

84.906 

5882 

M15182 

Homo sapiens beta-glucuronidase 
precursor (EC 3.2.1.31) 

367 

83. 333 

5883 

M15182 

Homo sapiens beta-glucuronidase 
precursor (EC 3.2.1.31) 

1030 

70.079 

5884 

M15182 

Homo sapiens beta-glucuronidase 
precursor (EC 3.2.1.31) 

533 

68.531 

5885 

M15182 

Homo sapiens beta-glucuronidase 
precursor (EC 3.2.1.31) 

480 

39.840 

5886 

AC004382 

Homo sapiens Unknown gene product 

1214 

66.987 

5887 

AC004382 

Homo sapiens Unknown gene product 

864 

91.758 

5888 

AC004382 

Homo sapiens Unknown gene product 

1443 

73.902 

5889 

AC004382 

Homo sapiens Unknown gene product 

1097 | 

95.745 

5891 

AF017369 

Mus musculus faciogenital dysplasia 
protein 3 

824 

61.275 

5892 

AIill0151 

Homo sapiens hypothetical protein 

1889 

98.294 

5893 

AL110151 

Homo sapiens hypothetical protein 

2327 

99.164 

5894 

AJ388545 

Canis familiaris hypothetical protein 

94 

41.935 

5895 

AF010144 

Homo sapiens neuronal thread protein 
AD7e-NTP 

236 

52.830 

5896 

AC005594 

Homo sapiens R26984 1 

382 

47.619 

5897 

AC005594 

Homo sapiens R26984 1 

952 

70.899 

5898 

AC005594 

Homo sapiens R26984 1 

819 

67.358 

5899 

AF042838 

Homo sapiens MEK kinase 1 

235 

88 .636 

5900 

AF042838 

Homo sapiens MEK kinase 1 

267 

73 .077 

5901 

AF042838 

Homo sapiens MEK kinase 1 

352 

56.522 


5902 

AF042838 

Homo sapiens MEK kinase 1 

2753 

61.538 

5903 

AF042838 

Homo sapiens MEK kinase 1 

9906 

99.599 

5904 

M22332 

Homo sapiens unknown protein 

181 

93.333 

5905 

M22332 

Homo sapiens unknown protein 

186 

93.548 

5906 

M22332 

Homo sapiens unknown protein 

1107 

96.815 

5907 

M22332 

Homo sapiens unknown protein 

1101 

96.178 

5908 

M22332 

Homo sapiens unknown protein 

1034 

92.308 

5909 

M22332 

Homo sapiens unknown protein 

1097 

94.904 

5910 

M22332 . 

Homo sapiens unknown protein 

1050 

92.903 

5911 

M22332 

Homo sapiens unknown protein . 

1063 

92.994 j 

5912 

M22332 

Homo sapiens unknown protein 

1101 

98. 052 

5913 

M22332 

Homo sapiens unknown protein 

1040 

92.357 

5914 

M22332 

Homo sapiens unknown protein 

1072 

92.994 

5915 

M14676 

Homo sapiens src-like tyrosine kinase 
(put. ) ; putative 

281 

80.000 

5916 

M14676 

Homo sapiens src-like tyrosine kinase 
(put . ) ; putative 

1331 

99.524 

5917 

M14676 

Homo sapiens src-like tyrosine kinase 
(put.); putative 

512 

59. 146 

5916 

Z36948 

Caenorhabditis elegans contains 3 
cysteine rich repeats 

113 

27.559 

5919 

Z36948 

Caenorhabditis elegans contains 3 
cysteine rich repeats 

156 

36.283 

5920 

AL050150 

Homo sapiens hypothetical protein 

257 

60.135 

5921 

AL050150 

Homo sapiens hypothetical protein 

1937 

94.385 

5922 

AF153879 

. Fugu rubripes BAW 

92 

60.000 

5923 

AF153879 

Fugu rubripes BAW 

325 

72 .414 

5924 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

348 

67 . 500 

5925 

M13751 

Escherichia coli branching enzyme (EC 
2.4.1.18) 

56 

29.032 

5926 

M13751 

Escherichia coli branching enzyme (EC 
2.4.1.18) 

61 

35.484 

5927 

M13751 

Escherichia coli branching enzyme (EC 
2.4.1.18) 

1202 

96.571 

5928 

M13751 

Escherichia coli branching enzyme (EC 
2.4.1.18) 

1826 

94 .139 

5930 

M13751 

Escherichia coli branching enzyme (EC 
2.4.1.18) 

1744 

91.727' 

5931 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 
isoform 

385 

49.645 

5932 

AF179286 

Homo sapiens death effector domain- 
associated factor 

380 

88.235 | 

5933 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

1298 

55.844 

5934 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

871 

48.866 

5935 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

1313 

95 .814 

5936 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

752 

71.304 

5937 

U79260 

Homo sapiens unknown ] 

103 

75 .000 

5938 

U79260 

Homo sapiens unknown 

313 

68 .182 

5939 

AL049628 

Streptomyces coelicolor hypothetical | 

156 

51 .220 




proline-rich protein 



5940 

AB012726 

Mub musculus C184M protein 

238 

59.459 

5941 

AB012726 

Mus musculus C184M protein 

880 

94 . 366 

5942 

L1X015 

Homo sapiens lymphotoxin-beta 

1468 

93.852 

5943 

L11015 

Homo sapiens lymphotoxin-beta 

1359 

-J * J \J 

5944 

AF145688 

Drosophila melanogaster BcDNA.LD26050 

212 

31.967 

5945 

AF124394 

Mus musculus f ragile-X-related protein 
1 isoform f 

685 

33 . 33b 

5946 

AP124394 

Mus musculus f ragile-X-related protein 
1 isoform f 

3775 

86 7"* 

5947 

AF124394 

Mus musculus f ragile-X-related protein 
i isoform f 

3665 

92 .769 

5948 

AF124394 

Mus musculus f ragile-X-related protein 
1 isoform f 

702 

95 . 161 

5949 

AL133105 

Homo sapiens hypothetical protein 

890 

59.447 

5950 

AB020682 

Homo sapiens KIAA0875 protein 

137 

100.000 

5952 

AB020682 

Homo sapiens KIAA0875 protein 

3405 

93 .701 

5953 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

228 

65.000 

5954 

X92485 

Plasmodium vivax pval 

151 

68.421 

5955 

S58722 

Homo sapiens X- linked retinopathy 
protein {c- terminal, clone XEH.8c} 

13 9 

O J . / o / 

5956 

AF010144 

Homo sapiens neuronal thread" protein 
AD7C-NTP 

46 

A£Z 1 CA 

5957 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

189 

"7 R Q A *7 

5958 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

1 ft 9 

JL O J7 

*7 ft OA *7 
/ O . 27*1 / 

5959 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

373 

4 (\ ft 97 

5960 

AF0X0144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

466 

53.005 

5963 

M36654 

Mus musculus homeo box 2.6 (Hox-2.6) 
gene product 

1614 

86.447 

5964 

AB014570 

Homo sapiens KIAA0670 protein 

926 

97 . 973 

5965 

AB014570 

Homo sapiens KIAA0670 protein 

8208 

99. 532 

5966 

AB014570 

Homo sapiens KIAA0670 protein 

7017 

92.869 

5967 

S58722 

Homo sapiens X- linked retinopathy 
protein {c-terminal, clone XEH.8c} 

145 

75 . 000 

5968 

AF126780 

Homo sapiens retinal short -chain 
dehydrogenase/reductase retSDR2 

1763 

95 . 440 

5969 

X78926 

Homo sapiens zinc finger protein 

2804 

98 . 500 

5970 

AB012726 

Mus musculus C184M protein 

170 

42.857 

5972 

X51755 

Homo sapiens open reading frame (458 
AA) 

360 

63 . 441 

5973 

X51755 

Homo sapiens open reading frame (458 
AA) 

461 


5974 

X51755 

Homo sapiens open reading frame (458 
AA) 

634 

78.125 

5975 

X51755 

Homo sapiens open reading frame (458 
AA) 

522 

56.886 

5976 

X51755 

Homo sapiens open reading frame (458 
AA) 

124 

34.021 

5978 

X51755 

Homo sapiens open reading frame (458 

433 

61.538 
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AA) 



CQ7Q 

YC1 TCC 

Homo sapiens open reading frame (458 

TV TV \ 

AA; 

387 

75.862 


AD ± / ?9 

Homo sapiens open reading frame (458 

J 1915 

94 .792 

5983 


Homo sapiens neuronal thread protein 

t\xj f — in x Jr 

262 

53.448 

5984 

AF155115 

"wuiu tsajpiens jw x — kjesjm - 3 o antigen 

605 

86.726 

5985 

AF155115 

mv/iiiu ua^JicilO I* i - I\.I2»1>I 3 O allLlC}Cn 

1 2766 

97.660 

5 986 

AB020643 

A-n-miv/ oa^xcuo IvXrtrtU O J O J-/ lOLcin 

1 4 04 0 

99.836 

5987 

U79260 

Homo a pi r*> ^ prin nnVnrkun 

1 260 

60.000 

5988 

AF129756 

Homo Ran 1 ! Ann "R 7A r T c » 

651 

73.077 | 

5989 

AF129756 

Homo saoienfl rati; 

1 731 

87.603 

5990 

AF129756 

Homo sanipnq "RTVTR 

1 1395 

92.827 

5991 

AF129756 

Homo sapiens BATS 

54 . 

47.826 

5992 

AF129756 

flomo sapiens mid 

2368 

90.200 

5993 

AF145634 

Drosophila melanogaster BcDNA. GH06193 

120 

60.000 

5994 

AT It SO ji 

Drosophila melanogaster BcDNA. GH0 6 193 

335 

54.237 

5995 


Homo sapiens TFAR19 

436 

74.590 

5996 


Homo sapiens TFAR19 

133 

82.857 

5 997 
•j j ^ t 

MJ? U / J / / 1 

Homo sapiens RNA polymerase II 
termination ractor 

393 

80.723 

5998 

AL03177 I S 

nomo sapiens au J0M3 . 3 (novel protein 
Bimnar to t, . eiegans Y63D3A.4) 

886 

95.425 

6000 

J03191 

xivjtuvj sapiens proniin 

341 

71.831 

6001 

X92485 

* laomuux uiii vivax pva 1 

50 

28.571 

6002 

X92485 

Plasmodium vivax pval 

270 

75.410 

6003 

AiTOl 1 Q1 C 

Homo sapiens hypothetical protein 

1170 

79.098 

6004 

-r*i«» v 1 J. j ID 

Homo sapiens hypothetical protein 

856 

97.872 

6005 

AiTO 1 1 9 1 

Homo sapiens hypothetical protein j 

1191 

90.119 

6006 

U79260 

nomo sapiens unxnown i 

208 

54.651 

6007 

U79260 

Homo sapiens unknown j 

379 

67.347 

6008 

AL031115 

Homo sapiens ZXDA, ZXDB (zinc finger 

y _!•(%-, \f a «3 -»— /^+— a 4 m \ 9 
■«»■ J- -L u prOuelQ; | 

1202 

60.571 

6009 

D17554 

Homo sapiens TAXREB107 

37 

38.462 

6010 

A'X / -J *± 

Homo sapiens TAXREB107 | 

193 

75.000 

6012 

U79260 

Homo sapiens unknown | 

226 

4 7.692 | 

O VJL j 

a Trt ft ~i A O >l 

Rattus norvegicus p65 protein 

422 

62.585 

6014 

S58544 

Homo sapiens 75 kda infertility- 
related sperm protein | 

786 

98.374 

6015 

S58544 

Homo sapiens 75 kda infertility- 
related sperm protein 

3379 

98.866 

6016 

A00328 

synthetic construct beta -endothelial 
cell growth factor | 

888 

95.484 

6017 

A00328 

synthetic construct beta -endothelial j 
cell growth factor 

287 

62.195 

6018 

A00328 

synthetic construct beta-endothelial 
cell growth factor | 

390 

93.548 

6019 

A00328 

synthetic construct beta-endothelial 
cell growth factor | 

723 

76.433 

6020 

A00328 

synthetic construct beta-endothelial 
cell growth factor 

668 

56.716 j 

6021 

AF053003 

Homo sapiens diphthamide biosynthesis 
protein-2; DPH2 

3222 

96.933 
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6023 

X58826 

Drosophila melanogaster RNA polymerase 
III second-largest subunit 

1190 

73 . 593 

6024 

X58826 

Drosophila melanogaster RNA polymerase 
III second- largest subunit 

405 

67 . 045 

6025 

AB000216 

Rattus norvegicus CCA3 

184 

80.556 

6026 

AB000216 

Rattus norvegicus CCA3 

3458 

66. 834 

6027 

S58722 

Homo sapiens X-linked retinopathy 
protein { C- terminal , clone XEH.8c} 

170 

68.421 

6028 

AL049783 

Homo sapiens hypothetical protein 

671 

100.000 

6629 

AL049783 

Homo sapiens hypothetical protein 

3259 

100.000 

6030 

AL049783 

Homo sapiens hypothetical protein 

4111 

100.000 

6031 

AL049783 

Homo sapiens hypothetical protein 

289 

95.455 

6032 

U79260 

Homo sapiens unknown 

295 

61.364 

6033 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

374 

68.132 

6037 

U79260 

Homo sapiens unknown 

203 

58.462 

6038 

AL117662 

Homo sapiens hypothetical protein 

56 

22.222 

6039 

AL117662 

Homo sapiens hypothetical protein 

207 

96.970 

6040 

AL117662 

Homo sapiens hypothetical protein 

718 

65.641 

6041 

AL117662 

Homo sapiens hypothetical protein 

1318 

97.573 

6042 

AL117662 

Homo sapiens hypothetical protein 

1384 

100 . 000 

6043 

AF155117 

Homo sapiens NY-REN- 62 antigen 

3962 

100 000 

6044 

AF125176 

Mus musculus angiopoietin-related 
protein-2 

364 

56.122 

6045 

AF125176 

Mus musculus angiopoietin-related 
protein-2 

962 

54.874 

6046 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

298 

68.571 

6047 

AL050285 

Homo sapiens hypothetical protein 

621 

95.876 

6048 

AC002400 

Homo sapiens Gene product with 
similarity to Ubiquitin binding enzyme 

458 

96.000 

6049 

AC002400 

Homo sapiens Gene product with 
similarity to Ubiquitin binding enzyme 

341 

90.741 

6050 

AC002400 

Homo sapiens Gene product with 
similarity to Ubiquitin binding enzyme 

J O X 

./4i 

6053 

AP000559 

Oryza sativa ESTs 

AU078277 (S1084) ,AU078273 (S13585) , 
D47857 (S13585) correspond to a region 
of the predicted gene.; Similar to 
Caenorhabditis elegans cosmid C15H9; 
hypothetical protein (U56965) 

97 

44.444 

6054 

AP000559 

Oryza sativa ESTs 

AU078277 (S1084) , AU078273 (S13585) , 
D47857 (S13585) correspond to a region 
of the predicted gene.; Similar to 
Caenorhabditis elegans cosmid C15H9; 
hypothetical protein (U56965) 

381 

52.525 

6055 

AP000559 

Oryza sativa ESTs 

AU078277 (S1084) , AU078273 (S13585) , 
D47857(S13585) correspond to a. region 
of the predicted gene.; Similar to 
Caenorhabditis elegans cosmid C15H9; 
hypothetical protein (U56965) 

486 

36.905 

6056 

AP000559 

Oryza sativa ESTs 

AU078277 (S1084) ,AU078273 (S13585) , 

445 

47.059 
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D47857 (S13585) correspond to a region 
of the predicted gene.; Similar to 
Caenorhabditis elegans cosmid C15H9; 
hypothetical protein (U56965) 



6057 

M22414 

Homo sapiens ribonuclease inhibitor 
precursor 

375 

87 . 324 

6058 

AL031187 

Arabidopsis thaliana kinesin-related 
protein katA (fragment) 

78 

34 . 545 

6059 

AL031187 

Arabidopsis thaliana kinesin-related 
protein katA (fragment) 

88 

32.039 

6060 

AL031187 

Arabidopsis thaliana kinesin-related 
protein katA (fragment) 

175 

24.324 

6061 

U79260 

Homo sapiens unknown 

373 

73.034 

6062 

AL110147 

Homo sapiens hypothetical protein 

797 

97.500 

6063 

U79260 

Homo sapiens unknown 

71 

64.706 

6064 

U79260 

Homo sapiens unknown 

266 

58.947 j 

6065 

D17629 

Homo sapiens GALNS 

778 

98.165 

6066 

D17629 

Homo sapiens GALNS 

60 

100.000 

6067 

D17629 

Homo sapiens GALNS 

530 

92.308 

6068 

D17629 

Homo sapiens GALNS 

3560 

99.808 

6069 

AB001914 

Homo sapiens PACE4C 

4294 

99.679 

6070 

AB001914 

Homo sapiens PACE4C 

4294 

99.679 

6071 

AB001914 

Homo sapiens PACE 4 C 

4152 

97.753 

6072 

AB001914 

Homo sapiens PACE4C 

4294 

99.679 

6073 

AB001914 

Homo sapiens PACE4C 

730 

84 .444 

6074 

AJ225124 

Mus musculus hyperpolarizat ion- 
activated cation channel, HAC3 

151 

31.754 

6075 

AF180728 

Drosophila melanogaster sulfate 
transporter 

484 

36.237 

6077 

AC004794 

Homo sapiens F02569 2 

2936 

73.025 ! 

6078 

U22376 

Homo Bapiens alternatively spliced 
product using exon 13A 

142 

79. 167 

6079 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

415 

66.667 

6080 

AP091047 

Mus musculus KH domain RNA binding 
protein QKI-7B 

1415 

100.000 

6081 

AF091047 

Mus musculus KH domain RNA binding 
protein QKI-7B 

441 

100.000 

6082 

AF127765 

Homo sapiens calpain 3; calcium 
activated neutral protease; CAPN3 ; CL1 

725 

92.366 

6083 

AP127765 

Homo Bapiens calpain 3; calcium 
activated neutral protease; CAPN3; CL1 

508 

80.357 

6084 

AB020720 ! 

Homo sapiens KIAA0913 protein 

8478 

99.216 

6085 

AF036549 

Homo sapiens RGC32 

129 

76.667 

6086 

AF036549 

Homo sapiens RGC32 

641 

100.000 

6087 

AL050018 

Homo sapiens hypothetical protein 

36 

40.000 

6088 

AL050018 

Homo sapiens hypothetical' protein 

490 

77.119 

6089 

AIi050018 

Homo sapiens hypothetical protein 

370 

96.721 

6090 

AL050018 

Homo sapiens hypothetical protein 

2294 

94.256 

6093 

L25125 

Mus musculus RNA helicase 

61 

38.462 

6094 

L25125 

Mus musculus RNA helicase 

58 

26.316 

6095 

L25125 

Mus musculus RNA helicase 

69 

26.786 

6096 

L25125 

Mus. musculus RNA helicase 

65 

21.854 

6098 

L25125 

Mus musculus RNA helicase 

330 

63 .636 
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6100 

L25125 

Mus musculus RNA helicase 

48 

29.630 

6104 

L25125 

Mus musculus RNA helicase 

61 

38.462 

6105 

L25125 

Mus musculus RNA helicase 

56 

23.077 

6110 

L25125 

Mus musculus RNA helicase 

70 

23.871 

6111 

L25125 

Mus musculus RNA helicase 

821 

63.547 

6112 

Z92832 

Caenorhabditis elegans F31D4.2 

324 

35.862 

6113 

Z92832 

Caenorhabditis elegans F31D4.2 

556 

45.455 

6115 

AC005594 

Homo sapiens R26984 1 

910 

67.876 

6116 

AC005594 

Homo sapiens R26984 _1 

808 

66.321 

6117 

Z75526 

Caenorhabditis elegans Weak similarity 
to Staphyloccus autolysin gene 
(TR:G765072) ; cDNA EST EMBL:M89336 
comes from this gene; cDNA EST 
yk505dl2.3 comes from this gene 

290 

50.588 

6118 

Z75526 

Caenorhabditis elegans Weak similarity 
to Staphyloccus autolysin gene 
(TR:Q765072) ; cDNA EST EMBL:M89336 
comes from this gene; cDNA EST 
yk505dl2.3 comes from this gene 

672 

48.485 

6119 

AB010959 

Cyprinus carpio natural killer cell 
enhancing factor 

376 

51.034 

6120 

AB010959 

Cyprinus carpio natural killer cell 
enhancing factor 

325 

48.858 

6122 

Z11115 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST yk281d4.5 comes 
from this gene; cDNA EST yk288c!2.5 
comes from this gene; cDNA EST 
yk406gll.5 comes from this gene 

194 

40.909 

6123 

AF144757 

Homo sapiens PR-domain zinc -finger 
protein PFM1 

453 

47 . 291 

6124 

AF144757 

Homo sapiens PR-domain zinc- finger 
protein PFM1 

48 

32.258 

6125 

AF144757 

Homo sapiens PR-domain zinc- finger 
protein PFM1 

308 

50.450 

6126 

AF144757 

Homo sapiens PR -domain zinc -finger 
protein PFM1 

1266 

63.969 

6127 

AF144757 

Homo sapiens PR- domain zinc -finger 
protein PFM1 

5078 

98. 500 

6128 

U08819 

Mus musculus microtubule -associated 
protein 4 

1606 

47. 013 

6129 

U08819 

Mus musculus microtubule -associated 
protein 4 

2261 

44.151 

6130 

U08819 

Mus musculus microtubule -associated 
protein 4 

1840 

51.398 

6131 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

396 

69.000 

6132 

U79260 | 

Homo sapiens unknown 

343 

64.894 

6133 

AF036705 

Unknown Similar to phytoene 
desaturase; coded for by C. elegans 
cDNA CEESX74F; coded for by C./el 

1753 

50.903 

6134 

AF036705 

Unknown Similar to phytoene 
desaturase; coded for by C. elegans 
cDNA CEESX74F; coded for by C. el 

361 

56.122 

6136 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

204 

59.615 
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6137 

U22376 

Homo saDiens altemativelv nnl 1rpH 
product using exon 13 A 


76 . 667 

6138 

U22376 

Homo aat5leii3 alternat*ivplv nnl p&rl 
product using exon 13 A 


74 . 648 

6140 

AP010144 

Homo saoiens neuronal' thread nrnhA'ln 
AD7c-NTP 


64 . 000 

6141 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

553 

60.000 

6142 

AP060228 

Homo sapiens retinoic acid receptor 
responder 3 

200 

71.154 ! 

6143 

AJ243136 

Trachemys scripta- elegans putative 
pyruvate kinase 


o 0 . 227 

6144 

AJ242540 

Volvox carteri f . nagariensis 
hydroxyproline-rich glycoprotein DZ- 
HRGP 

« V J 

C A "lor 

6146 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

431 

73.469 

6147 

U79260 

Homo sapiens unknown 

344 

64.835 

6149 

AP000996 

Homo sapiens ubiquitous TPR motif, Y 
isoform 

J? / 

70. lib 

6151 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

^ A 
3 O 

72 . 727 

6152 

U22376 

Homo saoiens altemativplv ot~»1 S ppH 
product using exon 13A 

C VI 

26 . 316 

6153 

U22376 

Homo saniens alhprnflh{vplv anl i' r«o^ 
product using exon 13 A 

c a 
DO 

72 . 727 

6154 

U22376 

Homo saoiens altemativelv qnl -1 m=»H 
product using exon 13A 

1 Q1 

44 . 000 

6155 

S58722 

Homo sapiens X- linked retinopathy 
protein {c-terminal, clone XEH.8c} 

ion 

64 . 000 

6156 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 


70.103 

6157 

AF169301 

Homo sapiens Na+/ sulfate cotransporter 
SUT-1 

168 

78.125 

6158 

AF169301 

Homo sapiens Na+/sulfate cotransporter 
SUT- 1 

4047 

100. 000 

6159 

U49973 

Homo sapiens ORF1; MER37; putative 
transposase similar to pogo element 

210 

34.167 

6160 

ABO33021 

Homo sapiens KIAA1195 protein 

294 

43.269 

6161 

U49954 

Caenorhabrf'i t" i a elenano r*r^rl c*rl -F r^v* r* 
ua&uwiiicu/uiuio cxc^auo UUUcU tor Dy l_. . 

elegans cDNA CEESG19F; short region 
of weak similarity to Thermomonospora 
protein kinase (GB:U23 820) 

54 

29.730 

6162 

U49954 

Caenorhabditl Alpnann r-nrloH ^*-\*» K-ir r* 
wa&uviucwuxtlo C J. cy CtjLLt> l_l_/(Jfc;vJ. IOl DV V* * 

elegans cDNA CEESG19P; short region 
of weak similarity to Thermomonospora 
protein kinase (GB:U23820) 

46 

29.412 

6163 

U49954 

Caenorhabditis elegans coded for by C. 
elegans cDNA CEESQ19F; short region 
of weak similarity to Thermomonospora 
protein kinase (GB:U23 82 0) 

256 

43 . 750 

6164 

U49954 

Caenorhabditis elegans coded for by C. 
elegans cDNA CEESG19F; short region 
of weak similarity to Thermomonospora 
protein kinase (GB:U23820) 

211 

40.323 



.::LO M:«-lhS, H-3< „ O 5! H i& O H 


6165 

U37376 

Xenopus laevis MAM domain protein 

443 

53.125 

6166 

U37376 

Xenopus laevis MAM domain protein 

173 

58.000 

6167 

U37376 

Xenopus laevis MAM domain protein 

347 

46.970 

6168 

U37376 

Xenopus laevis MAM domain protein 

574 

42 .400 

6169 

U37376 

Xenopus laevis MAM domain protein 

1645 

63.243 

61.70 

AL078630 

Mus musculus 573K1.5 (novel protein 
similar to worm, yeast and plant 
proteins) 

375 

51.639 

6171 

AL078630 

Mus musculus 573K1.5 (novel protein 
similar to worm, yeast and plant 
proteins) 

558 

66.667 

6172 

Z81561 

Caenorhabditis elegans cDNA EST 
yk338f6.5 comes from this gene; cDNA 
EST EMBL:D75296 comes from this gene 

50 

3 8 . 095 

6173 

281561 

Caenorhabditis elegans cDNA EST 
yk338f6.5 comes from this gene; cDNA 
EST EMBL:D752 96 comes from this gene 

250 

35.537 

6174 

Z14020 

Nicotiana tabacum Pistil extensin like 
protein, partial CDS only 

82 

33.333 

6175 

Z14020 

Nicotiana tabacum Pistil extensin like 
protein, partial CDS only 

205 

42.045 

6176 

Z14020 

Nicotiana tabacum Pistil extensin like 
protein, partial CDS only 

197 

42.683 

6177 

U79260 

Homo sapiens unknown 

376 

71.591 

6178 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

362 

62.376 

6179 

Y08715 

Mus musculus vascular cadherin-2 

283 

26.439 | 

6180 

AF000152 

Homo sapiens OS -4 protein 

288 

88.235 

6181 

AF000152 

Homo sapiens OS -4 protein 

934 

95.333 | 

6182 

AF171938 

Homo sapiens NUMB isoform 1 

4408 

100. 000 

6183 

AF171938 

Homo sapiens NUMB isoform 1 

406 

95.161 

6184 

AF171938 

Homo sapiens NUMB isoform 1 

424 

67.961 

6186 

AF171938 

Homo sapiens NUMB isoform 1 

564 

34.568 

6187 

AF171938 

Homo sapiens NUMB isoform 1 

3014 

82.753 

6189 

AF171938 

Homo sapiens NUMB isoform 1 

4274 

98.310 

6190 

AF171938 

Homo sapiens NUMB isoform 1 

3529 

87.023 

6191 

AF171938 

Homo sapiens NUMB isoform 1 

3666 

9O.505 

6192 

X53581 

Rattus norvegicus ORF4 

40 

26.923 

6193 

X53581 

Rattus norvegicus 0RP4 

47 

27.273 

6194 

X53581 

Rattus norvegicus ORF4 

180 

52.239 

6195 

X16623 

Rattus sp. neuraxin (AA 1-863) 

512 

34.286 

6196 

S63758 | 

Mus sp. metallothionein-I gene 
transcription activator 

250 

93.103 

6197 

AF010144 

Homo sapiens neuronal / thread protein 
AD7C-NTP 

511 

38.579 

6198 

X86779 

Homo sapiens FAST kinase 

620 

60.476 

6199 

AB032983 

Homo sapiens KIAA1157 protein 

355 

57.609 

6200 

L12147 

Mus musculus early B-cell factor 

207 

91.176 

6201 

L12147 

Mus musculus early B-cell factor 

214 

100.000. 

6202 

L12147 

Mus musculus early B-cell factor 

1646 

99.219 ! 

6203 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

283 

58.763 

6204 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

1066 

51.422 
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6205 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

181 

36.943 

6206 

AF010144 - 

Homo sapiens neuronal thread protein 
AD7C-NTP 

481 

35.984 

6207 

AF055666 

Mus musculus kinesin light chain 2 

224 

70.690 

6208 

D45371 

Homo sapiens a novel adipose specific 
collagen-like factor, apMl (adipose 
most abundant gene transcript 1) 

241 

42 . 537 

6209 

X70476 

Homo sapiens subunit of coatomer 
complex 

66 

30.000 

6210 

X70476 

Homo sapiens subunit of coatomer 
complex 

335 

78.824 

6211 

Z86109 

Saccharomyces pastorianus unknown 

123 

35.227 

6216 

U79260 

Homo sapiens unknown 

58 

44.444 | 

6217 

U79260 

Homo sapiens unknown 

308 

58.889 

6218 

M25113 

Homo sapiens sickle bet a -hemoglobin 

720 

89. 600 

6221 

AJ223814 

Homo sapiens striatin 

5149 

100.000 

6222 

AJ223814 

Homo sapiens striatin 

3862 

96.753 

6223 

AJ223814 

Homo sapiens striatin 

402 

82.432 

6229 

X924B5 

Plasmodium vivax pval 

242 

50.000 

6230 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

343 

62.376 

6231 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

338 

61.321 

6233 

Z93244 

Homo sapiens bK116F5.2 (PUTATIVE 
RhoGAP (CDC42 GTPAse Activating 
Protein) LIKE protein) 

132 

58 . 824 

6234 

AF071081 

Mycobacterium tuberculosis proline- 
rich mucin homolog 

152 

26.255 

6235 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 
isoform 

187 

41. 176 

6236 

AB032907 

Hylobates lar dopamine receptor D4 

78 

26.027 

6237 

AB032907 

Hylobates lar dopamine receptor D4 

73 

29.032 

6238 

AB032907 

Hylobates lar dopamine receptor D4 

57 

54.545 

6239 

AB032907 

Hylobates lar dopamine receptor D4 

89 

35.000 

6240 

AB032907 

Hylobates lar dopamine receptor D4 

87 

29.333 

6241 

AB032907 

Hylobates lar dopamine receptor D4 

137 

32.632 | 

6242 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

336 

56.522 

6243 

U53153 

Caenorhabditis elegans one short 
region of weak similarity to S. 
cerevisiae protease A inhibitor 3 
(SP:P01094) and another short region 
of weak similarity to S. cerevisiae 
glucose repression mediator protein 
(SP:P14 922) 

185 

2 9 . 457 

6244 

M15530 

Homo sapiens B-cell growth factor 

149 

62.791 

6245 

AB033034 

Homo sapiens KIAA12 08 protein 

171 

73.529 

6246 

X92485 

Plasmodium vivax pval 

154 | 

70.968 

6247 

U58658 

Homo sapiens unknown j 

188 

4 8.052 

6248 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

424 

68.000 

6249 

U32462 

Ithomia sp. NADH dehydrogenase subunit 
1 

104 

26.829 
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6250 

AF095737 

Homo sapiens unknown 

256 

54.945 

6252 

AF159055 

Homo sapiens leucine zipper- like 

144 

70.000 



protein 



6253 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 

259 

51.163 



isof orm 



6254 

U09823 

Oryctolagus cuniculus elongation . 

619 

77.600 



factor 1 alpha 



6255 

U09823 

Oryctolagus cuniculus elongation 

1307. 

.94.628 



factor 1 alpha 



6256 

AF010144 

Homo sapiens neuronal thread protein 

346 

65.432 



AD7C-NTP 



6257 

AC002451 

Homo sapiens pyruvate dehydrogenase 

2405 

87.302 



kinase isoform 4 



6258 

AC002451 

Homo sapiens pyruvate dehydrogenase 

182 

45.714 



kinase isoform 4 



6259 

AC002451 

Homo sapiens pyruvate dehydrogenase 

2722 

100.000 



kinase isoform 4 



6261 

AC004473 

Arabidopsis thaliana T13D8.31 

519 

33.163 

6262 

AC004473 

Arabidopsis thaliana T13D8.31 

494 

46.471 ! 

6265 

AF033339 

Caenorhabditis briggsae UNC-45 

149 

23.077 

6266 

X67863 

Mus musculus T2 

252 

49.398 

6267 

AE001875 

Deinococcus radiodurans hypothetical 

141 

22.609 



protein 



6268 

AL034556 

Plasmodium falciparum predicted using 

127 

41.304 



hexExon; MAL3P5.9 (PFC0615w) , 





Hypothetical protein, len: 332 aa 



6270 

X92485 

Plasmodium vivax pval 

218 

59.322 

6272 

AP010144 

Homo sapiens neuronal thread protein 

346 

64.935 



AD7C-NTP 



6274 

AF010144 

Homo sapiens neuronal thread protein 

425 

68.317 



AD7C-NTP 



6276 

AF116553 

Drosophila melanogaster antennal- 

226 

30.216 



specific short-chain 





dehydrogenas e / r educ t a s e 



6277 

AF116553 

Drosophila melanogaster antennal- 

44 

22.857 



specific short -chain 





dehydrogenase/reductase 



6278 

AF116553 

Drosophila melanogaster antennal- 

358 

28.339 



specific short-chain 





dehydrogenase/ reductase 



6279 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 

307 

58.333 



isoform 



6280 

U73522 

Homo sapiens AMSH 

690 

98.990 

6281 

U73522 

Homo sapiens AMSH 

266 

69.231 

6282 

L26953 

Homo sapiens chromosomal protein 

124 

54.717 

6283 

L26953 

Homo sapiens chromosomal protein 

44 

30.000 

6284 

L26953 

Homo sapiens chromosomal protein 

165 

45.283 

6 V 2 85 

U22376 

Homo sapiens alternatively spliced 

200 

46.988 



product using exon 13A 



6286 

U16359 

Rattus norvegicus nitric oxide. 

121 

53.333 



synthase 



6287 

X92485 

Plasmodium vivax pval 

178 

45.455 

6289 

X92485 

Plasmodium vivax pval 

329 i 

44.872 

6291 

AF013213 

Bos taurus elongation factor 1 alpha 

132 

95.833 
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6292 

AP013213 

Bos taurus elongation factor 1 alpha 

732 

100.000 

6293 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 
isoform 

271 

53.165 

6294 

AP000616 

Oryza sativa similar to RING-H2 finger 
protein RHAla (AF078683) 

148 

35.955 

6295 

AF010144 

Homo sapienB neuronal thread protein 
AD7C-NTP 

197 

53.333 

6296 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

333 

66.234 

6297 

X16064 

Homo sapiens tumor protein (AA 1 - 
172) 

230 

100.000 

6298 

X16064 

Homo sapiens tumor protein (AA 1 - 
172) 

603 

75.138 ] 

6299 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

310 

56.322 

6300 

X92485 

Plasmodium vivax pval 

192 

74.286 j 

6301 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

261 

72.727 

6302 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

359 

71.605 

6303 

L24804 

Homo sapiens p23 

969 

98.551 

6304 

L24804 

Homo sapiens p23 

161 

82.759 1 

6305 

L40357 

Homo sapiens thyroid receptor 
interactor 

467 

91.250 

6306 

L40357 

Homo sapiens thyroid receptor 
interactor 

109 

65.517 

6307 

L40357 

Homo sapiens thyroid receptor 
interactor 

625 

90.625 

6308 

U39208 

Rattus norvegicus cytochrome P450 4F6 

130 

34.568 

6309 

AF182070 

Bdelloura Candida calcium channel 
alpha- 1 subunit 

71 

20.168 

6310 

AF182070 

Bdelloura Candida calcium channel 
alpha- 1 subunit 

49 

26.531 

6311 

AP182070 

Bdelloura Candida calcium channel 
a lpha - 1 subuni t 

43 

27.273 

6312 

Z30425 

Homo sapiens orphan nuclear hormone 
receptor 

43 

40.000 

6313 

Z30425 

Homo sapiens orphan nuclear hormone 
receptor 

873 

79.348 

6314 

Z30425 

Homo sapiens orphan nuclear hormone 
receptor 

2295 

97.734 

6315 

Z30425 

Homo sapiens orphan nuclear hormone 
receptor 

246 

93.023 

6316 

U18997 

Escherichia coli ORF o622; reading 
frame open far upstream of start; 
possible frameshift, linking to 
previous ORF 

65 

28 .261 

6317 

U18997 

Escherichia coli ORF_o622; reading 
frame open far upstream of start; 
possible frameshift, linking to 
previous ORF 

64 

37.500 

6318 

U18997 

Escherichia coli 0RF_o622; reading 
frame open far upstream of start; 
possible frameshift, linking to 
previous ORF 

60 

16.667 


Mis' 
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6320 

U18997 

Escherichia coji ORF_o622; reading 
frame open far upstream of start; 
possible frameshift, linking to 
previous ORF 

59 

26. 667 

6321 

U18997 

Escherichia coli ORF_o622; reading 
frame open far upstream of start; 
possible frameshift, linking to 
previous ORF 

691 

38.992 

6323 

AF047441 

Homo sapiens RNA polymerase I 40kD 
subunit 

170 

35.417 

6324 

AF149B09 

Oryza sativa circumsporozoite protein 
precursor homolog 

116 

30.851 

6325 

X92485 

Plasmodium vivax pval 

235 

59.701 

6326 

AF117831 

Mus musculus putative protein 
phosphatase type 2C 

40 

35:000 

6327 

AF117S31 

Mus musculus putative protein 
phosphatase type 2C 

285 

65.753 

6328 

AF117831 

Mus musculus putative protein 
phosphatase type 2C 

49 

25.806 

6329 

AF117831 

Mus musculus putative protein 
phosphatase type 2C 

37 

38. 889 

6330 

AP117831 

Mus musculus putative protein 
phosphatase type 2C 

34 

63.636 

6331 

AF117831 

Mus musculus putative protein 
phosphatase type 2C 

44 

21.154 

6332 

AF117831 

Mus musculus putative protein 
phosphatase type 2C 

48 

31.579 

6333 

AF117831 

Mus musculus putative protein 
phosphatase type 2C 

53 

42.857 

6334 

AL049730 

Arabidopsis thaliana p EARL I 1-like , 
protein 

143 

36.765 

6335 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

399 

73.333 

6336 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

346 

73.529 

6337 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

160 

67.568 

6338 

U23515 

Caenorhabditis elegahs weak similarity 
to adenylyl cyclase-associated protein 
(CAP) and to P. chabaudi adami major 
merozoite surfae antigen protein 
(PIR:A32555) . Final exon overlaps 
gene predicted on other strand. 

76 

31.081 

6339 

U23515 

Caenorhabditis elegans weak similarity 
to adenylyl cyclase-associated protein 
(CAP) and to P. chabaudi adami major 
merozoite surfae antigen protein 
(PIR:A32555) . Final exon overlaps 
gene predicted on other strand. 

121 

5X.351 

6340 

Z81113 

Unknown similar to WD domain, G-beta 
repeats (4 domains) ; cDNA EST 
EMBL:D7522 8 comes from this gene 

153 

46.296 

6341 

Z81113 

Unknown similar to WD domain, G-beta 
repeats (4 domains) ; cDNA EST 
EMBL:D75228 comes from this gene 

151 

23.118 
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6342 

X61048 

Hydra sp. mini -collagen 

147 

38-272 

6343 

AF063866 

Melanoplus sanguinipes entomopoxvirus 
ORP MSV249 hypotehtical protein 

4 8 

22.581 

6344 

AF063866 

Melanoplus sanguinipes entomopoxvirus 
ORP MSV24 9 hypotehtical protein 

41 

25.000 

6345 

AF063866 

Melanoplus sanguinipes entomopoxvirus 
ORF MSV24 9 hypotehtical protein 

67 

22.581 

6346 

M15530 

Homo sapiens B-cell growth factor 

114 

60.526 

6347 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

218 

50.000 

6348 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

434 

77.333 

6349 

AF067216 

Caenorhabditis elegans No definition 
line found 

223 

44 .248 

6350 

AL121745 

Schizosaccharomyces pombe hypothetical 
zinc finger protein 

187 

38.202 

6351 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

382 

58.407 

6352 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

73 

23.810 

6353 

D83516 

Homo sapiens HLA-A9HH antigen 

630 

84.404 

6354 

D83516 

Homo sapiens HLA-A9HH antigen 

290 

58.947 

6356 

AF027144 

Homo sapiens zinc finger protein 

270 

51.471 | 

6357 

AF027144 

Homo Bapiens zinc finger protein 

139 

45.283 

6358 

AF027144 

Homo sapiens zinc finger protein 

225 

44.595 

6359 

AF080217 

Sinorhizpbium meliloti acetoacetyl-CoA 
synthetase; acetoacetyl-CoA ligase; 
acyl -activating enzyme 

95 

32 .203 

6360 

AF080217 

Sinorhizobium meliloti acetoacetyl-CoA 
synthetase; acetoacetyl-CoA ligase; 
acyl -activating enzyme 

772 

47.699 

6361 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

112 

37.647 

6362 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

379 

63.366 

6364 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

296 

63.514 

6365 

AF030357 

Arabidopsis thaliana C-8,7 sterol 
isomerase; aSIl 

136 

50.000 

6366 

AF030357 

Arabidopsis thaliana C-8,7 sterol 
isomerase; aSIl 

194 

38.462 

6367 

X66171 

Homo sapiens CMRF-35 antigen 

179 

43.023 

6368 

X66171 

Homo sapiens CMRF-35 antigen 

800 

62.212 

6369 

AJ000644 

Homo sapiens SPOP 

163 

89.286 | 

6370 

AJ000644 

Homo sapiens SPOP 

2290 | 

95.225 

6371 

AF199413 

Pseudorabies virus thymidine kinase 

72 

33.010 | 

6372 

AF199413 

Pseudorabies virus thymidine kinase 

140 

28.571 1 

6373 

X51605 

Xenopus laevis nuclear actin-binding 
protein chain a 

35 | 

37.500 

6374 

X51605 

Xenopus laevis nuclear actin-.binding 
protein chain a j 

40 

44.444 

6375 

X51605 

Xenopus laevis nuclear actin-binding 
protein chain a 

33 

29.412 

6376 

X51605 

Xenopus laevis nuclear actin-binding 

43 

2 6.667 
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protein chain a 



6377 

X51605 

Xenopus laevis nuclear actin-binding 
protein chain a 

236 

97.368 

6378 

D87969 

Homo sapiens CMP-sialic acid 
transporter 

264 

66.176 

£ "5 o rt 

63 80 

U2 2 3 76 

Homo sapiens alternatively spliced 
product using exon 13 A 

322 

75.000 

Ri 

At V JL U X *i 1 

Homo sapiens neuronal thread protein 

/ C Vi X Mr 

268 

51.111 

6383 

M82829 

Homo sapiens fusion protein 

1828 

83.143 

63 84 

MOO Q*> Q 

Homo sapiens fusion protein 

4517 

76.618 

6385 

K00557 

Homo Bapiens alpha- tubulin 

775 

83.333 

OJtJ/ 

V n nee? 

Homo sapiens alpha- tubulin 

1328 

94.660 

6388 

K00557 

Homo sapiens alpha-tubulin 

1755 

89.174 

bio? 

X92485 

Plasmodium vivax pval 

226 

46.667 

6390 

U22376 

Homo, sapiens alternatively spliced 
product UBing exon 13A 

212 

67.308 


AE001910 

Deinococcus radiodurans hypothetical 
protein 

42 

28.571 


AE001910 

Deinococcus radiodurans hypothetical 
protein 

101 

36.667 


AE001910 

Deinococcus radiodurans hypothetical 
protein 

106 

36.923 

DO J J 

Y7PQOQ 

Homo sapiens zinc finger protein 

183 

34.694 

6396 

AF083132 . 

Porcine adenovirus 3 putative 13.1 kDa 
E3 protein 

128 

30.120 

6398 

L04673 

Saccharomyces cerevisiae phosphatase 

206 

30.178 


AF095737 

Homo sapiens unknown 

201 

50.000 

6400 

X12701 

Homo sapiens PAI precursor polypeptide 

954 

98.039 

6401 

X12701 

Homo sapiens PAI precursor polypeptide 

367 

88.235 

6402 

Z48955 

Didelphis virginiana ORF-2, putative 
RT 

14 5 

41.509 


7AT?rti n i yi/i 
AlrU ZUX44 

Homo sapiens neuronal thread protein 

AJJ /C-WTP 

383 

72.603 

6404 

AF010144 

Homo sapiens neuronal thread protein 

TV T*Y.'"7 KFPT1 

416 

75.610 | 

6406 


Homo sapiens chromosomal protein 

120 

71.429 

6407 

U47856 

Araneus diadematus fibroin-4 

45 

50.000 ! 

6. a. n o 

IT A 1 Q C C 

U4 / 856 

Araneus diadematus fibroin-4 [ 

30 

41.667 

6411 

U47856 

Araneus diadematus fibroin-4 

172 

35.385 1 

6412 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

313 

60.274 

CA T 1 

64 IjJ 

X92485 

Plasmodium vivax pval 

131 

51.282 1 

6*1 JL4 

AB0035 

Oryctolagus cuniculus cysteine rich 
hair keratin associated protein 

156 

41.176 

64 X6 

A80035 

Oryctolagus cuniculus cysteine rich 
hair keratin associated protein 

187 

36.000 j 

6417 

X80035 

Orvetolacrus cuniculus rvqfpi'np ri rVt 
hair keratin associated protein 

JLb J. 

3 7.143 

6419 

L35013 

Homo sapiens spliceosomal protein 

206 

32.934 

6420 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

278 

58.571 

6421 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

54 

25.806 
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6423 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

229 

1 76.190 

6424 

AJ243459 

Leishmania major proteophosphoglycan 

102 

29.565 

6425 

AJ243459 

Leishmania major proteophosphoglycan 

164 

24 .424 

6426 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 
isof orm 

347 

67.442 

6431 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c) 

273 

50.495 

6432 

U41007 

Caenorhabditis elegans similar to 
mitochondrial precursor proteins 
import receptors 

78 

57. 895 

6434 

U41007 

Caenorhabditis elegans similar to 
mitochondrial precursor proteins 
import receptors 

142 

32.381 

6435 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

189 

78.947 

6436 

AP010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

189 

78.947 

6437 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

157 

68.421 

6438 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

408 

52.899 

6440 

U01849 

Trypanosoma brucei ORF2 

143 

39.286 j 

6443 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

56 

35.000 

6444 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

354 

69.333 

6445 

U41557 

Caenorhabditis elegans histidine-rich 

137 

31.783 

6446 

AF051098 

Mus musculus seven transmembrane 
domain orphan receptor 

137 

28.736 

044 / 

Af 010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

371 

69.512 

6448 

s ft 7 *? o 

Homo sapiens X-linked retinopathy 
procein (L-cerminal, clone XEH.8c} 

251 

64.407 

6451 

X92485 1 

Plasmodium vivax pval j 

195 ! 

57 . 143 

6452 

AB001684 

Chlorella vulgaris ORF49b 

41 | 

28.571 

6453 

AB001684 

Chlorella vulgaris ORF49b 

68 , j 

47.368 

6455 

21171 ti 1 Oil *a 

Homo sapiens CGI -85 protein 

190 1 

83.784 

6456 

X79417 

Sus scrofa 40S ribosomal protein S12 

340 1 

100.000 

04D / 

X79417 

Sus scrofa 40S ribosomal protein S12 

471 [ 

66.187 

6458 

CA C n 

X79417 f 
X79417 

Sus scrofa 40S ribosomal protein S12 

240 

97.222 

C A £1 A 

AF153208 

Sus scrofa 40S ribosomal protein S12 
Homo sapiens GC-rich sequence DNA- 
binding factor candidate 

847 
175 

98.485 
43.056 

(1a (Zi 

o c o *7 o o 
obo 722 

Homo sapiens X-linked retinopathy 
protein {c-terminal, clone XEH.8c) 

234 

56.923 

6462 

Z14014 

Nicotiana tabacum Pistil extensin 
like protein, partial CDS only 

111 

41.463 

6463 

M69297 

Homo sapiens ORF 3 

159 

44.231 

6464 

X16064 

Homo sapiens tumor protein (AA 1 - 
172) 

776 

100.000 

6465 
6466 

X16064 
X16064 ! 

Homo sapiens tumor protein (AA 1 — 
172) 

Homo sapiens tumor protein (AA 1 - 

230 

1080 j 

100.000 
98.837 
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6467 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

61 

23.932 

6468 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

62 

34.694 

6469 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

61 

36.000 

6470 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

108 

33.088 

C A *7 1 

6471 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

366 

65. 854 

D 4 f Z 

TV "Oft *i r» 1 Jl A 

ArOlO 144 

Homo sapiens neuronal thread protein 

AJJ /C~W xP 

61 

29.412 


TTO One 

Homo sapiens alternatively spliced 
proQuct using exon ija 

307 

59.259 

6475 

API 

nomo sapiens unxnown 

143 

39.535 . 

6476 

AFO R1 fiRO 

nomo sapiens supervillin 

6925 

92.048 

6477 

AF051850 

Homo sapiens supervillin 

11827 

100.000 

&A~7 ft 


Homo sapiens supervillin 

10929 

80.759 

6479 

APfi ft Qrt 

Homo sapiens supervillin 

366 

61.616 

64ft 1 

v O X 

AOO J 

Mus musculus HC1 ORF 

134 

34.884 

GA ft 


Mycobacterium bovis unknown 

160 

27.074 

04 

-Air UU^oz9 

Mycobacterium bovis unknown 

203 

28.205 ! 

6485 

AL031133 

Homo sapiens dJ281H8.3 (PUTATIVE novel 
protein) 

260 

43.137 

c a a c 

D64 154 

Homo sapiens Mr 110,000 antigen 

347 

85.484 

o4o / 

D64 154 

Homo sapiens Mr 110,000 antigen 

306 

52.985 


Do4 154 

Homo sapiens Mr 110,000 antigen 

568 

91.667 

CAR Q 

AJoU 1U4 J U 

Aspergillus oryzae HAPB 

163 

30.435 i 

£ a ft n 

O *± 27 V 

Ar U 10 144 

Homo sapiens neuronal thread protein 

J\xJ /C-JM xtr 

289 

58.228 

6491 


Homo sapiens FAP protein 

178 

72.727 

6492 

rjg 3542 

riomo Bapiens rAr protein 

200 

81.818 

6493 

U63 542 

nomo Bdpieno rj\jtr protein 

200 

81.818 

6494 

U63542 

Homo sapiens FAP protein 

200 

81.818 


UDJ D4Z 

Homo sapiens FAP protein 

200 

81.818 

6496 

U63542 

Homo sapiens FAP protein 

200 

81.818 

o4 y i 

U63542 

Homo sapiens FAP protein 

190 

76.471 

6498 

U63542 

Homo sapiens FAP protein 

209 

80.000 

6499 

U63 54 2 

Homo sapiens FAP protein 

200 

81.818 

6502 

D87831 

Mus musculus myocyte enhancer factor 
2B type B-l 

130 I 

33.846 

6503 

D87831 

Mus musculus myocyte enhancer factor 
2B type B-l 

127 

29.258 

6504 

D87831 

Mus musculus myocyte enhancer factor 
2B type B-l 

89 

26.496 

6506 

U22376 

Homo sapiens alternatively spliced 
product usinor exon 13A 

247 

63.014 

6508 

L17308 

Gossypium hirsutum proline-rich cell 
wall protein 

58 

37.037 

6509 

L17308 

Gossypium hirsutum proline-rich cell 
wall protein 

170 

30.000 

6511 

S79410 

Mus Bp. nuclear localization signals ; 
(NLS) -binding protein=spot-l 

122 

56.250 
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6512 

U39318 

Homo sapiens UbcHSC 

101 

93.333 

6513 

U39318 

Homo sapiens UbcHSC 

425 

93.939 

6514 

AF095737 

Homo sapiens unknown 

298 

57.609 

6515 

X72963 

Nicotiana tabacum pAP8 product 

115 

29.412 

6516 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

354 

63.529 

6517 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

114 . 

83.333 

6518 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

182 

62.500 

6521 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

262 

60.294 

6523 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

205 

53.968 

6524 

M15530 

Homo sapiens B-cell growth- f actor 

143 

62.857 

6526 

Z98980 

Schizosaccharomyces pombe wiskott- 
aldrich syndrome protein homolog 1 

215 

28.571 

6527 

A4 6482 

unidentified TAX 

134 

33.750 

6529 

X02954 

Escherichia coli parA (aa 1-398) 

1611 

87.582 

6530 

X02954 

Escherichia coli parA (aa 1-398) 

1031 

87.234 

6531 

X02954 

Escherichia coli parA (aa 1-3 98) 

236 

25.769 

6532 

X02954 

Escherichia coli parA (aa 1-398) 

61 

34.545 

6533 

X02954 

Escherichia coli parA (aa 1-398) 

1797 

80.577 

6534 

X02954 

Escherichia coli parA (aa 1-398) 

758 

94.309 

6535 

X02954 

Escherichia coli parA (aa 1-398) 

989 

54.286 

6536 

X02954 

Escherichia coli parA (aa 1-398) 

1775 

90.968 

6537 

X02954 

Escherichia coli parA (aa 1-398) 

756 

99.138 

6542 

X02954 

Escherichia coli parA (aa 1-398) 

2281 

100.000 

6543 

X02954 

Escherichia coli parA (aa 1-398) 

1782 

99.640 

6544 

X02954 

Escherichia coli parA (aa 1-398) 

2406 

98.681 

6545 

X02954 

Escherichia coli parA (aa 1-398) 

62 

26.190 

6546 

X02954 

Escherichia coli parA (aa 1-398) 

2406 

98.681 

6548 

AE001863 

Deinococcus radiodurans hypothetical 
protein 

55 

38.095 

6549 

AE001863 

Deinococcus radiodurans hypothetical 
protein 

126 

35.616 

6550 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

130 

73.913 

6551 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

488 

69.725 

6552 

U16368 

Sus scrofa skeletal alpha actin 

492 

98.667 

6553 

U16368 

Sus scrofa skeletal alpha actin 

372 

96.429 

6555 

D83485 

Homo sapiens ER-60 protease 

363 

64.706 

6556 

D83485 

Homo sapiens ER-60 protease 

885 

85.326 

6557 

U19247 

Homo sapiens interferon -gamma receptor 
alpha chain 

185 

96.429 

6558 

U19247 

Homo sapiens interferon -gamma receptor 
alpha chain 

35 

23.077 

6559 

U19247 

Homo sapiens interferon- gamma receptor 
alpha chain 

2985 i 

97.741 

6560 

X94438 

Homo sapiens transcription factor 

57 

32.407 | 

6561 

X94438 

Homo sapiens transcription factor 

2807 

98.595 | 

6562 

X94438 

Homo sapiens transcription factor 

1459 

91.429 

6563 

AF009668 

multiple sclerosis associated 

670 

70.803 
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retrovirus polyprotein 



6564 

AF009668 

multiple sclerosis associated 
retrovirus polyprotein 

326 

56.989 

6566 

AF009668 

multiple sclerosis associated 
retrovirus polyprotein 

379 

67.368 ! 

6567 

AF009668 

multiple sclerosis associated 
retrovirus polyprotein 

282 

62.121 

6569 

AF009668 

multiple sclerosis associated 
retrovirus polyprotein 

432 

68.041 

6573 

AF009668 

multiple sclerosis associated 
retrovirus polyprotein 

1231 

74.059 

6577 

AF009668 

multiple sclerosis associated 
retrovirus polyprotein 

276 

61.905 

6579 

AF009668 

multiple sclerosis associated 
retrovirus polyprotein 

1299 

75.100 

6560 

AF009668 

multiple sclerosis associated, 
retrovirus polyprotein 

995 

53.529 

6581 

AF009668 

multiple sclerosis associated 
retrovirus polyprotein 

2849 

61.178 

65B2 

AF009668 

multiple sclerosis associated 
retrovirus polyprotein 

1299 

75.100 

6583 

AF117897 

Bos taurus rabll binding protein 

278 

88.000 

6584 

AB033040 

Homo sapiens KIAA1214 protein 

349 

46-457 

6585 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

312 


6586 

M34551 

Homo sapiens 52 -kD Ro/SSA 
ribonucleoprotein 

658 

100.000 

6587 

M34551 

Homo sapiens 52 -kD Ro/SSA 
ribonucleoprotein 

3098 

96.894 

6588 

X52604 

Artemia sp . actin (329 AA) 

692 

82.812 

6589 

X52604 

Artemia sp. actin (329 AA) 

360 

75.000 

6590 

X52604 

Artemia sp. actin (329 AA) 

1172 

70.213 

6591 

X52604 

Artemia sp . actin (329 AA) > 

1172 

70.213 

6592 

X52604 

Artemia sp. actin (329 AA) 

1981 

89.602 

6593 

X52604 

Artemia sp. actin (32 9 AA) 

2013 

95.719 

6595 

X52604 

Artemia sp. actin (329 AA) 

1648 

85.976 

6596 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

421 

68.421 

6597 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

366 

55.932 

6598 

U73819 

Mus musculus polypeptide GalNAc 
transf erase -T4 

1772 

67.102 

6599 

U73819 

Mus musculus polypeptide GalNAc 
transf erase-T4 

1690 

48.947 

6600 

AF061025 

Homo sapiens leucine zipper-EF-hand 
containing transmembrane protein 1 

60 

21.649 

6601 

AF061025 

Homo sapiens leucine zipper-EF-hand j 
containing transmembrane protein 1 

4759 

99.729 

6602 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

258 

56.338 

6603 

M76595 

Homo sapiens erythropoietin receptor 

34 

44 .444 

6604 

M76595 

Homo sapiens erythropoietin receptor 

64 

35.294 

6605 

M76595 

Homo sapiens erythropoietin receptor 

688 

100 .000 

6606 

M76595 

Homo sapiens erythropoietin receptor 

173 

7 8.947 


vat • 


OE!cii&Ob?! 


6607 

X92963 

Homo sapiens ubiqui tin- conjugating 
enzyme UbcH6 

153 

75.862 

6608 

M27364 

Homo sapiens elongation factor 1 alpha 

132 

95.833 

6609 

M27364 

Homo sapiens elongation factor 1 alpha 

125 

87.500 

6610 

M27364 

Homo sapiens elongation factor 1 alpha 

1041 

88.718 

6611 

AF191020 

Homo sapiens E2IG5 

130 

100.000 

6612 

AF191020 

Homo sapiens E2IG5 

303 

59.873 

6613 

S39127 

human, testis, Peptide, 331 aa 
cathepsin S°cysteine proteinase 

467 

100.000 

6614 

S39127 

human, testis, Peptide, 331 aa 
cathepsin S=cysteine proteinase 

2005 

96.096 

6615 

S39127 

human, testis, Peptide, 331 aa 
cathepsin S=cysteine proteinase 

598 

90.526 

DDI / 

O C dill 

ob 8722 

Homo sapiens X- linked retinopathy 
protein {C-terminal, clone XEH.Bc} 

117 

56.667 

DOIO 

Air Zr\j / ddu 

Homo s ap i e n s unknown 

163 

26.271 

O O «L ^ 


Homo sapiens ribosomal protein L38 

215 

100.000 | 

6620 


Homo sapiens ribosomal protein L3 8 

220 

97.222 

6622 

AB007185 

Homo sapiens ribosomal protein L38 

214 

100.000 

DOZ J 

AB007185 

Homo sapiens ribosomal protein L3 8 

215 

100.000 

6624 

AB007185 

Homo sapiens ribosomal protein L3 8 

220 

97.222 

6626 

AB007185 

Homo sapiens ribosomal protein L3 8 

214 

100.000 

6627 

A62731 

unidentified unnamed protein product 

202 

94.118 

6628 

A62731 

unidentified unnamed protein product 

153 

72.727 

6629 

M62387 

Oryctolagus cuniculus ubiqui tin 
conjugating-protein 

862 

92.466 

6630 

M62387 

Oryctolagus cuniculus ubiquitin 
conj ugating-protein 

622 

73 .649 

6631 

M62387 

Oryctolagus cuniculus ubiquitin 
conj ugating-protein 

626 

83.186 

6632 

M62387 

Oryctolagus cuniculus ubiquitin 
conj uga t ing -prot e in 

199 

100.000 

6633 

M62387 

Oryctolagus cuniculus ubiquitin 
conjugating-protein 

538 

79.747 

6634 

X91656 

Mus musculus alternatively spliced; 
truncated 

325 

65.060 

6635 

X91656 

Mus musculus alternatively spliced; 
truncated 

280 

86.667 

6636 

X91656 

Mus musculus alternatively spliced; 
truncated 

701 | 

92.381 

6 63 7 

X91656 

Mus musculus alternatively spliced; 
truncated 

641 

100.000 

6638 

X91656 

Mus musculus alternatively spliced; 
truncated 

405 

73.109 

6639 

AB002330 | 

Homo sapiens KIAA0332 

49 

38.462 

6640 

AB002330 

Homo sapiens KIAA0332 

60 

24.468 I 

6641 

AB002330 

Homo sapiens KIAA0332 

51 

47.619 

6642 

AB002330 

Homo saoieng KTAA01^5 


8 1 . 690 

6643 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

124 

83.333 

6644 

U22376 | 

Homo sapiens alternatively spliced 
product using exon 13A 

175 

44.681 

6645 

U79260 

Homo sapiens unknown 

137 

68.750 

6646 

AE000710 i 

Aquifex aeolicus hypothetical protein 

65 

24.561 j 
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DOf* / 

AcuyO 7 10 

Aquifex aeolicua hypothetical protein 

59 

39.286 

6649 

AB000710 

Aquifex aeolicue hypothetical protein 

262 

26.695 

6652 

X72963 

Nicotiana tabacum pAP8 product 

115 

40.816 

6653 

Y10487 

Homo sapiens stress -activated protein 
kinase-3 

47 

40.741 

6654 

Y10487 

Homo sapiens stress -activated protein 
kinase -3 

978 

55.840 

6655 

Y10487 

Homo sapiens stress-activated protein 
kinase-3 

308 

76.471 

6656 

Y10487 

Homo sapiens stress -activated protein 
kinase-3 

145 

91.667 

6657 

Y10487 

Homo sapiens stress-activated protein 
kinase-3 

43 

83.333 

6658 

Y10487 

Homo sapiens stress-activated protein 
kinase-3 

43 

46.154 

. do zjy 

X 104 87 

Homo sapiens stress-activated protein 

2413 

99.728 


I 1U40 / 

Homo sapiens stress-activated protein 

A.j.ilaoc — J 

585 

61.326 

6661 

JVP0101 4 A 

numo BdpienB neuronax unread protein 

r\u / y^. —Ail x. jtr 

217 

60.714 

6662 

AF007826 


206 

55 . 769 

6663 

M90054 

Homo sapiens ribosomal protein L3 

1209 

80.972 

6664 

Manned 

Homo sapiens ribosomal protein L3 

1918 

84 .223 

6665 

M12759 

Homo sapiens Ig J-chain 

359 

100.000 

DODO 


Homo sapiens Ig J-chain 

30 

36.364 

6667 

M12759 

Homo sapiens Ig J-chain 

681 

91.367 

ez c a o 
Dooo 

M12759 

Homo sapiens Ig J-chain 

179 

100.000 

6669 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

177 1 

66.667 

6670 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

189 

78.947 

6671 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

189 

78.947 

6672 

AF010144 

Homo sapiens neuronal thread protein 
AD7c -NTP 

77 

29.730 

6673 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

122 

90.909 

DO / i 

Ar U1U144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

125 

90.909 

OO/j 

AJf U1U144 

Homo sapiens neuronal thread protein 

t\xj / C— iM lr 

123 

57.576 

W o / o 


Homo sapiens neuronal thread protein 
AD7c-NTP 

373 

44.286 

6678 

** O O 

Plasmodium vivax pval 

136 

70.000 

6679 

S58722 

Homo sapiens X- linked retinopathy 
procem \L-cerminai, clone XEH ,8c) 

217 

64.407 

6682 

D90901 

Synechocystis sp. hypothetical protein 

75 


6683 

D90901 

Synechocystis sp. hypothetical protein 

117 

31.111 

6684 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

190 

63.830 

6685 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

124 

83.333 

6686 

M60119 

Homo sapiens HIV-EP2/Schnurri-2 

12463 

99.945 
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6687 

M60119 

Homo sapiens HIV-BP2/Schnurri-2 

12288 

95.568 

6688 

Z82062 

Caenorhabditis elegans cDNA EST 
yk415cl2.5 comes from this gene; cDNA 
EST yk526h3.3 comes from this gene; 
cDNA EST yk599bl.3 comes from this 
gene 

484 

43.094 

6689 

Z82062 

Caenorhabditis elegans cDNA EST 
yk415cl2.5 comes from this gene; cDNA 
EST yk526h3.3 comes from this gene; 
cDNA EST yk599bl.3 comes from this 
gene 

139 

39.130 

. . . 

6690 

AB002375 

Homo sapiens KIAA0377 

169 

82 .353 

6691 

AF070968 

Caenorhabditis elegans PAR- 6 

135 

35.938 

6695 

M33519 

Homo sapiens HLA-B-assOciated 
transcript 3 (BAT3) 

185 

83.784 

6696 

M33519 

Homo sapiens HLA-B-associated 
transcript 3 (BAT3) 

7379 

99.029 

6697 

AF015043 

Homo sapiens EH-binding protein 

240 

97.368 j 

6698 

AF090326 

Mus musculus AE-1 binding protein 
AEBP2 

298 

100.000 

6699 

AF008442 

Homo sapiens RNA polymerase I subunit 
hRPA3 9 

204 

53.571 

6700 

AFO08442 

Homo sapiens RNA polymerase I subunit 
hRPA39 

2152 

98.559 

6701 

AF121860 

Homo sapiens sorting nexin 10 

40 

40 . 000 

6702 

AF121860 

Homo sapiens sorting nexin 10 

127 

85 . 714 

6703 

D84342 

Homo sapiens SM22 alpha 

736 

99.099 

6704 

D84342 

Homo sapiens SM22 alpha 

644 

65 . 854 

6705 

AB011109 

Homo sapiens KIAA0537 protein 

495 

42 .157 

6706 

AJ005458 

Bos taurus protein Phosphatase 2C beta 

903 

94.964 

6707 

AJ005458 

Bos taurus protein Phosphatase 2C beta 

2525 

96.364 

6708 

AJ005458 

Bos taurus protein Phosphatase 2C beta 

609 

94.681 

6709 

Z34286 

Oryctolagus cuniculus ORF might extend 
further in 5' -direction 

41 

31.579 

6710 

Z34286 

Oryctolagus cuniculus ORF might extend 
further in 5 1 -direction 

416 

39 .535 

6711 

AJ005560 

Mus musculus SPR2B protein 

129 

42 . 000 

6712 

X92968 

Streptococcus pyogenes protein SIC 

172 

4 0.845 

6715 

U93197 

Rattus norvegicus RN protein 

215 

87 . 179 

6716 

U93197. 

Rattus norvegicus RN protein 

2011 

97 . 476 

6717 

X68392 

Borna disease virus p38 /39 

142 

34 . 021 

6718 

AB033007 

Homo sapiens KIAA1181 protein 

57 

3 7. 838 

6719 

AB033007 

Homo sapiens KIAA11B1 protein 

2282 

100.000 

6720 

AB033007 

Homo sapiens KIAA1181 protein 

57 

37 . 838 

6721 

AB033007 

Homo sapiens KIAA1181 protein 

1364 

69.643 

6723 

M15530 

Homo Bapiens B-cell growth factor 

115 

55.000 

6724 

AB007925 

Homo sapiens KIAA0456 protein 

955 

98.592 

6725 

AB007925 

Homo sapiens KIAA0456 protein 

319 

98 * 000 

6726 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

202 

69.231 

6727 

S58722 

Homo sapiens X- linked retinopathy 
protein {C- terminal, clone XEH.8c} 

39 

52.941 

6728 

S58722 

Homo sapiens X- linked retinopathy 
protein {C-terminal, clone XEH.8c} 

299 

52.577 


ions' 
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Ar UJ.Uj.44 

Homo sapiens neuronal thread protein 

AD/C-Ni F 

304 

69,231 

O / -5 X 

TTR Q C c: Q 
U3 OO jO 

Homo sapiens unknown 

166 

65.789 



LaenornaDai lib eiegans T07F12 . 1 gene 

M-i- WUU^. L- 

426 

33 ♦ 617 

6733 

X76116 

v^acnux. iiduui l, x a cicydiiB CclxiTier protein 

<c2) 

ion 

56 . 364 

■ 6734 

AF095737 

Homo Qani pno unVn rium 

69 

55 . 556 

6735 

AF095737 

Homo sapiens unknown 

336 

60.440 

O / J 


Homo sapiens cull in 1 

209 

97.059 

6740 

AC006323 

Homo sapiens cull in 1 

3283 

100.000 

O / ** 1 

i\ /Tinea 

Homo sapiens cullin 1 

3223 

100.000 

O / ** A 


Homo sapiens . UDP-(3alNAc : polypeptide N- 

:a p» f vl nal a /-• 4- /-x q am <t t-i"\.f 1 frano'Pav^na 
aucLyj.yaJ.dL. LUoctullJiyi LraJiBIcraSG 

592 

95.652 i 

6743 


ncjino sapiens xjec.a-sj.gnax sequence 

.1- V— Ji»/ \- W -1- 

310 

100 . 000 

6744 

D37991 

eapjcua JJCLd u j. y lid i. ocilUcIlCc 

receptor 

10 01 

94 . 565 

6745 

AF029061 

Homo sapiens BAT1 

to* 


6746 

AF029061 

Homo s aniens BAT1 


1 f\ t\ AAA 

6747 

Y07569 

Homo Rani ptib DH2iOT^sa rtT*nho-i n 
nuiuu oajf xciiD r iifVr X ^ cl pLULc xll 

9U 

45 . 455 

6748 

Y07569 

Homo satii^nf? pxja p t o -» nrrthpln 

o n 
ZUb 

4 6. 429 

6749 

AL133161 

Homo C5ar^ ^ f»n }rvroot*Vif»t* i r»» 1 ^Vvr-ot-f* -5 -n 

i >i <r 
14D 

49 . 254 

6750 

X92485 

Plasmodium vivax pval 

181 * 

46.739 

6751 

X58099 

DdLL,iidxuinyL.es ccreviBidc 6X^43 

3 84 

74 . 667 

6753 

X58099 

Saccharomyces cerevisiae eIF4a 

455 

76.136 


Yconqq 

Saccharomyces cerevisiae eXF4a 

455 

76.136 

w / 3 3 

YCOAQQ 
A3 O \f j j 

oacLiiaroniyceB cerevisiae exr 4a 

203 

91. 176 

O / 3 O 

A3 OUJ? 

Saccharomyces cerevisiae eIF4a 

167 

60. 563 

O / 3 / 

.TO f» 1 QA 

Homo sapiens hla-dr antigen alpha 

1475 

89.764 

6758 

J00194 

nuinu sapiens nia. — or ancigen aipna 
chain 

1498 

97 . 244 

6759 

D86980 

Homo saDiena 

11 / 

33 . 333 

6760 

D86980 

Homo saoiens KIAA0227 

jDO 

q O OCA 

6761 

D86980 

Homo sapiens KTAAOP ^ 7 

one 

/3 

1 A AAA 

1U0 . ooo 

6762 

L46722 

Homo sapiens BTG1 binding factor 1 

306 

93.878 

6763 

L46722 

Homo sapiens BTG1 binding factor 1 

1366 

96.186 

6764 

/iDU V/ ID O** 

Lnioreiia vuigari s URr o 3 

50 

33 . 333 

6765 

AB001684 

Chlorella vulgaris ORF63 \ 

72 

83.333 

6766 

U12404 

Homo sapiens Csa-19 

594 

68.310 

CI C *7 

U12404 

Homo sapiens Csa-19 

696 

83.111 

o/oo 

Zz2 lo 1 

Caenorhabditis eiegans cDNA EST 

comes rrom tills gene; cDNA 
yjs.z3 joiii . 3 comes rrom cms gene 

47 

25.676 

6769 

Z22181 

L^deiiurndjjai li s eiegans cjjjna ruol 

vk253bl2 3 nnmpn t 7 rnrn fVi< o n pr>A . onuii 
jf . j ^.wiilczo x. i. uut luid y true / 

EST yk253bl2.5 comes from this gene 

118 

28 . 947 

6771 

AB001684 

Chlorella vulgaris ORF49b 

32 

66.667 

6773 

AB001684 

Chlorella vulgaris ORF49b 

62 

42.308 

6774 

AB001684 

Chlorella vulgaris ORF49b 

45 

38.095 

6775 

AB001684 

Chlorella vulgaris ORF49b 

67 

66.667 

6776 

A58791 

unidentified unnamed protein product 

273 

93 . 182 

Sill 

A58791 

unidentified unnamed protein product 

2829 

100.000 


6778 

A58791 

unidentified unnamed protein product 

353 

97.917 

6779 

A58791 

unidentified unnamed protein product 

2446 

88.262 

6780 

AB001684 

Chlorella vulgaris ORF54d 

108 

66. 667 

6781 

X92485 

Plasmodium vivax pval 

259 

55.056 

6782 

AF160864 

Tetrahymena pyriformis orf98 

66 

50.000 

6783 

AF160864 

Tetrahymena pyriformis orf98 

74 . 

52.941 

6784 

AB002371 

Homo sapiens KIAA0373 

3704 

94.896 

6785 

Z77666 

Unknown cDNA EST EMBIi:T01059 comes 
from this gene; cDNA EST EMBL-.D71534 
comes from this gene; cDNA 

502 

37.500 

6786 

AL033502 

Candida albicans hypothetical protein 

74 

22.162 

6787 

AL033502 

Candida albicans hypothetical protein 

61 

24.590 

6788 

AL033502 

Candida albicans hypothetical protein 

63 

21.951 

6789 

AL033502 

Candida albicans hypothetical protein 

54 

43.750 

6791 

AL033502 

Candida albicans hypothetical protein 

50 

38.889 

6792 

AL033502 

Candida albicans hypothetical protein 

49 

21.429 

6793 

AL033502 

Candida albicans hypothetical protein 

60 

22.131 

6794 

AL033502 

Candida albicans hypothetical protein 

54 

18.919 

6795 

AL033502 

Candida albicans hypothetical protein 

57 

22.404 

6797 

AL033502 

Candida albicans hypothetical protein 

270 

27.451 

6799 

X07507 

Xenopus borealis actin 

717 

86.719 

6800 

X07507 

Xenopus borealis actin 

355 

73.684 

6801 

X07507 

Xenopus. borealis actin 

715 

77.848 

6802 

X07507 

Xenopus borealis actin 

396 

95.161 

6803 

X07507 

Xenopus borealis actin 

944 

97.945 

6804 

X07507 

Xenopus borealis actin 

2182 

98.663 

6805 

U39621 

Gallus gallus type V collagen 

84 

34.848 

6806 

U39621 

Gallus gallus type V collagen 

67 

32.530 

6807 

U39621 

Gallus gallus type V collagen 

419 

34 .583 

6808 

Z12840 

Oryctolagus cuniculus protein of 
unknown function 

308 

29.834 

6809 

Z12840 

Oryctolagus cuniculus protein of 
unknown function 

377 

32.955 

6810 

AF047826 

Homo sapiens cadherin-*7 

2059 

100.000 

6811 

U92545 

Gallus gallus tubby 

45 

41.667 

6812 

U92545 

Gallus gallus tubby 

202 

87.500 

6813 

AF139166 

Cavia porcellus prion protein 

129 

31.818 

6814 

X04201 

Homo sapiens skeletal muscle 
tropomyosin (AA 1-2 85) \ 

1012 

77.778 

6815 

X04201 

Homo sapiens skeletal muscle 
tropomyosin (AA 1-285) 

320 

98.148 

6816 

X04201 

Homo sapiens skeletal muscle 
tropomyosin (AA 1-285) 

1550 

99.298 

6817 

M12140 

Homo. sapiens envelope protein 

420 

37.037 

6818 

M12140 

Homo sapiens envelope protein 

186 

30.435 

6819 

M12140 

Homo sapiens envelope protein 

323 

42.029 

6820 

M12140 

Homo sapiens envelope protein 

47 

36.538 

6821 

M12140 

Homo sapiens envelope protein 

1433 

47.863 

6822 

M15841 

Homo sapiens U2 small nuclear . 
ribonucleoprotein B ■ 1 j 

129 

86.364 

6823 

AF152996 

Mus musculus peroxisomal assembly 
protein PEX3 

212 

1O0.0Q0 | 

6825 

X02585 

Xenopus laevis unidentified open 
reading frame 1 (166 aa) 

199 

76.316 



io9v? 


■JL O VHbf «4i& H-31 ,„ O 2 ES i& OS 


6826 

X02585 

Xenopus laevis unidentified open 
reading frame 1 (166 aa) 

201 

53.448 j 

6827 

X02585 

Xenopus laevis unidentified open 
reading frame 1 (166 aa) 

233 

41.129 

. 6828 

AF098788 

Gallus gallus nuclear calmodulin- 
binding protein 

346 

52.893 

6829 

AF098788 

Gallus gallus nuclear . calmodul in- 
binding protein 

64 

29.167 

6830 

AF098788 

Gallus gallus nuclear calmodulin- 
binding protein 

752 

76. 923 

6831 

AF098788 

Gallus gallus nuclear calmodulin- 
binding protein 

65 

35.185 

6832 

AF098788 

Gallus gallus nuclear calmodulin- 
binding protein 

808 

62.255 

6833 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

211 

63.630 

6834 

U22376 

Homo sapiens alternatively spliced . 
product using exon 13 A 

211 

63.830 

6835 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

428 

61.765 

6836 

AF155832 

Homo sapiens adenosine 5 ' - 
diphosphosugar pyrophosphatase 

89 

83.333 

6837 

AF155832 

Homo sapiens adenosine 5 l - 
diphosphosugar pyrophosphatase 

341 

100.000 

6838 

AF155832 

Homo sapiens adenosine 5 1 - 
diphosphosugar pyrophosphatase 

970 

87.611 

6839 

M76744 

Homo sapiens BGP 

113 

62.500 

6840 

U77494 

Homo sapiens RANBP8 

357 

100.000 

6841 

AF070663 

Homo sapiens HSPC007 

223 

100.000 

6843 

X92485 

Plasmodium vivax pval 

145 

60.606 

6844 

X92485 . 

Plasmodium vivax pval 

348 

56.667 

6845 

Y10389 

Xenopus laevis nuclear protein 

134 

24.561 

6846 

U44083 

Human immunodeficiency virus type 1 
rev gene product 

42 

34.483 

6847 

U44083 

Human immunodeficiency virus type 1 
rev gene product 

50 

39.130 

6848 

U44083 

Human immunodeficiency virus type 1 
rev gene product 

91 

55.000 

6849 

X55777 

Homo sapiens put- ORF 

181 

49.153 

6850 

AB001111 

Archaeoglobus fulgidus conserved 
hypothetical transmembrane protein 

42 

25.000 

6851 

AE001111 

Archaeoglobus fulgidus conserved 
hypothetical transmembrane protein 

53 

34.211 

6852 

AE001111 

Archaeoglobus fulgidus conserved 
hypothetical transmembrane protein 

138 

24.603 

6853 

U12204 

Mus musculus phosphoprotein 
phosphatase 

158 

36.667 

6854 

U12204 

Mus musculus phosphoprotein 
phosphatase 

251 

36.429 

6855 

AF103540 

Homo sapiens immunoglobulin kappa 
light chain variable region 

528 

82.653 

6856 

AF103540 

Homo sapiens immunoglobulin kappa 
light chain variable region 

627 

94.000 

6858 

A44601 

Bos taurus unnamed protein product 

266 

88.636 

6859 

AF139768 

Homo sapiens type II transmembrane 

252 

87.805 
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protein MDL-1 



6860 

AL031290 

Homo sapiens dJ774I24.1 (similar to 
PREGNANCY-ASSOCIATED PLASMA PROTEIN A 
PRECURSOR) 

308 

93.182 

6861 

M76744 

Homo sapiens BGP 

101 

68.000 

6862 

AL021748 

Schizosaccharomyces pombe similar to 
dna -damage -Inducible protein 

206 

40.541 

6863 

U39648 

Caenorhabditis elegans T13C5.6 gene 
product 

52 

36.364 

6864 

U39648 

Caenorhabditis elegans T13C5.6 gene 
product 

121 

52.632 

6865 

U39648 

Caenorhabditis elegans T13C5.6 gene 
product 

413 

56.198 

6866 

U39648 

Caenorhabditis elegans T13C5.6 gene, 
product 

309 

53.922 

6867 

U39648 

Caenorhabditis elegans T13C5.6 gene 
product 

208 

39.370 

6668 

AF067219 

Caenorhabditis elegans contains 
similarity to human homeobox protein 
CDX-1 (SW: CDX1 HUMAN) 

109 

35.484 

6869 

M22332 

Homo sapiens unknown protein 

146 

46.341 

6870 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

198 

46.250 

6871 

AB014537 

Homo sapiens KIAA0637 protein 

147 • 

33.784 

6872 

Z72499 

Homo sapiens herpesvirus associated 
ubiquitin-specif ic protease (HAUSP) 

598 

78.740 

6873 

272499 

Homo sapiens herpesvirus associated 
ubiquitin-specif ic protease (HAUSP) 

262 

100.000 

6875 

AF162472 

Mus musculus inducible protein 

199 

80.000 

6876 

AF038535 

Homo sapiens synaptotagmin VII 

239 

24.254 I 

6877 

AF112476 

Chlamydomonas reinhardtii outer arm 
dyne in light chain 1 

120 

36.508 

6879 

AB002349 

Homo sapiens KIAA0351 

56 

29.032 

6880 

AB002349 

Homo sapiens KIAA0351 

229 

100.000 

6881 

U93567 

Homo sapiens p40 

454 

50.735 

6882 

AL021747 

Schizosaccharomyces pombe hypothetical 
protein 

162 

36.842 

6883 

247087 

Homo sapiens RNA polymerase II 
elongation factor-like protein 

214 ; 

100.000 

6885 

M25246 

Homo sapiens vimentin 

266 

100.000 

6886 

M25246 

Homo sapiens vimentin 

927 

72.318 

6887 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

319 

67.606 . 

6888 

AF153208 

Homo sapiens GC-rich sequence DNA- 
binding factor candidate 

295 

92.857 

6889 

AF153208 

Homo sapiens GC-rich sequence DNA- 
binding factor candidate 

159 

95.833 

6890 

U77948 

Homo sapiens Bruton's tyrosine kinase- 
associated protein-135; BAP-135 

6189 

95.491 

6891 

U77948 

Homo sapiens Bruto^s tyrosine kinase - 
associated protein-135; BAP-135 

355 

73.077 

6892 

U77948 

Homo sapiens Bruto^s tyrosine kinase- 
associated protein-135; BAP-135 

6242 

97.444 

6893 

U77948 

Homo sapiens Bruton'B tyrosine kinase- 
associated protein-135; BAP-135 

133 

100.000 
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6B94 

U77948 

Homo sapiens Bruton's tyrosine kinase- 
associated protein- 135; BAP-135 

6209 

99.791 

6896 

AB015630 

Homo sapiens type II membrane protein 

388 

48.819 

6897 

D26076 

Oryctolagus cuniculus chloride channel 

384 

89.394 

6898 

D26076 

Oryctolagrus cuniculus chloride channel 

1235 

90.377 

6900 

X76538 

Homo sapiens hMpvl7 

130 

100.000 

6901 

J03756 

Homo sapiens hGH-V2 

373 

62.393 

6903 

J03756 

Homo sapiens hGH-V2 

508 

60.927 

6904 

J03756 

Homo sapiens hGH-V2 

961 

79.670 

6905 

J03756 

Homo sapiens hGH-V2 

886 

87.117 

6906 

J03756 

Homo sapiens hGH-V2 

361 

62.821 

6907 

AiJ004832 

Homo sapiens neuropathy target 
esterase 

183 

59.524 

6908 

AJ004832 

Homo sapiens neuropathy target 
esterase 

1635 

63.683 

6909 

AF032666 

Rattus norvegicus rsec5 

507 

90.805 

6912 

AB002368 

Homo sapiens KIAA0370 

56 

41.667 

6913 

AB002368 

Homo sapiens KIAA0370 

300 

74.324 

6914 

AB002368 

Homo Bapiens KIAA03 70 

429 

100.000 

6915 

AB002368 

Homo sapiens KIAA0370 

4330 

96.076 

6916 

X91242 

Xenopus laevis XrpFIbetal 

38 

38.889 

6917 

X91242 

Xenopus laevis XrpFIbetal 

134 

42 .424 

6919 

AL050018 

Homo sapiens hypothetical protein 

2504 

100.000 

6920 

U57317 

Homo sapiens p300/CBP-associated 
factor 

5276 

97.962 

6921 

U57317 

Homo sapiens p300/CBP-associated 
factor 

371 

100.000 

6922 

Z74990 

Saccharomyces cerevisiae ORF YOR082C 

115 j 

52.632 

6923 

AL049925 

Homo sapiens hypothetical protein 

407 

68 .354 

6924 

AL049925 

Homo sapiens hypothetical protein 

199 

56.000 

6925 

AC007182 

Homo sapiens unknown 

415 

100.000 

6926 

AF003535 

Homo sapiens ORF2-like protein 

200 

60.417 

6927 

AF007826 | 

Homo sapiens bax epsilon 

186 

70.588 

6928 

AF007826 i 

Homo sapiens bax epsilon 

186 

70.588 

6929 

AF007826 | 

Homo sapiens bax epsilon 

205 

42.308 

6930 

AF007826 

Homo sapiens bax epsilon 

127 

77.273 

6931 

AF007826 

Homo sapiens bax epsilon 

192 

63.043 

6932 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

43 

26.733 

6934 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

238 

70.213 

6935 

L08483 

Drosophila melanogaster ring canal 
protein 

869 

31.327 

6936 

AB032984 

Homo sapiens KIAA1158 protein 

334 

100.000 | 

6937 

AL022018 

Unknown 

/predict ion« (method : " "genscah" " , 
version: WM 1.0 wn # score: n w 294 . 38 nw ) ; 
/match= (desc : " "THIAZ 

338 

27.273 

6938 

AF045646 j 

Caenorhabditis elegans contains 
similarity to collagens 

125 

38.889. 

6940 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

310 

61.957 

6941 

U76759 

Mus musculus nuclear protein NIP45 

295 

65.823 ! 

6942 

U76759 

Mus musculus nuclear protein NIP45 

1199 

80.702 
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6943 

U88 964 


o o 
232 

51 • 136 

6944 

AB028994 

Homo sapiens KIAA1071 protein 

2729 

95.043 



Homo sapiens KIAA1071 protein 

3001 

100,000 

O O 


Homo sapiens transmembrane receptor 

UXM LOv, f 

54 . 

21.875 

6947 

AF055634 

Homo sapiens transmembrane, receptor 
UNC5C 

6322 

100.000 

6948 


nvuiu Ddpiclib U J. ciilcflllcllUJX cili t: rcCcptOr 

UNC5C 

524 

76 . 636 

6949 

AB018324 

Homo s aniens KTAA07fi1 nrohp^ 


100 .000 

6950 

X00824 

Gallus aal 1 tin pnl lanpn 

ft ^ 

43 . 333 

6951 

U2 2 3 7 6 

Homo saDipna. al f^rnaf i vpI v qnl {roH 
Droduct usina exon 13 A 

4 X / 

71 . 264 

6952 

AF061022 

Homo sapiens CTH 

859 

96.350 

6953 

S79410 

Mil *t qt") nurl paT 1 f^r* a 1 H i i-vti o -5 a 1 a 
» juo d^s. iiuLicoi. lVwd,Ll^citlUli t> Xgi la X 3 

(NLS) -binding protein=spot-l 

11 o 

35 . 593 

6954 

L26953 

W1IIU DCIJ^XCHCI l—XXX. UlUwoUf IlCt X pi OLclIl 

43 

30 .233 

6955 

L26953 

Homo sapiens chromosomal protein 

123 

55.172 

6957 


nomo sapiens maai^Uo protein 

140 

71.429 

6958 

J05016 

Homo sapiens protein disulfide 
iBotuerase-reiatca procexn 

306 

100.000 

6959 

J05016 

Homo sapiens protein disulfide 
isomerase-reiatea protexn 

3798 

96.918 

6961 

Z49199 

Homo sapiens RNA polymerase II subunit 
nxirxJi / 

774 

97.333 



Human papillomavirus type 4 7 E1/E4 

XUoJ.(JU pruLcin 

174 

54.000 

6965 

U2 2376 

nuiuu bdpieiiB aiternaEiveiy spiiC6Q 
product using exon 13A 

66 

27 . 586 

6966 

U2 2376 

nuiiicj bapieno aicernauivej.y spj-icect 
product using exon 13A 

431 

71.579 

6967 

X03145 

Hrtffin oani An a nnf" HDP \TT 
Dapxcllo pub • \Jt\.t* VI 

48 

32 .258 

6968 

X03145 

Homo sapiens pot. ORF VI 

380 

£8.235 

6969 

X03145 

«omo uapxens pot . sjkj* vx 

367 

69.620 ] 

6970 

U22376 

Homo sapiens alternatively spliced 
prouuci. usxng exon ua 

54 

24.000 

6971 

U22376 

Homo sapiens alternatively spliced 
pxoaucu usxng exon jlja 

56 

24.324 

6972 

U22376 

Homo sapiens alternatively spliced 
pxouuut usxng exon ua 

80 

27.273 

6973 

U22376 

Homo sapiens alternatively spliced 
pxuaucL usxng exon i ja 

70 

24.286 

6974 

U22376 

Homo sapiens alternatively spliced 
tr^- vJLXLii^ u usx jig exon ija 

68 | 

32.558 

6976 

U22376 1 

xivviuu oopxcns ax Lcrnat xvexy spxxceu 
Droduct nnina pvnn i 7fl 

226 

72 .549 

6977 

U22376 

nviiiw Dapxciio aXLCXUaLXVcxy SpXXCeQ 

product using exon 13A 

100 

65 .385 

6978 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

269 

79.630 

6979 

AF109906 

Mus musculus NG22 

302 

34.351 

6981 

AF109906 

Mus musculus NG22 

402 

29.524 

6982 

AF109906 

Mus musculus NG22 

285 

37.226 

6984 

X86031 

Sus scrofa homologue of 

132 

37.209 
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proline/arginine-rich antibacterial 
protein 


! 

CQQ7 

TV x?n i i m 

Bos taurus chitinase-like protein 1 . 

1930 

83.735 

by oo 

AJrOlU /3 

Bos taurus chitinase-like protein 1 

1943 

84.940 



Bos taurus chitinase-like protein 1 

246 

78.261 

6991 

AF008196 

Homo sapiens bax epsilon 

110 

54.167 

6992 

AF008196 

Homo sapiens bax epsilon 

43 

40.000 

6993 

U79260 

. Homo sapiens unknown ' 

140 

73.077 

6994 

U79260 

Homo sapiens unknown 

338 

66.667 

6997 

X58197 

Zea mays 27kDa storage protein, zeih 

105 

32.222 

6998 

X58197 

Zea mays 27kDa storage protein, zein 

128 

34.884 

6999 

M35258 

Rattus norvegicus malic enzyme (EC 
1.1.1.40) 

77 

63.158 

7000 

AC004883 

Homo sapiens similar to KIAA0766; 
similar to PID:g3882253 

283 

26.718 

7001 

AF125176 

Mus musculus angiopoiet in- related 
protein-2 

61 

53.333 

7002 

AF125176 

Mus musculus angiopoietin-related 
protein-2 

430 

58.763 

7003 

AB032956 

Homo sapiens KIAA113 0 protein 

3936 

100.000 

7004 

AB032956 

Homo sapiens KIAA113 0 protein 

1868 

85.758 

7005 

AF139923 

Mus musculus receptor-activated 
calcium channel 

576 

76.119 

7006 

AF139923 

Mus musculus receptor-activated 
calcium channel 

1560 

97.166 

7007 

M38435 

Brugia malayi collagen-like antigen 

111 

33.981 

7008 

*r y\ r* y\ a 

L05090 

Homo sapiens ribosomal protein S13 

103 

100.000 

7010 

UO9608 

Homo sapiens cell surface protein 

774 

100.000 

7011 

U09608 

Homo sapiens cell surface protein 

870 

91.018 

7012 

D50372 

Homo sapiens myosin regulatory light 
chain 

879 

85.549 

7013 

D50372 

Homo sapiens myosin regulatory light 
chain 

549 

73.913 

7014 

U19940 

Fragaria x ananassa putative 4 OS 
ribosomal protein sl2 

41 

54.545 

7015 

U19940 

Fragaria x ananassa putative 4 OS 
ribosomal protein sl2 

55 

34.783 

7016 

U19940 

Fragaria x ananassa putative 4 OS 
ribosomal protein sl2 

55 

34.783 

7017 

U19940 

Fragaria x ananassa putative 4 OS 
ribosomal protein sl2 

55 

34.783 

7018 

U19940 

Fragaria x ananassa putative 4 OS 
ribosomal protein sl2 

55 ; 

34.783 

7019 

U19940 

Fragaria x ananassa putative 4 OS 
ribosomal protein sl2 

55 

34.783 

7020 

U19940 

Fragaria x ananassa putative 4 OS 
ribosomal protein sl2 

610 

69^784 

7021 

U19940 

Ft*ao A T""i it ^n^ris ooa niit-3t"tTra a rvo 
*■ »yc*i -Lev jv» c*iiciiJcifc>fc>Gi putative 4Uo 

ribosomal protein sl2 

49 

27 .778 

7022 

U19940 

Fragaria x ananassa putative 4 OS 
ribosomal protein sl2 

45 

66.667 

7023 

U19940 

Fragaria x ananassa putative 4 OS 
ribosomal protein sl2 

46 

27.027 

7024 

U19940 

Fragaria x ananassa putative 4 OS 

58 

3 8.095 
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ribosomal protein sl2 



7025 

U19940 

Fragaria x ananas s a putative 4 OS 
ribOBomal protein sl2 

232 

81.818 

7026 

U19940 

Fragaria x ananassa putative 40S 
ribosomal protein sl2 

57 

40.741 

7028 

U09116 

Homo sapiens ORF1, encodes a 40 kDa 
product 

122 

66.667 

/ U ^ J 


womo sapiens kiaaoxo/ 

385 

60.284 

7030 

D80009 

Homo sapiens KIAA0187 

743 

66.176 

/Q31 

D80009 

Homo sapiens KIAA0187 

663 

98.131 

/U32 

T^O AAA n 

Homo sapiens KIAA0187 

8360 

99.766 

7033 

D80009 

Homo sapiens KIAA0187. 

257 

100.000 

7034 

M22332 

Homo sapiens unknown protein 

238 

67.308 

7035 

AF09S737 

Homo sapiens unknown 

300 

56.322 

7036 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

182 

77.500 

7037 

X92485 

Plasmodium vivax pval 

154 

67.568 

7038 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

99 

50.000 

7039 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

61 

42.222 

7040 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

357 

73.034 

7041 

D82361 

Homo sapiens MSSP-2 

1734 

92 .708 

7042 

D82361 

Homo sapiens MSSP-2 

1856 

98.264 

7043 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

183 

71.429 

7045 

AL117646 

Homo sapiens hypothetical protein 

570 

69.828 

7046 

AF103939 

Homo sapiens echinoderm microtubule - 
associated protein-like EMAP2 

244 

31.624 

7047 

AF103939 

Homo sapiens echinoderm microtubule - 
associated protein-like EMAP2 

107 

42.000 

7048 

AF103939 

Homo sapiens echinoderm microtubule- 
associated protein-like EMAP2 

658 

28.656 

7049 

AF103939 

Homo sapiens echinoderm microtubule - 
associated protein- like EMAP2 

813 

32.037 

7050 

AF085357 

Homo sapiens flotillin 

456 

64.130 j 

7051 

AF085357 

Homo sapiens flotillin 

2649 | 

99.766 I 

7052 

AF085357 

Homo sapiens flotillin 

335 

100.000 

7053 

AF085357 

Homo sapiens flotillin 

2067 

89.838 

7054 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

362 

61.856 

7055 

AP000060 | 

Aeropyrum pernix lOOaa long 
hypothetical protein 

102 

47.500 

7056 

Z99753 

Schizosaccharomyces pombe putative 
beta-mannosyltransf erase 

282 

4 6.875 

7057 

Z99753 

Schizosaccharomyces pombe putative 

u c*. luauiiuo y -l cuioicx aoc 

45 

30.000 

7058 

Z99753 

Schizosaccharomyces pombe putative 
beta-mannosyltransf erase r 

232 

38.136 

7059 

Z99753 

Schizosaccharomyces pombe putative i 
beta -mannosyl trans f eras e 

319 

3 6.123 

7060 

AB007888 

Homo sapiens KIAA0428 

1674 

69.415 

7061 

AB007888 

Homo sapiens KIAA042 8 

204 

65.909 
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7062 

AB007888 

Homo sapiens KIAA0428 

217 

47.761 

7063 

AF004327 

Homo sapiens angiopoietin-2 

465 

100.000 

7064 

AF004327 

Homo sapiens angiopoietin-2 

52 

38.095 

7065 

AF004327 

Homo sapiens angiopoietin-2 

449 

89.189 

7066 

AF004327 - 

Homo sapiens, angiopoietin-2 

3294 

100.000 

7067 

AF004327 

Homo sapiens angiopoietin-2 

1240 

68.354 

7068 

M33666 

Homo sapiens pregnancy- specific beta- 
1 -glycoprotein 6 

601 

94.565 

7069 

M33666 

Homo sapiens pregnancy-specific beta- 
1-glycoprotein 6 

530 

100.000 

7070 

M33666 

Homo sapiens pregnancy- specific beta- 
1 -glycoprotein 6 

980 

75.234 

7071 

M33666 

Homo sapiens pregnancy- specif ic beta- 
1 -glycoprotein 6 

1101 

81.308 

7072 

M33666 

Homo sapiens pregnancy- specific beta- 
1 -glycoprotein 6 

853 

56.727. 

7073 

M33666 

Homo sapiens pregnancy- specific beta- 
1 -glycoprotein 6 

125 

77.778 

7074 

M33666 

Homo sapiens pregnancy- specific beta- 
1 -glycoprotein 6 

1595 

59.881 

7075 

M33666 

Homo sapiens pregnancy- specific beta- 
1 -glycoprotein 6 

535 

86.957 

7076 

M33666 

Homo sapiens pregnancy- specific beta- 
i -glycoprotein 6 

2496 

90.338 

7077 

M33666 

Homo sapiens pregnancy-specific beta- 
1 -glycoprotein 6 

926 

52.890 

7078 

M33666 

Homo sapiens pregnancy- specific beta- 
1 -glycoprotein 6 

1344 

71. 667 

7079 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

374 

70.330 

7080 

AB028972 

Homo sapiens KIAA1049 protein 

847 

76.966 

7081 

AB028972 

Homo sapiens KIAA1049 protein 

368 

96.296 

7082 

AB028972 

Homo sapiens KIAA1049 protein 

3481 

99.093 

7083 

X17652 

Homo sapiens IgG Fc receptor 

359 

82.812 

7084 

AF170726 

Myxoma virus ml47R 

217 

34.167 

7085 

AF170726 

Myxoma virus ml47R 

399 

31.905 

7086 

AJ010063 

Homo sapiens telethonin 

225 

86.842 

7 0 87 

AJ010063 

Homo sapiens telethonin 

59 

30.667 

7088 

AL117579 

Homo sapiens hypothetical protein 

61 

34.783 

7089 

AL117579 

Homo sapiens hypothetical protein 

877 

70.536 

7090 

AL117579 

Homo sapiens hypothetical protein 

1729 

100.000 

7091 

Z81490 

Unknown similar to WD domain, G-beta 
repeats (2 domains); cDNA EST 
EMBL:T00482 comes from this gene 

134 . 

47.059 

7092 

S77772 

Homo sapiens aspartylglucosaminidase , 
AGA {C- terminal, alternatively 
spliced} {EC 3.5.1.26} 

135 

81.818 

7093 

AC0043 92 

Arabidopsis thaliana Similar to 
gb|AF072908 calcium- dependent protein 
kinase from Nicotiana tabacum. 

59 

32.500 

7094 

AC0043 92 

Arabidopsis thaliana Similar to 
gb|AF072908 calcium- dependent protein 
kinase from Nicotiana tabacum. 

116 

27.049 j 

7095 

AC004392 

Arabidopsis t"haliana Similar to 

299 

40.351 
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gb|AF072908 calcium -dependent protein 
kinase from Nicotiana tabacum. 



7096 

AC004392 

Arabidopsis thaliana Similar to 
gb|AF072908 calcium-dependent protein 
kinase from Nicotiana tabacum. 

71 

28.846 

7097 

L31920 

Mub spretus recognized by HS-63 
monoclonal antibody- 

161 

27.368 

7099 

D30648 

Homo sapiens f lavoprotein subunit of 
complex II 

43 

32.353 

7100 

D30648 

Homo sapiens flavoprotein subunit of 
complex II 

43 

34.483 

7101 

D30648 

Homo sapiens flavoprotein subunit of 
complex II 

178 

89.655 

7102 

D30648 

Homo sapiens flavoprotein subunit of 
complex II 

290 

65.672 

7103 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

133 

74.194 

7105 

X13338 

Homo sapiens plasminogen activator 
inhibitor 1 

270 

100. 000 

7106 

X13338 

Homo sapiens plasminogen activator 
inhibitor 1 

529 

91.111 

7107 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

361 

68.919 

7108 

M76744 

Homo sapiens BGP 

34 

22.500 

7110 

M76744 

Homo sapiens BGP 

99 

61.538 

7112 

AL031447 

Homo sapiens dJ126A5-2.1 (novel 
protein) (isoform 1) 

236 

23 .506 

7114 

K02569 

Homo sapiens gamma fibrinogen type A 
(AA at 202) 

213 

100. 000 

7115 

K02569 

Homo sapiens gamma fibrinogen type A 
(AA at 202) 

924 

96.078 

7116 

S58722 

Homo sapiens X- linked retinopathy 
protein {c-terminal, clone XEH.8c} 

266 

62.651 \ 

7118 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c} 

288 

63.095 

7119 

S58722 

Homo sapiens X-linked retinopathy 
protein {c-terminal, clone XEH.8c} 

251 

67.123 

7121 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c} 

192 

52.564 

7122 

S5B722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c} 

328 

70.130 

7123 

S58722 

Homo sapiens X- linked retinopathy 
protein {C-terminal, clone XEH.8c} 

290 

52.427 

7124 

S58722 

Homo sapiens X-linked retinopathy 
protein { C-terminal, clone XEH.8c} 

336 

68.675 

7125 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH»8c} 

287 

63.855 

7126 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XKH.8c} 

265 

65.278 

7127 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XErt.8c} 

279 

65.060 

7128 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c} 

273 

65.789 

7129 

S58722 

Homo sapiens X-linked retinopathy 
protein {c-terminal, clone XEH.8c} 

282 

64.286 



7130 

S58722 

Homo sapiens X- linked retinopathy 
protein { C- terminal , clone XEH.8c} 

113 

68.000 j 

7132 

S58722 

Homo sapiens X- linked retinopathy 
protein {C- terminal, clone XEH.8c} 

302 

62.791 

7135 

U46859 

Yersinia enterocolitica (type 0:8) 
Wbcl 

13 8 

29.348 

7137 

L07810 

Homo sapiens dynamin 

319 

95.556 

7138 

L07810 

Homo s api ens dynami n 

866 

100. 000 

7139 

D83206 

Mus musculus P24 protein 

220 

91.667 

7140 

U10485 

Homo sapiens lymphoid- restricted 
membrane jprotein 

57 

40.625 

7141 

U10485 

Homo sapiens lymphoid-restricted 
membrane protein 

3535 

98.746 

7142 

U10485 

Homo sapiens lymphoid-restricted 
membrane protein 

311 

94.231 

7143 

AF163302 

Homo sapiens somatostatin receptor 
interacting protein splice variant a 

6066 

97.773 

7144 

X16576 

Homo sapiens KUP protein 

461 

32.493 

7145 

AL031431 

Homo sapiens dJ462023.1 (novel 
protein) . 

751 

75.145 

7146 

AL031431 

Homo sapiens d«J462023.1 (novel 
protein) 

1767 

77.528 

7147 

L27428 

Homo sapiens reverse transcriptase 

167 

62.500 

7148 

X58338 

Arabidopsis thaliana glycine rich 
protein 

440 

58.559 

7149 

X58338 

Arabidopsis thaliana glycine rich 
protein 

438 

58.716 

7150 

AF021351 

Homo sapiens RNA polymerase III 
largest subunit 

441 

100.000 

7151 

S67247 

Homo sapiens smooth muscle myosin 
heavy chain isoform SMemb 

159 

88.889 

7152 

Z85981 

Brugia malayi actin » 

225 

94.118 

7153 

Z85981 

Brugia malayi actin 

2197 

95.467 

7154 

S58722 

Homo sapiens X- linked retinopathy 
protein {C- terminal, clone XEH.8c} 

101 

60.714 

7155 

AF072508 

Homo sapiens envelope protein 

36 

38.462 

7156 

AF072508 

Homo sapiens envelope protein 

47 

60.000 

7157 

AF072508 

Homo sapiens envelope protein 

14 6 

46.667 

7158 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

363 

71.233 

7160 

U20106 

Rattus norvegicus synaptotagmin VII 

460 

83.908 

7161 

X52164 

Mus musculus Q300 protein (AA 1-77) 

130 

76.190 

7162 

U79278 

Homo sapiens protein disulfide 
isomerase-related protein 5 

149 

96.000 

7163 

U79278 

Homo sapiens protein disulfide 
isomerase-related protein 5 

2767 

99.525 

7164 

U79278 

Homo sapiens protein disulfide 
isomerase-related protein 5 

250 

87.500 

7165 

U79278 

Homo sapiens protein disulfide 
isomerase-related protein 5 

2414 

94.588 

7166 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

40 

23.077 

7167 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

326 

51.695 



7168 

X62839 

Rattus rattus voltage-gated potassium 
channel 

142 

75*000 

7169 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

142 

79.167 

7170 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

502 

86.905 

7173 

AF048977 

Homo sapiens Ser/Arg-related nuclear 
matrix protein . 

216 

36.250 

7174 

U23502 

Plasmodium chabaudi chabaudi P0M1 

679 

46.383 

7175 

U23502 

Plasmodium chabaudi chabaudi POM1 

329 

41.071 

7176 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

184 

67.857 

7177 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

110 

31.532 

7178 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

402 

71.154 

7179 

U58736 

Caenorhabditis elegans Similar to 
cuticular collagen 

141 

37.681 

7180 

U58736 

Caenorhabditis elegans Similar to 
cuticular collagen 

85 

31.034 

7181 

X12883 

Homo sapiens cytokeratin 18 (424 AA) 

1197 

81.181 

7182 

X12883 

Homo sapiens cytokeratin 18 (424 AA) 

1197 

81.181 

7183 

X12883 

Homo sapiens cytokeratin 18 (424 AA) 

43 

23.256 

7184 

X12883 

Homo sapiens cytokeratin 18 (424 AA) 

2237 

97.362 

7185 

S58722 

Homo sapiens X-linked retinopathy 
protein (c-terminal, clone XEH. 8c} 

302 

59.596 

7186 

S58722 

Homo sapiens X- linked retinopathy 
protein {C-terminal, clone XEH. 8c} 

200 

73.333 

7187 

S58722 

Homo sapiens X- linked retinopathy 
protein {C-terminal # clone XEH. 8c} 

321 

62.651 

7188 

X14420 

Homo sapiens prepro- alpha -1 type 3 
collagen 

101 

33.333 

7189 

X14420 

Homo sapiens prepro- alpha -1 type 3 
collagen 

62 

35 .714 

7190 

X14420 

Homo sapiens prepro- alpha -1 type 3 
collagen 

238 

32.153 

7191 

AF071081 

Mycobacterium tuberculosis proline - 
rich mucin homolog 

202 

27.188 

7192 

Z46913 

Streptomyces ambofaciens polyketide 
synthase 

165 

36.301 | 

7193 

Z46913 

Streptomyces ambofaciens polyketide 
synthase 

144 

34 .307 

7197 

U97553 

murine herpesvirus 68 unknown 

221 

33 .898 

7199 

X60382 

Homo sapiens collagen Bubunit ( alpha - 
1 (X)) 3 

165 j 

29.123 

/2 02 

X55777 

Homo sapiens put . ORF 

234 

55 .263 


K02zl2 

Homo sapiens alpha- 1-anti trypsin 

168 

96 .296 

7206 

K02212 
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04 . /UD 

7208 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

330 

63.830 

7211 

AF163302 

Homo sapiens somatostatin receptor 
interacting protein splice variant a 

156 

33.884 

7213 

AF079967 

Phytomonas serpens MURF1 

154 

24.691 

7219 

U22376 | 

Homo sapiens alternatively spliced 

309 

56.190 
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product using exon 13 A 



7222 

AF003535 

Homo sapiens ORF2-like protein 

168 

33 .333 

7223 

AJ245453 

Mus musculus putative protein kinase 

321 

35.909 

7225 

U22376 . 

Homo sapiens alternatively spliced 
product using exon 13 A 

149 

40.541 

7228 

M13002 

Mus musculus 2 855 is the position of 
the first start codon in ORF 2; 
putative 

171 

60.784 

7230 

AB018705 

Mus musculus ORF2 

208 

34 . 862 

7231 

D38595 

Homo sapiens inter-alpha-trypsin 
inhibitor family heavy chain-related 
protein (IHRP) 

5175 

89.032 

7232 

D38595 

Homo sapiens inter-alpha-trypsin 
inhibitor family heavy chain-related 
protein (IHRP) 

333 

57.692 

7233 

D38595 

Homo sapiens inter-alpha-trypsin 
inhibitor family heavy chain-related 
protein (IHRP) 

5875 

98.498 

7237 

X61048 

Hydra sp. mini-collagen 

153 

33.750 

7238 

AF028832 

Homo sapiens Hsp89-alpha-delta-N 

665 

89.076 

7239 

AF028832 

Homo sapiens Hsp89-alpha-delta-N 

600 

54.086 

7240 

AF028832 

Homo sapiens Hsp89-alpha-delta-N 

2535 

92.650 

7242 

AF145679 

Drosophila melanogaster BcDNA.LD22726 

219 

37.363 

7248 

AF042800 

Homo sapiens suppressor of white 
apricot homolog 2 

83 

32.308 

7249 

AF042800 

Homo sapiens suppressor of white 
apricot homolog 2 

201 

30.579 

7251 

AF058807 

Bos taurus GTP-binding protein rah 

649 

72.121 j 

7252 

AF058807 

Bos taurus GTP-binding protein rah 

221 

54 .762 

7253 

AF058807 

Bos taurus GTP-binding protein rah 

645 

65.104 

7254 

AP058807 

Bos taurus GTP-binding protein rah 

652 

69.500 

7255 

S63654 

Mus Bp. type VII collagen 

61 

37.500 

7256 

S63654 

Mus sp. type VII collagen 

92 

30.275 

7260 

AF053091 

Drosophila melanogaster eyelid 

259 

30.000 

7261 

D50857 

Homo sapiens DOCK180 protein 

881 

72.959 | 

7262 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

115 

90.000 

7263 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

387 

62.264 

7266 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

382 

57.391 

7267 

U61538 

Homo sapiens calcium-binding protein 
chp 

133 

31.579 

7269 

AF010144 i 

Homo sapiens neuronal thread protein 
AD7C-NTP 

388 

60.377 

7273 

AF003535 

Homo sapiens ORF2-like protein 

151 

59.322 

7274 

X89267 

Homo sapiens uroporphyrinogen 
decarboxylase ; 

661 

100.000 

7275 

X89267 

Homo sapiens uroporphyrinogen 
decarboxylase 

1416 

95.720 

7276 

K03332 

Human herpesvirus 4 nuclear antigen 2 

104 

31.333 

7277 

K03332 

Human herpesvirus 4 nuclear antigen 2 

152 

31.707 

7279 

M22334 

Homo sapiens unknown protein 

153 

40.000 

7280 

M22334 

Homo sapiens unknown protein 

261 

28.280 
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7281 

M14144 

Homo sapiens vimentin 

552 

72.932 

7282 

M14144 

Homo sapiens vimentin 

2018 

85.895 

7283. 

U93572 

Homo sapiens putative p!50 

173 

28.829 

7284 

U93572 

Homo sapiens putative pi 50 

1247 

48.430 

7285 

AB012223 

Canis f amiliaris ORF2 

321 

42.775 

7287 

AL033534 

Schizosaccharomyces pombe serine-rich 
protein 

237 

29.492 

7288 

AB033031 

Homo sapiens KIAA1205 protein 

103 

33 .613 

7289 

AB033031 

Homo sapiens KIAA1205 protein. 

87 

28.571 

7292 

X55777 

Homo sapiens put . ORF 

234 

55.263 

7297 

AC004665 

Arabidopsis thaliana unknown protein 

186 

31.818 

7302 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

314 

61.165 

7303 

AC002291 

Arabidopsis thaliana Similar ATP- 
dependent RNA Helicase 

197 

26.887 

7304 

AC002291 

Arabidopsis thaliana Similar ATP- 
dependent RNA Helicase 

674 

39.634 

7305 

U07932 

Homo sapiens AF-17 

118 

24.573 

7307 

Z95976 

Homo sapiens RNA editing deaminase 1 

174 

67.442 

7308 

Z78542 

Unknown cDNA EST EMBL:D72182 comes 
from this gene; cDNA EST EMBL:D72353 
comes from this gene; cDNA 

57 

32,075 

7309 

Z78542 

Unknown cDNA EST EMBL:D72182 comes 
from this gene; cDNA EST EMBL:D72353 
comes from this gene; cDNA 

201 

31.788 

7310 

U85494 

Zea mays LON1 protease 

1519 

55.625 

7315 

AF010144 . 

Homo sapiens neuronal thread protein 
AD7C-NTP 

182 

42.400 

7316 

L24521 

Homo sapiens transformation-related 
protein 

291 

68.182 

7321 

X03484 

Homo sapiens raf protein (aa 1-648) 

3688 

95.699 

7327 

AL079308 

Streptomyces coelicolor putative 
serine/threonine protein kinase 

95 

29.375 

732 8 

AF051944 

Gallus gallus Xin 

346 

25.000 

7330 

L20249 

Streptomyces coriofaciens homologous 
to Saccharopolyspora erythraea beta- 
ketoacyl synthase 

150 

32.812 

7332 

J03770 

Mus musculus homeobox protein 

150 

36.782 

7334 

AF000996 

Homo sapiens ubiquitous TPR motif, Y 
isoform 

250 

62.319 

7335 

U07224 • 

Brugia malayi a2 (IV) basement 
membrane collagen 

191 

31.667 

7338 

AF010144 . 

Homo sapiens neuronal thread protein 
AD7C-NTP 

314 

45.699 

7342 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

297 

58.163 

7343 

U25281 

Rattus norvegicus SH3 domain binding 
protein 

203 

27.706 

7344 

AB001971 

Deinococcus radiodurans hypothetical 
protein 

172 

27.551 

7345 

AL133220 

Streptomyces coelicolor A3 (2) putative 
multi- domain regulatory protein. 

170 

2 8.814 

7346 

AF062655 

Mus musculus plenty- of -prolines -101; 
POP101; SH3-philo-protein - 

171 

24.046 
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7347 

AF062655 

Mus musculus plenty-of -prolines-101; 
POP101; SH3-philo-protein 

99 

30.303 

7349 

X72579 

Homo sapiens collagen X 

89 

33.333 

7350 

X72579 

Homo sapiens collagen X 

223 

32.121 

73S1 

X83413 

Human herpesvirus 6 U88 

175 

38.971 

7352 

M22334 

Homo sapiens unknown protein 

401 

44 .444 

7353 

M13002 

Mus musculus 2 855 is the position of 
the first start codon in ORF 2; 
putative 

321 

57 .522 

7355 

AF183139 

Cercopithecine herpesvirus 15 . 
trucnated type 2 BBNA2 

155 

28.458 

7356 

AF183139 

Cercopithecine herpesvirus 15 
trucnated type 2 EBNA2 

205 

3 0.742 

7358 

X13885 

Nicotiana tabacum extenBin (AA 1-62 0) 

243 

35.745 

7359 

X13885 

Nicotiana tabacum extensin (AA 1-620) 

156 

32.389 

7363 

U93567 

Homo sapiens putative pl50 

316 

50.388 

7364 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

344 

65.217 

7366 

U67557 

Methanococcus jannaschii cell division 
control protein 48 (cdc48), AAA family 

146 

36.190 

7367 

U67557 

Methanococcus jannaschii cell division 
control protein 4 8 (cdc4 8) , AAA family 

690 

40.237 

7368 

D80009 

Homo sapiens KIAA0187 

971 

57.525 

7369 

D80009 

Homo sapiens KIAA0187 

7181 

91.510 

7370 

D80009 

Homo sapiens KIAA0187 

1086 

78.970 

7372 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

289 

52.381 

7373 

AB032965 

Homo sapiens KIAA1139 protein 

168 

33.987 

7375 

X15332 

Homo sapiens alpha- 1 (III) collagen 

425 

31.818 

7377 

AB007923 

Homo sapiens KIAA04 54 protein 

82 

22.605 

7378 

AB007923 

Homo sapiens KIAA0454 protein 

490 

54.237 

7379 

AB007923 

Homo sapiens KIAA0454 protein i 

226 

41.129 

7380 

AB007923 

Homo sapiens KIAA0454 protein 

12006 

100.000 

7381 

AB007923 

Homo sapiens KIAA04 54 protein 

71 

25.676 

7382 

AB007923 

Homo sapiens KIAA0454 protein 

62 

28.000 

7383 

AB007923 

Homo sapiens KIAA0454 protein 

266 

26.829 

7384 

AB007923 

Homo sapiens KIAA0454 protein 

532 

67.123 

7385 

AF003535 ! 

Homo sapiens ORF2-like protein 

200 

51.515 

7386 

AF003535 i 

Homo sapiens 0RF2-like protein 

200 

51.515 

7387 

AB011164 

Homo sapiens KIAA0592 protein 

1275 

93.213 

7388 

AB01X164 

Homo sapiens KIAA0592 protein 

931 

51.020 

7389 

AB011164 

Homo sapiens KIAA0592 protein 

499 

79.798 

7390 

AB011164 

Homo sapiens KIAA0592 protein 

1496 

78.370 

7392 

U76112 

Mus musculus translation repressor 
NAT1 

457 

48.707 

7393 

•- 

U76112 

Mus musculus translation repressor 
NAT1 

53BS 

93.668 

7394 

AF062655 

Mus musculus plenty-of -prolines-101 ; 
POP101; SH3-philo-protein 

219 

26.415 

7395 

X61295 

Rattus norvegicus LI retroposon, a 
portion of its ORF2 sequence 

392 

50.340 

7396 

U07973 

Gallus gallus alpha-1 collagen type 
III 

14 8 

33.824 

7398 

X13783 

Homo sapiens alpha -1 type 2 collagen 

209 

27.660 





(714 AA) 



7399 

Z29560 

Caenorhabditis elegans similar to 
Transmembrane and sushi domain; cDNA 
EST EMBL:D67777 comes from this gene; 
cDNA EST EMBL:D64 676 comes from this 
gene 

133 

28.571 

7400 

Z29560 

Caenorhabditis elegans similar to 
Transmembrane and sushi domain; cDNA 
EST EMBL:D67777 comes from this gene; 
cDNA EST EMBL:D64676 comes from this 
gene 

291 

29.412 

7402 

X57527 

Homo sapiens alpha 1 (VIII) . collagen 

14 8 

33.884 

7403 

X57527 

Homo sapiens alpha l(VIII) collagen 

159 

28.571 

7404 

U88154 

Homo sapiens proline and glutamic acid 
rich nuclear protein isoform 

183 

28.796 

7405 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

329 

46,632 

7408 

J02009 

Abelson murine leukemia virus pl20- 
gag-abl polyprotein 

174 

25.773 

7410 

X03145 

Homo sapiens pot. ORF I 

209 

38.806 

7412 

AF017369 

Mus musculus faciogenital dysplasia 
protein 3 

500 

77.876 

7413 

AF017369 

Mus musculus faciogenital dysplasia 
protein 3 

603 

77.953 

7414 

AF017369 

Mus musculus faciogenital dysplasia 
protein 3 

2521 

73.779 

7415 

X97675 

Homo sapiens plakophilin 2b 

179 

67.442 

7416 

U41557 

Caenorhabditis elegans proline and 
glycine-rich 

96 

33.735 

7417 

U41557 

Caenorhabditis elegans proline and 
glycine-rich 

282 

30.932 

7420 

AF038600 

Sus scrofa pol protein 

310 

32.218 

7421 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

411 

43.701 

7422 

Z72875 

Saccharomyces cerevisiae ORF YGR090W 

212 

33.708 

7423 

M80341 

Homo sapiens 0RF2 contains a reverse 
transcriptase domain.; ORF2 

655 

61.350 

7425 

AF169388 

Mus musculus alpha 4 collagen IV 

292 

28.049 

7426 

AB027251 

Homo sapiens zinc finger protein 
(ZFD25) 

356 

39.691 

7427 

U83303 

Homo sapiens line-1 reverse 
transcriptase 

216 

42.105 

7428 

AJ001701 

Homo sapiens deoxyhypusine synthase 

358 

66.038 

7429 

AJ001701 

Homo sapiens deoxyhypusine synthase 

875 

77.928 

7432 

AL021086 

Unknown 

/prediction^ (method : " "genef inder ,! " , 
version: »"Q84 nn , score: n "147 . 90" n ) ; 
/match= (desc : n "LI 

213 

54.667 

7433 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP ! 

353 

65.957 

7435 

U93563 

Homo sapiens putative pl50 

163 

33.628 

7437 

AF010144 

Homo sapiens neuronal thread protein ! 
AD7C-NTP 

332 

52.903 

7439 

AF169387 

Mus musculus alpha 3 collagen IV 

169 

32.258 

7440 

AF106929 

Medicago truncatula putative cell wall 

78 

28.000 
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7443 

X15081 

Crithidia f asciculata MURF2 protein 
(AA 1-348) 

192 

25 .446 

7444 

AF000298 

Caenorhabditis elegans weak similarity 
"to collagens; glycine- and proline - 
rich 

94 

30 . 723 

744 5 

AF000298 

Caenorhabditis elegans weak similarity 
to collagens; glycine- and proline- 
rich 

185 

31.818 

7446 

AB028236 

Tricholoma nauseosum Pol (reverse 
transcriptase-RNase H-integrase) 

162 

29.299 

7448 

AF160864 

Tetrahymena pyriformis orf443 

143 

28.125 

7450 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

210 

52.041 

7451 

X83413 

Human herpesvirus 6 U87 

84 

23.789 

7454 

X83413 

Human herpesvirus 6 U87 

138 

30.000 

7456 

X17403 

human herpesvirus 5 HCMVUL61 

145 

29.545 

7458 

L27841 

Homo sapiens pericentriol material 1 

1707 

96.503 

7459 

L27841 

Homo sapiens pericentriol material 1 

1096 

84 .685 

7461 

AB011167 

Homo sapiens KIAA0595 protein 

266 

25.575 

7462 

M13100 

Rattus norvegicus unknown protein 

144 

58.824 

7464 

Y17794 

Gallus gallus winged-helix 
transcription factor 

158 

30.317 

7465 

AB002362 

Homo sapiens KIAA03 64 

68 

25.00Q 

7466 

AB002362 

Homo sapiens KIAA0364 

55 

34.783 

7468 

AB002362 

Homo sapiens KIAA0364 

438 

32.536 

7469 

AF043944 

Mytilus edulis nongradient byssal 
precursor 

353 

29.279 

7470 

X52046 

Mus musculus type III collagen 

265 

28.036 

7474 . 

D87744 

Mus musculus DRPLA protein 

181 

24 .409 

7480 

Y17397 

Homo sapiens collagen alpha 4 type IV 

170 

26.546 

7483 

AB033026 

Homo sapiens KIAA1200 protein 

1037 

57.934 

7484 

U79260 

Homo sapiens unknown 

176 

54 .930 

7485 

X13885 

Nicotiana tabacum extensin (AA 1-620) 

135 

27.429 

7489 

X69838 

Homo sapiens G9a 

487 

52.349 

7490 

AF039052 

Caenorhabditis elegans No definition 
line found 

205 

3 9.394 

7491 

AF039052 

Caenorhabditis elegans No definition 
line found 

201 

31.203 

7492 

AF039052 

Caenorhabditis elegans No definition 
line found 

217 

38.356 

7493 

L24521 

Homo sapiens transformation-related 
protein 

234 

45.000 

7494 

AB035541 

Homo sapiens pancreatic zymogen 
granule membrane associated protein 
GP2 alpha form 

342 

45.968 

7496 

AB017337 

Mus musculus sterol regulatory 
element -binding protein-1 (SREBP-1) 

150 

26.577 

7498 

U56862 

Rattus norvegicus pancreas only zinc 
finger protein 

770 

50.973 

7500 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

341 

49.359 

7503 

AL021813 

Schizosaccharomyces pombe hypothetical 
protein 

53 j 

37 .500 
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7504 

AL021813 

Schizosaccharomyces pornbe hypothetical 
protein 

169 

22.342 

7505 

X59244 

Homo sapiens ZNF43 

871 

61. 017 

7506 

M14123 

Homo sapiens gag 2 protein 

603 

42.920 ( 

7509 

Z82268 

Unknown cDNA EST yk338gl0 . 5. comes from 
this gene; cDNA EST EMBL:D27 934 comes 
from this gene; cDNA E 

158 

27. 320 

7510 

Z82268 

Unknown cDNA EST yk33 8gl0.5 comes from 
this gene; cDNA EST EI4BL :D27934 comes 
from this gene; cDNA E 

208 

33.333 

7511 

S80119 

Rattus sp. reverse transcriptase 
homolog 

188 

49.206 

7512 

S80119 

Rattus sp. reverse transcriptase 
homolog 

299 

39.394 

7513 

S80119 

Rattus sp f reverse transcriptase 
homolog 

188 

49.206 

7514 

S80119 

Rattus sp. reverse transcriptase 
homolog 

299 

39.394 

7515 

Y18890 

Human endogenous retrovirus K env 
protein 

292 

37.121 

7521 

X15332 

Homo sapiens alpha tl (III) collagen 

421 

30.191 

7522 

AF053538 

Alvinella pompejana fibrillar collagen 
chain FApl alpha 

73 

40.000 

7523 

AF053538 

Alvinella pompejana fibrillar collagen 
chain FApl alpha 

64 

33.333 

7524 

AF053538 

Alvinella pompejana fibrillar collagen 
chain FApl alpha 

129 

29.560 

7525 

AF053538 

Alvinella pompejana fibrillar collagen 
chain FApl alpha 

180 

29.348 

7526 

AF053538 

Alvinella pompejana fibrillar collagen 
chain FApl alpha 

312 

29.661 

7528 

X04758 

Homo sapiens pro- alpha (V) collagen 
(AA 1099) 

90 

32.941 

7529 

X04758 

Homo sapiens pro- alpha (V) collagen 
(AA 1099) 

93 

34.746 

7530 

X04758 

Homo sapiens pro- alpha (V) collagen 
(AA 1099) j 

83 

32.624 

7531 

X04758 

Homo sapiens pro- alpha (V) collagen 
(AA 1099) 

186 

30.061 

7532 

AF140675 

Homo sapiens* zinc metalloprotease . 
ADAMTS7 

64 

26.250 

7535 

AF140675 

Homo sapiens zinc metalloprotease 
ADAMTS7 

71 

25.217 

7536 

AF140675 

Homo sapiens zinc metalloprotease 
ADAMTS7 

60 

39.130 

7537 

AF140675 

Homo sapiens zinc metalloprotease 
ADAMTS7 

334 

46.154 

7539 

AF071172 

Homo sapiens HERC2 

1208 

95.604 

7540 

AF071172 

Homo sapiens HERC2 

53 

28.571 

7542 

AF071172 

Homo sapiens HERC2 

60 

34.375 

7543 

AF071172 

Homo sapiens HERC2 

55 

25.758 

7544 

AF071172 

Homo sapiens HERC2 

1516 

98.291 

7545 

AF071172 

Homo sapiens HERC2 

68 

25.984 

7546 

AF071172 

Homo sapiens HERC2 

66 

30.108 

7547 

AFQ71172 

Homo sapiens HERC2 

66 

27.273 
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7548 

AF071172 

Homo sapiens HERC2 

2680 

85.328 

7549 

AF071172 

Homo sapiens HERC2 

3574 

66.077 

7550 

U67056 

Acanthamoeba castellanii myosin I 
heavy chain kinase 

191 

36.842 

7552 

M87306 

Tetrahymena thermophila micronuclear 
linker histone polyprotein 

193 

26.768 

7553 

M87306 

Tetrahymena thermophila micronuclear 
linker histone polyprotein 

61 

18.557 

7554 

M87306 

Tetrahymena thermophila micronuclear 
linker histone polyprotein 

244 

27.236 

7555 

M22334 

Homo sapiens unknown protein 

513 

49.761 

7556 

AB021641 

Homo sapiens gonadotropin inducible 
transcription represBor-1 

635 

48.691 

7561 

AF181631 

Drosophila melanogaster BcDNA.GH04929 

202 

29.012 

7562 

AF010144 

Homo sapiens neuronal thread protein 
AD7c-NTP 

112 

37.647 

7563 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

126 

74.074 

7564 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

350 

53.595 

7566 

S64572 

Strongylocentrotus purpuratus=*sea 
urchin, Peptide Partial, 907 aa 
nonfibrillar collagen 

164 

29.956 

7567 

AB012223 

Canis familiar is ORF2 

286- 

42.975 

7570 

AB023143 

Homo sapiens K1AA0 92 6 protein 

165 

27.500 

7571 

AB023143 

Homo sapiens KIAA0926 protein 

94 

31.522 

7572 

AB023143 

Homo sapiens KIAA092 6 protein 

314 

26.025 

7573 

AB023143 

Homo sapiens KIAA092 6 protein 

331 

26.409 

7575 

L26953 

Homo sapiens chromosomal protein 

164 

57.143 

7577 

L26953 

Homo sapiens chromosomal protein 

102 

3 2.927 

7578 

L26953 

Homo sapiens chromosomal protein 

295 

50.000 

7579 

X83413 

Human herpesvirus 6 U88 

202 

60.000 

7581 

L26953 

Homo sapiens chromosomal protein 

132 

39.706 

7583 

AB014607 

Homo sapiens KIAA0707 protein 

414 

38.462 | 

7585 

AB023167 

Homo sapiens KIAA0950 protein 

637 

79.508 

7586 

AB023167 

Homo sapiens KIAA0950 protein 

1004 

99.346 j 

7587 

AF053091 

Drosophila melanogaster eyelid 

104 

35.000 

7588 

AF053091 

Drosophila melanogaster eyelid 

79 

30.556 

7589 

AF053091 

Drosophila melanogaster eyelid 

216 

27.985 

7593 

L20216 

Cercopithecus aethiops UV-damaged DNA- 
binding protein 

7425 

99.649 

7594 

L20216 

Cercopithecus aethiops UV-damaged DNA- 
binding protein 

456 

90. 244 

7596 

X16468 

Homo sapiens prepropeptide (AA 1-1418) 

424 

27.953 j 

7597 

U15647 

Mus muBculus reverse transcriptase 

802 

43.956 1 

7598 

AB007407 

Mus musculus myeloid zinc finger 
protein-2 

139 

45.000 

7cqq 

iUSUUOOOJ 

Bos taurus alpha2(X) collagen 

112 

37 . 313 

7600 

AB008683 

Bos taurus alpha2(I) collagen 

114 

27.711 

7601 

Z72496 

Homo sapiens mucin MUC5B 

157 

23.907 

7603 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

342 

32.734 

7604 

U93572 

Homo sapiens putative pl50 

466 

56.934 

7605 

U93572 

Homo sapiens putative p!50 

480 

56.391 
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7606 

U93572 

Homo sapiens putative pl50 

513 

66 .418 

7609 

X14254 

Rattus rattus invariant chain (AA 1- 
280) 

659 

69 .767 

7610 

X14254 

Rattus rattus invariant chain (AA 1- 
280) 

459 

44 .891 

7611 

X14254 

Rattus rattus invariant chain (AA 1- 
280) 

1000 

69.930 

7613 

AB018333 

Homo sapiens KIAA0790 protein 

428 

52.083 

7614 

AB018333 

Homo sapiens KIAA0790 protein 

788 

40.618 

7615 

U62315 

Rattus norvegicus alpha globin . 

724 

74 .648 

7616 

U62315 

Rattus norvegicus alpha globin 

286 

57.600 

7617 

AC004990 

Homo sapiens atrophin-1 interacting 
protein 1 

95 

34 .211 

7618 

AC004990 

Homo sapiens atrophin-1 interacting 
protein 1 

138 

33 .577 

7621 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

158 

55.556 

7622 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

235 

68.966 

7624 

L24521 

Homo sapiens transformation-related 
protein 

246 

62.195 

7626 

X13885 

Nicotiana tabacum extensin (AA 1-620) 

125 

29.452 

7627 

X13885 

Nicotiana tabacum extensin (AA 1-620) 

52 

26.786 

7628 

X13885 

Nicotiana tabacum extensin (AA 1-620) 

125 

26.357 

7630 

X15334 

Homo sapiens creatine kinase B 

763 

74.033 

7631 

X15334 

Homo sapiens creatine kinase B 

320 

94.444 

7632 

X15334 

Homo sapiens creatine kinase B 

800 

86.131 

7633 

X15334 

Homo sapiens creatine kinase B 

1762 

87.532 

7634 

AIill7466 

Homo sapiens hypothetical protein 

273 

82 ; 143 

7635 

AL.117466 

Homo sapiens hypothetical protein 

151 

88.889 

7636 

AL117466 

Homo sapiens hypothetical protein 

4080 

100 . 000 

7637 

AJ242540 . 

Volvox carter! f. nagariensis 
hydroxyproline-rich glycoprotein DZ- 
HRGP 

183 

3 7.895 

7638 

U66619 

Homo sapiens SWI/SNF complex 60 KDa 
subunit 

366 

56 . 115 

7639 

U66619 

Homo sapiens SWI/SNF complex 60 KDa 
subunit 

2486 

93.450 

7640 

AF061742 

Bos taurus retinal short-chain 
dehydrogenase/reductase retSDRl 

277 

92.000 

7641 

AF061742 

Bos taurus retinal short -chain 
dehydrogenase/ reductase retSDRl 

1167 

82.609 

7643 

L11672 

Homo sapiens zinc finger protein 

3305 

5 1 . 579 

7646 

S58722 

Homo sapiens X- linked retinopathy 
protein {C-terminal, clone XEH.8c} 

268 

66.667 

7649 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

132 

41.250 

7650 

U00049 

Caenorhabditis elegans weakly similar 
to ANK repeat region of Fowlpox virus 
BamHI-orf7 protein 

268 

62.500 

7651 

M20681 

Homo sapiens glucose transporter- like 
protein 

1735 

95.053 

7652 

M20681 

Homo sapiens glucose transporter- like 
protein 

258 

74.667 
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7654 

Y07632 

Zea mays potassium channel 

59 

34.483 

7655 

Y07632 

Zea mays potassium channel 

169 

35.052 

7658 

AF167295 

Gallus gallus reversion- induced LIM 
protein 

215 

46.429 

7661 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

346 

66.292 

7663 

AF155110 

Homo sapiens NY-REN-45 antigen 

86 

31.111 

7665 

AF155110 

Homo sapiens NY-REN-4 5 antigen 

239 

40.972 

7666 

AF155110 

Homo sapiens NY-REN-4 5 antigen 

2082 

73.821 

7667 

AF068625 

Mus musculus DNA cytosine-5 
methyltransf erase 3 A 

92 

30.864 

7668 

AF068625 

Mus musculus DNA cytosine-5 
methyltransf erase 3A 

209 

39.604 

7669 

AF172288 

Mus musculus hairy-related 
transcription factor 3 

311 

50.000 

7673 

S72008 

Homo sapiens CDC10 homolog=hCDC10 

440 

66.667 

7674 

S72008 

Homo sapiens CDC 10 homolog=hCDC10 

453 

81.319 

7675 

S72008 

Homo sapiens CDC 10 homolog=hCDC10 

1099 

97.884 

7676 

AB002388 

Homo sapiens KIAA0390 

359 

42.742 

7677 

AB002388 

Homo sapiens KIAA0390 

622 

39.697 

7678 

Z68108 

Caenorhabditis elegans similar to 
phosphoinositide- specific 
phospholipase C; cDNA EST EMBL:D28105 
comes from this gene; cDNA EST 
EMBL:D2 8104 comes from this gene 

205 

29.333 

7680 

AB002584 

Rattus norvegicus beta-alanine- 
pyruvate aminotransferase 

326 

58.252 

7681 

AB002584 

Rattus norvegicus beta- alanine - 
pyruvate aminotransferase 

146 

59.459 

7683 

AB002584 

Rattus norvegicus beta-alanine- 
pyruvate aminotransferase 

262 

76.812 

7684 

U79260 

Homo sapiens unknown 

156 

56.667 

7686 

Z49909 

Caenorhabditis elegans C14A4 . 14 

232 

35.652 

7687 

Z49909 

Caenorhabditis elegans C14A4 . 14 

402 

32.468 

7689 

AF159356 

Rattus norvegicus Muncl3-4 protein 

4440 

85.888 

7693 

D80009 | 

Homo sapiens KIAA0187 

593 

60.513 

7694 

D80009 

Homo sapiens KIAA0187 

760 

83.571 

7695 

D80009 j 

Homo sapiens KIAA0187 

1166 

75.676 

7696 

D80009 

Homo sapiens KIAA0187 

2029 

94 .813 

7698 

L28808 

Oryctolagus cuniculus manganese 
superoxide dismutase 

886 

48.294 

7699 

L28808 

Oryctolagus cuniculus manganese 
superoxide dismutase 

246 

90.000 

7 700 

L26808 

Oryctolagus cuniculus manganese 
superoxide dismutase 

1109 

91.584 

7703 

AF030131 

Mus musculus Plenty of SH3s; POSH 

291 

28.904 | 

7704 

U79260 

Homo sapiens unknown 

156 

56.667 

7705 

U79260 

Homo sapiens unknown 

156 

56.667 

7706 

AB005549 

Rattus norvegicus atypical PKC 
specific binding protein 

838 

80.702 

7710 

U49385 

Mus musculus CTP synthetase homolog 

780 

86.765 | 

7711 

AF103801 

Homo sapiens unknown 

292 

97.727 

7713 

X97675 

Homo sapiens plakophilin 2b 

173 

74.286 

7715 

AF030131 

Mus musculus Plenty of SH3s; POSH 

811 

93.548 
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7716 

AB027520 

Mus musculus TAP-like ABC transporter . 

810 

91.367 

7717 

AB002306 

Homo sapiens KIAA0308 

349 

35.565 

7718 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

285 

65.385 

7720 

AF053356 

Homo sapiens leucin rich neuronal 
protein 

5713 

99.880 

7721 

AF053356 

Homo sapiens leucin rich neuronal 
protein 

570 m 

78.906 

7723 

AIi035656 

Arabidopsis thaliana putative protein 

498 

63.303 

7724 

AL035656 

Arabidopsis thaliana putative protein 

426 

41.579 

7725 

AL035656 

Arabidopsis thaliana putative protein 

143 

41.176 

7726 

AL035656 

Arabidopsis thaliana putative protein 

686 

46.617 

7727 

AL035656 

Arabidopsis thaliana putative protein 

833 

52.795 

7729 

X56203 

Plasmodium falciparum liver stage 
antigen 

204 

24.375 

7730 

L26953 

Homo sapiens chromosomal protein 

148 

73;333 

7732 

D16226 

Oryctolagus cuniculus one of the 
members of sodium-glucose 
cotransporter family 

677 

69.006 

7733 

D16226 

Oryctolagus cuniculus one of the 
members of sodium-glucose 
cotransporter family 

646 

91.346 

7734 

D16226 

Oryctolagus cuniculus one of the 
members of sodium-glucose 
cotransporter family 

1310 

90.367 

7735 

M76670 

Lycopersicon esculentum extensin 
(class I) 

129 

24.510 

7737 

M76670 

Lycopersicon esculentum extensin 
(class I) 

170 

31.250 

7739 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

211 

73 .333 

7740 

AF000298 

Caenorhabditis elegans weak similarity 
to collagens; glycine- and proline- 
rich 

144 

38.043 

7741 

X64712 

Gallus gallus collagen -alpha -3 type IX 

149 

37.113 

7742 

AB032961 

Homo sapiens KIAA1135 protein 

375 

52.448 

7743 

AL022373 

Arabidopsis thaliana putative protein 

201 

28.622 

7745 

Z50142 

Schizosaccharomyces pombe hypothetical 
PHD- finger protein 

164 

33.036 

7746 

AB020642 

Homo sapiens KIAA0835 protein 

210 

84.091 

7747 

X97675 

Homo sapiens plakophilin 2b 

159 

86.207 

7748 

AF030131 

Mus musculus Plenty of SH3s; POSH 

2613 

87.879 

7749 

Z11584 

Homo sapiens NuMA protein 

13102 

99.054 

7750 

Z11584 

Homo sapiens NuMA protein 

10652 

98.907 

7751 

Z11584 

Homo sapiens NuMA protein 

13010 

99.810 

7752 

Z11584 

Homo sapiens NuMA protein 

520 

86 .408 

7753 

X16318 

Canis familiaris 54k protein (AA 1- 
504) . 

367 

1O0.000 

7754 

X16318 

Canis familiaris 54k protein (AA 1- 
504) 

3085 

97.222 

7755 

AF083501 

Macaca mulatta rhadinovirus 17577 
latent nuclear antigen 

186 

30.075 

7757 

AF181250 

Homo sapiens Ras guanine nucleotide 
exchange factor 2 

951 

- 

51.196 


.i. o tm hi- 3; u o 2 a 6. in a 


7758 

AF181250 

Homo sapiens Ras guanine nucleotide 
exchange factor 2 

942 

98-611 

7759 

AP181250 

Homo sapiens Ras guanine nucleotide 
exchange factor 2 

8176 

99.919 

7760 

AF181250 

Homo sapiens Ras guanine nucleotide 
exchange factor 2 

271 

88.679 

7761 

AF026205 

Caenorhabditis elegans No definition 
line found 

162 

25.589 

7764 

AJ006278 

Mus musculus 

acetylglucosaminyltransf erase- like 
protein 

916 

58.498 

7765 

AJ006278 

Mus musculus 

acetylglucosaminyltransf erase -like 
protein 

1768 

67.574 

7766 

AF010144- 

Homo sapiens neuronal thread protein 
AD7C-NTP 

56 

42.857 

7769 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

161 

56.604 

7770 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

46 

23.529 

7771 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

46 

35.294 

7772 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

235 

68.966 

7773 

AF099989 

Homo sapiens Ste-20 related kinase 
SPAK 

547 

94 .565 

7774 

AF099989 

Homo sapiens Ste-20 related kinase 
SPAK 

274 

69.136 

7775 

AJ005162 

Homo sapiens UDP- 
glucuronosyl transferase 

3531 

98.864 

111S 

AJ005162 

Homo sapiens UDP- 
glucuronosyltransf erase 

677 

69.512 

1111 

AJ005162 

Homo sapiens UDP- 
glucuronosyl transferase 

3531 ! 

98.864 

111S 

AJ005162 

Homo sapiens UDP- 
glucuronosyl trans f erase 

3502 

99.432 

1119 

U79260 

Homo sapiens unknown 

239 

6 6.102 

7782 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

1038 

93.443 

7785 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

684 

95.146 

7786 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

746 

94.118 

7787 

U09823 

Oryctolagus cuniculus elongation, 
factor 1 alpha 

741 

97.458 

7788 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

625 

49.853 

7789 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2307 

97.734 

7790 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2228 

96.335 

7794 

D64009 

Mus musculus seizure-related gene 
product 6 type 2 precursor 

551 

94.186 

7795 

D64009 

Mus musculuB seizure- related gene 
product 6 type 2 precursor 

67 

31.250 


X 01 0 £j! 4-6, 4-3 . O BS! 6. 0 2 


7796 

D64009 

Mus musculus seizure -related gene 
product 6 type 2 precursor 

2259 

83.582 

7799 

AF047383 

Homo sapiens uroporphyrinogen 
decarboxylase 

€61 

100.000 

7800 

AF047383 

Homo sapienB uroporphyrinogen 
decarboxylase 

1046 

88.701 

7803 

X07273 

Mesocricetus auratus beta-myosin heavy 
chain (974 AA) ; S2 fragment and LMM 
region 

245 

70.270 

7804 

X07273 

Mesocricetus auratus beta-myosin heavy 
chain (974 AA) ; S2 fragment and LMM 
region 

561 

83 .803 

7805 

X70944 

Homo sapiens PTB- associated splicing 
factor 

206 

40.230 

7806 

L20321 

Homo sapiens protein serine/threonine 
kinase 

14 6 

57.895 

7807 

U79260 

Homo sapiens unknown 

382 

71.910 

7808 

U79260 

Homo sapiens unknown 

267 

61.290 

7809 

AF174427 

Arabidopsis thaliana formin-like 
protein AHF1 

107 

39.024 

7811 

U38847 

Homo sapiens TAR RNA loop binding 
protein 

395 

59.756 

7812 

U38847 

Homo Bapiens TAR RNA loop binding 
protein 

2005 

100.000 

7 814 

U93568 

Homo sapiens putative pl50 

250 

73.585 

7819 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

123 

41.667 

7821 

M2 2 3 3 4 

Homo sapiens unknown protein 

264 

62.500 

7823 

U10281 

Sus scrof a gastric mucin 

14 6 

33.071 

. 7825 

X97675 

Homo sapiens plakophilin 2b 

201 

50.000 

7839 

AB033011 

Homo sapiens KIAA118 5 protein 

249 

97.297 

7840 

AB033011 

Homo sapiens KIAA1185 protein 

179 

96.774 

7841 

AB033011 

Homo sapiens KIAA1185 protein 

1010 

100.000 

7843 

M27878 

Homo sapiens DNA binding protein 

1209 

53.074 

7847 

J01415 

Homo sapiens cytochrome oxidase 
subunit 3 

369 

68.269 

7848 

J01415 

Homo sapiens cytochrome oxidase 
subunit 3 

191 

90.625 

7850 

M87583 

Equine infectious anemia virus 
deoxyuridine triphosphatase 

118 

23.770 

7853 

Y15918 

Homo sapiens COL1A1 and PDGPB fusion 
transcript 

126 

36.364 

7854 

Y15918 

Homo sapiens COL1A1 and PDGFB fusion 
transcript 

89 

27.215 

7857 

AL096723 

Homo sapiens hypothetical protein 

5411 

100.000 

7858 

AL096723 

Homo sapiens hypothetical protein 

5205 

94.667 

7860 

AF070561 

Homo sapiens beta- tubulin 

1756 ! 

79.657 

•7865 

U66469 

Homo sapiens cell growth regulator 
CGR19 

289 

95.652 

7866 

U66469 

Homo sapiens cell growth regulator 
CGR19 

1600 j 

69.727 

7867 

X53795 

Homo sapiens inducible membrane 
protein 

463 

100.000 

7868 

X53795 

Homo sapiens inducible membrane 

1265 

100.000 j 
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protein 



7869 

M21332 

Mus musculus 42 kd polypeptide (RD) , 
(first expressed exon) 

133 

31.868 

7870 

D13748 

Homo sapiens eukaryotic initiation 
factor 4AI 

558 

43.068 

7871 

D13748 

Homo sapiens eukaryotic initiation 
factor 4AI 

612 

75.676 

7872 

D13748 

Homo sapiens eukaryotic initiation 
factor 4AI 

410 

62.698 

7873 

D13748 

Homo sapiens eukaryotic initiation 
factor 4AI 

554 

97.727 

7874 

D13748 

Homo sapiens eukaryotic initiation 
factor 4AI 

528 

91.209 

7875 

D13748 

Homo sapiens eukaryotic initiation 
factor 4AI 

1732 

85.817 

7876 

U86602 

Homo sapiens nucleolar protein p40 

593 

74.400 

7881 

Z49127 

Unknown Serine/proline rich protein; 
cDNA EST EMBL:D69024 comes from this 
gene; cDNA EST EMBL:D698 9 

14 6 

33.333 

7882 

AL034559 

Plasmodium falciparum predicted using 
hexExon; MAL3P7.14 (PFC0925w) , 
Hypothetical protein, len: 489 aa 

78 

46.154 

7884 

AF013213 

Bos taurus elongation factor 1 alpha 

213 

97.500 

7885 

AF013213 

Bos taurus elongation factor 1 alpha 

582 

79.661 

7889 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

213 

53.731 

7890 

AF058290 

Homo sapiens imidazoline receptor 
antisera-selected protein 

3988 

100.000 

7891 

AF058290 

Homo sapiens imidazoline receptor 
antisera-selected protein 

148 

44.595 

7893 

X55777 

Homo sapiens put . ORF 

243 

64.407 

7894 

L24521 

Homo sapiens transformation-related 
protein 

286 

60.759 

7895 

L24521 

Homo sapiens transformation-related 
protein 

178 

60.000 

7900 

U22376 

Homo sapiens alternatively spliced 
product using exon 13 A 

335 

74.627 

7903 

S63654 

Mus sp. type VII collagen 

163 

32.051 

7904 

S78454 

Mus musculus metal response element 
DNA-binding protein M96 

1243 

96.985 

7905 

S78454 

Mus musculus metal response element 
DNA-binding protein M96 

154 

58.696 

7906 

M10126 

Leishmania tarentolae NH2 terminus 
uncertain 

169 

24.201 

7907 

AF176688 

Rattus norvegicus 

sodium/calcium/potassium exchanger 
NCKX1 

193 

33.333 

7914 

Z70690 

Schizosaccharomyces pombe hypothetical 
coiled-coil protein 

102 

29.060 

7915 

Z70690 

Schizosaccharomyces pombe hypothetical 
coiled-coil protein 

149 

23.009 

7916 

AL049487 

Arabidopsis thaliana putative protein 

173 

30.769 

7920 

X70944 

Homo sapiens PTB- associated splicing 
factor 

195 

39.744 

7921 

S58722 

Homo sapiens X- linked retinopathy 

66 

47.826 
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protein { terminal # clone XEH.8c) 



7922 

S58722 

Homo sapiens X- linked retinopathy 
protein {c-terminal, clone XEH.8c} 

258 

67.692 

7924 

AB020656 

Homo sapiens KIAA0849 protein 

867 

86.228 

7925 

AB020656 

Homo sapiens KIAA0849 protein 

998 

98.052 

7927 

D28818 

Mus musculus NBL4 

456 

45.238 

7945 

X67863 

Mus musculus T2 

177 

52.000 

7946 

AE002089 

Deinococcus radiodurans hypothetical 
protein 

138 

30.208 

7949 

V00503 

Homo sapiens procollagen (1 is 3rd 
base in codon) 

142 

37.736 . 

7950 

V00503 

Homo sapiens procollagen (1 is 3rd 
base in codon) 

100 

36.458 

7951 

V00503 

Homo sapiens procollagen (1 is 3rd 
base in codon) 

191 

28.000 

7953 

AJ010063 

Homo sapiens telethonin 

742 

100.000 

7954 

AJ010063 

Homo sapiens telethonin 

1016 

95.833 ! 

7955 

M29863 

Homo sapiens farnesyl pyrophosphate 
synthetase 

771 

89.552 

7956 

M29863 

Homo sapiens farnesyl pyrophosphate 
synthetase 

214 

48.315 

7957 

M29863 

Homo sapiens farnesyl pyrophosphate 
synthetase 

1930 

96.275 

7958 

AF046873 

Homo sapiens synapsin Ilia 

144 

27.068 

7960 

X92485 

Plasmodium vivax pval 

126 

73.913 

7962 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

258 

49.020 

7963 

U81031 

Homo sapiens KIAA0167 

5287 

95.607 

7964 

U81031 

Homo sapiens KIAA0167 

359 

55. 970 

7968 

AF038599 

Sus scrofa env protein 

226 

24.521 

7970 

Z86099 

human herpesvirus 2 UL2 6.5 

80 

47.059 ( 

7971 

Z86099 

human herpesvirus 2 UL26.5 

161 

30.682 

7972 

Z36715 

Homo sapiens Net 

2500 

98.775 

7973 

Z36715 

Homo sapiens Net 

2129 

98.503 

7977 

D88461 

Rattus rattus N-WASP 

246 

3 6.601 

7979 

Y14690 

Homo sapiens procollagen alpha 2 (V) 

169 

32.919 

7980 

M36676 

Homo sapiens apolipoprotein B100 

2164 

99.710 

7981 

M36676 

Homo sapiens apolipoprotein B100 

2104 

99.710 ! 

7982 

AF078866 

Homo sapiens SURF -4 

685 

84.397 

7983 

AF078866 

Homo sapiens SURF- 4 

1557 

97.778 

7989 

AF010144 . 

Homo sapiens neuronal thread protein 
AD7C-NTP 

236 

61.667 

7992 

U22376 

Homo sapiens alternatively Bpliced 
product using exon 13 A 

243 

53.165 

7996 

D50086 

Mus musculus neuropil in 

49 

23.077 

7998 

AB001563 

Homo sapiens RES4-22D 

842 

61.044 

7999 

AB001563 

Homo sapiens RES4-22D 

4596 

100.000 

8000 

AB001563 

Homo sapiens RES4-22D 

4411 

99.533 

8001 

AB001563 

Homo sapiens RES4-22D 

1759 

100.000 

8002 

AB001563 

Homo sapiens RES4-22D 

4493 

98.943 

8003 

D87744 

Mus musculus DRPLA protein 

154 

36.667 

8005 

AF046873 

Homo sapiens synapsin Ilia 

152 

27.976 

8008 

L07597 

Homo sapiens ribosomal protein S6 
kinase 1 

4756 

97.690 
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8009 

L07597 

Homo sapiens ribosomal protein S6 
kinase 1 

2480 

86.313 

8014 

U79260 

Homo sapiens unknown 

382 

71.910 

8015 

U79260 

Homo sapiens unknown 

267 

61.290 

8021 

M25282 

Paracentrotus lividus alpha collagen 
type 1 precursor 

220 

30.342 

8023 

AFQ36699 

Caenorhabditis elegans Similar to 
cuticular collagen; P58F6.2 

161 

33.529 

8024 

AF036699 

Caenorhabditis elegans Similar to 
cuticular collagen; F58F6.2 

182 

34.783 

8027 

A78803 

unidentified unnamed protein product 

769 

90.0^1 

8028 J 

A78803 

unidentified unnamed protein product 

865 

98.582 

8029 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

199 

56.250 

8031 

U09823 

Oryctolagus cuniculuB elongation 
factor 1 alpha 

816 

70.531 

8033 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2333 

90.578 

8035 

AF190797 

Mus musculus actin-related protein 11 

258 

50.000 

8037 

X92485 

Plasmodium vivax pval 

134 

67.742 

8039 

Z81592 

Caenorhabditis elegans predicted using 
Genefinder; cDNA EST EMBL:M89005 comes 
from this gene . 

128 

25.967 

8042 

U49974 

Homo sapiens mariner transposase 

491 

70.690 

8046 

AB032905 

Hylobates concolor dopamine receptor 
D4 

68 

30.952 

8047 

AB032905 

Hylobates concolor dopamine receptor 
D4 

155 

38.462 

8048 

AB032905 

Hylobates concolor dopamine receptor 

D4 

139 

35.714 

8050 

X55777 

Homo sapiens put . ORF 

165 

48.333 

8052 

U79260 

Homo sapiens unknown 

194 

43.678 

8053 

U79260 

Homo sapiens unknown 

282 

57.692 

8054 

U79260 

Homo sapiens unknown 

141 

42.857 

8055 

U79260 

Homo sapiens unknown 

138 

48.869 

8056 

U792 60 

Homo sapiens unknown . 

351 

70.213 

8057 

U79260 

Homo sapiens unknown 

263 | 

60.241 

8060 

X92485 

Plasmodium vivax pval 

61 

25.532 | 

8061 

X92485 

Plasmodium vivax pval 

146 

86.364 

8062 

L15189 

Homo sapiens MTHSP75 

316 

98. 000 

8063 

L15189 

Homo sapiens MTHSP75 

4014 

97.496 

8064 

U22376 

Homo sapiens alternatively spliced 
product using exon 13A 

377 

78.082 

8066 

A61971 

unidentified MCSP 

76 

23.167 

8067 

A61971 

unidentified MCSP 

56 

33.333 

8068 

A61971 

unidentified MCSP 

66 

37.209 

8069 

A61971 

unidentified MCSP 

1433 

93.478 

8070 

X79440 

Homo sapiens NADP+ -dependent malic 
enzyme 

865 

77.320 

8071 

X79440 

Homo sapiens NADP+ -dependent malic 
enzyme 

318 

94.231 

8072 

X79440 

Homo sapiens NADP+ - dependent malic 
enzyme 

3477 

93.290 

8073 

L20321 

Homo sapiens protein serine/ threonine 

160 

58.696 
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kinase 



8074 

D25220 

Bos taurus selenoprotein P like 
protein 

130 

35.849 

8075 

X55656 

Homo sapiens gamma -G globin 

958 

97.333 

8076 

X55656 

Homo sapiens gamma -G globin 

435 

87.261 

8080 

X97675 

Homo sapiens plakophilin 2b 

151 

77.778 

8081 

X97675 

Homo sapiens plakophilin 2b 

198 

84 .375 

8082 

X97675 

Homo sapiens plakophilin 2b 

213 

62.222 

8083 

X97675 

Homo sapiens plakophilin 2b 

166 

66.667 

8086 

AF135789 

Mus fmisculus m6a methyltransf erase 

1201 

99.419 

8087 

AF135789 

Mus musculus m6a methyltransf erase 

942 

62.724 

8092 

AF044127 

Homo sapiens peroxisomal short -chain 
alcohol dehydrogenase 

1359 

90.837 

8093 

AF044127 

Homo. sapiens peroxisomal short-chain 
alcohol dehydrogenase 

162 

92.593 

8094 

AF044127 

Homo sapiens peroxisomal short -chain 
alcohol dehydrogenase 

1501 

98.805 

8096 

AL050297 

Homo sapienB hypothetical protein. 

117 

30.233 

8097 

U88170 

Caenorhabditis elegans No definition 
line found 

401 

44.156 

8098 

U88170 

Caenorhabditis elegans No definition 
line found 

375 

32.258 

8099 

AF132941 

Homo sapiens CGI -07 protein 

197 

93 .939 

8100 

AF132941 

Homo sapiens CGI -07 protein 

3349 

99.008 | 

8102 

X51755 

Homo sapiens open reading frame (458 
AA) 

461 

93.333 

8103 

X51755 

Homo sapiens open reading frame (458 
AA) 

564 

65.823 

8104 

X51755 

Homo sapiens open reading frame (458 
AA) 

503 

70.866 

8105 

X51755 

Homo sapiens open reading frame (458 
AA) 

479 

71.053 

8106 

X51755 

Homo sapiens open reading frame (458 
AA) 

421 

83.951 

8108 

X51755 

Homo sapiens open reading frame (458 
AA) 

750 

75.625 

8111 

X97675 

Homo sapiens plakophilin 2b 

46 

2 3.913 

8112 

X97675 

Homo sapiens plakophilin 2b 

177 

87.879 

8113 

X97675 

Homo sapiens plakophilin 2b 

186 

76.316 

8115 

X80169 

Mus musculus tsg24 

607 

80.374 

8116 

X80169 

Mus musculus tsg24 

76 

25.974 

8121 

AJ006295 

Rattus nprvegicus AF-9 

601 

80.702 

8122 

AJ006295 

Rattus norvegicus AF-9 

1941 

88.830 

8126 

AB002342 

Homo sapiens KIAA0344 

7872 

99.838 

8127 

AB002342 

Homo sapiens KIAA0344 

7947 

100.000 

8129 

AF182215 

Tilapia mossambica chloride channel 
CLC-3 1 

154 

66.667 

8130 

AL096723 

Homo sapiens hypothetical protein = 

5411 

100. 000 

8131 

AL096723 

Homo sapiens hypothetical protein 

835 

80.899 

8132 

AL096723 

Homo sapiens hypothetical protein 

5103 

93.826 

8133 

M22333 

Homo sapiens unknown protein 

42 

50.000 

8134 

M22333 

Homo sapiens unknown protein 

1442 

83.271 

8135 

M22333 

Homo sapiens unknown protein 

609 

79.310 

8136 

Z66511 

Unknown similar to ribokinase; cDNA 

14 0 

32.632 
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EST EMBIi:D^95S3 comes from this gene; 
cDNA EST EMBL:D65938 come 



8142 

A00279 

synthetic construct Human serum 
albumin 

864 

97.674 

8143 

A00279 

synthetic construct Human serum 
albumin 

548 

39.286 

8144 

AF003540 

Homo sapiens Krueppel family zinc 
finger protein 

120 

59.459 

8150 

AL035064 

Schizosaccharomyces pombe queuine 
trna-ribosyltransf erase 

601 

56.000 

8152 

AF007170 

Homo sapiens unknown 

1781 

58.176 

8158 

U79260 

Homo sapiens unknown 

397 

63.636 

8159 

AC005396 

Arabidopsis thaliana putative proline- 
rich cell wall protein 

122 

28.378 

8160 

AC005396 

Arabidopsis thaliana putative proline - 
rich cell wall protein 

202 

34.234 

8163 

Z66496 

Unknown cDNA EST EMBL:D71941 comes 
from this gene; cDNA EST EMBL:D74691 
comes from this gene; cDNA 

14 4 

33.708 

8164 

Z66496 

Unknown cDNA EST EMBL:D71941 comes 
from this gene; cDNA EST EMBIj:D74 691 
comes from this gene; cDNA 

97 

23.596 

8165 

Z66496 

Unknown CDNA EST EMBL:D71941 comes 
-from this gene; cDNA EST EMBL:D74691 
comes from this gene; cDNA 

47 

58.333 

8167 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

158 

64.706 

8169 

U64856 

Caenorhabditis elegans weak similarity 
to TPR domains 

219 

41.346 

8170 

U64856 

Caenorhabditis elegans weak similarity 
to TPR domains 

705 

43.089 

8171 

S58722 

Homo sapiens X- linked retinopathy . 
jarotein {c-terminal, clone XEH.8c} 

278 

71.212 

8172 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal, clone XEH.8c} 

297 

54.808 

8175 

X97675 

Homo sapiens plakophilin 2b 

194 

62.500 

8177 

Y09085 

Mus musculus hypoxia -inducible factor 
one alpha 

505 

60.355 

8178 

Y09085 

Mus musculus hypoxia- inducible factor 
one alpha 

1122 

80.000 

8182 

AL032653 

Caenorhabditis elegans similar to 
Ubiquitin-conjugating enzymes; cDNA 
EST EMBL:C08678 comes from this gene 

442 

74.684 

8183 

AL032653 

Caenorhabditis elegans similar to 
Ubiquitin-conjugating enzymes; cDNA 
EST EMBL:C08678 comes from this gene 

499 

56.463 | 

8184 

1*20321 

Homo sapiens protein serine/threonine 
kinase 

186 

66.667 

old / 

K02212 

Homo sapiens alpha- 1- antitrypsin 

605 

90.179 1 

8188 

K02212 

Homo sapiens alpha-l-antitrypsin 

1278 

78.214 

8192 

U66469 

Homo sapiens cell growth regulator 
CGR19 

289 

95.652 

8193 

U66469 

Homo sapiens cell growth regulator 
CGR19 

1340 

66.484 

8194 

D00654 

Homo sapiens enteric smooth muscle 

1047 

94.545 
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gamma-actin 



8195 

D00654 

Homo sapiens enteric smooth muscle 
gamma -act in 

1012 

93.750 

8196 

D00654 

Homo sapiens enteric smooth muscle 
gamma-actin 

1808 

84 .319 

8200 

AB028980 

Homo sapiens KIAA1057 protein 

253 

32 .787 

8202 

Z46267 

CaeriorhabditiB elegans cDNA EST 
yk251d5.5 comes from this gene; cDNA 
EST yk356d9.5 gomes .from this gene; 
cDNA EST yk377hll.5 comes from this 
gene 

133 

25.410 

8203 

M60626 

Homo sapiens N-f ormylpeptide receptor 
fMLP-R98 

686 

74.627 

8204 

M60626 

Homo sapiens N-f ormylpeptide receptor 
fMLP-R98 

1753 

86.908 

8207 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

242 

70.909 

8208 

AB028997 

Homo sapiens KIAA1074 protein 

713 

38.202 

8212 

AL117448 

Homo sapiens hypothetical protein 

177 

86.667 

8213 

L40157 

Homo sapiens .endosome -associated 
protein 

168 

27.053 

8214 

L40157 

Homo sapiens endosome-associated 
protein 

159 

29.310 

8215 

Z48166 

Schizosaccharomyces pombe gar 2 

113 

26.471 

8217 

U78524 

Homo sapiens Gu binding protein 

1339 

98.522 

B218 

U78524 

Homo sapiens Gu binding protein 

300 

100.000 

8219 

D29677 

Homo sapiens KIAA0054 

4168 

68.302 

8220 

D29677 

Homo sapiens KIAA0054 

13192 

100.000 

8221 

AC006963. 

Homo sapiens similar to Kelch 
proteins; similar to BAA77027 
(PID:g4650844) 

1563 

92.800 

8224 

AF198532 

Homo sapiens lymphoid enhancer binding 
f actor- l 

193 

100.000 

8225 

AB000528 

Helicobacter pylori 26695 H. pylori 
predicted coding region HP0059 i 

107 

24.359 

8227 

L22453 

Homo sapiens TARBP-b gene product 

946 

74.091 

8228 

L22453 

Homo sapiens TARBP-b gene product 

420 

95.890 

8229 

L22453 

Homo sapiens TARBP-b gene product 

758 

70.934 

8232 

S58722 

Homo sapiens X- linked retinopathy 
protein {c-terminal, clone XEH.8c} 

278 

71.212 

8233 

S58722 

Homo sapiens X-linked retinopathy 
protein {C-terminal , clone XEH.8c} 

297 

54.808 

8234 

S66385 

Mus sp. CRBB-binding protein, CBP 

58 

35.294 

8235 

S66385 

Mus sp. CREB-binding protein, CBP 

272 

33.742 

8239 

Z79754 

Caenorhabditis elegans Similarity to 
some phosphatases and kinases; cDNA 
EST EMBL:Z14 643 comes from this gene; 
cDNA EST yk531b4.3 comes from this 
gene; cDNA EST yk642e5.3 comes from 
this gene 

354 

41.935 

8240 

Z79754 

Caenorhabditis elegans Similarity to 
some phosphatases and kinases; cDNA 
EST EMBL:Z14643 comes from this gene; 
cDNA EST yk531b4.3 comes from this 
gene; cDNA EST yk642e5.3 comes from 

913 

60.748 


ioisr 
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this gene 



8241 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

265 

70.370 

8242 

M84911 

Pseudomonas aeruginosa ORF located 
downstream of mmsAB operon, has 
sequence similarity to an acetyl -CoA 
synthetase; ORF1; putative 


58.385 

8243 

M84911 

Pseudomonas aeruginosa ORF located 
downstream of mmsAB operon, has 
sequence similarity to an acetyl -CoA 
synthetase; 0RF1; putative 

679 


8247 

L16547 

Bos taurus chloride channel protein 

55 

36.364 j 

8248 

L16547 

Bos taurus chloride channel protein 

521 

80.612 

8250 

U08813 

Oryctolagus cuniculus 597 aa protein 
related to Na/glucose cotransporters 

1509 


8251 

U08813 

Oryctolagus cuniculus 597 aa protein 
related to Na/glucose cotransporters 

1401 

69 . 486 

8252 

D78348 

Canis familiar is zymogen granule 
membrane associated protein 

344 

48 . 039 

8253 

AF115480 

Mus musculus cAMP -dependent Rapl 
guanine-nucleotide exchange factor 

226 

35 . 849 

8257 

U70063 

Homo sapiens acid ceramidase 

431 

95 sfl« 

8258 

U70063 

Homo sapiens acid ceramidase 

2293 

95.214 

8259 

X54422 

Mytilus edulis polyphenolic adhesive 
protein 

85 

Z V . Z \J A 

8260 

X54422 

Mytilus edulis polyphenolic adhesive 
protein 

143 

o q n*7n 

.8261 

AP000616 

Oryza sativa similar to RING-H2 finger 
protein RHAla (AF078683) 

186 

JO. 144 

8262 

AP000616 

Oryza sativa similar to RING-H2 finger 
protein RHAla (AF078683) 

119 


8263 

AP000616 

Oryza sativa similar to RING-H2 finger 
protein RHAla (AF078683) 

183 


8264 

AP000616 

Oryza sativa similar to RING-H2 finger 
protein RHAla (AF078683) 

188 


8267 

M81483 

Mus musculus calmodulin dependent 
phosphatase catalytic subunit 

822 

89 . 189 

8268 

M81483 

Mus musculus calmodulin dependent 
phosphatase catalytic subunit 

869 

97 . 183 

8269 

M81483 

Mus musculus calmodulin dependent 
phosphatase catalytic subunit 

172 

100.000 

8270 

AB033026 

Homo sapiens KIAA1200 protein 

213 

89.189 

8271 

AJ249706 

Mus musculus myosin X 

318 

87.931 

8272 

X97675 

Homo sapiens plakophilin 2b 

172 

78.125 

8274 

AF126867 

Mus musculus calpain-like protease 

558 

70.000 

8275 

AF126867 

Mus musculus calpain-like protease 

66 

30.000 

8277 

AF131820 

Homo sapiens Unknown 

453 

90.909 i 

8278 

AF13 1 ft? 0 

riomo sapiens unknown 

407 

100.000 

8279 

AF131820 

Homo sapiens Unknown 

1134 

76.022 

8280 

AL117452 

Homo sapiens hypothetical protein 

136 

72.727 

8281 

AL117452 

Homo sapiens hypothetical protein 

398 

75.862 

8282 

U34932 

Rattus norvegicus Fos -related antigen 

301 

68.421 

8283 

D88026 

Mus musculus Dhm2 protein 

219 

84.848 

8286 

AF017306 

Homo sapiens UnpES 

560 

75.591 
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8287 

AF017306 

Homo sapiens UnpES 

343 

98.077 

8286 

AF017306 

Homo sapiens UnpES _J 

1415 

100.000 

8289 

1754638 

Mus musculus rhbtekin 

149 

38.983 

8291 

L05424 

Homo sapiens cell surface glycoprotein 
CD44 

4568 

99.555 

8292 

L05424 

Homo sapiens cell surface glycoprotein 
CD44 

449 

87.671 

8296 

U23147 

Caenorhabditis elegans No definition 
line found 

323 

31.282 

8298 

AB018335 

Homo sapiens KIAA07 92 protein 

777 

67.456 

8299 

AB018335 

Homo sapiens KIAA07 92 protein 

47 

37.500 

8300 

X91879 

Homo sapiens sperm specific protein 

190 

86.111 

8302 

X90858 

Homo sapiens uridine phosphorylase 

481 

85.882 

8303 

X90858 

Homo sapiens uridine phosphorylase 

209 

91.667 

8304 

X90858 

Homo sapiens uridine phosphorylase 

93 9 

94.611 

8306 

U51576 

Felis catus MHC class II antigen 

1332 

79.921 

8307 

U51576 

Felis catus MHC class II antigen 

13 9 

95.238 

8308 

U51576 

Felis catus MHC class II antigen 

1269 ! 

84.375 

8309 

AB013384 

Homo sapiens HIP1R 

47 

36.364 

8310 

AB013384 

Homo sapiens HIP1R 

2810 

56.262 

8311 

AB013384 

Homo sapiens HIP1R 

5628 j 

100.000 

8312 

AB013384 

Homo sapiens HIP1R 

207 

100.000 

8315 

AF118125 

Homo sapiens polycystin-2 -like protein 

190 

21.456 

8316 

Z77660 

Homo sapiens Similarity to Human 
enoyl-CoA hydratase (SW : ECHM_HUMAN) ; 
cDNA EST EMBL:T00611 comes from th 

270 

44.898 

8319 

AF078777 

Homo sapiens ABC transporter 

199 

100.000 

8325 

X51829 

Mus musculus MyD116 protein (AA 1-657) 

147 

37.143 

8329 

AP000616 

Oryza sativa similar to RING-H2 finger 
protein RHAla (AF078683) 

48 

40.000 

8330 

AP000616 

Oryza sativa similar to RING-H2 finger 
protein RHAla (AF078683) 

48 

40.000 

8331 

AP000616 

Oryza sativa similar to RING-H2 finger 
protein RHAla (AF078683) 

57 

36.842 

8332 

AP000616 

Oryza sativa similar to RING-H2 finger 
protein RHAla (AF078683) 

159 

7 0.000 

8333 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

141 

42.857 

8335 

U61262 

Homo sapiens neogenin 

352 

82.609 

8338 

AB007897 

Homo sapiens KIAA0437 

335 

56.701 

8340 

AB023224 

Homo, sapiens KIAA1007 protein 

307 

97.674 

8341 

AB023224 

Homo sapiens KIAA1007 protein 

6808 

96.621 

8342 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

1051 

88.830 

8343 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

948 

91.304 

8344 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

807 

97.541 

8345 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

612 

78.195 

8346 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

764 

63.404 

8347 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

1441 

94. 805 
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8396 

X99211 

Drosophila melanogaster ubiquitin- 
specif ic protease 

281 

55.000 

8397 

X53778 

Homo sapiens uracil DNA glycosylase 

622 

87. 903 

8398 

X53778 

Homo sapiens uracil DNA glycosylase 

1251 

83.673 

8400 

X13482 

Homo sapiens U2 snRNP- specific A 1 
protein (AA 1-255) 

134 

29.870 

8401 

X13482 

Homo sapiens U2 snRNP- specific A 1 
protein (AA 1-255) 

167 

26.712 

8403 

X90568 

liomo sapiens Protein sequence and 
annotation available soon via Swiss - 
Prot; available at present via e-mail 
from LABEIT@EMBL-Heidelberg.DE 

133 

25.610 

8404 

X90568 

Homo sapiens Protein sequence and 
annotation available soon via Swiss- 
Prot; available at present via e-mail 
from LABEiT®EPiBLi^xiexaejLi>erg . vis 

10724 

n -7 oil 
87 .211 

8406 

AF191018 

Homo sapiens E21G3 

3666 

100 000 

8407 

AF191018 

Homo sapiens E2XG3 



8411 

U00968 

Homo sapiens SREBP-1 

532 

100 000 

A V V/ . W V w 

8412 

U00968 

Homo sapiens SREBP-1 


Q 792 

8413 

AF181856 

14 Rattus norvegicus tRNA selenocysteine 
associated protein 

ion 


8414 

AF165163 

Mus musculus growth suppressor 1L 

461 

52.893 

8415 

AF165163 

Mus musculus growth suppressor 1L 

A Q 

G, 1 AA 

8416 

AF165163 

Mus musculus qrowth suppressor 1L 

916 

40.097 

8418 

AB020706 - 

Homo sapiens KIAA0899 protein 

2 87 


8419 

AB020706 

Homo sapiens KIAA0899 protein 

5715 

98.499 

8428 

AF039698 

Homo sapiens antigen NY-CO- 3 3 

393 

47 . 619 

8431 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2684 

90.217 

8432 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha . 

1960 

85.124 

8433 

AB018566 

Homo sapiens Proline synthetase 
associated 

468 

100.000 

8434 

AB018566 

Homo sapiens Proline synthetase 
associated 

500 

93 . 827 

8435 

AB018566 

Homo sapiens Proline synthetase 
associated 

I/O 1 


8436 

Z83864 

Mycobacterium tuberculosis 
hypothetical protein Rv3835 

11 / 

^ 1 ft A £ 

8437 

M22334 

Homo sapiens unknown protein 

IOC 

/ U . D J J 

8439 

U97079 

Mus musculus U5-116kD 

239 

92.500 

8440 

U97079 

Mus musculus U5-116kD 

• 6344 

98 . 561 

8444 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2253 

87.898 

8445 

AL031032 

Arabidopsis thaliana extensin-like 
protein 

323 

40.157 

O A A C 

o44o 

At Uoo / U2 

r UcUUUlcLUlco ViiUo y xy v y^a.** y*-* 

precursor 

42 

28.571 

8447 

AF086702 

Pseudorabies viruB glycoprotein gD 
precursor 

158 

47.692 

8452 

X04655 

Rattus norvegicus synapsin I (AA 1- 
691) 

50 

47.826 

8453 

X04655 

Rattus norvegicus synapsin I (AA 1- 

58 

70.000 
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8348 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

838 

72.081 

8349 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2307 

79.874 

8350 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2658 

90.043 j 

8351 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2575 

97.985 

8352 

U09823 

Oryctolagus cuniculus elongation 
factor 1 alpha 

2735 

94.635 

8353 

U29244 

Caenorhabditis elegans No definition 
line found 

319 

36.806 

8354 

U29244 

Caenorhabditis elegans No definition 
line found 

254 

43.956 j 

8355 

AF006621 

Homo sapiens embryonic lung protein 

373 

65.049 

8356 

AF006621 

Homo sapiens embryonic lung protein 

1042 

100.000 

8357 

AB029040 

Homo sapiens KIAA1117 protein 

468 

98.701 

8358 

AB029040 

Homo sapiens KIAA1117 protein 

8967 

100.000 1 

8363 

M76546 

He li an thus annuus hydroxyproline-rich 
protein 

129 

33.898 

8366 

X80397 

Streptococcus pyogenes orfl 

131 

29.814 

8367 

AF061258 

Homo sapiens LIM protein 

1304 

95.238 

8372 

AF069602 

Homo sapiens myosin light chain kinase 
isoform 3A 

74 

27.083 

8373 

AF069602 

Homo sapiens myosin light chain kinase 
isoform 3A 

66 

30.986 

8374 

AF069602 

Homo sapiens myosin light chain kinase 
isoform 3A 

61 

24.167 

8375 

AF069602 

Homo sapiens myosin light, chain kinase 
isoform 3A 

75 

57.692 

8377 

AF069602 

Homo sapiens myosin light chain kinase 
isoform 3A 

74 

34.177 

8378 

AF069602 

Homo sapiens myosin light chain kinase 
isoform 3A j 

68 

35.556 

8379 

AF069602 

Homo sapiens myosin light chain kinase 
isoform 3A 

65 

29.825 

8380 

AF069602 

Homo sapiens myosin light chain kinase 
isoform 3A 

103 

27.933 

8381 

AF069602 

Homo sapiens myosin light chain kinase 
isoform 3A 

330 

57.944 

8382 

AF069602 

Homo sapiens myosin light chain kinase 
isoform 3A 

61 

20.213 

8383 

U82535 

Homo sapiens fatty acid amide 
hydrolase 

817 

88.435 

8384 

U82535 

Homo sapiens fatty acid amide 
hydrolase 

3884 

99.827 

8385 

U82535 

Homo sapiens fatty acid amide 
hydrolase 

163 

96.000 

8387 

AB011157 

Homo sapiens KIAA0585 protein 

131 

100.000 

8390 

U64601 

Caenorhabditis elegans Gene probably 
begins in the next cosmid 

547 

63.025 

8391 

Y19188 

Homo sapiens aczonin 

148 

33.333 | 

8394 

X90858 

Homo sapiens uridine phosphorylase 

381 

87.671 

8395 

X90858 

Homo sapiens uridine phosphorylase 

1551 

93.269 


> 
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8454 

X04655 

Rattus norvegicus synapsin I (AA 1- 
691) 

98 

24.561 

8455 

X75947 

Mus musculus mCBP 

488 

83 .654 

8456 

X75947 

Mus musculus mCBP 

1071 

82.807 

8457 

AL049709 

Homo sapiens dJ18C9.2 (novel gene 
(locus D20S101) similar to Gamma- 
glutamyltranspeptidase, contains CCA 
trinucleotide repeat, based on Genscan 
and Pgenesh predictions.) 

796 

96 . 094 

8458 

AL049709 

Homo sapiens dJ18C9.2 (novel gene 
(locus D20S101) similar to Gamma- 
glutamyltranspeptidase, contains CCA 
trinucleotide repeat, based on GenBcan 
and Fgenesh predictions.) 

4177 

96.997 

8459 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

194 

60.345 

8460 

Z28407 

Homo sapiens ribosomal protein L8 

846 

79.121 

8461 

Z28407 

Homo sapiens ribosomal protein L8 

1379 

91.603 

8463 

AF109906 

Mus musculus NG22 

69 

31.667 

8464 

AF109906 

Mus musculus NG22 

138 

28.767 

8469 

A03758 

Homo sapiens serum albumin 

996 

87.912 

8470 

A03758 

Homo sapiens serum albumin 

705 

55.039 

8473 

A03758 

Homo sapiens serum albumin 

1099 

65.033 

8474 

A03758 

Homo sapiens serum albumin 

1334 

68.293 

8475 

A0375B 

Homo sapiens serum albumin 

2767 

77.094 

8476 

A03758 

Homo sapiens serum albumin 

1566 

87.279 

8477 

A03758 

Homo sapiens serum albumin 

1083 

99.387 

8478 

A03758 

Homo sapiens serum albumin 

3469 

74.031 

8481 

AB002375 

Homo sapiens KIAA0377 

143 

67,647 

8482 

X99711 

Spodoptera littoralis nuclear 
polyhedrosis virus orf 1035 

140 

32.692 

8485 

X97675 

Homo sapiens plakophilin 2b 

191 

70.270 

8488 

AF096709 

Drosophila virilis failed axon 
connections protein 

237 

35.971 

8489 

AF096709 

Drosophila virilis failed axon 
connections protein 

235 

40.952 

8490 

L23113 

Homo sapiens binding protein 

598 

95.876 

8491 

L23113 

Homo sapiens binding protein 

2918 

89.101 

8492 

LL23113 

Homo sapiens binding protein 

2881 

87.103 

8493 

L23113 

Homo sapiens binding protein 

371 

98.438 

8494 

L23113 

Homo sapiens binding protein 

3158 

98.476 ! 

8497 

AF106685 

Homo sapiens myelin gene expression 
factor 2* 

1442 

99.107 

8499 

AF034580 

Mus musculus TSG118 . 1 

270 

43.796 

8500 

AF034580 

Mus musculus TSG118.1 

1622 

58.378 

8501 

Y12713 

Mus musculus Pro-Pol-dUTPase 
polyprotein 

66 

21.505 

8502 

Y12713 

Mus musculus Pro-Pol-dUTPase 
polyprotein 

400 

36.723 
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8454 

X04655 

Rattus norvegicus synapsin I {AA 1- 
691) 

98 

24.561 

8455 

X75947 

Mus musculus mCBP 

488 

83.654 

8456 

X75947 

Mus musculus mCBP 

1071 

82.807 

8457 

AL049709 

Homo sapiens dJ18C9.2 (novel gene 
(locus D20S101) similar to Gamma- 
glut amy 1 transpeptidase, contains CCA 
trinucleotide repeat, based on Genscan 
and Fgenesh predictions.) 

796 

96.094 

8458 

/UjC/4 9/09 

Homo sapiens d*J18C9.2 (novel gene 

(lOCUS UZUblvl/ BlITillar CO OalTUTla ~ 
yxuuaiiiyjL UJL ail o|JC:}J i — LudDC f LyiiLaiuy v-v^M. 

trinucleotide reoeat . based on Genscan 
and Fgenesh predictions.) 

4177 

96 . 997 

8459 

AF010144 

Homo sapiens neuronal thread ■ protein 
AD7c-NTP 

194 

60 . 345 

8460 

Z28407 

Homo sapiens ribosomal protein L8 

846 

79.121 

8461 

Z28407 

Homo sapiens ribosomal protein L8 

1379 

91.603 

8463 

AF109906 

Mus musculus NG22 

69 

31 . 667 

8464 

AF109906 

Mus musculus NG22 

138 

28. 767 

8469 

A03758 

Homo sapiens serum albumin 

996 

87.912 

8470 

A03758 

Homo saoiena seinim albumin 

705 

55 . 039 

8473 

A03758 

Homo saoiens serum albumin 

1099 

65 . 033 

8474 

A03758 

Homo sapiens serum albumin 

1334 

68.293 
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8476 

A03758 

Homo sapiens serum albumin . 

1566 

87.279 
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Homo sapiens serum albumin 
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8485 

X97675 

"Homo a^jT~i "{ f^n r r>1 aVot^HH 1 "I n 0 V> 

191 

70 . 270 

8488 

AF096709 

Drosophila virilis failed axon 
connections oirotein 

237 

35.971 

8489 

AF096709 

DroaoDhila virilis failed axon 
connections protein 

235 

4 0 . 952 

8490 

L23113 

Homo sapiens binding protein 

598 

95.876 

8491 

L23113 

Homo s aniens bindina orotein 

2918 

89.101 • 

8492 

L23113 

Homo sapiens binding protein 

2881 

87.103 

8493 

L23113 

Homo sapiens binding protein 

371 

98.438 

8494 

L23113 

Homo sapiens binding protein 

3158 

98.476 

8497 

AF106685 

Homo sapiens myelin gene expression 
factor 2 

1442 

99.107 

8499 

AF034580 

Mus musculus TSG118.1 

270 

43.796 

8500 

AF034580 

Mus musculus TSG118.1 

1622 

58.378 

8501 
» 

Y12713 

Mus musculus Pro-Pol-dUTPase 
polyprotein 

66 

21.505 

8502 

Y12713 

Mus musculus Pro-Pol-dUTPase 
polyprotein 

400 

36.723 
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8454 

X04655 

Rattus norvegicus synapsin I (AA 1- 
691) 

98 

24.561 

8455 

X75947 

Mus musculus mCBP 

488 

83 .654 

8456 

X75947 

Mus mueculus mCBP 

1071 

82.807 

8457 

AL049709 

Homo sapiens dJ18C9.2 (novel gene 
(locus D20S101) similar to Gamma- 
glut amyl transpeptidase, contains CCA 
trinucleotide repeat, based on Genscan 
and Fgenesh predictions.) 

796 

96.094 

8458 

AL049709 

Homo sapiens dJ18C9.2 (novel gene 
(locus D20S101) similar to Gamma- 
glutamyl transpeptidase, contains CCA 
trinucleotide repeat, based on Genscan 
and Fgenesh predictions.) 

4177 

96.997 

8459 

AF010144 

Homo sapiens neuronal thread protein 
AD7C-NTP 

194 

60.345 

8460 

Z28407 

Homo sapiens ribosomal protein L8 

846 

79 . 121 

8461 

Z28407 

Homo sapiens ribosomal protein L8 

1379 

91 . 603 

8463 

AF109906 

Mus musculus NG22 

69 

31 . 667 

8464 

AF109906 

Mus musculus NG22 

138 

28.767 

8469 

A03758 

Homo sapiens serum albumin 

996 

87.912 

8470 

A03758 

Homo sapiens serum albumin 

705 

55.039 1 

8473 

A03758 

Homo sapiens serum albumin 

1099 

65.033 

8474 

A03758 

Homo sapiens serum albumin 

1334 

68.293 

8475 

A03758 

Homo sapiens serum albumin 

2767 

77 .094 

8476 

A03758 

Homo sapiens serum albumin 

1566 

87.279 

8477 

A037.58 

Homo sapiens serum albumin 

1083 

99.387 | 

8478 

A03758 

Homo sapiens serum albumin 

3469 

74.031 

8481 

AB002375 

Homo sapiens KIAA0377 

143 j 

67.647 

8482 

X99711 

Spodoptera littoralis nuclear 
polyhedrosis virus orf 1035 

14 0 

32.692 

8485 

X97675 

Homo sapiens plakophilin 2b 

191 

70.270 

8488 

AF096709 

Drosophila virilis failed axon 
connections protein 

237 

35.971 

8489 

AF096709 

Drosophila virilis failed axon 
connections protein 

235 

40.952 

8490 

L23113 

Homo sapiens binding protein 

598 

95.876 

8491 

L23113 

Homo sapiens binding protein 

2918 

89.101 

8492 

L23113 

Homo sapiens binding protein 

2881 

87.103 

8493 

L23113 

Homo sapiens binding protein 

371 

98.438 

8494 

L23113 

Homo sapiens binding protein 

3158 

98.476 

8497 

AF106685 

Homo sapiens myelin gene expression 
factor 2 

1442 

99.107 

8499 

AF034580 

Mus musculus TSG118.1 

270 

43.796 

8500 

AF034580 

Mus musculus TSG118.1 

1622 

58.378 

8501 
» 

Y12713 

Mus musculus Pro-Pol -dUTPase 
polyprotein 

66 

21.505 

8502 

Y12713 

Mus musculus Pro-Pol-dUTPase 
polyprotein 

400 

36.723 


